Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 1241 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 41.3 KiB |
Average record size in memory | 34.1 B |
Variable types
Numeric | 2 |
---|---|
Text | 2 |
Dataset
Description | 2022년 기준의 데이터로, 연구개발특구진흥재단의 연구소기업 운영 현황에 관한 데이터입니다.연구소기업명과 등록연도 등의 데이터를 보유하고 있습니다.해당 데이터가 보유한 칼럼은 다음과 같습니다.칼럼명 : 구분, 기업명, 사업자등록번호, 등록연도 |
---|---|
Author | (재)연구개발특구진흥재단 |
URL | https://www.data.go.kr/data/15089826/fileData.do |
Reproduction
Analysis started | 2023-12-12 04:10:39.604379 |
---|---|
Analysis finished | 2023-12-12 04:10:40.524483 |
Duration | 0.92 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
구분
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 1241 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 621 |
Minimum | 1 |
---|---|
Maximum | 1241 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 11.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 63 |
Q1 | 311 |
median | 621 |
Q3 | 931 |
95-th percentile | 1179 |
Maximum | 1241 |
Range | 1240 |
Interquartile range (IQR) | 620 |
Descriptive statistics
Standard deviation | 358.39015 |
---|---|
Coefficient of variation (CV) | 0.57711779 |
Kurtosis | -1.2 |
Mean | 621 |
Median Absolute Deviation (MAD) | 310 |
Skewness | 0 |
Sum | 770661 |
Variance | 128443.5 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 0.1% |
826 | 1 | 0.1% |
833 | 1 | 0.1% |
832 | 1 | 0.1% |
831 | 1 | 0.1% |
830 | 1 | 0.1% |
829 | 1 | 0.1% |
828 | 1 | 0.1% |
827 | 1 | 0.1% |
825 | 1 | 0.1% |
Other values (1231) | 1231 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
1241 | 1 | |
1240 | 1 | |
1239 | 1 | |
1238 | 1 | |
1237 | 1 | |
1236 | 1 | |
1235 | 1 | |
1234 | 1 | |
1233 | 1 | |
1232 | 1 |
기업명
Text
Distinct | 1238 |
---|---|
Distinct (%) | 99.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 9.8 KiB |
Value | Count | Frequency (%) |
농업회사법인 | 7 | 0.6% |
유한회사 | 3 | 0.2% |
㈜그린코어 | 2 | 0.2% |
㈜케이에스씨 | 2 | 0.2% |
㈜헬스텍 | 2 | 0.2% |
㈜제이에스컴퍼니 | 1 | 0.1% |
㈜비놀로지 | 1 | 0.1% |
㈜휴엔씨네이쳐 | 1 | 0.1% |
㈜테라프릭스 | 1 | 0.1% |
㈜에쓰큐씨 | 1 | 0.1% |
Other values (1238) | 1238 |
Most occurring characters
Value | Count | Frequency (%) |
㈜ | 1218 | 16.3% |
이 | 555 | 7.4% |
스 | 378 | 5.0% |
에 | 274 | 3.7% |
아 | 137 | 1.8% |
오 | 132 | 1.8% |
디 | 120 | 1.6% |
지 | 111 | 1.5% |
리 | 110 | 1.5% |
크 | 96 | 1.3% |
Other values (463) | 4360 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 6217 | |
Other Symbol | 1218 | 16.3% |
Space Separator | 30 | 0.4% |
Close Punctuation | 10 | 0.1% |
Open Punctuation | 9 | 0.1% |
Lowercase Letter | 5 | 0.1% |
Decimal Number | 1 | < 0.1% |
Uppercase Letter | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
이 | 555 | 8.9% |
스 | 378 | 6.1% |
에 | 274 | 4.4% |
아 | 137 | 2.2% |
오 | 132 | 2.1% |
디 | 120 | 1.9% |
지 | 111 | 1.8% |
리 | 110 | 1.8% |
크 | 96 | 1.5% |
트 | 93 | 1.5% |
Other values (452) | 4211 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 1 | |
l | 1 | |
u | 1 | |
a | 1 | |
d | 1 |
Other Symbol
Value | Count | Frequency (%) |
㈜ | 1218 |
Space Separator
Value | Count | Frequency (%) |
30 |
Close Punctuation
Value | Count | Frequency (%) |
) | 10 |
Open Punctuation
Value | Count | Frequency (%) |
( | 9 |
Decimal Number
Value | Count | Frequency (%) |
2 | 1 |
Uppercase Letter
Value | Count | Frequency (%) |
N | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 7435 | |
Common | 50 | 0.7% |
Latin | 6 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
㈜ | 1218 | 16.4% |
이 | 555 | 7.5% |
스 | 378 | 5.1% |
에 | 274 | 3.7% |
아 | 137 | 1.8% |
오 | 132 | 1.8% |
디 | 120 | 1.6% |
지 | 111 | 1.5% |
리 | 110 | 1.5% |
크 | 96 | 1.3% |
Other values (453) | 4304 |
Latin
Value | Count | Frequency (%) |
e | 1 | |
l | 1 | |
u | 1 | |
a | 1 | |
d | 1 | |
N | 1 |
Common
Value | Count | Frequency (%) |
30 | ||
) | 10 | 20.0% |
( | 9 | 18.0% |
2 | 1 | 2.0% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 6217 | |
None | 1218 | 16.3% |
ASCII | 56 | 0.7% |
Most frequent character per block
None
Value | Count | Frequency (%) |
㈜ | 1218 |
Hangul
Value | Count | Frequency (%) |
이 | 555 | 8.9% |
스 | 378 | 6.1% |
에 | 274 | 4.4% |
아 | 137 | 2.2% |
오 | 132 | 2.1% |
디 | 120 | 1.9% |
지 | 111 | 1.8% |
리 | 110 | 1.8% |
크 | 96 | 1.5% |
트 | 93 | 1.5% |
Other values (452) | 4211 |
ASCII
Value | Count | Frequency (%) |
30 | ||
) | 10 | 17.9% |
( | 9 | 16.1% |
e | 1 | 1.8% |
l | 1 | 1.8% |
u | 1 | 1.8% |
a | 1 | 1.8% |
d | 1 | 1.8% |
2 | 1 | 1.8% |
N | 1 | 1.8% |
사업자등록번호
Text
UNIQUE
 
Distinct | 1241 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 9.8 KiB |
Length
Max length | 13 |
---|---|
Median length | 12 |
Mean length | 12.002417 |
Min length | 12 |
Characters and Unicode
Total characters | 14895 |
---|---|
Distinct characters | 12 |
Distinct categories | 3 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 1241 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 120-86-42098 |
---|---|
2nd row | 314-81-38438 |
3rd row | 314-81-57605 |
4th row | 314-86-01557 |
5th row | 314-86-31949 |
Value | Count | Frequency (%) |
120-86-42098 | 1 | 0.1% |
572-87-02052 | 1 | 0.1% |
838-81-02080 | 1 | 0.1% |
515-81-56306 | 1 | 0.1% |
572-87-02125 | 1 | 0.1% |
489-87-01770 | 1 | 0.1% |
695-87-01910 | 1 | 0.1% |
516-87-01806 | 1 | 0.1% |
579-86-01800 | 1 | 0.1% |
569-81-01278 | 1 | 0.1% |
Other values (1231) | 1231 |
Most occurring characters
Value | Count | Frequency (%) |
- | 2482 | |
8 | 2284 | |
0 | 2051 | |
1 | 1595 | |
6 | 1155 | |
7 | 1079 | |
2 | 1033 | |
4 | 903 | 6.1% |
3 | 840 | 5.6% |
5 | 830 | 5.6% |
Other values (2) | 643 | 4.3% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 12410 | |
Dash Punctuation | 2482 | 16.7% |
Space Separator | 3 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
8 | 2284 | |
0 | 2051 | |
1 | 1595 | |
6 | 1155 | |
7 | 1079 | |
2 | 1033 | |
4 | 903 | 7.3% |
3 | 840 | 6.8% |
5 | 830 | 6.7% |
9 | 640 | 5.2% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2482 |
Space Separator
Value | Count | Frequency (%) |
3 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 14895 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
- | 2482 | |
8 | 2284 | |
0 | 2051 | |
1 | 1595 | |
6 | 1155 | |
7 | 1079 | |
2 | 1033 | |
4 | 903 | 6.1% |
3 | 840 | 5.6% |
5 | 830 | 5.6% |
Other values (2) | 643 | 4.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 14895 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 2482 | |
8 | 2284 | |
0 | 2051 | |
1 | 1595 | |
6 | 1155 | |
7 | 1079 | |
2 | 1033 | |
4 | 903 | 6.1% |
3 | 840 | 5.6% |
5 | 830 | 5.6% |
Other values (2) | 643 | 4.3% |
등록연도
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 13 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2019.3102 |
Minimum | 2008 |
---|---|
Maximum | 2022 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 11.0 KiB |
Quantile statistics
Minimum | 2008 |
---|---|
5-th percentile | 2016 |
Q1 | 2018 |
median | 2020 |
Q3 | 2021 |
95-th percentile | 2022 |
Maximum | 2022 |
Range | 14 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 2.2485766 |
---|---|
Coefficient of variation (CV) | 0.001113537 |
Kurtosis | 0.76779048 |
Mean | 2019.3102 |
Median Absolute Deviation (MAD) | 2 |
Skewness | -0.8268029 |
Sum | 2505964 |
Variance | 5.0560968 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
2022 | 236 | |
2021 | 224 | |
2020 | 205 | |
2019 | 161 | |
2018 | 132 | |
2017 | 124 | |
2016 | 99 | |
2015 | 30 | 2.4% |
2014 | 21 | 1.7% |
2009 | 3 | 0.2% |
Other values (3) | 6 | 0.5% |
Value | Count | Frequency (%) |
2008 | 1 | 0.1% |
2009 | 3 | 0.2% |
2012 | 3 | 0.2% |
2013 | 2 | 0.2% |
2014 | 21 | 1.7% |
2015 | 30 | 2.4% |
2016 | 99 | |
2017 | 124 | |
2018 | 132 | |
2019 | 161 |
Value | Count | Frequency (%) |
2022 | 236 | |
2021 | 224 | |
2020 | 205 | |
2019 | 161 | |
2018 | 132 | |
2017 | 124 | |
2016 | 99 | |
2015 | 30 | 2.4% |
2014 | 21 | 1.7% |
2013 | 2 | 0.2% |
구분 | 등록연도 | |
---|---|---|
구분 | 1.000 | 0.853 |
등록연도 | 0.853 | 1.000 |
구분 | 등록연도 | |
---|---|---|
구분 | 1.000 | 0.989 |
등록연도 | 0.989 | 1.000 |
구분 | 기업명 | 사업자등록번호 | 등록연도 | |
---|---|---|---|---|
0 | 1 | ㈜비티웍스 | 120-86-42098 | 2008 |
1 | 2 | ㈜라스테크 | 314-81-38438 | 2009 |
2 | 3 | 서울프로폴리스㈜ | 314-81-57605 | 2009 |
3 | 4 | ㈜케이에너지 | 314-86-01557 | 2009 |
4 | 5 | 호전에이블 | 314-86-31949 | 2012 |
5 | 6 | ㈜세이프텍리서치 | 314-86-39131 | 2012 |
6 | 7 | ㈜뉴런 | 504-86-00879 | 2012 |
7 | 8 | ㈜그린모빌리티 | 514-81-84847 | 2013 |
8 | 9 | ㈜에스엠나노바이오 | 314-86-51514 | 2013 |
9 | 10 | ㈜디지엠텍 | 514-81-91464 | 2014 |
구분 | 기업명 | 사업자등록번호 | 등록연도 | |
---|---|---|---|---|
1231 | 1232 | ㈜스마트세이프티랩 | 784-86-02755 | 2022 |
1232 | 1233 | ㈜케이컨스 | 682-88-02623 | 2022 |
1233 | 1234 | ㈜광명이엔지 | 138-81-50783 | 2022 |
1234 | 1235 | ㈜에스에스월드 | 535-88-01857 | 2022 |
1235 | 1236 | ㈜케이에스씨 | 217-81-50086 | 2022 |
1236 | 1237 | ㈜골다공인공지능 | 571-88-02546 | 2022 |
1237 | 1238 | 유한회사 케이에듀 | 545-87-01328 | 2022 |
1238 | 1239 | ㈜유엔에스바이오 | 793-88-02734 | 2022 |
1239 | 1240 | ㈜에어트러스트 | 160-87-02131 | 2022 |
1240 | 1241 | 케이유융합소프트웨어연구센터㈜ | 326-87-02032 | 2022 |