Overview

Dataset statistics

Number of variables14
Number of observations1377
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory153.4 KiB
Average record size in memory114.1 B

Variable types

Categorical11
Numeric1
Text2

Dataset

Description동 데이터는 개발기술사업화자금을 지원받은 기업의 현황자료로, 신청사유, 기업개요, 대여금액 수준 등의 자료를 포함하고 있습니다.
URLhttps://www.data.go.kr/data/15120131/fileData.do

Alerts

지원연도 has constant value ""Constant
자금명 has constant value ""Constant
업력구분(중진공) is highly overall correlated with 특허담보 여부High correlation
매출액 is highly overall correlated with 특허담보 여부High correlation
대여금액(합계_백만원) is highly overall correlated with 특허담보 여부High correlation
업종 is highly overall correlated with 특허담보 여부High correlation
특허담보 여부 is highly overall correlated with 업체번호 and 8 other fieldsHigh correlation
자산규모 is highly overall correlated with 특허담보 여부High correlation
신청사유 is highly overall correlated with 특허담보 여부High correlation
종업원규모 is highly overall correlated with 특허담보 여부High correlation
지역 is highly overall correlated with 특허담보 여부High correlation
업체번호 is highly overall correlated with 특허담보 여부High correlation
특허담보 여부 is highly imbalanced (66.3%)Imbalance
업체번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 13:32:21.119281
Analysis finished2023-12-12 13:32:23.283470
Duration2.16 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지원연도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size10.9 KiB
2022
1377 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 1377
100.0%

Length

2023-12-12T22:32:23.368149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:32:23.496737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 1377
100.0%

업체번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1377
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean689
Minimum1
Maximum1377
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size12.2 KiB
2023-12-12T22:32:23.616648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile69.8
Q1345
median689
Q31033
95-th percentile1308.2
Maximum1377
Range1376
Interquartile range (IQR)688

Descriptive statistics

Standard deviation397.64997
Coefficient of variation (CV)0.57714074
Kurtosis-1.2
Mean689
Median Absolute Deviation (MAD)344
Skewness0
Sum948753
Variance158125.5
MonotonicityStrictly increasing
2023-12-12T22:32:23.786227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
916 1
 
0.1%
924 1
 
0.1%
923 1
 
0.1%
922 1
 
0.1%
921 1
 
0.1%
920 1
 
0.1%
919 1
 
0.1%
918 1
 
0.1%
917 1
 
0.1%
Other values (1367) 1367
99.3%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1377 1
0.1%
1376 1
0.1%
1375 1
0.1%
1374 1
0.1%
1373 1
0.1%
1372 1
0.1%
1371 1
0.1%
1370 1
0.1%
1369 1
0.1%
1368 1
0.1%

지역
Categorical

HIGH CORRELATION 

Distinct17
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size10.9 KiB
경기
379 
서울
175 
경남
113 
경북
110 
부산
86 
Other values (12)
514 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row충남
2nd row충남
3rd row경기
4th row강원
5th row강원

Common Values

ValueCountFrequency (%)
경기 379
27.5%
서울 175
12.7%
경남 113
 
8.2%
경북 110
 
8.0%
부산 86
 
6.2%
충북 67
 
4.9%
대구 64
 
4.6%
충남 62
 
4.5%
인천 61
 
4.4%
전남 53
 
3.8%
Other values (7) 207
15.0%

Length

2023-12-12T22:32:23.964720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기 379
27.5%
서울 175
12.7%
경남 113
 
8.2%
경북 110
 
8.0%
부산 86
 
6.2%
충북 67
 
4.9%
대구 64
 
4.6%
충남 62
 
4.5%
인천 61
 
4.4%
전남 53
 
3.8%
Other values (7) 207
15.0%

업력구분(중진공)
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size10.9 KiB
10년미만
286 
20년이상
214 
15년미만
213 
5년미만
167 
3년미만
159 
Other values (3)
338 

Length

Max length5
Median length5
Mean length4.6318083
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3년미만
2nd row3년미만
3rd row15년미만
4th row10년미만
5th row1년미만

Common Values

ValueCountFrequency (%)
10년미만 286
20.8%
20년이상 214
15.5%
15년미만 213
15.5%
5년미만 167
12.1%
3년미만 159
11.5%
20년미만 157
11.4%
7년미만 133
9.7%
1년미만 48
 
3.5%

Length

2023-12-12T22:32:24.080892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:32:24.215118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
10년미만 286
20.8%
20년이상 214
15.5%
15년미만 213
15.5%
5년미만 167
12.1%
3년미만 159
11.5%
20년미만 157
11.4%
7년미만 133
9.7%
1년미만 48
 
3.5%

자금명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size10.9 KiB
개발기술사업화
1377 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개발기술사업화
2nd row개발기술사업화
3rd row개발기술사업화
4th row개발기술사업화
5th row개발기술사업화

Common Values

ValueCountFrequency (%)
개발기술사업화 1377
100.0%

Length

2023-12-12T22:32:24.371630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:32:24.462903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개발기술사업화 1377
100.0%

신청사유
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size10.9 KiB
특허, 실용신안 또는 저작권 등록 기술
805 
기업부설연구소 보유 기업이 개발한 기술
192 
<NA>
91 
연구개발전담부서 보유 기업이 개발한 기술
 
79
벤처기업
 
47
Other values (10)
163 

Length

Max length73
Median length21
Mean length19.501089
Min length4

Unique

Unique3 ?
Unique (%)0.2%

Sample

1st row벤처기업
2nd row벤처기업
3rd rowInno-Biz기업
4th row기업부설연구소 보유 기업이 개발한 기술
5th row국내/외 대학, 연구기관, 기업, 기술거래기관 등으로부터 이전 받은 기술

Common Values

ValueCountFrequency (%)
특허, 실용신안 또는 저작권 등록 기술 805
58.5%
기업부설연구소 보유 기업이 개발한 기술 192
 
13.9%
<NA> 91
 
6.6%
연구개발전담부서 보유 기업이 개발한 기술 79
 
5.7%
벤처기업 47
 
3.4%
Main-Biz기업 46
 
3.3%
Inno-Biz기업 42
 
3.1%
중기부 R&D사업에 참여하여 기술개발에 성공(완료)한 기술 42
 
3.1%
중기부 외 R&D사업에 참여하여 기술개발에 성공(완료)한 기술 13
 
0.9%
정부 및 정부 공인기관이 인증한 기술 (신기술(NET), 전력신기술, 건설신기술, 보건신기술(HT),공공기관 통합기술마켓 인증 등) 8
 
0.6%
Other values (5) 12
 
0.9%

Length

2023-12-12T22:32:24.571115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
기술 1149
16.6%
특허 805
11.6%
등록 805
11.6%
실용신안 805
11.6%
저작권 805
11.6%
또는 805
11.6%
개발한 272
 
3.9%
보유 271
 
3.9%
기업이 271
 
3.9%
기업부설연구소 192
 
2.8%
Other values (42) 759
10.9%

특허담보 여부
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size10.9 KiB
<NA>
1291 
특허
 
86

Length

Max length4
Median length4
Mean length3.8750908
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 1291
93.8%
특허 86
 
6.2%

Length

2023-12-12T22:32:24.712715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:32:24.858433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 1291
93.8%
특허 86
 
6.2%

대여금액(합계_백만원)
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size10.9 KiB
2억이하
525 
1억이하
495 
3억이하
202 
5억이하
87 
4억이하
 
30
Other values (6)
 
38

Length

Max length5
Median length4
Mean length4.0210603
Min length4

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row3억이하
2nd row2억이하
3rd row5억이하
4th row3억이하
5th row5억이하

Common Values

ValueCountFrequency (%)
2억이하 525
38.1%
1억이하 495
35.9%
3억이하 202
 
14.7%
5억이하 87
 
6.3%
4억이하 30
 
2.2%
10억이하 11
 
0.8%
7억이하 9
 
0.7%
20억이하 7
 
0.5%
15억이하 7
 
0.5%
30억이하 3
 
0.2%

Length

2023-12-12T22:32:24.965603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2억이하 525
38.1%
1억이하 495
35.9%
3억이하 202
 
14.7%
5억이하 87
 
6.3%
4억이하 30
 
2.2%
10억이하 11
 
0.8%
7억이하 9
 
0.7%
20억이하 7
 
0.5%
15억이하 7
 
0.5%
30억이하 3
 
0.2%

업종
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size10.9 KiB
기계
269 
금속
188 
정보
146 
화공
143 
전자
135 
Other values (6)
496 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전자
2nd row기타
3rd row금속
4th row기타
5th row식료

Common Values

ValueCountFrequency (%)
기계 269
19.5%
금속 188
13.7%
정보 146
10.6%
화공 143
10.4%
전자 135
9.8%
잡화 107
 
7.8%
전기 101
 
7.3%
기타 98
 
7.1%
식료 87
 
6.3%
유통 65
 
4.7%

Length

2023-12-12T22:32:25.127525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
기계 269
19.5%
금속 188
13.7%
정보 146
10.6%
화공 143
10.4%
전자 135
9.8%
잡화 107
 
7.8%
전기 101
 
7.3%
기타 98
 
7.1%
식료 87
 
6.3%
유통 65
 
4.7%
Distinct1299
Distinct (%)94.3%
Missing0
Missing (%)0.0%
Memory size10.9 KiB
2023-12-12T22:32:25.482521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length57
Median length30
Mean length10.578794
Min length2

Characters and Unicode

Total characters14567
Distinct characters622
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1234 ?
Unique (%)89.6%

Sample

1st row무선통신장비제조, 스마트팜 구축 등
2nd row중전기 절연진단 서비스
3rd row에어샤워기
4th row송어양식, PDRN필러 등
5th row건강기능식품 원료
ValueCountFrequency (%)
147
 
4.5%
제조 110
 
3.4%
97
 
3.0%
소프트웨어 32
 
1.0%
개발 31
 
0.9%
부품 23
 
0.7%
시스템 22
 
0.7%
플랫폼 20
 
0.6%
가공 18
 
0.6%
산업용 18
 
0.6%
Other values (1950) 2748
84.1%
2023-12-12T22:32:26.007087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2023
 
13.9%
, 475
 
3.3%
387
 
2.7%
286
 
2.0%
271
 
1.9%
240
 
1.6%
230
 
1.6%
208
 
1.4%
179
 
1.2%
173
 
1.2%
Other values (612) 10095
69.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11142
76.5%
Space Separator 2023
 
13.9%
Other Punctuation 523
 
3.6%
Uppercase Letter 489
 
3.4%
Lowercase Letter 130
 
0.9%
Open Punctuation 120
 
0.8%
Close Punctuation 120
 
0.8%
Decimal Number 14
 
0.1%
Dash Punctuation 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
387
 
3.5%
286
 
2.6%
271
 
2.4%
240
 
2.2%
230
 
2.1%
208
 
1.9%
179
 
1.6%
173
 
1.6%
172
 
1.5%
171
 
1.5%
Other values (555) 8825
79.2%
Uppercase Letter
ValueCountFrequency (%)
S 52
 
10.6%
C 48
 
9.8%
E 41
 
8.4%
P 38
 
7.8%
D 34
 
7.0%
L 31
 
6.3%
T 31
 
6.3%
I 29
 
5.9%
A 27
 
5.5%
R 27
 
5.5%
Other values (13) 131
26.8%
Lowercase Letter
ValueCountFrequency (%)
o 16
12.3%
e 12
 
9.2%
a 11
 
8.5%
l 10
 
7.7%
i 10
 
7.7%
s 8
 
6.2%
n 8
 
6.2%
p 8
 
6.2%
r 7
 
5.4%
t 6
 
4.6%
Other values (12) 34
26.2%
Other Punctuation
ValueCountFrequency (%)
, 475
90.8%
/ 41
 
7.8%
. 4
 
0.8%
& 2
 
0.4%
· 1
 
0.2%
Decimal Number
ValueCountFrequency (%)
3 9
64.3%
2 4
28.6%
4 1
 
7.1%
Space Separator
ValueCountFrequency (%)
2023
100.0%
Open Punctuation
ValueCountFrequency (%)
( 120
100.0%
Close Punctuation
ValueCountFrequency (%)
) 120
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11142
76.5%
Common 2806
 
19.3%
Latin 619
 
4.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
387
 
3.5%
286
 
2.6%
271
 
2.4%
240
 
2.2%
230
 
2.1%
208
 
1.9%
179
 
1.6%
173
 
1.6%
172
 
1.5%
171
 
1.5%
Other values (555) 8825
79.2%
Latin
ValueCountFrequency (%)
S 52
 
8.4%
C 48
 
7.8%
E 41
 
6.6%
P 38
 
6.1%
D 34
 
5.5%
L 31
 
5.0%
T 31
 
5.0%
I 29
 
4.7%
A 27
 
4.4%
R 27
 
4.4%
Other values (35) 261
42.2%
Common
ValueCountFrequency (%)
2023
72.1%
, 475
 
16.9%
( 120
 
4.3%
) 120
 
4.3%
/ 41
 
1.5%
3 9
 
0.3%
- 6
 
0.2%
. 4
 
0.1%
2 4
 
0.1%
& 2
 
0.1%
Other values (2) 2
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11142
76.5%
ASCII 3424
 
23.5%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2023
59.1%
, 475
 
13.9%
( 120
 
3.5%
) 120
 
3.5%
S 52
 
1.5%
C 48
 
1.4%
/ 41
 
1.2%
E 41
 
1.2%
P 38
 
1.1%
D 34
 
1.0%
Other values (46) 432
 
12.6%
Hangul
ValueCountFrequency (%)
387
 
3.5%
286
 
2.6%
271
 
2.4%
240
 
2.2%
230
 
2.1%
208
 
1.9%
179
 
1.6%
173
 
1.6%
172
 
1.5%
171
 
1.5%
Other values (555) 8825
79.2%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct391
Distinct (%)28.4%
Missing0
Missing (%)0.0%
Memory size10.9 KiB
2023-12-12T22:32:26.364546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length25
Mean length15.801743
Min length3

Characters and Unicode

Total characters21759
Distinct characters332
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique159 ?
Unique (%)11.5%

Sample

1st row방송장비 제조업
2nd row기타 기술 시험, 검사 및 분석업
3rd row금속 위생용품 제조업
4th row내수면 양식 어업
5th row건강 기능식품 제조업
ValueCountFrequency (%)
제조업 961
 
14.6%
648
 
9.8%
기타 408
 
6.2%
201
 
3.1%
199
 
3.0%
금속 109
 
1.7%
소프트웨어 101
 
1.5%
개발 94
 
1.4%
공급업 94
 
1.4%
기계 81
 
1.2%
Other values (672) 3690
56.0%
2023-12-12T22:32:26.969069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5209
23.9%
1455
 
6.7%
1285
 
5.9%
1135
 
5.2%
902
 
4.1%
648
 
3.0%
458
 
2.1%
419
 
1.9%
383
 
1.8%
301
 
1.4%
Other values (322) 9564
44.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16337
75.1%
Space Separator 5209
 
23.9%
Other Punctuation 178
 
0.8%
Open Punctuation 14
 
0.1%
Close Punctuation 14
 
0.1%
Decimal Number 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1455
 
8.9%
1285
 
7.9%
1135
 
6.9%
902
 
5.5%
648
 
4.0%
458
 
2.8%
419
 
2.6%
383
 
2.3%
301
 
1.8%
286
 
1.8%
Other values (317) 9065
55.5%
Space Separator
ValueCountFrequency (%)
5209
100.0%
Other Punctuation
ValueCountFrequency (%)
, 178
100.0%
Open Punctuation
ValueCountFrequency (%)
( 14
100.0%
Close Punctuation
ValueCountFrequency (%)
) 14
100.0%
Decimal Number
ValueCountFrequency (%)
1 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16337
75.1%
Common 5422
 
24.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1455
 
8.9%
1285
 
7.9%
1135
 
6.9%
902
 
5.5%
648
 
4.0%
458
 
2.8%
419
 
2.6%
383
 
2.3%
301
 
1.8%
286
 
1.8%
Other values (317) 9065
55.5%
Common
ValueCountFrequency (%)
5209
96.1%
, 178
 
3.3%
( 14
 
0.3%
) 14
 
0.3%
1 7
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16302
74.9%
ASCII 5422
 
24.9%
Compat Jamo 35
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5209
96.1%
, 178
 
3.3%
( 14
 
0.3%
) 14
 
0.3%
1 7
 
0.1%
Hangul
ValueCountFrequency (%)
1455
 
8.9%
1285
 
7.9%
1135
 
7.0%
902
 
5.5%
648
 
4.0%
458
 
2.8%
419
 
2.6%
383
 
2.3%
301
 
1.8%
286
 
1.8%
Other values (316) 9030
55.4%
Compat Jamo
ValueCountFrequency (%)
35
100.0%

종업원규모
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size10.9 KiB
20인미만
357 
10인미만
345 
5인미만
307 
50인미만
283 
100인미만
69 

Length

Max length6
Median length5
Mean length4.83878
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row10인미만
2nd row10인미만
3rd row20인미만
4th row20인미만
5th row5인미만

Common Values

ValueCountFrequency (%)
20인미만 357
25.9%
10인미만 345
25.1%
5인미만 307
22.3%
50인미만 283
20.6%
100인미만 69
 
5.0%
300인미만 16
 
1.2%

Length

2023-12-12T22:32:27.123497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:32:27.252164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20인미만 357
25.9%
10인미만 345
25.1%
5인미만 307
22.3%
50인미만 283
20.6%
100인미만 69
 
5.0%
300인미만 16
 
1.2%

자산규모
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size10.9 KiB
30억미만
380 
10억미만
318 
70억미만
313 
100억미만
133 
200억미만
122 
Other values (2)
111 

Length

Max length6
Median length5
Mean length5.1801017
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row30억미만
2nd row10억미만
3rd row30억미만
4th row70억미만
5th row10억미만

Common Values

ValueCountFrequency (%)
30억미만 380
27.6%
10억미만 318
23.1%
70억미만 313
22.7%
100억미만 133
 
9.7%
200억미만 122
 
8.9%
<NA> 59
 
4.3%
200억이상 52
 
3.8%

Length

2023-12-12T22:32:27.417514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:32:27.534892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
30억미만 380
27.6%
10억미만 318
23.1%
70억미만 313
22.7%
100억미만 133
 
9.7%
200억미만 122
 
8.9%
na 59
 
4.3%
200억이상 52
 
3.8%

매출액
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size10.9 KiB
50억미만
614 
100억미만
229 
5억미만
172 
10억미만
147 
300억미만
137 
Other values (2)
78 

Length

Max length6
Median length5
Mean length5.1118373
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row50억미만
2nd row10억미만
3rd row50억미만
4th row5억미만
5th row5억미만

Common Values

ValueCountFrequency (%)
50억미만 614
44.6%
100억미만 229
 
16.6%
5억미만 172
 
12.5%
10억미만 147
 
10.7%
300억미만 137
 
9.9%
<NA> 59
 
4.3%
300억이상 19
 
1.4%

Length

2023-12-12T22:32:27.717111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:32:27.860097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
50억미만 614
44.6%
100억미만 229
 
16.6%
5억미만 172
 
12.5%
10억미만 147
 
10.7%
300억미만 137
 
9.9%
na 59
 
4.3%
300억이상 19
 
1.4%

Interactions

2023-12-12T22:32:22.784648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T22:32:27.962761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업체번호지역업력구분(중진공)신청사유대여금액(합계_백만원)업종종업원규모자산규모매출액
업체번호1.0000.1940.0570.0540.0960.0370.1460.1590.143
지역0.1941.0000.0780.3110.1170.4370.0670.1590.134
업력구분(중진공)0.0570.0781.0000.2160.3060.1500.3680.4620.468
신청사유0.0540.3110.2161.0000.0000.2640.1290.1960.189
대여금액(합계_백만원)0.0960.1170.3060.0001.0000.0860.4390.4940.497
업종0.0370.4370.1500.2640.0861.0000.0530.2130.193
종업원규모0.1460.0670.3680.1290.4390.0531.0000.7910.776
자산규모0.1590.1590.4620.1960.4940.2130.7911.0000.858
매출액0.1430.1340.4680.1890.4970.1930.7760.8581.000
2023-12-12T22:32:28.096694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업력구분(중진공)매출액대여금액(합계_백만원)업종특허담보 여부자산규모신청사유종업원규모지역
업력구분(중진공)1.0000.2820.1490.0711.0000.2780.0960.2140.032
매출액0.2821.0000.2810.1001.0000.4820.0940.3840.063
대여금액(합계_백만원)0.1490.2811.0000.0251.0000.2790.0000.2420.044
업종0.0710.1000.0251.0001.0000.1100.1060.0270.178
특허담보 여부1.0001.0001.0001.0001.0001.0001.0001.0001.000
자산규모0.2780.4820.2790.1101.0001.0000.0970.4000.075
신청사유0.0960.0940.0000.1061.0000.0971.0000.0630.111
종업원규모0.2140.3840.2420.0271.0000.4000.0631.0000.031
지역0.0320.0630.0440.1781.0000.0750.1110.0311.000
2023-12-12T22:32:28.237532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업체번호지역업력구분(중진공)신청사유특허담보 여부대여금액(합계_백만원)업종종업원규모자산규모매출액
업체번호1.0000.0760.0270.0221.0000.0410.0160.0770.0840.075
지역0.0761.0000.0320.1111.0000.0440.1780.0310.0750.063
업력구분(중진공)0.0270.0321.0000.0961.0000.1490.0710.2140.2780.282
신청사유0.0220.1110.0961.0001.0000.0000.1060.0630.0970.094
특허담보 여부1.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
대여금액(합계_백만원)0.0410.0440.1490.0001.0001.0000.0250.2420.2790.281
업종0.0160.1780.0710.1061.0000.0251.0000.0270.1100.100
종업원규모0.0770.0310.2140.0631.0000.2420.0271.0000.4000.384
자산규모0.0840.0750.2780.0971.0000.2790.1100.4001.0000.482
매출액0.0750.0630.2820.0941.0000.2810.1000.3840.4821.000

Missing values

2023-12-12T22:32:22.973609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:32:23.191807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

지원연도업체번호지역업력구분(중진공)자금명신청사유특허담보 여부대여금액(합계_백만원)업종주생산품산업품목코드명종업원규모자산규모매출액
020221충남3년미만개발기술사업화벤처기업<NA>3억이하전자무선통신장비제조, 스마트팜 구축 등방송장비 제조업10인미만30억미만50억미만
120222충남3년미만개발기술사업화벤처기업<NA>2억이하기타중전기 절연진단 서비스기타 기술 시험, 검사 및 분석업10인미만10억미만10억미만
220223경기15년미만개발기술사업화Inno-Biz기업<NA>5억이하금속에어샤워기금속 위생용품 제조업20인미만30억미만50억미만
320224강원10년미만개발기술사업화기업부설연구소 보유 기업이 개발한 기술<NA>3억이하기타송어양식, PDRN필러 등내수면 양식 어업20인미만70억미만5억미만
420225강원1년미만개발기술사업화국내/외 대학, 연구기관, 기업, 기술거래기관 등으로부터 이전 받은 기술<NA>5억이하식료건강기능식품 원료건강 기능식품 제조업5인미만10억미만5억미만
520226강원1년미만개발기술사업화국내/외 대학, 연구기관, 기업, 기술거래기관 등으로부터 이전 받은 기술<NA>5억이하식료건강기능식품 원료건강 기능식품 제조업5인미만10억미만5억미만
620227부산7년미만개발기술사업화중기부 R&D사업에 참여하여 기술개발에 성공(완료)한 기술<NA>2억이하전기차단기전기회로 개폐, 보호 장치 제조업5인미만30억미만50억미만
720228부산20년미만개발기술사업화특허, 실용신안 또는 저작권 등록 기술<NA>1억이하전기태양광 발전기 부품(인버터)기타 전기 변환장치 제조업20인미만200억미만100억미만
820229인천15년미만개발기술사업화특허, 실용신안 또는 저작권 등록 기술<NA>5억이하화공화장품용 방부제그 외 기타 분류 안된 화학제품 제조업5인미만30억미만50억미만
9202210인천15년미만개발기술사업화특허, 실용신안 또는 저작권 등록 기술<NA>1억이하화공화장품용 방부제그 외 기타 분류 안된 화학제품 제조업5인미만30억미만50억미만
지원연도업체번호지역업력구분(중진공)자금명신청사유특허담보 여부대여금액(합계_백만원)업종주생산품산업품목코드명종업원규모자산규모매출액
136720221368경북10년미만개발기술사업화<NA><NA>3억이하화공페인트일반용 도료 및 관련제품 제조업20인미만100억미만100억미만
136820221369경북20년미만개발기술사업화<NA><NA>2억이하섬유부직포,이불솜그 외 기타 분류 안된 섬유제품 제조업5인미만70억미만50억미만
136920221370전남20년미만개발기술사업화<NA><NA>1억이하전기전기,통신,소방공사일반 전기 공사업20인미만70억미만50억미만
137020221371경북10년미만개발기술사업화<NA><NA>2억이하화공화학코팅약품, 접착제접착제 및 젤라틴 제조업20인미만70억미만50억미만
137120221372경남20년이상개발기술사업화<NA><NA>1억이하화공자동차페달 덮개용 패드, 스폰지 패드그 외 기타 고무제품 제조업50인미만200억미만100억미만
137220221373광주20년이상개발기술사업화<NA><NA>1억이하전자산업 처리공정 제어 장비 제조업산업 처리공정 제어장비 제조업20인미만70억미만100억미만
137320221374경북10년미만개발기술사업화<NA><NA>2억이하기계특장차용 부품(손잡이, 힌지 등)자동차 차체용 신품 부품 제조업20인미만30억미만50억미만
137420221375충북10년미만개발기술사업화<NA><NA>2억이하식료육포, 훈제닭가슴살가금류 가공 및 저장 처리업50인미만70억미만50억미만
137520221376경기15년미만개발기술사업화<NA><NA>2억이하섬유부직포, 펠트부직포 및 펠트 제조업100인미만200억이상300억이상
137620221377전북5년미만개발기술사업화<NA><NA>2억이하잡화매트리스침대, 흙침대매트리스 및 침대 제조업20인미만30억미만10억미만