Overview

Dataset statistics

Number of variables13
Number of observations10000
Missing cells54
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.1 MiB
Average record size in memory113.0 B

Variable types

Numeric1
Text5
Categorical5
DateTime2

Dataset

Description"24년1월 기준 벤처기업명단입니다. 업체명, 대표자, 확인유형, 지역, 주소, 업종분류, 업종명, 주생산품, 유효시작일, 유효만료일, 확인기관 항목으로 이루어짐
Author중소벤처기업부
URLhttps://www.data.go.kr/data/15084581/fileData.do

Alerts

벤처확인기관 has constant value ""Constant
연번 has unique valuesUnique

Reproduction

Analysis started2024-03-14 21:02:26.937245
Analysis finished2024-03-14 21:02:31.100954
Duration4.16 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20209.894
Minimum2
Maximum40548
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T06:02:31.300663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile1942.4
Q110176.75
median20221
Q330235.25
95-th percentile38497.3
Maximum40548
Range40546
Interquartile range (IQR)20058.5

Descriptive statistics

Standard deviation11637.139
Coefficient of variation (CV)0.57581396
Kurtosis-1.1795379
Mean20209.894
Median Absolute Deviation (MAD)10024
Skewness-0.0011597446
Sum2.0209894 × 108
Variance1.35423 × 108
MonotonicityNot monotonic
2024-03-15T06:02:31.749571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
11248 1
 
< 0.1%
31961 1
 
< 0.1%
3448 1
 
< 0.1%
13888 1
 
< 0.1%
19994 1
 
< 0.1%
16482 1
 
< 0.1%
9233 1
 
< 0.1%
4071 1
 
< 0.1%
2251 1
 
< 0.1%
555 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
2 1
< 0.1%
17 1
< 0.1%
18 1
< 0.1%
26 1
< 0.1%
28 1
< 0.1%
31 1
< 0.1%
36 1
< 0.1%
51 1
< 0.1%
52 1
< 0.1%
53 1
< 0.1%
ValueCountFrequency (%)
40548 1
< 0.1%
40547 1
< 0.1%
40546 1
< 0.1%
40545 1
< 0.1%
40544 1
< 0.1%
40540 1
< 0.1%
40537 1
< 0.1%
40535 1
< 0.1%
40531 1
< 0.1%
40513 1
< 0.1%
Distinct9972
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T06:02:32.342015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length37
Mean length8.26
Min length1

Characters and Unicode

Total characters82600
Distinct characters914
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9944 ?
Unique (%)99.4%

Sample

1st row주식회사 바이오엔티
2nd row㈜삼우디티피
3rd row주식회사 팜하이테크
4th row㈜오티아이코리아
5th row베스트핀 주식회사
ValueCountFrequency (%)
주식회사 4419
29.3%
114
 
0.8%
농업회사법인 62
 
0.4%
예비창업자 54
 
0.4%
inc 36
 
0.2%
유한회사 36
 
0.2%
ltd 29
 
0.2%
co 28
 
0.2%
co.,ltd 26
 
0.2%
농업법인회사 8
 
0.1%
Other values (10179) 10293
68.1%
2024-03-15T06:02:33.440361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5738
 
6.9%
5163
 
6.3%
4956
 
6.0%
4796
 
5.8%
4685
 
5.7%
3656
 
4.4%
3259
 
3.9%
2959
 
3.6%
1640
 
2.0%
) 1251
 
1.5%
Other values (904) 44497
53.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 69284
83.9%
Space Separator 5163
 
6.3%
Other Symbol 3259
 
3.9%
Close Punctuation 1252
 
1.5%
Open Punctuation 1250
 
1.5%
Uppercase Letter 1140
 
1.4%
Lowercase Letter 930
 
1.1%
Other Punctuation 239
 
0.3%
Decimal Number 70
 
0.1%
Dash Punctuation 13
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5738
 
8.3%
4956
 
7.2%
4796
 
6.9%
4685
 
6.8%
3656
 
5.3%
2959
 
4.3%
1640
 
2.4%
1097
 
1.6%
907
 
1.3%
870
 
1.3%
Other values (833) 37980
54.8%
Uppercase Letter
ValueCountFrequency (%)
C 122
 
10.7%
I 102
 
8.9%
L 99
 
8.7%
T 94
 
8.2%
E 83
 
7.3%
O 70
 
6.1%
N 69
 
6.1%
S 63
 
5.5%
A 57
 
5.0%
M 44
 
3.9%
Other values (16) 337
29.6%
Lowercase Letter
ValueCountFrequency (%)
o 122
13.1%
t 101
10.9%
n 98
10.5%
e 97
10.4%
d 69
 
7.4%
c 68
 
7.3%
a 58
 
6.2%
i 49
 
5.3%
r 41
 
4.4%
s 35
 
3.8%
Other values (15) 192
20.6%
Decimal Number
ValueCountFrequency (%)
1 13
18.6%
2 11
15.7%
3 10
14.3%
5 9
12.9%
4 7
10.0%
6 7
10.0%
8 5
 
7.1%
7 3
 
4.3%
9 3
 
4.3%
0 2
 
2.9%
Other Punctuation
ValueCountFrequency (%)
. 164
68.6%
, 61
 
25.5%
& 14
 
5.9%
Close Punctuation
ValueCountFrequency (%)
) 1251
99.9%
] 1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1249
99.9%
[ 1
 
0.1%
Space Separator
ValueCountFrequency (%)
5163
100.0%
Other Symbol
ValueCountFrequency (%)
3259
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 72543
87.8%
Common 7987
 
9.7%
Latin 2070
 
2.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5738
 
7.9%
4956
 
6.8%
4796
 
6.6%
4685
 
6.5%
3656
 
5.0%
3259
 
4.5%
2959
 
4.1%
1640
 
2.3%
1097
 
1.5%
907
 
1.3%
Other values (834) 38850
53.6%
Latin
ValueCountFrequency (%)
C 122
 
5.9%
o 122
 
5.9%
I 102
 
4.9%
t 101
 
4.9%
L 99
 
4.8%
n 98
 
4.7%
e 97
 
4.7%
T 94
 
4.5%
E 83
 
4.0%
O 70
 
3.4%
Other values (41) 1082
52.3%
Common
ValueCountFrequency (%)
5163
64.6%
) 1251
 
15.7%
( 1249
 
15.6%
. 164
 
2.1%
, 61
 
0.8%
& 14
 
0.2%
1 13
 
0.2%
- 13
 
0.2%
2 11
 
0.1%
3 10
 
0.1%
Other values (9) 38
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 69284
83.9%
ASCII 10057
 
12.2%
None 3259
 
3.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5738
 
8.3%
4956
 
7.2%
4796
 
6.9%
4685
 
6.8%
3656
 
5.3%
2959
 
4.3%
1640
 
2.4%
1097
 
1.6%
907
 
1.3%
870
 
1.3%
Other values (833) 37980
54.8%
ASCII
ValueCountFrequency (%)
5163
51.3%
) 1251
 
12.4%
( 1249
 
12.4%
. 164
 
1.6%
C 122
 
1.2%
o 122
 
1.2%
I 102
 
1.0%
t 101
 
1.0%
L 99
 
1.0%
n 98
 
1.0%
Other values (60) 1586
 
15.8%
None
ValueCountFrequency (%)
3259
100.0%
Distinct471
Distinct (%)4.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T06:02:34.742705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length3
Mean length3.2395
Min length3

Characters and Unicode

Total characters32395
Distinct characters131
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique284 ?
Unique (%)2.8%

Sample

1st row정**
2nd row남**
3rd row우**
4th row박**
5th row주**
ValueCountFrequency (%)
1997
20.0%
1444
 
14.4%
803
 
8.0%
436
 
4.4%
402
 
4.0%
274
 
2.7%
264
 
2.6%
205
 
2.1%
193
 
1.9%
182
 
1.8%
Other values (461) 3800
38.0%
2024-03-15T06:02:36.068530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 21197
65.4%
2233
 
6.9%
1610
 
5.0%
909
 
2.8%
, 599
 
1.8%
501
 
1.5%
459
 
1.4%
302
 
0.9%
290
 
0.9%
234
 
0.7%
Other values (121) 4061
 
12.5%

Most occurring categories

ValueCountFrequency (%)
Other Punctuation 21796
67.3%
Other Letter 10560
32.6%
Uppercase Letter 39
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2233
21.1%
1610
15.2%
909
 
8.6%
501
 
4.7%
459
 
4.3%
302
 
2.9%
290
 
2.7%
234
 
2.2%
215
 
2.0%
212
 
2.0%
Other values (104) 3595
34.0%
Uppercase Letter
ValueCountFrequency (%)
K 9
23.1%
L 8
20.5%
P 4
10.3%
Y 3
 
7.7%
C 3
 
7.7%
S 2
 
5.1%
H 2
 
5.1%
Z 1
 
2.6%
E 1
 
2.6%
I 1
 
2.6%
Other values (5) 5
12.8%
Other Punctuation
ValueCountFrequency (%)
* 21197
97.3%
, 599
 
2.7%

Most occurring scripts

ValueCountFrequency (%)
Common 21796
67.3%
Hangul 10560
32.6%
Latin 39
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2233
21.1%
1610
15.2%
909
 
8.6%
501
 
4.7%
459
 
4.3%
302
 
2.9%
290
 
2.7%
234
 
2.2%
215
 
2.0%
212
 
2.0%
Other values (104) 3595
34.0%
Latin
ValueCountFrequency (%)
K 9
23.1%
L 8
20.5%
P 4
10.3%
Y 3
 
7.7%
C 3
 
7.7%
S 2
 
5.1%
H 2
 
5.1%
Z 1
 
2.6%
E 1
 
2.6%
I 1
 
2.6%
Other values (5) 5
12.8%
Common
ValueCountFrequency (%)
* 21197
97.3%
, 599
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 21835
67.4%
Hangul 10560
32.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 21197
97.1%
, 599
 
2.7%
K 9
 
< 0.1%
L 8
 
< 0.1%
P 4
 
< 0.1%
Y 3
 
< 0.1%
C 3
 
< 0.1%
S 2
 
< 0.1%
H 2
 
< 0.1%
Z 1
 
< 0.1%
Other values (7) 7
 
< 0.1%
Hangul
ValueCountFrequency (%)
2233
21.1%
1610
15.2%
909
 
8.6%
501
 
4.7%
459
 
4.3%
302
 
2.9%
290
 
2.7%
234
 
2.2%
215
 
2.0%
212
 
2.0%
Other values (104) 3595
34.0%
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
혁신성장유형
6524 
벤처투자유형
1748 
연구개발유형
1672 
예비벤처유형
 
56

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row연구개발유형
2nd row연구개발유형
3rd row혁신성장유형
4th row혁신성장유형
5th row벤처투자유형

Common Values

ValueCountFrequency (%)
혁신성장유형 6524
65.2%
벤처투자유형 1748
 
17.5%
연구개발유형 1672
 
16.7%
예비벤처유형 56
 
0.6%

Length

2024-03-15T06:02:36.283936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T06:02:36.563358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
혁신성장유형 6524
65.2%
벤처투자유형 1748
 
17.5%
연구개발유형 1672
 
16.7%
예비벤처유형 56
 
0.6%

지역
Categorical

Distinct17
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경기
3085 
서울
2928 
인천
445 
부산
429 
대전
391 
Other values (12)
2722 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기
2nd row경기
3rd row경기
4th row부산
5th row서울

Common Values

ValueCountFrequency (%)
경기 3085
30.9%
서울 2928
29.3%
인천 445
 
4.5%
부산 429
 
4.3%
대전 391
 
3.9%
경북 365
 
3.6%
경남 362
 
3.6%
대구 356
 
3.6%
충남 342
 
3.4%
충북 253
 
2.5%
Other values (7) 1044
 
10.4%

Length

2024-03-15T06:02:36.930597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기 3085
30.9%
서울 2928
29.3%
인천 445
 
4.5%
부산 429
 
4.3%
대전 391
 
3.9%
경북 365
 
3.6%
경남 362
 
3.6%
대구 356
 
3.6%
충남 342
 
3.4%
충북 253
 
2.5%
Other values (7) 1044
 
10.4%
Distinct243
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T06:02:38.195380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length13
Mean length8.1778
Min length7

Characters and Unicode

Total characters81778
Distinct characters156
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)0.3%

Sample

1st row경기도 화성시
2nd row경기도 안산시
3rd row경기도 화성시
4th row부산광역시 동래구
5th row서울특별시 영등포구
ValueCountFrequency (%)
경기도 3085
 
15.4%
서울특별시 2928
 
14.6%
강남구 648
 
3.2%
인천광역시 445
 
2.2%
성남시 444
 
2.2%
부산광역시 429
 
2.1%
화성시 407
 
2.0%
대전광역시 391
 
2.0%
경상북도 365
 
1.8%
경상남도 362
 
1.8%
Other values (228) 10496
52.5%
2024-03-15T06:02:39.753236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10000
 
12.2%
9719
 
11.9%
5313
 
6.5%
5127
 
6.3%
3919
 
4.8%
3795
 
4.6%
3112
 
3.8%
3099
 
3.8%
3055
 
3.7%
3055
 
3.7%
Other values (146) 31584
38.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 71763
87.8%
Space Separator 10000
 
12.2%
Decimal Number 15
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9719
 
13.5%
5313
 
7.4%
5127
 
7.1%
3919
 
5.5%
3795
 
5.3%
3112
 
4.3%
3099
 
4.3%
3055
 
4.3%
3055
 
4.3%
2379
 
3.3%
Other values (141) 29190
40.7%
Decimal Number
ValueCountFrequency (%)
7 9
60.0%
1 4
26.7%
4 1
 
6.7%
3 1
 
6.7%
Space Separator
ValueCountFrequency (%)
10000
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 71763
87.8%
Common 10015
 
12.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9719
 
13.5%
5313
 
7.4%
5127
 
7.1%
3919
 
5.5%
3795
 
5.3%
3112
 
4.3%
3099
 
4.3%
3055
 
4.3%
3055
 
4.3%
2379
 
3.3%
Other values (141) 29190
40.7%
Common
ValueCountFrequency (%)
10000
99.9%
7 9
 
0.1%
1 4
 
< 0.1%
4 1
 
< 0.1%
3 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 71763
87.8%
ASCII 10015
 
12.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10000
99.9%
7 9
 
0.1%
1 4
 
< 0.1%
4 1
 
< 0.1%
3 1
 
< 0.1%
Hangul
ValueCountFrequency (%)
9719
 
13.5%
5313
 
7.4%
5127
 
7.1%
3919
 
5.5%
3795
 
5.3%
3112
 
4.3%
3099
 
4.3%
3055
 
4.3%
3055
 
4.3%
2379
 
3.3%
Other values (141) 29190
40.7%
Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
제조업
5726 
정보처리S/W
2229 
기타
1013 
도소매업
 
380
연구개발서비스
 
367
Other values (2)
 
285

Length

Max length8
Median length3
Mean length4.0252
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row연구개발서비스
2nd row제조업
3rd row제조업
4th row제조업
5th row정보처리S/W

Common Values

ValueCountFrequency (%)
제조업 5726
57.3%
정보처리S/W 2229
 
22.3%
기타 1013
 
10.1%
도소매업 380
 
3.8%
연구개발서비스 367
 
3.7%
건설운수 231
 
2.3%
농,어,임,광업 54
 
0.5%

Length

2024-03-15T06:02:40.270016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T06:02:40.580113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제조업 5726
57.3%
정보처리s/w 2229
 
22.3%
기타 1013
 
10.1%
도소매업 380
 
3.8%
연구개발서비스 367
 
3.7%
건설운수 231
 
2.3%
농,어,임,광업 54
 
0.5%
Distinct684
Distinct (%)6.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T06:02:41.771046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length24
Mean length15.5283
Min length3

Characters and Unicode

Total characters155283
Distinct characters394
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique140 ?
Unique (%)1.4%

Sample

1st row의학 및 약학 연구개발업
2nd row날염 가공업
3rd row천막, 텐트 및 유사 제품 제조업
4th row그 외 기타 특수목적용 기계 제조업
5th row컴퓨터 프로그래밍 서비스업
ValueCountFrequency (%)
제조업 5398
 
11.9%
4601
 
10.1%
기타 3276
 
7.2%
1825
 
4.0%
1818
 
4.0%
서비스업 1372
 
3.0%
소프트웨어 1233
 
2.7%
공급업 1221
 
2.7%
개발 1210
 
2.7%
응용 791
 
1.7%
Other values (1020) 22729
50.0%
2024-03-15T06:02:43.256059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
35478
22.8%
10302
 
6.6%
7340
 
4.7%
6743
 
4.3%
6123
 
3.9%
4601
 
3.0%
3300
 
2.1%
2739
 
1.8%
2539
 
1.6%
2223
 
1.4%
Other values (384) 73895
47.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 118820
76.5%
Space Separator 35478
 
22.8%
Other Punctuation 899
 
0.6%
Close Punctuation 30
 
< 0.1%
Open Punctuation 30
 
< 0.1%
Decimal Number 26
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10302
 
8.7%
7340
 
6.2%
6743
 
5.7%
6123
 
5.2%
4601
 
3.9%
3300
 
2.8%
2739
 
2.3%
2539
 
2.1%
2223
 
1.9%
2174
 
1.8%
Other values (378) 70736
59.5%
Other Punctuation
ValueCountFrequency (%)
, 881
98.0%
. 18
 
2.0%
Space Separator
ValueCountFrequency (%)
35478
100.0%
Close Punctuation
ValueCountFrequency (%)
) 30
100.0%
Open Punctuation
ValueCountFrequency (%)
( 30
100.0%
Decimal Number
ValueCountFrequency (%)
1 26
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 118820
76.5%
Common 36463
 
23.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10302
 
8.7%
7340
 
6.2%
6743
 
5.7%
6123
 
5.2%
4601
 
3.9%
3300
 
2.8%
2739
 
2.3%
2539
 
2.1%
2223
 
1.9%
2174
 
1.8%
Other values (378) 70736
59.5%
Common
ValueCountFrequency (%)
35478
97.3%
, 881
 
2.4%
) 30
 
0.1%
( 30
 
0.1%
1 26
 
0.1%
. 18
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 118732
76.5%
ASCII 36463
 
23.5%
Compat Jamo 88
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
35478
97.3%
, 881
 
2.4%
) 30
 
0.1%
( 30
 
0.1%
1 26
 
0.1%
. 18
 
< 0.1%
Hangul
ValueCountFrequency (%)
10302
 
8.7%
7340
 
6.2%
6743
 
5.7%
6123
 
5.2%
4601
 
3.9%
3300
 
2.8%
2739
 
2.3%
2539
 
2.1%
2223
 
1.9%
2174
 
1.8%
Other values (377) 70648
59.5%
Compat Jamo
ValueCountFrequency (%)
88
100.0%
Distinct8981
Distinct (%)90.3%
Missing54
Missing (%)0.5%
Memory size156.2 KiB
2024-03-15T06:02:44.517833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length115
Median length87
Mean length13.156344
Min length1

Characters and Unicode

Total characters130853
Distinct characters942
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8642 ?
Unique (%)86.9%

Sample

1st row의료기기 의약품
2nd row섬유디지털프린트및가고편발수가공 요가복요가매트외
3rd rowLED, OLED 스마트 조명기구
4th row이화학용 멸균기 및 살균기
5th row담보대출 비교 플랫폼
ValueCountFrequency (%)
1443
 
5.4%
소프트웨어 470
 
1.8%
개발 426
 
1.6%
서비스 361
 
1.4%
307
 
1.2%
플랫폼 302
 
1.1%
275
 
1.0%
솔루션 210
 
0.8%
제조 197
 
0.7%
시스템 184
 
0.7%
Other values (11796) 22434
84.3%
2024-03-15T06:02:46.210474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17110
 
13.1%
, 3808
 
2.9%
3225
 
2.5%
2480
 
1.9%
1761
 
1.3%
1658
 
1.3%
1598
 
1.2%
1534
 
1.2%
1507
 
1.2%
1487
 
1.1%
Other values (932) 94685
72.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 95621
73.1%
Space Separator 17110
 
13.1%
Uppercase Letter 7260
 
5.5%
Lowercase Letter 5033
 
3.8%
Other Punctuation 4456
 
3.4%
Close Punctuation 522
 
0.4%
Open Punctuation 431
 
0.3%
Decimal Number 303
 
0.2%
Dash Punctuation 104
 
0.1%
Math Symbol 8
 
< 0.1%
Other values (3) 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3225
 
3.4%
2480
 
2.6%
1761
 
1.8%
1658
 
1.7%
1598
 
1.7%
1534
 
1.6%
1507
 
1.6%
1487
 
1.6%
1467
 
1.5%
1462
 
1.5%
Other values (849) 77442
81.0%
Uppercase Letter
ValueCountFrequency (%)
S 708
 
9.8%
C 612
 
8.4%
E 545
 
7.5%
D 527
 
7.3%
I 520
 
7.2%
A 503
 
6.9%
T 453
 
6.2%
P 438
 
6.0%
R 427
 
5.9%
L 381
 
5.2%
Other values (16) 2146
29.6%
Lowercase Letter
ValueCountFrequency (%)
e 587
11.7%
o 489
 
9.7%
a 487
 
9.7%
i 407
 
8.1%
t 391
 
7.8%
r 378
 
7.5%
n 290
 
5.8%
s 285
 
5.7%
l 266
 
5.3%
p 192
 
3.8%
Other values (16) 1261
25.1%
Decimal Number
ValueCountFrequency (%)
3 92
30.4%
2 83
27.4%
0 37
12.2%
1 27
 
8.9%
5 18
 
5.9%
6 13
 
4.3%
4 13
 
4.3%
9 9
 
3.0%
8 6
 
2.0%
7 5
 
1.7%
Other Punctuation
ValueCountFrequency (%)
, 3808
85.5%
/ 355
 
8.0%
. 124
 
2.8%
& 93
 
2.1%
; 54
 
1.2%
: 12
 
0.3%
· 9
 
0.2%
% 1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 517
99.0%
] 3
 
0.6%
} 2
 
0.4%
Math Symbol
ValueCountFrequency (%)
| 5
62.5%
+ 2
 
25.0%
= 1
 
12.5%
Open Punctuation
ValueCountFrequency (%)
( 427
99.1%
[ 4
 
0.9%
Space Separator
ValueCountFrequency (%)
17110
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 104
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 95610
73.1%
Common 22939
 
17.5%
Latin 12293
 
9.4%
Han 11
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3225
 
3.4%
2480
 
2.6%
1761
 
1.8%
1658
 
1.7%
1598
 
1.7%
1534
 
1.6%
1507
 
1.6%
1487
 
1.6%
1467
 
1.5%
1462
 
1.5%
Other values (847) 77431
81.0%
Latin
ValueCountFrequency (%)
S 708
 
5.8%
C 612
 
5.0%
e 587
 
4.8%
E 545
 
4.4%
D 527
 
4.3%
I 520
 
4.2%
A 503
 
4.1%
o 489
 
4.0%
a 487
 
4.0%
T 453
 
3.7%
Other values (42) 6862
55.8%
Common
ValueCountFrequency (%)
17110
74.6%
, 3808
 
16.6%
) 517
 
2.3%
( 427
 
1.9%
/ 355
 
1.5%
. 124
 
0.5%
- 104
 
0.5%
& 93
 
0.4%
3 92
 
0.4%
2 83
 
0.4%
Other values (21) 226
 
1.0%
Han
ValueCountFrequency (%)
10
90.9%
1
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 95610
73.1%
ASCII 35221
 
26.9%
CJK 11
 
< 0.1%
None 9
 
< 0.1%
Punctuation 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
17110
48.6%
, 3808
 
10.8%
S 708
 
2.0%
C 612
 
1.7%
e 587
 
1.7%
E 545
 
1.5%
D 527
 
1.5%
I 520
 
1.5%
) 517
 
1.5%
A 503
 
1.4%
Other values (70) 9784
27.8%
Hangul
ValueCountFrequency (%)
3225
 
3.4%
2480
 
2.6%
1761
 
1.8%
1658
 
1.7%
1598
 
1.7%
1534
 
1.6%
1507
 
1.6%
1487
 
1.6%
1467
 
1.5%
1462
 
1.5%
Other values (847) 77431
81.0%
CJK
ValueCountFrequency (%)
10
90.9%
1
 
9.1%
None
ValueCountFrequency (%)
· 9
100.0%
Punctuation
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct821
Distinct (%)8.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2021-02-08 00:00:00
Maximum2024-01-31 00:00:00
2024-03-15T06:02:46.454911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T06:02:46.861632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct821
Distinct (%)8.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2024-02-07 00:00:00
Maximum2027-01-30 00:00:00
2024-03-15T06:02:47.378387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T06:02:47.831480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

벤처확인기관
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
벤처기업확인기관
10000 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row벤처기업확인기관
2nd row벤처기업확인기관
3rd row벤처기업확인기관
4th row벤처기업확인기관
5th row벤처기업확인기관

Common Values

ValueCountFrequency (%)
벤처기업확인기관 10000
100.0%

Length

2024-03-15T06:02:48.290575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T06:02:48.597702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
벤처기업확인기관 10000
100.0%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
재확인
6299 
신규
3701 

Length

Max length3
Median length3
Mean length2.6299
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row신규
2nd row재확인
3rd row신규
4th row재확인
5th row신규

Common Values

ValueCountFrequency (%)
재확인 6299
63.0%
신규 3701
37.0%

Length

2024-03-15T06:02:49.049948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T06:02:49.384858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
재확인 6299
63.0%
신규 3701
37.0%

Interactions

2024-03-15T06:02:29.805044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T06:02:49.677160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번벤처확인유형지역업종분류(기보)신규_재확인코드
연번1.0000.0970.0440.0620.433
벤처확인유형0.0971.0000.2840.2860.281
지역0.0440.2841.0000.4100.112
업종분류(기보)0.0620.2860.4101.0000.179
신규_재확인코드0.4330.2810.1120.1791.000
2024-03-15T06:02:49.962908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
신규_재확인코드업종분류(기보)지역벤처확인유형
신규_재확인코드1.0000.1910.1010.187
업종분류(기보)0.1911.0000.1980.200
지역0.1010.1981.0000.161
벤처확인유형0.1870.2000.1611.000
2024-03-15T06:02:50.249127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번벤처확인유형지역업종분류(기보)신규_재확인코드
연번1.0000.0580.0170.0310.333
벤처확인유형0.0581.0000.1610.2000.187
지역0.0170.1611.0000.1980.101
업종분류(기보)0.0310.2000.1981.0000.191
신규_재확인코드0.3330.1870.1010.1911.000

Missing values

2024-03-15T06:02:30.249737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T06:02:30.838224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업체명대표자명벤처확인유형지역간략주소업종분류(기보)업종명(10차)주생산품벤처유효시작일벤처유효종료일벤처확인기관신규_재확인코드
1124711248주식회사 바이오엔티정**연구개발유형경기경기도 화성시연구개발서비스의학 및 약학 연구개발업의료기기 의약품2021-11-242024-11-23벤처기업확인기관신규
2127721278㈜삼우디티피남**연구개발유형경기경기도 안산시제조업날염 가공업섬유디지털프린트및가고편발수가공 요가복요가매트외2022-04-022025-04-01벤처기업확인기관재확인
2839728398주식회사 팜하이테크우**혁신성장유형경기경기도 화성시제조업천막, 텐트 및 유사 제품 제조업LED, OLED 스마트 조명기구2022-10-192025-10-18벤처기업확인기관신규
74947495㈜오티아이코리아박**혁신성장유형부산부산광역시 동래구제조업그 외 기타 특수목적용 기계 제조업이화학용 멸균기 및 살균기2021-06-242024-06-23벤처기업확인기관재확인
1383813839베스트핀 주식회사주**벤처투자유형서울서울특별시 영등포구정보처리S/W컴퓨터 프로그래밍 서비스업담보대출 비교 플랫폼2021-11-242024-11-23벤처기업확인기관신규
2913529136주식회사 누지박**연구개발유형서울서울특별시 송파구제조업그 외 기타 가구 제조업스마트 헬스케어 체어2022-12-142025-12-13벤처기업확인기관신규
76907691㈜에너캠프최**연구개발유형대구대구광역시 달서구제조업에너지 저장장치 제조업스마트 배터리 충전기2021-06-272024-06-26벤처기업확인기관재확인
3054130542주식회사 나이스웨더노**벤처투자유형서울서울특별시 강남구도소매업상품 종합 도매업의류, 식품, 화장품, 잡화2022-11-092025-11-08벤처기업확인기관신규
2861728618주식회사 더블유앤피P**벤처투자유형울산울산광역시 남구기타기타 엔지니어링 서비스업수소 연료전지용 멤브레인 개발 및 LNG 탱크 컨테이너 개발2022-09-282025-09-27벤처기업확인기관신규
1636116362보금냉열천**혁신성장유형경북경상북도 경산시제조업산업용 냉장 및 냉동 장비 제조업저온저장고 실외기2022-03-302025-03-29벤처기업확인기관신규
연번업체명대표자명벤처확인유형지역간략주소업종분류(기보)업종명(10차)주생산품벤처유효시작일벤처유효종료일벤처확인기관신규_재확인코드
29962997아이디어플러스 주식회사박**혁신성장유형경기경기도 안산시제조업일반용 전기 조명장치 제조업LED조명2021-05-232024-05-22벤처기업확인기관재확인
74357436주식회사 지티김**벤처투자유형울산울산광역시 울주군제조업그 외 기타 일반목적용 기계 제조업지티 Metal-Co2 시스템2021-07-212024-07-20벤처기업확인기관신규
1552715528농업회사법인미래㈜조**혁신성장유형경기경기도 김포시제조업김치류 제조업김치류2022-01-302025-01-29벤처기업확인기관재확인
3295132952주식회사 시걸컴즈백**연구개발유형서울서울특별시 성동구제조업가방 및 기타 보호용 케이스 제조업가방 및 핸드백2022-12-072025-12-06벤처기업확인기관재확인
2200522006주식회사 농업회사법인 콩아저씨 두부가게복**혁신성장유형충북충청북도 음성군제조업두부 및 유사식품 제조업두부, 순두부, 청국장 등2022-06-222025-06-21벤처기업확인기관신규
93429343㈜우진주**혁신성장유형대구대구광역시 동구제조업그 외 기타 일반목적용 기계 제조업교반기 응집기2021-09-252024-09-24벤처기업확인기관재확인
1197311974농업법인회사 영인바이오최**혁신성장유형전북전라북도 군산시제조업과실 및 그 외 채소 절임식품 제조업순살꽃게장 외 수산물 가공식품2021-10-072024-10-06벤처기업확인기관재확인
1519115192㈜비에스지에이치앤비김**혁신성장유형서울서울특별시 마포구제조업화장품 제조업화장품2022-01-052025-01-04벤처기업확인기관신규
3904539046예비창업자 이상민이**예비벤처유형인천인천광역시 미추홀구제조업기타 가정용 전기기기 제조업<NA>2023-10-042026-10-03벤처기업확인기관신규
1861718618(주)알파머티리얼즈박**연구개발유형경기경기도 화성시제조업전기용 탄소제품 및 절연제품 제조업방열소재 및 기타응용제품2022-02-272025-02-26벤처기업확인기관재확인