Overview

Dataset statistics

Number of variables5
Number of observations324
Missing cells36
Missing cells (%)2.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.8 KiB
Average record size in memory40.4 B

Variable types

Text5

Dataset

Description「중소기업간 협업사업계획」을 승인받은 중소기업의 현황을 제공하는 데이터로 승인번호, 업체명, 우편번호, 업체주소, 대표자 항목으로 구성되어 있습니다.22.7.31일 기준으로 작성되었습니다.
Author중소벤처기업부
URLhttps://www.data.go.kr/data/3034764/fileData.do

Alerts

우편번호 has 36 (11.1%) missing valuesMissing
승인번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 08:46:20.390701
Analysis finished2023-12-12 08:46:21.084459
Duration0.69 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

승인번호
Text

UNIQUE 

Distinct324
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
2023-12-12T17:46:21.329088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length9.382716
Min length9

Characters and Unicode

Total characters3040
Distinct characters13
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique324 ?
Unique (%)100.0%

Sample

1st row제2012-12호
2nd row제2012-16호
3rd row제2012-18호
4th row제2012-30호
5th row제2013-12호
ValueCountFrequency (%)
제2012-12호 1
 
0.3%
제2022-071호 1
 
0.3%
제2022-095호 1
 
0.3%
제2022-094호 1
 
0.3%
제2022-093호 1
 
0.3%
제2022-092호 1
 
0.3%
제2022-091호 1
 
0.3%
제2022-090호 1
 
0.3%
제2022-089호 1
 
0.3%
제2022-096호 1
 
0.3%
Other values (314) 314
96.9%
2023-12-12T17:46:21.877755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 806
26.5%
0 519
17.1%
324
10.7%
- 324
10.7%
324
10.7%
1 294
 
9.7%
3 119
 
3.9%
4 64
 
2.1%
5 59
 
1.9%
6 55
 
1.8%
Other values (3) 152
 
5.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2068
68.0%
Other Letter 648
 
21.3%
Dash Punctuation 324
 
10.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 806
39.0%
0 519
25.1%
1 294
 
14.2%
3 119
 
5.8%
4 64
 
3.1%
5 59
 
2.9%
6 55
 
2.7%
8 54
 
2.6%
7 49
 
2.4%
9 49
 
2.4%
Other Letter
ValueCountFrequency (%)
324
50.0%
324
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 324
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2392
78.7%
Hangul 648
 
21.3%

Most frequent character per script

Common
ValueCountFrequency (%)
2 806
33.7%
0 519
21.7%
- 324
13.5%
1 294
 
12.3%
3 119
 
5.0%
4 64
 
2.7%
5 59
 
2.5%
6 55
 
2.3%
8 54
 
2.3%
7 49
 
2.0%
Hangul
ValueCountFrequency (%)
324
50.0%
324
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2392
78.7%
Hangul 648
 
21.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 806
33.7%
0 519
21.7%
- 324
13.5%
1 294
 
12.3%
3 119
 
5.0%
4 64
 
2.7%
5 59
 
2.5%
6 55
 
2.3%
8 54
 
2.3%
7 49
 
2.0%
Hangul
ValueCountFrequency (%)
324
50.0%
324
50.0%
Distinct308
Distinct (%)95.1%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
2023-12-12T17:46:22.180477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length15
Mean length7.632716
Min length2

Characters and Unicode

Total characters2473
Distinct characters304
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique294 ?
Unique (%)90.7%

Sample

1st row(주)에너테크
2nd row(주)비엔씨넷
3rd row농업법인자연드림(주)
4th row신한코아(주)
5th row엔에스엘이디(주)
ValueCountFrequency (%)
주식회사 53
 
13.8%
농업회사법인 6
 
1.6%
주)오존에이드 3
 
0.8%
주)모현씨앤디 3
 
0.8%
주)금명 2
 
0.5%
주)에너테크 2
 
0.5%
주)휴마스터 2
 
0.5%
주)한창산업 2
 
0.5%
매이크앤(주 2
 
0.5%
주)가나테크 2
 
0.5%
Other values (302) 308
80.0%
2023-12-12T17:46:22.712137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
246
 
9.9%
) 178
 
7.2%
( 176
 
7.1%
99
 
4.0%
80
 
3.2%
78
 
3.2%
75
 
3.0%
72
 
2.9%
61
 
2.5%
55
 
2.2%
Other values (294) 1353
54.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2024
81.8%
Close Punctuation 178
 
7.2%
Open Punctuation 176
 
7.1%
Space Separator 61
 
2.5%
Uppercase Letter 22
 
0.9%
Other Symbol 7
 
0.3%
Other Punctuation 3
 
0.1%
Dash Punctuation 1
 
< 0.1%
Decimal Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
246
 
12.2%
99
 
4.9%
80
 
4.0%
78
 
3.9%
75
 
3.7%
72
 
3.6%
55
 
2.7%
35
 
1.7%
32
 
1.6%
30
 
1.5%
Other values (272) 1222
60.4%
Uppercase Letter
ValueCountFrequency (%)
I 3
13.6%
N 2
9.1%
C 2
9.1%
E 2
9.1%
S 2
9.1%
T 2
9.1%
M 2
9.1%
H 1
 
4.5%
D 1
 
4.5%
B 1
 
4.5%
Other values (4) 4
18.2%
Other Punctuation
ValueCountFrequency (%)
. 2
66.7%
& 1
33.3%
Close Punctuation
ValueCountFrequency (%)
) 178
100.0%
Open Punctuation
ValueCountFrequency (%)
( 176
100.0%
Space Separator
ValueCountFrequency (%)
61
100.0%
Other Symbol
ValueCountFrequency (%)
7
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Decimal Number
ValueCountFrequency (%)
3 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2031
82.1%
Common 420
 
17.0%
Latin 22
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
246
 
12.1%
99
 
4.9%
80
 
3.9%
78
 
3.8%
75
 
3.7%
72
 
3.5%
55
 
2.7%
35
 
1.7%
32
 
1.6%
30
 
1.5%
Other values (273) 1229
60.5%
Latin
ValueCountFrequency (%)
I 3
13.6%
N 2
9.1%
C 2
9.1%
E 2
9.1%
S 2
9.1%
T 2
9.1%
M 2
9.1%
H 1
 
4.5%
D 1
 
4.5%
B 1
 
4.5%
Other values (4) 4
18.2%
Common
ValueCountFrequency (%)
) 178
42.4%
( 176
41.9%
61
 
14.5%
. 2
 
0.5%
- 1
 
0.2%
& 1
 
0.2%
3 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2024
81.8%
ASCII 442
 
17.9%
None 7
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
246
 
12.2%
99
 
4.9%
80
 
4.0%
78
 
3.9%
75
 
3.7%
72
 
3.6%
55
 
2.7%
35
 
1.7%
32
 
1.6%
30
 
1.5%
Other values (272) 1222
60.4%
ASCII
ValueCountFrequency (%)
) 178
40.3%
( 176
39.8%
61
 
13.8%
I 3
 
0.7%
N 2
 
0.5%
C 2
 
0.5%
E 2
 
0.5%
. 2
 
0.5%
S 2
 
0.5%
T 2
 
0.5%
Other values (11) 12
 
2.7%
None
ValueCountFrequency (%)
7
100.0%

우편번호
Text

MISSING 

Distinct256
Distinct (%)88.9%
Missing36
Missing (%)11.1%
Memory size2.7 KiB
2023-12-12T17:46:23.143227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.4097222
Min length5

Characters and Unicode

Total characters1558
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique232 ?
Unique (%)80.6%

Sample

1st row462-807
2nd row331-816
3rd row363-883
4th row429-450
5th row235-200
ValueCountFrequency (%)
34054 4
 
1.4%
462-807 4
 
1.4%
34013 4
 
1.4%
34036 3
 
1.0%
57118 3
 
1.0%
12918 2
 
0.7%
10048 2
 
0.7%
51347 2
 
0.7%
48002 2
 
0.7%
63243 2
 
0.7%
Other values (246) 260
90.3%
2023-12-12T17:46:23.757314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 205
13.2%
1 205
13.2%
0 188
12.1%
3 182
11.7%
2 175
11.2%
5 148
9.5%
8 121
7.8%
7 100
6.4%
6 98
6.3%
9 77
 
4.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1499
96.2%
Dash Punctuation 59
 
3.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 205
13.7%
1 205
13.7%
0 188
12.5%
3 182
12.1%
2 175
11.7%
5 148
9.9%
8 121
8.1%
7 100
6.7%
6 98
6.5%
9 77
 
5.1%
Dash Punctuation
ValueCountFrequency (%)
- 59
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1558
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 205
13.2%
1 205
13.2%
0 188
12.1%
3 182
11.7%
2 175
11.2%
5 148
9.5%
8 121
7.8%
7 100
6.4%
6 98
6.3%
9 77
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1558
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 205
13.2%
1 205
13.2%
0 188
12.1%
3 182
11.7%
2 175
11.2%
5 148
9.5%
8 121
7.8%
7 100
6.4%
6 98
6.3%
9 77
 
4.9%
Distinct318
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
2023-12-12T17:46:24.211635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length57
Median length48
Mean length27.324074
Min length10

Characters and Unicode

Total characters8853
Distinct characters353
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique312 ?
Unique (%)96.3%

Sample

1st row경기도 성남시 중원구 둔촌대로 545
2nd row충청남도 천안시 서북구 직산로 136-0 천안밸리
3rd row충청북도 청원군 오창읍 연구단지로 76 충북테크노파크 미래융합기술관419호
4th row경기도 시흥시 공단2대로318번길 14-0 신한코아(주)
5th row강원도 태백시 철암공단길
ValueCountFrequency (%)
경기도 47
 
2.5%
경기 32
 
1.7%
대전 28
 
1.5%
경남 26
 
1.4%
유성구 25
 
1.3%
서울 24
 
1.3%
창원시 13
 
0.7%
성남시 13
 
0.7%
서구 12
 
0.6%
서울특별시 12
 
0.6%
Other values (1111) 1634
87.6%
2023-12-12T17:46:24.812402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1557
 
17.6%
1 304
 
3.4%
273
 
3.1%
237
 
2.7%
230
 
2.6%
2 208
 
2.3%
3 188
 
2.1%
182
 
2.1%
0 179
 
2.0%
( 146
 
1.6%
Other values (343) 5349
60.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5142
58.1%
Decimal Number 1564
 
17.7%
Space Separator 1557
 
17.6%
Open Punctuation 146
 
1.6%
Close Punctuation 146
 
1.6%
Dash Punctuation 129
 
1.5%
Other Punctuation 116
 
1.3%
Uppercase Letter 49
 
0.6%
Math Symbol 2
 
< 0.1%
Other Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
273
 
5.3%
237
 
4.6%
230
 
4.5%
182
 
3.5%
143
 
2.8%
140
 
2.7%
140
 
2.7%
131
 
2.5%
111
 
2.2%
107
 
2.1%
Other values (307) 3448
67.1%
Uppercase Letter
ValueCountFrequency (%)
B 10
20.4%
A 7
14.3%
C 5
10.2%
F 3
 
6.1%
D 3
 
6.1%
I 3
 
6.1%
S 3
 
6.1%
K 2
 
4.1%
T 2
 
4.1%
O 2
 
4.1%
Other values (7) 9
18.4%
Decimal Number
ValueCountFrequency (%)
1 304
19.4%
2 208
13.3%
3 188
12.0%
0 179
11.4%
4 141
9.0%
5 135
8.6%
6 125
8.0%
7 121
 
7.7%
8 94
 
6.0%
9 69
 
4.4%
Other Punctuation
ValueCountFrequency (%)
, 115
99.1%
/ 1
 
0.9%
Other Symbol
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
1557
100.0%
Open Punctuation
ValueCountFrequency (%)
( 146
100.0%
Close Punctuation
ValueCountFrequency (%)
) 146
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 129
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5143
58.1%
Common 3661
41.4%
Latin 49
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
273
 
5.3%
237
 
4.6%
230
 
4.5%
182
 
3.5%
143
 
2.8%
140
 
2.7%
140
 
2.7%
131
 
2.5%
111
 
2.2%
107
 
2.1%
Other values (308) 3449
67.1%
Common
ValueCountFrequency (%)
1557
42.5%
1 304
 
8.3%
2 208
 
5.7%
3 188
 
5.1%
0 179
 
4.9%
( 146
 
4.0%
) 146
 
4.0%
4 141
 
3.9%
5 135
 
3.7%
- 129
 
3.5%
Other values (8) 528
 
14.4%
Latin
ValueCountFrequency (%)
B 10
20.4%
A 7
14.3%
C 5
10.2%
F 3
 
6.1%
D 3
 
6.1%
I 3
 
6.1%
S 3
 
6.1%
K 2
 
4.1%
T 2
 
4.1%
O 2
 
4.1%
Other values (7) 9
18.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5142
58.1%
ASCII 3709
41.9%
None 1
 
< 0.1%
Enclosed Alphanum 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1557
42.0%
1 304
 
8.2%
2 208
 
5.6%
3 188
 
5.1%
0 179
 
4.8%
( 146
 
3.9%
) 146
 
3.9%
4 141
 
3.8%
5 135
 
3.6%
- 129
 
3.5%
Other values (24) 576
 
15.5%
Hangul
ValueCountFrequency (%)
273
 
5.3%
237
 
4.6%
230
 
4.5%
182
 
3.5%
143
 
2.8%
140
 
2.7%
140
 
2.7%
131
 
2.5%
111
 
2.2%
107
 
2.1%
Other values (307) 3448
67.1%
None
ValueCountFrequency (%)
1
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
1
100.0%
Distinct306
Distinct (%)94.4%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
2023-12-12T17:46:25.256659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length3
Mean length3.1203704
Min length2

Characters and Unicode

Total characters1011
Distinct characters148
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique290 ?
Unique (%)89.5%

Sample

1st row박훈양
2nd row최준호
3rd row신주열
4th row안중산
5th row김영진
ValueCountFrequency (%)
홍명기 3
 
0.9%
최승선 3
 
0.9%
정명환 2
 
0.6%
이재연 2
 
0.6%
박훈양 2
 
0.6%
유수남 2
 
0.6%
구태규 2
 
0.6%
유승구 2
 
0.6%
홍복용 2
 
0.6%
지용수 2
 
0.6%
Other values (300) 308
93.3%
2023-12-12T17:46:25.842596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
59
 
5.8%
55
 
5.4%
33
 
3.3%
30
 
3.0%
29
 
2.9%
23
 
2.3%
22
 
2.2%
21
 
2.1%
21
 
2.1%
20
 
2.0%
Other values (138) 698
69.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 985
97.4%
Space Separator 20
 
2.0%
Other Punctuation 6
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
59
 
6.0%
55
 
5.6%
33
 
3.4%
30
 
3.0%
29
 
2.9%
23
 
2.3%
22
 
2.2%
21
 
2.1%
21
 
2.1%
20
 
2.0%
Other values (136) 672
68.2%
Space Separator
ValueCountFrequency (%)
20
100.0%
Other Punctuation
ValueCountFrequency (%)
, 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 985
97.4%
Common 26
 
2.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
59
 
6.0%
55
 
5.6%
33
 
3.4%
30
 
3.0%
29
 
2.9%
23
 
2.3%
22
 
2.2%
21
 
2.1%
21
 
2.1%
20
 
2.0%
Other values (136) 672
68.2%
Common
ValueCountFrequency (%)
20
76.9%
, 6
 
23.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 985
97.4%
ASCII 26
 
2.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
59
 
6.0%
55
 
5.6%
33
 
3.4%
30
 
3.0%
29
 
2.9%
23
 
2.3%
22
 
2.2%
21
 
2.1%
21
 
2.1%
20
 
2.0%
Other values (136) 672
68.2%
ASCII
ValueCountFrequency (%)
20
76.9%
, 6
 
23.1%

Missing values

2023-12-12T17:46:20.917458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:46:21.031070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

승인번호업체명우편번호업체주소대표자
0제2012-12호(주)에너테크462-807경기도 성남시 중원구 둔촌대로 545박훈양
1제2012-16호(주)비엔씨넷331-816충청남도 천안시 서북구 직산로 136-0 천안밸리최준호
2제2012-18호농업법인자연드림(주)363-883충청북도 청원군 오창읍 연구단지로 76 충북테크노파크 미래융합기술관419호신주열
3제2012-30호신한코아(주)429-450경기도 시흥시 공단2대로318번길 14-0 신한코아(주)안중산
4제2013-12호엔에스엘이디(주)235-200강원도 태백시 철암공단길김영진
5제2013-18호에피텍(주)150-863서울특별시 영등포구 양평로 22길 21송석용
6제2013-21호(주)한양엔지니어링462-807경기도 성남시 중원구 사기막골로90번길 36-0송인진
7제2013-22호(주)우리미래425-780경기도 안산시 단원구 산단로 342-0 유통상가박한열
8제2013-24호주식회사 와이앤피302-845대전 서구 용문동 255-59양승은
9제2014-04호메플릿산업주식회사425-834경기도 안산시 단원구 성곡로146번길 9-0 A동이관희
승인번호업체명우편번호업체주소대표자
314제2023-032호에스이랩스51391경남 창원시 의창구 차룡로48번길 54 (팔용동)송기석
315제2023-033호성신전자51390경남 창원시 의창구 죽전로74번길 33 (팔용동)박선옥
316제2023-034호앤드박스(END-BOX)<NA>서울 노원구 한글비석로24바길 20, 2층윤일
317제2023-035호주식회사 에너파이브63200제주특별자치도 제주시 도남로 37 (도남동, 대우빌딩) 2층김은태
318제2023-036호(주)복용14557경기 부천시 부천로198번길 18 (춘의동, 춘의테크노파크II) 202동 1401호홍복용
319제2023-037호(주)애드위너44412울산 중구 종가로 15 (다운동, 울산테크노파크) 정밀화학소재기술연구소동 213호조양래
320제2023-038호(주)중부화물터미널31212충남 천안시 동남구 청당산업길 33 중부화물터미널김현희
321제2023-039호(주)광명바이오산업62415광주광역시 광산구 평동로 803번길 117-84번지정광우
322제2023-040호주식회사 씨엔컴퍼니48741부산 동구 조방로 14 (범일동, 동일타워) 807호박창준
323제2023-041호세연시스템즈<NA>서울 구로구 디지털로30길 28 (구로동)박노성