Overview

Dataset statistics

Number of variables9
Number of observations4349
Missing cells4416
Missing cells (%)11.3%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory310.2 KiB
Average record size in memory73.0 B

Variable types

Categorical4
Text2
DateTime2
Unsupported1

Alerts

집계년월 has constant value ""Constant
Dataset has 1 (< 0.1%) duplicate rowsDuplicates
수리일 has 67 (1.5%) missing valuesMissing
대표자성명 has 4349 (100.0%) missing valuesMissing
대표자성명 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-05-03 18:44:41.101362
Analysis finished2024-05-03 18:44:44.146755
Duration3.05 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

집계년월
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size34.1 KiB
2024-03
4349 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024-03
2nd row2024-03
3rd row2024-03
4th row2024-03
5th row2024-03

Common Values

ValueCountFrequency (%)
2024-03 4349
100.0%

Length

2024-05-03T18:44:44.315864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-03T18:44:44.712700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024-03 4349
100.0%

시군명
Categorical

Distinct31
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size34.1 KiB
수원시
381 
성남시
318 
화성시
306 
고양시
297 
용인시
 
228
Other values (26)
2819 

Length

Max length4
Median length3
Mean length3.0758795
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가평군
2nd row가평군
3rd row가평군
4th row가평군
5th row가평군

Common Values

ValueCountFrequency (%)
수원시 381
 
8.8%
성남시 318
 
7.3%
화성시 306
 
7.0%
고양시 297
 
6.8%
용인시 228
 
5.2%
부천시 211
 
4.9%
안산시 207
 
4.8%
남양주시 186
 
4.3%
시흥시 183
 
4.2%
파주시 175
 
4.0%
Other values (21) 1857
42.7%

Length

2024-05-03T18:44:45.004205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
수원시 381
 
8.8%
성남시 318
 
7.3%
화성시 306
 
7.0%
고양시 297
 
6.8%
용인시 228
 
5.2%
부천시 211
 
4.9%
안산시 207
 
4.8%
남양주시 186
 
4.3%
시흥시 183
 
4.2%
파주시 175
 
4.0%
Other values (21) 1857
42.7%
Distinct4207
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Memory size34.1 KiB
2024-05-03T18:44:45.664928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length27
Mean length9.8753736
Min length5

Characters and Unicode

Total characters42948
Distinct characters832
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4083 ?
Unique (%)93.9%

Sample

1st row에코피아가평발효팜협동조합
2nd row재즈팜장아찌협동조합
3rd row가평엘피지협동조합
4th row달샘협동조합
5th row가평민들레교육협동조합
ValueCountFrequency (%)
우리협동조합 5
 
0.1%
한마음협동조합 5
 
0.1%
생명살리기안심먹거리협동조합 4
 
0.1%
어울림협동조합 3
 
0.1%
다온협동조합 3
 
0.1%
마루협동조합 3
 
0.1%
홍익협동조합 3
 
0.1%
한국수입자동차정비협동조합 3
 
0.1%
다올협동조합 3
 
0.1%
한국강섬유공업협동조합 3
 
0.1%
Other values (4191) 4314
99.2%
2024-05-03T18:44:46.774509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4562
 
10.6%
4423
 
10.3%
4410
 
10.3%
4377
 
10.2%
512
 
1.2%
483
 
1.1%
427
 
1.0%
421
 
1.0%
376
 
0.9%
364
 
0.8%
Other values (822) 22593
52.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 42126
98.1%
Uppercase Letter 260
 
0.6%
Lowercase Letter 189
 
0.4%
Decimal Number 138
 
0.3%
Close Punctuation 91
 
0.2%
Open Punctuation 91
 
0.2%
Other Punctuation 46
 
0.1%
Dash Punctuation 6
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4562
 
10.8%
4423
 
10.5%
4410
 
10.5%
4377
 
10.4%
512
 
1.2%
483
 
1.1%
427
 
1.0%
421
 
1.0%
376
 
0.9%
364
 
0.9%
Other values (753) 21771
51.7%
Uppercase Letter
ValueCountFrequency (%)
A 25
 
9.6%
S 23
 
8.8%
C 19
 
7.3%
D 17
 
6.5%
O 16
 
6.2%
M 16
 
6.2%
E 15
 
5.8%
K 15
 
5.8%
T 14
 
5.4%
I 13
 
5.0%
Other values (14) 87
33.5%
Lowercase Letter
ValueCountFrequency (%)
o 23
12.2%
e 19
10.1%
i 17
 
9.0%
t 16
 
8.5%
a 14
 
7.4%
c 12
 
6.3%
r 12
 
6.3%
n 12
 
6.3%
l 9
 
4.8%
u 9
 
4.8%
Other values (12) 46
24.3%
Decimal Number
ValueCountFrequency (%)
1 41
29.7%
2 25
18.1%
3 20
14.5%
0 15
 
10.9%
4 12
 
8.7%
5 11
 
8.0%
6 7
 
5.1%
7 3
 
2.2%
9 3
 
2.2%
8 1
 
0.7%
Other Punctuation
ValueCountFrequency (%)
. 20
43.5%
, 9
19.6%
& 6
 
13.0%
" 4
 
8.7%
· 3
 
6.5%
' 2
 
4.3%
2
 
4.3%
Close Punctuation
ValueCountFrequency (%)
) 89
97.8%
] 2
 
2.2%
Open Punctuation
ValueCountFrequency (%)
( 89
97.8%
[ 2
 
2.2%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 42124
98.1%
Latin 449
 
1.0%
Common 373
 
0.9%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4562
 
10.8%
4423
 
10.5%
4410
 
10.5%
4377
 
10.4%
512
 
1.2%
483
 
1.1%
427
 
1.0%
421
 
1.0%
376
 
0.9%
364
 
0.9%
Other values (751) 21769
51.7%
Latin
ValueCountFrequency (%)
A 25
 
5.6%
S 23
 
5.1%
o 23
 
5.1%
C 19
 
4.2%
e 19
 
4.2%
i 17
 
3.8%
D 17
 
3.8%
O 16
 
3.6%
t 16
 
3.6%
M 16
 
3.6%
Other values (36) 258
57.5%
Common
ValueCountFrequency (%)
) 89
23.9%
( 89
23.9%
1 41
11.0%
2 25
 
6.7%
. 20
 
5.4%
3 20
 
5.4%
0 15
 
4.0%
4 12
 
3.2%
5 11
 
2.9%
, 9
 
2.4%
Other values (13) 42
11.3%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 42124
98.1%
ASCII 817
 
1.9%
None 5
 
< 0.1%
CJK 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4562
 
10.8%
4423
 
10.5%
4410
 
10.5%
4377
 
10.4%
512
 
1.2%
483
 
1.1%
427
 
1.0%
421
 
1.0%
376
 
0.9%
364
 
0.9%
Other values (751) 21769
51.7%
ASCII
ValueCountFrequency (%)
) 89
 
10.9%
( 89
 
10.9%
1 41
 
5.0%
A 25
 
3.1%
2 25
 
3.1%
S 23
 
2.8%
o 23
 
2.8%
. 20
 
2.4%
3 20
 
2.4%
C 19
 
2.3%
Other values (57) 443
54.2%
None
ValueCountFrequency (%)
· 3
60.0%
2
40.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%

유형명
Categorical

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size34.1 KiB
사업자
3287 
다중이해관계자
774 
직원
 
170
소비자
 
118

Length

Max length7
Median length3
Mean length3.6727983
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사업자
2nd row사업자
3rd row사업자
4th row사업자
5th row다중이해관계자

Common Values

ValueCountFrequency (%)
사업자 3287
75.6%
다중이해관계자 774
 
17.8%
직원 170
 
3.9%
소비자 118
 
2.7%

Length

2024-05-03T18:44:47.231102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-03T18:44:47.658717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사업자 3287
75.6%
다중이해관계자 774
 
17.8%
직원 170
 
3.9%
소비자 118
 
2.7%

업종명
Categorical

Distinct20
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size34.1 KiB
도매 및 소매업
815 
교육 서비스업
759 
제조업
409 
예술, 스포츠 및 여가관련 서비스업
365 
농업, 어업 및 임업
325 
Other values (15)
1676 

Length

Max length31
Median length24
Mean length11.255231
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row예술, 스포츠 및 여가관련 서비스업
2nd row예술, 스포츠 및 여가관련 서비스업
3rd row전기, 가스, 증기 및 수도사업
4th row도매 및 소매업
5th row교육 서비스업

Common Values

ValueCountFrequency (%)
도매 및 소매업 815
18.7%
교육 서비스업 759
17.5%
제조업 409
9.4%
예술, 스포츠 및 여가관련 서비스업 365
8.4%
농업, 어업 및 임업 325
 
7.5%
전문, 과학 및 기술 서비스업 242
 
5.6%
협회 및 단체, 수리 및 기타 개인 서비스업 199
 
4.6%
보건업 및 사회복지서비스업 181
 
4.2%
출판, 영상, 방송통신 및 정보서비스업 171
 
3.9%
숙박 및 음식점업 168
 
3.9%
Other values (10) 715
16.4%

Length

2024-05-03T18:44:48.105057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
3149
21.5%
서비스업 1724
 
11.8%
도매 815
 
5.6%
소매업 815
 
5.6%
교육 759
 
5.2%
제조업 409
 
2.8%
예술 365
 
2.5%
스포츠 365
 
2.5%
여가관련 365
 
2.5%
농업 325
 
2.2%
Other values (48) 5574
38.0%
Distinct3200
Distinct (%)73.6%
Missing0
Missing (%)0.0%
Memory size34.1 KiB
2024-05-03T18:44:48.757572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length212
Median length100
Mean length12.85146
Min length1

Characters and Unicode

Total characters55891
Distinct characters692
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2884 ?
Unique (%)66.3%

Sample

1st row농가공식품
2nd row농가공식품
3rd row엘피지(LPG) 가스
4th row조청, 들기름, 농산물 등
5th row교육 서비스
ValueCountFrequency (%)
1115
 
8.3%
498
 
3.7%
사업 383
 
2.8%
교육 352
 
2.6%
판매 206
 
1.5%
서비스 134
 
1.0%
서비스업 133
 
1.0%
교육서비스 112
 
0.8%
유통 111
 
0.8%
관련 107
 
0.8%
Other values (4235) 10322
76.6%
2024-05-03T18:44:49.949957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9124
 
16.3%
, 2677
 
4.8%
2500
 
4.5%
1558
 
2.8%
1157
 
2.1%
1026
 
1.8%
964
 
1.7%
951
 
1.7%
861
 
1.5%
847
 
1.5%
Other values (682) 34226
61.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 43110
77.1%
Space Separator 9124
 
16.3%
Other Punctuation 2997
 
5.4%
Uppercase Letter 184
 
0.3%
Close Punctuation 169
 
0.3%
Open Punctuation 169
 
0.3%
Decimal Number 67
 
0.1%
Lowercase Letter 43
 
0.1%
Dash Punctuation 27
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2500
 
5.8%
1558
 
3.6%
1157
 
2.7%
1026
 
2.4%
964
 
2.2%
951
 
2.2%
861
 
2.0%
847
 
2.0%
796
 
1.8%
749
 
1.7%
Other values (621) 31701
73.5%
Uppercase Letter
ValueCountFrequency (%)
E 20
10.9%
D 20
10.9%
T 15
 
8.2%
P 14
 
7.6%
C 14
 
7.6%
L 13
 
7.1%
M 13
 
7.1%
A 12
 
6.5%
I 10
 
5.4%
R 10
 
5.4%
Other values (11) 43
23.4%
Lowercase Letter
ValueCountFrequency (%)
o 6
14.0%
n 6
14.0%
i 5
11.6%
e 4
9.3%
s 4
9.3%
d 3
7.0%
g 3
7.0%
w 2
 
4.7%
r 2
 
4.7%
a 2
 
4.7%
Other values (5) 6
14.0%
Other Punctuation
ValueCountFrequency (%)
, 2677
89.3%
· 132
 
4.4%
/ 102
 
3.4%
. 51
 
1.7%
11
 
0.4%
# 7
 
0.2%
? 7
 
0.2%
" 6
 
0.2%
& 2
 
0.1%
* 2
 
0.1%
Decimal Number
ValueCountFrequency (%)
1 15
22.4%
3 15
22.4%
2 7
10.4%
4 7
10.4%
5 6
 
9.0%
6 5
 
7.5%
0 5
 
7.5%
7 4
 
6.0%
8 2
 
3.0%
9 1
 
1.5%
Space Separator
ValueCountFrequency (%)
9124
100.0%
Close Punctuation
ValueCountFrequency (%)
) 169
100.0%
Open Punctuation
ValueCountFrequency (%)
( 169
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 27
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 43109
77.1%
Common 12554
 
22.5%
Latin 227
 
0.4%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2500
 
5.8%
1558
 
3.6%
1157
 
2.7%
1026
 
2.4%
964
 
2.2%
951
 
2.2%
861
 
2.0%
847
 
2.0%
796
 
1.8%
749
 
1.7%
Other values (620) 31700
73.5%
Latin
ValueCountFrequency (%)
E 20
 
8.8%
D 20
 
8.8%
T 15
 
6.6%
P 14
 
6.2%
C 14
 
6.2%
L 13
 
5.7%
M 13
 
5.7%
A 12
 
5.3%
I 10
 
4.4%
R 10
 
4.4%
Other values (26) 86
37.9%
Common
ValueCountFrequency (%)
9124
72.7%
, 2677
 
21.3%
) 169
 
1.3%
( 169
 
1.3%
· 132
 
1.1%
/ 102
 
0.8%
. 51
 
0.4%
- 27
 
0.2%
1 15
 
0.1%
3 15
 
0.1%
Other values (15) 73
 
0.6%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 43092
77.1%
ASCII 12638
 
22.6%
None 143
 
0.3%
Compat Jamo 17
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9124
72.2%
, 2677
 
21.2%
) 169
 
1.3%
( 169
 
1.3%
/ 102
 
0.8%
. 51
 
0.4%
- 27
 
0.2%
E 20
 
0.2%
D 20
 
0.2%
T 15
 
0.1%
Other values (49) 264
 
2.1%
Hangul
ValueCountFrequency (%)
2500
 
5.8%
1558
 
3.6%
1157
 
2.7%
1026
 
2.4%
964
 
2.2%
951
 
2.2%
861
 
2.0%
847
 
2.0%
796
 
1.8%
749
 
1.7%
Other values (619) 31683
73.5%
None
ValueCountFrequency (%)
· 132
92.3%
11
 
7.7%
Compat Jamo
ValueCountFrequency (%)
17
100.0%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct2166
Distinct (%)49.8%
Missing0
Missing (%)0.0%
Memory size34.1 KiB
Minimum2012-12-03 00:00:00
Maximum2024-03-29 00:00:00
2024-05-03T18:44:50.288095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T18:44:50.666360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

수리일
Date

MISSING 

Distinct2055
Distinct (%)48.0%
Missing67
Missing (%)1.5%
Memory size34.1 KiB
Minimum2012-03-15 00:00:00
Maximum2024-04-24 00:00:00
2024-05-03T18:44:51.017466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T18:44:51.358526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

대표자성명
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing4349
Missing (%)100.0%
Memory size38.4 KiB

Correlations

2024-05-03T18:44:51.695571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명유형명업종명
시군명1.0000.3400.434
유형명0.3401.0000.256
업종명0.4340.2561.000
2024-05-03T18:44:51.968635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종명유형명시군명
업종명1.0000.1420.129
유형명0.1421.0000.182
시군명0.1290.1821.000
2024-05-03T18:44:52.228908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명유형명업종명
시군명1.0000.1820.129
유형명0.1821.0000.142
업종명0.1290.1421.000

Missing values

2024-05-03T18:44:43.239010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-03T18:44:43.921444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

집계년월시군명협동조합사업장명유형명업종명품목내역신청일수리일대표자성명
02024-03가평군에코피아가평발효팜협동조합사업자예술, 스포츠 및 여가관련 서비스업농가공식품2015-01-232015-01-23<NA>
12024-03가평군재즈팜장아찌협동조합사업자예술, 스포츠 및 여가관련 서비스업농가공식품2015-01-232015-01-23<NA>
22024-03가평군가평엘피지협동조합사업자전기, 가스, 증기 및 수도사업엘피지(LPG) 가스2017-11-282017-12-05<NA>
32024-03가평군달샘협동조합사업자도매 및 소매업조청, 들기름, 농산물 등2017-11-222017-11-27<NA>
42024-03가평군가평민들레교육협동조합다중이해관계자교육 서비스업교육 서비스2017-07-282017-08-10<NA>
52024-03가평군가평북면언론협동조합사업자출판, 영상, 방송통신 및 정보서비스업소식지발행, 지역농산물, 신문발행, 관광상품, 비영리 자원봉사, 소외계층 및 지역발전사회 돌봄2017-06-082017-07-31<NA>
62024-03가평군가평군주민복지협동조합소비자협회 및 단체, 수리 및 기타 개인 서비스업방문세차서비스, 소분 및 반제품 납품2017-06-222017-07-03<NA>
72024-03가평군아홉마지기마을전통한과협동조합사업자도매 및 소매업한과2017-02-142017-03-24<NA>
82024-03가평군관광문화콘텐츠협동조합가치가사업자예술, 스포츠 및 여가관련 서비스업여행, 관광2017-03-082017-03-23<NA>
92024-03가평군가평농산물가공협동조합사업자제조업고추장, 조청, 매실고2016-12-282017-01-18<NA>
집계년월시군명협동조합사업장명유형명업종명품목내역신청일수리일대표자성명
43392024-03화성시천지명1협동조합다중이해관계자전문, 과학 및 기술 서비스업재생에너지, 태양광2021-08-022021-08-02<NA>
43402024-03화성시천지명4협동조합다중이해관계자전문, 과학 및 기술 서비스업재생에너지, 태양광2021-08-022021-08-02<NA>
43412024-03화성시사회복지준법연구소협동조합다중이해관계자교육 서비스업행정법률 자문2021-07-292021-07-29<NA>
43422024-03화성시리필락(RefillLock)협동조합사업자제조업소공인 신제품 연구, 개발2021-07-232021-07-23<NA>
43432024-03화성시가온나눔협동조합다중이해관계자전문, 과학 및 기술 서비스업재생에너지, 태양광2021-07-232021-07-23<NA>
43442024-03화성시이동목욕협동조합사업자보건업 및 사회복지서비스업보건업 ( 목욕문화서비스 사업)2018-06-122018-06-12<NA>
43452024-03화성시한국우드리싸이클협동조합다중이해관계자제조업목질원재료, 현장소모품2021-07-192021-07-19<NA>
43462024-03화성시한울나눔협동조합다중이해관계자전문, 과학 및 기술 서비스업재생에너지, 태양광2021-07-012021-07-01<NA>
43472024-03화성시해솔나눔협동조합다중이해관계자전문, 과학 및 기술 서비스업재생에너지, 태양광2021-07-012021-07-01<NA>
43482024-03화성시사랑나눔협동조합다중이해관계자전문, 과학 및 기술 서비스업재생에너지, 태양광2021-07-012021-07-01<NA>

Duplicate rows

Most frequently occurring

집계년월시군명협동조합사업장명유형명업종명품목내역신청일수리일# duplicates
02024-03부천시한국낚시협동조합사업자예술, 스포츠 및 여가관련 서비스업낚시용품 공동구매 공동판매2014-08-062014-08-062