Overview

Dataset statistics

Number of variables7
Number of observations1326
Missing cells1593
Missing cells (%)17.2%
Duplicate rows1
Duplicate rows (%)0.1%
Total size in memory75.2 KiB
Average record size in memory58.1 B

Variable types

Numeric2
Text4
Categorical1

Dataset

Description계룡시 관내의 계룡사랑상품권 가맹점 현황(사업자등록번호,가맹점명, 업종, 주소)에 관한 공공데이터를 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=74&beforeMenuCd=DOM_000000201001001000&publicdatapk=15108270

Alerts

Dataset has 1 (0.1%) duplicate rowsDuplicates
가맹점상세주소 has 727 (54.8%) missing valuesMissing
업종상세(중분류) has 866 (65.3%) missing valuesMissing

Reproduction

Analysis started2024-01-09 21:33:42.992906
Analysis finished2024-01-09 21:33:44.120155
Duration1.13 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사업자등록번호
Real number (ℝ)

Distinct1324
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.2027135 × 109
Minimum1.0107844 × 109
Maximum8.991701 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.8 KiB
2024-01-10T06:33:44.177233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.0107844 × 109
5-th percentile1.4466846 × 109
Q13.0804402 × 109
median3.0890643 × 109
Q35.6121266 × 109
95-th percentile8.4311512 × 109
Maximum8.991701 × 109
Range7.9809167 × 109
Interquartile range (IQR)2.5316864 × 109

Descriptive statistics

Standard deviation2.0825098 × 109
Coefficient of variation (CV)0.49551553
Kurtosis-0.4687209
Mean4.2027135 × 109
Median Absolute Deviation (MAD)9.6686958 × 108
Skewness0.78827664
Sum5.5727982 × 1012
Variance4.3368472 × 1018
MonotonicityNot monotonic
2024-01-10T06:33:44.288800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3088100621 2
 
0.2%
7660302899 2
 
0.2%
2202501436 1
 
0.1%
3088114668 1
 
0.1%
3080726897 1
 
0.1%
3080697449 1
 
0.1%
1093981300 1
 
0.1%
3080853687 1
 
0.1%
3081151296 1
 
0.1%
2150951201 1
 
0.1%
Other values (1314) 1314
99.1%
ValueCountFrequency (%)
1010784355 1
0.1%
1011246582 1
0.1%
1012360784 1
0.1%
1018660120 1
0.1%
1020805063 1
0.1%
1021695177 1
0.1%
1040460502 1
0.1%
1042550203 1
0.1%
1043651168 1
0.1%
1046500392 1
0.1%
ValueCountFrequency (%)
8991701033 1
0.1%
8990301583 1
0.1%
8980200487 1
0.1%
8978200303 1
0.1%
8974400164 1
0.1%
8942801465 1
0.1%
8941001359 1
0.1%
8932800819 1
0.1%
8931500861 1
0.1%
8926100579 1
0.1%
Distinct1320
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size10.5 KiB
2024-01-10T06:33:44.513317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length22
Mean length6.8423831
Min length1

Characters and Unicode

Total characters9073
Distinct characters702
Distinct categories8 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1314 ?
Unique (%)99.1%

Sample

1st row아침산 저녁바다 계룡점
2nd row방앗간순대국
3rd row본가 왕대박
4th row라온점핑클럽 점핑하이 대실점
5th row바른정육
ValueCountFrequency (%)
계룡점 55
 
3.0%
주식회사 15
 
0.8%
계룡 13
 
0.7%
계룡엄사점 13
 
0.7%
엄사점 7
 
0.4%
계룡대점 6
 
0.3%
세븐일레븐 6
 
0.3%
카페 5
 
0.3%
농업회사법인 5
 
0.3%
씨유 4
 
0.2%
Other values (1596) 1689
92.9%
2024-01-10T06:33:44.846924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
494
 
5.4%
261
 
2.9%
247
 
2.7%
199
 
2.2%
177
 
2.0%
152
 
1.7%
147
 
1.6%
125
 
1.4%
102
 
1.1%
102
 
1.1%
Other values (692) 7067
77.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7942
87.5%
Space Separator 494
 
5.4%
Uppercase Letter 200
 
2.2%
Lowercase Letter 159
 
1.8%
Decimal Number 84
 
0.9%
Close Punctuation 77
 
0.8%
Open Punctuation 77
 
0.8%
Other Punctuation 40
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
261
 
3.3%
247
 
3.1%
199
 
2.5%
177
 
2.2%
152
 
1.9%
147
 
1.9%
125
 
1.6%
102
 
1.3%
102
 
1.3%
95
 
1.2%
Other values (628) 6335
79.8%
Uppercase Letter
ValueCountFrequency (%)
S 22
 
11.0%
B 17
 
8.5%
G 17
 
8.5%
E 16
 
8.0%
A 16
 
8.0%
M 14
 
7.0%
C 14
 
7.0%
T 10
 
5.0%
L 9
 
4.5%
I 8
 
4.0%
Other values (13) 57
28.5%
Lowercase Letter
ValueCountFrequency (%)
e 25
15.7%
a 19
11.9%
r 14
 
8.8%
n 11
 
6.9%
c 10
 
6.3%
i 10
 
6.3%
f 9
 
5.7%
o 8
 
5.0%
m 7
 
4.4%
p 7
 
4.4%
Other values (11) 39
24.5%
Decimal Number
ValueCountFrequency (%)
2 23
27.4%
5 20
23.8%
1 11
13.1%
0 6
 
7.1%
4 5
 
6.0%
3 5
 
6.0%
6 5
 
6.0%
9 5
 
6.0%
8 3
 
3.6%
7 1
 
1.2%
Other Punctuation
ValueCountFrequency (%)
, 13
32.5%
& 10
25.0%
. 6
15.0%
/ 3
 
7.5%
# 3
 
7.5%
; 3
 
7.5%
' 2
 
5.0%
Space Separator
ValueCountFrequency (%)
494
100.0%
Close Punctuation
ValueCountFrequency (%)
) 77
100.0%
Open Punctuation
ValueCountFrequency (%)
( 77
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7941
87.5%
Common 772
 
8.5%
Latin 359
 
4.0%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
261
 
3.3%
247
 
3.1%
199
 
2.5%
177
 
2.2%
152
 
1.9%
147
 
1.9%
125
 
1.6%
102
 
1.3%
102
 
1.3%
95
 
1.2%
Other values (627) 6334
79.8%
Latin
ValueCountFrequency (%)
e 25
 
7.0%
S 22
 
6.1%
a 19
 
5.3%
B 17
 
4.7%
G 17
 
4.7%
E 16
 
4.5%
A 16
 
4.5%
M 14
 
3.9%
C 14
 
3.9%
r 14
 
3.9%
Other values (34) 185
51.5%
Common
ValueCountFrequency (%)
494
64.0%
) 77
 
10.0%
( 77
 
10.0%
2 23
 
3.0%
5 20
 
2.6%
, 13
 
1.7%
1 11
 
1.4%
& 10
 
1.3%
0 6
 
0.8%
. 6
 
0.8%
Other values (10) 35
 
4.5%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7941
87.5%
ASCII 1131
 
12.5%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
494
43.7%
) 77
 
6.8%
( 77
 
6.8%
e 25
 
2.2%
2 23
 
2.0%
S 22
 
1.9%
5 20
 
1.8%
a 19
 
1.7%
B 17
 
1.5%
G 17
 
1.5%
Other values (54) 340
30.1%
Hangul
ValueCountFrequency (%)
261
 
3.3%
247
 
3.1%
199
 
2.5%
177
 
2.2%
152
 
1.9%
147
 
1.9%
125
 
1.6%
102
 
1.3%
102
 
1.3%
95
 
1.2%
Other values (627) 6334
79.8%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct671
Distinct (%)50.6%
Missing0
Missing (%)0.0%
Memory size10.5 KiB
2024-01-10T06:33:45.085534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length35
Mean length21.888386
Min length18

Characters and Unicode

Total characters29024
Distinct characters216
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique437 ?
Unique (%)33.0%

Sample

1st row충청남도 계룡시 엄사면 전원로 5
2nd row충청남도 계룡시 금암2길 37(금암동)
3rd row충청남도 계룡시 엄사면 전원로 12
4th row충청남도 계룡시 두마면 농소로 63(계룡 푸르지오 더 퍼스트)
5th row충청남도 계룡시 두마면 농소로 38
ValueCountFrequency (%)
충청남도 1291
19.8%
계룡시 1257
19.3%
엄사면 786
 
12.1%
번영로 202
 
3.1%
엄사중앙로 179
 
2.7%
금암동 108
 
1.7%
두마면 107
 
1.6%
장안로 97
 
1.5%
번영3길 75
 
1.2%
계룡대로 73
 
1.1%
Other values (662) 2346
36.0%
2024-01-10T06:33:45.421266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5195
17.9%
1434
 
4.9%
1421
 
4.9%
1383
 
4.8%
1308
 
4.5%
1304
 
4.5%
1298
 
4.5%
1296
 
4.5%
1015
 
3.5%
972
 
3.3%
Other values (206) 12398
42.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 19153
66.0%
Space Separator 5195
 
17.9%
Decimal Number 3723
 
12.8%
Close Punctuation 448
 
1.5%
Open Punctuation 448
 
1.5%
Other Punctuation 45
 
0.2%
Lowercase Letter 6
 
< 0.1%
Uppercase Letter 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1434
 
7.5%
1421
 
7.4%
1383
 
7.2%
1308
 
6.8%
1304
 
6.8%
1298
 
6.8%
1296
 
6.8%
1015
 
5.3%
972
 
5.1%
966
 
5.0%
Other values (189) 6756
35.3%
Decimal Number
ValueCountFrequency (%)
1 830
22.3%
3 480
12.9%
2 417
11.2%
4 394
10.6%
6 354
9.5%
5 269
 
7.2%
0 265
 
7.1%
7 262
 
7.0%
9 260
 
7.0%
8 192
 
5.2%
Uppercase Letter
ValueCountFrequency (%)
L 3
50.0%
H 3
50.0%
Space Separator
ValueCountFrequency (%)
5195
100.0%
Close Punctuation
ValueCountFrequency (%)
) 448
100.0%
Open Punctuation
ValueCountFrequency (%)
( 448
100.0%
Other Punctuation
ValueCountFrequency (%)
, 45
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 19153
66.0%
Common 9859
34.0%
Latin 12
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1434
 
7.5%
1421
 
7.4%
1383
 
7.2%
1308
 
6.8%
1304
 
6.8%
1298
 
6.8%
1296
 
6.8%
1015
 
5.3%
972
 
5.1%
966
 
5.0%
Other values (189) 6756
35.3%
Common
ValueCountFrequency (%)
5195
52.7%
1 830
 
8.4%
3 480
 
4.9%
) 448
 
4.5%
( 448
 
4.5%
2 417
 
4.2%
4 394
 
4.0%
6 354
 
3.6%
5 269
 
2.7%
0 265
 
2.7%
Other values (4) 759
 
7.7%
Latin
ValueCountFrequency (%)
e 6
50.0%
L 3
25.0%
H 3
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 19153
66.0%
ASCII 9871
34.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5195
52.6%
1 830
 
8.4%
3 480
 
4.9%
) 448
 
4.5%
( 448
 
4.5%
2 417
 
4.2%
4 394
 
4.0%
6 354
 
3.6%
5 269
 
2.7%
0 265
 
2.7%
Other values (7) 771
 
7.8%
Hangul
ValueCountFrequency (%)
1434
 
7.5%
1421
 
7.4%
1383
 
7.2%
1308
 
6.8%
1304
 
6.8%
1298
 
6.8%
1296
 
6.8%
1015
 
5.3%
972
 
5.1%
966
 
5.0%
Other values (189) 6756
35.3%

가맹점상세주소
Text

MISSING 

Distinct299
Distinct (%)49.9%
Missing727
Missing (%)54.8%
Memory size10.5 KiB
2024-01-10T06:33:45.617566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length22
Mean length6.1903172
Min length1

Characters and Unicode

Total characters3708
Distinct characters163
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique262 ?
Unique (%)43.7%

Sample

1st row1층
2nd row1층
3rd row상가동 1층 102호 103호
4th row1층 102호 103호
5th row1층
ValueCountFrequency (%)
1층 219
24.7%
2층 59
 
6.6%
102호 43
 
4.8%
101호 38
 
4.3%
3층 29
 
3.3%
103호 26
 
2.9%
상가동 20
 
2.3%
201호 15
 
1.7%
105호 15
 
1.7%
104호 14
 
1.6%
Other values (254) 410
46.2%
2024-01-10T06:33:45.919564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 673
18.1%
368
 
9.9%
357
 
9.6%
0 339
 
9.1%
289
 
7.8%
2 253
 
6.8%
3 119
 
3.2%
( 95
 
2.6%
) 95
 
2.6%
89
 
2.4%
Other values (153) 1031
27.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1629
43.9%
Decimal Number 1577
42.5%
Space Separator 289
 
7.8%
Open Punctuation 95
 
2.6%
Close Punctuation 95
 
2.6%
Other Punctuation 13
 
0.4%
Uppercase Letter 7
 
0.2%
Math Symbol 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
368
22.6%
357
21.9%
89
 
5.5%
75
 
4.6%
72
 
4.4%
46
 
2.8%
45
 
2.8%
38
 
2.3%
36
 
2.2%
36
 
2.2%
Other values (134) 467
28.7%
Decimal Number
ValueCountFrequency (%)
1 673
42.7%
0 339
21.5%
2 253
 
16.0%
3 119
 
7.5%
4 76
 
4.8%
5 45
 
2.9%
6 23
 
1.5%
8 21
 
1.3%
7 17
 
1.1%
9 11
 
0.7%
Uppercase Letter
ValueCountFrequency (%)
B 5
71.4%
D 1
 
14.3%
S 1
 
14.3%
Other Punctuation
ValueCountFrequency (%)
, 12
92.3%
. 1
 
7.7%
Space Separator
ValueCountFrequency (%)
289
100.0%
Open Punctuation
ValueCountFrequency (%)
( 95
100.0%
Close Punctuation
ValueCountFrequency (%)
) 95
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2072
55.9%
Hangul 1629
43.9%
Latin 7
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
368
22.6%
357
21.9%
89
 
5.5%
75
 
4.6%
72
 
4.4%
46
 
2.8%
45
 
2.8%
38
 
2.3%
36
 
2.2%
36
 
2.2%
Other values (134) 467
28.7%
Common
ValueCountFrequency (%)
1 673
32.5%
0 339
16.4%
289
13.9%
2 253
 
12.2%
3 119
 
5.7%
( 95
 
4.6%
) 95
 
4.6%
4 76
 
3.7%
5 45
 
2.2%
6 23
 
1.1%
Other values (6) 65
 
3.1%
Latin
ValueCountFrequency (%)
B 5
71.4%
D 1
 
14.3%
S 1
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2079
56.1%
Hangul 1629
43.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 673
32.4%
0 339
16.3%
289
13.9%
2 253
 
12.2%
3 119
 
5.7%
( 95
 
4.6%
) 95
 
4.6%
4 76
 
3.7%
5 45
 
2.2%
6 23
 
1.1%
Other values (9) 72
 
3.5%
Hangul
ValueCountFrequency (%)
368
22.6%
357
21.9%
89
 
5.5%
75
 
4.6%
72
 
4.4%
46
 
2.8%
45
 
2.8%
38
 
2.3%
36
 
2.2%
36
 
2.2%
Other values (134) 467
28.7%

우편번호
Real number (ℝ)

Distinct104
Distinct (%)7.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean32610.624
Minimum2821
Maximum54575
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.8 KiB
2024-01-10T06:33:46.044924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2821
5-th percentile32801
Q132805
median32809
Q332826
95-th percentile32838
Maximum54575
Range51754
Interquartile range (IQR)21

Descriptive statistics

Standard deviation2462.558
Coefficient of variation (CV)0.075513981
Kurtosis92.840805
Mean32610.624
Median Absolute Deviation (MAD)8
Skewness-8.1565332
Sum43241687
Variance6064192
MonotonicityNot monotonic
2024-01-10T06:33:46.150876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
32826 153
 
11.5%
32805 146
 
11.0%
32804 133
 
10.0%
32808 78
 
5.9%
32801 74
 
5.6%
32824 63
 
4.8%
32809 62
 
4.7%
32806 55
 
4.1%
32803 43
 
3.2%
32812 32
 
2.4%
Other values (94) 487
36.7%
ValueCountFrequency (%)
2821 1
 
0.1%
4566 1
 
0.1%
6564 1
 
0.1%
7337 3
0.2%
8291 1
 
0.1%
8606 1
 
0.1%
8828 1
 
0.1%
14238 1
 
0.1%
14548 1
 
0.1%
17766 1
 
0.1%
ValueCountFrequency (%)
54575 1
 
0.1%
44774 1
 
0.1%
41757 1
 
0.1%
35310 1
 
0.1%
35221 1
 
0.1%
35214 1
 
0.1%
34433 1
 
0.1%
34139 1
 
0.1%
34126 8
0.6%
34016 1
 
0.1%

가맹점유형
Categorical

Distinct8
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size10.5 KiB
음식점업
440 
소매업
377 
개인서비스업
280 
교육서비스업
114 
제조업
 
39
Other values (3)
76 

Length

Max length11
Median length6
Mean length4.2488688
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row음식점업
2nd row음식점업
3rd row음식점업
4th row개인서비스업
5th row소매업

Common Values

ValueCountFrequency (%)
음식점업 440
33.2%
소매업 377
28.4%
개인서비스업 280
21.1%
교육서비스업 114
 
8.6%
제조업 39
 
2.9%
기타 38
 
2.9%
보건업 29
 
2.2%
스포츠여가관련서비스업 9
 
0.7%

Length

2024-01-10T06:33:46.246753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:33:46.331442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
음식점업 440
33.2%
소매업 377
28.4%
개인서비스업 280
21.1%
교육서비스업 114
 
8.6%
제조업 39
 
2.9%
기타 38
 
2.9%
보건업 29
 
2.2%
스포츠여가관련서비스업 9
 
0.7%
Distinct265
Distinct (%)57.6%
Missing866
Missing (%)65.3%
Memory size10.5 KiB
2024-01-10T06:33:46.523145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length18
Mean length4.7217391
Min length1

Characters and Unicode

Total characters2172
Distinct characters280
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique210 ?
Unique (%)45.7%

Sample

1st row한식
2nd row한식
3rd row점핑
4th row육류
5th row김치,절임식품외
ValueCountFrequency (%)
한식 70
 
11.9%
22
 
3.8%
커피 17
 
2.9%
분식 11
 
1.9%
의류 10
 
1.7%
편의점 9
 
1.5%
소매업 9
 
1.5%
치킨 9
 
1.5%
피자 8
 
1.4%
8
 
1.4%
Other values (286) 413
70.5%
2024-01-10T06:33:46.825226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
133
 
6.1%
127
 
5.8%
75
 
3.5%
, 66
 
3.0%
57
 
2.6%
46
 
2.1%
41
 
1.9%
41
 
1.9%
40
 
1.8%
36
 
1.7%
Other values (270) 1510
69.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1944
89.5%
Space Separator 127
 
5.8%
Other Punctuation 66
 
3.0%
Open Punctuation 10
 
0.5%
Close Punctuation 10
 
0.5%
Decimal Number 9
 
0.4%
Uppercase Letter 5
 
0.2%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
133
 
6.8%
75
 
3.9%
57
 
2.9%
46
 
2.4%
41
 
2.1%
41
 
2.1%
40
 
2.1%
36
 
1.9%
35
 
1.8%
32
 
1.6%
Other values (257) 1408
72.4%
Decimal Number
ValueCountFrequency (%)
5 4
44.4%
8 1
 
11.1%
0 1
 
11.1%
1 1
 
11.1%
2 1
 
11.1%
4 1
 
11.1%
Uppercase Letter
ValueCountFrequency (%)
P 3
60.0%
C 2
40.0%
Space Separator
ValueCountFrequency (%)
127
100.0%
Other Punctuation
ValueCountFrequency (%)
, 66
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%
Lowercase Letter
ValueCountFrequency (%)
c 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1944
89.5%
Common 222
 
10.2%
Latin 6
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
133
 
6.8%
75
 
3.9%
57
 
2.9%
46
 
2.4%
41
 
2.1%
41
 
2.1%
40
 
2.1%
36
 
1.9%
35
 
1.8%
32
 
1.6%
Other values (257) 1408
72.4%
Common
ValueCountFrequency (%)
127
57.2%
, 66
29.7%
( 10
 
4.5%
) 10
 
4.5%
5 4
 
1.8%
8 1
 
0.5%
0 1
 
0.5%
1 1
 
0.5%
2 1
 
0.5%
4 1
 
0.5%
Latin
ValueCountFrequency (%)
P 3
50.0%
C 2
33.3%
c 1
 
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1943
89.5%
ASCII 228
 
10.5%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
133
 
6.8%
75
 
3.9%
57
 
2.9%
46
 
2.4%
41
 
2.1%
41
 
2.1%
40
 
2.1%
36
 
1.9%
35
 
1.8%
32
 
1.6%
Other values (256) 1407
72.4%
ASCII
ValueCountFrequency (%)
127
55.7%
, 66
28.9%
( 10
 
4.4%
) 10
 
4.4%
5 4
 
1.8%
P 3
 
1.3%
C 2
 
0.9%
8 1
 
0.4%
0 1
 
0.4%
1 1
 
0.4%
Other values (3) 3
 
1.3%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

Interactions

2024-01-10T06:33:43.743238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:33:43.582039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:33:43.825296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:33:43.667761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T06:33:46.900514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업자등록번호우편번호가맹점유형
사업자등록번호1.0000.0920.155
우편번호0.0921.0000.000
가맹점유형0.1550.0001.000
2024-01-10T06:33:46.966522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업자등록번호우편번호가맹점유형
사업자등록번호1.0000.0130.074
우편번호0.0131.0000.000
가맹점유형0.0740.0001.000

Missing values

2024-01-10T06:33:43.915001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T06:33:44.006943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-01-10T06:33:44.081933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

사업자등록번호가맹점명가맹점기본주소(도로명)가맹점상세주소우편번호가맹점유형업종상세(중분류)
02202501436아침산 저녁바다 계룡점충청남도 계룡시 엄사면 전원로 51층32805음식점업한식
11633201072방앗간순대국충청남도 계룡시 금암2길 37(금암동)1층32823음식점업한식
28974400164본가 왕대박충청남도 계룡시 엄사면 전원로 12<NA>32803음식점업<NA>
33335300532라온점핑클럽 점핑하이 대실점충청남도 계룡시 두마면 농소로 63(계룡 푸르지오 더 퍼스트)상가동 1층 102호 103호32844개인서비스업점핑
47660302899바른정육충청남도 계룡시 두마면 농소로 381층 102호 103호32844소매업육류
54672001312오롯이 사골곰탕충청남도 계룡시 엄사면 번영로 461층32805소매업김치,절임식품외
67660302899바른정육충청남도 계룡시 두마면 농소로 381층 102호 103호32844소매업육류
73192901281아너(Honer)필라테스충청남도 계룡시 두마면 농소로 325층 506호 507호32844교육서비스업기타 스포츠 교육기관
87381300556요가 숨충청남도 계룡시 엄사면 엄사중앙로 985층32804개인서비스업<NA>
95486200780에쏘르아뜰리에충청남도 계룡시 엄사면 계룡대로 5184층32801교육서비스업교습소, 공방
사업자등록번호가맹점명가맹점기본주소(도로명)가맹점상세주소우편번호가맹점유형업종상세(중분류)
13162131801240판돈충청남도 계룡시 엄사면 번영10길 2916<NA>32808소매업<NA>
13172193012223팔복떡집충청남도 계룡시 엄사면 번영로 54<NA>32805제조업<NA>
13185943800148풍년닭갈비충청남도 계룡시 엄사면 번영2길 51<NA>32806음식점업<NA>
13193089046518풍성한과일충청남도 계룡시 엄사면 번영로 44<NA>32805소매업<NA>
13204315600026하나미술충청남도 계룡시 엄사면 엄사중앙로 66 (성원아파트)<NA>32812교육서비스업<NA>
13213080742693한우뱅이충청남도 계룡시 엄사면 전원로 177<NA>32805음식점업<NA>
13223080608664함스헤어충청남도 계룡시 엄사면 번영9길 34<NA>32808개인서비스업<NA>
13236763200486향적산 한상충청남도 계룡시 엄사면 광석향한길 2211<NA>32816음식점업<NA>
13243089048229현대내과의원충청남도 계룡시 엄사면 엄사중앙로 96<NA>32804보건업<NA>
13253140537610현대상사충청남도 계룡시 엄사면 엄사중앙로 97<NA>32805소매업<NA>

Duplicate rows

Most frequently occurring

사업자등록번호가맹점명가맹점기본주소(도로명)가맹점상세주소우편번호가맹점유형업종상세(중분류)# duplicates
07660302899바른정육충청남도 계룡시 두마면 농소로 381층 102호 103호32844소매업육류2