Overview

Dataset statistics

Number of variables8
Number of observations990
Missing cells1312
Missing cells (%)16.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory62.0 KiB
Average record size in memory64.1 B

Variable types

Text8

Dataset

Description충청남도 보령시 통신 판매 업체의 법인 또는 상호, 대표자명, 소재지 우편번호, 소재지 주소, 도메인, 취급품목, 데이터기준일 안내 데이터입니다
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=410&beforeMenuCd=DOM_000000201001001000&publicdatapk=15037781

Alerts

전화번호 has 454 (45.9%) missing valuesMissing
소재지우편번호 has 156 (15.8%) missing valuesMissing
Unnamed: 7 has 700 (70.7%) missing valuesMissing

Reproduction

Analysis started2024-01-09 20:06:32.805015
Analysis finished2024-01-09 20:06:34.491556
Duration1.69 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct973
Distinct (%)98.3%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
2024-01-10T05:06:34.719536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length24
Mean length6.3949495
Min length2

Characters and Unicode

Total characters6331
Distinct characters626
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique958 ?
Unique (%)96.8%

Sample

1st row감성점빵
2nd row지니하우스
3rd row삼디라이프
4th row코드네임
5th row포도나라
ValueCountFrequency (%)
주식회사 36
 
3.0%
펜션 15
 
1.2%
9
 
0.7%
농업회사법인 8
 
0.7%
유한회사 6
 
0.5%
보령 6
 
0.5%
농장 4
 
0.3%
컴퍼니 3
 
0.2%
블루모텔 3
 
0.2%
협동조합 3
 
0.2%
Other values (1071) 1110
92.3%
2024-01-10T05:06:35.252053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
218
 
3.4%
123
 
1.9%
121
 
1.9%
115
 
1.8%
106
 
1.7%
105
 
1.7%
105
 
1.7%
97
 
1.5%
95
 
1.5%
94
 
1.5%
Other values (616) 5152
81.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5541
87.5%
Space Separator 218
 
3.4%
Lowercase Letter 197
 
3.1%
Uppercase Letter 157
 
2.5%
Open Punctuation 81
 
1.3%
Close Punctuation 81
 
1.3%
Decimal Number 31
 
0.5%
Dash Punctuation 10
 
0.2%
Other Punctuation 10
 
0.2%
Other Symbol 5
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
123
 
2.2%
121
 
2.2%
115
 
2.1%
106
 
1.9%
105
 
1.9%
105
 
1.9%
97
 
1.8%
95
 
1.7%
94
 
1.7%
94
 
1.7%
Other values (553) 4486
81.0%
Uppercase Letter
ValueCountFrequency (%)
O 17
 
10.8%
A 16
 
10.2%
P 11
 
7.0%
S 10
 
6.4%
D 10
 
6.4%
K 8
 
5.1%
B 7
 
4.5%
L 7
 
4.5%
E 6
 
3.8%
F 6
 
3.8%
Other values (14) 59
37.6%
Lowercase Letter
ValueCountFrequency (%)
e 23
11.7%
a 20
10.2%
i 20
10.2%
o 19
9.6%
n 16
 
8.1%
r 15
 
7.6%
m 12
 
6.1%
c 9
 
4.6%
t 8
 
4.1%
u 8
 
4.1%
Other values (12) 47
23.9%
Decimal Number
ValueCountFrequency (%)
0 8
25.8%
1 8
25.8%
2 5
16.1%
8 4
12.9%
3 3
 
9.7%
4 2
 
6.5%
5 1
 
3.2%
Other Punctuation
ValueCountFrequency (%)
. 6
60.0%
& 2
 
20.0%
1
 
10.0%
/ 1
 
10.0%
Close Punctuation
ValueCountFrequency (%)
) 80
98.8%
1
 
1.2%
Space Separator
ValueCountFrequency (%)
218
100.0%
Open Punctuation
ValueCountFrequency (%)
( 81
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%
Other Symbol
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5545
87.6%
Common 431
 
6.8%
Latin 354
 
5.6%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
123
 
2.2%
121
 
2.2%
115
 
2.1%
106
 
1.9%
105
 
1.9%
105
 
1.9%
97
 
1.7%
95
 
1.7%
94
 
1.7%
94
 
1.7%
Other values (553) 4490
81.0%
Latin
ValueCountFrequency (%)
e 23
 
6.5%
a 20
 
5.6%
i 20
 
5.6%
o 19
 
5.4%
O 17
 
4.8%
A 16
 
4.5%
n 16
 
4.5%
r 15
 
4.2%
m 12
 
3.4%
P 11
 
3.1%
Other values (36) 185
52.3%
Common
ValueCountFrequency (%)
218
50.6%
( 81
 
18.8%
) 80
 
18.6%
- 10
 
2.3%
0 8
 
1.9%
1 8
 
1.9%
. 6
 
1.4%
2 5
 
1.2%
8 4
 
0.9%
3 3
 
0.7%
Other values (6) 8
 
1.9%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5540
87.5%
ASCII 783
 
12.4%
None 7
 
0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
218
27.8%
( 81
 
10.3%
) 80
 
10.2%
e 23
 
2.9%
a 20
 
2.6%
i 20
 
2.6%
o 19
 
2.4%
O 17
 
2.2%
A 16
 
2.0%
n 16
 
2.0%
Other values (50) 273
34.9%
Hangul
ValueCountFrequency (%)
123
 
2.2%
121
 
2.2%
115
 
2.1%
106
 
1.9%
105
 
1.9%
105
 
1.9%
97
 
1.8%
95
 
1.7%
94
 
1.7%
94
 
1.7%
Other values (552) 4485
81.0%
None
ValueCountFrequency (%)
5
71.4%
1
 
14.3%
1
 
14.3%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Distinct907
Distinct (%)91.6%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
2024-01-10T05:06:35.641623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length3
Mean length3.0959596
Min length2

Characters and Unicode

Total characters3065
Distinct characters222
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique837 ?
Unique (%)84.5%

Sample

1st row박정연
2nd row지유진
3rd row김경태
4th row조혜진
5th row이충원
ValueCountFrequency (%)
한아름 6
 
0.6%
김청한 4
 
0.4%
최요한 3
 
0.3%
유동균 3
 
0.3%
김무경 3
 
0.3%
안종철 3
 
0.3%
최민순 3
 
0.3%
김성수 3
 
0.3%
최성선 3
 
0.3%
정기화 2
 
0.2%
Other values (902) 963
96.7%
2024-01-10T05:06:36.192438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
228
 
7.4%
143
 
4.7%
98
 
3.2%
77
 
2.5%
77
 
2.5%
67
 
2.2%
66
 
2.2%
65
 
2.1%
62
 
2.0%
58
 
1.9%
Other values (212) 2124
69.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2999
97.8%
Uppercase Letter 35
 
1.1%
Other Punctuation 17
 
0.6%
Lowercase Letter 8
 
0.3%
Space Separator 6
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
228
 
7.6%
143
 
4.8%
98
 
3.3%
77
 
2.6%
77
 
2.6%
67
 
2.2%
66
 
2.2%
65
 
2.2%
62
 
2.1%
58
 
1.9%
Other values (187) 2058
68.6%
Uppercase Letter
ValueCountFrequency (%)
O 6
17.1%
N 5
14.3%
H 3
8.6%
T 3
8.6%
A 2
 
5.7%
I 2
 
5.7%
E 2
 
5.7%
S 2
 
5.7%
L 2
 
5.7%
B 1
 
2.9%
Other values (7) 7
20.0%
Lowercase Letter
ValueCountFrequency (%)
u 2
25.0%
i 2
25.0%
h 1
12.5%
y 1
12.5%
e 1
12.5%
n 1
12.5%
Other Punctuation
ValueCountFrequency (%)
17
100.0%
Space Separator
ValueCountFrequency (%)
6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2999
97.8%
Latin 43
 
1.4%
Common 23
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
228
 
7.6%
143
 
4.8%
98
 
3.3%
77
 
2.6%
77
 
2.6%
67
 
2.2%
66
 
2.2%
65
 
2.2%
62
 
2.1%
58
 
1.9%
Other values (187) 2058
68.6%
Latin
ValueCountFrequency (%)
O 6
14.0%
N 5
 
11.6%
H 3
 
7.0%
T 3
 
7.0%
u 2
 
4.7%
i 2
 
4.7%
A 2
 
4.7%
I 2
 
4.7%
E 2
 
4.7%
S 2
 
4.7%
Other values (13) 14
32.6%
Common
ValueCountFrequency (%)
17
73.9%
6
 
26.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2999
97.8%
ASCII 49
 
1.6%
None 17
 
0.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
228
 
7.6%
143
 
4.8%
98
 
3.3%
77
 
2.6%
77
 
2.6%
67
 
2.2%
66
 
2.2%
65
 
2.2%
62
 
2.1%
58
 
1.9%
Other values (187) 2058
68.6%
None
ValueCountFrequency (%)
17
100.0%
ASCII
ValueCountFrequency (%)
O 6
 
12.2%
6
 
12.2%
N 5
 
10.2%
H 3
 
6.1%
T 3
 
6.1%
u 2
 
4.1%
i 2
 
4.1%
A 2
 
4.1%
I 2
 
4.1%
E 2
 
4.1%
Other values (14) 16
32.7%

전화번호
Text

MISSING 

Distinct496
Distinct (%)92.5%
Missing454
Missing (%)45.9%
Memory size7.9 KiB
2024-01-10T05:06:36.503079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.992537
Min length9

Characters and Unicode

Total characters6428
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique463 ?
Unique (%)86.4%

Sample

1st row041-934-1111
2nd row041-935-7090
3rd row041-931-9800
4th row042-538-0900
5th row041-934-2259
ValueCountFrequency (%)
041-935-1711 6
 
1.1%
041-931-1100 3
 
0.6%
041-935-1701 3
 
0.6%
041-933-7471 3
 
0.6%
041-932-2806 2
 
0.4%
041-931-5200 2
 
0.4%
041-933-7749 2
 
0.4%
041-934-7654 2
 
0.4%
041-934-4005 2
 
0.4%
041-936-2929 2
 
0.4%
Other values (486) 509
95.0%
2024-01-10T05:06:37.014893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1071
16.7%
1 917
14.3%
0 824
12.8%
4 820
12.8%
3 776
12.1%
9 707
11.0%
2 336
 
5.2%
5 273
 
4.2%
6 248
 
3.9%
7 241
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5357
83.3%
Dash Punctuation 1071
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 917
17.1%
0 824
15.4%
4 820
15.3%
3 776
14.5%
9 707
13.2%
2 336
 
6.3%
5 273
 
5.1%
6 248
 
4.6%
7 241
 
4.5%
8 215
 
4.0%
Dash Punctuation
ValueCountFrequency (%)
- 1071
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 6428
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 1071
16.7%
1 917
14.3%
0 824
12.8%
4 820
12.8%
3 776
12.1%
9 707
11.0%
2 336
 
5.2%
5 273
 
4.2%
6 248
 
3.9%
7 241
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6428
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1071
16.7%
1 917
14.3%
0 824
12.8%
4 820
12.8%
3 776
12.1%
9 707
11.0%
2 336
 
5.2%
5 273
 
4.2%
6 248
 
3.9%
7 241
 
3.7%

소재지우편번호
Text

MISSING 

Distinct175
Distinct (%)21.0%
Missing156
Missing (%)15.8%
Memory size7.9 KiB
2024-01-10T05:06:37.334309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.7410072
Min length5

Characters and Unicode

Total characters4788
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique44 ?
Unique (%)5.3%

Sample

1st row33430
2nd row33486
3rd row33507
4th row33477
5th row33496
ValueCountFrequency (%)
355-943 77
 
9.2%
33488 49
 
5.9%
33491 33
 
4.0%
33489 32
 
3.8%
33430 17
 
2.0%
355-851 17
 
2.0%
33490 17
 
2.0%
355-931 16
 
1.9%
33508 15
 
1.8%
355-938 13
 
1.6%
Other values (165) 548
65.7%
2024-01-10T05:06:37.831148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 1636
34.2%
5 790
16.5%
4 688
14.4%
9 353
 
7.4%
8 318
 
6.6%
- 309
 
6.5%
1 202
 
4.2%
0 178
 
3.7%
2 133
 
2.8%
6 92
 
1.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4479
93.5%
Dash Punctuation 309
 
6.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 1636
36.5%
5 790
17.6%
4 688
15.4%
9 353
 
7.9%
8 318
 
7.1%
1 202
 
4.5%
0 178
 
4.0%
2 133
 
3.0%
6 92
 
2.1%
7 89
 
2.0%
Dash Punctuation
ValueCountFrequency (%)
- 309
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4788
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 1636
34.2%
5 790
16.5%
4 688
14.4%
9 353
 
7.4%
8 318
 
6.6%
- 309
 
6.5%
1 202
 
4.2%
0 178
 
3.7%
2 133
 
2.8%
6 92
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4788
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 1636
34.2%
5 790
16.5%
4 688
14.4%
9 353
 
7.4%
8 318
 
6.6%
- 309
 
6.5%
1 202
 
4.2%
0 178
 
3.7%
2 133
 
2.8%
6 92
 
1.9%
Distinct902
Distinct (%)91.1%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
2024-01-10T05:06:38.216501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length53
Mean length25.594949
Min length15

Characters and Unicode

Total characters25339
Distinct characters327
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique839 ?
Unique (%)84.7%

Sample

1st row충청남도 보령시 한내로터리길 170, 108동 101호 (동대동, e편한세상 보령)
2nd row충청남도 보령시 성주면 성주산로 673-47, 개화허브랜드
3rd row충청남도 보령시 웅천읍 이청1길 51
4th row충청남도 보령시 주공로 33, 124동 207호 (동대동, 동대주공 1차, 2차, 3차 아파트)
5th row충청남도 보령시 남포면 충서로 1705
ValueCountFrequency (%)
충청남도 990
 
18.1%
보령시 985
 
18.0%
신흑동 214
 
3.9%
대천동 111
 
2.0%
웅천읍 82
 
1.5%
동대동 77
 
1.4%
남포면 71
 
1.3%
고잠2길 47
 
0.9%
천북면 42
 
0.8%
주교면 41
 
0.7%
Other values (1190) 2815
51.4%
2024-01-10T05:06:38.784055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4492
 
17.7%
1112
 
4.4%
1064
 
4.2%
1052
 
4.2%
1037
 
4.1%
1026
 
4.0%
1016
 
4.0%
1013
 
4.0%
1 930
 
3.7%
894
 
3.5%
Other values (317) 11703
46.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14926
58.9%
Space Separator 4492
 
17.7%
Decimal Number 4069
 
16.1%
Close Punctuation 599
 
2.4%
Open Punctuation 599
 
2.4%
Other Punctuation 340
 
1.3%
Dash Punctuation 296
 
1.2%
Lowercase Letter 10
 
< 0.1%
Uppercase Letter 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1112
 
7.5%
1064
 
7.1%
1052
 
7.0%
1037
 
6.9%
1026
 
6.9%
1016
 
6.8%
1013
 
6.8%
894
 
6.0%
555
 
3.7%
436
 
2.9%
Other values (290) 5721
38.3%
Decimal Number
ValueCountFrequency (%)
1 930
22.9%
2 602
14.8%
3 448
11.0%
0 438
10.8%
4 350
 
8.6%
6 318
 
7.8%
5 285
 
7.0%
7 265
 
6.5%
9 225
 
5.5%
8 208
 
5.1%
Uppercase Letter
ValueCountFrequency (%)
B 2
25.0%
D 1
12.5%
A 1
12.5%
C 1
12.5%
P 1
12.5%
R 1
12.5%
F 1
12.5%
Lowercase Letter
ValueCountFrequency (%)
e 4
40.0%
k 2
20.0%
s 2
20.0%
y 2
20.0%
Other Punctuation
ValueCountFrequency (%)
339
99.7%
@ 1
 
0.3%
Space Separator
ValueCountFrequency (%)
4492
100.0%
Close Punctuation
ValueCountFrequency (%)
) 599
100.0%
Open Punctuation
ValueCountFrequency (%)
( 599
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 296
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14926
58.9%
Common 10395
41.0%
Latin 18
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1112
 
7.5%
1064
 
7.1%
1052
 
7.0%
1037
 
6.9%
1026
 
6.9%
1016
 
6.8%
1013
 
6.8%
894
 
6.0%
555
 
3.7%
436
 
2.9%
Other values (290) 5721
38.3%
Common
ValueCountFrequency (%)
4492
43.2%
1 930
 
8.9%
2 602
 
5.8%
) 599
 
5.8%
( 599
 
5.8%
3 448
 
4.3%
0 438
 
4.2%
4 350
 
3.4%
339
 
3.3%
6 318
 
3.1%
Other values (6) 1280
 
12.3%
Latin
ValueCountFrequency (%)
e 4
22.2%
B 2
11.1%
k 2
11.1%
s 2
11.1%
y 2
11.1%
D 1
 
5.6%
A 1
 
5.6%
C 1
 
5.6%
P 1
 
5.6%
R 1
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14926
58.9%
ASCII 10074
39.8%
None 339
 
1.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4492
44.6%
1 930
 
9.2%
2 602
 
6.0%
) 599
 
5.9%
( 599
 
5.9%
3 448
 
4.4%
0 438
 
4.3%
4 350
 
3.5%
6 318
 
3.2%
- 296
 
2.9%
Other values (16) 1002
 
9.9%
Hangul
ValueCountFrequency (%)
1112
 
7.5%
1064
 
7.1%
1052
 
7.0%
1037
 
6.9%
1026
 
6.9%
1016
 
6.8%
1013
 
6.8%
894
 
6.0%
555
 
3.7%
436
 
2.9%
Other values (290) 5721
38.3%
None
ValueCountFrequency (%)
339
100.0%
Distinct727
Distinct (%)73.5%
Missing1
Missing (%)0.1%
Memory size7.9 KiB
2024-01-10T05:06:39.033577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length101
Median length39
Mean length16.671385
Min length1

Characters and Unicode

Total characters16488
Distinct characters234
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique686 ?
Unique (%)69.4%

Sample

1st rowsmartstore.naver.com/gamseongjeombbang
2nd rowsmartstore.naver.com/jinihouse019
3rd row-
4th row코드네임
5th row네이버 스마트스토어
ValueCountFrequency (%)
135
 
12.8%
www.auction.co.kr 25
 
2.4%
http://mall.epost.go.kr 22
 
2.1%
옥션 17
 
1.6%
www.gmarket.co.kr 13
 
1.2%
www.11st.co.kr 12
 
1.1%
www.naver.com 11
 
1.0%
11번가 10
 
1.0%
g마켓 10
 
1.0%
http://sell.storefarm.naver.com 8
 
0.8%
Other values (729) 789
75.0%
2024-01-10T05:06:39.474207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 1825
 
11.1%
w 1618
 
9.8%
o 1386
 
8.4%
c 877
 
5.3%
r 863
 
5.2%
a 830
 
5.0%
m 827
 
5.0%
t 806
 
4.9%
e 773
 
4.7%
n 658
 
4.0%
Other values (224) 6025
36.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 12715
77.1%
Other Punctuation 2568
 
15.6%
Other Letter 507
 
3.1%
Decimal Number 414
 
2.5%
Dash Punctuation 163
 
1.0%
Space Separator 75
 
0.5%
Uppercase Letter 35
 
0.2%
Connector Punctuation 6
 
< 0.1%
Close Punctuation 3
 
< 0.1%
Open Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
22
 
4.3%
21
 
4.1%
19
 
3.7%
18
 
3.6%
15
 
3.0%
13
 
2.6%
13
 
2.6%
13
 
2.6%
11
 
2.2%
11
 
2.2%
Other values (160) 351
69.2%
Lowercase Letter
ValueCountFrequency (%)
w 1618
12.7%
o 1386
 
10.9%
c 877
 
6.9%
r 863
 
6.8%
a 830
 
6.5%
m 827
 
6.5%
t 806
 
6.3%
e 773
 
6.1%
n 658
 
5.2%
k 509
 
4.0%
Other values (16) 3568
28.1%
Uppercase Letter
ValueCountFrequency (%)
G 9
25.7%
N 4
11.4%
W 3
 
8.6%
B 3
 
8.6%
E 2
 
5.7%
L 2
 
5.7%
I 2
 
5.7%
V 1
 
2.9%
T 1
 
2.9%
D 1
 
2.9%
Other values (7) 7
20.0%
Decimal Number
ValueCountFrequency (%)
1 127
30.7%
2 55
13.3%
0 53
12.8%
3 32
 
7.7%
5 30
 
7.2%
9 28
 
6.8%
4 24
 
5.8%
7 23
 
5.6%
8 22
 
5.3%
6 20
 
4.8%
Other Punctuation
ValueCountFrequency (%)
. 1825
71.1%
/ 539
 
21.0%
: 199
 
7.7%
@ 3
 
0.1%
! 1
 
< 0.1%
# 1
 
< 0.1%
Dash Punctuation
ValueCountFrequency (%)
- 163
100.0%
Space Separator
ValueCountFrequency (%)
75
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 12750
77.3%
Common 3231
 
19.6%
Hangul 507
 
3.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
22
 
4.3%
21
 
4.1%
19
 
3.7%
18
 
3.6%
15
 
3.0%
13
 
2.6%
13
 
2.6%
13
 
2.6%
11
 
2.2%
11
 
2.2%
Other values (160) 351
69.2%
Latin
ValueCountFrequency (%)
w 1618
12.7%
o 1386
 
10.9%
c 877
 
6.9%
r 863
 
6.8%
a 830
 
6.5%
m 827
 
6.5%
t 806
 
6.3%
e 773
 
6.1%
n 658
 
5.2%
k 509
 
4.0%
Other values (33) 3603
28.3%
Common
ValueCountFrequency (%)
. 1825
56.5%
/ 539
 
16.7%
: 199
 
6.2%
- 163
 
5.0%
1 127
 
3.9%
75
 
2.3%
2 55
 
1.7%
0 53
 
1.6%
3 32
 
1.0%
5 30
 
0.9%
Other values (11) 133
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 15981
96.9%
Hangul 507
 
3.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 1825
 
11.4%
w 1618
 
10.1%
o 1386
 
8.7%
c 877
 
5.5%
r 863
 
5.4%
a 830
 
5.2%
m 827
 
5.2%
t 806
 
5.0%
e 773
 
4.8%
n 658
 
4.1%
Other values (54) 5518
34.5%
Hangul
ValueCountFrequency (%)
22
 
4.3%
21
 
4.1%
19
 
3.7%
18
 
3.6%
15
 
3.0%
13
 
2.6%
13
 
2.6%
13
 
2.6%
11
 
2.2%
11
 
2.2%
Other values (160) 351
69.2%
Distinct222
Distinct (%)22.4%
Missing1
Missing (%)0.1%
Memory size7.9 KiB
2024-01-10T05:06:39.704964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length87
Median length76
Mean length4.9615774
Min length1

Characters and Unicode

Total characters4907
Distinct characters278
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique180 ?
Unique (%)18.2%

Sample

1st row주방용품 유아용품
2nd row악세사리 건어물
3rd row3d프린터 대리 출력
4th row의류/패션/잡화/뷰티
5th row농산물
ValueCountFrequency (%)
기타 254
22.0%
건강/식품 202
17.5%
숙박업 73
 
6.3%
의류/패션/잡화/뷰티 71
 
6.1%
레져/여행/공연 45
 
3.9%
수산물 38
 
3.3%
펜션 29
 
2.5%
종합몰 28
 
2.4%
숙박 25
 
2.2%
농산물 24
 
2.1%
Other values (234) 367
31.7%
2024-01-10T05:06:40.541042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 573
 
11.7%
274
 
5.6%
268
 
5.5%
257
 
5.2%
220
 
4.5%
212
 
4.3%
211
 
4.3%
204
 
4.2%
114
 
2.3%
109
 
2.2%
Other values (268) 2465
50.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4074
83.0%
Other Punctuation 575
 
11.7%
Space Separator 211
 
4.3%
Close Punctuation 19
 
0.4%
Open Punctuation 19
 
0.4%
Lowercase Letter 5
 
0.1%
Uppercase Letter 3
 
0.1%
Decimal Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
274
 
6.7%
268
 
6.6%
257
 
6.3%
220
 
5.4%
212
 
5.2%
204
 
5.0%
114
 
2.8%
109
 
2.7%
106
 
2.6%
102
 
2.5%
Other values (255) 2208
54.2%
Lowercase Letter
ValueCountFrequency (%)
c 2
40.0%
d 1
20.0%
v 1
20.0%
t 1
20.0%
Uppercase Letter
ValueCountFrequency (%)
D 1
33.3%
E 1
33.3%
L 1
33.3%
Other Punctuation
ValueCountFrequency (%)
/ 573
99.7%
. 2
 
0.3%
Space Separator
ValueCountFrequency (%)
211
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Decimal Number
ValueCountFrequency (%)
3 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4074
83.0%
Common 825
 
16.8%
Latin 8
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
274
 
6.7%
268
 
6.6%
257
 
6.3%
220
 
5.4%
212
 
5.2%
204
 
5.0%
114
 
2.8%
109
 
2.7%
106
 
2.6%
102
 
2.5%
Other values (255) 2208
54.2%
Latin
ValueCountFrequency (%)
c 2
25.0%
d 1
12.5%
v 1
12.5%
t 1
12.5%
D 1
12.5%
E 1
12.5%
L 1
12.5%
Common
ValueCountFrequency (%)
/ 573
69.5%
211
 
25.6%
) 19
 
2.3%
( 19
 
2.3%
. 2
 
0.2%
3 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4074
83.0%
ASCII 833
 
17.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 573
68.8%
211
 
25.3%
) 19
 
2.3%
( 19
 
2.3%
c 2
 
0.2%
. 2
 
0.2%
3 1
 
0.1%
d 1
 
0.1%
v 1
 
0.1%
t 1
 
0.1%
Other values (3) 3
 
0.4%
Hangul
ValueCountFrequency (%)
274
 
6.7%
268
 
6.6%
257
 
6.3%
220
 
5.4%
212
 
5.2%
204
 
5.0%
114
 
2.8%
109
 
2.7%
106
 
2.6%
102
 
2.5%
Other values (255) 2208
54.2%

Unnamed: 7
Text

MISSING 

Distinct194
Distinct (%)66.9%
Missing700
Missing (%)70.7%
Memory size7.9 KiB
2024-01-10T05:06:40.913002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length17
Mean length5.7206897
Min length1

Characters and Unicode

Total characters1659
Distinct characters253
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique167 ?
Unique (%)57.6%

Sample

1st row농산물
2nd row숙박
3rd row가공식품
4th row
5th row달걀
ValueCountFrequency (%)
조미김 28
 
6.7%
24
 
5.7%
의류 18
 
4.3%
수산물 17
 
4.0%
맛김 16
 
3.8%
농산물 12
 
2.9%
건어물 11
 
2.6%
10
 
2.4%
숙박업 7
 
1.7%
악세사리 6
 
1.4%
Other values (216) 271
64.5%
2024-01-10T05:06:41.529258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
194
 
11.7%
87
 
5.2%
79
 
4.8%
61
 
3.7%
) 50
 
3.0%
( 49
 
3.0%
44
 
2.7%
43
 
2.6%
41
 
2.5%
38
 
2.3%
Other values (243) 973
58.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1331
80.2%
Space Separator 194
 
11.7%
Close Punctuation 50
 
3.0%
Open Punctuation 49
 
3.0%
Other Punctuation 22
 
1.3%
Uppercase Letter 8
 
0.5%
Decimal Number 3
 
0.2%
Lowercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
87
 
6.5%
79
 
5.9%
61
 
4.6%
44
 
3.3%
43
 
3.2%
41
 
3.1%
38
 
2.9%
38
 
2.9%
30
 
2.3%
26
 
2.0%
Other values (228) 844
63.4%
Uppercase Letter
ValueCountFrequency (%)
D 4
50.0%
C 1
 
12.5%
V 1
 
12.5%
P 1
 
12.5%
L 1
 
12.5%
Other Punctuation
ValueCountFrequency (%)
/ 13
59.1%
. 8
36.4%
? 1
 
4.5%
Decimal Number
ValueCountFrequency (%)
3 2
66.7%
1 1
33.3%
Lowercase Letter
ValueCountFrequency (%)
p 1
50.0%
m 1
50.0%
Space Separator
ValueCountFrequency (%)
194
100.0%
Close Punctuation
ValueCountFrequency (%)
) 50
100.0%
Open Punctuation
ValueCountFrequency (%)
( 49
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1331
80.2%
Common 318
 
19.2%
Latin 10
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
87
 
6.5%
79
 
5.9%
61
 
4.6%
44
 
3.3%
43
 
3.2%
41
 
3.1%
38
 
2.9%
38
 
2.9%
30
 
2.3%
26
 
2.0%
Other values (228) 844
63.4%
Common
ValueCountFrequency (%)
194
61.0%
) 50
 
15.7%
( 49
 
15.4%
/ 13
 
4.1%
. 8
 
2.5%
3 2
 
0.6%
1 1
 
0.3%
? 1
 
0.3%
Latin
ValueCountFrequency (%)
D 4
40.0%
p 1
 
10.0%
m 1
 
10.0%
C 1
 
10.0%
V 1
 
10.0%
P 1
 
10.0%
L 1
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1331
80.2%
ASCII 328
 
19.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
194
59.1%
) 50
 
15.2%
( 49
 
14.9%
/ 13
 
4.0%
. 8
 
2.4%
D 4
 
1.2%
3 2
 
0.6%
1 1
 
0.3%
? 1
 
0.3%
p 1
 
0.3%
Other values (5) 5
 
1.5%
Hangul
ValueCountFrequency (%)
87
 
6.5%
79
 
5.9%
61
 
4.6%
44
 
3.3%
43
 
3.2%
41
 
3.1%
38
 
2.9%
38
 
2.9%
30
 
2.3%
26
 
2.0%
Other values (228) 844
63.4%

Missing values

2024-01-10T05:06:34.014062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:06:34.198910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-01-10T05:06:34.387319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

법인또는상호대표자명전화번호소재지우편번호소재지주소도메인명취급품목세부사항Unnamed: 7
0감성점빵박정연<NA>33430충청남도 보령시 한내로터리길 170, 108동 101호 (동대동, e편한세상 보령)smartstore.naver.com/gamseongjeombbang주방용품 유아용품<NA>
1지니하우스지유진<NA>33486충청남도 보령시 성주면 성주산로 673-47, 개화허브랜드smartstore.naver.com/jinihouse019악세사리 건어물<NA>
2삼디라이프김경태<NA>33507충청남도 보령시 웅천읍 이청1길 51-3d프린터 대리 출력<NA>
3코드네임조혜진<NA>33477충청남도 보령시 주공로 33, 124동 207호 (동대동, 동대주공 1차, 2차, 3차 아파트)코드네임의류/패션/잡화/뷰티<NA>
4포도나라이충원<NA>33496충청남도 보령시 남포면 충서로 1705네이버 스마트스토어농산물<NA>
5참조은수산유서준<NA>33502충청남도 보령시 남포면 용두욕장길 48-8-수산물<NA>
6컴바인 드론고재범<NA>33472충청남도 보령시 대해로 29-11 (궁촌동)worlddrone.1.kr<NA>
7대유맛김김기선<NA>33454충청남도 보령시 해태로 3 (대천동)dykim.rubi.co.kr펜션<NA>
8관촌수필펜션이채원<NA>33502충청남도 보령시 남포면 열린바다로 30관촌수필펜션숙박<NA>
9대천쥬얼리펜션오지순<NA>33488충청남도 보령시 고잠2길 26-14 (신흑동)daecheonj.com건어물<NA>
법인또는상호대표자명전화번호소재지우편번호소재지주소도메인명취급품목세부사항Unnamed: 7
980보령냉열기최종육041-931-3700<NA>충청남도 보령시 보령북로 110 (대천동)www.coonaircin.co.kr가전냉열기
981동이농산영농조합법인백이호041-933-7471<NA>충청남도 보령시 남포면 보령남로 410-8www.dongyi.co.kr건강/식품건강식품
982고내미전통장이계화041-932-4993<NA>충청남도 보령시 주포면 고남길 77www.gonemi.com건강/식품조미식품
983보령광천수산영어조합법인전성기041-934-7667355-090충청남도 보령시 장벌길 37 (남곡동)www.epost.go.kr건강/식품젓갈류
984청라은행한과공장이순희041-934-9674<NA>충청남도 보령시 청라면 원자울길 40-2www.rda.go.kr건강/식품한과
985대천김(주)최민순041-935-859533491충청남도 보령시 대해로 425-9 (요암동)www.15889293.com건강/식품맛김 수산물
986보령식품김재범041-933-1770<NA>충청남도 보령시 오천면 가숭구지길 77www.boryeongfood.co.kr건강/식품액젓
987동이식품장동현041-936-2425<NA>충청남도 보령시 주포면 배재길 46www.gooykim.com건강/식품맛김 수산물
988현대수산맛김유광호041-934-1415355-923충청남도 보령시 주교면 은포길 340-14www.hyundaekim.com건강/식품해태 건어물 김
989중앙맛김김희환041-931-7995<NA>충청남도 보령시 남포면 대야실2길 26www.joongangkim.com건강/식품건어물 고추 마늘 김