Overview

Dataset statistics

Number of variables15
Number of observations10000
Missing cells2857
Missing cells (%)1.9%
Duplicate rows56
Duplicate rows (%)0.6%
Total size in memory1.2 MiB
Average record size in memory128.0 B

Variable types

Categorical4
Text8
Unsupported3

Dataset

Description전주시 내 업종별 음식점 언어별(한.영.일.중어) 데이터를 현장 수집하여 다양한 목적으로 활용을 위한 목적 데이터로언어구분, 업소명, 업소위치, 전화번호, 지번주소, 도로명주소등의 항목을 제공합니다.
Author전북특별자치도 전주시
URLhttps://www.data.go.kr/data/15097768/fileData.do

Alerts

Dataset has 56 (0.6%) duplicate rowsDuplicates
주차장 is highly overall correlated with 언어 and 2 other fieldsHigh correlation
언어 is highly overall correlated with 휴일 and 2 other fieldsHigh correlation
휴일 is highly overall correlated with 언어 and 2 other fieldsHigh correlation
와이파이 is highly overall correlated with 언어 and 2 other fieldsHigh correlation
홈페이지 has 1713 (17.1%) missing valuesMissing
반려견 동반입장 여부 has 1144 (11.4%) missing valuesMissing
시작영업시간 is an unsupported type, check if it needs cleaning or further analysisUnsupported
종료영업시간 is an unsupported type, check if it needs cleaning or further analysisUnsupported
가격 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-03-14 18:37:38.062636
Analysis finished2024-03-14 18:37:42.975366
Duration4.91 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

언어
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
KO
2540 
CN
2497 
EN
2494 
JP
2469 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowJP
2nd rowEN
3rd rowCN
4th rowCN
5th rowEN

Common Values

ValueCountFrequency (%)
KO 2540
25.4%
CN 2497
25.0%
EN 2494
24.9%
JP 2469
24.7%

Length

2024-03-15T03:37:43.175805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T03:37:43.498143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
ko 2540
25.4%
cn 2497
25.0%
en 2494
24.9%
jp 2469
24.7%
Distinct2829
Distinct (%)28.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T03:37:44.860221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length100
Median length59
Mean length11.9637
Min length1

Characters and Unicode

Total characters119637
Distinct characters1488
Distinct categories11 ?
Distinct scripts6 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique573 ?
Unique (%)5.7%

Sample

1st rowジャ?ジャ?一番地
2nd rowDon Donjeong Branch
3rd row星巴克 全州西新店
4th row名牌海?面的松川店
5th rowUga Yangpyeong Haejangguk Hyoja Branch
ValueCountFrequency (%)
branch 871
 
5.0%
jeonju 531
 
3.0%
coffee 193
 
1.1%
hyoja 143
 
0.8%
songcheon 141
 
0.8%
chicken 132
 
0.8%
city 131
 
0.8%
cafe 113
 
0.6%
103
 
0.6%
new 100
 
0.6%
Other values (3246) 14966
85.9%
2024-03-15T03:37:46.630839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7427
 
6.2%
? 6321
 
5.3%
n 5116
 
4.3%
e 5053
 
4.2%
a 4944
 
4.1%
o 4662
 
3.9%
i 2752
 
2.3%
r 2354
 
2.0%
2316
 
1.9%
_ 2289
 
1.9%
Other values (1478) 76403
63.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 48788
40.8%
Lowercase Letter 44669
37.3%
Uppercase Letter 8464
 
7.1%
Space Separator 7427
 
6.2%
Other Punctuation 6740
 
5.6%
Connector Punctuation 2289
 
1.9%
Decimal Number 1058
 
0.9%
Dash Punctuation 146
 
0.1%
Open Punctuation 26
 
< 0.1%
Close Punctuation 26
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2316
 
4.7%
1286
 
2.6%
1271
 
2.6%
1205
 
2.5%
1028
 
2.1%
752
 
1.5%
661
 
1.4%
614
 
1.3%
427
 
0.9%
397
 
0.8%
Other values (1402) 38831
79.6%
Lowercase Letter
ValueCountFrequency (%)
n 5116
11.5%
e 5053
11.3%
a 4944
11.1%
o 4662
 
10.4%
i 2752
 
6.2%
r 2354
 
5.3%
h 2130
 
4.8%
c 2053
 
4.6%
u 1871
 
4.2%
s 1759
 
3.9%
Other values (16) 11975
26.8%
Uppercase Letter
ValueCountFrequency (%)
B 1243
14.7%
S 957
11.3%
C 883
10.4%
J 840
9.9%
H 610
 
7.2%
P 439
 
5.2%
M 402
 
4.7%
T 365
 
4.3%
G 348
 
4.1%
D 315
 
3.7%
Other values (16) 2062
24.4%
Decimal Number
ValueCountFrequency (%)
1 265
25.0%
0 140
13.2%
2 139
13.1%
9 117
11.1%
8 110
10.4%
5 102
 
9.6%
4 84
 
7.9%
6 45
 
4.3%
7 34
 
3.2%
3 22
 
2.1%
Other Punctuation
ValueCountFrequency (%)
? 6321
93.8%
' 119
 
1.8%
& 86
 
1.3%
· 86
 
1.3%
. 84
 
1.2%
% 20
 
0.3%
, 18
 
0.3%
6
 
0.1%
Space Separator
ValueCountFrequency (%)
7427
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2289
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 146
100.0%
Open Punctuation
ValueCountFrequency (%)
( 26
100.0%
Close Punctuation
ValueCountFrequency (%)
) 26
100.0%
Math Symbol
ValueCountFrequency (%)
+ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 53133
44.4%
Han 19163
 
16.0%
Common 17716
 
14.8%
Hangul 17222
 
14.4%
Katakana 11191
 
9.4%
Hiragana 1212
 
1.0%

Most frequent character per script

Han
ValueCountFrequency (%)
2316
 
12.1%
1205
 
6.3%
1028
 
5.4%
661
 
3.4%
377
 
2.0%
西 365
 
1.9%
337
 
1.8%
327
 
1.7%
319
 
1.7%
285
 
1.5%
Other values (739) 11943
62.3%
Hangul
ValueCountFrequency (%)
1271
 
7.4%
752
 
4.4%
614
 
3.6%
362
 
2.1%
353
 
2.0%
338
 
2.0%
329
 
1.9%
305
 
1.8%
284
 
1.6%
255
 
1.5%
Other values (516) 12359
71.8%
Katakana
ValueCountFrequency (%)
1286
 
11.5%
427
 
3.8%
397
 
3.5%
379
 
3.4%
366
 
3.3%
356
 
3.2%
350
 
3.1%
312
 
2.8%
271
 
2.4%
264
 
2.4%
Other values (67) 6783
60.6%
Hiragana
ValueCountFrequency (%)
199
 
16.4%
84
 
6.9%
79
 
6.5%
72
 
5.9%
50
 
4.1%
49
 
4.0%
45
 
3.7%
45
 
3.7%
40
 
3.3%
39
 
3.2%
Other values (50) 510
42.1%
Latin
ValueCountFrequency (%)
n 5116
 
9.6%
e 5053
 
9.5%
a 4944
 
9.3%
o 4662
 
8.8%
i 2752
 
5.2%
r 2354
 
4.4%
h 2130
 
4.0%
c 2053
 
3.9%
u 1871
 
3.5%
s 1759
 
3.3%
Other values (42) 20439
38.5%
Common
ValueCountFrequency (%)
7427
41.9%
? 6321
35.7%
_ 2289
 
12.9%
1 265
 
1.5%
- 146
 
0.8%
0 140
 
0.8%
2 139
 
0.8%
' 119
 
0.7%
9 117
 
0.7%
8 110
 
0.6%
Other values (14) 643
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 70757
59.1%
CJK 19163
 
16.0%
Hangul 17222
 
14.4%
Katakana 11191
 
9.4%
Hiragana 1212
 
1.0%
None 92
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
7427
 
10.5%
? 6321
 
8.9%
n 5116
 
7.2%
e 5053
 
7.1%
a 4944
 
7.0%
o 4662
 
6.6%
i 2752
 
3.9%
r 2354
 
3.3%
_ 2289
 
3.2%
h 2130
 
3.0%
Other values (64) 27709
39.2%
CJK
ValueCountFrequency (%)
2316
 
12.1%
1205
 
6.3%
1028
 
5.4%
661
 
3.4%
377
 
2.0%
西 365
 
1.9%
337
 
1.8%
327
 
1.7%
319
 
1.7%
285
 
1.5%
Other values (739) 11943
62.3%
Katakana
ValueCountFrequency (%)
1286
 
11.5%
427
 
3.8%
397
 
3.5%
379
 
3.4%
366
 
3.3%
356
 
3.2%
350
 
3.1%
312
 
2.8%
271
 
2.4%
264
 
2.4%
Other values (67) 6783
60.6%
Hangul
ValueCountFrequency (%)
1271
 
7.4%
752
 
4.4%
614
 
3.6%
362
 
2.1%
353
 
2.0%
338
 
2.0%
329
 
1.9%
305
 
1.8%
284
 
1.6%
255
 
1.5%
Other values (516) 12359
71.8%
Hiragana
ValueCountFrequency (%)
199
 
16.4%
84
 
6.9%
79
 
6.5%
72
 
5.9%
50
 
4.1%
49
 
4.0%
45
 
3.7%
45
 
3.7%
40
 
3.3%
39
 
3.2%
Other values (50) 510
42.1%
None
ValueCountFrequency (%)
· 86
93.5%
6
 
6.5%
Distinct139
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T03:37:47.841461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length30
Mean length12.2819
Min length6

Characters and Unicode

Total characters122819
Distinct characters111
Distinct categories7 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row完山區 西新洞
2nd rowGosa-dong, Wansan-gu
3rd row完山區 西新洞
4th row德津區 松川1洞
5th rowHyoja-dong, Wansan-gu
ValueCountFrequency (%)
完山區 3538
16.7%
완산구 1802
 
8.5%
wansan-gu 1769
 
8.3%
德津區 1428
 
6.7%
孝子洞3街 831
 
3.9%
덕진구 738
 
3.5%
deokjin-gu 725
 
3.4%
孝子洞2街 666
 
3.1%
西新洞 544
 
2.6%
中華山洞2街 535
 
2.5%
Other values (120) 8664
40.8%
2024-03-15T03:37:49.529534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11240
 
9.2%
n 8821
 
7.2%
g 6533
 
5.3%
a 6183
 
5.0%
- 5639
 
4.6%
o 5432
 
4.4%
4966
 
4.0%
4966
 
4.0%
4174
 
3.4%
3538
 
2.9%
Other values (101) 61327
49.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 49634
40.4%
Lowercase Letter 42982
35.0%
Space Separator 11240
 
9.2%
Decimal Number 5842
 
4.8%
Dash Punctuation 5639
 
4.6%
Uppercase Letter 4988
 
4.1%
Other Punctuation 2494
 
2.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4966
 
10.0%
4966
 
10.0%
4174
 
8.4%
3538
 
7.1%
3033
 
6.1%
2540
 
5.1%
2540
 
5.1%
2140
 
4.3%
1830
 
3.7%
1830
 
3.7%
Other values (67) 18077
36.4%
Lowercase Letter
ValueCountFrequency (%)
n 8821
20.5%
g 6533
15.2%
a 6183
14.4%
o 5432
12.6%
u 2899
 
6.7%
s 2545
 
5.9%
d 2494
 
5.8%
j 1811
 
4.2%
e 1662
 
3.9%
i 1261
 
2.9%
Other values (7) 3341
 
7.8%
Uppercase Letter
ValueCountFrequency (%)
W 1769
35.5%
H 1028
20.6%
D 936
18.8%
S 600
 
12.0%
J 378
 
7.6%
U 106
 
2.1%
P 94
 
1.9%
G 72
 
1.4%
T 5
 
0.1%
Decimal Number
ValueCountFrequency (%)
2 2437
41.7%
3 1283
22.0%
1 1272
21.8%
5 551
 
9.4%
4 299
 
5.1%
Space Separator
ValueCountFrequency (%)
11240
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5639
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2494
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 47970
39.1%
Han 33382
27.2%
Common 25215
20.5%
Hangul 16252
 
13.2%

Most frequent character per script

Han
ValueCountFrequency (%)
4966
14.9%
4966
14.9%
4174
12.5%
3538
10.6%
3033
9.1%
1830
 
5.5%
1830
 
5.5%
1711
 
5.1%
1711
 
5.1%
739
 
2.2%
Other values (30) 4884
14.6%
Hangul
ValueCountFrequency (%)
2540
15.6%
2540
15.6%
2140
13.2%
1802
11.1%
938
 
5.8%
938
 
5.8%
892
 
5.5%
892
 
5.5%
770
 
4.7%
358
 
2.2%
Other values (27) 2442
15.0%
Latin
ValueCountFrequency (%)
n 8821
18.4%
g 6533
13.6%
a 6183
12.9%
o 5432
11.3%
u 2899
 
6.0%
s 2545
 
5.3%
d 2494
 
5.2%
j 1811
 
3.8%
W 1769
 
3.7%
e 1662
 
3.5%
Other values (16) 7821
16.3%
Common
ValueCountFrequency (%)
11240
44.6%
- 5639
22.4%
, 2494
 
9.9%
2 2437
 
9.7%
3 1283
 
5.1%
1 1272
 
5.0%
5 551
 
2.2%
4 299
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 73185
59.6%
CJK 33382
27.2%
Hangul 16252
 
13.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11240
15.4%
n 8821
12.1%
g 6533
 
8.9%
a 6183
 
8.4%
- 5639
 
7.7%
o 5432
 
7.4%
u 2899
 
4.0%
s 2545
 
3.5%
, 2494
 
3.4%
d 2494
 
3.4%
Other values (24) 18905
25.8%
CJK
ValueCountFrequency (%)
4966
14.9%
4966
14.9%
4174
12.5%
3538
10.6%
3033
9.1%
1830
 
5.5%
1830
 
5.5%
1711
 
5.1%
1711
 
5.1%
739
 
2.2%
Other values (30) 4884
14.6%
Hangul
ValueCountFrequency (%)
2540
15.6%
2540
15.6%
2140
13.2%
1802
11.1%
938
 
5.8%
938
 
5.8%
892
 
5.5%
892
 
5.5%
770
 
4.7%
358
 
2.2%
Other values (27) 2442
15.0%
Distinct2176
Distinct (%)21.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T03:37:51.379016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length112
Median length99
Mean length35.3858
Min length18

Characters and Unicode

Total characters353858
Distinct characters243
Distinct categories8 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique356 ?
Unique (%)3.6%

Sample

1st row全羅北道 全州市 完山區 西新洞 860-8
2nd row186, Gosa-dong, Wansan-gu, Jeonju-si, Jeollabuk-do
3rd row全羅北道 全州市 完山區 西新洞 964-6
4th row全羅北道 全州市 德津區 松川洞2街 468-5
5th row382, Hyoja-dong 1-ga, Wansan-gu, Jeonju-si, Jeollabuk-do
ValueCountFrequency (%)
全州市 4971
 
8.9%
全羅北道 4966
 
8.9%
完山區 3538
 
6.3%
전주시 2540
 
4.5%
전북특별자치도 2540
 
4.5%
jeollabuk-do 2494
 
4.5%
jeonju-si 2494
 
4.5%
완산구 1802
 
3.2%
wansan-gu 1769
 
3.2%
德津區 1428
 
2.6%
Other values (1081) 27371
49.0%
2024-03-15T03:37:53.826621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
46096
 
13.0%
- 21251
 
6.0%
1 15026
 
4.2%
o 14323
 
4.0%
n 11681
 
3.3%
, 10528
 
3.0%
a 10114
 
2.9%
9948
 
2.8%
2 8911
 
2.5%
g 8010
 
2.3%
Other values (233) 197970
55.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 113457
32.1%
Lowercase Letter 93201
26.3%
Decimal Number 57210
16.2%
Space Separator 46096
13.0%
Dash Punctuation 21251
 
6.0%
Other Punctuation 11993
 
3.4%
Uppercase Letter 10636
 
3.0%
Math Symbol 14
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9948
 
8.8%
5114
 
4.5%
4989
 
4.4%
4974
 
4.4%
4966
 
4.4%
4966
 
4.4%
4966
 
4.4%
4966
 
4.4%
4966
 
4.4%
4179
 
3.7%
Other values (171) 59423
52.4%
Lowercase Letter
ValueCountFrequency (%)
o 14323
15.4%
n 11681
12.5%
a 10114
10.9%
g 8010
8.6%
u 7973
8.6%
e 6769
7.3%
l 5551
 
6.0%
s 5353
 
5.7%
d 5214
 
5.6%
j 4311
 
4.6%
Other values (14) 13902
14.9%
Uppercase Letter
ValueCountFrequency (%)
J 5439
51.1%
W 1769
 
16.6%
D 974
 
9.2%
H 937
 
8.8%
S 669
 
6.3%
R 152
 
1.4%
B 136
 
1.3%
U 106
 
1.0%
G 106
 
1.0%
P 73
 
0.7%
Other values (11) 275
 
2.6%
Decimal Number
ValueCountFrequency (%)
1 15026
26.3%
2 8911
15.6%
3 7261
12.7%
5 4563
 
8.0%
6 4429
 
7.7%
7 4290
 
7.5%
4 4039
 
7.1%
9 2993
 
5.2%
8 2873
 
5.0%
0 2825
 
4.9%
Other Punctuation
ValueCountFrequency (%)
, 10528
87.8%
? 1425
 
11.9%
# 28
 
0.2%
. 12
 
0.1%
Space Separator
ValueCountFrequency (%)
46096
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 21251
100.0%
Math Symbol
ValueCountFrequency (%)
~ 14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 136564
38.6%
Latin 103837
29.3%
Han 69245
19.6%
Hangul 44212
 
12.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5114
 
11.6%
3483
 
7.9%
2571
 
5.8%
2561
 
5.8%
2541
 
5.7%
2540
 
5.7%
2540
 
5.7%
2540
 
5.7%
2540
 
5.7%
2540
 
5.7%
Other values (104) 15242
34.5%
Han
ValueCountFrequency (%)
9948
14.4%
4989
 
7.2%
4974
 
7.2%
4966
 
7.2%
4966
 
7.2%
4966
 
7.2%
4966
 
7.2%
4966
 
7.2%
4179
 
6.0%
3846
 
5.6%
Other values (57) 16479
23.8%
Latin
ValueCountFrequency (%)
o 14323
13.8%
n 11681
11.2%
a 10114
9.7%
g 8010
 
7.7%
u 7973
 
7.7%
e 6769
 
6.5%
l 5551
 
5.3%
J 5439
 
5.2%
s 5353
 
5.2%
d 5214
 
5.0%
Other values (35) 23410
22.5%
Common
ValueCountFrequency (%)
46096
33.8%
- 21251
15.6%
1 15026
 
11.0%
, 10528
 
7.7%
2 8911
 
6.5%
3 7261
 
5.3%
5 4563
 
3.3%
6 4429
 
3.2%
7 4290
 
3.1%
4 4039
 
3.0%
Other values (7) 10170
 
7.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 240401
67.9%
CJK 69245
 
19.6%
Hangul 44212
 
12.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
46096
19.2%
- 21251
 
8.8%
1 15026
 
6.3%
o 14323
 
6.0%
n 11681
 
4.9%
, 10528
 
4.4%
a 10114
 
4.2%
2 8911
 
3.7%
g 8010
 
3.3%
u 7973
 
3.3%
Other values (52) 86488
36.0%
CJK
ValueCountFrequency (%)
9948
14.4%
4989
 
7.2%
4974
 
7.2%
4966
 
7.2%
4966
 
7.2%
4966
 
7.2%
4966
 
7.2%
4966
 
7.2%
4179
 
6.0%
3846
 
5.6%
Other values (57) 16479
23.8%
Hangul
ValueCountFrequency (%)
5114
 
11.6%
3483
 
7.9%
2571
 
5.8%
2561
 
5.8%
2541
 
5.7%
2540
 
5.7%
2540
 
5.7%
2540
 
5.7%
2540
 
5.7%
2540
 
5.7%
Other values (104) 15242
34.5%
Distinct2171
Distinct (%)21.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T03:37:55.416482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length111
Median length90
Mean length31.6687
Min length17

Characters and Unicode

Total characters316687
Distinct characters392
Distinct categories8 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique351 ?
Unique (%)3.5%

Sample

1st row全羅北道 全州市 完山區 全?路 40
2nd row74-50, Jeonjugaeksa 4-gil, Wansan-gu, Jeonju-si, Jeollabuk-do
3rd row全羅北道 全州市 完山區 西新路 104
4th row全羅北道 全州市 德津區 松川中央路 234
5th row180, Geomapyeong-ro, Wansan-gu, Jeonju-si, Jeollabuk-do
ValueCountFrequency (%)
全州市 4971
 
9.0%
全羅北道 4966
 
9.0%
完山區 3548
 
6.5%
전주시 2540
 
4.6%
전북특별자치도 2540
 
4.6%
jeollabuk-do 2494
 
4.5%
jeonju-si 2494
 
4.5%
완산구 1808
 
3.3%
wansan-gu 1775
 
3.2%
德津區 1428
 
2.6%
Other values (1265) 26404
48.0%
2024-03-15T03:37:57.239398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
45011
 
14.2%
o 13923
 
4.4%
- 12828
 
4.1%
, 11009
 
3.5%
n 10630
 
3.4%
10333
 
3.3%
1 9700
 
3.1%
u 8354
 
2.6%
a 8223
 
2.6%
e 8215
 
2.6%
Other values (382) 178461
56.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 108706
34.3%
Lowercase Letter 91157
28.8%
Space Separator 45011
14.2%
Decimal Number 33679
 
10.6%
Other Punctuation 14776
 
4.7%
Dash Punctuation 12828
 
4.1%
Uppercase Letter 10516
 
3.3%
Math Symbol 14
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10333
 
9.5%
5385
 
5.0%
5238
 
4.8%
5083
 
4.7%
4976
 
4.6%
4971
 
4.6%
4966
 
4.6%
4966
 
4.6%
4579
 
4.2%
3548
 
3.3%
Other values (320) 54661
50.3%
Lowercase Letter
ValueCountFrequency (%)
o 13923
15.3%
n 10630
11.7%
u 8354
9.2%
a 8223
9.0%
e 8215
9.0%
l 6862
7.5%
g 5677
 
6.2%
s 5628
 
6.2%
i 4856
 
5.3%
j 3775
 
4.1%
Other values (13) 15014
16.5%
Uppercase Letter
ValueCountFrequency (%)
J 5284
50.2%
W 1800
 
17.1%
D 859
 
8.2%
S 660
 
6.3%
H 449
 
4.3%
B 302
 
2.9%
G 235
 
2.2%
M 197
 
1.9%
R 168
 
1.6%
Y 127
 
1.2%
Other values (12) 435
 
4.1%
Decimal Number
ValueCountFrequency (%)
1 9700
28.8%
2 4981
14.8%
3 4149
12.3%
4 3149
 
9.4%
5 2503
 
7.4%
0 2113
 
6.3%
7 2091
 
6.2%
6 1863
 
5.5%
8 1627
 
4.8%
9 1503
 
4.5%
Other Punctuation
ValueCountFrequency (%)
, 11009
74.5%
? 3716
 
25.1%
# 40
 
0.3%
. 11
 
0.1%
Space Separator
ValueCountFrequency (%)
45011
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12828
100.0%
Math Symbol
ValueCountFrequency (%)
~ 14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 106308
33.6%
Latin 101673
32.1%
Han 65205
20.6%
Hangul 43501
13.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5385
 
12.4%
2660
 
6.1%
2605
 
6.0%
2605
 
6.0%
2567
 
5.9%
2552
 
5.9%
2545
 
5.9%
2541
 
5.8%
2540
 
5.8%
2540
 
5.8%
Other values (185) 14961
34.4%
Han
ValueCountFrequency (%)
10333
15.8%
5238
 
8.0%
5083
 
7.8%
4976
 
7.6%
4971
 
7.6%
4966
 
7.6%
4966
 
7.6%
4579
 
7.0%
3548
 
5.4%
2755
 
4.2%
Other values (125) 13790
21.1%
Latin
ValueCountFrequency (%)
o 13923
13.7%
n 10630
10.5%
u 8354
 
8.2%
a 8223
 
8.1%
e 8215
 
8.1%
l 6862
 
6.7%
g 5677
 
5.6%
s 5628
 
5.5%
J 5284
 
5.2%
i 4856
 
4.8%
Other values (35) 24021
23.6%
Common
ValueCountFrequency (%)
45011
42.3%
- 12828
 
12.1%
, 11009
 
10.4%
1 9700
 
9.1%
2 4981
 
4.7%
3 4149
 
3.9%
? 3716
 
3.5%
4 3149
 
3.0%
5 2503
 
2.4%
0 2113
 
2.0%
Other values (7) 7149
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 207981
65.7%
CJK 65205
 
20.6%
Hangul 43501
 
13.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
45011
21.6%
o 13923
 
6.7%
- 12828
 
6.2%
, 11009
 
5.3%
n 10630
 
5.1%
1 9700
 
4.7%
u 8354
 
4.0%
a 8223
 
4.0%
e 8215
 
3.9%
l 6862
 
3.3%
Other values (52) 73226
35.2%
CJK
ValueCountFrequency (%)
10333
15.8%
5238
 
8.0%
5083
 
7.8%
4976
 
7.6%
4971
 
7.6%
4966
 
7.6%
4966
 
7.6%
4579
 
7.0%
3548
 
5.4%
2755
 
4.2%
Other values (125) 13790
21.1%
Hangul
ValueCountFrequency (%)
5385
 
12.4%
2660
 
6.1%
2605
 
6.0%
2605
 
6.0%
2567
 
5.9%
2552
 
5.9%
2545
 
5.9%
2541
 
5.8%
2540
 
5.8%
2540
 
5.8%
Other values (185) 14961
34.4%
Distinct830
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T03:37:58.148183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length12
Mean length12.6649
Min length1

Characters and Unicode

Total characters126649
Distinct characters21
Distinct categories5 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)0.2%

Sample

1st row063-255-2164
2nd row063-282-6551
3rd row1522-3232
4th row063-252-6663
5th row0507-1444-5950
ValueCountFrequency (%)
1522-3232 120
 
1.2%
063-275-0007 55
 
0.5%
063-255-2177 49
 
0.5%
063-271-1710 44
 
0.4%
063-224-0222 42
 
0.4%
0507-0090-0668 42
 
0.4%
063-223-6325 41
 
0.4%
063-214-8999 41
 
0.4%
063-222-3480 40
 
0.4%
063-254-1010 40
 
0.4%
Other values (822) 9671
95.0%
2024-03-15T03:37:59.545487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 20200
15.9%
0 19714
15.6%
2 16771
13.2%
3 13988
11.0%
6 11879
9.4%
7 9058
7.2%
5 8933
7.1%
1 7935
 
6.3%
8 6130
 
4.8%
9 5862
 
4.6%
Other values (11) 6179
 
4.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 106049
83.7%
Dash Punctuation 20200
 
15.9%
Other Punctuation 187
 
0.1%
Space Separator 185
 
0.1%
Other Letter 28
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 19714
18.6%
2 16771
15.8%
3 13988
13.2%
6 11879
11.2%
7 9058
8.5%
5 8933
8.4%
1 7935
7.5%
8 6130
 
5.8%
9 5862
 
5.5%
4 5779
 
5.4%
Other Letter
ValueCountFrequency (%)
8
28.6%
8
28.6%
6
21.4%
2
 
7.1%
2
 
7.1%
1
 
3.6%
1
 
3.6%
Other Punctuation
ValueCountFrequency (%)
, 185
98.9%
? 2
 
1.1%
Dash Punctuation
ValueCountFrequency (%)
- 20200
100.0%
Space Separator
ValueCountFrequency (%)
185
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 126621
> 99.9%
Hangul 16
 
< 0.1%
Han 10
 
< 0.1%
Hiragana 2
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
- 20200
16.0%
0 19714
15.6%
2 16771
13.2%
3 13988
11.0%
6 11879
9.4%
7 9058
7.2%
5 8933
7.1%
1 7935
 
6.3%
8 6130
 
4.8%
9 5862
 
4.6%
Other values (4) 6151
 
4.9%
Han
ValueCountFrequency (%)
6
60.0%
2
 
20.0%
1
 
10.0%
1
 
10.0%
Hangul
ValueCountFrequency (%)
8
50.0%
8
50.0%
Hiragana
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 126621
> 99.9%
Hangul 16
 
< 0.1%
CJK 10
 
< 0.1%
Hiragana 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 20200
16.0%
0 19714
15.6%
2 16771
13.2%
3 13988
11.0%
6 11879
9.4%
7 9058
7.2%
5 8933
7.1%
1 7935
 
6.3%
8 6130
 
4.8%
9 5862
 
4.6%
Other values (4) 6151
 
4.9%
Hangul
ValueCountFrequency (%)
8
50.0%
8
50.0%
CJK
ValueCountFrequency (%)
6
60.0%
2
 
20.0%
1
 
10.0%
1
 
10.0%
Hiragana
ValueCountFrequency (%)
2
100.0%

시작영업시간
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size156.2 KiB

종료영업시간
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size156.2 KiB

휴일
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
不可
4815 
불가
2467 
impossible
2422 
可能
 
151
가능
 
73

Length

Max length10
Median length2
Mean length3.9808
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row不可
2nd rowimpossible
3rd row不可
4th row不可
5th rowimpossible

Common Values

ValueCountFrequency (%)
不可 4815
48.1%
불가 2467
24.7%
impossible 2422
24.2%
可能 151
 
1.5%
가능 73
 
0.7%
Possible 72
 
0.7%

Length

2024-03-15T03:38:00.009342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T03:38:00.372901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
不可 4815
48.1%
불가 2467
24.7%
impossible 2422
24.2%
可能 151
 
1.5%
가능 73
 
0.7%
possible 72
 
0.7%

주차장
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
제공
2025 
offer
1983 
??
1964 
提供
1949 
未提供
1053 
Other values (2)
1026 

Length

Max length11
Median length2
Mean length3.2116
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row提供
2nd rowoffer
3rd row??
4th row未提供
5th rowoffer

Common Values

ValueCountFrequency (%)
제공 2025
20.2%
offer 1983
19.8%
?? 1964
19.6%
提供 1949
19.5%
未提供 1053
10.5%
미제공 515
 
5.1%
Not provide 511
 
5.1%

Length

2024-03-15T03:38:00.768780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T03:38:01.113617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제공 2025
19.3%
offer 1983
18.9%
1964
18.7%
提供 1949
18.5%
未提供 1053
10.0%
미제공 515
 
4.9%
not 511
 
4.9%
provide 511
 
4.9%

와이파이
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
可能
2811 
不可
2155 
가능
1480 
Possible
1440 
불가
1060 

Length

Max length10
Median length2
Mean length3.7072
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row不可
2nd rowPossible
3rd row可能
4th row不可
5th rowPossible

Common Values

ValueCountFrequency (%)
可能 2811
28.1%
不可 2155
21.6%
가능 1480
14.8%
Possible 1440
14.4%
불가 1060
 
10.6%
impossible 1054
 
10.5%

Length

2024-03-15T03:38:01.551183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T03:38:01.915169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
可能 2811
28.1%
不可 2155
21.6%
가능 1480
14.8%
possible 1440
14.4%
불가 1060
 
10.6%
impossible 1054
 
10.5%

홈페이지
Text

MISSING 

Distinct212
Distinct (%)2.6%
Missing1713
Missing (%)17.1%
Memory size156.2 KiB
2024-03-15T03:38:03.455024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length66
Median length36
Mean length3.2000724
Min length1

Characters and Unicode

Total characters26519
Distinct characters144
Distinct categories10 ?
Distinct scripts5 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)0.5%

Sample

1st row無し
2nd row
3rd row星期一
4th rowSunday
5th row
ValueCountFrequency (%)
1748
18.4%
없음 1746
18.4%
無し 1659
17.4%
sunday 465
 
4.9%
日曜日 462
 
4.9%
일요일 444
 
4.7%
星期天 420
 
4.4%
月曜日 186
 
2.0%
monday 174
 
1.8%
월요일 160
 
1.7%
Other values (161) 2046
21.5%
2024-03-15T03:38:05.334071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1748
 
6.6%
1746
 
6.6%
1746
 
6.6%
1659
 
6.3%
1659
 
6.3%
1340
 
5.1%
1283
 
4.8%
1223
 
4.6%
d 970
 
3.7%
a 899
 
3.4%
Other values (134) 12246
46.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 17050
64.3%
Lowercase Letter 6089
 
23.0%
Space Separator 1223
 
4.6%
Uppercase Letter 950
 
3.6%
Decimal Number 628
 
2.4%
Other Punctuation 556
 
2.1%
Math Symbol 15
 
0.1%
Dash Punctuation 4
 
< 0.1%
Open Punctuation 2
 
< 0.1%
Close Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1748
10.3%
1746
10.2%
1746
10.2%
1659
9.7%
1659
9.7%
1340
 
7.9%
1283
 
7.5%
808
 
4.7%
792
 
4.6%
761
 
4.5%
Other values (78) 3508
20.6%
Lowercase Letter
ValueCountFrequency (%)
d 970
15.9%
a 899
14.8%
y 853
14.0%
n 784
12.9%
u 602
9.9%
e 387
 
6.4%
o 355
 
5.8%
s 240
 
3.9%
r 225
 
3.7%
h 203
 
3.3%
Other values (11) 571
9.4%
Uppercase Letter
ValueCountFrequency (%)
S 510
53.7%
M 175
 
18.4%
T 146
 
15.4%
W 35
 
3.7%
F 21
 
2.2%
C 21
 
2.2%
E 10
 
1.1%
H 8
 
0.8%
B 6
 
0.6%
D 5
 
0.5%
Other values (4) 13
 
1.4%
Decimal Number
ValueCountFrequency (%)
3 177
28.2%
1 140
22.3%
4 121
19.3%
2 117
18.6%
0 44
 
7.0%
5 13
 
2.1%
9 9
 
1.4%
7 6
 
1.0%
6 1
 
0.2%
Other Punctuation
ValueCountFrequency (%)
, 483
86.9%
: 26
 
4.7%
? 26
 
4.7%
/ 9
 
1.6%
. 6
 
1.1%
' 4
 
0.7%
2
 
0.4%
Space Separator
ValueCountFrequency (%)
1223
100.0%
Math Symbol
ValueCountFrequency (%)
~ 15
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Open Punctuation
ValueCountFrequency (%)
2
100.0%
Close Punctuation
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Han 8823
33.3%
Latin 7039
26.5%
Hangul 6568
24.8%
Common 2430
 
9.2%
Hiragana 1659
 
6.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1746
26.6%
1746
26.6%
1283
19.5%
792
12.1%
213
 
3.2%
162
 
2.5%
86
 
1.3%
76
 
1.2%
44
 
0.7%
40
 
0.6%
Other values (39) 380
 
5.8%
Han
ValueCountFrequency (%)
1748
19.8%
1659
18.8%
1340
15.2%
808
9.2%
761
8.6%
758
8.6%
447
 
5.1%
201
 
2.3%
195
 
2.2%
150
 
1.7%
Other values (28) 756
8.6%
Latin
ValueCountFrequency (%)
d 970
13.8%
a 899
12.8%
y 853
12.1%
n 784
11.1%
u 602
8.6%
S 510
7.2%
e 387
 
5.5%
o 355
 
5.0%
s 240
 
3.4%
r 225
 
3.2%
Other values (25) 1214
17.2%
Common
ValueCountFrequency (%)
1223
50.3%
, 483
 
19.9%
3 177
 
7.3%
1 140
 
5.8%
4 121
 
5.0%
2 117
 
4.8%
0 44
 
1.8%
: 26
 
1.1%
? 26
 
1.1%
~ 15
 
0.6%
Other values (11) 58
 
2.4%
Hiragana
ValueCountFrequency (%)
1659
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9463
35.7%
CJK 8823
33.3%
Hangul 6568
24.8%
Hiragana 1659
 
6.3%
None 6
 
< 0.1%

Most frequent character per block

CJK
ValueCountFrequency (%)
1748
19.8%
1659
18.8%
1340
15.2%
808
9.2%
761
8.6%
758
8.6%
447
 
5.1%
201
 
2.3%
195
 
2.2%
150
 
1.7%
Other values (28) 756
8.6%
Hangul
ValueCountFrequency (%)
1746
26.6%
1746
26.6%
1283
19.5%
792
12.1%
213
 
3.2%
162
 
2.5%
86
 
1.3%
76
 
1.2%
44
 
0.7%
40
 
0.6%
Other values (39) 380
 
5.8%
Hiragana
ValueCountFrequency (%)
1659
100.0%
ASCII
ValueCountFrequency (%)
1223
12.9%
d 970
10.3%
a 899
 
9.5%
y 853
 
9.0%
n 784
 
8.3%
u 602
 
6.4%
S 510
 
5.4%
, 483
 
5.1%
e 387
 
4.1%
o 355
 
3.8%
Other values (43) 2397
25.3%
None
ValueCountFrequency (%)
2
33.3%
2
33.3%
2
33.3%
Distinct368
Distinct (%)4.2%
Missing1144
Missing (%)11.4%
Memory size156.2 KiB
2024-03-15T03:38:06.244604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length96
Median length62
Mean length18.888663
Min length1

Characters and Unicode

Total characters167278
Distinct characters96
Distinct categories9 ?
Distinct scripts5 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)0.1%

Sample

1st row無し
2nd rowhttps://blog.naver.com/zoqtir/221556367827
3rd rowhttp://www.starbucks.co.kr/
4th row
5th rowhttp://woogayp.co.kr
ValueCountFrequency (%)
1150
 
13.0%
없음 1145
 
12.9%
無し 1132
 
12.8%
http://www.starbucks.co.kr 120
 
1.4%
http://www.starpizza.co.kr/html/index 80
 
0.9%
http://www.congsan.co.kr 78
 
0.9%
http://www.youngdabang.com 74
 
0.8%
http://www.megacoffee.me 72
 
0.8%
http://www.ediya.com 67
 
0.8%
http://www.yogerpresso.co.kr 59
 
0.7%
Other values (352) 4899
55.2%
2024-03-15T03:38:07.678357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 14778
 
8.8%
t 13916
 
8.3%
. 11431
 
6.8%
o 11421
 
6.8%
w 9614
 
5.7%
a 8962
 
5.4%
c 6953
 
4.2%
m 6755
 
4.0%
h 6528
 
3.9%
p 6404
 
3.8%
Other values (86) 70516
42.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 120486
72.0%
Other Punctuation 31584
 
18.9%
Decimal Number 6593
 
3.9%
Other Letter 6021
 
3.6%
Connector Punctuation 1556
 
0.9%
Uppercase Letter 615
 
0.4%
Math Symbol 225
 
0.1%
Dash Punctuation 178
 
0.1%
Space Separator 20
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1150
19.1%
1145
19.0%
1145
19.0%
1132
18.8%
1132
18.8%
38
 
0.6%
21
 
0.3%
21
 
0.3%
21
 
0.3%
21
 
0.3%
Other values (19) 195
 
3.2%
Lowercase Letter
ValueCountFrequency (%)
t 13916
 
11.5%
o 11421
 
9.5%
w 9614
 
8.0%
a 8962
 
7.4%
c 6953
 
5.8%
m 6755
 
5.6%
h 6528
 
5.4%
p 6404
 
5.3%
n 6216
 
5.2%
r 6140
 
5.1%
Other values (16) 37577
31.2%
Uppercase Letter
ValueCountFrequency (%)
B 99
16.1%
S 75
12.2%
E 71
11.5%
C 68
11.1%
F 62
10.1%
D 37
 
6.0%
O 35
 
5.7%
I 35
 
5.7%
M 31
 
5.0%
V 25
 
4.1%
Other values (7) 77
12.5%
Other Punctuation
ValueCountFrequency (%)
/ 14778
46.8%
. 11431
36.2%
: 4901
 
15.5%
? 285
 
0.9%
% 108
 
0.3%
, 24
 
0.1%
& 23
 
0.1%
@ 22
 
0.1%
; 11
 
< 0.1%
# 1
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
2 1160
17.6%
0 905
13.7%
1 734
11.1%
5 622
9.4%
4 585
8.9%
6 571
8.7%
7 532
8.1%
3 531
8.1%
9 488
7.4%
8 465
7.1%
Connector Punctuation
ValueCountFrequency (%)
_ 1556
100.0%
Math Symbol
ValueCountFrequency (%)
= 225
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 178
100.0%
Space Separator
ValueCountFrequency (%)
20
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 121101
72.4%
Common 40156
 
24.0%
Hangul 2607
 
1.6%
Han 2282
 
1.4%
Hiragana 1132
 
0.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
t 13916
 
11.5%
o 11421
 
9.4%
w 9614
 
7.9%
a 8962
 
7.4%
c 6953
 
5.7%
m 6755
 
5.6%
h 6528
 
5.4%
p 6404
 
5.3%
n 6216
 
5.1%
r 6140
 
5.1%
Other values (33) 38192
31.5%
Hangul
ValueCountFrequency (%)
1145
43.9%
1145
43.9%
38
 
1.5%
21
 
0.8%
21
 
0.8%
21
 
0.8%
21
 
0.8%
15
 
0.6%
15
 
0.6%
15
 
0.6%
Other values (16) 150
 
5.8%
Common
ValueCountFrequency (%)
/ 14778
36.8%
. 11431
28.5%
: 4901
 
12.2%
_ 1556
 
3.9%
2 1160
 
2.9%
0 905
 
2.3%
1 734
 
1.8%
5 622
 
1.5%
4 585
 
1.5%
6 571
 
1.4%
Other values (14) 2913
 
7.3%
Han
ValueCountFrequency (%)
1150
50.4%
1132
49.6%
Hiragana
ValueCountFrequency (%)
1132
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 161257
96.4%
Hangul 2607
 
1.6%
CJK 2282
 
1.4%
Hiragana 1132
 
0.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 14778
 
9.2%
t 13916
 
8.6%
. 11431
 
7.1%
o 11421
 
7.1%
w 9614
 
6.0%
a 8962
 
5.6%
c 6953
 
4.3%
m 6755
 
4.2%
h 6528
 
4.0%
p 6404
 
4.0%
Other values (57) 64495
40.0%
CJK
ValueCountFrequency (%)
1150
50.4%
1132
49.6%
Hangul
ValueCountFrequency (%)
1145
43.9%
1145
43.9%
38
 
1.5%
21
 
0.8%
21
 
0.8%
21
 
0.8%
21
 
0.8%
15
 
0.6%
15
 
0.6%
15
 
0.6%
Other values (16) 150
 
5.8%
Hiragana
ValueCountFrequency (%)
1132
100.0%

메뉴
Text

Distinct7537
Distinct (%)75.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T03:38:09.190508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length239
Median length85
Mean length11.22
Min length1

Characters and Unicode

Total characters112200
Distinct characters1597
Distinct categories13 ?
Distinct scripts6 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6510 ?
Unique (%)65.1%

Sample

1st rowもち米酢豚(大)
2nd rowSet Menu of Offal Hot Pot with Rice
3rd row?年?茶 Tall
4th row牡?炒?面
5th rowSoju/Beer/Unrefined Rice Wine
ValueCountFrequency (%)
238
 
1.3%
rice 216
 
1.2%
chicken 193
 
1.0%
latte 187
 
1.0%
tea 163
 
0.9%
and 148
 
0.8%
pork 141
 
0.8%
hot 131
 
0.7%
soup 122
 
0.7%
spicy 119
 
0.6%
Other values (7066) 16804
91.0%
2024-03-15T03:38:11.087471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8546
 
7.6%
e 6815
 
6.1%
? 5601
 
5.0%
a 4674
 
4.2%
i 3841
 
3.4%
o 3631
 
3.2%
r 3168
 
2.8%
t 2783
 
2.5%
n 2729
 
2.4%
l 2535
 
2.3%
Other values (1587) 67877
60.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 47390
42.2%
Other Letter 37048
33.0%
Uppercase Letter 9036
 
8.1%
Space Separator 8546
 
7.6%
Other Punctuation 6854
 
6.1%
Decimal Number 1084
 
1.0%
Open Punctuation 893
 
0.8%
Close Punctuation 892
 
0.8%
Dash Punctuation 233
 
0.2%
Math Symbol 214
 
0.2%
Other values (3) 10
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
735
 
2.0%
521
 
1.4%
494
 
1.3%
433
 
1.2%
427
 
1.2%
418
 
1.1%
402
 
1.1%
386
 
1.0%
382
 
1.0%
372
 
1.0%
Other values (1498) 32478
87.7%
Lowercase Letter
ValueCountFrequency (%)
e 6815
14.4%
a 4674
 
9.9%
i 3841
 
8.1%
o 3631
 
7.7%
r 3168
 
6.7%
t 2783
 
5.9%
n 2729
 
5.8%
l 2535
 
5.3%
c 2024
 
4.3%
s 1907
 
4.0%
Other values (16) 13283
28.0%
Uppercase Letter
ValueCountFrequency (%)
S 1448
16.0%
C 1097
12.1%
B 780
 
8.6%
P 631
 
7.0%
M 486
 
5.4%
R 485
 
5.4%
T 478
 
5.3%
L 448
 
5.0%
G 419
 
4.6%
H 390
 
4.3%
Other values (16) 2374
26.3%
Other Punctuation
ValueCountFrequency (%)
? 5601
81.7%
/ 708
 
10.3%
, 245
 
3.6%
. 122
 
1.8%
& 66
 
1.0%
' 24
 
0.4%
24
 
0.4%
23
 
0.3%
· 14
 
0.2%
: 13
 
0.2%
Other values (3) 14
 
0.2%
Decimal Number
ValueCountFrequency (%)
0 358
33.0%
1 260
24.0%
2 137
 
12.6%
5 111
 
10.2%
3 75
 
6.9%
8 44
 
4.1%
4 40
 
3.7%
6 33
 
3.0%
7 19
 
1.8%
9 5
 
0.5%
Close Punctuation
ValueCountFrequency (%)
) 845
94.7%
46
 
5.2%
1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 844
94.5%
48
 
5.4%
1
 
0.1%
Math Symbol
ValueCountFrequency (%)
+ 206
96.3%
~ 8
 
3.7%
Space Separator
ValueCountFrequency (%)
8546
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 233
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 8
100.0%
Control
ValueCountFrequency (%)
1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 56426
50.3%
Common 18726
 
16.7%
Hangul 14183
 
12.6%
Katakana 10973
 
9.8%
Han 10211
 
9.1%
Hiragana 1681
 
1.5%

Most frequent character per script

Han
ValueCountFrequency (%)
295
 
2.9%
287
 
2.8%
206
 
2.0%
189
 
1.9%
183
 
1.8%
182
 
1.8%
158
 
1.5%
154
 
1.5%
154
 
1.5%
144
 
1.4%
Other values (711) 8259
80.9%
Hangul
ValueCountFrequency (%)
402
 
2.8%
386
 
2.7%
372
 
2.6%
352
 
2.5%
231
 
1.6%
210
 
1.5%
190
 
1.3%
177
 
1.2%
175
 
1.2%
170
 
1.2%
Other values (634) 11518
81.2%
Katakana
ValueCountFrequency (%)
735
 
6.7%
521
 
4.7%
494
 
4.5%
433
 
3.9%
427
 
3.9%
418
 
3.8%
382
 
3.5%
355
 
3.2%
320
 
2.9%
305
 
2.8%
Other values (70) 6583
60.0%
Hiragana
ValueCountFrequency (%)
255
 
15.2%
139
 
8.3%
95
 
5.7%
83
 
4.9%
66
 
3.9%
62
 
3.7%
58
 
3.5%
52
 
3.1%
50
 
3.0%
48
 
2.9%
Other values (53) 773
46.0%
Latin
ValueCountFrequency (%)
e 6815
 
12.1%
a 4674
 
8.3%
i 3841
 
6.8%
o 3631
 
6.4%
r 3168
 
5.6%
t 2783
 
4.9%
n 2729
 
4.8%
l 2535
 
4.5%
c 2024
 
3.6%
s 1907
 
3.4%
Other values (42) 22319
39.6%
Common
ValueCountFrequency (%)
8546
45.6%
? 5601
29.9%
) 845
 
4.5%
( 844
 
4.5%
/ 708
 
3.8%
0 358
 
1.9%
1 260
 
1.4%
, 245
 
1.3%
- 233
 
1.2%
+ 206
 
1.1%
Other values (27) 880
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 74983
66.8%
Hangul 14182
 
12.6%
Katakana 10973
 
9.8%
CJK 10206
 
9.1%
Hiragana 1681
 
1.5%
None 168
 
0.1%
CJK Compat Ideographs 5
 
< 0.1%
Compat Jamo 1
 
< 0.1%
Box Drawing 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8546
 
11.4%
e 6815
 
9.1%
? 5601
 
7.5%
a 4674
 
6.2%
i 3841
 
5.1%
o 3631
 
4.8%
r 3168
 
4.2%
t 2783
 
3.7%
n 2729
 
3.6%
l 2535
 
3.4%
Other values (68) 30660
40.9%
Katakana
ValueCountFrequency (%)
735
 
6.7%
521
 
4.7%
494
 
4.5%
433
 
3.9%
427
 
3.9%
418
 
3.8%
382
 
3.5%
355
 
3.2%
320
 
2.9%
305
 
2.8%
Other values (70) 6583
60.0%
Hangul
ValueCountFrequency (%)
402
 
2.8%
386
 
2.7%
372
 
2.6%
352
 
2.5%
231
 
1.6%
210
 
1.5%
190
 
1.3%
177
 
1.2%
175
 
1.2%
170
 
1.2%
Other values (633) 11517
81.2%
CJK
ValueCountFrequency (%)
295
 
2.9%
287
 
2.8%
206
 
2.0%
189
 
1.9%
183
 
1.8%
182
 
1.8%
158
 
1.5%
154
 
1.5%
154
 
1.5%
144
 
1.4%
Other values (710) 8254
80.9%
Hiragana
ValueCountFrequency (%)
255
 
15.2%
139
 
8.3%
95
 
5.7%
83
 
4.9%
66
 
3.9%
62
 
3.7%
58
 
3.5%
52
 
3.1%
50
 
3.0%
48
 
2.9%
Other values (53) 773
46.0%
None
ValueCountFrequency (%)
48
28.6%
46
27.4%
24
14.3%
23
13.7%
· 14
 
8.3%
6
 
3.6%
3
 
1.8%
2
 
1.2%
1
 
0.6%
1
 
0.6%
CJK Compat Ideographs
ValueCountFrequency (%)
5
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Box Drawing
ValueCountFrequency (%)
1
100.0%

가격
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size156.2 KiB

Correlations

2024-03-15T03:38:11.351179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
언어휴일주차장와이파이
언어1.0000.9250.9600.925
휴일0.9251.0000.7980.938
주차장0.9600.7981.0000.813
와이파이0.9250.9380.8131.000
2024-03-15T03:38:11.608891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주차장와이파이언어휴일
주차장1.0000.6530.9640.632
와이파이0.6531.0000.8160.639
언어0.9640.8161.0000.816
휴일0.6320.6390.8161.000
2024-03-15T03:38:11.863317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
언어휴일주차장와이파이
언어1.0000.8160.9640.816
휴일0.8161.0000.6320.639
주차장0.9640.6321.0000.653
와이파이0.8160.6390.6531.000

Missing values

2024-03-15T03:37:41.712977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T03:37:42.336850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-15T03:37:42.793246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

언어업소명업소위치전화번호지번주소도로명주소시작영업시간종료영업시간휴일주차장와이파이홈페이지반려견 동반입장 여부메뉴가격
62748JPジャ?ジャ?一番地完山區 西新洞全羅北道 全州市 完山區 西新洞 860-8全羅北道 全州市 完山區 全?路 40063-255-216412:00:0021:00:00不可提供不可無し無しもち米酢豚(大)28000
90243ENDon Donjeong BranchGosa-dong, Wansan-gu186, Gosa-dong, Wansan-gu, Jeonju-si, Jeollabuk-do74-50, Jeonjugaeksa 4-gil, Wansan-gu, Jeonju-si, Jeollabuk-do063-282-655111:00:0022:00:00impossibleofferPossible<NA>https://blog.naver.com/zoqtir/221556367827Set Menu of Offal Hot Pot with Rice14000
48483CN星巴克 全州西新店完山區 西新洞全羅北道 全州市 完山區 西新洞 964-6全羅北道 全州市 完山區 西新路 1041522-3232B : 星期一~星期五 07:30, 星期六~星期天 08:00B : 平日 21:30, 周末 22:00不可??可能http://www.starbucks.co.kr/?年?茶 Tall4100
74662CN名牌海?面的松川店德津區 松川1洞全羅北道 全州市 德津區 松川洞2街 468-5全羅北道 全州市 德津區 松川中央路 234063-252-666311:00:0020:30:00不可未提供不可星期一牡?炒?面9000
49791ENUga Yangpyeong Haejangguk Hyoja BranchHyoja-dong, Wansan-gu382, Hyoja-dong 1-ga, Wansan-gu, Jeonju-si, Jeollabuk-do180, Geomapyeong-ro, Wansan-gu, Jeonju-si, Jeollabuk-do0507-1444-595009:00:0023:00:00impossibleofferPossible<NA>http://woogayp.co.krSoju/Beer/Unrefined Rice Wine4000
41725ENChante JoursHyoja-dong 3-ga, Wansan-gu1631-12, Hyoja-dong 3-ga, Wansan-gu, Jeonju-si, Jeollabuk-do11-3, Majeon 4-gil, Wansan-gu, Jeonju-si, Jeollabuk-do063-223-720011:30:0021:30:00impossibleofferPossibleSunday<NA>Slice of Cake5000
78413ENBless Roll Jeonju New town BranchHyoja 5-dong, Wansan-gu1st floor, 1155-5, Hyoja-dong 2-ga, Wansan-gu, Jeonju-si, Jeollabuk-do1st floor, 56 Hongsannam-ro, Wansan-gu, Jeonju-si, Jeollabuk-do063-227-007009:00:0001:00:00impossibleofferPossible<NA>https://youtu.be/8irI3mpOEokStrawberry Soft Ice Cream / Choco Soft Ice Cream3800
96845CN??尼全州松川店德津區 松川洞全羅北道 全州市 德津區 松川洞2街 523-7全羅北道 全州市 德津區 ?川3街 10507-1419-868709:30:0022:00:00不可??不可almond affogato4200
33805JPボベ飯店_全州西新店完山區 西新洞全羅北道 全州市 完山區 西新洞 832-1 1?全羅北道 全州市 完山區 西新路 40 1?063-275-295211:25:0022:00:00不可提供可能無しhttp://bobaebanjum.co.kr/海鮮チャンポン18000
92944ENPaekdabang Jeonju Jesus Hospital BranchJunghwasan-dong 1-ga, Wansan-gu321-3, Junghwasan-dong 1-ga, Wansan-gu, Jeonju-si, Jeollabuk-do345, Seowon-ro, Wansan-gu, Jeonju-si, Jeollabuk-do0507-1389-1011B : Monday to Friday 08:30, Saturday to Sunday 09:3022:00:00impossibleofferPossible<NA>https://paikdabang.com/Sweet Potato Latte2000
언어업소명업소위치전화번호지번주소도로명주소시작영업시간종료영업시간휴일주차장와이파이홈페이지반려견 동반입장 여부메뉴가격
13969KO옥류관_송천점덕진구 송천1동전북특별자치도 전주시 덕진구 송천동2가 614-11전북특별자치도 전주시 덕진구 시천로 114063-255-165611:30:0021:30:00불가미제공가능없음없음회냉면 회사리5000
65528KO변산반도횟집완산구 중화산동2가전북특별자치도 전주시 완산구 중화산동2가 597-8전북특별자치도 전주시 완산구 중산6길 15-6050-7744-338311:00:0022:00:00가능제공가능없음없음해삼20000
77988CN海?完山區 西新洞全羅北道 全州市 完山區 西新洞 948-17全羅北道 全州市 完山區 古沙坪路 16063-274-247817:00:0022:00:00不可??不可2,4周 星期一半半四人120000
45335CN莫蒙特?·斯?德?斯完山區 孝子洞3街全羅北道 全州市 完山區 孝子洞3街 1615-6全羅北道 全州市 完山區 文?大5街 10010-2463-1722B : 平日 08:30, 周末 09:3021:00:00不可??可能香草茶5000
56219KO한솥도시락_전북대정문점덕진구 덕진동전북특별자치도 전주시 덕진구 덕진동1가 1312-70전북특별자치도 전주시 덕진구 명륜5길 20063-252-855508:00:0021:00:00불가제공가능없음http://www.hsd.co.kr국물떡볶이 튀김세트4800
74060JP手際完山區 中華山洞1街全羅北道 全州市 完山區 中華山洞1街 303-7 102?全羅北道 全州市 完山區 ?院路 356-1 102毫063-285-993310:00:0020:00:00不可提供可能日曜日無しチ?ズキンパプ3500
58533JPコンサン_松川店德津區 松川洞全羅北道 全州市 德津區 松川洞1街 138-6全羅北道 全州市 德津區 松溪路 119063-285-535309:00:0023:00:00不可提供不可無しhttp://www.congsan.co.kr/スイ?トポテトラテ4500
21435CN木制收音机 客舍店完山區 高士洞全羅北道 全州市 完山區 高士洞 392-4全羅北道 全州市 完山區 全州客舍3街 46-5063-232-700712:00:0022:00:00不可??不可星期三http://instagram.com/caf?_namooradio香草茶6000
47940KO엘린스샌드위치_전주효자동점완산구 효자동전북특별자치도 전주시 완산구 효자동2가 1210-1전북특별자치도 전주시 완산구 소태정3길 15010-6514-315008:00:0016:00:00불가미제공불가일요일http://blog.naver.com/ellins_jeonju베이직 푸드박스10000
94726ENWonjo Hamheung Naengmyeon Main BranchGosa-dong, Wansan-gu367-3, Gosa-dong, Wansan-gu, Jeonju-si, Jeollabuk-do59, Jeonjugaeksa 4-gil, Wansan-gu, Jeonju-si, Jeollabuk-do063-282-994611:00:0022:00:00impossibleNot provideimpossible<NA><NA>Boiled Mandu4000

Duplicate rows

Most frequently occurring

언어업소명업소위치전화번호지번주소도로명주소휴일주차장와이파이홈페이지반려견 동반입장 여부메뉴# duplicates
21CN札幌路?集德津區 牛牙洞2街全羅北道 全州市 德津區 牛牙洞2街 866-5全羅北道 全州市 德津區 乾山路 284063-245-6970不可??不可星期天???冬面3
0CN??尼全州松川店德津區 松川洞全羅北道 全州市 德津區 松川洞2街 523-7全羅北道 全州市 德津區 ?川3街 10507-1419-8687不可??不可拿?2
1CN??尼全州松川店德津區 松川洞全羅北道 全州市 德津區 松川洞2街 523-7全羅北道 全州市 德津區 ?川3街 10507-1419-8687不可??不可焦糖?奇?2
2CN??弗洛特德津區 松川洞全羅北道 全州市 德津區 松川洞2街 179-119全羅北道 全州市 德津區 沙斤1街 31063-252-9224不可??不可Strawberry Latte2
3CN??水?完山區 孝子洞3街全羅北道 全州市 完山區 孝子洞3街 1480-1全羅北道 全州市 完山區 西谷7街 10507-1310-4002不可未提供不可http://dokdosusan.mtsolution.co.kr海?2
4CN??男人家?泡茶德津區 松川洞全羅北道 全州市 德津區 松川洞1街 98 102?全羅北道 全州市 德津區 五松1街 37-5 102毫010-9279-6359不可未提供可能星期天http://instagram.com/two_man_beer???2
5CN?姆士 全州回?城市店德津區 松川洞全羅北道 全州市 德津區 松川洞2街 1330-14全羅北道 全州市 德津區 洗兵2街 32-18063-252-1445不可??可能http://tomntoms.com/?布奇?2
6CN?家蛤?火?全州新街分店完山區 孝子洞3街全羅北道 全州市 完山區 孝子洞3街 1630-4全羅北道 全州市 完山區 ?田3街 20507-1483-3852不可未提供可能http://www.택이네.comTiger Shrimp2
7CN?家面屋全州?店完山區 中華山洞2街全羅北道 全州市 完山區 中華山洞2街 728-3 街全羅北道 全州市 完山區 油然路 296063-224-0492不可??可能http://jangga.7x7.kr?酒2
8CN?是好紫菜包?天堂新市?完山區 孝子洞3街全羅北道 全州市 完山區 孝子洞3街 1609-15全羅北道 全州市 完山區 洪山路 377063-223-6325不可未提供不可油豆腐辛奇?冬面2