Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 9330 |
Missing cells | 1923 |
Missing cells (%) | 2.6% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 601.5 KiB |
Average record size in memory | 66.0 B |
Variable types
Numeric | 2 |
---|---|
Categorical | 1 |
Text | 4 |
DateTime | 1 |
Dataset
Description | 다국어 식당정보(식당명, 업종, 주소, 언어종류 등 8개 항목) |
---|---|
Author | 전라남도 |
URL | https://www.data.go.kr/data/15076623/fileData.do |
Reproduction
Analysis started | 2023-12-12 21:42:56.561108 |
---|---|
Analysis finished | 2023-12-12 21:42:58.280514 |
Duration | 1.72 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
다국어식당정보ID
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 9330 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4665.5 |
Minimum | 1 |
---|---|
Maximum | 9330 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 82.1 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 467.45 |
Q1 | 2333.25 |
median | 4665.5 |
Q3 | 6997.75 |
95-th percentile | 8863.55 |
Maximum | 9330 |
Range | 9329 |
Interquartile range (IQR) | 4664.5 |
Descriptive statistics
Standard deviation | 2693.4833 |
---|---|
Coefficient of variation (CV) | 0.57731933 |
Kurtosis | -1.2 |
Mean | 4665.5 |
Median Absolute Deviation (MAD) | 2332.5 |
Skewness | 0 |
Sum | 43529115 |
Variance | 7254852.5 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | < 0.1% |
6198 | 1 | < 0.1% |
6218 | 1 | < 0.1% |
6219 | 1 | < 0.1% |
6220 | 1 | < 0.1% |
6221 | 1 | < 0.1% |
6222 | 1 | < 0.1% |
6223 | 1 | < 0.1% |
6224 | 1 | < 0.1% |
6225 | 1 | < 0.1% |
Other values (9320) | 9320 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
9330 | 1 | |
9329 | 1 | |
9328 | 1 | |
9327 | 1 | |
9326 | 1 | |
9325 | 1 | |
9324 | 1 | |
9323 | 1 | |
9322 | 1 | |
9321 | 1 |
식당ID
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 3110 |
---|---|
Distinct (%) | 33.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 753151.39 |
Minimum | 2858 |
---|---|
Maximum | 865303 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 82.1 KiB |
Quantile statistics
Minimum | 2858 |
---|---|
5-th percentile | 197468 |
Q1 | 857711 |
median | 858711.5 |
Q3 | 859674 |
95-th percentile | 864297 |
Maximum | 865303 |
Range | 862445 |
Interquartile range (IQR) | 1963 |
Descriptive statistics
Standard deviation | 219197.56 |
---|---|
Coefficient of variation (CV) | 0.29104051 |
Kurtosis | 2.3446412 |
Mean | 753151.39 |
Median Absolute Deviation (MAD) | 985.5 |
Skewness | -1.9170256 |
Sum | 7.0269025 × 109 |
Variance | 4.8047572 × 1010 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
2858 | 3 | < 0.1% |
859360 | 3 | < 0.1% |
859328 | 3 | < 0.1% |
859330 | 3 | < 0.1% |
859333 | 3 | < 0.1% |
859334 | 3 | < 0.1% |
859336 | 3 | < 0.1% |
859337 | 3 | < 0.1% |
859338 | 3 | < 0.1% |
859339 | 3 | < 0.1% |
Other values (3100) | 9300 |
Value | Count | Frequency (%) |
2858 | 3 | |
3820 | 3 | |
4419 | 3 | |
4751 | 3 | |
5075 | 3 | |
6215 | 3 | |
10302 | 3 | |
11705 | 3 | |
12433 | 3 | |
16676 | 3 |
Value | Count | Frequency (%) |
865303 | 3 | |
865302 | 3 | |
865301 | 3 | |
865300 | 3 | |
865299 | 3 | |
865297 | 3 | |
865295 | 3 | |
865288 | 3 | |
865287 | 3 | |
865282 | 3 |
언어타입
Categorical
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 73.0 KiB |
en | |
---|---|
ja | |
zh-Hans |
Length
Max length | 7 |
---|---|
Median length | 2 |
Mean length | 3.6666667 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | en |
---|---|
2nd row | ja |
3rd row | zh-Hans |
4th row | en |
5th row | ja |
Common Values
Value | Count | Frequency (%) |
en | 3110 | |
ja | 3110 | |
zh-Hans | 3110 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
en | 3110 | |
ja | 3110 | |
zh-hans | 3110 |
업종
Text
MISSING
 
Distinct | 222 |
---|---|
Distinct (%) | 3.0% |
Missing | 1908 |
Missing (%) | 20.5% |
Memory size | 73.0 KiB |
Value | Count | Frequency (%) |
korean | 1254 | 12.4% |
cuisine | 1243 | 12.3% |
韩餐 | 1165 | 11.5% |
韓食 | 1165 | 11.5% |
fish | 231 | 2.3% |
生鱼片 | 216 | 2.1% |
刺身 | 216 | 2.1% |
sliced | 216 | 2.1% |
raw | 216 | 2.1% |
soup | 182 | 1.8% |
Other values (227) | 3994 |
Most occurring characters
Value | Count | Frequency (%) |
e | 4271 | 9.1% |
i | 3538 | 7.6% |
n | 2840 | 6.1% |
2712 | 5.8% | |
a | 2167 | 4.6% |
o | 2073 | 4.4% |
s | 1987 | 4.3% |
c | 1873 | 4.0% |
r | 1703 | 3.6% |
u | 1649 | 3.5% |
Other values (262) | 21931 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 26309 | |
Other Letter | 13563 | |
Uppercase Letter | 3587 | 7.7% |
Space Separator | 2712 | 5.8% |
Other Punctuation | 160 | 0.3% |
Modifier Letter | 154 | 0.3% |
Close Punctuation | 88 | 0.2% |
Open Punctuation | 88 | 0.2% |
Dash Punctuation | 83 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
食 | 1414 | 10.4% |
韩 | 1254 | 9.2% |
韓 | 1254 | 9.2% |
餐 | 1239 | 9.1% |
肉 | 641 | 4.7% |
理 | 302 | 2.2% |
牛 | 297 | 2.2% |
料 | 281 | 2.1% |
鱼 | 271 | 2.0% |
汤 | 228 | 1.7% |
Other values (210) | 6382 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 4271 | |
i | 3538 | |
n | 2840 | |
a | 2167 | |
o | 2073 | |
s | 1987 | |
c | 1873 | |
r | 1703 | 6.5% |
u | 1649 | 6.3% |
l | 740 | 2.8% |
Other values (13) | 3468 |
Uppercase Letter
Value | Count | Frequency (%) |
K | 1254 | |
S | 624 | |
R | 321 | 8.9% |
B | 268 | 7.5% |
F | 256 | 7.1% |
D | 159 | 4.4% |
C | 153 | 4.3% |
G | 134 | 3.7% |
P | 104 | 2.9% |
M | 84 | 2.3% |
Other values (10) | 230 | 6.4% |
Other Punctuation
Value | Count | Frequency (%) |
' | 82 | |
, | 78 |
Close Punctuation
Value | Count | Frequency (%) |
) | 45 | |
) | 43 |
Open Punctuation
Value | Count | Frequency (%) |
( | 45 | |
( | 43 |
Space Separator
Value | Count | Frequency (%) |
2712 |
Modifier Letter
Value | Count | Frequency (%) |
ー | 154 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 83 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 29896 | |
Han | 11158 | 23.9% |
Common | 3285 | 7.0% |
Katakana | 1938 | 4.1% |
Hiragana | 467 | 1.0% |
Most frequent character per script
Han
Value | Count | Frequency (%) |
食 | 1414 | 12.7% |
韩 | 1254 | 11.2% |
韓 | 1254 | 11.2% |
餐 | 1239 | 11.1% |
肉 | 641 | 5.7% |
理 | 302 | 2.7% |
牛 | 297 | 2.7% |
料 | 281 | 2.5% |
鱼 | 271 | 2.4% |
汤 | 228 | 2.0% |
Other values (143) | 3977 |
Katakana
Value | Count | Frequency (%) |
カ | 162 | 8.4% |
ッ | 162 | 8.4% |
ン | 126 | 6.5% |
プ | 125 | 6.4% |
ク | 105 | 5.4% |
タ | 95 | 4.9% |
パ | 94 | 4.9% |
ス | 89 | 4.6% |
ム | 79 | 4.1% |
ャ | 76 | 3.9% |
Other values (38) | 825 |
Latin
Value | Count | Frequency (%) |
e | 4271 | |
i | 3538 | |
n | 2840 | |
a | 2167 | 7.2% |
o | 2073 | 6.9% |
s | 1987 | 6.6% |
c | 1873 | 6.3% |
r | 1703 | 5.7% |
u | 1649 | 5.5% |
K | 1254 | 4.2% |
Other values (33) | 6541 |
Hiragana
Value | Count | Frequency (%) |
き | 115 | |
し | 45 | 9.6% |
う | 28 | 6.0% |
ど | 28 | 6.0% |
ょ | 26 | 5.6% |
じ | 26 | 5.6% |
の | 23 | 4.9% |
ふ | 23 | 4.9% |
ぐ | 23 | 4.9% |
ゃ | 22 | 4.7% |
Other values (9) | 108 |
Common
Value | Count | Frequency (%) |
2712 | ||
ー | 154 | 4.7% |
- | 83 | 2.5% |
' | 82 | 2.5% |
, | 78 | 2.4% |
) | 45 | 1.4% |
( | 45 | 1.4% |
) | 43 | 1.3% |
( | 43 | 1.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 32937 | |
CJK | 11137 | 23.8% |
Katakana | 2092 | 4.5% |
Hiragana | 467 | 1.0% |
None | 90 | 0.2% |
CJK Compat Ideographs | 21 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
e | 4271 | |
i | 3538 | |
n | 2840 | 8.6% |
2712 | 8.2% | |
a | 2167 | 6.6% |
o | 2073 | 6.3% |
s | 1987 | 6.0% |
c | 1873 | 5.7% |
r | 1703 | 5.2% |
u | 1649 | 5.0% |
Other values (39) | 8124 |
CJK
Value | Count | Frequency (%) |
食 | 1414 | 12.7% |
韩 | 1254 | 11.3% |
韓 | 1254 | 11.3% |
餐 | 1239 | 11.1% |
肉 | 641 | 5.8% |
理 | 302 | 2.7% |
牛 | 297 | 2.7% |
料 | 281 | 2.5% |
鱼 | 271 | 2.4% |
汤 | 228 | 2.0% |
Other values (142) | 3956 |
Katakana
Value | Count | Frequency (%) |
カ | 162 | 7.7% |
ッ | 162 | 7.7% |
ー | 154 | 7.4% |
ン | 126 | 6.0% |
プ | 125 | 6.0% |
ク | 105 | 5.0% |
タ | 95 | 4.5% |
パ | 94 | 4.5% |
ス | 89 | 4.3% |
ム | 79 | 3.8% |
Other values (39) | 901 |
Hiragana
Value | Count | Frequency (%) |
き | 115 | |
し | 45 | 9.6% |
う | 28 | 6.0% |
ど | 28 | 6.0% |
ょ | 26 | 5.6% |
じ | 26 | 5.6% |
の | 23 | 4.9% |
ふ | 23 | 4.9% |
ぐ | 23 | 4.9% |
ゃ | 22 | 4.7% |
Other values (9) | 108 |
None
Value | Count | Frequency (%) |
) | 45 | |
( | 45 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
料 | 21 |
식당명
Text
Distinct | 2944 |
---|---|
Distinct (%) | 31.6% |
Missing | 6 |
Missing (%) | 0.1% |
Memory size | 73.0 KiB |
Length
Max length | 68 |
---|---|
Median length | 46 |
Mean length | 17.121943 |
Min length | 2 |
Characters and Unicode
Total characters | 159645 |
---|---|
Distinct characters | 81 |
Distinct categories | 10 ? |
Distinct scripts | 4 ? |
Distinct blocks | 5 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Hoseongdang |
---|---|
2nd row | Hoseongdang |
3rd row | Hoseongdang |
4th row | Hwanggeum Dokki |
5th row | Hwanggeum Dokki |
Value | Count | Frequency (%) |
sikdang | 1125 | 5.5% |
hoetjip | 378 | 1.8% |
garden | 357 | 1.7% |
galbi | 279 | 1.4% |
gukbap | 252 | 1.2% |
hoegwan | 219 | 1.1% |
hanu | 201 | 1.0% |
jangeo | 168 | 0.8% |
sutbul | 150 | 0.7% |
gamjatang | 138 | 0.7% |
Other values (2895) | 17334 |
Most occurring characters
Value | Count | Frequency (%) |
a | 16878 | 10.6% |
n | 16488 | 10.3% |
o | 13005 | 8.1% |
e | 12171 | 7.6% |
g | 11982 | 7.5% |
11277 | 7.1% | |
i | 7467 | 4.7% |
u | 7461 | 4.7% |
m | 4878 | 3.1% |
k | 4602 | 2.9% |
Other values (71) | 53436 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 125424 | |
Uppercase Letter | 22260 | 13.9% |
Space Separator | 11277 | 7.1% |
Decimal Number | 294 | 0.2% |
Other Punctuation | 138 | 0.1% |
Open Punctuation | 105 | 0.1% |
Close Punctuation | 105 | 0.1% |
Other Letter | 27 | < 0.1% |
Dash Punctuation | 9 | < 0.1% |
Math Symbol | 6 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
a | 16878 | |
n | 16488 | |
o | 13005 | |
e | 12171 | |
g | 11982 | |
i | 7467 | 6.0% |
u | 7461 | 5.9% |
m | 4878 | 3.9% |
k | 4602 | 3.7% |
j | 3699 | 2.9% |
Other values (17) | 26793 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 3705 | |
G | 2931 | |
J | 2562 | |
H | 2433 | |
M | 1506 | |
B | 1446 | 6.5% |
D | 1347 | 6.1% |
C | 993 | 4.5% |
N | 969 | 4.4% |
Y | 954 | 4.3% |
Other values (16) | 3414 |
Decimal Number
Value | Count | Frequency (%) |
1 | 63 | |
2 | 51 | |
4 | 33 | |
9 | 30 | |
5 | 24 | 8.2% |
3 | 24 | 8.2% |
0 | 24 | 8.2% |
6 | 18 | 6.1% |
8 | 15 | 5.1% |
7 | 12 | 4.1% |
Other Punctuation
Value | Count | Frequency (%) |
& | 57 | |
. | 42 | |
, | 18 | 13.0% |
' | 9 | 6.5% |
· | 6 | 4.3% |
! | 3 | 2.2% |
: | 3 | 2.2% |
Other Letter
Value | Count | Frequency (%) |
家 | 6 | |
大 | 6 | |
福 | 6 | |
李 | 3 | |
海 | 3 | |
ㄱ | 3 |
Space Separator
Value | Count | Frequency (%) |
11277 |
Open Punctuation
Value | Count | Frequency (%) |
( | 105 |
Close Punctuation
Value | Count | Frequency (%) |
) | 105 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 9 |
Math Symbol
Value | Count | Frequency (%) |
+ | 6 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 147684 | |
Common | 11934 | 7.5% |
Han | 24 | < 0.1% |
Hangul | 3 | < 0.1% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
a | 16878 | 11.4% |
n | 16488 | 11.2% |
o | 13005 | 8.8% |
e | 12171 | 8.2% |
g | 11982 | 8.1% |
i | 7467 | 5.1% |
u | 7461 | 5.1% |
m | 4878 | 3.3% |
k | 4602 | 3.1% |
S | 3705 | 2.5% |
Other values (43) | 49047 |
Common
Value | Count | Frequency (%) |
11277 | ||
( | 105 | 0.9% |
) | 105 | 0.9% |
1 | 63 | 0.5% |
& | 57 | 0.5% |
2 | 51 | 0.4% |
. | 42 | 0.4% |
4 | 33 | 0.3% |
9 | 30 | 0.3% |
5 | 24 | 0.2% |
Other values (12) | 147 | 1.2% |
Han
Value | Count | Frequency (%) |
家 | 6 | |
大 | 6 | |
福 | 6 | |
李 | 3 | |
海 | 3 |
Hangul
Value | Count | Frequency (%) |
ㄱ | 3 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 159558 | |
None | 60 | < 0.1% |
CJK | 21 | < 0.1% |
CJK Compat Ideographs | 3 | < 0.1% |
Compat Jamo | 3 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
a | 16878 | 10.6% |
n | 16488 | 10.3% |
o | 13005 | 8.2% |
e | 12171 | 7.6% |
g | 11982 | 7.5% |
11277 | 7.1% | |
i | 7467 | 4.7% |
u | 7461 | 4.7% |
m | 4878 | 3.1% |
k | 4602 | 2.9% |
Other values (62) | 53349 |
None
Value | Count | Frequency (%) |
é | 42 | |
É | 12 | 20.0% |
· | 6 | 10.0% |
CJK
Value | Count | Frequency (%) |
家 | 6 | |
大 | 6 | |
福 | 6 | |
海 | 3 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
李 | 3 |
Compat Jamo
Value | Count | Frequency (%) |
ㄱ | 3 |
도로명주소
Text
Distinct | 8511 |
---|---|
Distinct (%) | 91.3% |
Missing | 9 |
Missing (%) | 0.1% |
Memory size | 73.0 KiB |
Length
Max length | 70 |
---|---|
Median length | 60 |
Mean length | 31.594893 |
Min length | 11 |
Characters and Unicode
Total characters | 294496 |
---|---|
Distinct characters | 324 |
Distinct categories | 6 ? |
Distinct scripts | 4 ? |
Distinct blocks | 4 ? |
Unique
Unique | 8163 ? |
---|---|
Unique (%) | 87.6% |
Sample
1st row | 17 Jangpyeong-ro Suncheon-si Jeollanam-do |
---|---|
2nd row | 全羅南道 スンチョン市 チャンピョンロ17 |
3rd row | 全罗南道 顺天市 Jangpyeong路17 |
4th row | 10 Galti-ro Gwangyang-eup Gwangyang-si Jeollanam-do |
5th row | 全羅南道 クァンヤン市 クァンヤン邑 カルティロ10 |
Value | Count | Frequency (%) |
全羅南道 | 3107 | 8.3% |
jeollanam-do | 3107 | 8.3% |
全罗南道 | 3107 | 8.3% |
suncheon-si | 470 | 1.3% |
顺天市 | 470 | 1.3% |
スンチョン市 | 470 | 1.3% |
ヨス市 | 426 | 1.1% |
麗水市 | 426 | 1.1% |
yeosu-si | 426 | 1.1% |
naju-si | 253 | 0.7% |
Other values (8309) | 25110 |
Most occurring characters
Value | Count | Frequency (%) |
28363 | 9.6% | |
n | 18713 | 6.4% |
o | 16763 | 5.7% |
a | 15729 | 5.3% |
- | 13633 | 4.6% |
e | 12667 | 4.3% |
g | 11657 | 4.0% |
ン | 8714 | 3.0% |
l | 8513 | 2.9% |
u | 7599 | 2.6% |
Other values (314) | 152145 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 125933 | |
Other Letter | 86723 | |
Space Separator | 28363 | 9.6% |
Decimal Number | 25777 | 8.8% |
Uppercase Letter | 14067 | 4.8% |
Dash Punctuation | 13633 | 4.6% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
ン | 8714 | 10.0% |
南 | 6463 | 7.5% |
道 | 6318 | 7.3% |
全 | 6214 | 7.2% |
郡 | 3510 | 4.0% |
罗 | 3380 | 3.9% |
羅 | 3107 | 3.6% |
市 | 2724 | 3.1% |
邑 | 2412 | 2.8% |
ル | 1950 | 2.2% |
Other values (261) | 41931 |
Lowercase Letter
Value | Count | Frequency (%) |
n | 18713 | |
o | 16763 | |
a | 15729 | |
e | 12667 | |
g | 11657 | |
l | 8513 | |
u | 7599 | 6.0% |
m | 6487 | 5.2% |
i | 4601 | 3.7% |
s | 4154 | 3.3% |
Other values (11) | 19050 |
Uppercase Letter
Value | Count | Frequency (%) |
J | 4238 | |
S | 1841 | |
G | 1547 | 11.0% |
Y | 1347 | 9.6% |
H | 1110 | 7.9% |
D | 774 | 5.5% |
B | 724 | 5.1% |
M | 668 | 4.7% |
N | 628 | 4.5% |
C | 242 | 1.7% |
Other values (10) | 948 | 6.7% |
Decimal Number
Value | Count | Frequency (%) |
1 | 5914 | |
2 | 3751 | |
3 | 3077 | |
4 | 2316 | 9.0% |
5 | 2223 | 8.6% |
6 | 2013 | 7.8% |
7 | 1926 | 7.5% |
8 | 1590 | 6.2% |
0 | 1533 | 5.9% |
9 | 1434 | 5.6% |
Space Separator
Value | Count | Frequency (%) |
28363 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 13633 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 140000 | |
Common | 67773 | |
Han | 48798 | 16.6% |
Katakana | 37925 | 12.9% |
Most frequent character per script
Han
Value | Count | Frequency (%) |
南 | 6463 | |
道 | 6318 | |
全 | 6214 | |
郡 | 3510 | 7.2% |
罗 | 3380 | 6.9% |
羅 | 3107 | 6.4% |
市 | 2724 | 5.6% |
邑 | 2412 | 4.9% |
面 | 1706 | 3.5% |
路 | 1540 | 3.2% |
Other values (190) | 11424 |
Katakana
Value | Count | Frequency (%) |
ン | 8714 | |
ル | 1950 | 5.1% |
ム | 1839 | 4.8% |
チ | 1689 | 4.5% |
ギ | 1655 | 4.4% |
ョ | 1506 | 4.0% |
ロ | 1492 | 3.9% |
ク | 1364 | 3.6% |
ヨ | 1182 | 3.1% |
ス | 1176 | 3.1% |
Other values (61) | 15358 |
Latin
Value | Count | Frequency (%) |
n | 18713 | |
o | 16763 | |
a | 15729 | |
e | 12667 | 9.0% |
g | 11657 | 8.3% |
l | 8513 | 6.1% |
u | 7599 | 5.4% |
m | 6487 | 4.6% |
i | 4601 | 3.3% |
J | 4238 | 3.0% |
Other values (31) | 33033 |
Common
Value | Count | Frequency (%) |
28363 | ||
- | 13633 | |
1 | 5914 | 8.7% |
2 | 3751 | 5.5% |
3 | 3077 | 4.5% |
4 | 2316 | 3.4% |
5 | 2223 | 3.3% |
6 | 2013 | 3.0% |
7 | 1926 | 2.8% |
8 | 1590 | 2.3% |
Other values (2) | 2967 | 4.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 207773 | |
CJK | 47872 | 16.3% |
Katakana | 37925 | 12.9% |
CJK Compat Ideographs | 926 | 0.3% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
28363 | ||
n | 18713 | 9.0% |
o | 16763 | 8.1% |
a | 15729 | 7.6% |
- | 13633 | 6.6% |
e | 12667 | 6.1% |
g | 11657 | 5.6% |
l | 8513 | 4.1% |
u | 7599 | 3.7% |
m | 6487 | 3.1% |
Other values (43) | 67649 |
Katakana
Value | Count | Frequency (%) |
ン | 8714 | |
ル | 1950 | 5.1% |
ム | 1839 | 4.8% |
チ | 1689 | 4.5% |
ギ | 1655 | 4.4% |
ョ | 1506 | 4.0% |
ロ | 1492 | 3.9% |
ク | 1364 | 3.6% |
ヨ | 1182 | 3.1% |
ス | 1176 | 3.1% |
Other values (61) | 15358 |
CJK
Value | Count | Frequency (%) |
南 | 6463 | |
道 | 6318 | |
全 | 6214 | |
郡 | 3510 | 7.3% |
罗 | 3380 | 7.1% |
羅 | 3107 | 6.5% |
市 | 2724 | 5.7% |
邑 | 2412 | 5.0% |
面 | 1706 | 3.6% |
路 | 1540 | 3.2% |
Other values (180) | 10498 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
靈 | 471 | |
麗 | 426 | |
樂 | 10 | 1.1% |
綾 | 6 | 0.6% |
立 | 4 | 0.4% |
栗 | 3 | 0.3% |
老 | 2 | 0.2% |
金 | 2 | 0.2% |
臨 | 1 | 0.1% |
蘆 | 1 | 0.1% |
지번주소
Text
Distinct | 7164 |
---|---|
Distinct (%) | 76.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 73.0 KiB |
Length
Max length | 64 |
---|---|
Median length | 55 |
Mean length | 28.15134 |
Min length | 11 |
Characters and Unicode
Total characters | 262652 |
---|---|
Distinct characters | 473 |
Distinct categories | 6 ? |
Distinct scripts | 4 ? |
Distinct blocks | 4 ? |
Unique
Unique | 6645 ? |
---|---|
Unique (%) | 71.2% |
Sample
1st row | 558-2 Namjeong-dong Suncheon-si Jeollanam-do |
---|---|
2nd row | 全羅南道 スンチョン市 ナムジョン洞558-2 |
3rd row | 全罗南道 顺天市 南井洞558-2 |
4th row | 1794-3 Deongnye-ri Gwangyang-eup Gwangyang-si Jeollanam-do |
5th row | 全羅南道 グァンヤン市 クァンヤン邑 トクレェ 里1794-3 |
Value | Count | Frequency (%) |
全罗南道 | 3110 | 9.0% |
jeollanam-do | 3110 | 9.0% |
全羅南道 | 3110 | 9.0% |
顺天市 | 472 | 1.4% |
suncheon-si | 472 | 1.4% |
スンチョン市 | 472 | 1.4% |
ヨス市 | 426 | 1.2% |
麗水市 | 426 | 1.2% |
yeosu-si | 426 | 1.2% |
ナジュ市 | 253 | 0.7% |
Other values (7544) | 22199 |
Most occurring characters
Value | Count | Frequency (%) |
25274 | 9.6% | |
- | 16665 | 6.3% |
n | 14684 | 5.6% |
o | 12984 | 4.9% |
a | 12217 | 4.7% |
e | 9581 | 3.6% |
g | 8020 | 3.1% |
ン | 7346 | 2.8% |
南 | 6681 | 2.5% |
1 | 6606 | 2.5% |
Other values (463) | 142594 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 96779 | |
Other Letter | 83247 | |
Decimal Number | 29892 | 11.4% |
Space Separator | 25274 | 9.6% |
Dash Punctuation | 16665 | 6.3% |
Uppercase Letter | 10795 | 4.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
ン | 7346 | 8.8% |
南 | 6681 | 8.0% |
道 | 6314 | 7.6% |
全 | 6220 | 7.5% |
郡 | 3539 | 4.3% |
罗 | 3386 | 4.1% |
羅 | 3110 | 3.7% |
市 | 2728 | 3.3% |
里 | 2702 | 3.2% |
邑 | 2495 | 3.0% |
Other values (411) | 38726 |
Lowercase Letter
Value | Count | Frequency (%) |
n | 14684 | |
o | 12984 | |
a | 12217 | |
e | 9581 | |
g | 8020 | |
l | 6483 | |
u | 5867 | 6.1% |
m | 5517 | 5.7% |
d | 4586 | 4.7% |
i | 3614 | 3.7% |
Other values (11) | 13226 |
Uppercase Letter
Value | Count | Frequency (%) |
J | 3845 | |
S | 1301 | 12.1% |
G | 1188 | 11.0% |
Y | 1135 | 10.5% |
H | 837 | 7.8% |
D | 529 | 4.9% |
B | 526 | 4.9% |
N | 466 | 4.3% |
M | 391 | 3.6% |
W | 146 | 1.4% |
Other values (9) | 431 | 4.0% |
Decimal Number
Value | Count | Frequency (%) |
1 | 6606 | |
2 | 3819 | |
3 | 2913 | |
4 | 2526 | 8.5% |
6 | 2481 | 8.3% |
5 | 2481 | 8.3% |
7 | 2442 | 8.2% |
8 | 2346 | 7.8% |
0 | 2226 | 7.4% |
9 | 2052 | 6.9% |
Space Separator
Value | Count | Frequency (%) |
25274 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 16665 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 107574 | |
Common | 71831 | |
Han | 54735 | |
Katakana | 28512 | 10.9% |
Most frequent character per script
Han
Value | Count | Frequency (%) |
南 | 6681 | |
道 | 6314 | |
全 | 6220 | |
郡 | 3539 | 6.5% |
罗 | 3386 | 6.2% |
羅 | 3110 | 5.7% |
市 | 2728 | 5.0% |
里 | 2702 | 4.9% |
邑 | 2495 | 4.6% |
洞 | 2101 | 3.8% |
Other values (336) | 15459 |
Katakana
Value | Count | Frequency (%) |
ン | 7346 | |
ム | 1697 | 6.0% |
チ | 1391 | 4.9% |
ク | 1254 | 4.4% |
ヨ | 1183 | 4.1% |
ョ | 1161 | 4.1% |
ス | 1135 | 4.0% |
ァ | 863 | 3.0% |
ソ | 860 | 3.0% |
ナ | 827 | 2.9% |
Other values (65) | 10795 |
Latin
Value | Count | Frequency (%) |
n | 14684 | |
o | 12984 | |
a | 12217 | |
e | 9581 | |
g | 8020 | 7.5% |
l | 6483 | 6.0% |
u | 5867 | 5.5% |
m | 5517 | 5.1% |
d | 4586 | 4.3% |
J | 3845 | 3.6% |
Other values (30) | 23790 |
Common
Value | Count | Frequency (%) |
25274 | ||
- | 16665 | |
1 | 6606 | 9.2% |
2 | 3819 | 5.3% |
3 | 2913 | 4.1% |
4 | 2526 | 3.5% |
6 | 2481 | 3.5% |
5 | 2481 | 3.5% |
7 | 2442 | 3.4% |
8 | 2346 | 3.3% |
Other values (2) | 4278 | 6.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 179405 | |
CJK | 53658 | 20.4% |
Katakana | 28512 | 10.9% |
CJK Compat Ideographs | 1077 | 0.4% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
25274 | ||
- | 16665 | 9.3% |
n | 14684 | 8.2% |
o | 12984 | 7.2% |
a | 12217 | 6.8% |
e | 9581 | 5.3% |
g | 8020 | 4.5% |
1 | 6606 | 3.7% |
l | 6483 | 3.6% |
u | 5867 | 3.3% |
Other values (42) | 61024 |
Katakana
Value | Count | Frequency (%) |
ン | 7346 | |
ム | 1697 | 6.0% |
チ | 1391 | 4.9% |
ク | 1254 | 4.4% |
ヨ | 1183 | 4.1% |
ョ | 1161 | 4.1% |
ス | 1135 | 4.0% |
ァ | 863 | 3.0% |
ソ | 860 | 3.0% |
ナ | 827 | 2.9% |
Other values (65) | 10795 |
CJK
Value | Count | Frequency (%) |
南 | 6681 | |
道 | 6314 | |
全 | 6220 | |
郡 | 3539 | 6.6% |
罗 | 3386 | 6.3% |
羅 | 3110 | 5.8% |
市 | 2728 | 5.1% |
里 | 2702 | 5.0% |
邑 | 2495 | 4.6% |
洞 | 2101 | 3.9% |
Other values (320) | 14382 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
靈 | 475 | |
麗 | 459 | |
蓮 | 93 | 8.6% |
樂 | 10 | 0.9% |
金 | 8 | 0.7% |
栗 | 7 | 0.6% |
綾 | 6 | 0.6% |
麟 | 4 | 0.4% |
立 | 4 | 0.4% |
綠 | 3 | 0.3% |
Other values (6) | 8 | 0.7% |
등록일시
Date
Distinct | 546 |
---|---|
Distinct (%) | 5.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 73.0 KiB |
Minimum | 2021-01-21 13:23:55 |
---|---|
Maximum | 2021-01-21 13:33:45 |
다국어식당정보ID | 식당ID | 언어타입 | |
---|---|---|---|
다국어식당정보ID | 1.000 | 0.867 | 0.000 |
식당ID | 0.867 | 1.000 | 0.000 |
언어타입 | 0.000 | 0.000 | 1.000 |
다국어식당정보ID | 식당ID | 언어타입 | |
---|---|---|---|
다국어식당정보ID | 1.000 | 1.000 | 0.000 |
식당ID | 1.000 | 1.000 | 0.000 |
언어타입 | 0.000 | 0.000 | 1.000 |
다국어식당정보ID | 식당ID | 언어타입 | 업종 | 식당명 | 도로명주소 | 지번주소 | 등록일시 | |
---|---|---|---|---|---|---|---|---|
0 | 1 | 2858 | en | Bakery | Hoseongdang | 17 Jangpyeong-ro Suncheon-si Jeollanam-do | 558-2 Namjeong-dong Suncheon-si Jeollanam-do | 2021-01-21 13:23:55 |
1 | 2 | 2858 | ja | ベーカリー | Hoseongdang | 全羅南道 スンチョン市 チャンピョンロ17 | 全羅南道 スンチョン市 ナムジョン洞558-2 | 2021-01-21 13:23:55 |
2 | 3 | 2858 | zh-Hans | 面包店 | Hoseongdang | 全罗南道 顺天市 Jangpyeong路17 | 全罗南道 顺天市 南井洞558-2 | 2021-01-21 13:23:55 |
3 | 4 | 3820 | en | Japanese (cuisine) | Hwanggeum Dokki | 10 Galti-ro Gwangyang-eup Gwangyang-si Jeollanam-do | 1794-3 Deongnye-ri Gwangyang-eup Gwangyang-si Jeollanam-do | 2021-01-21 13:23:55 |
4 | 5 | 3820 | ja | 和食 | Hwanggeum Dokki | 全羅南道 クァンヤン市 クァンヤン邑 カルティロ10 | 全羅南道 グァンヤン市 クァンヤン邑 トクレェ 里1794-3 | 2021-01-21 13:23:55 |
5 | 6 | 3820 | zh-Hans | 日本料理 | Hwanggeum Dokki | 全罗南道 光阳市 光阳邑 Galti路10 | 全罗南道 光阳市 光阳邑 德礼里1794-3 | 2021-01-21 13:23:55 |
6 | 7 | 4419 | en | Japanese (cuisine) | Geobukseon Hoetjip | 11 Bonghwa 2-gil Suncheon-si Jeollanam-do | 1714-1 Jorye-dong Suncheon-si Jeollanam-do | 2021-01-21 13:23:55 |
7 | 8 | 4419 | ja | 和食 | Geobukseon Hoetjip | 全羅南道 スンチョン市 ポンファ2ギル11 | 全羅南道 スンチョン市 チョリェ洞1714-1 | 2021-01-21 13:23:55 |
8 | 9 | 4419 | zh-Hans | 日本料理 | Geobukseon Hoetjip | 全罗南道 顺天市 Bonghwa2街11 | 全罗南道 顺天市 照礼洞1714-1 | 2021-01-21 13:23:55 |
9 | 10 | 4751 | en | Korean cuisine | Hwawon Imone Sikdang | 34 Daejukseo-ro 15beon-gil Samhyang-eup Muan-gun Jeollanam-do | 2159 Namak-ri Samhyang-eup Muan-gun Jeollanam-do | 2021-01-21 13:23:55 |
다국어식당정보ID | 식당ID | 언어타입 | 업종 | 식당명 | 도로명주소 | 지번주소 | 등록일시 | |
---|---|---|---|---|---|---|---|---|
9320 | 9321 | 865300 | zh-Hans | 韩餐 | Seoneo Sikdang | 全罗南道 罗州市 Naju路168 | 全罗南道 罗州市 中央洞72-4 | 2021-01-21 13:33:44 |
9321 | 9322 | 865301 | en | restaurant | Okcheon Gwitturami | 19 Jeojeon 1-gil Suncheon-si Jeollanam-do | 121-6 Jeojeon-dong Suncheon-si Jeollanam-do | 2021-01-21 13:33:45 |
9322 | 9323 | 865301 | ja | 飲食店 | Okcheon Gwitturami | 全羅南道 スンチョン市 チョジョン1ギル19 | 全羅南道 スンチョン市 チョジョン洞121-6 | 2021-01-21 13:33:45 |
9323 | 9324 | 865301 | zh-Hans | 餐厅 | Okcheon Gwitturami | 全罗南道 顺天市 Jeojeon1街19 | 全罗南道 顺天市 楮田洞121-6 | 2021-01-21 13:33:45 |
9324 | 9325 | 865302 | en | Korean cuisine | Geurin Jjigae Bapsang | 6 Naedong-gil Naju-si Jeollanam-do | 1098-7 Songwol-dong Naju-si Jeollanam-do | 2021-01-21 13:33:45 |
9325 | 9326 | 865302 | ja | 韓食 | Geurin Jjigae Bapsang | 全羅南道 ナジュ市 ネドンギル6 | 全羅南道 ナジュ市 ソンウォル洞1098-7 | 2021-01-21 13:33:45 |
9326 | 9327 | 865302 | zh-Hans | 韩餐 | Geurin Jjigae Bapsang | 全罗南道 罗州市 Naedong街6 | 全罗南道 罗州市 松月洞1098-7 | 2021-01-21 13:33:45 |
9327 | 9328 | 865303 | en | Rice Soup | Ompanggol Kongnamul Gukbap | 101 Honam-gil Suncheon-si Jeollanam-do | 142-6 Jeojeon-dong Suncheon-si Jeollanam-do | 2021-01-21 13:33:45 |
9328 | 9329 | 865303 | ja | 牛肉クッパ | Ompanggol Kongnamul Gukbap | 全羅南道 スンチョン市 ホナムギル101 | 全羅南道 スンチョン市 チョジョン洞142-6 | 2021-01-21 13:33:45 |
9329 | 9330 | 865303 | zh-Hans | 汤饭 | Ompanggol Kongnamul Gukbap | 全罗南道 顺天市 Honam街101 | 全罗南道 顺天市 楮田洞142-6 | 2021-01-21 13:33:45 |