Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells508
Missing cells (%)0.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory566.4 KiB
Average record size in memory58.0 B

Variable types

Numeric2
Categorical1
Text2
DateTime1

Dataset

Description다국어 메뉴 정보(메뉴명, 언어정류, 메뉴태그정보 등 6개 항목)
Author전라남도
URLhttps://www.data.go.kr/data/15076626/fileData.do

Alerts

메뉴태그정보 has 499 (5.0%) missing valuesMissing
다국어메뉴정보ID has unique valuesUnique

Reproduction

Analysis started2023-12-11 23:17:01.424229
Analysis finished2023-12-11 23:17:03.433823
Duration2.01 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

다국어메뉴정보ID
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean44193.683
Minimum15
Maximum88508
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T08:17:03.518936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum15
5-th percentile4319.7
Q122318.75
median43973.5
Q366311.75
95-th percentile83581.95
Maximum88508
Range88493
Interquartile range (IQR)43993

Descriptive statistics

Standard deviation25400.454
Coefficient of variation (CV)0.57475306
Kurtosis-1.1925091
Mean44193.683
Median Absolute Deviation (MAD)22025
Skewness-0.010774887
Sum4.4193683 × 108
Variance6.4518309 × 108
MonotonicityNot monotonic
2023-12-12T08:17:03.694061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
50293 1
 
< 0.1%
88179 1
 
< 0.1%
52166 1
 
< 0.1%
45622 1
 
< 0.1%
13673 1
 
< 0.1%
14611 1
 
< 0.1%
75861 1
 
< 0.1%
4498 1
 
< 0.1%
47660 1
 
< 0.1%
19702 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
15 1
< 0.1%
35 1
< 0.1%
43 1
< 0.1%
52 1
< 0.1%
58 1
< 0.1%
60 1
< 0.1%
61 1
< 0.1%
75 1
< 0.1%
84 1
< 0.1%
97 1
< 0.1%
ValueCountFrequency (%)
88508 1
< 0.1%
88506 1
< 0.1%
88502 1
< 0.1%
88491 1
< 0.1%
88488 1
< 0.1%
88462 1
< 0.1%
88450 1
< 0.1%
88449 1
< 0.1%
88444 1
< 0.1%
88443 1
< 0.1%

메뉴ID
Real number (ℝ)

Distinct8903
Distinct (%)89.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean151913.15
Minimum76
Maximum533242
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T08:17:03.842643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum76
5-th percentile4121
Q119182
median106361
Q3209116.25
95-th percentile360688
Maximum533242
Range533166
Interquartile range (IQR)189934.25

Descriptive statistics

Standard deviation128424.52
Coefficient of variation (CV)0.84538122
Kurtosis-1.0051416
Mean151913.15
Median Absolute Deviation (MAD)95901.5
Skewness0.51475968
Sum1.5191315 × 109
Variance1.6492858 × 1010
MonotonicityNot monotonic
2023-12-12T08:17:03.974587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
73075 3
 
< 0.1%
8541 3
 
< 0.1%
99856 3
 
< 0.1%
80573 3
 
< 0.1%
206884 3
 
< 0.1%
407505 3
 
< 0.1%
205850 3
 
< 0.1%
16324 3
 
< 0.1%
19733 3
 
< 0.1%
17956 3
 
< 0.1%
Other values (8893) 9970
99.7%
ValueCountFrequency (%)
76 1
< 0.1%
80 1
< 0.1%
86 1
< 0.1%
89 1
< 0.1%
98 1
< 0.1%
102 1
< 0.1%
103 1
< 0.1%
105 1
< 0.1%
183 1
< 0.1%
188 1
< 0.1%
ValueCountFrequency (%)
533242 1
< 0.1%
533234 1
< 0.1%
533233 1
< 0.1%
533205 1
< 0.1%
533193 1
< 0.1%
533180 1
< 0.1%
533177 2
< 0.1%
533167 1
< 0.1%
413761 1
< 0.1%
413757 1
< 0.1%

언어타입
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
ja
3365 
en
3319 
zh-Hans
3316 

Length

Max length7
Median length2
Mean length3.658
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowen
2nd rowja
3rd rowzh-Hans
4th rowja
5th rowen

Common Values

ValueCountFrequency (%)
ja 3365
33.7%
en 3319
33.2%
zh-Hans 3316
33.2%

Length

2023-12-12T08:17:04.097549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:17:04.186118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
ja 3365
33.7%
en 3319
33.2%
zh-hans 3316
33.2%
Distinct5288
Distinct (%)52.9%
Missing9
Missing (%)0.1%
Memory size156.2 KiB
2023-12-12T08:17:04.435991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length42
Mean length15.604644
Min length1

Characters and Unicode

Total characters155906
Distinct characters77
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3907 ?
Unique (%)39.1%

Sample

1st rowTtalgi Yogurt
2nd rowMaekju
3rd rowMaechwisun
4th rowBangeo
5th rowOmeurice
ValueCountFrequency (%)
soju 289
 
1.7%
maekju 278
 
1.7%
eumnyosu 229
 
1.4%
gonggibap 181
 
1.1%
bokbunja 167
 
1.0%
naengmyeon 135
 
0.8%
hanu 125
 
0.7%
chuga 123
 
0.7%
bokkeumbap 123
 
0.7%
makgeolli 119
 
0.7%
Other values (4388) 14910
89.4%
2023-12-12T08:17:04.880223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 14442
 
9.3%
n 12074
 
7.7%
e 11040
 
7.1%
o 10421
 
6.7%
g 9612
 
6.2%
u 9221
 
5.9%
i 8585
 
5.5%
6689
 
4.3%
m 5102
 
3.3%
k 5026
 
3.2%
Other values (67) 63694
40.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 115877
74.3%
Uppercase Letter 19137
 
12.3%
Space Separator 6689
 
4.3%
Decimal Number 4258
 
2.7%
Open Punctuation 4254
 
2.7%
Close Punctuation 4225
 
2.7%
Other Punctuation 996
 
0.6%
Math Symbol 313
 
0.2%
Connector Punctuation 83
 
0.1%
Dash Punctuation 73
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 14442
12.5%
n 12074
10.4%
e 11040
9.5%
o 10421
 
9.0%
g 9612
 
8.3%
u 9221
 
8.0%
i 8585
 
7.4%
m 5102
 
4.4%
k 5026
 
4.3%
j 4118
 
3.6%
Other values (16) 26236
22.6%
Uppercase Letter
ValueCountFrequency (%)
S 2933
15.3%
M 2139
11.2%
G 1855
9.7%
J 1705
8.9%
B 1542
 
8.1%
H 1330
 
6.9%
C 1126
 
5.9%
D 985
 
5.1%
T 801
 
4.2%
L 686
 
3.6%
Other values (14) 4035
21.1%
Decimal Number
ValueCountFrequency (%)
1 1169
27.5%
0 1161
27.3%
2 795
18.7%
3 353
 
8.3%
5 319
 
7.5%
4 222
 
5.2%
8 112
 
2.6%
6 64
 
1.5%
7 38
 
0.9%
9 25
 
0.6%
Other Punctuation
ValueCountFrequency (%)
/ 859
86.2%
, 40
 
4.0%
. 38
 
3.8%
: 19
 
1.9%
& 14
 
1.4%
' 11
 
1.1%
% 8
 
0.8%
· 6
 
0.6%
* 1
 
0.1%
Math Symbol
ValueCountFrequency (%)
+ 199
63.6%
~ 114
36.4%
Space Separator
ValueCountFrequency (%)
6689
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4254
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4225
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 83
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 73
100.0%
Other Letter
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 135014
86.6%
Common 20891
 
13.4%
Hangul 1
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 14442
 
10.7%
n 12074
 
8.9%
e 11040
 
8.2%
o 10421
 
7.7%
g 9612
 
7.1%
u 9221
 
6.8%
i 8585
 
6.4%
m 5102
 
3.8%
k 5026
 
3.7%
j 4118
 
3.1%
Other values (40) 45373
33.6%
Common
ValueCountFrequency (%)
6689
32.0%
( 4254
20.4%
) 4225
20.2%
1 1169
 
5.6%
0 1161
 
5.6%
/ 859
 
4.1%
2 795
 
3.8%
3 353
 
1.7%
5 319
 
1.5%
4 222
 
1.1%
Other values (16) 845
 
4.0%
Hangul
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 155899
> 99.9%
None 6
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 14442
 
9.3%
n 12074
 
7.7%
e 11040
 
7.1%
o 10421
 
6.7%
g 9612
 
6.2%
u 9221
 
5.9%
i 8585
 
5.5%
6689
 
4.3%
m 5102
 
3.3%
k 5026
 
3.2%
Other values (65) 63687
40.9%
None
ValueCountFrequency (%)
· 6
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

메뉴태그정보
Text

MISSING 

Distinct4389
Distinct (%)46.2%
Missing499
Missing (%)5.0%
Memory size156.2 KiB
2023-12-12T08:17:05.059724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length177
Median length116
Mean length31.239659
Min length6

Characters and Unicode

Total characters296808
Distinct characters764
Distinct categories9 ?
Distinct scripts5 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3049 ?
Unique (%)32.1%

Sample

1st rowIngredient : Strawberry,Yogurt
2nd row主食材 : ビール / ソース : お酒
3rd row主料 : 青梅,酒
4th rowIngredient : Egg,SteamedRice,Vegetables / Cooking : Stir-fry
5th rowIngredient : Ribs / Cooking : Meat
ValueCountFrequency (%)
26251
42.2%
主食材 2965
 
4.8%
ingredient 2962
 
4.8%
主料 2932
 
4.7%
cooking 1700
 
2.7%
調理法 1658
 
2.7%
烹饪法 1647
 
2.6%
调味汁 905
 
1.5%
sauce 904
 
1.5%
ソース 890
 
1.4%
Other values (3729) 19360
31.1%
2023-12-12T08:17:05.385316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
52673
 
17.7%
e 18501
 
6.2%
: 17876
 
6.0%
, 11001
 
3.7%
i 9900
 
3.3%
n 9448
 
3.2%
o 9131
 
3.1%
/ 8375
 
2.8%
r 7432
 
2.5%
t 7313
 
2.5%
Other values (754) 145158
48.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 107374
36.2%
Other Letter 77014
25.9%
Space Separator 52673
17.7%
Other Punctuation 37289
 
12.6%
Uppercase Letter 20181
 
6.8%
Modifier Letter 1641
 
0.6%
Dash Punctuation 510
 
0.2%
Close Punctuation 63
 
< 0.1%
Open Punctuation 63
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5897
 
7.7%
3305
 
4.3%
3089
 
4.0%
3041
 
3.9%
2973
 
3.9%
2283
 
3.0%
1821
 
2.4%
1701
 
2.2%
調 1658
 
2.2%
1653
 
2.1%
Other values (693) 49593
64.4%
Lowercase Letter
ValueCountFrequency (%)
e 18501
17.2%
i 9900
9.2%
n 9448
8.8%
o 9131
 
8.5%
r 7432
 
6.9%
t 7313
 
6.8%
a 6256
 
5.8%
d 6134
 
5.7%
g 6027
 
5.6%
s 3925
 
3.7%
Other values (16) 23307
21.7%
Uppercase Letter
ValueCountFrequency (%)
S 4578
22.7%
I 3082
15.3%
C 2658
13.2%
B 1474
 
7.3%
P 1332
 
6.6%
R 1121
 
5.6%
M 745
 
3.7%
V 711
 
3.5%
F 705
 
3.5%
E 589
 
2.9%
Other values (14) 3186
15.8%
Other Punctuation
ValueCountFrequency (%)
: 17876
47.9%
, 11001
29.5%
/ 8375
22.5%
' 37
 
0.1%
Modifier Letter
ValueCountFrequency (%)
1640
99.9%
1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
58
92.1%
( 5
 
7.9%
Space Separator
ValueCountFrequency (%)
52673
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 510
100.0%
Close Punctuation
ValueCountFrequency (%)
) 63
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 127555
43.0%
Common 92238
31.1%
Han 60875
20.5%
Katakana 10670
 
3.6%
Hiragana 5470
 
1.8%

Most frequent character per script

Han
ValueCountFrequency (%)
5897
 
9.7%
3305
 
5.4%
3089
 
5.1%
3041
 
5.0%
2973
 
4.9%
2283
 
3.8%
1821
 
3.0%
1701
 
2.8%
調 1658
 
2.7%
1653
 
2.7%
Other values (558) 33454
55.0%
Katakana
ValueCountFrequency (%)
1315
 
12.3%
1014
 
9.5%
595
 
5.6%
473
 
4.4%
450
 
4.2%
446
 
4.2%
374
 
3.5%
288
 
2.7%
262
 
2.5%
261
 
2.4%
Other values (67) 5192
48.7%
Hiragana
ValueCountFrequency (%)
720
13.2%
703
12.9%
480
 
8.8%
478
 
8.7%
394
 
7.2%
330
 
6.0%
225
 
4.1%
217
 
4.0%
157
 
2.9%
142
 
2.6%
Other values (49) 1624
29.7%
Latin
ValueCountFrequency (%)
e 18501
14.5%
i 9900
 
7.8%
n 9448
 
7.4%
o 9131
 
7.2%
r 7432
 
5.8%
t 7313
 
5.7%
a 6256
 
4.9%
d 6134
 
4.8%
g 6027
 
4.7%
S 4578
 
3.6%
Other values (40) 42835
33.6%
Common
ValueCountFrequency (%)
52673
57.1%
: 17876
 
19.4%
, 11001
 
11.9%
/ 8375
 
9.1%
1640
 
1.8%
- 510
 
0.6%
) 63
 
0.1%
58
 
0.1%
' 37
 
< 0.1%
( 5
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 218095
73.5%
CJK 60874
 
20.5%
Katakana 12310
 
4.1%
Hiragana 5470
 
1.8%
None 59
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
52673
24.2%
e 18501
 
8.5%
: 17876
 
8.2%
, 11001
 
5.0%
i 9900
 
4.5%
n 9448
 
4.3%
o 9131
 
4.2%
/ 8375
 
3.8%
r 7432
 
3.4%
t 7313
 
3.4%
Other values (48) 66445
30.5%
CJK
ValueCountFrequency (%)
5897
 
9.7%
3305
 
5.4%
3089
 
5.1%
3041
 
5.0%
2973
 
4.9%
2283
 
3.8%
1821
 
3.0%
1701
 
2.8%
調 1658
 
2.7%
1653
 
2.7%
Other values (557) 33453
55.0%
Katakana
ValueCountFrequency (%)
1640
 
13.3%
1315
 
10.7%
1014
 
8.2%
595
 
4.8%
473
 
3.8%
450
 
3.7%
446
 
3.6%
374
 
3.0%
288
 
2.3%
262
 
2.1%
Other values (68) 5453
44.3%
Hiragana
ValueCountFrequency (%)
720
13.2%
703
12.9%
480
 
8.8%
478
 
8.7%
394
 
7.2%
330
 
6.0%
225
 
4.1%
217
 
4.0%
157
 
2.9%
142
 
2.6%
Other values (49) 1624
29.7%
None
ValueCountFrequency (%)
58
98.3%
1
 
1.7%
Distinct214
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2021-01-21 13:23:55
Maximum2021-01-21 13:27:46
2023-12-12T08:17:05.520511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:17:05.659006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T08:17:02.835291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:17:02.631355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:17:02.944961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:17:02.724945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T08:17:05.737891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
다국어메뉴정보ID메뉴ID언어타입
다국어메뉴정보ID1.0000.4050.000
메뉴ID0.4051.0000.000
언어타입0.0000.0001.000
2023-12-12T08:17:05.826141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
다국어메뉴정보ID메뉴ID언어타입
다국어메뉴정보ID1.0000.3420.000
메뉴ID0.3421.0000.000
언어타입0.0000.0001.000

Missing values

2023-12-12T08:17:03.118623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:17:03.254612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T08:17:03.370764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

다국어메뉴정보ID메뉴ID언어타입메뉴명메뉴태그정보등록일시
5029250293199584enTtalgi YogurtIngredient : Strawberry,Yogurt2021-01-21 13:26:06
885018850273717jaMaekju主食材 : ビール / ソース : お酒2021-01-21 13:27:46
373673736872417zh-HansMaechwisun主料 : 青梅,酒2021-01-21 13:25:36
8034180342349058jaBangeo<NA>2021-01-21 13:27:27
6081960820407988enOmeuriceIngredient : Egg,SteamedRice,Vegetables / Cooking : Stir-fry2021-01-21 13:26:34
7300873009107360enOdolgalbi(150g)Ingredient : Ribs / Cooking : Meat2021-01-21 13:27:08
7670476705209805enOmeuriceIngredient : Egg,SteamedRice,Vegetables / Cooking : Stir-fry2021-01-21 13:27:18
8486084861344183zh-HansBokbunja主料 : 覆盆子 / 调味汁 : 酒2021-01-21 13:27:38
2768527686360565jaMaeuntang(L)主食材 : 野菜,魚 / 調理法 : 湯 / ソース : コチュジャン2021-01-21 13:25:16
6618366184209143enMaekjuIngredient : Beer / Sauce : Liquor2021-01-21 13:26:44
다국어메뉴정보ID메뉴ID언어타입메뉴명메뉴태그정보등록일시
6390639181915enDeungsim(1inbun/150g)Etc : Sirloin2021-01-21 13:24:11
584415844283164jaModeum Hoe+ Hamo Shabushabu Set(Doldom/S)主食材 : 刺身,盛り合わせ / 調理法 : しゃぶしゃぶ / ソース : 出汁 / そのた : セット2021-01-21 13:26:29
200542005577950zh-HansGomtang主料 : 牛肉 / 烹饪法 : 汤2021-01-21 13:24:55
584885848983133enUreok(S)Ingredient : Rockfish2021-01-21 13:26:29
2664426645152318jaModeum Jeopsisuyuk(L)主食材 : 盛り合わせ / 調理法 : お肉,ゆで肉,茹で2021-01-21 13:25:13
204112041276319zh-HansHanbangju(1jan)主料 : 鸡肉,药膳,人参 / 烹饪法 : 汤2021-01-21 13:24:56
179971799821521enMaekjuIngredient : Beer / Sauce : Liquor2021-01-21 13:24:50
6891568916357909zh-HansSandeuljoeun Jungsik(Handon/Sunhanmat/2in isang)主料 : 套餐2021-01-21 13:26:59
61286129205606zh-HansEumnyosu主料 : 饮料2021-01-21 13:24:10
3848838489340144jaKkanpung Pyogo Saeu Yo-Ri(M)主食材 : 海老,海鮮,シイタケ / ソース : カンプンソース2021-01-21 13:25:38