Overview

Dataset statistics

Number of variables4
Number of observations691
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory21.7 KiB
Average record size in memory32.2 B

Variable types

Text4

Dataset

Description해당 데이터는 재료(면, 떡, 채소류, 젓갈 등) 및 조리법(찜, 구이, 조림, 등) 별로 분류한 약 700여가지의 한국음식에 대한 정보를 영문으로 제공합니다.
Author한국국제교류재단
URLhttps://www.data.go.kr/data/15044203/fileData.do

Alerts

음식명 has unique valuesUnique
영문 has unique valuesUnique
음식설명 has unique valuesUnique

Reproduction

Analysis started2024-03-14 16:11:14.064319
Analysis finished2024-03-14 16:11:15.066567
Duration1 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Text

Distinct57
Distinct (%)8.2%
Missing0
Missing (%)0.0%
Memory size5.5 KiB
2024-03-15T01:11:15.728776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length5
Mean length2.0448625
Min length1

Characters and Unicode

Total characters1413
Distinct characters78
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)1.4%

Sample

1st row
2nd row
3rd row
4th row구이
5th row젓갈
ValueCountFrequency (%)
국,탕 71
 
10.3%
구이 41
 
5.9%
전,적 34
 
4.9%
31
 
4.5%
30
 
4.3%
30
 
4.3%
김치 30
 
4.3%
29
 
4.2%
27
 
3.9%
한과 25
 
3.6%
Other values (46) 343
49.6%
2024-03-15T01:11:16.960729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
, 109
 
7.7%
90
 
6.4%
74
 
5.2%
72
 
5.1%
52
 
3.7%
45
 
3.2%
41
 
2.9%
41
 
2.9%
39
 
2.8%
36
 
2.5%
Other values (68) 814
57.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1301
92.1%
Other Punctuation 111
 
7.9%
Space Separator 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
90
 
6.9%
74
 
5.7%
72
 
5.5%
52
 
4.0%
45
 
3.5%
41
 
3.2%
41
 
3.2%
39
 
3.0%
36
 
2.8%
33
 
2.5%
Other values (65) 778
59.8%
Other Punctuation
ValueCountFrequency (%)
, 109
98.2%
/ 2
 
1.8%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1301
92.1%
Common 112
 
7.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
90
 
6.9%
74
 
5.7%
72
 
5.5%
52
 
4.0%
45
 
3.5%
41
 
3.2%
41
 
3.2%
39
 
3.0%
36
 
2.8%
33
 
2.5%
Other values (65) 778
59.8%
Common
ValueCountFrequency (%)
, 109
97.3%
/ 2
 
1.8%
1
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1301
92.1%
ASCII 112
 
7.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
, 109
97.3%
/ 2
 
1.8%
1
 
0.9%
Hangul
ValueCountFrequency (%)
90
 
6.9%
74
 
5.7%
72
 
5.5%
52
 
4.0%
45
 
3.5%
41
 
3.2%
41
 
3.2%
39
 
3.0%
36
 
2.8%
33
 
2.5%
Other values (65) 778
59.8%

음식명
Text

UNIQUE 

Distinct691
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size5.5 KiB
2024-03-15T01:11:18.060675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length3.642547
Min length1

Characters and Unicode

Total characters2517
Distinct characters330
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique691 ?
Unique (%)100.0%

Sample

1st row가락국수
2nd row가래떡
3rd row가오리찜
4th row가자미구이
5th row가자미식해
ValueCountFrequency (%)
16
 
2.2%
가락국수 1
 
0.1%
오이소박이(김치 1
 
0.1%
오징어채무침 1
 
0.1%
오분자기 1
 
0.1%
오분자기젓 1
 
0.1%
오분자기찜 1
 
0.1%
오이냉국 1
 
0.1%
오이무침 1
 
0.1%
오이볶음 1
 
0.1%
Other values (698) 698
96.5%
2024-03-15T01:11:19.380090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
69
 
2.7%
66
 
2.6%
50
 
2.0%
50
 
2.0%
46
 
1.8%
45
 
1.8%
43
 
1.7%
40
 
1.6%
39
 
1.5%
37
 
1.5%
Other values (320) 2032
80.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2418
96.1%
Space Separator 32
 
1.3%
Other Punctuation 23
 
0.9%
Open Punctuation 22
 
0.9%
Close Punctuation 22
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
69
 
2.9%
66
 
2.7%
50
 
2.1%
50
 
2.1%
46
 
1.9%
45
 
1.9%
43
 
1.8%
40
 
1.7%
39
 
1.6%
37
 
1.5%
Other values (316) 1933
79.9%
Space Separator
ValueCountFrequency (%)
32
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 23
100.0%
Open Punctuation
ValueCountFrequency (%)
( 22
100.0%
Close Punctuation
ValueCountFrequency (%)
) 22
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2418
96.1%
Common 99
 
3.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
69
 
2.9%
66
 
2.7%
50
 
2.1%
50
 
2.1%
46
 
1.9%
45
 
1.9%
43
 
1.8%
40
 
1.7%
39
 
1.6%
37
 
1.5%
Other values (316) 1933
79.9%
Common
ValueCountFrequency (%)
32
32.3%
/ 23
23.2%
( 22
22.2%
) 22
22.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2418
96.1%
ASCII 99
 
3.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
69
 
2.9%
66
 
2.7%
50
 
2.1%
50
 
2.1%
46
 
1.9%
45
 
1.9%
43
 
1.8%
40
 
1.7%
39
 
1.6%
37
 
1.5%
Other values (316) 1933
79.9%
ASCII
ValueCountFrequency (%)
32
32.3%
/ 23
23.2%
( 22
22.2%
) 22
22.2%

영문
Text

UNIQUE 

Distinct691
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size5.5 KiB
2024-03-15T01:11:21.010047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length41
Mean length19.836469
Min length4

Characters and Unicode

Total characters13707
Distinct characters61
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique691 ?
Unique (%)100.0%

Sample

1st rowThick Noodles in Clear Broth
2nd rowCylindrical Rice Pasta
3rd rowSteamed Stingray
4th rowGrilled Sole
5th rowSpicy Fermented Sole
ValueCountFrequency (%)
rice 86
 
4.0%
soup 81
 
3.8%
with 65
 
3.0%
beef 56
 
2.6%
and 50
 
2.3%
kimchi 43
 
2.0%
grilled 35
 
1.6%
pan-fried 28
 
1.3%
braised 27
 
1.3%
dried 26
 
1.2%
Other values (406) 1650
76.9%
2024-03-15T01:11:23.080235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1675
 
12.2%
e 1643
 
12.0%
i 931
 
6.8%
a 854
 
6.2%
o 649
 
4.7%
r 589
 
4.3%
d 580
 
4.2%
l 547
 
4.0%
t 531
 
3.9%
n 488
 
3.6%
Other values (51) 5220
38.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 9911
72.3%
Uppercase Letter 1979
 
14.4%
Space Separator 1675
 
12.2%
Dash Punctuation 59
 
0.4%
Open Punctuation 30
 
0.2%
Close Punctuation 30
 
0.2%
Decimal Number 15
 
0.1%
Other Punctuation 8
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 1643
16.6%
i 931
 
9.4%
a 854
 
8.6%
o 649
 
6.5%
r 589
 
5.9%
d 580
 
5.9%
l 547
 
5.5%
t 531
 
5.4%
n 488
 
4.9%
s 473
 
4.8%
Other values (16) 2626
26.5%
Uppercase Letter
ValueCountFrequency (%)
S 444
22.4%
P 233
11.8%
B 199
10.1%
C 194
9.8%
R 159
 
8.0%
G 93
 
4.7%
D 87
 
4.4%
M 84
 
4.2%
F 58
 
2.9%
N 58
 
2.9%
Other values (14) 370
18.7%
Other Punctuation
ValueCountFrequency (%)
' 3
37.5%
. 2
25.0%
, 2
25.0%
/ 1
 
12.5%
Decimal Number
ValueCountFrequency (%)
1 7
46.7%
2 7
46.7%
3 1
 
6.7%
Space Separator
ValueCountFrequency (%)
1675
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 59
100.0%
Open Punctuation
ValueCountFrequency (%)
( 30
100.0%
Close Punctuation
ValueCountFrequency (%)
) 30
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 11890
86.7%
Common 1817
 
13.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 1643
 
13.8%
i 931
 
7.8%
a 854
 
7.2%
o 649
 
5.5%
r 589
 
5.0%
d 580
 
4.9%
l 547
 
4.6%
t 531
 
4.5%
n 488
 
4.1%
s 473
 
4.0%
Other values (40) 4605
38.7%
Common
ValueCountFrequency (%)
1675
92.2%
- 59
 
3.2%
( 30
 
1.7%
) 30
 
1.7%
1 7
 
0.4%
2 7
 
0.4%
' 3
 
0.2%
. 2
 
0.1%
, 2
 
0.1%
/ 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 13707
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1675
 
12.2%
e 1643
 
12.0%
i 931
 
6.8%
a 854
 
6.2%
o 649
 
4.7%
r 589
 
4.3%
d 580
 
4.2%
l 547
 
4.0%
t 531
 
3.9%
n 488
 
3.6%
Other values (51) 5220
38.1%

음식설명
Text

UNIQUE 

Distinct691
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size5.5 KiB
2024-03-15T01:11:24.471978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length556
Median length215
Mean length132.96816
Min length24

Characters and Unicode

Total characters91881
Distinct characters71
Distinct categories11 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique691 ?
Unique (%)100.0%

Sample

1st rowNoodles in a clear broth, topped with vegetables such as mushrooms and green onions.
2nd rowCylindrical rice pasta sliced diagonally into ovals. Cooked in a beef broth to make a traditional soup for New Year's Day.
3rd rowMarinated stingray steamed and served with thinly sliced mushrooms, vegetables, and julienned egg garnish.
4th rowSole seasoned with salt then grilled, often topped with a spicy soy sauce.
5th rowA spicy dish of sole mixed with cooked millet and white radish, and seasoned with red chili pepper powder and garlic. Famous in the Hamgyeong-do region of North Korea.
ValueCountFrequency (%)
and 904
 
5.7%
a 671
 
4.3%
of 501
 
3.2%
with 494
 
3.1%
in 440
 
2.8%
the 389
 
2.5%
made 227
 
1.4%
is 226
 
1.4%
rice 212
 
1.3%
sauce 204
 
1.3%
Other values (1394) 11462
72.9%
2024-03-15T01:11:26.357927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
15047
16.4%
e 9884
 
10.8%
a 6239
 
6.8%
s 5736
 
6.2%
i 5709
 
6.2%
o 4794
 
5.2%
n 4679
 
5.1%
d 4557
 
5.0%
r 4494
 
4.9%
t 4454
 
4.8%
Other values (61) 26288
28.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 72868
79.3%
Space Separator 15047
 
16.4%
Other Punctuation 2208
 
2.4%
Uppercase Letter 1258
 
1.4%
Open Punctuation 164
 
0.2%
Close Punctuation 159
 
0.2%
Dash Punctuation 155
 
0.2%
Decimal Number 15
 
< 0.1%
Format 5
 
< 0.1%
Control 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 9884
13.6%
a 6239
 
8.6%
s 5736
 
7.9%
i 5709
 
7.8%
o 4794
 
6.6%
n 4679
 
6.4%
d 4557
 
6.3%
r 4494
 
6.2%
t 4454
 
6.1%
h 3120
 
4.3%
Other values (16) 19202
26.4%
Uppercase Letter
ValueCountFrequency (%)
A 309
24.6%
T 226
18.0%
S 141
11.2%
K 114
 
9.1%
C 77
 
6.1%
B 54
 
4.3%
P 45
 
3.6%
R 37
 
2.9%
D 34
 
2.7%
F 28
 
2.2%
Other values (13) 193
15.3%
Other Punctuation
ValueCountFrequency (%)
, 1081
49.0%
. 1072
48.6%
/ 18
 
0.8%
' 16
 
0.7%
" 6
 
0.3%
: 6
 
0.3%
; 6
 
0.3%
% 3
 
0.1%
Decimal Number
ValueCountFrequency (%)
0 4
26.7%
5 3
20.0%
7 2
13.3%
1 2
13.3%
2 2
13.3%
4 1
 
6.7%
6 1
 
6.7%
Space Separator
ValueCountFrequency (%)
15047
100.0%
Open Punctuation
ValueCountFrequency (%)
( 164
100.0%
Close Punctuation
ValueCountFrequency (%)
) 159
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 155
100.0%
Format
ValueCountFrequency (%)
­ 5
100.0%
Control
ValueCountFrequency (%)
1
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 74126
80.7%
Common 17755
 
19.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 9884
13.3%
a 6239
 
8.4%
s 5736
 
7.7%
i 5709
 
7.7%
o 4794
 
6.5%
n 4679
 
6.3%
d 4557
 
6.1%
r 4494
 
6.1%
t 4454
 
6.0%
h 3120
 
4.2%
Other values (39) 20460
27.6%
Common
ValueCountFrequency (%)
15047
84.7%
, 1081
 
6.1%
. 1072
 
6.0%
( 164
 
0.9%
) 159
 
0.9%
- 155
 
0.9%
/ 18
 
0.1%
' 16
 
0.1%
" 6
 
< 0.1%
: 6
 
< 0.1%
Other values (12) 31
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 91876
> 99.9%
None 5
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
15047
16.4%
e 9884
 
10.8%
a 6239
 
6.8%
s 5736
 
6.2%
i 5709
 
6.2%
o 4794
 
5.2%
n 4679
 
5.1%
d 4557
 
5.0%
r 4494
 
4.9%
t 4454
 
4.8%
Other values (60) 26283
28.6%
None
ValueCountFrequency (%)
­ 5
100.0%

Missing values

2024-03-15T01:11:14.663712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T01:11:14.955861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분음식명영문음식설명
0가락국수Thick Noodles in Clear BrothNoodles in a clear broth, topped with vegetables such as mushrooms and green onions.
1가래떡Cylindrical Rice PastaCylindrical rice pasta sliced diagonally into ovals. Cooked in a beef broth to make a traditional soup for New Year's Day.
2가오리찜Steamed StingrayMarinated stingray steamed and served with thinly sliced mushrooms, vegetables, and julienned egg garnish.
3구이가자미구이Grilled SoleSole seasoned with salt then grilled, often topped with a spicy soy sauce.
4젓갈가자미식해Spicy Fermented SoleA spicy dish of sole mixed with cooked millet and white radish, and seasoned with red chili pepper powder and garlic. Famous in the Hamgyeong-do region of North Korea.
5조림가자미조림Braised SoleSole seasoned with soy sauce, sugar, green onions, garlic and red chili pepper powder, then slowly simmered over low heat to reduce the liquid.
6나물가지나물Eggplant NamulEggplant steamed then marinated in soy sauce, green onions, garlic, red chili pepper powder, sesame oil and vinegar.
7가지선Stuffed EggplantEggplants cut lengthwise, filled with seasonings, and simmered in broth.
8전,적가지전Pan-fried EggplantEggplant slices coated with flour, dipped in egg, and pan-fried.
9가지찜Stuffed Eggplant with BeefEggplant cut lengthwise, filled with seasoned minced beef, and cooked in boiling water.
구분음식명영문음식설명
681전,적화양적Beef and Vegetable Brochette (3)Brochettes of beef, bellflower roots (doraji), carrots, shiitake mushrooms, cucumbers and other ingredients cut into strips, sauteed in soy sauce and sesame oil, then put onto skewers and pan-fried.
682화전Pan-fried Sweet Rice Cake with Flower PetalsRound, fiat rice cakes made of glutinous rice flour, decorated with minced jujubes (Korean dates) or petals of colorful flowers such as azalea, rose, chrysanthemum, and pan-fried. The fried cakes are sprinkled with sugar while still warm.
683음청류화채Korean Fruit PunchA sweet drink made of various fruits in omija (fruit of the Chinese magnolia vine) punch sweetened with sugar or honey. Azalea petals are sometimes added for color.
684구이황태구이Seasoned and Grilled Dried PollackFreeze-dried pollack grilled with a seasoning of soy sauce, red chili pepper paste, green onions, garlic, sesame oil, sugar and black pepper.
685Raw SeafoodThe word raw fish (hoe) usually refers to raw fish dishes similar to Japanese sashimi. However, hoe includes other raw or lightly cooked dishes including other seafoods and meats. Yukhoe refers to a raw beef (see Korean beef steak tartare) and sukhoe are dishes that are only lightly cooked (see parboild squid).
686회냉면Spicy Chilled Noodles with Raw SkateHamheung-style buckwheat noodles mixed with a spicy sauce and topped with raw or fermented skate. Served cold.
687회덮밥Raw Fish and Vegetables over RiceSlices of raw fish and fresh vegetables served in a bowl over rice with vinegared red chili pepper paste. Mixed thoroughly before eating.
688흑임자죽Black Sesame and Rice PorridgeA porridge made of white rice and ground black sesame seeds. Popular as a high energy food.
689흰떡White Rice CakeRice cake made from white rice flour. The rice cakes are steamed then beaten or kneaded to give them texture. The cakes can be made into cylindrical rice pasta, garaetteok or into square jeolpyeon.
690흰죽Rice PorridgeA porridge made of white rice only. The rice is soaked in water, added to plenty of water (about 6-7 times of the amount of rice) then simmered over medium heat until thick.