Overview

Dataset statistics

Number of variables22
Number of observations10000
Missing cells40181
Missing cells (%)18.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.8 MiB
Average record size in memory187.0 B

Variable types

Numeric3
Text19

Dataset

Description전통식품코드, 출전문헌내용 원문, 출전문헌내용 번역문, 조리법, 식재료명,식재료 수량 및 단위를 제공하여 사용자 서비스를 위한 파일을 제공한다.
Author한국식품연구원
URLhttps://www.data.go.kr/data/15047800/fileData.do

Alerts

출전문헌 해당페이지 has 6812 (68.1%) missing valuesMissing
분석정보 KTKRC has 8878 (88.8%) missing valuesMissing
전통식품명 (영문-음가) has 9166 (91.7%) missing valuesMissing
전통식품명 (영문-번역) has 9166 (91.7%) missing valuesMissing
조리법 (가공기술) has 1496 (15.0%) missing valuesMissing
조리기기 및 도구 has 4649 (46.5%) missing valuesMissing
전통 식품코드 has unique valuesUnique

Reproduction

Analysis started2023-12-12 08:38:54.761218
Analysis finished2023-12-12 08:39:08.462362
Duration13.7 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

대표식품 코드
Real number (ℝ)

Distinct3355
Distinct (%)33.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean102465.03
Minimum100003
Maximum105003
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T17:39:08.596707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum100003
5-th percentile100225
Q1101213.75
median102496
Q3103707
95-th percentile104654.1
Maximum105003
Range5000
Interquartile range (IQR)2493.25

Descriptive statistics

Standard deviation1423.2037
Coefficient of variation (CV)0.013889652
Kurtosis-1.2102762
Mean102465.03
Median Absolute Deviation (MAD)1252.5
Skewness-0.017895387
Sum1.0246503 × 109
Variance2025508.7
MonotonicityNot monotonic
2023-12-12T17:39:08.845035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
102896 139
 
1.4%
100861 54
 
0.5%
103817 47
 
0.5%
103101 41
 
0.4%
101315 40
 
0.4%
103488 40
 
0.4%
103097 35
 
0.4%
102190 34
 
0.3%
101423 33
 
0.3%
104613 32
 
0.3%
Other values (3345) 9505
95.0%
ValueCountFrequency (%)
100003 1
 
< 0.1%
100004 3
 
< 0.1%
100005 4
< 0.1%
100009 1
 
< 0.1%
100010 9
0.1%
100016 1
 
< 0.1%
100018 1
 
< 0.1%
100019 1
 
< 0.1%
100020 2
 
< 0.1%
100021 1
 
< 0.1%
ValueCountFrequency (%)
105003 1
 
< 0.1%
105002 1
 
< 0.1%
105001 1
 
< 0.1%
104926 1
 
< 0.1%
104924 4
 
< 0.1%
104922 11
0.1%
104921 1
 
< 0.1%
104920 1
 
< 0.1%
104919 1
 
< 0.1%
104918 1
 
< 0.1%
Distinct3355
Distinct (%)33.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T17:39:09.353975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length9
Mean length3.1802
Min length1

Characters and Unicode

Total characters31802
Distinct characters482
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1856 ?
Unique (%)18.6%

Sample

1st row소주
2nd row사슴꼬리육포
3rd row두부조림
4th row더덕구이
5th row주악
ValueCountFrequency (%)
139
 
1.4%
다식 54
 
0.5%
47
 
0.5%
약밥 41
 
0.4%
두텁떡 40
 
0.4%
완자탕 40
 
0.4%
약과 35
 
0.3%
빙사과 34
 
0.3%
만두 33
 
0.3%
팥죽 32
 
0.3%
Other values (3346) 9506
95.1%
2023-12-12T17:39:10.014281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
958
 
3.0%
792
 
2.5%
604
 
1.9%
597
 
1.9%
592
 
1.9%
546
 
1.7%
532
 
1.7%
509
 
1.6%
491
 
1.5%
485
 
1.5%
Other values (472) 25696
80.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 31794
> 99.9%
Control 7
 
< 0.1%
Space Separator 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
958
 
3.0%
792
 
2.5%
604
 
1.9%
597
 
1.9%
592
 
1.9%
546
 
1.7%
532
 
1.7%
509
 
1.6%
491
 
1.5%
485
 
1.5%
Other values (470) 25688
80.8%
Control
ValueCountFrequency (%)
ž 7
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 31794
> 99.9%
Common 8
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
958
 
3.0%
792
 
2.5%
604
 
1.9%
597
 
1.9%
592
 
1.9%
546
 
1.7%
532
 
1.7%
509
 
1.6%
491
 
1.5%
485
 
1.5%
Other values (470) 25688
80.8%
Common
ValueCountFrequency (%)
ž 7
87.5%
1
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 31794
> 99.9%
None 7
 
< 0.1%
ASCII 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
958
 
3.0%
792
 
2.5%
604
 
1.9%
597
 
1.9%
592
 
1.9%
546
 
1.7%
532
 
1.7%
509
 
1.6%
491
 
1.5%
485
 
1.5%
Other values (470) 25688
80.8%
None
ValueCountFrequency (%)
ž 7
100.0%
ASCII
ValueCountFrequency (%)
1
100.0%

전통 식품코드
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean100630.55
Minimum10002
Maximum185003
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T17:39:10.228817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10002
5-th percentile11163.9
Q1103590.75
median107271.5
Q3122220.25
95-th percentile183428.1
Maximum185003
Range175001
Interquartile range (IQR)18629.5

Descriptive statistics

Standard deviation51808.036
Coefficient of variation (CV)0.51483407
Kurtosis-0.3706821
Mean100630.55
Median Absolute Deviation (MAD)14463
Skewness-0.51967777
Sum1.0063055 × 109
Variance2.6840726 × 109
MonotonicityNot monotonic
2023-12-12T17:39:10.437662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10058 1
 
< 0.1%
120177 1
 
< 0.1%
106165 1
 
< 0.1%
103897 1
 
< 0.1%
105109 1
 
< 0.1%
105754 1
 
< 0.1%
125629 1
 
< 0.1%
127083 1
 
< 0.1%
12881 1
 
< 0.1%
109310 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
10002 1
< 0.1%
10003 1
< 0.1%
10004 1
< 0.1%
10005 1
< 0.1%
10012 1
< 0.1%
10014 1
< 0.1%
10016 1
< 0.1%
10017 1
< 0.1%
10021 1
< 0.1%
10023 1
< 0.1%
ValueCountFrequency (%)
185003 1
< 0.1%
185002 1
< 0.1%
185001 1
< 0.1%
184030 1
< 0.1%
184029 1
< 0.1%
184028 1
< 0.1%
184027 1
< 0.1%
184026 1
< 0.1%
184025 1
< 0.1%
184024 1
< 0.1%
Distinct117
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T17:39:10.753051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length10
Mean length5.6772
Min length2

Characters and Unicode

Total characters56772
Distinct characters150
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)0.2%

Sample

1st row연행록
2nd row오주연문장전산고
3rd row이조궁정요리통고
4th row시의전서
5th row반찬등속
ValueCountFrequency (%)
진연의궤 1078
 
8.5%
만드는 990
 
7.8%
990
 
7.8%
임원십육지 718
 
5.7%
조선요리제법 619
 
4.9%
음식 555
 
4.4%
우리나라 555
 
4.4%
조선무쌍신식요리제법 545
 
4.3%
산림경제 487
 
3.8%
조선음식 435
 
3.4%
Other values (110) 5689
44.9%
2023-12-12T17:39:11.262295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3558
 
6.3%
3201
 
5.6%
2764
 
4.9%
2715
 
4.8%
2661
 
4.7%
2483
 
4.4%
2399
 
4.2%
1884
 
3.3%
1564
 
2.8%
1499
 
2.6%
Other values (140) 32044
56.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 53935
95.0%
Space Separator 2661
 
4.7%
Dash Punctuation 176
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3558
 
6.6%
3201
 
5.9%
2764
 
5.1%
2715
 
5.0%
2483
 
4.6%
2399
 
4.4%
1884
 
3.5%
1564
 
2.9%
1499
 
2.8%
1217
 
2.3%
Other values (138) 30651
56.8%
Space Separator
ValueCountFrequency (%)
2661
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 176
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 53935
95.0%
Common 2837
 
5.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3558
 
6.6%
3201
 
5.9%
2764
 
5.1%
2715
 
5.0%
2483
 
4.6%
2399
 
4.4%
1884
 
3.5%
1564
 
2.9%
1499
 
2.8%
1217
 
2.3%
Other values (138) 30651
56.8%
Common
ValueCountFrequency (%)
2661
93.8%
- 176
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 53935
95.0%
ASCII 2837
 
5.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3558
 
6.6%
3201
 
5.9%
2764
 
5.1%
2715
 
5.0%
2483
 
4.6%
2399
 
4.4%
1884
 
3.5%
1564
 
2.9%
1499
 
2.8%
1217
 
2.3%
Other values (138) 30651
56.8%
ASCII
ValueCountFrequency (%)
2661
93.8%
- 176
 
6.2%
Distinct120
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T17:39:11.636181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length5.5729
Min length2

Characters and Unicode

Total characters55729
Distinct characters262
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)0.2%

Sample

1st row燕行錄
2nd row五洲衍文長箋散稿
3rd row李朝宮庭料理通考
4th row是議全書
5th row반 등속(饌饍繕冊)
ValueCountFrequency (%)
進宴儀軌 1078
 
9.6%
林園十六志 718
 
6.4%
우리나라음식만드는법 555
 
4.9%
朝鮮無雙新式料理製法 545
 
4.9%
山林經濟 487
 
4.3%
조선음식 435
 
3.9%
만드는 435
 
3.9%
435
 
3.9%
農政會要 429
 
3.8%
조선요리제법 392
 
3.5%
Other values (118) 5714
50.9%
2023-12-12T17:39:12.165954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1712
 
3.1%
1705
 
3.1%
1530
 
2.7%
1499
 
2.7%
1476
 
2.6%
1435
 
2.6%
1331
 
2.4%
1304
 
2.3%
1223
 
2.2%
1205
 
2.2%
Other values (252) 41309
74.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 54256
97.4%
Space Separator 1223
 
2.2%
Dash Punctuation 176
 
0.3%
Open Punctuation 37
 
0.1%
Close Punctuation 37
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1712
 
3.2%
1705
 
3.1%
1530
 
2.8%
1499
 
2.8%
1476
 
2.7%
1435
 
2.6%
1331
 
2.5%
1304
 
2.4%
1205
 
2.2%
1187
 
2.2%
Other values (248) 39872
73.5%
Space Separator
ValueCountFrequency (%)
1223
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 176
100.0%
Open Punctuation
ValueCountFrequency (%)
( 37
100.0%
Close Punctuation
ValueCountFrequency (%)
) 37
100.0%

Most occurring scripts

ValueCountFrequency (%)
Han 37856
67.9%
Hangul 16400
29.4%
Common 1473
 
2.6%

Most frequent character per script

Han
ValueCountFrequency (%)
1712
 
4.5%
1705
 
4.5%
1331
 
3.5%
1304
 
3.4%
1205
 
3.2%
1187
 
3.1%
1166
 
3.1%
1156
 
3.1%
1078
 
2.8%
1078
 
2.8%
Other values (207) 24934
65.9%
Hangul
ValueCountFrequency (%)
1530
 
9.3%
1499
 
9.1%
1476
 
9.0%
1435
 
8.8%
1059
 
6.5%
1059
 
6.5%
1026
 
6.3%
1026
 
6.3%
1026
 
6.3%
852
 
5.2%
Other values (31) 4412
26.9%
Common
ValueCountFrequency (%)
1223
83.0%
- 176
 
11.9%
( 37
 
2.5%
) 37
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
CJK 37124
66.6%
Hangul 16400
29.4%
ASCII 1473
 
2.6%
CJK Compat Ideographs 732
 
1.3%

Most frequent character per block

CJK
ValueCountFrequency (%)
1712
 
4.6%
1705
 
4.6%
1331
 
3.6%
1304
 
3.5%
1205
 
3.2%
1187
 
3.2%
1166
 
3.1%
1156
 
3.1%
1078
 
2.9%
1078
 
2.9%
Other values (202) 24202
65.2%
Hangul
ValueCountFrequency (%)
1530
 
9.3%
1499
 
9.1%
1476
 
9.0%
1435
 
8.8%
1059
 
6.5%
1059
 
6.5%
1026
 
6.3%
1026
 
6.3%
1026
 
6.3%
852
 
5.2%
Other values (31) 4412
26.9%
ASCII
ValueCountFrequency (%)
1223
83.0%
- 176
 
11.9%
( 37
 
2.5%
) 37
 
2.5%
CJK Compat Ideographs
ValueCountFrequency (%)
525
71.7%
176
 
24.0%
24
 
3.3%
5
 
0.7%
2
 
0.3%
Distinct88
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T17:39:12.459126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length8
Mean length7.4629
Min length2

Characters and Unicode

Total characters74629
Distinct characters264
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)0.2%

Sample

1st row최덕중(崔德中)
2nd row이규경(李圭景)
3rd row한희순(韓熙順), 황혜성(黃慧性), 이혜경(李惠卿)
4th row저자미상
5th row미상
ValueCountFrequency (%)
저자미상 2492
21.9%
방신영(方信榮 1609
14.1%
서유구(徐有榘 894
 
7.9%
李用基 545
 
4.8%
이용기 545
 
4.8%
홍만선(洪萬選 487
 
4.3%
최한기(崔漢綺 429
 
3.8%
전순의(全循義 391
 
3.4%
조자호(趙慈鎬 360
 
3.2%
미상 352
 
3.1%
Other values (90) 3276
28.8%
2023-12-12T17:39:12.899047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 7502
 
10.1%
( 7500
 
10.0%
2852
 
3.8%
2851
 
3.8%
2844
 
3.8%
2492
 
3.3%
1742
 
2.3%
1619
 
2.2%
1611
 
2.2%
1610
 
2.2%
Other values (254) 42006
56.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 57320
76.8%
Close Punctuation 7502
 
10.1%
Open Punctuation 7500
 
10.0%
Space Separator 1742
 
2.3%
Other Punctuation 565
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2852
 
5.0%
2851
 
5.0%
2844
 
5.0%
2492
 
4.3%
1619
 
2.8%
1611
 
2.8%
1610
 
2.8%
1609
 
2.8%
1609
 
2.8%
1609
 
2.8%
Other values (250) 36614
63.9%
Close Punctuation
ValueCountFrequency (%)
) 7502
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7500
100.0%
Space Separator
ValueCountFrequency (%)
1742
100.0%
Other Punctuation
ValueCountFrequency (%)
, 565
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 34428
46.1%
Han 22892
30.7%
Common 17309
23.2%

Most frequent character per script

Han
ValueCountFrequency (%)
1610
 
7.0%
1609
 
7.0%
1609
 
7.0%
1225
 
5.4%
935
 
4.1%
894
 
3.9%
894
 
3.9%
545
 
2.4%
545
 
2.4%
516
 
2.3%
Other values (143) 12510
54.6%
Hangul
ValueCountFrequency (%)
2852
 
8.3%
2851
 
8.3%
2844
 
8.3%
2492
 
7.2%
1619
 
4.7%
1611
 
4.7%
1609
 
4.7%
1438
 
4.2%
1417
 
4.1%
1084
 
3.1%
Other values (97) 14611
42.4%
Common
ValueCountFrequency (%)
) 7502
43.3%
( 7500
43.3%
1742
 
10.1%
, 565
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 34428
46.1%
CJK 21858
29.3%
ASCII 17309
23.2%
CJK Compat Ideographs 1034
 
1.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 7502
43.3%
( 7500
43.3%
1742
 
10.1%
, 565
 
3.3%
Hangul
ValueCountFrequency (%)
2852
 
8.3%
2851
 
8.3%
2844
 
8.3%
2492
 
7.2%
1619
 
4.7%
1611
 
4.7%
1609
 
4.7%
1438
 
4.2%
1417
 
4.1%
1084
 
3.1%
Other values (97) 14611
42.4%
CJK
ValueCountFrequency (%)
1610
 
7.4%
1609
 
7.4%
1609
 
7.4%
1225
 
5.6%
894
 
4.1%
894
 
4.1%
545
 
2.5%
545
 
2.5%
516
 
2.4%
488
 
2.2%
Other values (138) 11923
54.5%
CJK Compat Ideographs
ValueCountFrequency (%)
935
90.4%
95
 
9.2%
2
 
0.2%
1
 
0.1%
1
 
0.1%
Distinct67
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T17:39:13.182869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length5.2605
Min length4

Characters and Unicode

Total characters52605
Distinct characters35
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st row17세기
2nd row19세기
3rd row1957년
4th row1800년대말
5th row1913년
ValueCountFrequency (%)
1902년 1078
 
9.7%
1835년경 718
 
6.5%
1954년 555
 
5.0%
18세기 549
 
5.0%
1936 545
 
4.9%
1946 517
 
4.7%
19세기 516
 
4.7%
434
 
3.9%
1830년경 429
 
3.9%
1934 392
 
3.5%
Other values (57) 5345
48.2%
2023-12-12T17:39:13.625880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 10504
20.0%
9 5578
10.6%
5409
10.3%
0 4852
9.2%
8 2962
 
5.6%
4 2708
 
5.1%
3 2530
 
4.8%
5 2390
 
4.5%
6 1992
 
3.8%
7 1640
 
3.1%
Other values (25) 12040
22.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 36618
69.6%
Other Letter 14257
 
27.1%
Space Separator 1086
 
2.1%
Other Punctuation 360
 
0.7%
Math Symbol 177
 
0.3%
Open Punctuation 49
 
0.1%
Close Punctuation 49
 
0.1%
Dash Punctuation 9
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5409
37.9%
1581
 
11.1%
1509
 
10.6%
1448
 
10.2%
1357
 
9.5%
547
 
3.8%
515
 
3.6%
317
 
2.2%
222
 
1.6%
199
 
1.4%
Other values (9) 1153
 
8.1%
Decimal Number
ValueCountFrequency (%)
1 10504
28.7%
9 5578
15.2%
0 4852
13.3%
8 2962
 
8.1%
4 2708
 
7.4%
3 2530
 
6.9%
5 2390
 
6.5%
6 1992
 
5.4%
7 1640
 
4.5%
2 1462
 
4.0%
Space Separator
ValueCountFrequency (%)
1086
100.0%
Other Punctuation
ValueCountFrequency (%)
. 360
100.0%
Math Symbol
ValueCountFrequency (%)
~ 177
100.0%
Open Punctuation
ValueCountFrequency (%)
( 49
100.0%
Close Punctuation
ValueCountFrequency (%)
) 49
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 38348
72.9%
Hangul 14257
 
27.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5409
37.9%
1581
 
11.1%
1509
 
10.6%
1448
 
10.2%
1357
 
9.5%
547
 
3.8%
515
 
3.6%
317
 
2.2%
222
 
1.6%
199
 
1.4%
Other values (9) 1153
 
8.1%
Common
ValueCountFrequency (%)
1 10504
27.4%
9 5578
14.5%
0 4852
12.7%
8 2962
 
7.7%
4 2708
 
7.1%
3 2530
 
6.6%
5 2390
 
6.2%
6 1992
 
5.2%
7 1640
 
4.3%
2 1462
 
3.8%
Other values (6) 1730
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 38348
72.9%
Hangul 14257
 
27.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 10504
27.4%
9 5578
14.5%
0 4852
12.7%
8 2962
 
7.7%
4 2708
 
7.1%
3 2530
 
6.6%
5 2390
 
6.2%
6 1992
 
5.2%
7 1640
 
4.3%
2 1462
 
3.8%
Other values (6) 1730
 
4.5%
Hangul
ValueCountFrequency (%)
5409
37.9%
1581
 
11.1%
1509
 
10.6%
1448
 
10.2%
1357
 
9.5%
547
 
3.8%
515
 
3.6%
317
 
2.2%
222
 
1.6%
199
 
1.4%
Other values (9) 1153
 
8.1%
Distinct703
Distinct (%)22.1%
Missing6812
Missing (%)68.1%
Memory size156.2 KiB
2023-12-12T17:39:14.065770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.6386449
Min length1

Characters and Unicode

Total characters8412
Distinct characters20
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique102 ?
Unique (%)3.2%

Sample

1st row117
2nd row三三九
3rd row三四
4th row二五○
5th row七六
ValueCountFrequency (%)
133 15
 
0.5%
205 13
 
0.4%
134 12
 
0.4%
128 12
 
0.4%
172 12
 
0.4%
126 11
 
0.3%
一○二 11
 
0.3%
148 11
 
0.3%
169 11
 
0.3%
120 11
 
0.3%
Other values (693) 3069
96.3%
2023-12-12T17:39:14.628581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 1046
 
12.4%
832
 
9.9%
2 682
 
8.1%
681
 
8.1%
438
 
5.2%
386
 
4.6%
4 342
 
4.1%
3 338
 
4.0%
5 330
 
3.9%
329
 
3.9%
Other values (10) 3008
35.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4243
50.4%
Other Letter 3897
46.3%
Other Symbol 272
 
3.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 1046
24.7%
2 682
16.1%
4 342
 
8.1%
3 338
 
8.0%
5 330
 
7.8%
8 319
 
7.5%
7 315
 
7.4%
0 301
 
7.1%
6 291
 
6.9%
9 279
 
6.6%
Other Letter
ValueCountFrequency (%)
832
21.3%
681
17.5%
438
11.2%
386
9.9%
329
 
8.4%
326
 
8.4%
318
 
8.2%
314
 
8.1%
273
 
7.0%
Other Symbol
ValueCountFrequency (%)
272
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4515
53.7%
Han 3897
46.3%

Most frequent character per script

Common
ValueCountFrequency (%)
1 1046
23.2%
2 682
15.1%
4 342
 
7.6%
3 338
 
7.5%
5 330
 
7.3%
8 319
 
7.1%
7 315
 
7.0%
0 301
 
6.7%
6 291
 
6.4%
9 279
 
6.2%
Han
ValueCountFrequency (%)
832
21.3%
681
17.5%
438
11.2%
386
9.9%
329
 
8.4%
326
 
8.4%
318
 
8.2%
314
 
8.1%
273
 
7.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4243
50.4%
CJK 3571
42.5%
CJK Compat Ideographs 326
 
3.9%
Geometric Shapes 272
 
3.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 1046
24.7%
2 682
16.1%
4 342
 
8.1%
3 338
 
8.0%
5 330
 
7.8%
8 319
 
7.5%
7 315
 
7.4%
0 301
 
7.1%
6 291
 
6.9%
9 279
 
6.6%
CJK
ValueCountFrequency (%)
832
23.3%
681
19.1%
438
12.3%
386
10.8%
329
 
9.2%
318
 
8.9%
314
 
8.8%
273
 
7.6%
CJK Compat Ideographs
ValueCountFrequency (%)
326
100.0%
Geometric Shapes
ValueCountFrequency (%)
272
100.0%
Distinct545
Distinct (%)5.5%
Missing4
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-12T17:39:14.953369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length45
Mean length18.787715
Min length8

Characters and Unicode

Total characters187802
Distinct characters26
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique192 ?
Unique (%)1.9%

Sample

1st rowC12G 3/12
2nd rowA23L 1/318
3rd rowA23L 1/39, A23L 1/212
4th rowA23L 1/214, A23L 1/01
5th rowA23L 7/10, A23P 1/08
ValueCountFrequency (%)
a23l 13674
37.4%
1/10 2583
 
7.1%
c12g 1988
 
5.4%
1/39 1480
 
4.0%
1/212 1325
 
3.6%
3/00 1306
 
3.6%
3/02 1071
 
2.9%
1/325 826
 
2.3%
a23g 730
 
2.0%
a23p 685
 
1.9%
Other values (170) 10903
29.8%
2023-12-12T17:39:15.504783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
26575
14.2%
2 26330
14.0%
3 25584
13.6%
1 23962
12.8%
/ 18340
9.8%
A 16049
8.5%
L 13777
7.3%
0 11937
6.4%
, 8343
 
4.4%
G 2726
 
1.5%
Other values (16) 14179
7.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 97864
52.1%
Uppercase Letter 36679
 
19.5%
Other Punctuation 26684
 
14.2%
Space Separator 26575
 
14.2%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A 16049
43.8%
L 13777
37.6%
G 2726
 
7.4%
C 2383
 
6.5%
P 686
 
1.9%
B 532
 
1.5%
J 172
 
0.5%
F 105
 
0.3%
K 97
 
0.3%
D 56
 
0.2%
Other values (2) 96
 
0.3%
Decimal Number
ValueCountFrequency (%)
2 26330
26.9%
3 25584
26.1%
1 23962
24.5%
0 11937
12.2%
8 2475
 
2.5%
9 1951
 
2.0%
5 1938
 
2.0%
4 1595
 
1.6%
7 1310
 
1.3%
6 782
 
0.8%
Other Punctuation
ValueCountFrequency (%)
/ 18340
68.7%
, 8343
31.3%
. 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
26575
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 151123
80.5%
Latin 36679
 
19.5%

Most frequent character per script

Common
ValueCountFrequency (%)
26575
17.6%
2 26330
17.4%
3 25584
16.9%
1 23962
15.9%
/ 18340
12.1%
0 11937
7.9%
, 8343
 
5.5%
8 2475
 
1.6%
9 1951
 
1.3%
5 1938
 
1.3%
Other values (4) 3688
 
2.4%
Latin
ValueCountFrequency (%)
A 16049
43.8%
L 13777
37.6%
G 2726
 
7.4%
C 2383
 
6.5%
P 686
 
1.9%
B 532
 
1.5%
J 172
 
0.5%
F 105
 
0.3%
K 97
 
0.3%
D 56
 
0.2%
Other values (2) 96
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 187802
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
26575
14.2%
2 26330
14.0%
3 25584
13.6%
1 23962
12.8%
/ 18340
9.8%
A 16049
8.5%
L 13777
7.3%
0 11937
6.4%
, 8343
 
4.4%
G 2726
 
1.5%
Other values (16) 14179
7.5%

분석정보 KTKRC
Text

MISSING 

Distinct80
Distinct (%)7.1%
Missing8878
Missing (%)88.8%
Memory size156.2 KiB
2023-12-12T17:39:15.678911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length18
Mean length14.052585
Min length7

Characters and Unicode

Total characters15767
Distinct characters17
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)3.3%

Sample

1st rowA21C 03/, A21B 05/
2nd rowA21C 03/
3rd rowA24K 11/, A21C 07/
4th rowA21C 07/, A24K 11/
5th rowA21C 01/, A24K 11/
ValueCountFrequency (%)
a21c 1076
29.9%
03 518
14.4%
a21b 405
 
11.2%
07 363
 
10.1%
11 286
 
7.9%
a24k 267
 
7.4%
01 205
 
5.7%
09 192
 
5.3%
05 192
 
5.3%
a21d 52
 
1.4%
Other values (4) 47
 
1.3%
2023-12-12T17:39:15.987116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2481
15.7%
1 2356
14.9%
2 1801
11.4%
A 1800
11.4%
/ 1800
11.4%
0 1472
9.3%
C 1077
6.8%
, 679
 
4.3%
3 562
 
3.6%
B 405
 
2.6%
Other values (7) 1334
8.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 7205
45.7%
Uppercase Letter 3602
22.8%
Space Separator 2481
 
15.7%
Other Punctuation 2479
 
15.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 2356
32.7%
2 1801
25.0%
0 1472
20.4%
3 562
 
7.8%
7 363
 
5.0%
4 267
 
3.7%
9 192
 
2.7%
5 192
 
2.7%
Uppercase Letter
ValueCountFrequency (%)
A 1800
50.0%
C 1077
29.9%
B 405
 
11.2%
K 267
 
7.4%
D 52
 
1.4%
J 1
 
< 0.1%
Other Punctuation
ValueCountFrequency (%)
/ 1800
72.6%
, 679
 
27.4%
Space Separator
ValueCountFrequency (%)
2481
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 12165
77.2%
Latin 3602
 
22.8%

Most frequent character per script

Common
ValueCountFrequency (%)
2481
20.4%
1 2356
19.4%
2 1801
14.8%
/ 1800
14.8%
0 1472
12.1%
, 679
 
5.6%
3 562
 
4.6%
7 363
 
3.0%
4 267
 
2.2%
9 192
 
1.6%
Latin
ValueCountFrequency (%)
A 1800
50.0%
C 1077
29.9%
B 405
 
11.2%
K 267
 
7.4%
D 52
 
1.4%
J 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 15767
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2481
15.7%
1 2356
14.9%
2 1801
11.4%
A 1800
11.4%
/ 1800
11.4%
0 1472
9.3%
C 1077
6.8%
, 679
 
4.3%
3 562
 
3.6%
B 405
 
2.6%
Other values (7) 1334
8.5%
Distinct8789
Distinct (%)88.0%
Missing10
Missing (%)0.1%
Memory size156.2 KiB
2023-12-12T17:39:16.492151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length182
Median length101
Mean length24.061562
Min length2

Characters and Unicode

Total characters240375
Distinct characters2345
Distinct categories12 ?
Distinct scripts5 ?
Distinct blocks14 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8427 ?
Unique (%)84.4%

Sample

1st row소주, 燒酒
2nd row사슴꼬리육포, 醃鹿尾, 암록미 , 사슴꼬리, 육포
3rd row두부조림, 두부조리개, 두부, 소고기, 조림
4th row더덕구이, 沙蔘炙, 사삼자, 더덕, 파, 깨소금, 꿀, 구이
5th row주악, 쥬왁, 주왁, 밀가루, 거피팥고물
ValueCountFrequency (%)
1015
 
1.8%
812
 
1.4%
소금 779
 
1.4%
만드는 743
 
1.3%
밀가루 695
 
1.2%
676
 
1.2%
639
 
1.1%
찹쌀 625
 
1.1%
간장 495
 
0.9%
기름 433
 
0.8%
Other values (15118) 50552
88.0%
2023-12-12T17:39:17.182146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
47525
 
19.8%
, 39719
 
16.5%
3280
 
1.4%
3179
 
1.3%
3150
 
1.3%
2895
 
1.2%
2742
 
1.1%
2672
 
1.1%
2140
 
0.9%
1836
 
0.8%
Other values (2335) 131237
54.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 151300
62.9%
Space Separator 47528
 
19.8%
Other Punctuation 39750
 
16.5%
Open Punctuation 687
 
0.3%
Close Punctuation 687
 
0.3%
Private Use 203
 
0.1%
Decimal Number 131
 
0.1%
Math Symbol 72
 
< 0.1%
Control 8
 
< 0.1%
Other Symbol 5
 
< 0.1%
Other values (2) 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3280
 
2.2%
3179
 
2.1%
3150
 
2.1%
2895
 
1.9%
2742
 
1.8%
2672
 
1.8%
2140
 
1.4%
1836
 
1.2%
1697
 
1.1%
1562
 
1.0%
Other values (2251) 126147
83.4%
Private Use
ValueCountFrequency (%)
42
20.7%
32
15.8%
27
13.3%
21
10.3%
8
 
3.9%
6
 
3.0%
5
 
2.5%
4
 
2.0%
4
 
2.0%
4
 
2.0%
Other values (33) 50
24.6%
Decimal Number
ValueCountFrequency (%)
2 28
21.4%
1 25
19.1%
4 12
9.2%
7 12
9.2%
5 12
9.2%
3 11
 
8.4%
9 9
 
6.9%
6 8
 
6.1%
0 8
 
6.1%
8 6
 
4.6%
Other Punctuation
ValueCountFrequency (%)
, 39719
99.9%
. 14
 
< 0.1%
8
 
< 0.1%
: 5
 
< 0.1%
2
 
< 0.1%
/ 1
 
< 0.1%
1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 623
90.7%
38
 
5.5%
22
 
3.2%
[ 3
 
0.4%
{ 1
 
0.1%
Other Symbol
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Close Punctuation
ValueCountFrequency (%)
) 620
90.2%
38
 
5.5%
26
 
3.8%
] 3
 
0.4%
Math Symbol
ValueCountFrequency (%)
43
59.7%
19
26.4%
~ 9
 
12.5%
+ 1
 
1.4%
Space Separator
ValueCountFrequency (%)
47525
> 99.9%
  3
 
< 0.1%
Lowercase Letter
ValueCountFrequency (%)
h 2
66.7%
t 1
33.3%
Control
ValueCountFrequency (%)
ž 8
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 129219
53.8%
Common 88869
37.0%
Han 22081
 
9.2%
Unknown 203
 
0.1%
Latin 3
 
< 0.1%

Most frequent character per script

Han
ValueCountFrequency (%)
1214
 
5.5%
824
 
3.7%
814
 
3.7%
365
 
1.7%
355
 
1.6%
332
 
1.5%
313
 
1.4%
311
 
1.4%
282
 
1.3%
260
 
1.2%
Other values (1424) 17011
77.0%
Hangul
ValueCountFrequency (%)
3280
 
2.5%
3179
 
2.5%
3150
 
2.4%
2895
 
2.2%
2742
 
2.1%
2672
 
2.1%
2140
 
1.7%
1836
 
1.4%
1697
 
1.3%
1562
 
1.2%
Other values (817) 104066
80.5%
Unknown
ValueCountFrequency (%)
42
20.7%
32
15.8%
27
13.3%
21
10.3%
8
 
3.9%
6
 
3.0%
5
 
2.5%
4
 
2.0%
4
 
2.0%
4
 
2.0%
Other values (33) 50
24.6%
Common
ValueCountFrequency (%)
47525
53.5%
, 39719
44.7%
( 623
 
0.7%
) 620
 
0.7%
43
 
< 0.1%
38
 
< 0.1%
38
 
< 0.1%
2 28
 
< 0.1%
26
 
< 0.1%
1 25
 
< 0.1%
Other values (29) 184
 
0.2%
Latin
ValueCountFrequency (%)
h 2
66.7%
t 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 129179
53.7%
ASCII 88659
36.9%
CJK 21797
 
9.1%
CJK Compat Ideographs 246
 
0.1%
PUA 203
 
0.1%
None 145
 
0.1%
Math Operators 62
 
< 0.1%
Compat Jamo 40
 
< 0.1%
CJK Ext A 30
 
< 0.1%
CJK Ext B 8
 
< 0.1%
Other values (4) 6
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
47525
53.6%
, 39719
44.8%
( 623
 
0.7%
) 620
 
0.7%
2 28
 
< 0.1%
1 25
 
< 0.1%
. 14
 
< 0.1%
4 12
 
< 0.1%
7 12
 
< 0.1%
5 12
 
< 0.1%
Other values (15) 69
 
0.1%
Hangul
ValueCountFrequency (%)
3280
 
2.5%
3179
 
2.5%
3150
 
2.4%
2895
 
2.2%
2742
 
2.1%
2672
 
2.1%
2140
 
1.7%
1836
 
1.4%
1697
 
1.3%
1562
 
1.2%
Other values (813) 104026
80.5%
CJK
ValueCountFrequency (%)
1214
 
5.6%
824
 
3.8%
814
 
3.7%
365
 
1.7%
355
 
1.6%
332
 
1.5%
313
 
1.4%
311
 
1.4%
282
 
1.3%
260
 
1.2%
Other values (1370) 16727
76.7%
CJK Compat Ideographs
ValueCountFrequency (%)
58
23.6%
54
22.0%
21
 
8.5%
14
 
5.7%
11
 
4.5%
10
 
4.1%
10
 
4.1%
9
 
3.7%
9
 
3.7%
7
 
2.8%
Other values (25) 43
17.5%
Math Operators
ValueCountFrequency (%)
43
69.4%
19
30.6%
PUA
ValueCountFrequency (%)
42
20.7%
32
15.8%
27
13.3%
21
10.3%
8
 
3.9%
6
 
3.0%
5
 
2.5%
4
 
2.0%
4
 
2.0%
4
 
2.0%
Other values (33) 50
24.6%
None
ValueCountFrequency (%)
38
26.2%
38
26.2%
26
17.9%
22
15.2%
8
 
5.5%
ž 8
 
5.5%
  3
 
2.1%
2
 
1.4%
Compat Jamo
ValueCountFrequency (%)
33
82.5%
5
 
12.5%
1
 
2.5%
1
 
2.5%
CJK Ext A
ValueCountFrequency (%)
11
36.7%
4
 
13.3%
3
 
10.0%
2
 
6.7%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
Other values (4) 4
 
13.3%
CJK Ext B
ValueCountFrequency (%)
𦙫 3
37.5%
𩝊 2
25.0%
𩜶 1
 
12.5%
𩼧 1
 
12.5%
𪌳 1
 
12.5%
Specials
ValueCountFrequency (%)
1
100.0%
Block Elements
ValueCountFrequency (%)
1
100.0%
Geometric Shapes
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Punctuation
ValueCountFrequency (%)
1
100.0%

DB 구축년도
Real number (ℝ)

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2012.5891
Minimum2011
Maximum2018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T17:39:17.346331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2011
5-th percentile2011
Q12012
median2012
Q32013
95-th percentile2016
Maximum2018
Range7
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.4278883
Coefficient of variation (CV)0.00070947832
Kurtosis0.46100425
Mean2012.5891
Median Absolute Deviation (MAD)1
Skewness1.0757749
Sum20125891
Variance2.0388651
MonotonicityNot monotonic
2023-12-12T17:39:17.474355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
2012 3829
38.3%
2011 2130
21.3%
2013 2090
20.9%
2016 826
 
8.3%
2014 757
 
7.6%
2015 365
 
3.6%
2018 3
 
< 0.1%
ValueCountFrequency (%)
2011 2130
21.3%
2012 3829
38.3%
2013 2090
20.9%
2014 757
 
7.6%
2015 365
 
3.6%
2016 826
 
8.3%
2018 3
 
< 0.1%
ValueCountFrequency (%)
2018 3
 
< 0.1%
2016 826
 
8.3%
2015 365
 
3.6%
2014 757
 
7.6%
2013 2090
20.9%
2012 3829
38.3%
2011 2130
21.3%
Distinct5906
Distinct (%)59.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T17:39:17.897553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length34
Mean length5.4145
Min length1

Characters and Unicode

Total characters54145
Distinct characters667
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4571 ?
Unique (%)45.7%

Sample

1st row소주
2nd row사슴꼬리육포
3rd row두부조림
4th row더덕구이
5th row주악
ValueCountFrequency (%)
만드는 836
 
5.3%
673
 
4.2%
방법 552
 
3.5%
사철 232
 
1.5%
178
 
1.1%
각색 154
 
1.0%
여름철 85
 
0.5%
겨울철 82
 
0.5%
가을철 72
 
0.5%
삼색 59
 
0.4%
Other values (5272) 12933
81.6%
2023-12-12T17:39:18.530879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5855
 
10.8%
1514
 
2.8%
1336
 
2.5%
1073
 
2.0%
919
 
1.7%
845
 
1.6%
840
 
1.6%
( 830
 
1.5%
) 829
 
1.5%
740
 
1.4%
Other values (657) 39364
72.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 46096
85.1%
Space Separator 5856
 
10.8%
Close Punctuation 831
 
1.5%
Open Punctuation 830
 
1.5%
Other Punctuation 251
 
0.5%
Decimal Number 214
 
0.4%
Math Symbol 60
 
0.1%
Control 6
 
< 0.1%
Other Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1514
 
3.3%
1336
 
2.9%
1073
 
2.3%
919
 
2.0%
845
 
1.8%
840
 
1.8%
740
 
1.6%
728
 
1.6%
681
 
1.5%
662
 
1.4%
Other values (634) 36758
79.7%
Decimal Number
ValueCountFrequency (%)
2 60
28.0%
1 53
24.8%
3 20
 
9.3%
4 17
 
7.9%
9 15
 
7.0%
7 12
 
5.6%
6 11
 
5.1%
5 11
 
5.1%
0 8
 
3.7%
8 7
 
3.3%
Other Punctuation
ValueCountFrequency (%)
, 247
98.4%
. 3
 
1.2%
/ 1
 
0.4%
Math Symbol
ValueCountFrequency (%)
37
61.7%
20
33.3%
~ 3
 
5.0%
Space Separator
ValueCountFrequency (%)
5855
> 99.9%
  1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 829
99.8%
2
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 830
100.0%
Control
ValueCountFrequency (%)
ž 6
100.0%
Other Number
ValueCountFrequency (%)
½ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 46082
85.1%
Common 8049
 
14.9%
Han 14
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1514
 
3.3%
1336
 
2.9%
1073
 
2.3%
919
 
2.0%
845
 
1.8%
840
 
1.8%
740
 
1.6%
728
 
1.6%
681
 
1.5%
662
 
1.4%
Other values (620) 36744
79.7%
Common
ValueCountFrequency (%)
5855
72.7%
( 830
 
10.3%
) 829
 
10.3%
, 247
 
3.1%
2 60
 
0.7%
1 53
 
0.7%
37
 
0.5%
3 20
 
0.2%
20
 
0.2%
4 17
 
0.2%
Other values (13) 81
 
1.0%
Han
ValueCountFrequency (%)
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
Other values (4) 4
28.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 46060
85.1%
ASCII 7982
 
14.7%
Math Operators 57
 
0.1%
Compat Jamo 22
 
< 0.1%
CJK 14
 
< 0.1%
None 10
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5855
73.4%
( 830
 
10.4%
) 829
 
10.4%
, 247
 
3.1%
2 60
 
0.8%
1 53
 
0.7%
3 20
 
0.3%
4 17
 
0.2%
9 15
 
0.2%
7 12
 
0.2%
Other values (7) 44
 
0.6%
Hangul
ValueCountFrequency (%)
1514
 
3.3%
1336
 
2.9%
1073
 
2.3%
919
 
2.0%
845
 
1.8%
840
 
1.8%
740
 
1.6%
728
 
1.6%
681
 
1.5%
662
 
1.4%
Other values (619) 36722
79.7%
Math Operators
ValueCountFrequency (%)
37
64.9%
20
35.1%
Compat Jamo
ValueCountFrequency (%)
22
100.0%
None
ValueCountFrequency (%)
ž 6
60.0%
2
 
20.0%
½ 1
 
10.0%
  1
 
10.0%
CJK
ValueCountFrequency (%)
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
Other values (4) 4
28.6%
Distinct7346
Distinct (%)73.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T17:39:18.951546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length38
Mean length4.8265
Min length1

Characters and Unicode

Total characters48265
Distinct characters2063
Distinct categories14 ?
Distinct scripts5 ?
Distinct blocks16 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6189 ?
Unique (%)61.9%

Sample

1st row燒酒
2nd row醃鹿尾
3rd row두부조리개
4th row沙蔘炙 더덕구이
5th row쥬왁
ValueCountFrequency (%)
사철 246
 
2.1%
83
 
0.7%
겨울철 70
 
0.6%
가을철 65
 
0.5%
여름철 51
 
0.4%
봄철 31
 
0.3%
藥飯 28
 
0.2%
1 27
 
0.2%
2 27
 
0.2%
豆粥 26
 
0.2%
Other values (7520) 11216
94.5%
2023-12-12T17:39:19.751030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2029
 
4.2%
( 2009
 
4.2%
) 2009
 
4.2%
1149
 
2.4%
805
 
1.7%
778
 
1.6%
777
 
1.6%
458
 
0.9%
425
 
0.9%
400
 
0.8%
Other values (2053) 37426
77.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 41242
85.4%
Close Punctuation 2054
 
4.3%
Open Punctuation 2053
 
4.3%
Space Separator 2029
 
4.2%
Private Use 413
 
0.9%
Other Punctuation 268
 
0.6%
Decimal Number 179
 
0.4%
Other Symbol 9
 
< 0.1%
Letter Number 7
 
< 0.1%
Math Symbol 5
 
< 0.1%
Other values (4) 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1149
 
2.8%
805
 
2.0%
778
 
1.9%
777
 
1.9%
458
 
1.1%
425
 
1.0%
400
 
1.0%
362
 
0.9%
357
 
0.9%
352
 
0.9%
Other values (1947) 35379
85.8%
Private Use
ValueCountFrequency (%)
67
16.2%
66
16.0%
37
 
9.0%
29
 
7.0%
22
 
5.3%
16
 
3.9%
13
 
3.1%
11
 
2.7%
7
 
1.7%
7
 
1.7%
Other values (57) 138
33.4%
Decimal Number
ValueCountFrequency (%)
2 79
44.1%
1 72
40.2%
3 22
 
12.3%
4 2
 
1.1%
6 1
 
0.6%
7 1
 
0.6%
8 1
 
0.6%
9 1
 
0.6%
Other Punctuation
ValueCountFrequency (%)
, 254
94.8%
5
 
1.9%
: 4
 
1.5%
2
 
0.7%
/ 1
 
0.4%
1
 
0.4%
1
 
0.4%
Other Symbol
ValueCountFrequency (%)
3
33.3%
2
22.2%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
Open Punctuation
ValueCountFrequency (%)
( 2009
97.9%
27
 
1.3%
15
 
0.7%
[ 2
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 2009
97.8%
27
 
1.3%
16
 
0.8%
] 2
 
0.1%
Letter Number
ValueCountFrequency (%)
3
42.9%
3
42.9%
1
 
14.3%
Math Symbol
ValueCountFrequency (%)
4
80.0%
+ 1
 
20.0%
Space Separator
ValueCountFrequency (%)
2029
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Control
ValueCountFrequency (%)
ž 1
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Han 20823
43.1%
Hangul 20419
42.3%
Common 6603
 
13.7%
Unknown 413
 
0.9%
Latin 7
 
< 0.1%

Most frequent character per script

Han
ValueCountFrequency (%)
1149
 
5.5%
805
 
3.9%
777
 
3.7%
352
 
1.7%
343
 
1.6%
322
 
1.5%
294
 
1.4%
286
 
1.4%
283
 
1.4%
246
 
1.2%
Other values (1337) 15966
76.7%
Hangul
ValueCountFrequency (%)
778
 
3.8%
458
 
2.2%
425
 
2.1%
400
 
2.0%
362
 
1.8%
357
 
1.7%
339
 
1.7%
319
 
1.6%
285
 
1.4%
284
 
1.4%
Other values (600) 16412
80.4%
Unknown
ValueCountFrequency (%)
67
16.2%
66
16.0%
37
 
9.0%
29
 
7.0%
22
 
5.3%
16
 
3.9%
13
 
3.1%
11
 
2.7%
7
 
1.7%
7
 
1.7%
Other values (57) 138
33.4%
Common
ValueCountFrequency (%)
2029
30.7%
( 2009
30.4%
) 2009
30.4%
, 254
 
3.8%
2 79
 
1.2%
1 72
 
1.1%
27
 
0.4%
27
 
0.4%
3 22
 
0.3%
16
 
0.2%
Other values (26) 59
 
0.9%
Latin
ValueCountFrequency (%)
3
42.9%
3
42.9%
1
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
CJK 20561
42.6%
Hangul 20376
42.2%
ASCII 6493
 
13.5%
PUA 413
 
0.9%
CJK Compat Ideographs 237
 
0.5%
None 94
 
0.2%
Compat Jamo 43
 
0.1%
CJK Ext A 20
 
< 0.1%
Number Forms 7
 
< 0.1%
CJK Ext B 5
 
< 0.1%
Other values (6) 16
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2029
31.2%
( 2009
30.9%
) 2009
30.9%
, 254
 
3.9%
2 79
 
1.2%
1 72
 
1.1%
3 22
 
0.3%
: 4
 
0.1%
- 3
 
< 0.1%
] 2
 
< 0.1%
Other values (8) 10
 
0.2%
CJK
ValueCountFrequency (%)
1149
 
5.6%
805
 
3.9%
777
 
3.8%
352
 
1.7%
343
 
1.7%
322
 
1.6%
294
 
1.4%
286
 
1.4%
283
 
1.4%
246
 
1.2%
Other values (1289) 15704
76.4%
Hangul
ValueCountFrequency (%)
778
 
3.8%
458
 
2.2%
425
 
2.1%
400
 
2.0%
362
 
1.8%
357
 
1.8%
339
 
1.7%
319
 
1.6%
285
 
1.4%
284
 
1.4%
Other values (597) 16369
80.3%
PUA
ValueCountFrequency (%)
67
16.2%
66
16.0%
37
 
9.0%
29
 
7.0%
22
 
5.3%
16
 
3.9%
13
 
3.1%
11
 
2.7%
7
 
1.7%
7
 
1.7%
Other values (57) 138
33.4%
CJK Compat Ideographs
ValueCountFrequency (%)
58
24.5%
51
21.5%
21
 
8.9%
14
 
5.9%
10
 
4.2%
10
 
4.2%
9
 
3.8%
9
 
3.8%
8
 
3.4%
6
 
2.5%
Other values (23) 41
17.3%
Compat Jamo
ValueCountFrequency (%)
33
76.7%
9
 
20.9%
1
 
2.3%
None
ValueCountFrequency (%)
27
28.7%
27
28.7%
16
17.0%
15
16.0%
5
 
5.3%
2
 
2.1%
1
 
1.1%
ž 1
 
1.1%
CJK Ext A
ValueCountFrequency (%)
9
45.0%
2
 
10.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
Math Operators
ValueCountFrequency (%)
4
100.0%
Specials
ValueCountFrequency (%)
3
100.0%
Number Forms
ValueCountFrequency (%)
3
42.9%
3
42.9%
1
 
14.3%
CJK Ext B
ValueCountFrequency (%)
𩝊 2
40.0%
𩜶 1
20.0%
𪌳 1
20.0%
𦙫 1
20.0%
Geometric Shapes
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%
Block Elements
ValueCountFrequency (%)
1
100.0%
Punctuation
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Box Drawing
ValueCountFrequency (%)
1
100.0%
Distinct6266
Distinct (%)62.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T17:39:20.122349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length41
Mean length4.293
Min length1

Characters and Unicode

Total characters42930
Distinct characters593
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4831 ?
Unique (%)48.3%

Sample

1st row소주
2nd row암록미
3rd row두부조림
4th row사삼자
5th row주왁
ValueCountFrequency (%)
사철 239
 
2.1%
112
 
1.0%
겨울철 80
 
0.7%
가을철 72
 
0.6%
여름철 72
 
0.6%
36
 
0.3%
완자탕 35
 
0.3%
만드는 34
 
0.3%
화전 33
 
0.3%
봄철 30
 
0.3%
Other values (6154) 10678
93.5%
2023-12-12T17:39:20.679071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1474
 
3.4%
1425
 
3.3%
1125
 
2.6%
( 1104
 
2.6%
) 1102
 
2.6%
858
 
2.0%
796
 
1.9%
793
 
1.8%
775
 
1.8%
748
 
1.7%
Other values (583) 32730
76.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 38761
90.3%
Space Separator 1425
 
3.3%
Open Punctuation 1136
 
2.6%
Close Punctuation 1135
 
2.6%
Other Punctuation 279
 
0.6%
Decimal Number 156
 
0.4%
Math Symbol 34
 
0.1%
Other Number 1
 
< 0.1%
Final Punctuation 1
 
< 0.1%
Initial Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1474
 
3.8%
1125
 
2.9%
858
 
2.2%
796
 
2.1%
793
 
2.0%
775
 
2.0%
748
 
1.9%
657
 
1.7%
574
 
1.5%
566
 
1.5%
Other values (553) 30395
78.4%
Decimal Number
ValueCountFrequency (%)
2 46
29.5%
1 39
25.0%
3 15
 
9.6%
4 15
 
9.6%
9 10
 
6.4%
5 10
 
6.4%
6 9
 
5.8%
0 5
 
3.2%
8 4
 
2.6%
7 3
 
1.9%
Open Punctuation
ValueCountFrequency (%)
( 1104
97.2%
27
 
2.4%
4
 
0.4%
[ 1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 1102
97.1%
26
 
2.3%
6
 
0.5%
] 1
 
0.1%
Other Punctuation
ValueCountFrequency (%)
, 274
98.2%
. 2
 
0.7%
2
 
0.7%
/ 1
 
0.4%
Math Symbol
ValueCountFrequency (%)
20
58.8%
~ 8
 
23.5%
6
 
17.6%
Space Separator
ValueCountFrequency (%)
1425
100.0%
Other Number
ValueCountFrequency (%)
½ 1
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%
Control
ValueCountFrequency (%)
ž 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 38749
90.3%
Common 4169
 
9.7%
Han 12
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1474
 
3.8%
1125
 
2.9%
858
 
2.2%
796
 
2.1%
793
 
2.0%
775
 
2.0%
748
 
1.9%
657
 
1.7%
574
 
1.5%
566
 
1.5%
Other values (542) 30383
78.4%
Common
ValueCountFrequency (%)
1425
34.2%
( 1104
26.5%
) 1102
26.4%
, 274
 
6.6%
2 46
 
1.1%
1 39
 
0.9%
27
 
0.6%
26
 
0.6%
20
 
0.5%
3 15
 
0.4%
Other values (20) 91
 
2.2%
Han
ValueCountFrequency (%)
2
16.7%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%
1
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 38740
90.2%
ASCII 4074
 
9.5%
None 67
 
0.2%
Math Operators 26
 
0.1%
CJK 11
 
< 0.1%
Compat Jamo 9
 
< 0.1%
Punctuation 2
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1474
 
3.8%
1125
 
2.9%
858
 
2.2%
796
 
2.1%
793
 
2.0%
775
 
2.0%
748
 
1.9%
657
 
1.7%
574
 
1.5%
566
 
1.5%
Other values (541) 30374
78.4%
ASCII
ValueCountFrequency (%)
1425
35.0%
( 1104
27.1%
) 1102
27.0%
, 274
 
6.7%
2 46
 
1.1%
1 39
 
1.0%
3 15
 
0.4%
4 15
 
0.4%
9 10
 
0.2%
5 10
 
0.2%
Other values (9) 34
 
0.8%
None
ValueCountFrequency (%)
27
40.3%
26
38.8%
6
 
9.0%
4
 
6.0%
2
 
3.0%
½ 1
 
1.5%
ž 1
 
1.5%
Math Operators
ValueCountFrequency (%)
20
76.9%
6
 
23.1%
Compat Jamo
ValueCountFrequency (%)
9
100.0%
CJK
ValueCountFrequency (%)
2
18.2%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
Punctuation
ValueCountFrequency (%)
1
50.0%
1
50.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Distinct206
Distinct (%)24.7%
Missing9166
Missing (%)91.7%
Memory size156.2 KiB
2023-12-12T17:39:20.994550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length17
Mean length9.2110312
Min length4

Characters and Unicode

Total characters7682
Distinct characters37
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique70 ?
Unique (%)8.4%

Sample

1st rowDubu-jorim
2nd rowDeodeok-gui
3rd rowSaengseon-hoe
4th rowYakgwa
5th rowBibim-guksu
ValueCountFrequency (%)
namul 34
 
3.3%
gui 32
 
3.1%
patjuk 29
 
2.8%
hwajeon 27
 
2.6%
guksu 26
 
2.5%
jjim 20
 
1.9%
yakgwa 19
 
1.8%
dongchimi 18
 
1.7%
sujeonggwa 17
 
1.6%
songpyeon 16
 
1.5%
Other values (195) 807
77.2%
2023-12-12T17:39:21.475305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 711
 
9.3%
o 633
 
8.2%
g 631
 
8.2%
n 588
 
7.7%
e 585
 
7.6%
u 540
 
7.0%
i 477
 
6.2%
k 475
 
6.2%
j 394
 
5.1%
m 295
 
3.8%
Other values (27) 2353
30.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 6781
88.3%
Uppercase Letter 467
 
6.1%
Dash Punctuation 223
 
2.9%
Space Separator 211
 
2.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 711
10.5%
o 633
9.3%
g 631
9.3%
n 588
8.7%
e 585
8.6%
u 540
 
8.0%
i 477
 
7.0%
k 475
 
7.0%
j 394
 
5.8%
m 295
 
4.4%
Other values (11) 1452
21.4%
Uppercase Letter
ValueCountFrequency (%)
S 85
18.2%
G 51
10.9%
B 47
10.1%
Y 47
10.1%
J 43
9.2%
D 38
8.1%
H 32
 
6.9%
P 30
 
6.4%
M 27
 
5.8%
K 23
 
4.9%
Other values (4) 44
9.4%
Dash Punctuation
ValueCountFrequency (%)
- 223
100.0%
Space Separator
ValueCountFrequency (%)
211
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 7248
94.4%
Common 434
 
5.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 711
 
9.8%
o 633
 
8.7%
g 631
 
8.7%
n 588
 
8.1%
e 585
 
8.1%
u 540
 
7.5%
i 477
 
6.6%
k 475
 
6.6%
j 394
 
5.4%
m 295
 
4.1%
Other values (25) 1919
26.5%
Common
ValueCountFrequency (%)
- 223
51.4%
211
48.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7682
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 711
 
9.3%
o 633
 
8.2%
g 631
 
8.2%
n 588
 
7.7%
e 585
 
7.6%
u 540
 
7.0%
i 477
 
6.2%
k 475
 
6.2%
j 394
 
5.1%
m 295
 
3.8%
Other values (27) 2353
30.6%
Distinct201
Distinct (%)24.1%
Missing9166
Missing (%)91.7%
Memory size156.2 KiB
2023-12-12T17:39:21.806164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length29
Mean length17.053957
Min length1

Characters and Unicode

Total characters14223
Distinct characters51
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique67 ?
Unique (%)8.0%

Sample

1st rowBraised Tofu
2nd rowGrilled Deodeok
3rd rowSliced Raw Fish
4th rowHoney Cookie
5th rowSpicy Noodles
ValueCountFrequency (%)
rice 152
 
6.7%
soup 83
 
3.7%
bean 69
 
3.0%
kimchi 67
 
3.0%
noodles 58
 
2.6%
porridge 57
 
2.5%
cake 49
 
2.2%
beef 48
 
2.1%
braised 46
 
2.0%
grilled 46
 
2.0%
Other values (190) 1593
70.2%
2023-12-12T17:39:22.386775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 1714
 
12.1%
1460
 
10.3%
i 963
 
6.8%
a 958
 
6.7%
o 850
 
6.0%
r 673
 
4.7%
d 610
 
4.3%
n 599
 
4.2%
l 548
 
3.9%
s 499
 
3.5%
Other values (41) 5349
37.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 10472
73.6%
Uppercase Letter 2197
 
15.4%
Space Separator 1460
 
10.3%
Dash Punctuation 86
 
0.6%
Decimal Number 7
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 1714
16.4%
i 963
 
9.2%
a 958
 
9.1%
o 850
 
8.1%
r 673
 
6.4%
d 610
 
5.8%
n 599
 
5.7%
l 548
 
5.2%
s 499
 
4.8%
c 482
 
4.6%
Other values (15) 2576
24.6%
Uppercase Letter
ValueCountFrequency (%)
S 440
20.0%
R 299
13.6%
B 261
11.9%
P 251
11.4%
C 210
9.6%
M 88
 
4.0%
K 84
 
3.8%
N 80
 
3.6%
H 69
 
3.1%
G 69
 
3.1%
Other values (12) 346
15.7%
Space Separator
ValueCountFrequency (%)
1460
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 86
100.0%
Decimal Number
ValueCountFrequency (%)
0 7
100.0%
Other Punctuation
ValueCountFrequency (%)
' 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 12669
89.1%
Common 1554
 
10.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 1714
 
13.5%
i 963
 
7.6%
a 958
 
7.6%
o 850
 
6.7%
r 673
 
5.3%
d 610
 
4.8%
n 599
 
4.7%
l 548
 
4.3%
s 499
 
3.9%
c 482
 
3.8%
Other values (37) 4773
37.7%
Common
ValueCountFrequency (%)
1460
94.0%
- 86
 
5.5%
0 7
 
0.5%
' 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 14223
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 1714
 
12.1%
1460
 
10.3%
i 963
 
6.8%
a 958
 
6.7%
o 850
 
6.0%
r 673
 
4.7%
d 610
 
4.3%
n 599
 
4.2%
l 548
 
3.9%
s 499
 
3.5%
Other values (41) 5349
37.6%

원문
Text

Distinct9883
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T17:39:22.895380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length1024
Median length732
Mean length157.7702
Min length2

Characters and Unicode

Total characters1577702
Distinct characters6799
Distinct categories19 ?
Distinct scripts5 ?
Distinct blocks21 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9801 ?
Unique (%)98.0%

Sample

1st row○宗班正使則隔二日一供。㺚羊一隻,燒酒一甁,冬則燒肉炭十五斤,烤手炭三十斤。 書狀官。 水稻米二升,魚二尾,豆腐二斤,醃菜一斤,白鹽二兩,茶葉二兩,柴十五斤。
2nd row刀剃去尾根上毛,剔去骨,用塩一錢蕪荑半錢填尾內,杖夾風吹乾。
3rd row재료 두부 2모 소고기 20匁 녹말가루 조금 기름 조금 실고추 조금 파 조금 깨소금 조금 양념(간장, 설탕, 참기름, 깨소금, 후추가루, 파, 마늘) 조리법 두부는 반듯반듯하게 썰어서 녹말가루를 씨워 번철에 지진다. 소고기는 양념(간장, 설탕, 참기름, 후추가루, 깨소금, 파, 마늘)하여 두부지진것과 함께 남비에 담고 간장, 설탕, 파, 실고추를 넣고 조린다. 두부는 녹말가루를 씨우지 않고 누룻누룻하게 지져서 조리기도 하나 녹말가루를 묻혀서 지진 것은 부서지지 않고 좋다.
4th row더덕을 물에 담가 물에 부른 후 겁질을 졍히 글거 씨셔 건져 도마에 노코 칼노 근 근 두다혀 젹쇠에 언고 발 깃 셔로 유 발나 굽다가 파 다져 쇼곰 기름 고쵸가로 합 여 그릇 담아 함담 보아 굽든 더덕을 너허 간 물너 굽되 오 구으면 양념이 타셔 못쓰니 구어 여 치 기릐식 너 졉시에 담고 우희에 쇼곰 려씨라
5th row쥬왁은 밀가루로 반쥭 여 송편갓치 비지되 계피고물을 조고맛치 느어셔 송편보 즉게 여 작 게 여서 기름에 밧삭 지지여 게라
ValueCountFrequency (%)
넣고 2341
 
0.7%
2022
 
0.6%
조금 1778
 
0.6%
1603
 
0.5%
재료 1519
 
0.5%
1416
 
0.4%
1 1197
 
0.4%
1176
 
0.4%
마늘 1164
 
0.4%
1162
 
0.4%
Other values (81115) 306686
95.2%
2023-12-12T17:39:23.655490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
312445
 
19.8%
33397
 
2.1%
22728
 
1.4%
, 21300
 
1.4%
20780
 
1.3%
19202
 
1.2%
18797
 
1.2%
18771
 
1.2%
18286
 
1.2%
. 16193
 
1.0%
Other values (6789) 1075803
68.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1144275
72.5%
Space Separator 312447
 
19.8%
Other Punctuation 78284
 
5.0%
Decimal Number 15390
 
1.0%
Private Use 7434
 
0.5%
Close Punctuation 7213
 
0.5%
Open Punctuation 7156
 
0.5%
Lowercase Letter 3083
 
0.2%
Other Number 815
 
0.1%
Other Symbol 402
 
< 0.1%
Other values (9) 1203
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
33397
 
2.9%
20780
 
1.8%
19202
 
1.7%
18797
 
1.6%
18771
 
1.6%
18286
 
1.6%
15573
 
1.4%
14666
 
1.3%
11570
 
1.0%
10944
 
1.0%
Other values (6234) 962289
84.1%
Private Use
ValueCountFrequency (%)
 217
 
2.9%
214
 
2.9%
168
 
2.3%
163
 
2.2%
159
 
2.1%
154
 
2.1%
150
 
2.0%
144
 
1.9%
134
 
1.8%
128
 
1.7%
Other values (407) 5803
78.1%
Other Punctuation
ValueCountFrequency (%)
22728
29.0%
, 21300
27.2%
. 16193
20.7%
14067
18.0%
1464
 
1.9%
: 1015
 
1.3%
440
 
0.6%
/ 327
 
0.4%
302
 
0.4%
207
 
0.3%
Other values (12) 241
 
0.3%
Lowercase Letter
ValueCountFrequency (%)
g 509
16.5%
s 474
15.4%
t 461
15.0%
a 351
11.4%
c 342
11.1%
m 274
8.9%
l 171
 
5.5%
b 160
 
5.2%
d 83
 
2.7%
e 79
 
2.6%
Other values (12) 179
 
5.8%
Uppercase Letter
ValueCountFrequency (%)
C 164
45.8%
S 77
21.5%
T 69
19.3%
P 11
 
3.1%
V 9
 
2.5%
A 5
 
1.4%
R 3
 
0.8%
G 3
 
0.8%
E 3
 
0.8%
B 2
 
0.6%
Other values (9) 12
 
3.4%
Other Symbol
ValueCountFrequency (%)
103
25.6%
95
23.6%
80
19.9%
54
13.4%
40
 
10.0%
10
 
2.5%
8
 
2.0%
3
 
0.7%
° 3
 
0.7%
1
 
0.2%
Other values (5) 5
 
1.2%
Other Number
ValueCountFrequency (%)
254
31.2%
246
30.2%
151
18.5%
67
 
8.2%
36
 
4.4%
21
 
2.6%
13
 
1.6%
9
 
1.1%
½ 8
 
1.0%
3
 
0.4%
Other values (4) 7
 
0.9%
Decimal Number
ValueCountFrequency (%)
1 3811
24.8%
2 2738
17.8%
0 2625
17.1%
3 2108
13.7%
5 1503
 
9.8%
4 1248
 
8.1%
6 552
 
3.6%
7 379
 
2.5%
8 282
 
1.8%
9 144
 
0.9%
Close Punctuation
ValueCountFrequency (%)
) 4686
65.0%
1171
 
16.2%
889
 
12.3%
] 438
 
6.1%
11
 
0.2%
8
 
0.1%
7
 
0.1%
} 3
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 4629
64.7%
1171
 
16.4%
888
 
12.4%
[ 438
 
6.1%
12
 
0.2%
8
 
0.1%
7
 
0.1%
{ 3
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 238
78.8%
× 18
 
6.0%
> 17
 
5.6%
< 17
 
5.6%
+ 7
 
2.3%
3
 
1.0%
2
 
0.7%
Space Separator
ValueCountFrequency (%)
312445
> 99.9%
  2
 
< 0.1%
Initial Punctuation
ValueCountFrequency (%)
203
91.4%
19
 
8.6%
Final Punctuation
ValueCountFrequency (%)
202
91.4%
19
 
8.6%
Dash Punctuation
ValueCountFrequency (%)
- 51
83.6%
10
 
16.4%
Format
ValueCountFrequency (%)
 8
80.0%
2
 
20.0%
Letter Number
ValueCountFrequency (%)
15
100.0%
Control
ValueCountFrequency (%)
ž 13
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 819698
52.0%
Common 422537
26.8%
Han 324592
 
20.6%
Unknown 7434
 
0.5%
Latin 3441
 
0.2%

Most frequent character per script

Han
ValueCountFrequency (%)
8143
 
2.5%
4884
 
1.5%
4008
 
1.2%
3870
 
1.2%
3766
 
1.2%
3682
 
1.1%
3639
 
1.1%
3545
 
1.1%
3493
 
1.1%
3402
 
1.0%
Other values (4660) 282160
86.9%
Hangul
ValueCountFrequency (%)
33397
 
4.1%
20780
 
2.5%
19202
 
2.3%
18797
 
2.3%
18771
 
2.3%
18286
 
2.2%
15573
 
1.9%
14666
 
1.8%
11570
 
1.4%
10944
 
1.3%
Other values (1565) 637712
77.8%
Unknown
ValueCountFrequency (%)
 217
 
2.9%
214
 
2.9%
168
 
2.3%
163
 
2.2%
159
 
2.1%
154
 
2.1%
150
 
2.0%
144
 
1.9%
134
 
1.8%
128
 
1.7%
Other values (407) 5803
78.1%
Common
ValueCountFrequency (%)
312445
73.9%
22728
 
5.4%
, 21300
 
5.0%
. 16193
 
3.8%
14067
 
3.3%
) 4686
 
1.1%
( 4629
 
1.1%
1 3811
 
0.9%
2 2738
 
0.6%
0 2625
 
0.6%
Other values (86) 17315
 
4.1%
Latin
ValueCountFrequency (%)
g 509
14.8%
s 474
13.8%
t 461
13.4%
a 351
10.2%
c 342
9.9%
m 274
8.0%
l 171
 
5.0%
C 164
 
4.8%
b 160
 
4.6%
d 83
 
2.4%
Other values (31) 452
13.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 818438
51.9%
ASCII 380690
24.1%
CJK 323571
 
20.5%
None 43454
 
2.8%
PUA 7434
 
0.5%
Compat Jamo 1254
 
0.1%
CJK Compat Ideographs 829
 
0.1%
Enclosed Alphanum 803
 
0.1%
Punctuation 642
 
< 0.1%
Geometric Shapes 304
 
< 0.1%
Other values (11) 283
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
312445
82.1%
, 21300
 
5.6%
. 16193
 
4.3%
) 4686
 
1.2%
( 4629
 
1.2%
1 3811
 
1.0%
2 2738
 
0.7%
0 2625
 
0.7%
3 2108
 
0.6%
5 1503
 
0.4%
Other values (63) 8652
 
2.3%
Hangul
ValueCountFrequency (%)
33397
 
4.1%
20780
 
2.5%
19202
 
2.3%
18797
 
2.3%
18771
 
2.3%
18286
 
2.2%
15573
 
1.9%
14666
 
1.8%
11570
 
1.4%
10944
 
1.3%
Other values (1546) 636452
77.8%
None
ValueCountFrequency (%)
22728
52.3%
14067
32.4%
1464
 
3.4%
1171
 
2.7%
1171
 
2.7%
889
 
2.0%
888
 
2.0%
440
 
1.0%
302
 
0.7%
207
 
0.5%
Other values (17) 127
 
0.3%
CJK
ValueCountFrequency (%)
8143
 
2.5%
4884
 
1.5%
4008
 
1.2%
3870
 
1.2%
3766
 
1.2%
3682
 
1.1%
3639
 
1.1%
3545
 
1.1%
3493
 
1.1%
3402
 
1.1%
Other values (4527) 281139
86.9%
Compat Jamo
ValueCountFrequency (%)
768
61.2%
389
31.0%
29
 
2.3%
24
 
1.9%
15
 
1.2%
7
 
0.6%
5
 
0.4%
5
 
0.4%
5
 
0.4%
2
 
0.2%
Other values (4) 5
 
0.4%
CJK Compat Ideographs
ValueCountFrequency (%)
375
45.2%
65
 
7.8%
58
 
7.0%
34
 
4.1%
28
 
3.4%
24
 
2.9%
17
 
2.1%
16
 
1.9%
14
 
1.7%
14
 
1.7%
Other values (66) 184
22.2%
Enclosed Alphanum
ValueCountFrequency (%)
254
31.6%
246
30.6%
151
18.8%
67
 
8.3%
36
 
4.5%
21
 
2.6%
13
 
1.6%
9
 
1.1%
3
 
0.4%
2
 
0.2%
PUA
ValueCountFrequency (%)
 217
 
2.9%
214
 
2.9%
168
 
2.3%
163
 
2.2%
159
 
2.1%
154
 
2.1%
150
 
2.0%
144
 
1.9%
134
 
1.8%
128
 
1.7%
Other values (407) 5803
78.1%
Punctuation
ValueCountFrequency (%)
203
31.6%
202
31.5%
88
13.7%
54
 
8.4%
42
 
6.5%
19
 
3.0%
19
 
3.0%
10
 
1.6%
2
 
0.3%
2
 
0.3%
Geometric Shapes
ValueCountFrequency (%)
103
33.9%
95
31.2%
54
17.8%
40
 
13.2%
8
 
2.6%
3
 
1.0%
1
 
0.3%
Block Elements
ValueCountFrequency (%)
80
100.0%
CJK Ext A
ValueCountFrequency (%)
40
27.4%
19
13.0%
9
 
6.2%
9
 
6.2%
9
 
6.2%
6
 
4.1%
5
 
3.4%
5
 
3.4%
4
 
2.7%
3
 
2.1%
Other values (31) 37
25.3%
CJK Ext B
ValueCountFrequency (%)
𥹸 10
32.3%
𥻓 5
16.1%
𩼧 2
 
6.5%
𣼬 2
 
6.5%
𤩱 2
 
6.5%
𡋺 1
 
3.2%
𪁈 1
 
3.2%
𦢸 1
 
3.2%
𦙳 1
 
3.2%
𨄔 1
 
3.2%
Other values (5) 5
16.1%
Letterlike Symbols
ValueCountFrequency (%)
10
100.0%
Math Operators
ValueCountFrequency (%)
3
60.0%
2
40.0%
Jamo
ValueCountFrequency (%)
2
33.3%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Box Drawing
ValueCountFrequency (%)
1
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
CJK Compat
ValueCountFrequency (%)
1
100.0%
CJK Compat Forms
ValueCountFrequency (%)
1
100.0%
Specials
ValueCountFrequency (%)
1
100.0%
Distinct9804
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T17:39:24.088363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length1024
Median length721
Mean length219.4307
Min length12

Characters and Unicode

Total characters2194307
Distinct characters3758
Distinct categories17 ?
Distinct scripts5 ?
Distinct blocks18 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9660 ?
Unique (%)96.6%

Sample

1st row종반이 정사일 적에는 이틀 간격으로 달양(㺚羊) 1짝, 소주(燒酒) 1병을 1차례씩 공급하는데, 겨울에는 고기 굽는 숯이 15근, 손 쬐는 숯이 30근이다. 서장관에게는 논벼쌀 2되, 생선 2마리, 두부 2근, 엄채(醃菜) 1근, 흰 소금 2냥, 찻잎 2냥, 땔나무 15근.
2nd row<사슴꼬리>를 칼로 꼬리부분 털을 제거하고 뼈도 제거한 다음 소금 1돈과 무이 1/2돈을 꼬리 속에 넣은 후 꼬챙이에 껴서 바람에 말린다.
3rd row재료 두부 2모 소고기 20돈 녹말가루 조금 기름 조금 실고추 조금 파 조금 깨소금 조금 양념(간장, 설탕, 참기름, 깨소금, 후춧가루, 파, 마늘) 조리법 두부는 반듯반듯하게 썰어서 녹말가루를 씌워 번철에 지진다. 소고기는 간장, 설탕, 참기름, 후춧가루, 깨소금, 파, 마늘로 양념하여 두부 지진 것과 함께 냄비에 담고 간장, 설탕, 파, 실고추를 넣고 조린다. 두부는 녹말가루를 씌우지 않고 누룻누룻하게 지져서 조리기도 하나 녹말가루를 묻혀서 지진 것이 부서지지 않고 좋다.
4th row더덕을 물에 담가 불린 후 껍질을 깨끗이 긁어내고 씻는다. 칼로 더덕을 자근자근 두드리고 석쇠에 얹어 꿩의 깃으로 기름장을 발라 굽는다. 다진 파, 깨소금, 기름, 꿀, 고춧가루를 합하여 그릇에 담고 간을 맞춘다. 여기에 애벌구이한 더덕을 넣고 잠깐 버무려 알맞게 굽되 오래 구우면 양념이 타서 못 쓴다. 구운 더덕을 1치 길이로 잘라 접시에 담고 깨소금을 위에 뿌린다.
5th row주악은 밀가루로 반죽하여 송편같이 빚는다. 거피팥고물을 조금 넣어 송편보다 작고 납작하게 하여 기름에 바싹 지진다.
ValueCountFrequency (%)
넣고 6796
 
1.2%
3493
 
0.6%
물에 3041
 
0.5%
다음 2970
 
0.5%
조금 2809
 
0.5%
2784
 
0.5%
2616
 
0.5%
넣어 2547
 
0.4%
한다 2486
 
0.4%
물을 2436
 
0.4%
Other values (57616) 548882
94.5%
2023-12-12T17:39:24.751009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
570915
26.0%
50361
 
2.3%
50214
 
2.3%
41912
 
1.9%
, 40357
 
1.8%
. 40266
 
1.8%
38303
 
1.7%
34821
 
1.6%
33132
 
1.5%
30968
 
1.4%
Other values (3748) 1263058
57.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1444373
65.8%
Space Separator 570931
 
26.0%
Other Punctuation 84259
 
3.8%
Decimal Number 56663
 
2.6%
Close Punctuation 14633
 
0.7%
Open Punctuation 13116
 
0.6%
Lowercase Letter 4446
 
0.2%
Math Symbol 3847
 
0.2%
Dash Punctuation 554
 
< 0.1%
Uppercase Letter 407
 
< 0.1%
Other values (7) 1078
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
50361
 
3.5%
50214
 
3.5%
41912
 
2.9%
38303
 
2.7%
34821
 
2.4%
33132
 
2.3%
30968
 
2.1%
24999
 
1.7%
24273
 
1.7%
22682
 
1.6%
Other values (3639) 1092708
75.7%
Other Punctuation
ValueCountFrequency (%)
, 40357
47.9%
. 40266
47.8%
: 1267
 
1.5%
/ 1147
 
1.4%
' 395
 
0.5%
336
 
0.4%
" 144
 
0.2%
132
 
0.2%
88
 
0.1%
37
 
< 0.1%
Other values (9) 90
 
0.1%
Decimal Number
ValueCountFrequency (%)
1 18374
32.4%
2 9983
17.6%
3 6921
 
12.2%
5 6300
 
11.1%
0 5859
 
10.3%
4 4055
 
7.2%
7 1842
 
3.3%
6 1714
 
3.0%
8 1058
 
1.9%
9 544
 
1.0%
Other values (4) 13
 
< 0.1%
Other Symbol
ValueCountFrequency (%)
232
82.6%
13
 
4.6%
8
 
2.8%
6
 
2.1%
5
 
1.8%
4
 
1.4%
3
 
1.1%
2
 
0.7%
° 2
 
0.7%
2
 
0.7%
Other values (4) 4
 
1.4%
Math Symbol
ValueCountFrequency (%)
~ 1447
37.6%
< 985
25.6%
> 981
25.5%
247
 
6.4%
99
 
2.6%
26
 
0.7%
26
 
0.7%
× 19
 
0.5%
14
 
0.4%
2
 
0.1%
Lowercase Letter
ValueCountFrequency (%)
g 1954
43.9%
m 691
 
15.5%
s 517
 
11.6%
c 477
 
10.7%
t 445
 
10.0%
a 145
 
3.3%
l 145
 
3.3%
k 63
 
1.4%
x 6
 
0.1%
p 3
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 10248
70.0%
] 2828
 
19.3%
1513
 
10.3%
16
 
0.1%
10
 
0.1%
10
 
0.1%
} 5
 
< 0.1%
2
 
< 0.1%
1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 8734
66.6%
[ 2833
 
21.6%
1512
 
11.5%
16
 
0.1%
10
 
0.1%
4
 
< 0.1%
{ 4
 
< 0.1%
2
 
< 0.1%
1
 
< 0.1%
Other Number
ValueCountFrequency (%)
½ 37
67.3%
5
 
9.1%
4
 
7.3%
4
 
7.3%
¼ 3
 
5.5%
1
 
1.8%
1
 
1.8%
Uppercase Letter
ValueCountFrequency (%)
L 181
44.5%
C 148
36.4%
T 73
17.9%
A 3
 
0.7%
S 1
 
0.2%
B 1
 
0.2%
Space Separator
ValueCountFrequency (%)
570915
> 99.9%
  16
 
< 0.1%
Initial Punctuation
ValueCountFrequency (%)
298
82.8%
62
 
17.2%
Final Punctuation
ValueCountFrequency (%)
298
83.9%
57
 
16.1%
Dash Punctuation
ValueCountFrequency (%)
- 554
100.0%
Letter Number
ValueCountFrequency (%)
14
100.0%
Control
ValueCountFrequency (%)
ž 12
100.0%
Private Use
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1426814
65.0%
Common 745059
34.0%
Han 17580
 
0.8%
Latin 4853
 
0.2%
Unknown 1
 
< 0.1%

Most frequent character per script

Han
ValueCountFrequency (%)
350
 
2.0%
324
 
1.8%
244
 
1.4%
243
 
1.4%
204
 
1.2%
203
 
1.2%
172
 
1.0%
130
 
0.7%
120
 
0.7%
120
 
0.7%
Other values (2316) 15470
88.0%
Hangul
ValueCountFrequency (%)
50361
 
3.5%
50214
 
3.5%
41912
 
2.9%
38303
 
2.7%
34821
 
2.4%
33132
 
2.3%
30968
 
2.2%
24999
 
1.8%
24273
 
1.7%
22682
 
1.6%
Other values (1316) 1075149
75.4%
Common
ValueCountFrequency (%)
570915
76.6%
, 40357
 
5.4%
. 40266
 
5.4%
1 18374
 
2.5%
) 10248
 
1.4%
2 9983
 
1.3%
( 8734
 
1.2%
3 6921
 
0.9%
5 6300
 
0.8%
0 5859
 
0.8%
Other values (79) 27102
 
3.6%
Latin
ValueCountFrequency (%)
g 1954
40.3%
m 691
 
14.2%
s 517
 
10.7%
c 477
 
9.8%
t 445
 
9.2%
L 181
 
3.7%
C 148
 
3.0%
a 145
 
3.0%
l 145
 
3.0%
T 73
 
1.5%
Other values (6) 77
 
1.6%
Unknown
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1424381
64.9%
ASCII 744657
33.9%
CJK 17385
 
0.8%
None 3738
 
0.2%
Compat Jamo 2426
 
0.1%
Punctuation 851
 
< 0.1%
Math Operators 400
 
< 0.1%
Misc Symbols 232
 
< 0.1%
CJK Compat Ideographs 150
 
< 0.1%
CJK Ext A 25
 
< 0.1%
Other values (8) 62
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
570915
76.7%
, 40357
 
5.4%
. 40266
 
5.4%
1 18374
 
2.5%
) 10248
 
1.4%
2 9983
 
1.3%
( 8734
 
1.2%
3 6921
 
0.9%
5 6300
 
0.8%
0 5859
 
0.8%
Other values (38) 26700
 
3.6%
Hangul
ValueCountFrequency (%)
50361
 
3.5%
50214
 
3.5%
41912
 
2.9%
38303
 
2.7%
34821
 
2.4%
33132
 
2.3%
30968
 
2.2%
24999
 
1.8%
24273
 
1.7%
22682
 
1.6%
Other values (1310) 1072716
75.3%
Compat Jamo
ValueCountFrequency (%)
2410
99.3%
7
 
0.3%
7
 
0.3%
2
 
0.1%
None
ValueCountFrequency (%)
1513
40.5%
1512
40.4%
336
 
9.0%
132
 
3.5%
½ 37
 
1.0%
21
 
0.6%
× 19
 
0.5%
16
 
0.4%
  16
 
0.4%
16
 
0.4%
Other values (21) 120
 
3.2%
CJK
ValueCountFrequency (%)
350
 
2.0%
324
 
1.9%
244
 
1.4%
243
 
1.4%
204
 
1.2%
203
 
1.2%
172
 
1.0%
130
 
0.7%
120
 
0.7%
120
 
0.7%
Other values (2239) 15275
87.9%
Punctuation
ValueCountFrequency (%)
298
35.0%
298
35.0%
88
 
10.3%
62
 
7.3%
57
 
6.7%
37
 
4.3%
7
 
0.8%
4
 
0.5%
Math Operators
ValueCountFrequency (%)
247
61.8%
99
24.8%
26
 
6.5%
26
 
6.5%
2
 
0.5%
Misc Symbols
ValueCountFrequency (%)
232
100.0%
Letterlike Symbols
ValueCountFrequency (%)
13
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
10
 
6.7%
8
 
5.3%
8
 
5.3%
8
 
5.3%
7
 
4.7%
7
 
4.7%
6
 
4.0%
5
 
3.3%
5
 
3.3%
5
 
3.3%
Other values (46) 81
54.0%
Geometric Shapes
ValueCountFrequency (%)
8
38.1%
6
28.6%
4
19.0%
2
 
9.5%
1
 
4.8%
Enclosed Alphanum
ValueCountFrequency (%)
5
35.7%
4
28.6%
4
28.6%
1
 
7.1%
CJK Ext A
ValueCountFrequency (%)
5
20.0%
4
16.0%
3
12.0%
2
 
8.0%
2
 
8.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
Other values (4) 4
16.0%
Specials
ValueCountFrequency (%)
3
100.0%
PUA
ValueCountFrequency (%)
1
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
CJK Compat
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
CJK Ext B
ValueCountFrequency (%)
𦙳 1
16.7%
𪁈 1
16.7%
𥻓 1
16.7%
𤩱 1
16.7%
𪌳 1
16.7%
𩛩 1
16.7%
Distinct8487
Distinct (%)99.8%
Missing1496
Missing (%)15.0%
Memory size156.2 KiB
2023-12-12T17:39:25.198962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length1024
Median length676
Mean length209.81279
Min length11

Characters and Unicode

Total characters1784248
Distinct characters1397
Distinct categories17 ?
Distinct scripts5 ?
Distinct blocks17 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8474 ?
Unique (%)99.6%

Sample

1st row1) 사슴꼬리는 칼로 털과 뼈를 제거한다. 2) 소금 1돈과 무이 1/2돈을 꼬리 속에 집어 넣어 채운다. 3) 꼬챙이로 꿰어서 바람이 통하는 곳에서 말린다.
2nd row1) 두부는 반듯반듯하게 썰어서 녹말가루를 씌워 번철에 지진다. 2) 소고기는 간장, 설탕, 참기름, 후춧가루, 깨소금, 파, 마늘로 양념한다. 3) 1), 2) 를 냄비에 담고 간장, 설탕, 파, 실고추를 넣고 조린다. - 두부는 녹말가루를 씌우지 않고 누룻누룻하게 지져서 조리기도 하나 녹말가루를 묻혀서 지진 것이 부서지지 않고 좋다.
3rd row1) 더덕을 물에 담가 껍질을 깨끗이 긁어내고 씻는다. 2) 씻은 더덕을 칼로 자근자근 두드려 석쇠에 얹는다. 3) 석쇠에 올린 더덕을 꿩의 깃으로 기름장을 발라 굽는다. 4) 다진 파, 깨소금, 기름, 꿀, 고춧가루를 섞어 그릇에 담고 간을 맞춘다. 5) 여기에 애벌 구이한 더덕을 넣어 잠깐 버무린다. 6) 양념한 더덕을 알맞게 다시 굽는다. 7) 구운 더덕을 1치 길이로 잘라 접시에 담고 깨소금을 위에 뿌린다.
4th row1) 밀가루를 반죽한다. 2) 거피팥고물을 조금 넣어 송편보다 작고 납작하게 빚는다. 3) 기름에 바싹 지진다.
5th row1) 오가피 껍질을 벗겨 그늘에서 말리고 칼로 잘게 썰어서 독 밑에 넣는다. 2) 쌀 5말로 하려면 먼저 쌀의 반을 깨끗이 씻어서 가루를 내고 범벅을 개어 식힌 다음 가루누룩 5되를 섞어 빚는다. 3) 술이 다 괴면 남은 쌀을 깨끗이 씻어 익게 찌고 차게 식힌 다음...
ValueCountFrequency (%)
1 10526
 
2.1%
2 10002
 
2.0%
넣고 7103
 
1.4%
3 7002
 
1.4%
6384
 
1.3%
4 4397
 
0.9%
물에 3150
 
0.6%
2831
 
0.6%
놓는다 2719
 
0.6%
5 2690
 
0.5%
Other values (33081) 433297
88.4%
2023-12-12T17:39:25.813390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
481645
27.0%
56485
 
3.2%
. 44413
 
2.5%
) 41752
 
2.3%
38915
 
2.2%
36354
 
2.0%
33862
 
1.9%
27413
 
1.5%
24710
 
1.4%
22762
 
1.3%
Other values (1387) 975937
54.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1116029
62.5%
Space Separator 481645
27.0%
Decimal Number 64769
 
3.6%
Other Punctuation 64153
 
3.6%
Close Punctuation 42168
 
2.4%
Dash Punctuation 6812
 
0.4%
Math Symbol 6148
 
0.3%
Lowercase Letter 1200
 
0.1%
Open Punctuation 1153
 
0.1%
Other Number 75
 
< 0.1%
Other values (7) 96
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
56485
 
5.1%
38915
 
3.5%
36354
 
3.3%
33862
 
3.0%
27413
 
2.5%
24710
 
2.2%
22762
 
2.0%
21922
 
2.0%
21675
 
1.9%
18337
 
1.6%
Other values (1294) 813594
72.9%
Decimal Number
ValueCountFrequency (%)
1 19435
30.0%
2 14260
22.0%
3 10648
16.4%
4 6445
 
10.0%
5 5853
 
9.0%
6 2432
 
3.8%
7 2137
 
3.3%
0 1948
 
3.0%
8 967
 
1.5%
9 606
 
0.9%
Other values (10) 38
 
0.1%
Other Punctuation
ValueCountFrequency (%)
. 44413
69.2%
, 17953
28.0%
1456
 
2.3%
/ 238
 
0.4%
: 42
 
0.1%
' 22
 
< 0.1%
% 15
 
< 0.1%
6
 
< 0.1%
3
 
< 0.1%
3
 
< 0.1%
Other values (2) 2
 
< 0.1%
Other Number
ValueCountFrequency (%)
½ 15
20.0%
14
18.7%
14
18.7%
10
13.3%
6
 
8.0%
5
 
6.7%
4
 
5.3%
3
 
4.0%
1
 
1.3%
1
 
1.3%
Other values (2) 2
 
2.7%
Math Symbol
ValueCountFrequency (%)
> 2307
37.5%
< 2307
37.5%
~ 1337
21.7%
108
 
1.8%
43
 
0.7%
23
 
0.4%
22
 
0.4%
+ 1
 
< 0.1%
Other Symbol
ValueCountFrequency (%)
12
34.3%
8
22.9%
4
 
11.4%
4
 
11.4%
2
 
5.7%
2
 
5.7%
° 2
 
5.7%
1
 
2.9%
Lowercase Letter
ValueCountFrequency (%)
m 624
52.0%
c 495
41.2%
g 47
 
3.9%
s 20
 
1.7%
x 7
 
0.6%
t 7
 
0.6%
Close Punctuation
ValueCountFrequency (%)
) 41752
99.0%
] 366
 
0.9%
46
 
0.1%
3
 
< 0.1%
1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 778
67.5%
[ 367
31.8%
4
 
0.3%
3
 
0.3%
1
 
0.1%
Uppercase Letter
ValueCountFrequency (%)
T 14
45.2%
L 13
41.9%
C 2
 
6.5%
S 1
 
3.2%
B 1
 
3.2%
Dash Punctuation
ValueCountFrequency (%)
- 6681
98.1%
125
 
1.8%
6
 
0.1%
Private Use
ValueCountFrequency (%)
11
91.7%
1
 
8.3%
Final Punctuation
ValueCountFrequency (%)
7
87.5%
1
 
12.5%
Initial Punctuation
ValueCountFrequency (%)
5
83.3%
1
 
16.7%
Space Separator
ValueCountFrequency (%)
481645
100.0%
Control
ValueCountFrequency (%)
ž 3
100.0%
Format
ValueCountFrequency (%)
­ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1115801
62.5%
Common 666974
37.4%
Latin 1231
 
0.1%
Han 230
 
< 0.1%
Unknown 12
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
56485
 
5.1%
38915
 
3.5%
36354
 
3.3%
33862
 
3.0%
27413
 
2.5%
24710
 
2.2%
22762
 
2.0%
21922
 
2.0%
21675
 
1.9%
18337
 
1.6%
Other values (1151) 813366
72.9%
Han
ValueCountFrequency (%)
21
 
9.1%
6
 
2.6%
6
 
2.6%
6
 
2.6%
4
 
1.7%
4
 
1.7%
4
 
1.7%
3
 
1.3%
3
 
1.3%
3
 
1.3%
Other values (134) 170
73.9%
Common
ValueCountFrequency (%)
481645
72.2%
. 44413
 
6.7%
) 41752
 
6.3%
1 19435
 
2.9%
, 17953
 
2.7%
2 14260
 
2.1%
3 10648
 
1.6%
- 6681
 
1.0%
4 6445
 
1.0%
5 5853
 
0.9%
Other values (69) 17889
 
2.7%
Latin
ValueCountFrequency (%)
m 624
50.7%
c 495
40.2%
g 47
 
3.8%
s 20
 
1.6%
T 14
 
1.1%
L 13
 
1.1%
x 7
 
0.6%
t 7
 
0.6%
C 2
 
0.2%
S 1
 
0.1%
Unknown
ValueCountFrequency (%)
11
91.7%
1
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1115779
62.5%
ASCII 666186
37.3%
None 1750
 
0.1%
CJK 227
 
< 0.1%
Math Operators 151
 
< 0.1%
Enclosed Alphanum 58
 
< 0.1%
Punctuation 29
 
< 0.1%
Compat Jamo 20
 
< 0.1%
Geometric Shapes 14
 
< 0.1%
Letterlike Symbols 12
 
< 0.1%
Other values (7) 22
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
481645
72.3%
. 44413
 
6.7%
) 41752
 
6.3%
1 19435
 
2.9%
, 17953
 
2.7%
2 14260
 
2.1%
3 10648
 
1.6%
- 6681
 
1.0%
4 6445
 
1.0%
5 5853
 
0.9%
Other values (27) 17101
 
2.6%
Hangul
ValueCountFrequency (%)
56485
 
5.1%
38915
 
3.5%
36354
 
3.3%
33862
 
3.0%
27413
 
2.5%
24710
 
2.2%
22762
 
2.0%
21922
 
2.0%
21675
 
1.9%
18337
 
1.6%
Other values (1147) 813344
72.9%
None
ValueCountFrequency (%)
1456
83.2%
125
 
7.1%
46
 
2.6%
23
 
1.3%
22
 
1.3%
15
 
0.9%
½ 15
 
0.9%
5
 
0.3%
4
 
0.2%
4
 
0.2%
Other values (18) 35
 
2.0%
Math Operators
ValueCountFrequency (%)
108
71.5%
43
 
28.5%
CJK
ValueCountFrequency (%)
21
 
9.3%
6
 
2.6%
6
 
2.6%
6
 
2.6%
4
 
1.8%
4
 
1.8%
4
 
1.8%
3
 
1.3%
3
 
1.3%
3
 
1.3%
Other values (131) 167
73.6%
Compat Jamo
ValueCountFrequency (%)
18
90.0%
1
 
5.0%
1
 
5.0%
Enclosed Alphanum
ValueCountFrequency (%)
14
24.1%
14
24.1%
10
17.2%
6
10.3%
5
 
8.6%
4
 
6.9%
3
 
5.2%
1
 
1.7%
1
 
1.7%
Letterlike Symbols
ValueCountFrequency (%)
12
100.0%
PUA
ValueCountFrequency (%)
11
91.7%
1
 
8.3%
Geometric Shapes
ValueCountFrequency (%)
8
57.1%
4
28.6%
2
 
14.3%
Punctuation
ValueCountFrequency (%)
7
24.1%
6
20.7%
6
20.7%
5
17.2%
3
10.3%
1
 
3.4%
1
 
3.4%
Misc Symbols
ValueCountFrequency (%)
4
100.0%
CJK Ext A
ValueCountFrequency (%)
1
100.0%
Katakana
ValueCountFrequency (%)
1
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
50.0%
1
50.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
CJK Compat
ValueCountFrequency (%)
1
100.0%
Distinct2669
Distinct (%)49.9%
Missing4649
Missing (%)46.5%
Memory size156.2 KiB
2023-12-12T17:39:26.228857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length120
Median length76
Mean length7.894786
Min length1

Characters and Unicode

Total characters42245
Distinct characters476
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2268 ?
Unique (%)42.4%

Sample

1st row칼, 꼬챙이
2nd row번철, 냄비
3rd row칼, 석쇠, 그릇, 꿩의 깃
4th row칼, 독
5th row항아리
ValueCountFrequency (%)
항아리 1267
 
9.7%
800
 
6.2%
552
 
4.2%
그릇 520
 
4.0%
냄비 468
 
3.6%
번철 368
 
2.8%
359
 
2.8%
시루 335
 
2.6%
방법 217
 
1.7%
189
 
1.5%
Other values (1172) 7932
61.0%
2023-12-12T17:39:26.829796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7669
 
18.2%
, 5257
 
12.4%
1591
 
3.8%
1403
 
3.3%
1354
 
3.2%
884
 
2.1%
766
 
1.8%
756
 
1.8%
750
 
1.8%
736
 
1.7%
Other values (466) 21079
49.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 27994
66.3%
Space Separator 7669
 
18.2%
Other Punctuation 5265
 
12.5%
Math Symbol 523
 
1.2%
Decimal Number 272
 
0.6%
Close Punctuation 261
 
0.6%
Open Punctuation 261
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1591
 
5.7%
1403
 
5.0%
1354
 
4.8%
884
 
3.2%
766
 
2.7%
756
 
2.7%
750
 
2.7%
736
 
2.6%
677
 
2.4%
666
 
2.4%
Other values (443) 18411
65.8%
Decimal Number
ValueCountFrequency (%)
1 102
37.5%
2 96
35.3%
3 34
 
12.5%
4 13
 
4.8%
0 11
 
4.0%
5 10
 
3.7%
6 3
 
1.1%
8 2
 
0.7%
7 1
 
0.4%
Other Punctuation
ValueCountFrequency (%)
, 5257
99.8%
. 2
 
< 0.1%
/ 2
 
< 0.1%
: 2
 
< 0.1%
2
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
< 258
49.3%
> 258
49.3%
~ 6
 
1.1%
1
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 257
98.5%
] 4
 
1.5%
Open Punctuation
ValueCountFrequency (%)
( 257
98.5%
[ 4
 
1.5%
Space Separator
ValueCountFrequency (%)
7669
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 27973
66.2%
Common 14251
33.7%
Han 21
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1591
 
5.7%
1403
 
5.0%
1354
 
4.8%
884
 
3.2%
766
 
2.7%
756
 
2.7%
750
 
2.7%
736
 
2.6%
677
 
2.4%
666
 
2.4%
Other values (424) 18390
65.7%
Common
ValueCountFrequency (%)
7669
53.8%
, 5257
36.9%
< 258
 
1.8%
> 258
 
1.8%
) 257
 
1.8%
( 257
 
1.8%
1 102
 
0.7%
2 96
 
0.7%
3 34
 
0.2%
4 13
 
0.1%
Other values (13) 50
 
0.4%
Han
ValueCountFrequency (%)
2
 
9.5%
2
 
9.5%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
Other values (9) 9
42.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 27973
66.2%
ASCII 14248
33.7%
CJK 21
 
< 0.1%
None 2
 
< 0.1%
Math Operators 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
7669
53.8%
, 5257
36.9%
< 258
 
1.8%
> 258
 
1.8%
) 257
 
1.8%
( 257
 
1.8%
1 102
 
0.7%
2 96
 
0.7%
3 34
 
0.2%
4 13
 
0.1%
Other values (11) 47
 
0.3%
Hangul
ValueCountFrequency (%)
1591
 
5.7%
1403
 
5.0%
1354
 
4.8%
884
 
3.2%
766
 
2.7%
756
 
2.7%
750
 
2.7%
736
 
2.6%
677
 
2.4%
666
 
2.4%
Other values (424) 18390
65.7%
CJK
ValueCountFrequency (%)
2
 
9.5%
2
 
9.5%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
Other values (9) 9
42.9%
None
ValueCountFrequency (%)
2
100.0%
Math Operators
ValueCountFrequency (%)
1
100.0%
Distinct9122
Distinct (%)91.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T17:39:27.130525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length702
Median length569
Mean length158.2324
Min length16

Characters and Unicode

Total characters1582324
Distinct characters839
Distinct categories13 ?
Distinct scripts5 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8756 ?
Unique (%)87.6%

Sample

1st row[누룩, , , , , , , ,주재료]
2nd row[사슴꼬리, , , , ,1,개, ,주재료],[소금, , , , ,1,전, ,부재료],[무이, , , , ,0.5,전,반 전,부재료]
3rd row[두부, , , , , , , ,주재료],[녹말가루, , , , , , , ,부재료],[소고기, , , , , , , ,부재료],[간장, , , , , , ,(양념),부재료],[설탕, , , , , , ,(양념),부재료],[참기름, , , , , , ,(양념),부재료],[후추가루, , , , , , ,(양념),부재료],[깨소금, , , , , , ,(양념),부재료],[파, , , , , , ,(양념),부재료],[마늘, , , , , , ,(양념),부재료],[간장, , , , , , , ,부재료],[설탕, , , , , , , ,부재료],[파, , , , , , , ,부재료],[실고추, , , , , , , ,부재료]
4th row[더덕, , , , , , , ,주재료],[다진 파, , , , , , , ,부재료],[깨소금, , , , , , , ,부재료],[기름, , , , , , , ,부재료],[꿀, , , , , , , ,부재료],[고춧가루, , , , , , , ,부재료]
5th row[밀가루, , , , , , , ,부재료],[거피팥고물, , , , , , , ,부재료],[기름, , , , , , , ,부재료]
ValueCountFrequency (%)
276279
72.4%
부재료 5999
 
1.6%
방법 2759
 
0.7%
주재료 1333
 
0.3%
부재료],[파 1323
 
0.3%
부재료],[소금 1046
 
0.3%
부재료],[기름 971
 
0.3%
부재료],[간장 882
 
0.2%
주재료],[소금 769
 
0.2%
부재료],[마늘 762
 
0.2%
Other values (18679) 89335
 
23.4%
2023-12-12T17:39:27.635542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
, 550750
34.8%
371462
23.5%
[ 62606
 
4.0%
] 62563
 
4.0%
62191
 
3.9%
62115
 
3.9%
44805
 
2.8%
1 21292
 
1.3%
19085
 
1.2%
) 13713
 
0.9%
Other values (829) 311742
19.7%

Most occurring categories

ValueCountFrequency (%)
Other Punctuation 553018
34.9%
Other Letter 433742
27.4%
Space Separator 371462
23.5%
Open Punctuation 76290
 
4.8%
Close Punctuation 76277
 
4.8%
Decimal Number 53158
 
3.4%
Math Symbol 15304
 
1.0%
Lowercase Letter 2810
 
0.2%
Uppercase Letter 247
 
< 0.1%
Other Symbol 12
 
< 0.1%
Other values (3) 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
62191
 
14.3%
62115
 
14.3%
44805
 
10.3%
19085
 
4.4%
9347
 
2.2%
8187
 
1.9%
8160
 
1.9%
7251
 
1.7%
7170
 
1.7%
5620
 
1.3%
Other values (781) 199811
46.1%
Lowercase Letter
ValueCountFrequency (%)
g 1833
65.2%
s 415
 
14.8%
t 350
 
12.5%
m 109
 
3.9%
k 67
 
2.4%
c 10
 
0.4%
o 6
 
0.2%
l 6
 
0.2%
p 3
 
0.1%
a 3
 
0.1%
Other values (3) 8
 
0.3%
Decimal Number
ValueCountFrequency (%)
1 21292
40.1%
2 8644
16.3%
5 6465
 
12.2%
0 6176
 
11.6%
3 5658
 
10.6%
4 2271
 
4.3%
6 1095
 
2.1%
7 727
 
1.4%
8 614
 
1.2%
9 216
 
0.4%
Other Punctuation
ValueCountFrequency (%)
, 550750
99.6%
. 2125
 
0.4%
/ 110
 
< 0.1%
: 24
 
< 0.1%
* 5
 
< 0.1%
% 4
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
< 7499
49.0%
> 7492
49.0%
~ 240
 
1.6%
73
 
0.5%
Open Punctuation
ValueCountFrequency (%)
[ 62606
82.1%
( 13683
 
17.9%
1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
] 62563
82.0%
) 13713
 
18.0%
1
 
< 0.1%
Other Symbol
ValueCountFrequency (%)
5
41.7%
4
33.3%
3
25.0%
Uppercase Letter
ValueCountFrequency (%)
L 181
73.3%
T 66
 
26.7%
Space Separator
ValueCountFrequency (%)
371462
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Private Use
ValueCountFrequency (%)
1
100.0%
Other Number
ValueCountFrequency (%)
½ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1145515
72.4%
Hangul 433687
 
27.4%
Latin 3057
 
0.2%
Han 64
 
< 0.1%
Unknown 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
62191
 
14.3%
62115
 
14.3%
44805
 
10.3%
19085
 
4.4%
9347
 
2.2%
8187
 
1.9%
8160
 
1.9%
7251
 
1.7%
7170
 
1.7%
5620
 
1.3%
Other values (742) 199756
46.1%
Han
ValueCountFrequency (%)
5
 
7.8%
4
 
6.2%
3
 
4.7%
3
 
4.7%
3
 
4.7%
3
 
4.7%
2
 
3.1%
2
 
3.1%
2
 
3.1%
2
 
3.1%
Other values (31) 35
54.7%
Common
ValueCountFrequency (%)
, 550750
48.1%
371462
32.4%
[ 62606
 
5.5%
] 62563
 
5.5%
1 21292
 
1.9%
) 13713
 
1.2%
( 13683
 
1.2%
2 8644
 
0.8%
< 7499
 
0.7%
> 7492
 
0.7%
Other values (20) 25811
 
2.3%
Latin
ValueCountFrequency (%)
g 1833
60.0%
s 415
 
13.6%
t 350
 
11.4%
L 181
 
5.9%
m 109
 
3.6%
k 67
 
2.2%
T 66
 
2.2%
c 10
 
0.3%
o 6
 
0.2%
l 6
 
0.2%
Other values (5) 14
 
0.5%
Unknown
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1148493
72.6%
Hangul 433663
 
27.4%
Math Operators 73
 
< 0.1%
CJK 64
 
< 0.1%
Compat Jamo 15
 
< 0.1%
None 12
 
< 0.1%
Specials 3
 
< 0.1%
PUA 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
, 550750
48.0%
371462
32.3%
[ 62606
 
5.5%
] 62563
 
5.4%
1 21292
 
1.9%
) 13713
 
1.2%
( 13683
 
1.2%
2 8644
 
0.8%
< 7499
 
0.7%
> 7492
 
0.7%
Other values (30) 28789
 
2.5%
Hangul
ValueCountFrequency (%)
62191
 
14.3%
62115
 
14.3%
44805
 
10.3%
19085
 
4.4%
9347
 
2.2%
8187
 
1.9%
8160
 
1.9%
7251
 
1.7%
7170
 
1.7%
5620
 
1.3%
Other values (739) 199732
46.1%
Math Operators
ValueCountFrequency (%)
73
100.0%
Compat Jamo
ValueCountFrequency (%)
15
100.0%
CJK
ValueCountFrequency (%)
5
 
7.8%
4
 
6.2%
3
 
4.7%
3
 
4.7%
3
 
4.7%
3
 
4.7%
2
 
3.1%
2
 
3.1%
2
 
3.1%
2
 
3.1%
Other values (31) 35
54.7%
None
ValueCountFrequency (%)
5
41.7%
4
33.3%
1
 
8.3%
1
 
8.3%
½ 1
 
8.3%
Specials
ValueCountFrequency (%)
3
100.0%
PUA
ValueCountFrequency (%)
1
100.0%

Sample

대표식품 코드대표 식품명전통 식품코드출전문헌 현대어출전문헌 원어출전문헌 저자명출전문헌 간행년도출전문헌 해당페이지분석정보 IPC분석정보 KTKRC분석정보 KEYWORDDB 구축년도전통식품명전통식품명 (원문)전통식품명 (독음)전통식품명 (영문-음가)전통식품명 (영문-번역)원문번역문조리법 (가공기술)조리기기 및 도구식재료 및 배합비 [식재료명,원문,독음,이명,이명(영문),수량,단위,기타,주재료,부재료]
5916102728소주10058연행록燕行錄최덕중(崔德中)17세기<NA>C12G 3/12<NA>소주, 燒酒2011소주燒酒소주<NA><NA>○宗班正使則隔二日一供。㺚羊一隻,燒酒一甁,冬則燒肉炭十五斤,烤手炭三十斤。 書狀官。 水稻米二升,魚二尾,豆腐二斤,醃菜一斤,白鹽二兩,茶葉二兩,柴十五斤。종반이 정사일 적에는 이틀 간격으로 달양(㺚羊) 1짝, 소주(燒酒) 1병을 1차례씩 공급하는데, 겨울에는 고기 굽는 숯이 15근, 손 쬐는 숯이 30근이다. 서장관에게는 논벼쌀 2되, 생선 2마리, 두부 2근, 엄채(醃菜) 1근, 흰 소금 2냥, 찻잎 2냥, 땔나무 15근.<NA><NA>[누룩, , , , , , , ,주재료]
4960102224사슴꼬리육포103428오주연문장전산고五洲衍文長箋散稿이규경(李圭景)19세기<NA>A23L 1/318<NA>사슴꼬리육포, 醃鹿尾, 암록미 , 사슴꼬리, 육포2012사슴꼬리육포醃鹿尾암록미<NA><NA>刀剃去尾根上毛,剔去骨,用塩一錢蕪荑半錢填尾內,杖夾風吹乾。<사슴꼬리>를 칼로 꼬리부분 털을 제거하고 뼈도 제거한 다음 소금 1돈과 무이 1/2돈을 꼬리 속에 넣은 후 꼬챙이에 껴서 바람에 말린다.1) 사슴꼬리는 칼로 털과 뼈를 제거한다. 2) 소금 1돈과 무이 1/2돈을 꼬리 속에 집어 넣어 채운다. 3) 꼬챙이로 꿰어서 바람이 통하는 곳에서 말린다.칼, 꼬챙이[사슴꼬리, , , , ,1,개, ,주재료],[소금, , , , ,1,전, ,부재료],[무이, , , , ,0.5,전,반 전,부재료]
2923101304두부조림121570이조궁정요리통고李朝宮庭料理通考한희순(韓熙順), 황혜성(黃慧性), 이혜경(李惠卿)1957년117A23L 1/39, A23L 1/212<NA>두부조림, 두부조리개, 두부, 소고기, 조림2013두부조림두부조리개두부조림Dubu-jorimBraised Tofu재료 두부 2모 소고기 20匁 녹말가루 조금 기름 조금 실고추 조금 파 조금 깨소금 조금 양념(간장, 설탕, 참기름, 깨소금, 후추가루, 파, 마늘) 조리법 두부는 반듯반듯하게 썰어서 녹말가루를 씨워 번철에 지진다. 소고기는 양념(간장, 설탕, 참기름, 후추가루, 깨소금, 파, 마늘)하여 두부지진것과 함께 남비에 담고 간장, 설탕, 파, 실고추를 넣고 조린다. 두부는 녹말가루를 씨우지 않고 누룻누룻하게 지져서 조리기도 하나 녹말가루를 묻혀서 지진 것은 부서지지 않고 좋다.재료 두부 2모 소고기 20돈 녹말가루 조금 기름 조금 실고추 조금 파 조금 깨소금 조금 양념(간장, 설탕, 참기름, 깨소금, 후춧가루, 파, 마늘) 조리법 두부는 반듯반듯하게 썰어서 녹말가루를 씌워 번철에 지진다. 소고기는 간장, 설탕, 참기름, 후춧가루, 깨소금, 파, 마늘로 양념하여 두부 지진 것과 함께 냄비에 담고 간장, 설탕, 파, 실고추를 넣고 조린다. 두부는 녹말가루를 씌우지 않고 누룻누룻하게 지져서 조리기도 하나 녹말가루를 묻혀서 지진 것이 부서지지 않고 좋다.1) 두부는 반듯반듯하게 썰어서 녹말가루를 씌워 번철에 지진다. 2) 소고기는 간장, 설탕, 참기름, 후춧가루, 깨소금, 파, 마늘로 양념한다. 3) 1), 2) 를 냄비에 담고 간장, 설탕, 파, 실고추를 넣고 조린다. - 두부는 녹말가루를 씌우지 않고 누룻누룻하게 지져서 조리기도 하나 녹말가루를 묻혀서 지진 것이 부서지지 않고 좋다.번철, 냄비[두부, , , , , , , ,주재료],[녹말가루, , , , , , , ,부재료],[소고기, , , , , , , ,부재료],[간장, , , , , , ,(양념),부재료],[설탕, , , , , , ,(양념),부재료],[참기름, , , , , , ,(양념),부재료],[후추가루, , , , , , ,(양념),부재료],[깨소금, , , , , , ,(양념),부재료],[파, , , , , , ,(양념),부재료],[마늘, , , , , , ,(양념),부재료],[간장, , , , , , , ,부재료],[설탕, , , , , , , ,부재료],[파, , , , , , , ,부재료],[실고추, , , , , , , ,부재료]
2392101065더덕구이107298시의전서是議全書저자미상1800년대말<NA>A23L 1/214, A23L 1/01<NA>더덕구이, 沙蔘炙, 사삼자, 더덕, 파, 깨소금, 꿀, 구이2012더덕구이沙蔘炙 더덕구이사삼자Deodeok-guiGrilled Deodeok더덕을 물에 담가 물에 부른 후 겁질을 졍히 글거 씨셔 건져 도마에 노코 칼노 근 근 두다혀 젹쇠에 언고 발 깃 셔로 유 발나 굽다가 파 다져 쇼곰 기름 고쵸가로 합 여 그릇 담아 함담 보아 굽든 더덕을 너허 간 물너 굽되 오 구으면 양념이 타셔 못쓰니 구어 여 치 기릐식 너 졉시에 담고 우희에 쇼곰 려씨라더덕을 물에 담가 불린 후 껍질을 깨끗이 긁어내고 씻는다. 칼로 더덕을 자근자근 두드리고 석쇠에 얹어 꿩의 깃으로 기름장을 발라 굽는다. 다진 파, 깨소금, 기름, 꿀, 고춧가루를 합하여 그릇에 담고 간을 맞춘다. 여기에 애벌구이한 더덕을 넣고 잠깐 버무려 알맞게 굽되 오래 구우면 양념이 타서 못 쓴다. 구운 더덕을 1치 길이로 잘라 접시에 담고 깨소금을 위에 뿌린다.1) 더덕을 물에 담가 껍질을 깨끗이 긁어내고 씻는다. 2) 씻은 더덕을 칼로 자근자근 두드려 석쇠에 얹는다. 3) 석쇠에 올린 더덕을 꿩의 깃으로 기름장을 발라 굽는다. 4) 다진 파, 깨소금, 기름, 꿀, 고춧가루를 섞어 그릇에 담고 간을 맞춘다. 5) 여기에 애벌 구이한 더덕을 넣어 잠깐 버무린다. 6) 양념한 더덕을 알맞게 다시 굽는다. 7) 구운 더덕을 1치 길이로 잘라 접시에 담고 깨소금을 위에 뿌린다.칼, 석쇠, 그릇, 꿩의 깃[더덕, , , , , , , ,주재료],[다진 파, , , , , , , ,부재료],[깨소금, , , , , , , ,부재료],[기름, , , , , , , ,부재료],[꿀, , , , , , , ,부재료],[고춧가루, , , , , , , ,부재료]
9075104036주악183274반찬등속반 등속(饌饍繕冊)미상1913년<NA>A23L 7/10, A23P 1/08<NA>주악, 쥬왁, 주왁, 밀가루, 거피팥고물2016주악쥬왁주왁<NA><NA>쥬왁은 밀가루로 반쥭 여 송편갓치 비지되 계피고물을 조고맛치 느어셔 송편보 즉게 여 작 게 여서 기름에 밧삭 지지여 게라주악은 밀가루로 반죽하여 송편같이 빚는다. 거피팥고물을 조금 넣어 송편보다 작고 납작하게 하여 기름에 바싹 지진다.1) 밀가루를 반죽한다. 2) 거피팥고물을 조금 넣어 송편보다 작고 납작하게 빚는다. 3) 기름에 바싹 지진다.<NA>[밀가루, , , , , , , ,부재료],[거피팥고물, , , , , , , ,부재료],[기름, , , , , , , ,부재료]
7522103356오가피주10807보감녹보감녹저자미상1927년<NA>C12G 3/02<NA>오가피주, 오가피쥬법2011오가피주오가피쥬법오가피주<NA><NA>오가피물 모을 에 만히 벗겨 음건 야 유협도의 게 쓰러 독 밋테 너코  닷 말 랴면 반식 셰 야 말 야 범벅 여 식힌 후 로누록 닷 되 섯거 비져서 다 괴거던 남은  세 야 닉게  게 식혀 탕수을 다른 슐 물과 갓치 잡아 졍히 엿다가 다 게 데여 공심의 알맛게 먹그면 풍병과 반신불슈을 고칠 불 안이라 옛 윤공도와 작소란 람이 이 슐을 먹그니 나흔 삼 식 살고 아달은 서흔을 나흐니라 오가피 일명은 금념이오 일명은 문쟝초니 우흐로 오셩 졍기을 응 고로 닙피 오츌이니 고인이 왈 만일 모슴 오가피를 어드면 금옥이 슈에 득 거살 쓰지 안이 여기니라 고 우 왈 문쟝초로 슐을 면 금이 귀 믈 니아지 못 다 니라오가피 껍질을 벗겨 그늘에서 말리고 칼로 잘게 썰어서 독 밑에 넣는다. 쌀 5말로 하려면 먼저 쌀의 반을 깨끗이 씻어서 가루를 내고 범벅을 개어 식힌 다음 가루누룩 5되를 섞어 빚는다. 술이 다 괴면 남은 쌀을 깨끗이 씻어 익게 찌고 차게 식힌 다음... 물을 끓이고 술을 동량으로 섞어 알맞은 양을 공복에 따스하게 마시면 풍병과 반신불수를 고칠 뿐 아니라 옛날 사람 윤공도와 맹작소란 사람은 이 술을 먹고 300세를 누리고 아들을 30명을 낳았다고 한다. 오가피의 다른 이름은 금념과 문장초이다. 하늘의 오차성(五車星)의 정기를 받아서 잎이 다섯이 난다고 한다. 옛 사람이 말하기를 만일 한 모금의 오가피를 얻으면 금과 옥이 가득한 수레도 마음에 들어오지 않는다 하고 또 말하기를 오가피로 빚은 술은 금보다 더 귀하다 하였다.1) 오가피 껍질을 벗겨 그늘에서 말리고 칼로 잘게 썰어서 독 밑에 넣는다. 2) 쌀 5말로 하려면 먼저 쌀의 반을 깨끗이 씻어서 가루를 내고 범벅을 개어 식힌 다음 가루누룩 5되를 섞어 빚는다. 3) 술이 다 괴면 남은 쌀을 깨끗이 씻어 익게 찌고 차게 식힌 다음...칼, 독[오가피, , , , , , , ,주재료],[쌀, , , , ,5,말, ,주재료],[가루누룩, , , , ,5,되, ,주재료]
630310289610674주찬酒饌저자미상1800연대초엽<NA>C12G 3/02, C12G 3/00<NA>삼키기 아까울 정도로 향이 좋은 술, 石炭香, 석탄향2011삼키기 아까울 정도로 향이 좋은 술石炭香석탄향<NA><NA>白米二升,百洗作末,水二瓶作粥,待冷,曲一升調置,不有他水氣。冬七日春秋五日夏三日,浄精粘米一斗,百洗浸宿,翌日熟烝,待冷,本酒調釀合,七日後垂之,味甚烈美。쌀 2되를 여러 번 씻어서 가루를 내고 물 2병으로 죽을 만들어서 차게 식힌 후, 누룩 1되를 딴 물을 넣지 않고 섞어 둔다. 겨울이면 7일, 봄∙가을이면 5일, 여름이며 3일 후에 깨끗이 찧은 찹쌀 1말을 여러 번 씻어서 물에 하룻밤 담가 두었다가 다음날 푹 쪄서 차게 식힌 다음, 밑술에 섞어 빚는다. 7일 후에 용수를 박는다. 맛이 매우 독하고 좋다.<밑술 빚기> 1) 쌀 2되를 여러 번 씻어서 가루를 낸 다음 물 2병을 부어 죽을 만들어 차게 식힌다. 2) 누룩 1되를 1)과 섞어 빚는다. <덧술 빚기>(겨울이면 7일, 봄,가을이면 5일, 여름이며 3일 후) 1) 찹쌀 1말을 여러 번 씻어서 물에 하룻밤 담가 두었다가 다음날 푹 쪄서 차게 식힌다. 2) 식힌 고두밥과 앞서 빚은 밑술을 섞어 빚는다. 3) 7일 후에 용수를 박는다. - 맛이 매우 독하고 좋다.<NA>[쌀, , , , ,2,되,<밑술>,주재료],[물, , , , ,2,병,<밑술>,부재료],[누룩, , , , ,1,되,<밑술>,주재료],[찹쌀, , , , ,1,말,<덧술>,주재료]
9270104116무지게미절임121849농정회요農政會要최한기(崔漢綺)1830년경<NA>A23L 1/218<NA>술지게미무절임 만드는 방법, 糟蘿葍方, 조라복방, 술지게미무절임, 술지게미, 무2013술지게미무절임 만드는 방법糟蘿葍方조라복방<NA><NA>蘿葍一斤,塩三兩,以蘿葍不要見水揩净,帶須半根晒乹。糟與塩拌過,次入蘿葍又拌過,入瓮。此方非暴吃者。무[蘿葍] 1근, 소금 3냥. 무는 물기 없도록 깨끗이 닦아내고 잔뿌리 달린 무를 햇볕에 반쯤 말린다. 술지게미[糟]와 소금을 잘 섞은 다음 무를 넣고 또 골고루 버무린 뒤에 항아리에 넣는다. 이것은 바로 먹는 것이 아니다.1) 무 1근을 물기 없이 깨끗이 닦는다. 3) 잔뿌리가 달린 무를 햇볕에 반쯤 말린다. 4) 술지게미와 소금 3냥을 잘 섞은 다음 무를 넣고 골고루 버무려 항아리에 넣는다. - 이것은 바로 먹는 것이 아니다.항아리[무, , , , ,1,근, ,부재료],[술지게미, , , , , , , ,부재료],[소금, , , , ,3,냥, ,부재료]
9220104094중박계106999조선요리제법조선요리제법방신영(方信榮)1934三三九A23G 3/00, A23G 3/34<NA>중박계, 중백기, 약과, 밀가루, 엿, 꿀2012중박계중백기중박계<NA><NA>중백기는 약과 만드는 법과 꼭 같은것인데 반만 익혀서 건지는 것이라 약과는 속까지 검은빛이 나도록 익히고 중백기는 것만 노랗게 익히는 것이니 반죽해서 닷분 두께로 밀어가지고 한치 길이 팔푼 넓이로 베어서 지져서 아무것도 바르지 않고 그대로 접시에 놓는것이니라중백기는 약과 만드는 법과 꼭 같은 방법으로 하는 것이다. 반만 익혀서 건지는 것이 다르다. 약과는 속까지 검은빛이 나도록 익히고 중백기는 겉만 노랗게 익히는 것이다. 반죽해서 닷 분 두께로 밀어가지고 한 치 길이, 팔 푼 너비로 베어서 지져서 아무것도 바르지 않고 그대로 접시에 놓는다.1) 중백기는 약과 만드는 법과 꼭 같고 반만 익혀서 건지는 것만 다르다. 또한 약과는 속까지 검은빛이 나도록 익히지만 중백기는 겉만 노랗게 익힌다. 2) 반죽해서 닷 분 두께로 밀고 한 치 길이, 팔 푼 너비로 베어서 지지고 아무것도 바르지 않고 그대로 접시에 놓는다.과줄판, 판자, 밀방망이, 번철, 그릇,[밀가루, , , , ,1,근, ,주재료],[설탕, , , , ,80,돈, ,부재료],[기름, , , , ,1,종자, ,주재료],[물, , , , ,1.5,종자, ,부재료],[계핏가루, , , , , , , ,부재료],[술, , , , , , , ,부재료]
9622104326천문동125651고사신서攷事新書서명응(徐命膺)1771년<NA>A23L 1/315, A23L 1/39A21C 03/, A21B 05/천문동, 天門冬, 천문동 뿌리2014천문동天門冬천문동<NA><NA>天門冬取根蒸熟,去皮心食之,甚香羙。荒年取啖,足以斷穀止飢。천문동(天門冬) 뿌리를 취해 쪄서 익혀 껍질과 속을 제거하고 먹으면 아주 향긋하고 맛이 좋다. 흉년에 취해 먹으면 곡식을 끊고도 배고픔을 견딜 수 있다.1) 천문동 뿌리를 취하여 쪄서 익힌다. 2) 익힌 천문동 뿌리는 껍질과 속을 제거한다.<NA>[천문동 뿌리, , , , , , , ,주재료]
대표식품 코드대표 식품명전통 식품코드출전문헌 현대어출전문헌 원어출전문헌 저자명출전문헌 간행년도출전문헌 해당페이지분석정보 IPC분석정보 KTKRC분석정보 KEYWORDDB 구축년도전통식품명전통식품명 (원문)전통식품명 (독음)전통식품명 (영문-음가)전통식품명 (영문-번역)원문번역문조리법 (가공기술)조리기기 및 도구식재료 및 배합비 [식재료명,원문,독음,이명,이명(영문),수량,단위,기타,주재료,부재료]
3130101361마늘104202박해통고博海通攷저자미상18세기 이후<NA>A23L 1/315, A23L 1/39<NA>마늘보관법, 藏大蒜法, 장대산법 , 마늘, 초2012마늘보관법藏大蒜法장대산법<NA><NA>剝去皮浸好醋中,屡過月,愈久愈新,味佳臭醎。마늘의 껍질을 벗겨버리고 양질의 초에 담근다. 여러 달 담그는데 시간이 길면 길수록 신선하고 향이 좋고 간도 알맞아 좋다.1) 마늘의 껍질을 벗겨 초에 담근다. 2) 한 달 이상 담그는데 시간이 길면 길수록 신선하고 향이 좋고 간도 알맞아 맛이 좋다.<NA>[마늘, , , , , , , ,주재료],[초, , , , , , , ,부재료]
9907104447치즈120993임원십육지林園十六志서유구(徐有榘)1835년경<NA>A23C 19/08, A23C 9/00<NA>간장을 넣어 만든 치즈 만드는 방법, 造乳團法, 조유단법2013간장을 넣어 만든 치즈 만드는 방법造乳團法조유단법<NA><NA>用酪五升煎滚,入冷醬水半升,必自成塊,未成更入醬一盞,以帛包搦如乳餅樣,收之。《 臞仙神隱書 》타락[酪] 5되를 팔팔 끓여 차가운 간장물 0.5되를 넣으면 반드시 저절로 덩어리가 진다. 덩어리 지지 않으면 다시 간장 1잔을 넣고 명주천으로 싸서 눌러서 유병모양을 만들어 보관한다. 《 구선신은서 》1) 우유 5되를 팔팔 끓여 차가운 간장 0.5되를 넣는다. 2) 저절로 덩어리가 엉겨진다. - 덩어리 지지 않으면 다시 간장 1잔을 넣는다. 3) 명주천으로 싸서 짓눌러서 유병모양을 만든다.명주천[우유, , , , ,5,되, ,부재료],[차가운 간장, , , , ,0.5,되, ,부재료]
5557102540석박지121615이조궁정요리통고李朝宮庭料理通考한희순(韓熙順), 황혜성(黃慧性), 이혜경(李惠卿)1957년142A23B 7/10, A23L 1/23<NA>석박지, 무, 배추, 미나리2013석박지석박지석박지<NA><NA>재료 무우 20개 배추 20통 갓, 미나리, 배, 밤 파, 마늘, 생강, 청각, 굴, 고추 젓국 조리법 배추와 무를 썰어서 심심하게 절여서 여러 가지 양념을 넣고 버무려서 비늘김치(무나 오이비늘김치)를 넣으면서 독에 담고 돌로 누르고 젓국을 붓고 꼭 봉한다.재료 무 20개 배추 20통 갓, 미나리, 배, 밤 파, 마늘, 생강, 청각, 굴, 고추 젓국 조리법 배추와 무를 썰어서 심심하게 절여서 여러 가지 양념을 넣고 버무려서 비늘 김치 또는 무나 오이비늘 김치를 넣으면서 독에 담고 돌로 누르고 젓국을 붓고 꼭 봉한다.1) 배추, 무는 썰어서 심심하게 절인다. 2) 1)과 여러 가지 양념을 넣고 버무려서 비늘 김치 또는 무나 오이비늘 김치를 넣으면서 독에 담고 돌로 누른다. 3) 젓국을 붓고 꼭 봉한다.[배추, , , , , , , ,부재료],[무, , , , , , , ,부재료],[소금, , , , , , , ,부재료],[여러 가지 양념, , , , , , , ,부재료],[비늘 김치, , , , , , ,또는 무나 오이비늘 김치,부재료],[젓국, , , , , , , ,부재료]
7917103564원미109237조선요리법朝鮮料理法조자호(趙慈鎬)19.43一五○A23L 1/10<NA>원미, 멥쌀, 설탕, 물, 수조2012원미원미원미<NA><NA>재료 멥쌀, 설탕, 얼음, 약소주. 만드는 법 멥쌀을 정히 씻어 일어서 건저 말려가지고 매에다 반씩 쪼개지게 타서 홑체로 가루는 쳐버리고 싸라기만 물을 먼저 끓이다가 알맞히 놓고 죽을 쑤는데 보통 죽보다 되직하게 쑤어 가지고 찬물에 채워 훨씬 식힙니다. 그런 후 쑤어 놓은 죽을 자실만큼 뜨고 소주를 조금 타고 설탕을 적당히 타고 얼음을 잘게 깨 처질러 잡수십시오.재료 멥쌀, 설탕, 얼음, 약소주 만드는 법 멥쌀을 깨끗이 씻어 일어서 건져 말려가지고 매에다 반씩 쪼개지게 만들어 홑체로 가루는 쳐버리고 싸라기만 물을 먼저 끓이다가 알맞게 놓고 죽을 쑤는데 보통 죽보다 되직하게 쑤어 가지고 찬물에 채워 식힌다. 그런 후 쑤어 놓은 죽을 자실만큼 뜨고 소주를 조금 타고 설탕도 적당히 타고 얼음을 잘게 깨어서 곁들인다.1) 멥쌀을 깨끗이 씻어 일어서 건져 말려 맷돌에 반씩 쪼개지게 만들어 홑체로 가루는 쳐 버린다. 2) 싸라기만 물을 먼저 끓이다가 알맞게 넣고 죽을 쑤는데 보통 죽보다 되직하게 쑨다. 3) 찬물에 채워 식힌다. 4) 쑤어 놓은 죽을 먹을 만큼 뜨고 소주를 조금 타고 설탕을 적당히 탄다. 5) 얼음을 잘게 깨어 넣는다.맷돌, 홑체[멥쌀, , , , , , , ,부재료],[설탕, , , , , , , ,부재료],[얼음, , , , , , , ,부재료],[약소주, , , , , , , ,부재료]
10684104767호두경단183147거가필용사류전집居家必用事類全集미상원나라 시대<NA>A23L 7/10, A23P 1/08<NA>設克兒疋剌, 설극아필랄, 호두육, 꿀, 떡2016호두경단設克兒疋剌설극아필랄<NA><NA>胡桃肉溫水退皮二斤, 淨控乾, 下擂盆搗碎. 入熟蜜一斤, 曲呂車燒餠揉碎一斤. 三件拌勻, 掿作小團塊. 用曲呂車燒餠劑包餡, 捏作糝孛撒樣, 入爐貼熟爲度.호두육 2근을 온수에 넣고 떫은 껍질을 벗겨내고 깨끗이 씻어 물기를 빼내고 절구에 넣어 잘게 부순다. 익은 꿀 1근, 곡려차(曲呂車))로 늘려서 구운 떡을 주물러 부순 것 1근을 준비한다. 이상의 3가지를 잘 반죽하여 손으로 주물러 작은 경단을 만든다. 곡려차로 늘려서 구운 떡으로 경단을 싸고 손으로 집어서 삼패살(糝孛撒, 밀가루로 만든 피에 소를 싼 것)처럼 만들고 익을 때까지 화로에 넣어 굽는다.1) 호두육 2근을 온수에 넣고 떫은 껍질을 벗겨낸다. 2) 1)을 깨끗이 씻어 물기를 빼내고 절구에 넣어 잘게 부순다. 3) 익은 꿀 1근, 곡려차로 늘려서 구운 떡을 주물러 부순 것 1근을 준비한다. 4) 2)와 3)을 잘 반죽하여 손으로 주물러 작은 경단을 만든다. 5) 곡려차로 늘려서 구운 떡으로 경단을 싸고 손으로 집어서 밀가루로 만든 피에 소를 싼 것처럼 만든다. 6) 익을 때까지 화로에 넣어 굽는다.절구, 화로[호두육, , , , ,2,근, ,주재료],[꿀, , , , ,1,근, ,부재료],[구운 떡, , , , ,1,근, ,부재료]
8405103791잡탕109315조선요리법朝鮮料理法조자호(趙慈鎬)19.43一九七A23L 1/39<NA>잡탕, 양, 사태, 계란, 정육, 깨소금2012잡탕잡탕잡탕japtangMixed Seafood Stew재료 양 사태 곤자소니 뼈도가니 흘떠러기 각각마음대로, 무도알맞히, 실백, 계란, 미나리, 밀가루, 정육, 간장, 깨소금, 후추가루, 파, 마늘. 만드는법 양은 끓는물에 튀해서 껍질을벗기고 곤자소니 뼈도가니 흘떠러기등을 정하게씻어서 무도씻어넣고 한데 보통곰국처럼 푹끕니다. 다 익거든 건저서 보통 곰국건디기보다는 조금자잘하게썰어서 갖은양념을하고 무도보통국에무보다 잘게썰어 양념해서 국물에다시넣어 끓여서놓고 미나리는 지단을붙여서 나붓나붓하게썰고 고기는 조그만치다저서 갖은양념을해서 모루기를만들고 지간을 노른가 흰자를 각각붙여서 미나리는 지단과같은치수로 썰어놓았다가 국을뜨고 모루기를넣고 지단부처썬것과 실백을얹읍니다. 이것은 특히 비빔국수에 제일격이고, 보통 밥상에 놓아도 관계없읍니다.재료 양, 사태, 곤자소니, 뼈도가니, 홀떼기, 무, 잣, 달걀, 미라니, 밀가루, 정육, 간장, 깨소금, 후춧가루, 파, 마늘 만드는 법 양은 끓는 물에 데쳐 껍질을 벗기고 곤자소니, 뼈도가니, 흘떠러기 등은 깨끗하게 씻고 무도 씻어 넣고 같이 보통 곰국처럼 푹 끓인다. 다 익으면 건져서 보통 곰국 건더기보다는 조금 자잘하게 썰어서 갖은 양념을 하고 무도 보통 국의 무보다 잘게 썰어 양념해서 국물에 다시 넣어 끓여놓고 미나리는 지단을 부쳐서 납작하게 썰고 고기는 작게 다져서 갖은 양념을 해서 완자를 만들고 노른자, 흰자 지간을 각각 부치고 미나리는 지단과 같은 치수로 썰어놓았다가 국을 뜨고 완자를 넣고 지단 부쳐 썬 것과 잣을 얹는다. 이것은 특히 비빔국수에 제격이고, 보통 밥상에 놓아도 관계없다.1) 양은 끓는 물에 데쳐 껍질을 벗기고 곤자소니, 뼈도가니, 홀떼기 등은 깨끗하게 씻어 놓는다. 2) 무는 씻어 1)과 함께 보통 곰국처럼 푹 끓인다. 3) 2)가 다 익으면 건더기를 건져 조금 자잘하게 썰고 갖은 양념을 하고 다시 국물에 넣어 끓인다. 4) 고기는 작게 다져 갖은 양념을 하여 완자를 만든다. 5) 달걀은 황백 지단을 부치고, 미나리는 초대를 만들어 썰어놓는다. 6) 그릇에 3)을 담고, 4), 5), 잣을 고명으로 얹는다.<NA>[양, , , , , , , ,주재료],[사태, , , , , , , ,주재료],[곤자소니, , , , , , , ,주재료],[뼈도가니, , , , , , , ,주재료],[홀떼기, , , , , , , ,주재료],[무, , , , , , , ,주재료],[잣, , , , , , , ,부재료],[달걀, , , , , , , ,부재료],[미나리, , , , , , , ,주재료],[밀가루, , , , , , , ,부재료],[정육, , , , , , , ,주재료],[간장, , , , , , , ,부재료],[깨소금, , , , , , , ,부재료],[후추가루, , , , , , , ,부재료],[파, , , , , , , ,부재료],[마늘, , , , , , , ,부재료]
10674104765호두12154진연의궤進宴儀軌저자미상1902년<NA>A61K 36/52<NA>호두, 實胡桃, 실호도2011호두實胡桃실호도<NA><NA>實胡桃一器: 高一尺五寸 實胡桃五斗실호도 1그릇: 고임높이 1자 5치, 호두 5말<NA><NA>[호두, , , , ,5,말,(1그릇),주재료]
1745100792녹두차103769산림경제山林經濟홍만선(洪萬選)1700년대<NA>A23L 2/38, A23L 2/04<NA>녹두차, 菉豆茶法, 녹두차법, 녹두, 차, 꿀2012녹두차菉豆茶法녹두차법<NA><NA>菉豆暫烹,取水和蜜飲之,色绿味清,觧胃熱。녹두를 잠깐 삶아 낸 물에 꿀을 타서 마신다. 색상이 푸르고 맛이 담백하다. 위 열을 제거해준다.1) 녹두를 잠깐 삶아 낸다. 2) 녹두 삶은 물에 꿀을 타서 마신다. - 색상이 푸르고 맛이 담백하다.<NA>[녹두, , , , , , , ,주재료],[꿀, , , , , , , ,부재료]
898100410곰국105792우리음식우리음식손정규(孫貞圭)194861A23L 1/313, A23L 1/39, A23L 1/312<NA>곰국(肉湯), 육탕 , 사태, 허파, 양, 곱창, 무, 파2012곰국곰국(肉湯)곰국(육탕)gomgukBeef Bone Soup재료 사태 200그람. 꼬리 200그람. 허파 120그람. 양 120그람. 곱창 120그람. 무 또는 파 600그람. 장 약간. 물 600릿틀. 여러 가지 고기를 모두 씻어서 통 채 끓인다. 거진 익었을 때 무와 파를 통 채 넣고, 장으로 간을 하여 다시 끓인다. 푹 무른 뒤에 고기와 무를 건져 잘게 썰어 후추와 다진 파로 양념을 하여, 뜨거운 국물에 넣어서 먹는다.재료 사태 200g, 꼬리 200g, 허파 120g, 양 120g, 곱창 120g, 무 또는 파 600g, 간장 약간, 물 600L 여러 가지 고기를 모두 씻어서 통째로 끓인다. 거의 익었을 때 무와 파를 통째로 넣고 장으로 간을 하여 다시 끓인다. 푹 무른 뒤에 고기와 무를 건져 잘게 썰고 후추와 다진 파로 양념을 한 후 뜨거운 국물에 넣어 먹는다.1) 여러 가지 고기를 모두 씻어서 통째로 끓인다. 2) 거의 익었을 때 무와 파를 통째로 넣고 장으로 간을 하여 다시 끓인다. 3) 푹 무른 뒤에 고기와 무를 건져 잘게 썰고 후추와 다진 파로 양념을 한 후 뜨거운 국물에 넣어 먹는다.<NA>[사태, , , , ,200,g, ,주재료],[꼬리, , , , ,200,g, ,주재료],[허파, , , , ,120,g, ,주재료],[양, , , , ,120,g, ,주재료],[곱창, , , , ,120,g, ,주재료],[무, , , , ,600,g,또는 파,부재료],[간장, , , , ,약간, , ,부재료],[물, , , , ,600,L, ,부재료]
1683100770녹두국수120294임원십육지林園十六志서유구(徐有榘)1835년경<NA>A23L 1/315, A23L 1/39<NA>녹두국수, 真珠麫方, 진주면방2013녹두국수真珠麫方진주면방<NA><NA>雉鷄鵝鴨中取膏而軟者, 暫烹去水氣, 剉作緑豆大或黃豆大, 拖緑豆粉中拌勻, 令不相粘着, 以淡醬水烹之, 入芝麻油雞卵菌蕈石茸等交胎用之。《三山方》꿩, 닭, 오리, 거위의 연한 살찐 고기를 취하여 삶아 물기를 빼고 녹두 혹은 대두 크기로 썰어 녹두가루에 넣어 서로 붙지 않게 골고루 섞은 다음 묽은 장물에 넣어 끓여 참기름과 계란 그리고 석이버섯 등의 버섯류를 넣는다. 《삼산방》1) 꿩, 닭, 오리, 거위의 연한 살찐 고기부위를 살짝 삶아 물기를 뺀다. 2) 물기를 뺀 고기를 녹두 혹은 대두 크기로 썰어 녹두가루에 넣고 서로 붙지 않게 골고루 섞는다. 3) 묽은 장물에 2)를 넣고 끓인다. 4) 참기름, 계란, 석이버섯 등의 버섯류를 넣는다.<NA>[꿩, , , , , , , ,부재료],[닭, , , , , , , ,부재료],[오리, , , , , , , ,부재료],[거위의 연한 살찐 고기부위, , , , , , , ,부재료],[녹두가루, , , , , , , ,주재료],[묽은 장물, , , , , , , ,부재료],[참기름, , , , , , , ,부재료],[계란, , , , , , , ,부재료],[석이버섯, , , , , , ,등의 버섯류,부재료]