Overview

Dataset statistics

Number of variables8
Number of observations8752
Missing cells5
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory555.7 KiB
Average record size in memory65.0 B

Variable types

Categorical4
Text3
Numeric1

Dataset

Description대분류명,중분류명,소분류명,식재료코드,식재료명,브랜드명,생산연도,규격
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-20919/S/1/datasetView.do

Alerts

대분류명 is highly overall correlated with 식재료코드 and 3 other fieldsHigh correlation
브랜드명 is highly overall correlated with 식재료코드 and 3 other fieldsHigh correlation
중분류명 is highly overall correlated with 대분류명 and 1 other fieldsHigh correlation
식재료코드 is highly overall correlated with 대분류명 and 1 other fieldsHigh correlation
생산연도 is highly overall correlated with 대분류명 and 1 other fieldsHigh correlation
브랜드명 is highly imbalanced (82.3%)Imbalance
식재료코드 has unique valuesUnique

Reproduction

Analysis started2024-05-11 01:00:38.901354
Analysis finished2024-05-11 01:00:42.479521
Duration3.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

대분류명
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size68.5 KiB
축산물
4181 
수산물
1909 
농산물
1650 
가공식품
1012 

Length

Max length4
Median length3
Mean length3.1156307
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가공식품
2nd row가공식품
3rd row가공식품
4th row농산물
5th row가공식품

Common Values

ValueCountFrequency (%)
축산물 4181
47.8%
수산물 1909
21.8%
농산물 1650
 
18.9%
가공식품 1012
 
11.6%

Length

2024-05-11T01:00:42.697222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T01:00:43.184652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
축산물 4181
47.8%
수산물 1909
21.8%
농산물 1650
 
18.9%
가공식품 1012
 
11.6%

중분류명
Categorical

HIGH CORRELATION 

Distinct26
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size68.5 KiB
한우
1565 
돈육
1300 
농산물가공식품
1013 
가금류
824 
건어물
718 
Other values (21)
3332 

Length

Max length7
Median length5
Mean length3.1242002
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row농산물가공식품
2nd row농산물가공식품
3rd row농산물가공식품
4th row버섯류
5th row농산물가공식품

Common Values

ValueCountFrequency (%)
한우 1565
17.9%
돈육 1300
14.9%
농산물가공식품 1013
11.6%
가금류 824
9.4%
건어물 718
8.2%
어류 690
7.9%
육우 492
 
5.6%
양곡부류 381
 
4.4%
과일류 322
 
3.7%
연체류 239
 
2.7%
Other values (16) 1208
13.8%

Length

2024-05-11T01:00:43.617183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한우 1565
17.9%
돈육 1300
14.9%
농산물가공식품 1013
11.6%
가금류 824
9.4%
건어물 718
8.2%
어류 690
7.9%
육우 492
 
5.6%
양곡부류 381
 
4.4%
과일류 322
 
3.7%
연체류 239
 
2.7%
Other values (16) 1208
13.8%
Distinct374
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Memory size68.5 KiB
2024-05-11T01:00:44.285899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length4.2346892
Min length1

Characters and Unicode

Total characters37062
Distinct characters282
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)0.5%

Sample

1st row가시오가피
2nd row가지말랭이
3rd row가지말랭이
4th row새송이버섯
5th row감말랭이
ValueCountFrequency (%)
닭고기 652
 
7.4%
한우(설도 301
 
3.4%
한우(우둔 249
 
2.8%
돈육(등심 240
 
2.7%
돈육(전지 179
 
2.0%
돈육(삼겹 178
 
2.0%
오렌지 166
 
1.9%
돈육(후지 162
 
1.9%
돈육(안심 160
 
1.8%
돈육(목심 160
 
1.8%
Other values (364) 6305
72.0%
2024-05-11T01:00:45.563770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 3500
 
9.4%
) 3500
 
9.4%
2671
 
7.2%
1792
 
4.8%
1565
 
4.2%
1300
 
3.5%
1091
 
2.9%
969
 
2.6%
882
 
2.4%
790
 
2.1%
Other values (272) 19002
51.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 30062
81.1%
Open Punctuation 3500
 
9.4%
Close Punctuation 3500
 
9.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2671
 
8.9%
1792
 
6.0%
1565
 
5.2%
1300
 
4.3%
1091
 
3.6%
969
 
3.2%
882
 
2.9%
790
 
2.6%
652
 
2.2%
650
 
2.2%
Other values (270) 17700
58.9%
Open Punctuation
ValueCountFrequency (%)
( 3500
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3500
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 30062
81.1%
Common 7000
 
18.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2671
 
8.9%
1792
 
6.0%
1565
 
5.2%
1300
 
4.3%
1091
 
3.6%
969
 
3.2%
882
 
2.9%
790
 
2.6%
652
 
2.2%
650
 
2.2%
Other values (270) 17700
58.9%
Common
ValueCountFrequency (%)
( 3500
50.0%
) 3500
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 30062
81.1%
ASCII 7000
 
18.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 3500
50.0%
) 3500
50.0%
Hangul
ValueCountFrequency (%)
2671
 
8.9%
1792
 
6.0%
1565
 
5.2%
1300
 
4.3%
1091
 
3.6%
969
 
3.2%
882
 
2.9%
790
 
2.6%
652
 
2.2%
650
 
2.2%
Other values (270) 17700
58.9%

식재료코드
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct8752
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1122937.6
Minimum1118269
Maximum1128233
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size77.1 KiB
2024-05-11T01:00:46.099969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1118269
5-th percentile1118729.1
Q11120544.8
median1122738.5
Q31125422.2
95-th percentile1127241.4
Maximum1128233
Range9964
Interquartile range (IQR)4877.5

Descriptive statistics

Standard deviation2780.3063
Coefficient of variation (CV)0.0024759223
Kurtosis-1.1993959
Mean1122937.6
Median Absolute Deviation (MAD)2434
Skewness0.087843196
Sum9.82795 × 109
Variance7730103.2
MonotonicityNot monotonic
2024-05-11T01:00:46.589525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1118269 1
 
< 0.1%
1124648 1
 
< 0.1%
1124642 1
 
< 0.1%
1124643 1
 
< 0.1%
1124644 1
 
< 0.1%
1124645 1
 
< 0.1%
1124646 1
 
< 0.1%
1124647 1
 
< 0.1%
1124649 1
 
< 0.1%
1124691 1
 
< 0.1%
Other values (8742) 8742
99.9%
ValueCountFrequency (%)
1118269 1
< 0.1%
1118270 1
< 0.1%
1118271 1
< 0.1%
1118272 1
< 0.1%
1118273 1
< 0.1%
1118274 1
< 0.1%
1118275 1
< 0.1%
1118276 1
< 0.1%
1118277 1
< 0.1%
1118278 1
< 0.1%
ValueCountFrequency (%)
1128233 1
< 0.1%
1128225 1
< 0.1%
1128224 1
< 0.1%
1128223 1
< 0.1%
1128222 1
< 0.1%
1128221 1
< 0.1%
1128220 1
< 0.1%
1128219 1
< 0.1%
1128218 1
< 0.1%
1128217 1
< 0.1%
Distinct1352
Distinct (%)15.4%
Missing0
Missing (%)0.0%
Memory size68.5 KiB
2024-05-11T01:00:47.193276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length28
Mean length6.7390311
Min length1

Characters and Unicode

Total characters58980
Distinct characters459
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique614 ?
Unique (%)7.0%

Sample

1st row가시오가피/율성맑은아침
2nd row가지말랭이/산림조합
3rd row가지말랭이/청채마루
4th row새송이버섯
5th row감말랭이/산림조합
ValueCountFrequency (%)
한우(설도 300
 
3.4%
한우(우둔 249
 
2.8%
돈육(등심 240
 
2.7%
닭고기 235
 
2.7%
돈육(전지 179
 
2.0%
돈육(후지 162
 
1.9%
돈육(안심 160
 
1.8%
돈육(목심 160
 
1.8%
돈육(삼겹 160
 
1.8%
한우(전각 142
 
1.6%
Other values (1341) 6765
77.3%
2024-05-11T01:00:48.555800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 5346
 
9.1%
) 5345
 
9.1%
2727
 
4.6%
/ 2400
 
4.1%
2134
 
3.6%
1953
 
3.3%
1300
 
2.2%
1133
 
1.9%
1068
 
1.8%
956
 
1.6%
Other values (449) 34618
58.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 43740
74.2%
Open Punctuation 5367
 
9.1%
Close Punctuation 5366
 
9.1%
Other Punctuation 2760
 
4.7%
Decimal Number 1299
 
2.2%
Lowercase Letter 304
 
0.5%
Dash Punctuation 85
 
0.1%
Uppercase Letter 24
 
< 0.1%
Math Symbol 22
 
< 0.1%
Connector Punctuation 13
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2727
 
6.2%
2134
 
4.9%
1953
 
4.5%
1300
 
3.0%
1133
 
2.6%
1068
 
2.4%
956
 
2.2%
722
 
1.7%
683
 
1.6%
636
 
1.5%
Other values (414) 30428
69.6%
Decimal Number
ValueCountFrequency (%)
0 380
29.3%
9 273
21.0%
1 157
12.1%
2 134
 
10.3%
5 117
 
9.0%
3 95
 
7.3%
4 49
 
3.8%
7 43
 
3.3%
8 37
 
2.8%
6 14
 
1.1%
Lowercase Letter
ValueCountFrequency (%)
g 190
62.5%
k 54
 
17.8%
c 19
 
6.2%
a 16
 
5.3%
p 16
 
5.3%
m 5
 
1.6%
x 2
 
0.7%
l 2
 
0.7%
Other Punctuation
ValueCountFrequency (%)
/ 2400
87.0%
, 217
 
7.9%
% 122
 
4.4%
& 10
 
0.4%
* 7
 
0.3%
. 4
 
0.1%
Uppercase Letter
ValueCountFrequency (%)
F 10
41.7%
S 10
41.7%
G 4
 
16.7%
Open Punctuation
ValueCountFrequency (%)
( 5346
99.6%
[ 21
 
0.4%
Close Punctuation
ValueCountFrequency (%)
) 5345
99.6%
] 21
 
0.4%
Math Symbol
ValueCountFrequency (%)
~ 20
90.9%
+ 2
 
9.1%
Dash Punctuation
ValueCountFrequency (%)
- 85
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 43737
74.2%
Common 14912
 
25.3%
Latin 328
 
0.6%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2727
 
6.2%
2134
 
4.9%
1953
 
4.5%
1300
 
3.0%
1133
 
2.6%
1068
 
2.4%
956
 
2.2%
722
 
1.7%
683
 
1.6%
636
 
1.5%
Other values (412) 30425
69.6%
Common
ValueCountFrequency (%)
( 5346
35.9%
) 5345
35.8%
/ 2400
16.1%
0 380
 
2.5%
9 273
 
1.8%
, 217
 
1.5%
1 157
 
1.1%
2 134
 
0.9%
% 122
 
0.8%
5 117
 
0.8%
Other values (14) 421
 
2.8%
Latin
ValueCountFrequency (%)
g 190
57.9%
k 54
 
16.5%
c 19
 
5.8%
a 16
 
4.9%
p 16
 
4.9%
F 10
 
3.0%
S 10
 
3.0%
m 5
 
1.5%
G 4
 
1.2%
x 2
 
0.6%
Han
ValueCountFrequency (%)
2
66.7%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 43737
74.2%
ASCII 15240
 
25.8%
CJK 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 5346
35.1%
) 5345
35.1%
/ 2400
15.7%
0 380
 
2.5%
9 273
 
1.8%
, 217
 
1.4%
g 190
 
1.2%
1 157
 
1.0%
2 134
 
0.9%
% 122
 
0.8%
Other values (25) 676
 
4.4%
Hangul
ValueCountFrequency (%)
2727
 
6.2%
2134
 
4.9%
1953
 
4.5%
1300
 
3.0%
1133
 
2.6%
1068
 
2.4%
956
 
2.2%
722
 
1.7%
683
 
1.6%
636
 
1.5%
Other values (412) 30425
69.6%
CJK
ValueCountFrequency (%)
2
66.7%
1
33.3%

브랜드명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct33
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size68.5 KiB
<NA>
7847 
체리부로
 
141
하늘농가
 
129
닭터의자연
 
117
수덕푸드
 
56
Other values (28)
 
462

Length

Max length10
Median length4
Mean length4.0933501
Min length2

Unique

Unique4 ?
Unique (%)< 0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 7847
89.7%
체리부로 141
 
1.6%
하늘농가 129
 
1.5%
닭터의자연 117
 
1.3%
수덕푸드 56
 
0.6%
농업회사법인㈜푸르원 51
 
0.6%
자연품은 50
 
0.6%
주식회사 올품 50
 
0.6%
자연실록 44
 
0.5%
주원산오리 43
 
0.5%
Other values (23) 224
 
2.6%

Length

2024-05-11T01:00:49.084347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 7847
89.1%
체리부로 141
 
1.6%
하늘농가 129
 
1.5%
닭터의자연 117
 
1.3%
자연실록 59
 
0.7%
수덕푸드 56
 
0.6%
농업회사법인㈜푸르원 51
 
0.6%
올품 50
 
0.6%
주식회사 50
 
0.6%
자연품은 50
 
0.6%
Other values (25) 261
 
3.0%

생산연도
Categorical

HIGH CORRELATION 

Distinct19
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size68.5 KiB
2024
2489 
2023년
2318 
2024년
1710 
2023
615 
당해년도
426 
Other values (14)
1194 

Length

Max length9
Median length6
Mean length4.5305073
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row2023년
2nd row2023년
3rd row2023년
4th row2024
5th row2023년

Common Values

ValueCountFrequency (%)
2024 2489
28.4%
2023년 2318
26.5%
2024년 1710
19.5%
2023 615
 
7.0%
당해년도 426
 
4.9%
별도표기 395
 
4.5%
0 285
 
3.3%
2023/2024 246
 
2.8%
해당년도 113
 
1.3%
<NA> 67
 
0.8%
Other values (9) 88
 
1.0%

Length

2024-05-11T01:00:49.718380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2024 2489
28.4%
2023년 2318
26.5%
2024년 1710
19.5%
2023 615
 
7.0%
당해년도 426
 
4.9%
별도표기 395
 
4.5%
0 285
 
3.3%
2023/2024 246
 
2.8%
해당년도 117
 
1.3%
na 67
 
0.8%
Other values (9) 85
 
1.0%

규격
Text

Distinct4488
Distinct (%)51.3%
Missing5
Missing (%)0.1%
Memory size68.5 KiB
2024-05-11T01:00:50.398694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length46
Mean length23.085744
Min length1

Characters and Unicode

Total characters201931
Distinct characters519
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2905 ?
Unique (%)33.2%

Sample

1st row가시오가피/0.1kg
2nd row가지말랭이/1kg
3rd row가지말랭이/0.3kg
4th row새송이버섯/0.3kg
5th row감말랭이/1kg
ValueCountFrequency (%)
413
 
2.5%
냉동(손질 248
 
1.5%
cm)/3kg 196
 
1.2%
cm)/1kg 192
 
1.2%
cm)/5kg 192
 
1.2%
cm)/0.5kg 180
 
1.1%
구이용 180
 
1.1%
cm)/0.3kg 180
 
1.1%
99%),토막당 163
 
1.0%
냉동(포 146
 
0.9%
Other values (3729) 14268
87.2%
2024-05-11T01:00:52.138411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 27010
 
13.4%
0 10099
 
5.0%
g 9866
 
4.9%
7655
 
3.8%
k 6547
 
3.2%
( 5818
 
2.9%
) 5810
 
2.9%
1 5345
 
2.6%
3 4738
 
2.3%
. 4364
 
2.2%
Other values (509) 114679
56.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 92832
46.0%
Other Punctuation 37360
18.5%
Decimal Number 30174
 
14.9%
Lowercase Letter 20508
 
10.2%
Space Separator 7655
 
3.8%
Open Punctuation 5818
 
2.9%
Close Punctuation 5810
 
2.9%
Math Symbol 844
 
0.4%
Connector Punctuation 273
 
0.1%
Other Symbol 269
 
0.1%
Other values (2) 388
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4079
 
4.4%
3938
 
4.2%
2923
 
3.1%
2800
 
3.0%
2726
 
2.9%
2417
 
2.6%
2219
 
2.4%
2105
 
2.3%
1917
 
2.1%
1910
 
2.1%
Other values (459) 65798
70.9%
Uppercase Letter
ValueCountFrequency (%)
A 50
22.7%
C 48
21.8%
P 48
21.8%
K 33
15.0%
L 10
 
4.5%
S 10
 
4.5%
O 6
 
2.7%
M 4
 
1.8%
G 3
 
1.4%
X 3
 
1.4%
Other values (2) 5
 
2.3%
Lowercase Letter
ValueCountFrequency (%)
g 9866
48.1%
k 6547
31.9%
m 2040
 
9.9%
c 1980
 
9.7%
x 34
 
0.2%
l 20
 
0.1%
p 7
 
< 0.1%
e 6
 
< 0.1%
t 6
 
< 0.1%
h 1
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
0 10099
33.5%
1 5345
17.7%
3 4738
15.7%
5 3755
 
12.4%
2 3363
 
11.1%
4 1303
 
4.3%
9 569
 
1.9%
7 566
 
1.9%
8 219
 
0.7%
6 217
 
0.7%
Other Punctuation
ValueCountFrequency (%)
/ 27010
72.3%
. 4364
 
11.7%
* 3192
 
8.5%
, 2566
 
6.9%
% 221
 
0.6%
& 7
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 603
71.4%
× 130
 
15.4%
= 90
 
10.7%
± 16
 
1.9%
+ 5
 
0.6%
Space Separator
ValueCountFrequency (%)
7655
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5818
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5810
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 273
100.0%
Other Symbol
ValueCountFrequency (%)
269
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 168
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 92803
46.0%
Common 88371
43.8%
Latin 20728
 
10.3%
Han 29
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4079
 
4.4%
3938
 
4.2%
2923
 
3.1%
2800
 
3.0%
2726
 
2.9%
2417
 
2.6%
2219
 
2.4%
2105
 
2.3%
1917
 
2.1%
1910
 
2.1%
Other values (456) 65769
70.9%
Common
ValueCountFrequency (%)
/ 27010
30.6%
0 10099
 
11.4%
7655
 
8.7%
( 5818
 
6.6%
) 5810
 
6.6%
1 5345
 
6.0%
3 4738
 
5.4%
. 4364
 
4.9%
5 3755
 
4.2%
2 3363
 
3.8%
Other values (17) 10414
 
11.8%
Latin
ValueCountFrequency (%)
g 9866
47.6%
k 6547
31.6%
m 2040
 
9.8%
c 1980
 
9.6%
A 50
 
0.2%
C 48
 
0.2%
P 48
 
0.2%
x 34
 
0.2%
K 33
 
0.2%
l 20
 
0.1%
Other values (13) 62
 
0.3%
Han
ValueCountFrequency (%)
13
44.8%
10
34.5%
6
20.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 108684
53.8%
Hangul 92803
46.0%
CJK Compat 269
 
0.1%
None 146
 
0.1%
CJK 29
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 27010
24.9%
0 10099
 
9.3%
g 9866
 
9.1%
7655
 
7.0%
k 6547
 
6.0%
( 5818
 
5.4%
) 5810
 
5.3%
1 5345
 
4.9%
3 4738
 
4.4%
. 4364
 
4.0%
Other values (37) 21432
19.7%
Hangul
ValueCountFrequency (%)
4079
 
4.4%
3938
 
4.2%
2923
 
3.1%
2800
 
3.0%
2726
 
2.9%
2417
 
2.6%
2219
 
2.4%
2105
 
2.3%
1917
 
2.1%
1910
 
2.1%
Other values (456) 65769
70.9%
CJK Compat
ValueCountFrequency (%)
269
100.0%
None
ValueCountFrequency (%)
× 130
89.0%
± 16
 
11.0%
CJK
ValueCountFrequency (%)
13
44.8%
10
34.5%
6
20.7%

Interactions

2024-05-11T01:00:41.511496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-11T01:00:52.498645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대분류명중분류명식재료코드브랜드명생산연도
대분류명1.0001.0000.7871.0000.782
중분류명1.0001.0000.8260.9980.803
식재료코드0.7870.8261.0000.9120.679
브랜드명1.0000.9980.9121.0000.958
생산연도0.7820.8030.6790.9581.000
2024-05-11T01:00:52.921250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
생산연도대분류명브랜드명중분류명
생산연도1.0000.5580.7720.362
대분류명0.5581.0000.9780.998
브랜드명0.7720.9781.0000.911
중분류명0.3620.9980.9111.000
2024-05-11T01:00:53.247440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
식재료코드대분류명중분류명브랜드명생산연도
식재료코드1.0000.6070.4780.6200.338
대분류명0.6071.0000.9980.9780.558
중분류명0.4780.9981.0000.9110.362
브랜드명0.6200.9780.9111.0000.772
생산연도0.3380.5580.3620.7721.000

Missing values

2024-05-11T01:00:41.901261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-11T01:00:42.301022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

대분류명중분류명소분류명식재료코드식재료명브랜드명생산연도규격
0가공식품농산물가공식품가시오가피1118269가시오가피/율성맑은아침<NA>2023년가시오가피/0.1kg
1가공식품농산물가공식품가지말랭이1118270가지말랭이/산림조합<NA>2023년가지말랭이/1kg
2가공식품농산물가공식품가지말랭이1118271가지말랭이/청채마루<NA>2023년가지말랭이/0.3kg
3농산물버섯류새송이버섯1118272새송이버섯<NA>2024새송이버섯/0.3kg
4가공식품농산물가공식품감말랭이1118273감말랭이/산림조합<NA>2023년감말랭이/1kg
5가공식품농산물가공식품감말랭이1118274감말랭이/율성맑은아침<NA>2023년감말랭이/0.1kg
6가공식품농산물가공식품감말랭이1118275감말랭이/푸르원농업회사법인㈜푸르원2023년감말랭이/1kg
7가공식품농산물가공식품감자전분1118276감자전분/함양농협<NA>2023년감자전분(1kg)
8가공식품농산물가공식품감자전분1118277감자전분(500g)<NA>2023년감자전분(500g)
9가공식품농산물가공식품감초1118278감초/율성맑은아침<NA>2023년감초/0.1kg
대분류명중분류명소분류명식재료코드식재료명브랜드명생산연도규격
8742농산물양채류파프리카1128217파프리카(노랑)<NA>2024파프리카(노랑)/2개들입_
8743농산물양채류파프리카1128218파프리카(적색)<NA>2024파프리카(적색)/2개들입_
8744농산물양채류파프리카1128219파프리카(주황)<NA>2024파프리카(주황)/2개들입_
8745농산물양채류피망1128220피망<NA>2024피망/청/2개들입_
8746농산물조미채소류양파1128221양파(깐것)<NA>2024양파(깐것)/1kg_
8747농산물조미채소류고추1128222고추(청양)<NA>2024청양고추/0.15kg_
8748농산물조미채소류대파1128223대파(흙)<NA>2024대파(흙)/0.5kg_
8749농산물조미채소류대파1128224대파(깐것)<NA>2024대파(깐것)/0.5kg_
8750농산물양채류양상추1128225깐양상추<NA>2024깐양상추/0.3kg_
8751농산물양곡부류찹쌀1128233찹쌀<NA><NA>찹쌀/국내산/1kg-대체사입