Overview

Dataset statistics

Number of variables12
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.0 MiB
Average record size in memory107.0 B

Variable types

Numeric3
Categorical5
DateTime1
Text3

Dataset

Description세종특별자치시 공공급식 통합수발주 발주현황을 제공합니다. 데이터는거래처, 견적일, 식재료코드, 식재료명, 대분류, 중분류, 규격, 단위, 발주량, 인증구분, 원산지 로 구성되어 있습니다.
Author공공데이터포털
URLhttps://www.data.go.kr/data/15117965/fileData.do

Alerts

연번 is highly overall correlated with 거래처High correlation
거래처 is highly overall correlated with 연번High correlation
대분류 is highly overall correlated with 단위 and 1 other fieldsHigh correlation
단위 is highly overall correlated with 대분류High correlation
원산지 is highly overall correlated with 대분류High correlation
단위 is highly imbalanced (54.1%)Imbalance
인증구분 is highly imbalanced (76.4%)Imbalance
원산지 is highly imbalanced (53.2%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2024-04-20 22:38:36.860815
Analysis finished2024-04-20 22:38:41.542555
Duration4.68 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean32963.002
Minimum7
Maximum65533
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-21T07:38:41.667340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum7
5-th percentile3357.6
Q116713.25
median33242.5
Q349215.25
95-th percentile62373.95
Maximum65533
Range65526
Interquartile range (IQR)32502

Descriptive statistics

Standard deviation18910.107
Coefficient of variation (CV)0.57367673
Kurtosis-1.1895377
Mean32963.002
Median Absolute Deviation (MAD)16297
Skewness-0.019373785
Sum3.2963002 × 108
Variance3.5759216 × 108
MonotonicityNot monotonic
2024-04-21T07:38:41.914800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20669 1
 
< 0.1%
11083 1
 
< 0.1%
171 1
 
< 0.1%
63718 1
 
< 0.1%
20682 1
 
< 0.1%
5257 1
 
< 0.1%
7437 1
 
< 0.1%
36529 1
 
< 0.1%
54479 1
 
< 0.1%
49540 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
7 1
< 0.1%
14 1
< 0.1%
21 1
< 0.1%
22 1
< 0.1%
26 1
< 0.1%
27 1
< 0.1%
44 1
< 0.1%
50 1
< 0.1%
61 1
< 0.1%
63 1
< 0.1%
ValueCountFrequency (%)
65533 1
< 0.1%
65516 1
< 0.1%
65508 1
< 0.1%
65506 1
< 0.1%
65489 1
< 0.1%
65479 1
< 0.1%
65474 1
< 0.1%
65471 1
< 0.1%
65459 1
< 0.1%
65453 1
< 0.1%

거래처
Categorical

HIGH CORRELATION 

Distinct28
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
고운고등학교
 
590
고운유치원
 
472
고운초등학교
 
461
가락초등학교
 
451
다빛유치원
 
450
Other values (23)
7576 

Length

Max length10
Median length5
Mean length5.5262
Min length5

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row고운고등학교
2nd row고운중학교
3rd row늘봄유치원
4th row가득초등학교
5th row다빛유치원

Common Values

ValueCountFrequency (%)
고운고등학교 590
 
5.9%
고운유치원 472
 
4.7%
고운초등학교 461
 
4.6%
가락초등학교 451
 
4.5%
다빛유치원 450
 
4.5%
늘봄유치원 449
 
4.5%
가온유치원 444
 
4.4%
금남초등학교 426
 
4.3%
나성유치원 425
 
4.2%
가득유치원 425
 
4.2%
Other values (18) 5407
54.1%

Length

2024-04-21T07:38:42.153137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
고운고등학교 590
 
5.9%
고운유치원 472
 
4.7%
고운초등학교 461
 
4.6%
가락초등학교 451
 
4.5%
다빛유치원 450
 
4.5%
늘봄유치원 449
 
4.5%
가온유치원 444
 
4.4%
금남초등학교 426
 
4.3%
나성유치원 425
 
4.2%
가득유치원 425
 
4.2%
Other values (18) 5407
54.1%
Distinct143
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2023-01-02 00:00:00
Maximum2023-07-31 00:00:00
2024-04-21T07:38:42.370434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:38:42.615569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

식재료코드
Real number (ℝ)

Distinct2664
Distinct (%)26.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1049097.2
Minimum1003987
Maximum1108216
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-21T07:38:42.882126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1003987
5-th percentile1005179.6
Q11009484
median1060512
Q31076563
95-th percentile1103867
Maximum1108216
Range104229
Interquartile range (IQR)67079

Descriptive statistics

Standard deviation36041.97
Coefficient of variation (CV)0.034355224
Kurtosis-1.4884767
Mean1049097.2
Median Absolute Deviation (MAD)38784
Skewness0.0925767
Sum1.0490972 × 1010
Variance1.2990236 × 109
MonotonicityNot monotonic
2024-04-21T07:38:43.145416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1067252 305
 
3.0%
1099296 213
 
2.1%
1103867 197
 
2.0%
1067253 149
 
1.5%
1063814 142
 
1.4%
1009381 115
 
1.1%
1008692 108
 
1.1%
1008798 107
 
1.1%
1103866 98
 
1.0%
1103144 95
 
0.9%
Other values (2654) 8471
84.7%
ValueCountFrequency (%)
1003987 1
 
< 0.1%
1003996 1
 
< 0.1%
1004093 7
0.1%
1004115 1
 
< 0.1%
1004116 1
 
< 0.1%
1004122 3
< 0.1%
1004123 3
< 0.1%
1004129 2
 
< 0.1%
1004132 1
 
< 0.1%
1004137 1
 
< 0.1%
ValueCountFrequency (%)
1108216 2
 
< 0.1%
1108215 1
 
< 0.1%
1108180 1
 
< 0.1%
1108174 1
 
< 0.1%
1108159 1
 
< 0.1%
1108100 2
 
< 0.1%
1107848 1
 
< 0.1%
1107809 11
0.1%
1107783 1
 
< 0.1%
1107758 1
 
< 0.1%
Distinct1200
Distinct (%)12.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-21T07:38:44.009902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length20
Mean length8.8876
Min length2

Characters and Unicode

Total characters88876
Distinct characters523
Distinct categories8 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique374 ?
Unique (%)3.7%

Sample

1st row피망(홍피망)+일반
2nd row돼지고기(갈비)+일반
3rd row블루베리+국산
4th row느타리버섯+친환경
5th row피망(홍피망)+일반
ValueCountFrequency (%)
마늘+껍질제거(깐것)꼭지제거 305
 
3.0%
당근+일반 257
 
2.6%
양파+일반 241
 
2.4%
파(대파)+일반 218
 
2.2%
생강+일반껍질제거(깐것 196
 
2.0%
양파+껍질제거(깐것 149
 
1.5%
무(조선무)+일반 146
 
1.5%
김치(포기김치 139
 
1.4%
멸치(큰멸치대멸 138
 
1.4%
달걀 122
 
1.2%
Other values (1190) 8089
80.9%
2024-04-21T07:38:45.373781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 7162
 
8.1%
( 7162
 
8.1%
+ 6154
 
6.9%
2733
 
3.1%
2727
 
3.1%
2040
 
2.3%
1831
 
2.1%
1620
 
1.8%
1579
 
1.8%
1535
 
1.7%
Other values (513) 54333
61.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 67928
76.4%
Close Punctuation 7162
 
8.1%
Open Punctuation 7162
 
8.1%
Math Symbol 6154
 
6.9%
Uppercase Letter 381
 
0.4%
Decimal Number 57
 
0.1%
Lowercase Letter 30
 
< 0.1%
Other Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2733
 
4.0%
2727
 
4.0%
2040
 
3.0%
1831
 
2.7%
1620
 
2.4%
1579
 
2.3%
1535
 
2.3%
1325
 
2.0%
1251
 
1.8%
1245
 
1.8%
Other values (496) 50042
73.7%
Decimal Number
ValueCountFrequency (%)
8 13
22.8%
1 11
19.3%
0 10
17.5%
4 10
17.5%
6 6
10.5%
7 2
 
3.5%
3 2
 
3.5%
5 2
 
3.5%
2 1
 
1.8%
Uppercase Letter
ValueCountFrequency (%)
A 127
33.3%
P 127
33.3%
G 127
33.3%
Close Punctuation
ValueCountFrequency (%)
) 7162
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7162
100.0%
Math Symbol
ValueCountFrequency (%)
+ 6154
100.0%
Lowercase Letter
ValueCountFrequency (%)
m 30
100.0%
Other Punctuation
ValueCountFrequency (%)
% 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 67876
76.4%
Common 20537
 
23.1%
Latin 411
 
0.5%
Han 52
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2733
 
4.0%
2727
 
4.0%
2040
 
3.0%
1831
 
2.7%
1620
 
2.4%
1579
 
2.3%
1535
 
2.3%
1325
 
2.0%
1251
 
1.8%
1245
 
1.8%
Other values (493) 49990
73.6%
Common
ValueCountFrequency (%)
) 7162
34.9%
( 7162
34.9%
+ 6154
30.0%
8 13
 
0.1%
1 11
 
0.1%
0 10
 
< 0.1%
4 10
 
< 0.1%
6 6
 
< 0.1%
% 2
 
< 0.1%
7 2
 
< 0.1%
Other values (3) 5
 
< 0.1%
Latin
ValueCountFrequency (%)
A 127
30.9%
P 127
30.9%
G 127
30.9%
m 30
 
7.3%
Han
ValueCountFrequency (%)
37
71.2%
13
 
25.0%
2
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 67876
76.4%
ASCII 20948
 
23.6%
CJK 52
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 7162
34.2%
( 7162
34.2%
+ 6154
29.4%
A 127
 
0.6%
P 127
 
0.6%
G 127
 
0.6%
m 30
 
0.1%
8 13
 
0.1%
1 11
 
0.1%
0 10
 
< 0.1%
Other values (7) 25
 
0.1%
Hangul
ValueCountFrequency (%)
2733
 
4.0%
2727
 
4.0%
2040
 
3.0%
1831
 
2.7%
1620
 
2.4%
1579
 
2.3%
1535
 
2.3%
1325
 
2.0%
1251
 
1.8%
1245
 
1.8%
Other values (493) 49990
73.6%
CJK
ValueCountFrequency (%)
37
71.2%
13
 
25.0%
2
 
3.8%

대분류
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
농산물
5142 
가공식품
3543 
축산물
665 
수산물
650 

Length

Max length4
Median length3
Mean length3.3543
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row농산물
2nd row축산물
3rd row농산물
4th row농산물
5th row농산물

Common Values

ValueCountFrequency (%)
농산물 5142
51.4%
가공식품 3543
35.4%
축산물 665
 
6.7%
수산물 650
 
6.5%

Length

2024-04-21T07:38:45.810267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T07:38:46.144051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
농산물 5142
51.4%
가공식품 3543
35.4%
축산물 665
 
6.7%
수산물 650
 
6.5%
Distinct53
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-21T07:38:46.818660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length3
Mean length3.4477
Min length2

Characters and Unicode

Total characters34477
Distinct characters82
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row채소류
2nd row돼지고기
3rd row과일류
4th row버섯류
5th row채소류
ValueCountFrequency (%)
근채류 1495
 
14.9%
채소류 1214
 
12.1%
조미식품류 626
 
6.3%
절임+조림류 597
 
6.0%
과일류 508
 
5.1%
곡류 478
 
4.8%
양채류 329
 
3.3%
엽채류 301
 
3.0%
버섯류 260
 
2.6%
어류 234
 
2.3%
Other values (43) 3958
39.6%
2024-04-21T07:38:47.940444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9498
27.5%
3339
 
9.7%
1663
 
4.8%
1495
 
4.3%
1332
 
3.9%
1317
 
3.8%
1214
 
3.5%
+ 782
 
2.3%
657
 
1.9%
642
 
1.9%
Other values (72) 12538
36.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 33695
97.7%
Math Symbol 782
 
2.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9498
28.2%
3339
 
9.9%
1663
 
4.9%
1495
 
4.4%
1332
 
4.0%
1317
 
3.9%
1214
 
3.6%
657
 
1.9%
642
 
1.9%
626
 
1.9%
Other values (71) 11912
35.4%
Math Symbol
ValueCountFrequency (%)
+ 782
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 33695
97.7%
Common 782
 
2.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9498
28.2%
3339
 
9.9%
1663
 
4.9%
1495
 
4.4%
1332
 
4.0%
1317
 
3.9%
1214
 
3.6%
657
 
1.9%
642
 
1.9%
626
 
1.9%
Other values (71) 11912
35.4%
Common
ValueCountFrequency (%)
+ 782
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 33695
97.7%
ASCII 782
 
2.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
9498
28.2%
3339
 
9.9%
1663
 
4.9%
1495
 
4.4%
1332
 
4.0%
1317
 
3.9%
1214
 
3.6%
657
 
1.9%
642
 
1.9%
626
 
1.9%
Other values (71) 11912
35.4%
ASCII
ValueCountFrequency (%)
+ 782
100.0%

규격
Text

Distinct2270
Distinct (%)22.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-21T07:38:48.856008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length94
Median length68
Mean length28.6642
Min length17

Characters and Unicode

Total characters286642
Distinct characters825
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1251 ?
Unique (%)12.5%

Sample

1st row1000g+kg+국산+일반+홍피망
2nd row1000g+kg+국산+일반+HACCP+1등급이상+돼지고기+갈비+냉동
3rd row1000g+kg+국산+일반+블루베리
4th row1000g+kg+국산+친환경+느타리버섯
5th row1000g+kg+국산+일반+홍피망
ValueCountFrequency (%)
1000g+kg+국산+일반+껍질제거(깐것)||꼭지제거+깐마늘 305
 
2.9%
1000g+kg+국산+일반+흙당근 257
 
2.4%
1000g+kg+국산+일반+피양파 241
 
2.3%
1000g+kg+국산+일반+흙대파 218
 
2.1%
1000g+kg+국산+일반+껍질제거(깐것)+깐양파 149
 
1.4%
1000g+kg+국산+일반+조선무 146
 
1.4%
1000g+kg+국내산+일반+배식용+포기김치 139
 
1.3%
1800g+판+국내산+무항생제+1등급+30구+달걀+냉장 108
 
1.0%
1000g+kg+국산+일반+껍질제거(깐것)+haccp+깐생강 108
 
1.0%
1000g+kg+국산+일반+껍질제거(깐것)+haccp+깐대파 107
 
1.0%
Other values (2441) 8804
83.2%
2024-04-21T07:38:50.284739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
+ 55602
19.4%
0 30421
 
10.6%
g 17311
 
6.0%
10402
 
3.6%
1 10229
 
3.6%
9983
 
3.5%
8904
 
3.1%
8823
 
3.1%
k 6747
 
2.4%
( 3435
 
1.2%
Other values (815) 124785
43.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 139759
48.8%
Math Symbol 56822
19.8%
Decimal Number 46609
 
16.3%
Lowercase Letter 26033
 
9.1%
Uppercase Letter 8669
 
3.0%
Open Punctuation 3524
 
1.2%
Close Punctuation 3524
 
1.2%
Other Punctuation 943
 
0.3%
Space Separator 582
 
0.2%
Dash Punctuation 113
 
< 0.1%
Other values (2) 64
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10402
 
7.4%
9983
 
7.1%
8904
 
6.4%
8823
 
6.3%
2704
 
1.9%
2617
 
1.9%
2385
 
1.7%
2310
 
1.7%
2284
 
1.6%
2074
 
1.5%
Other values (739) 87273
62.4%
Uppercase Letter
ValueCountFrequency (%)
C 2628
30.3%
A 1981
22.9%
P 1305
15.1%
H 1139
13.1%
E 698
 
8.1%
J 278
 
3.2%
G 157
 
1.8%
O 87
 
1.0%
S 75
 
0.9%
X 65
 
0.7%
Other values (12) 256
 
3.0%
Lowercase Letter
ValueCountFrequency (%)
g 17311
66.5%
k 6747
 
25.9%
m 720
 
2.8%
l 592
 
2.3%
e 214
 
0.8%
a 209
 
0.8%
c 79
 
0.3%
s 28
 
0.1%
x 24
 
0.1%
t 23
 
0.1%
Other values (10) 86
 
0.3%
Decimal Number
ValueCountFrequency (%)
0 30421
65.3%
1 10229
 
21.9%
5 1449
 
3.1%
2 1437
 
3.1%
3 1112
 
2.4%
8 670
 
1.4%
4 509
 
1.1%
9 314
 
0.7%
6 303
 
0.7%
7 165
 
0.4%
Other Punctuation
ValueCountFrequency (%)
* 623
66.1%
. 210
 
22.3%
% 64
 
6.8%
& 33
 
3.5%
: 4
 
0.4%
# 3
 
0.3%
3
 
0.3%
" 2
 
0.2%
! 1
 
0.1%
Math Symbol
ValueCountFrequency (%)
+ 55602
97.9%
| 792
 
1.4%
~ 279
 
0.5%
± 124
 
0.2%
× 25
 
< 0.1%
Other Symbol
ValueCountFrequency (%)
26
66.7%
8
 
20.5%
5
 
12.8%
Open Punctuation
ValueCountFrequency (%)
( 3435
97.5%
[ 89
 
2.5%
Close Punctuation
ValueCountFrequency (%)
) 3435
97.5%
] 89
 
2.5%
Space Separator
ValueCountFrequency (%)
582
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 113
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 25
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 139677
48.7%
Common 112155
39.1%
Latin 34702
 
12.1%
Han 108
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10402
 
7.4%
9983
 
7.1%
8904
 
6.4%
8823
 
6.3%
2704
 
1.9%
2617
 
1.9%
2385
 
1.7%
2310
 
1.7%
2284
 
1.6%
2074
 
1.5%
Other values (734) 87191
62.4%
Latin
ValueCountFrequency (%)
g 17311
49.9%
k 6747
 
19.4%
C 2628
 
7.6%
A 1981
 
5.7%
P 1305
 
3.8%
H 1139
 
3.3%
m 720
 
2.1%
E 698
 
2.0%
l 592
 
1.7%
J 278
 
0.8%
Other values (32) 1303
 
3.8%
Common
ValueCountFrequency (%)
+ 55602
49.6%
0 30421
27.1%
1 10229
 
9.1%
( 3435
 
3.1%
) 3435
 
3.1%
5 1449
 
1.3%
2 1437
 
1.3%
3 1112
 
1.0%
| 792
 
0.7%
8 670
 
0.6%
Other values (23) 3573
 
3.2%
Han
ValueCountFrequency (%)
45
41.7%
39
36.1%
15
 
13.9%
5
 
4.6%
2
 
1.9%
2
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 146692
51.2%
Hangul 139651
48.7%
None 178
 
0.1%
CJK 108
 
< 0.1%
CJK Compat 13
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
+ 55602
37.9%
0 30421
20.7%
g 17311
 
11.8%
1 10229
 
7.0%
k 6747
 
4.6%
( 3435
 
2.3%
) 3435
 
2.3%
C 2628
 
1.8%
A 1981
 
1.4%
5 1449
 
1.0%
Other values (60) 13454
 
9.2%
Hangul
ValueCountFrequency (%)
10402
 
7.4%
9983
 
7.1%
8904
 
6.4%
8823
 
6.3%
2704
 
1.9%
2617
 
1.9%
2385
 
1.7%
2310
 
1.7%
2284
 
1.6%
2074
 
1.5%
Other values (733) 87165
62.4%
None
ValueCountFrequency (%)
± 124
69.7%
26
 
14.6%
× 25
 
14.0%
3
 
1.7%
CJK
ValueCountFrequency (%)
45
41.7%
39
36.1%
15
 
13.9%
5
 
4.6%
2
 
1.9%
2
 
1.9%
CJK Compat
ValueCountFrequency (%)
8
61.5%
5
38.5%

단위
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct13
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
kg
6510 
1717 
696 
 
339
 
303
Other values (8)
 
435

Length

Max length3
Median length2
Mean length1.6634
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowkg
2nd rowkg
3rd rowkg
4th rowkg
5th rowkg

Common Values

ValueCountFrequency (%)
kg 6510
65.1%
1717
 
17.2%
696
 
7.0%
339
 
3.4%
303
 
3.0%
214
 
2.1%
138
 
1.4%
BOX 56
 
0.6%
13
 
0.1%
세트 8
 
0.1%
Other values (3) 6
 
0.1%

Length

2024-04-21T07:38:50.533532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
kg 6510
65.1%
1717
 
17.2%
696
 
7.0%
339
 
3.4%
303
 
3.0%
214
 
2.1%
138
 
1.4%
box 56
 
0.6%
13
 
0.1%
세트 8
 
0.1%
Other values (3) 6
 
0.1%

발주량
Real number (ℝ)

Distinct286
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14.924029
Minimum0.05
Maximum1470
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-21T07:38:50.771100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.05
5-th percentile0.3
Q11
median2
Q37
95-th percentile43
Maximum1470
Range1469.95
Interquartile range (IQR)6

Descriptive statistics

Standard deviation75.04748
Coefficient of variation (CV)5.028634
Kurtosis172.10998
Mean14.924029
Median Absolute Deviation (MAD)1.5
Skewness12.133471
Sum149240.29
Variance5632.1242
MonotonicityNot monotonic
2024-04-21T07:38:51.018693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1.0 2178
21.8%
2.0 1081
 
10.8%
3.0 760
 
7.6%
4.0 514
 
5.1%
5.0 503
 
5.0%
0.5 418
 
4.2%
6.0 280
 
2.8%
0.3 257
 
2.6%
10.0 241
 
2.4%
0.2 239
 
2.4%
Other values (276) 3529
35.3%
ValueCountFrequency (%)
0.05 2
 
< 0.1%
0.06 1
 
< 0.1%
0.1 185
1.8%
0.13 1
 
< 0.1%
0.15 2
 
< 0.1%
0.17 1
 
< 0.1%
0.18 1
 
< 0.1%
0.2 239
2.4%
0.21 1
 
< 0.1%
0.25 3
 
< 0.1%
ValueCountFrequency (%)
1470.0 2
< 0.1%
1460.0 2
< 0.1%
1420.0 1
 
< 0.1%
1200.0 1
 
< 0.1%
1160.0 1
 
< 0.1%
1140.0 4
< 0.1%
1120.0 3
< 0.1%
1100.0 4
< 0.1%
970.0 1
 
< 0.1%
940.0 1
 
< 0.1%

인증구분
Categorical

IMBALANCE 

Distinct11
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
해당사항없음
8577 
친환경
 
803
HACCP
 
435
GAP
 
115
무항생제
 
35
Other values (6)
 
35

Length

Max length11
Median length6
Mean length5.6658
Min length1

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row해당사항없음
2nd row해당사항없음
3rd row해당사항없음
4th row친환경
5th row해당사항없음

Common Values

ValueCountFrequency (%)
해당사항없음 8577
85.8%
친환경 803
 
8.0%
HACCP 435
 
4.3%
GAP 115
 
1.1%
무항생제 35
 
0.4%
유기농 18
 
0.2%
무농약 8
 
0.1%
전통식품 5
 
0.1%
어린이기호식품품질인증 2
 
< 0.1%
동물복지인증 1
 
< 0.1%

Length

2024-04-21T07:38:51.249422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
해당사항없음 8577
85.8%
친환경 803
 
8.0%
haccp 435
 
4.4%
gap 115
 
1.2%
무항생제 35
 
0.4%
유기농 18
 
0.2%
무농약 8
 
0.1%
전통식품 5
 
0.1%
어린이기호식품품질인증 2
 
< 0.1%
동물복지인증 1
 
< 0.1%

원산지
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct10
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
국내산
5399 
포장지에별도표기
2994 
세종산
1300 
외국산
 
283
베트남산
 
6
Other values (5)
 
18

Length

Max length8
Median length3
Mean length4.4983
Min length3

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row국내산
2nd row세종산
3rd row국내산
4th row세종산
5th row국내산

Common Values

ValueCountFrequency (%)
국내산 5399
54.0%
포장지에별도표기 2994
29.9%
세종산 1300
 
13.0%
외국산 283
 
2.8%
베트남산 6
 
0.1%
러시아산 5
 
0.1%
미국산 5
 
0.1%
수입산 4
 
< 0.1%
중국산 3
 
< 0.1%
인도네시아 1
 
< 0.1%

Length

2024-04-21T07:38:51.455112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T07:38:51.665354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내산 5399
54.0%
포장지에별도표기 2994
29.9%
세종산 1300
 
13.0%
외국산 283
 
2.8%
베트남산 6
 
0.1%
러시아산 5
 
< 0.1%
미국산 5
 
< 0.1%
수입산 4
 
< 0.1%
중국산 3
 
< 0.1%
인도네시아 1
 
< 0.1%

Interactions

2024-04-21T07:38:40.342768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:38:39.342211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:38:39.841519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:38:40.511352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:38:39.502068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:38:40.009727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:38:40.681767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:38:39.669942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T07:38:40.170022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T07:38:51.835452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번거래처식재료코드대분류중분류단위발주량인증구분원산지
연번1.0000.9900.1120.0430.1520.0830.1750.0820.055
거래처0.9901.0000.1900.1480.3060.1740.2760.5620.072
식재료코드0.1120.1901.0000.5450.7650.5260.1710.3220.591
대분류0.0430.1480.5451.0001.0000.7220.1570.3810.716
중분류0.1520.3060.7651.0001.0000.8950.4800.7050.855
단위0.0830.1740.5260.7220.8951.0000.3420.4670.636
발주량0.1750.2760.1710.1570.4800.3421.0000.0000.185
인증구분0.0820.5620.3220.3810.7050.4670.0001.0000.284
원산지0.0550.0720.5910.7160.8550.6360.1850.2841.000
2024-04-21T07:38:52.035709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
단위인증구분대분류원산지거래처
단위1.0000.2070.5170.3260.056
인증구분0.2071.0000.2390.1240.229
대분류0.5170.2391.0000.5190.071
원산지0.3260.1240.5191.0000.026
거래처0.0560.2290.0710.0261.000
2024-04-21T07:38:52.208951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번식재료코드발주량거래처대분류단위인증구분원산지
연번1.000-0.0040.0320.9280.0250.0340.0350.017
식재료코드-0.0041.0000.0630.0690.3570.2490.1430.217
발주량0.0320.0631.0000.1020.0940.1490.0000.058
거래처0.9280.0690.1021.0000.0710.0560.2290.026
대분류0.0250.3570.0940.0711.0000.5170.2390.519
단위0.0340.2490.1490.0560.5171.0000.2070.326
인증구분0.0350.1430.0000.2290.2390.2071.0000.124
원산지0.0170.2170.0580.0260.5190.3260.1241.000

Missing values

2024-04-21T07:38:41.121377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T07:38:41.413866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번거래처견적일식재료코드식재료명대분류중분류규격단위발주량인증구분원산지
2066820669고운고등학교2023-04-211009519피망(홍피망)+일반농산물채소류1000g+kg+국산+일반+홍피망kg1.0해당사항없음국내산
2492324924고운중학교2023-05-081085044돼지고기(갈비)+일반축산물돼지고기1000g+kg+국산+일반+HACCP+1등급이상+돼지고기+갈비+냉동kg67.0해당사항없음세종산
5860958610늘봄유치원2023-07-121070850블루베리+국산농산물과일류1000g+kg+국산+일반+블루베리kg2.0해당사항없음국내산
44854486가득초등학교2023-05-181008555느타리버섯+친환경농산물버섯류1000g+kg+국산+친환경+느타리버섯kg2.0친환경세종산
6550765508다빛유치원2023-06-291009519피망(홍피망)+일반농산물채소류1000g+kg+국산+일반+홍피망kg0.5해당사항없음국내산
4942849429나루초등학교2023-05-121009519피망(홍피망)+일반농산물채소류1000g+kg+국산+일반+홍피망kg1.5해당사항없음국내산
2089520896고운고등학교2023-03-281097790후추(검은색)가공식품조미식품류200g+봉+외국산+일반+청정원+순후추+상온0.5해당사항없음포장지에별도표기
2804628047고운초등학교2023-06-291063809무(조선무)+일반농산물근채류1000g+kg+국산+일반+조선무kg6.0해당사항없음국내산
5795057951늘봄유치원2023-06-271050354두부가공식품두부+묵류1000g+판+국내산+일반+풀무원+풀무원매일아침신선두부+냉장1.0해당사항없음포장지에별도표기
5823458235늘봄유치원2023-07-191103154메추리알+껍질제거(깐것)축산물난류1000g+판+국내산+껍질제거(깐것)+100구+깐메추리알+냉장1.0해당사항없음국내산
연번거래처견적일식재료코드식재료명대분류중분류규격단위발주량인증구분원산지
4472944730나래유치원2023-01-301004795한우잡뼈+일반축산물부산물1000g+kg+국산+일반+HACCP+1등급이상+한우+잡뼈+냉동kg1.0해당사항없음국내산
2714027141고운초등학교2023-05-151088963깐쇼칠리소스가공식품조미식품류2000g+EA(개)+외국산+일반+오뚜기+오쉐프깐풍칠리소스+냉장6.0해당사항없음포장지에별도표기
52685269가득초등학교2023-06-161052983브로콜리+일반농산물양채류1000g+kg+국산+일반+브로콜리kg3.5해당사항없음세종산
6326363264다빛유치원2023-07-171008689당근+세척한것농산물근채류1000g+kg+국산+일반+세척한것+통+HACCP+세척당근kg0.5해당사항없음국내산
6242462425늘봄초등학교2023-06-211009491파프리카(노랑파프리카)+일반농산물양채류1000g+kg+국산+일반+파프리카(노랑)kg1.0해당사항없음국내산
4135641357금호중학교2023-04-101099296양파+일반농산물근채류1000g+kg+국산+일반+피양파kg7.5해당사항없음국내산
6506965070다빛유치원2023-04-261063831토마토+일반농산물과일류1000g+kg+국산+일반+2번과+완숙토마토kg2.0해당사항없음국내산
1054610547가락초등학교2023-05-111041469밀가루+국산가공식품분말류1000g+봉+국산+일반+아이사랑+우리밀백밀가루+실온1.0해당사항없음포장지에별도표기
2502025021고운중학교2023-07-051087451된장(일본된장)가공식품장류14000g+통+외국산+일반+오복+백된장14kg+상온1.0해당사항없음포장지에별도표기
1144211443가락초등학교2023-05-121035286청주가공식품주류1800ml+병+국내산+일반+1.8L+롯데주류+백화수복+상온1.0해당사항없음포장지에별도표기