Overview

Dataset statistics

Number of variables18
Number of observations10000
Missing cells15870
Missing cells (%)8.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.5 MiB
Average record size in memory157.0 B

Variable types

Categorical10
Text2
Numeric5
Boolean1

Dataset

Description매일 조사하는 일반농산물 도매 69품목 116품종, 소매 90품목 143품종 및 친환경농산물 38품목 44종 대상 가격자료
URLhttps://www.data.go.kr/data/15072357/fileData.do

Alerts

시군구명 is highly overall correlated with 시장명 and 1 other fieldsHigh correlation
시장명 is highly overall correlated with 시도명 and 2 other fieldsHigh correlation
소매출하단위명 is highly overall correlated with 친환경농산물출하단위크기 and 2 other fieldsHigh correlation
조사구분명 is highly overall correlated with 시장명High correlation
산지출하단위명 is highly overall correlated with 산지출하단위크기 and 3 other fieldsHigh correlation
도매출하단위명 is highly overall correlated with 친환경농산물출하단위크기 and 3 other fieldsHigh correlation
친환경농산물출하단위명 is highly overall correlated with 품목가격 and 4 other fieldsHigh correlation
시도명 is highly overall correlated with 시장명 and 1 other fieldsHigh correlation
품목가격 is highly overall correlated with 친환경농산물출하단위명High correlation
산지출하단위크기 is highly overall correlated with 도매출하단위크기 and 2 other fieldsHigh correlation
도매출하단위크기 is highly overall correlated with 산지출하단위크기High correlation
소매출하단위크기 is highly overall correlated with 산지출하단위크기 and 3 other fieldsHigh correlation
친환경농산물출하단위크기 is highly overall correlated with 소매출하단위크기 and 4 other fieldsHigh correlation
산물등급명 is highly imbalanced (55.1%)Imbalance
산지출하단위명 is highly imbalanced (52.4%)Imbalance
도매출하단위명 is highly imbalanced (57.7%)Imbalance
할인가격여부 is highly imbalanced (67.3%)Imbalance
산지출하단위크기 has 7530 (75.3%) missing valuesMissing
도매출하단위크기 has 1415 (14.1%) missing valuesMissing
소매출하단위크기 has 161 (1.6%) missing valuesMissing
친환경농산물출하단위크기 has 6764 (67.6%) missing valuesMissing

Reproduction

Analysis started2023-12-12 03:29:49.194950
Analysis finished2023-12-12 03:29:57.236306
Duration8.04 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct20
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2021-01-21
 
537
2021-01-22
 
533
2021-01-25
 
532
2021-01-19
 
530
2021-01-04
 
523
Other values (15)
7345 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-01-14
2nd row2021-01-19
3rd row2021-01-18
4th row2021-01-11
5th row2021-01-11

Common Values

ValueCountFrequency (%)
2021-01-21 537
 
5.4%
2021-01-22 533
 
5.3%
2021-01-25 532
 
5.3%
2021-01-19 530
 
5.3%
2021-01-04 523
 
5.2%
2021-01-12 519
 
5.2%
2021-01-15 519
 
5.2%
2021-01-27 511
 
5.1%
2021-01-14 511
 
5.1%
2021-01-28 509
 
5.1%
Other values (10) 4776
47.8%

Length

2023-12-12T12:29:57.380277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2021-01-21 537
 
5.4%
2021-01-22 533
 
5.3%
2021-01-25 532
 
5.3%
2021-01-19 530
 
5.3%
2021-01-04 523
 
5.2%
2021-01-12 519
 
5.2%
2021-01-15 519
 
5.2%
2021-01-27 511
 
5.1%
2021-01-14 511
 
5.1%
2021-01-28 509
 
5.1%
Other values (10) 4776
47.8%

시장명
Categorical

HIGH CORRELATION 

Distinct33
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
C-유통
1569 
A-유통
1489 
E-유통
820 
B-유통
533 
역전
510 
Other values (28)
5079 

Length

Max length6
Median length4
Mean length3.2512
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row동구
2nd rowC-유통
3rd rowA-유통
4th row중앙
5th row중앙

Common Values

ValueCountFrequency (%)
C-유통 1569
15.7%
A-유통 1489
14.9%
E-유통 820
 
8.2%
B-유통 533
 
5.3%
역전 510
 
5.1%
중앙 363
 
3.6%
부전 332
 
3.3%
양동 273
 
2.7%
서부도매 270
 
2.7%
가락도매 269
 
2.7%
Other values (23) 3572
35.7%

Length

2023-12-12T12:29:57.501352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
c-유통 1569
15.7%
a-유통 1489
14.9%
e-유통 820
 
8.2%
b-유통 533
 
5.3%
역전 510
 
5.1%
중앙 363
 
3.6%
부전 332
 
3.3%
양동 273
 
2.7%
서부도매 270
 
2.7%
가락도매 269
 
2.7%
Other values (23) 3572
35.7%

시도명
Categorical

HIGH CORRELATION 

Distinct16
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
서울
1652 
부산
945 
대전
926 
대구
915 
광주
864 
Other values (11)
4698 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대구
2nd row전북
3rd row울산
4th row강원
5th row강원

Common Values

ValueCountFrequency (%)
서울 1652
16.5%
부산 945
9.4%
대전 926
9.3%
대구 915
9.2%
광주 864
8.6%
강원 722
 
7.2%
경기 592
 
5.9%
경북 532
 
5.3%
울산 415
 
4.2%
인천 414
 
4.1%
Other values (6) 2023
20.2%

Length

2023-12-12T12:29:57.633131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서울 1652
16.5%
부산 945
9.4%
대전 926
9.3%
대구 915
9.2%
광주 864
8.6%
강원 722
 
7.2%
경기 592
 
5.9%
경북 532
 
5.3%
울산 415
 
4.2%
인천 414
 
4.1%
Other values (6) 2023
20.2%

시군구명
Categorical

HIGH CORRELATION 

Distinct19
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
서울
1652 
부산
945 
대전
926 
대구
915 
광주
864 
Other values (14)
4698 

Length

Max length3
Median length2
Mean length2.018
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대구
2nd row전주
3rd row울산
4th row춘천
5th row강릉

Common Values

ValueCountFrequency (%)
서울 1652
16.5%
부산 945
 
9.4%
대전 926
 
9.3%
대구 915
 
9.2%
광주 864
 
8.6%
울산 415
 
4.2%
인천 414
 
4.1%
수원 412
 
4.1%
순천 398
 
4.0%
강릉 390
 
3.9%
Other values (9) 2669
26.7%

Length

2023-12-12T12:29:57.765344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서울 1652
16.5%
부산 945
 
9.4%
대전 926
 
9.3%
대구 915
 
9.2%
광주 864
 
8.6%
울산 415
 
4.2%
인천 414
 
4.1%
수원 412
 
4.1%
순천 398
 
4.0%
전주 390
 
3.9%
Other values (9) 2669
26.7%
Distinct88
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T12:29:58.065637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length2.738
Min length1

Characters and Unicode

Total characters27380
Distinct characters130
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row풋고추
2nd row시금치
3rd row깐마늘(국산)
4th row
5th row녹두
ValueCountFrequency (%)
쇠고기 410
 
4.1%
돼지고기 396
 
4.0%
풋고추 381
 
3.8%
상추 260
 
2.6%
호박 217
 
2.2%
오이 212
 
2.1%
209
 
2.1%
땅콩 191
 
1.9%
느타리버섯 180
 
1.8%
방울토마토 167
 
1.7%
Other values (78) 7377
73.8%
2023-12-12T12:29:58.515819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2025
 
7.4%
1349
 
4.9%
1073
 
3.9%
619
 
2.3%
619
 
2.3%
577
 
2.1%
532
 
1.9%
525
 
1.9%
494
 
1.8%
448
 
1.6%
Other values (120) 19119
69.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 26938
98.4%
Close Punctuation 221
 
0.8%
Open Punctuation 221
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2025
 
7.5%
1349
 
5.0%
1073
 
4.0%
619
 
2.3%
619
 
2.3%
577
 
2.1%
532
 
2.0%
525
 
1.9%
494
 
1.8%
448
 
1.7%
Other values (118) 18677
69.3%
Close Punctuation
ValueCountFrequency (%)
) 221
100.0%
Open Punctuation
ValueCountFrequency (%)
( 221
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 26938
98.4%
Common 442
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2025
 
7.5%
1349
 
5.0%
1073
 
4.0%
619
 
2.3%
619
 
2.3%
577
 
2.1%
532
 
2.0%
525
 
1.9%
494
 
1.8%
448
 
1.7%
Other values (118) 18677
69.3%
Common
ValueCountFrequency (%)
) 221
50.0%
( 221
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 26938
98.4%
ASCII 442
 
1.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2025
 
7.5%
1349
 
5.0%
1073
 
4.0%
619
 
2.3%
619
 
2.3%
577
 
2.1%
532
 
2.0%
525
 
1.9%
494
 
1.8%
448
 
1.7%
Other values (118) 18677
69.3%
ASCII
ValueCountFrequency (%)
) 221
50.0%
( 221
50.0%
Distinct106
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T12:29:58.905602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length17
Mean length3.8773
Min length1

Characters and Unicode

Total characters38773
Distinct characters191
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row청양고추
2nd row시금치
3rd row깐마늘(국산)
4th row붉은 팥(국산)
5th row국산
ValueCountFrequency (%)
수입 862
 
7.4%
국산 492
 
4.2%
월동 291
 
2.5%
일반계 213
 
1.8%
209
 
1.8%
154
 
1.3%
151
 
1.3%
청양고추 147
 
1.3%
꽈리고추 144
 
1.2%
생선 143
 
1.2%
Other values (120) 8877
76.0%
2023-12-12T12:29:59.479586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1683
 
4.3%
1391
 
3.6%
1322
 
3.4%
1214
 
3.1%
1163
 
3.0%
909
 
2.3%
( 867
 
2.2%
) 867
 
2.2%
747
 
1.9%
737
 
1.9%
Other values (181) 27873
71.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 32898
84.8%
Space Separator 1683
 
4.3%
Decimal Number 1385
 
3.6%
Open Punctuation 867
 
2.2%
Close Punctuation 867
 
2.2%
Lowercase Letter 603
 
1.6%
Math Symbol 179
 
0.5%
Uppercase Letter 174
 
0.4%
Other Punctuation 117
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1391
 
4.2%
1322
 
4.0%
1214
 
3.7%
1163
 
3.5%
909
 
2.8%
747
 
2.3%
737
 
2.2%
711
 
2.2%
665
 
2.0%
532
 
1.6%
Other values (164) 23507
71.5%
Decimal Number
ValueCountFrequency (%)
0 362
26.1%
1 299
21.6%
3 237
17.1%
2 179
12.9%
5 125
 
9.0%
8 117
 
8.4%
4 66
 
4.8%
Uppercase Letter
ValueCountFrequency (%)
L 60
34.5%
J 57
32.8%
C 57
32.8%
Lowercase Letter
ValueCountFrequency (%)
g 424
70.3%
k 179
29.7%
Space Separator
ValueCountFrequency (%)
1683
100.0%
Open Punctuation
ValueCountFrequency (%)
( 867
100.0%
Close Punctuation
ValueCountFrequency (%)
) 867
100.0%
Math Symbol
ValueCountFrequency (%)
× 179
100.0%
Other Punctuation
ValueCountFrequency (%)
. 117
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 32898
84.8%
Common 5098
 
13.1%
Latin 777
 
2.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1391
 
4.2%
1322
 
4.0%
1214
 
3.7%
1163
 
3.5%
909
 
2.8%
747
 
2.3%
737
 
2.2%
711
 
2.2%
665
 
2.0%
532
 
1.6%
Other values (164) 23507
71.5%
Common
ValueCountFrequency (%)
1683
33.0%
( 867
17.0%
) 867
17.0%
0 362
 
7.1%
1 299
 
5.9%
3 237
 
4.6%
2 179
 
3.5%
× 179
 
3.5%
5 125
 
2.5%
8 117
 
2.3%
Other values (2) 183
 
3.6%
Latin
ValueCountFrequency (%)
g 424
54.6%
k 179
23.0%
L 60
 
7.7%
J 57
 
7.3%
C 57
 
7.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 32898
84.8%
ASCII 5696
 
14.7%
None 179
 
0.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1683
29.5%
( 867
15.2%
) 867
15.2%
g 424
 
7.4%
0 362
 
6.4%
1 299
 
5.2%
3 237
 
4.2%
2 179
 
3.1%
k 179
 
3.1%
5 125
 
2.2%
Other values (6) 474
 
8.3%
Hangul
ValueCountFrequency (%)
1391
 
4.2%
1322
 
4.0%
1214
 
3.7%
1163
 
3.5%
909
 
2.8%
747
 
2.3%
737
 
2.2%
711
 
2.2%
665
 
2.0%
532
 
1.6%
Other values (164) 23507
71.5%
None
ValueCountFrequency (%)
× 179
100.0%

조사구분명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
소매
8441 
도매
1559 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row소매
2nd row소매
3rd row소매
4th row소매
5th row소매

Common Values

ValueCountFrequency (%)
소매 8441
84.4%
도매 1559
 
15.6%

Length

2023-12-12T12:29:59.672359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:29:59.796979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
소매 8441
84.4%
도매 1559
 
15.6%

산물등급명
Categorical

IMBALANCE 

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
상품
5623 
중품
3841 
1등급
 
199
1+등급
 
128
M과
 
66
Other values (3)
 
143

Length

Max length4
Median length2
Mean length2.0455
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중품
2nd row상품
3rd row상품
4th row상품
5th row상품

Common Values

ValueCountFrequency (%)
상품 5623
56.2%
중품 3841
38.4%
1등급 199
 
2.0%
1+등급 128
 
1.3%
M과 66
 
0.7%
S과 60
 
0.6%
냉장 42
 
0.4%
냉동 41
 
0.4%

Length

2023-12-12T12:29:59.961119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:30:00.138786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
상품 5623
56.2%
중품 3841
38.4%
1등급 199
 
2.0%
1+등급 128
 
1.3%
m과 66
 
0.7%
s과 60
 
0.6%
냉장 42
 
0.4%
냉동 41
 
0.4%

품목가격
Real number (ℝ)

HIGH CORRELATION 

Distinct1126
Distinct (%)11.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean23465.378
Minimum150
Maximum1075000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T12:30:00.320607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum150
5-th percentile630
Q11980
median5330
Q312900
95-th percentile61015
Maximum1075000
Range1074850
Interquartile range (IQR)10920

Descriptive statistics

Standard deviation89686.151
Coefficient of variation (CV)3.8220629
Kurtosis68.216495
Mean23465.378
Median Absolute Deviation (MAD)3930
Skewness7.923776
Sum2.3465378 × 108
Variance8.0436056 × 109
MonotonicityNot monotonic
2023-12-12T12:30:00.520613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2500 132
 
1.3%
2000 125
 
1.2%
1000 122
 
1.2%
5000 119
 
1.2%
1980 105
 
1.1%
4000 104
 
1.0%
6000 102
 
1.0%
8000 100
 
1.0%
3000 84
 
0.8%
1500 84
 
0.8%
Other values (1116) 8923
89.2%
ValueCountFrequency (%)
150 1
 
< 0.1%
160 4
 
< 0.1%
190 2
 
< 0.1%
200 4
 
< 0.1%
210 6
0.1%
220 8
0.1%
230 8
0.1%
240 10
0.1%
250 14
0.1%
257 1
 
< 0.1%
ValueCountFrequency (%)
1075000 1
 
< 0.1%
1050000 2
 
< 0.1%
1025000 3
 
< 0.1%
1000000 5
 
0.1%
960000 4
 
< 0.1%
950000 1
 
< 0.1%
925000 3
 
< 0.1%
920000 2
 
< 0.1%
910000 2
 
< 0.1%
900000 13
0.1%

산지출하단위크기
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct8
Distinct (%)0.3%
Missing7530
Missing (%)75.3%
Infinite0
Infinite (%)0.0%
Mean53.064777
Minimum1
Maximum600
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T12:30:00.672995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median10
Q340
95-th percentile600
Maximum600
Range599
Interquartile range (IQR)39

Descriptive statistics

Standard deviation146.17989
Coefficient of variation (CV)2.7547443
Kurtosis9.8950596
Mean53.064777
Median Absolute Deviation (MAD)9
Skewness3.4132316
Sum131070
Variance21368.561
MonotonicityNot monotonic
2023-12-12T12:30:00.828336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
1 1111
 
11.1%
15 399
 
4.0%
40 354
 
3.5%
600 162
 
1.6%
20 151
 
1.5%
80 104
 
1.0%
10 101
 
1.0%
3 88
 
0.9%
(Missing) 7530
75.3%
ValueCountFrequency (%)
1 1111
11.1%
3 88
 
0.9%
10 101
 
1.0%
15 399
 
4.0%
20 151
 
1.5%
40 354
 
3.5%
80 104
 
1.0%
600 162
 
1.6%
ValueCountFrequency (%)
600 162
 
1.6%
80 104
 
1.0%
40 354
 
3.5%
20 151
 
1.5%
15 399
 
4.0%
10 101
 
1.0%
3 88
 
0.9%
1 1111
11.1%

산지출하단위명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
7530 
kg
2207 
g
 
162
 
101

Length

Max length4
Median length4
Mean length3.4797
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 7530
75.3%
kg 2207
 
22.1%
g 162
 
1.6%
101
 
1.0%

Length

2023-12-12T12:30:01.015009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:30:01.155333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 7530
75.3%
kg 2207
 
22.1%
g 162
 
1.6%
101
 
1.0%

도매출하단위크기
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct19
Distinct (%)0.2%
Missing1415
Missing (%)14.1%
Infinite0
Infinite (%)0.0%
Mean12.131974
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T12:30:01.319843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median10
Q320
95-th percentile40
Maximum100
Range99
Interquartile range (IQR)18

Descriptive statistics

Standard deviation14.549364
Coefficient of variation (CV)1.1992578
Kurtosis14.107515
Mean12.131974
Median Absolute Deviation (MAD)8.5
Skewness3.0478189
Sum104153
Variance211.684
MonotonicityNot monotonic
2023-12-12T12:30:01.466601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
1.0 1906
19.1%
10.0 1594
15.9%
20.0 1233
12.3%
2.0 720
 
7.2%
4.0 643
 
6.4%
30.0 480
 
4.8%
5.0 472
 
4.7%
40.0 288
 
2.9%
8.0 236
 
2.4%
15.0 191
 
1.9%
Other values (9) 822
8.2%
(Missing) 1415
14.1%
ValueCountFrequency (%)
1.0 1906
19.1%
1.5 88
 
0.9%
2.0 720
 
7.2%
4.0 643
 
6.4%
5.0 472
 
4.7%
8.0 236
 
2.4%
10.0 1594
15.9%
12.0 145
 
1.5%
13.0 125
 
1.2%
15.0 191
 
1.9%
ValueCountFrequency (%)
100.0 103
 
1.0%
50.0 51
 
0.5%
45.0 23
 
0.2%
40.0 288
 
2.9%
35.0 128
 
1.3%
30.0 480
 
4.8%
20.0 1233
12.3%
18.0 37
 
0.4%
17.0 122
 
1.2%
15.0 191
 
1.9%

도매출하단위명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
kg
7941 
<NA>
1415 
 
496
마리
 
99
 
49

Length

Max length4
Median length2
Mean length2.2285
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowkg
2nd rowkg
3rd rowkg
4th rowkg
5th rowkg

Common Values

ValueCountFrequency (%)
kg 7941
79.4%
<NA> 1415
 
14.1%
496
 
5.0%
마리 99
 
1.0%
49
 
0.5%

Length

2023-12-12T12:30:01.670402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:30:01.807276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
kg 7941
79.4%
na 1415
 
14.1%
496
 
5.0%
마리 99
 
1.0%
49
 
0.5%

소매출하단위크기
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct11
Distinct (%)0.1%
Missing161
Missing (%)1.6%
Infinite0
Infinite (%)0.0%
Mean70.483687
Minimum1
Maximum600
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T12:30:01.921789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median10
Q3100
95-th percentile500
Maximum600
Range599
Interquartile range (IQR)99

Descriptive statistics

Standard deviation124.37565
Coefficient of variation (CV)1.7646019
Kurtosis8.3966189
Mean70.483687
Median Absolute Deviation (MAD)9
Skewness2.9147635
Sum693489
Variance15469.301
MonotonicityNot monotonic
2023-12-12T12:30:02.069570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
1 3929
39.3%
100 3267
32.7%
10 1314
 
13.1%
500 409
 
4.1%
5 240
 
2.4%
600 162
 
1.6%
200 132
 
1.3%
20 104
 
1.0%
30 101
 
1.0%
150 101
 
1.0%
(Missing) 161
 
1.6%
ValueCountFrequency (%)
1 3929
39.3%
2 80
 
0.8%
5 240
 
2.4%
10 1314
 
13.1%
20 104
 
1.0%
30 101
 
1.0%
100 3267
32.7%
150 101
 
1.0%
200 132
 
1.3%
500 409
 
4.1%
ValueCountFrequency (%)
600 162
 
1.6%
500 409
 
4.1%
200 132
 
1.3%
150 101
 
1.0%
100 3267
32.7%
30 101
 
1.0%
20 104
 
1.0%
10 1314
13.1%
5 240
 
2.4%
2 80
 
0.8%

소매출하단위명
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
g
4071 
kg
2265 
2202 
마리
667 
포기
 
264
Other values (5)
531 

Length

Max length4
Median length1
Mean length1.385
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowg
2nd rowkg
3rd rowkg
4th rowg
5th rowg

Common Values

ValueCountFrequency (%)
g 4071
40.7%
kg 2265
22.7%
2202
22.0%
마리 667
 
6.7%
포기 264
 
2.6%
<NA> 161
 
1.6%
133
 
1.3%
묶음 122
 
1.2%
66
 
0.7%
리터 49
 
0.5%

Length

2023-12-12T12:30:02.217042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:30:02.361856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
g 4071
40.7%
kg 2265
22.7%
2202
22.0%
마리 667
 
6.7%
포기 264
 
2.6%
na 161
 
1.6%
133
 
1.3%
묶음 122
 
1.2%
66
 
0.7%
리터 49
 
0.5%

친환경농산물출하단위크기
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct6
Distinct (%)0.2%
Missing6764
Missing (%)67.6%
Infinite0
Infinite (%)0.0%
Mean46.776576
Minimum1
Maximum200
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T12:30:02.505999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median10
Q3100
95-th percentile150
Maximum200
Range199
Interquartile range (IQR)99

Descriptive statistics

Standard deviation57.22
Coefficient of variation (CV)1.2232618
Kurtosis-0.13922055
Mean46.776576
Median Absolute Deviation (MAD)9
Skewness0.95697948
Sum151369
Variance3274.1284
MonotonicityNot monotonic
2023-12-12T12:30:02.652143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
1 1329
 
13.3%
100 997
 
10.0%
10 576
 
5.8%
200 132
 
1.3%
30 101
 
1.0%
150 101
 
1.0%
(Missing) 6764
67.6%
ValueCountFrequency (%)
1 1329
13.3%
10 576
5.8%
30 101
 
1.0%
100 997
10.0%
150 101
 
1.0%
200 132
 
1.3%
ValueCountFrequency (%)
200 132
 
1.3%
150 101
 
1.0%
100 997
10.0%
30 101
 
1.0%
10 576
5.8%
1 1329
13.3%

친환경농산물출하단위명
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
6764 
g
1230 
kg
1168 
708 
포기
 
130

Length

Max length4
Median length4
Mean length3.159
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd rowkg
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 6764
67.6%
g 1230
 
12.3%
kg 1168
 
11.7%
708
 
7.1%
포기 130
 
1.3%

Length

2023-12-12T12:30:02.818194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:30:02.935818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 6764
67.6%
g 1230
 
12.3%
kg 1168
 
11.7%
708
 
7.1%
포기 130
 
1.3%

할인가격여부
Boolean

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
False
9401 
True
 
599
ValueCountFrequency (%)
False 9401
94.0%
True 599
 
6.0%
2023-12-12T12:30:03.064646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Interactions

2023-12-12T12:29:55.470999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:29:52.667641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:29:53.461515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:29:54.156837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:29:54.802381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:29:55.619743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:29:52.794999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:29:53.621416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:29:54.258563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:29:54.939899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:29:55.760618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:29:52.967609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:29:53.774140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:29:54.385662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:29:55.071276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:29:55.945294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:29:53.130708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:29:53.902979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:29:54.517347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:29:55.203631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:29:56.098473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:29:53.294452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:29:54.020167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:29:54.661056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:29:55.336722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T12:30:03.164498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
가격등록일자시장명시도명시군구명품목명조사구분명산물등급명품목가격산지출하단위크기산지출하단위명도매출하단위크기도매출하단위명소매출하단위크기소매출하단위명친환경농산물출하단위크기친환경농산물출하단위명할인가격여부
가격등록일자1.0000.0360.0470.0380.0000.0300.0150.0000.0000.0430.0000.0000.0100.0000.0800.0490.018
시장명0.0361.0000.9670.9650.6420.9810.2780.5860.4350.3720.3830.3130.3740.3000.0720.1190.371
시도명0.0470.9671.0001.0000.2200.5430.1470.1350.1440.1240.0720.0500.0560.0540.0000.0240.109
시군구명0.0380.9651.0001.0000.2300.4820.1340.1340.1380.1230.0790.0510.0520.0630.0000.0000.102
품목명0.0000.6420.2200.2301.0000.4300.9010.7291.0001.0000.9850.9961.0001.0001.0001.0000.385
조사구분명0.0300.9810.5430.4820.4301.0000.2360.5140.0550.0770.1150.0510.1350.0610.0650.0850.168
산물등급명0.0150.2780.1470.1340.9010.2361.0000.0290.0940.1340.1740.2190.2540.4000.2250.3380.067
품목가격0.0000.5860.1350.1340.7290.5140.0291.0000.6160.6160.4660.0000.4500.109NaNNaN0.040
산지출하단위크기0.0000.4350.1440.1381.0000.0550.0940.6161.0000.9430.5090.0410.6820.4590.1030.2030.051
산지출하단위명0.0430.3720.1240.1231.0000.0770.1340.6160.9431.0000.4491.0000.6810.4211.0000.2700.022
도매출하단위크기0.0000.3830.0720.0790.9850.1150.1740.4660.5090.4491.0000.4680.8310.5550.2710.6680.094
도매출하단위명0.0000.3130.0500.0510.9960.0510.2190.0000.0411.0000.4681.0000.2250.8180.4380.8900.012
소매출하단위크기0.0100.3740.0560.0521.0000.1350.2540.4500.6820.6810.8310.2251.0000.7090.9630.8840.055
소매출하단위명0.0000.3000.0540.0631.0000.0610.4000.1090.4590.4210.5550.8180.7091.0000.6551.0000.092
친환경농산물출하단위크기0.0800.0720.0000.0001.0000.0650.225NaN0.1031.0000.2710.4380.9630.6551.0000.6720.024
친환경농산물출하단위명0.0490.1190.0240.0001.0000.0850.338NaN0.2030.2700.6680.8900.8841.0000.6721.0000.040
할인가격여부0.0180.3710.1090.1020.3850.1680.0670.0400.0510.0220.0940.0120.0550.0920.0240.0401.000
2023-12-12T12:30:03.363923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군구명산물등급명시장명소매출하단위명조사구분명산지출하단위명할인가격여부가격등록일자도매출하단위명친환경농산물출하단위명시도명
시군구명1.0000.0560.6920.0250.4290.0640.0900.0110.0280.0001.000
산물등급명0.0561.0000.1110.2090.1770.2210.0500.0060.1000.2260.052
시장명0.6920.1111.0000.1140.9560.1850.3150.0090.1650.0620.728
소매출하단위명0.0250.2090.1141.0000.0610.3510.0920.0000.6710.9860.022
조사구분명0.4290.1770.9560.0611.0000.1280.1070.0230.0340.0570.429
산지출하단위명0.0640.2210.1850.3510.1281.0000.0370.0221.0000.4380.067
할인가격여부0.0900.0500.3150.0920.1070.0371.0000.0140.0080.0260.086
가격등록일자0.0110.0060.0090.0000.0230.0220.0141.0000.0000.0230.014
도매출하단위명0.0280.1000.1650.6710.0341.0000.0080.0001.0000.6990.023
친환경농산물출하단위명0.0000.2260.0620.9860.0570.4380.0260.0230.6991.0000.011
시도명1.0000.0520.7280.0220.4290.0670.0860.0140.0230.0111.000
2023-12-12T12:30:03.567823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
품목가격산지출하단위크기도매출하단위크기소매출하단위크기친환경농산물출하단위크기가격등록일자시장명시도명시군구명조사구분명산물등급명산지출하단위명도매출하단위명소매출하단위명친환경농산물출하단위명할인가격여부
품목가격1.0000.4810.218-0.258-0.3640.0000.2490.0530.0500.3960.0140.3370.0000.0491.0000.031
산지출하단위크기0.4811.0000.6350.651-0.0030.0000.2240.0780.0720.0920.1560.7080.0680.3910.3330.085
도매출하단위크기0.2180.6351.0000.032-0.1570.0000.1760.0340.0370.0830.0970.4430.3190.2280.3180.068
소매출하단위크기-0.2580.6510.0321.0000.9640.0040.1710.0270.0240.0970.1440.7130.1470.4460.5580.039
친환경농산물출하단위크기-0.364-0.003-0.1570.9641.0000.0340.0340.0000.0000.0800.2750.9990.5330.5850.6040.030
가격등록일자0.0000.0000.0000.0040.0341.0000.0090.0140.0110.0230.0060.0220.0000.0000.0230.014
시장명0.2490.2240.1760.1710.0340.0091.0000.7280.6920.9560.1110.1850.1650.1140.0620.315
시도명0.0530.0780.0340.0270.0000.0140.7281.0001.0000.4290.0520.0670.0230.0220.0110.086
시군구명0.0500.0720.0370.0240.0000.0110.6921.0001.0000.4290.0560.0640.0280.0250.0000.090
조사구분명0.3960.0920.0830.0970.0800.0230.9560.4290.4291.0000.1770.1280.0340.0610.0570.107
산물등급명0.0140.1560.0970.1440.2750.0060.1110.0520.0560.1771.0000.2210.1000.2090.2260.050
산지출하단위명0.3370.7080.4430.7130.9990.0220.1850.0670.0640.1280.2211.0001.0000.3510.4380.037
도매출하단위명0.0000.0680.3190.1470.5330.0000.1650.0230.0280.0340.1001.0001.0000.6710.6990.008
소매출하단위명0.0490.3910.2280.4460.5850.0000.1140.0220.0250.0610.2090.3510.6711.0000.9860.092
친환경농산물출하단위명1.0000.3330.3180.5580.6040.0230.0620.0110.0000.0570.2260.4380.6990.9861.0000.026
할인가격여부0.0310.0850.0680.0390.0300.0140.3150.0860.0900.1070.0500.0370.0080.0920.0261.000

Missing values

2023-12-12T12:29:56.335720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:29:56.933600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T12:29:57.122293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

가격등록일자시장명시도명시군구명품목명품종명조사구분명산물등급명품목가격산지출하단위크기산지출하단위명도매출하단위크기도매출하단위명소매출하단위크기소매출하단위명친환경농산물출하단위크기친환경농산물출하단위명할인가격여부
414592021-01-14동구대구대구풋고추청양고추소매중품650<NA><NA>10.0kg100g<NA><NA>N
565842021-01-19C-유통전북전주시금치시금치소매상품9520<NA><NA>4.0kg1kg1kgN
505602021-01-18A-유통울산울산깐마늘(국산)깐마늘(국산)소매상품11970<NA><NA>20.0kg1kg<NA><NA>N
272212021-01-11중앙강원춘천붉은 팥(국산)소매상품6660<NA><NA>40.0kg500g<NA><NA>N
273812021-01-11중앙강원강릉녹두국산소매상품10000<NA><NA>40.0kg500g<NA><NA>N
218872021-01-08A-유통울산울산양파양파소매상품29301kg20.0kg1kg1kgN
445282021-01-15칠성대구대구건미역건미역도매상품14000<NA><NA>1.0kg100g<NA><NA>N
23452021-01-04양동광주광주돼지고기삼겹살(수입냉동)소매중품970<NA><NA>1.0kg100g<NA><NA>N
510072021-01-18B-유통경기수원두부풀무원 국산콩 300g×2소매상품5690<NA><NA><NA><NA>1<NA><NA>N
220292021-01-08신정울산울산파인애플수입소매중품5000<NA><NA>12.0kg1<NA><NA>N
가격등록일자시장명시도명시군구명품목명품종명조사구분명산물등급명품목가격산지출하단위크기산지출하단위명도매출하단위크기도매출하단위명소매출하단위크기소매출하단위명친환경농산물출하단위크기친환경농산물출하단위명할인가격여부
706592021-01-22B-유통충북청주월동소매상품17801kg20.0kg1<NA><NA>Y
756932021-01-25A-유통전남순천얼갈이배추얼갈이배추소매상품3040<NA><NA>4.0kg1kg<NA><NA>N
685872021-01-22A-유통대구대구소매상품279601kg1.0kg1kg1kgN
143862021-01-07가락도매서울서울당근세척(수입)도매중품8000<NA><NA>10.0kg<NA><NA><NA><NA>N
19602021-01-04인동대전대전붉은 팥(국산)도매중품460000<NA><NA>40.0kg500g<NA><NA>N
457542021-01-15A-유통울산울산붉은 팥(국산)소매상품7140<NA><NA>40.0kg500g<NA><NA>Y
526722021-01-19가락도매서울서울딸기딸기도매중품22000<NA><NA>2.0kg100g100gN
14022021-01-04C-유통세종세종건포도수입소매중품1000<NA><NA><NA><NA>100g<NA><NA>N
386842021-01-14E-유통경북포항피망소매상품1480<NA><NA>10.0kg100g100gN
518202021-01-18A-유통전남순천월동소매중품22901kg20.0kg1<NA><NA>N