Overview

Dataset statistics

Number of variables9
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory839.8 KiB
Average record size in memory86.0 B

Variable types

Categorical6
Numeric3

Dataset

Description품목,단위(등급),전년가격,검색전일대비등락율,검색일전년대비등락율,단위,검색일가격,검색전일가격,해당일자
Author서울시농수산식품공사
URLhttps://data.seoul.go.kr/dataList/OA-13448/S/1/datasetView.do

Alerts

전년가격 has constant value ""Constant
검색전일대비등락율 has constant value ""Constant
검색일전년대비등락율 has constant value ""Constant
Dataset has 1 (< 0.1%) duplicate rowsDuplicates
단위 is highly overall correlated with 검색일가격 and 2 other fieldsHigh correlation
품목 is highly overall correlated with 검색일가격 and 3 other fieldsHigh correlation
검색일가격 is highly overall correlated with 검색전일가격 and 2 other fieldsHigh correlation
검색전일가격 is highly overall correlated with 검색일가격 and 2 other fieldsHigh correlation
단위(등급) is highly overall correlated with 품목High correlation

Reproduction

Analysis started2024-05-18 05:12:08.019119
Analysis finished2024-05-18 05:12:12.329620
Duration4.31 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

품목
Categorical

HIGH CORRELATION 

Distinct36
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
바나나 수입
1038 
당근 수입
977 
양파 수입
832 
망고 수입
810 
냉동 가자미 수입
797 
Other values (31)
5546 

Length

Max length9
Median length8
Mean length6.2702
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row양상추 수입
2nd row(냉)고등어 수입
3rd row양파 수입
4th row브로콜리 수입
5th row망고 수입

Common Values

ValueCountFrequency (%)
바나나 수입 1038
 
10.4%
당근 수입 977
 
9.8%
양파 수입 832
 
8.3%
망고 수입 810
 
8.1%
냉동 가자미 수입 797
 
8.0%
양상추 수입 645
 
6.5%
생표고 수입 621
 
6.2%
브로콜리 수입 617
 
6.2%
단호박 수입 539
 
5.4%
(선)명태 수입 430
 
4.3%
Other values (26) 2694
26.9%

Length

2024-05-18T14:12:12.546919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
수입 9751
46.0%
냉동 1122
 
5.3%
바나나 1038
 
4.9%
당근 977
 
4.6%
생표고 842
 
4.0%
양파 832
 
3.9%
망고 810
 
3.8%
가자미 797
 
3.8%
양상추 645
 
3.0%
브로콜리 617
 
2.9%
Other values (28) 3768
 
17.8%

단위(등급)
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
3567 
2761 
2077 
1418 
 
177

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
3567
35.7%
2761
27.6%
2077
20.8%
1418
 
14.2%
177
 
1.8%

Length

2024-05-18T14:12:12.880610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T14:12:13.074050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3567
35.7%
2761
27.6%
2077
20.8%
1418
 
14.2%
177
 
1.8%

전년가격
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 10000
100.0%

Length

2024-05-18T14:12:13.418602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T14:12:13.685669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 10000
100.0%

검색전일대비등락율
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
100
10000 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row100
2nd row100
3rd row100
4th row100
5th row100

Common Values

ValueCountFrequency (%)
100 10000
100.0%

Length

2024-05-18T14:12:13.977005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T14:12:14.247621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
100 10000
100.0%

검색일전년대비등락율
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 10000
100.0%

Length

2024-05-18T14:12:14.491844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T14:12:14.769525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 10000
100.0%

단위
Categorical

HIGH CORRELATION 

Distinct20
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
10키로상자
2027 
5키로상자
1786 
1키로
1172 
4키로상자
975 
13키로상자
954 
Other values (15)
3086 

Length

Max length8
Median length8
Mean length7.7656
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row 7.5키로상자
2nd row 10키로상자
3rd row 1키로
4th row 8키로상자
5th row 5키로상자

Common Values

ValueCountFrequency (%)
10키로상자 2027
20.3%
5키로상자 1786
17.9%
1키로 1172
11.7%
4키로상자 975
9.8%
13키로상자 954
9.5%
8키로상자 876
8.8%
10키로망대 539
 
5.4%
7.5키로상자 506
 
5.1%
3키로상자 280
 
2.8%
12키로상자 242
 
2.4%
Other values (10) 643
 
6.4%

Length

2024-05-18T14:12:15.085530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
10키로상자 2027
20.3%
5키로상자 1786
17.9%
1키로 1172
11.7%
4키로상자 975
9.8%
13키로상자 954
9.5%
8키로상자 876
8.8%
10키로망대 539
 
5.4%
7.5키로상자 506
 
5.1%
3키로상자 280
 
2.8%
12키로상자 242
 
2.4%
Other values (10) 643
 
6.4%

검색일가격
Real number (ℝ)

HIGH CORRELATION 

Distinct5556
Distinct (%)55.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37879.353
Minimum44
Maximum612500
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-18T14:12:15.323744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum44
5-th percentile1157.95
Q110132.5
median20521
Q330624.5
95-th percentile128462
Maximum612500
Range612456
Interquartile range (IQR)20492

Descriptive statistics

Standard deviation75852.139
Coefficient of variation (CV)2.0024666
Kurtosis30.216572
Mean37879.353
Median Absolute Deviation (MAD)10253.5
Skewness5.2051768
Sum3.7879354 × 108
Variance5.753547 × 109
MonotonicityNot monotonic
2024-05-18T14:12:15.751681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
30000 145
 
1.5%
28720 123
 
1.2%
30250 118
 
1.2%
30500 113
 
1.1%
16000 111
 
1.1%
10000 82
 
0.8%
17000 65
 
0.7%
18000 64
 
0.6%
210000 63
 
0.6%
241000 62
 
0.6%
Other values (5546) 9054
90.5%
ValueCountFrequency (%)
44 1
< 0.1%
55 1
< 0.1%
284 1
< 0.1%
331 1
< 0.1%
357 1
< 0.1%
403 1
< 0.1%
441 1
< 0.1%
448 1
< 0.1%
464 1
< 0.1%
490 1
< 0.1%
ValueCountFrequency (%)
612500 52
0.5%
532000 11
 
0.1%
512000 60
0.6%
508000 12
 
0.1%
365000 14
 
0.1%
360500 17
 
0.2%
358000 3
 
< 0.1%
350000 32
0.3%
340000 2
 
< 0.1%
265000 30
0.3%

검색전일가격
Real number (ℝ)

HIGH CORRELATION 

Distinct5556
Distinct (%)55.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37879.353
Minimum44
Maximum612500
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-18T14:12:16.046191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum44
5-th percentile1157.95
Q110132.5
median20521
Q330624.5
95-th percentile128462
Maximum612500
Range612456
Interquartile range (IQR)20492

Descriptive statistics

Standard deviation75852.139
Coefficient of variation (CV)2.0024666
Kurtosis30.216572
Mean37879.353
Median Absolute Deviation (MAD)10253.5
Skewness5.2051768
Sum3.7879354 × 108
Variance5.753547 × 109
MonotonicityNot monotonic
2024-05-18T14:12:16.451412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
30000 145
 
1.5%
28720 123
 
1.2%
30250 118
 
1.2%
30500 113
 
1.1%
16000 111
 
1.1%
10000 82
 
0.8%
17000 65
 
0.7%
18000 64
 
0.6%
210000 63
 
0.6%
241000 62
 
0.6%
Other values (5546) 9054
90.5%
ValueCountFrequency (%)
44 1
< 0.1%
55 1
< 0.1%
284 1
< 0.1%
331 1
< 0.1%
357 1
< 0.1%
403 1
< 0.1%
441 1
< 0.1%
448 1
< 0.1%
464 1
< 0.1%
490 1
< 0.1%
ValueCountFrequency (%)
612500 52
0.5%
532000 11
 
0.1%
512000 60
0.6%
508000 12
 
0.1%
365000 14
 
0.1%
360500 17
 
0.2%
358000 3
 
< 0.1%
350000 32
0.3%
340000 2
 
< 0.1%
265000 30
0.3%

해당일자
Real number (ℝ)

Distinct300
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20234298
Minimum20230517
Maximum20240511
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-18T14:12:16.873573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20230517
5-th percentile20230530
Q120230804
median20231108
Q320240220
95-th percentile20240425
Maximum20240511
Range9994
Interquartile range (IQR)9416.25

Descriptive statistics

Standard deviation4541.8868
Coefficient of variation (CV)0.00022446476
Kurtosis-1.6747239
Mean20234298
Median Absolute Deviation (MAD)482
Skewness0.56379484
Sum2.0234298 × 1011
Variance20628736
MonotonicityNot monotonic
2024-05-18T14:12:17.312945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20240405 69
 
0.7%
20240408 67
 
0.7%
20240404 66
 
0.7%
20240403 62
 
0.6%
20231010 58
 
0.6%
20240325 58
 
0.6%
20230823 56
 
0.6%
20230519 55
 
0.5%
20230518 52
 
0.5%
20240207 52
 
0.5%
Other values (290) 9405
94.0%
ValueCountFrequency (%)
20230517 49
0.5%
20230518 52
0.5%
20230519 55
0.5%
20230520 45
0.4%
20230522 46
0.5%
20230523 42
0.4%
20230524 41
0.4%
20230525 45
0.4%
20230526 42
0.4%
20230527 31
0.3%
ValueCountFrequency (%)
20240511 35
0.4%
20240510 29
0.3%
20240509 35
0.4%
20240508 37
0.4%
20240507 31
0.3%
20240506 32
0.3%
20240504 36
0.4%
20240503 31
0.3%
20240502 37
0.4%
20240501 39
0.4%

Interactions

2024-05-18T14:12:10.895170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T14:12:09.351667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T14:12:10.171399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T14:12:11.092996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T14:12:09.633214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T14:12:10.351571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T14:12:11.308408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T14:12:09.906709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-18T14:12:10.582180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-18T14:12:17.657813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
품목단위(등급)단위검색일가격검색전일가격해당일자
품목1.0000.8420.9960.9740.9740.315
단위(등급)0.8421.0000.5640.2590.2590.003
단위0.9960.5641.0000.9550.9550.182
검색일가격0.9740.2590.9551.0001.0000.136
검색전일가격0.9740.2590.9551.0001.0000.136
해당일자0.3150.0030.1820.1360.1361.000
2024-05-18T14:12:17.936065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
단위단위(등급)품목
단위1.0000.2790.936
단위(등급)0.2791.0000.582
품목0.9360.5821.000
2024-05-18T14:12:18.201458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
검색일가격검색전일가격해당일자품목단위(등급)단위
검색일가격1.0001.000-0.1050.8340.1620.784
검색전일가격1.0001.000-0.1050.8340.1620.784
해당일자-0.105-0.1051.0000.2510.0000.144
품목0.8340.8340.2511.0000.5820.936
단위(등급)0.1620.1620.0000.5821.0000.279
단위0.7840.7840.1440.9360.2791.000

Missing values

2024-05-18T14:12:11.714674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-18T14:12:11.992612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

품목단위(등급)전년가격검색전일대비등락율검색일전년대비등락율단위검색일가격검색전일가격해당일자
8903양상추 수입010007.5키로상자196851968520230920
9557(냉)고등어 수입0100010키로상자375753757520230901
3967양파 수입010001키로1254125420240202
9495브로콜리 수입010008키로상자192151921520230902
9175망고 수입010005키로상자287202872020230912
1984생표고 수입010004키로상자236912369120240329
5214바나나 수입0100013키로상자225692256920231228
3123(선)명태 수입0100010키로상자856708567020240302
8205바나나 수입0100013키로상자203312033120231010
2256바나나 수입0100013키로상자303333033320240323
품목단위(등급)전년가격검색전일대비등락율검색일전년대비등락율단위검색일가격검색전일가격해당일자
5714냉동 조기 수입010004키로상자280922809220231215
11463브로콜리 수입010008키로상자203542035420230712
7291망고 수입010005키로상자302503025020231101
7100바나나 수입0100013키로상자230002300020231106
4566냉동 가자미 수입010005키로상자200002000020240117
7295홍어 수입0100010키로상자843088430820231101
2484활 점성어 수입010001키로9695969520240318
11236파인애플 수입0100012키로상자206542065420230718
4765양상추 수입010007.5키로상자170001700020240111
5004브로콜리 수입010008키로상자131871318720240105

Duplicate rows

Most frequently occurring

품목단위(등급)전년가격검색전일대비등락율검색일전년대비등락율단위검색일가격검색전일가격해당일자# duplicates
0당근 수입0100010키로상자70007000202404232