Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows1656
Duplicate rows (%)16.6%
Total size in memory722.7 KiB
Average record size in memory74.0 B

Variable types

Categorical3
Text3
Numeric2

Dataset

Description삼산농산물도매시장의 실시간 농산물 경락가격을 제공함으로써 시장 투명성 확보에 기여<br/>삼산농산물도매시장 실시간 경락정보(경매일자,법인명,품목,품종,거래물량(kg),경락단가(원), 산지,비고)등의 데이터 입니다.<br/>
Author인천광역시
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=3044829&srcSe=7661IVAWM27C61E190

Alerts

Dataset has 1656 (16.6%) duplicate rowsDuplicates
거래물량(kg) is highly overall correlated with 경락단가(원)High correlation
경락단가(원) is highly overall correlated with 거래물량(kg)High correlation
비고 is highly imbalanced (51.3%)Imbalance

Reproduction

Analysis started2024-04-29 13:44:21.144291
Analysis finished2024-04-29 13:44:22.184475
Duration1.04 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

경매일자
Categorical

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-10-05
1568 
2023-10-06
1517 
2023-10-09
1473 
2023-10-04
1453 
2023-10-07
1425 
Other values (2)
2564 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-10-04
2nd row2023-10-03
3rd row2023-10-02
4th row2023-10-04
5th row2023-10-09

Common Values

ValueCountFrequency (%)
2023-10-05 1568
15.7%
2023-10-06 1517
15.2%
2023-10-09 1473
14.7%
2023-10-04 1453
14.5%
2023-10-07 1425
14.2%
2023-10-03 1390
13.9%
2023-10-02 1174
11.7%

Length

2024-04-29T22:44:22.234759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-29T22:44:22.328737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-10-05 1568
15.7%
2023-10-06 1517
15.2%
2023-10-09 1473
14.7%
2023-10-04 1453
14.5%
2023-10-07 1425
14.2%
2023-10-03 1390
13.9%
2023-10-02 1174
11.7%

법인명
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
(주)부평농산
6102 
(주)경인농산
2016 
인천원예농협
1882 

Length

Max length7
Median length7
Mean length6.8118
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row(주)부평농산
2nd row(주)부평농산
3rd row(주)부평농산
4th row(주)부평농산
5th row(주)경인농산

Common Values

ValueCountFrequency (%)
(주)부평농산 6102
61.0%
(주)경인농산 2016
 
20.2%
인천원예농협 1882
 
18.8%

Length

2024-04-29T22:44:22.448512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-29T22:44:22.550688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
주)부평농산 6102
61.0%
주)경인농산 2016
 
20.2%
인천원예농협 1882
 
18.8%

품목
Text

Distinct136
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-29T22:44:22.792779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length2
Mean length2.8203
Min length1

Characters and Unicode

Total characters28203
Distinct characters177
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique12 ?
Unique (%)0.1%

Sample

1st row수박
2nd row새송이버섯
3rd row깻잎
4th row감자
5th row당근
ValueCountFrequency (%)
포도 767
 
7.6%
떫은감 451
 
4.5%
깻잎 439
 
4.4%
풋고추 437
 
4.3%
단감 408
 
4.0%
오이 370
 
3.7%
고구마 353
 
3.5%
호박 341
 
3.4%
느타리버섯 248
 
2.5%
바나나 243
 
2.4%
Other values (128) 6019
59.7%
2024-04-29T22:44:23.150737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2009
 
7.1%
1376
 
4.9%
1262
 
4.5%
961
 
3.4%
894
 
3.2%
894
 
3.2%
849
 
3.0%
786
 
2.8%
767
 
2.7%
746
 
2.6%
Other values (167) 17659
62.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 27739
98.4%
Close Punctuation 183
 
0.6%
Open Punctuation 183
 
0.6%
Space Separator 76
 
0.3%
Other Punctuation 22
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2009
 
7.2%
1376
 
5.0%
1262
 
4.5%
961
 
3.5%
894
 
3.2%
894
 
3.2%
849
 
3.1%
786
 
2.8%
767
 
2.8%
746
 
2.7%
Other values (163) 17195
62.0%
Close Punctuation
ValueCountFrequency (%)
) 183
100.0%
Open Punctuation
ValueCountFrequency (%)
( 183
100.0%
Space Separator
ValueCountFrequency (%)
76
100.0%
Other Punctuation
ValueCountFrequency (%)
, 22
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 27739
98.4%
Common 464
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2009
 
7.2%
1376
 
5.0%
1262
 
4.5%
961
 
3.5%
894
 
3.2%
894
 
3.2%
849
 
3.1%
786
 
2.8%
767
 
2.8%
746
 
2.7%
Other values (163) 17195
62.0%
Common
ValueCountFrequency (%)
) 183
39.4%
( 183
39.4%
76
16.4%
, 22
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 27739
98.4%
ASCII 464
 
1.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2009
 
7.2%
1376
 
5.0%
1262
 
4.5%
961
 
3.5%
894
 
3.2%
894
 
3.2%
849
 
3.1%
786
 
2.8%
767
 
2.8%
746
 
2.7%
Other values (163) 17195
62.0%
ASCII
ValueCountFrequency (%)
) 183
39.4%
( 183
39.4%
76
16.4%
, 22
 
4.7%

품종
Text

Distinct244
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-29T22:44:23.407205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length10
Mean length4.7297
Min length2

Characters and Unicode

Total characters47297
Distinct characters247
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique38 ?
Unique (%)0.4%

Sample

1st row수박(일반)(꼭지절단)
2nd row새송이버섯(일반)
3rd row깻잎(일반)
4th row기타
5th row흙당근
ValueCountFrequency (%)
기타 1072
 
10.7%
샤인마스캇 540
 
5.4%
백다다기 317
 
3.2%
약시 311
 
3.1%
깻잎(일반 301
 
3.0%
송본 293
 
2.9%
청양 266
 
2.7%
바나나(수입 243
 
2.4%
새송이버섯(일반 236
 
2.4%
애호박 228
 
2.3%
Other values (236) 6215
62.0%
2024-04-29T22:44:23.775697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 4032
 
8.5%
( 4032
 
8.5%
3399
 
7.2%
3274
 
6.9%
1598
 
3.4%
1374
 
2.9%
1221
 
2.6%
1012
 
2.1%
1008
 
2.1%
905
 
1.9%
Other values (237) 25442
53.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 39028
82.5%
Close Punctuation 4032
 
8.5%
Open Punctuation 4032
 
8.5%
Decimal Number 182
 
0.4%
Space Separator 22
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3399
 
8.7%
3274
 
8.4%
1598
 
4.1%
1374
 
3.5%
1221
 
3.1%
1012
 
2.6%
1008
 
2.6%
905
 
2.3%
874
 
2.2%
825
 
2.1%
Other values (231) 23538
60.3%
Decimal Number
ValueCountFrequency (%)
1 180
98.9%
3 2
 
1.1%
Close Punctuation
ValueCountFrequency (%)
) 4032
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4032
100.0%
Space Separator
ValueCountFrequency (%)
22
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 39028
82.5%
Common 8269
 
17.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3399
 
8.7%
3274
 
8.4%
1598
 
4.1%
1374
 
3.5%
1221
 
3.1%
1012
 
2.6%
1008
 
2.6%
905
 
2.3%
874
 
2.2%
825
 
2.1%
Other values (231) 23538
60.3%
Common
ValueCountFrequency (%)
) 4032
48.8%
( 4032
48.8%
1 180
 
2.2%
22
 
0.3%
3 2
 
< 0.1%
, 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 39028
82.5%
ASCII 8269
 
17.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 4032
48.8%
( 4032
48.8%
1 180
 
2.2%
22
 
0.3%
3 2
 
< 0.1%
, 1
 
< 0.1%
Hangul
ValueCountFrequency (%)
3399
 
8.7%
3274
 
8.4%
1598
 
4.1%
1374
 
3.5%
1221
 
3.1%
1012
 
2.6%
1008
 
2.6%
905
 
2.3%
874
 
2.2%
825
 
2.1%
Other values (231) 23538
60.3%

거래물량(kg)
Real number (ℝ)

HIGH CORRELATION 

Distinct192
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26.8033
Minimum1
Maximum800
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-29T22:44:23.900772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median8
Q320
95-th percentile114
Maximum800
Range799
Interquartile range (IQR)17

Descriptive statistics

Standard deviation61.6398
Coefficient of variation (CV)2.2997094
Kurtosis37.270109
Mean26.8033
Median Absolute Deviation (MAD)6
Skewness5.4666497
Sum268033
Variance3799.465
MonotonicityNot monotonic
2024-04-29T22:44:24.038718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 999
 
10.0%
5 984
 
9.8%
10 905
 
9.0%
2 869
 
8.7%
3 675
 
6.8%
4 516
 
5.2%
6 373
 
3.7%
20 351
 
3.5%
8 334
 
3.3%
7 272
 
2.7%
Other values (182) 3722
37.2%
ValueCountFrequency (%)
1 999
10.0%
2 869
8.7%
3 675
6.8%
4 516
5.2%
5 984
9.8%
6 373
 
3.7%
7 272
 
2.7%
8 334
 
3.3%
9 195
 
1.9%
10 905
9.0%
ValueCountFrequency (%)
800 1
 
< 0.1%
700 1
 
< 0.1%
690 1
 
< 0.1%
640 1
 
< 0.1%
600 2
 
< 0.1%
576 1
 
< 0.1%
560 2
 
< 0.1%
550 24
0.2%
540 1
 
< 0.1%
530 4
 
< 0.1%

경락단가(원)
Real number (ℝ)

HIGH CORRELATION 

Distinct968
Distinct (%)9.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean243197.64
Minimum700
Maximum8800000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-29T22:44:24.168455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum700
5-th percentile13000
Q150000
median120000
Q3285000
95-th percentile855150
Maximum8800000
Range8799300
Interquartile range (IQR)235000

Descriptive statistics

Standard deviation390560.31
Coefficient of variation (CV)1.6059379
Kurtosis71.541251
Mean243197.64
Median Absolute Deviation (MAD)87000
Skewness6.1311714
Sum2.4319764 × 109
Variance1.5253736 × 1011
MonotonicityNot monotonic
2024-04-29T22:44:24.290390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
120000 168
 
1.7%
60000 150
 
1.5%
70000 128
 
1.3%
90000 124
 
1.2%
150000 122
 
1.2%
40000 115
 
1.1%
80000 115
 
1.1%
140000 112
 
1.1%
180000 107
 
1.1%
100000 107
 
1.1%
Other values (958) 8752
87.5%
ValueCountFrequency (%)
700 1
 
< 0.1%
1800 1
 
< 0.1%
2000 4
 
< 0.1%
2500 2
 
< 0.1%
3000 8
0.1%
3500 1
 
< 0.1%
4000 10
0.1%
4100 1
 
< 0.1%
4300 2
 
< 0.1%
4400 1
 
< 0.1%
ValueCountFrequency (%)
8800000 1
< 0.1%
7680000 1
< 0.1%
7000000 1
< 0.1%
5910000 1
< 0.1%
5400000 1
< 0.1%
4752000 1
< 0.1%
4514000 1
< 0.1%
4200000 1
< 0.1%
4024000 1
< 0.1%
3801000 1
< 0.1%

산지
Text

Distinct131
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-29T22:44:24.547682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length6
Mean length5.564
Min length2

Characters and Unicode

Total characters55640
Distinct characters100
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5 ?
Unique (%)< 0.1%

Sample

1st row충북 음성군
2nd row전남 장성군
3rd row충남 논산시
4th row강원 평창군
5th row강원 평창군
ValueCountFrequency (%)
경기 1843
 
9.8%
충남 1754
 
9.3%
경북 1602
 
8.5%
강원 1407
 
7.5%
국외 951
 
5.0%
논산시 858
 
4.6%
청도군 645
 
3.4%
인천 482
 
2.6%
경남 469
 
2.5%
평창군 451
 
2.4%
Other values (124) 8385
44.5%
2024-04-29T22:44:24.929111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8847
15.9%
4455
 
8.0%
4304
 
7.7%
3746
 
6.7%
2720
 
4.9%
2282
 
4.1%
2127
 
3.8%
1926
 
3.5%
1916
 
3.4%
1699
 
3.1%
Other values (90) 21618
38.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 46793
84.1%
Space Separator 8847
 
15.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4455
 
9.5%
4304
 
9.2%
3746
 
8.0%
2720
 
5.8%
2282
 
4.9%
2127
 
4.5%
1926
 
4.1%
1916
 
4.1%
1699
 
3.6%
1608
 
3.4%
Other values (89) 20010
42.8%
Space Separator
ValueCountFrequency (%)
8847
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 46793
84.1%
Common 8847
 
15.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4455
 
9.5%
4304
 
9.2%
3746
 
8.0%
2720
 
5.8%
2282
 
4.9%
2127
 
4.5%
1926
 
4.1%
1916
 
4.1%
1699
 
3.6%
1608
 
3.4%
Other values (89) 20010
42.8%
Common
ValueCountFrequency (%)
8847
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 46793
84.1%
ASCII 8847
 
15.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8847
100.0%
Hangul
ValueCountFrequency (%)
4455
 
9.5%
4304
 
9.2%
3746
 
8.0%
2720
 
5.8%
2282
 
4.9%
2127
 
4.5%
1926
 
4.1%
1916
 
4.1%
1699
 
3.6%
1608
 
3.4%
Other values (89) 20010
42.8%

비고
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경매
8941 
정가수의
1059 

Length

Max length4
Median length2
Mean length2.2118
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경매
2nd row경매
3rd row경매
4th row경매
5th row경매

Common Values

ValueCountFrequency (%)
경매 8941
89.4%
정가수의 1059
 
10.6%

Length

2024-04-29T22:44:25.056970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-29T22:44:25.150419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경매 8941
89.4%
정가수의 1059
 
10.6%

Interactions

2024-04-29T22:44:21.814022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-29T22:44:21.635924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-29T22:44:21.926008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-29T22:44:21.730700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-29T22:44:25.206502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
경매일자법인명거래물량(kg)경락단가(원)비고
경매일자1.0000.1210.0350.0520.052
법인명0.1211.0000.0570.0410.167
거래물량(kg)0.0350.0571.0000.8860.071
경락단가(원)0.0520.0410.8861.0000.013
비고0.0520.1670.0710.0131.000
2024-04-29T22:44:25.289944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
법인명경매일자비고
법인명1.0000.0810.275
경매일자0.0811.0000.055
비고0.2750.0551.000
2024-04-29T22:44:25.574740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
거래물량(kg)경락단가(원)경매일자법인명비고
거래물량(kg)1.0000.7470.0170.0340.054
경락단가(원)0.7471.0000.0260.0240.010
경매일자0.0170.0261.0000.0810.055
법인명0.0340.0240.0811.0000.275
비고0.0540.0100.0550.2751.000

Missing values

2024-04-29T22:44:22.026626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-29T22:44:22.131104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

경매일자법인명품목품종거래물량(kg)경락단가(원)산지비고
340002023-10-04(주)부평농산수박수박(일반)(꼭지절단)20360000충북 음성군경매
139602023-10-03(주)부평농산새송이버섯새송이버섯(일반)1073000전남 장성군경매
49272023-10-02(주)부평농산깻잎깻잎(일반)121000충남 논산시경매
328162023-10-04(주)부평농산감자기타18270000강원 평창군경매
983462023-10-09(주)경인농산당근흙당근15675000강원 평창군경매
61122023-10-02(주)부평농산당근기타755400000경남 김해시경매
935342023-10-09(주)부평농산풋고추청양256000광주 남구경매
457052023-10-05(주)부평농산깻잎깻잎순16500충남 논산시경매
913592023-10-09(주)부평농산떫은감약시424000경북 청도군경매
272152023-10-04(주)부평농산오이백다다기18000경기 안성시경매
경매일자법인명품목품종거래물량(kg)경락단가(원)산지비고
987122023-10-09인천원예농협대파대파(일반)5501265000강원 횡성군경매
208552023-10-03(주)경인농산양상추양상추(일반)3114000전남 영암군경매
50852023-10-02(주)부평농산기타150390000충남 논산시경매
580612023-10-06(주)부평농산감자기타4164000강원 양구군경매
811842023-10-07(주)경인농산포도샤인마스캇44308000경북 김천시경매
785252023-10-07(주)부평농산감귤조생귤26156000경남 함안군경매
965782023-10-09(주)경인농산팽이버섯팽이1호590000경북경매
407542023-10-05(주)부평농산피망(단고추)피망(일반)118500강원 평창군경매
143272023-10-03(주)부평농산만가닥만가닥(일반)565000경북 청도군경매
987552023-10-09인천원예농협부추일반부추100185000인천 계양구경매

Duplicate rows

Most frequently occurring

경매일자법인명품목품종거래물량(kg)경락단가(원)산지비고# duplicates
8752023-10-05(주)부평농산표고버섯표고버섯(수입)211000국외경매13
3772023-10-03(주)부평농산파인애플파인애플(수입)3120000국외정가수의12
552023-10-02(주)부평농산느타리버섯느타리버섯(일반)557500경기 양평군경매10
1582023-10-02(주)부평농산팽이버섯팽이1호5120000경북 청도군경매10
7982023-10-05(주)부평농산블루베리블루베리(수입)7266000국외정가수의10
15722023-10-09(주)부평농산양상추양상추(일반)5175000강원 횡성군경매10
16192023-10-09(주)부평농산포도샤인마스캇40340000경북 경산시경매10
2892023-10-03(주)부평농산레몬레몬(수입)3195000국외정가수의9
5082023-10-04(주)부평농산느타리버섯느타리버섯(일반)537500경기 여주군경매9
7412023-10-05(주)부평농산느타리버섯느타리버섯(일반)535000경기 여주군경매9