Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows1703
Duplicate rows (%)17.0%
Total size in memory722.7 KiB
Average record size in memory74.0 B

Variable types

Categorical3
Text3
Numeric2

Dataset

Description삼산농산물도매시장의 실시간 농산물 경락가격을 제공함으로써 시장 투명성 확보에 기여<br/>삼산농산물도매시장 실시간 경락정보(경매일자,법인명,품목,품종,거래물량(kg),경락단가(원), 산지,비고)등의 데이터 입니다.<br/>
Author인천광역시
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=3044829&srcSe=7661IVAWM27C61E190

Alerts

Dataset has 1703 (17.0%) duplicate rowsDuplicates
거래물량(kg) is highly overall correlated with 경락단가(원)High correlation
경락단가(원) is highly overall correlated with 거래물량(kg)High correlation
비고 is highly imbalanced (51.2%)Imbalance

Reproduction

Analysis started2024-04-29 13:44:15.132007
Analysis finished2024-04-29 13:44:16.109831
Duration0.98 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

경매일자
Categorical

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-10-05
1570 
2023-10-06
1559 
2023-10-07
1500 
2023-10-09
1489 
2023-10-04
1379 
Other values (2)
2503 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-10-05
2nd row2023-10-05
3rd row2023-10-06
4th row2023-10-07
5th row2023-10-06

Common Values

ValueCountFrequency (%)
2023-10-05 1570
15.7%
2023-10-06 1559
15.6%
2023-10-07 1500
15.0%
2023-10-09 1489
14.9%
2023-10-04 1379
13.8%
2023-10-03 1304
13.0%
2023-10-02 1199
12.0%

Length

2024-04-29T22:44:16.163518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-29T22:44:16.259961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-10-05 1570
15.7%
2023-10-06 1559
15.6%
2023-10-07 1500
15.0%
2023-10-09 1489
14.9%
2023-10-04 1379
13.8%
2023-10-03 1304
13.0%
2023-10-02 1199
12.0%

법인명
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
(주)부평농산
6142 
인천원예농협
1933 
(주)경인농산
1925 

Length

Max length7
Median length7
Mean length6.8067
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row(주)부평농산
2nd row(주)경인농산
3rd row(주)부평농산
4th row(주)경인농산
5th row(주)경인농산

Common Values

ValueCountFrequency (%)
(주)부평농산 6142
61.4%
인천원예농협 1933
 
19.3%
(주)경인농산 1925
 
19.2%

Length

2024-04-29T22:44:16.583471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-29T22:44:16.671420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
주)부평농산 6142
61.4%
인천원예농협 1933
 
19.3%
주)경인농산 1925
 
19.2%

품목
Text

Distinct139
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-29T22:44:16.905235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length2.876
Min length1

Characters and Unicode

Total characters28760
Distinct characters186
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique12 ?
Unique (%)0.1%

Sample

1st row얼갈이배추
2nd row양배추
3rd row감귤
4th row포도
5th row포도
ValueCountFrequency (%)
포도 740
 
7.3%
풋고추 484
 
4.8%
단감 436
 
4.3%
깻잎 432
 
4.3%
떫은감 417
 
4.1%
고구마 396
 
3.9%
오이 336
 
3.3%
호박 314
 
3.1%
새송이버섯 254
 
2.5%
느타리버섯 242
 
2.4%
Other values (131) 6059
59.9%
2024-04-29T22:44:17.274977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2081
 
7.2%
1491
 
5.2%
1208
 
4.2%
931
 
3.2%
906
 
3.2%
877
 
3.0%
877
 
3.0%
757
 
2.6%
740
 
2.6%
732
 
2.5%
Other values (176) 18160
63.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 28196
98.0%
Open Punctuation 209
 
0.7%
Close Punctuation 209
 
0.7%
Space Separator 110
 
0.4%
Other Punctuation 36
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2081
 
7.4%
1491
 
5.3%
1208
 
4.3%
931
 
3.3%
906
 
3.2%
877
 
3.1%
877
 
3.1%
757
 
2.7%
740
 
2.6%
732
 
2.6%
Other values (172) 17596
62.4%
Open Punctuation
ValueCountFrequency (%)
( 209
100.0%
Close Punctuation
ValueCountFrequency (%)
) 209
100.0%
Space Separator
ValueCountFrequency (%)
110
100.0%
Other Punctuation
ValueCountFrequency (%)
, 36
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 28196
98.0%
Common 564
 
2.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2081
 
7.4%
1491
 
5.3%
1208
 
4.3%
931
 
3.3%
906
 
3.2%
877
 
3.1%
877
 
3.1%
757
 
2.7%
740
 
2.6%
732
 
2.6%
Other values (172) 17596
62.4%
Common
ValueCountFrequency (%)
( 209
37.1%
) 209
37.1%
110
19.5%
, 36
 
6.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 28196
98.0%
ASCII 564
 
2.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2081
 
7.4%
1491
 
5.3%
1208
 
4.3%
931
 
3.3%
906
 
3.2%
877
 
3.1%
877
 
3.1%
757
 
2.7%
740
 
2.6%
732
 
2.6%
Other values (172) 17596
62.4%
ASCII
ValueCountFrequency (%)
( 209
37.1%
) 209
37.1%
110
19.5%
, 36
 
6.4%

품종
Text

Distinct243
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-29T22:44:17.479769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length10
Mean length4.7466
Min length2

Characters and Unicode

Total characters47466
Distinct characters250
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)0.3%

Sample

1st row알배기
2nd row양배추(일반)
3rd row하우스감귤
4th row샤인마스캇
5th row샤인마스캇
ValueCountFrequency (%)
기타 1081
 
10.8%
샤인마스캇 514
 
5.1%
송본 322
 
3.2%
깻잎(일반 302
 
3.0%
약시 302
 
3.0%
백다다기 295
 
2.9%
청양 285
 
2.8%
새송이버섯(일반 254
 
2.5%
바나나(수입 230
 
2.3%
양배추(일반 197
 
2.0%
Other values (234) 6255
62.3%
2024-04-29T22:44:17.817199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 4065
 
8.6%
) 4065
 
8.6%
3383
 
7.1%
3280
 
6.9%
1592
 
3.4%
1397
 
2.9%
1207
 
2.5%
1023
 
2.2%
1014
 
2.1%
983
 
2.1%
Other values (240) 25457
53.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 39150
82.5%
Open Punctuation 4065
 
8.6%
Close Punctuation 4065
 
8.6%
Decimal Number 148
 
0.3%
Space Separator 37
 
0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3383
 
8.6%
3280
 
8.4%
1592
 
4.1%
1397
 
3.6%
1207
 
3.1%
1023
 
2.6%
1014
 
2.6%
983
 
2.5%
888
 
2.3%
868
 
2.2%
Other values (234) 23515
60.1%
Decimal Number
ValueCountFrequency (%)
1 145
98.0%
3 3
 
2.0%
Open Punctuation
ValueCountFrequency (%)
( 4065
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4065
100.0%
Space Separator
ValueCountFrequency (%)
37
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 39150
82.5%
Common 8316
 
17.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3383
 
8.6%
3280
 
8.4%
1592
 
4.1%
1397
 
3.6%
1207
 
3.1%
1023
 
2.6%
1014
 
2.6%
983
 
2.5%
888
 
2.3%
868
 
2.2%
Other values (234) 23515
60.1%
Common
ValueCountFrequency (%)
( 4065
48.9%
) 4065
48.9%
1 145
 
1.7%
37
 
0.4%
3 3
 
< 0.1%
, 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 39150
82.5%
ASCII 8316
 
17.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 4065
48.9%
) 4065
48.9%
1 145
 
1.7%
37
 
0.4%
3 3
 
< 0.1%
, 1
 
< 0.1%
Hangul
ValueCountFrequency (%)
3383
 
8.6%
3280
 
8.4%
1592
 
4.1%
1397
 
3.6%
1207
 
3.1%
1023
 
2.6%
1014
 
2.6%
983
 
2.5%
888
 
2.3%
868
 
2.2%
Other values (234) 23515
60.1%

거래물량(kg)
Real number (ℝ)

HIGH CORRELATION 

Distinct193
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26.8928
Minimum1
Maximum1098
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-29T22:44:17.939943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median8
Q320
95-th percentile120
Maximum1098
Range1097
Interquartile range (IQR)17

Descriptive statistics

Standard deviation63.364888
Coefficient of variation (CV)2.3562027
Kurtosis44.398942
Mean26.8928
Median Absolute Deviation (MAD)6
Skewness5.7552344
Sum268928
Variance4015.109
MonotonicityNot monotonic
2024-04-29T22:44:18.053020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1034
 
10.3%
5 968
 
9.7%
10 947
 
9.5%
2 835
 
8.3%
3 691
 
6.9%
4 517
 
5.2%
6 361
 
3.6%
20 355
 
3.5%
8 333
 
3.3%
7 302
 
3.0%
Other values (183) 3657
36.6%
ValueCountFrequency (%)
1 1034
10.3%
2 835
8.3%
3 691
6.9%
4 517
5.2%
5 968
9.7%
6 361
 
3.6%
7 302
 
3.0%
8 333
 
3.3%
9 201
 
2.0%
10 947
9.5%
ValueCountFrequency (%)
1098 1
 
< 0.1%
836 1
 
< 0.1%
750 3
 
< 0.1%
700 1
 
< 0.1%
690 1
 
< 0.1%
668 1
 
< 0.1%
600 5
0.1%
550 10
0.1%
540 1
 
< 0.1%
530 4
 
< 0.1%

경락단가(원)
Real number (ℝ)

HIGH CORRELATION 

Distinct963
Distinct (%)9.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean241531.81
Minimum1500
Maximum10868000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-29T22:44:18.177355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1500
5-th percentile13000
Q150000
median120000
Q3270000
95-th percentile876200
Maximum10868000
Range10866500
Interquartile range (IQR)220000

Descriptive statistics

Standard deviation397228.88
Coefficient of variation (CV)1.6446234
Kurtosis115.75358
Mean241531.81
Median Absolute Deviation (MAD)85000
Skewness7.1944497
Sum2.4153181 × 109
Variance1.5779078 × 1011
MonotonicityNot monotonic
2024-04-29T22:44:18.300482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
60000 167
 
1.7%
120000 160
 
1.6%
90000 129
 
1.3%
40000 129
 
1.3%
100000 120
 
1.2%
24000 120
 
1.2%
70000 110
 
1.1%
80000 110
 
1.1%
140000 110
 
1.1%
240000 107
 
1.1%
Other values (953) 8738
87.4%
ValueCountFrequency (%)
1500 1
 
< 0.1%
2000 7
0.1%
2500 3
 
< 0.1%
3000 9
0.1%
3100 1
 
< 0.1%
3500 1
 
< 0.1%
3750 1
 
< 0.1%
4000 8
0.1%
4200 1
 
< 0.1%
4300 3
 
< 0.1%
ValueCountFrequency (%)
10868000 1
< 0.1%
10020000 1
< 0.1%
7000000 1
< 0.1%
5910000 1
< 0.1%
4860000 1
< 0.1%
4200000 1
< 0.1%
4024000 1
< 0.1%
3801000 1
< 0.1%
3800000 1
< 0.1%
3500000 1
< 0.1%

산지
Text

Distinct133
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-29T22:44:18.570982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length6
Mean length5.5718
Min length2

Characters and Unicode

Total characters55718
Distinct characters107
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)0.1%

Sample

1st row강원 평창군
2nd row충남 서산시
3rd row제주 제주시
4th row경북 김천시
5th row경북 김천시
ValueCountFrequency (%)
경기 1851
 
9.8%
충남 1789
 
9.5%
경북 1510
 
8.0%
강원 1405
 
7.4%
국외 980
 
5.2%
논산시 862
 
4.6%
청도군 581
 
3.1%
경남 501
 
2.7%
인천 500
 
2.6%
평창군 433
 
2.3%
Other values (124) 8460
44.8%
2024-04-29T22:44:18.946607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8872
15.9%
4503
 
8.1%
4248
 
7.6%
3682
 
6.6%
2801
 
5.0%
2310
 
4.1%
2067
 
3.7%
1963
 
3.5%
1912
 
3.4%
1691
 
3.0%
Other values (97) 21669
38.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 46846
84.1%
Space Separator 8872
 
15.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4503
 
9.6%
4248
 
9.1%
3682
 
7.9%
2801
 
6.0%
2310
 
4.9%
2067
 
4.4%
1963
 
4.2%
1912
 
4.1%
1691
 
3.6%
1654
 
3.5%
Other values (96) 20015
42.7%
Space Separator
ValueCountFrequency (%)
8872
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 46846
84.1%
Common 8872
 
15.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4503
 
9.6%
4248
 
9.1%
3682
 
7.9%
2801
 
6.0%
2310
 
4.9%
2067
 
4.4%
1963
 
4.2%
1912
 
4.1%
1691
 
3.6%
1654
 
3.5%
Other values (96) 20015
42.7%
Common
ValueCountFrequency (%)
8872
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 46846
84.1%
ASCII 8872
 
15.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8872
100.0%
Hangul
ValueCountFrequency (%)
4503
 
9.6%
4248
 
9.1%
3682
 
7.9%
2801
 
6.0%
2310
 
4.9%
2067
 
4.4%
1963
 
4.2%
1912
 
4.1%
1691
 
3.6%
1654
 
3.5%
Other values (96) 20015
42.7%

비고
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경매
8938 
정가수의
1062 

Length

Max length4
Median length2
Mean length2.2124
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경매
2nd row경매
3rd row경매
4th row경매
5th row경매

Common Values

ValueCountFrequency (%)
경매 8938
89.4%
정가수의 1062
 
10.6%

Length

2024-04-29T22:44:19.084101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-29T22:44:19.174432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경매 8938
89.4%
정가수의 1062
 
10.6%

Interactions

2024-04-29T22:44:15.771833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-29T22:44:15.619892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-29T22:44:15.848940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-29T22:44:15.692145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-29T22:44:19.232518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
경매일자법인명거래물량(kg)경락단가(원)비고
경매일자1.0000.1220.0500.0510.068
법인명0.1221.0000.1300.0490.166
거래물량(kg)0.0500.1301.0000.6410.058
경락단가(원)0.0510.0490.6411.0000.046
비고0.0680.1660.0580.0461.000
2024-04-29T22:44:19.313112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
법인명경매일자비고
법인명1.0000.0820.273
경매일자0.0821.0000.073
비고0.2730.0731.000
2024-04-29T22:44:19.395868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
거래물량(kg)경락단가(원)경매일자법인명비고
거래물량(kg)1.0000.7470.0260.0570.058
경락단가(원)0.7471.0000.0270.0310.035
경매일자0.0260.0271.0000.0820.073
법인명0.0570.0310.0821.0000.273
비고0.0580.0350.0730.2731.000

Missing values

2024-04-29T22:44:15.951903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-29T22:44:16.056225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

경매일자법인명품목품종거래물량(kg)경락단가(원)산지비고
406892023-10-05(주)부평농산얼갈이배추알배기20140000강원 평창군경매
509192023-10-05(주)경인농산양배추양배추(일반)14126000충남 서산시경매
587302023-10-06(주)부평농산감귤하우스감귤16112000제주 제주시경매
817222023-10-07(주)경인농산포도샤인마스캇116500경북 김천시경매
665852023-10-06(주)경인농산포도샤인마스캇53450500경북 김천시경매
83882023-10-02(주)경인농산오이백다다기222000경기 연천군경매
81872023-10-02(주)경인농산깻잎깻잎(일반)9225000충남 논산시경매
651822023-10-06(주)경인농산얼갈이배추알배기17119000전남 해남군경매
959222023-10-09(주)경인농산미나리돌미나리256000경기 시흥시경매
649662023-10-06(주)경인농산대파대파(일반)150495000경기 양주군경매
경매일자법인명품목품종거래물량(kg)경락단가(원)산지비고
894692023-10-09(주)부평농산만가닥만가닥(일반)222000경북 청도군경매
487152023-10-05(주)경인농산대파대파(일반)4501215000경기경매
621812023-10-06(주)부평농산풋고추청양264000강원 평창군경매
496202023-10-05(주)경인농산새송이버섯새송이버섯(일반)10120000충북 음성군경매
605762023-10-06(주)부평농산꽈리고추꽈리고추(일반)225000강원 평창군경매
536592023-10-05인천원예농협토마토완숙토마토16000강원 춘천시경매
214042023-10-03(주)경인농산풋고추청양252000강원 홍천군경매
7342023-10-02(주)부평농산실파실파(일반)60210000경기 고양시 일산서구경매
207972023-10-03(주)경인농산깻잎깻잎(일반)10210000충남 논산시경매
572682023-10-06(주)부평농산단감송본590000경남 창원시경매

Duplicate rows

Most frequently occurring

경매일자법인명품목품종거래물량(kg)경락단가(원)산지비고# duplicates
12422023-10-07(주)부평농산느타리버섯느타리버섯(일반)550000경기 양평군경매12
3452023-10-03(주)부평농산양상추양상추(일반)5190000강원 횡성군경매11
3682023-10-03(주)부평농산파인애플파인애플(수입)3120000국외정가수의11
5462023-10-04(주)부평농산브로코리(녹색꽃양배추)브로코리(수입)234000국외경매11
3162023-10-03(주)부평농산바나나바나나(수입)10310000국외정가수의10
4872023-10-04(주)부평농산느타리버섯느타리버섯(일반)537500경기 여주군경매10
5962023-10-04(주)부평농산팽이버섯팽이1호575000경북 청도군경매10
7802023-10-05(주)부평농산브로코리(녹색꽃양배추)브로코리(수입)235000국외경매10
8672023-10-05(주)부평농산표고버섯표고버섯(수입)211000국외경매10
1312023-10-02(주)부평농산양상추양상추(일반)10380000강원 횡성군경매9