Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows1711
Duplicate rows (%)17.1%
Total size in memory722.7 KiB
Average record size in memory74.0 B

Variable types

Categorical3
Text3
Numeric2

Dataset

Description삼산농산물도매시장의 실시간 농산물 경락가격을 제공함으로써 시장 투명성 확보에 기여<br/>삼산농산물도매시장 실시간 경락정보(경매일자,법인명,품목,품종,거래물량(kg),경락단가(원), 산지,비고)등의 데이터 입니다.<br/>
Author인천광역시
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=3044829&srcSe=7661IVAWM27C61E190

Alerts

Dataset has 1711 (17.1%) duplicate rowsDuplicates
거래물량(kg) is highly overall correlated with 경락단가(원)High correlation
경락단가(원) is highly overall correlated with 거래물량(kg)High correlation

Reproduction

Analysis started2024-04-29 13:44:45.427934
Analysis finished2024-04-29 13:44:46.676434
Duration1.25 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

경매일자
Categorical

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-10-05
1608 
2023-10-06
1585 
2023-10-07
1552 
2023-10-03
1461 
2023-10-04
1418 
Other values (2)
2376 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-10-03
2nd row2023-10-05
3rd row2023-10-04
4th row2023-10-03
5th row2023-10-03

Common Values

ValueCountFrequency (%)
2023-10-05 1608
16.1%
2023-10-06 1585
15.8%
2023-10-07 1552
15.5%
2023-10-03 1461
14.6%
2023-10-04 1418
14.2%
2023-10-02 1233
12.3%
2023-10-09 1143
11.4%

Length

2024-04-29T22:44:46.727440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-29T22:44:46.826272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-10-05 1608
16.1%
2023-10-06 1585
15.8%
2023-10-07 1552
15.5%
2023-10-03 1461
14.6%
2023-10-04 1418
14.2%
2023-10-02 1233
12.3%
2023-10-09 1143
11.4%

법인명
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
(주)부평농산
6222 
인천원예농협
1949 
(주)경인농산
1829 

Length

Max length7
Median length7
Mean length6.8051
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row인천원예농협
2nd row인천원예농협
3rd row(주)부평농산
4th row(주)경인농산
5th row(주)경인농산

Common Values

ValueCountFrequency (%)
(주)부평농산 6222
62.2%
인천원예농협 1949
 
19.5%
(주)경인농산 1829
 
18.3%

Length

2024-04-29T22:44:46.932823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-29T22:44:47.027564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
주)부평농산 6222
62.2%
인천원예농협 1949
 
19.5%
주)경인농산 1829
 
18.3%

품목
Text

Distinct135
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-29T22:44:47.279357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length2.8412
Min length1

Characters and Unicode

Total characters28412
Distinct characters180
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8 ?
Unique (%)0.1%

Sample

1st row메론
2nd row호박
3rd row풋고추
4th row꽈리고추
5th row쑥갓
ValueCountFrequency (%)
포도 715
 
7.1%
풋고추 476
 
4.7%
떫은감 475
 
4.7%
단감 439
 
4.4%
깻잎 390
 
3.9%
오이 365
 
3.6%
고구마 352
 
3.5%
호박 321
 
3.2%
느타리버섯 258
 
2.6%
새송이버섯 252
 
2.5%
Other values (127) 6017
59.8%
2024-04-29T22:44:47.657952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2120
 
7.5%
1370
 
4.8%
1282
 
4.5%
981
 
3.5%
912
 
3.2%
912
 
3.2%
830
 
2.9%
736
 
2.6%
722
 
2.5%
715
 
2.5%
Other values (170) 17832
62.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 27922
98.3%
Close Punctuation 199
 
0.7%
Open Punctuation 199
 
0.7%
Space Separator 60
 
0.2%
Other Punctuation 32
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2120
 
7.6%
1370
 
4.9%
1282
 
4.6%
981
 
3.5%
912
 
3.3%
912
 
3.3%
830
 
3.0%
736
 
2.6%
722
 
2.6%
715
 
2.6%
Other values (166) 17342
62.1%
Close Punctuation
ValueCountFrequency (%)
) 199
100.0%
Open Punctuation
ValueCountFrequency (%)
( 199
100.0%
Space Separator
ValueCountFrequency (%)
60
100.0%
Other Punctuation
ValueCountFrequency (%)
, 32
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 27922
98.3%
Common 490
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2120
 
7.6%
1370
 
4.9%
1282
 
4.6%
981
 
3.5%
912
 
3.3%
912
 
3.3%
830
 
3.0%
736
 
2.6%
722
 
2.6%
715
 
2.6%
Other values (166) 17342
62.1%
Common
ValueCountFrequency (%)
) 199
40.6%
( 199
40.6%
60
 
12.2%
, 32
 
6.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 27922
98.3%
ASCII 490
 
1.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2120
 
7.6%
1370
 
4.9%
1282
 
4.6%
981
 
3.5%
912
 
3.3%
912
 
3.3%
830
 
3.0%
736
 
2.6%
722
 
2.6%
715
 
2.6%
Other values (166) 17342
62.1%
ASCII
ValueCountFrequency (%)
) 199
40.6%
( 199
40.6%
60
 
12.2%
, 32
 
6.5%

품종
Text

Distinct244
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-29T22:44:47.903507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length10
Mean length4.707
Min length2

Characters and Unicode

Total characters47070
Distinct characters253
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)0.3%

Sample

1st row네트계
2nd row애호박
3rd row오이고추
4th row꽈리고추(일반)
5th row쑥갓(일반)
ValueCountFrequency (%)
기타 1137
 
11.3%
샤인마스캇 495
 
4.9%
약시 350
 
3.5%
백다다기 315
 
3.1%
송본 311
 
3.1%
청양 274
 
2.7%
깻잎(일반 271
 
2.7%
새송이버섯(일반 252
 
2.5%
바나나(수입 235
 
2.3%
애호박 212
 
2.1%
Other values (236) 6174
61.6%
2024-04-29T22:44:48.287386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 4019
 
8.5%
) 4019
 
8.5%
3349
 
7.1%
3236
 
6.9%
1640
 
3.5%
1456
 
3.1%
1245
 
2.6%
1014
 
2.2%
989
 
2.1%
887
 
1.9%
Other values (243) 25216
53.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 38808
82.4%
Open Punctuation 4019
 
8.5%
Close Punctuation 4019
 
8.5%
Decimal Number 195
 
0.4%
Space Separator 26
 
0.1%
Other Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3349
 
8.6%
3236
 
8.3%
1640
 
4.2%
1456
 
3.8%
1245
 
3.2%
1014
 
2.6%
989
 
2.5%
887
 
2.3%
863
 
2.2%
771
 
2.0%
Other values (237) 23358
60.2%
Decimal Number
ValueCountFrequency (%)
1 192
98.5%
3 3
 
1.5%
Open Punctuation
ValueCountFrequency (%)
( 4019
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4019
100.0%
Space Separator
ValueCountFrequency (%)
26
100.0%
Other Punctuation
ValueCountFrequency (%)
, 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 38808
82.4%
Common 8262
 
17.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3349
 
8.6%
3236
 
8.3%
1640
 
4.2%
1456
 
3.8%
1245
 
3.2%
1014
 
2.6%
989
 
2.5%
887
 
2.3%
863
 
2.2%
771
 
2.0%
Other values (237) 23358
60.2%
Common
ValueCountFrequency (%)
( 4019
48.6%
) 4019
48.6%
1 192
 
2.3%
26
 
0.3%
, 3
 
< 0.1%
3 3
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 38808
82.4%
ASCII 8262
 
17.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 4019
48.6%
) 4019
48.6%
1 192
 
2.3%
26
 
0.3%
, 3
 
< 0.1%
3 3
 
< 0.1%
Hangul
ValueCountFrequency (%)
3349
 
8.6%
3236
 
8.3%
1640
 
4.2%
1456
 
3.8%
1245
 
3.2%
1014
 
2.6%
989
 
2.5%
887
 
2.3%
863
 
2.2%
771
 
2.0%
Other values (237) 23358
60.2%

거래물량(kg)
Real number (ℝ)

HIGH CORRELATION 

Distinct190
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean27.4788
Minimum1
Maximum836
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-29T22:44:48.418633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median8
Q320
95-th percentile120
Maximum836
Range835
Interquartile range (IQR)17

Descriptive statistics

Standard deviation63.740085
Coefficient of variation (CV)2.3196095
Kurtosis34.893956
Mean27.4788
Median Absolute Deviation (MAD)6
Skewness5.2929045
Sum274788
Variance4062.7984
MonotonicityNot monotonic
2024-04-29T22:44:48.528198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1017
 
10.2%
5 1015
 
10.2%
10 1005
 
10.1%
2 832
 
8.3%
3 654
 
6.5%
4 548
 
5.5%
20 347
 
3.5%
6 339
 
3.4%
8 322
 
3.2%
7 297
 
3.0%
Other values (180) 3624
36.2%
ValueCountFrequency (%)
1 1017
10.2%
2 832
8.3%
3 654
6.5%
4 548
5.5%
5 1015
10.2%
6 339
 
3.4%
7 297
 
3.0%
8 322
 
3.2%
9 203
 
2.0%
10 1005
10.1%
ValueCountFrequency (%)
836 1
 
< 0.1%
690 2
 
< 0.1%
640 2
 
< 0.1%
600 3
 
< 0.1%
576 1
 
< 0.1%
560 3
 
< 0.1%
550 23
0.2%
540 1
 
< 0.1%
530 1
 
< 0.1%
522 1
 
< 0.1%

경락단가(원)
Real number (ℝ)

HIGH CORRELATION 

Distinct969
Distinct (%)9.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean245130.64
Minimum1000
Maximum10868000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-29T22:44:48.666669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1000
5-th percentile13000
Q148000
median120000
Q3280000
95-th percentile896000
Maximum10868000
Range10867000
Interquartile range (IQR)232000

Descriptive statistics

Standard deviation410059.97
Coefficient of variation (CV)1.6728222
Kurtosis102.11825
Mean245130.64
Median Absolute Deviation (MAD)86000
Skewness7.0290241
Sum2.4513064 × 109
Variance1.6814918 × 1011
MonotonicityNot monotonic
2024-04-29T22:44:48.794459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
120000 171
 
1.7%
60000 162
 
1.6%
90000 130
 
1.3%
180000 125
 
1.2%
40000 119
 
1.2%
100000 118
 
1.2%
80000 118
 
1.2%
140000 118
 
1.2%
30000 114
 
1.1%
36000 113
 
1.1%
Other values (959) 8712
87.1%
ValueCountFrequency (%)
1000 2
 
< 0.1%
1500 4
 
< 0.1%
2000 1
 
< 0.1%
2500 2
 
< 0.1%
2700 1
 
< 0.1%
3000 8
 
0.1%
4000 10
 
0.1%
4250 1
 
< 0.1%
4500 8
 
0.1%
5000 27
0.3%
ValueCountFrequency (%)
10868000 1
< 0.1%
9128000 1
< 0.1%
7680000 1
< 0.1%
5400000 2
< 0.1%
5280000 1
< 0.1%
5250000 1
< 0.1%
4800000 1
< 0.1%
4620000 1
< 0.1%
3895000 1
< 0.1%
3801000 1
< 0.1%

산지
Text

Distinct135
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-29T22:44:49.050439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length6
Mean length5.5598
Min length2

Characters and Unicode

Total characters55598
Distinct characters108
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique13 ?
Unique (%)0.1%

Sample

1st row경남 진주시
2nd row경기 연천군
3rd row강원 양구군
4th row강원 평창군
5th row경기
ValueCountFrequency (%)
경기 1869
 
9.9%
충남 1691
 
9.0%
경북 1580
 
8.4%
강원 1410
 
7.5%
국외 994
 
5.3%
논산시 822
 
4.4%
청도군 682
 
3.6%
경남 532
 
2.8%
인천 490
 
2.6%
평창군 426
 
2.3%
Other values (128) 8352
44.3%
2024-04-29T22:44:49.421065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8848
15.9%
4418
 
7.9%
4354
 
7.8%
3780
 
6.8%
2745
 
4.9%
2178
 
3.9%
2080
 
3.7%
1947
 
3.5%
1928
 
3.5%
1641
 
3.0%
Other values (98) 21679
39.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 46750
84.1%
Space Separator 8848
 
15.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4418
 
9.5%
4354
 
9.3%
3780
 
8.1%
2745
 
5.9%
2178
 
4.7%
2080
 
4.4%
1947
 
4.2%
1928
 
4.1%
1641
 
3.5%
1568
 
3.4%
Other values (97) 20111
43.0%
Space Separator
ValueCountFrequency (%)
8848
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 46750
84.1%
Common 8848
 
15.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4418
 
9.5%
4354
 
9.3%
3780
 
8.1%
2745
 
5.9%
2178
 
4.7%
2080
 
4.4%
1947
 
4.2%
1928
 
4.1%
1641
 
3.5%
1568
 
3.4%
Other values (97) 20111
43.0%
Common
ValueCountFrequency (%)
8848
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 46750
84.1%
ASCII 8848
 
15.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8848
100.0%
Hangul
ValueCountFrequency (%)
4418
 
9.5%
4354
 
9.3%
3780
 
8.1%
2745
 
5.9%
2178
 
4.7%
2080
 
4.4%
1947
 
4.2%
1928
 
4.1%
1641
 
3.5%
1568
 
3.4%
Other values (97) 20111
43.0%

비고
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경매
8871 
정가수의
1129 

Length

Max length4
Median length2
Mean length2.2258
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경매
2nd row경매
3rd row경매
4th row경매
5th row경매

Common Values

ValueCountFrequency (%)
경매 8871
88.7%
정가수의 1129
 
11.3%

Length

2024-04-29T22:44:49.550334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-29T22:44:49.644651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경매 8871
88.7%
정가수의 1129
 
11.3%

Interactions

2024-04-29T22:44:46.116019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-29T22:44:45.950176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-29T22:44:46.203541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-29T22:44:46.034621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-29T22:44:49.701239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
경매일자법인명거래물량(kg)경락단가(원)비고
경매일자1.0000.2220.0590.0630.057
법인명0.2221.0000.0560.0500.188
거래물량(kg)0.0590.0561.0000.7370.072
경락단가(원)0.0630.0500.7371.0000.000
비고0.0570.1880.0720.0001.000
2024-04-29T22:44:49.790002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
법인명경매일자비고
법인명1.0000.1530.309
경매일자0.1531.0000.061
비고0.3090.0611.000
2024-04-29T22:44:49.862873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
거래물량(kg)경락단가(원)경매일자법인명비고
거래물량(kg)1.0000.7460.0300.0330.055
경락단가(원)0.7461.0000.0340.0320.000
경매일자0.0300.0341.0000.1530.061
법인명0.0330.0320.1531.0000.309
비고0.0550.0000.0610.3091.000

Missing values

2024-04-29T22:44:46.519421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-29T22:44:46.618348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

경매일자법인명품목품종거래물량(kg)경락단가(원)산지비고
249222023-10-03인천원예농협메론네트계25425000경남 진주시경매
531812023-10-05인천원예농협호박애호박642000경기 연천군경매
289092023-10-04(주)부평농산풋고추오이고추6234000강원 양구군경매
210882023-10-03(주)경인농산꽈리고추꽈리고추(일반)12132000강원 평창군경매
212612023-10-03(주)경인농산쑥갓쑥갓(일반)15150000경기경매
156452023-10-03(주)부평농산기타1251875000강원 홍천군경매
906212023-10-09(주)부평농산바나나바나나(수입)274000국외정가수의
61652023-10-02(주)부평농산알타리무알타리무(일반)200240000경기 화성시경매
672262023-10-06(주)경인농산고구마호박고구마5140000충남 당진군경매
187642023-10-03(주)부평농산당근세척당근(수입)20140000국외경매
경매일자법인명품목품종거래물량(kg)경락단가(원)산지비고
51282023-10-02(주)부평농산무순무순(일반)3012000경기도 수원시 장안구경매
190292023-10-03(주)부평농산토마토토마토(일반)50950000강원 철원군경매
383122023-10-04인천원예농협가지가지(일반)318000경기 여주군경매
458992023-10-05(주)부평농산표고버섯표고버섯(수입)211000국외경매
774002023-10-07(주)부평농산단감송본456000경남 창원시경매
199122023-10-03(주)부평농산단감송본136000경남 창원시경매
399842023-10-05(주)부평농산표고버섯표고버섯(일반)896000경기 양주시경매
953692023-10-09(주)부평농산포도캠벨얼리40660000경기 화성시경매
574762023-10-06(주)부평농산쪽파깐쪽파30330000충남 아산시경매
95252023-10-02(주)경인농산포도샤인마스캇801040000경북 김천시경매

Duplicate rows

Most frequently occurring

경매일자법인명품목품종거래물량(kg)경락단가(원)산지비고# duplicates
10062023-10-06(주)부평농산느타리버섯느타리버섯(일반)535000경기 여주군경매14
3672023-10-03(주)부평농산양상추양상추(일반)5190000강원 횡성군경매13
14082023-10-07(주)부평농산팽이버섯팽이1호590000경북 청도군경매13
1062023-10-02(주)부평농산배추기타1401540000충북 제천시경매12
3952023-10-03(주)부평농산파인애플파인애플(수입)3120000국외정가수의12
8632023-10-05(주)부평농산파인애플파인애플(수입)142000국외정가수의12
11472023-10-06(주)부평농산팽이버섯팽이1호585000경북 청도군경매12
2172023-10-03(주)경인농산느타리버섯애느타리1080000충남 천안시경매11
15252023-10-09(주)부평농산고구마기타492000경기 여주군경매11
2742023-10-03(주)부평농산느타리버섯느타리버섯(일반)545000경기 양평군경매10