Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows1698
Duplicate rows (%)17.0%
Total size in memory722.7 KiB
Average record size in memory74.0 B

Variable types

Categorical3
Text3
Numeric2

Dataset

Description삼산농산물도매시장의 실시간 농산물 경락가격을 제공함으로써 시장 투명성 확보에 기여<br/>삼산농산물도매시장 실시간 경락정보(경매일자,법인명,품목,품종,거래물량(kg),경락단가(원), 산지,비고)등의 데이터 입니다.<br/>
Author인천광역시
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=3044829&srcSe=7661IVAWM27C61E190

Alerts

Dataset has 1698 (17.0%) duplicate rowsDuplicates
거래물량(kg) is highly overall correlated with 경락단가(원)High correlation
경락단가(원) is highly overall correlated with 거래물량(kg)High correlation
비고 is highly imbalanced (51.5%)Imbalance

Reproduction

Analysis started2024-04-29 13:44:39.575274
Analysis finished2024-04-29 13:44:40.682512
Duration1.11 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

경매일자
Categorical

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-10-05
1535 
2023-10-06
1531 
2023-10-07
1497 
2023-10-04
1435 
2023-10-09
1433 
Other values (2)
2569 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-10-06
2nd row2023-10-09
3rd row2023-10-05
4th row2023-10-05
5th row2023-10-02

Common Values

ValueCountFrequency (%)
2023-10-05 1535
15.3%
2023-10-06 1531
15.3%
2023-10-07 1497
15.0%
2023-10-04 1435
14.3%
2023-10-09 1433
14.3%
2023-10-03 1359
13.6%
2023-10-02 1210
12.1%

Length

2024-04-29T22:44:40.743681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-29T22:44:40.835433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-10-05 1535
15.3%
2023-10-06 1531
15.3%
2023-10-07 1497
15.0%
2023-10-04 1435
14.3%
2023-10-09 1433
14.3%
2023-10-03 1359
13.6%
2023-10-02 1210
12.1%

법인명
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
(주)부평농산
6068 
인천원예농협
1986 
(주)경인농산
1946 

Length

Max length7
Median length7
Mean length6.8014
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row인천원예농협
2nd row(주)부평농산
3rd row(주)경인농산
4th row(주)부평농산
5th row(주)부평농산

Common Values

ValueCountFrequency (%)
(주)부평농산 6068
60.7%
인천원예농협 1986
 
19.9%
(주)경인농산 1946
 
19.5%

Length

2024-04-29T22:44:40.946727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-29T22:44:41.026421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
주)부평농산 6068
60.7%
인천원예농협 1986
 
19.9%
주)경인농산 1946
 
19.5%

품목
Text

Distinct136
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-29T22:44:41.268479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10
Mean length2.8508
Min length1

Characters and Unicode

Total characters28508
Distinct characters182
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique12 ?
Unique (%)0.1%

Sample

1st row새싹
2nd row고구마
3rd row
4th row호박
5th row브로코리(녹색꽃양배추)
ValueCountFrequency (%)
포도 740
 
7.4%
풋고추 483
 
4.8%
떫은감 444
 
4.4%
깻잎 421
 
4.2%
단감 397
 
3.9%
고구마 381
 
3.8%
오이 348
 
3.5%
호박 346
 
3.4%
느타리버섯 255
 
2.5%
사과 244
 
2.4%
Other values (128) 6009
59.7%
2024-04-29T22:44:41.623688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2058
 
7.2%
1491
 
5.2%
1213
 
4.3%
957
 
3.4%
906
 
3.2%
906
 
3.2%
892
 
3.1%
759
 
2.7%
740
 
2.6%
733
 
2.6%
Other values (172) 17853
62.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 27996
98.2%
Open Punctuation 212
 
0.7%
Close Punctuation 212
 
0.7%
Space Separator 68
 
0.2%
Other Punctuation 20
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2058
 
7.4%
1491
 
5.3%
1213
 
4.3%
957
 
3.4%
906
 
3.2%
906
 
3.2%
892
 
3.2%
759
 
2.7%
740
 
2.6%
733
 
2.6%
Other values (168) 17341
61.9%
Open Punctuation
ValueCountFrequency (%)
( 212
100.0%
Close Punctuation
ValueCountFrequency (%)
) 212
100.0%
Space Separator
ValueCountFrequency (%)
68
100.0%
Other Punctuation
ValueCountFrequency (%)
, 20
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 27996
98.2%
Common 512
 
1.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2058
 
7.4%
1491
 
5.3%
1213
 
4.3%
957
 
3.4%
906
 
3.2%
906
 
3.2%
892
 
3.2%
759
 
2.7%
740
 
2.6%
733
 
2.6%
Other values (168) 17341
61.9%
Common
ValueCountFrequency (%)
( 212
41.4%
) 212
41.4%
68
 
13.3%
, 20
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 27996
98.2%
ASCII 512
 
1.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2058
 
7.4%
1491
 
5.3%
1213
 
4.3%
957
 
3.4%
906
 
3.2%
906
 
3.2%
892
 
3.2%
759
 
2.7%
740
 
2.6%
733
 
2.6%
Other values (168) 17341
61.9%
ASCII
ValueCountFrequency (%)
( 212
41.4%
) 212
41.4%
68
 
13.3%
, 20
 
3.9%

품종
Text

Distinct241
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-29T22:44:41.865730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length10
Mean length4.7311
Min length2

Characters and Unicode

Total characters47311
Distinct characters247
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)0.3%

Sample

1st row새싹(일반)
2nd row기타
3rd row갓(일반)
4th row애호박
5th row브로코리(수입)
ValueCountFrequency (%)
기타 1073
 
10.7%
샤인마스캇 521
 
5.2%
약시 314
 
3.1%
백다다기 298
 
3.0%
청양 287
 
2.9%
송본 280
 
2.8%
깻잎(일반 276
 
2.8%
새송이버섯(일반 232
 
2.3%
바나나(수입 230
 
2.3%
애호박 225
 
2.2%
Other values (232) 6285
62.7%
2024-04-29T22:44:42.228804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 4024
 
8.5%
) 4024
 
8.5%
3380
 
7.1%
3265
 
6.9%
1564
 
3.3%
1395
 
2.9%
1182
 
2.5%
1042
 
2.2%
1020
 
2.2%
930
 
2.0%
Other values (237) 25485
53.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 39055
82.5%
Open Punctuation 4024
 
8.5%
Close Punctuation 4024
 
8.5%
Decimal Number 186
 
0.4%
Space Separator 21
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3380
 
8.7%
3265
 
8.4%
1564
 
4.0%
1395
 
3.6%
1182
 
3.0%
1042
 
2.7%
1020
 
2.6%
930
 
2.4%
868
 
2.2%
868
 
2.2%
Other values (231) 23541
60.3%
Decimal Number
ValueCountFrequency (%)
1 183
98.4%
3 3
 
1.6%
Open Punctuation
ValueCountFrequency (%)
( 4024
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4024
100.0%
Space Separator
ValueCountFrequency (%)
21
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 39055
82.5%
Common 8256
 
17.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3380
 
8.7%
3265
 
8.4%
1564
 
4.0%
1395
 
3.6%
1182
 
3.0%
1042
 
2.7%
1020
 
2.6%
930
 
2.4%
868
 
2.2%
868
 
2.2%
Other values (231) 23541
60.3%
Common
ValueCountFrequency (%)
( 4024
48.7%
) 4024
48.7%
1 183
 
2.2%
21
 
0.3%
3 3
 
< 0.1%
, 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 39055
82.5%
ASCII 8256
 
17.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 4024
48.7%
) 4024
48.7%
1 183
 
2.2%
21
 
0.3%
3 3
 
< 0.1%
, 1
 
< 0.1%
Hangul
ValueCountFrequency (%)
3380
 
8.7%
3265
 
8.4%
1564
 
4.0%
1395
 
3.6%
1182
 
3.0%
1042
 
2.7%
1020
 
2.6%
930
 
2.4%
868
 
2.2%
868
 
2.2%
Other values (231) 23541
60.3%

거래물량(kg)
Real number (ℝ)

HIGH CORRELATION 

Distinct193
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26.6258
Minimum1
Maximum800
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-29T22:44:42.378731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median8
Q320
95-th percentile120
Maximum800
Range799
Interquartile range (IQR)17

Descriptive statistics

Standard deviation60.876747
Coefficient of variation (CV)2.2863819
Kurtosis35.322412
Mean26.6258
Median Absolute Deviation (MAD)6
Skewness5.3146567
Sum266258
Variance3705.9784
MonotonicityNot monotonic
2024-04-29T22:44:42.543463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1111
 
11.1%
5 1015
 
10.2%
10 929
 
9.3%
2 818
 
8.2%
3 624
 
6.2%
4 537
 
5.4%
6 365
 
3.6%
20 333
 
3.3%
8 296
 
3.0%
7 295
 
2.9%
Other values (183) 3677
36.8%
ValueCountFrequency (%)
1 1111
11.1%
2 818
8.2%
3 624
6.2%
4 537
5.4%
5 1015
10.2%
6 365
 
3.6%
7 295
 
2.9%
8 296
 
3.0%
9 192
 
1.9%
10 929
9.3%
ValueCountFrequency (%)
800 1
 
< 0.1%
690 1
 
< 0.1%
601 1
 
< 0.1%
600 2
 
< 0.1%
576 1
 
< 0.1%
560 1
 
< 0.1%
550 17
0.2%
540 1
 
< 0.1%
530 2
 
< 0.1%
522 1
 
< 0.1%

경락단가(원)
Real number (ℝ)

HIGH CORRELATION 

Distinct972
Distinct (%)9.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean240386.88
Minimum700
Maximum9675000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-29T22:44:42.679821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum700
5-th percentile13000
Q148000
median120000
Q3276000
95-th percentile876200
Maximum9675000
Range9674300
Interquartile range (IQR)228000

Descriptive statistics

Standard deviation381118.84
Coefficient of variation (CV)1.5854394
Kurtosis70.035105
Mean240386.88
Median Absolute Deviation (MAD)86100
Skewness5.8018543
Sum2.4038688 × 109
Variance1.4525157 × 1011
MonotonicityNot monotonic
2024-04-29T22:44:42.819455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
60000 160
 
1.6%
120000 156
 
1.6%
30000 126
 
1.3%
180000 122
 
1.2%
100000 121
 
1.2%
140000 118
 
1.2%
24000 113
 
1.1%
40000 107
 
1.1%
36000 106
 
1.1%
80000 105
 
1.1%
Other values (962) 8766
87.7%
ValueCountFrequency (%)
700 1
 
< 0.1%
1000 1
 
< 0.1%
1500 2
 
< 0.1%
2000 3
 
< 0.1%
2500 1
 
< 0.1%
3000 9
0.1%
3100 1
 
< 0.1%
3800 1
 
< 0.1%
4000 20
0.2%
4200 1
 
< 0.1%
ValueCountFrequency (%)
9675000 1
< 0.1%
5910000 1
< 0.1%
5760000 1
< 0.1%
5400000 1
< 0.1%
4676000 1
< 0.1%
4514000 1
< 0.1%
4320000 1
< 0.1%
4200000 1
< 0.1%
4148000 1
< 0.1%
3975000 1
< 0.1%

산지
Text

Distinct130
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-29T22:44:43.107082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length6
Mean length5.5607
Min length2

Characters and Unicode

Total characters55607
Distinct characters103
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)0.1%

Sample

1st row전남 나주시
2nd row경기 여주군
3rd row경기 고양시
4th row경기 양주시
5th row국외
ValueCountFrequency (%)
경기 1924
 
10.2%
충남 1705
 
9.0%
경북 1595
 
8.5%
강원 1440
 
7.6%
국외 947
 
5.0%
논산시 797
 
4.2%
청도군 648
 
3.4%
인천 485
 
2.6%
경남 464
 
2.5%
평창군 441
 
2.3%
Other values (122) 8400
44.6%
2024-04-29T22:44:43.489811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8846
15.9%
4369
 
7.9%
4351
 
7.8%
3836
 
6.9%
2674
 
4.8%
2237
 
4.0%
2130
 
3.8%
1984
 
3.6%
1948
 
3.5%
1743
 
3.1%
Other values (93) 21489
38.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 46761
84.1%
Space Separator 8846
 
15.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4369
 
9.3%
4351
 
9.3%
3836
 
8.2%
2674
 
5.7%
2237
 
4.8%
2130
 
4.6%
1984
 
4.2%
1948
 
4.2%
1743
 
3.7%
1550
 
3.3%
Other values (92) 19939
42.6%
Space Separator
ValueCountFrequency (%)
8846
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 46761
84.1%
Common 8846
 
15.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4369
 
9.3%
4351
 
9.3%
3836
 
8.2%
2674
 
5.7%
2237
 
4.8%
2130
 
4.6%
1984
 
4.2%
1948
 
4.2%
1743
 
3.7%
1550
 
3.3%
Other values (92) 19939
42.6%
Common
ValueCountFrequency (%)
8846
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 46761
84.1%
ASCII 8846
 
15.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8846
100.0%
Hangul
ValueCountFrequency (%)
4369
 
9.3%
4351
 
9.3%
3836
 
8.2%
2674
 
5.7%
2237
 
4.8%
2130
 
4.6%
1984
 
4.2%
1948
 
4.2%
1743
 
3.7%
1550
 
3.3%
Other values (92) 19939
42.6%

비고
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경매
8950 
정가수의
1050 

Length

Max length4
Median length2
Mean length2.21
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정가수의
2nd row경매
3rd row경매
4th row경매
5th row경매

Common Values

ValueCountFrequency (%)
경매 8950
89.5%
정가수의 1050
 
10.5%

Length

2024-04-29T22:44:43.640805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-29T22:44:43.748270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경매 8950
89.5%
정가수의 1050
 
10.5%

Interactions

2024-04-29T22:44:40.308151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-29T22:44:40.111882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-29T22:44:40.396377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-29T22:44:40.217145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-29T22:44:43.803995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
경매일자법인명거래물량(kg)경락단가(원)비고
경매일자1.0000.1250.0360.0470.056
법인명0.1251.0000.0840.0350.160
거래물량(kg)0.0360.0841.0000.4850.085
경락단가(원)0.0470.0350.4851.0000.043
비고0.0560.1600.0850.0431.000
2024-04-29T22:44:43.890261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
법인명경매일자비고
법인명1.0000.0840.264
경매일자0.0841.0000.060
비고0.2640.0601.000
2024-04-29T22:44:43.965238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
거래물량(kg)경락단가(원)경매일자법인명비고
거래물량(kg)1.0000.7590.0180.0500.065
경락단가(원)0.7591.0000.0250.0220.033
경매일자0.0180.0251.0000.0840.060
법인명0.0500.0220.0841.0000.264
비고0.0650.0330.0600.2641.000

Missing values

2024-04-29T22:44:40.499227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-29T22:44:40.618881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

경매일자법인명품목품종거래물량(kg)경락단가(원)산지비고
702432023-10-06인천원예농협새싹새싹(일반)208000전남 나주시정가수의
887662023-10-09(주)부평농산고구마기타8168000경기 여주군경매
502042023-10-05(주)경인농산갓(일반)5080000경기 고양시경매
464362023-10-05(주)부평농산호박애호박354000경기 양주시경매
11102023-10-02(주)부평농산브로코리(녹색꽃양배추)브로코리(수입)250000국외경매
376862023-10-04인천원예농협감자수미13585000강원 평창군경매
329312023-10-04(주)부평농산조미가공미역줄기221000경기 광명시정가수의
477172023-10-05(주)부평농산대추기타330000충남 논산시경매
216792023-10-03(주)경인농산열무열무(일반)510000경기 동두천시경매
433102023-10-05(주)부평농산대추기타30510000충남 논산시경매
경매일자법인명품목품종거래물량(kg)경락단가(원)산지비고
352142023-10-04(주)경인농산부추일반부추60108000강원 평창군경매
192222023-10-03(주)부평농산감자기타8352000강원 평창군경매
879662023-10-09(주)부평농산상추꽃적상추13299000충남 논산시경매
755302023-10-07(주)부평농산참다래(키위)키위(수입)141000국외정가수의
142372023-10-03(주)부평농산상추꽃적상추11407000충남 논산시경매
484042023-10-05(주)부평농산포도샤인마스캇63504000경북 김천시경매
979842023-10-09(주)경인농산사과시나노골드155000충북 충주시경매
905882023-10-09(주)부평농산포도샤인마스캇12192000경북 경산시경매
647452023-10-06(주)경인농산치커리치커리(일반)36126000인천경매
696262023-10-06인천원예농협포도기타27432000경북 경산시경매

Duplicate rows

Most frequently occurring

경매일자법인명품목품종거래물량(kg)경락단가(원)산지비고# duplicates
3182023-10-03(주)부평농산바나나바나나(수입)10310000국외정가수의16
3812023-10-03(주)부평농산파인애플파인애플(수입)3120000국외정가수의16
3572023-10-03(주)부평농산양상추양상추(일반)5190000강원 횡성군경매12
5092023-10-04(주)부평농산느타리버섯느타리버섯(일반)535000경기 양평군경매11
1552023-10-02(주)부평농산팽이버섯팽이1호5120000경북 청도군경매10
8702023-10-05(주)부평농산표고버섯표고버섯(수입)211000국외경매10
15102023-10-09(주)부평농산느타리버섯느타리버섯(일반)547500경기 양평군경매10
15952023-10-09(주)부평농산브로코리(녹색꽃양배추)브로코리(수입)5140000국외경매10
472023-10-02(주)부평농산느타리버섯느타리버섯(일반)557500경기 양평군경매9
1222023-10-02(주)부평농산양상추양상추(일반)5190000강원 횡성군경매9