Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 632 |
Duplicate rows (%) | 6.3% |
Total size in memory | 488.3 KiB |
Average record size in memory | 50.0 B |
Variable types
Text | 1 |
---|---|
Categorical | 2 |
Numeric | 2 |
Dataset
Description | 인천광역시 남촌농산물도매시장 월간 경락가격에 대한 데이터로 품목, 등급, 단량, 단위, 평균가등을 볼 수 있습니다. |
---|---|
Author | 인천광역시 |
URL | https://data.incheon.go.kr/findData/publicDataDetail?dataId=15051664&srcSe=7661IVAWM27C61E190 |
단위 has constant value "" | Constant |
Dataset has 632 (6.3%) duplicate rows | Duplicates |
단량 is highly overall correlated with 평균가 | High correlation |
평균가 is highly overall correlated with 단량 | High correlation |
등급 is highly imbalanced (54.6%) | Imbalance |
Reproduction
Analysis started | 2024-01-28 15:45:30.252272 |
---|---|
Analysis finished | 2024-01-28 15:45:31.000099 |
Duration | 0.75 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
품목
Text
Distinct | 403 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 22 |
---|---|
Median length | 17 |
Mean length | 9.3083 |
Min length | 5 |
Characters and Unicode
Total characters | 93083 |
---|---|
Distinct characters | 297 |
Distinct categories | 6 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 59 ? |
---|---|
Unique (%) | 0.6% |
Sample
1st row | 로메인(통로메인) |
---|---|
2nd row | 고구마(호박고구마) |
3rd row | 표고버섯(표고버섯(일반)) |
4th row | 파세리(향미나리)(파세리(일반)) |
5th row | 양송이(기타) |
Value | Count | Frequency (%) |
표고버섯(생표고 | 284 | 2.8% |
오이(백다다기 | 210 | 2.0% |
수박(수박(일반)(꼭지절단 | 184 | 1.8% |
기타(엽경채류(기타 | 181 | 1.8% |
표고버섯(표고버섯(일반 | 156 | 1.5% |
시금치(시금치(일반 | 154 | 1.5% |
가지(가지(일반 | 141 | 1.4% |
새송이(새송이(일반 | 133 | 1.3% |
풋고추(청양 | 130 | 1.3% |
밤(밤(일반 | 113 | 1.1% |
Other values (397) | 8580 |
Most occurring characters
Value | Count | Frequency (%) |
( | 14132 | 15.2% |
) | 14132 | 15.2% |
반 | 3559 | 3.8% |
일 | 3516 | 3.8% |
고 | 3051 | 3.3% |
추 | 2880 | 3.1% |
기 | 2419 | 2.6% |
타 | 2129 | 2.3% |
마 | 1360 | 1.5% |
리 | 1307 | 1.4% |
Other values (287) | 44598 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 64407 | |
Open Punctuation | 14132 | 15.2% |
Close Punctuation | 14132 | 15.2% |
Space Separator | 266 | 0.3% |
Other Punctuation | 125 | 0.1% |
Decimal Number | 21 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
반 | 3559 | 5.5% |
일 | 3516 | 5.5% |
고 | 3051 | 4.7% |
추 | 2880 | 4.5% |
기 | 2419 | 3.8% |
타 | 2129 | 3.3% |
마 | 1360 | 2.1% |
리 | 1307 | 2.0% |
이 | 1258 | 2.0% |
박 | 1108 | 1.7% |
Other values (281) | 41820 |
Decimal Number
Value | Count | Frequency (%) |
1 | 15 | |
8 | 6 | 28.6% |
Open Punctuation
Value | Count | Frequency (%) |
( | 14132 |
Close Punctuation
Value | Count | Frequency (%) |
) | 14132 |
Space Separator
Value | Count | Frequency (%) |
266 |
Other Punctuation
Value | Count | Frequency (%) |
, | 125 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 64407 | |
Common | 28676 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
반 | 3559 | 5.5% |
일 | 3516 | 5.5% |
고 | 3051 | 4.7% |
추 | 2880 | 4.5% |
기 | 2419 | 3.8% |
타 | 2129 | 3.3% |
마 | 1360 | 2.1% |
리 | 1307 | 2.0% |
이 | 1258 | 2.0% |
박 | 1108 | 1.7% |
Other values (281) | 41820 |
Common
Value | Count | Frequency (%) |
( | 14132 | |
) | 14132 | |
266 | 0.9% | |
, | 125 | 0.4% |
1 | 15 | 0.1% |
8 | 6 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 64407 | |
ASCII | 28676 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
( | 14132 | |
) | 14132 | |
266 | 0.9% | |
, | 125 | 0.4% |
1 | 15 | 0.1% |
8 | 6 | < 0.1% |
Hangul
Value | Count | Frequency (%) |
반 | 3559 | 5.5% |
일 | 3516 | 5.5% |
고 | 3051 | 4.7% |
추 | 2880 | 4.5% |
기 | 2419 | 3.8% |
타 | 2129 | 3.3% |
마 | 1360 | 2.1% |
리 | 1307 | 2.0% |
이 | 1258 | 2.0% |
박 | 1108 | 1.7% |
Other values (281) | 41820 |
등급
Categorical
IMBALANCE
 
Distinct | 10 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
특(1등 | |
---|---|
상(2등 | |
보통(3 | 416 |
4등 | 201 |
9등(등 | 183 |
Other values (5) | 239 |
Length
Max length | 17 |
---|---|
Median length | 16 |
Mean length | 16.0359 |
Min length | 16 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 특(1등 |
---|---|
2nd row | 특(1등 |
3rd row | 특(1등 |
4th row | 특(1등 |
5th row | 특(1등 |
Common Values
Value | Count | Frequency (%) |
특(1등 | 6378 | |
상(2등 | 2583 | |
보통(3 | 416 | 4.2% |
4등 | 201 | 2.0% |
9등(등 | 183 | 1.8% |
없음 | 81 | 0.8% |
5등 | 49 | 0.5% |
8등 | 44 | 0.4% |
6등 | 40 | 0.4% |
7등 | 25 | 0.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
특(1등 | 6378 | |
상(2등 | 2583 | |
보통(3 | 416 | 4.2% |
4등 | 201 | 2.0% |
9등(등 | 183 | 1.8% |
없음 | 81 | 0.8% |
5등 | 49 | 0.5% |
8등 | 44 | 0.4% |
6등 | 40 | 0.4% |
7등 | 25 | 0.2% |
단량
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 90 |
---|---|
Distinct (%) | 0.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.506306 |
Minimum | 0.01 |
---|---|
Maximum | 102 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0.01 |
---|---|
5-th percentile | 0.5 |
Q1 | 3 |
median | 5 |
Q3 | 10 |
95-th percentile | 16 |
Maximum | 102 |
Range | 101.99 |
Interquartile range (IQR) | 7 |
Descriptive statistics
Standard deviation | 5.0374768 |
---|---|
Coefficient of variation (CV) | 0.7742453 |
Kurtosis | 32.747612 |
Mean | 6.506306 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 2.7626948 |
Sum | 65063.06 |
Variance | 25.376173 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
10.0 | 2193 | |
4.0 | 2041 | |
2.0 | 1156 | |
5.0 | 779 | 7.8% |
8.0 | 743 | 7.4% |
1.0 | 464 | 4.6% |
15.0 | 317 | 3.2% |
3.0 | 314 | 3.1% |
20.0 | 278 | 2.8% |
0.5 | 248 | 2.5% |
Other values (80) | 1467 |
Value | Count | Frequency (%) |
0.01 | 17 | 0.2% |
0.02 | 1 | < 0.1% |
0.05 | 39 | |
0.06 | 11 | 0.1% |
0.1 | 20 | 0.2% |
0.12 | 6 | 0.1% |
0.15 | 3 | < 0.1% |
0.16 | 10 | 0.1% |
0.2 | 58 | |
0.25 | 4 | < 0.1% |
Value | Count | Frequency (%) |
102.0 | 1 | < 0.1% |
89.0 | 1 | < 0.1% |
85.0 | 2 | < 0.1% |
51.0 | 1 | < 0.1% |
40.0 | 3 | < 0.1% |
25.0 | 5 | 0.1% |
21.0 | 2 | < 0.1% |
20.0 | 278 | |
19.0 | 1 | < 0.1% |
18.0 | 89 | 0.9% |
단위
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
kg |
---|
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | kg |
---|---|
2nd row | kg |
3rd row | kg |
4th row | kg |
5th row | kg |
Common Values
Value | Count | Frequency (%) |
kg | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
kg | 10000 |
평균가
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 4512 |
---|---|
Distinct (%) | 45.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 18319.487 |
Minimum | 100 |
---|---|
Maximum | 1062500 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 100 |
---|---|
5-th percentile | 1500 |
Q1 | 6000 |
median | 12000 |
Q3 | 22228.75 |
95-th percentile | 55000 |
Maximum | 1062500 |
Range | 1062400 |
Interquartile range (IQR) | 16228.75 |
Descriptive statistics
Standard deviation | 25682.765 |
---|---|
Coefficient of variation (CV) | 1.4019369 |
Kurtosis | 340.52663 |
Mean | 18319.487 |
Median Absolute Deviation (MAD) | 7400 |
Skewness | 11.806945 |
Sum | 1.8319487 × 108 |
Variance | 6.5960441 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
10000 | 227 | 2.3% |
4000 | 182 | 1.8% |
8000 | 173 | 1.7% |
15000 | 163 | 1.6% |
13000 | 154 | 1.5% |
3000 | 154 | 1.5% |
5000 | 145 | 1.5% |
7000 | 142 | 1.4% |
6000 | 140 | 1.4% |
12000 | 140 | 1.4% |
Other values (4502) | 8380 |
Value | Count | Frequency (%) |
100 | 1 | < 0.1% |
150 | 1 | < 0.1% |
200 | 5 | 0.1% |
250 | 1 | < 0.1% |
276 | 1 | < 0.1% |
300 | 26 | |
336 | 1 | < 0.1% |
350 | 10 | 0.1% |
400 | 14 | |
424 | 1 | < 0.1% |
Value | Count | Frequency (%) |
1062500 | 1 | < 0.1% |
604200 | 1 | < 0.1% |
531200 | 1 | < 0.1% |
406200 | 3 | < 0.1% |
240000 | 5 | |
215000 | 1 | < 0.1% |
210000 | 1 | < 0.1% |
200000 | 4 | |
195000 | 9 | |
180000 | 2 | < 0.1% |
등급 | 단량 | 평균가 | |
---|---|---|---|
등급 | 1.000 | 0.034 | 0.000 |
단량 | 0.034 | 1.000 | 0.965 |
평균가 | 0.000 | 0.965 | 1.000 |
단량 | 평균가 | 등급 | |
---|---|---|---|
단량 | 1.000 | 0.582 | 0.017 |
평균가 | 0.582 | 1.000 | 0.000 |
등급 | 0.017 | 0.000 | 1.000 |
품목 | 등급 | 단량 | 단위 | 평균가 | |
---|---|---|---|---|---|
4952 | 로메인(통로메인) | 특(1등 | 2.0 | kg | 4000 |
1560 | 고구마(호박고구마) | 특(1등 | 10.0 | kg | 9760 |
14197 | 표고버섯(표고버섯(일반)) | 특(1등 | 16.0 | kg | 68000 |
12885 | 파세리(향미나리)(파세리(일반)) | 특(1등 | 4.0 | kg | 35000 |
9699 | 양송이(기타) | 특(1등 | 2.0 | kg | 12458 |
7084 | 브로코리(녹색꽃양배추)(브로코리(일반)) | 특(1등 | 8.0 | kg | 12301 |
8022 | 새송이(새송이(일반)) | 상(2등 | 2.0 | kg | 3988 |
10960 | 오이(취청) | 특(1등 | 10.0 | kg | 14628 |
13011 | 파프리카(빨강파프리카) | 특(1등 | 5.0 | kg | 18000 |
953 | 강낭콩(강낭콩(일반)) | 상(2등 | 5.0 | kg | 13750 |
품목 | 등급 | 단량 | 단위 | 평균가 | |
---|---|---|---|---|---|
5381 | 멜론(기타) | 상(2등 | 8.0 | kg | 10000 |
15245 | 풋고추(청초(일반)) | 특(1등 | 15.0 | kg | 20000 |
13398 | 포도(마스캇베리에이) | 상(2등 | 3.0 | kg | 10945 |
14850 | 풋고추(오이맛고추) | 상(2등 | 4.0 | kg | 8500 |
14737 | 풋고추(애기초) | 특(1등 | 6.0 | kg | 26000 |
15358 | 피망(단고추)(청피망) | 상(2등 | 10.0 | kg | 25111 |
4997 | 마(기타) | 특(1등 | 10.0 | kg | 50000 |
3374 | 느타리버섯(느타리버섯(일반)) | 상(2등 | 2.0 | kg | 7121 |
8434 | 속새(속새(일반)) | 특(1등 | 4.0 | kg | 20000 |
14560 | 풋고추(롱그린) | 특(1등 | 10.0 | kg | 59800 |
Most frequently occurring
품목 | 등급 | 단량 | 단위 | 평균가 | # duplicates | |
---|---|---|---|---|---|---|
218 | 미역(줄기미역) | 특(1등 | 7.5 | kg | 11000 | 18 |
69 | 곡물제조(두부) | 특(1등 | 7.0 | kg | 7500 | 17 |
183 | 무순(무순(일반)) | 특(1등 | 0.05 | kg | 300 | 17 |
352 | 어묵,어분,어비(기타) | 특(1등 | 3.0 | kg | 8250 | 17 |
72 | 곡물제조(순두부) | 특(1등 | 16.0 | kg | 17800 | 16 |
65 | 곡물제조(두부) | 특(1등 | 0.5 | kg | 1230 | 15 |
309 | 숙주나물(숙주나물(일반)) | 특(1등 | 3.5 | kg | 4500 | 15 |
41 | 고구마순(생고구마순) | 특(1등 | 2.0 | kg | 4000 | 14 |
101 | 꼬시래기(꼬시래기(일반)) | 특(1등 | 8.0 | kg | 10500 | 14 |
191 | 무청(건무청) | 특(1등 | 10.0 | kg | 20000 | 14 |