Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 735 |
Duplicate rows (%) | 7.3% |
Total size in memory | 488.3 KiB |
Average record size in memory | 50.0 B |
Variable types
Text | 1 |
---|---|
Categorical | 2 |
Numeric | 2 |
Dataset
Description | 인천광역시 남촌농산물도매시장 월간 경락가격에 대한 데이터로 품목, 등급, 단량, 단위, 평균가등을 볼 수 있습니다. |
---|---|
Author | 인천광역시 |
URL | https://www.data.go.kr/data/15051664/fileData.do |
Dataset has 735 (7.3%) duplicate rows | Duplicates |
단량 is highly overall correlated with 평균가 | High correlation |
평균가 is highly overall correlated with 단량 | High correlation |
단위 is highly imbalanced (99.9%) | Imbalance |
Reproduction
Analysis started | 2024-04-21 01:18:50.272605 |
---|---|
Analysis finished | 2024-04-21 01:18:52.401096 |
Duration | 2.13 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
품목
Text
Distinct | 332 |
---|---|
Distinct (%) | 3.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 22 |
---|---|
Median length | 18 |
Mean length | 9.2846 |
Min length | 5 |
Characters and Unicode
Total characters | 92846 |
---|---|
Distinct characters | 281 |
Distinct categories | 6 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 44 ? |
---|---|
Unique (%) | 0.4% |
Sample
1st row | 전분 및 사료제조(청포묵) |
---|---|
2nd row | 달래(달래(일반)) |
3rd row | 양파(양파(일반)) |
4th row | 파프리카(파프리카(일반)) |
5th row | 마늘(풋마늘) |
Value | Count | Frequency (%) |
딸기(설향 | 507 | 4.9% |
딸기(기타 | 307 | 3.0% |
표고버섯(생표고 | 236 | 2.3% |
시금치(시금치(일반 | 211 | 2.0% |
새송이(새송이(일반 | 165 | 1.6% |
딸기(금실 | 146 | 1.4% |
냉이(일반냉이 | 143 | 1.4% |
곡물제조(두부 | 137 | 1.3% |
수박(수박(일반)(꼭지절단 | 133 | 1.3% |
오이(백다다기 | 126 | 1.2% |
Other values (326) | 8223 |
Most occurring characters
Value | Count | Frequency (%) |
( | 14099 | 15.2% |
) | 14099 | 15.2% |
일 | 3472 | 3.7% |
반 | 3466 | 3.7% |
기 | 3250 | 3.5% |
추 | 2148 | 2.3% |
타 | 2043 | 2.2% |
고 | 1976 | 2.1% |
리 | 1555 | 1.7% |
이 | 1409 | 1.5% |
Other values (271) | 45329 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 64151 | |
Open Punctuation | 14099 | 15.2% |
Close Punctuation | 14099 | 15.2% |
Space Separator | 334 | 0.4% |
Other Punctuation | 142 | 0.2% |
Decimal Number | 21 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
일 | 3472 | 5.4% |
반 | 3466 | 5.4% |
기 | 3250 | 5.1% |
추 | 2148 | 3.3% |
타 | 2043 | 3.2% |
고 | 1976 | 3.1% |
리 | 1555 | 2.4% |
이 | 1409 | 2.2% |
마 | 1294 | 2.0% |
딸 | 1203 | 1.9% |
Other values (266) | 42335 |
Open Punctuation
Value | Count | Frequency (%) |
( | 14099 |
Close Punctuation
Value | Count | Frequency (%) |
) | 14099 |
Space Separator
Value | Count | Frequency (%) |
334 |
Other Punctuation
Value | Count | Frequency (%) |
, | 142 |
Decimal Number
Value | Count | Frequency (%) |
1 | 21 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 64151 | |
Common | 28695 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
일 | 3472 | 5.4% |
반 | 3466 | 5.4% |
기 | 3250 | 5.1% |
추 | 2148 | 3.3% |
타 | 2043 | 3.2% |
고 | 1976 | 3.1% |
리 | 1555 | 2.4% |
이 | 1409 | 2.2% |
마 | 1294 | 2.0% |
딸 | 1203 | 1.9% |
Other values (266) | 42335 |
Common
Value | Count | Frequency (%) |
( | 14099 | |
) | 14099 | |
334 | 1.2% | |
, | 142 | 0.5% |
1 | 21 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 64151 | |
ASCII | 28695 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
( | 14099 | |
) | 14099 | |
334 | 1.2% | |
, | 142 | 0.5% |
1 | 21 | 0.1% |
Hangul
Value | Count | Frequency (%) |
일 | 3472 | 5.4% |
반 | 3466 | 5.4% |
기 | 3250 | 5.1% |
추 | 2148 | 3.3% |
타 | 2043 | 3.2% |
고 | 1976 | 3.1% |
리 | 1555 | 2.4% |
이 | 1409 | 2.2% |
마 | 1294 | 2.0% |
딸 | 1203 | 1.9% |
Other values (266) | 42335 |
등급
Categorical
Distinct | 10 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
특(1등 | |
---|---|
상(2등 | |
보통(3 | |
4등 | 300 |
9등(등 | 197 |
Other values (5) | 327 |
Length
Max length | 17 |
---|---|
Median length | 16 |
Mean length | 16.046 |
Min length | 16 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 특(1등 |
---|---|
2nd row | 상(2등 |
3rd row | 상(2등 |
4th row | 상(2등 |
5th row | 특(1등 |
Common Values
Value | Count | Frequency (%) |
특(1등 | 6100 | |
상(2등 | 2405 | 24.1% |
보통(3 | 671 | 6.7% |
4등 | 300 | 3.0% |
9등(등 | 197 | 2.0% |
없음 | 167 | 1.7% |
5등 | 73 | 0.7% |
6등 | 46 | 0.5% |
7등 | 21 | 0.2% |
8등 | 20 | 0.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
특(1등 | 6100 | |
상(2등 | 2405 | 24.1% |
보통(3 | 671 | 6.7% |
4등 | 300 | 3.0% |
9등(등 | 197 | 2.0% |
없음 | 167 | 1.7% |
5등 | 73 | 0.7% |
6등 | 46 | 0.5% |
7등 | 21 | 0.2% |
8등 | 20 | 0.2% |
단량
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 78 |
---|---|
Distinct (%) | 0.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.598883 |
Minimum | 0.01 |
---|---|
Maximum | 102 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0.01 |
---|---|
5-th percentile | 0.5 |
Q1 | 2 |
median | 4 |
Q3 | 10 |
95-th percentile | 17 |
Maximum | 102 |
Range | 101.99 |
Interquartile range (IQR) | 8 |
Descriptive statistics
Standard deviation | 5.199428 |
---|---|
Coefficient of variation (CV) | 0.92865453 |
Kurtosis | 15.017707 |
Mean | 5.598883 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 2.0717996 |
Sum | 55988.83 |
Variance | 27.034052 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4.0 | 1836 | |
10.0 | 1597 | |
2.0 | 1231 | |
1.0 | 969 | |
0.5 | 831 | |
8.0 | 631 | 6.3% |
5.0 | 502 | 5.0% |
20.0 | 323 | 3.2% |
3.0 | 260 | 2.6% |
9.0 | 167 | 1.7% |
Other values (68) | 1653 |
Value | Count | Frequency (%) |
0.01 | 22 | 0.2% |
0.05 | 40 | |
0.1 | 14 | 0.1% |
0.12 | 1 | < 0.1% |
0.15 | 22 | 0.2% |
0.16 | 15 | 0.1% |
0.2 | 87 | |
0.25 | 32 | 0.3% |
0.3 | 54 | |
0.35 | 8 | 0.1% |
Value | Count | Frequency (%) |
102.0 | 1 | < 0.1% |
51.0 | 1 | < 0.1% |
50.0 | 1 | < 0.1% |
40.0 | 6 | 0.1% |
30.0 | 17 | 0.2% |
25.0 | 4 | < 0.1% |
22.0 | 1 | < 0.1% |
20.0 | 323 | |
19.0 | 6 | 0.1% |
18.0 | 86 | 0.9% |
단위
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
kg | |
---|---|
g | 1 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 1.9999 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | kg |
---|---|
2nd row | kg |
3rd row | kg |
4th row | kg |
5th row | kg |
Common Values
Value | Count | Frequency (%) |
kg | 9999 | |
g | 1 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
kg | 9999 | |
g | 1 | < 0.1% |
평균가
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 4802 |
---|---|
Distinct (%) | 48.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 23101.864 |
Minimum | 200 |
---|---|
Maximum | 1062500 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 200 |
---|---|
5-th percentile | 2000 |
Q1 | 5946.75 |
median | 14245 |
Q3 | 29200 |
95-th percentile | 70277.15 |
Maximum | 1062500 |
Range | 1062300 |
Interquartile range (IQR) | 23253.25 |
Descriptive statistics
Standard deviation | 31298.011 |
---|---|
Coefficient of variation (CV) | 1.3547829 |
Kurtosis | 183.29076 |
Mean | 23101.864 |
Median Absolute Deviation (MAD) | 9755 |
Skewness | 8.2386325 |
Sum | 2.3101864 × 108 |
Variance | 9.7956548 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4000 | 150 | 1.5% |
10000 | 148 | 1.5% |
2000 | 133 | 1.3% |
8000 | 130 | 1.3% |
15000 | 129 | 1.3% |
3000 | 126 | 1.3% |
13000 | 119 | 1.2% |
20000 | 105 | 1.1% |
7000 | 104 | 1.0% |
5000 | 103 | 1.0% |
Other values (4792) | 8753 |
Value | Count | Frequency (%) |
200 | 1 | < 0.1% |
300 | 8 | |
334 | 1 | < 0.1% |
335 | 1 | < 0.1% |
337 | 1 | < 0.1% |
338 | 1 | < 0.1% |
342 | 2 | < 0.1% |
343 | 1 | < 0.1% |
345 | 1 | < 0.1% |
347 | 1 | < 0.1% |
Value | Count | Frequency (%) |
1062500 | 1 | |
833625 | 1 | |
531200 | 1 | |
274500 | 1 | |
272000 | 1 | |
260000 | 1 | |
256000 | 1 | |
255000 | 2 | |
246500 | 1 | |
242000 | 1 |
등급 | 단량 | 단위 | 평균가 | |
---|---|---|---|---|
등급 | 1.000 | 0.051 | 0.092 | 0.091 |
단량 | 0.051 | 1.000 | 0.000 | 0.902 |
단위 | 0.092 | 0.000 | 1.000 | 0.000 |
평균가 | 0.091 | 0.902 | 0.000 | 1.000 |
등급 | 단위 | |
---|---|---|
등급 | 1.000 | 0.071 |
단위 | 0.071 | 1.000 |
단량 | 평균가 | 등급 | 단위 | |
---|---|---|---|---|
단량 | 1.000 | 0.749 | 0.027 | 0.000 |
평균가 | 0.749 | 1.000 | 0.048 | 0.000 |
등급 | 0.027 | 0.048 | 1.000 | 0.071 |
단위 | 0.000 | 0.000 | 0.071 | 1.000 |
품목 | 등급 | 단량 | 단위 | 평균가 | |
---|---|---|---|---|---|
10291 | 전분 및 사료제조(청포묵) | 특(1등 | 4.0 | kg | 7778 |
2441 | 달래(달래(일반)) | 상(2등 | 4.0 | kg | 28000 |
8696 | 양파(양파(일반)) | 상(2등 | 15.0 | kg | 23500 |
11914 | 파프리카(파프리카(일반)) | 상(2등 | 5.0 | kg | 35909 |
5002 | 마늘(풋마늘) | 특(1등 | 20.0 | kg | 52500 |
6882 | 사과(후지) | 상(2등 | 10.0 | kg | 22500 |
4761 | 마늘(깐마늘 남도) | 보통(3 | 10.0 | kg | 62000 |
5947 | 미역(줄기미역) | 특(1등 | 7.5 | kg | 11000 |
11327 | 콩(기타) | 특(1등 | 4.0 | kg | 12500 |
10682 | 참나물(참나물(일반)) | 특(1등 | 4.0 | kg | 16338 |
품목 | 등급 | 단량 | 단위 | 평균가 | |
---|---|---|---|---|---|
694 | 고구마(밤고구마) | 보통(3 | 10.0 | kg | 26317 |
13427 | 호박(쥬키니호박) | 특(1등 | 10.0 | kg | 22948 |
4069 | 딸기(설향) | 4등 | 1.0 | kg | 3168 |
6080 | 방울양배추(스프로스)(방울양배추(일반)) | 특(1등 | 0.5 | kg | 1700 |
11749 | 파인애플(파인애플(수입)) | 특(1등 | 11.5 | kg | 27000 |
2563 | 당근(기타) | 특(1등 | 10.0 | kg | 7167 |
1797 | 깻잎(깻잎(일반)) | 상(2등 | 2.0 | kg | 18511 |
5311 | 머위대(머위잎) | 특(1등 | 3.0 | kg | 26000 |
12701 | 풋고추(롱그린) | 상(2등 | 12.0 | kg | 72000 |
3039 | 딸기(금실) | 상(2등 | 0.5 | kg | 5226 |
Most frequently occurring
품목 | 등급 | 단량 | 단위 | 평균가 | # duplicates | |
---|---|---|---|---|---|---|
51 | 곡물제조(순두부) | 특(1등 | 16.0 | kg | 17800 | 22 |
324 | 미역(줄기미역) | 특(1등 | 5.5 | kg | 8000 | 22 |
39 | 곡물제조(두부) | 특(1등 | 0.5 | kg | 1230 | 21 |
43 | 곡물제조(두부) | 특(1등 | 3.0 | kg | 5300 | 20 |
640 | 콩나물(콩나물(일반)) | 특(1등 | 5.0 | kg | 7500 | 20 |
83 | 꼬시래기(꼬시래기(일반)) | 특(1등 | 8.0 | kg | 13500 | 19 |
53 | 곡물제조(연두부) | 특(1등 | 12.0 | kg | 17800 | 17 |
47 | 곡물제조(두부) | 특(1등 | 7.0 | kg | 7500 | 16 |
302 | 무순(무순(일반)) | 특(1등 | 0.15 | kg | 800 | 16 |
328 | 미역(줄기미역) | 특(1등 | 7.5 | kg | 11000 | 16 |