Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 2062 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 139.1 KiB |
Average record size in memory | 69.1 B |
Variable types
Text | 1 |
---|---|
Categorical | 3 |
Numeric | 4 |
Dataset
Description | 품목명,등급,수량,단위,최고가,최저가,평균가,조사일 |
---|---|
Author | 서울시농수산식품공사 |
URL | https://data.seoul.go.kr/dataList/OA-2664/S/1/datasetView.do |
조사일 has constant value "" | Constant |
최고가 is highly overall correlated with 최저가 and 1 other fields | High correlation |
최저가 is highly overall correlated with 최고가 and 1 other fields | High correlation |
평균가 is highly overall correlated with 최고가 and 1 other fields | High correlation |
단위 is highly imbalanced (66.2%) | Imbalance |
최고가 has 1289 (62.5%) zeros | Zeros |
최저가 has 1289 (62.5%) zeros | Zeros |
평균가 has 1289 (62.5%) zeros | Zeros |
Reproduction
Analysis started | 2024-05-11 05:54:52.975868 |
---|---|
Analysis finished | 2024-05-11 05:54:58.536549 |
Duration | 5.56 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
품목명
Text
Distinct | 429 |
---|---|
Distinct (%) | 20.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 16.2 KiB |
Value | Count | Frequency (%) |
복숭아 | 292 | 8.8% |
수입 | 171 | 5.1% |
사과 | 140 | 4.2% |
국산 | 75 | 2.2% |
포도 | 60 | 1.8% |
딸기 | 56 | 1.7% |
만감 | 52 | 1.6% |
감귤 | 48 | 1.4% |
양파 | 44 | 1.3% |
자두 | 44 | 1.3% |
Other values (421) | 2352 |
Most occurring characters
Value | Count | Frequency (%) |
1272 | 10.9% | |
( | 488 | 4.2% |
) | 488 | 4.2% |
아 | 353 | 3.0% |
복 | 302 | 2.6% |
숭 | 298 | 2.6% |
백 | 282 | 2.4% |
수 | 261 | 2.2% |
도 | 195 | 1.7% |
사 | 192 | 1.6% |
Other values (313) | 7537 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 9390 | |
Space Separator | 1272 | 10.9% |
Open Punctuation | 488 | 4.2% |
Close Punctuation | 488 | 4.2% |
Uppercase Letter | 24 | 0.2% |
Decimal Number | 6 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
아 | 353 | 3.8% |
복 | 302 | 3.2% |
숭 | 298 | 3.2% |
백 | 282 | 3.0% |
수 | 261 | 2.8% |
도 | 195 | 2.1% |
사 | 192 | 2.0% |
감 | 189 | 2.0% |
입 | 173 | 1.8% |
자 | 170 | 1.8% |
Other values (306) | 6975 |
Uppercase Letter
Value | Count | Frequency (%) |
A | 8 | |
M | 8 | |
B | 8 |
Space Separator
Value | Count | Frequency (%) |
1272 |
Open Punctuation
Value | Count | Frequency (%) |
( | 488 |
Close Punctuation
Value | Count | Frequency (%) |
) | 488 |
Decimal Number
Value | Count | Frequency (%) |
5 | 6 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 9390 | |
Common | 2254 | 19.3% |
Latin | 24 | 0.2% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
아 | 353 | 3.8% |
복 | 302 | 3.2% |
숭 | 298 | 3.2% |
백 | 282 | 3.0% |
수 | 261 | 2.8% |
도 | 195 | 2.1% |
사 | 192 | 2.0% |
감 | 189 | 2.0% |
입 | 173 | 1.8% |
자 | 170 | 1.8% |
Other values (306) | 6975 |
Common
Value | Count | Frequency (%) |
1272 | ||
( | 488 | 21.7% |
) | 488 | 21.7% |
5 | 6 | 0.3% |
Latin
Value | Count | Frequency (%) |
A | 8 | |
M | 8 | |
B | 8 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 9390 | |
ASCII | 2278 | 19.5% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1272 | ||
( | 488 | 21.4% |
) | 488 | 21.4% |
A | 8 | 0.4% |
M | 8 | 0.4% |
B | 8 | 0.4% |
5 | 6 | 0.3% |
Hangul
Value | Count | Frequency (%) |
아 | 353 | 3.8% |
복 | 302 | 3.2% |
숭 | 298 | 3.2% |
백 | 282 | 3.0% |
수 | 261 | 2.8% |
도 | 195 | 2.1% |
사 | 192 | 2.0% |
감 | 189 | 2.0% |
입 | 173 | 1.8% |
자 | 170 | 1.8% |
Other values (306) | 6975 |
등급
Categorical
Distinct | 6 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 16.2 KiB |
상 | |
---|---|
중 | |
하 | |
특 | |
대 | 18 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 상 |
---|---|
2nd row | 중 |
3rd row | 상 |
4th row | 상 |
5th row | 상 |
Common Values
Value | Count | Frequency (%) |
상 | 607 | |
중 | 566 | |
하 | 498 | |
특 | 360 | |
대 | 18 | 0.9% |
소 | 13 | 0.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
상 | 607 | |
중 | 566 | |
하 | 498 | |
특 | 360 | |
대 | 18 | 0.9% |
소 | 13 | 0.6% |
수량
Real number (ℝ)
Distinct | 42 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 43.075558 |
Minimum | 0.05 |
---|---|
Maximum | 10000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 18.3 KiB |
Quantile statistics
Minimum | 0.05 |
---|---|
5-th percentile | 1 |
Q1 | 3.125 |
median | 5 |
Q3 | 10 |
95-th percentile | 20 |
Maximum | 10000 |
Range | 9999.95 |
Interquartile range (IQR) | 6.875 |
Descriptive statistics
Standard deviation | 494.19904 |
---|---|
Coefficient of variation (CV) | 11.472841 |
Kurtosis | 337.44189 |
Mean | 43.075558 |
Median Absolute Deviation (MAD) | 4 |
Skewness | 17.861002 |
Sum | 88821.8 |
Variance | 244232.69 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
10.0 | 480 | |
5.0 | 276 | |
1.0 | 213 | |
4.0 | 206 | |
2.0 | 140 | 6.8% |
8.0 | 107 | 5.2% |
15.0 | 100 | 4.8% |
20.0 | 94 | 4.6% |
3.0 | 68 | 3.3% |
4.5 | 64 | 3.1% |
Other values (32) | 314 |
Value | Count | Frequency (%) |
0.05 | 3 | 0.1% |
0.2 | 3 | 0.1% |
0.25 | 4 | 0.2% |
0.5 | 3 | 0.1% |
0.75 | 3 | 0.1% |
1.0 | 213 | |
1.5 | 31 | 1.5% |
1.6 | 4 | 0.2% |
2.0 | 140 | |
2.5 | 44 | 2.1% |
Value | Count | Frequency (%) |
10000.0 | 4 | 0.2% |
5000.0 | 4 | 0.2% |
750.0 | 3 | 0.1% |
700.0 | 3 | 0.1% |
500.0 | 11 | |
400.0 | 5 | |
200.0 | 3 | 0.1% |
150.0 | 3 | 0.1% |
100.0 | 11 | |
50.0 | 12 |
단위
Categorical
IMBALANCE
 
Distinct | 16 |
---|---|
Distinct (%) | 0.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 16.2 KiB |
kg상자 | |
---|---|
kg | |
Kg그물망 | 45 |
개 | 30 |
g단 | 23 |
Other values (11) | 89 |
Length
Max length | 5 |
---|---|
Median length | 4 |
Mean length | 3.5800194 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | kg상자 |
---|---|
2nd row | kg상자 |
3rd row | kg상자 |
4th row | kg상자 |
5th row | kg상자 |
Common Values
Value | Count | Frequency (%) |
kg상자 | 1537 | |
kg | 338 | 16.4% |
Kg그물망 | 45 | 2.2% |
개 | 30 | 1.5% |
g단 | 23 | 1.1% |
kg단 | 22 | 1.1% |
kg개 | 20 | 1.0% |
속 | 10 | 0.5% |
마리 | 8 | 0.4% |
kgPAN | 6 | 0.3% |
Other values (6) | 23 | 1.1% |
Length
Value | Count | Frequency (%) |
kg상자 | 1537 | |
kg | 338 | 16.4% |
kg그물망 | 45 | 2.2% |
개 | 30 | 1.5% |
g단 | 23 | 1.1% |
kg단 | 22 | 1.1% |
kg개 | 20 | 1.0% |
속 | 10 | 0.5% |
kgpp대 | 9 | 0.4% |
마리 | 8 | 0.4% |
Other values (5) | 20 | 1.0% |
최고가
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 293 |
---|---|
Distinct (%) | 14.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 10561.476 |
Minimum | 0 |
---|---|
Maximum | 210000 |
Zeros | 1289 |
Zeros (%) | 62.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 18.3 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 14000 |
95-th percentile | 52000 |
Maximum | 210000 |
Range | 210000 |
Interquartile range (IQR) | 14000 |
Descriptive statistics
Standard deviation | 22678.624 |
---|---|
Coefficient of variation (CV) | 2.1472968 |
Kurtosis | 19.591209 |
Mean | 10561.476 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 3.8406181 |
Sum | 21777764 |
Variance | 5.1432 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 1289 | |
20000 | 23 | 1.1% |
16000 | 17 | 0.8% |
12000 | 16 | 0.8% |
25000 | 15 | 0.7% |
30000 | 14 | 0.7% |
10000 | 13 | 0.6% |
14000 | 13 | 0.6% |
14500 | 12 | 0.6% |
15000 | 12 | 0.6% |
Other values (283) | 638 |
Value | Count | Frequency (%) |
0 | 1289 | |
292 | 1 | < 0.1% |
350 | 2 | 0.1% |
800 | 1 | < 0.1% |
900 | 1 | < 0.1% |
1000 | 1 | < 0.1% |
1050 | 2 | 0.1% |
1100 | 1 | < 0.1% |
1150 | 2 | 0.1% |
1200 | 1 | < 0.1% |
Value | Count | Frequency (%) |
210000 | 1 | < 0.1% |
190000 | 1 | < 0.1% |
180000 | 1 | < 0.1% |
178000 | 1 | < 0.1% |
175000 | 1 | < 0.1% |
173000 | 1 | < 0.1% |
170000 | 1 | < 0.1% |
160000 | 3 | |
147000 | 1 | < 0.1% |
145000 | 2 |
최저가
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 269 |
---|---|
Distinct (%) | 13.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 7955.5572 |
Minimum | 0 |
---|---|
Maximum | 200000 |
Zeros | 1289 |
Zeros (%) | 62.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 18.3 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 8900 |
95-th percentile | 40000 |
Maximum | 200000 |
Range | 200000 |
Interquartile range (IQR) | 8900 |
Descriptive statistics
Standard deviation | 18575.057 |
---|---|
Coefficient of variation (CV) | 2.3348531 |
Kurtosis | 25.157777 |
Mean | 7955.5572 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 4.331056 |
Sum | 16404359 |
Variance | 3.4503275 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 1289 | |
20000 | 22 | 1.1% |
10000 | 20 | 1.0% |
12000 | 19 | 0.9% |
16000 | 15 | 0.7% |
4000 | 15 | 0.7% |
3000 | 14 | 0.7% |
6000 | 13 | 0.6% |
25000 | 13 | 0.6% |
15000 | 12 | 0.6% |
Other values (259) | 630 |
Value | Count | Frequency (%) |
0 | 1289 | |
100 | 1 | < 0.1% |
200 | 1 | < 0.1% |
250 | 1 | < 0.1% |
292 | 1 | < 0.1% |
350 | 1 | < 0.1% |
400 | 1 | < 0.1% |
500 | 3 | 0.1% |
530 | 1 | < 0.1% |
533 | 1 | < 0.1% |
Value | Count | Frequency (%) |
200000 | 1 | < 0.1% |
170000 | 1 | < 0.1% |
160000 | 1 | < 0.1% |
150000 | 2 | |
145000 | 1 | < 0.1% |
140000 | 1 | < 0.1% |
126667 | 2 | |
125000 | 4 | |
121000 | 1 | < 0.1% |
120000 | 2 |
평균가
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 730 |
---|---|
Distinct (%) | 35.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 9236.5213 |
Minimum | 0 |
---|---|
Maximum | 202500 |
Zeros | 1289 |
Zeros (%) | 62.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 18.3 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 11848 |
95-th percentile | 45864.1 |
Maximum | 202500 |
Range | 202500 |
Interquartile range (IQR) | 11848 |
Descriptive statistics
Standard deviation | 20223.823 |
---|---|
Coefficient of variation (CV) | 2.1895498 |
Kurtosis | 21.68513 |
Mean | 9236.5213 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 4.0118876 |
Sum | 19045707 |
Variance | 4.0900302 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 1289 | |
20000 | 5 | 0.2% |
45000 | 4 | 0.2% |
30000 | 4 | 0.2% |
3500 | 3 | 0.1% |
16000 | 2 | 0.1% |
13860 | 2 | 0.1% |
10839 | 2 | 0.1% |
21557 | 2 | 0.1% |
20559 | 2 | 0.1% |
Other values (720) | 747 |
Value | Count | Frequency (%) |
0 | 1289 | |
264 | 1 | < 0.1% |
347 | 1 | < 0.1% |
350 | 1 | < 0.1% |
715 | 1 | < 0.1% |
742 | 1 | < 0.1% |
846 | 1 | < 0.1% |
852 | 1 | < 0.1% |
908 | 1 | < 0.1% |
932 | 1 | < 0.1% |
Value | Count | Frequency (%) |
202500 | 1 | |
173333 | 1 | |
169190 | 1 | |
165429 | 1 | |
160833 | 1 | |
155295 | 1 | |
152305 | 1 | |
143334 | 1 | |
140940 | 1 | |
136423 | 1 |
조사일
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 16.2 KiB |
20240511 |
---|
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 8 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 20240511 |
---|---|
2nd row | 20240511 |
3rd row | 20240511 |
4th row | 20240511 |
5th row | 20240511 |
Common Values
Value | Count | Frequency (%) |
20240511 | 2062 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
20240511 | 2062 |
등급 | 수량 | 단위 | 최고가 | 최저가 | 평균가 | |
---|---|---|---|---|---|---|
등급 | 1.000 | 0.000 | 0.266 | 0.075 | 0.119 | 0.055 |
수량 | 0.000 | 1.000 | 0.096 | 0.000 | 0.000 | 0.000 |
단위 | 0.266 | 0.096 | 1.000 | 0.000 | 0.000 | 0.000 |
최고가 | 0.075 | 0.000 | 0.000 | 1.000 | 0.960 | 0.988 |
최저가 | 0.119 | 0.000 | 0.000 | 0.960 | 1.000 | 0.983 |
평균가 | 0.055 | 0.000 | 0.000 | 0.988 | 0.983 | 1.000 |
단위 | 등급 | |
---|---|---|
단위 | 1.000 | 0.131 |
등급 | 0.131 | 1.000 |
수량 | 최고가 | 최저가 | 평균가 | 등급 | 단위 | |
---|---|---|---|---|---|---|
수량 | 1.000 | -0.010 | -0.018 | -0.009 | 0.000 | 0.052 |
최고가 | -0.010 | 1.000 | 0.994 | 0.999 | 0.039 | 0.000 |
최저가 | -0.018 | 0.994 | 1.000 | 0.997 | 0.062 | 0.000 |
평균가 | -0.009 | 0.999 | 0.997 | 1.000 | 0.029 | 0.000 |
등급 | 0.000 | 0.039 | 0.062 | 0.029 | 1.000 | 0.131 |
단위 | 0.052 | 0.000 | 0.000 | 0.000 | 0.131 | 1.000 |
품목명 | 등급 | 수량 | 단위 | 최고가 | 최저가 | 평균가 | 조사일 | |
---|---|---|---|---|---|---|---|---|
0 | (냉)갈치 | 상 | 10.0 | kg상자 | 0 | 0 | 0 | 20240511 |
1 | (냉)갈치 | 중 | 10.0 | kg상자 | 0 | 0 | 0 | 20240511 |
2 | (냉)고등어 | 상 | 10.0 | kg상자 | 45000 | 18300 | 33024 | 20240511 |
3 | (냉)고등어 | 상 | 20.0 | kg상자 | 0 | 0 | 0 | 20240511 |
4 | (냉)고등어 수입 | 상 | 10.0 | kg상자 | 55000 | 26667 | 36025 | 20240511 |
5 | (선)갈치 | 상 | 3.0 | kg상자 | 0 | 0 | 0 | 20240511 |
6 | (선)갈치 | 중 | 3.0 | kg상자 | 0 | 0 | 0 | 20240511 |
7 | (선)갈치 | 하 | 3.0 | kg상자 | 0 | 0 | 0 | 20240511 |
8 | (선)갈치 | 상 | 5.0 | kg상자 | 210000 | 200000 | 202500 | 20240511 |
9 | (선)갈치 | 중 | 5.0 | kg상자 | 175000 | 170000 | 173333 | 20240511 |
품목명 | 등급 | 수량 | 단위 | 최고가 | 최저가 | 평균가 | 조사일 | |
---|---|---|---|---|---|---|---|---|
2052 | 황색멜론 | 중 | 5.0 | kg상자 | 0 | 0 | 0 | 20240511 |
2053 | 황색멜론 | 하 | 5.0 | kg상자 | 0 | 0 | 0 | 20240511 |
2054 | 황색멜론 | 특 | 8.0 | kg상자 | 0 | 0 | 0 | 20240511 |
2055 | 황색멜론 | 상 | 8.0 | kg상자 | 0 | 0 | 0 | 20240511 |
2056 | 황색멜론 | 중 | 8.0 | kg상자 | 0 | 0 | 0 | 20240511 |
2057 | 황색멜론 | 하 | 8.0 | kg상자 | 0 | 0 | 0 | 20240511 |
2058 | 황색멜론 | 특 | 10.0 | kg상자 | 0 | 0 | 0 | 20240511 |
2059 | 황색멜론 | 상 | 10.0 | kg상자 | 0 | 0 | 0 | 20240511 |
2060 | 황색멜론 | 중 | 10.0 | kg상자 | 0 | 0 | 0 | 20240511 |
2061 | 황색멜론 | 하 | 10.0 | kg상자 | 0 | 0 | 0 | 20240511 |