Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 7274 |
Missing cells | 2440 |
Missing cells (%) | 8.4% |
Duplicate rows | 1 |
Duplicate rows (%) | < 0.1% |
Total size in memory | 241.6 KiB |
Average record size in memory | 34.0 B |
Variable types
DateTime | 1 |
---|---|
Text | 1 |
Numeric | 2 |
Dataset
Description | 인천광역시 남촌농산물도매시장에서 거래되는 농산물에 대한 경매 가격 정보로 거래일자, 품목, 물량, 금액 등을 볼 수 있습니다 |
---|---|
Author | 인천광역시 |
URL | https://www.data.go.kr/data/15051663/fileData.do |
Dataset has 1 (< 0.1%) duplicate rows | Duplicates |
물량 is highly overall correlated with 금액 | High correlation |
금액 is highly overall correlated with 물량 | High correlation |
일자 has 610 (8.4%) missing values | Missing |
품목 has 610 (8.4%) missing values | Missing |
물량 has 610 (8.4%) missing values | Missing |
금액 has 610 (8.4%) missing values | Missing |
Reproduction
Analysis started | 2024-03-15 00:46:02.387421 |
---|---|
Analysis finished | 2024-03-15 00:46:04.776089 |
Duration | 2.39 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
일자
Date
MISSING
 
Distinct | 26 |
---|---|
Distinct (%) | 0.4% |
Missing | 610 |
Missing (%) | 8.4% |
Memory size | 57.0 KiB |
Minimum | 2023-12-01 00:00:00 |
---|---|
Maximum | 2023-12-30 00:00:00 |
품목
Text
MISSING
 
Distinct | 475 |
---|---|
Distinct (%) | 7.1% |
Missing | 610 |
Missing (%) | 8.4% |
Memory size | 57.0 KiB |
Value | Count | Frequency (%) |
및 | 195 | 2.7% |
전분 | 195 | 2.7% |
마늘(깐마늘 | 52 | 0.7% |
곡물제조(순두부(수입 | 26 | 0.4% |
새싹(기타 | 26 | 0.4% |
미역(줄기미역 | 26 | 0.4% |
양배추(양배추(일반 | 26 | 0.4% |
쑥갓(쑥갓(일반 | 26 | 0.4% |
시금치(시금치(일반 | 26 | 0.4% |
숙주나물(숙주나물(일반 | 26 | 0.4% |
Other values (469) | 6485 |
Most occurring characters
Value | Count | Frequency (%) |
) | 9684 | 15.0% |
( | 9684 | 15.0% |
일 | 1959 | 3.0% |
반 | 1931 | 3.0% |
추 | 1548 | 2.4% |
기 | 1522 | 2.4% |
타 | 1369 | 2.1% |
리 | 1343 | 2.1% |
고 | 1130 | 1.7% |
나 | 907 | 1.4% |
Other values (297) | 33526 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 44707 | |
Close Punctuation | 9684 | 15.0% |
Open Punctuation | 9684 | 15.0% |
Space Separator | 445 | 0.7% |
Other Punctuation | 57 | 0.1% |
Decimal Number | 26 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
일 | 1959 | 4.4% |
반 | 1931 | 4.3% |
추 | 1548 | 3.5% |
기 | 1522 | 3.4% |
타 | 1369 | 3.1% |
리 | 1343 | 3.0% |
고 | 1130 | 2.5% |
나 | 907 | 2.0% |
수 | 892 | 2.0% |
파 | 779 | 1.7% |
Other values (292) | 31327 |
Close Punctuation
Value | Count | Frequency (%) |
) | 9684 |
Open Punctuation
Value | Count | Frequency (%) |
( | 9684 |
Space Separator
Value | Count | Frequency (%) |
445 |
Other Punctuation
Value | Count | Frequency (%) |
, | 57 |
Decimal Number
Value | Count | Frequency (%) |
1 | 26 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 44707 | |
Common | 19896 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
일 | 1959 | 4.4% |
반 | 1931 | 4.3% |
추 | 1548 | 3.5% |
기 | 1522 | 3.4% |
타 | 1369 | 3.1% |
리 | 1343 | 3.0% |
고 | 1130 | 2.5% |
나 | 907 | 2.0% |
수 | 892 | 2.0% |
파 | 779 | 1.7% |
Other values (292) | 31327 |
Common
Value | Count | Frequency (%) |
) | 9684 | |
( | 9684 | |
445 | 2.2% | |
, | 57 | 0.3% |
1 | 26 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 44707 | |
ASCII | 19896 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
) | 9684 | |
( | 9684 | |
445 | 2.2% | |
, | 57 | 0.3% |
1 | 26 | 0.1% |
Hangul
Value | Count | Frequency (%) |
일 | 1959 | 4.4% |
반 | 1931 | 4.3% |
추 | 1548 | 3.5% |
기 | 1522 | 3.4% |
타 | 1369 | 3.1% |
리 | 1343 | 3.0% |
고 | 1130 | 2.5% |
나 | 907 | 2.0% |
수 | 892 | 2.0% |
파 | 779 | 1.7% |
Other values (292) | 31327 |
물량
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 2487 |
---|---|
Distinct (%) | 37.3% |
Missing | 610 |
Missing (%) | 8.4% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1797.8513 |
Minimum | 0.02 |
---|---|
Maximum | 79650 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 64.1 KiB |
Quantile statistics
Minimum | 0.02 |
---|---|
5-th percentile | 8 |
Q1 | 50 |
median | 240 |
Q3 | 1180.5 |
95-th percentile | 9097.1 |
Maximum | 79650 |
Range | 79649.98 |
Interquartile range (IQR) | 1130.5 |
Descriptive statistics
Standard deviation | 5064.0148 |
---|---|
Coefficient of variation (CV) | 2.8167039 |
Kurtosis | 48.283732 |
Mean | 1797.8513 |
Median Absolute Deviation (MAD) | 224 |
Skewness | 5.9524166 |
Sum | 11980881 |
Variance | 25644246 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20.0 | 171 | 2.4% |
40.0 | 150 | 2.1% |
10.0 | 142 | 2.0% |
50.0 | 91 | 1.3% |
60.0 | 91 | 1.3% |
30.0 | 79 | 1.1% |
4.0 | 71 | 1.0% |
8.0 | 70 | 1.0% |
16.0 | 69 | 0.9% |
100.0 | 68 | 0.9% |
Other values (2477) | 5662 | |
(Missing) | 610 | 8.4% |
Value | Count | Frequency (%) |
0.02 | 1 | < 0.1% |
0.03 | 1 | < 0.1% |
0.08 | 1 | < 0.1% |
0.1 | 2 | < 0.1% |
0.12 | 1 | < 0.1% |
0.2 | 5 | |
0.3 | 3 | |
0.5 | 5 | |
0.6 | 2 | < 0.1% |
0.7 | 1 | < 0.1% |
Value | Count | Frequency (%) |
79650.0 | 1 | |
79588.0 | 1 | |
68085.0 | 1 | |
58095.0 | 1 | |
54860.0 | 1 | |
54350.0 | 1 | |
49280.0 | 1 | |
49020.0 | 1 | |
47040.0 | 1 | |
45115.0 | 1 |
금액
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 4025 |
---|---|
Distinct (%) | 60.4% |
Missing | 610 |
Missing (%) | 8.4% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4436054.1 |
Minimum | 600 |
---|---|
Maximum | 2.175825 × 108 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 64.1 KiB |
Quantile statistics
Minimum | 600 |
---|---|
5-th percentile | 25000 |
Q1 | 170000 |
median | 842000 |
Q3 | 3546750 |
95-th percentile | 18086725 |
Maximum | 2.175825 × 108 |
Range | 2.175819 × 108 |
Interquartile range (IQR) | 3376750 |
Descriptive statistics
Standard deviation | 12700395 |
---|---|
Coefficient of variation (CV) | 2.8629937 |
Kurtosis | 78.868159 |
Mean | 4436054.1 |
Median Absolute Deviation (MAD) | 785000 |
Skewness | 7.6574623 |
Sum | 2.9561864 × 1010 |
Variance | 1.6130003 × 1014 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20000 | 55 | 0.8% |
60000 | 52 | 0.7% |
30000 | 45 | 0.6% |
40000 | 45 | 0.6% |
100000 | 39 | 0.5% |
90000 | 34 | 0.5% |
120000 | 30 | 0.4% |
180000 | 28 | 0.4% |
26000 | 27 | 0.4% |
45000 | 26 | 0.4% |
Other values (4015) | 6283 | |
(Missing) | 610 | 8.4% |
Value | Count | Frequency (%) |
600 | 1 | < 0.1% |
900 | 1 | < 0.1% |
1000 | 1 | < 0.1% |
1500 | 2 | |
2000 | 1 | < 0.1% |
2800 | 1 | < 0.1% |
3000 | 4 | |
3600 | 1 | < 0.1% |
4000 | 4 | |
4500 | 3 |
Value | Count | Frequency (%) |
217582500 | 1 | |
213994000 | 1 | |
173048000 | 1 | |
171916500 | 1 | |
164225000 | 1 | |
156260000 | 1 | |
152450200 | 1 | |
151830000 | 1 | |
150809500 | 1 | |
149018500 | 1 |
일자 | 물량 | 금액 | |
---|---|---|---|
일자 | 1.000 | 0.000 | 0.000 |
물량 | 0.000 | 1.000 | 0.552 |
금액 | 0.000 | 0.552 | 1.000 |
물량 | 금액 | |
---|---|---|
물량 | 1.000 | 0.925 |
금액 | 0.925 | 1.000 |
일자 | 품목 | 물량 | 금액 | |
---|---|---|---|---|
0 | 2023-12-01 | 가지(가지(일반)) | 1685.0 | 5430000 |
1 | 2023-12-01 | 감귤(기타) | 26625.0 | 51449000 |
2 | 2023-12-01 | 감귤(조생귤) | 25090.0 | 45312500 |
3 | 2023-12-01 | 감귤(황금향) | 2337.6 | 9971800 |
4 | 2023-12-01 | 감자(기타) | 10380.0 | 20116000 |
5 | 2023-12-01 | 감자(수미) | 7390.0 | 11408000 |
6 | 2023-12-01 | 갓(갓(일반)) | 654.0 | 2600500 |
7 | 2023-12-01 | 갓(돌산갓) | 370.0 | 794000 |
8 | 2023-12-01 | 갓(반청갓) | 5378.5 | 12258500 |
9 | 2023-12-01 | 갓(청갓) | 5320.0 | 13313000 |
일자 | 품목 | 물량 | 금액 | |
---|---|---|---|---|
7264 | <NA> | <NA> | <NA> | <NA> |
7265 | <NA> | <NA> | <NA> | <NA> |
7266 | <NA> | <NA> | <NA> | <NA> |
7267 | <NA> | <NA> | <NA> | <NA> |
7268 | <NA> | <NA> | <NA> | <NA> |
7269 | <NA> | <NA> | <NA> | <NA> |
7270 | <NA> | <NA> | <NA> | <NA> |
7271 | <NA> | <NA> | <NA> | <NA> |
7272 | <NA> | <NA> | <NA> | <NA> |
7273 | <NA> | <NA> | <NA> | <NA> |
Most frequently occurring
일자 | 품목 | 물량 | 금액 | # duplicates | |
---|---|---|---|---|---|
0 | <NA> | <NA> | <NA> | <NA> | 610 |