Overview

Dataset statistics

Number of variables4
Number of observations7274
Missing cells2440
Missing cells (%)8.4%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory241.6 KiB
Average record size in memory34.0 B

Variable types

DateTime1
Text1
Numeric2

Dataset

Description인천광역시 남촌농산물도매시장에서 거래되는 농산물에 대한 경매 가격 정보로 거래일자, 품목, 물량, 금액 등을 볼 수 있습니다
Author인천광역시
URLhttps://www.data.go.kr/data/15051663/fileData.do

Alerts

Dataset has 1 (< 0.1%) duplicate rowsDuplicates
물량 is highly overall correlated with 금액High correlation
금액 is highly overall correlated with 물량High correlation
일자 has 610 (8.4%) missing valuesMissing
품목 has 610 (8.4%) missing valuesMissing
물량 has 610 (8.4%) missing valuesMissing
금액 has 610 (8.4%) missing valuesMissing

Reproduction

Analysis started2024-03-15 00:46:02.387421
Analysis finished2024-03-15 00:46:04.776089
Duration2.39 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일자
Date

MISSING 

Distinct26
Distinct (%)0.4%
Missing610
Missing (%)8.4%
Memory size57.0 KiB
Minimum2023-12-01 00:00:00
Maximum2023-12-30 00:00:00
2024-03-15T09:46:05.048165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T09:46:05.271553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)

품목
Text

MISSING 

Distinct475
Distinct (%)7.1%
Missing610
Missing (%)8.4%
Memory size57.0 KiB
2024-03-15T09:46:06.122131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length17
Mean length9.6943277
Min length5

Characters and Unicode

Total characters64603
Distinct characters307
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique50 ?
Unique (%)0.8%

Sample

1st row가지(가지(일반))
2nd row감귤(기타)
3rd row감귤(조생귤)
4th row감귤(황금향)
5th row감자(기타)
ValueCountFrequency (%)
195
 
2.7%
전분 195
 
2.7%
마늘(깐마늘 52
 
0.7%
곡물제조(순두부(수입 26
 
0.4%
새싹(기타 26
 
0.4%
미역(줄기미역 26
 
0.4%
양배추(양배추(일반 26
 
0.4%
쑥갓(쑥갓(일반 26
 
0.4%
시금치(시금치(일반 26
 
0.4%
숙주나물(숙주나물(일반 26
 
0.4%
Other values (469) 6485
91.2%
2024-03-15T09:46:07.531001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 9684
 
15.0%
( 9684
 
15.0%
1959
 
3.0%
1931
 
3.0%
1548
 
2.4%
1522
 
2.4%
1369
 
2.1%
1343
 
2.1%
1130
 
1.7%
907
 
1.4%
Other values (297) 33526
51.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 44707
69.2%
Close Punctuation 9684
 
15.0%
Open Punctuation 9684
 
15.0%
Space Separator 445
 
0.7%
Other Punctuation 57
 
0.1%
Decimal Number 26
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1959
 
4.4%
1931
 
4.3%
1548
 
3.5%
1522
 
3.4%
1369
 
3.1%
1343
 
3.0%
1130
 
2.5%
907
 
2.0%
892
 
2.0%
779
 
1.7%
Other values (292) 31327
70.1%
Close Punctuation
ValueCountFrequency (%)
) 9684
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9684
100.0%
Space Separator
ValueCountFrequency (%)
445
100.0%
Other Punctuation
ValueCountFrequency (%)
, 57
100.0%
Decimal Number
ValueCountFrequency (%)
1 26
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 44707
69.2%
Common 19896
30.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1959
 
4.4%
1931
 
4.3%
1548
 
3.5%
1522
 
3.4%
1369
 
3.1%
1343
 
3.0%
1130
 
2.5%
907
 
2.0%
892
 
2.0%
779
 
1.7%
Other values (292) 31327
70.1%
Common
ValueCountFrequency (%)
) 9684
48.7%
( 9684
48.7%
445
 
2.2%
, 57
 
0.3%
1 26
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 44707
69.2%
ASCII 19896
30.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 9684
48.7%
( 9684
48.7%
445
 
2.2%
, 57
 
0.3%
1 26
 
0.1%
Hangul
ValueCountFrequency (%)
1959
 
4.4%
1931
 
4.3%
1548
 
3.5%
1522
 
3.4%
1369
 
3.1%
1343
 
3.0%
1130
 
2.5%
907
 
2.0%
892
 
2.0%
779
 
1.7%
Other values (292) 31327
70.1%

물량
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct2487
Distinct (%)37.3%
Missing610
Missing (%)8.4%
Infinite0
Infinite (%)0.0%
Mean1797.8513
Minimum0.02
Maximum79650
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size64.1 KiB
2024-03-15T09:46:07.894286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.02
5-th percentile8
Q150
median240
Q31180.5
95-th percentile9097.1
Maximum79650
Range79649.98
Interquartile range (IQR)1130.5

Descriptive statistics

Standard deviation5064.0148
Coefficient of variation (CV)2.8167039
Kurtosis48.283732
Mean1797.8513
Median Absolute Deviation (MAD)224
Skewness5.9524166
Sum11980881
Variance25644246
MonotonicityNot monotonic
2024-03-15T09:46:08.360863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20.0 171
 
2.4%
40.0 150
 
2.1%
10.0 142
 
2.0%
50.0 91
 
1.3%
60.0 91
 
1.3%
30.0 79
 
1.1%
4.0 71
 
1.0%
8.0 70
 
1.0%
16.0 69
 
0.9%
100.0 68
 
0.9%
Other values (2477) 5662
77.8%
(Missing) 610
 
8.4%
ValueCountFrequency (%)
0.02 1
 
< 0.1%
0.03 1
 
< 0.1%
0.08 1
 
< 0.1%
0.1 2
 
< 0.1%
0.12 1
 
< 0.1%
0.2 5
0.1%
0.3 3
< 0.1%
0.5 5
0.1%
0.6 2
 
< 0.1%
0.7 1
 
< 0.1%
ValueCountFrequency (%)
79650.0 1
< 0.1%
79588.0 1
< 0.1%
68085.0 1
< 0.1%
58095.0 1
< 0.1%
54860.0 1
< 0.1%
54350.0 1
< 0.1%
49280.0 1
< 0.1%
49020.0 1
< 0.1%
47040.0 1
< 0.1%
45115.0 1
< 0.1%

금액
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct4025
Distinct (%)60.4%
Missing610
Missing (%)8.4%
Infinite0
Infinite (%)0.0%
Mean4436054.1
Minimum600
Maximum2.175825 × 108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size64.1 KiB
2024-03-15T09:46:08.819569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum600
5-th percentile25000
Q1170000
median842000
Q33546750
95-th percentile18086725
Maximum2.175825 × 108
Range2.175819 × 108
Interquartile range (IQR)3376750

Descriptive statistics

Standard deviation12700395
Coefficient of variation (CV)2.8629937
Kurtosis78.868159
Mean4436054.1
Median Absolute Deviation (MAD)785000
Skewness7.6574623
Sum2.9561864 × 1010
Variance1.6130003 × 1014
MonotonicityNot monotonic
2024-03-15T09:46:09.271987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20000 55
 
0.8%
60000 52
 
0.7%
30000 45
 
0.6%
40000 45
 
0.6%
100000 39
 
0.5%
90000 34
 
0.5%
120000 30
 
0.4%
180000 28
 
0.4%
26000 27
 
0.4%
45000 26
 
0.4%
Other values (4015) 6283
86.4%
(Missing) 610
 
8.4%
ValueCountFrequency (%)
600 1
 
< 0.1%
900 1
 
< 0.1%
1000 1
 
< 0.1%
1500 2
< 0.1%
2000 1
 
< 0.1%
2800 1
 
< 0.1%
3000 4
0.1%
3600 1
 
< 0.1%
4000 4
0.1%
4500 3
< 0.1%
ValueCountFrequency (%)
217582500 1
< 0.1%
213994000 1
< 0.1%
173048000 1
< 0.1%
171916500 1
< 0.1%
164225000 1
< 0.1%
156260000 1
< 0.1%
152450200 1
< 0.1%
151830000 1
< 0.1%
150809500 1
< 0.1%
149018500 1
< 0.1%

Interactions

2024-03-15T09:46:03.422122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T09:46:02.852037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T09:46:03.707068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T09:46:03.151593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T09:46:09.532951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일자물량금액
일자1.0000.0000.000
물량0.0001.0000.552
금액0.0000.5521.000
2024-03-15T09:46:09.769258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
물량금액
물량1.0000.925
금액0.9251.000

Missing values

2024-03-15T09:46:04.034331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T09:46:04.332022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-15T09:46:04.656801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

일자품목물량금액
02023-12-01가지(가지(일반))1685.05430000
12023-12-01감귤(기타)26625.051449000
22023-12-01감귤(조생귤)25090.045312500
32023-12-01감귤(황금향)2337.69971800
42023-12-01감자(기타)10380.020116000
52023-12-01감자(수미)7390.011408000
62023-12-01갓(갓(일반))654.02600500
72023-12-01갓(돌산갓)370.0794000
82023-12-01갓(반청갓)5378.512258500
92023-12-01갓(청갓)5320.013313000
일자품목물량금액
7264<NA><NA><NA><NA>
7265<NA><NA><NA><NA>
7266<NA><NA><NA><NA>
7267<NA><NA><NA><NA>
7268<NA><NA><NA><NA>
7269<NA><NA><NA><NA>
7270<NA><NA><NA><NA>
7271<NA><NA><NA><NA>
7272<NA><NA><NA><NA>
7273<NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

일자품목물량금액# duplicates
0<NA><NA><NA><NA>610