Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells140
Missing cells (%)0.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory673.8 KiB
Average record size in memory69.0 B

Variable types

DateTime1
Categorical2
Numeric4

Dataset

Description2020년 낙엽기 화성시 포도 스마트팜 데이터 입니다.
Author경기도 화성시
URLhttps://www.data.go.kr/data/15097680/fileData.do

Alerts

code is highly overall correlated with 농가번호 and 3 other fieldsHigh correlation
농장번호 is highly overall correlated with 농가번호 and 3 other fieldsHigh correlation
농가번호 is highly overall correlated with Y and 2 other fieldsHigh correlation
X is highly overall correlated with code and 1 other fieldsHigh correlation
Y is highly overall correlated with 농가번호 and 2 other fieldsHigh correlation
Value has 140 (1.4%) missing valuesMissing

Reproduction

Analysis started2024-04-17 12:59:34.338785
Analysis finished2024-04-17 12:59:36.847666
Duration2.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct7276
Distinct (%)72.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2020-10-01 00:09:59.962000
Maximum2020-12-31 23:19:59.981000
2024-04-17T21:59:36.907461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T21:59:37.034815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

code
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1_1
1342 
16_0
1322 
15_0
1321 
14_1
1321 
12_0
1293 
Other values (3)
3401 

Length

Max length4
Median length4
Mean length3.8658
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row17_0
2nd row1_1
3rd row15_0
4th row12_0
5th row17_0

Common Values

ValueCountFrequency (%)
1_1 1342
13.4%
16_0 1322
13.2%
15_0 1321
13.2%
14_1 1321
13.2%
12_0 1293
12.9%
13_0 1286
12.9%
11_1 1246
12.5%
17_0 869
8.7%

Length

2024-04-17T21:59:37.158961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T21:59:37.261626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1_1 1342
13.4%
16_0 1322
13.2%
15_0 1321
13.2%
14_1 1321
13.2%
12_0 1293
12.9%
13_0 1286
12.9%
11_1 1246
12.5%
17_0 869
8.7%

농가번호
Real number (ℝ)

HIGH CORRELATION 

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.1516
Minimum1
Maximum17
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-17T21:59:37.361782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q111
median13
Q315
95-th percentile17
Maximum17
Range16
Interquartile range (IQR)4

Descriptive statistics

Standard deviation4.7408507
Coefficient of variation (CV)0.39014209
Kurtosis1.283823
Mean12.1516
Median Absolute Deviation (MAD)2
Skewness-1.5479816
Sum121516
Variance22.475665
MonotonicityNot monotonic
2024-04-17T21:59:37.458965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
1 1342
13.4%
16 1322
13.2%
15 1321
13.2%
14 1321
13.2%
12 1293
12.9%
13 1286
12.9%
11 1246
12.5%
17 869
8.7%
ValueCountFrequency (%)
1 1342
13.4%
11 1246
12.5%
12 1293
12.9%
13 1286
12.9%
14 1321
13.2%
15 1321
13.2%
16 1322
13.2%
17 869
8.7%
ValueCountFrequency (%)
17 869
8.7%
16 1322
13.2%
15 1321
13.2%
14 1321
13.2%
13 1286
12.9%
12 1293
12.9%
11 1246
12.5%
1 1342
13.4%

농장번호
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
6091 
1
3909 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row1
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 6091
60.9%
1 3909
39.1%

Length

2024-04-17T21:59:37.569131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T21:59:37.672381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 6091
60.9%
1 3909
39.1%

X
Real number (ℝ)

HIGH CORRELATION 

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean932911.43
Minimum928164
Maximum940379
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-17T21:59:37.755351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum928164
5-th percentile928164
Q1929197
median932196
Q3937806
95-th percentile940379
Maximum940379
Range12215
Interquartile range (IQR)8609

Descriptive statistics

Standard deviation4080.9294
Coefficient of variation (CV)0.0043744017
Kurtosis-0.87674833
Mean932911.43
Median Absolute Deviation (MAD)2999
Skewness0.66505513
Sum9.3291143 × 109
Variance16653985
MonotonicityNot monotonic
2024-04-17T21:59:37.848682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
933410 1342
13.4%
929197 1322
13.2%
940379 1321
13.2%
932196 1321
13.2%
928872 1293
12.9%
937806 1286
12.9%
931608 1246
12.5%
928164 869
8.7%
ValueCountFrequency (%)
928164 869
8.7%
928872 1293
12.9%
929197 1322
13.2%
931608 1246
12.5%
932196 1321
13.2%
933410 1342
13.4%
937806 1286
12.9%
940379 1321
13.2%
ValueCountFrequency (%)
940379 1321
13.2%
937806 1286
12.9%
933410 1342
13.4%
932196 1321
13.2%
931608 1246
12.5%
929197 1322
13.2%
928872 1293
12.9%
928164 869
8.7%

Y
Real number (ℝ)

HIGH CORRELATION 

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1913882.3
Minimum1910119
Maximum1916594
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-17T21:59:37.943057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1910119
5-th percentile1910119
Q11911584
median1913982
Q31916477
95-th percentile1916594
Maximum1916594
Range6475
Interquartile range (IQR)4893

Descriptive statistics

Standard deviation2128.8001
Coefficient of variation (CV)0.0011122942
Kurtosis-0.89216306
Mean1913882.3
Median Absolute Deviation (MAD)2398
Skewness-0.37236144
Sum1.9138823 × 1010
Variance4531789.9
MonotonicityNot monotonic
2024-04-17T21:59:38.042298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
1916594 1342
13.4%
1913483 1322
13.2%
1910119 1321
13.2%
1913982 1321
13.2%
1916477 1293
12.9%
1911584 1286
12.9%
1914754 1246
12.5%
1914162 869
8.7%
ValueCountFrequency (%)
1910119 1321
13.2%
1911584 1286
12.9%
1913483 1322
13.2%
1913982 1321
13.2%
1914162 869
8.7%
1914754 1246
12.5%
1916477 1293
12.9%
1916594 1342
13.4%
ValueCountFrequency (%)
1916594 1342
13.4%
1916477 1293
12.9%
1914754 1246
12.5%
1914162 869
8.7%
1913982 1321
13.2%
1913483 1322
13.2%
1911584 1286
12.9%
1910119 1321
13.2%

Value
Real number (ℝ)

MISSING 

Distinct4470
Distinct (%)45.3%
Missing140
Missing (%)1.4%
Infinite0
Infinite (%)0.0%
Mean76.930733
Minimum22.92
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-17T21:59:38.156728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum22.92
5-th percentile39.66
Q160.575
median82.625
Q394.89
95-th percentile100
Maximum100
Range77.08
Interquartile range (IQR)34.315

Descriptive statistics

Standard deviation20.232317
Coefficient of variation (CV)0.26299395
Kurtosis-0.87487417
Mean76.930733
Median Absolute Deviation (MAD)14.675
Skewness-0.60157934
Sum758537.03
Variance409.34667
MonotonicityNot monotonic
2024-04-17T21:59:38.294313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100.0 802
 
8.0%
99.94 16
 
0.2%
95.7 11
 
0.1%
92.86 11
 
0.1%
99.98 11
 
0.1%
95.69 10
 
0.1%
99.96 9
 
0.1%
98.27 9
 
0.1%
93.02 9
 
0.1%
99.93 9
 
0.1%
Other values (4460) 8963
89.6%
(Missing) 140
 
1.4%
ValueCountFrequency (%)
22.92 1
< 0.1%
23.41 2
< 0.1%
23.91 1
< 0.1%
24.01 1
< 0.1%
24.77 1
< 0.1%
25.1 1
< 0.1%
25.83 1
< 0.1%
26.07 1
< 0.1%
26.11 1
< 0.1%
26.21 2
< 0.1%
ValueCountFrequency (%)
100.0 802
8.0%
99.99 7
 
0.1%
99.98 11
 
0.1%
99.97 8
 
0.1%
99.96 9
 
0.1%
99.95 1
 
< 0.1%
99.94 16
 
0.2%
99.93 9
 
0.1%
99.92 6
 
0.1%
99.91 3
 
< 0.1%

Interactions

2024-04-17T21:59:36.262960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T21:59:34.873539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T21:59:35.531325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T21:59:35.902905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T21:59:36.359038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T21:59:34.964959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T21:59:35.626350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T21:59:35.994565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T21:59:36.455518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T21:59:35.055709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T21:59:35.718674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T21:59:36.085037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T21:59:36.553897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T21:59:35.442541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T21:59:35.809615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T21:59:36.169978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-17T21:59:38.385874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
code농가번호농장번호XYValue
code1.0001.0001.0001.0001.0000.195
농가번호1.0001.0000.5580.9320.9770.270
농장번호1.0000.5581.0001.0000.5120.142
X1.0000.9321.0001.0000.9080.218
Y1.0000.9770.5120.9081.0000.286
Value0.1950.2700.1420.2180.2861.000
2024-04-17T21:59:38.477734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
code농장번호
code1.0001.000
농장번호1.0001.000
2024-04-17T21:59:38.552964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
농가번호XYValuecode농장번호
농가번호1.000-0.194-0.6680.0191.0000.675
X-0.1941.000-0.490-0.0391.0001.000
Y-0.668-0.4901.0000.0261.0000.667
Value0.019-0.0390.0261.0000.0940.109
code1.0001.0001.0000.0941.0001.000
농장번호0.6751.0000.6670.1091.0001.000

Missing values

2024-04-17T21:59:36.701209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-17T21:59:36.803734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

측정일시code농가번호농장번호XYValue
989822020-11-10 14:19:59.98117_0170928164191416241.46
103362020-10-21 05:10:00.0191_111933410191659496.73
749502020-10-26 18:40:00.01915_0150940379191011990.42
373572020-10-15 05:39:59.99012_0120928872191647784.59
961862020-11-30 00:20:00.01017_0170928164191416285.83
250512020-10-09 17:50:00.03811_1111931608191475464.49
439412020-11-29 09:49:59.98113_0130937806191158457.62
580302020-11-22 10:09:59.99014_11419321961913982100.0
949822020-12-08 09:00:00.00017_0170928164191416270.4
734242020-11-06 09:00:00.00015_0150940379191011962.93
측정일시code농가번호농장번호XYValue
148922020-12-19 10:20:00.03811_1111931608191475437.29
558782020-12-07 08:50:00.03814_1141932196191398284.27
588412020-11-16 19:00:00.02914_1141932196191398298.32
69842020-11-13 11:50:00.0381_111933410191659466.46
51532020-11-26 04:59:59.9711_111933410191659499.33
871452020-11-01 22:39:59.96216_01609291971913483100.0
145912020-12-21 12:29:59.97111_1111931608191475446.19
938422020-12-16 07:00:00.02917_0170928164191416275.69
876092020-10-29 17:19:59.98116_0160929197191348355.08
265532020-12-29 07:20:00.03812_01209288721916477100.0