Overview

Dataset statistics

Number of variables8
Number of observations346
Missing cells346
Missing cells (%)12.5%
Duplicate rows12
Duplicate rows (%)3.5%
Total size in memory23.4 KiB
Average record size in memory69.4 B

Variable types

Categorical3
Numeric4
Unsupported1

Dataset

Description농업용 면세유 사용위반(용도외사용) 정보 현황
Author농림축산식품부
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220217000000002052

Alerts

VIOLT_TY has constant value ""Constant
Dataset has 12 (3.5%) duplicate rowsDuplicates
VIOLT_QY is highly overall correlated with VIOLT_AMTHigh correlation
VIOLT_AMT is highly overall correlated with VIOLT_QYHigh correlation
RTRVL_QY is highly overall correlated with RTRVL_AMTHigh correlation
RTRVL_AMT is highly overall correlated with RTRVL_QYHigh correlation
FRCN_KND is highly overall correlated with OIL_KNDHigh correlation
OIL_KND is highly overall correlated with FRCN_KNDHigh correlation
SPT_MANAGT has 346 (100.0%) missing valuesMissing
SPT_MANAGT is an unsupported type, check if it needs cleaning or further analysisUnsupported
VIOLT_QY has 53 (15.3%) zerosZeros
VIOLT_AMT has 68 (19.7%) zerosZeros
RTRVL_QY has 169 (48.8%) zerosZeros
RTRVL_AMT has 181 (52.3%) zerosZeros

Reproduction

Analysis started2023-12-11 03:38:04.985980
Analysis finished2023-12-11 03:38:07.554369
Duration2.57 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

VIOLT_TY
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
농업용도 외 사용
346 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row농업용도 외 사용
2nd row농업용도 외 사용
3rd row농업용도 외 사용
4th row농업용도 외 사용
5th row농업용도 외 사용

Common Values

ValueCountFrequency (%)
농업용도 외 사용 346
100.0%

Length

2023-12-11T12:38:07.651492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T12:38:07.794270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
농업용도 346
33.3%
346
33.3%
사용 346
33.3%

FRCN_KND
Categorical

HIGH CORRELATION 

Distinct29
Distinct (%)8.4%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
온풍난방기-간접식(유류용)
100 
동력경운기
51 
온풍난방기-직화식(유류용)-대포형
36 
농산물건조기(선반형)
28 
농업용 트랙터
23 
Other values (24)
108 

Length

Max length21
Median length17
Mean length11.419075
Min length3

Unique

Unique9 ?
Unique (%)2.6%

Sample

1st row온풍난방기-간접식(유류용)
2nd row동력경운기
3rd row예도형 동력예취기
4th row예도형 동력예취기
5th row온풍난방기-간접식(유류용)

Common Values

ValueCountFrequency (%)
온풍난방기-간접식(유류용) 100
28.9%
동력경운기 51
14.7%
온풍난방기-직화식(유류용)-대포형 36
 
10.4%
농산물건조기(선반형) 28
 
8.1%
농업용 트랙터 23
 
6.6%
관리기(보행형) 22
 
6.4%
온수보일러(유류용) 20
 
5.8%
농산물건조기(시설형-유류용)_온풍난방기 9
 
2.6%
동력이앙기(보행형) 7
 
2.0%
고속분무기(SS기) 6
 
1.7%
Other values (19) 44
12.7%

Length

2023-12-11T12:38:07.922985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
온풍난방기-간접식(유류용 100
26.2%
동력경운기 51
13.4%
온풍난방기-직화식(유류용)-대포형 36
 
9.4%
농산물건조기(선반형 28
 
7.3%
농업용 23
 
6.0%
트랙터 23
 
6.0%
관리기(보행형 22
 
5.8%
온수보일러(유류용 20
 
5.2%
농산물건조기(시설형-유류용)_온풍난방기 9
 
2.4%
동력이앙기(보행형 7
 
1.8%
Other values (24) 62
16.3%

OIL_KND
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
경유
218 
휘발유
76 
실내등유
48 
중유
 
4

Length

Max length4
Median length2
Mean length2.4971098
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경유
2nd row경유
3rd row휘발유
4th row휘발유
5th row경유

Common Values

ValueCountFrequency (%)
경유 218
63.0%
휘발유 76
 
22.0%
실내등유 48
 
13.9%
중유 4
 
1.2%

Length

2023-12-11T12:38:08.074473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T12:38:08.231106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경유 218
63.0%
휘발유 76
 
22.0%
실내등유 48
 
13.9%
중유 4
 
1.2%

VIOLT_QY
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct175
Distinct (%)50.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2525.9595
Minimum0
Maximum66498
Zeros53
Zeros (%)15.3%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2023-12-11T12:38:08.377952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q151.5
median457.5
Q32655.75
95-th percentile11459
Maximum66498
Range66498
Interquartile range (IQR)2604.25

Descriptive statistics

Standard deviation5654.7909
Coefficient of variation (CV)2.2386704
Kurtosis53.897037
Mean2525.9595
Median Absolute Deviation (MAD)457.5
Skewness5.9927611
Sum873982
Variance31976660
MonotonicityNot monotonic
2023-12-11T12:38:08.580199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 53
 
15.3%
400 13
 
3.8%
1000 10
 
2.9%
360 8
 
2.3%
3000 7
 
2.0%
40 7
 
2.0%
200 7
 
2.0%
1500 6
 
1.7%
300 6
 
1.7%
2000 6
 
1.7%
Other values (165) 223
64.5%
ValueCountFrequency (%)
0 53
15.3%
8 1
 
0.3%
10 3
 
0.9%
13 1
 
0.3%
15 1
 
0.3%
16 1
 
0.3%
18 1
 
0.3%
20 4
 
1.2%
24 1
 
0.3%
28 2
 
0.6%
ValueCountFrequency (%)
66498 1
0.3%
40000 1
0.3%
28000 1
0.3%
21000 1
0.3%
20000 1
0.3%
18538 1
0.3%
18298 1
0.3%
17214 1
0.3%
15900 1
0.3%
14754 1
0.3%

VIOLT_AMT
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct247
Distinct (%)71.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2743272.5
Minimum0
Maximum66498000
Zeros68
Zeros (%)19.7%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2023-12-11T12:38:08.763931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q136348.75
median443000
Q33027000
95-th percentile12965975
Maximum66498000
Range66498000
Interquartile range (IQR)2990651.2

Descriptive statistics

Standard deviation6063800.4
Coefficient of variation (CV)2.2104258
Kurtosis42.163594
Mean2743272.5
Median Absolute Deviation (MAD)443000
Skewness5.2976428
Sum9.491723 × 108
Variance3.6769675 × 1013
MonotonicityNot monotonic
2023-12-11T12:38:08.926055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 68
 
19.7%
405360 5
 
1.4%
1156500 4
 
1.2%
1030400 3
 
0.9%
225200 3
 
0.9%
248400 3
 
0.9%
3390000 3
 
0.9%
100000 2
 
0.6%
563000 2
 
0.6%
460000 2
 
0.6%
Other values (237) 251
72.5%
ValueCountFrequency (%)
0 68
19.7%
850 1
 
0.3%
10000 1
 
0.3%
11500 1
 
0.3%
15000 1
 
0.3%
15310 1
 
0.3%
16000 1
 
0.3%
18900 1
 
0.3%
20000 1
 
0.3%
21000 1
 
0.3%
ValueCountFrequency (%)
66498000 1
0.3%
43320000 1
0.3%
29400000 1
0.3%
23520000 1
0.3%
21000000 1
0.3%
20932910 1
0.3%
19796100 1
0.3%
19650280 1
0.3%
19485000 1
0.3%
17490000 1
0.3%

RTRVL_QY
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct135
Distinct (%)39.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3086.6908
Minimum0
Maximum88000
Zeros169
Zeros (%)48.8%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2023-12-11T12:38:09.075536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median12.5
Q3563
95-th percentile15693.5
Maximum88000
Range88000
Interquartile range (IQR)563

Descriptive statistics

Standard deviation9668.0792
Coefficient of variation (CV)3.1321827
Kurtosis35.385738
Mean3086.6908
Median Absolute Deviation (MAD)12.5
Skewness5.4904126
Sum1067995
Variance93471756
MonotonicityNot monotonic
2023-12-11T12:38:09.247459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 169
48.8%
360 13
 
3.8%
137 9
 
2.6%
563 6
 
1.7%
40 4
 
1.2%
68 3
 
0.9%
172 3
 
0.9%
15 2
 
0.6%
4180 2
 
0.6%
52 2
 
0.6%
Other values (125) 133
38.4%
ValueCountFrequency (%)
0 169
48.8%
2 1
 
0.3%
6 1
 
0.3%
10 1
 
0.3%
11 1
 
0.3%
14 2
 
0.6%
15 2
 
0.6%
16 1
 
0.3%
20 1
 
0.3%
22 1
 
0.3%
ValueCountFrequency (%)
88000 1
0.3%
71031 1
0.3%
64758 1
0.3%
60441 1
0.3%
55998 1
0.3%
50501 1
0.3%
34577 1
0.3%
27000 1
0.3%
24000 2
0.6%
23398 1
0.3%

RTRVL_AMT
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct151
Distinct (%)43.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3055559
Minimum0
Maximum92400000
Zeros181
Zeros (%)52.3%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2023-12-11T12:38:09.419879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q3515306.25
95-th percentile16517381
Maximum92400000
Range92400000
Interquartile range (IQR)515306.25

Descriptive statistics

Standard deviation9819958.9
Coefficient of variation (CV)3.2138011
Kurtosis38.117098
Mean3055559
Median Absolute Deviation (MAD)0
Skewness5.6387239
Sum1.0572234 × 109
Variance9.6431594 × 1013
MonotonicityNot monotonic
2023-12-11T12:38:09.600492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 181
52.3%
405360 6
 
1.7%
413640 5
 
1.4%
147000 2
 
0.6%
173720 2
 
0.6%
138370 2
 
0.6%
646887 2
 
0.6%
1434975 2
 
0.6%
850 2
 
0.6%
86000 1
 
0.3%
Other values (141) 141
40.8%
ValueCountFrequency (%)
0 181
52.3%
850 2
 
0.6%
11000 1
 
0.3%
14700 1
 
0.3%
15135 1
 
0.3%
15750 1
 
0.3%
16000 1
 
0.3%
20000 1
 
0.3%
22000 1
 
0.3%
25000 1
 
0.3%
ValueCountFrequency (%)
92400000 1
0.3%
72529200 1
0.3%
71233800 1
0.3%
58797900 1
0.3%
55551100 1
0.3%
38034700 1
0.3%
30000000 1
0.3%
28350000 1
0.3%
26795000 1
0.3%
26400000 1
0.3%

SPT_MANAGT
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing346
Missing (%)100.0%
Memory size3.2 KiB

Interactions

2023-12-11T12:38:06.768815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:38:05.245483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:38:05.858837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:38:06.249574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:38:06.878075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:38:05.330038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:38:05.955803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:38:06.357629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:38:07.007751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:38:05.413586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:38:06.057562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:38:06.515131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:38:07.119328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:38:05.493484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:38:06.152270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:38:06.643436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T12:38:09.743022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
FRCN_KNDOIL_KNDVIOLT_QYVIOLT_AMTRTRVL_QYRTRVL_AMT
FRCN_KND1.0000.8660.4490.2790.0000.000
OIL_KND0.8661.0000.1040.1550.5700.735
VIOLT_QY0.4490.1041.0000.9980.6530.221
VIOLT_AMT0.2790.1550.9981.0000.6510.175
RTRVL_QY0.0000.5700.6530.6511.0000.940
RTRVL_AMT0.0000.7350.2210.1750.9401.000
2023-12-11T12:38:09.896199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
FRCN_KNDOIL_KND
FRCN_KND1.0000.630
OIL_KND0.6301.000
2023-12-11T12:38:10.038409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
VIOLT_QYVIOLT_AMTRTRVL_QYRTRVL_AMTFRCN_KNDOIL_KND
VIOLT_QY1.0000.9310.0670.0710.1960.071
VIOLT_AMT0.9311.0000.1000.1200.1140.106
RTRVL_QY0.0670.1001.0000.9370.0000.400
RTRVL_AMT0.0710.1200.9371.0000.0000.402
FRCN_KND0.1960.1140.0000.0001.0000.630
OIL_KND0.0710.1060.4000.4020.6301.000

Missing values

2023-12-11T12:38:07.280427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T12:38:07.468481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

VIOLT_TYFRCN_KNDOIL_KNDVIOLT_QYVIOLT_AMTRTRVL_QYRTRVL_AMTSPT_MANAGT
0농업용도 외 사용온풍난방기-간접식(유류용)경유1585168010000<NA>
1농업용도 외 사용동력경운기경유202300000<NA>
2농업용도 외 사용예도형 동력예취기휘발유202100000<NA>
3농업용도 외 사용예도형 동력예취기휘발유403600000<NA>
4농업용도 외 사용온풍난방기-간접식(유류용)경유1006115650000<NA>
5농업용도 외 사용온풍난방기-간접식(유류용)경유1150132250000<NA>
6농업용도 외 사용온풍난방기-간접식(유류용)실내등유896103040000<NA>
7농업용도 외 사용온풍난방기-간접식(유류용)실내등유896103040000<NA>
8농업용도 외 사용온풍난방기-간접식(유류용)실내등유896103040000<NA>
9농업용도 외 사용온풍난방기-직화식(유류용)-대포형경유1006115650000<NA>
VIOLT_TYFRCN_KNDOIL_KNDVIOLT_QYVIOLT_AMTRTRVL_QYRTRVL_AMTSPT_MANAGT
336농업용도 외 사용동력이앙기(보행형)휘발유001111000<NA>
337농업용도 외 사용동력이앙기(보행형)휘발유18153106850<NA>
338농업용도 외 사용동력이앙기(보행형)휘발유1085010850<NA>
339농업용도 외 사용동력이앙기(승용형)휘발유53451005345100<NA>
340농업용도 외 사용버섯재배소독기(살균솥-병/봉지)경유5211327771000<NA>
341농업용도 외 사용예도형 동력예취기휘발유002020000<NA>
342농업용도 외 사용농업용 트랙터경유80080000000<NA>
343농업용도 외 사용온풍난방기-간접식(유류용)경유105001050000000<NA>
344농업용도 외 사용온풍난방기-간접식(유류용)경유110001100000000<NA>
345농업용도 외 사용온풍난방기-직화식(유류용)-대포형경유664986649800000<NA>

Duplicate rows

Most frequently occurring

VIOLT_TYFRCN_KNDOIL_KNDVIOLT_QYVIOLT_AMTRTRVL_QYRTRVL_AMT# duplicates
2농업용도 외 사용농산물건조기(선반형)실내등유3604053603604053605
4농업용도 외 사용동력경운기휘발유00004
7농업용도 외 사용온풍난방기-간접식(유류용)실내등유8961030400003
8농업용도 외 사용온풍난방기-직화식(유류용)-대포형경유216248400003
10농업용도 외 사용온풍난방기-직화식(유류용)-대포형경유10061156500003
0농업용도 외 사용관리기(보행형)휘발유4037800002
1농업용도 외 사용농산물건조기(선반형)실내등유1001126003604136402
3농업용도 외 사용동력경운기경유001721737202
5농업용도 외 사용온풍난방기-간접식(유류용)경유00120002
6농업용도 외 사용온풍난방기-간접식(유류용)경유68807451040132514349752