Overview

Dataset statistics

Number of variables8
Number of observations346
Missing cells346
Missing cells (%)12.5%
Duplicate rows12
Duplicate rows (%)3.5%
Total size in memory23.4 KiB
Average record size in memory69.4 B

Variable types

Categorical3
Numeric4
Unsupported1

Dataset

Description농업용 면세유 사용위반(용도외사용) 정보 현황
Author농림축산식품부
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220217000000002052

Alerts

위반유형 has constant value ""Constant
Dataset has 12 (3.5%) duplicate rowsDuplicates
위반물량 is highly overall correlated with 위반금액High correlation
위반금액 is highly overall correlated with 위반물량High correlation
회수물량 is highly overall correlated with 회수금액High correlation
회수금액 is highly overall correlated with 회수물량High correlation
기종종 is highly overall correlated with 유종High correlation
유종 is highly overall correlated with 기종종High correlation
현장조치 has 346 (100.0%) missing valuesMissing
현장조치 is an unsupported type, check if it needs cleaning or further analysisUnsupported
위반물량 has 53 (15.3%) zerosZeros
위반금액 has 68 (19.7%) zerosZeros
회수물량 has 169 (48.8%) zerosZeros
회수금액 has 181 (52.3%) zerosZeros

Reproduction

Analysis started2023-12-11 03:38:11.373370
Analysis finished2023-12-11 03:38:13.764335
Duration2.39 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

위반유형
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
농업용도 외 사용
346 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row농업용도 외 사용
2nd row농업용도 외 사용
3rd row농업용도 외 사용
4th row농업용도 외 사용
5th row농업용도 외 사용

Common Values

ValueCountFrequency (%)
농업용도 외 사용 346
100.0%

Length

2023-12-11T12:38:13.852195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T12:38:13.986769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
농업용도 346
33.3%
346
33.3%
사용 346
33.3%

기종종
Categorical

HIGH CORRELATION 

Distinct29
Distinct (%)8.4%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
온풍난방기-간접식(유류용)
100 
동력경운기
51 
온풍난방기-직화식(유류용)-대포형
36 
농산물건조기(선반형)
28 
농업용 트랙터
23 
Other values (24)
108 

Length

Max length21
Median length17
Mean length11.419075
Min length3

Unique

Unique9 ?
Unique (%)2.6%

Sample

1st row온풍난방기-간접식(유류용)
2nd row동력경운기
3rd row예도형 동력예취기
4th row예도형 동력예취기
5th row온풍난방기-간접식(유류용)

Common Values

ValueCountFrequency (%)
온풍난방기-간접식(유류용) 100
28.9%
동력경운기 51
14.7%
온풍난방기-직화식(유류용)-대포형 36
 
10.4%
농산물건조기(선반형) 28
 
8.1%
농업용 트랙터 23
 
6.6%
관리기(보행형) 22
 
6.4%
온수보일러(유류용) 20
 
5.8%
농산물건조기(시설형-유류용)_온풍난방기 9
 
2.6%
동력이앙기(보행형) 7
 
2.0%
고속분무기(SS기) 6
 
1.7%
Other values (19) 44
12.7%

Length

2023-12-11T12:38:14.154460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
온풍난방기-간접식(유류용 100
26.2%
동력경운기 51
13.4%
온풍난방기-직화식(유류용)-대포형 36
 
9.4%
농산물건조기(선반형 28
 
7.3%
농업용 23
 
6.0%
트랙터 23
 
6.0%
관리기(보행형 22
 
5.8%
온수보일러(유류용 20
 
5.2%
농산물건조기(시설형-유류용)_온풍난방기 9
 
2.4%
동력이앙기(보행형 7
 
1.8%
Other values (24) 62
16.3%

유종
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
경유
218 
휘발유
76 
실내등유
48 
중유
 
4

Length

Max length4
Median length2
Mean length2.4971098
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경유
2nd row경유
3rd row휘발유
4th row휘발유
5th row경유

Common Values

ValueCountFrequency (%)
경유 218
63.0%
휘발유 76
 
22.0%
실내등유 48
 
13.9%
중유 4
 
1.2%

Length

2023-12-11T12:38:14.369683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T12:38:14.532803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경유 218
63.0%
휘발유 76
 
22.0%
실내등유 48
 
13.9%
중유 4
 
1.2%

위반물량
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct175
Distinct (%)50.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2525.9595
Minimum0
Maximum66498
Zeros53
Zeros (%)15.3%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2023-12-11T12:38:14.717173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q151.5
median457.5
Q32655.75
95-th percentile11459
Maximum66498
Range66498
Interquartile range (IQR)2604.25

Descriptive statistics

Standard deviation5654.7909
Coefficient of variation (CV)2.2386704
Kurtosis53.897037
Mean2525.9595
Median Absolute Deviation (MAD)457.5
Skewness5.9927611
Sum873982
Variance31976660
MonotonicityNot monotonic
2023-12-11T12:38:14.897945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 53
 
15.3%
400 13
 
3.8%
1000 10
 
2.9%
360 8
 
2.3%
3000 7
 
2.0%
40 7
 
2.0%
200 7
 
2.0%
1500 6
 
1.7%
300 6
 
1.7%
2000 6
 
1.7%
Other values (165) 223
64.5%
ValueCountFrequency (%)
0 53
15.3%
8 1
 
0.3%
10 3
 
0.9%
13 1
 
0.3%
15 1
 
0.3%
16 1
 
0.3%
18 1
 
0.3%
20 4
 
1.2%
24 1
 
0.3%
28 2
 
0.6%
ValueCountFrequency (%)
66498 1
0.3%
40000 1
0.3%
28000 1
0.3%
21000 1
0.3%
20000 1
0.3%
18538 1
0.3%
18298 1
0.3%
17214 1
0.3%
15900 1
0.3%
14754 1
0.3%

위반금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct247
Distinct (%)71.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2743272.5
Minimum0
Maximum66498000
Zeros68
Zeros (%)19.7%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2023-12-11T12:38:15.112274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q136348.75
median443000
Q33027000
95-th percentile12965975
Maximum66498000
Range66498000
Interquartile range (IQR)2990651.2

Descriptive statistics

Standard deviation6063800.4
Coefficient of variation (CV)2.2104258
Kurtosis42.163594
Mean2743272.5
Median Absolute Deviation (MAD)443000
Skewness5.2976428
Sum9.491723 × 108
Variance3.6769675 × 1013
MonotonicityNot monotonic
2023-12-11T12:38:15.320686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 68
 
19.7%
405360 5
 
1.4%
1156500 4
 
1.2%
1030400 3
 
0.9%
225200 3
 
0.9%
248400 3
 
0.9%
3390000 3
 
0.9%
100000 2
 
0.6%
563000 2
 
0.6%
460000 2
 
0.6%
Other values (237) 251
72.5%
ValueCountFrequency (%)
0 68
19.7%
850 1
 
0.3%
10000 1
 
0.3%
11500 1
 
0.3%
15000 1
 
0.3%
15310 1
 
0.3%
16000 1
 
0.3%
18900 1
 
0.3%
20000 1
 
0.3%
21000 1
 
0.3%
ValueCountFrequency (%)
66498000 1
0.3%
43320000 1
0.3%
29400000 1
0.3%
23520000 1
0.3%
21000000 1
0.3%
20932910 1
0.3%
19796100 1
0.3%
19650280 1
0.3%
19485000 1
0.3%
17490000 1
0.3%

회수물량
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct135
Distinct (%)39.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3086.6908
Minimum0
Maximum88000
Zeros169
Zeros (%)48.8%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2023-12-11T12:38:15.529691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median12.5
Q3563
95-th percentile15693.5
Maximum88000
Range88000
Interquartile range (IQR)563

Descriptive statistics

Standard deviation9668.0792
Coefficient of variation (CV)3.1321827
Kurtosis35.385738
Mean3086.6908
Median Absolute Deviation (MAD)12.5
Skewness5.4904126
Sum1067995
Variance93471756
MonotonicityNot monotonic
2023-12-11T12:38:15.711684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 169
48.8%
360 13
 
3.8%
137 9
 
2.6%
563 6
 
1.7%
40 4
 
1.2%
68 3
 
0.9%
172 3
 
0.9%
15 2
 
0.6%
4180 2
 
0.6%
52 2
 
0.6%
Other values (125) 133
38.4%
ValueCountFrequency (%)
0 169
48.8%
2 1
 
0.3%
6 1
 
0.3%
10 1
 
0.3%
11 1
 
0.3%
14 2
 
0.6%
15 2
 
0.6%
16 1
 
0.3%
20 1
 
0.3%
22 1
 
0.3%
ValueCountFrequency (%)
88000 1
0.3%
71031 1
0.3%
64758 1
0.3%
60441 1
0.3%
55998 1
0.3%
50501 1
0.3%
34577 1
0.3%
27000 1
0.3%
24000 2
0.6%
23398 1
0.3%

회수금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct151
Distinct (%)43.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3055559
Minimum0
Maximum92400000
Zeros181
Zeros (%)52.3%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2023-12-11T12:38:15.908656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q3515306.25
95-th percentile16517381
Maximum92400000
Range92400000
Interquartile range (IQR)515306.25

Descriptive statistics

Standard deviation9819958.9
Coefficient of variation (CV)3.2138011
Kurtosis38.117098
Mean3055559
Median Absolute Deviation (MAD)0
Skewness5.6387239
Sum1.0572234 × 109
Variance9.6431594 × 1013
MonotonicityNot monotonic
2023-12-11T12:38:16.108659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 181
52.3%
405360 6
 
1.7%
413640 5
 
1.4%
147000 2
 
0.6%
173720 2
 
0.6%
138370 2
 
0.6%
646887 2
 
0.6%
1434975 2
 
0.6%
850 2
 
0.6%
86000 1
 
0.3%
Other values (141) 141
40.8%
ValueCountFrequency (%)
0 181
52.3%
850 2
 
0.6%
11000 1
 
0.3%
14700 1
 
0.3%
15135 1
 
0.3%
15750 1
 
0.3%
16000 1
 
0.3%
20000 1
 
0.3%
22000 1
 
0.3%
25000 1
 
0.3%
ValueCountFrequency (%)
92400000 1
0.3%
72529200 1
0.3%
71233800 1
0.3%
58797900 1
0.3%
55551100 1
0.3%
38034700 1
0.3%
30000000 1
0.3%
28350000 1
0.3%
26795000 1
0.3%
26400000 1
0.3%

현장조치
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing346
Missing (%)100.0%
Memory size3.2 KiB

Interactions

2023-12-11T12:38:12.906316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:38:11.697806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:38:12.137891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:38:12.555513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:38:12.986811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:38:11.807686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:38:12.238777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:38:12.654812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:38:13.069230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:38:11.926154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:38:12.358488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:38:12.753603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:38:13.140541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:38:12.029598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:38:12.447794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:38:12.831311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T12:38:16.229290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기종종유종위반물량위반금액회수물량회수금액
기종종1.0000.8660.4490.2790.0000.000
유종0.8661.0000.1040.1550.5700.735
위반물량0.4490.1041.0000.9980.6530.221
위반금액0.2790.1550.9981.0000.6510.175
회수물량0.0000.5700.6530.6511.0000.940
회수금액0.0000.7350.2210.1750.9401.000
2023-12-11T12:38:16.364223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
유종기종종
유종1.0000.630
기종종0.6301.000
2023-12-11T12:38:16.486530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위반물량위반금액회수물량회수금액기종종유종
위반물량1.0000.9310.0670.0710.1960.071
위반금액0.9311.0000.1000.1200.1140.106
회수물량0.0670.1001.0000.9370.0000.400
회수금액0.0710.1200.9371.0000.0000.402
기종종0.1960.1140.0000.0001.0000.630
유종0.0710.1060.4000.4020.6301.000

Missing values

2023-12-11T12:38:13.524979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T12:38:13.689041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

위반유형기종종유종위반물량위반금액회수물량회수금액현장조치
0농업용도 외 사용온풍난방기-간접식(유류용)경유1585168010000<NA>
1농업용도 외 사용동력경운기경유202300000<NA>
2농업용도 외 사용예도형 동력예취기휘발유202100000<NA>
3농업용도 외 사용예도형 동력예취기휘발유403600000<NA>
4농업용도 외 사용온풍난방기-간접식(유류용)경유1006115650000<NA>
5농업용도 외 사용온풍난방기-간접식(유류용)경유1150132250000<NA>
6농업용도 외 사용온풍난방기-간접식(유류용)실내등유896103040000<NA>
7농업용도 외 사용온풍난방기-간접식(유류용)실내등유896103040000<NA>
8농업용도 외 사용온풍난방기-간접식(유류용)실내등유896103040000<NA>
9농업용도 외 사용온풍난방기-직화식(유류용)-대포형경유1006115650000<NA>
위반유형기종종유종위반물량위반금액회수물량회수금액현장조치
336농업용도 외 사용동력이앙기(보행형)휘발유001111000<NA>
337농업용도 외 사용동력이앙기(보행형)휘발유18153106850<NA>
338농업용도 외 사용동력이앙기(보행형)휘발유1085010850<NA>
339농업용도 외 사용동력이앙기(승용형)휘발유53451005345100<NA>
340농업용도 외 사용버섯재배소독기(살균솥-병/봉지)경유5211327771000<NA>
341농업용도 외 사용예도형 동력예취기휘발유002020000<NA>
342농업용도 외 사용농업용 트랙터경유80080000000<NA>
343농업용도 외 사용온풍난방기-간접식(유류용)경유105001050000000<NA>
344농업용도 외 사용온풍난방기-간접식(유류용)경유110001100000000<NA>
345농업용도 외 사용온풍난방기-직화식(유류용)-대포형경유664986649800000<NA>

Duplicate rows

Most frequently occurring

위반유형기종종유종위반물량위반금액회수물량회수금액# duplicates
2농업용도 외 사용농산물건조기(선반형)실내등유3604053603604053605
4농업용도 외 사용동력경운기휘발유00004
7농업용도 외 사용온풍난방기-간접식(유류용)실내등유8961030400003
8농업용도 외 사용온풍난방기-직화식(유류용)-대포형경유216248400003
10농업용도 외 사용온풍난방기-직화식(유류용)-대포형경유10061156500003
0농업용도 외 사용관리기(보행형)휘발유4037800002
1농업용도 외 사용농산물건조기(선반형)실내등유1001126003604136402
3농업용도 외 사용동력경운기경유001721737202
5농업용도 외 사용온풍난방기-간접식(유류용)경유00120002
6농업용도 외 사용온풍난방기-간접식(유류용)경유68807451040132514349752