Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows553
Duplicate rows (%)5.5%
Total size in memory644.5 KiB
Average record size in memory66.0 B

Variable types

Categorical4
Text2
Numeric1

Dataset

Description2013~14년 농협별 농기계기종별 면세유 배정현황
Author농림축산식품부
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220217000000002051

Alerts

배정년도 has constant value ""Constant
Dataset has 553 (5.5%) duplicate rowsDuplicates
지역본부 is highly overall correlated with 시지부High correlation
시지부 is highly overall correlated with 지역본부High correlation
배정량(L) has 525 (5.2%) zerosZeros

Reproduction

Analysis started2023-12-11 03:40:15.368494
Analysis finished2023-12-11 03:40:16.365468
Duration1 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지역본부
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
강원지역본부
5876 
경기지역본부
4124 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강원지역본부
2nd row경기지역본부
3rd row경기지역본부
4th row강원지역본부
5th row강원지역본부

Common Values

ValueCountFrequency (%)
강원지역본부 5876
58.8%
경기지역본부 4124
41.2%

Length

2023-12-11T12:40:16.428640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T12:40:16.550677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
강원지역본부 5876
58.8%
경기지역본부 4124
41.2%

시지부
Categorical

HIGH CORRELATION 

Distinct50
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
춘천시농정지원단
1033 
홍천군농정지원단
969 
원주시농정지원단
 
585
화성시농정지원단
 
518
안성시농정지원단
 
448
Other values (45)
6447 

Length

Max length9
Median length8
Mean length7.977
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row홍천군농정지원단
2nd row김포시농정지원단
3rd row고양시농정지원단
4th row횡성군농정지원단
5th row춘천시농정지원단

Common Values

ValueCountFrequency (%)
춘천시농정지원단 1033
 
10.3%
홍천군농정지원단 969
 
9.7%
원주시농정지원단 585
 
5.9%
화성시농정지원단 518
 
5.2%
안성시농정지원단 448
 
4.5%
횡성군농정지원단 401
 
4.0%
철원군농정지원단 396
 
4.0%
강릉시농정지원단 376
 
3.8%
이천시농정지원단 359
 
3.6%
용인시농정지원단 350
 
3.5%
Other values (40) 4565
45.6%

Length

2023-12-11T12:40:16.655522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
춘천시농정지원단 1033
 
10.3%
홍천군농정지원단 969
 
9.7%
원주시농정지원단 585
 
5.9%
화성시농정지원단 518
 
5.2%
안성시농정지원단 448
 
4.5%
횡성군농정지원단 401
 
4.0%
철원군농정지원단 396
 
4.0%
강릉시농정지원단 376
 
3.8%
이천시농정지원단 359
 
3.6%
용인시농정지원단 350
 
3.5%
Other values (40) 4565
45.6%
Distinct362
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T12:40:16.884535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length7.3278
Min length4

Characters and Unicode

Total characters73278
Distinct characters207
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9 ?
Unique (%)0.1%

Sample

1st row서홍천농협
2nd row신김포농협 하성지점
3rd row일산농협
4th row동횡성농협 갑천지점
5th row남산농협 경제사업소
ValueCountFrequency (%)
경제사업소 861
 
5.5%
남산농협 622
 
3.9%
서홍천농협 470
 
3.0%
서안성농협 267
 
1.7%
공도중앙지점 251
 
1.6%
원삼농협 242
 
1.5%
원일지점 242
 
1.5%
원주농협 239
 
1.5%
강릉농협 226
 
1.4%
일동농협 223
 
1.4%
Other values (386) 12116
76.9%
2023-12-11T12:40:17.351334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10089
 
13.8%
10000
 
13.6%
5759
 
7.9%
3932
 
5.4%
3703
 
5.1%
1444
 
2.0%
1423
 
1.9%
1418
 
1.9%
1377
 
1.9%
1247
 
1.7%
Other values (197) 32886
44.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 66963
91.4%
Space Separator 5759
 
7.9%
Math Symbol 556
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10089
 
15.1%
10000
 
14.9%
3932
 
5.9%
3703
 
5.5%
1444
 
2.2%
1423
 
2.1%
1418
 
2.1%
1377
 
2.1%
1247
 
1.9%
1247
 
1.9%
Other values (194) 31083
46.4%
Math Symbol
ValueCountFrequency (%)
> 278
50.0%
< 278
50.0%
Space Separator
ValueCountFrequency (%)
5759
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 66963
91.4%
Common 6315
 
8.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10089
 
15.1%
10000
 
14.9%
3932
 
5.9%
3703
 
5.5%
1444
 
2.2%
1423
 
2.1%
1418
 
2.1%
1377
 
2.1%
1247
 
1.9%
1247
 
1.9%
Other values (194) 31083
46.4%
Common
ValueCountFrequency (%)
5759
91.2%
> 278
 
4.4%
< 278
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 66963
91.4%
ASCII 6315
 
8.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
10089
 
15.1%
10000
 
14.9%
3932
 
5.9%
3703
 
5.5%
1444
 
2.2%
1423
 
2.1%
1418
 
2.1%
1377
 
2.1%
1247
 
1.9%
1247
 
1.9%
Other values (194) 31083
46.4%
ASCII
ValueCountFrequency (%)
5759
91.2%
> 278
 
4.4%
< 278
 
4.4%

배정년도
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2013
10000 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2013
2nd row2013
3rd row2013
4th row2013
5th row2013

Common Values

ValueCountFrequency (%)
2013 10000
100.0%

Length

2023-12-11T12:40:17.513287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T12:40:17.638872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2013 10000
100.0%

유종명
Categorical

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경유
4550 
휘발유
3777 
실내등유
1523 
가스(차량)
 
112
중유
 
20
Other values (2)
 
18

Length

Max length6
Median length5
Mean length2.7334
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경유
2nd row경유
3rd row경유
4th row경유
5th row경유

Common Values

ValueCountFrequency (%)
경유 4550
45.5%
휘발유 3777
37.8%
실내등유 1523
 
15.2%
가스(차량) 112
 
1.1%
중유 20
 
0.2%
보일러등유 9
 
0.1%
가스(난방) 9
 
0.1%

Length

2023-12-11T12:40:17.755599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T12:40:17.929236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경유 4550
45.5%
휘발유 3777
37.8%
실내등유 1523
 
15.2%
가스(차량 112
 
1.1%
중유 20
 
0.2%
보일러등유 9
 
0.1%
가스(난방 9
 
0.1%
Distinct76
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T12:40:18.194483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length18
Mean length9.8369
Min length3

Characters and Unicode

Total characters98369
Distinct characters149
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)0.1%

Sample

1st row<<농산물건조기(일반)>>
2nd row콤바인(자탈형-4조)
3rd row동력경운기
4th row농업용 트랙터
5th row동력경운기
ValueCountFrequency (%)
온풍난방기 1019
 
8.8%
동력경운기 988
 
8.6%
농업용 777
 
6.7%
트랙터 777
 
6.7%
휴대형)동력예취기 571
 
4.9%
관리기 447
 
3.9%
온풍난방기-직화식(유류용)-대포형 377
 
3.3%
농산물건조기(일반 368
 
3.2%
화물자동차(1톤이하)-농기계 366
 
3.2%
화물자동차(1톤이하 366
 
3.2%
Other values (74) 5491
47.6%
2023-12-11T12:40:18.720096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8613
 
8.8%
< 5924
 
6.0%
> 5924
 
6.0%
) 4441
 
4.5%
( 4441
 
4.5%
4044
 
4.1%
3311
 
3.4%
2456
 
2.5%
2269
 
2.3%
2229
 
2.3%
Other values (139) 54717
55.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 72323
73.5%
Math Symbol 11848
 
12.0%
Close Punctuation 4441
 
4.5%
Open Punctuation 4441
 
4.5%
Dash Punctuation 2186
 
2.2%
Space Separator 1547
 
1.6%
Decimal Number 1204
 
1.2%
Uppercase Letter 276
 
0.3%
Other Punctuation 93
 
0.1%
Connector Punctuation 10
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8613
 
11.9%
4044
 
5.6%
3311
 
4.6%
2456
 
3.4%
2269
 
3.1%
2229
 
3.1%
2188
 
3.0%
2076
 
2.9%
1725
 
2.4%
1634
 
2.3%
Other values (125) 41778
57.8%
Decimal Number
ValueCountFrequency (%)
1 763
63.4%
4 206
 
17.1%
3 91
 
7.6%
5 90
 
7.5%
2 54
 
4.5%
Math Symbol
ValueCountFrequency (%)
< 5924
50.0%
> 5924
50.0%
Close Punctuation
ValueCountFrequency (%)
) 4441
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4441
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2186
100.0%
Space Separator
ValueCountFrequency (%)
1547
100.0%
Uppercase Letter
ValueCountFrequency (%)
S 276
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 93
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 72323
73.5%
Common 25770
 
26.2%
Latin 276
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8613
 
11.9%
4044
 
5.6%
3311
 
4.6%
2456
 
3.4%
2269
 
3.1%
2229
 
3.1%
2188
 
3.0%
2076
 
2.9%
1725
 
2.4%
1634
 
2.3%
Other values (125) 41778
57.8%
Common
ValueCountFrequency (%)
< 5924
23.0%
> 5924
23.0%
) 4441
17.2%
( 4441
17.2%
- 2186
 
8.5%
1547
 
6.0%
1 763
 
3.0%
4 206
 
0.8%
/ 93
 
0.4%
3 91
 
0.4%
Other values (3) 154
 
0.6%
Latin
ValueCountFrequency (%)
S 276
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 72323
73.5%
ASCII 26046
 
26.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
8613
 
11.9%
4044
 
5.6%
3311
 
4.6%
2456
 
3.4%
2269
 
3.1%
2229
 
3.1%
2188
 
3.0%
2076
 
2.9%
1725
 
2.4%
1634
 
2.3%
Other values (125) 41778
57.8%
ASCII
ValueCountFrequency (%)
< 5924
22.7%
> 5924
22.7%
) 4441
17.1%
( 4441
17.1%
- 2186
 
8.4%
1547
 
5.9%
1 763
 
2.9%
S 276
 
1.1%
4 206
 
0.8%
/ 93
 
0.4%
Other values (4) 245
 
0.9%

배정량(L)
Real number (ℝ)

ZEROS 

Distinct3584
Distinct (%)35.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6198.9376
Minimum0
Maximum1280183
Zeros525
Zeros (%)5.2%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T12:40:18.936359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q158
median422
Q32435
95-th percentile28969.6
Maximum1280183
Range1280183
Interquartile range (IQR)2377

Descriptive statistics

Standard deviation28595.451
Coefficient of variation (CV)4.61296
Kurtosis554.12018
Mean6198.9376
Median Absolute Deviation (MAD)407
Skewness18.004098
Sum61989376
Variance8.1769984 × 108
MonotonicityNot monotonic
2023-12-11T12:40:19.374457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 525
 
5.2%
137 242
 
2.4%
172 235
 
2.4%
25 202
 
2.0%
379 200
 
2.0%
43 192
 
1.9%
14 183
 
1.8%
18 92
 
0.9%
11 85
 
0.9%
28 83
 
0.8%
Other values (3574) 7961
79.6%
ValueCountFrequency (%)
0 525
5.2%
1 15
 
0.1%
2 16
 
0.2%
3 15
 
0.1%
4 16
 
0.2%
5 8
 
0.1%
6 7
 
0.1%
7 30
 
0.3%
8 9
 
0.1%
9 29
 
0.3%
ValueCountFrequency (%)
1280183 1
< 0.1%
769482 1
< 0.1%
711710 1
< 0.1%
549398 1
< 0.1%
548478 1
< 0.1%
532463 1
< 0.1%
486144 1
< 0.1%
416223 1
< 0.1%
377438 1
< 0.1%
371066 1
< 0.1%

Interactions

2023-12-11T12:40:16.056132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T12:40:19.476319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역본부시지부유종명농기계기종명배정량(L)
지역본부1.0001.0000.0480.2650.028
시지부1.0001.0000.2860.4010.000
유종명0.0480.2861.0000.8420.057
농기계기종명0.2650.4010.8421.0000.000
배정량(L)0.0280.0000.0570.0001.000
2023-12-11T12:40:19.594995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역본부유종명시지부
지역본부1.0000.0520.998
유종명0.0521.0000.112
시지부0.9980.1121.000
2023-12-11T12:40:19.691449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
배정량(L)지역본부시지부유종명
배정량(L)1.0000.0210.0000.030
지역본부0.0211.0000.9980.052
시지부0.0000.9981.0000.112
유종명0.0300.0520.1121.000

Missing values

2023-12-11T12:40:16.191069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T12:40:16.304940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

지역본부시지부지역농협배정년도유종명농기계기종명배정량(L)
11438강원지역본부홍천군농정지원단서홍천농협2013경유<<농산물건조기(일반)>>577
16425경기지역본부김포시농정지원단신김포농협 하성지점2013경유콤바인(자탈형-4조)18979
12895경기지역본부고양시농정지원단일산농협2013경유동력경운기137
5928강원지역본부횡성군농정지원단동횡성농협 갑천지점2013경유농업용 트랙터2884
7481강원지역본부춘천시농정지원단남산농협 경제사업소2013경유동력경운기171
4812강원지역본부원주시농정지원단문막농협 부론지점2013경유농업용 트랙터39211
11040강원지역본부홍천군농정지원단서홍천농협2013휘발유<<관리기>>43
3795강원지역본부강릉시농정지원단주문진농협2013휘발유병충해방제기-동력살분무기(살분기)331
13429경기지역본부화성시농정지원단비봉농협 경제사업장2013휘발유예도형 동력예취기18
5864강원지역본부삼척시농정지원단원덕농협2013휘발유동력이앙기(승용형)1839
지역본부시지부지역농협배정년도유종명농기계기종명배정량(L)
11028강원지역본부홍천군농정지원단내면농협2013경유화물자동차(1톤이하)4386
19599경기지역본부가평군농정지원단가평군농협 상면지점2013실내등유곡물건조기(순환식)2900
16352경기지역본부이천시농정지원단신둔농협 도암지점2013경유동력경운기137
16190경기지역본부광주시농정지원단광주농협2013휘발유<<관리기>>52
19805경기지역본부포천시농정지원단일동농협 경제사업소2013경유동력경운기137
13692경기지역본부포천시농정지원단일동농협 경제사업소2013휘발유<<동력이앙기>>7
17123경기지역본부안성시농정지원단일죽농협2013실내등유<<온풍난방기>>3288
18168경기지역본부파주시농정지원단광탄농협 주유소2013경유농업용 트랙터559
19528경기지역본부안산시농정지원단군자농협 대부경제사업소2013휘발유예도형 동력예취기36
11306강원지역본부고성군농정지원단거진농협2013실내등유<<농산물건조기(일반)>>563

Duplicate rows

Most frequently occurring

지역본부시지부지역농협배정년도유종명농기계기종명배정량(L)# duplicates
110강원지역본부춘천시농정지원단남산농협 경제사업소2013경유동력경운기17274
320경기지역본부안산시농정지원단군자농협 대부경제사업소2013경유동력경운기13751
116강원지역본부춘천시농정지원단남산농협 경제사업소2013경유화물자동차(1톤이하)37941
119강원지역본부춘천시농정지원단남산농협 경제사업소2013경유화물자동차(1톤이하)-농기계37941
109강원지역본부춘천시농정지원단남산농협 경제사업소2013경유동력경운기17135
488경기지역본부포천시농정지원단일동농협 경제사업소2013휘발유(휴대형)동력예취기2535
485경기지역본부포천시농정지원단일동농협 경제사업소2013경유동력경운기17233
201강원지역본부홍천군농정지원단서홍천농협2013경유동력경운기029
364경기지역본부안성시농정지원단서안성농협 공도중앙지점2013휘발유예도형 동력예취기1828
326경기지역본부안산시농정지원단군자농협 대부경제사업소2013휘발유<<관리기>>4327