Overview

Dataset statistics

Number of variables10
Number of observations3445
Missing cells6891
Missing cells (%)20.0%
Duplicate rows2
Duplicate rows (%)0.1%
Total size in memory286.1 KiB
Average record size in memory85.0 B

Variable types

Categorical4
Text2
Numeric2
Unsupported2

Dataset

DescriptionFTA 피해보전직불제 신청현황 정보 제공
Author농림축산식품부
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20211220000000001652

Alerts

사업연도 has constant value ""Constant
분야 has constant value ""Constant
Dataset has 2 (0.1%) duplicate rowsDuplicates
농가수 is highly overall correlated with 신청(마리수)High correlation
신청(마리수) is highly overall correlated with 농가수High correlation
신청(㎡) has 3445 (100.0%) missing valuesMissing
신청(m) has 3445 (100.0%) missing valuesMissing
신청(㎡) is an unsupported type, check if it needs cleaning or further analysisUnsupported
신청(m) is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-11 03:33:16.795630
Analysis finished2023-12-11 03:33:18.118318
Duration1.32 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사업연도
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size27.0 KiB
2013
3445 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2013
2nd row2013
3rd row2013
4th row2013
5th row2013

Common Values

ValueCountFrequency (%)
2013 3445
100.0%

Length

2023-12-11T12:33:18.188932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T12:33:18.296341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2013 3445
100.0%

시도
Categorical

Distinct16
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size27.0 KiB
경상북도
563 
전라남도
498 
경상남도
466 
경기도
399 
충청남도
363 
Other values (11)
1156 

Length

Max length7
Median length4
Mean length3.9114659
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산광역시
2nd row부산광역시
3rd row부산광역시
4th row부산광역시
5th row부산광역시

Common Values

ValueCountFrequency (%)
경상북도 563
16.3%
전라남도 498
14.5%
경상남도 466
13.5%
경기도 399
11.6%
충청남도 363
10.5%
전라북도 358
10.4%
강원도 302
8.8%
충청북도 246
7.1%
제주특별자치도 45
 
1.3%
대구광역시 43
 
1.2%
Other values (6) 162
 
4.7%

Length

2023-12-11T12:33:18.414803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경상북도 563
16.3%
전라남도 498
14.5%
경상남도 466
13.5%
경기도 399
11.6%
충청남도 363
10.5%
전라북도 358
10.4%
강원도 302
8.8%
충청북도 246
7.1%
제주특별자치도 45
 
1.3%
대구광역시 43
 
1.2%
Other values (6) 162
 
4.7%
Distinct168
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Memory size27.0 KiB
2023-12-11T12:33:18.765124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length2.9939042
Min length2

Characters and Unicode

Total characters10314
Distinct characters120
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row금정구
2nd row금정구
3rd row강서구
4th row강서구
5th row강서구
ValueCountFrequency (%)
청주시 46
 
1.3%
상주시 46
 
1.3%
진주시 40
 
1.2%
남원시 40
 
1.2%
경주시 39
 
1.1%
익산시 39
 
1.1%
정읍시 39
 
1.1%
안동시 38
 
1.1%
김천시 38
 
1.1%
창원시 38
 
1.1%
Other values (158) 3042
88.3%
2023-12-11T12:33:19.275466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1766
 
17.1%
1624
 
15.7%
544
 
5.3%
430
 
4.2%
313
 
3.0%
301
 
2.9%
276
 
2.7%
228
 
2.2%
196
 
1.9%
160
 
1.6%
Other values (110) 4476
43.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10314
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1766
 
17.1%
1624
 
15.7%
544
 
5.3%
430
 
4.2%
313
 
3.0%
301
 
2.9%
276
 
2.7%
228
 
2.2%
196
 
1.9%
160
 
1.6%
Other values (110) 4476
43.4%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10314
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1766
 
17.1%
1624
 
15.7%
544
 
5.3%
430
 
4.2%
313
 
3.0%
301
 
2.9%
276
 
2.7%
228
 
2.2%
196
 
1.9%
160
 
1.6%
Other values (110) 4476
43.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10314
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1766
 
17.1%
1624
 
15.7%
544
 
5.3%
430
 
4.2%
313
 
3.0%
301
 
2.9%
276
 
2.7%
228
 
2.2%
196
 
1.9%
160
 
1.6%
Other values (110) 4476
43.4%
Distinct1594
Distinct (%)46.3%
Missing0
Missing (%)0.0%
Memory size27.0 KiB
2023-12-11T12:33:19.596632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length3
Mean length3.053701
Min length2

Characters and Unicode

Total characters10520
Distinct characters303
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique112 ?
Unique (%)3.3%

Sample

1st row금성동
2nd row금성동
3rd row대저2동
4th row대저2동
5th row강동동
ValueCountFrequency (%)
남면 24
 
0.7%
서면 20
 
0.6%
북면 15
 
0.4%
동면 10
 
0.3%
금성면 10
 
0.3%
봉산면 8
 
0.2%
산내면 8
 
0.2%
옥산면 8
 
0.2%
삼산면 8
 
0.2%
대산면 8
 
0.2%
Other values (1584) 3326
96.5%
2023-12-11T12:33:20.005033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2295
21.8%
905
 
8.6%
440
 
4.2%
331
 
3.1%
184
 
1.7%
184
 
1.7%
180
 
1.7%
144
 
1.4%
135
 
1.3%
126
 
1.2%
Other values (293) 5596
53.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10372
98.6%
Decimal Number 144
 
1.4%
Other Punctuation 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2295
22.1%
905
 
8.7%
440
 
4.2%
331
 
3.2%
184
 
1.8%
184
 
1.8%
180
 
1.7%
144
 
1.4%
135
 
1.3%
126
 
1.2%
Other values (286) 5448
52.5%
Decimal Number
ValueCountFrequency (%)
1 58
40.3%
2 49
34.0%
3 23
 
16.0%
4 9
 
6.2%
5 3
 
2.1%
6 2
 
1.4%
Other Punctuation
ValueCountFrequency (%)
. 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10372
98.6%
Common 148
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2295
22.1%
905
 
8.7%
440
 
4.2%
331
 
3.2%
184
 
1.8%
184
 
1.8%
180
 
1.7%
144
 
1.4%
135
 
1.3%
126
 
1.2%
Other values (286) 5448
52.5%
Common
ValueCountFrequency (%)
1 58
39.2%
2 49
33.1%
3 23
 
15.5%
4 9
 
6.1%
. 4
 
2.7%
5 3
 
2.0%
6 2
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10372
98.6%
ASCII 148
 
1.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2295
22.1%
905
 
8.7%
440
 
4.2%
331
 
3.2%
184
 
1.8%
184
 
1.8%
180
 
1.7%
144
 
1.4%
135
 
1.3%
126
 
1.2%
Other values (286) 5448
52.5%
ASCII
ValueCountFrequency (%)
1 58
39.2%
2 49
33.1%
3 23
 
15.5%
4 9
 
6.1%
. 4
 
2.7%
5 3
 
2.0%
6 2
 
1.4%

분야
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size27.0 KiB
축산
3445 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row축산
2nd row축산
3rd row축산
4th row축산
5th row축산

Common Values

ValueCountFrequency (%)
축산 3445
100.0%

Length

2023-12-11T12:33:20.132801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T12:33:20.219214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
축산 3445
100.0%

품목
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size27.0 KiB
한우
1758 
송아지
1687 

Length

Max length3
Median length2
Mean length2.4896952
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row한우
2nd row송아지
3rd row한우
4th row송아지
5th row한우

Common Values

ValueCountFrequency (%)
한우 1758
51.0%
송아지 1687
49.0%

Length

2023-12-11T12:33:20.312254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T12:33:20.417655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한우 1758
51.0%
송아지 1687
49.0%

농가수
Real number (ℝ)

HIGH CORRELATION 

Distinct267
Distinct (%)7.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean54.485051
Minimum1
Maximum458
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size30.4 KiB
2023-12-11T12:33:20.596021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q113
median38
Q376
95-th percentile167.6
Maximum458
Range457
Interquartile range (IQR)63

Descriptive statistics

Standard deviation56.842699
Coefficient of variation (CV)1.0432715
Kurtosis5.2895416
Mean54.485051
Median Absolute Deviation (MAD)29
Skewness1.9406243
Sum187701
Variance3231.0925
MonotonicityNot monotonic
2023-12-11T12:33:20.727472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 191
 
5.5%
2 104
 
3.0%
3 88
 
2.6%
4 80
 
2.3%
7 72
 
2.1%
5 55
 
1.6%
13 48
 
1.4%
6 46
 
1.3%
8 45
 
1.3%
10 44
 
1.3%
Other values (257) 2672
77.6%
ValueCountFrequency (%)
1 191
5.5%
2 104
3.0%
3 88
2.6%
4 80
2.3%
5 55
 
1.6%
6 46
 
1.3%
7 72
 
2.1%
8 45
 
1.3%
9 41
 
1.2%
10 44
 
1.3%
ValueCountFrequency (%)
458 1
< 0.1%
426 1
< 0.1%
369 2
0.1%
368 1
< 0.1%
365 1
< 0.1%
362 1
< 0.1%
345 1
< 0.1%
343 1
< 0.1%
338 1
< 0.1%
333 1
< 0.1%

신청(㎡)
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3445
Missing (%)100.0%
Memory size30.4 KiB

신청(마리수)
Real number (ℝ)

HIGH CORRELATION 

Distinct848
Distinct (%)24.6%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean263.55226
Minimum1
Maximum3924
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size30.4 KiB
2023-12-11T12:33:21.142418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4
Q151
median162
Q3344
95-th percentile879
Maximum3924
Range3923
Interquartile range (IQR)293

Descriptive statistics

Standard deviation330.91059
Coefficient of variation (CV)1.2555786
Kurtosis16.399693
Mean263.55226
Median Absolute Deviation (MAD)127
Skewness3.140687
Sum907674
Variance109501.82
MonotonicityNot monotonic
2023-12-11T12:33:21.338921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 52
 
1.5%
3 48
 
1.4%
2 42
 
1.2%
4 39
 
1.1%
9 29
 
0.8%
6 26
 
0.8%
5 25
 
0.7%
7 24
 
0.7%
11 24
 
0.7%
15 23
 
0.7%
Other values (838) 3112
90.3%
ValueCountFrequency (%)
1 52
1.5%
2 42
1.2%
3 48
1.4%
4 39
1.1%
5 25
0.7%
6 26
0.8%
7 24
0.7%
8 21
0.6%
9 29
0.8%
10 22
0.6%
ValueCountFrequency (%)
3924 1
< 0.1%
3593 1
< 0.1%
3082 1
< 0.1%
2653 1
< 0.1%
2523 1
< 0.1%
2501 1
< 0.1%
2450 1
< 0.1%
2440 1
< 0.1%
2320 1
< 0.1%
2202 1
< 0.1%

신청(m)
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3445
Missing (%)100.0%
Memory size30.4 KiB

Interactions

2023-12-11T12:33:17.512213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:17.263834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:17.633919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:33:17.375701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T12:33:21.441291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시도품목농가수신청(마리수)
시도1.0000.0000.2620.156
품목0.0001.0000.0220.249
농가수0.2620.0221.0000.683
신청(마리수)0.1560.2490.6831.000
2023-12-11T12:33:21.552865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
품목시도
품목1.0000.000
시도0.0001.000
2023-12-11T12:33:21.640695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
농가수신청(마리수)시도품목
농가수1.0000.9200.1060.017
신청(마리수)0.9201.0000.0650.248
시도0.1060.0651.0000.000
품목0.0170.2480.0001.000

Missing values

2023-12-11T12:33:17.846112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T12:33:18.050673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업연도시도시군구읍면동분야품목농가수신청(㎡)신청(마리수)신청(m)
02013부산광역시금정구금성동축산한우1<NA>1<NA>
12013부산광역시금정구금성동축산송아지6<NA>24<NA>
22013부산광역시강서구대저2동축산한우2<NA>11<NA>
32013부산광역시강서구대저2동축산송아지3<NA>20<NA>
42013부산광역시강서구강동동축산한우5<NA>17<NA>
52013부산광역시강서구강동동축산송아지8<NA>24<NA>
62013부산광역시강서구가락동축산한우4<NA>11<NA>
72013부산광역시강서구가락동축산송아지2<NA>3<NA>
82013부산광역시강서구가덕도동축산한우1<NA>2<NA>
92013부산광역시기장군기장읍축산한우3<NA>12<NA>
사업연도시도시군구읍면동분야품목농가수신청(㎡)신청(마리수)신청(m)
34352013세종특별자치시세종시장군면축산한우140<NA>1213<NA>
34362013세종특별자치시세종시장군면축산송아지103<NA>524<NA>
34372013세종특별자치시세종시연서면축산한우110<NA>1368<NA>
34382013세종특별자치시세종시연서면축산송아지74<NA>366<NA>
34392013세종특별자치시세종시전의면축산한우60<NA>409<NA>
34402013세종특별자치시세종시전의면축산송아지51<NA>233<NA>
34412013세종특별자치시세종시전동면축산한우102<NA>1207<NA>
34422013세종특별자치시세종시전동면축산송아지70<NA>328<NA>
34432013세종특별자치시세종시소정면축산한우5<NA>25<NA>
34442013세종특별자치시세종시소정면축산송아지3<NA>15<NA>

Duplicate rows

Most frequently occurring

사업연도시도시군구읍면동분야품목농가수신청(마리수)# duplicates
02013세종특별자치시세종시소정면축산송아지3152
12013세종특별자치시세종시전의면축산송아지512332