Overview

Dataset statistics

Number of variables7
Number of observations1166
Missing cells571
Missing cells (%)7.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory66.2 KiB
Average record size in memory58.1 B

Variable types

Numeric2
Text2
Categorical3

Dataset

Description경상남도 거창군 축산농장(한우, 육우, 젖소, 돼지, 육계, 산란계, 오리 등) 데이터로 사업장 명칭, 주사육업종, 두수, 사업장 소재지 항목을 제공합니다.
URLhttps://www.data.go.kr/data/15031907/fileData.do

Alerts

기준일자 has constant value ""Constant
주사육업종 is highly overall correlated with 등록축종High correlation
등록축종 is highly overall correlated with 주사육업종High correlation
주사육업종 is highly imbalanced (78.1%)Imbalance
등록축종 is highly imbalanced (78.8%)Imbalance
사업장소재지(도로명) has 571 (49.0%) missing valuesMissing
순번 has unique valuesUnique
사육두수 has 42 (3.6%) zerosZeros

Reproduction

Analysis started2023-12-12 06:49:31.764157
Analysis finished2023-12-12 06:49:32.761074
Duration1 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct1166
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean583.5
Minimum1
Maximum1166
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size10.4 KiB
2023-12-12T15:49:32.846365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile59.25
Q1292.25
median583.5
Q3874.75
95-th percentile1107.75
Maximum1166
Range1165
Interquartile range (IQR)582.5

Descriptive statistics

Standard deviation336.73951
Coefficient of variation (CV)0.57710285
Kurtosis-1.2
Mean583.5
Median Absolute Deviation (MAD)291.5
Skewness0
Sum680361
Variance113393.5
MonotonicityStrictly increasing
2023-12-12T15:49:33.017632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
803 1
 
0.1%
783 1
 
0.1%
782 1
 
0.1%
781 1
 
0.1%
780 1
 
0.1%
779 1
 
0.1%
778 1
 
0.1%
777 1
 
0.1%
776 1
 
0.1%
Other values (1156) 1156
99.1%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1166 1
0.1%
1165 1
0.1%
1164 1
0.1%
1163 1
0.1%
1162 1
0.1%
1161 1
0.1%
1160 1
0.1%
1159 1
0.1%
1158 1
0.1%
1157 1
0.1%
Distinct341
Distinct (%)29.2%
Missing0
Missing (%)0.0%
Memory size9.2 KiB
2023-12-12T15:49:33.238922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length1
Mean length2.2084048
Min length1

Characters and Unicode

Total characters2575
Distinct characters243
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique307 ?
Unique (%)26.3%

Sample

1st row해성농장
2nd row
3rd row한우
4th row
5th row
ValueCountFrequency (%)
732
61.2%
없음 52
 
4.3%
농장 6
 
0.5%
대룡축산 4
 
0.3%
대경축산 4
 
0.3%
한우리 3
 
0.3%
대박축산 3
 
0.3%
행복한 3
 
0.3%
개미농장 3
 
0.3%
태양축산 3
 
0.3%
Other values (345) 384
32.1%
2023-12-12T15:49:33.553499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
738
28.7%
202
 
7.8%
186
 
7.2%
169
 
6.6%
148
 
5.7%
52
 
2.0%
52
 
2.0%
49
 
1.9%
34
 
1.3%
31
 
1.2%
Other values (233) 914
35.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2523
98.0%
Space Separator 31
 
1.2%
Decimal Number 13
 
0.5%
Open Punctuation 4
 
0.2%
Close Punctuation 4
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
738
29.3%
202
 
8.0%
186
 
7.4%
169
 
6.7%
148
 
5.9%
52
 
2.1%
52
 
2.1%
49
 
1.9%
34
 
1.3%
30
 
1.2%
Other values (227) 863
34.2%
Decimal Number
ValueCountFrequency (%)
2 9
69.2%
1 3
 
23.1%
3 1
 
7.7%
Space Separator
ValueCountFrequency (%)
31
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2523
98.0%
Common 52
 
2.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
738
29.3%
202
 
8.0%
186
 
7.4%
169
 
6.7%
148
 
5.9%
52
 
2.1%
52
 
2.1%
49
 
1.9%
34
 
1.3%
30
 
1.2%
Other values (227) 863
34.2%
Common
ValueCountFrequency (%)
31
59.6%
2 9
 
17.3%
( 4
 
7.7%
) 4
 
7.7%
1 3
 
5.8%
3 1
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2523
98.0%
ASCII 52
 
2.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
738
29.3%
202
 
8.0%
186
 
7.4%
169
 
6.7%
148
 
5.9%
52
 
2.1%
52
 
2.1%
49
 
1.9%
34
 
1.3%
30
 
1.2%
Other values (227) 863
34.2%
ASCII
ValueCountFrequency (%)
31
59.6%
2 9
 
17.3%
( 4
 
7.7%
) 4
 
7.7%
1 3
 
5.8%
3 1
 
1.9%

주사육업종
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct9
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size9.2 KiB
한우
1059 
돼지
 
28
오리
 
22
종계/산란계
 
21
젖소
 
9
Other values (4)
 
27

Length

Max length6
Median length2
Mean length2.0806175
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row돼지
2nd row한우
3rd row한우
4th row한우
5th row한우

Common Values

ValueCountFrequency (%)
한우 1059
90.8%
돼지 28
 
2.4%
오리 22
 
1.9%
종계/산란계 21
 
1.8%
젖소 9
 
0.8%
육계 9
 
0.8%
염소 7
 
0.6%
육우 6
 
0.5%
육용오리 5
 
0.4%

Length

2023-12-12T15:49:33.677668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:49:33.777859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한우 1059
90.8%
돼지 28
 
2.4%
오리 22
 
1.9%
종계/산란계 21
 
1.8%
젖소 9
 
0.8%
육계 9
 
0.8%
염소 7
 
0.6%
육우 6
 
0.5%
육용오리 5
 
0.4%

등록축종
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct15
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size9.2 KiB
한우
1042 
돼지
 
26
종계/산란계
 
21
오리
 
20
<NA>
 
19
Other values (10)
 
38

Length

Max length10
Median length2
Mean length2.1457976
Min length2

Unique

Unique3 ?
Unique (%)0.3%

Sample

1st row돼지
2nd row한우
3rd row한우
4th row한우
5th row한우

Common Values

ValueCountFrequency (%)
한우 1042
89.4%
돼지 26
 
2.2%
종계/산란계 21
 
1.8%
오리 20
 
1.7%
<NA> 19
 
1.6%
육계 8
 
0.7%
젖소 6
 
0.5%
육용오리 6
 
0.5%
염소 6
 
0.5%
육우 4
 
0.3%
Other values (5) 8
 
0.7%

Length

2023-12-12T15:49:33.898722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한우 1050
89.4%
돼지 27
 
2.3%
종계/산란계 21
 
1.8%
오리 20
 
1.7%
na 19
 
1.6%
젖소 10
 
0.9%
육계 9
 
0.8%
육우 7
 
0.6%
육용오리 6
 
0.5%
염소 6
 
0.5%
Distinct555
Distinct (%)93.3%
Missing571
Missing (%)49.0%
Memory size9.2 KiB
2023-12-12T15:49:34.118220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length31
Mean length22.366387
Min length19

Characters and Unicode

Total characters13308
Distinct characters185
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique522 ?
Unique (%)87.7%

Sample

1st row경상남도 거창군 고제면 고제로 303-47, 해성농장
2nd row경상남도 거창군 마리면 원말흘1길 49-28
3rd row경상남도 거창군 마리면 토점길 73
4th row경상남도 거창군 마리면 엄대1길 149-15
5th row경상남도 거창군 마리면 동편길 18
ValueCountFrequency (%)
경상남도 595
19.8%
거창군 595
19.8%
거창읍 91
 
3.0%
남상면 86
 
2.9%
가조면 84
 
2.8%
신원면 82
 
2.7%
마리면 66
 
2.2%
위천면 54
 
1.8%
남하면 38
 
1.3%
북상면 25
 
0.8%
Other values (781) 1283
42.8%
2023-12-12T15:49:34.490243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2405
18.1%
752
 
5.7%
749
 
5.6%
702
 
5.3%
695
 
5.2%
607
 
4.6%
596
 
4.5%
595
 
4.5%
1 551
 
4.1%
507
 
3.8%
Other values (175) 5149
38.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7936
59.6%
Decimal Number 2552
 
19.2%
Space Separator 2405
 
18.1%
Dash Punctuation 388
 
2.9%
Other Punctuation 11
 
0.1%
Open Punctuation 8
 
0.1%
Close Punctuation 8
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
752
 
9.5%
749
 
9.4%
702
 
8.8%
695
 
8.8%
607
 
7.6%
596
 
7.5%
595
 
7.5%
507
 
6.4%
462
 
5.8%
146
 
1.8%
Other values (160) 2125
26.8%
Decimal Number
ValueCountFrequency (%)
1 551
21.6%
2 379
14.9%
3 255
10.0%
4 251
9.8%
5 225
8.8%
6 204
 
8.0%
7 180
 
7.1%
9 176
 
6.9%
0 169
 
6.6%
8 162
 
6.3%
Space Separator
ValueCountFrequency (%)
2405
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 388
100.0%
Other Punctuation
ValueCountFrequency (%)
, 11
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7936
59.6%
Common 5372
40.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
752
 
9.5%
749
 
9.4%
702
 
8.8%
695
 
8.8%
607
 
7.6%
596
 
7.5%
595
 
7.5%
507
 
6.4%
462
 
5.8%
146
 
1.8%
Other values (160) 2125
26.8%
Common
ValueCountFrequency (%)
2405
44.8%
1 551
 
10.3%
- 388
 
7.2%
2 379
 
7.1%
3 255
 
4.7%
4 251
 
4.7%
5 225
 
4.2%
6 204
 
3.8%
7 180
 
3.4%
9 176
 
3.3%
Other values (5) 358
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7936
59.6%
ASCII 5372
40.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2405
44.8%
1 551
 
10.3%
- 388
 
7.2%
2 379
 
7.1%
3 255
 
4.7%
4 251
 
4.7%
5 225
 
4.2%
6 204
 
3.8%
7 180
 
3.4%
9 176
 
3.3%
Other values (5) 358
 
6.7%
Hangul
ValueCountFrequency (%)
752
 
9.5%
749
 
9.4%
702
 
8.8%
695
 
8.8%
607
 
7.6%
596
 
7.5%
595
 
7.5%
507
 
6.4%
462
 
5.8%
146
 
1.8%
Other values (160) 2125
26.8%

사육두수
Real number (ℝ)

ZEROS 

Distinct182
Distinct (%)15.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2483.7985
Minimum0
Maximum450000
Zeros42
Zeros (%)3.6%
Negative0
Negative (%)0.0%
Memory size10.4 KiB
2023-12-12T15:49:34.621063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2
Q17
median18.5
Q350
95-th percentile2012.5
Maximum450000
Range450000
Interquartile range (IQR)43

Descriptive statistics

Standard deviation20868.017
Coefficient of variation (CV)8.4016547
Kurtosis254.21731
Mean2483.7985
Median Absolute Deviation (MAD)13.5
Skewness14.563372
Sum2896109
Variance4.3547413 × 108
MonotonicityNot monotonic
2023-12-12T15:49:34.751081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10 59
 
5.1%
3 49
 
4.2%
6 47
 
4.0%
30 45
 
3.9%
20 44
 
3.8%
7 42
 
3.6%
0 42
 
3.6%
4 40
 
3.4%
15 36
 
3.1%
5 36
 
3.1%
Other values (172) 726
62.3%
ValueCountFrequency (%)
0 42
3.6%
1 6
 
0.5%
2 35
3.0%
3 49
4.2%
4 40
3.4%
5 36
3.1%
6 47
4.0%
7 42
3.6%
8 35
3.0%
9 23
2.0%
ValueCountFrequency (%)
450000 1
0.1%
300000 1
0.1%
289000 1
0.1%
160000 1
0.1%
150000 1
0.1%
140000 1
0.1%
120000 1
0.1%
88000 1
0.1%
80640 1
0.1%
80000 2
0.2%

기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size9.2 KiB
2023-07-03
1166 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-07-03
2nd row2023-07-03
3rd row2023-07-03
4th row2023-07-03
5th row2023-07-03

Common Values

ValueCountFrequency (%)
2023-07-03 1166
100.0%

Length

2023-12-12T15:49:34.886210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:49:35.002800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-07-03 1166
100.0%

Interactions

2023-12-12T15:49:32.357488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:49:32.147181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:49:32.479954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:49:32.247561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:49:35.091635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번주사육업종등록축종사육두수
순번1.0000.3050.2960.113
주사육업종0.3051.0000.9790.613
등록축종0.2960.9791.0000.620
사육두수0.1130.6130.6201.000
2023-12-12T15:49:35.225761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주사육업종등록축종
주사육업종1.0000.917
등록축종0.9171.000
2023-12-12T15:49:35.342339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번사육두수주사육업종등록축종
순번1.000-0.1770.1430.123
사육두수-0.1771.0000.3590.363
주사육업종0.1430.3591.0000.917
등록축종0.1230.3630.9171.000

Missing values

2023-12-12T15:49:32.597687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:49:32.701711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번사업장명칭주사육업종등록축종사업장소재지(도로명)사육두수기준일자
01해성농장돼지돼지경상남도 거창군 고제면 고제로 303-47, 해성농장17002023-07-03
12한우한우경상남도 거창군 마리면 원말흘1길 49-2822023-07-03
23한우한우한우경상남도 거창군 마리면 토점길 73352023-07-03
34한우한우경상남도 거창군 마리면 엄대1길 149-15212023-07-03
45한우한우경상남도 거창군 마리면 동편길 18202023-07-03
56한우한우경상남도 거창군 마리면 거안로 931-8202023-07-03
67금귀농장돼지돼지경상남도 거창군 거창읍 구례길 242-11718002023-07-03
78새싹농장돼지돼지경상남도 거창군 웅양면 원촌3길 109-20510002023-07-03
89개미농장돼지돼지<NA>36002023-07-03
910오성농장2돼지돼지경상남도 거창군 남하면 양항길 124-5640002023-07-03
순번사업장명칭주사육업종등록축종사업장소재지(도로명)사육두수기준일자
11561157없음한우한우경상남도 거창군 남상면 고척길 46022023-07-03
11571158월평축산1한우한우<NA>802023-07-03
11581159없음염소염소<NA>382023-07-03
11591160없음한우한우<NA>212023-07-03
11601161유담농장한우한우<NA>542023-07-03
11611162강동댁축산한우한우<NA>92023-07-03
11621163없음한우한우<NA>62023-07-03
11631164없음한우한우<NA>122023-07-03
11641165대원축산한우<NA><NA>02023-07-03
11651166없음한우<NA><NA>02023-07-03