Overview

Dataset statistics

Number of variables6
Number of observations1270
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory62.1 KiB
Average record size in memory50.1 B

Variable types

Numeric2
Text2
Categorical2

Dataset

Description경상남도 거창군 축산농장(한우, 육우, 젖소, 돼지, 육계, 산란계, 오리 등) 데이터로 사업장 명칭, 주사육업종, 두수, 사업장 소재지 항목을 제공합니다.
Author경상남도 거창군
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15031907

Alerts

기준일자 has constant value ""Constant
주사육업종 is highly imbalanced (77.8%)Imbalance
순번 has unique valuesUnique
두수 has 23 (1.8%) zerosZeros

Reproduction

Analysis started2023-12-11 00:29:28.489960
Analysis finished2023-12-11 00:29:29.337644
Duration0.85 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct1270
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean635.5
Minimum1
Maximum1270
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.3 KiB
2023-12-11T09:29:29.410896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile64.45
Q1318.25
median635.5
Q3952.75
95-th percentile1206.55
Maximum1270
Range1269
Interquartile range (IQR)634.5

Descriptive statistics

Standard deviation366.76173
Coefficient of variation (CV)0.5771231
Kurtosis-1.2
Mean635.5
Median Absolute Deviation (MAD)317.5
Skewness0
Sum807085
Variance134514.17
MonotonicityStrictly increasing
2023-12-11T09:29:29.534778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
846 1
 
0.1%
853 1
 
0.1%
852 1
 
0.1%
851 1
 
0.1%
850 1
 
0.1%
849 1
 
0.1%
848 1
 
0.1%
847 1
 
0.1%
845 1
 
0.1%
Other values (1260) 1260
99.2%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1270 1
0.1%
1269 1
0.1%
1268 1
0.1%
1267 1
0.1%
1266 1
0.1%
1265 1
0.1%
1264 1
0.1%
1263 1
0.1%
1262 1
0.1%
1261 1
0.1%
Distinct330
Distinct (%)26.0%
Missing0
Missing (%)0.0%
Memory size10.1 KiB
2023-12-11T09:29:29.723592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length1
Mean length2.0440945
Min length1

Characters and Unicode

Total characters2596
Distinct characters235
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique295 ?
Unique (%)23.2%

Sample

1st row해성농장
2nd row금귀농장
3rd row새싹농장
4th row에덴농장
5th row홍일농장
ValueCountFrequency (%)
895
68.7%
농장 6
 
0.5%
대룡축산 5
 
0.4%
대경축산 4
 
0.3%
대박축산 3
 
0.2%
없음 3
 
0.2%
오성농장 3
 
0.2%
태양축산 3
 
0.2%
한우리 3
 
0.2%
개미농장 3
 
0.2%
Other values (335) 374
28.7%
2023-12-11T09:29:30.037184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
901
34.7%
203
 
7.8%
186
 
7.2%
157
 
6.0%
137
 
5.3%
50
 
1.9%
35
 
1.3%
32
 
1.2%
31
 
1.2%
24
 
0.9%
Other values (225) 840
32.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2546
98.1%
Space Separator 32
 
1.2%
Decimal Number 10
 
0.4%
Open Punctuation 4
 
0.2%
Close Punctuation 4
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
901
35.4%
203
 
8.0%
186
 
7.3%
157
 
6.2%
137
 
5.4%
50
 
2.0%
35
 
1.4%
31
 
1.2%
24
 
0.9%
23
 
0.9%
Other values (220) 799
31.4%
Decimal Number
ValueCountFrequency (%)
2 8
80.0%
1 2
 
20.0%
Space Separator
ValueCountFrequency (%)
32
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2546
98.1%
Common 50
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
901
35.4%
203
 
8.0%
186
 
7.3%
157
 
6.2%
137
 
5.4%
50
 
2.0%
35
 
1.4%
31
 
1.2%
24
 
0.9%
23
 
0.9%
Other values (220) 799
31.4%
Common
ValueCountFrequency (%)
32
64.0%
2 8
 
16.0%
( 4
 
8.0%
) 4
 
8.0%
1 2
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2546
98.1%
ASCII 50
 
1.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
901
35.4%
203
 
8.0%
186
 
7.3%
157
 
6.2%
137
 
5.4%
50
 
2.0%
35
 
1.4%
31
 
1.2%
24
 
0.9%
23
 
0.9%
Other values (220) 799
31.4%
ASCII
ValueCountFrequency (%)
32
64.0%
2 8
 
16.0%
( 4
 
8.0%
) 4
 
8.0%
1 2
 
4.0%

주사육업종
Categorical

IMBALANCE 

Distinct10
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size10.1 KiB
한우
1141 
돼지
 
38
종계/산란계
 
30
오리
 
26
육계
 
11
Other values (5)
 
24

Length

Max length6
Median length2
Mean length2.1007874
Min length2

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row돼지
2nd row돼지
3rd row돼지
4th row돼지
5th row돼지

Common Values

ValueCountFrequency (%)
한우 1141
89.8%
돼지 38
 
3.0%
종계/산란계 30
 
2.4%
오리 26
 
2.0%
육계 11
 
0.9%
젖소 11
 
0.9%
염소 4
 
0.3%
육용오리 4
 
0.3%
육우 4
 
0.3%
타조 1
 
0.1%

Length

2023-12-11T09:29:30.157444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:29:30.264559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한우 1141
89.8%
돼지 38
 
3.0%
종계/산란계 30
 
2.4%
오리 26
 
2.0%
육계 11
 
0.9%
젖소 11
 
0.9%
염소 4
 
0.3%
육용오리 4
 
0.3%
육우 4
 
0.3%
타조 1
 
0.1%

두수
Real number (ℝ)

ZEROS 

Distinct168
Distinct (%)13.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2044.3512
Minimum0
Maximum300000
Zeros23
Zeros (%)1.8%
Negative0
Negative (%)0.0%
Memory size11.3 KiB
2023-12-11T09:29:30.388064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2
Q17
median16
Q341.75
95-th percentile3000
Maximum300000
Range300000
Interquartile range (IQR)34.75

Descriptive statistics

Standard deviation13920.103
Coefficient of variation (CV)6.8090565
Kurtosis206.36409
Mean2044.3512
Median Absolute Deviation (MAD)12
Skewness12.493807
Sum2596326
Variance1.9376926 × 108
MonotonicityNot monotonic
2023-12-11T09:29:30.515893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10 66
 
5.2%
5 61
 
4.8%
3 61
 
4.8%
6 53
 
4.2%
20 52
 
4.1%
4 49
 
3.9%
30 46
 
3.6%
7 45
 
3.5%
2 43
 
3.4%
8 42
 
3.3%
Other values (158) 752
59.2%
ValueCountFrequency (%)
0 23
 
1.8%
1 17
 
1.3%
2 43
3.4%
3 61
4.8%
4 49
3.9%
5 61
4.8%
6 53
4.2%
7 45
3.5%
8 42
3.3%
9 27
2.1%
ValueCountFrequency (%)
300000 1
0.1%
160000 1
0.1%
150000 1
0.1%
140000 1
0.1%
120000 2
0.2%
80640 1
0.1%
80000 2
0.2%
70000 1
0.1%
60000 1
0.1%
53000 1
0.1%
Distinct1217
Distinct (%)95.8%
Missing0
Missing (%)0.0%
Memory size10.1 KiB
2023-12-11T09:29:30.872976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length46
Mean length24.04252
Min length16

Characters and Unicode

Total characters30534
Distinct characters134
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1170 ?
Unique (%)92.1%

Sample

1st row경상남도 거창군 고제면 궁항리 1713
2nd row경상남도 거창군 거창읍 학리 152번지 1호
3rd row경상남도 거창군 웅양면 노현리 772번지
4th row경상남도 거창군 위천면 남산리 601번지
5th row경상남도 거창군 신원면 과정리 880번지 1
ValueCountFrequency (%)
경상남도 1270
 
17.9%
거창군 1270
 
17.9%
남상면 213
 
3.0%
1호 185
 
2.6%
거창읍 183
 
2.6%
가조면 171
 
2.4%
신원면 151
 
2.1%
위천면 124
 
1.8%
마리면 104
 
1.5%
92
 
1.3%
Other values (1079) 3321
46.9%
2023-12-11T09:29:31.376432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5814
19.0%
1632
 
5.3%
1601
 
5.2%
1468
 
4.8%
1459
 
4.8%
1383
 
4.5%
1375
 
4.5%
1304
 
4.3%
1273
 
4.2%
1270
 
4.2%
Other values (124) 11955
39.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 19640
64.3%
Space Separator 5814
 
19.0%
Decimal Number 4924
 
16.1%
Dash Punctuation 112
 
0.4%
Other Punctuation 31
 
0.1%
Open Punctuation 6
 
< 0.1%
Close Punctuation 6
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1632
 
8.3%
1601
 
8.2%
1468
 
7.5%
1459
 
7.4%
1383
 
7.0%
1375
 
7.0%
1304
 
6.6%
1273
 
6.5%
1270
 
6.5%
1158
 
5.9%
Other values (107) 5717
29.1%
Decimal Number
ValueCountFrequency (%)
1 1059
21.5%
2 550
11.2%
3 479
9.7%
4 470
9.5%
5 426
8.7%
0 417
 
8.5%
8 408
 
8.3%
6 407
 
8.3%
7 366
 
7.4%
9 342
 
6.9%
Other Punctuation
ValueCountFrequency (%)
, 30
96.8%
* 1
 
3.2%
Space Separator
ValueCountFrequency (%)
5814
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 112
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 19640
64.3%
Common 10894
35.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1632
 
8.3%
1601
 
8.2%
1468
 
7.5%
1459
 
7.4%
1383
 
7.0%
1375
 
7.0%
1304
 
6.6%
1273
 
6.5%
1270
 
6.5%
1158
 
5.9%
Other values (107) 5717
29.1%
Common
ValueCountFrequency (%)
5814
53.4%
1 1059
 
9.7%
2 550
 
5.0%
3 479
 
4.4%
4 470
 
4.3%
5 426
 
3.9%
0 417
 
3.8%
8 408
 
3.7%
6 407
 
3.7%
7 366
 
3.4%
Other values (7) 498
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 19640
64.3%
ASCII 10894
35.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5814
53.4%
1 1059
 
9.7%
2 550
 
5.0%
3 479
 
4.4%
4 470
 
4.3%
5 426
 
3.9%
0 417
 
3.8%
8 408
 
3.7%
6 407
 
3.7%
7 366
 
3.4%
Other values (7) 498
 
4.6%
Hangul
ValueCountFrequency (%)
1632
 
8.3%
1601
 
8.2%
1468
 
7.5%
1459
 
7.4%
1383
 
7.0%
1375
 
7.0%
1304
 
6.6%
1273
 
6.5%
1270
 
6.5%
1158
 
5.9%
Other values (107) 5717
29.1%

기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size10.1 KiB
2021-06-30
1270 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-06-30
2nd row2021-06-30
3rd row2021-06-30
4th row2021-06-30
5th row2021-06-30

Common Values

ValueCountFrequency (%)
2021-06-30 1270
100.0%

Length

2023-12-11T09:29:31.504910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:29:31.588069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-06-30 1270
100.0%

Interactions

2023-12-11T09:29:28.996848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:29:28.802286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:29:29.080299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:29:28.908043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T09:29:31.641551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번주사육업종두수
순번1.0000.7540.285
주사육업종0.7541.0000.585
두수0.2850.5851.000
2023-12-11T09:29:31.741216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번두수주사육업종
순번1.000-0.3840.323
두수-0.3841.0000.345
주사육업종0.3230.3451.000

Missing values

2023-12-11T09:29:29.204346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:29:29.303588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번사업장명칭주사육업종두수사업장소재지기준일자
01해성농장돼지1700경상남도 거창군 고제면 궁항리 17132021-06-30
12금귀농장돼지1800경상남도 거창군 거창읍 학리 152번지 1호2021-06-30
23새싹농장돼지1000경상남도 거창군 웅양면 노현리 772번지2021-06-30
34에덴농장돼지1300경상남도 거창군 위천면 남산리 601번지2021-06-30
45홍일농장돼지3000경상남도 거창군 신원면 과정리 880번지 12021-06-30
56개미농장돼지3600경상남도 거창군 위천면 남산리 618번지 외 4필지2021-06-30
67돼지450경상남도 거창군 웅양면 노현리 1229번지 1호2021-06-30
78샘터농장돼지600경상남도 거창군 위천면 모동리 377번지2021-06-30
89오성농장2돼지4000경상남도 거창군 남하면 양항리 404번지2021-06-30
910삼정농장돼지1200경상남도 거창군 거창읍 학리 1018번지2021-06-30
순번사업장명칭주사육업종두수사업장소재지기준일자
12601261한우0경상남도 거창군 남상면 둔동리 644번지 외 1필지(644-2)2021-06-30
12611262한우0경상남도 거창군 마리면 고학리 223번지2021-06-30
12621263한우3경상남도 거창군 웅양면 산포리 1044번지 1호2021-06-30
12631264한우13경상남도 거창군 마리면 월계리 757번지 1호2021-06-30
12641265한우0경상남도 거창군 고제면 봉계리 26번지 5호2021-06-30
12651266시비네축사한우6경상남도 거창군 거창읍 장팔리 201번지 3호2021-06-30
12661267한우34경상남도 거창군 위천면 당산리 585번지 1호2021-06-30
12671268한우0경상남도 거창군 남상면 무촌리 476번지 1호2021-06-30
12681269한우28경상남도 거창군 남상면 무촌리 476번지 2호2021-06-30
12691270한우리한우30경상남도 거창군 거창읍 가지리 810번지2021-06-30