Overview

Dataset statistics

Number of variables6
Number of observations1165
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory57.0 KiB
Average record size in memory50.1 B

Variable types

Numeric2
Text2
Categorical2

Dataset

Description경상남도 거창군 축산농장(한우, 육우, 젖소, 돼지, 육계, 산란계, 오리 등) 데이터로 사업장 명칭, 주사육업종, 두수, 사업장 소재지 항목을 제공합니다.
Author경상남도 거창군
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15031907

Alerts

기준일자 has constant value ""Constant
주사육업종 is highly imbalanced (79.3%)Imbalance
순번 has unique valuesUnique
두수 has 31 (2.7%) zerosZeros

Reproduction

Analysis started2023-12-11 00:29:23.420311
Analysis finished2023-12-11 00:29:24.269139
Duration0.85 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct1165
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean583
Minimum1
Maximum1165
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size10.4 KiB
2023-12-11T09:29:24.354383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile59.2
Q1292
median583
Q3874
95-th percentile1106.8
Maximum1165
Range1164
Interquartile range (IQR)582

Descriptive statistics

Standard deviation336.45084
Coefficient of variation (CV)0.57710264
Kurtosis-1.2
Mean583
Median Absolute Deviation (MAD)291
Skewness0
Sum679195
Variance113199.17
MonotonicityStrictly increasing
2023-12-11T09:29:24.520367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
802 1
 
0.1%
782 1
 
0.1%
781 1
 
0.1%
780 1
 
0.1%
779 1
 
0.1%
778 1
 
0.1%
777 1
 
0.1%
776 1
 
0.1%
775 1
 
0.1%
Other values (1155) 1155
99.1%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1165 1
0.1%
1164 1
0.1%
1163 1
0.1%
1162 1
0.1%
1161 1
0.1%
1160 1
0.1%
1159 1
0.1%
1158 1
0.1%
1157 1
0.1%
1156 1
0.1%
Distinct333
Distinct (%)28.6%
Missing0
Missing (%)0.0%
Memory size9.2 KiB
2023-12-11T09:29:24.776121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length1
Mean length2.1321888
Min length1

Characters and Unicode

Total characters2484
Distinct characters235
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique300 ?
Unique (%)25.8%

Sample

1st row해성농장
2nd row
3rd row한우
4th row
5th row
ValueCountFrequency (%)
791
66.1%
농장 6
 
0.5%
대룡축산 4
 
0.3%
대경축산 4
 
0.3%
개미농장 3
 
0.3%
축산 3
 
0.3%
행복한 3
 
0.3%
대박축산 3
 
0.3%
초계농장 3
 
0.3%
한우리 3
 
0.3%
Other values (337) 373
31.2%
2023-12-11T09:29:25.286572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
797
32.1%
200
 
8.1%
180
 
7.2%
161
 
6.5%
141
 
5.7%
51
 
2.1%
34
 
1.4%
31
 
1.2%
28
 
1.1%
24
 
1.0%
Other values (225) 837
33.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2434
98.0%
Space Separator 31
 
1.2%
Decimal Number 11
 
0.4%
Close Punctuation 4
 
0.2%
Open Punctuation 4
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
797
32.7%
200
 
8.2%
180
 
7.4%
161
 
6.6%
141
 
5.8%
51
 
2.1%
34
 
1.4%
28
 
1.2%
24
 
1.0%
22
 
0.9%
Other values (220) 796
32.7%
Decimal Number
ValueCountFrequency (%)
2 9
81.8%
1 2
 
18.2%
Space Separator
ValueCountFrequency (%)
31
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2434
98.0%
Common 50
 
2.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
797
32.7%
200
 
8.2%
180
 
7.4%
161
 
6.6%
141
 
5.8%
51
 
2.1%
34
 
1.4%
28
 
1.2%
24
 
1.0%
22
 
0.9%
Other values (220) 796
32.7%
Common
ValueCountFrequency (%)
31
62.0%
2 9
 
18.0%
) 4
 
8.0%
( 4
 
8.0%
1 2
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2434
98.0%
ASCII 50
 
2.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
797
32.7%
200
 
8.2%
180
 
7.4%
161
 
6.6%
141
 
5.8%
51
 
2.1%
34
 
1.4%
28
 
1.2%
24
 
1.0%
22
 
0.9%
Other values (220) 796
32.7%
ASCII
ValueCountFrequency (%)
31
62.0%
2 9
 
18.0%
) 4
 
8.0%
( 4
 
8.0%
1 2
 
4.0%

주사육업종
Categorical

IMBALANCE 

Distinct14
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size9.2 KiB
한우
1047 
돼지
 
27
종계/산란계
 
24
오리
 
23
육계
 
8
Other values (9)
 
36

Length

Max length6
Median length2
Mean length2.1304721
Min length2

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row돼지
2nd row한우
3rd row한우
4th row한우
5th row한우

Common Values

ValueCountFrequency (%)
한우 1047
89.9%
돼지 27
 
2.3%
종계/산란계 24
 
2.1%
오리 23
 
2.0%
육계 8
 
0.7%
젖소 7
 
0.6%
<NA> 7
 
0.6%
육용오리 5
 
0.4%
염소 5
 
0.4%
육우 4
 
0.3%
Other values (4) 8
 
0.7%

Length

2023-12-11T09:29:25.449363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한우 1055
89.9%
돼지 29
 
2.5%
종계/산란계 24
 
2.0%
오리 23
 
2.0%
젖소 10
 
0.9%
육계 9
 
0.8%
na 7
 
0.6%
육우 6
 
0.5%
육용오리 5
 
0.4%
염소 5
 
0.4%

두수
Real number (ℝ)

ZEROS 

Distinct178
Distinct (%)15.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2362.8249
Minimum0
Maximum450000
Zeros31
Zeros (%)2.7%
Negative0
Negative (%)0.0%
Memory size10.4 KiB
2023-12-11T09:29:25.581529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2
Q17
median19
Q350
95-th percentile2500
Maximum450000
Range450000
Interquartile range (IQR)43

Descriptive statistics

Standard deviation19261.54
Coefficient of variation (CV)8.1519117
Kurtosis310.07842
Mean2362.8249
Median Absolute Deviation (MAD)14
Skewness15.779507
Sum2752691
Variance3.7100692 × 108
MonotonicityNot monotonic
2023-12-11T09:29:25.717421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10 59
 
5.1%
30 50
 
4.3%
3 48
 
4.1%
6 46
 
3.9%
20 45
 
3.9%
5 41
 
3.5%
7 41
 
3.5%
4 40
 
3.4%
15 37
 
3.2%
2 35
 
3.0%
Other values (168) 723
62.1%
ValueCountFrequency (%)
0 31
2.7%
1 10
 
0.9%
2 35
3.0%
3 48
4.1%
4 40
3.4%
5 41
3.5%
6 46
3.9%
7 41
3.5%
8 34
2.9%
9 21
1.8%
ValueCountFrequency (%)
450000 1
0.1%
300000 1
0.1%
160000 1
0.1%
150000 1
0.1%
140000 1
0.1%
120000 1
0.1%
100000 1
0.1%
80640 1
0.1%
80000 2
0.2%
70000 1
0.1%
Distinct1130
Distinct (%)97.0%
Missing0
Missing (%)0.0%
Memory size9.2 KiB
2023-12-11T09:29:26.050765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length59
Median length56
Mean length26.248069
Min length4

Characters and Unicode

Total characters30579
Distinct characters131
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1098 ?
Unique (%)94.2%

Sample

1st row경상남도 거창군 고제면 궁항리 1713 외 2필지
2nd row경상남도 거창군 마리면 말흘리 763번지
3rd row경상남도 거창군 마리면 월계리 757번지 1호
4th row경상남도 거창군 마리면 대동리 987번지 3호
5th row경상남도 거창군 마리면 대동리 389번지
ValueCountFrequency (%)
경상남도 1164
 
17.7%
거창군 1164
 
17.7%
남상면 192
 
2.9%
1호 178
 
2.7%
거창읍 171
 
2.6%
가조면 164
 
2.5%
신원면 128
 
1.9%
위천면 111
 
1.7%
마리면 105
 
1.6%
101
 
1.5%
Other values (1047) 3100
47.1%
2023-12-11T09:29:26.559043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7539
24.7%
1492
 
4.9%
1460
 
4.8%
1347
 
4.4%
1337
 
4.4%
1273
 
4.2%
1266
 
4.1%
1198
 
3.9%
1167
 
3.8%
1164
 
3.8%
Other values (121) 11336
37.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 18066
59.1%
Space Separator 7539
24.7%
Decimal Number 4739
 
15.5%
Dash Punctuation 143
 
0.5%
Other Punctuation 61
 
0.2%
Close Punctuation 15
 
< 0.1%
Open Punctuation 15
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1492
 
8.3%
1460
 
8.1%
1347
 
7.5%
1337
 
7.4%
1273
 
7.0%
1266
 
7.0%
1198
 
6.6%
1167
 
6.5%
1164
 
6.4%
1055
 
5.8%
Other values (105) 5307
29.4%
Decimal Number
ValueCountFrequency (%)
1 1014
21.4%
2 533
11.2%
3 457
9.6%
4 454
9.6%
6 416
8.8%
5 415
8.8%
0 404
 
8.5%
8 390
 
8.2%
7 342
 
7.2%
9 314
 
6.6%
Space Separator
ValueCountFrequency (%)
7539
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 143
100.0%
Other Punctuation
ValueCountFrequency (%)
, 61
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 18066
59.1%
Common 12513
40.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1492
 
8.3%
1460
 
8.1%
1347
 
7.5%
1337
 
7.4%
1273
 
7.0%
1266
 
7.0%
1198
 
6.6%
1167
 
6.5%
1164
 
6.4%
1055
 
5.8%
Other values (105) 5307
29.4%
Common
ValueCountFrequency (%)
7539
60.2%
1 1014
 
8.1%
2 533
 
4.3%
3 457
 
3.7%
4 454
 
3.6%
6 416
 
3.3%
5 415
 
3.3%
0 404
 
3.2%
8 390
 
3.1%
7 342
 
2.7%
Other values (6) 549
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 18066
59.1%
ASCII 12513
40.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
7539
60.2%
1 1014
 
8.1%
2 533
 
4.3%
3 457
 
3.7%
4 454
 
3.6%
6 416
 
3.3%
5 415
 
3.3%
0 404
 
3.2%
8 390
 
3.1%
7 342
 
2.7%
Other values (6) 549
 
4.4%
Hangul
ValueCountFrequency (%)
1492
 
8.3%
1460
 
8.1%
1347
 
7.5%
1337
 
7.4%
1273
 
7.0%
1266
 
7.0%
1198
 
6.6%
1167
 
6.5%
1164
 
6.4%
1055
 
5.8%
Other values (105) 5307
29.4%

기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size9.2 KiB
2022-06-30
1165 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-06-30
2nd row2022-06-30
3rd row2022-06-30
4th row2022-06-30
5th row2022-06-30

Common Values

ValueCountFrequency (%)
2022-06-30 1165
100.0%

Length

2023-12-11T09:29:26.698062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:29:26.784217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-06-30 1165
100.0%

Interactions

2023-12-11T09:29:23.923790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:29:23.744468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:29:24.009525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:29:23.842350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T09:29:26.837871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번주사육업종두수
순번1.0000.3060.130
주사육업종0.3061.0000.597
두수0.1300.5971.000
2023-12-11T09:29:26.925912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번두수주사육업종
순번1.000-0.1470.131
두수-0.1471.0000.346
주사육업종0.1310.3461.000

Missing values

2023-12-11T09:29:24.109720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:29:24.222181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번사업장명칭주사육업종두수사업장소재지기준일자
01해성농장돼지1800경상남도 거창군 고제면 궁항리 1713 외 2필지2022-06-30
12한우2경상남도 거창군 마리면 말흘리 763번지2022-06-30
23한우한우35경상남도 거창군 마리면 월계리 757번지 1호2022-06-30
34한우21경상남도 거창군 마리면 대동리 987번지 3호2022-06-30
45한우20경상남도 거창군 마리면 대동리 389번지2022-06-30
56한우20경상남도 거창군 마리면 영승리 575번지 1호2022-06-30
67금귀농장돼지1800경상남도 거창군 거창읍 학리 152번지 1호2022-06-30
78새싹농장돼지1000경상남도 거창군 웅양면 노현리 772번지2022-06-30
89에덴농장돼지1300경상남도 거창군 위천면 남산리 601번지2022-06-30
910개미농장돼지3600경상남도 거창군 위천면 남산리 618번지 외 4필지2022-06-30
순번사업장명칭주사육업종두수사업장소재지기준일자
11551156한우39경상남도 거창군 남상면 둔동리 256번지 외 1필지(254-3)2022-06-30
11561157한우136경상남도 거창군 남하면 둔마리 186번지 1호 외 1필지(186)2022-06-30
11571158한우24경상남도 거창군 신원면 양지리 489번지2022-06-30
11581159한우45경상남도 거창군 위천면 황산리 203번지 3호 ,203-2,645-1,645-32022-06-30
11591160한우137경상남도 거창군 가북면 박암리 884번지 4호 외 2필지(884-5, 884-6)2022-06-30
11601161대원농장육용오리10000경상남도 거창군 가조면 동례리 92번지 3호 , 92-42022-06-30
11611162맹양네축산한우66경상남도 거창군 위천면 황산리 186번지2022-06-30
11621163우보농장한우26경상남도 거창군 거창읍 학리 264번지2022-06-30
11631164한우14경상남도 거창군 남하면 양항리 614번지2022-06-30
11641165천하축산한우174경상남도 거창군 남상면 대산리 1457번지 4호2022-06-30