Overview

Dataset statistics

Number of variables5
Number of observations1973
Missing cells0
Missing cells (%)0.0%
Duplicate rows9
Duplicate rows (%)0.5%
Total size in memory79.1 KiB
Average record size in memory41.1 B

Variable types

Text2
Categorical1
Numeric1
DateTime1

Dataset

Description충청남도 공주시 축산농가현황에 대한 데이터로 (축산농가 농장명, 가축종류 및 수량 축산농가 위치 정보) 등의 항목을 제공합니다.
Author충청남도 공주시
URLhttps://www.data.go.kr/data/15028263/fileData.do

Alerts

데이터기준일 has constant value ""Constant
Dataset has 9 (0.5%) duplicate rowsDuplicates
주사육업종 is highly imbalanced (68.6%)Imbalance
사육두수 has 55 (2.8%) zerosZeros

Reproduction

Analysis started2023-12-12 08:39:34.333508
Analysis finished2023-12-12 08:39:34.820373
Duration0.49 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1627
Distinct (%)82.5%
Missing0
Missing (%)0.0%
Memory size15.5 KiB
2023-12-12T17:39:35.039288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length13
Mean length3.4586923
Min length1

Characters and Unicode

Total characters6824
Distinct characters374
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1416 ?
Unique (%)71.8%

Sample

1st row구룡
2nd row명승
3rd row주윤
4th row정모
5th row환만
ValueCountFrequency (%)
농장 91
 
4.3%
대성 16
 
0.8%
우리농장 10
 
0.5%
제2농장 10
 
0.5%
축사 8
 
0.4%
내산 8
 
0.4%
목장 8
 
0.4%
한우 7
 
0.3%
경천 7
 
0.3%
보흥 6
 
0.3%
Other values (1607) 1936
91.9%
2023-12-12T17:39:35.447780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
986
 
14.4%
859
 
12.6%
156
 
2.3%
150
 
2.2%
149
 
2.2%
134
 
2.0%
131
 
1.9%
95
 
1.4%
95
 
1.4%
91
 
1.3%
Other values (364) 3978
58.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6626
97.1%
Space Separator 134
 
2.0%
Decimal Number 36
 
0.5%
Uppercase Letter 21
 
0.3%
Open Punctuation 3
 
< 0.1%
Close Punctuation 3
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
986
 
14.9%
859
 
13.0%
156
 
2.4%
150
 
2.3%
149
 
2.2%
131
 
2.0%
95
 
1.4%
95
 
1.4%
91
 
1.4%
87
 
1.3%
Other values (346) 3827
57.8%
Uppercase Letter
ValueCountFrequency (%)
K 7
33.3%
O 5
23.8%
C 2
 
9.5%
M 2
 
9.5%
A 2
 
9.5%
B 1
 
4.8%
Y 1
 
4.8%
J 1
 
4.8%
Decimal Number
ValueCountFrequency (%)
2 24
66.7%
3 4
 
11.1%
1 4
 
11.1%
8 2
 
5.6%
6 1
 
2.8%
7 1
 
2.8%
Space Separator
ValueCountFrequency (%)
134
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6626
97.1%
Common 177
 
2.6%
Latin 21
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
986
 
14.9%
859
 
13.0%
156
 
2.4%
150
 
2.3%
149
 
2.2%
131
 
2.0%
95
 
1.4%
95
 
1.4%
91
 
1.4%
87
 
1.3%
Other values (346) 3827
57.8%
Common
ValueCountFrequency (%)
134
75.7%
2 24
 
13.6%
3 4
 
2.3%
1 4
 
2.3%
( 3
 
1.7%
) 3
 
1.7%
8 2
 
1.1%
6 1
 
0.6%
7 1
 
0.6%
. 1
 
0.6%
Latin
ValueCountFrequency (%)
K 7
33.3%
O 5
23.8%
C 2
 
9.5%
M 2
 
9.5%
A 2
 
9.5%
B 1
 
4.8%
Y 1
 
4.8%
J 1
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6626
97.1%
ASCII 198
 
2.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
986
 
14.9%
859
 
13.0%
156
 
2.4%
150
 
2.3%
149
 
2.2%
131
 
2.0%
95
 
1.4%
95
 
1.4%
91
 
1.4%
87
 
1.3%
Other values (346) 3827
57.8%
ASCII
ValueCountFrequency (%)
134
67.7%
2 24
 
12.1%
K 7
 
3.5%
O 5
 
2.5%
3 4
 
2.0%
1 4
 
2.0%
( 3
 
1.5%
) 3
 
1.5%
C 2
 
1.0%
M 2
 
1.0%
Other values (8) 10
 
5.1%

주사육업종
Categorical

IMBALANCE 

Distinct13
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size15.5 KiB
한우
1615 
육계
 
99
돼지
 
89
젖소
 
70
사슴
 
36
Other values (8)
 
64

Length

Max length6
Median length2
Mean length2.0172326
Min length2

Unique

Unique3 ?
Unique (%)0.2%

Sample

1st row돼지
2nd row산란계
3rd row한우
4th row돼지
5th row돼지

Common Values

ValueCountFrequency (%)
한우 1615
81.9%
육계 99
 
5.0%
돼지 89
 
4.5%
젖소 70
 
3.5%
사슴 36
 
1.8%
산란계 28
 
1.4%
염소 13
 
0.7%
육우 9
 
0.5%
산양 9
 
0.5%
메추리 2
 
0.1%
Other values (3) 3
 
0.2%

Length

2023-12-12T17:39:35.593924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한우 1615
81.9%
육계 99
 
5.0%
돼지 89
 
4.5%
젖소 70
 
3.5%
사슴 36
 
1.8%
산란계 28
 
1.4%
염소 13
 
0.7%
육우 9
 
0.5%
산양 9
 
0.5%
메추리 2
 
0.1%
Other values (3) 3
 
0.2%
Distinct423
Distinct (%)21.4%
Missing0
Missing (%)0.0%
Memory size15.5 KiB
2023-12-12T17:39:35.954451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length26
Mean length19.303599
Min length13

Characters and Unicode

Total characters38086
Distinct characters139
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique152 ?
Unique (%)7.7%

Sample

1st row충청남도 공주시 이인면 신영리
2nd row충청남도 공주시 계룡면 경천리
3rd row충청남도 공주시 이인면 만수리
4th row충청남도 공주시 정안면 인풍리
5th row충청남도 공주시 우성면 상서리
ValueCountFrequency (%)
충청남도 1973
27.0%
공주시 1973
27.0%
우성면 367
 
5.0%
탄천면 221
 
3.0%
이인면 209
 
2.9%
계룡면 178
 
2.4%
유구읍 166
 
2.3%
정안면 125
 
1.7%
의당면 124
 
1.7%
사곡면 104
 
1.4%
Other values (178) 1866
25.5%
2023-12-12T17:39:36.466729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
14428
37.9%
2008
 
5.3%
1998
 
5.2%
1996
 
5.2%
1993
 
5.2%
1974
 
5.2%
1973
 
5.2%
1973
 
5.2%
1589
 
4.2%
1418
 
3.7%
Other values (129) 6736
17.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 23611
62.0%
Space Separator 14428
37.9%
Dash Punctuation 32
 
0.1%
Other Punctuation 13
 
< 0.1%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2008
 
8.5%
1998
 
8.5%
1996
 
8.5%
1993
 
8.4%
1974
 
8.4%
1973
 
8.4%
1973
 
8.4%
1589
 
6.7%
1418
 
6.0%
432
 
1.8%
Other values (124) 6257
26.5%
Space Separator
ValueCountFrequency (%)
14428
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 32
100.0%
Other Punctuation
ValueCountFrequency (%)
, 13
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 23611
62.0%
Common 14475
38.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2008
 
8.5%
1998
 
8.5%
1996
 
8.5%
1993
 
8.4%
1974
 
8.4%
1973
 
8.4%
1973
 
8.4%
1589
 
6.7%
1418
 
6.0%
432
 
1.8%
Other values (124) 6257
26.5%
Common
ValueCountFrequency (%)
14428
99.7%
- 32
 
0.2%
, 13
 
0.1%
( 1
 
< 0.1%
) 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 23611
62.0%
ASCII 14475
38.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
14428
99.7%
- 32
 
0.2%
, 13
 
0.1%
( 1
 
< 0.1%
) 1
 
< 0.1%
Hangul
ValueCountFrequency (%)
2008
 
8.5%
1998
 
8.5%
1996
 
8.5%
1993
 
8.4%
1974
 
8.4%
1973
 
8.4%
1973
 
8.4%
1589
 
6.7%
1418
 
6.0%
432
 
1.8%
Other values (124) 6257
26.5%

사육두수
Real number (ℝ)

ZEROS 

Distinct223
Distinct (%)11.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2893.3345
Minimum0
Maximum600000
Zeros55
Zeros (%)2.8%
Negative0
Negative (%)0.0%
Memory size17.5 KiB
2023-12-12T17:39:36.610391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2
Q19
median21
Q352
95-th percentile13000
Maximum600000
Range600000
Interquartile range (IQR)43

Descriptive statistics

Standard deviation18640.182
Coefficient of variation (CV)6.4424564
Kurtosis553.81733
Mean2893.3345
Median Absolute Deviation (MAD)15
Skewness19.297608
Sum5708549
Variance3.4745637 × 108
MonotonicityNot monotonic
2023-12-12T17:39:36.775240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10 90
 
4.6%
20 85
 
4.3%
30 83
 
4.2%
15 66
 
3.3%
5 62
 
3.1%
2 59
 
3.0%
7 58
 
2.9%
0 55
 
2.8%
11 50
 
2.5%
4 50
 
2.5%
Other values (213) 1315
66.6%
ValueCountFrequency (%)
0 55
2.8%
1 32
1.6%
2 59
3.0%
3 44
2.2%
4 50
2.5%
5 62
3.1%
6 49
2.5%
7 58
2.9%
8 43
2.2%
9 45
2.3%
ValueCountFrequency (%)
600000 1
 
0.1%
240000 1
 
0.1%
140000 2
 
0.1%
130000 1
 
0.1%
110000 1
 
0.1%
100980 1
 
0.1%
90500 1
 
0.1%
90000 1
 
0.1%
85000 1
 
0.1%
80000 5
0.3%

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size15.5 KiB
Minimum2019-02-13 00:00:00
Maximum2019-02-13 00:00:00
2023-12-12T17:39:36.899582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:39:37.075147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T17:39:34.604932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:39:37.145654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주사육업종사육두수
주사육업종1.0000.650
사육두수0.6501.000
2023-12-12T17:39:37.234011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사육두수주사육업종
사육두수1.0000.424
주사육업종0.4241.000

Missing values

2023-12-12T17:39:34.704422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:39:34.784696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업장명칭주사육업종사업장소재지사육두수데이터기준일
0구룡돼지충청남도 공주시 이인면 신영리3402019-02-13
1명승산란계충청남도 공주시 계룡면 경천리230002019-02-13
2주윤한우충청남도 공주시 이인면 만수리82019-02-13
3정모돼지충청남도 공주시 정안면 인풍리1002019-02-13
4환만돼지충청남도 공주시 우성면 상서리4002019-02-13
5병수한우충청남도 공주시 우성면 도천리52019-02-13
6미성돼지충청남도 공주시 계룡면 향지리15002019-02-13
7평화돼지충청남도 공주시 정안면 장원리12002019-02-13
8석진한우충청남도 공주시 우성면 방문리352019-02-13
9순자돼지충청남도 공주시 탄천면 장선리4002019-02-13
사업장명칭주사육업종사업장소재지사육두수데이터기준일
1963녹천농장한우충청남도 공주시 유구읍 녹천리12019-02-13
1964윤진농장한우충청남도 공주시 이인면 신흥리702019-02-13
1965가온누리한우충청남도 공주시 유구읍 입석리152019-02-13
1966들풀농장육계충청남도 공주시 우성면 방문리1400002019-02-13
1967달산농장한우충청남도 공주시 이인면 달리552019-02-13
1968예동농장한우충청남도 공주시 이인면 구암리92019-02-13
1969부자농장한우충청남도 공주시 우성면 방문리1502019-02-13
1970현대농장한우충청남도 공주시 신풍면 쌍대리202019-02-13
1971행운농장한우충청남도 공주시 우성면 상서리82019-02-13
1972좌성축산한우충청남도 공주시 계룡면 상성리152019-02-13

Duplicate rows

Most frequently occurring

사업장명칭주사육업종사업장소재지사육두수데이터기준일# duplicates
0계룡농장육계충청남도 공주시 계룡면 월곡리400002019-02-132
1금강제2농장한우충청남도 공주시 의당면 수촌리1102019-02-132
2남산한우충청남도 공주시 탄천면 남리262019-02-132
3내산한우충청남도 공주시 우성면 내리202019-02-132
4우리농장한우충청남도 공주시302019-02-132
5우리농장한우충청남도 공주시 우성면 방흥리32019-02-132
6축사한우충청남도 공주시112019-02-132
7텃골한우충청남도 공주시 우성면 보흥리202019-02-132
8황금육계충청남도 공주시 우성면 목천리360002019-02-132