Overview

Dataset statistics

Number of variables6
Number of observations156
Missing cells10
Missing cells (%)1.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.6 KiB
Average record size in memory49.8 B

Variable types

Text3
Categorical1
Numeric1
DateTime1

Dataset

Description충청북도 단양군 내 축산업 등록현황으로 사업장명칭 사업장 소재지(지번주소), 사업장소재지(도로명주소), 사육업종, 사육두수 등의 항목을 제공
URLhttps://www.data.go.kr/data/15006930/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
주사육업종 is highly imbalanced (59.9%)Imbalance
사업장소재지(도로명) has 10 (6.4%) missing valuesMissing

Reproduction

Analysis started2023-12-12 09:58:26.636099
Analysis finished2023-12-12 09:58:27.174095
Duration0.54 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct154
Distinct (%)98.7%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-12T18:58:27.360586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length9
Mean length5.0064103
Min length3

Characters and Unicode

Total characters781
Distinct characters168
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique152 ?
Unique (%)97.4%

Sample

1st row부유농원
2nd row맹도재농장
3rd row심원식 농장
4th row유규열 농장
5th row밤재 농장
ValueCountFrequency (%)
농장 69
28.7%
목장 4
 
1.7%
축산 3
 
1.2%
ok 2
 
0.8%
협업농장 2
 
0.8%
농원 2
 
0.8%
용소 2
 
0.8%
김천일 1
 
0.4%
부유농원 1
 
0.4%
시온농장 1
 
0.4%
Other values (153) 153
63.7%
2023-12-12T18:58:27.828821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
142
18.2%
136
 
17.4%
84
 
10.8%
18
 
2.3%
11
 
1.4%
11
 
1.4%
10
 
1.3%
10
 
1.3%
9
 
1.2%
8
 
1.0%
Other values (158) 342
43.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 692
88.6%
Space Separator 84
 
10.8%
Uppercase Letter 4
 
0.5%
Decimal Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
142
20.5%
136
19.7%
18
 
2.6%
11
 
1.6%
11
 
1.6%
10
 
1.4%
10
 
1.4%
9
 
1.3%
8
 
1.2%
8
 
1.2%
Other values (154) 329
47.5%
Uppercase Letter
ValueCountFrequency (%)
K 2
50.0%
O 2
50.0%
Space Separator
ValueCountFrequency (%)
84
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 692
88.6%
Common 85
 
10.9%
Latin 4
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
142
20.5%
136
19.7%
18
 
2.6%
11
 
1.6%
11
 
1.6%
10
 
1.4%
10
 
1.4%
9
 
1.3%
8
 
1.2%
8
 
1.2%
Other values (154) 329
47.5%
Common
ValueCountFrequency (%)
84
98.8%
1 1
 
1.2%
Latin
ValueCountFrequency (%)
K 2
50.0%
O 2
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 692
88.6%
ASCII 89
 
11.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
142
20.5%
136
19.7%
18
 
2.6%
11
 
1.6%
11
 
1.6%
10
 
1.4%
10
 
1.4%
9
 
1.3%
8
 
1.2%
8
 
1.2%
Other values (154) 329
47.5%
ASCII
ValueCountFrequency (%)
84
94.4%
K 2
 
2.2%
O 2
 
2.2%
1 1
 
1.1%

주사육업종
Categorical

IMBALANCE 

Distinct8
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
한우
123 
종계/산란계
16 
육계
 
6
염소
 
5
돼지
 
2
Other values (3)
 
4

Length

Max length6
Median length2
Mean length2.4102564
Min length2

Unique

Unique2 ?
Unique (%)1.3%

Sample

1st row종계/산란계
2nd row한우
3rd row종계/산란계
4th row종계/산란계
5th row종계/산란계

Common Values

ValueCountFrequency (%)
한우 123
78.8%
종계/산란계 16
 
10.3%
육계 6
 
3.8%
염소 5
 
3.2%
돼지 2
 
1.3%
사슴 2
 
1.3%
산양 1
 
0.6%
젖소 1
 
0.6%

Length

2023-12-12T18:58:28.066102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:58:28.251635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한우 123
78.8%
종계/산란계 16
 
10.3%
육계 6
 
3.8%
염소 5
 
3.2%
돼지 2
 
1.3%
사슴 2
 
1.3%
산양 1
 
0.6%
젖소 1
 
0.6%
Distinct116
Distinct (%)74.4%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-12T18:58:28.748145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length31
Mean length20.121795
Min length4

Characters and Unicode

Total characters3139
Distinct characters99
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique114 ?
Unique (%)73.1%

Sample

1st row충청북도 단양군 영춘면 유암리 425번지
2nd row충청북도 단양군 가곡면 보발리 414번지
3rd row충청북도 단양군 영춘면 상리 4번지
4th row충청북도 단양군 영춘면 상리 187번지
5th row충청북도 단양군 영춘면 하리 54번지 2호
ValueCountFrequency (%)
충청북도 116
18.0%
단양군 116
18.0%
영춘면 47
 
7.3%
1호 20
 
3.1%
가곡면 16
 
2.5%
어상천면 15
 
2.3%
대강면 14
 
2.2%
동대리 12
 
1.9%
만종리 12
 
1.9%
적성면 11
 
1.7%
Other values (172) 267
41.3%
2023-12-12T18:58:29.302218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
916
29.2%
121
 
3.9%
120
 
3.8%
119
 
3.8%
119
 
3.8%
118
 
3.8%
117
 
3.7%
116
 
3.7%
116
 
3.7%
116
 
3.7%
Other values (89) 1161
37.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1831
58.3%
Space Separator 916
29.2%
Decimal Number 389
 
12.4%
Other Punctuation 2
 
0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
121
 
6.6%
120
 
6.6%
119
 
6.5%
119
 
6.5%
118
 
6.4%
117
 
6.4%
116
 
6.3%
116
 
6.3%
116
 
6.3%
116
 
6.3%
Other values (76) 653
35.7%
Decimal Number
ValueCountFrequency (%)
1 74
19.0%
4 55
14.1%
2 51
13.1%
3 43
11.1%
5 34
8.7%
8 33
8.5%
9 28
 
7.2%
0 27
 
6.9%
6 22
 
5.7%
7 22
 
5.7%
Space Separator
ValueCountFrequency (%)
916
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1831
58.3%
Common 1308
41.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
121
 
6.6%
120
 
6.6%
119
 
6.5%
119
 
6.5%
118
 
6.4%
117
 
6.4%
116
 
6.3%
116
 
6.3%
116
 
6.3%
116
 
6.3%
Other values (76) 653
35.7%
Common
ValueCountFrequency (%)
916
70.0%
1 74
 
5.7%
4 55
 
4.2%
2 51
 
3.9%
3 43
 
3.3%
5 34
 
2.6%
8 33
 
2.5%
9 28
 
2.1%
0 27
 
2.1%
6 22
 
1.7%
Other values (3) 25
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1831
58.3%
ASCII 1308
41.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
916
70.0%
1 74
 
5.7%
4 55
 
4.2%
2 51
 
3.9%
3 43
 
3.3%
5 34
 
2.6%
8 33
 
2.5%
9 28
 
2.1%
0 27
 
2.1%
6 22
 
1.7%
Other values (3) 25
 
1.9%
Hangul
ValueCountFrequency (%)
121
 
6.6%
120
 
6.6%
119
 
6.5%
119
 
6.5%
118
 
6.4%
117
 
6.4%
116
 
6.3%
116
 
6.3%
116
 
6.3%
116
 
6.3%
Other values (76) 653
35.7%
Distinct144
Distinct (%)98.6%
Missing10
Missing (%)6.4%
Memory size1.3 KiB
2023-12-12T18:58:29.679415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length29
Mean length22.410959
Min length19

Characters and Unicode

Total characters3272
Distinct characters118
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique142 ?
Unique (%)97.3%

Sample

1st row충청북도 단양군 영춘면 명전길 9-53
2nd row충청북도 단양군 가곡면 보발곰절길 94-4
3rd row충청북도 단양군 영춘면 강변로 850-31
4th row충청북도 단양군 영춘면 강변로 824
5th row충청북도 단양군 영춘면 영부로 2700
ValueCountFrequency (%)
충청북도 146
19.8%
단양군 146
19.8%
영춘면 62
 
8.4%
가곡면 30
 
4.1%
대강면 15
 
2.0%
적성면 14
 
1.9%
어상천면 13
 
1.8%
영부로 9
 
1.2%
매포읍 9
 
1.2%
별방만종로 7
 
1.0%
Other values (218) 285
38.7%
2023-12-12T18:58:30.293446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
590
18.0%
155
 
4.7%
154
 
4.7%
150
 
4.6%
149
 
4.6%
146
 
4.5%
146
 
4.5%
146
 
4.5%
135
 
4.1%
1 117
 
3.6%
Other values (108) 1384
42.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2024
61.9%
Space Separator 590
 
18.0%
Decimal Number 569
 
17.4%
Dash Punctuation 82
 
2.5%
Other Punctuation 5
 
0.2%
Close Punctuation 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
155
 
7.7%
154
 
7.6%
150
 
7.4%
149
 
7.4%
146
 
7.2%
146
 
7.2%
146
 
7.2%
135
 
6.7%
98
 
4.8%
75
 
3.7%
Other values (93) 670
33.1%
Decimal Number
ValueCountFrequency (%)
1 117
20.6%
2 95
16.7%
3 61
10.7%
4 59
10.4%
5 47
8.3%
9 45
 
7.9%
8 44
 
7.7%
7 42
 
7.4%
6 31
 
5.4%
0 28
 
4.9%
Space Separator
ValueCountFrequency (%)
590
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 82
100.0%
Other Punctuation
ValueCountFrequency (%)
, 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2024
61.9%
Common 1248
38.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
155
 
7.7%
154
 
7.6%
150
 
7.4%
149
 
7.4%
146
 
7.2%
146
 
7.2%
146
 
7.2%
135
 
6.7%
98
 
4.8%
75
 
3.7%
Other values (93) 670
33.1%
Common
ValueCountFrequency (%)
590
47.3%
1 117
 
9.4%
2 95
 
7.6%
- 82
 
6.6%
3 61
 
4.9%
4 59
 
4.7%
5 47
 
3.8%
9 45
 
3.6%
8 44
 
3.5%
7 42
 
3.4%
Other values (5) 66
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2024
61.9%
ASCII 1248
38.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
590
47.3%
1 117
 
9.4%
2 95
 
7.6%
- 82
 
6.6%
3 61
 
4.9%
4 59
 
4.7%
5 47
 
3.8%
9 45
 
3.6%
8 44
 
3.5%
7 42
 
3.4%
Other values (5) 66
 
5.3%
Hangul
ValueCountFrequency (%)
155
 
7.7%
154
 
7.6%
150
 
7.4%
149
 
7.4%
146
 
7.2%
146
 
7.2%
146
 
7.2%
135
 
6.7%
98
 
4.8%
75
 
3.7%
Other values (93) 670
33.1%

사육두수
Real number (ℝ)

Distinct64
Distinct (%)41.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2840.0256
Minimum1
Maximum60000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2023-12-12T18:58:30.513948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q15
median16
Q344.25
95-th percentile14500
Maximum60000
Range59999
Interquartile range (IQR)39.25

Descriptive statistics

Standard deviation9422.681
Coefficient of variation (CV)3.3178155
Kurtosis18.753723
Mean2840.0256
Median Absolute Deviation (MAD)13
Skewness4.2342651
Sum443044
Variance88786918
MonotonicityNot monotonic
2023-12-12T18:58:30.713432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 10
 
6.4%
2 10
 
6.4%
3 8
 
5.1%
13 7
 
4.5%
20 6
 
3.8%
4 6
 
3.8%
5 6
 
3.8%
8 6
 
3.8%
11 5
 
3.2%
10 5
 
3.2%
Other values (54) 87
55.8%
ValueCountFrequency (%)
1 10
6.4%
2 10
6.4%
3 8
5.1%
4 6
3.8%
5 6
3.8%
6 3
 
1.9%
7 2
 
1.3%
8 6
3.8%
9 1
 
0.6%
10 5
3.2%
ValueCountFrequency (%)
60000 1
0.6%
53000 1
0.6%
45000 1
0.6%
42000 1
0.6%
40000 1
0.6%
35000 1
0.6%
17000 1
0.6%
16000 1
0.6%
14000 1
0.6%
13000 2
1.3%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
Minimum2023-04-19 00:00:00
Maximum2023-04-19 00:00:00
2023-12-12T18:58:30.875579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:58:30.985441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T18:58:26.909127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:58:31.054773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주사육업종사육두수
주사육업종1.0000.868
사육두수0.8681.000
2023-12-12T18:58:31.483940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사육두수주사육업종
사육두수1.0000.470
주사육업종0.4701.000

Missing values

2023-12-12T18:58:27.027020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:58:27.133734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업장명칭주사육업종사업장소재지(지번)사업장소재지(도로명)사육두수데이터기준일자
0부유농원종계/산란계충청북도 단양군 영춘면 유암리 425번지충청북도 단양군 영춘면 명전길 9-53170002023-04-19
1맹도재농장한우충청북도 단양군 가곡면 보발리 414번지충청북도 단양군 가곡면 보발곰절길 94-482023-04-19
2심원식 농장종계/산란계충청북도 단양군 영춘면 상리 4번지충청북도 단양군 영춘면 강변로 850-31110002023-04-19
3유규열 농장종계/산란계충청북도 단양군 영춘면 상리 187번지충청북도 단양군 영춘면 강변로 82480002023-04-19
4밤재 농장종계/산란계충청북도 단양군 영춘면 하리 54번지 2호충청북도 단양군 영춘면 영부로 2700130002023-04-19
5북벽 농장종계/산란계충청북도 단양군 영춘면 상리 545번지충청북도 단양군 영춘면 강변로 727-1990002023-04-19
6나봉주 농장종계/산란계충청북도 단양군 영춘면 동대리 91번지 양계장충청북도 단양군 영춘면 영부로 2191-37, 양계장30002023-04-19
7박명종 농장종계/산란계충청북도 단양군 영춘면 동대리 531번지충청북도 단양군 영춘면 동대3길 29140002023-04-19
8양지농장종계/산란계충청북도 단양군 영춘면 동대리 750번지 2호 양계장충청북도 단양군 영춘면 동대2길 26, 양계장160002023-04-19
9병두 농장종계/산란계충청북도 단양군 영춘면 용진리 522번지 , 434번지충청북도 단양군 영춘면 용진3길 35 (, 434번지)120002023-04-19
사업장명칭주사육업종사업장소재지(지번)사업장소재지(도로명)사육두수데이터기준일자
146율곡 농장한우충청북도 단양군 어상천면 율곡리 269번지 4호충청북도 단양군 어상천면 어상천로 484-1732023-04-19
147팔형제 농장한우충청북도 단양군 대강면 직티리 247번지 5호<NA>172023-04-19
148명전 농장한우충청북도 단양군 영춘면 유암리 536번지충청북도 단양군 영춘면 명전1길 94-12352023-04-19
149호 농장한우충청북도 단양군 영춘면 동대리 271번지충청북도 단양군 영춘면 동대5길 132-3202023-04-19
150자연인 농장한우충청북도 단양군 단성면 대잠리 319번지<NA>22023-04-19
151경희 농장한우충청북도 단양군 매포읍 응실리 202번지 외 1필지충청북도 단양군 매포읍 단양로 1381-5432023-04-19
152평강농장한우충청북도 단양군 영춘면 만종리 306번지충청북도 단양군 영춘면 만종2길 6142023-04-19
153우리농장한우충청북도 단양군 영춘면 사이곡리 65번지 3호충청북도 단양군 영춘면 사이곡3길 24-742023-04-19
154마추피추 농장종계/산란계충청북도 단양군 단성면 가산리 92번지 1호충청북도 단양군 단성면 사인암로 42-29402023-04-19
155백자리한우농장한우충청북도 단양군 영춘면 백자리 50번지 3호충청북도 단양군 영춘면 구인사로 1186-122023-04-19