Overview

Dataset statistics

Number of variables5
Number of observations340
Missing cells43
Missing cells (%)2.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.7 KiB
Average record size in memory41.4 B

Variable types

Text3
Categorical1
Numeric1

Dataset

Description진천군에서 사육중인 사업장(목장), 주사육업종, 사업장소재지(지번, 도로명), 사육두수를 보여주는 데이터
Author충청북도 진천군
URLhttps://www.data.go.kr/data/15043096/fileData.do

Alerts

사업장소재지(도로명) has 42 (12.4%) missing valuesMissing
사육두수 has 4 (1.2%) zerosZeros

Reproduction

Analysis started2024-03-16 06:35:28.810901
Analysis finished2024-03-16 06:35:31.371125
Duration2.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct316
Distinct (%)92.9%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2024-03-16T06:35:32.029743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length4
Mean length4.35
Min length3

Characters and Unicode

Total characters1479
Distinct characters219
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique297 ?
Unique (%)87.4%

Sample

1st row양산농장
2nd row수지농장
3rd row지구목장
4th row용강농장
5th row하백목장
ValueCountFrequency (%)
농장 34
 
8.8%
양산농장 5
 
1.3%
목장 4
 
1.0%
금성농장 3
 
0.8%
구암농장 3
 
0.8%
은혜목장 2
 
0.5%
구정농장 2
 
0.5%
개미목장 2
 
0.5%
우리농장 2
 
0.5%
현승목장 2
 
0.5%
Other values (316) 327
84.7%
2024-03-16T06:35:33.657627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
334
22.6%
220
 
14.9%
113
 
7.6%
46
 
3.1%
26
 
1.8%
20
 
1.4%
19
 
1.3%
18
 
1.2%
18
 
1.2%
18
 
1.2%
Other values (209) 647
43.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1417
95.8%
Space Separator 46
 
3.1%
Decimal Number 8
 
0.5%
Uppercase Letter 4
 
0.3%
Letter Number 4
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
334
23.6%
220
 
15.5%
113
 
8.0%
26
 
1.8%
20
 
1.4%
19
 
1.3%
18
 
1.3%
18
 
1.3%
18
 
1.3%
16
 
1.1%
Other values (202) 615
43.4%
Decimal Number
ValueCountFrequency (%)
2 7
87.5%
1 1
 
12.5%
Uppercase Letter
ValueCountFrequency (%)
Y 2
50.0%
K 2
50.0%
Letter Number
ValueCountFrequency (%)
2
50.0%
2
50.0%
Space Separator
ValueCountFrequency (%)
46
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1417
95.8%
Common 54
 
3.7%
Latin 8
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
334
23.6%
220
 
15.5%
113
 
8.0%
26
 
1.8%
20
 
1.4%
19
 
1.3%
18
 
1.3%
18
 
1.3%
18
 
1.3%
16
 
1.1%
Other values (202) 615
43.4%
Latin
ValueCountFrequency (%)
Y 2
25.0%
2
25.0%
2
25.0%
K 2
25.0%
Common
ValueCountFrequency (%)
46
85.2%
2 7
 
13.0%
1 1
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1417
95.8%
ASCII 58
 
3.9%
Number Forms 4
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
334
23.6%
220
 
15.5%
113
 
8.0%
26
 
1.8%
20
 
1.4%
19
 
1.3%
18
 
1.3%
18
 
1.3%
18
 
1.3%
16
 
1.1%
Other values (202) 615
43.4%
ASCII
ValueCountFrequency (%)
46
79.3%
2 7
 
12.1%
Y 2
 
3.4%
K 2
 
3.4%
1 1
 
1.7%
Number Forms
ValueCountFrequency (%)
2
50.0%
2
50.0%

주사육업종
Categorical

Distinct2
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
한우
281 
젖소
59 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row한우
2nd row한우
3rd row한우
4th row한우
5th row한우

Common Values

ValueCountFrequency (%)
한우 281
82.6%
젖소 59
 
17.4%

Length

2024-03-16T06:35:34.350329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-16T06:35:34.671367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한우 281
82.6%
젖소 59
 
17.4%
Distinct336
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2024-03-16T06:35:35.051362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length57
Mean length26.194118
Min length4

Characters and Unicode

Total characters8906
Distinct characters96
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique333 ?
Unique (%)97.9%

Sample

1st row충청북도 진천군 초평면 용기리 732번지
2nd row충청북도 진천군 덕산읍 화상리 486번지 18호 , 486-1, 486-11, 486-12
3rd row충청북도 진천군 백곡면 사송리 790번지
4th row충청북도 진천군 초평면 용산리 297번지
5th row충청북도 진천군 백곡면 양백리 233번지 3호
ValueCountFrequency (%)
충청북도 337
17.8%
진천군 337
17.8%
초평면 88
 
4.6%
이월면 61
 
3.2%
1호 51
 
2.7%
진천읍 40
 
2.1%
문백면 39
 
2.1%
광혜원면 39
 
2.1%
용기리 38
 
2.0%
백곡면 37
 
2.0%
Other values (406) 826
43.6%
2024-03-16T06:35:36.133870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2211
24.8%
389
 
4.4%
378
 
4.2%
356
 
4.0%
339
 
3.8%
339
 
3.8%
337
 
3.8%
337
 
3.8%
337
 
3.8%
337
 
3.8%
Other values (86) 3546
39.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5294
59.4%
Space Separator 2211
24.8%
Decimal Number 1326
 
14.9%
Dash Punctuation 32
 
0.4%
Other Punctuation 27
 
0.3%
Close Punctuation 8
 
0.1%
Open Punctuation 8
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
389
 
7.3%
378
 
7.1%
356
 
6.7%
339
 
6.4%
339
 
6.4%
337
 
6.4%
337
 
6.4%
337
 
6.4%
337
 
6.4%
336
 
6.3%
Other values (71) 1809
34.2%
Decimal Number
ValueCountFrequency (%)
1 231
17.4%
2 178
13.4%
3 150
11.3%
7 132
10.0%
4 125
9.4%
5 119
9.0%
6 113
8.5%
8 107
8.1%
9 89
 
6.7%
0 82
 
6.2%
Space Separator
ValueCountFrequency (%)
2211
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 32
100.0%
Other Punctuation
ValueCountFrequency (%)
, 27
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5294
59.4%
Common 3612
40.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
389
 
7.3%
378
 
7.1%
356
 
6.7%
339
 
6.4%
339
 
6.4%
337
 
6.4%
337
 
6.4%
337
 
6.4%
337
 
6.4%
336
 
6.3%
Other values (71) 1809
34.2%
Common
ValueCountFrequency (%)
2211
61.2%
1 231
 
6.4%
2 178
 
4.9%
3 150
 
4.2%
7 132
 
3.7%
4 125
 
3.5%
5 119
 
3.3%
6 113
 
3.1%
8 107
 
3.0%
9 89
 
2.5%
Other values (5) 157
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5294
59.4%
ASCII 3612
40.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2211
61.2%
1 231
 
6.4%
2 178
 
4.9%
3 150
 
4.2%
7 132
 
3.7%
4 125
 
3.5%
5 119
 
3.3%
6 113
 
3.1%
8 107
 
3.0%
9 89
 
2.5%
Other values (5) 157
 
4.3%
Hangul
ValueCountFrequency (%)
389
 
7.3%
378
 
7.1%
356
 
6.7%
339
 
6.4%
339
 
6.4%
337
 
6.4%
337
 
6.4%
337
 
6.4%
337
 
6.4%
336
 
6.3%
Other values (71) 1809
34.2%
Distinct290
Distinct (%)97.3%
Missing42
Missing (%)12.4%
Memory size2.8 KiB
2024-03-16T06:35:36.693830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length29
Mean length21.996644
Min length18

Characters and Unicode

Total characters6555
Distinct characters120
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique283 ?
Unique (%)95.0%

Sample

1st row충청북도 진천군 초평면 용전2길 40
2nd row충청북도 진천군 덕산읍 습지길 114-33
3rd row충청북도 진천군 백곡면 지구길 27-20
4th row충청북도 진천군 초평면 대구동길 13
5th row충청북도 진천군 백곡면 양백안길 6-8
ValueCountFrequency (%)
충청북도 298
19.9%
진천군 298
19.9%
초평면 75
 
5.0%
이월면 54
 
3.6%
진천읍 38
 
2.5%
문백면 35
 
2.3%
광혜원면 34
 
2.3%
백곡면 32
 
2.1%
덕산읍 30
 
2.0%
구정로 13
 
0.9%
Other values (384) 591
39.5%
2024-03-16T06:35:37.798950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1201
18.3%
356
 
5.4%
337
 
5.1%
304
 
4.6%
299
 
4.6%
298
 
4.5%
298
 
4.5%
298
 
4.5%
1 253
 
3.9%
230
 
3.5%
Other values (110) 2681
40.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3965
60.5%
Space Separator 1201
 
18.3%
Decimal Number 1183
 
18.0%
Dash Punctuation 197
 
3.0%
Other Punctuation 3
 
< 0.1%
Open Punctuation 3
 
< 0.1%
Close Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
356
 
9.0%
337
 
8.5%
304
 
7.7%
299
 
7.5%
298
 
7.5%
298
 
7.5%
298
 
7.5%
230
 
5.8%
183
 
4.6%
115
 
2.9%
Other values (95) 1247
31.5%
Decimal Number
ValueCountFrequency (%)
1 253
21.4%
2 158
13.4%
3 141
11.9%
4 127
10.7%
6 105
8.9%
9 90
 
7.6%
5 85
 
7.2%
7 84
 
7.1%
0 72
 
6.1%
8 68
 
5.7%
Space Separator
ValueCountFrequency (%)
1201
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 197
100.0%
Other Punctuation
ValueCountFrequency (%)
, 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3965
60.5%
Common 2590
39.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
356
 
9.0%
337
 
8.5%
304
 
7.7%
299
 
7.5%
298
 
7.5%
298
 
7.5%
298
 
7.5%
230
 
5.8%
183
 
4.6%
115
 
2.9%
Other values (95) 1247
31.5%
Common
ValueCountFrequency (%)
1201
46.4%
1 253
 
9.8%
- 197
 
7.6%
2 158
 
6.1%
3 141
 
5.4%
4 127
 
4.9%
6 105
 
4.1%
9 90
 
3.5%
5 85
 
3.3%
7 84
 
3.2%
Other values (5) 149
 
5.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3965
60.5%
ASCII 2590
39.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1201
46.4%
1 253
 
9.8%
- 197
 
7.6%
2 158
 
6.1%
3 141
 
5.4%
4 127
 
4.9%
6 105
 
4.1%
9 90
 
3.5%
5 85
 
3.3%
7 84
 
3.2%
Other values (5) 149
 
5.8%
Hangul
ValueCountFrequency (%)
356
 
9.0%
337
 
8.5%
304
 
7.7%
299
 
7.5%
298
 
7.5%
298
 
7.5%
298
 
7.5%
230
 
5.8%
183
 
4.6%
115
 
2.9%
Other values (95) 1247
31.5%

사육두수
Real number (ℝ)

ZEROS 

Distinct138
Distinct (%)40.7%
Missing1
Missing (%)0.3%
Infinite0
Infinite (%)0.0%
Mean79.120944
Minimum0
Maximum500
Zeros4
Zeros (%)1.2%
Negative0
Negative (%)0.0%
Memory size3.1 KiB
2024-03-16T06:35:38.305263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile5.9
Q120
median54
Q3110
95-th percentile215.2
Maximum500
Range500
Interquartile range (IQR)90

Descriptive statistics

Standard deviation77.16627
Coefficient of variation (CV)0.97529512
Kurtosis4.6851931
Mean79.120944
Median Absolute Deviation (MAD)39
Skewness1.8533012
Sum26822
Variance5954.6333
MonotonicityNot monotonic
2024-03-16T06:35:38.819106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20 14
 
4.1%
50 13
 
3.8%
15 12
 
3.5%
80 11
 
3.2%
100 10
 
2.9%
150 8
 
2.4%
40 8
 
2.4%
10 7
 
2.1%
30 7
 
2.1%
60 7
 
2.1%
Other values (128) 242
71.2%
ValueCountFrequency (%)
0 4
1.2%
2 4
1.2%
3 2
 
0.6%
4 2
 
0.6%
5 5
1.5%
6 1
 
0.3%
7 4
1.2%
8 3
0.9%
9 3
0.9%
10 7
2.1%
ValueCountFrequency (%)
500 1
 
0.3%
411 1
 
0.3%
400 1
 
0.3%
381 1
 
0.3%
330 1
 
0.3%
300 3
0.9%
297 1
 
0.3%
273 1
 
0.3%
271 1
 
0.3%
270 1
 
0.3%

Interactions

2024-03-16T06:35:29.634961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-16T06:35:39.143692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주사육업종사육두수
주사육업종1.0000.285
사육두수0.2851.000
2024-03-16T06:35:39.411316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사육두수주사육업종
사육두수1.0000.152
주사육업종0.1521.000

Missing values

2024-03-16T06:35:30.113855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-16T06:35:30.544685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-16T06:35:31.131545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

사업장명칭주사육업종사업장소재지(지번)사업장소재지(도로명)사육두수
0양산농장한우충청북도 진천군 초평면 용기리 732번지충청북도 진천군 초평면 용전2길 40138
1수지농장한우충청북도 진천군 덕산읍 화상리 486번지 18호 , 486-1, 486-11, 486-12충청북도 진천군 덕산읍 습지길 114-33411
2지구목장한우충청북도 진천군 백곡면 사송리 790번지충청북도 진천군 백곡면 지구길 27-20132
3용강농장한우충청북도 진천군 초평면 용산리 297번지충청북도 진천군 초평면 대구동길 1398
4하백목장한우충청북도 진천군 백곡면 양백리 233번지 3호충청북도 진천군 백곡면 양백안길 6-815
5구암농장한우충청북도 진천군 초평면 용기리 808번지<NA>12
6용정농장한우충청북도 진천군 초평면 용정리 376번지충청북도 진천군 초평면 양촌길 39-421
7용동목장한우충청북도 진천군 초평면 신통리 304번지<NA>35
8용산농장한우충청북도 진천군 초평면 용산리 105번지 4호충청북도 진천군 초평면 금성길 35-540
9영광농장한우충청북도 진천군 초평면 용산리 218번지 4호충청북도 진천군 초평면 대구동길 2359
사업장명칭주사육업종사업장소재지(지번)사업장소재지(도로명)사육두수
330허브목장젖소충청북도 진천군 이월면 동성리 951번지 외 2필지충청북도 진천군 이월면 성중로 316-42200
331현승목장젖소충청북도 진천군 초평면 신통리 250번지충청북도 진천군 초평면 초동로 397-46120
332걸포 목장젖소충청북도 진천군 진천읍 가산리 666번지충청북도 진천군 진천읍 가산1길 126-29199
333두레목장Ⅱ젖소충청북도 진천군 이월면 신월리 676번지충청북도 진천군 이월면 산삼로 442150
334구름송이젖소충청북도 진천군 광혜원면 실원리 423번지 1호<NA>35
335두산목장Ⅲ젖소충청북도 진천군 덕산읍 인산리 955번지<NA>214
336눈설농장젖소충청북도 진천군 이월면 동성리 789번지충청북도 진천군 이월면 성중로 316-83<NA>
337둥이목장젖소충청북도 진천군 초평면 금곡리 505번지충청북도 진천군 초평면 초동로 101-2880
338우천농장젖소충청북도 진천군 이월면 사곡리 752번지<NA>39
339동인목장젖소충청북도 진천군 문백면 은탄리 276번지 외 1필지(279)충청북도 진천군 문백면 농다리로 40910