Overview

Dataset statistics

Number of variables7
Number of observations214
Missing cells214
Missing cells (%)14.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.0 KiB
Average record size in memory57.6 B

Variable types

Text4
Categorical1
Numeric1
DateTime1

Dataset

Description관내 소 사육농가에 대한 데이터로 사업장 명칭, 대표자, 사업장 주소, 사육 마리 수, 사육 축종을 구분한 자료입니다.
Author충청남도 금산군
URLhttps://www.data.go.kr/data/15042690/fileData.do

Alerts

소(우) 종류 구분 has constant value ""Constant
데이터기준일 has constant value ""Constant
지번주소 has 135 (63.1%) missing valuesMissing
도로명 주소 has 79 (36.9%) missing valuesMissing
사육두수 has 4 (1.9%) zerosZeros

Reproduction

Analysis started2024-03-15 00:52:01.435074
Analysis finished2024-03-15 00:52:03.200410
Duration1.77 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct207
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2024-03-15T09:52:04.003288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length4
Mean length4.4626168
Min length2

Characters and Unicode

Total characters955
Distinct characters202
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique201 ?
Unique (%)93.9%

Sample

1st row연흥농장
2nd row정농장
3rd row무내미농장
4th rowe-greenfarm
5th row행복한농장
ValueCountFrequency (%)
한우농장 5
 
2.3%
한울농장 2
 
0.9%
버들농장 2
 
0.9%
역평농장 2
 
0.9%
우진농장 2
 
0.9%
남매농장 2
 
0.9%
진우농장 2
 
0.9%
수당농장 1
 
0.5%
의총농장 1
 
0.5%
영주농장 1
 
0.5%
Other values (199) 199
90.9%
2024-03-15T09:52:05.290698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
206
21.6%
189
19.8%
29
 
3.0%
29
 
3.0%
16
 
1.7%
15
 
1.6%
14
 
1.5%
11
 
1.2%
10
 
1.0%
10
 
1.0%
Other values (192) 426
44.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 917
96.0%
Decimal Number 12
 
1.3%
Lowercase Letter 10
 
1.0%
Space Separator 5
 
0.5%
Close Punctuation 5
 
0.5%
Open Punctuation 5
 
0.5%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
206
22.5%
189
20.6%
29
 
3.2%
29
 
3.2%
16
 
1.7%
15
 
1.6%
14
 
1.5%
11
 
1.2%
10
 
1.1%
10
 
1.1%
Other values (177) 388
42.3%
Lowercase Letter
ValueCountFrequency (%)
e 3
30.0%
r 2
20.0%
g 1
 
10.0%
m 1
 
10.0%
a 1
 
10.0%
f 1
 
10.0%
n 1
 
10.0%
Decimal Number
ValueCountFrequency (%)
2 7
58.3%
1 2
 
16.7%
3 2
 
16.7%
5 1
 
8.3%
Space Separator
ValueCountFrequency (%)
5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 917
96.0%
Common 28
 
2.9%
Latin 10
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
206
22.5%
189
20.6%
29
 
3.2%
29
 
3.2%
16
 
1.7%
15
 
1.6%
14
 
1.5%
11
 
1.2%
10
 
1.1%
10
 
1.1%
Other values (177) 388
42.3%
Common
ValueCountFrequency (%)
2 7
25.0%
5
17.9%
) 5
17.9%
( 5
17.9%
1 2
 
7.1%
3 2
 
7.1%
5 1
 
3.6%
- 1
 
3.6%
Latin
ValueCountFrequency (%)
e 3
30.0%
r 2
20.0%
g 1
 
10.0%
m 1
 
10.0%
a 1
 
10.0%
f 1
 
10.0%
n 1
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 917
96.0%
ASCII 38
 
4.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
206
22.5%
189
20.6%
29
 
3.2%
29
 
3.2%
16
 
1.7%
15
 
1.6%
14
 
1.5%
11
 
1.2%
10
 
1.1%
10
 
1.1%
Other values (177) 388
42.3%
ASCII
ValueCountFrequency (%)
2 7
18.4%
5
13.2%
) 5
13.2%
( 5
13.2%
e 3
7.9%
1 2
 
5.3%
3 2
 
5.3%
r 2
 
5.3%
5 1
 
2.6%
g 1
 
2.6%
Other values (5) 5
13.2%
Distinct208
Distinct (%)97.2%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2024-03-15T09:52:06.471568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length3
Mean length3.0934579
Min length2

Characters and Unicode

Total characters662
Distinct characters140
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique202 ?
Unique (%)94.4%

Sample

1st row배상성
2nd row정봉구
3rd row박병춘
4th row이윤근
5th row전이순
ValueCountFrequency (%)
한기종 2
 
0.9%
최영임 2
 
0.9%
김영순 2
 
0.9%
김영태 2
 
0.9%
이명임 2
 
0.9%
박병운 2
 
0.9%
이춘배 1
 
0.5%
이용석 1
 
0.5%
박복철 1
 
0.5%
이성철 1
 
0.5%
Other values (199) 199
92.6%
2024-03-15T09:52:07.831071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
54
 
8.2%
37
 
5.6%
22
 
3.3%
20
 
3.0%
16
 
2.4%
15
 
2.3%
12
 
1.8%
12
 
1.8%
12
 
1.8%
12
 
1.8%
Other values (130) 450
68.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 659
99.5%
Space Separator 1
 
0.2%
Open Punctuation 1
 
0.2%
Close Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
54
 
8.2%
37
 
5.6%
22
 
3.3%
20
 
3.0%
16
 
2.4%
15
 
2.3%
12
 
1.8%
12
 
1.8%
12
 
1.8%
12
 
1.8%
Other values (127) 447
67.8%
Space Separator
ValueCountFrequency (%)
1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 659
99.5%
Common 3
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
54
 
8.2%
37
 
5.6%
22
 
3.3%
20
 
3.0%
16
 
2.4%
15
 
2.3%
12
 
1.8%
12
 
1.8%
12
 
1.8%
12
 
1.8%
Other values (127) 447
67.8%
Common
ValueCountFrequency (%)
1
33.3%
( 1
33.3%
) 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 659
99.5%
ASCII 3
 
0.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
54
 
8.2%
37
 
5.6%
22
 
3.3%
20
 
3.0%
16
 
2.4%
15
 
2.3%
12
 
1.8%
12
 
1.8%
12
 
1.8%
12
 
1.8%
Other values (127) 447
67.8%
ASCII
ValueCountFrequency (%)
1
33.3%
( 1
33.3%
) 1
33.3%

지번주소
Text

MISSING 

Distinct79
Distinct (%)100.0%
Missing135
Missing (%)63.1%
Memory size1.8 KiB
2024-03-15T09:52:09.039783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length38
Mean length25.620253
Min length20

Characters and Unicode

Total characters2024
Distinct characters85
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique79 ?
Unique (%)100.0%

Sample

1st row충청남도 금산군 진산면 막현리 133번지
2nd row충청남도 금산군 남일면 마장리 751번지
3rd row충청남도 금산군 군북면 산안리 521번지
4th row충청남도 금산군 제원면 제원리 416-6
5th row충청남도 금산군 군북면 동편리 186
ValueCountFrequency (%)
충청남도 79
18.2%
금산군 79
18.2%
1호 18
 
4.1%
복수면 15
 
3.4%
군북면 13
 
3.0%
금성면 12
 
2.8%
부리면 8
 
1.8%
진산면 8
 
1.8%
추부면 7
 
1.6%
도곡리 5
 
1.1%
Other values (135) 191
43.9%
2024-03-15T09:52:10.457441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
493
24.4%
96
 
4.7%
96
 
4.7%
92
 
4.5%
87
 
4.3%
86
 
4.2%
86
 
4.2%
84
 
4.2%
79
 
3.9%
79
 
3.9%
Other values (75) 746
36.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1220
60.3%
Space Separator 493
24.4%
Decimal Number 295
 
14.6%
Dash Punctuation 7
 
0.3%
Open Punctuation 4
 
0.2%
Close Punctuation 4
 
0.2%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
96
 
7.9%
96
 
7.9%
92
 
7.5%
87
 
7.1%
86
 
7.0%
86
 
7.0%
84
 
6.9%
79
 
6.5%
79
 
6.5%
75
 
6.1%
Other values (60) 360
29.5%
Decimal Number
ValueCountFrequency (%)
1 57
19.3%
4 43
14.6%
2 42
14.2%
6 34
11.5%
5 31
10.5%
3 23
7.8%
8 23
7.8%
7 16
 
5.4%
0 14
 
4.7%
9 12
 
4.1%
Space Separator
ValueCountFrequency (%)
493
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1220
60.3%
Common 804
39.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
96
 
7.9%
96
 
7.9%
92
 
7.5%
87
 
7.1%
86
 
7.0%
86
 
7.0%
84
 
6.9%
79
 
6.5%
79
 
6.5%
75
 
6.1%
Other values (60) 360
29.5%
Common
ValueCountFrequency (%)
493
61.3%
1 57
 
7.1%
4 43
 
5.3%
2 42
 
5.2%
6 34
 
4.2%
5 31
 
3.9%
3 23
 
2.9%
8 23
 
2.9%
7 16
 
2.0%
0 14
 
1.7%
Other values (5) 28
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1220
60.3%
ASCII 804
39.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
493
61.3%
1 57
 
7.1%
4 43
 
5.3%
2 42
 
5.2%
6 34
 
4.2%
5 31
 
3.9%
3 23
 
2.9%
8 23
 
2.9%
7 16
 
2.0%
0 14
 
1.7%
Other values (5) 28
 
3.5%
Hangul
ValueCountFrequency (%)
96
 
7.9%
96
 
7.9%
92
 
7.5%
87
 
7.1%
86
 
7.0%
86
 
7.0%
84
 
6.9%
79
 
6.5%
79
 
6.5%
75
 
6.1%
Other values (60) 360
29.5%

도로명 주소
Text

MISSING 

Distinct133
Distinct (%)98.5%
Missing79
Missing (%)36.9%
Memory size1.8 KiB
2024-03-15T09:52:11.893440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length30
Mean length21.585185
Min length18

Characters and Unicode

Total characters2914
Distinct characters150
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique132 ?
Unique (%)97.8%

Sample

1st row충청남도 금산군 복수면 수심대길 93
2nd row충청남도 금산군 추부면 추풍로 98
3rd row충청남도 금산군 남일면 마장길 144-14
4th row충청남도 금산군 부리면 무금로 1736-45
5th row충청남도 금산군 금성면 큰말길 23
ValueCountFrequency (%)
충청남도 135
19.9%
금산군 135
19.9%
금성면 29
 
4.3%
부리면 20
 
2.9%
군북면 17
 
2.5%
복수면 17
 
2.5%
남일면 12
 
1.8%
추부면 10
 
1.5%
진산면 8
 
1.2%
남이면 8
 
1.2%
Other values (220) 288
42.4%
2024-03-15T09:52:13.845824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
546
18.7%
180
 
6.2%
157
 
5.4%
155
 
5.3%
153
 
5.3%
135
 
4.6%
135
 
4.6%
135
 
4.6%
128
 
4.4%
98
 
3.4%
Other values (140) 1092
37.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1851
63.5%
Space Separator 546
 
18.7%
Decimal Number 454
 
15.6%
Dash Punctuation 52
 
1.8%
Close Punctuation 4
 
0.1%
Open Punctuation 4
 
0.1%
Other Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
180
 
9.7%
157
 
8.5%
155
 
8.4%
153
 
8.3%
135
 
7.3%
135
 
7.3%
135
 
7.3%
128
 
6.9%
98
 
5.3%
39
 
2.1%
Other values (125) 536
29.0%
Decimal Number
ValueCountFrequency (%)
1 97
21.4%
2 66
14.5%
3 48
10.6%
4 48
10.6%
5 47
10.4%
7 43
9.5%
6 38
 
8.4%
0 26
 
5.7%
9 23
 
5.1%
8 18
 
4.0%
Space Separator
ValueCountFrequency (%)
546
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 52
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Other Punctuation
ValueCountFrequency (%)
, 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1851
63.5%
Common 1063
36.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
180
 
9.7%
157
 
8.5%
155
 
8.4%
153
 
8.3%
135
 
7.3%
135
 
7.3%
135
 
7.3%
128
 
6.9%
98
 
5.3%
39
 
2.1%
Other values (125) 536
29.0%
Common
ValueCountFrequency (%)
546
51.4%
1 97
 
9.1%
2 66
 
6.2%
- 52
 
4.9%
3 48
 
4.5%
4 48
 
4.5%
5 47
 
4.4%
7 43
 
4.0%
6 38
 
3.6%
0 26
 
2.4%
Other values (5) 52
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1851
63.5%
ASCII 1063
36.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
546
51.4%
1 97
 
9.1%
2 66
 
6.2%
- 52
 
4.9%
3 48
 
4.5%
4 48
 
4.5%
5 47
 
4.4%
7 43
 
4.0%
6 38
 
3.6%
0 26
 
2.4%
Other values (5) 52
 
4.9%
Hangul
ValueCountFrequency (%)
180
 
9.7%
157
 
8.5%
155
 
8.4%
153
 
8.3%
135
 
7.3%
135
 
7.3%
135
 
7.3%
128
 
6.9%
98
 
5.3%
39
 
2.1%
Other values (125) 536
29.0%

소(우) 종류 구분
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
한우
214 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row한우
2nd row한우
3rd row한우
4th row한우
5th row한우

Common Values

ValueCountFrequency (%)
한우 214
100.0%

Length

2024-03-15T09:52:14.292317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T09:52:14.522669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한우 214
100.0%

사육두수
Real number (ℝ)

ZEROS 

Distinct87
Distinct (%)40.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean59.635514
Minimum0
Maximum750
Zeros4
Zeros (%)1.9%
Negative0
Negative (%)0.0%
Memory size2.0 KiB
2024-03-15T09:52:14.795008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1.65
Q15
median20
Q370
95-th percentile231.55
Maximum750
Range750
Interquartile range (IQR)65

Descriptive statistics

Standard deviation100.91701
Coefficient of variation (CV)1.69223
Kurtosis14.967719
Mean59.635514
Median Absolute Deviation (MAD)18
Skewness3.4475347
Sum12762
Variance10184.242
MonotonicityNot monotonic
2024-03-15T09:52:15.446690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2 18
 
8.4%
5 13
 
6.1%
3 8
 
3.7%
25 8
 
3.7%
50 7
 
3.3%
1 7
 
3.3%
10 7
 
3.3%
8 7
 
3.3%
6 6
 
2.8%
100 5
 
2.3%
Other values (77) 128
59.8%
ValueCountFrequency (%)
0 4
 
1.9%
1 7
 
3.3%
2 18
8.4%
3 8
3.7%
4 5
 
2.3%
5 13
6.1%
6 6
 
2.8%
7 4
 
1.9%
8 7
 
3.3%
9 2
 
0.9%
ValueCountFrequency (%)
750 1
0.5%
500 2
0.9%
450 1
0.5%
430 1
0.5%
400 2
0.9%
319 1
0.5%
306 1
0.5%
250 1
0.5%
240 1
0.5%
227 1
0.5%

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
Minimum2024-01-09 00:00:00
Maximum2024-01-09 00:00:00
2024-03-15T09:52:15.807430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T09:52:16.105037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-03-15T09:52:02.003927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T09:52:16.311332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지번주소사육두수
지번주소1.0001.000
사육두수1.0001.000

Missing values

2024-03-15T09:52:02.365995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T09:52:02.766140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-15T09:52:03.050287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

사업장명칭대표자명지번주소도로명 주소소(우) 종류 구분사육두수데이터기준일
0연흥농장배상성<NA>충청남도 금산군 복수면 수심대길 93한우1002024-01-09
1정농장정봉구<NA>충청남도 금산군 추부면 추풍로 98한우212024-01-09
2무내미농장박병춘<NA>충청남도 금산군 남일면 마장길 144-14한우762024-01-09
3e-greenfarm이윤근<NA>충청남도 금산군 부리면 무금로 1736-45한우3192024-01-09
4행복한농장전이순<NA>충청남도 금산군 금성면 큰말길 23한우272024-01-09
5정호농장박병운<NA>충청남도 금산군 금성면 큰말길 63-7한우1332024-01-09
6매현농장김일순충청남도 금산군 진산면 막현리 133번지<NA>한우252024-01-09
7산흥농장김진수<NA>충청남도 금산군 추부면 추풍로 234-22한우1252024-01-09
8경훤농장박훤용<NA>충청남도 금산군 금성면 왜벌길 5한우2202024-01-09
9한우농장김태문<NA>충청남도 금산군 금산읍 금산천2길 25-7한우902024-01-09
사업장명칭대표자명지번주소도로명 주소소(우) 종류 구분사육두수데이터기준일
204다올농장최영임충청남도 금산군 부리면 관천리 173번지<NA>한우2102024-01-09
205우진농장 2호김봉수충청남도 금산군 금성면 하류리 656번지 외 1필지(657)<NA>한우1262024-01-09
206영진김태영충청남도 금산군 금성면 하류리 614번지 1호 외1필지(614-2)<NA>한우32024-01-09
207기복농장김기복충청남도 금산군 추부면 서대리 480번지 2호<NA>한우92024-01-09
208수농장소병구충청남도 금산군 금성면 도곡리 458번지<NA>한우402024-01-09
209금홍한우3농장박선미충청남도 금산군 금성면 도곡리 446번지 외 2필지(446-1, 446-2)<NA>한우1342024-01-09
210오케이한우2농장이상신충청남도 금산군 부리면 선원리 223번지 11호<NA>한우22024-01-09
211금수소김금수충청남도 금산군 부리면 관천리 578번지<NA>한우252024-01-09
212다금2농장명노관충청남도 금산군 부리면 선원리 223번지 12호<NA>한우102024-01-09
213남매농장김영숙충청남도 금산군 금성면 두곡리 189번지<NA>한우22024-01-09