Overview

Dataset statistics

Number of variables7
Number of observations354
Missing cells214
Missing cells (%)8.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory19.8 KiB
Average record size in memory57.4 B

Variable types

Text4
Categorical1
Numeric1
DateTime1

Dataset

Description이 데이터는 충청남도 금산군 축산 및 가금류 농장현황(농장명, 축종, 사육수, 소재지, 대표자, 연락처)을 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=401&beforeMenuCd=DOM_000000201001001000&publicdatapk=15034228

Alerts

데이터기준일자 has constant value ""Constant
축종 is highly imbalanced (54.7%)Imbalance
연락처 has 214 (60.5%) missing valuesMissing
사육수 has 5 (1.4%) zerosZeros

Reproduction

Analysis started2024-01-09 20:45:10.605917
Analysis finished2024-01-09 20:45:11.206849
Duration0.6 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct334
Distinct (%)94.4%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2024-01-10T05:45:11.407075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length4
Mean length4.4971751
Min length2

Characters and Unicode

Total characters1592
Distinct characters250
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique316 ?
Unique (%)89.3%

Sample

1st row창평농장
2nd row연흥농장
3rd row정농장
4th row무내미농장
5th rowe-greenfarm
ValueCountFrequency (%)
한우농장 4
 
1.1%
추부농장 3
 
0.8%
장대농장 2
 
0.6%
농장 2
 
0.6%
금성농장 2
 
0.6%
한울농장 2
 
0.6%
진우농장 2
 
0.6%
상진농장 2
 
0.6%
영태농장 2
 
0.6%
양지농장 2
 
0.6%
Other values (331) 340
93.7%
2024-01-10T05:45:11.777038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
336
21.1%
309
19.4%
35
 
2.2%
31
 
1.9%
30
 
1.9%
26
 
1.6%
21
 
1.3%
19
 
1.2%
18
 
1.1%
18
 
1.1%
Other values (240) 749
47.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1539
96.7%
Open Punctuation 11
 
0.7%
Close Punctuation 11
 
0.7%
Lowercase Letter 10
 
0.6%
Space Separator 9
 
0.6%
Decimal Number 9
 
0.6%
Uppercase Letter 2
 
0.1%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
336
21.8%
309
20.1%
35
 
2.3%
31
 
2.0%
30
 
1.9%
26
 
1.7%
21
 
1.4%
19
 
1.2%
18
 
1.2%
18
 
1.2%
Other values (223) 696
45.2%
Lowercase Letter
ValueCountFrequency (%)
e 3
30.0%
r 2
20.0%
n 1
 
10.0%
f 1
 
10.0%
g 1
 
10.0%
a 1
 
10.0%
m 1
 
10.0%
Decimal Number
ValueCountFrequency (%)
2 4
44.4%
3 2
22.2%
1 2
22.2%
5 1
 
11.1%
Uppercase Letter
ValueCountFrequency (%)
J 1
50.0%
K 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Space Separator
ValueCountFrequency (%)
9
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1539
96.7%
Common 41
 
2.6%
Latin 12
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
336
21.8%
309
20.1%
35
 
2.3%
31
 
2.0%
30
 
1.9%
26
 
1.7%
21
 
1.4%
19
 
1.2%
18
 
1.2%
18
 
1.2%
Other values (223) 696
45.2%
Latin
ValueCountFrequency (%)
e 3
25.0%
r 2
16.7%
n 1
 
8.3%
f 1
 
8.3%
J 1
 
8.3%
K 1
 
8.3%
g 1
 
8.3%
a 1
 
8.3%
m 1
 
8.3%
Common
ValueCountFrequency (%)
( 11
26.8%
) 11
26.8%
9
22.0%
2 4
 
9.8%
3 2
 
4.9%
1 2
 
4.9%
- 1
 
2.4%
5 1
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1539
96.7%
ASCII 53
 
3.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
336
21.8%
309
20.1%
35
 
2.3%
31
 
2.0%
30
 
1.9%
26
 
1.7%
21
 
1.4%
19
 
1.2%
18
 
1.2%
18
 
1.2%
Other values (223) 696
45.2%
ASCII
ValueCountFrequency (%)
( 11
20.8%
) 11
20.8%
9
17.0%
2 4
 
7.5%
e 3
 
5.7%
3 2
 
3.8%
1 2
 
3.8%
r 2
 
3.8%
n 1
 
1.9%
f 1
 
1.9%
Other values (7) 7
13.2%

축종
Categorical

IMBALANCE 

Distinct17
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
한우
250 
산양
 
19
염소
 
17
육계
 
16
돼지
 
13
Other values (12)
39 

Length

Max length6
Median length2
Mean length2.220339
Min length2

Unique

Unique6 ?
Unique (%)1.7%

Sample

1st row돼지
2nd row한우, 육우
3rd row한우
4th row한우
5th row한우

Common Values

ValueCountFrequency (%)
한우 250
70.6%
산양 19
 
5.4%
염소 17
 
4.8%
육계 16
 
4.5%
돼지 13
 
3.7%
젖소 11
 
3.1%
종계/산란계 9
 
2.5%
사슴 4
 
1.1%
육우 3
 
0.8%
<NA> 3
 
0.8%
Other values (7) 9
 
2.5%

Length

2024-01-10T05:45:11.919176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한우 258
71.1%
산양 20
 
5.5%
염소 19
 
5.2%
육계 17
 
4.7%
돼지 14
 
3.9%
젖소 12
 
3.3%
종계/산란계 9
 
2.5%
육우 6
 
1.7%
사슴 5
 
1.4%
na 3
 
0.8%

사육수
Real number (ℝ)

ZEROS 

Distinct119
Distinct (%)33.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2722.774
Minimum0
Maximum165000
Zeros5
Zeros (%)1.4%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2024-01-10T05:45:12.040927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q15
median20
Q375.25
95-th percentile3000
Maximum165000
Range165000
Interquartile range (IQR)70.25

Descriptive statistics

Standard deviation15468.074
Coefficient of variation (CV)5.6809982
Kurtosis67.221756
Mean2722.774
Median Absolute Deviation (MAD)18
Skewness7.6891278
Sum963862
Variance2.3926132 × 108
MonotonicityNot monotonic
2024-01-10T05:45:12.162334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2 31
 
8.8%
1 20
 
5.6%
4 16
 
4.5%
5 16
 
4.5%
3 12
 
3.4%
50 11
 
3.1%
7 9
 
2.5%
10 9
 
2.5%
20 9
 
2.5%
8 8
 
2.3%
Other values (109) 213
60.2%
ValueCountFrequency (%)
0 5
 
1.4%
1 20
5.6%
2 31
8.8%
3 12
 
3.4%
4 16
4.5%
5 16
4.5%
6 6
 
1.7%
7 9
 
2.5%
8 8
 
2.3%
9 5
 
1.4%
ValueCountFrequency (%)
165000 1
 
0.3%
156000 1
 
0.3%
90000 1
 
0.3%
80000 1
 
0.3%
70000 1
 
0.3%
60000 3
0.8%
35000 1
 
0.3%
34000 1
 
0.3%
30000 3
0.8%
11000 1
 
0.3%
Distinct350
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2024-01-10T05:45:12.376896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length64
Median length59
Mean length26.288136
Min length18

Characters and Unicode

Total characters9306
Distinct characters130
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique348 ?
Unique (%)98.3%

Sample

1st row충청남도 금산군 부리면 창평리 139번지 외 1필지(115번지)
2nd row충청남도 금산군 복수면 곡남리 304번지 1호
3rd row충청남도 금산군 추부면 장대리 466번지 2호
4th row충청남도 금산군 남일면 마장리 327번지 외 1필(332)
5th row충청남도 금산군 부리면 선원리 807번지 외 2필
ValueCountFrequency (%)
충청남도 354
 
17.6%
금산군 354
 
17.6%
1호 74
 
3.7%
금성면 52
 
2.6%
복수면 50
 
2.5%
부리면 48
 
2.4%
군북면 44
 
2.2%
추부면 43
 
2.1%
진산면 34
 
1.7%
2호 27
 
1.3%
Other values (453) 932
46.3%
2024-01-10T05:45:12.722181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2321
24.9%
432
 
4.6%
429
 
4.6%
398
 
4.3%
398
 
4.3%
396
 
4.3%
386
 
4.1%
367
 
3.9%
354
 
3.8%
354
 
3.8%
Other values (120) 3471
37.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5534
59.5%
Space Separator 2321
24.9%
Decimal Number 1366
 
14.7%
Dash Punctuation 36
 
0.4%
Other Punctuation 20
 
0.2%
Close Punctuation 14
 
0.2%
Open Punctuation 14
 
0.2%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
432
 
7.8%
429
 
7.8%
398
 
7.2%
398
 
7.2%
396
 
7.2%
386
 
7.0%
367
 
6.6%
354
 
6.4%
354
 
6.4%
348
 
6.3%
Other values (104) 1672
30.2%
Decimal Number
ValueCountFrequency (%)
1 273
20.0%
2 177
13.0%
5 157
11.5%
4 156
11.4%
3 151
11.1%
8 103
 
7.5%
7 100
 
7.3%
6 98
 
7.2%
9 82
 
6.0%
0 69
 
5.1%
Space Separator
ValueCountFrequency (%)
2321
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 36
100.0%
Other Punctuation
ValueCountFrequency (%)
, 20
100.0%
Close Punctuation
ValueCountFrequency (%)
) 14
100.0%
Open Punctuation
ValueCountFrequency (%)
( 14
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5534
59.5%
Common 3772
40.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
432
 
7.8%
429
 
7.8%
398
 
7.2%
398
 
7.2%
396
 
7.2%
386
 
7.0%
367
 
6.6%
354
 
6.4%
354
 
6.4%
348
 
6.3%
Other values (104) 1672
30.2%
Common
ValueCountFrequency (%)
2321
61.5%
1 273
 
7.2%
2 177
 
4.7%
5 157
 
4.2%
4 156
 
4.1%
3 151
 
4.0%
8 103
 
2.7%
7 100
 
2.7%
6 98
 
2.6%
9 82
 
2.2%
Other values (6) 154
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5534
59.5%
ASCII 3771
40.5%
CJK Compat 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2321
61.5%
1 273
 
7.2%
2 177
 
4.7%
5 157
 
4.2%
4 156
 
4.1%
3 151
 
4.0%
8 103
 
2.7%
7 100
 
2.7%
6 98
 
2.6%
9 82
 
2.2%
Other values (5) 153
 
4.1%
Hangul
ValueCountFrequency (%)
432
 
7.8%
429
 
7.8%
398
 
7.2%
398
 
7.2%
396
 
7.2%
386
 
7.0%
367
 
6.6%
354
 
6.4%
354
 
6.4%
348
 
6.3%
Other values (104) 1672
30.2%
CJK Compat
ValueCountFrequency (%)
1
100.0%
Distinct346
Distinct (%)97.7%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2024-01-10T05:45:13.035690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length3
Mean length3.1723164
Min length3

Characters and Unicode

Total characters1123
Distinct characters168
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique339 ?
Unique (%)95.8%

Sample

1st row윤석권
2nd row배상성
3rd row정봉구
4th row박병춘
5th row이윤근
ValueCountFrequency (%)
부자양계영농조합 3
 
0.8%
농업회사법인 3
 
0.8%
김영태 2
 
0.6%
김은주 2
 
0.6%
김도연 2
 
0.6%
한기종 2
 
0.6%
이명임 2
 
0.6%
이상진 2
 
0.6%
주식회사 2
 
0.6%
김칠배 1
 
0.3%
Other values (338) 338
94.2%
2024-01-10T05:45:13.493458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
79
 
7.0%
50
 
4.5%
38
 
3.4%
36
 
3.2%
34
 
3.0%
23
 
2.0%
20
 
1.8%
19
 
1.7%
19
 
1.7%
19
 
1.7%
Other values (158) 786
70.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1114
99.2%
Space Separator 5
 
0.4%
Close Punctuation 2
 
0.2%
Open Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
79
 
7.1%
50
 
4.5%
38
 
3.4%
36
 
3.2%
34
 
3.1%
23
 
2.1%
20
 
1.8%
19
 
1.7%
19
 
1.7%
19
 
1.7%
Other values (155) 777
69.7%
Space Separator
ValueCountFrequency (%)
5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1114
99.2%
Common 9
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
79
 
7.1%
50
 
4.5%
38
 
3.4%
36
 
3.2%
34
 
3.1%
23
 
2.1%
20
 
1.8%
19
 
1.7%
19
 
1.7%
19
 
1.7%
Other values (155) 777
69.7%
Common
ValueCountFrequency (%)
5
55.6%
) 2
 
22.2%
( 2
 
22.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1114
99.2%
ASCII 9
 
0.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
79
 
7.1%
50
 
4.5%
38
 
3.4%
36
 
3.2%
34
 
3.1%
23
 
2.1%
20
 
1.8%
19
 
1.7%
19
 
1.7%
19
 
1.7%
Other values (155) 777
69.7%
ASCII
ValueCountFrequency (%)
5
55.6%
) 2
 
22.2%
( 2
 
22.2%

연락처
Text

MISSING 

Distinct137
Distinct (%)97.9%
Missing214
Missing (%)60.5%
Memory size2.9 KiB
2024-01-10T05:45:13.718075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters1680
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique135 ?
Unique (%)96.4%

Sample

1st row041-754-6233
2nd row041-753-6130
3rd row041-752-6092
4th row042-753-8465
5th row041-752-2290
ValueCountFrequency (%)
041-754-9654 3
 
2.1%
042-582-7025 2
 
1.4%
041-751-3632 1
 
0.7%
041-753-0925 1
 
0.7%
041-753-2314 1
 
0.7%
041-752-9127 1
 
0.7%
041-752-4634 1
 
0.7%
041-752-3138 1
 
0.7%
041-752-2965 1
 
0.7%
041-754-5095 1
 
0.7%
Other values (127) 127
90.7%
2024-01-10T05:45:14.062666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 280
16.7%
4 211
12.6%
5 204
12.1%
0 199
11.8%
7 193
11.5%
1 192
11.4%
2 125
7.4%
3 106
 
6.3%
8 60
 
3.6%
9 55
 
3.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1400
83.3%
Dash Punctuation 280
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 211
15.1%
5 204
14.6%
0 199
14.2%
7 193
13.8%
1 192
13.7%
2 125
8.9%
3 106
7.6%
8 60
 
4.3%
9 55
 
3.9%
6 55
 
3.9%
Dash Punctuation
ValueCountFrequency (%)
- 280
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1680
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 280
16.7%
4 211
12.6%
5 204
12.1%
0 199
11.8%
7 193
11.5%
1 192
11.4%
2 125
7.4%
3 106
 
6.3%
8 60
 
3.6%
9 55
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1680
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 280
16.7%
4 211
12.6%
5 204
12.1%
0 199
11.8%
7 193
11.5%
1 192
11.4%
2 125
7.4%
3 106
 
6.3%
8 60
 
3.6%
9 55
 
3.3%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
Minimum2020-11-24 00:00:00
Maximum2020-11-24 00:00:00
2024-01-10T05:45:14.175102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:45:14.261431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-01-10T05:45:10.975898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T05:45:14.336932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
축종사육수
축종1.0000.611
사육수0.6111.000
2024-01-10T05:45:14.437238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사육수축종
사육수1.0000.331
축종0.3311.000

Missing values

2024-01-10T05:45:11.076388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:45:11.168086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

농장명축종사육수소재지대표자연락처데이터기준일자
0창평농장돼지3500충청남도 금산군 부리면 창평리 139번지 외 1필지(115번지)윤석권041-754-62332020-11-24
1연흥농장한우, 육우100충청남도 금산군 복수면 곡남리 304번지 1호배상성041-753-61302020-11-24
2정농장한우21충청남도 금산군 추부면 장대리 466번지 2호정봉구041-752-60922020-11-24
3무내미농장한우76충청남도 금산군 남일면 마장리 327번지 외 1필(332)박병춘042-753-84652020-11-24
4e-greenfarm한우319충청남도 금산군 부리면 선원리 807번지 외 2필이윤근<NA>2020-11-24
5행복한농장한우27충청남도 금산군 금성면 하류리 434번지 외 1필지전이순<NA>2020-11-24
6부자양계영농조합(1농장)종계/산란계165000충청남도 금산군 금성면 하류리 448번지 3호부자양계영농조합<NA>2020-11-24
7정호농장한우133충청남도 금산군 금성면 하류리 416번지 4호 (외 1필지 하류리 127-3)박병운041-752-22902020-11-24
8매현농장한우25충청남도 금산군 진산면 막현리 133번지김일순<NA>2020-11-24
9산흥농장한우125충청남도 금산군 추부면 자부리 488번지 6호김진수041-753-86892020-11-24
농장명축종사육수소재지대표자연락처데이터기준일자
344성현농장한우3충청남도 금산군 진산면 읍내리 89번지 2호엄해룡041-752-41562020-11-24
345부미목장젖소165충청남도 금산군 금성면 양전리 657번지 2호손부미<NA>2020-11-24
346형제농장한우3충청남도 금산군 제원면 명암리 30번지 2호이완순<NA>2020-11-24
347다복염소농장염소30충청남도 금산군 복수면 다복리 80번지 1호김길용<NA>2020-11-24
348동일한우농장(설성한우)한우240충청남도 금산군 금성면 하류리 493번지 2호박은자<NA>2020-11-24
349금홍한우농장(2농장)한우500충청남도 금산군 금성면 하류리 535번지 1호김은주041-754-96542020-11-24
350자유농장(설성농장)한우240충청남도 금산군 금성면 하류리 531번지김미경<NA>2020-11-24
351현내흑염소염소30충청남도 금산군 부리면 현내리 343번지 11호엄정현<NA>2020-11-24
352수영농장육계70000충청남도 금산군 복수면 수영리 692번지이인수041-753-32042020-11-24
353선진농장한우100충청남도 금산군 금성면 하류리 626번지임선애<NA>2020-11-24