Overview

Dataset statistics

Number of variables6
Number of observations950
Missing cells0
Missing cells (%)0.0%
Duplicate rows17
Duplicate rows (%)1.8%
Total size in memory45.6 KiB
Average record size in memory49.1 B

Variable types

Categorical4
Text1
Numeric1

Dataset

Description충청북도 충주시 가축사육업 현황에 관련된 데이터를 제공합니다.(자치단체, 농장이름, 축종, 농장소재지 읍면동, 사육두수)
Author충청북도 충주시
URLhttps://www.data.go.kr/data/15034230/fileData.do

Alerts

자치단체 has constant value ""Constant
데이터기준일자 has constant value ""Constant
Dataset has 17 (1.8%) duplicate rowsDuplicates
사육두수 is highly overall correlated with 축종High correlation
축종 is highly overall correlated with 사육두수High correlation
축종 is highly imbalanced (59.6%)Imbalance

Reproduction

Analysis started2023-12-12 11:04:02.818475
Analysis finished2023-12-12 11:04:03.709641
Duration0.89 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

자치단체
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size7.6 KiB
충청북도 충주시
950 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row충청북도 충주시
2nd row충청북도 충주시
3rd row충청북도 충주시
4th row충청북도 충주시
5th row충청북도 충주시

Common Values

ValueCountFrequency (%)
충청북도 충주시 950
100.0%

Length

2023-12-12T20:04:03.808521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:04:03.952446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
충청북도 950
50.0%
충주시 950
50.0%
Distinct859
Distinct (%)90.4%
Missing0
Missing (%)0.0%
Memory size7.6 KiB
2023-12-12T20:04:04.289305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length4
Mean length4.3073684
Min length2

Characters and Unicode

Total characters4092
Distinct characters330
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique785 ?
Unique (%)82.6%

Sample

1st row부인농원
2nd row나나농장
3rd row화계농장
4th row현기농장
5th row가금농장
ValueCountFrequency (%)
농장 40
 
3.8%
우리농장 7
 
0.7%
2 6
 
0.6%
농업회사법인 6
 
0.6%
중원농장 5
 
0.5%
한우 5
 
0.5%
사슴농장 4
 
0.4%
한우농장 4
 
0.4%
덕해농장 4
 
0.4%
목장 4
 
0.4%
Other values (866) 963
91.9%
2023-12-12T20:04:04.973046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
705
 
17.2%
655
 
16.0%
98
 
2.4%
84
 
2.1%
76
 
1.9%
70
 
1.7%
69
 
1.7%
54
 
1.3%
49
 
1.2%
46
 
1.1%
Other values (320) 2186
53.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3944
96.4%
Space Separator 98
 
2.4%
Decimal Number 28
 
0.7%
Open Punctuation 9
 
0.2%
Close Punctuation 9
 
0.2%
Uppercase Letter 3
 
0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
705
 
17.9%
655
 
16.6%
84
 
2.1%
76
 
1.9%
70
 
1.8%
69
 
1.7%
54
 
1.4%
49
 
1.2%
46
 
1.2%
45
 
1.1%
Other values (308) 2091
53.0%
Decimal Number
ValueCountFrequency (%)
2 19
67.9%
1 3
 
10.7%
4 2
 
7.1%
3 2
 
7.1%
5 2
 
7.1%
Uppercase Letter
ValueCountFrequency (%)
K 1
33.3%
A 1
33.3%
L 1
33.3%
Space Separator
ValueCountFrequency (%)
98
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3944
96.4%
Common 145
 
3.5%
Latin 3
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
705
 
17.9%
655
 
16.6%
84
 
2.1%
76
 
1.9%
70
 
1.8%
69
 
1.7%
54
 
1.4%
49
 
1.2%
46
 
1.2%
45
 
1.1%
Other values (308) 2091
53.0%
Common
ValueCountFrequency (%)
98
67.6%
2 19
 
13.1%
( 9
 
6.2%
) 9
 
6.2%
1 3
 
2.1%
4 2
 
1.4%
3 2
 
1.4%
5 2
 
1.4%
/ 1
 
0.7%
Latin
ValueCountFrequency (%)
K 1
33.3%
A 1
33.3%
L 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3944
96.4%
ASCII 148
 
3.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
705
 
17.9%
655
 
16.6%
84
 
2.1%
76
 
1.9%
70
 
1.8%
69
 
1.7%
54
 
1.4%
49
 
1.2%
46
 
1.2%
45
 
1.1%
Other values (308) 2091
53.0%
ASCII
ValueCountFrequency (%)
98
66.2%
2 19
 
12.8%
( 9
 
6.1%
) 9
 
6.1%
1 3
 
2.0%
4 2
 
1.4%
3 2
 
1.4%
5 2
 
1.4%
K 1
 
0.7%
A 1
 
0.7%
Other values (2) 2
 
1.4%

축종
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct28
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size7.6 KiB
한우
668 
산양
 
54
육계
 
51
염소
 
46
돼지
 
25
Other values (23)
106 

Length

Max length14
Median length2
Mean length2.3010526
Min length1

Unique

Unique8 ?
Unique (%)0.8%

Sample

1st row육계
2nd row종계/산란계
3rd row한우
4th row한우, 돼지
5th row육계

Common Values

ValueCountFrequency (%)
한우 668
70.3%
산양 54
 
5.7%
육계 51
 
5.4%
염소 46
 
4.8%
돼지 25
 
2.6%
종계/산란계 22
 
2.3%
사슴 16
 
1.7%
젖소 12
 
1.3%
육우 11
 
1.2%
한우, 젖소 8
 
0.8%
Other values (18) 37
 
3.9%

Length

2023-12-12T20:04:05.223452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한우 693
70.0%
산양 64
 
6.5%
육계 58
 
5.9%
염소 51
 
5.2%
돼지 28
 
2.8%
종계/산란계 28
 
2.8%
사슴 21
 
2.1%
젖소 21
 
2.1%
육우 14
 
1.4%
산란육성계 5
 
0.5%
Other values (4) 7
 
0.7%
Distinct20
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size7.6 KiB
주덕읍
177 
대소원면
116 
신니면
108 
금가면
106 
동량면
78 
Other values (15)
365 

Length

Max length6
Median length3
Mean length3.2210526
Min length3

Unique

Unique2 ?
Unique (%)0.2%

Sample

1st row주덕읍
2nd row산척면
3rd row주덕읍
4th row주덕읍
5th row중앙탑면

Common Values

ValueCountFrequency (%)
주덕읍 177
18.6%
대소원면 116
12.2%
신니면 108
11.4%
금가면 106
11.2%
동량면 78
8.2%
살미면 56
 
5.9%
중앙탑면 53
 
5.6%
엄정면 51
 
5.4%
산척면 49
 
5.2%
노은면 36
 
3.8%
Other values (10) 120
12.6%

Length

2023-12-12T20:04:05.437803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
주덕읍 177
18.6%
대소원면 116
12.2%
신니면 108
11.4%
금가면 106
11.2%
동량면 78
8.2%
살미면 56
 
5.9%
중앙탑면 53
 
5.6%
엄정면 51
 
5.4%
산척면 49
 
5.2%
노은면 36
 
3.8%
Other values (10) 120
12.6%

사육두수
Real number (ℝ)

HIGH CORRELATION 

Distinct229
Distinct (%)24.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3891.9242
Minimum0
Maximum641160
Zeros3
Zeros (%)0.3%
Negative0
Negative (%)0.0%
Memory size8.5 KiB
2023-12-12T20:04:05.658863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3
Q111
median30
Q392.75
95-th percentile20902
Maximum641160
Range641160
Interquartile range (IQR)81.75

Descriptive statistics

Standard deviation27556.228
Coefficient of variation (CV)7.0803608
Kurtosis344.62949
Mean3891.9242
Median Absolute Deviation (MAD)23
Skewness16.828882
Sum3697328
Variance7.5934569 × 108
MonotonicityNot monotonic
2023-12-12T20:04:05.880905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4 35
 
3.7%
30 28
 
2.9%
5 27
 
2.8%
25 27
 
2.8%
20 26
 
2.7%
9 26
 
2.7%
6 26
 
2.7%
10 25
 
2.6%
100 24
 
2.5%
7 23
 
2.4%
Other values (219) 683
71.9%
ValueCountFrequency (%)
0 3
 
0.3%
1 10
 
1.1%
2 23
2.4%
3 19
2.0%
4 35
3.7%
5 27
2.8%
6 26
2.7%
7 23
2.4%
8 18
1.9%
9 26
2.7%
ValueCountFrequency (%)
641160 1
 
0.1%
374694 1
 
0.1%
267840 1
 
0.1%
100000 1
 
0.1%
85000 1
 
0.1%
75000 1
 
0.1%
72288 1
 
0.1%
70000 3
0.3%
65000 1
 
0.1%
62064 1
 
0.1%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size7.6 KiB
2023-08-31
950 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-08-31
2nd row2023-08-31
3rd row2023-08-31
4th row2023-08-31
5th row2023-08-31

Common Values

ValueCountFrequency (%)
2023-08-31 950
100.0%

Length

2023-12-12T20:04:06.073972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:04:06.194422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-08-31 950
100.0%

Interactions

2023-12-12T20:04:03.242714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T20:04:06.303231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
축종농장 소재지 읍면동사육두수
축종1.0000.3950.812
농장 소재지 읍면동0.3951.0000.311
사육두수0.8120.3111.000
2023-12-12T20:04:06.428317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
축종농장 소재지 읍면동
축종1.0000.113
농장 소재지 읍면동0.1131.000
2023-12-12T20:04:06.556426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사육두수축종농장 소재지 읍면동
사육두수1.0000.5510.137
축종0.5511.0000.113
농장 소재지 읍면동0.1370.1131.000

Missing values

2023-12-12T20:04:03.464268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:04:03.640110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

자치단체농장이름축종농장 소재지 읍면동사육두수데이터기준일자
0충청북도 충주시부인농원육계주덕읍500002023-08-31
1충청북도 충주시나나농장종계/산란계산척면122402023-08-31
2충청북도 충주시화계농장한우주덕읍982023-08-31
3충청북도 충주시현기농장한우, 돼지주덕읍1402023-08-31
4충청북도 충주시가금농장육계중앙탑면500002023-08-31
5충청북도 충주시꼬꼬농장육계봉방동700002023-08-31
6충청북도 충주시농업회사법인(주) 다비육종돼지중앙탑면1092023-08-31
7충청북도 충주시대성목장한우동량면512023-08-31
8충청북도 충주시자원한우한우주덕읍1822023-08-31
9충청북도 충주시만년농장한우주덕읍422023-08-31
자치단체농장이름축종농장 소재지 읍면동사육두수데이터기준일자
940충청북도 충주시화치 농장한우신니면212023-08-31
941충청북도 충주시한마음농장 2한우금가면762023-08-31
942충청북도 충주시화영농장2한우주덕읍2852023-08-31
943충청북도 충주시우다농장한우수안보면122023-08-31
944충청북도 충주시동해농장염소엄정면2002023-08-31
945충청북도 충주시삼형제 농장한우대소원면82023-08-31
946충청북도 충주시효재목장한우동량면262023-08-31
947충청북도 충주시승태농원염소금가면602023-08-31
948충청북도 충주시이지영염소대소원면352023-08-31
949충청북도 충주시정상섭한우대소원면22023-08-31

Duplicate rows

Most frequently occurring

자치단체농장이름축종농장 소재지 읍면동사육두수데이터기준일자# duplicates
0충청북도 충주시거리농장한우대소원면162023-08-312
1충청북도 충주시달천농장염소연수동152023-08-312
2충청북도 충주시덕해농장한우산척면42023-08-312
3충청북도 충주시독동농장한우대소원면42023-08-312
4충청북도 충주시동해농장염소엄정면2002023-08-312
5충청북도 충주시모남농장한우신니면42023-08-312
6충청북도 충주시산돌농장염소대소원면302023-08-312
7충청북도 충주시삼형제 농장한우대소원면82023-08-312
8충청북도 충주시승태농원염소금가면602023-08-312
9충청북도 충주시우다농장한우수안보면122023-08-312