Overview

Dataset statistics

Number of variables5
Number of observations1234
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory50.7 KiB
Average record size in memory42.1 B

Variable types

Numeric2
Text1
Categorical2

Dataset

Description충청북도 제천시의 가축사육업 현황입니다. 사업장명칭, 주사육업종, 사육두수, 데이터기준일자의 자료를 제공합니다.
Author충청북도 제천시
URLhttps://www.data.go.kr/data/15033911/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
주사육업종 is highly imbalanced (59.2%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2024-03-14 11:59:45.569740
Analysis finished2024-03-14 11:59:47.514561
Duration1.94 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct1234
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean617.5
Minimum1
Maximum1234
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.0 KiB
2024-03-14T20:59:47.746883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile62.65
Q1309.25
median617.5
Q3925.75
95-th percentile1172.35
Maximum1234
Range1233
Interquartile range (IQR)616.5

Descriptive statistics

Standard deviation356.36942
Coefficient of variation (CV)0.57711648
Kurtosis-1.2
Mean617.5
Median Absolute Deviation (MAD)308.5
Skewness0
Sum761995
Variance126999.17
MonotonicityStrictly increasing
2024-03-14T20:59:48.198886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
822 1
 
0.1%
829 1
 
0.1%
828 1
 
0.1%
827 1
 
0.1%
826 1
 
0.1%
825 1
 
0.1%
824 1
 
0.1%
823 1
 
0.1%
821 1
 
0.1%
Other values (1224) 1224
99.2%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1234 1
0.1%
1233 1
0.1%
1232 1
0.1%
1231 1
0.1%
1230 1
0.1%
1229 1
0.1%
1228 1
0.1%
1227 1
0.1%
1226 1
0.1%
1225 1
0.1%
Distinct986
Distinct (%)79.9%
Missing0
Missing (%)0.0%
Memory size9.8 KiB
2024-03-14T20:59:49.555781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length3
Mean length3.6904376
Min length2

Characters and Unicode

Total characters4554
Distinct characters303
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique787 ?
Unique (%)63.8%

Sample

1st row영일농장
2nd row아름농장
3rd row보래농장
4th row송왕농장
5th row적덕농장
ValueCountFrequency (%)
부자농장 6
 
0.5%
형제농장 6
 
0.5%
우리농장 5
 
0.4%
지선목장 5
 
0.4%
장평농장 5
 
0.4%
초원농장 4
 
0.3%
명도농장 4
 
0.3%
행운농장 4
 
0.3%
우성축산 3
 
0.2%
양화목장 3
 
0.2%
Other values (981) 1196
96.4%
2024-03-14T20:59:51.099874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
567
 
12.5%
489
 
10.7%
142
 
3.1%
115
 
2.5%
91
 
2.0%
88
 
1.9%
72
 
1.6%
64
 
1.4%
63
 
1.4%
58
 
1.3%
Other values (293) 2805
61.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4505
98.9%
Decimal Number 32
 
0.7%
Space Separator 7
 
0.2%
Close Punctuation 5
 
0.1%
Open Punctuation 5
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
567
 
12.6%
489
 
10.9%
142
 
3.2%
115
 
2.6%
91
 
2.0%
88
 
2.0%
72
 
1.6%
64
 
1.4%
63
 
1.4%
58
 
1.3%
Other values (285) 2756
61.2%
Decimal Number
ValueCountFrequency (%)
2 24
75.0%
3 3
 
9.4%
1 3
 
9.4%
5 1
 
3.1%
4 1
 
3.1%
Space Separator
ValueCountFrequency (%)
7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4505
98.9%
Common 49
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
567
 
12.6%
489
 
10.9%
142
 
3.2%
115
 
2.6%
91
 
2.0%
88
 
2.0%
72
 
1.6%
64
 
1.4%
63
 
1.4%
58
 
1.3%
Other values (285) 2756
61.2%
Common
ValueCountFrequency (%)
2 24
49.0%
7
 
14.3%
) 5
 
10.2%
( 5
 
10.2%
3 3
 
6.1%
1 3
 
6.1%
5 1
 
2.0%
4 1
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4505
98.9%
ASCII 49
 
1.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
567
 
12.6%
489
 
10.9%
142
 
3.2%
115
 
2.6%
91
 
2.0%
88
 
2.0%
72
 
1.6%
64
 
1.4%
63
 
1.4%
58
 
1.3%
Other values (285) 2756
61.2%
ASCII
ValueCountFrequency (%)
2 24
49.0%
7
 
14.3%
) 5
 
10.2%
( 5
 
10.2%
3 3
 
6.1%
1 3
 
6.1%
5 1
 
2.0%
4 1
 
2.0%

주사육업종
Categorical

IMBALANCE 

Distinct20
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size9.8 KiB
닭-토종닭
611 
소-한우
429 
염소
129 
칠면조
 
21
돼지
 
15
Other values (15)
 
29

Length

Max length14
Median length5
Mean length4.3079417
Min length2

Unique

Unique9 ?
Unique (%)0.7%

Sample

1st row돼지
2nd row돼지
3rd row돼지
4th row돼지
5th row돼지

Common Values

ValueCountFrequency (%)
닭-토종닭 611
49.5%
소-한우 429
34.8%
염소 129
 
10.5%
칠면조 21
 
1.7%
돼지 15
 
1.2%
닭-육계 4
 
0.3%
닭-산란계 4
 
0.3%
소-한우,소-육우 4
 
0.3%
사슴-엘크 3
 
0.2%
소-젖소 3
 
0.2%
Other values (10) 11
 
0.9%

Length

2024-03-14T20:59:51.350713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
닭-토종닭 611
49.5%
소-한우 429
34.8%
염소 129
 
10.5%
칠면조 21
 
1.7%
돼지 15
 
1.2%
닭-육계 4
 
0.3%
닭-산란계 4
 
0.3%
소-한우,소-육우 4
 
0.3%
사슴-엘크 3
 
0.2%
소-젖소 3
 
0.2%
Other values (10) 11
 
0.9%

사육두수
Real number (ℝ)

Distinct128
Distinct (%)10.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean529.14668
Minimum1
Maximum150000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.0 KiB
2024-03-14T20:59:51.579707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q15
median11
Q325
95-th percentile90
Maximum150000
Range149999
Interquartile range (IQR)20

Descriptive statistics

Standard deviation6200.5528
Coefficient of variation (CV)11.718023
Kurtosis327.25095
Mean529.14668
Median Absolute Deviation (MAD)7
Skewness16.53142
Sum652967
Variance38446855
MonotonicityNot monotonic
2024-03-14T20:59:51.841046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10 88
 
7.1%
2 80
 
6.5%
5 77
 
6.2%
4 70
 
5.7%
3 66
 
5.3%
6 62
 
5.0%
7 54
 
4.4%
15 53
 
4.3%
20 48
 
3.9%
8 43
 
3.5%
Other values (118) 593
48.1%
ValueCountFrequency (%)
1 32
 
2.6%
2 80
6.5%
3 66
5.3%
4 70
5.7%
5 77
6.2%
6 62
5.0%
7 54
4.4%
8 43
3.5%
9 24
 
1.9%
10 88
7.1%
ValueCountFrequency (%)
150000 1
0.1%
79600 1
0.1%
74200 1
0.1%
64000 1
0.1%
45000 1
0.1%
44500 1
0.1%
44000 1
0.1%
42000 1
0.1%
27500 1
0.1%
25000 1
0.1%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size9.8 KiB
2024-02-28
1234 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024-02-28
2nd row2024-02-28
3rd row2024-02-28
4th row2024-02-28
5th row2024-02-28

Common Values

ValueCountFrequency (%)
2024-02-28 1234
100.0%

Length

2024-03-14T20:59:52.138993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T20:59:52.340311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024-02-28 1234
100.0%

Interactions

2024-03-14T20:59:46.432099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:59:45.889642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:59:46.705593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:59:46.155725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T20:59:52.442163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번주사육업종사육두수
연번1.0000.8510.007
주사육업종0.8511.0000.753
사육두수0.0070.7531.000
2024-03-14T20:59:52.589454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사육두수주사육업종
연번1.000-0.2750.449
사육두수-0.2751.0000.471
주사육업종0.4490.4711.000

Missing values

2024-03-14T20:59:47.064960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T20:59:47.382734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번사업장명칭주사육업종사육두수데이터기준일자
01영일농장돼지1702024-02-28
12아름농장돼지362024-02-28
23보래농장돼지772024-02-28
34송왕농장돼지502024-02-28
45적덕농장돼지250002024-02-28
56선경농장돼지4002024-02-28
67부성농장돼지796002024-02-28
78마루축산돼지402024-02-28
89대원축산돼지11002024-02-28
910용호농장돼지872024-02-28
연번사업장명칭주사육업종사육두수데이터기준일자
12241225고석광칠면조42024-02-28
12251226현대농장칠면조102024-02-28
12261227이희천칠면조192024-02-28
12271228양천용칠면조12024-02-28
12281229심현옥칠면조12024-02-28
12291230박문수칠면조52024-02-28
12301231우성축산칠면조12024-02-28
12311232주금선칠면조22024-02-28
12321233엄영복칠면조32024-02-28
12331234박영진칠면조32024-02-28