Overview

Dataset statistics

Number of variables8
Number of observations828
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory55.1 KiB
Average record size in memory68.2 B

Variable types

Categorical5
Numeric3

Dataset

Description축산업 허가등록 지역별 통계 현황
Author경기도
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=J1UO35AILRBV6J7U8M7W28179654&infSeq=1

Alerts

등록일자 has constant value ""Constant
축종명 is highly overall correlated with 업종명High correlation
업종명 is highly overall correlated with 축종명High correlation
개소 is highly overall correlated with 규모 and 1 other fieldsHigh correlation
규모 is highly overall correlated with 개소 and 1 other fieldsHigh correlation
동수 is highly overall correlated with 개소 and 1 other fieldsHigh correlation
업종명 is highly imbalanced (64.1%)Imbalance
규모 has 35 (4.2%) zerosZeros
동수 has 40 (4.8%) zerosZeros

Reproduction

Analysis started2023-12-10 21:29:22.592163
Analysis finished2023-12-10 21:29:24.540429
Duration1.95 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

등록일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size6.6 KiB
20220601
828 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20220601
2nd row20220601
3rd row20220601
4th row20220601
5th row20220601

Common Values

ValueCountFrequency (%)
20220601 828
100.0%

Length

2023-12-11T06:29:24.600486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:29:24.724485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20220601 828
100.0%

시군명
Categorical

Distinct31
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Memory size6.6 KiB
화성시
 
46
여주시
 
46
이천시
 
46
안성시
 
45
고양시
 
44
Other values (26)
601 

Length

Max length4
Median length3
Mean length3.076087
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가평군
2nd row가평군
3rd row가평군
4th row가평군
5th row가평군

Common Values

ValueCountFrequency (%)
화성시 46
 
5.6%
여주시 46
 
5.6%
이천시 46
 
5.6%
안성시 45
 
5.4%
고양시 44
 
5.3%
포천시 43
 
5.2%
평택시 43
 
5.2%
파주시 43
 
5.2%
용인시 42
 
5.1%
양평군 39
 
4.7%
Other values (21) 391
47.2%

Length

2023-12-11T06:29:24.853763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
화성시 46
 
5.6%
여주시 46
 
5.6%
이천시 46
 
5.6%
안성시 45
 
5.4%
고양시 44
 
5.3%
포천시 43
 
5.2%
평택시 43
 
5.2%
파주시 43
 
5.2%
용인시 42
 
5.1%
양평군 39
 
4.7%
Other values (21) 391
47.2%

업종명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size6.6 KiB
가축사육업
721 
종축업
 
66
부화업
 
32
정액등처리업
 
9

Length

Max length27
Median length25
Mean length25.225845
Min length24

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가축사육업
2nd row가축사육업
3rd row가축사육업
4th row가축사육업
5th row가축사육업

Common Values

ValueCountFrequency (%)
가축사육업 721
87.1%
종축업 66
 
8.0%
부화업 32
 
3.9%
정액등처리업 9
 
1.1%

Length

2023-12-11T06:29:25.006424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:29:25.115090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
가축사육업 721
87.1%
종축업 66
 
8.0%
부화업 32
 
3.9%
정액등처리업 9
 
1.1%

축종명
Categorical

HIGH CORRELATION 

Distinct25
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size6.6 KiB
한우
103 
젖소
91 
육계
83 
돼지
81 
육우
80 
Other values (20)
390 

Length

Max length6
Median length2
Mean length2.3429952
Min length1

Unique

Unique2 ?
Unique (%)0.2%

Sample

1st row돼지
2nd row돼지
3rd row돼지
4th row돼지
5th row면양

Common Values

ValueCountFrequency (%)
한우 103
12.4%
젖소 91
11.0%
육계 83
10.0%
돼지 81
9.8%
육우 80
9.7%
산란계 71
8.6%
오리 44
 
5.3%
<NA> 42
 
5.1%
사슴 42
 
5.1%
염소 40
 
4.8%
Other values (15) 151
18.2%

Length

2023-12-11T06:29:25.227079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한우 103
12.4%
젖소 91
11.0%
육계 83
10.0%
돼지 81
9.8%
육우 80
9.7%
산란계 71
8.6%
오리 44
 
5.3%
na 42
 
5.1%
사슴 42
 
5.1%
염소 40
 
4.8%
Other values (15) 151
18.2%

영업상태
Categorical

Distinct6
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size6.6 KiB
정상
304 
폐업
281 
휴업
118 
말소
117 
행정처분
 
6

Length

Max length4
Median length2
Mean length2.0193237
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row말소
2nd row정상
3rd row폐업
4th row휴업
5th row정상

Common Values

ValueCountFrequency (%)
정상 304
36.7%
폐업 281
33.9%
휴업 118
 
14.3%
말소 117
 
14.1%
행정처분 6
 
0.7%
<NA> 2
 
0.2%

Length

2023-12-11T06:29:25.346616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:29:25.474896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상 304
36.7%
폐업 281
33.9%
휴업 118
 
14.3%
말소 117
 
14.1%
행정처분 6
 
0.7%
na 2
 
0.2%

개소
Real number (ℝ)

HIGH CORRELATION 

Distinct131
Distinct (%)15.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26.471014
Minimum1
Maximum910
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.4 KiB
2023-12-11T06:29:25.608473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median4
Q316
95-th percentile143.65
Maximum910
Range909
Interquartile range (IQR)15

Descriptive statistics

Standard deviation66.828665
Coefficient of variation (CV)2.5245978
Kurtosis52.939619
Mean26.471014
Median Absolute Deviation (MAD)3
Skewness5.9293311
Sum21918
Variance4466.0705
MonotonicityNot monotonic
2023-12-11T06:29:25.975762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 216
26.1%
2 93
 
11.2%
3 74
 
8.9%
4 40
 
4.8%
6 39
 
4.7%
5 36
 
4.3%
9 24
 
2.9%
8 21
 
2.5%
7 20
 
2.4%
11 12
 
1.4%
Other values (121) 253
30.6%
ValueCountFrequency (%)
1 216
26.1%
2 93
11.2%
3 74
 
8.9%
4 40
 
4.8%
5 36
 
4.3%
6 39
 
4.7%
7 20
 
2.4%
8 21
 
2.5%
9 24
 
2.9%
10 11
 
1.3%
ValueCountFrequency (%)
910 1
0.1%
636 1
0.1%
498 1
0.1%
405 1
0.1%
384 1
0.1%
362 1
0.1%
350 1
0.1%
335 1
0.1%
334 1
0.1%
300 1
0.1%

규모
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct769
Distinct (%)92.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean30169.637
Minimum0
Maximum941133.87
Zeros35
Zeros (%)4.2%
Negative0
Negative (%)0.0%
Memory size7.4 KiB
2023-12-11T06:29:26.129349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile39.15
Q1668.25
median3469.01
Q317431.72
95-th percentile168666.94
Maximum941133.87
Range941133.87
Interquartile range (IQR)16763.47

Descriptive statistics

Standard deviation77545.608
Coefficient of variation (CV)2.5703196
Kurtosis35.640675
Mean30169.637
Median Absolute Deviation (MAD)3289.01
Skewness5.0311847
Sum24980460
Variance6.0133214 × 109
MonotonicityNot monotonic
2023-12-11T06:29:26.261473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 35
 
4.2%
198.0 4
 
0.5%
50.0 4
 
0.5%
192.0 4
 
0.5%
150.0 3
 
0.4%
60.0 2
 
0.2%
500.0 2
 
0.2%
641.0 2
 
0.2%
125.0 2
 
0.2%
400.0 2
 
0.2%
Other values (759) 768
92.8%
ValueCountFrequency (%)
0.0 35
4.2%
18.91 1
 
0.1%
19.84 1
 
0.1%
20.0 1
 
0.1%
27.0 1
 
0.1%
30.0 1
 
0.1%
31.5 1
 
0.1%
36.0 1
 
0.1%
45.0 1
 
0.1%
48.0 1
 
0.1%
ValueCountFrequency (%)
941133.87 1
0.1%
556572.17 1
0.1%
503244.14 1
0.1%
491826.33 1
0.1%
483235.69 1
0.1%
469674.54 1
0.1%
459303.8 1
0.1%
456661.85 1
0.1%
449381.85 1
0.1%
391680.83 1
0.1%

동수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct205
Distinct (%)24.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean78.63285
Minimum0
Maximum2390
Zeros40
Zeros (%)4.8%
Negative0
Negative (%)0.0%
Memory size7.4 KiB
2023-12-11T06:29:26.398282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q13
median11
Q343.25
95-th percentile423.65
Maximum2390
Range2390
Interquartile range (IQR)40.25

Descriptive statistics

Standard deviation199.48532
Coefficient of variation (CV)2.5369209
Kurtosis35.486421
Mean78.63285
Median Absolute Deviation (MAD)10
Skewness5.0403667
Sum65108
Variance39794.395
MonotonicityNot monotonic
2023-12-11T06:29:26.553385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 79
 
9.5%
2 61
 
7.4%
3 49
 
5.9%
0 40
 
4.8%
5 35
 
4.2%
4 31
 
3.7%
7 30
 
3.6%
6 26
 
3.1%
8 19
 
2.3%
10 18
 
2.2%
Other values (195) 440
53.1%
ValueCountFrequency (%)
0 40
4.8%
1 79
9.5%
2 61
7.4%
3 49
5.9%
4 31
 
3.7%
5 35
4.2%
6 26
 
3.1%
7 30
 
3.6%
8 19
 
2.3%
9 13
 
1.6%
ValueCountFrequency (%)
2390 1
0.1%
1662 1
0.1%
1268 1
0.1%
1242 1
0.1%
1207 1
0.1%
1191 1
0.1%
1177 1
0.1%
1066 1
0.1%
995 1
0.1%
984 1
0.1%

Interactions

2023-12-11T06:29:23.972980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:29:23.369074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:29:23.676126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:29:24.084027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:29:23.477636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:29:23.762438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:29:24.199980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:29:23.561762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:29:23.859569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T06:29:26.653632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명업종명축종명영업상태개소규모동수
시군명1.0000.0000.0000.0000.0000.0000.000
업종명0.0001.0001.0000.1020.0000.0000.000
축종명0.0001.0001.0000.2850.0450.0000.000
영업상태0.0000.1020.2851.0000.0550.1630.156
개소0.0000.0000.0450.0551.0000.7860.920
규모0.0000.0000.0000.1630.7861.0000.867
동수0.0000.0000.0000.1560.9200.8671.000
2023-12-11T06:29:26.768947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명축종명업종명영업상태
시군명1.0000.0000.0000.000
축종명0.0001.0000.9860.140
업종명0.0000.9861.0000.083
영업상태0.0000.1400.0831.000
2023-12-11T06:29:26.904284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
개소규모동수시군명업종명축종명영업상태
개소1.0000.8060.8900.0000.0000.0150.034
규모0.8061.0000.9440.0000.0000.0000.104
동수0.8900.9441.0000.0000.0000.0000.095
시군명0.0000.0000.0001.0000.0000.0000.000
업종명0.0000.0000.0000.0001.0000.9860.083
축종명0.0150.0000.0000.0000.9861.0000.140
영업상태0.0340.1040.0950.0000.0830.1401.000

Missing values

2023-12-11T06:29:24.339842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T06:29:24.486998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

등록일자시군명업종명축종명영업상태개소규모동수
020220601가평군가축사육업돼지말소63870.615
120220601가평군가축사육업돼지정상512671.8445
220220601가평군가축사육업돼지폐업159319.1933
320220601가평군가축사육업돼지휴업25596.1914
420220601가평군가축사육업면양정상1139.51
520220601가평군가축사육업부화용알생산폐업1360.04
620220601가평군가축사육업사슴정상101871.2313
720220601가평군가축사육업사슴폐업6471.66
820220601가평군가축사육업산란계말소2716.82
920220601가평군가축사육업산란계정상616139.6925
등록일자시군명업종명축종명영업상태개소규모동수
81820220601화성시가축사육업한우휴업2710074.7842
81920220601화성시부화업<NA>정상20.00
82020220601화성시부화업<NA>폐업50.00
82120220601화성시부화업<NA>휴업10.00
82220220601화성시종축업종계업말소13910.08
82320220601화성시종축업종계업정상2062221.0185
82420220601화성시종축업종계업폐업28117968.09149
82520220601화성시종축업종계업휴업37442.4212
82620220601화성시종축업종돈업정상213669.0113
82720220601화성시종축업종돈업폐업11585.03