Overview

Dataset statistics

Number of variables5
Number of observations61
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.6 KiB
Average record size in memory43.2 B

Variable types

Text2
Categorical2
Numeric1

Dataset

Description2022년 5월 3일 기준 동해시 가축사육 축산업 등록 현황에 대한 데이터로 사업장명, 사업장주소, 주사육업종, 사육두수 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15006405/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
사육두수 is highly overall correlated with 주사육업종High correlation
주사육업종 is highly overall correlated with 사육두수High correlation
주사육업종 is highly imbalanced (62.7%)Imbalance
사업장주소 has unique valuesUnique
사육두수 has 7 (11.5%) zerosZeros

Reproduction

Analysis started2023-12-12 12:16:54.552222
Analysis finished2023-12-12 12:16:55.216370
Duration0.66 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct58
Distinct (%)95.1%
Missing0
Missing (%)0.0%
Memory size620.0 B
2023-12-12T21:16:55.468227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length4
Mean length4.5901639
Min length3

Characters and Unicode

Total characters280
Distinct characters104
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique55 ?
Unique (%)90.2%

Sample

1st row창성축산
2nd row상현농장
3rd row운기농장
4th row윤성농장
5th row일출농장
ValueCountFrequency (%)
농장 4
 
6.0%
쇄운농장 2
 
3.0%
새들농장 2
 
3.0%
푸른농장 2
 
3.0%
느릅재농장 2
 
3.0%
지가농장 1
 
1.5%
창성축산 1
 
1.5%
만우농장 1
 
1.5%
윤씨농장 1
 
1.5%
래동농장 1
 
1.5%
Other values (50) 50
74.6%
2023-12-12T21:16:55.928205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
57
20.4%
57
20.4%
6
 
2.1%
6
 
2.1%
5
 
1.8%
5
 
1.8%
4
 
1.4%
4
 
1.4%
4
 
1.4%
3
 
1.1%
Other values (94) 129
46.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 271
96.8%
Space Separator 6
 
2.1%
Decimal Number 3
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
57
21.0%
57
21.0%
6
 
2.2%
5
 
1.8%
5
 
1.8%
4
 
1.5%
4
 
1.5%
4
 
1.5%
3
 
1.1%
3
 
1.1%
Other values (90) 123
45.4%
Decimal Number
ValueCountFrequency (%)
5 1
33.3%
8 1
33.3%
6 1
33.3%
Space Separator
ValueCountFrequency (%)
6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 271
96.8%
Common 9
 
3.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
57
21.0%
57
21.0%
6
 
2.2%
5
 
1.8%
5
 
1.8%
4
 
1.5%
4
 
1.5%
4
 
1.5%
3
 
1.1%
3
 
1.1%
Other values (90) 123
45.4%
Common
ValueCountFrequency (%)
6
66.7%
5 1
 
11.1%
8 1
 
11.1%
6 1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 271
96.8%
ASCII 9
 
3.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
57
21.0%
57
21.0%
6
 
2.2%
5
 
1.8%
5
 
1.8%
4
 
1.5%
4
 
1.5%
4
 
1.5%
3
 
1.1%
3
 
1.1%
Other values (90) 123
45.4%
ASCII
ValueCountFrequency (%)
6
66.7%
5 1
 
11.1%
8 1
 
11.1%
6 1
 
11.1%

주사육업종
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct6
Distinct (%)9.8%
Missing0
Missing (%)0.0%
Memory size620.0 B
한우
51 
육계
 
5
종계/산란계
 
2
사슴
 
1
면양
 
1

Length

Max length6
Median length2
Mean length2.1311475
Min length2

Unique

Unique3 ?
Unique (%)4.9%

Sample

1st row한우
2nd row한우
3rd row한우
4th row한우
5th row한우

Common Values

ValueCountFrequency (%)
한우 51
83.6%
육계 5
 
8.2%
종계/산란계 2
 
3.3%
사슴 1
 
1.6%
면양 1
 
1.6%
타조 1
 
1.6%

Length

2023-12-12T21:16:56.134122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:16:56.294353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한우 51
83.6%
육계 5
 
8.2%
종계/산란계 2
 
3.3%
사슴 1
 
1.6%
면양 1
 
1.6%
타조 1
 
1.6%

사업장주소
Text

UNIQUE 

Distinct61
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size620.0 B
2023-12-12T21:16:56.568741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length23
Mean length20.245902
Min length17

Characters and Unicode

Total characters1235
Distinct characters52
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique61 ?
Unique (%)100.0%

Sample

1st row강원도 동해시 괴란동 166번지 1호
2nd row강원도 동해시 괴란동 240번지 2호
3rd row강원도 동해시 괴란동 153번지
4th row강원도 동해시 괴란동 280번지 3호
5th row강원도 동해시 심곡동 47번지 3호
ValueCountFrequency (%)
강원도 61
22.2%
동해시 61
22.2%
1호 17
 
6.2%
괴란동 12
 
4.4%
심곡동 9
 
3.3%
쇄운동 7
 
2.5%
초구동 7
 
2.5%
망상동 5
 
1.8%
발한동 5
 
1.8%
만우동 3
 
1.1%
Other values (78) 88
32.0%
2023-12-12T21:16:56.947096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
332
26.9%
122
 
9.9%
64
 
5.2%
61
 
4.9%
61
 
4.9%
61
 
4.9%
61
 
4.9%
61
 
4.9%
61
 
4.9%
1 41
 
3.3%
Other values (42) 310
25.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 702
56.8%
Space Separator 332
26.9%
Decimal Number 201
 
16.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
122
17.4%
64
9.1%
61
8.7%
61
8.7%
61
8.7%
61
8.7%
61
8.7%
61
8.7%
29
 
4.1%
12
 
1.7%
Other values (31) 109
15.5%
Decimal Number
ValueCountFrequency (%)
1 41
20.4%
2 30
14.9%
4 27
13.4%
8 21
10.4%
6 19
9.5%
3 17
8.5%
0 14
 
7.0%
7 13
 
6.5%
5 12
 
6.0%
9 7
 
3.5%
Space Separator
ValueCountFrequency (%)
332
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 702
56.8%
Common 533
43.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
122
17.4%
64
9.1%
61
8.7%
61
8.7%
61
8.7%
61
8.7%
61
8.7%
61
8.7%
29
 
4.1%
12
 
1.7%
Other values (31) 109
15.5%
Common
ValueCountFrequency (%)
332
62.3%
1 41
 
7.7%
2 30
 
5.6%
4 27
 
5.1%
8 21
 
3.9%
6 19
 
3.6%
3 17
 
3.2%
0 14
 
2.6%
7 13
 
2.4%
5 12
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 702
56.8%
ASCII 533
43.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
332
62.3%
1 41
 
7.7%
2 30
 
5.6%
4 27
 
5.1%
8 21
 
3.9%
6 19
 
3.6%
3 17
 
3.2%
0 14
 
2.6%
7 13
 
2.4%
5 12
 
2.3%
Hangul
ValueCountFrequency (%)
122
17.4%
64
9.1%
61
8.7%
61
8.7%
61
8.7%
61
8.7%
61
8.7%
61
8.7%
29
 
4.1%
12
 
1.7%
Other values (31) 109
15.5%

사육두수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct35
Distinct (%)57.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean162.52459
Minimum0
Maximum7500
Zeros7
Zeros (%)11.5%
Negative0
Negative (%)0.0%
Memory size681.0 B
2023-12-12T21:16:57.092710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q15
median15
Q328
95-th percentile161
Maximum7500
Range7500
Interquartile range (IQR)23

Descriptive statistics

Standard deviation961.48119
Coefficient of variation (CV)5.9159121
Kurtosis59.32463
Mean162.52459
Median Absolute Deviation (MAD)11
Skewness7.6606073
Sum9914
Variance924446.09
MonotonicityNot monotonic
2023-12-12T21:16:57.229811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)
ValueCountFrequency (%)
0 7
 
11.5%
5 5
 
8.2%
3 4
 
6.6%
22 3
 
4.9%
7 3
 
4.9%
16 2
 
3.3%
100 2
 
3.3%
23 2
 
3.3%
17 2
 
3.3%
2 2
 
3.3%
Other values (25) 29
47.5%
ValueCountFrequency (%)
0 7
11.5%
2 2
 
3.3%
3 4
6.6%
4 2
 
3.3%
5 5
8.2%
6 1
 
1.6%
7 3
4.9%
9 2
 
3.3%
10 1
 
1.6%
11 1
 
1.6%
ValueCountFrequency (%)
7500 1
1.6%
820 1
1.6%
263 1
1.6%
161 1
1.6%
100 2
3.3%
94 1
1.6%
90 1
1.6%
75 1
1.6%
58 1
1.6%
38 2
3.3%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size620.0 B
2023-04-20
61 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-04-20
2nd row2023-04-20
3rd row2023-04-20
4th row2023-04-20
5th row2023-04-20

Common Values

ValueCountFrequency (%)
2023-04-20 61
100.0%

Length

2023-12-12T21:16:57.363383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:16:57.778886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-04-20 61
100.0%

Interactions

2023-12-12T21:16:54.850649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:16:57.838100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업장명주사육업종사업장주소사육두수
사업장명1.0001.0001.0001.000
주사육업종1.0001.0001.0000.839
사업장주소1.0001.0001.0001.000
사육두수1.0000.8391.0001.000
2023-12-12T21:16:57.929042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사육두수주사육업종
사육두수1.0000.514
주사육업종0.5141.000

Missing values

2023-12-12T21:16:55.007431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:16:55.157673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업장명주사육업종사업장주소사육두수데이터기준일자
0창성축산한우강원도 동해시 괴란동 166번지 1호1612023-04-20
1상현농장한우강원도 동해시 괴란동 240번지 2호942023-04-20
2운기농장한우강원도 동해시 괴란동 153번지232023-04-20
3윤성농장한우강원도 동해시 괴란동 280번지 3호172023-04-20
4일출농장한우강원도 동해시 심곡동 47번지 3호222023-04-20
5두암농장한우강원도 동해시 괴란동 68번지272023-04-20
6동물농장한우강원도 동해시 망상동 468번지222023-04-20
7한교농장한우강원도 동해시 초구동 242번지02023-04-20
8지경상한우농장한우강원도 동해시 심곡동 357번지152023-04-20
9동해농장한우강원도 동해시 괴란동 22번지252023-04-20
사업장명주사육업종사업장주소사육두수데이터기준일자
51매밑농장한우강원도 동해시 심곡동 58번지 2호102023-04-20
52한솔농장육계강원도 동해시 지흥동 210번지 1호752023-04-20
53늘햇살농원면양강원도 동해시 만우동 68번지32023-04-20
54동해타조시티타조강원도 동해시 심곡동 419번지 1호02023-04-20
55일출한우농장한우강원도 동해시 지가동 212번지232023-04-20
56새들농장한우강원도 동해시 초구동 42번지382023-04-20
57비천농장한우강원도 동해시 비천동 270번지02023-04-20
58단봉청계농장육계강원도 동해시 단봉동 463번지 8호902023-04-20
59685 느릅재농장한우강원도 동해시 발한동 685번지72023-04-20
60매내골 농장한우강원도 동해시 이로동 1246번지 4호52023-04-20