Overview

Dataset statistics

Number of variables5
Number of observations536
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory21.6 KiB
Average record size in memory41.2 B

Variable types

Text2
Categorical2
Numeric1

Dataset

Description서천군 관내 가축사육업허가, 등록현황입니다. (사업장명, 주사육업종, 사육두수, 사업장소재지 를 제공하고 있습니다.)
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=404&beforeMenuCd=DOM_000000201001001000&publicdatapk=15034221

Alerts

데이터기준일자 has constant value ""Constant
주사육업종 is highly imbalanced (59.6%)Imbalance
사육두수 has 31 (5.8%) zerosZeros

Reproduction

Analysis started2024-01-09 23:00:19.280800
Analysis finished2024-01-09 23:00:19.804843
Duration0.52 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct524
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Memory size4.3 KiB
2024-01-10T08:00:19.977746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length5
Mean length4.9757463
Min length3

Characters and Unicode

Total characters2667
Distinct characters262
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique514 ?
Unique (%)95.9%

Sample

1st row랑평농장
2nd row백영기농장
3rd row회현한돈영농조합
4th row나송학목장
5th row백운농장
ValueCountFrequency (%)
농장 32
 
5.5%
목장 4
 
0.7%
제2농장 4
 
0.7%
에덴농장 3
 
0.5%
판교농장 3
 
0.5%
온새미로 2
 
0.3%
한우 2
 
0.3%
이권희 2
 
0.3%
함남주 2
 
0.3%
에코원흑염소 2
 
0.3%
Other values (523) 531
90.5%
2024-01-10T08:00:20.347499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
516
 
19.3%
448
 
16.8%
68
 
2.5%
60
 
2.2%
54
 
2.0%
51
 
1.9%
38
 
1.4%
38
 
1.4%
30
 
1.1%
28
 
1.0%
Other values (252) 1336
50.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2587
97.0%
Space Separator 51
 
1.9%
Decimal Number 23
 
0.9%
Open Punctuation 3
 
0.1%
Close Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
516
19.9%
448
 
17.3%
68
 
2.6%
60
 
2.3%
54
 
2.1%
38
 
1.5%
38
 
1.5%
30
 
1.2%
28
 
1.1%
25
 
1.0%
Other values (246) 1282
49.6%
Decimal Number
ValueCountFrequency (%)
2 18
78.3%
3 4
 
17.4%
1 1
 
4.3%
Space Separator
ValueCountFrequency (%)
51
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2587
97.0%
Common 80
 
3.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
516
19.9%
448
 
17.3%
68
 
2.6%
60
 
2.3%
54
 
2.1%
38
 
1.5%
38
 
1.5%
30
 
1.2%
28
 
1.1%
25
 
1.0%
Other values (246) 1282
49.6%
Common
ValueCountFrequency (%)
51
63.7%
2 18
 
22.5%
3 4
 
5.0%
( 3
 
3.8%
) 3
 
3.8%
1 1
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2587
97.0%
ASCII 80
 
3.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
516
19.9%
448
 
17.3%
68
 
2.6%
60
 
2.3%
54
 
2.1%
38
 
1.5%
38
 
1.5%
30
 
1.2%
28
 
1.1%
25
 
1.0%
Other values (246) 1282
49.6%
ASCII
ValueCountFrequency (%)
51
63.7%
2 18
 
22.5%
3 4
 
5.0%
( 3
 
3.8%
) 3
 
3.8%
1 1
 
1.2%

주사육업종
Categorical

IMBALANCE 

Distinct9
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size4.3 KiB
한우
422 
육계
 
42
젖소
 
25
돼지
 
15
산양
 
10
Other values (4)
 
22

Length

Max length6
Median length2
Mean length2.0522388
Min length2

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row한우
2nd row육계
3rd row돼지
4th row한우
5th row한우

Common Values

ValueCountFrequency (%)
한우 422
78.7%
육계 42
 
7.8%
젖소 25
 
4.7%
돼지 15
 
2.8%
산양 10
 
1.9%
종계/산란계 7
 
1.3%
사슴 7
 
1.3%
염소 7
 
1.3%
육우 1
 
0.2%

Length

2024-01-10T08:00:20.474724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T08:00:20.644429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한우 422
78.7%
육계 42
 
7.8%
젖소 25
 
4.7%
돼지 15
 
2.8%
산양 10
 
1.9%
종계/산란계 7
 
1.3%
사슴 7
 
1.3%
염소 7
 
1.3%
육우 1
 
0.2%
Distinct516
Distinct (%)96.3%
Missing0
Missing (%)0.0%
Memory size4.3 KiB
2024-01-10T08:00:20.995762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length70
Median length64
Mean length26.643657
Min length4

Characters and Unicode

Total characters14281
Distinct characters137
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique506 ?
Unique (%)94.4%

Sample

1st row충청남도 서천군 종천면 랑평리 86번지 , 300-4
2nd row충청남도 서천군 서면 신합리 252번지 3호
3rd row충청남도 서천군 장항읍 원수리 16번지 11호
4th row충청남도 서천군 마서면 남전리 517번지 2호
5th row충청남도 서천군 종천면 종천리 76번지 외 3필지
ValueCountFrequency (%)
충청남도 524
 
17.1%
서천군 524
 
17.1%
1호 85
 
2.8%
판교면 74
 
2.4%
화양면 63
 
2.1%
마서면 54
 
1.8%
2호 54
 
1.8%
마산면 52
 
1.7%
기산면 49
 
1.6%
문산면 45
 
1.5%
Other values (631) 1545
50.3%
2024-01-10T08:00:21.450366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3538
24.8%
641
 
4.5%
586
 
4.1%
544
 
3.8%
541
 
3.8%
540
 
3.8%
536
 
3.8%
524
 
3.7%
524
 
3.7%
524
 
3.7%
Other values (127) 5783
40.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8117
56.8%
Space Separator 3538
24.8%
Decimal Number 2344
 
16.4%
Other Punctuation 146
 
1.0%
Dash Punctuation 132
 
0.9%
Close Punctuation 2
 
< 0.1%
Open Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
641
 
7.9%
586
 
7.2%
544
 
6.7%
541
 
6.7%
540
 
6.7%
536
 
6.6%
524
 
6.5%
524
 
6.5%
524
 
6.5%
497
 
6.1%
Other values (112) 2660
32.8%
Decimal Number
ValueCountFrequency (%)
1 463
19.8%
2 365
15.6%
3 286
12.2%
4 279
11.9%
5 220
9.4%
7 180
 
7.7%
6 176
 
7.5%
9 144
 
6.1%
0 127
 
5.4%
8 104
 
4.4%
Space Separator
ValueCountFrequency (%)
3538
100.0%
Other Punctuation
ValueCountFrequency (%)
, 146
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 132
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8117
56.8%
Common 6164
43.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
641
 
7.9%
586
 
7.2%
544
 
6.7%
541
 
6.7%
540
 
6.7%
536
 
6.6%
524
 
6.5%
524
 
6.5%
524
 
6.5%
497
 
6.1%
Other values (112) 2660
32.8%
Common
ValueCountFrequency (%)
3538
57.4%
1 463
 
7.5%
2 365
 
5.9%
3 286
 
4.6%
4 279
 
4.5%
5 220
 
3.6%
7 180
 
2.9%
6 176
 
2.9%
, 146
 
2.4%
9 144
 
2.3%
Other values (5) 367
 
6.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8117
56.8%
ASCII 6164
43.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3538
57.4%
1 463
 
7.5%
2 365
 
5.9%
3 286
 
4.6%
4 279
 
4.5%
5 220
 
3.6%
7 180
 
2.9%
6 176
 
2.9%
, 146
 
2.4%
9 144
 
2.3%
Other values (5) 367
 
6.0%
Hangul
ValueCountFrequency (%)
641
 
7.9%
586
 
7.2%
544
 
6.7%
541
 
6.7%
540
 
6.7%
536
 
6.6%
524
 
6.5%
524
 
6.5%
524
 
6.5%
497
 
6.1%
Other values (112) 2660
32.8%

사육두수
Real number (ℝ)

ZEROS 

Distinct151
Distinct (%)28.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3172.5019
Minimum0
Maximum100000
Zeros31
Zeros (%)5.8%
Negative0
Negative (%)0.0%
Memory size4.8 KiB
2024-01-10T08:00:21.582452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q17
median25.5
Q380
95-th percentile26250
Maximum100000
Range100000
Interquartile range (IQR)73

Descriptive statistics

Standard deviation13035.793
Coefficient of variation (CV)4.1089947
Kurtosis25.099277
Mean3172.5019
Median Absolute Deviation (MAD)21.5
Skewness4.8589092
Sum1700461
Variance1.699319 × 108
MonotonicityNot monotonic
2024-01-10T08:00:21.708545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 31
 
5.8%
30 25
 
4.7%
3 23
 
4.3%
5 23
 
4.3%
2 20
 
3.7%
20 19
 
3.5%
4 18
 
3.4%
10 14
 
2.6%
40 13
 
2.4%
25 12
 
2.2%
Other values (141) 338
63.1%
ValueCountFrequency (%)
0 31
5.8%
1 5
 
0.9%
2 20
3.7%
3 23
4.3%
4 18
3.4%
5 23
4.3%
6 6
 
1.1%
7 12
 
2.2%
8 11
 
2.1%
9 7
 
1.3%
ValueCountFrequency (%)
100000 2
0.4%
88000 1
 
0.2%
80000 2
0.4%
75050 1
 
0.2%
70000 1
 
0.2%
65000 1
 
0.2%
60000 4
0.7%
50027 1
 
0.2%
50000 1
 
0.2%
45000 1
 
0.2%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.3 KiB
2023-12-15
536 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-12-15
2nd row2023-12-15
3rd row2023-12-15
4th row2023-12-15
5th row2023-12-15

Common Values

ValueCountFrequency (%)
2023-12-15 536
100.0%

Length

2024-01-10T08:00:21.853750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T08:00:21.931711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-12-15 536
100.0%

Interactions

2024-01-10T08:00:19.572659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T08:00:21.981132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주사육업종사육두수
주사육업종1.0000.562
사육두수0.5621.000
2024-01-10T08:00:22.053392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사육두수주사육업종
사육두수1.0000.296
주사육업종0.2961.000

Missing values

2024-01-10T08:00:19.672661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T08:00:19.768826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업장명칭주사육업종사업장소재지(지번)사육두수데이터기준일자
0랑평농장한우충청남도 서천군 종천면 랑평리 86번지 , 300-4252023-12-15
1백영기농장육계충청남도 서천군 서면 신합리 252번지 3호300002023-12-15
2회현한돈영농조합돼지충청남도 서천군 장항읍 원수리 16번지 11호10002023-12-15
3나송학목장한우충청남도 서천군 마서면 남전리 517번지 2호282023-12-15
4백운농장한우충청남도 서천군 종천면 종천리 76번지 외 3필지1202023-12-15
5두혁중농장돼지충청남도 서천군 장항읍 옥남리 580번지 4호21502023-12-15
6형제농장한우충청남도 서천군 종천면 산천리 462번지2502023-12-15
7장용돈농장육계충청남도 서천군 종천면 당정리 295번지 2호500272023-12-15
8박갑수농원한우충청남도 서천군 기산면 화산리 247번지 51호 247-48, 247-63, 247-70, 247-52742023-12-15
9나상헌농장한우충청남도 서천군 기산면 신산리 413번지 1호252023-12-15
사업장명칭주사육업종사업장소재지(지번)사육두수데이터기준일자
526와요팜한우충청남도 서천군 화양면 장상리 99번지 1호 , 99-23232023-12-15
527이창재 농장한우충청남도 서천군 시초면 봉선리 306번지32023-12-15
528김동환농장한우충청남도 서천군 서천읍 화금리 328번지 2호32023-12-15
529이형재농장한우충청남도 서천군 서면 주항리 188번지 1호162023-12-15
530동산농장염소충청남도 서천군 한산면 동지리 452번지602023-12-15
531우애3농장한우충청남도 서천군 화양면 봉명리 593번지1632023-12-15
532금강2농장한우충청남도 서천군 화양면 봉명리 570번지 2호1632023-12-15
533금강3농장한우충청남도 서천군 화양면 봉명리 507번지 3호52023-12-15
534김훈경농장한우충청남도 서천군 마서면 송석리 372번지 7호 , 372-6242023-12-15
535저거리농장한우충청남도 서천군 화양면 기복리 260번지32023-12-15