Overview

Dataset statistics

Number of variables5
Number of observations167
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.8 KiB
Average record size in memory41.8 B

Variable types

Text3
Categorical1
Numeric1

Dataset

Description2024년 2월 8일 현재 안성시 양돈농가 등록현황입니다.양돈농가의 사업장명, 주소, 사육두수, 축종 등을 제공합니다
Author경기도 안성시
URLhttps://www.data.go.kr/data/15127163/fileData.do

Alerts

축종명 has constant value ""Constant
사육두수 has 15 (9.0%) zerosZeros

Reproduction

Analysis started2024-03-16 04:11:43.180763
Analysis finished2024-03-16 04:11:44.104418
Duration0.92 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct158
Distinct (%)94.6%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-03-16T13:11:44.279706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length4
Mean length5.3832335
Min length3

Characters and Unicode

Total characters899
Distinct characters175
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique152 ?
Unique (%)91.0%

Sample

1st row신하농장
2nd row농업회사법인 (주) 대덕
3rd row은진농장
4th row승양농장
5th row네남매농장
ValueCountFrequency (%)
농업회사법인 7
 
3.7%
태광농장 4
 
2.1%
대유농장 3
 
1.6%
농장 3
 
1.6%
주식회사 3
 
1.6%
고은농장 2
 
1.1%
농업회사법인(주 2
 
1.1%
삼죽농장 2
 
1.1%
바우농장 2
 
1.1%
해돈농장 2
 
1.1%
Other values (158) 159
84.1%
2024-03-16T13:11:44.866799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
150
 
16.7%
137
 
15.2%
22
 
2.4%
20
 
2.2%
19
 
2.1%
18
 
2.0%
17
 
1.9%
17
 
1.9%
16
 
1.8%
) 15
 
1.7%
Other values (165) 468
52.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 838
93.2%
Space Separator 22
 
2.4%
Close Punctuation 15
 
1.7%
Open Punctuation 15
 
1.7%
Decimal Number 6
 
0.7%
Uppercase Letter 2
 
0.2%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
150
 
17.9%
137
 
16.3%
20
 
2.4%
19
 
2.3%
18
 
2.1%
17
 
2.0%
17
 
2.0%
16
 
1.9%
14
 
1.7%
13
 
1.6%
Other values (157) 417
49.8%
Decimal Number
ValueCountFrequency (%)
2 5
83.3%
1 1
 
16.7%
Uppercase Letter
ValueCountFrequency (%)
F 1
50.0%
C 1
50.0%
Space Separator
ValueCountFrequency (%)
22
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Other Punctuation
ValueCountFrequency (%)
· 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 838
93.2%
Common 59
 
6.6%
Latin 2
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
150
 
17.9%
137
 
16.3%
20
 
2.4%
19
 
2.3%
18
 
2.1%
17
 
2.0%
17
 
2.0%
16
 
1.9%
14
 
1.7%
13
 
1.6%
Other values (157) 417
49.8%
Common
ValueCountFrequency (%)
22
37.3%
) 15
25.4%
( 15
25.4%
2 5
 
8.5%
· 1
 
1.7%
1 1
 
1.7%
Latin
ValueCountFrequency (%)
F 1
50.0%
C 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 838
93.2%
ASCII 60
 
6.7%
None 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
150
 
17.9%
137
 
16.3%
20
 
2.4%
19
 
2.3%
18
 
2.1%
17
 
2.0%
17
 
2.0%
16
 
1.9%
14
 
1.7%
13
 
1.6%
Other values (157) 417
49.8%
ASCII
ValueCountFrequency (%)
22
36.7%
) 15
25.0%
( 15
25.0%
2 5
 
8.3%
F 1
 
1.7%
C 1
 
1.7%
1 1
 
1.7%
None
ValueCountFrequency (%)
· 1
100.0%

축종명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
돼지
167 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row돼지
2nd row돼지
3rd row돼지
4th row돼지
5th row돼지

Common Values

ValueCountFrequency (%)
돼지 167
100.0%

Length

2024-03-16T13:11:45.085464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-16T13:11:45.235040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
돼지 167
100.0%

사육두수
Real number (ℝ)

ZEROS 

Distinct89
Distinct (%)53.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2224.018
Minimum0
Maximum23000
Zeros15
Zeros (%)9.0%
Negative0
Negative (%)0.0%
Memory size1.6 KiB
2024-03-16T13:11:45.421425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1800
median1500
Q32768.5
95-th percentile6300
Maximum23000
Range23000
Interquartile range (IQR)1968.5

Descriptive statistics

Standard deviation2781.8002
Coefficient of variation (CV)1.2507994
Kurtosis27.801546
Mean2224.018
Median Absolute Deviation (MAD)900
Skewness4.4843756
Sum371411
Variance7738412.5
MonotonicityNot monotonic
2024-03-16T13:11:45.634050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 15
 
9.0%
1500 8
 
4.8%
2000 7
 
4.2%
3000 6
 
3.6%
800 6
 
3.6%
2500 5
 
3.0%
1000 5
 
3.0%
600 5
 
3.0%
1800 4
 
2.4%
1400 4
 
2.4%
Other values (79) 102
61.1%
ValueCountFrequency (%)
0 15
9.0%
70 1
 
0.6%
112 1
 
0.6%
300 2
 
1.2%
390 1
 
0.6%
400 2
 
1.2%
500 1
 
0.6%
550 1
 
0.6%
554 1
 
0.6%
600 5
 
3.0%
ValueCountFrequency (%)
23000 1
0.6%
20000 1
0.6%
9800 1
0.6%
9000 1
0.6%
7543 1
0.6%
7477 1
0.6%
7000 1
0.6%
6396 1
0.6%
6300 2
1.2%
6000 1
0.6%
Distinct165
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-03-16T13:11:45.942937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length69
Median length61
Mean length23.796407
Min length18

Characters and Unicode

Total characters3974
Distinct characters127
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique163 ?
Unique (%)97.6%

Sample

1st row경기도 안성시 미양면 보촌4길 128
2nd row경기도 안성시 대덕면 신령로 223, (외 11필지)
3rd row경기도 안성시 죽산면 장계길 14-60
4th row경기도 안성시 죽산면 능앞길 181-78
5th row경기도 안성시 죽산면 장계길 14-108
ValueCountFrequency (%)
경기도 167
18.8%
안성시 167
18.8%
일죽면 69
 
7.8%
죽산면 23
 
2.6%
삼죽면 20
 
2.2%
미양면 14
 
1.6%
보개면 13
 
1.5%
13
 
1.5%
죽화로 12
 
1.3%
장암로 10
 
1.1%
Other values (283) 381
42.9%
2024-03-16T13:11:46.391827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
737
18.5%
1 190
 
4.8%
174
 
4.4%
170
 
4.3%
170
 
4.3%
168
 
4.2%
167
 
4.2%
167
 
4.2%
166
 
4.2%
- 137
 
3.4%
Other values (117) 1728
43.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2194
55.2%
Decimal Number 821
 
20.7%
Space Separator 737
 
18.5%
Dash Punctuation 137
 
3.4%
Close Punctuation 29
 
0.7%
Open Punctuation 29
 
0.7%
Other Punctuation 27
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
174
 
7.9%
170
 
7.7%
170
 
7.7%
168
 
7.7%
167
 
7.6%
167
 
7.6%
166
 
7.6%
129
 
5.9%
90
 
4.1%
76
 
3.5%
Other values (102) 717
32.7%
Decimal Number
ValueCountFrequency (%)
1 190
23.1%
2 114
13.9%
4 81
9.9%
3 81
9.9%
7 77
9.4%
5 69
 
8.4%
6 60
 
7.3%
8 60
 
7.3%
0 45
 
5.5%
9 44
 
5.4%
Space Separator
ValueCountFrequency (%)
737
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 137
100.0%
Close Punctuation
ValueCountFrequency (%)
) 29
100.0%
Open Punctuation
ValueCountFrequency (%)
( 29
100.0%
Other Punctuation
ValueCountFrequency (%)
, 27
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2194
55.2%
Common 1780
44.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
174
 
7.9%
170
 
7.7%
170
 
7.7%
168
 
7.7%
167
 
7.6%
167
 
7.6%
166
 
7.6%
129
 
5.9%
90
 
4.1%
76
 
3.5%
Other values (102) 717
32.7%
Common
ValueCountFrequency (%)
737
41.4%
1 190
 
10.7%
- 137
 
7.7%
2 114
 
6.4%
4 81
 
4.6%
3 81
 
4.6%
7 77
 
4.3%
5 69
 
3.9%
6 60
 
3.4%
8 60
 
3.4%
Other values (5) 174
 
9.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2194
55.2%
ASCII 1780
44.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
737
41.4%
1 190
 
10.7%
- 137
 
7.7%
2 114
 
6.4%
4 81
 
4.6%
3 81
 
4.6%
7 77
 
4.3%
5 69
 
3.9%
6 60
 
3.4%
8 60
 
3.4%
Other values (5) 174
 
9.8%
Hangul
ValueCountFrequency (%)
174
 
7.9%
170
 
7.7%
170
 
7.7%
168
 
7.7%
167
 
7.6%
167
 
7.6%
166
 
7.6%
129
 
5.9%
90
 
4.1%
76
 
3.5%
Other values (102) 717
32.7%
Distinct166
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2024-03-16T13:11:46.736157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length68
Median length58
Mean length29.48503
Min length19

Characters and Unicode

Total characters4924
Distinct characters100
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique165 ?
Unique (%)98.8%

Sample

1st row경기도 안성시 미양면 고지리 460번지
2nd row경기도 안성시 대덕면 신령리 111번지 5호 (외 11필지)
3rd row경기도 안성시 죽산면 장계리 109번지
4th row경기도 안성시 죽산면 장계리 362번지 1호
5th row경기도 안성시 죽산면 장계리 330번지 , 330-1, 330-2, 331, 331-1, 332
ValueCountFrequency (%)
경기도 167
 
15.1%
안성시 167
 
15.1%
일죽면 69
 
6.2%
1호 39
 
3.5%
38
 
3.4%
죽산면 23
 
2.1%
22
 
2.0%
삼죽면 20
 
1.8%
미양면 14
 
1.3%
장암리 13
 
1.2%
Other values (322) 535
48.3%
2024-03-16T13:11:47.660831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1198
24.3%
228
 
4.6%
1 207
 
4.2%
173
 
3.5%
172
 
3.5%
170
 
3.5%
168
 
3.4%
168
 
3.4%
168
 
3.4%
167
 
3.4%
Other values (90) 2105
42.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2602
52.8%
Space Separator 1198
24.3%
Decimal Number 948
 
19.3%
Other Punctuation 79
 
1.6%
Dash Punctuation 61
 
1.2%
Close Punctuation 18
 
0.4%
Open Punctuation 18
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
228
 
8.8%
173
 
6.6%
172
 
6.6%
170
 
6.5%
168
 
6.5%
168
 
6.5%
168
 
6.5%
167
 
6.4%
167
 
6.4%
166
 
6.4%
Other values (75) 855
32.9%
Decimal Number
ValueCountFrequency (%)
1 207
21.8%
3 124
13.1%
4 112
11.8%
2 108
11.4%
5 83
8.8%
6 74
 
7.8%
8 72
 
7.6%
0 63
 
6.6%
7 55
 
5.8%
9 50
 
5.3%
Space Separator
ValueCountFrequency (%)
1198
100.0%
Other Punctuation
ValueCountFrequency (%)
, 79
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 61
100.0%
Close Punctuation
ValueCountFrequency (%)
) 18
100.0%
Open Punctuation
ValueCountFrequency (%)
( 18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2602
52.8%
Common 2322
47.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
228
 
8.8%
173
 
6.6%
172
 
6.6%
170
 
6.5%
168
 
6.5%
168
 
6.5%
168
 
6.5%
167
 
6.4%
167
 
6.4%
166
 
6.4%
Other values (75) 855
32.9%
Common
ValueCountFrequency (%)
1198
51.6%
1 207
 
8.9%
3 124
 
5.3%
4 112
 
4.8%
2 108
 
4.7%
5 83
 
3.6%
, 79
 
3.4%
6 74
 
3.2%
8 72
 
3.1%
0 63
 
2.7%
Other values (5) 202
 
8.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2602
52.8%
ASCII 2322
47.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1198
51.6%
1 207
 
8.9%
3 124
 
5.3%
4 112
 
4.8%
2 108
 
4.7%
5 83
 
3.6%
, 79
 
3.4%
6 74
 
3.2%
8 72
 
3.1%
0 63
 
2.7%
Other values (5) 202
 
8.7%
Hangul
ValueCountFrequency (%)
228
 
8.8%
173
 
6.6%
172
 
6.6%
170
 
6.5%
168
 
6.5%
168
 
6.5%
168
 
6.5%
167
 
6.4%
167
 
6.4%
166
 
6.4%
Other values (75) 855
32.9%

Interactions

2024-03-16T13:11:43.446930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-03-16T13:11:43.578237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-16T13:11:44.040308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

농장명축종명사육두수소재지도로명주소소재지지번주소
0신하농장돼지2500경기도 안성시 미양면 보촌4길 128경기도 안성시 미양면 고지리 460번지
1농업회사법인 (주) 대덕돼지3600경기도 안성시 대덕면 신령로 223, (외 11필지)경기도 안성시 대덕면 신령리 111번지 5호 (외 11필지)
2은진농장돼지1300경기도 안성시 죽산면 장계길 14-60경기도 안성시 죽산면 장계리 109번지
3승양농장돼지2511경기도 안성시 죽산면 능앞길 181-78경기도 안성시 죽산면 장계리 362번지 1호
4네남매농장돼지4059경기도 안성시 죽산면 장계길 14-108경기도 안성시 죽산면 장계리 330번지 , 330-1, 330-2, 331, 331-1, 332
5진영농장돼지0경기도 안성시 죽산면 목동길 47-23경기도 안성시 죽산면 당목리 666번지 1호 외 8필지
6진성농장돼지1000경기도 안성시 고삼면 한내로 553-22경기도 안성시 고삼면 대갈리 398번지
7문호농장돼지1400경기도 안성시 일죽면 소라태길 86경기도 안성시 일죽면 월정리 402번지 외 5필지
8민우농장돼지0경기도 안성시 일죽면 일생로 205-99경기도 안성시 일죽면 금산리 510번지 4호 , 510-5
9학열 농장돼지1680경기도 안성시 원곡면 성주리 221번지경기도 안성시 원곡면 성주리 221번지
농장명축종명사육두수소재지도로명주소소재지지번주소
157새마을농장돼지0경기도 안성시 일죽면 죽화로 433경기도 안성시 일죽면 신흥리 686번지 4호 번지 외 3필지
158농업회사법인(주) 팜스월드 광일농장돼지2800경기도 안성시 삼죽면 동아예대길 99경기도 안성시 삼죽면 진촌리 310번지 3호
159C·F돼지554경기도 안성시 죽산면 걸미로 478-12경기도 안성시 죽산면 당목리 970번지 3호
160청해농장돼지614경기도 안성시 죽산면 능앞길 129-16경기도 안성시 죽산면 장능리 69번지 1호
161행복농장돼지0경기도 안성시 일죽면 장암로 93경기도 안성시 일죽면 장암리 425번지 1호
162동산비육장돼지0경기도 안성시 삼죽면 상삼로 121경기도 안성시 삼죽면 내장리 210번지 1호 , 210-2, 209-5, 209-6, 209-7
163세민농장2돼지300경기도 안성시 고삼면 쌍지길 215-42경기도 안성시 고삼면 쌍지리 302번지 4호
164덕산농장돼지1400경기도 안성시 삼죽면 삼죽초교길 157경기도 안성시 삼죽면 덕산리 455번지
165능국농장돼지660경기도 안성시 일죽면 지내동길 20경기도 안성시 일죽면 능국리 745-2, 745-3
166민근농장돼지2818경기도 안성시 미양면 중앙배수1길 128-37경기도 안성시 미양면 정동리 506번지 2호