Overview

Dataset statistics

Number of variables7
Number of observations201
Missing cells20
Missing cells (%)1.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.3 KiB
Average record size in memory57.7 B

Variable types

Categorical3
Text3
Numeric1

Dataset

Description경기도 포천시에서 제공하는 가금류 축산농가 현황(시군명, 농장명, 축종명, 구분, 사육두수, 소재지도로명주소, 소재지지번주소 등) 데이터 입니다.
Author경기도 포천시
URLhttps://www.data.go.kr/data/15034236/fileData.do

Alerts

시군명 has constant value ""Constant
축종명 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 축종명High correlation
축종명 is highly imbalanced (83.6%)Imbalance
소재지도로명주소 has 20 (10.0%) missing valuesMissing

Reproduction

Analysis started2023-12-12 16:12:29.354673
Analysis finished2023-12-12 16:12:30.071327
Duration0.72 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
포천시
201 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row포천시
2nd row포천시
3rd row포천시
4th row포천시
5th row포천시

Common Values

ValueCountFrequency (%)
포천시 201
100.0%

Length

2023-12-13T01:12:30.131393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:12:30.246861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
포천시 201
100.0%
Distinct167
Distinct (%)83.1%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-13T01:12:30.491685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length4
Mean length4.2686567
Min length1

Characters and Unicode

Total characters858
Distinct characters190
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique153 ?
Unique (%)76.1%

Sample

1st row희망농장
2nd row흥연농장
3rd row화현농장
4th row화현농장
5th row화현농장
ValueCountFrequency (%)
16
 
7.8%
계림농장 4
 
1.9%
대성농장 4
 
1.9%
화현농장 3
 
1.5%
자일농장 3
 
1.5%
햇빛농장 2
 
1.0%
영농조합법인 2
 
1.0%
초원농장 2
 
1.0%
하늘농장 2
 
1.0%
늘푸른농장 2
 
1.0%
Other values (160) 166
80.6%
2023-12-13T01:12:30.956573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
169
19.7%
155
 
18.1%
19
 
2.2%
18
 
2.1%
- 16
 
1.9%
15
 
1.7%
14
 
1.6%
12
 
1.4%
11
 
1.3%
11
 
1.3%
Other values (180) 418
48.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 817
95.2%
Dash Punctuation 16
 
1.9%
Close Punctuation 6
 
0.7%
Open Punctuation 6
 
0.7%
Decimal Number 6
 
0.7%
Space Separator 5
 
0.6%
Uppercase Letter 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
169
20.7%
155
19.0%
19
 
2.3%
18
 
2.2%
15
 
1.8%
14
 
1.7%
12
 
1.5%
11
 
1.3%
11
 
1.3%
11
 
1.3%
Other values (170) 382
46.8%
Decimal Number
ValueCountFrequency (%)
3 2
33.3%
2 2
33.3%
1 1
16.7%
5 1
16.7%
Uppercase Letter
ValueCountFrequency (%)
L 1
50.0%
A 1
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 16
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Space Separator
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 817
95.2%
Common 39
 
4.5%
Latin 2
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
169
20.7%
155
19.0%
19
 
2.3%
18
 
2.2%
15
 
1.8%
14
 
1.7%
12
 
1.5%
11
 
1.3%
11
 
1.3%
11
 
1.3%
Other values (170) 382
46.8%
Common
ValueCountFrequency (%)
- 16
41.0%
) 6
 
15.4%
( 6
 
15.4%
5
 
12.8%
3 2
 
5.1%
2 2
 
5.1%
1 1
 
2.6%
5 1
 
2.6%
Latin
ValueCountFrequency (%)
L 1
50.0%
A 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 817
95.2%
ASCII 41
 
4.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
169
20.7%
155
19.0%
19
 
2.3%
18
 
2.2%
15
 
1.8%
14
 
1.7%
12
 
1.5%
11
 
1.3%
11
 
1.3%
11
 
1.3%
Other values (170) 382
46.8%
ASCII
ValueCountFrequency (%)
- 16
39.0%
) 6
 
14.6%
( 6
 
14.6%
5
 
12.2%
3 2
 
4.9%
2 2
 
4.9%
1 1
 
2.4%
L 1
 
2.4%
A 1
 
2.4%
5 1
 
2.4%

축종명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
192 
오리
 
5
타조
 
2
메추리
 
2

Length

Max length3
Median length1
Mean length1.0547264
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
192
95.5%
오리 5
 
2.5%
타조 2
 
1.0%
메추리 2
 
1.0%

Length

2023-12-13T01:12:31.120179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:12:31.280028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
192
95.5%
오리 5
 
2.5%
타조 2
 
1.0%
메추리 2
 
1.0%

구분
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
육계
100 
산란계
96 
오리
 
5

Length

Max length3
Median length2
Mean length2.4776119
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row산란계
2nd row육계
3rd row육계
4th row육계
5th row육계

Common Values

ValueCountFrequency (%)
육계 100
49.8%
산란계 96
47.8%
오리 5
 
2.5%

Length

2023-12-13T01:12:31.402338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:12:31.514597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
육계 100
49.8%
산란계 96
47.8%
오리 5
 
2.5%

사육두수
Real number (ℝ)

Distinct97
Distinct (%)48.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean47965.766
Minimum0
Maximum451000
Zeros2
Zeros (%)1.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-13T01:12:31.670958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile60
Q114000
median30000
Q351300
95-th percentile180000
Maximum451000
Range451000
Interquartile range (IQR)37300

Descriptive statistics

Standard deviation64490.322
Coefficient of variation (CV)1.3445073
Kurtosis11.976259
Mean47965.766
Median Absolute Deviation (MAD)20000
Skewness3.1198278
Sum9641119
Variance4.1590017 × 109
MonotonicityNot monotonic
2023-12-13T01:12:31.843108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
30000 18
 
9.0%
50000 14
 
7.0%
20000 12
 
6.0%
25000 10
 
5.0%
40000 8
 
4.0%
35000 7
 
3.5%
60000 6
 
3.0%
10000 5
 
2.5%
70000 5
 
2.5%
2000 4
 
2.0%
Other values (87) 112
55.7%
ValueCountFrequency (%)
0 2
1.0%
6 1
0.5%
20 1
0.5%
30 1
0.5%
34 1
0.5%
40 1
0.5%
48 1
0.5%
52 1
0.5%
54 1
0.5%
60 2
1.0%
ValueCountFrequency (%)
451000 1
0.5%
350000 1
0.5%
308000 1
0.5%
270000 1
0.5%
264600 1
0.5%
250000 1
0.5%
246230 1
0.5%
220000 1
0.5%
200000 1
0.5%
185800 1
0.5%
Distinct165
Distinct (%)91.2%
Missing20
Missing (%)10.0%
Memory size1.7 KiB
2023-12-13T01:12:32.149552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length27
Mean length22.243094
Min length17

Characters and Unicode

Total characters4026
Distinct characters108
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique150 ?
Unique (%)82.9%

Sample

1st row경기도 포천시 창수면 옥수로214번길 86
2nd row경기도 포천시 일동면 새낭로 218
3rd row경기도 포천시 일동면 정자골1길 64-132
4th row경기도 포천시 일동면 정자골1길 64-132
5th row경기도 포천시 화현면 금강로 4035-83
ValueCountFrequency (%)
경기도 181
20.1%
포천시 181
20.1%
영북면 37
 
4.1%
창수면 26
 
2.9%
영중면 25
 
2.8%
신북면 22
 
2.4%
호국로 17
 
1.9%
가산면 13
 
1.4%
일동면 12
 
1.3%
이동면 11
 
1.2%
Other values (246) 376
41.7%
2023-12-13T01:12:32.631211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
720
17.9%
190
 
4.7%
190
 
4.7%
184
 
4.6%
182
 
4.5%
181
 
4.5%
181
 
4.5%
168
 
4.2%
1 167
 
4.1%
2 130
 
3.2%
Other values (98) 1733
43.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2348
58.3%
Decimal Number 889
 
22.1%
Space Separator 720
 
17.9%
Dash Punctuation 69
 
1.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
190
 
8.1%
190
 
8.1%
184
 
7.8%
182
 
7.8%
181
 
7.7%
181
 
7.7%
168
 
7.2%
129
 
5.5%
120
 
5.1%
81
 
3.4%
Other values (86) 742
31.6%
Decimal Number
ValueCountFrequency (%)
1 167
18.8%
2 130
14.6%
3 105
11.8%
4 91
10.2%
7 79
8.9%
5 76
8.5%
6 72
8.1%
8 63
 
7.1%
9 61
 
6.9%
0 45
 
5.1%
Space Separator
ValueCountFrequency (%)
720
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 69
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2348
58.3%
Common 1678
41.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
190
 
8.1%
190
 
8.1%
184
 
7.8%
182
 
7.8%
181
 
7.7%
181
 
7.7%
168
 
7.2%
129
 
5.5%
120
 
5.1%
81
 
3.4%
Other values (86) 742
31.6%
Common
ValueCountFrequency (%)
720
42.9%
1 167
 
10.0%
2 130
 
7.7%
3 105
 
6.3%
4 91
 
5.4%
7 79
 
4.7%
5 76
 
4.5%
6 72
 
4.3%
- 69
 
4.1%
8 63
 
3.8%
Other values (2) 106
 
6.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2348
58.3%
ASCII 1678
41.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
720
42.9%
1 167
 
10.0%
2 130
 
7.7%
3 105
 
6.3%
4 91
 
5.4%
7 79
 
4.7%
5 76
 
4.5%
6 72
 
4.3%
- 69
 
4.1%
8 63
 
3.8%
Other values (2) 106
 
6.3%
Hangul
ValueCountFrequency (%)
190
 
8.1%
190
 
8.1%
184
 
7.8%
182
 
7.8%
181
 
7.7%
181
 
7.7%
168
 
7.2%
129
 
5.5%
120
 
5.1%
81
 
3.4%
Other values (86) 742
31.6%
Distinct190
Distinct (%)94.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-13T01:12:33.009857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length37
Mean length22.761194
Min length16

Characters and Unicode

Total characters4575
Distinct characters105
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique179 ?
Unique (%)89.1%

Sample

1st row경기도 포천시 창수면 주원리 19번지
2nd row경기도 포천시 일동면 사직리 954-9번지
3rd row경기도 포천시 일동면 길명리 82번지 (외1필지(82))
4th row경기도 포천시 일동면 길명리 82번지
5th row경기도 포천시 화현면 화현리 282번지
ValueCountFrequency (%)
경기도 201
19.4%
포천시 201
19.4%
영북면 40
 
3.9%
창수면 30
 
2.9%
영중면 28
 
2.7%
자일리 25
 
2.4%
신북면 22
 
2.1%
주원리 18
 
1.7%
일동면 15
 
1.4%
가산면 13
 
1.3%
Other values (261) 443
42.8%
2023-12-13T01:12:33.478901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
835
18.3%
218
 
4.8%
202
 
4.4%
202
 
4.4%
201
 
4.4%
201
 
4.4%
201
 
4.4%
201
 
4.4%
201
 
4.4%
195
 
4.3%
Other values (95) 1918
41.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2870
62.7%
Space Separator 835
 
18.3%
Decimal Number 734
 
16.0%
Dash Punctuation 85
 
1.9%
Open Punctuation 23
 
0.5%
Close Punctuation 23
 
0.5%
Other Punctuation 5
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
218
 
7.6%
202
 
7.0%
202
 
7.0%
201
 
7.0%
201
 
7.0%
201
 
7.0%
201
 
7.0%
201
 
7.0%
195
 
6.8%
185
 
6.4%
Other values (80) 863
30.1%
Decimal Number
ValueCountFrequency (%)
1 119
16.2%
4 102
13.9%
2 99
13.5%
3 84
11.4%
5 64
8.7%
7 64
8.7%
6 57
7.8%
8 57
7.8%
9 52
7.1%
0 36
 
4.9%
Space Separator
ValueCountFrequency (%)
835
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 85
100.0%
Open Punctuation
ValueCountFrequency (%)
( 23
100.0%
Close Punctuation
ValueCountFrequency (%)
) 23
100.0%
Other Punctuation
ValueCountFrequency (%)
, 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2870
62.7%
Common 1705
37.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
218
 
7.6%
202
 
7.0%
202
 
7.0%
201
 
7.0%
201
 
7.0%
201
 
7.0%
201
 
7.0%
201
 
7.0%
195
 
6.8%
185
 
6.4%
Other values (80) 863
30.1%
Common
ValueCountFrequency (%)
835
49.0%
1 119
 
7.0%
4 102
 
6.0%
2 99
 
5.8%
- 85
 
5.0%
3 84
 
4.9%
5 64
 
3.8%
7 64
 
3.8%
6 57
 
3.3%
8 57
 
3.3%
Other values (5) 139
 
8.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2870
62.7%
ASCII 1705
37.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
835
49.0%
1 119
 
7.0%
4 102
 
6.0%
2 99
 
5.8%
- 85
 
5.0%
3 84
 
4.9%
5 64
 
3.8%
7 64
 
3.8%
6 57
 
3.3%
8 57
 
3.3%
Other values (5) 139
 
8.2%
Hangul
ValueCountFrequency (%)
218
 
7.6%
202
 
7.0%
202
 
7.0%
201
 
7.0%
201
 
7.0%
201
 
7.0%
201
 
7.0%
201
 
7.0%
195
 
6.8%
185
 
6.4%
Other values (80) 863
30.1%

Interactions

2023-12-13T01:12:29.753597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T01:12:33.573624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
축종명구분사육두수
축종명1.0000.6780.265
구분0.6781.0000.133
사육두수0.2650.1331.000
2023-12-13T01:12:33.656596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분축종명
구분1.0000.707
축종명0.7071.000
2023-12-13T01:12:33.740053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사육두수축종명구분
사육두수1.0000.1700.056
축종명0.1701.0000.707
구분0.0560.7071.000

Missing values

2023-12-13T01:12:29.878695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:12:30.017059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군명농장명축종명구분사육두수소재지도로명주소소재지지번주소
0포천시희망농장산란계35000경기도 포천시 창수면 옥수로214번길 86경기도 포천시 창수면 주원리 19번지
1포천시흥연농장육계39000경기도 포천시 일동면 새낭로 218경기도 포천시 일동면 사직리 954-9번지
2포천시화현농장육계25000경기도 포천시 일동면 정자골1길 64-132경기도 포천시 일동면 길명리 82번지 (외1필지(82))
3포천시화현농장육계30000경기도 포천시 일동면 정자골1길 64-132경기도 포천시 일동면 길명리 82번지
4포천시화현농장육계1200경기도 포천시 화현면 금강로 4035-83경기도 포천시 화현면 화현리 282번지
5포천시홍문화농장육계60경기도 포천시 영중면 금화봉2길 117경기도 포천시 영중면 거사리 180번지 1호
6포천시혜옥농장육계10000경기도 포천시 화현면 봉화로 543경기도 포천시 화현면 명덕리 469-2번지
7포천시협동농장산란계30000경기도 포천시 신북면 호국로 2446-20경기도 포천시 신북면 만세교리 203-3번지
8포천시햇빛농장산란계15000경기도 포천시 신북면 탑신로 1181경기도 포천시 신북면 금동리 455-4번지
9포천시햇빛농장육계5000경기도 포천시 신북면 호국로 2297경기도 포천시 신북면 만세교리 354번지
시군명농장명축종명구분사육두수소재지도로명주소소재지지번주소
191포천시-육계35000경기도 포천시 영중면 물안길 53경기도 포천시 영중면 금주리 508번지 ((508))
192포천시-육계5000<NA>경기도 포천시 영중면 영송리 165번지 1호
193포천시-산란계25000경기도 포천시 영북면 호국로4350번길 21-26경기도 포천시 영북면 자일리 334번지
194포천시-육계80경기도 포천시 내촌면 금강로2753번길 22-8경기도 포천시 내촌면 소학리 172번지
195포천시-육계200경기도 포천시 신북면 삼성당4길 92-74경기도 포천시 신북면 삼성당리 27번지
196포천시-육계20경기도 포천시 영북면 호국로4350번길 154-151경기도 포천시 영북면 자일리 5번지
197포천시-육계30경기도 포천시 내촌면 작은넙고개길 77경기도 포천시 내촌면 진목리 324-1번지
198포천시-육계60경기도 포천시 소흘읍 소흘로 73경기도 포천시 소흘읍 무봉리 235-13번지
199포천시-육계34<NA>경기도 포천시 일동면 길명리 691번지 1호
200포천시-육계40경기도 포천시 일동면 정자골1길 86-93경기도 포천시 일동면 길명리 107-5번지