Overview

Dataset statistics

Number of variables4
Number of observations204
Missing cells9
Missing cells (%)1.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.5 KiB
Average record size in memory32.6 B

Variable types

Text3
Categorical1

Dataset

Description경기도 포천시 양돈농가 데이터로 사업장명, 등록축종, 소재지주소(도로명), 소재지주소(지번) 정보를 제공합니다.
Author경기도 포천시
URLhttps://www.data.go.kr/data/15127148/fileData.do

Alerts

등록축종 has constant value ""Constant
소재지주소(지번) has 9 (4.4%) missing valuesMissing

Reproduction

Analysis started2024-03-16 04:15:02.723517
Analysis finished2024-03-16 04:15:03.475813
Duration0.75 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct186
Distinct (%)91.2%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2024-03-16T13:15:03.775141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length4
Mean length4.9607843
Min length1

Characters and Unicode

Total characters1012
Distinct characters196
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique177 ?
Unique (%)86.8%

Sample

1st row내가리농장
2nd row팔팔농장
3rd row보성농장
4th row지성농장
5th row성운축산
ValueCountFrequency (%)
10
 
4.5%
농업회사법인 7
 
3.2%
주식회사 4
 
1.8%
한비축산 3
 
1.4%
신흥농장 2
 
0.9%
동암영농조합법인 2
 
0.9%
브니엘농장 2
 
0.9%
새여리팜 2
 
0.9%
양문농장 2
 
0.9%
정상농장 2
 
0.9%
Other values (182) 186
83.8%
2024-03-16T13:15:04.453964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
167
 
16.5%
148
 
14.6%
28
 
2.8%
21
 
2.1%
21
 
2.1%
19
 
1.9%
18
 
1.8%
18
 
1.8%
17
 
1.7%
16
 
1.6%
Other values (186) 539
53.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 941
93.0%
Space Separator 18
 
1.8%
Decimal Number 12
 
1.2%
Dash Punctuation 11
 
1.1%
Open Punctuation 9
 
0.9%
Close Punctuation 9
 
0.9%
Lowercase Letter 7
 
0.7%
Uppercase Letter 5
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
167
 
17.7%
148
 
15.7%
28
 
3.0%
21
 
2.2%
21
 
2.2%
19
 
2.0%
18
 
1.9%
17
 
1.8%
16
 
1.7%
15
 
1.6%
Other values (168) 471
50.1%
Decimal Number
ValueCountFrequency (%)
2 7
58.3%
1 2
 
16.7%
9 1
 
8.3%
5 1
 
8.3%
3 1
 
8.3%
Uppercase Letter
ValueCountFrequency (%)
F 1
20.0%
C 1
20.0%
J 1
20.0%
D 1
20.0%
Y 1
20.0%
Lowercase Letter
ValueCountFrequency (%)
r 2
28.6%
m 2
28.6%
a 2
28.6%
f 1
14.3%
Space Separator
ValueCountFrequency (%)
18
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 941
93.0%
Common 59
 
5.8%
Latin 12
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
167
 
17.7%
148
 
15.7%
28
 
3.0%
21
 
2.2%
21
 
2.2%
19
 
2.0%
18
 
1.9%
17
 
1.8%
16
 
1.7%
15
 
1.6%
Other values (168) 471
50.1%
Common
ValueCountFrequency (%)
18
30.5%
- 11
18.6%
( 9
15.3%
) 9
15.3%
2 7
 
11.9%
1 2
 
3.4%
9 1
 
1.7%
5 1
 
1.7%
3 1
 
1.7%
Latin
ValueCountFrequency (%)
r 2
16.7%
m 2
16.7%
a 2
16.7%
F 1
8.3%
C 1
8.3%
J 1
8.3%
f 1
8.3%
D 1
8.3%
Y 1
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 941
93.0%
ASCII 71
 
7.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
167
 
17.7%
148
 
15.7%
28
 
3.0%
21
 
2.2%
21
 
2.2%
19
 
2.0%
18
 
1.9%
17
 
1.8%
16
 
1.7%
15
 
1.6%
Other values (168) 471
50.1%
ASCII
ValueCountFrequency (%)
18
25.4%
- 11
15.5%
( 9
12.7%
) 9
12.7%
2 7
 
9.9%
1 2
 
2.8%
r 2
 
2.8%
m 2
 
2.8%
a 2
 
2.8%
F 1
 
1.4%
Other values (8) 8
11.3%

등록축종
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
돼지
204 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row돼지
2nd row돼지
3rd row돼지
4th row돼지
5th row돼지

Common Values

ValueCountFrequency (%)
돼지 204
100.0%

Length

2024-03-16T13:15:04.745928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-16T13:15:04.913870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
돼지 204
100.0%
Distinct203
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2024-03-16T13:15:05.306966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length69
Median length59
Mean length28.882353
Min length17

Characters and Unicode

Total characters5892
Distinct characters103
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique202 ?
Unique (%)99.0%

Sample

1st row경기도 포천시 소흘읍 고모리 488번지 3호
2nd row경기도 포천시 창수면 가양리 155번지 2호
3rd row경기도 포천시 창수면 주원리 85번지 4호
4th row경기도 포천시 창수면 주원리 85번지 2호
5th row경기도 포천시 창수면 가양리 256번지 4호
ValueCountFrequency (%)
경기도 204
 
16.2%
포천시 204
 
16.2%
일동면 41
 
3.3%
창수면 38
 
3.0%
1호 37
 
2.9%
이동면 29
 
2.3%
영중면 24
 
1.9%
주원리 24
 
1.9%
2호 23
 
1.8%
영북면 21
 
1.7%
Other values (328) 615
48.8%
2024-03-16T13:15:06.013099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1400
23.8%
244
 
4.1%
1 227
 
3.9%
209
 
3.5%
207
 
3.5%
204
 
3.5%
204
 
3.5%
204
 
3.5%
204
 
3.5%
204
 
3.5%
Other values (93) 2585
43.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3074
52.2%
Space Separator 1400
23.8%
Decimal Number 1103
 
18.7%
Dash Punctuation 138
 
2.3%
Other Punctuation 123
 
2.1%
Open Punctuation 27
 
0.5%
Close Punctuation 27
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
244
 
7.9%
209
 
6.8%
207
 
6.7%
204
 
6.6%
204
 
6.6%
204
 
6.6%
204
 
6.6%
204
 
6.6%
202
 
6.6%
200
 
6.5%
Other values (77) 992
32.3%
Decimal Number
ValueCountFrequency (%)
1 227
20.6%
3 138
12.5%
4 130
11.8%
2 130
11.8%
5 112
10.2%
6 98
8.9%
7 78
 
7.1%
8 72
 
6.5%
0 63
 
5.7%
9 55
 
5.0%
Other Punctuation
ValueCountFrequency (%)
, 117
95.1%
. 6
 
4.9%
Space Separator
ValueCountFrequency (%)
1400
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 138
100.0%
Open Punctuation
ValueCountFrequency (%)
( 27
100.0%
Close Punctuation
ValueCountFrequency (%)
) 27
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3074
52.2%
Common 2818
47.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
244
 
7.9%
209
 
6.8%
207
 
6.7%
204
 
6.6%
204
 
6.6%
204
 
6.6%
204
 
6.6%
204
 
6.6%
202
 
6.6%
200
 
6.5%
Other values (77) 992
32.3%
Common
ValueCountFrequency (%)
1400
49.7%
1 227
 
8.1%
- 138
 
4.9%
3 138
 
4.9%
4 130
 
4.6%
2 130
 
4.6%
, 117
 
4.2%
5 112
 
4.0%
6 98
 
3.5%
7 78
 
2.8%
Other values (6) 250
 
8.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3074
52.2%
ASCII 2818
47.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1400
49.7%
1 227
 
8.1%
- 138
 
4.9%
3 138
 
4.9%
4 130
 
4.6%
2 130
 
4.6%
, 117
 
4.2%
5 112
 
4.0%
6 98
 
3.5%
7 78
 
2.8%
Other values (6) 250
 
8.9%
Hangul
ValueCountFrequency (%)
244
 
7.9%
209
 
6.8%
207
 
6.7%
204
 
6.6%
204
 
6.6%
204
 
6.6%
204
 
6.6%
204
 
6.6%
202
 
6.6%
200
 
6.5%
Other values (77) 992
32.3%
Distinct188
Distinct (%)96.4%
Missing9
Missing (%)4.4%
Memory size1.7 KiB
2024-03-16T13:15:06.491314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length47
Mean length24.866667
Min length18

Characters and Unicode

Total characters4849
Distinct characters127
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique181 ?
Unique (%)92.8%

Sample

1st row경기도 포천시 소흘읍 죽엽산로237번길 21-41
2nd row경기도 포천시 창수면 가영로150번길 74
3rd row경기도 포천시 창수면 옥수로298번길 52
4th row경기도 포천시 창수면 옥수로298번길 56
5th row경기도 포천시 창수면 가영로 281-26
ValueCountFrequency (%)
경기도 195
19.2%
포천시 195
19.2%
일동면 37
 
3.6%
창수면 37
 
3.6%
이동면 28
 
2.8%
영중면 22
 
2.2%
영북면 21
 
2.1%
관인면 15
 
1.5%
신북면 12
 
1.2%
무리울길 11
 
1.1%
Other values (302) 441
43.5%
2024-03-16T13:15:07.280154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
819
 
16.9%
1 209
 
4.3%
201
 
4.1%
201
 
4.1%
198
 
4.1%
196
 
4.0%
195
 
4.0%
195
 
4.0%
191
 
3.9%
2 168
 
3.5%
Other values (117) 2276
46.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2669
55.0%
Decimal Number 1119
23.1%
Space Separator 819
 
16.9%
Dash Punctuation 123
 
2.5%
Other Punctuation 43
 
0.9%
Close Punctuation 38
 
0.8%
Open Punctuation 38
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
201
 
7.5%
201
 
7.5%
198
 
7.4%
196
 
7.3%
195
 
7.3%
195
 
7.3%
191
 
7.2%
160
 
6.0%
126
 
4.7%
98
 
3.7%
Other values (98) 908
34.0%
Decimal Number
ValueCountFrequency (%)
1 209
18.7%
2 168
15.0%
3 130
11.6%
5 110
9.8%
4 94
8.4%
6 93
8.3%
8 87
7.8%
7 83
 
7.4%
9 73
 
6.5%
0 72
 
6.4%
Other Punctuation
ValueCountFrequency (%)
, 40
93.0%
. 2
 
4.7%
* 1
 
2.3%
Close Punctuation
ValueCountFrequency (%)
) 37
97.4%
] 1
 
2.6%
Open Punctuation
ValueCountFrequency (%)
( 37
97.4%
[ 1
 
2.6%
Space Separator
ValueCountFrequency (%)
819
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 123
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2669
55.0%
Common 2180
45.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
201
 
7.5%
201
 
7.5%
198
 
7.4%
196
 
7.3%
195
 
7.3%
195
 
7.3%
191
 
7.2%
160
 
6.0%
126
 
4.7%
98
 
3.7%
Other values (98) 908
34.0%
Common
ValueCountFrequency (%)
819
37.6%
1 209
 
9.6%
2 168
 
7.7%
3 130
 
6.0%
- 123
 
5.6%
5 110
 
5.0%
4 94
 
4.3%
6 93
 
4.3%
8 87
 
4.0%
7 83
 
3.8%
Other values (9) 264
 
12.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2669
55.0%
ASCII 2180
45.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
819
37.6%
1 209
 
9.6%
2 168
 
7.7%
3 130
 
6.0%
- 123
 
5.6%
5 110
 
5.0%
4 94
 
4.3%
6 93
 
4.3%
8 87
 
4.0%
7 83
 
3.8%
Other values (9) 264
 
12.1%
Hangul
ValueCountFrequency (%)
201
 
7.5%
201
 
7.5%
198
 
7.4%
196
 
7.3%
195
 
7.3%
195
 
7.3%
191
 
7.2%
160
 
6.0%
126
 
4.7%
98
 
3.7%
Other values (98) 908
34.0%

Missing values

2024-03-16T13:15:03.215551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-16T13:15:03.407245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업장명등록축종소재지주소(도로명)소재지주소(지번)
0내가리농장돼지경기도 포천시 소흘읍 고모리 488번지 3호경기도 포천시 소흘읍 죽엽산로237번길 21-41
1팔팔농장돼지경기도 포천시 창수면 가양리 155번지 2호경기도 포천시 창수면 가영로150번길 74
2보성농장돼지경기도 포천시 창수면 주원리 85번지 4호경기도 포천시 창수면 옥수로298번길 52
3지성농장돼지경기도 포천시 창수면 주원리 85번지 2호경기도 포천시 창수면 옥수로298번길 56
4성운축산돼지경기도 포천시 창수면 가양리 256번지 4호경기도 포천시 창수면 가영로 281-26
5덕암농장돼지경기도 포천시 창수면 운산리 18번지 1호 ,-2경기도 포천시 창수면 방골길 182
6세일농장돼지경기도 포천시 창수면 가양리 256번지 2호 -1, -14경기도 포천시 창수면 가영로 281-38
7덕영농장돼지경기도 포천시 일동면 길명리 538번지 1호 ,-4,-5,-6,-8,-10,537번지경기도 포천시 일동면 정자골길 92
8수용농장돼지경기도 포천시 영북면 자일리 461번지 등 3필지(-1,459-2)경기도 포천시 영북면 호국로4232번길 45-29, 등 3필지(자일리 461,-1,459-2)
9샘터농장돼지경기도 포천시 화현면 지현리 134번지경기도 포천시 화현면 봉화로709번길 32
사업장명등록축종소재지주소(도로명)소재지주소(지번)
194다돈농장돼지경기도 포천시 영중면 영송리 204번지 24호경기도 포천시 영중면 은잿말길 226
195한탄강스마트팜돼지경기도 포천시 영북면 자일리 52번지 1호경기도 포천시 영북면 호국로4350번길 136-51
196호암농장돼지경기도 포천시 일동면 사직리 1626번지경기도 포천시 일동면 앵바위길 222-153
197지앤알팜사계돼지경기도 포천시 일동면 사직리 1424번지경기도 포천시 일동면 새낭로215번길 68-213
198무리울농장돼지경기도 포천시 일동면 화대리 14번지 1호 14-2, 13, 13-2, 15-1경기도 포천시 일동면 무리울길 245-14
199진우농장돼지경기도 포천시 창수면 주원리 619번지 ,-1,621,-3,871-56,-58경기도 포천시 창수면 옥수로327번길 51-1
20095Farm돼지경기도 포천시 창수면 주원리 29번지 4호경기도 포천시 창수면 옥수로214번길 112
201(주)로고농업회사법인돼지경기도 포천시 창수면 주원리 465번지 ,-1,467,산186,산187-3<NA>
202동암농장돼지경기도 포천시 영중면 영송리 685번지 7호 684-9,-13경기도 포천시 영중면 가영로535번길 63, [*미고시]
203삼성농장2돼지경기도 포천시 일동면 사직리 164번지 5호경기도 포천시 일동면 사당말6길 88