Overview

Dataset statistics

Number of variables3
Number of observations1047
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.1%
Total size in memory24.7 KiB
Average record size in memory24.1 B

Variable types

Text2
Categorical1

Dataset

Description충청남도 당진시 축산 관련 정보입니다.(제공 컬럼 : 축산 농장명, 축종,농장주소)제공 날짜 : 2024-04-25
Author충청남도 당진시
URLhttps://www.data.go.kr/data/15021573/fileData.do

Alerts

Dataset has 1 (0.1%) duplicate rowsDuplicates

Reproduction

Analysis started2024-04-29 22:24:38.357446
Analysis finished2024-04-29 22:24:39.088244
Duration0.73 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct981
Distinct (%)93.7%
Missing0
Missing (%)0.0%
Memory size8.3 KiB
2024-04-30T07:24:39.267814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length4
Mean length4.5931232
Min length2

Characters and Unicode

Total characters4809
Distinct characters315
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique926 ?
Unique (%)88.4%

Sample

1st row서경한우목장
2nd row만수목장
3rd row화곡목장
4th row호윤선농장
5th row샛터농장
ValueCountFrequency (%)
농장 26
 
2.4%
목장 5
 
0.5%
대성농장 5
 
0.5%
서해농장 5
 
0.5%
농업회사법인 4
 
0.4%
행정농장 3
 
0.3%
운곡농장 3
 
0.3%
우리농장 3
 
0.3%
혜훈*농장 3
 
0.3%
주식회사 3
 
0.3%
Other values (980) 1034
94.5%
2024-04-30T07:24:39.641232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
983
20.4%
827
 
17.2%
178
 
3.7%
83
 
1.7%
76
 
1.6%
55
 
1.1%
* 55
 
1.1%
54
 
1.1%
52
 
1.1%
51
 
1.1%
Other values (305) 2395
49.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4676
97.2%
Other Punctuation 56
 
1.2%
Space Separator 47
 
1.0%
Uppercase Letter 12
 
0.2%
Open Punctuation 9
 
0.2%
Close Punctuation 9
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
983
21.0%
827
 
17.7%
178
 
3.8%
83
 
1.8%
76
 
1.6%
55
 
1.2%
54
 
1.2%
52
 
1.1%
51
 
1.1%
51
 
1.1%
Other values (291) 2266
48.5%
Uppercase Letter
ValueCountFrequency (%)
O 3
25.0%
E 2
16.7%
R 1
 
8.3%
T 1
 
8.3%
N 1
 
8.3%
B 1
 
8.3%
F 1
 
8.3%
M 1
 
8.3%
K 1
 
8.3%
Other Punctuation
ValueCountFrequency (%)
* 55
98.2%
. 1
 
1.8%
Space Separator
ValueCountFrequency (%)
47
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4676
97.2%
Common 121
 
2.5%
Latin 12
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
983
21.0%
827
 
17.7%
178
 
3.8%
83
 
1.8%
76
 
1.6%
55
 
1.2%
54
 
1.2%
52
 
1.1%
51
 
1.1%
51
 
1.1%
Other values (291) 2266
48.5%
Latin
ValueCountFrequency (%)
O 3
25.0%
E 2
16.7%
R 1
 
8.3%
T 1
 
8.3%
N 1
 
8.3%
B 1
 
8.3%
F 1
 
8.3%
M 1
 
8.3%
K 1
 
8.3%
Common
ValueCountFrequency (%)
* 55
45.5%
47
38.8%
( 9
 
7.4%
) 9
 
7.4%
. 1
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4676
97.2%
ASCII 133
 
2.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
983
21.0%
827
 
17.7%
178
 
3.8%
83
 
1.8%
76
 
1.6%
55
 
1.2%
54
 
1.2%
52
 
1.1%
51
 
1.1%
51
 
1.1%
Other values (291) 2266
48.5%
ASCII
ValueCountFrequency (%)
* 55
41.4%
47
35.3%
( 9
 
6.8%
) 9
 
6.8%
O 3
 
2.3%
E 2
 
1.5%
R 1
 
0.8%
T 1
 
0.8%
N 1
 
0.8%
B 1
 
0.8%
Other values (4) 4
 
3.0%

축종
Categorical

Distinct3
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size8.3 KiB
810 
돼지
127 
110 

Length

Max length2
Median length1
Mean length1.1212989
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
810
77.4%
돼지 127
 
12.1%
110
 
10.5%

Length

2024-04-30T07:24:39.798276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T07:24:39.904698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
810
77.4%
돼지 127
 
12.1%
110
 
10.5%
Distinct797
Distinct (%)76.1%
Missing0
Missing (%)0.0%
Memory size8.3 KiB
2024-04-30T07:24:40.186545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length95
Median length85
Mean length34.690544
Min length14

Characters and Unicode

Total characters36321
Distinct characters129
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique647 ?
Unique (%)61.8%

Sample

1st row충청남도 당진시 대호지면 두산리 산 ***번지
2nd row충청남도 당진시 합덕읍 소소리 ***번지 *호
3rd row충청남도 당진시 합덕읍 석우리 ***번지 **호
4th row충청남도 당진시 고대면 장항리 ***-*번지 외*필지
5th row충청남도 당진시 석문면 초락도리 ***번지 *호 ,***-*,***-*,***-*,***-*
ValueCountFrequency (%)
1578
20.8%
충청남도 1047
13.8%
당진시 1047
13.8%
번지 1042
13.8%
720
9.5%
고대면 161
 
2.1%
순성면 145
 
1.9%
합덕읍 142
 
1.9%
신평면 104
 
1.4%
면천면 74
 
1.0%
Other values (154) 1514
20.0%
2024-04-30T07:24:40.598861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 9218
25.4%
8107
22.3%
, 1330
 
3.7%
1141
 
3.1%
1110
 
3.1%
1101
 
3.0%
- 1100
 
3.0%
1089
 
3.0%
1079
 
3.0%
1058
 
2.9%
Other values (119) 9988
27.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16508
45.5%
Other Punctuation 10548
29.0%
Space Separator 8107
22.3%
Dash Punctuation 1100
 
3.0%
Close Punctuation 28
 
0.1%
Open Punctuation 28
 
0.1%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1141
 
6.9%
1110
 
6.7%
1101
 
6.7%
1089
 
6.6%
1079
 
6.5%
1058
 
6.4%
1051
 
6.4%
1047
 
6.3%
1042
 
6.3%
975
 
5.9%
Other values (111) 5815
35.2%
Other Punctuation
ValueCountFrequency (%)
* 9218
87.4%
, 1330
 
12.6%
Uppercase Letter
ValueCountFrequency (%)
A 1
50.0%
C 1
50.0%
Space Separator
ValueCountFrequency (%)
8107
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1100
100.0%
Close Punctuation
ValueCountFrequency (%)
) 28
100.0%
Open Punctuation
ValueCountFrequency (%)
( 28
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 19811
54.5%
Hangul 16508
45.5%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1141
 
6.9%
1110
 
6.7%
1101
 
6.7%
1089
 
6.6%
1079
 
6.5%
1058
 
6.4%
1051
 
6.4%
1047
 
6.3%
1042
 
6.3%
975
 
5.9%
Other values (111) 5815
35.2%
Common
ValueCountFrequency (%)
* 9218
46.5%
8107
40.9%
, 1330
 
6.7%
- 1100
 
5.6%
) 28
 
0.1%
( 28
 
0.1%
Latin
ValueCountFrequency (%)
A 1
50.0%
C 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 19813
54.5%
Hangul 16508
45.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 9218
46.5%
8107
40.9%
, 1330
 
6.7%
- 1100
 
5.6%
) 28
 
0.1%
( 28
 
0.1%
A 1
 
< 0.1%
C 1
 
< 0.1%
Hangul
ValueCountFrequency (%)
1141
 
6.9%
1110
 
6.7%
1101
 
6.7%
1089
 
6.6%
1079
 
6.5%
1058
 
6.4%
1051
 
6.4%
1047
 
6.3%
1042
 
6.3%
975
 
5.9%
Other values (111) 5815
35.2%

Missing values

2024-04-30T07:24:38.972802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-30T07:24:39.054583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

농장명축종농장주소
0서경한우목장충청남도 당진시 대호지면 두산리 산 ***번지
1만수목장충청남도 당진시 합덕읍 소소리 ***번지 *호
2화곡목장충청남도 당진시 합덕읍 석우리 ***번지 **호
3호윤선농장충청남도 당진시 고대면 장항리 ***-*번지 외*필지
4샛터농장충청남도 당진시 석문면 초락도리 ***번지 *호 ,***-*,***-*,***-*,***-*
5도문한우농장충청남도 당진시 송산면 도문리 ***번지 *호 , ***, ***-*, ***, ***-*, ***-*
6손영호농장충청남도 당진시 고대면 장항리 ***-*, ***-*번지
7철탑농장충청남도 당진시 석문면 교로리 ***번지
8아라농장충청남도 당진시 석문면 교로리 ***번지 , ***
9지희목장충청남도 당진시 합덕읍 소소리 **번지 **호
농장명축종농장주소
1037바른농장충청남도 당진시 고대면 당진포리 ****번지 ****, ****, ****, ****, ****, ****
1038제경농장충청남도 당진시 대호지면 송전리 산 **번지
1039대명농장충청남도 당진시 대호지면 마중리 *번지 *호 양계장
1040호선농장충청남도 당진시 정미면 승산리 ***번지 *호 ,***-**, ***-**, ***-**
1041일성농장충청남도 당진시 합덕읍 도곡리 ***번지 *호 , ***-**, ***-**
1042기린 고대농장충청남도 당진시 고대면 당진포리 ***번지 *호 , ***-*, ***-*, ***-*, ***-*, ***-**
1043효원농장충청남도 당진시 면천면 문봉리 ***번지 **호 , ***-**, ***-**
1044수훈농장충청남도 당진시 신평면 부수리 ***번지 *호 , ***-**, ***-**, ***-**, ***-**
1045원농장충청남도 당진시 합덕읍 성동리 산 **번지 *호
1046잠언농장충청남도 당진시 순성면 본리 **번지 **호

Duplicate rows

Most frequently occurring

농장명축종농장주소# duplicates
0영선*농장충청남도 당진시 순성면 중방리 **번지 *호2