Overview

Dataset statistics

Number of variables3
Number of observations2260
Missing cells1280
Missing cells (%)18.9%
Duplicate rows26
Duplicate rows (%)1.2%
Total size in memory53.1 KiB
Average record size in memory24.1 B

Variable types

Text3

Dataset

Description지역별 인공수정소 현황
Author농림축산식품부
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220215000000001897

Alerts

Dataset has 26 (1.2%) duplicate rowsDuplicates
주소 has 50 (2.2%) missing valuesMissing
전화번호 has 1230 (54.4%) missing valuesMissing

Reproduction

Analysis started2024-04-17 20:18:08.720659
Analysis finished2024-04-17 20:18:09.341561
Duration0.62 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

이름
Text

Distinct1739
Distinct (%)76.9%
Missing0
Missing (%)0.0%
Memory size17.8 KiB
2024-04-18T05:18:09.500944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length9
Mean length8.9075221
Min length1

Characters and Unicode

Total characters20131
Distinct characters428
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1511 ?
Unique (%)66.9%

Sample

1st row부안가축인공수정소
2nd row강화가축인공수정소
3rd row인천가축인공수정소
4th row광역가축인공수정소
5th row중앙가축인공수정소
ValueCountFrequency (%)
가축인공수정소 173
 
6.6%
인공수정소 66
 
2.5%
제일가축인공수정소 53
 
2.0%
중앙가축인공수정소 29
 
1.1%
현대가축인공수정소 26
 
1.0%
수정소 22
 
0.8%
우리가축인공수정소 19
 
0.7%
동물병원 16
 
0.6%
가축 15
 
0.6%
초원가축인공수정소 12
 
0.5%
Other values (1749) 2186
83.5%
2024-04-18T05:18:09.816174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2032
 
10.1%
2028
 
10.1%
2004
 
10.0%
1922
 
9.5%
1904
 
9.5%
1879
 
9.3%
1837
 
9.1%
358
 
1.8%
292
 
1.5%
268
 
1.3%
Other values (418) 5607
27.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 19625
97.5%
Space Separator 358
 
1.8%
Lowercase Letter 55
 
0.3%
Uppercase Letter 50
 
0.2%
Decimal Number 19
 
0.1%
Close Punctuation 8
 
< 0.1%
Open Punctuation 8
 
< 0.1%
Other Punctuation 7
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2032
 
10.4%
2028
 
10.3%
2004
 
10.2%
1922
 
9.8%
1904
 
9.7%
1879
 
9.6%
1837
 
9.4%
292
 
1.5%
268
 
1.4%
225
 
1.1%
Other values (373) 5234
26.7%
Uppercase Letter
ValueCountFrequency (%)
A 10
20.0%
I 9
18.0%
E 5
10.0%
T 5
10.0%
C 4
 
8.0%
V 3
 
6.0%
B 2
 
4.0%
H 2
 
4.0%
R 2
 
4.0%
F 1
 
2.0%
Other values (7) 7
14.0%
Lowercase Letter
ValueCountFrequency (%)
i 10
18.2%
o 7
12.7%
s 6
10.9%
n 6
10.9%
r 5
9.1%
e 5
9.1%
a 3
 
5.5%
m 3
 
5.5%
k 2
 
3.6%
v 2
 
3.6%
Other values (5) 6
10.9%
Decimal Number
ValueCountFrequency (%)
3 6
31.6%
2 3
15.8%
1 3
15.8%
5 3
15.8%
8 2
 
10.5%
4 1
 
5.3%
6 1
 
5.3%
Other Punctuation
ValueCountFrequency (%)
. 5
71.4%
& 2
 
28.6%
Space Separator
ValueCountFrequency (%)
358
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 19625
97.5%
Common 401
 
2.0%
Latin 105
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2032
 
10.4%
2028
 
10.3%
2004
 
10.2%
1922
 
9.8%
1904
 
9.7%
1879
 
9.6%
1837
 
9.4%
292
 
1.5%
268
 
1.4%
225
 
1.1%
Other values (373) 5234
26.7%
Latin
ValueCountFrequency (%)
A 10
 
9.5%
i 10
 
9.5%
I 9
 
8.6%
o 7
 
6.7%
s 6
 
5.7%
n 6
 
5.7%
r 5
 
4.8%
E 5
 
4.8%
e 5
 
4.8%
T 5
 
4.8%
Other values (22) 37
35.2%
Common
ValueCountFrequency (%)
358
89.3%
) 8
 
2.0%
( 8
 
2.0%
3 6
 
1.5%
. 5
 
1.2%
2 3
 
0.7%
1 3
 
0.7%
5 3
 
0.7%
& 2
 
0.5%
8 2
 
0.5%
Other values (3) 3
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 19615
97.4%
ASCII 506
 
2.5%
Compat Jamo 10
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2032
 
10.4%
2028
 
10.3%
2004
 
10.2%
1922
 
9.8%
1904
 
9.7%
1879
 
9.6%
1837
 
9.4%
292
 
1.5%
268
 
1.4%
225
 
1.1%
Other values (372) 5224
26.6%
ASCII
ValueCountFrequency (%)
358
70.8%
A 10
 
2.0%
i 10
 
2.0%
I 9
 
1.8%
) 8
 
1.6%
( 8
 
1.6%
o 7
 
1.4%
3 6
 
1.2%
s 6
 
1.2%
n 6
 
1.2%
Other values (35) 78
 
15.4%
Compat Jamo
ValueCountFrequency (%)
10
100.0%

주소
Text

MISSING 

Distinct1543
Distinct (%)69.8%
Missing50
Missing (%)2.2%
Memory size17.8 KiB
2024-04-18T05:18:10.110308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length16
Mean length15.302262
Min length10

Characters and Unicode

Total characters33818
Distinct characters309
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1167 ?
Unique (%)52.8%

Sample

1st row전라북도 부안군 주산면 사산리
2nd row인천광역시 강화군 강화읍 남산리
3rd row인천광역시 남동구 만수동
4th row인천광역시 강화군 길상면 온수리
5th row경기도 동두천시 생연동
ValueCountFrequency (%)
경상북도 404
 
4.7%
전라남도 336
 
3.9%
경상남도 283
 
3.3%
충청남도 279
 
3.3%
경기도 270
 
3.2%
전라북도 214
 
2.5%
강원도 174
 
2.0%
충청북도 154
 
1.8%
화성시 67
 
0.8%
경주시 49
 
0.6%
Other values (2225) 6312
73.9%
2024-04-18T05:18:10.537168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6332
 
18.7%
2241
 
6.6%
1837
 
5.4%
1197
 
3.5%
1177
 
3.5%
1138
 
3.4%
1106
 
3.3%
1064
 
3.1%
874
 
2.6%
813
 
2.4%
Other values (299) 16039
47.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 27479
81.3%
Space Separator 6332
 
18.7%
Decimal Number 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2241
 
8.2%
1837
 
6.7%
1197
 
4.4%
1177
 
4.3%
1138
 
4.1%
1106
 
4.0%
1064
 
3.9%
874
 
3.2%
813
 
3.0%
771
 
2.8%
Other values (295) 15261
55.5%
Decimal Number
ValueCountFrequency (%)
1 5
71.4%
3 1
 
14.3%
2 1
 
14.3%
Space Separator
ValueCountFrequency (%)
6332
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 27479
81.3%
Common 6339
 
18.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2241
 
8.2%
1837
 
6.7%
1197
 
4.4%
1177
 
4.3%
1138
 
4.1%
1106
 
4.0%
1064
 
3.9%
874
 
3.2%
813
 
3.0%
771
 
2.8%
Other values (295) 15261
55.5%
Common
ValueCountFrequency (%)
6332
99.9%
1 5
 
0.1%
3 1
 
< 0.1%
2 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 27479
81.3%
ASCII 6339
 
18.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6332
99.9%
1 5
 
0.1%
3 1
 
< 0.1%
2 1
 
< 0.1%
Hangul
ValueCountFrequency (%)
2241
 
8.2%
1837
 
6.7%
1197
 
4.4%
1177
 
4.3%
1138
 
4.1%
1106
 
4.0%
1064
 
3.9%
874
 
3.2%
813
 
3.0%
771
 
2.8%
Other values (295) 15261
55.5%

전화번호
Text

MISSING 

Distinct990
Distinct (%)96.1%
Missing1230
Missing (%)54.4%
Memory size17.8 KiB
2024-04-18T05:18:10.767710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length8
Mean length9.8805825
Min length3

Characters and Unicode

Total characters10177
Distinct characters16
Distinct categories6 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique952 ?
Unique (%)92.4%

Sample

1st row032-933-7998
2nd row681-2406
3rd row041-934-4467
4th row772-6516
5th row031)211-5390
ValueCountFrequency (%)
055)964-0601 3
 
0.3%
671-3079 3
 
0.3%
041-867-5919 2
 
0.2%
323-1503 2
 
0.2%
041-864-0448 2
 
0.2%
055-271-9200 2
 
0.2%
061-862-0277 2
 
0.2%
041-864-5800 2
 
0.2%
041-867-7778 2
 
0.2%
863-2377 2
 
0.2%
Other values (980) 1008
97.9%
2024-04-18T05:18:11.093804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1470
14.4%
3 1323
13.0%
5 1089
10.7%
0 1062
10.4%
2 913
9.0%
4 898
8.8%
6 819
8.0%
1 783
7.7%
7 709
7.0%
8 614
6.0%
Other values (6) 497
 
4.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 8676
85.3%
Dash Punctuation 1470
 
14.4%
Close Punctuation 21
 
0.2%
Open Punctuation 6
 
0.1%
Math Symbol 2
 
< 0.1%
Other Punctuation 2
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 1323
15.2%
5 1089
12.6%
0 1062
12.2%
2 913
10.5%
4 898
10.4%
6 819
9.4%
1 783
9.0%
7 709
8.2%
8 614
7.1%
9 466
 
5.4%
Other Punctuation
ValueCountFrequency (%)
, 1
50.0%
. 1
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 1470
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 10177
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 1470
14.4%
3 1323
13.0%
5 1089
10.7%
0 1062
10.4%
2 913
9.0%
4 898
8.8%
6 819
8.0%
1 783
7.7%
7 709
7.0%
8 614
6.0%
Other values (6) 497
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 10177
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1470
14.4%
3 1323
13.0%
5 1089
10.7%
0 1062
10.4%
2 913
9.0%
4 898
8.8%
6 819
8.0%
1 783
7.7%
7 709
7.0%
8 614
6.0%
Other values (6) 497
 
4.9%

Missing values

2024-04-18T05:18:09.240722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-18T05:18:09.304437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

이름주소전화번호
0부안가축인공수정소전라북도 부안군 주산면 사산리<NA>
1강화가축인공수정소인천광역시 강화군 강화읍 남산리<NA>
2인천가축인공수정소인천광역시 남동구 만수동<NA>
3광역가축인공수정소인천광역시 강화군 길상면 온수리032-933-7998
4중앙가축인공수정소경기도 동두천시 생연동<NA>
5조문재가축인공수정소경기도 동두천시 생연동<NA>
6해동가축인공수정소경상남도 밀양시 무안면 무안리<NA>
7고금가축인공수정소전라남도 완도군 고금면 회룡리<NA>
8대원가축인공수정소경기도 평택시 비전동<NA>
9청북 가축인공수정소경기도 평택시 청북면 삼계리<NA>
이름주소전화번호
2250원동물병원가축인공수정소울산광역시 울주군 언양읍 동부리052-264-7872
2251호계막곡가축인공수정소경상북도 문경시 흥덕동<NA>
2252의령 가축인공수정소경상남도 의령군 의령읍 동동리573-5727
2253송동물병원경상남도 의령군 부림면 신반리055-574-5656
2254지정가축인공수정소경상남도 의령군 지정면 봉곡리572-5140
2255궁유 가축인공수정소경상남도 의령군 정곡면 중교리572-1197
2256제일동물병원병설가축인공수정소경상남도 의령군 부림면 신반리055-573-1617
2257대산전라북도 고창군 대산면 매산리<NA>
2258흥덕전라북도 고창군 흥덕면 흥덕리563-4974
2259현대전라북도 고창군 신림면 세곡리563-3163

Duplicate rows

Most frequently occurring

이름주소전화번호# duplicates
0ㄴㄴ경기도 남양주시 가운동<NA>5
1강릉가축인공수정소강원도 강릉시 교동<NA>2
2김의수가축인공수정소충청남도 연기군 조치원읍 정리041-864-04482
3나주종합동물병원전라남도 나주시 성북동<NA>2
4다인가축인공수정소경상북도 의성군 다인면 서릉리<NA>2
5대평가축인공수정소충청남도 연기군 금남면 용포리866-66832
6동강가축인공수정소전라남도 고흥군 동강면 유둔리<NA>2
7동송가축인공수정소강원도 철원군 동송읍 이평리<NA>2
8동원가축인공수정소경기도 양평군 용문면 다문리<NA>2
9둔내가축인공수정소강원도 횡성군 둔내면 둔방내리<NA>2