Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows796
Duplicate rows (%)8.0%
Total size in memory546.9 KiB
Average record size in memory56.0 B

Variable types

Categorical3
Text2
DateTime1

Dataset

Description대전광역시 동물보호센터 유기동물공고란에 등록된 동물의 발견장소에 대한 자료임(2017~2019년)
Author대전광역시
URLhttps://www.data.go.kr/data/15065130/fileData.do

Alerts

시도 has constant value ""Constant
Dataset has 796 (8.0%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 12:13:52.047700
Analysis finished2023-12-12 12:13:53.115132
Duration1.07 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

종류
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
5837 
고양이
4017 
기타
 
146

Length

Max length3
Median length1
Mean length1.818
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row고양이

Common Values

ValueCountFrequency (%)
5837
58.4%
고양이 4017
40.2%
기타 146
 
1.5%

Length

2023-12-12T21:13:53.225101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:13:53.384931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5837
58.4%
고양이 4017
40.2%
기타 146
 
1.5%

품종
Text

Distinct144
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T21:13:53.667896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length10
Mean length4.1199
Min length1

Characters and Unicode

Total characters41199
Distinct characters168
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)0.4%

Sample

1st row푸들
2nd row말티즈
3rd row말티즈
4th row리트리버
5th row코리안숏헤어
ValueCountFrequency (%)
코리안숏헤어 3635
35.6%
믹스 1665
16.3%
푸들 865
 
8.5%
말티즈 832
 
8.1%
진도 470
 
4.6%
포메라이언 403
 
3.9%
시츄 239
 
2.3%
스피츠 223
 
2.2%
치와와 177
 
1.7%
요크셔테리어 169
 
1.7%
Other values (102) 1542
15.1%
2023-12-12T21:13:54.195371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4254
 
10.3%
3901
 
9.5%
3841
 
9.3%
3832
 
9.3%
3658
 
8.9%
3658
 
8.9%
2221
 
5.4%
1667
 
4.0%
869
 
2.1%
869
 
2.1%
Other values (158) 12429
30.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 40978
99.5%
Space Separator 221
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4254
 
10.4%
3901
 
9.5%
3841
 
9.4%
3832
 
9.4%
3658
 
8.9%
3658
 
8.9%
2221
 
5.4%
1667
 
4.1%
869
 
2.1%
869
 
2.1%
Other values (157) 12208
29.8%
Space Separator
ValueCountFrequency (%)
221
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 40978
99.5%
Common 221
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4254
 
10.4%
3901
 
9.5%
3841
 
9.4%
3832
 
9.4%
3658
 
8.9%
3658
 
8.9%
2221
 
5.4%
1667
 
4.1%
869
 
2.1%
869
 
2.1%
Other values (157) 12208
29.8%
Common
ValueCountFrequency (%)
221
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 40978
99.5%
ASCII 221
 
0.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4254
 
10.4%
3901
 
9.5%
3841
 
9.4%
3832
 
9.4%
3658
 
8.9%
3658
 
8.9%
2221
 
5.4%
1667
 
4.1%
869
 
2.1%
869
 
2.1%
Other values (157) 12208
29.8%
ASCII
ValueCountFrequency (%)
221
100.0%

시도
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
대전광역시
10000 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대전광역시
2nd row대전광역시
3rd row대전광역시
4th row대전광역시
5th row대전광역시

Common Values

ValueCountFrequency (%)
대전광역시 10000
100.0%

Length

2023-12-12T21:13:54.382339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:13:54.526537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대전광역시 10000
100.0%

시군구
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
서구
2971 
유성구
2333 
동구
1718 
중구
1636 
대덕구
1342 

Length

Max length3
Median length2
Mean length2.3675
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유성구
2nd row동구
3rd row동구
4th row중구
5th row서구

Common Values

ValueCountFrequency (%)
서구 2971
29.7%
유성구 2333
23.3%
동구 1718
17.2%
중구 1636
16.4%
대덕구 1342
13.4%

Length

2023-12-12T21:13:54.669097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:13:54.834848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서구 2971
29.7%
유성구 2333
23.3%
동구 1718
17.2%
중구 1636
16.4%
대덕구 1342
13.4%
Distinct7143
Distinct (%)71.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T21:13:55.214960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length25
Mean length11.7901
Min length3

Characters and Unicode

Total characters117901
Distinct characters661
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5730 ?
Unique (%)57.3%

Sample

1st row화산2교 밑 천변
2nd row자양동 84-85번지 주변
3rd row우송대 서캠퍼스 W5 주변
4th row대흥동 246-8번지 주변
5th row만년남로 3번길 5 지하
ValueCountFrequency (%)
주변 4182
 
14.2%
인근 3689
 
12.5%
1360
 
4.6%
갈마동 246
 
0.8%
중리동 183
 
0.6%
가양동 171
 
0.6%
오정동 157
 
0.5%
도마동 155
 
0.5%
유천동 154
 
0.5%
용전동 145
 
0.5%
Other values (6629) 18976
64.5%
2023-12-12T21:13:55.861094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
19419
 
16.5%
7600
 
6.4%
4763
 
4.0%
4495
 
3.8%
3849
 
3.3%
3816
 
3.2%
1 3655
 
3.1%
- 2753
 
2.3%
2 2334
 
2.0%
2226
 
1.9%
Other values (651) 62991
53.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 77282
65.5%
Space Separator 19419
 
16.5%
Decimal Number 16648
 
14.1%
Dash Punctuation 2753
 
2.3%
Other Punctuation 1445
 
1.2%
Uppercase Letter 258
 
0.2%
Close Punctuation 51
 
< 0.1%
Open Punctuation 40
 
< 0.1%
Other Symbol 3
 
< 0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7600
 
9.8%
4763
 
6.2%
4495
 
5.8%
3849
 
5.0%
3816
 
4.9%
2226
 
2.9%
1689
 
2.2%
1658
 
2.1%
1440
 
1.9%
1114
 
1.4%
Other values (607) 44632
57.8%
Uppercase Letter
ValueCountFrequency (%)
C 40
15.5%
S 34
13.2%
K 32
12.4%
I 25
9.7%
T 25
9.7%
G 24
9.3%
B 14
 
5.4%
A 9
 
3.5%
M 8
 
3.1%
J 7
 
2.7%
Other values (13) 40
15.5%
Decimal Number
ValueCountFrequency (%)
1 3655
22.0%
2 2334
14.0%
3 1974
11.9%
4 1592
9.6%
5 1356
 
8.1%
0 1315
 
7.9%
6 1270
 
7.6%
8 1080
 
6.5%
7 1074
 
6.5%
9 998
 
6.0%
Other Punctuation
ValueCountFrequency (%)
@ 1431
99.0%
: 6
 
0.4%
& 5
 
0.3%
, 3
 
0.2%
Lowercase Letter
ValueCountFrequency (%)
k 1
50.0%
s 1
50.0%
Space Separator
ValueCountFrequency (%)
19419
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2753
100.0%
Close Punctuation
ValueCountFrequency (%)
) 51
100.0%
Open Punctuation
ValueCountFrequency (%)
( 40
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 77285
65.6%
Common 40356
34.2%
Latin 260
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7600
 
9.8%
4763
 
6.2%
4495
 
5.8%
3849
 
5.0%
3816
 
4.9%
2226
 
2.9%
1689
 
2.2%
1658
 
2.1%
1440
 
1.9%
1114
 
1.4%
Other values (608) 44635
57.8%
Latin
ValueCountFrequency (%)
C 40
15.4%
S 34
13.1%
K 32
12.3%
I 25
9.6%
T 25
9.6%
G 24
9.2%
B 14
 
5.4%
A 9
 
3.5%
M 8
 
3.1%
J 7
 
2.7%
Other values (15) 42
16.2%
Common
ValueCountFrequency (%)
19419
48.1%
1 3655
 
9.1%
- 2753
 
6.8%
2 2334
 
5.8%
3 1974
 
4.9%
4 1592
 
3.9%
@ 1431
 
3.5%
5 1356
 
3.4%
0 1315
 
3.3%
6 1270
 
3.1%
Other values (8) 3257
 
8.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 77282
65.5%
ASCII 40616
34.4%
None 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
19419
47.8%
1 3655
 
9.0%
- 2753
 
6.8%
2 2334
 
5.7%
3 1974
 
4.9%
4 1592
 
3.9%
@ 1431
 
3.5%
5 1356
 
3.3%
0 1315
 
3.2%
6 1270
 
3.1%
Other values (33) 3517
 
8.7%
Hangul
ValueCountFrequency (%)
7600
 
9.8%
4763
 
6.2%
4495
 
5.8%
3849
 
5.0%
3816
 
4.9%
2226
 
2.9%
1689
 
2.2%
1658
 
2.1%
1440
 
1.9%
1114
 
1.4%
Other values (607) 44632
57.8%
None
ValueCountFrequency (%)
3
100.0%
Distinct1090
Distinct (%)10.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2017-01-01 00:00:00
Maximum2019-12-31 00:00:00
2023-12-12T21:13:56.071918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:13:56.241683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Correlations

2023-12-12T21:13:56.332099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
종류시군구
종류1.0000.078
시군구0.0781.000
2023-12-12T21:13:56.420366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
종류시군구
종류1.0000.058
시군구0.0581.000
2023-12-12T21:13:56.503905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
종류시군구
종류1.0000.058
시군구0.0581.000

Missing values

2023-12-12T21:13:52.855802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:13:53.031936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

종류품종시도시군구발생장소구조일자
4249푸들대전광역시유성구화산2교 밑 천변2017-10-25
9296말티즈대전광역시동구자양동 84-85번지 주변2018-10-01
3407말티즈대전광역시동구우송대 서캠퍼스 W5 주변2017-09-05
10346리트리버대전광역시중구대흥동 246-8번지 주변2018-12-19
1266고양이코리안숏헤어대전광역시서구만년남로 3번길 5 지하2017-05-11
3998고양이코리안숏헤어대전광역시중구용두동 용두@ 입구 주변2017-10-10
14547고양이코리안숏헤어대전광역시동구대전대학교 정문 인근2019-10-23
9446포메라이언대전광역시중구유천동 301-41번지 주변2018-10-13
6256믹스대전광역시유성구덕명네거리 인근2018-04-21
9909고양이코리안숏헤어대전광역시유성구화암동 155-47 내2018-11-04
종류품종시도시군구발생장소구조일자
4529고양이코리안숏헤어대전광역시서구월평동 637 내2017-11-10
11500고양이코리안숏헤어대전광역시유성구궁동 482-4 포커스타운 내2019-04-09
3443닥스훈트대전광역시중구선화동 다연빌라 주변2017-09-09
10812믹스대전광역시유성구송강동 한솔@ 101 인근2019-01-27
9513요크셔테리어대전광역시유성구관평동 하모니동물병원2018-10-17
11631고양이코리안숏헤어대전광역시유성구봉명동 607-3 인근2019-04-17
1674고양이코리안숏헤어대전광역시유성구봉명동 센트럴시티@ 지하2017-05-31
14762고양이코리안숏헤어대전광역시유성구도룡동 스마트시티@ 213 인근2019-11-04
13047고양이페르시안대전광역시서구계백로 1385 신원상가 인근2019-07-15
6469고양이코리안숏헤어대전광역시유성구신성로 120길 12 지하2018-05-03

Duplicate rows

Most frequently occurring

종류품종시도시군구발생장소구조일자# duplicates
33말티즈대전광역시유성구중세동교차로 인근2018-08-089
472고양이코리안숏헤어대전광역시동구판암동 418-1번지 주변2017-05-129
794기타햄스터대전광역시유성구원신흥동 인스빌리베라@ 내2019-10-288
198진도 믹스대전광역시서구우명동 144-4 인근2017-08-027
243푸들대전광역시동구모암로 168번길 주변2018-02-277
496고양이코리안숏헤어대전광역시서구갈마동 392-10번지 내2017-04-237
620고양이코리안숏헤어대전광역시유성구반석마을7단지 내2018-07-217
649고양이코리안숏헤어대전광역시유성구장대동 269-11 공원 내2018-09-087
25말티즈대전광역시유성구둔곡교차로 인근2018-06-176
42믹스대전광역시대덕구신대동 448-1번지 주변2018-12-226