Overview

Dataset statistics

Number of variables6
Number of observations4364
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory209.0 KiB
Average record size in memory49.0 B

Variable types

Numeric1
DateTime1
Categorical2
Text2

Dataset

Description대전광역시 유기동물 발생(구조) 장소 자료(2021-2023년)에 대한 데이터로 구조일자, 종류, 품종 등의 항목을 제공합니다.
Author대전광역시
URLhttps://www.data.go.kr/data/15077510/fileData.do

Alerts

번호 has unique valuesUnique

Reproduction

Analysis started2024-04-06 08:09:38.905262
Analysis finished2024-04-06 08:09:40.372048
Duration1.47 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct4364
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2182.5
Minimum1
Maximum4364
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size38.5 KiB
2024-04-06T17:09:40.871969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile219.15
Q11091.75
median2182.5
Q33273.25
95-th percentile4145.85
Maximum4364
Range4363
Interquartile range (IQR)2181.5

Descriptive statistics

Standard deviation1259.9226
Coefficient of variation (CV)0.57728413
Kurtosis-1.2
Mean2182.5
Median Absolute Deviation (MAD)1091
Skewness0
Sum9524430
Variance1587405
MonotonicityStrictly increasing
2024-04-06T17:09:41.250293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
2909 1
 
< 0.1%
2915 1
 
< 0.1%
2914 1
 
< 0.1%
2913 1
 
< 0.1%
2912 1
 
< 0.1%
2911 1
 
< 0.1%
2910 1
 
< 0.1%
2908 1
 
< 0.1%
3002 1
 
< 0.1%
Other values (4354) 4354
99.8%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
4364 1
< 0.1%
4363 1
< 0.1%
4362 1
< 0.1%
4361 1
< 0.1%
4360 1
< 0.1%
4359 1
< 0.1%
4358 1
< 0.1%
4357 1
< 0.1%
4356 1
< 0.1%
4355 1
< 0.1%
Distinct847
Distinct (%)19.4%
Missing0
Missing (%)0.0%
Memory size34.2 KiB
Minimum2021-01-01 00:00:00
Maximum2023-12-30 00:00:00
2024-04-06T17:09:41.524344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:09:41.884182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

종류
Categorical

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size34.2 KiB
3219 
고양이
1001 
기타
 
144

Length

Max length3
Median length1
Mean length1.4917507
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row고양이
2nd row고양이
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
3219
73.8%
고양이 1001
 
22.9%
기타 144
 
3.3%

Length

2024-04-06T17:09:42.168296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:09:42.380841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3219
73.8%
고양이 1001
 
22.9%
기타 144
 
3.3%

품종
Text

Distinct144
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size34.2 KiB
2024-04-06T17:09:42.735434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length2
Mean length3.4030706
Min length1

Characters and Unicode

Total characters14851
Distinct characters175
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52 ?
Unique (%)1.2%

Sample

1st row코리안숏헤어
2nd row코리안숏헤어
3rd row믹스
4th row말티즈
5th row사모예드
ValueCountFrequency (%)
믹스 1572
35.4%
코리안숏헤어 876
19.7%
말티즈 270
 
6.1%
푸들 254
 
5.7%
진도 252
 
5.7%
포메라이언 161
 
3.6%
시츄 84
 
1.9%
리트리버 84
 
1.9%
치와와 68
 
1.5%
시바 66
 
1.5%
Other values (111) 756
17.0%
2024-04-06T17:09:43.359153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1793
 
12.1%
1603
 
10.8%
1185
 
8.0%
971
 
6.5%
941
 
6.3%
930
 
6.3%
880
 
5.9%
880
 
5.9%
294
 
2.0%
280
 
1.9%
Other values (165) 5094
34.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14772
99.5%
Space Separator 79
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1793
 
12.1%
1603
 
10.9%
1185
 
8.0%
971
 
6.6%
941
 
6.4%
930
 
6.3%
880
 
6.0%
880
 
6.0%
294
 
2.0%
280
 
1.9%
Other values (164) 5015
33.9%
Space Separator
ValueCountFrequency (%)
79
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14772
99.5%
Common 79
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1793
 
12.1%
1603
 
10.9%
1185
 
8.0%
971
 
6.6%
941
 
6.4%
930
 
6.3%
880
 
6.0%
880
 
6.0%
294
 
2.0%
280
 
1.9%
Other values (164) 5015
33.9%
Common
ValueCountFrequency (%)
79
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14772
99.5%
ASCII 79
 
0.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1793
 
12.1%
1603
 
10.9%
1185
 
8.0%
971
 
6.6%
941
 
6.4%
930
 
6.3%
880
 
6.0%
880
 
6.0%
294
 
2.0%
280
 
1.9%
Other values (164) 5015
33.9%
ASCII
ValueCountFrequency (%)
79
100.0%

시군구
Categorical

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size34.2 KiB
서구
967 
유성
932 
중구
883 
동구
872 
대덕
709 

Length

Max length3
Median length2
Mean length2.0002291
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row동구
2nd row대덕
3rd row유성
4th row중구
5th row서구

Common Values

ValueCountFrequency (%)
서구 967
22.2%
유성 932
21.4%
중구 883
20.2%
동구 872
20.0%
대덕 709
16.2%
대덕구 1
 
< 0.1%

Length

2024-04-06T17:09:43.586851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:09:43.792998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서구 967
22.2%
유성 932
21.4%
중구 883
20.2%
동구 872
20.0%
대덕 709
16.2%
대덕구 1
 
< 0.1%
Distinct3329
Distinct (%)76.3%
Missing0
Missing (%)0.0%
Memory size34.2 KiB
2024-04-06T17:09:44.334580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length27
Mean length13.27429
Min length3

Characters and Unicode

Total characters57929
Distinct characters566
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2774 ?
Unique (%)63.6%

Sample

1st row성남동 17-9번지 인근
2nd row법동 e편한세상@ 106동 주변
3rd row도룡동 380-37번지 주변
4th row산성로23번길 주변
5th row도마동 172-6
ValueCountFrequency (%)
주변 2380
 
17.3%
인근 1519
 
11.0%
부근 153
 
1.1%
갈마동 118
 
0.9%
가양동 92
 
0.7%
오정동 78
 
0.6%
도마동 70
 
0.5%
문화동 67
 
0.5%
중리동 64
 
0.5%
금고동 59
 
0.4%
Other values (3991) 9166
66.6%
2024-04-06T17:09:45.263452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9402
 
16.2%
3732
 
6.4%
2656
 
4.6%
2542
 
4.4%
1 2070
 
3.6%
1829
 
3.2%
1720
 
3.0%
- 1363
 
2.4%
1308
 
2.3%
2 1291
 
2.2%
Other values (556) 30016
51.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 36717
63.4%
Decimal Number 9902
 
17.1%
Space Separator 9402
 
16.2%
Dash Punctuation 1363
 
2.4%
Other Punctuation 410
 
0.7%
Lowercase Letter 72
 
0.1%
Uppercase Letter 56
 
0.1%
Close Punctuation 4
 
< 0.1%
Open Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3732
 
10.2%
2656
 
7.2%
2542
 
6.9%
1829
 
5.0%
1720
 
4.7%
1308
 
3.6%
1181
 
3.2%
852
 
2.3%
807
 
2.2%
565
 
1.5%
Other values (506) 19525
53.2%
Lowercase Letter
ValueCountFrequency (%)
c 18
25.0%
i 8
11.1%
e 7
 
9.7%
k 6
 
8.3%
a 5
 
6.9%
o 4
 
5.6%
t 4
 
5.6%
b 4
 
5.6%
u 3
 
4.2%
n 2
 
2.8%
Other values (8) 11
15.3%
Uppercase Letter
ValueCountFrequency (%)
C 15
26.8%
I 7
12.5%
T 6
 
10.7%
K 6
 
10.7%
A 6
 
10.7%
U 4
 
7.1%
G 3
 
5.4%
S 2
 
3.6%
N 2
 
3.6%
H 1
 
1.8%
Other values (4) 4
 
7.1%
Decimal Number
ValueCountFrequency (%)
1 2070
20.9%
2 1291
13.0%
3 1104
11.1%
4 975
9.8%
5 869
8.8%
6 841
8.5%
7 758
 
7.7%
0 701
 
7.1%
8 680
 
6.9%
9 613
 
6.2%
Other Punctuation
ValueCountFrequency (%)
@ 371
90.5%
, 37
 
9.0%
: 1
 
0.2%
& 1
 
0.2%
Space Separator
ValueCountFrequency (%)
9402
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1363
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 36717
63.4%
Common 21084
36.4%
Latin 128
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3732
 
10.2%
2656
 
7.2%
2542
 
6.9%
1829
 
5.0%
1720
 
4.7%
1308
 
3.6%
1181
 
3.2%
852
 
2.3%
807
 
2.2%
565
 
1.5%
Other values (506) 19525
53.2%
Latin
ValueCountFrequency (%)
c 18
14.1%
C 15
 
11.7%
i 8
 
6.2%
I 7
 
5.5%
e 7
 
5.5%
k 6
 
4.7%
T 6
 
4.7%
K 6
 
4.7%
A 6
 
4.7%
a 5
 
3.9%
Other values (22) 44
34.4%
Common
ValueCountFrequency (%)
9402
44.6%
1 2070
 
9.8%
- 1363
 
6.5%
2 1291
 
6.1%
3 1104
 
5.2%
4 975
 
4.6%
5 869
 
4.1%
6 841
 
4.0%
7 758
 
3.6%
0 701
 
3.3%
Other values (8) 1710
 
8.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 36717
63.4%
ASCII 21212
36.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9402
44.3%
1 2070
 
9.8%
- 1363
 
6.4%
2 1291
 
6.1%
3 1104
 
5.2%
4 975
 
4.6%
5 869
 
4.1%
6 841
 
4.0%
7 758
 
3.6%
0 701
 
3.3%
Other values (40) 1838
 
8.7%
Hangul
ValueCountFrequency (%)
3732
 
10.2%
2656
 
7.2%
2542
 
6.9%
1829
 
5.0%
1720
 
4.7%
1308
 
3.6%
1181
 
3.2%
852
 
2.3%
807
 
2.2%
565
 
1.5%
Other values (506) 19525
53.2%

Interactions

2024-04-06T17:09:39.842762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-06T17:09:45.446847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호종류시군구
번호1.0000.2260.109
종류0.2261.0000.208
시군구0.1090.2081.000
2024-04-06T17:09:45.620298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군구종류
시군구1.0000.088
종류0.0881.000
2024-04-06T17:09:45.790463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호종류시군구
번호1.0000.1380.057
종류0.1381.0000.088
시군구0.0570.0881.000

Missing values

2024-04-06T17:09:40.076640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T17:09:40.287343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호구조일자종류품종시군구발생장소
012021-01-01고양이코리안숏헤어동구성남동 17-9번지 인근
122021-01-02고양이코리안숏헤어대덕법동 e편한세상@ 106동 주변
232021-01-02믹스유성도룡동 380-37번지 주변
342021-01-02말티즈중구산성로23번길 주변
452021-01-02사모예드서구도마동 172-6
562021-01-03말티즈대덕송촌동 선비마을 3단지@ 테니스장 주변
672021-01-03말티즈유성갑동 만나식당 주변
782021-01-03믹스동구대성동삼거리 인근
892021-01-04진도동구삼성동 103-9 인근
9102021-01-04스피츠서구내동중학교 인근
번호구조일자종류품종시군구발생장소
435443552023-12-26믹스유성지족로190번길 15 노은지구 1단지@ 주변
435543562023-12-26믹스대덕중리동 연세의원 주변
435643572023-12-28고양이먼치킨서구월평동 632 인근
435743582023-12-28고양이스코티쉬폴드동구우암로296번길 72 인근
435843592023-12-28푸들대덕오정동 행정복지센터 주변
435943602023-12-29믹스중구문화동 서대전공원 주변
436043612023-12-29푸들중구옥계동 17-12 은혜아파트 a동 주변
436143622023-12-29푸들중구평촌로 111 주변
436243632023-12-30울프독대덕덕암동 신탄진IC 주변
436343642023-12-30믹스동구비룡동 519 인근