Overview

Dataset statistics

Number of variables4
Number of observations5914
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory196.5 KiB
Average record size in memory34.0 B

Variable types

Numeric2
Text1
Categorical1

Dataset

Description서울특별시 마포구의 반려동물 이름 현황 데이터(2023)입니다. 동물이름, 동물 수 데이터를 제공합니다. 경제진흥과에서 관리하는 데이터입니다. ## LINK 미리보기 [![미리보기](http://curate.gimi9.com/linkview/www-data-go-kr-data-filedata-15042010?url=http%3A//www.mapo.go.kr/site/main/openData/view%3FdataId%3D43&version=d7)](https://www.data.go.kr/data/15042010/fileData.do)
Author서울특별시 마포구
URLhttps://www.data.go.kr/data/15042010/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
연번 is highly overall correlated with 횟수High correlation
횟수 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
동물이름 has unique valuesUnique

Reproduction

Analysis started2023-12-12 05:18:20.025447
Analysis finished2023-12-12 05:18:21.386500
Duration1.36 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct5914
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2957.5
Minimum1
Maximum5914
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size52.1 KiB
2023-12-12T14:18:21.478105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile296.65
Q11479.25
median2957.5
Q34435.75
95-th percentile5618.35
Maximum5914
Range5913
Interquartile range (IQR)2956.5

Descriptive statistics

Standard deviation1707.3691
Coefficient of variation (CV)0.57730146
Kurtosis-1.2
Mean2957.5
Median Absolute Deviation (MAD)1478.5
Skewness0
Sum17490655
Variance2915109.2
MonotonicityStrictly increasing
2023-12-12T14:18:21.647366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
3941 1
 
< 0.1%
3950 1
 
< 0.1%
3949 1
 
< 0.1%
3948 1
 
< 0.1%
3947 1
 
< 0.1%
3946 1
 
< 0.1%
3945 1
 
< 0.1%
3944 1
 
< 0.1%
3943 1
 
< 0.1%
Other values (5904) 5904
99.8%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
5914 1
< 0.1%
5913 1
< 0.1%
5912 1
< 0.1%
5911 1
< 0.1%
5910 1
< 0.1%
5909 1
< 0.1%
5908 1
< 0.1%
5907 1
< 0.1%
5906 1
< 0.1%
5905 1
< 0.1%

동물이름
Text

UNIQUE 

Distinct5914
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size46.3 KiB
2023-12-12T14:18:22.041719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length2
Mean length2.4147785
Min length1

Characters and Unicode

Total characters14281
Distinct characters909
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5914 ?
Unique (%)100.0%

Sample

1st row코코
2nd row보리
3rd row초코
4th row콩이
5th row호두
ValueCountFrequency (%)
사랑 3
 
0.1%
코코 2
 
< 0.1%
제이크 2
 
< 0.1%
xiao 2
 
< 0.1%
깜비 2
 
< 0.1%
태민 2
 
< 0.1%
로마 2
 
< 0.1%
2
 
< 0.1%
란이 2
 
< 0.1%
2
 
< 0.1%
Other values (5844) 5916
99.6%
2023-12-12T14:18:22.958929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1167
 
8.2%
327
 
2.3%
196
 
1.4%
183
 
1.3%
171
 
1.2%
160
 
1.1%
129
 
0.9%
125
 
0.9%
122
 
0.9%
120
 
0.8%
Other values (899) 11581
81.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13192
92.4%
Lowercase Letter 554
 
3.9%
Uppercase Letter 340
 
2.4%
Space Separator 110
 
0.8%
Close Punctuation 27
 
0.2%
Open Punctuation 27
 
0.2%
Decimal Number 25
 
0.2%
Other Punctuation 4
 
< 0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1167
 
8.8%
327
 
2.5%
196
 
1.5%
183
 
1.4%
171
 
1.3%
160
 
1.2%
129
 
1.0%
125
 
0.9%
122
 
0.9%
120
 
0.9%
Other values (838) 10492
79.5%
Lowercase Letter
ValueCountFrequency (%)
i 70
12.6%
e 61
11.0%
a 54
 
9.7%
o 52
 
9.4%
n 48
 
8.7%
l 32
 
5.8%
y 28
 
5.1%
r 22
 
4.0%
m 22
 
4.0%
s 19
 
3.4%
Other values (14) 146
26.4%
Uppercase Letter
ValueCountFrequency (%)
A 30
 
8.8%
B 30
 
8.8%
I 29
 
8.5%
M 29
 
8.5%
O 29
 
8.5%
L 17
 
5.0%
N 16
 
4.7%
J 16
 
4.7%
T 13
 
3.8%
C 13
 
3.8%
Other values (14) 118
34.7%
Decimal Number
ValueCountFrequency (%)
0 6
24.0%
2 5
20.0%
1 5
20.0%
9 3
12.0%
3 2
 
8.0%
4 2
 
8.0%
5 1
 
4.0%
7 1
 
4.0%
Space Separator
ValueCountFrequency (%)
110
100.0%
Close Punctuation
ValueCountFrequency (%)
) 27
100.0%
Open Punctuation
ValueCountFrequency (%)
( 27
100.0%
Other Punctuation
ValueCountFrequency (%)
. 4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13192
92.4%
Latin 894
 
6.3%
Common 195
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1167
 
8.8%
327
 
2.5%
196
 
1.5%
183
 
1.4%
171
 
1.3%
160
 
1.2%
129
 
1.0%
125
 
0.9%
122
 
0.9%
120
 
0.9%
Other values (838) 10492
79.5%
Latin
ValueCountFrequency (%)
i 70
 
7.8%
e 61
 
6.8%
a 54
 
6.0%
o 52
 
5.8%
n 48
 
5.4%
l 32
 
3.6%
A 30
 
3.4%
B 30
 
3.4%
I 29
 
3.2%
M 29
 
3.2%
Other values (38) 459
51.3%
Common
ValueCountFrequency (%)
110
56.4%
) 27
 
13.8%
( 27
 
13.8%
0 6
 
3.1%
2 5
 
2.6%
1 5
 
2.6%
. 4
 
2.1%
9 3
 
1.5%
3 2
 
1.0%
4 2
 
1.0%
Other values (3) 4
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13192
92.4%
ASCII 1089
 
7.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1167
 
8.8%
327
 
2.5%
196
 
1.5%
183
 
1.4%
171
 
1.3%
160
 
1.2%
129
 
1.0%
125
 
0.9%
122
 
0.9%
120
 
0.9%
Other values (838) 10492
79.5%
ASCII
ValueCountFrequency (%)
110
 
10.1%
i 70
 
6.4%
e 61
 
5.6%
a 54
 
5.0%
o 52
 
4.8%
n 48
 
4.4%
l 32
 
2.9%
A 30
 
2.8%
B 30
 
2.8%
I 29
 
2.7%
Other values (51) 573
52.6%

횟수
Real number (ℝ)

HIGH CORRELATION 

Distinct86
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.411904
Minimum1
Maximum299
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size52.1 KiB
2023-12-12T14:18:23.148343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q32
95-th percentile11
Maximum299
Range298
Interquartile range (IQR)1

Descriptive statistics

Standard deviation10.842137
Coefficient of variation (CV)3.1777381
Kurtosis213.28207
Mean3.411904
Median Absolute Deviation (MAD)0
Skewness12.030631
Sum20178
Variance117.55194
MonotonicityDecreasing
2023-12-12T14:18:23.331620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 3937
66.6%
2 800
 
13.5%
3 306
 
5.2%
4 184
 
3.1%
5 114
 
1.9%
6 83
 
1.4%
7 63
 
1.1%
8 50
 
0.8%
9 35
 
0.6%
10 27
 
0.5%
Other values (76) 315
 
5.3%
ValueCountFrequency (%)
1 3937
66.6%
2 800
 
13.5%
3 306
 
5.2%
4 184
 
3.1%
5 114
 
1.9%
6 83
 
1.4%
7 63
 
1.1%
8 50
 
0.8%
9 35
 
0.6%
10 27
 
0.5%
ValueCountFrequency (%)
299 1
< 0.1%
258 1
< 0.1%
198 1
< 0.1%
165 1
< 0.1%
147 1
< 0.1%
137 1
< 0.1%
136 1
< 0.1%
131 1
< 0.1%
128 1
< 0.1%
123 1
< 0.1%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size46.3 KiB
2023-02-16
5914 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-02-16
2nd row2023-02-16
3rd row2023-02-16
4th row2023-02-16
5th row2023-02-16

Common Values

ValueCountFrequency (%)
2023-02-16 5914
100.0%

Length

2023-12-12T14:18:23.464388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:18:23.542132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-02-16 5914
100.0%

Interactions

2023-12-12T14:18:20.897369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:18:20.655302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:18:21.026989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:18:20.788360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:18:23.595869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번횟수
연번1.0000.285
횟수0.2851.000
2023-12-12T14:18:23.671080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번횟수
연번1.000-0.838
횟수-0.8381.000

Missing values

2023-12-12T14:18:21.177027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:18:21.331754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번동물이름횟수데이터기준일자
01코코2992023-02-16
12보리2582023-02-16
23초코1982023-02-16
34콩이1652023-02-16
45호두1472023-02-16
56토리1372023-02-16
67사랑이1362023-02-16
78쿠키1312023-02-16
89해피1282023-02-16
910별이1232023-02-16
연번동물이름횟수데이터기준일자
59045905히딩크12023-02-16
59055906히바12023-02-16
59065907히야12023-02-16
59075908히어로12023-02-16
59085909히찌12023-02-16
59095910히킨12023-02-16
59105911히포12023-02-16
59115912히피12023-02-16
59125913힌둥이12023-02-16
59135914힘찬12023-02-16