Overview

Dataset statistics

Number of variables6
Number of observations256
Missing cells123
Missing cells (%)8.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.4 KiB
Average record size in memory49.5 B

Variable types

Numeric1
Text3
Categorical2

Dataset

Description인천광역시 서구에 위치한 동물관련업 현황에 관한 데이터셋입니다. 인천광역시 서구에 위치한 동물관련업 현황의 사업장명칭, 영업내용, 소재지 주소, 전화번호에 관한 정보를 포함하고 있습니다.
Author인천광역시 서구
URLhttps://www.data.go.kr/data/15090796/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
연번 is highly overall correlated with 영업내용High correlation
영업내용 is highly overall correlated with 연번High correlation
전화번호 has 123 (48.0%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 03:42:29.887995
Analysis finished2023-12-12 03:42:30.616892
Duration0.73 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct256
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean128.5
Minimum1
Maximum256
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.4 KiB
2023-12-12T12:42:30.707531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile13.75
Q164.75
median128.5
Q3192.25
95-th percentile243.25
Maximum256
Range255
Interquartile range (IQR)127.5

Descriptive statistics

Standard deviation74.045031
Coefficient of variation (CV)0.57622592
Kurtosis-1.2
Mean128.5
Median Absolute Deviation (MAD)64
Skewness0
Sum32896
Variance5482.6667
MonotonicityStrictly increasing
2023-12-12T12:42:30.886122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
130 1
 
0.4%
164 1
 
0.4%
165 1
 
0.4%
166 1
 
0.4%
167 1
 
0.4%
168 1
 
0.4%
169 1
 
0.4%
170 1
 
0.4%
171 1
 
0.4%
Other values (246) 246
96.1%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
256 1
0.4%
255 1
0.4%
254 1
0.4%
253 1
0.4%
252 1
0.4%
251 1
0.4%
250 1
0.4%
249 1
0.4%
248 1
0.4%
247 1
0.4%
Distinct245
Distinct (%)95.7%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2023-12-12T12:42:31.192705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length21
Mean length6.7265625
Min length2

Characters and Unicode

Total characters1722
Distinct characters284
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique235 ?
Unique (%)91.8%

Sample

1st row럭셔리하개
2nd row도그샵
3rd row애견용품 할인매장
4th row인천조류원
5th row25시파랑새동물병원
ValueCountFrequency (%)
동물병원 11
 
3.2%
약국 9
 
2.6%
24시 4
 
1.2%
고양이 4
 
1.2%
강아지분양 3
 
0.9%
3
 
0.9%
인천점 3
 
0.9%
청라점 3
 
0.9%
청라바다약국 3
 
0.9%
온누리 3
 
0.9%
Other values (275) 295
86.5%
2023-12-12T12:42:31.642963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
144
 
8.4%
144
 
8.4%
85
 
4.9%
62
 
3.6%
59
 
3.4%
56
 
3.3%
41
 
2.4%
33
 
1.9%
32
 
1.9%
31
 
1.8%
Other values (274) 1035
60.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1593
92.5%
Space Separator 85
 
4.9%
Decimal Number 20
 
1.2%
Uppercase Letter 12
 
0.7%
Open Punctuation 5
 
0.3%
Close Punctuation 5
 
0.3%
Other Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
144
 
9.0%
144
 
9.0%
62
 
3.9%
59
 
3.7%
56
 
3.5%
41
 
2.6%
33
 
2.1%
32
 
2.0%
31
 
1.9%
25
 
1.6%
Other values (255) 966
60.6%
Uppercase Letter
ValueCountFrequency (%)
K 3
25.0%
S 1
 
8.3%
J 1
 
8.3%
W 1
 
8.3%
O 1
 
8.3%
D 1
 
8.3%
G 1
 
8.3%
I 1
 
8.3%
P 1
 
8.3%
H 1
 
8.3%
Decimal Number
ValueCountFrequency (%)
2 5
25.0%
5 4
20.0%
3 4
20.0%
4 4
20.0%
6 3
15.0%
Space Separator
ValueCountFrequency (%)
85
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Other Punctuation
ValueCountFrequency (%)
& 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1593
92.5%
Common 117
 
6.8%
Latin 12
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
144
 
9.0%
144
 
9.0%
62
 
3.9%
59
 
3.7%
56
 
3.5%
41
 
2.6%
33
 
2.1%
32
 
2.0%
31
 
1.9%
25
 
1.6%
Other values (255) 966
60.6%
Latin
ValueCountFrequency (%)
K 3
25.0%
S 1
 
8.3%
J 1
 
8.3%
W 1
 
8.3%
O 1
 
8.3%
D 1
 
8.3%
G 1
 
8.3%
I 1
 
8.3%
P 1
 
8.3%
H 1
 
8.3%
Common
ValueCountFrequency (%)
85
72.6%
( 5
 
4.3%
) 5
 
4.3%
2 5
 
4.3%
5 4
 
3.4%
3 4
 
3.4%
4 4
 
3.4%
6 3
 
2.6%
& 2
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1593
92.5%
ASCII 129
 
7.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
144
 
9.0%
144
 
9.0%
62
 
3.9%
59
 
3.7%
56
 
3.5%
41
 
2.6%
33
 
2.1%
32
 
2.0%
31
 
1.9%
25
 
1.6%
Other values (255) 966
60.6%
ASCII
ValueCountFrequency (%)
85
65.9%
( 5
 
3.9%
) 5
 
3.9%
2 5
 
3.9%
5 4
 
3.1%
3 4
 
3.1%
4 4
 
3.1%
K 3
 
2.3%
6 3
 
2.3%
& 2
 
1.6%
Other values (9) 9
 
7.0%

영업내용
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
동물약국
144 
판매업
66 
동물병원
46 

Length

Max length4
Median length4
Mean length3.7421875
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row판매업
2nd row판매업
3rd row판매업
4th row판매업
5th row판매업

Common Values

ValueCountFrequency (%)
동물약국 144
56.2%
판매업 66
25.8%
동물병원 46
 
18.0%

Length

2023-12-12T12:42:31.805416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:42:31.931163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
동물약국 144
56.2%
판매업 66
25.8%
동물병원 46
 
18.0%
Distinct247
Distinct (%)96.5%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2023-12-12T12:42:32.260566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length70
Median length46
Mean length32.632812
Min length20

Characters and Unicode

Total characters8354
Distinct characters255
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique238 ?
Unique (%)93.0%

Sample

1st row인천광역시 서구 승학로 457, 상가동 1층 101호 (검암동)
2nd row인천광역시 서구 승학로 283 (연희동)
3rd row인천광역시 서구 가정로 298 (석남동)
4th row인천광역시 서구 검단로609번길 3, 4층 (마전동)
5th row인천광역시 서구 완정로 182 (마전동)
ValueCountFrequency (%)
인천광역시 256
 
15.3%
서구 256
 
15.3%
청라동 47
 
2.8%
1층 42
 
2.5%
가정동 24
 
1.4%
가정로 24
 
1.4%
당하동 23
 
1.4%
가좌동 23
 
1.4%
마전동 22
 
1.3%
석남동 20
 
1.2%
Other values (486) 935
55.9%
2023-12-12T12:42:32.879928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1428
 
17.1%
1 372
 
4.5%
291
 
3.5%
276
 
3.3%
267
 
3.2%
264
 
3.2%
263
 
3.1%
260
 
3.1%
256
 
3.1%
256
 
3.1%
Other values (245) 4421
52.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4702
56.3%
Space Separator 1428
 
17.1%
Decimal Number 1374
 
16.4%
Open Punctuation 251
 
3.0%
Close Punctuation 251
 
3.0%
Other Punctuation 240
 
2.9%
Uppercase Letter 49
 
0.6%
Dash Punctuation 32
 
0.4%
Lowercase Letter 20
 
0.2%
Math Symbol 7
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
291
 
6.2%
276
 
5.9%
267
 
5.7%
264
 
5.6%
263
 
5.6%
260
 
5.5%
256
 
5.4%
256
 
5.4%
256
 
5.4%
155
 
3.3%
Other values (210) 2158
45.9%
Uppercase Letter
ValueCountFrequency (%)
B 14
28.6%
A 5
 
10.2%
S 5
 
10.2%
K 5
 
10.2%
E 4
 
8.2%
L 3
 
6.1%
I 3
 
6.1%
W 3
 
6.1%
V 3
 
6.1%
J 2
 
4.1%
Decimal Number
ValueCountFrequency (%)
1 372
27.1%
0 192
14.0%
2 177
12.9%
3 116
 
8.4%
5 101
 
7.4%
6 96
 
7.0%
8 90
 
6.6%
4 85
 
6.2%
7 79
 
5.7%
9 66
 
4.8%
Lowercase Letter
ValueCountFrequency (%)
e 6
30.0%
d 3
15.0%
a 3
15.0%
r 3
15.0%
s 3
15.0%
c 1
 
5.0%
b 1
 
5.0%
Other Punctuation
ValueCountFrequency (%)
, 237
98.8%
' 3
 
1.2%
Space Separator
ValueCountFrequency (%)
1428
100.0%
Open Punctuation
ValueCountFrequency (%)
( 251
100.0%
Close Punctuation
ValueCountFrequency (%)
) 251
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 32
100.0%
Math Symbol
ValueCountFrequency (%)
~ 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4702
56.3%
Common 3583
42.9%
Latin 69
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
291
 
6.2%
276
 
5.9%
267
 
5.7%
264
 
5.6%
263
 
5.6%
260
 
5.5%
256
 
5.4%
256
 
5.4%
256
 
5.4%
155
 
3.3%
Other values (210) 2158
45.9%
Latin
ValueCountFrequency (%)
B 14
20.3%
e 6
 
8.7%
A 5
 
7.2%
S 5
 
7.2%
K 5
 
7.2%
E 4
 
5.8%
L 3
 
4.3%
I 3
 
4.3%
W 3
 
4.3%
V 3
 
4.3%
Other values (8) 18
26.1%
Common
ValueCountFrequency (%)
1428
39.9%
1 372
 
10.4%
( 251
 
7.0%
) 251
 
7.0%
, 237
 
6.6%
0 192
 
5.4%
2 177
 
4.9%
3 116
 
3.2%
5 101
 
2.8%
6 96
 
2.7%
Other values (7) 362
 
10.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4702
56.3%
ASCII 3652
43.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1428
39.1%
1 372
 
10.2%
( 251
 
6.9%
) 251
 
6.9%
, 237
 
6.5%
0 192
 
5.3%
2 177
 
4.8%
3 116
 
3.2%
5 101
 
2.8%
6 96
 
2.6%
Other values (25) 431
 
11.8%
Hangul
ValueCountFrequency (%)
291
 
6.2%
276
 
5.9%
267
 
5.7%
264
 
5.6%
263
 
5.6%
260
 
5.5%
256
 
5.4%
256
 
5.4%
256
 
5.4%
155
 
3.3%
Other values (210) 2158
45.9%

전화번호
Text

MISSING 

Distinct126
Distinct (%)94.7%
Missing123
Missing (%)48.0%
Memory size2.1 KiB
2023-12-12T12:42:33.232355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.052632
Min length12

Characters and Unicode

Total characters1603
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique119 ?
Unique (%)89.5%

Sample

1st row032-567-2575
2nd row032-561-7508
3rd row032-561-9004
4th row070-4155-1119
5th row032-568-6333
ValueCountFrequency (%)
032-562-4001 2
 
1.5%
032-565-8270 2
 
1.5%
032-561-7508 2
 
1.5%
032-568-5538 2
 
1.5%
032-567-2575 2
 
1.5%
032-566-0075 2
 
1.5%
032-565-5250 2
 
1.5%
032-563-8875 1
 
0.8%
032-565-0416 1
 
0.8%
070-7670-1009 1
 
0.8%
Other values (116) 116
87.2%
2023-12-12T12:42:33.795462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 266
16.6%
0 235
14.7%
5 210
13.1%
3 187
11.7%
2 187
11.7%
6 144
9.0%
7 123
7.7%
1 81
 
5.1%
8 71
 
4.4%
9 52
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1337
83.4%
Dash Punctuation 266
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 235
17.6%
5 210
15.7%
3 187
14.0%
2 187
14.0%
6 144
10.8%
7 123
9.2%
1 81
 
6.1%
8 71
 
5.3%
9 52
 
3.9%
4 47
 
3.5%
Dash Punctuation
ValueCountFrequency (%)
- 266
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1603
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 266
16.6%
0 235
14.7%
5 210
13.1%
3 187
11.7%
2 187
11.7%
6 144
9.0%
7 123
7.7%
1 81
 
5.1%
8 71
 
4.4%
9 52
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1603
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 266
16.6%
0 235
14.7%
5 210
13.1%
3 187
11.7%
2 187
11.7%
6 144
9.0%
7 123
7.7%
1 81
 
5.1%
8 71
 
4.4%
9 52
 
3.2%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2023-10-17
256 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-10-17
2nd row2023-10-17
3rd row2023-10-17
4th row2023-10-17
5th row2023-10-17

Common Values

ValueCountFrequency (%)
2023-10-17 256
100.0%

Length

2023-12-12T12:42:34.005742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:42:34.139997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-10-17 256
100.0%

Interactions

2023-12-12T12:42:30.289556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T12:42:34.230933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번영업내용
연번1.0000.944
영업내용0.9441.000
2023-12-12T12:42:34.358450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번영업내용
연번1.0000.920
영업내용0.9201.000

Missing values

2023-12-12T12:42:30.435511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:42:30.561348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번사업장명칭영업내용소재지 주소전화번호데이터기준일자
01럭셔리하개판매업인천광역시 서구 승학로 457, 상가동 1층 101호 (검암동)<NA>2023-10-17
12도그샵판매업인천광역시 서구 승학로 283 (연희동)<NA>2023-10-17
23애견용품 할인매장판매업인천광역시 서구 가정로 298 (석남동)<NA>2023-10-17
34인천조류원판매업인천광역시 서구 검단로609번길 3, 4층 (마전동)<NA>2023-10-17
4525시파랑새동물병원판매업인천광역시 서구 완정로 182 (마전동)032-567-25752023-10-17
56행복한동물병원판매업인천광역시 서구 당하동 824-6 삼정프라자 101호032-561-75082023-10-17
67아지와 옹이판매업인천광역시 서구 승학로 446 (검암동)032-561-90042023-10-17
78메인독판매업인천광역시 서구 신석로111번길 15 (석남동)070-4155-11192023-10-17
89핫도그판매업인천광역시 서구 승학로 574 (검암동)032-568-63332023-10-17
910해피펫판매업인천광역시 서구 검단로501번안길 16 (마전동)032-561-74502023-10-17
연번사업장명칭영업내용소재지 주소전화번호데이터기준일자
246247이기쁨동물병원동물병원인천광역시 서구 가정로 437, 301동 B119호 (가정동, 루원시티 SK Leaders' VIEW)032-579-77782023-10-17
24724824시 더원 동물병원동물병원인천광역시 서구 발산로 41, 4층 (원당동)032-569-66772023-10-17
248249검단이담동물의료센터동물병원인천광역시 서구 이음1로 383, 세중시그니쳐 2층 201~3호 (원당동)032-710-24402023-10-17
249250아이비유동물병원동물병원인천광역시 서구 이음5로 65, 상가동 103호 (원당동, 검단 금호어울림 센트럴)<NA>2023-10-17
250251아람동물의료센터동물병원인천광역시 서구 발산로 6, 아인시티 주차타워 2층 201~3호 (원당동)032-866-11112023-10-17
251252우신&스퀘어 동물병원동물병원인천광역시 서구 담지로8번길 18, 1층 (청라동)<NA>2023-10-17
252253이영수 고양이 병원동물병원인천광역시 서구 봉오재3로 120, 301,302호 (가정동)032-205-02792023-10-17
253254루원 센트럴 동물의료센터동물병원인천광역시 서구 봉오대로 270, 302동 B121호 (가정동, 루원시티2차 SK Leaders' VIEW)032-579-75822023-10-17
254255W 인천심장동물병원동물병원인천광역시 서구 청라커낼로260번길 27, 청라한신더휴커낼웨이 2층 212,213호 (청라동)032-565-82702023-10-17
255256브리즈 동물병원동물병원인천광역시 서구 서곶로 821, 2층 (당하동)032-710-85592023-10-17