Overview

Dataset statistics

Number of variables6
Number of observations111
Missing cells13
Missing cells (%)2.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.4 KiB
Average record size in memory50.2 B

Variable types

Numeric1
Categorical2
Text3

Dataset

Description서대문구 다중이용시설 실내공기질 관리대상 현황 정보(2019.3.12기준)
Author서울특별시 서대문구
URLhttps://www.data.go.kr/data/15048877/fileData.do

Alerts

자치구 has constant value ""Constant
연번 is highly overall correlated with 시설군High correlation
시설군 is highly overall correlated with 연번High correlation
전화번호 has 13 (11.7%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 18:33:09.519785
Analysis finished2023-12-12 18:33:10.201685
Duration0.68 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct111
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean56
Minimum1
Maximum111
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-13T03:33:10.297085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.5
Q128.5
median56
Q383.5
95-th percentile105.5
Maximum111
Range110
Interquartile range (IQR)55

Descriptive statistics

Standard deviation32.186954
Coefficient of variation (CV)0.57476703
Kurtosis-1.2
Mean56
Median Absolute Deviation (MAD)28
Skewness0
Sum6216
Variance1036
MonotonicityStrictly increasing
2023-12-13T03:33:10.484896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.9%
2 1
 
0.9%
83 1
 
0.9%
82 1
 
0.9%
81 1
 
0.9%
80 1
 
0.9%
79 1
 
0.9%
78 1
 
0.9%
77 1
 
0.9%
76 1
 
0.9%
Other values (101) 101
91.0%
ValueCountFrequency (%)
1 1
0.9%
2 1
0.9%
3 1
0.9%
4 1
0.9%
5 1
0.9%
6 1
0.9%
7 1
0.9%
8 1
0.9%
9 1
0.9%
10 1
0.9%
ValueCountFrequency (%)
111 1
0.9%
110 1
0.9%
109 1
0.9%
108 1
0.9%
107 1
0.9%
106 1
0.9%
105 1
0.9%
104 1
0.9%
103 1
0.9%
102 1
0.9%

자치구
Categorical

CONSTANT 

Distinct1
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1020.0 B
서대문구
111 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서대문구
2nd row서대문구
3rd row서대문구
4th row서대문구
5th row서대문구

Common Values

ValueCountFrequency (%)
서대문구 111
100.0%

Length

2023-12-13T03:33:10.635464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:33:10.766550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서대문구 111
100.0%

시설군
Categorical

HIGH CORRELATION 

Distinct14
Distinct (%)12.6%
Missing0
Missing (%)0.0%
Memory size1020.0 B
실내주차장
35 
어린이집
19 
PC영업시설
12 
의료기관
지하역사
Other values (9)
31 

Length

Max length6
Median length5
Mean length4.5225225
Min length2

Unique

Unique2 ?
Unique (%)1.8%

Sample

1st row지하역사
2nd row지하역사
3rd row지하역사
4th row지하역사
5th row지하역사

Common Values

ValueCountFrequency (%)
실내주차장 35
31.5%
어린이집 19
17.1%
PC영업시설 12
 
10.8%
의료기관 8
 
7.2%
지하역사 6
 
5.4%
목욕장 6
 
5.4%
대규모점포 6
 
5.4%
영화상영관 4
 
3.6%
박물관 4
 
3.6%
학원 4
 
3.6%
Other values (4) 7
 
6.3%

Length

2023-12-13T03:33:10.882878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
실내주차장 35
31.5%
어린이집 19
17.1%
pc영업시설 12
 
10.8%
의료기관 8
 
7.2%
지하역사 6
 
5.4%
목욕장 6
 
5.4%
대규모점포 6
 
5.4%
영화상영관 4
 
3.6%
박물관 4
 
3.6%
학원 4
 
3.6%
Other values (4) 7
 
6.3%
Distinct107
Distinct (%)96.4%
Missing0
Missing (%)0.0%
Memory size1020.0 B
2023-12-13T03:33:11.214985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length17
Mean length8.3873874
Min length2

Characters and Unicode

Total characters931
Distinct characters243
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique103 ?
Unique (%)92.8%

Sample

1st row충정로역(2호선)
2nd row충정로역(5호선)
3rd row독립문역
4th row홍제역
5th row무악재역
ValueCountFrequency (%)
pc방 7
 
4.6%
학교법인연세대학교의과대학세브란스병원 3
 
2.0%
학교법인연세대학교치과대학치과병원 2
 
1.3%
충정로사옥 2
 
1.3%
본관 2
 
1.3%
그랜드힐튼호텔 2
 
1.3%
nh농협생명빌딩 2
 
1.3%
산후조리원 2
 
1.3%
현대 2
 
1.3%
예스에이피엠 2
 
1.3%
Other values (124) 126
82.9%
2023-12-13T03:33:11.697575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
44
 
4.7%
27
 
2.9%
21
 
2.3%
19
 
2.0%
19
 
2.0%
16
 
1.7%
16
 
1.7%
15
 
1.6%
15
 
1.6%
P 14
 
1.5%
Other values (233) 725
77.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 785
84.3%
Uppercase Letter 47
 
5.0%
Space Separator 44
 
4.7%
Lowercase Letter 18
 
1.9%
Close Punctuation 11
 
1.2%
Open Punctuation 11
 
1.2%
Decimal Number 9
 
1.0%
Dash Punctuation 3
 
0.3%
Other Punctuation 2
 
0.2%
Other Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
27
 
3.4%
21
 
2.7%
19
 
2.4%
19
 
2.4%
16
 
2.0%
16
 
2.0%
15
 
1.9%
15
 
1.9%
14
 
1.8%
14
 
1.8%
Other values (197) 609
77.6%
Uppercase Letter
ValueCountFrequency (%)
P 14
29.8%
C 13
27.7%
K 3
 
6.4%
V 3
 
6.4%
H 2
 
4.3%
N 2
 
4.3%
G 2
 
4.3%
B 2
 
4.3%
U 2
 
4.3%
R 1
 
2.1%
Other values (3) 3
 
6.4%
Lowercase Letter
ValueCountFrequency (%)
e 4
22.2%
p 2
11.1%
x 2
11.1%
l 2
11.1%
o 2
11.1%
s 1
 
5.6%
k 1
 
5.6%
n 1
 
5.6%
f 1
 
5.6%
a 1
 
5.6%
Decimal Number
ValueCountFrequency (%)
4 3
33.3%
2 3
33.3%
1 1
 
11.1%
3 1
 
11.1%
5 1
 
11.1%
Other Punctuation
ValueCountFrequency (%)
& 1
50.0%
· 1
50.0%
Space Separator
ValueCountFrequency (%)
44
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 786
84.4%
Common 80
 
8.6%
Latin 65
 
7.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
27
 
3.4%
21
 
2.7%
19
 
2.4%
19
 
2.4%
16
 
2.0%
16
 
2.0%
15
 
1.9%
15
 
1.9%
14
 
1.8%
14
 
1.8%
Other values (198) 610
77.6%
Latin
ValueCountFrequency (%)
P 14
21.5%
C 13
20.0%
e 4
 
6.2%
K 3
 
4.6%
V 3
 
4.6%
H 2
 
3.1%
N 2
 
3.1%
G 2
 
3.1%
B 2
 
3.1%
p 2
 
3.1%
Other values (14) 18
27.7%
Common
ValueCountFrequency (%)
44
55.0%
) 11
 
13.8%
( 11
 
13.8%
- 3
 
3.8%
4 3
 
3.8%
2 3
 
3.8%
1 1
 
1.2%
& 1
 
1.2%
· 1
 
1.2%
3 1
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 785
84.3%
ASCII 144
 
15.5%
None 2
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
44
30.6%
P 14
 
9.7%
C 13
 
9.0%
) 11
 
7.6%
( 11
 
7.6%
e 4
 
2.8%
- 3
 
2.1%
4 3
 
2.1%
K 3
 
2.1%
2 3
 
2.1%
Other values (24) 35
24.3%
Hangul
ValueCountFrequency (%)
27
 
3.4%
21
 
2.7%
19
 
2.4%
19
 
2.4%
16
 
2.0%
16
 
2.0%
15
 
1.9%
15
 
1.9%
14
 
1.8%
14
 
1.8%
Other values (197) 609
77.6%
None
ValueCountFrequency (%)
1
50.0%
· 1
50.0%

주소
Text

Distinct104
Distinct (%)93.7%
Missing0
Missing (%)0.0%
Memory size1020.0 B
2023-12-13T03:33:12.045736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length43
Mean length27.585586
Min length18

Characters and Unicode

Total characters3062
Distinct characters156
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)90.1%

Sample

1st row서울특별시 서대문구 서소문로 17 (충정로3가)
2nd row서울특별시 서대문구 충정로 28-1 (충정로3가)
3rd row서울특별시 서대문구 통일로 247 (현저동)
4th row서울특별시 서대문구 통일로 440-1 (홍제동)
5th row서울특별시 서대문구 통일로 361 (홍제동)
ValueCountFrequency (%)
서울특별시 111
 
18.6%
서대문구 110
 
18.4%
창천동 18
 
3.0%
통일로 15
 
2.5%
연세로 14
 
2.3%
충정로 11
 
1.8%
신촌로 10
 
1.7%
충정로3가 10
 
1.7%
홍제동 10
 
1.7%
신촌동 9
 
1.5%
Other values (195) 280
46.8%
2023-12-13T03:33:12.832917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
492
 
16.1%
227
 
7.4%
126
 
4.1%
120
 
3.9%
113
 
3.7%
113
 
3.7%
113
 
3.7%
111
 
3.6%
111
 
3.6%
111
 
3.6%
Other values (146) 1425
46.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1919
62.7%
Space Separator 492
 
16.1%
Decimal Number 369
 
12.1%
Close Punctuation 101
 
3.3%
Open Punctuation 101
 
3.3%
Other Punctuation 49
 
1.6%
Dash Punctuation 14
 
0.5%
Uppercase Letter 10
 
0.3%
Math Symbol 7
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
227
 
11.8%
126
 
6.6%
120
 
6.3%
113
 
5.9%
113
 
5.9%
113
 
5.9%
111
 
5.8%
111
 
5.8%
111
 
5.8%
88
 
4.6%
Other values (122) 686
35.7%
Decimal Number
ValueCountFrequency (%)
1 71
19.2%
2 58
15.7%
3 54
14.6%
5 43
11.7%
4 38
10.3%
0 26
 
7.0%
8 23
 
6.2%
7 23
 
6.2%
6 21
 
5.7%
9 12
 
3.3%
Uppercase Letter
ValueCountFrequency (%)
B 3
30.0%
V 1
 
10.0%
E 1
 
10.0%
R 1
 
10.0%
T 1
 
10.0%
I 1
 
10.0%
G 1
 
10.0%
O 1
 
10.0%
Space Separator
ValueCountFrequency (%)
492
100.0%
Close Punctuation
ValueCountFrequency (%)
) 101
100.0%
Open Punctuation
ValueCountFrequency (%)
( 101
100.0%
Other Punctuation
ValueCountFrequency (%)
, 49
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 14
100.0%
Math Symbol
ValueCountFrequency (%)
~ 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1919
62.7%
Common 1133
37.0%
Latin 10
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
227
 
11.8%
126
 
6.6%
120
 
6.3%
113
 
5.9%
113
 
5.9%
113
 
5.9%
111
 
5.8%
111
 
5.8%
111
 
5.8%
88
 
4.6%
Other values (122) 686
35.7%
Common
ValueCountFrequency (%)
492
43.4%
) 101
 
8.9%
( 101
 
8.9%
1 71
 
6.3%
2 58
 
5.1%
3 54
 
4.8%
, 49
 
4.3%
5 43
 
3.8%
4 38
 
3.4%
0 26
 
2.3%
Other values (6) 100
 
8.8%
Latin
ValueCountFrequency (%)
B 3
30.0%
V 1
 
10.0%
E 1
 
10.0%
R 1
 
10.0%
T 1
 
10.0%
I 1
 
10.0%
G 1
 
10.0%
O 1
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1919
62.7%
ASCII 1143
37.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
492
43.0%
) 101
 
8.8%
( 101
 
8.8%
1 71
 
6.2%
2 58
 
5.1%
3 54
 
4.7%
, 49
 
4.3%
5 43
 
3.8%
4 38
 
3.3%
0 26
 
2.3%
Other values (14) 110
 
9.6%
Hangul
ValueCountFrequency (%)
227
 
11.8%
126
 
6.6%
120
 
6.3%
113
 
5.9%
113
 
5.9%
113
 
5.9%
111
 
5.8%
111
 
5.8%
111
 
5.8%
88
 
4.6%
Other values (122) 686
35.7%

전화번호
Text

MISSING 

Distinct91
Distinct (%)92.9%
Missing13
Missing (%)11.7%
Memory size1020.0 B
2023-12-13T03:33:13.211712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length11
Mean length11.438776
Min length11

Characters and Unicode

Total characters1121
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique87 ?
Unique (%)88.8%

Sample

1st row02-6110-2431
2nd row02-6311-5311
3rd row02-6110-3261
4th row02-6110-3241
5th row02-6110-3251
ValueCountFrequency (%)
02-2228-1452 5
 
5.1%
02-2287-8294 2
 
2.0%
02-3145-3650 2
 
2.0%
02-360-5271 2
 
2.0%
02-6110-2431 1
 
1.0%
02-365-5484 1
 
1.0%
02-363-9533 1
 
1.0%
070-7438-9421 1
 
1.0%
02-362-3115 1
 
1.0%
02-364-4732 1
 
1.0%
Other values (81) 81
82.7%
2023-12-13T03:33:13.850151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 196
17.5%
2 178
15.9%
0 174
15.5%
3 142
12.7%
1 90
8.0%
7 73
 
6.5%
6 72
 
6.4%
5 59
 
5.3%
4 52
 
4.6%
9 46
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 925
82.5%
Dash Punctuation 196
 
17.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 178
19.2%
0 174
18.8%
3 142
15.4%
1 90
9.7%
7 73
7.9%
6 72
7.8%
5 59
 
6.4%
4 52
 
5.6%
9 46
 
5.0%
8 39
 
4.2%
Dash Punctuation
ValueCountFrequency (%)
- 196
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1121
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 196
17.5%
2 178
15.9%
0 174
15.5%
3 142
12.7%
1 90
8.0%
7 73
 
6.5%
6 72
 
6.4%
5 59
 
5.3%
4 52
 
4.6%
9 46
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1121
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 196
17.5%
2 178
15.9%
0 174
15.5%
3 142
12.7%
1 90
8.0%
7 73
 
6.5%
6 72
 
6.4%
5 59
 
5.3%
4 52
 
4.6%
9 46
 
4.1%

Interactions

2023-12-13T03:33:09.888550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T03:33:14.021142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시설군전화번호
연번1.0000.9020.948
시설군0.9021.0000.996
전화번호0.9480.9961.000
2023-12-13T03:33:14.181862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시설군
연번1.0000.644
시설군0.6441.000

Missing values

2023-12-13T03:33:10.016205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:33:10.151107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번자치구시설군시설명주소전화번호
01서대문구지하역사충정로역(2호선)서울특별시 서대문구 서소문로 17 (충정로3가)02-6110-2431
12서대문구지하역사충정로역(5호선)서울특별시 서대문구 충정로 28-1 (충정로3가)02-6311-5311
23서대문구지하역사독립문역서울특별시 서대문구 통일로 247 (현저동)02-6110-3261
34서대문구지하역사홍제역서울특별시 서대문구 통일로 440-1 (홍제동)02-6110-3241
45서대문구지하역사무악재역서울특별시 서대문구 통일로 361 (홍제동)02-6110-3251
56서대문구지하역사가좌역서울특별시 서대문구 수색로 27 (남가좌동)02-322-7898
67서대문구산후조리원동그라미 산후조리원(구 후 산후조리원)서울특별시 서대문구 신촌역로 10, 혜우빌딩 6층02-362-7500
78서대문구산후조리원제이 산후조리원서울특별시 서대문구 통일로 44202-2153-7788
89서대문구의료기관효담요양병원서울특별시 서대문구 통일로 544 (홍은동)02-395-8880
910서대문구의료기관가자연세병원서울특별시 서대문구 수색로 18, 2~5층 (남가좌동, 영보웨딩부페)02-304-5660
연번자치구시설군시설명주소전화번호
101102서대문구실내주차장성공타워서울특별시 서대문구 수색로 56(북가좌동)02-3152-0831
102103서대문구실내주차장동아일보사 충정로사옥서울특별시 서대문구 충정로 29 (충정로3가)02-361-1167
103104서대문구실내주차장신촌푸르지오시티서울특별시 서대문구 신촌역로 7 (대현동)02-362-4500
104105서대문구실내주차장서대문우체국서울특별시 서대문구 성산로20길 9 (창천동)02-390-9211
105106서대문구실내주차장브라운스톤 연희서울특별시 서대문구 연희로 82 (연희동)<NA>
106107서대문구실내주차장홍성교회서울특별시 서대문구 포방터길 28 (홍제동)02-391-4567
107108서대문구실내주차장홍은1동 제4공영주차장서울특별시 서대문구 홍은중앙로 125 (홍은1동)02-330-1865
108109서대문구실내주차장신촌가이아서울특별시 서대문구 신촌역로 16 (대현동)02-363-3655
109110서대문구실내주차장아현중앙교회서울특별시 서대문구 신촌로29길 11 (북아현동)02-363-1452
110111서대문구실내주차장국제오피스텔서울특별시 서대문구 성산로 543(대신동)02-362-9713