Overview

Dataset statistics

Number of variables5
Number of observations503
Missing cells5
Missing cells (%)0.2%
Duplicate rows65
Duplicate rows (%)12.9%
Total size in memory20.3 KiB
Average record size in memory41.3 B

Variable types

Categorical2
Text2
Numeric1

Dataset

Description서울특별시 서대문구 관내에 위치한 다중이용시설 실내공기질 관리 대상 현황(시설명, 시설군, 주소, 연면적)에 대한 데이터를 제공합니다.
Author서울특별시 서대문구
URLhttps://www.data.go.kr/data/15048876/fileData.do

Alerts

자치구 has constant value ""Constant
Dataset has 65 (12.9%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 00:45:14.770297
Analysis finished2023-12-12 00:45:15.548622
Duration0.78 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

자치구
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.1 KiB
서대문구
503 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서대문구
2nd row서대문구
3rd row서대문구
4th row서대문구
5th row서대문구

Common Values

ValueCountFrequency (%)
서대문구 503
100.0%

Length

2023-12-12T09:45:15.628009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:45:15.735361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서대문구 503
100.0%

시설군
Categorical

Distinct15
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size4.1 KiB
실내주차장
151 
보육시설
89 
PC영업시설
35 
학원
34 
의료기관
33 
Other values (10)
161 

Length

Max length6
Median length5
Mean length4.3697813
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row어린이집
2nd row어린이집
3rd row어린이집
4th row어린이집
5th row어린이집

Common Values

ValueCountFrequency (%)
실내주차장 151
30.0%
보육시설 89
17.7%
PC영업시설 35
 
7.0%
학원 34
 
6.8%
의료기관 33
 
6.6%
지하역사 30
 
6.0%
대규모점포 28
 
5.6%
목욕장 26
 
5.2%
영화상영관 20
 
4.0%
어린이집 14
 
2.8%
Other values (5) 43
 
8.5%

Length

2023-12-12T09:45:15.888877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
실내주차장 151
30.0%
보육시설 89
17.7%
pc영업시설 35
 
7.0%
학원 34
 
6.8%
의료기관 33
 
6.6%
지하역사 30
 
6.0%
대규모점포 28
 
5.6%
목욕장 26
 
5.2%
영화상영관 20
 
4.0%
어린이집 14
 
2.8%
Other values (5) 43
 
8.5%
Distinct183
Distinct (%)36.4%
Missing0
Missing (%)0.0%
Memory size4.1 KiB
2023-12-12T09:45:16.196267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length18
Mean length7.8290258
Min length2

Characters and Unicode

Total characters3938
Distinct characters266
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique56 ?
Unique (%)11.1%

Sample

1st row홍제어린이집
2nd row연세대 유진어린이집
3rd row북아현어린이집
4th row서대문구청 직장어린이집
5th row독립문어린이집
ValueCountFrequency (%)
pc방 17
 
2.6%
산후조리원 11
 
1.7%
현대백화점신촌점 10
 
1.5%
예스에이피엠 10
 
1.5%
현대 10
 
1.5%
u-plex 8
 
1.2%
임광빌딩 8
 
1.2%
그랜드힐튼호텔 7
 
1.1%
학교법인연세대학교의과대학세브란스병원 7
 
1.1%
치과병원 6
 
0.9%
Other values (199) 556
85.5%
2023-12-12T09:45:16.603461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
151
 
3.8%
120
 
3.0%
98
 
2.5%
94
 
2.4%
85
 
2.2%
75
 
1.9%
70
 
1.8%
63
 
1.6%
62
 
1.6%
59
 
1.5%
Other values (256) 3061
77.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3445
87.5%
Space Separator 151
 
3.8%
Uppercase Letter 146
 
3.7%
Lowercase Letter 68
 
1.7%
Decimal Number 44
 
1.1%
Close Punctuation 31
 
0.8%
Open Punctuation 31
 
0.8%
Dash Punctuation 12
 
0.3%
Other Punctuation 5
 
0.1%
Other Symbol 5
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
120
 
3.5%
98
 
2.8%
94
 
2.7%
85
 
2.5%
75
 
2.2%
70
 
2.0%
63
 
1.8%
62
 
1.8%
59
 
1.7%
57
 
1.7%
Other values (221) 2662
77.3%
Uppercase Letter
ValueCountFrequency (%)
P 44
30.1%
C 40
27.4%
K 13
 
8.9%
G 10
 
6.8%
U 10
 
6.8%
V 7
 
4.8%
B 6
 
4.1%
T 5
 
3.4%
Z 5
 
3.4%
N 2
 
1.4%
Other values (3) 4
 
2.7%
Lowercase Letter
ValueCountFrequency (%)
e 16
23.5%
l 10
14.7%
o 8
11.8%
x 8
11.8%
p 7
10.3%
k 5
 
7.4%
n 5
 
7.4%
s 5
 
7.4%
c 2
 
2.9%
f 1
 
1.5%
Decimal Number
ValueCountFrequency (%)
2 15
34.1%
4 15
34.1%
1 6
 
13.6%
5 5
 
11.4%
3 3
 
6.8%
Space Separator
ValueCountFrequency (%)
151
100.0%
Close Punctuation
ValueCountFrequency (%)
) 31
100.0%
Open Punctuation
ValueCountFrequency (%)
( 31
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%
Other Punctuation
ValueCountFrequency (%)
& 5
100.0%
Other Symbol
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3450
87.6%
Common 274
 
7.0%
Latin 214
 
5.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
120
 
3.5%
98
 
2.8%
94
 
2.7%
85
 
2.5%
75
 
2.2%
70
 
2.0%
63
 
1.8%
62
 
1.8%
59
 
1.7%
57
 
1.7%
Other values (222) 2667
77.3%
Latin
ValueCountFrequency (%)
P 44
20.6%
C 40
18.7%
e 16
 
7.5%
K 13
 
6.1%
G 10
 
4.7%
l 10
 
4.7%
U 10
 
4.7%
o 8
 
3.7%
x 8
 
3.7%
p 7
 
3.3%
Other values (14) 48
22.4%
Common
ValueCountFrequency (%)
151
55.1%
) 31
 
11.3%
( 31
 
11.3%
2 15
 
5.5%
4 15
 
5.5%
- 12
 
4.4%
1 6
 
2.2%
& 5
 
1.8%
5 5
 
1.8%
3 3
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3445
87.5%
ASCII 488
 
12.4%
None 5
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
151
30.9%
P 44
 
9.0%
C 40
 
8.2%
) 31
 
6.4%
( 31
 
6.4%
e 16
 
3.3%
2 15
 
3.1%
4 15
 
3.1%
K 13
 
2.7%
- 12
 
2.5%
Other values (24) 120
24.6%
Hangul
ValueCountFrequency (%)
120
 
3.5%
98
 
2.8%
94
 
2.7%
85
 
2.5%
75
 
2.2%
70
 
2.0%
63
 
1.8%
62
 
1.8%
59
 
1.7%
57
 
1.7%
Other values (221) 2662
77.3%
None
ValueCountFrequency (%)
5
100.0%

주소
Text

Distinct367
Distinct (%)73.0%
Missing0
Missing (%)0.0%
Memory size4.1 KiB
2023-12-12T09:45:16.942013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length39
Mean length25.11332
Min length1

Characters and Unicode

Total characters12632
Distinct characters169
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique257 ?
Unique (%)51.1%

Sample

1st row서울시 서대문구 통일로31길 3-4
2nd row서울시 서대문구 연세로 50
3rd row서울시 서대문구 북아현로5라길 26
4th row서울시 서대문구 홍제천로 182(연희동)
5th row서울시 서대문구 독립문로 12
ValueCountFrequency (%)
서대문구 496
20.3%
서울특별시 493
20.2%
창천동 62
 
2.5%
연세로 51
 
2.1%
통일로 45
 
1.8%
충정로 40
 
1.6%
신촌로 36
 
1.5%
충정로3가 35
 
1.4%
신촌동 30
 
1.2%
홍제동 24
 
1.0%
Other values (334) 1130
46.3%
2023-12-12T09:45:17.430073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1991
 
15.8%
1108
 
8.8%
657
 
5.2%
625
 
4.9%
595
 
4.7%
503
 
4.0%
499
 
4.0%
494
 
3.9%
494
 
3.9%
476
 
3.8%
Other values (159) 5190
41.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8266
65.4%
Space Separator 1991
 
15.8%
Decimal Number 1558
 
12.3%
Close Punctuation 350
 
2.8%
Open Punctuation 350
 
2.8%
Dash Punctuation 75
 
0.6%
Uppercase Letter 26
 
0.2%
Math Symbol 16
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1108
13.4%
657
 
7.9%
625
 
7.6%
595
 
7.2%
503
 
6.1%
499
 
6.0%
494
 
6.0%
494
 
6.0%
476
 
5.8%
324
 
3.9%
Other values (134) 2491
30.1%
Decimal Number
ValueCountFrequency (%)
1 300
19.3%
2 243
15.6%
3 227
14.6%
5 181
11.6%
0 138
8.9%
4 137
8.8%
7 100
 
6.4%
8 97
 
6.2%
6 79
 
5.1%
9 56
 
3.6%
Uppercase Letter
ValueCountFrequency (%)
B 7
26.9%
E 3
11.5%
V 2
 
7.7%
G 2
 
7.7%
R 2
 
7.7%
T 2
 
7.7%
I 2
 
7.7%
O 2
 
7.7%
A 2
 
7.7%
C 2
 
7.7%
Space Separator
ValueCountFrequency (%)
1991
100.0%
Close Punctuation
ValueCountFrequency (%)
) 350
100.0%
Open Punctuation
ValueCountFrequency (%)
( 350
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 75
100.0%
Math Symbol
ValueCountFrequency (%)
~ 16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8266
65.4%
Common 4340
34.4%
Latin 26
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1108
13.4%
657
 
7.9%
625
 
7.6%
595
 
7.2%
503
 
6.1%
499
 
6.0%
494
 
6.0%
494
 
6.0%
476
 
5.8%
324
 
3.9%
Other values (134) 2491
30.1%
Common
ValueCountFrequency (%)
1991
45.9%
) 350
 
8.1%
( 350
 
8.1%
1 300
 
6.9%
2 243
 
5.6%
3 227
 
5.2%
5 181
 
4.2%
0 138
 
3.2%
4 137
 
3.2%
7 100
 
2.3%
Other values (5) 323
 
7.4%
Latin
ValueCountFrequency (%)
B 7
26.9%
E 3
11.5%
V 2
 
7.7%
G 2
 
7.7%
R 2
 
7.7%
T 2
 
7.7%
I 2
 
7.7%
O 2
 
7.7%
A 2
 
7.7%
C 2
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8266
65.4%
ASCII 4366
34.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1991
45.6%
) 350
 
8.0%
( 350
 
8.0%
1 300
 
6.9%
2 243
 
5.6%
3 227
 
5.2%
5 181
 
4.1%
0 138
 
3.2%
4 137
 
3.1%
7 100
 
2.3%
Other values (15) 349
 
8.0%
Hangul
ValueCountFrequency (%)
1108
13.4%
657
 
7.9%
625
 
7.6%
595
 
7.2%
503
 
6.1%
499
 
6.0%
494
 
6.0%
494
 
6.0%
476
 
5.8%
324
 
3.9%
Other values (134) 2491
30.1%

연면적(제곱미터)
Real number (ℝ)

Distinct246
Distinct (%)49.4%
Missing5
Missing (%)1.0%
Infinite0
Infinite (%)0.0%
Mean6910.8254
Minimum0
Maximum283843.2
Zeros4
Zeros (%)0.8%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-12T09:45:17.562166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile397.12
Q1593.035
median2543.255
Q37371.03
95-th percentile21800
Maximum283843.2
Range283843.2
Interquartile range (IQR)6777.995

Descriptive statistics

Standard deviation19058.185
Coefficient of variation (CV)2.7577292
Kurtosis113.93718
Mean6910.8254
Median Absolute Deviation (MAD)2051.255
Skewness9.5636462
Sum3441591
Variance3.6321441 × 108
MonotonicityNot monotonic
2023-12-12T09:45:17.685079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3472.0 26
 
5.2%
13623.59 11
 
2.2%
590.77 10
 
2.0%
397.12 8
 
1.6%
2096.43 6
 
1.2%
35472.0 5
 
1.0%
397.0 5
 
1.0%
745.67 5
 
1.0%
499.0 5
 
1.0%
14133.0 4
 
0.8%
Other values (236) 413
82.1%
(Missing) 5
 
1.0%
ValueCountFrequency (%)
0.0 4
0.8%
189.0 2
 
0.4%
333.63 3
 
0.6%
336.15 1
 
0.2%
346.72 4
0.8%
347.0 1
 
0.2%
366.43 2
 
0.4%
396.93 1
 
0.2%
397.0 5
1.0%
397.12 8
1.6%
ValueCountFrequency (%)
283843.2 1
 
0.2%
171490.7 2
 
0.4%
105800.5 1
 
0.2%
105800.0 1
 
0.2%
68657.24 2
 
0.4%
38184.0 1
 
0.2%
35500.0 1
 
0.2%
35472.0 5
1.0%
29991.9 2
 
0.4%
28700.0 1
 
0.2%

Interactions

2023-12-12T09:45:15.224115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T09:45:17.765273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설군연면적(제곱미터)
시설군1.0000.330
연면적(제곱미터)0.3301.000
2023-12-12T09:45:17.840677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연면적(제곱미터)시설군
연면적(제곱미터)1.0000.159
시설군0.1591.000

Missing values

2023-12-12T09:45:15.391654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:45:15.502304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

자치구시설군시설명주소연면적(제곱미터)
0서대문구어린이집홍제어린이집서울시 서대문구 통일로31길 3-4<NA>
1서대문구어린이집연세대 유진어린이집서울시 서대문구 연세로 50<NA>
2서대문구어린이집북아현어린이집서울시 서대문구 북아현로5라길 26<NA>
3서대문구어린이집서대문구청 직장어린이집서울시 서대문구 홍제천로 182(연희동)<NA>
4서대문구어린이집독립문어린이집서울시 서대문구 독립문로 12<NA>
5서대문구실내주차장국제오피스텔서울특별시 서대문구 성산로 543(대신동)5821.23
6서대문구실내주차장아현중앙교회서울특별시 서대문구 신촌로29길 11 (북아현동)6688.02
7서대문구실내주차장신촌가이아서울특별시 서대문구 신촌역로 16 (대현동)2487.16
8서대문구실내주차장홍은1동 제4공영주차장서울특별시 서대문구 홍은중앙로 125 (홍은1동)3472.5
9서대문구실내주차장홍성교회서울특별시 서대문구 포방터길 28 (홍제동)5604.0
자치구시설군시설명주소연면적(제곱미터)
493서대문구도서관서대문도서관서울특별시 서대문구 모래내로 4123910.0
494서대문구지하역사무악재역서울특별시 서대문구 홍제4동 26-17640.0
495서대문구지하역사홍제역서울특별시 서대문구 홍제1동 330-6611600.0
496서대문구지하역사독립문역서울특별시 서대문구 현저동 1015970.0
497서대문구보육시설동우어린이나라서울특별시 서대문구 북아현로22나길 85482.0
498서대문구지하역사충정로역(5호선)서울특별시 서대문구 충정로3가 5811400.0
499서대문구지하역사충정로역(2호선)서울특별시 서대문구 충정로3가 295-6010200.0
500서대문구보육시설홍제서울특별시 서대문구 모래내로 26길 17-10438.0
501서대문구보육시설푸른누리서울특별시 서대문구 연희로20길 27번845.0
502서대문구대규모점포신촌밀리오레서울특별시 서대문구 연희로20길 27번13400.0

Duplicate rows

Most frequently occurring

자치구시설군시설명주소연면적(제곱미터)# duplicates
0서대문구PC영업시설3pop PC방서울특별시 서대문구 수색로 144 (북가좌동)333.632
1서대문구PC영업시설세븐 PC방서울특별시 서대문구 수색로 56 210 211212호 (북가좌동 성공타워1)366.432
2서대문구PC영업시설아토즈 PC방서울특별시 서대문구 증가로 259 3층(북가좌동 선정오피스텔)346.722
3서대문구PC영업시설인터라켄PC방서울특별시 서대문구 명물길 6 B1층 (창천동)397.122
4서대문구노인요양시설구립서대문노인전문요양센터서울특별시 서대문구 독립문로8길 57(홍은동)2096.432
5서대문구노인요양시설연희시니어스서울특별시 서대문구 연희로31길 8-7(연희동)1543.782
6서대문구대규모점포신촌밀리오레서울특별시 서대문구 신촌역로 30 (신촌동)13437.02
7서대문구대규모점포예스에이피엠서울특별시 서대문구 이화여대1길 10 (대현동)14133.02
8서대문구대규모점포현대백화점신촌점서울특별시 서대문구 신촌로 83 (창천동)35472.02
9서대문구도서관서대문도서관서울특별시 서대문구 모래내로 412 (연희동)3909.02