Overview

Dataset statistics

Number of variables3
Number of observations154
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.7 KiB
Average record size in memory24.9 B

Variable types

Text2
Categorical1

Dataset

Description서울특별시 성동구 실내공기질 관리법의 적용을 받는 다중이용시설 목록입니다. 시설구분, 시설명, 도로명주소, 전화번호 등의 정보를 포함합니다.
URLhttps://www.data.go.kr/data/15035134/fileData.do

Reproduction

Analysis started2023-12-11 23:21:57.656277
Analysis finished2023-12-11 23:21:57.962608
Duration0.31 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct151
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-12T08:21:58.125132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length19
Mean length9.3896104
Min length4

Characters and Unicode

Total characters1446
Distinct characters275
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique148 ?
Unique (%)96.1%

Sample

1st row금호노인요양원
2nd row시립동부노인전문요양센터
3rd row이암요양원
4th row(주)신세계이마트성수점
5th row(주)신세계이마트왕십리역점
ValueCountFrequency (%)
구립 17
 
7.7%
서울숲 8
 
3.6%
서울숲코오롱디지털타워 3
 
1.4%
메가박스성수 2
 
0.9%
한양대학교 2
 
0.9%
오피스텔 2
 
0.9%
새활용플라자 2
 
0.9%
한라시그마밸리 2
 
0.9%
성수역 2
 
0.9%
tower 2
 
0.9%
Other values (177) 180
81.1%
2023-12-12T08:21:58.476387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
68
 
4.7%
41
 
2.8%
34
 
2.4%
31
 
2.1%
30
 
2.1%
28
 
1.9%
27
 
1.9%
27
 
1.9%
) 26
 
1.8%
( 26
 
1.8%
Other values (265) 1108
76.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1218
84.2%
Uppercase Letter 69
 
4.8%
Space Separator 68
 
4.7%
Close Punctuation 26
 
1.8%
Open Punctuation 26
 
1.8%
Decimal Number 23
 
1.6%
Lowercase Letter 9
 
0.6%
Dash Punctuation 3
 
0.2%
Other Symbol 2
 
0.1%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
41
 
3.4%
34
 
2.8%
31
 
2.5%
30
 
2.5%
28
 
2.3%
27
 
2.2%
27
 
2.2%
25
 
2.1%
25
 
2.1%
24
 
2.0%
Other values (223) 926
76.0%
Uppercase Letter
ValueCountFrequency (%)
T 8
11.6%
C 7
10.1%
R 7
10.1%
E 7
10.1%
P 5
 
7.2%
S 4
 
5.8%
O 4
 
5.8%
K 4
 
5.8%
I 3
 
4.3%
V 3
 
4.3%
Other values (13) 17
24.6%
Decimal Number
ValueCountFrequency (%)
1 6
26.1%
5 6
26.1%
3 4
17.4%
2 3
13.0%
6 2
 
8.7%
9 1
 
4.3%
4 1
 
4.3%
Lowercase Letter
ValueCountFrequency (%)
o 2
22.2%
w 2
22.2%
e 2
22.2%
r 2
22.2%
t 1
11.1%
Space Separator
ValueCountFrequency (%)
68
100.0%
Close Punctuation
ValueCountFrequency (%)
) 26
100.0%
Open Punctuation
ValueCountFrequency (%)
( 26
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1220
84.4%
Common 147
 
10.2%
Latin 79
 
5.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
41
 
3.4%
34
 
2.8%
31
 
2.5%
30
 
2.5%
28
 
2.3%
27
 
2.2%
27
 
2.2%
25
 
2.0%
25
 
2.0%
24
 
2.0%
Other values (224) 928
76.1%
Latin
ValueCountFrequency (%)
T 8
 
10.1%
C 7
 
8.9%
R 7
 
8.9%
E 7
 
8.9%
P 5
 
6.3%
S 4
 
5.1%
O 4
 
5.1%
K 4
 
5.1%
I 3
 
3.8%
V 3
 
3.8%
Other values (19) 27
34.2%
Common
ValueCountFrequency (%)
68
46.3%
) 26
 
17.7%
( 26
 
17.7%
1 6
 
4.1%
5 6
 
4.1%
3 4
 
2.7%
2 3
 
2.0%
- 3
 
2.0%
6 2
 
1.4%
9 1
 
0.7%
Other values (2) 2
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1218
84.2%
ASCII 225
 
15.6%
None 2
 
0.1%
Number Forms 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
68
30.2%
) 26
 
11.6%
( 26
 
11.6%
T 8
 
3.6%
C 7
 
3.1%
R 7
 
3.1%
E 7
 
3.1%
1 6
 
2.7%
5 6
 
2.7%
P 5
 
2.2%
Other values (30) 59
26.2%
Hangul
ValueCountFrequency (%)
41
 
3.4%
34
 
2.8%
31
 
2.5%
30
 
2.5%
28
 
2.3%
27
 
2.2%
27
 
2.2%
25
 
2.1%
25
 
2.1%
24
 
2.0%
Other values (223) 926
76.0%
None
ValueCountFrequency (%)
2
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%

주소
Text

Distinct145
Distinct (%)94.2%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-12T08:21:58.746889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length64
Median length34
Mean length26.798701
Min length15

Characters and Unicode

Total characters4127
Distinct characters115
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique138 ?
Unique (%)89.6%

Sample

1st row서울특별시 성동구 금호로 45(금호동4가)
2nd row서울특별시 성동구 마장로23길 12(홍익동, 시설관리팀)
3rd row서울특별시 성동구 마장로 125(상왕십리동)
4th row서울특별시 성동구 뚝섬로 377(성수2가1동)
5th row서울특별시 성동구 왕십리광장로 17(행당동)
ValueCountFrequency (%)
성동구 152
21.5%
서울특별시 151
21.3%
관리사무소 45
 
6.4%
왕십리로 17
 
2.4%
지하 9
 
1.3%
광나루로 5
 
0.7%
천호대로 5
 
0.7%
행당로 5
 
0.7%
성수이로 5
 
0.7%
아차산로 5
 
0.7%
Other values (234) 309
43.6%
2023-12-12T08:21:59.187422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
556
 
13.5%
306
 
7.4%
228
 
5.5%
163
 
3.9%
161
 
3.9%
155
 
3.8%
154
 
3.7%
151
 
3.7%
151
 
3.7%
) 129
 
3.1%
Other values (105) 1973
47.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2669
64.7%
Space Separator 556
 
13.5%
Decimal Number 556
 
13.5%
Close Punctuation 129
 
3.1%
Open Punctuation 129
 
3.1%
Other Punctuation 70
 
1.7%
Dash Punctuation 13
 
0.3%
Math Symbol 3
 
0.1%
Uppercase Letter 1
 
< 0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
306
 
11.5%
228
 
8.5%
163
 
6.1%
161
 
6.0%
155
 
5.8%
154
 
5.8%
151
 
5.7%
151
 
5.7%
116
 
4.3%
81
 
3.0%
Other values (87) 1003
37.6%
Decimal Number
ValueCountFrequency (%)
1 119
21.4%
2 111
20.0%
3 59
10.6%
4 44
 
7.9%
0 43
 
7.7%
6 42
 
7.6%
5 41
 
7.4%
7 41
 
7.4%
8 32
 
5.8%
9 24
 
4.3%
Space Separator
ValueCountFrequency (%)
556
100.0%
Close Punctuation
ValueCountFrequency (%)
) 129
100.0%
Open Punctuation
ValueCountFrequency (%)
( 129
100.0%
Other Punctuation
ValueCountFrequency (%)
, 70
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%
Uppercase Letter
ValueCountFrequency (%)
C 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2670
64.7%
Common 1456
35.3%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
306
 
11.5%
228
 
8.5%
163
 
6.1%
161
 
6.0%
155
 
5.8%
154
 
5.8%
151
 
5.7%
151
 
5.7%
116
 
4.3%
81
 
3.0%
Other values (88) 1004
37.6%
Common
ValueCountFrequency (%)
556
38.2%
) 129
 
8.9%
( 129
 
8.9%
1 119
 
8.2%
2 111
 
7.6%
, 70
 
4.8%
3 59
 
4.1%
4 44
 
3.0%
0 43
 
3.0%
6 42
 
2.9%
Other values (6) 154
 
10.6%
Latin
ValueCountFrequency (%)
C 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2669
64.7%
ASCII 1457
35.3%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
556
38.2%
) 129
 
8.9%
( 129
 
8.9%
1 119
 
8.2%
2 111
 
7.6%
, 70
 
4.8%
3 59
 
4.0%
4 44
 
3.0%
0 43
 
3.0%
6 42
 
2.9%
Other values (7) 155
 
10.6%
Hangul
ValueCountFrequency (%)
306
 
11.5%
228
 
8.5%
163
 
6.1%
161
 
6.0%
155
 
5.8%
154
 
5.8%
151
 
5.7%
151
 
5.7%
116
 
4.3%
81
 
3.0%
Other values (87) 1003
37.6%
None
ValueCountFrequency (%)
1
100.0%

시설군
Categorical

Distinct16
Distinct (%)10.4%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
실내주차장
83 
어린이집
23 
대규모점포
10 
지하역사
의료기관
 
8
Other values (11)
21 

Length

Max length6
Median length5
Mean length4.5324675
Min length2

Unique

Unique6 ?
Unique (%)3.9%

Sample

1st row노인요양시설
2nd row노인요양시설
3rd row노인요양시설
4th row대규모점포
5th row대규모점포

Common Values

ValueCountFrequency (%)
실내주차장 83
53.9%
어린이집 23
 
14.9%
대규모점포 10
 
6.5%
지하역사 9
 
5.8%
의료기관 8
 
5.2%
목욕장 5
 
3.2%
노인요양시설 3
 
1.9%
학원 3
 
1.9%
영화관 2
 
1.3%
pc방 2
 
1.3%
Other values (6) 6
 
3.9%

Length

2023-12-12T08:21:59.340523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
실내주차장 83
53.9%
어린이집 23
 
14.9%
대규모점포 10
 
6.5%
지하역사 9
 
5.8%
의료기관 8
 
5.2%
목욕장 5
 
3.2%
노인요양시설 3
 
1.9%
학원 3
 
1.9%
pc방 3
 
1.9%
영화관 2
 
1.3%
Other values (5) 5
 
3.2%

Missing values

2023-12-12T08:21:57.865548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:21:57.937620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시설명주소시설군
0금호노인요양원서울특별시 성동구 금호로 45(금호동4가)노인요양시설
1시립동부노인전문요양센터서울특별시 성동구 마장로23길 12(홍익동, 시설관리팀)노인요양시설
2이암요양원서울특별시 성동구 마장로 125(상왕십리동)노인요양시설
3(주)신세계이마트성수점서울특별시 성동구 뚝섬로 377(성수2가1동)대규모점포
4(주)신세계이마트왕십리역점서울특별시 성동구 왕십리광장로 17(행당동)대규모점포
5대림리빙프라자서울특별시 성동구 행당로 75(행당동), 관리사무소대규모점포
6삼성쉐르빌상가서울특별시 성동구 무학로6길 50(도선동), 관리사무소대규모점포
7성수쇼핑센터 ㈜인트랩서울특별시 성동구 아차산로7길 28(성수2가3동), ㈜ 인트램(관리사무소)대규모점포
8용답자동차매매센터자동차시장1길 70(용답동), C동 3층 가-6호(관리사무소)대규모점포
9서울숲 더 샵(엔터식스 파크에비뉴 한양대점)서울특별시 성동구 왕십리로 241(행당동), 관리사무소대규모점포
시설명주소시설군
144하니삐아제어린이집서울특별시 성동구 독서당로272 금호4가대우아파트관리동어린이집
145에벤에셀516타워서울특별시 성동구 옥수동 265-1 외1필지실내주차장
146위너스오피스텔서울특별시 성동구 도선동 285-1실내주차장
147CORNER19서울특별시 성동구 성수동2가 314-19실내주차장
148서울숲에이원센터서울특별시 성동구 성수동1가 13-209 외5필지실내주차장
149성수에이원센터서울특별시 성동구 성수동2가 269-63실내주차장
150서울숲SR타워 오피스텔서울특별시 성동구 도선동 126 외1필지실내주차장
151케이타워 오피스텔서울특별시 성동구 용답동 229-2실내주차장
152동부자동차써비스서울특별시 성동구 성수동2가 329실내주차장
153성수불막사우나서울특별시 성동구 동일로 143(성수동2가)목욕장