Overview

Dataset statistics

Number of variables5
Number of observations299
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.1 KiB
Average record size in memory41.4 B

Variable types

Numeric1
Categorical2
Text2

Dataset

Description서울특별시 중구 관내 다중이용시설 실내공기질 관리시설 현황입니다. 서울시 중구 관내 시설명, 시설군, 주소 등이 제공됩니다.
URLhttps://www.data.go.kr/data/3078582/fileData.do

Alerts

자치구 has constant value ""Constant
연번 is highly overall correlated with 시설군High correlation
시설군 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 14:05:45.198249
Analysis finished2023-12-12 14:05:46.084542
Duration0.89 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct299
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean150
Minimum1
Maximum299
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.8 KiB
2023-12-12T23:05:46.172421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile15.9
Q175.5
median150
Q3224.5
95-th percentile284.1
Maximum299
Range298
Interquartile range (IQR)149

Descriptive statistics

Standard deviation86.458082
Coefficient of variation (CV)0.57638722
Kurtosis-1.2
Mean150
Median Absolute Deviation (MAD)75
Skewness0
Sum44850
Variance7475
MonotonicityStrictly increasing
2023-12-12T23:05:46.343067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
207 1
 
0.3%
205 1
 
0.3%
204 1
 
0.3%
203 1
 
0.3%
202 1
 
0.3%
201 1
 
0.3%
200 1
 
0.3%
199 1
 
0.3%
198 1
 
0.3%
Other values (289) 289
96.7%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
299 1
0.3%
298 1
0.3%
297 1
0.3%
296 1
0.3%
295 1
0.3%
294 1
0.3%
293 1
0.3%
292 1
0.3%
291 1
0.3%
290 1
0.3%

자치구
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
서울특별시 중구
299 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시 중구
2nd row서울특별시 중구
3rd row서울특별시 중구
4th row서울특별시 중구
5th row서울특별시 중구

Common Values

ValueCountFrequency (%)
서울특별시 중구 299
100.0%

Length

2023-12-12T23:05:46.514803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:05:46.636685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울특별시 299
50.0%
중구 299
50.0%

시설군
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
실내주차장
151 
대규모점포
61 
지하역사
23 
어린이집
21 
지하도상가
 
11
Other values (10)
32 

Length

Max length6
Median length5
Mean length4.7391304
Min length2

Unique

Unique5 ?
Unique (%)1.7%

Sample

1st row지하역사
2nd row지하역사
3rd row지하역사
4th row지하역사
5th row지하역사

Common Values

ValueCountFrequency (%)
실내주차장 151
50.5%
대규모점포 61
20.4%
지하역사 23
 
7.7%
어린이집 21
 
7.0%
지하도상가 11
 
3.7%
의료기관 9
 
3.0%
영화상영관 8
 
2.7%
목욕장 6
 
2.0%
미술관 2
 
0.7%
박물관 2
 
0.7%
Other values (5) 5
 
1.7%

Length

2023-12-12T23:05:46.786474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
실내주차장 151
50.5%
대규모점포 61
20.4%
지하역사 23
 
7.7%
어린이집 21
 
7.0%
지하도상가 11
 
3.7%
의료기관 9
 
3.0%
영화상영관 8
 
2.7%
목욕장 6
 
2.0%
미술관 2
 
0.7%
박물관 2
 
0.7%
Other values (5) 5
 
1.7%
Distinct292
Distinct (%)97.7%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2023-12-12T23:05:47.095149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length16
Mean length8.6822742
Min length3

Characters and Unicode

Total characters2596
Distinct characters309
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique286 ?
Unique (%)95.7%

Sample

1st row서울역(1호선)
2nd row시청역(1호선)
3rd row시청역(2호선)
4th row을지로입구(2호선)
5th row을지로3가(2호선)
ValueCountFrequency (%)
지하상가(시설관리공단 4
 
1.2%
지하도상가(시설관리공단 4
 
1.2%
동대문디자인플라자 3
 
0.9%
굿모닝시티 3
 
0.9%
주차장 2
 
0.6%
한화빌딩주차장 2
 
0.6%
어린이집 2
 
0.6%
남산타운 2
 
0.6%
cgv 2
 
0.6%
명동 2
 
0.6%
Other values (311) 319
92.5%
2023-12-12T23:05:47.574088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
130
 
5.0%
123
 
4.7%
118
 
4.5%
) 54
 
2.1%
( 51
 
2.0%
47
 
1.8%
47
 
1.8%
46
 
1.8%
44
 
1.7%
39
 
1.5%
Other values (299) 1897
73.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2312
89.1%
Uppercase Letter 73
 
2.8%
Close Punctuation 54
 
2.1%
Open Punctuation 51
 
2.0%
Space Separator 47
 
1.8%
Decimal Number 39
 
1.5%
Other Symbol 16
 
0.6%
Other Punctuation 2
 
0.1%
Dash Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
130
 
5.6%
123
 
5.3%
118
 
5.1%
47
 
2.0%
46
 
2.0%
44
 
1.9%
39
 
1.7%
35
 
1.5%
32
 
1.4%
31
 
1.3%
Other values (267) 1667
72.1%
Uppercase Letter
ValueCountFrequency (%)
C 10
13.7%
S 8
11.0%
K 7
9.6%
A 6
 
8.2%
E 6
 
8.2%
M 5
 
6.8%
J 5
 
6.8%
I 4
 
5.5%
G 3
 
4.1%
N 3
 
4.1%
Other values (9) 16
21.9%
Decimal Number
ValueCountFrequency (%)
2 10
25.6%
4 8
20.5%
3 8
20.5%
5 5
12.8%
6 4
 
10.3%
1 3
 
7.7%
0 1
 
2.6%
Close Punctuation
ValueCountFrequency (%)
) 54
100.0%
Open Punctuation
ValueCountFrequency (%)
( 51
100.0%
Space Separator
ValueCountFrequency (%)
47
100.0%
Other Symbol
ValueCountFrequency (%)
16
100.0%
Other Punctuation
ValueCountFrequency (%)
& 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2328
89.7%
Common 195
 
7.5%
Latin 73
 
2.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
130
 
5.6%
123
 
5.3%
118
 
5.1%
47
 
2.0%
46
 
2.0%
44
 
1.9%
39
 
1.7%
35
 
1.5%
32
 
1.4%
31
 
1.3%
Other values (268) 1683
72.3%
Latin
ValueCountFrequency (%)
C 10
13.7%
S 8
11.0%
K 7
9.6%
A 6
 
8.2%
E 6
 
8.2%
M 5
 
6.8%
J 5
 
6.8%
I 4
 
5.5%
G 3
 
4.1%
N 3
 
4.1%
Other values (9) 16
21.9%
Common
ValueCountFrequency (%)
) 54
27.7%
( 51
26.2%
47
24.1%
2 10
 
5.1%
4 8
 
4.1%
3 8
 
4.1%
5 5
 
2.6%
6 4
 
2.1%
1 3
 
1.5%
& 2
 
1.0%
Other values (2) 3
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2312
89.1%
ASCII 268
 
10.3%
None 16
 
0.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
130
 
5.6%
123
 
5.3%
118
 
5.1%
47
 
2.0%
46
 
2.0%
44
 
1.9%
39
 
1.7%
35
 
1.5%
32
 
1.4%
31
 
1.3%
Other values (267) 1667
72.1%
ASCII
ValueCountFrequency (%)
) 54
20.1%
( 51
19.0%
47
17.5%
2 10
 
3.7%
C 10
 
3.7%
S 8
 
3.0%
4 8
 
3.0%
3 8
 
3.0%
K 7
 
2.6%
A 6
 
2.2%
Other values (21) 59
22.0%
None
ValueCountFrequency (%)
16
100.0%

주소
Text

Distinct260
Distinct (%)87.0%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2023-12-12T23:05:47.934727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length35
Mean length18.297659
Min length14

Characters and Unicode

Total characters5471
Distinct characters159
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique227 ?
Unique (%)75.9%

Sample

1st row서울특별시 중구 세종대로 지하2(남대문로5가)
2nd row서울특별시 중구 세종대로 지하101(정동)
3rd row서울특별시 중구 서소문로 지하131(서소문동)
4th row서울특별시 중구 을지로 지하42
5th row서울특별시 중구 을지로 지하106
ValueCountFrequency (%)
중구 298
24.1%
서울특별시 296
23.9%
을지로 36
 
2.9%
퇴계로 25
 
2.0%
청계천로 18
 
1.5%
남대문로 18
 
1.5%
세종대로 14
 
1.1%
소공로 12
 
1.0%
장충단로 12
 
1.0%
서소문로 10
 
0.8%
Other values (295) 497
40.2%
2023-12-12T23:05:48.404524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
941
17.2%
319
 
5.8%
310
 
5.7%
302
 
5.5%
301
 
5.5%
299
 
5.5%
297
 
5.4%
297
 
5.4%
295
 
5.4%
1 165
 
3.0%
Other values (149) 1945
35.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3537
64.6%
Space Separator 941
 
17.2%
Decimal Number 849
 
15.5%
Close Punctuation 50
 
0.9%
Open Punctuation 50
 
0.9%
Uppercase Letter 18
 
0.3%
Other Punctuation 16
 
0.3%
Dash Punctuation 10
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
319
 
9.0%
310
 
8.8%
302
 
8.5%
301
 
8.5%
299
 
8.5%
297
 
8.4%
297
 
8.4%
295
 
8.3%
86
 
2.4%
81
 
2.3%
Other values (121) 950
26.9%
Uppercase Letter
ValueCountFrequency (%)
S 4
22.2%
K 2
11.1%
T 2
11.1%
L 1
 
5.6%
U 1
 
5.6%
D 1
 
5.6%
I 1
 
5.6%
O 1
 
5.6%
W 1
 
5.6%
G 1
 
5.6%
Other values (3) 3
16.7%
Decimal Number
ValueCountFrequency (%)
1 165
19.4%
2 137
16.1%
3 102
12.0%
4 75
8.8%
0 74
8.7%
5 66
 
7.8%
6 65
 
7.7%
7 59
 
6.9%
9 56
 
6.6%
8 50
 
5.9%
Space Separator
ValueCountFrequency (%)
941
100.0%
Close Punctuation
ValueCountFrequency (%)
) 50
100.0%
Open Punctuation
ValueCountFrequency (%)
( 50
100.0%
Other Punctuation
ValueCountFrequency (%)
, 16
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3537
64.6%
Common 1916
35.0%
Latin 18
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
319
 
9.0%
310
 
8.8%
302
 
8.5%
301
 
8.5%
299
 
8.5%
297
 
8.4%
297
 
8.4%
295
 
8.3%
86
 
2.4%
81
 
2.3%
Other values (121) 950
26.9%
Common
ValueCountFrequency (%)
941
49.1%
1 165
 
8.6%
2 137
 
7.2%
3 102
 
5.3%
4 75
 
3.9%
0 74
 
3.9%
5 66
 
3.4%
6 65
 
3.4%
7 59
 
3.1%
9 56
 
2.9%
Other values (5) 176
 
9.2%
Latin
ValueCountFrequency (%)
S 4
22.2%
K 2
11.1%
T 2
11.1%
L 1
 
5.6%
U 1
 
5.6%
D 1
 
5.6%
I 1
 
5.6%
O 1
 
5.6%
W 1
 
5.6%
G 1
 
5.6%
Other values (3) 3
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3537
64.6%
ASCII 1934
35.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
941
48.7%
1 165
 
8.5%
2 137
 
7.1%
3 102
 
5.3%
4 75
 
3.9%
0 74
 
3.8%
5 66
 
3.4%
6 65
 
3.4%
7 59
 
3.1%
9 56
 
2.9%
Other values (18) 194
 
10.0%
Hangul
ValueCountFrequency (%)
319
 
9.0%
310
 
8.8%
302
 
8.5%
301
 
8.5%
299
 
8.5%
297
 
8.4%
297
 
8.4%
295
 
8.3%
86
 
2.4%
81
 
2.3%
Other values (121) 950
26.9%

Interactions

2023-12-12T23:05:45.799013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:05:48.517792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시설군
연번1.0000.880
시설군0.8801.000
2023-12-12T23:05:48.617496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시설군
연번1.0000.567
시설군0.5671.000

Missing values

2023-12-12T23:05:45.929830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:05:46.041602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번자치구시설군시설명주소
01서울특별시 중구지하역사서울역(1호선)서울특별시 중구 세종대로 지하2(남대문로5가)
12서울특별시 중구지하역사시청역(1호선)서울특별시 중구 세종대로 지하101(정동)
23서울특별시 중구지하역사시청역(2호선)서울특별시 중구 서소문로 지하131(서소문동)
34서울특별시 중구지하역사을지로입구(2호선)서울특별시 중구 을지로 지하42
45서울특별시 중구지하역사을지로3가(2호선)서울특별시 중구 을지로 지하106
56서울특별시 중구지하역사을지로4가(2호선)서울특별시 중구 을지로 지하178
67서울특별시 중구지하역사동대문역사문화공원역(2호선)서울특별시 중구 을지로 지하279
78서울특별시 중구지하역사신당역(2호선)서울특별시 중구 퇴계로 지하431-1
89서울특별시 중구지하역사을지로3가(3호선)서울특별시 중구 을지로 지하129
910서울특별시 중구지하역사충무로(3호선)서울특별시 중구 퇴계로 지하199
연번자치구시설군시설명주소
289290서울특별시 중구목욕장황금스파서울특별시 중구 청계천로 400
290291서울특별시 중구미술관서울시립미술관서울특별시 중구 덕수궁길 61(서소문동)
291292서울특별시 중구미술관덕수궁미술관서울특별시 중구 세종대로 99
292293서울특별시 중구학원종로학원서울특별시 중구 청파로 456
293294서울특별시 중구전시시설동대문디자인플라자서울시 중구 을지로 281
294295서울특별시 중구박물관한국은행 화폐박물관서울특별시 중구 남대문로 39 (남대문로3가, 한국은행)
295296서울특별시 중구박물관이화백주년기념관(박물관)서울특별시 중구 정동길 26
296297서울특별시 중구노인요양시설구립중구노인요양센터서울특별시 중구 필동로 96
297298서울특별시 중구산후조리원제일병원(산후조리원)서울특별시 중구 서애로1길 17(묵정동)
298299서울특별시 중구도서관서울도서관서울특별시 중구 세종대로 110