Overview

Dataset statistics

Number of variables5
Number of observations366
Missing cells21
Missing cells (%)1.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory14.8 KiB
Average record size in memory41.4 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description실내공기질 관리법의 적용대상 영등포구 다중이용시설의 목록입니다. (시설군, 시설명, 주소, 전화번호 등 포함)
URLhttps://www.data.go.kr/data/15035088/fileData.do

Alerts

연번 is highly overall correlated with 시설군High correlation
시설군 is highly overall correlated with 연번High correlation
전화번호 has 21 (5.7%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 05:51:59.614220
Analysis finished2023-12-12 05:52:00.340662
Duration0.73 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct366
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean183.5
Minimum1
Maximum366
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.3 KiB
2023-12-12T14:52:00.416858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile19.25
Q192.25
median183.5
Q3274.75
95-th percentile347.75
Maximum366
Range365
Interquartile range (IQR)182.5

Descriptive statistics

Standard deviation105.79934
Coefficient of variation (CV)0.57656315
Kurtosis-1.2
Mean183.5
Median Absolute Deviation (MAD)91.5
Skewness0
Sum67161
Variance11193.5
MonotonicityStrictly increasing
2023-12-12T14:52:00.845283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
253 1
 
0.3%
251 1
 
0.3%
250 1
 
0.3%
249 1
 
0.3%
248 1
 
0.3%
247 1
 
0.3%
246 1
 
0.3%
245 1
 
0.3%
244 1
 
0.3%
Other values (356) 356
97.3%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
366 1
0.3%
365 1
0.3%
364 1
0.3%
363 1
0.3%
362 1
0.3%
361 1
0.3%
360 1
0.3%
359 1
0.3%
358 1
0.3%
357 1
0.3%

시설군
Categorical

HIGH CORRELATION 

Distinct18
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
실내주차장
204 
어린이집
48 
의료기관
37 
대규모점포
21 
지하역사
 
17
Other values (13)
39 

Length

Max length7
Median length5
Mean length4.6147541
Min length2

Unique

Unique5 ?
Unique (%)1.4%

Sample

1st row의료기관
2nd row의료기관
3rd row의료기관
4th row의료기관
5th row의료기관

Common Values

ValueCountFrequency (%)
실내주차장 204
55.7%
어린이집 48
 
13.1%
의료기관 37
 
10.1%
대규모점포 21
 
5.7%
지하역사 17
 
4.6%
PC방 10
 
2.7%
지하도상가 5
 
1.4%
목욕장 5
 
1.4%
산후조리원 4
 
1.1%
목욕장업 3
 
0.8%
Other values (8) 12
 
3.3%

Length

2023-12-12T14:52:00.966928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
실내주차장 204
55.7%
어린이집 48
 
13.1%
의료기관 37
 
10.1%
대규모점포 21
 
5.7%
지하역사 17
 
4.6%
pc방 10
 
2.7%
지하도상가 5
 
1.4%
목욕장 5
 
1.4%
산후조리원 4
 
1.1%
영화상영관 3
 
0.8%
Other values (8) 12
 
3.3%
Distinct356
Distinct (%)97.3%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
2023-12-12T14:52:01.242521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length21
Mean length9.3306011
Min length3

Characters and Unicode

Total characters3415
Distinct characters389
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique346 ?
Unique (%)94.5%

Sample

1st row의료법인인봉의료재단 영등포병원
2nd row대림요양병원
3rd row초록나무요양병원
4th row명지성모병원
5th row의료법인 춘혜의료재단 명지춘혜병원 본관
ValueCountFrequency (%)
영등포점 10
 
1.8%
어린이집 10
 
1.8%
공영주차장 7
 
1.2%
여의도 6
 
1.1%
영등포 6
 
1.1%
오피스텔 5
 
0.9%
9호선 5
 
0.9%
문래 4
 
0.7%
5호선 4
 
0.7%
의료법인 4
 
0.7%
Other values (451) 499
89.1%
2023-12-12T14:52:01.723738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
222
 
6.5%
92
 
2.7%
66
 
1.9%
64
 
1.9%
64
 
1.9%
59
 
1.7%
( 54
 
1.6%
) 54
 
1.6%
53
 
1.6%
51
 
1.5%
Other values (379) 2636
77.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2817
82.5%
Space Separator 222
 
6.5%
Uppercase Letter 153
 
4.5%
Decimal Number 71
 
2.1%
Open Punctuation 55
 
1.6%
Close Punctuation 55
 
1.6%
Lowercase Letter 22
 
0.6%
Other Punctuation 12
 
0.4%
Other Symbol 7
 
0.2%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
92
 
3.3%
66
 
2.3%
64
 
2.3%
64
 
2.3%
59
 
2.1%
53
 
1.9%
51
 
1.8%
50
 
1.8%
49
 
1.7%
48
 
1.7%
Other values (324) 2221
78.8%
Uppercase Letter
ValueCountFrequency (%)
K 18
11.8%
P 17
11.1%
C 13
 
8.5%
B 13
 
8.5%
S 13
 
8.5%
G 11
 
7.2%
T 11
 
7.2%
V 6
 
3.9%
N 6
 
3.9%
I 6
 
3.9%
Other values (14) 39
25.5%
Lowercase Letter
ValueCountFrequency (%)
e 5
22.7%
o 3
13.6%
r 3
13.6%
w 2
 
9.1%
p 2
 
9.1%
l 2
 
9.1%
h 1
 
4.5%
t 1
 
4.5%
n 1
 
4.5%
c 1
 
4.5%
Decimal Number
ValueCountFrequency (%)
2 17
23.9%
1 15
21.1%
3 13
18.3%
5 9
12.7%
9 6
 
8.5%
8 3
 
4.2%
7 3
 
4.2%
4 3
 
4.2%
6 2
 
2.8%
Other Punctuation
ValueCountFrequency (%)
: 4
33.3%
/ 4
33.3%
& 2
16.7%
, 2
16.7%
Open Punctuation
ValueCountFrequency (%)
( 54
98.2%
[ 1
 
1.8%
Close Punctuation
ValueCountFrequency (%)
) 54
98.2%
] 1
 
1.8%
Space Separator
ValueCountFrequency (%)
222
100.0%
Other Symbol
ValueCountFrequency (%)
7
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2824
82.7%
Common 416
 
12.2%
Latin 175
 
5.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
92
 
3.3%
66
 
2.3%
64
 
2.3%
64
 
2.3%
59
 
2.1%
53
 
1.9%
51
 
1.8%
50
 
1.8%
49
 
1.7%
48
 
1.7%
Other values (325) 2228
78.9%
Latin
ValueCountFrequency (%)
K 18
 
10.3%
P 17
 
9.7%
C 13
 
7.4%
B 13
 
7.4%
S 13
 
7.4%
G 11
 
6.3%
T 11
 
6.3%
V 6
 
3.4%
N 6
 
3.4%
I 6
 
3.4%
Other values (25) 61
34.9%
Common
ValueCountFrequency (%)
222
53.4%
( 54
 
13.0%
) 54
 
13.0%
2 17
 
4.1%
1 15
 
3.6%
3 13
 
3.1%
5 9
 
2.2%
9 6
 
1.4%
: 4
 
1.0%
/ 4
 
1.0%
Other values (9) 18
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2817
82.5%
ASCII 591
 
17.3%
None 7
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
222
37.6%
( 54
 
9.1%
) 54
 
9.1%
K 18
 
3.0%
P 17
 
2.9%
2 17
 
2.9%
1 15
 
2.5%
C 13
 
2.2%
3 13
 
2.2%
B 13
 
2.2%
Other values (44) 155
26.2%
Hangul
ValueCountFrequency (%)
92
 
3.3%
66
 
2.3%
64
 
2.3%
64
 
2.3%
59
 
2.1%
53
 
1.9%
51
 
1.8%
50
 
1.8%
49
 
1.7%
48
 
1.7%
Other values (324) 2221
78.8%
None
ValueCountFrequency (%)
7
100.0%

주소
Text

Distinct340
Distinct (%)92.9%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
2023-12-12T14:52:01.977252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length65
Median length49
Mean length28.010929
Min length19

Characters and Unicode

Total characters10252
Distinct characters153
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique322 ?
Unique (%)88.0%

Sample

1st row서울특별시 영등포구 당산로31길 10 (당산동3가)
2nd row서울특별시 영등포구 가마산로 381 (대림동)
3rd row서울특별시 영등포구 가마산로 368(대림동)
4th row서울특별시 영등포구 도림로 154, 1층/ 도림로 156/ 도림로 158, 3-4층/ 대림로23길 8, 1층(대림동)
5th row서울특별시 영등포구 대림로 223 (대림동)
ValueCountFrequency (%)
영등포구 388
21.5%
서울특별시 378
21.0%
여의도동 86
 
4.8%
의사당대로 25
 
1.4%
영중로 23
 
1.3%
신길동 21
 
1.2%
선유로 17
 
0.9%
영등포로 16
 
0.9%
여의대로 16
 
0.9%
10 14
 
0.8%
Other values (462) 819
45.4%
2023-12-12T14:52:02.373928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1450
 
14.1%
501
 
4.9%
468
 
4.6%
468
 
4.6%
388
 
3.8%
388
 
3.8%
386
 
3.8%
385
 
3.8%
378
 
3.7%
378
 
3.7%
Other values (143) 5062
49.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6691
65.3%
Space Separator 1450
 
14.1%
Decimal Number 1285
 
12.5%
Close Punctuation 360
 
3.5%
Open Punctuation 360
 
3.5%
Other Punctuation 57
 
0.6%
Dash Punctuation 23
 
0.2%
Uppercase Letter 22
 
0.2%
Math Symbol 3
 
< 0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
501
 
7.5%
468
 
7.0%
468
 
7.0%
388
 
5.8%
388
 
5.8%
386
 
5.8%
385
 
5.8%
378
 
5.6%
378
 
5.6%
375
 
5.6%
Other values (112) 2576
38.5%
Uppercase Letter
ValueCountFrequency (%)
B 6
27.3%
S 4
18.2%
L 2
 
9.1%
A 2
 
9.1%
R 1
 
4.5%
P 1
 
4.5%
E 1
 
4.5%
H 1
 
4.5%
T 1
 
4.5%
C 1
 
4.5%
Other values (2) 2
 
9.1%
Decimal Number
ValueCountFrequency (%)
1 261
20.3%
3 181
14.1%
2 161
12.5%
6 126
9.8%
4 122
9.5%
5 122
9.5%
8 97
 
7.5%
7 84
 
6.5%
0 83
 
6.5%
9 48
 
3.7%
Other Punctuation
ValueCountFrequency (%)
, 50
87.7%
/ 6
 
10.5%
· 1
 
1.8%
Space Separator
ValueCountFrequency (%)
1450
100.0%
Close Punctuation
ValueCountFrequency (%)
) 360
100.0%
Open Punctuation
ValueCountFrequency (%)
( 360
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 23
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%
Lowercase Letter
ValueCountFrequency (%)
b 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6691
65.3%
Common 3538
34.5%
Latin 23
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
501
 
7.5%
468
 
7.0%
468
 
7.0%
388
 
5.8%
388
 
5.8%
386
 
5.8%
385
 
5.8%
378
 
5.6%
378
 
5.6%
375
 
5.6%
Other values (112) 2576
38.5%
Common
ValueCountFrequency (%)
1450
41.0%
) 360
 
10.2%
( 360
 
10.2%
1 261
 
7.4%
3 181
 
5.1%
2 161
 
4.6%
6 126
 
3.6%
4 122
 
3.4%
5 122
 
3.4%
8 97
 
2.7%
Other values (8) 298
 
8.4%
Latin
ValueCountFrequency (%)
B 6
26.1%
S 4
17.4%
L 2
 
8.7%
A 2
 
8.7%
R 1
 
4.3%
P 1
 
4.3%
E 1
 
4.3%
H 1
 
4.3%
T 1
 
4.3%
b 1
 
4.3%
Other values (3) 3
13.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6691
65.3%
ASCII 3560
34.7%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1450
40.7%
) 360
 
10.1%
( 360
 
10.1%
1 261
 
7.3%
3 181
 
5.1%
2 161
 
4.5%
6 126
 
3.5%
4 122
 
3.4%
5 122
 
3.4%
8 97
 
2.7%
Other values (20) 320
 
9.0%
Hangul
ValueCountFrequency (%)
501
 
7.5%
468
 
7.0%
468
 
7.0%
388
 
5.8%
388
 
5.8%
386
 
5.8%
385
 
5.8%
378
 
5.6%
378
 
5.6%
375
 
5.6%
Other values (112) 2576
38.5%
None
ValueCountFrequency (%)
· 1
100.0%

전화번호
Text

MISSING 

Distinct307
Distinct (%)89.0%
Missing21
Missing (%)5.7%
Memory size3.0 KiB
2023-12-12T14:52:02.603191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length12
Mean length11.713043
Min length11

Characters and Unicode

Total characters4041
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique283 ?
Unique (%)82.0%

Sample

1st row02-2632-0013
2nd row02-836-7575
3rd row02-2135-7578
4th row02-829-7876
5th row02-3284-7797
ValueCountFrequency (%)
02-2650-1454 9
 
2.6%
02-6788-2119 5
 
1.4%
02-6265-7789 3
 
0.9%
02-6181-5644 3
 
0.9%
02-3150-7991 3
 
0.9%
02-2290-6561 3
 
0.9%
02-2081-8160 2
 
0.6%
02-2634-7654 2
 
0.6%
02-2631-3671 2
 
0.6%
02-6446-8105 2
 
0.6%
Other values (297) 311
90.1%
2023-12-12T14:52:02.985926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 693
17.1%
2 682
16.9%
0 646
16.0%
6 333
8.2%
1 331
8.2%
7 289
7.2%
3 268
 
6.6%
8 249
 
6.2%
5 207
 
5.1%
9 172
 
4.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3348
82.9%
Dash Punctuation 693
 
17.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 682
20.4%
0 646
19.3%
6 333
9.9%
1 331
9.9%
7 289
8.6%
3 268
 
8.0%
8 249
 
7.4%
5 207
 
6.2%
9 172
 
5.1%
4 171
 
5.1%
Dash Punctuation
ValueCountFrequency (%)
- 693
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4041
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 693
17.1%
2 682
16.9%
0 646
16.0%
6 333
8.2%
1 331
8.2%
7 289
7.2%
3 268
 
6.6%
8 249
 
6.2%
5 207
 
5.1%
9 172
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4041
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 693
17.1%
2 682
16.9%
0 646
16.0%
6 333
8.2%
1 331
8.2%
7 289
7.2%
3 268
 
6.6%
8 249
 
6.2%
5 207
 
5.1%
9 172
 
4.3%

Interactions

2023-12-12T14:52:00.007071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:52:03.081219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시설군
연번1.0000.835
시설군0.8351.000
2023-12-12T14:52:03.163324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시설군
연번1.0000.511
시설군0.5111.000

Missing values

2023-12-12T14:52:00.192211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:52:00.295960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시설군시설명주소전화번호
01의료기관의료법인인봉의료재단 영등포병원서울특별시 영등포구 당산로31길 10 (당산동3가)02-2632-0013
12의료기관대림요양병원서울특별시 영등포구 가마산로 381 (대림동)02-836-7575
23의료기관초록나무요양병원서울특별시 영등포구 가마산로 368(대림동)02-2135-7578
34의료기관명지성모병원서울특별시 영등포구 도림로 154, 1층/ 도림로 156/ 도림로 158, 3-4층/ 대림로23길 8, 1층(대림동)02-829-7876
45의료기관의료법인 춘혜의료재단 명지춘혜병원 본관서울특별시 영등포구 대림로 223 (대림동)02-3284-7797
56의료기관의료법인 춘혜의료재단 명지춘혜병원 엘드림센터서울특별시 영등포구 대림로39길 3(대림동)02-3284-7797
67의료기관한림대학교 강남성심병원(본관, 신관)서울특별시 영등포구 신길로 1 (대림동)02-829-5045
78의료기관한림대학교 강남성심병원(별관)서울특별시 영등포구 신길로 19-202-829-5045
89의료기관대림성모병원서울특별시 영등포구 시흥대로 657 / 시흥대로183길 7-8(대림동)02-829-9213
910의료기관로하스참사랑요양병원서울특별시 영등포구 경인로77길 6 (문래동2가)02-2634-7500
연번시설군시설명주소전화번호
356357대규모점포포레나당산동역세권청년주택서울특별시 영등포구 당산로 83(당산동2가)02-585-0580
357358PC방THE 1975서울특별시 영등포구 영등포로 247(영등포동2가), B01호<NA>
358359PC방NEOX PC 도림사거리점서울특별시 영등포구 도림로 331(도림동), 2층<NA>
359360목욕장업씨랄라서울특별시 영등포구 문래로 164(문래동3가)02-1522-9661
360361목욕장업여의도메리어트호텔수피트니스 앤 스파서울특별시 영등포구 여의대로 8(여의도동) 여의도파크센터 B2층02-2090-8023
361362목욕장업더파크 엠 사우나서울특별시 영등포구 선유동1로 50 THE PARK 365 지하1층02-2069-2229
362363학원밀당피티(PT)원격학원서울특별시 영등포구 여의대로 108(여의도동), 38층, 39층02-743-8838
363364실내주차장생각공장 당산서울특별시 영등포구 영등포로 150(당산동1가)02-2068-1123
364365실내주차장포레나당산동역세권청년주택서울특별시 영등포구 당산동2가 45-502-585-0580
365366실내주차장씨에스로직스 복합시설서울특별시 영등포구 당산로37길 16(당산동4가)02-3459-9363