Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows41
Duplicate rows (%)0.4%
Total size in memory468.8 KiB
Average record size in memory48.0 B

Variable types

Categorical2
Text3

Dataset

Description한국환경공단에서 운영하는 실내공기질 관리 종합정보망에 등록된 다중이용시설 연도별 대상 목록 정보를 제공합니다.
Author한국환경공단
URLhttps://www.data.go.kr/data/15093402/fileData.do

Alerts

Dataset has 41 (0.4%) duplicate rowsDuplicates

Reproduction

Analysis started2024-03-23 04:18:47.439427
Analysis finished2024-03-23 04:18:52.088438
Duration4.65 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도
Categorical

Distinct17
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경기도
2729 
서울특별시
2137 
인천광역시
611 
부산광역시
608 
경상남도
592 
Other values (12)
3323 

Length

Max length7
Median length5
Mean length4.235
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시
2nd row경상남도
3rd row경기도
4th row경기도
5th row서울특별시

Common Values

ValueCountFrequency (%)
경기도 2729
27.3%
서울특별시 2137
21.4%
인천광역시 611
 
6.1%
부산광역시 608
 
6.1%
경상남도 592
 
5.9%
대구광역시 386
 
3.9%
경상북도 374
 
3.7%
전라남도 349
 
3.5%
충청남도 345
 
3.5%
전라북도 326
 
3.3%
Other values (7) 1543
15.4%

Length

2024-03-23T04:18:52.481043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도 2729
27.3%
서울특별시 2137
21.4%
인천광역시 611
 
6.1%
부산광역시 608
 
6.1%
경상남도 592
 
5.9%
대구광역시 386
 
3.9%
경상북도 374
 
3.7%
전라남도 349
 
3.5%
충청남도 345
 
3.5%
전라북도 326
 
3.3%
Other values (7) 1543
15.4%
Distinct212
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-23T04:18:53.320903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length3
Mean length3.0737
Min length2

Characters and Unicode

Total characters30737
Distinct characters139
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row마포구
2nd row거제시
3rd row고양시
4th row성남시
5th row강남구
ValueCountFrequency (%)
성남시 309
 
3.1%
중구 301
 
3.0%
서구 287
 
2.9%
고양시 279
 
2.8%
강남구 258
 
2.6%
북구 237
 
2.4%
수원시 227
 
2.3%
용인시 224
 
2.2%
창원시 209
 
2.1%
부천시 176
 
1.8%
Other values (202) 7493
74.9%
2024-03-23T04:18:54.706436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5023
 
16.3%
4594
 
14.9%
1095
 
3.6%
1067
 
3.5%
899
 
2.9%
860
 
2.8%
789
 
2.6%
755
 
2.5%
743
 
2.4%
710
 
2.3%
Other values (129) 14202
46.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 30737
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5023
 
16.3%
4594
 
14.9%
1095
 
3.6%
1067
 
3.5%
899
 
2.9%
860
 
2.8%
789
 
2.6%
755
 
2.5%
743
 
2.4%
710
 
2.3%
Other values (129) 14202
46.2%

Most occurring scripts

ValueCountFrequency (%)
Hangul 30737
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5023
 
16.3%
4594
 
14.9%
1095
 
3.6%
1067
 
3.5%
899
 
2.9%
860
 
2.8%
789
 
2.6%
755
 
2.5%
743
 
2.4%
710
 
2.3%
Other values (129) 14202
46.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 30737
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5023
 
16.3%
4594
 
14.9%
1095
 
3.6%
1067
 
3.5%
899
 
2.9%
860
 
2.8%
789
 
2.6%
755
 
2.5%
743
 
2.4%
710
 
2.3%
Other values (129) 14202
46.2%
Distinct9245
Distinct (%)92.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-23T04:18:55.565284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length29
Mean length8.0088
Min length2

Characters and Unicode

Total characters80088
Distinct characters846
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8697 ?
Unique (%)87.0%

Sample

1st row홈플러스월드컵몰점
2nd row롯데마트거제점
3rd row현대프라자
4th row성남시노인보건센터의원
5th row와이비엠어학원
ValueCountFrequency (%)
pc 99
 
0.8%
pc방 87
 
0.7%
어린이집 87
 
0.7%
롯데시네마 54
 
0.4%
이마트 51
 
0.4%
홈플러스 50
 
0.4%
의료법인 43
 
0.4%
롯데마트 40
 
0.3%
cgv 33
 
0.3%
공영주차장 32
 
0.3%
Other values (9901) 11693
95.3%
2024-03-23T04:18:57.056957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3395
 
4.2%
2848
 
3.6%
2596
 
3.2%
2447
 
3.1%
2408
 
3.0%
2304
 
2.9%
1468
 
1.8%
1393
 
1.7%
1374
 
1.7%
1127
 
1.4%
Other values (836) 58728
73.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 71775
89.6%
Uppercase Letter 3037
 
3.8%
Space Separator 2304
 
2.9%
Decimal Number 791
 
1.0%
Open Punctuation 733
 
0.9%
Close Punctuation 732
 
0.9%
Lowercase Letter 434
 
0.5%
Other Symbol 131
 
0.2%
Other Punctuation 85
 
0.1%
Dash Punctuation 48
 
0.1%
Other values (3) 18
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3395
 
4.7%
2848
 
4.0%
2596
 
3.6%
2447
 
3.4%
2408
 
3.4%
1468
 
2.0%
1393
 
1.9%
1374
 
1.9%
1127
 
1.6%
962
 
1.3%
Other values (754) 51757
72.1%
Uppercase Letter
ValueCountFrequency (%)
C 770
25.4%
P 643
21.2%
G 139
 
4.6%
S 132
 
4.3%
A 127
 
4.2%
E 115
 
3.8%
K 114
 
3.8%
T 106
 
3.5%
O 102
 
3.4%
V 96
 
3.2%
Other values (16) 693
22.8%
Lowercase Letter
ValueCountFrequency (%)
p 70
16.1%
c 66
15.2%
e 59
13.6%
r 30
 
6.9%
a 27
 
6.2%
o 26
 
6.0%
s 21
 
4.8%
n 18
 
4.1%
t 17
 
3.9%
u 12
 
2.8%
Other values (15) 88
20.3%
Decimal Number
ValueCountFrequency (%)
2 230
29.1%
1 198
25.0%
3 104
13.1%
4 56
 
7.1%
5 56
 
7.1%
9 38
 
4.8%
6 34
 
4.3%
7 28
 
3.5%
8 24
 
3.0%
0 23
 
2.9%
Other Punctuation
ValueCountFrequency (%)
& 34
40.0%
. 30
35.3%
, 11
 
12.9%
: 3
 
3.5%
' 3
 
3.5%
1
 
1.2%
/ 1
 
1.2%
% 1
 
1.2%
! 1
 
1.2%
Open Punctuation
ValueCountFrequency (%)
( 729
99.5%
[ 4
 
0.5%
Close Punctuation
ValueCountFrequency (%)
) 728
99.5%
] 4
 
0.5%
Letter Number
ValueCountFrequency (%)
9
81.8%
2
 
18.2%
Math Symbol
ValueCountFrequency (%)
~ 4
66.7%
+ 2
33.3%
Space Separator
ValueCountFrequency (%)
2304
100.0%
Other Symbol
ValueCountFrequency (%)
131
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 48
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 71906
89.8%
Common 4700
 
5.9%
Latin 3482
 
4.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3395
 
4.7%
2848
 
4.0%
2596
 
3.6%
2447
 
3.4%
2408
 
3.3%
1468
 
2.0%
1393
 
1.9%
1374
 
1.9%
1127
 
1.6%
962
 
1.3%
Other values (755) 51888
72.2%
Latin
ValueCountFrequency (%)
C 770
22.1%
P 643
18.5%
G 139
 
4.0%
S 132
 
3.8%
A 127
 
3.6%
E 115
 
3.3%
K 114
 
3.3%
T 106
 
3.0%
O 102
 
2.9%
V 96
 
2.8%
Other values (43) 1138
32.7%
Common
ValueCountFrequency (%)
2304
49.0%
( 729
 
15.5%
) 728
 
15.5%
2 230
 
4.9%
1 198
 
4.2%
3 104
 
2.2%
4 56
 
1.2%
5 56
 
1.2%
- 48
 
1.0%
9 38
 
0.8%
Other values (18) 209
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 71775
89.6%
ASCII 8170
 
10.2%
None 132
 
0.2%
Number Forms 11
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3395
 
4.7%
2848
 
4.0%
2596
 
3.6%
2447
 
3.4%
2408
 
3.4%
1468
 
2.0%
1393
 
1.9%
1374
 
1.9%
1127
 
1.6%
962
 
1.3%
Other values (754) 51757
72.1%
ASCII
ValueCountFrequency (%)
2304
28.2%
C 770
 
9.4%
( 729
 
8.9%
) 728
 
8.9%
P 643
 
7.9%
2 230
 
2.8%
1 198
 
2.4%
G 139
 
1.7%
S 132
 
1.6%
A 127
 
1.6%
Other values (68) 2170
26.6%
None
ValueCountFrequency (%)
131
99.2%
1
 
0.8%
Number Forms
ValueCountFrequency (%)
9
81.8%
2
 
18.2%
Distinct8716
Distinct (%)87.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-23T04:18:58.060174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters100000
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8250 ?
Unique (%)82.5%

Sample

1st row2208160348
2nd row6128516490
3rd row1288020927
4th row1298283458
5th row2149086331
ValueCountFrequency (%)
2068650913 86
 
0.9%
2208160348 64
 
0.6%
2188200999 55
 
0.5%
1048145690 46
 
0.5%
6058209189 45
 
0.4%
3148210024 41
 
0.4%
1111111111 32
 
0.3%
2158605260 30
 
0.3%
1148201319 26
 
0.3%
5148206760 24
 
0.2%
Other values (8706) 9551
95.5%
2024-03-23T04:18:59.619021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 17567
17.6%
1 15038
15.0%
8 13135
13.1%
2 12324
12.3%
3 8055
8.1%
6 7703
7.7%
4 7267
7.3%
5 7064
7.1%
9 6011
 
6.0%
7 5827
 
5.8%
Other values (2) 9
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 99991
> 99.9%
Dash Punctuation 8
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 17567
17.6%
1 15038
15.0%
8 13135
13.1%
2 12324
12.3%
3 8055
8.1%
6 7703
7.7%
4 7267
7.3%
5 7064
7.1%
9 6011
 
6.0%
7 5827
 
5.8%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 100000
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 17567
17.6%
1 15038
15.0%
8 13135
13.1%
2 12324
12.3%
3 8055
8.1%
6 7703
7.7%
4 7267
7.3%
5 7064
7.1%
9 6011
 
6.0%
7 5827
 
5.8%
Other values (2) 9
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 100000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 17567
17.6%
1 15038
15.0%
8 13135
13.1%
2 12324
12.3%
3 8055
8.1%
6 7703
7.7%
4 7267
7.3%
5 7064
7.1%
9 6011
 
6.0%
7 5827
 
5.8%
Other values (2) 9
 
< 0.1%

시설군
Categorical

Distinct27
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
실내주차장
2541 
어린이집
2380 
의료기관
1365 
노인요양시설
792 
PC영업시설
653 
Other values (22)
2269 

Length

Max length14
Median length9
Mean length4.6143
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row대규모점포
2nd row대규모점포
3rd row실내주차장
4th row노인요양시설
5th row학원

Common Values

ValueCountFrequency (%)
실내주차장 2541
25.4%
어린이집 2380
23.8%
의료기관 1365
13.7%
노인요양시설 792
 
7.9%
PC영업시설 653
 
6.5%
대규모점포 547
 
5.5%
목욕장 365
 
3.6%
지하역사 277
 
2.8%
영화상영관 205
 
2.1%
산후조리원 169
 
1.7%
Other values (17) 706
 
7.1%

Length

2024-03-23T04:19:00.357688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
실내주차장 2541
25.4%
어린이집 2380
23.8%
의료기관 1365
13.7%
노인요양시설 792
 
7.9%
pc영업시설 653
 
6.5%
대규모점포 547
 
5.5%
목욕장 365
 
3.6%
지하역사 277
 
2.8%
영화상영관 205
 
2.1%
산후조리원 169
 
1.7%
Other values (17) 706
 
7.1%

Correlations

2024-03-23T04:19:00.743880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시도시설군
시도1.0000.376
시설군0.3761.000
2024-03-23T04:19:01.070091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설군시도
시설군1.0000.116
시도0.1161.000
2024-03-23T04:19:01.360133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시도시설군
시도1.0000.116
시설군0.1161.000

Missing values

2024-03-23T04:18:50.821039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T04:18:51.484627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도시군구시설명사업자등록번호시설군
15850서울특별시마포구홈플러스월드컵몰점2208160348대규모점포
7170경상남도거제시롯데마트거제점6128516490대규모점포
1259경기도고양시현대프라자1288020927실내주차장
3078경기도성남시성남시노인보건센터의원1298283458노인요양시설
13624서울특별시강남구와이비엠어학원2149086331학원
11624대전광역시유성구플랜트치과의원3142766193의료기관
15757서울특별시마포구스카이어린이집1018011082어린이집
14103서울특별시강서구NC강서점1098536893대규모점포
23004충청남도천안시롯데쇼핑(주)롯데마트 성정점3128526370실내주차장
13435서울특별시강남구바나나pc카페6561601138PC영업시설
시도시군구시설명사업자등록번호시설군
13218서울특별시강남구PMK빌딩2200761371실내주차장
18261서울특별시중구혜양엘리시움2038148728대규모점포
8987경상북도안동시홈플러스 안동점2208160348실내주차장
4543경기도안산시의료법인호원의료재단 호원요양병원1348209654의료기관
21067전라남도순천시현대제철순천어린이집4828002124어린이집
19050울산광역시중구롯데시네마 울산성남점6208520622영화상영관
18011서울특별시중구농협중앙회 본관주차장1048207072실내주차장
21362전라북도고창군고창효자노인병원4048205713의료기관
590강원도횡성군중앙어린이집2248002168어린이집
7454경상남도김해시한림어린이집6158027481어린이집

Duplicate rows

Most frequently occurring

시도시군구시설명사업자등록번호시설군# duplicates
3경기도고양시라페스타1288270246대규모점포3
4경기도고양시라페스타1288270246실내주차장3
12경기도안산시서해선 지하역사(소사원시)7328100969지하역사3
25인천광역시남동구길의료재단1398200376실내주차장3
31충청북도진천군덕산하나어린이집3018006253어린이집3
0강원도속초시기쁜어린이집1368260453어린이집2
1경기도고양시대방트리플라온1288114567실내주차장2
2경기도고양시동문굿모닝오피스텔2차2058260399실내주차장2
5경기도과천시과천역3148210024지하역사2
6경기도과천시시립부림어린이집1388902268어린이집2