Overview

Dataset statistics

Number of variables9
Number of observations291
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory20.6 KiB
Average record size in memory72.5 B

Variable types

Text2
Categorical6
DateTime1

Dataset

Description국토안전관리원에서 제공하는 공공시설물(공동주택 제외) 중 안전취약시설물(D,E등급)의 최근 D,E등급 판정일, 차기안전점검일, 사용제한여부, 주민공지여부 등이 포함된 CSV형식 파일데이터입니다.
Author공공데이터포털
URLhttps://www.data.go.kr/data/15084221/fileData.do

Alerts

시설물구분 is highly overall correlated with 시설물종류 and 1 other fieldsHigh correlation
시설물종류 is highly overall correlated with 시설물구분 and 1 other fieldsHigh correlation
종별 is highly overall correlated with 시설물구분 and 2 other fieldsHigh correlation
차기안전점검일 is highly overall correlated with 종별High correlation
시설물구분 is highly imbalanced (52.5%)Imbalance
시설물종류 is highly imbalanced (55.5%)Imbalance
종별 is highly imbalanced (62.7%)Imbalance
차기안전점검일 is highly imbalanced (55.8%)Imbalance

Reproduction

Analysis started2024-04-18 04:18:34.629655
Analysis finished2024-04-18 04:18:36.700442
Duration2.07 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct290
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
2024-04-18T13:18:36.847938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length22
Mean length6.0756014
Min length3

Characters and Unicode

Total characters1768
Distinct characters263
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique289 ?
Unique (%)99.3%

Sample

1st row화목양배수장
2nd row왕신저수지
3rd row금량교
4th row대곡교
5th row상라교
ValueCountFrequency (%)
본관 6
 
1.6%
창고 6
 
1.6%
서계동 3
 
0.8%
옹벽 3
 
0.8%
본관동 3
 
0.8%
2
 
0.5%
고덕중학교 2
 
0.5%
교사1동 2
 
0.5%
개운교 2
 
0.5%
본관교사동 2
 
0.5%
Other values (345) 346
91.8%
2024-04-18T13:18:37.160092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
246
 
13.9%
86
 
4.9%
64
 
3.6%
( 42
 
2.4%
) 41
 
2.3%
1 40
 
2.3%
2 39
 
2.2%
35
 
2.0%
3 27
 
1.5%
26
 
1.5%
Other values (253) 1122
63.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1406
79.5%
Decimal Number 156
 
8.8%
Space Separator 86
 
4.9%
Open Punctuation 44
 
2.5%
Close Punctuation 43
 
2.4%
Uppercase Letter 16
 
0.9%
Dash Punctuation 11
 
0.6%
Other Punctuation 3
 
0.2%
Math Symbol 2
 
0.1%
Letter Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
246
 
17.5%
64
 
4.6%
35
 
2.5%
26
 
1.8%
24
 
1.7%
22
 
1.6%
22
 
1.6%
22
 
1.6%
21
 
1.5%
21
 
1.5%
Other values (226) 903
64.2%
Decimal Number
ValueCountFrequency (%)
1 40
25.6%
2 39
25.0%
3 27
17.3%
0 19
12.2%
4 9
 
5.8%
6 8
 
5.1%
8 5
 
3.2%
9 3
 
1.9%
5 3
 
1.9%
7 3
 
1.9%
Uppercase Letter
ValueCountFrequency (%)
B 5
31.2%
C 4
25.0%
D 2
 
12.5%
A 2
 
12.5%
L 1
 
6.2%
P 1
 
6.2%
U 1
 
6.2%
Open Punctuation
ValueCountFrequency (%)
( 42
95.5%
[ 2
 
4.5%
Close Punctuation
ValueCountFrequency (%)
) 41
95.3%
] 2
 
4.7%
Other Punctuation
ValueCountFrequency (%)
, 2
66.7%
. 1
33.3%
Space Separator
ValueCountFrequency (%)
86
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1406
79.5%
Common 345
 
19.5%
Latin 17
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
246
 
17.5%
64
 
4.6%
35
 
2.5%
26
 
1.8%
24
 
1.7%
22
 
1.6%
22
 
1.6%
22
 
1.6%
21
 
1.5%
21
 
1.5%
Other values (226) 903
64.2%
Common
ValueCountFrequency (%)
86
24.9%
( 42
12.2%
) 41
11.9%
1 40
11.6%
2 39
11.3%
3 27
 
7.8%
0 19
 
5.5%
- 11
 
3.2%
4 9
 
2.6%
6 8
 
2.3%
Other values (9) 23
 
6.7%
Latin
ValueCountFrequency (%)
B 5
29.4%
C 4
23.5%
D 2
 
11.8%
A 2
 
11.8%
L 1
 
5.9%
P 1
 
5.9%
U 1
 
5.9%
1
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1406
79.5%
ASCII 361
 
20.4%
Number Forms 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
246
 
17.5%
64
 
4.6%
35
 
2.5%
26
 
1.8%
24
 
1.7%
22
 
1.6%
22
 
1.6%
22
 
1.6%
21
 
1.5%
21
 
1.5%
Other values (226) 903
64.2%
ASCII
ValueCountFrequency (%)
86
23.8%
( 42
11.6%
) 41
11.4%
1 40
11.1%
2 39
10.8%
3 27
 
7.5%
0 19
 
5.3%
- 11
 
3.0%
4 9
 
2.5%
6 8
 
2.2%
Other values (16) 39
10.8%
Number Forms
ValueCountFrequency (%)
1
100.0%

시설물구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct8
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
교량
192 
건축물
72 
하천
 
9
절토사면
 
8
기타
 
4
Other values (3)
 
6

Length

Max length4
Median length2
Mean length2.2955326
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row하천
2nd row
3rd row교량
4th row교량
5th row교량

Common Values

ValueCountFrequency (%)
교량 192
66.0%
건축물 72
 
24.7%
하천 9
 
3.1%
절토사면 8
 
2.7%
기타 4
 
1.4%
2
 
0.7%
항만 2
 
0.7%
옹벽 2
 
0.7%

Length

2024-04-18T13:18:37.271875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T13:18:37.360542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
교량 192
66.0%
건축물 72
 
24.7%
하천 9
 
3.1%
절토사면 8
 
2.7%
기타 4
 
1.4%
2
 
0.7%
항만 2
 
0.7%
옹벽 2
 
0.7%

시설물종류
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct17
Distinct (%)5.8%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
도로교량
190 
다중이용건축물
47 
기타
23 
도로사면
 
7
배수펌프장
 
5
Other values (12)
 
19

Length

Max length12
Median length4
Mean length4.395189
Min length2

Unique

Unique8 ?
Unique (%)2.7%

Sample

1st row배수펌프장
2nd row용수전용댐
3rd row도로교량
4th row도로교량
5th row도로교량

Common Values

ValueCountFrequency (%)
도로교량 190
65.3%
다중이용건축물 47
 
16.2%
기타 23
 
7.9%
도로사면 7
 
2.4%
배수펌프장 5
 
1.7%
대형건축물 5
 
1.7%
수문 및 통문 2
 
0.7%
용수전용댐 2
 
0.7%
육교 2
 
0.7%
하구둑 1
 
0.3%
Other values (7) 7
 
2.4%

Length

2024-04-18T13:18:37.458969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
도로교량 190
64.0%
다중이용건축물 47
 
15.8%
기타 23
 
7.7%
도로사면 7
 
2.4%
배수펌프장 5
 
1.7%
대형건축물 5
 
1.7%
3
 
1.0%
용수전용댐 2
 
0.7%
육교 2
 
0.7%
통문 2
 
0.7%
Other values (10) 11
 
3.7%

종별
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
3종
255 
2종
33 
1종
 
3

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2종
2nd row2종
3rd row3종
4th row3종
5th row3종

Common Values

ValueCountFrequency (%)
3종 255
87.6%
2종 33
 
11.3%
1종 3
 
1.0%

Length

2024-04-18T13:18:37.571126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T13:18:37.651630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3종 255
87.6%
2종 33
 
11.3%
1종 3
 
1.0%

주소
Text

Distinct122
Distinct (%)41.9%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
2024-04-18T13:18:37.920013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length8
Mean length8.9003436
Min length7

Characters and Unicode

Total characters2590
Distinct characters103
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique56 ?
Unique (%)19.2%

Sample

1st row경상남도 김해시
2nd row경상북도 경주시
3rd row경상북도 경주시
4th row경상북도 경주시
5th row경상북도 경주시
ValueCountFrequency (%)
강원특별자치도 56
 
9.3%
경상북도 49
 
8.1%
전라남도 42
 
7.0%
충청남도 31
 
5.1%
홍천군 28
 
4.6%
경기도 26
 
4.3%
전라북도 21
 
3.5%
서울특별시 18
 
3.0%
경상남도 16
 
2.6%
충청북도 14
 
2.3%
Other values (127) 303
50.2%
2024-04-18T13:18:38.300099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
313
 
12.1%
263
 
10.2%
166
 
6.4%
133
 
5.1%
103
 
4.0%
101
 
3.9%
88
 
3.4%
75
 
2.9%
75
 
2.9%
68
 
2.6%
Other values (93) 1205
46.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2277
87.9%
Space Separator 313
 
12.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
263
 
11.6%
166
 
7.3%
133
 
5.8%
103
 
4.5%
101
 
4.4%
88
 
3.9%
75
 
3.3%
75
 
3.3%
68
 
3.0%
68
 
3.0%
Other values (92) 1137
49.9%
Space Separator
ValueCountFrequency (%)
313
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2277
87.9%
Common 313
 
12.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
263
 
11.6%
166
 
7.3%
133
 
5.8%
103
 
4.5%
101
 
4.4%
88
 
3.9%
75
 
3.3%
75
 
3.3%
68
 
3.0%
68
 
3.0%
Other values (92) 1137
49.9%
Common
ValueCountFrequency (%)
313
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2277
87.9%
ASCII 313
 
12.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
313
100.0%
Hangul
ValueCountFrequency (%)
263
 
11.6%
166
 
7.3%
133
 
5.8%
103
 
4.5%
101
 
4.4%
88
 
3.9%
75
 
3.3%
75
 
3.3%
68
 
3.0%
68
 
3.0%
Other values (92) 1137
49.9%
Distinct98
Distinct (%)33.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
Minimum2019-08-05 00:00:00
Maximum2023-03-27 00:00:00
2024-04-18T13:18:38.417540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T13:18:38.550224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

차기안전점검일
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct10
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
2023-03-31
207 
2023-12-31
38 
2022-12-31
21 
2023-06-30
 
16
2022-03-31
 
3
Other values (5)
 
6

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique4 ?
Unique (%)1.4%

Sample

1st row2023-12-31
2nd row2023-12-31
3rd row2023-12-31
4th row2022-12-31
5th row2023-12-31

Common Values

ValueCountFrequency (%)
2023-03-31 207
71.1%
2023-12-31 38
 
13.1%
2022-12-31 21
 
7.2%
2023-06-30 16
 
5.5%
2022-03-31 3
 
1.0%
2024-06-30 2
 
0.7%
2024-12-31 1
 
0.3%
2021-12-31 1
 
0.3%
2020-03-31 1
 
0.3%
2020-12-31 1
 
0.3%

Length

2024-04-18T13:18:38.654725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T13:18:38.746821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-03-31 207
71.1%
2023-12-31 38
 
13.1%
2022-12-31 21
 
7.2%
2023-06-30 16
 
5.5%
2022-03-31 3
 
1.0%
2024-06-30 2
 
0.7%
2024-12-31 1
 
0.3%
2021-12-31 1
 
0.3%
2020-03-31 1
 
0.3%
2020-12-31 1
 
0.3%
Distinct4
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
사용제한 없음
143 
일부 사용제한
68 
<NA>
57 
전면 사용금지
23 

Length

Max length7
Median length7
Mean length6.4123711
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일부 사용제한
2nd row일부 사용제한
3rd row일부 사용제한
4th row일부 사용제한
5th row일부 사용제한

Common Values

ValueCountFrequency (%)
사용제한 없음 143
49.1%
일부 사용제한 68
23.4%
<NA> 57
 
19.6%
전면 사용금지 23
 
7.9%

Length

2024-04-18T13:18:38.854912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T13:18:38.937344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사용제한 211
40.2%
없음 143
27.2%
일부 68
 
13.0%
na 57
 
10.9%
전면 23
 
4.4%
사용금지 23
 
4.4%
Distinct4
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
위험표지, 방송·인터넷을 통한 주민공지
159 
<NA>
71 
위험표지
36 
방송·인터넷을 통한 주민공지
25 

Length

Max length21
Median length21
Mean length14.233677
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row위험표지
2nd row위험표지, 방송·인터넷을 통한 주민공지
3rd row위험표지, 방송·인터넷을 통한 주민공지
4th row위험표지, 방송·인터넷을 통한 주민공지
5th row위험표지, 방송·인터넷을 통한 주민공지

Common Values

ValueCountFrequency (%)
위험표지, 방송·인터넷을 통한 주민공지 159
54.6%
<NA> 71
24.4%
위험표지 36
 
12.4%
방송·인터넷을 통한 주민공지 25
 
8.6%

Length

2024-04-18T13:18:39.039592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T13:18:39.126876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
위험표지 195
23.8%
방송·인터넷을 184
22.5%
통한 184
22.5%
주민공지 184
22.5%
na 71
 
8.7%

Correlations

2024-04-18T13:18:39.191100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설물구분시설물종류종별DE등급판정일차기안전점검일사용제한여부주민공지여부
시설물구분1.0000.9860.8340.9310.4880.1890.401
시설물종류0.9861.0000.8990.9700.6680.4680.546
종별0.8340.8991.0000.9160.6920.4270.000
DE등급판정일0.9310.9700.9161.0000.9970.7700.860
차기안전점검일0.4880.6680.6920.9971.0000.1890.220
사용제한여부0.1890.4680.4270.7700.1891.0000.434
주민공지여부0.4010.5460.0000.8600.2200.4341.000
2024-04-18T13:18:39.283786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설물구분종별시설물종류차기안전점검일주민공지여부사용제한여부
시설물구분1.0000.7700.9170.2570.2750.119
종별0.7701.0000.7660.5410.0000.161
시설물종류0.9170.7661.0000.3290.3540.236
차기안전점검일0.2570.5410.3291.0000.1310.112
주민공지여부0.2750.0000.3540.1311.0000.165
사용제한여부0.1190.1610.2360.1120.1651.000
2024-04-18T13:18:39.370302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설물구분시설물종류종별차기안전점검일사용제한여부주민공지여부
시설물구분1.0000.9170.7700.2570.1190.275
시설물종류0.9171.0000.7660.3290.2360.354
종별0.7700.7661.0000.5410.1610.000
차기안전점검일0.2570.3290.5411.0000.1120.131
사용제한여부0.1190.2360.1610.1121.0000.165
주민공지여부0.2750.3540.0000.1310.1651.000

Missing values

2024-04-18T13:18:36.657774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시설물명시설물구분시설물종류종별주소DE등급판정일차기안전점검일사용제한여부주민공지여부
0화목양배수장하천배수펌프장2종경상남도 김해시2022-10-292023-12-31일부 사용제한위험표지
1왕신저수지용수전용댐2종경상북도 경주시2022-10-312023-12-31일부 사용제한위험표지, 방송·인터넷을 통한 주민공지
2금량교교량도로교량3종경상북도 경주시2023-03-202023-12-31일부 사용제한위험표지, 방송·인터넷을 통한 주민공지
3대곡교교량도로교량3종경상북도 경주시2022-06-302022-12-31일부 사용제한위험표지, 방송·인터넷을 통한 주민공지
4상라교교량도로교량3종경상북도 경주시2023-03-202023-12-31일부 사용제한위험표지, 방송·인터넷을 통한 주민공지
5마능교교량도로교량3종경상북도 경주시2023-03-202023-12-31사용제한 없음<NA>
6동방교교량도로교량2종경상북도 경주시2023-01-122023-12-31<NA><NA>
7가마골2교교량도로교량3종강원특별자치도 고성군2022-12-172023-03-31<NA><NA>
8구국도 삼포교교량도로교량3종강원특별자치도 고성군2022-12-172023-03-31<NA><NA>
9탑평교교량도로교량3종강원특별자치도 고성군2022-12-172023-03-31<NA><NA>
시설물명시설물구분시설물종류종별주소DE등급판정일차기안전점검일사용제한여부주민공지여부
281서계동 33-144건축물기타3종서울특별시 용산구2022-06-132022-12-31사용제한 없음방송·인터넷을 통한 주민공지
282서계동 262-3건축물기타3종서울특별시 용산구2022-06-132022-12-31사용제한 없음방송·인터넷을 통한 주민공지
283서계동 262-2건축물기타3종서울특별시 용산구2022-06-132022-12-31사용제한 없음방송·인터넷을 통한 주민공지
284한남동 504-1(노후건물)건축물기타3종서울특별시 용산구2022-06-132022-12-31전면 사용금지위험표지, 방송·인터넷을 통한 주민공지
285무지개빌라1~2동 옹벽 및 담장기타기타3종인천광역시 부평구2023-02-132023-12-31사용제한 없음위험표지, 방송·인터넷을 통한 주민공지
286L2연구동(서울 한국과학기술연구원)건축물대형건축물3종서울특별시 성북구2023-03-272023-12-31사용제한 없음<NA>
287테스트 교량교량도로교량2종경상남도 진주시2022-12-092023-03-31<NA><NA>
288원곡교교량도로교량3종경상북도 울진군2022-12-142023-03-31사용제한 없음위험표지, 방송·인터넷을 통한 주민공지
289군도12호 - 다천1절토사면도로사면2종경상북도 울진군2022-09-212023-03-31사용제한 없음위험표지, 방송·인터넷을 통한 주민공지
290숭의평화창작공간 2동밖건축물기타3종인천광역시 미추홀구2022-12-302023-03-31<NA><NA>