Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells65
Missing cells (%)0.1%
Duplicate rows8
Duplicate rows (%)0.1%
Total size in memory468.8 KiB
Average record size in memory48.0 B

Variable types

Text3
Boolean1
DateTime1

Dataset

Description승강기 중대고장 내역 등 정보 제공 ※ 승강기 안전관리법 제48조 및 동법 시행령 제37조제2항에 따른 중대고장 - 엘리베이터 출입문이 열린 상태로 운행되는 경우, 운행중 사람이 갇히는 경우 등
URLhttps://www.data.go.kr/data/15048080/fileData.do

Alerts

고장 has constant value ""Constant
Dataset has 8 (0.1%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 10:12:21.661398
Analysis finished2023-12-12 10:12:23.043614
Duration1.38 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct7863
Distinct (%)78.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T19:12:23.292212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length25
Mean length8.2253
Min length2

Characters and Unicode

Total characters82253
Distinct characters739
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6530 ?
Unique (%)65.3%

Sample

1st row은아아파트3단지
2nd row청솔우성아파트
3rd rowLG전자평택디지털파크
4th row효창맨숀아파트
5th row나우하이빌
ValueCountFrequency (%)
아파트 44
 
0.4%
잠실5단지아파트 30
 
0.3%
e편한세상 25
 
0.2%
한국철도공사 23
 
0.2%
sk 22
 
0.2%
힐스테이트 21
 
0.2%
오피스텔 19
 
0.2%
보도육교 15
 
0.1%
view 14
 
0.1%
13
 
0.1%
Other values (8321) 10797
97.9%
2023-12-12T19:12:24.078620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4361
 
5.3%
4225
 
5.1%
4026
 
4.9%
1789
 
2.2%
1507
 
1.8%
1462
 
1.8%
1339
 
1.6%
1245
 
1.5%
1220
 
1.5%
1095
 
1.3%
Other values (729) 59984
72.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 74363
90.4%
Decimal Number 3524
 
4.3%
Uppercase Letter 1764
 
2.1%
Space Separator 1033
 
1.3%
Close Punctuation 427
 
0.5%
Open Punctuation 422
 
0.5%
Dash Punctuation 287
 
0.3%
Lowercase Letter 263
 
0.3%
Other Punctuation 150
 
0.2%
Connector Punctuation 10
 
< 0.1%
Other values (2) 10
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4361
 
5.9%
4225
 
5.7%
4026
 
5.4%
1789
 
2.4%
1507
 
2.0%
1462
 
2.0%
1339
 
1.8%
1245
 
1.7%
1220
 
1.6%
1095
 
1.5%
Other values (658) 52094
70.1%
Uppercase Letter
ValueCountFrequency (%)
S 172
 
9.8%
L 170
 
9.6%
A 135
 
7.7%
K 135
 
7.7%
H 108
 
6.1%
E 100
 
5.7%
T 99
 
5.6%
I 90
 
5.1%
C 88
 
5.0%
M 88
 
5.0%
Other values (16) 579
32.8%
Lowercase Letter
ValueCountFrequency (%)
e 146
55.5%
l 14
 
5.3%
a 11
 
4.2%
t 10
 
3.8%
r 10
 
3.8%
i 9
 
3.4%
o 8
 
3.0%
h 7
 
2.7%
c 7
 
2.7%
u 6
 
2.3%
Other values (10) 35
 
13.3%
Decimal Number
ValueCountFrequency (%)
1 1031
29.3%
2 922
26.2%
3 449
12.7%
5 271
 
7.7%
4 229
 
6.5%
0 170
 
4.8%
6 156
 
4.4%
8 105
 
3.0%
7 102
 
2.9%
9 89
 
2.5%
Other Punctuation
ValueCountFrequency (%)
, 55
36.7%
/ 42
28.0%
. 42
28.0%
& 8
 
5.3%
' 2
 
1.3%
# 1
 
0.7%
Letter Number
ValueCountFrequency (%)
2
40.0%
2
40.0%
1
20.0%
Space Separator
ValueCountFrequency (%)
1033
100.0%
Close Punctuation
ValueCountFrequency (%)
) 427
100.0%
Open Punctuation
ValueCountFrequency (%)
( 422
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 287
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 10
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 74363
90.4%
Common 5858
 
7.1%
Latin 2032
 
2.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4361
 
5.9%
4225
 
5.7%
4026
 
5.4%
1789
 
2.4%
1507
 
2.0%
1462
 
2.0%
1339
 
1.8%
1245
 
1.7%
1220
 
1.6%
1095
 
1.5%
Other values (658) 52094
70.1%
Latin
ValueCountFrequency (%)
S 172
 
8.5%
L 170
 
8.4%
e 146
 
7.2%
A 135
 
6.6%
K 135
 
6.6%
H 108
 
5.3%
E 100
 
4.9%
T 99
 
4.9%
I 90
 
4.4%
C 88
 
4.3%
Other values (39) 789
38.8%
Common
ValueCountFrequency (%)
1033
17.6%
1 1031
17.6%
2 922
15.7%
3 449
7.7%
) 427
7.3%
( 422
7.2%
- 287
 
4.9%
5 271
 
4.6%
4 229
 
3.9%
0 170
 
2.9%
Other values (12) 617
10.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 74363
90.4%
ASCII 7885
 
9.6%
Number Forms 5
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4361
 
5.9%
4225
 
5.7%
4026
 
5.4%
1789
 
2.4%
1507
 
2.0%
1462
 
2.0%
1339
 
1.8%
1245
 
1.7%
1220
 
1.6%
1095
 
1.5%
Other values (658) 52094
70.1%
ASCII
ValueCountFrequency (%)
1033
 
13.1%
1 1031
 
13.1%
2 922
 
11.7%
3 449
 
5.7%
) 427
 
5.4%
( 422
 
5.4%
- 287
 
3.6%
5 271
 
3.4%
4 229
 
2.9%
S 172
 
2.2%
Other values (58) 2642
33.5%
Number Forms
ValueCountFrequency (%)
2
40.0%
2
40.0%
1
20.0%
Distinct9585
Distinct (%)96.5%
Missing65
Missing (%)0.7%
Memory size156.2 KiB
2023-12-12T19:12:24.488328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length8
Min length8

Characters and Unicode

Total characters79480
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9265 ?
Unique (%)93.3%

Sample

1st row5005-646
2nd row0093-380
3rd row2174-692
4th row0039-946
5th row5021-532
ValueCountFrequency (%)
2115-698 6
 
0.1%
0034-232 4
 
< 0.1%
6006-219 4
 
< 0.1%
6018-076 3
 
< 0.1%
0100-676 3
 
< 0.1%
8076-005 3
 
< 0.1%
2152-095 3
 
< 0.1%
0050-360 3
 
< 0.1%
4003-392 3
 
< 0.1%
8082-281 3
 
< 0.1%
Other values (9575) 9900
99.6%
2023-12-12T19:12:25.040676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 15727
19.8%
- 9935
12.5%
2 8107
10.2%
1 7045
8.9%
5 6559
8.3%
7 5751
 
7.2%
8 5630
 
7.1%
6 5602
 
7.0%
4 5410
 
6.8%
3 5222
 
6.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 69545
87.5%
Dash Punctuation 9935
 
12.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 15727
22.6%
2 8107
11.7%
1 7045
10.1%
5 6559
9.4%
7 5751
 
8.3%
8 5630
 
8.1%
6 5602
 
8.1%
4 5410
 
7.8%
3 5222
 
7.5%
9 4492
 
6.5%
Dash Punctuation
ValueCountFrequency (%)
- 9935
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 79480
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 15727
19.8%
- 9935
12.5%
2 8107
10.2%
1 7045
8.9%
5 6559
8.3%
7 5751
 
7.2%
8 5630
 
7.1%
6 5602
 
7.0%
4 5410
 
6.8%
3 5222
 
6.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 79480
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 15727
19.8%
- 9935
12.5%
2 8107
10.2%
1 7045
8.9%
5 6559
8.3%
7 5751
 
7.2%
8 5630
 
7.1%
6 5602
 
7.0%
4 5410
 
6.8%
3 5222
 
6.6%

주소
Text

Distinct8086
Distinct (%)80.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T19:12:25.443117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length48
Mean length26.8455
Min length11

Characters and Unicode

Total characters268455
Distinct characters655
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6876 ?
Unique (%)68.8%

Sample

1st row대전광역시 서구 가수원로 106 (가수원동)
2nd row서울특별시 동대문구 전농로10길 20 (답십리동)
3rd row경기도 평택시 진위면 엘지로 222
4th row서울특별시 용산구 효창원로 157 (효창동)
5th row충청북도 진천군 덕산면 이덕로 757
ValueCountFrequency (%)
경기도 2063
 
3.8%
서울특별시 1868
 
3.4%
대전광역시 689
 
1.3%
부산광역시 582
 
1.1%
충청남도 528
 
1.0%
경상북도 511
 
0.9%
대구광역시 510
 
0.9%
광주광역시 498
 
0.9%
서구 480
 
0.9%
경상남도 455
 
0.8%
Other values (11310) 46156
84.9%
2023-12-12T19:12:25.960196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
46707
 
17.4%
10601
 
3.9%
10016
 
3.7%
9036
 
3.4%
) 8627
 
3.2%
( 8623
 
3.2%
7815
 
2.9%
1 7368
 
2.7%
5727
 
2.1%
2 5072
 
1.9%
Other values (645) 148863
55.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 166189
61.9%
Space Separator 46707
 
17.4%
Decimal Number 34250
 
12.8%
Close Punctuation 8636
 
3.2%
Open Punctuation 8632
 
3.2%
Other Punctuation 2221
 
0.8%
Dash Punctuation 1431
 
0.5%
Uppercase Letter 306
 
0.1%
Lowercase Letter 78
 
< 0.1%
Letter Number 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10601
 
6.4%
10016
 
6.0%
9036
 
5.4%
7815
 
4.7%
5727
 
3.4%
3948
 
2.4%
3867
 
2.3%
3692
 
2.2%
3504
 
2.1%
3400
 
2.0%
Other values (586) 104583
62.9%
Uppercase Letter
ValueCountFrequency (%)
L 52
17.0%
H 35
11.4%
C 31
10.1%
S 28
9.2%
A 19
 
6.2%
G 18
 
5.9%
K 17
 
5.6%
B 16
 
5.2%
E 14
 
4.6%
I 12
 
3.9%
Other values (11) 64
20.9%
Lowercase Letter
ValueCountFrequency (%)
e 43
55.1%
t 6
 
7.7%
a 5
 
6.4%
h 5
 
6.4%
l 3
 
3.8%
c 3
 
3.8%
w 3
 
3.8%
m 2
 
2.6%
r 2
 
2.6%
i 2
 
2.6%
Other values (3) 4
 
5.1%
Decimal Number
ValueCountFrequency (%)
1 7368
21.5%
2 5072
14.8%
3 3878
11.3%
5 3094
9.0%
4 2892
 
8.4%
6 2733
 
8.0%
0 2546
 
7.4%
7 2472
 
7.2%
8 2193
 
6.4%
9 2002
 
5.8%
Other Punctuation
ValueCountFrequency (%)
, 1630
73.4%
561
 
25.3%
. 10
 
0.5%
· 7
 
0.3%
* 7
 
0.3%
: 5
 
0.2%
& 1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 8627
99.9%
] 9
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 8623
99.9%
[ 9
 
0.1%
Space Separator
ValueCountFrequency (%)
46707
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1431
100.0%
Letter Number
ValueCountFrequency (%)
4
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 166187
61.9%
Common 101878
37.9%
Latin 388
 
0.1%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10601
 
6.4%
10016
 
6.0%
9036
 
5.4%
7815
 
4.7%
5727
 
3.4%
3948
 
2.4%
3867
 
2.3%
3692
 
2.2%
3504
 
2.1%
3400
 
2.0%
Other values (584) 104581
62.9%
Latin
ValueCountFrequency (%)
L 52
13.4%
e 43
 
11.1%
H 35
 
9.0%
C 31
 
8.0%
S 28
 
7.2%
A 19
 
4.9%
G 18
 
4.6%
K 17
 
4.4%
B 16
 
4.1%
E 14
 
3.6%
Other values (25) 115
29.6%
Common
ValueCountFrequency (%)
46707
45.8%
) 8627
 
8.5%
( 8623
 
8.5%
1 7368
 
7.2%
2 5072
 
5.0%
3 3878
 
3.8%
5 3094
 
3.0%
4 2892
 
2.8%
6 2733
 
2.7%
0 2546
 
2.5%
Other values (14) 10338
 
10.1%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 166187
61.9%
ASCII 101694
37.9%
None 568
 
0.2%
Number Forms 4
 
< 0.1%
CJK 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
46707
45.9%
) 8627
 
8.5%
( 8623
 
8.5%
1 7368
 
7.2%
2 5072
 
5.0%
3 3878
 
3.8%
5 3094
 
3.0%
4 2892
 
2.8%
6 2733
 
2.7%
0 2546
 
2.5%
Other values (46) 10154
 
10.0%
Hangul
ValueCountFrequency (%)
10601
 
6.4%
10016
 
6.0%
9036
 
5.4%
7815
 
4.7%
5727
 
3.4%
3948
 
2.4%
3867
 
2.3%
3692
 
2.2%
3504
 
2.1%
3400
 
2.0%
Other values (584) 104581
62.9%
None
ValueCountFrequency (%)
561
98.8%
· 7
 
1.2%
Number Forms
ValueCountFrequency (%)
4
100.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%

고장
Boolean

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
True
10000 
ValueCountFrequency (%)
True 10000
100.0%
2023-12-12T19:12:26.071565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct3072
Distinct (%)30.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2007-04-14 00:00:00
Maximum2023-01-31 00:00:00
2023-12-12T19:12:26.167432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:12:26.296485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Missing values

2023-12-12T19:12:22.823300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:12:22.958883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

건물명승강기고유번호주소고장고장일자
44418은아아파트3단지5005-646대전광역시 서구 가수원로 106 (가수원동)Y2020-08-22
47682청솔우성아파트0093-380서울특별시 동대문구 전농로10길 20 (답십리동)Y2020-11-03
34462LG전자평택디지털파크2174-692경기도 평택시 진위면 엘지로 222Y2020-01-21
13744효창맨숀아파트0039-946서울특별시 용산구 효창원로 157 (효창동)Y2012-10-17
79847나우하이빌5021-532충청북도 진천군 덕산면 이덕로 757Y2022-04-11
69085의정부롯데캐슬골드파크2단지2196-550경기도 의정부시 범골로63번길 31 (의정부동, 의정부 롯데캐슬 골드파크 2단지)Y2021-10-17
61419제주드림타워9012-941제주특별자치도 제주시 노연로 12 (노형동)Y2021-06-27
38564동신아름마을아파트2057-201경기도 평택시 안중읍 안현로서8길 14Y2020-05-10
9750개신뜨란채아파트5007-764충청북도 청주시 서원구 경신로 68 (개신동)Y2011-10-03
12600송강청솔아파트5004-112대전광역시 유성구 송강로42번길 61 (송강동)Y2012-07-03
건물명승강기고유번호주소고장고장일자
56161백마2단지극동삼환아파트2068-565경기도 고양시 일산동구 일산로 205Y2021-04-09
17142K-water낙동강보관리단6043-767대구광역시 달성군 다사읍 강정본길 57 K-water낙동강중부보관리단Y2014-03-30
72485천안레이크타운1차푸르지오5066-739충청남도 천안시 서북구 성성6로 111 (성성동, 천안레이크타운푸르지오)Y2021-12-08
95402꿈마을한신아파트2027-645경기도 안양시 동안구 관평로 68 (평촌동,꿈마을한신아파트)Y2022-11-29
14259매호동서아파트6028-134대구광역시 수성구 천을로 180 (매호동)Y2012-12-31
55715동천마을6단지주공아파트7035-732광주광역시 서구 하남대로710번길 5 (동천동, 동천마을 6단지아파트)Y2021-04-02
22808청림빌딩0026-331서울특별시 강서구 공항대로45길 44 (등촌동)Y2018-01-12
53047동남빌딩2197-332경기도 부천시 소사로 119 (소사본동)Y2021-02-07
15336대우4차아파트5010-557충청남도 천안시 동남구 봉명1길 18 (봉명동)Y2013-06-15
46799구의현대2단지아파트0105-949서울특별시 광진구 광나루로56길 32 (구의동)Y2020-10-13

Duplicate rows

Most frequently occurring

건물명승강기고유번호주소고장고장일자# duplicates
0DMC파크뷰자이0108-604서울특별시 서대문구 가재울미래로 2 (남가좌동, DMC파크뷰자이)Y2018-05-102
1괴정엔스타8004-775부산광역시 사하구 낙동대로 203 (괴정동)Y2020-08-302
2당진한성필하우스아파트5055-750충청남도 당진시 남부로 200 (대덕동, 한성 필하우스)Y2021-07-312
3무궁화태영아파트2142-927경기도 안양시 동안구 경수대로610번길 37 (호계동, 무궁화태영아파트)Y2022-04-182
4상계주공4단지아파트0056-387서울특별시 노원구 동일로214길 21 (상계동)Y2021-02-212
5잠실롯데캐슬골드아파트0100-676서울특별시 송파구 올림픽로 269 (신천동)Y2020-04-242
6장암동신현대아파트2064-232경기도 의정부시 장곡로 280-25 (신곡동)Y2022-02-062
7한신그린코아아파트8054-152부산광역시 기장군 기장읍 차성로344번길 13Y2022-12-142