Overview

Dataset statistics

Number of variables11
Number of observations141
Missing cells709
Missing cells (%)45.7%
Duplicate rows5
Duplicate rows (%)3.5%
Total size in memory12.2 KiB
Average record size in memory88.9 B

Variable types

Text1
Unsupported10

Dataset

Description파일 다운로드
Author서울교통공사
URLhttps://data.seoul.go.kr/dataList/OA-11573/S/1/datasetView.do

Alerts

Dataset has 5 (3.5%) duplicate rowsDuplicates
□ 노약자․장애인 편의시설 has 127 (90.1%) missing valuesMissing
Unnamed: 1 has 7 (5.0%) missing valuesMissing
Unnamed: 2 has 11 (7.8%) missing valuesMissing
Unnamed: 3 has 25 (17.7%) missing valuesMissing
Unnamed: 4 has 17 (12.1%) missing valuesMissing
Unnamed: 5 has 35 (24.8%) missing valuesMissing
Unnamed: 6 has 60 (42.6%) missing valuesMissing
Unnamed: 7 has 87 (61.7%) missing valuesMissing
Unnamed: 8 has 100 (70.9%) missing valuesMissing
Unnamed: 9 has 127 (90.1%) missing valuesMissing
Unnamed: 10 has 113 (80.1%) missing valuesMissing
Unnamed: 1 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 2 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-11 05:46:52.544217
Analysis finished2023-12-11 05:46:53.043229
Duration0.5 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct10
Distinct (%)71.4%
Missing127
Missing (%)90.1%
Memory size1.2 KiB
2023-12-11T14:46:53.149327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length3
Mean length3.4285714
Min length1

Characters and Unicode

Total characters48
Distinct characters23
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6 ?
Unique (%)42.9%

Sample

1st row 구 분
2nd row
3rd row1호선
4th row2호선
5th row3호선
ValueCountFrequency (%)
1호선 2
10.5%
2호선 2
10.5%
3호선 2
10.5%
4호선 2
10.5%
2
10.5%
1
 
5.3%
1
 
5.3%
역당 1
 
5.3%
평균 1
 
5.3%
1
 
5.3%
Other values (4) 4
21.1%
2023-12-11T14:46:53.452464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9
18.8%
9
18.8%
4
 
8.3%
1 2
 
4.2%
2
 
4.2%
2
 
4.2%
4 2
 
4.2%
3 2
 
4.2%
2 2
 
4.2%
  1
 
2.1%
Other values (13) 13
27.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 33
68.8%
Decimal Number 8
 
16.7%
Space Separator 5
 
10.4%
Other Symbol 1
 
2.1%
Control 1
 
2.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9
27.3%
9
27.3%
2
 
6.1%
2
 
6.1%
1
 
3.0%
1
 
3.0%
1
 
3.0%
1
 
3.0%
1
 
3.0%
1
 
3.0%
Other values (5) 5
15.2%
Decimal Number
ValueCountFrequency (%)
1 2
25.0%
4 2
25.0%
3 2
25.0%
2 2
25.0%
Space Separator
ValueCountFrequency (%)
4
80.0%
  1
 
20.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Control
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 33
68.8%
Common 15
31.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9
27.3%
9
27.3%
2
 
6.1%
2
 
6.1%
1
 
3.0%
1
 
3.0%
1
 
3.0%
1
 
3.0%
1
 
3.0%
1
 
3.0%
Other values (5) 5
15.2%
Common
ValueCountFrequency (%)
4
26.7%
1 2
13.3%
4 2
13.3%
3 2
13.3%
2 2
13.3%
  1
 
6.7%
1
 
6.7%
1
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 33
68.8%
ASCII 13
 
27.1%
None 1
 
2.1%
Geometric Shapes 1
 
2.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
9
27.3%
9
27.3%
2
 
6.1%
2
 
6.1%
1
 
3.0%
1
 
3.0%
1
 
3.0%
1
 
3.0%
1
 
3.0%
1
 
3.0%
Other values (5) 5
15.2%
ASCII
ValueCountFrequency (%)
4
30.8%
1 2
15.4%
4 2
15.4%
3 2
15.4%
2 2
15.4%
1
 
7.7%
None
ValueCountFrequency (%)
  1
100.0%
Geometric Shapes
ValueCountFrequency (%)
1
100.0%

Unnamed: 1
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing7
Missing (%)5.0%
Memory size1.2 KiB

Unnamed: 2
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing11
Missing (%)7.8%
Memory size1.2 KiB

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing25
Missing (%)17.7%
Memory size1.2 KiB

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing17
Missing (%)12.1%
Memory size1.2 KiB

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing35
Missing (%)24.8%
Memory size1.2 KiB

Unnamed: 6
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing60
Missing (%)42.6%
Memory size1.2 KiB

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing87
Missing (%)61.7%
Memory size1.2 KiB

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing100
Missing (%)70.9%
Memory size1.2 KiB

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing127
Missing (%)90.1%
Memory size1.2 KiB

Unnamed: 10
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing113
Missing (%)80.1%
Memory size1.2 KiB

Missing values

2023-12-11T14:46:52.645028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T14:46:52.795676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T14:46:52.933423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

□ 노약자․장애인 편의시설Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10
0구 분설치현황NaNNaNNaNNaNNaNNaNNaNNaNNaN
1<NA>NaN엘리베이터NaN에스컬레이터NaN휠체어리프트NaN수평자동보도NaN
2<NA>NaNNaN(E/V)NaN(E/S)NaN(W/L)NaN(M/W)NaN
3<NA>역수대수역수대수역수대수역수대수역수대수
412081811631789445285412
51호선1065103472249--
62호선5032650136371701220--
73호선342523177251644912
84호선2617525702089816--
9역당 평균6.8(전체)NaN2.7(설치역당)NaN5.0(설치역당)NaN1.9(설치역당)NaN2(설치역당)NaN
□ 노약자․장애인 편의시설Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10
131<NA>회 현1NaN16NaN61NaNNaN
132<NA>서울④2114222NaNNaN
133<NA>숙대입구312NaNNaNNaNNaNNaNNaN
134<NA>삼각지42244NaNNaNNaNNaN
135<NA>신용산2NaN222NaN1NaNNaN
136<NA>이 촌42244NaN1NaNNaN
137<NA>동 작2NaN2NaNNaNNaNNaNNaNNaN
138<NA>총신대입구422NaNNaNNaNNaNNaNNaN
139<NA>사당④2117611NaNNaN
140<NA>남태령3122NaN2NaNNaN내부경사형EV

Duplicate rows

Most frequently occurring

□ 노약자․장애인 편의시설# duplicates
4<NA>127
01호선2
12호선2
23호선2
34호선2