Overview

Dataset statistics

Number of variables8
Number of observations201
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.1 KiB
Average record size in memory66.7 B

Variable types

Numeric2
Categorical4
Text2

Dataset

Description서울교통공사 9호선 2,3단계의 역사 에스컬레이터 설치 현황 데이터 입니다. 해당 데이터는 연번, 호선, 역명, 장비종류, 호기, 승강기번호, 운행구간, 설치위치를 포함하고 있습니다. 2023년 10월 기준입니다.
Author서울교통공사
URLhttps://www.data.go.kr/data/15118711/fileData.do

Alerts

장비종류 has constant value ""Constant
역명 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
호선 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
연번 is highly overall correlated with 호선 and 1 other fieldsHigh correlation
연번 has unique valuesUnique
승강기번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 23:10:42.565406
Analysis finished2023-12-12 23:10:43.503306
Duration0.94 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct201
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean101
Minimum1
Maximum201
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-13T08:10:43.579103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile11
Q151
median101
Q3151
95-th percentile191
Maximum201
Range200
Interquartile range (IQR)100

Descriptive statistics

Standard deviation58.167861
Coefficient of variation (CV)0.57591941
Kurtosis-1.2
Mean101
Median Absolute Deviation (MAD)50
Skewness0
Sum20301
Variance3383.5
MonotonicityStrictly increasing
2023-12-13T08:10:43.723739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.5%
139 1
 
0.5%
129 1
 
0.5%
130 1
 
0.5%
131 1
 
0.5%
132 1
 
0.5%
133 1
 
0.5%
134 1
 
0.5%
135 1
 
0.5%
136 1
 
0.5%
Other values (191) 191
95.0%
ValueCountFrequency (%)
1 1
0.5%
2 1
0.5%
3 1
0.5%
4 1
0.5%
5 1
0.5%
6 1
0.5%
7 1
0.5%
8 1
0.5%
9 1
0.5%
10 1
0.5%
ValueCountFrequency (%)
201 1
0.5%
200 1
0.5%
199 1
0.5%
198 1
0.5%
197 1
0.5%
196 1
0.5%
195 1
0.5%
194 1
0.5%
193 1
0.5%
192 1
0.5%

호선
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
9(2단계)
104 
9(3단계)
97 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row9(2단계)
2nd row9(2단계)
3rd row9(2단계)
4th row9(2단계)
5th row9(2단계)

Common Values

ValueCountFrequency (%)
9(2단계) 104
51.7%
9(3단계) 97
48.3%

Length

2023-12-13T08:10:43.848644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:10:43.939775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
9(2단계 104
51.7%
9(3단계 97
48.3%

역명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)6.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
언주
28 
석촌(9)
25 
봉은사
24 
삼성중앙
22 
종합운동장(9)
18 
Other values (8)
84 

Length

Max length8
Median length5
Mean length4.4228856
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row언주
2nd row언주
3rd row언주
4th row언주
5th row언주

Common Values

ValueCountFrequency (%)
언주 28
13.9%
석촌(9) 25
12.4%
봉은사 24
11.9%
삼성중앙 22
10.9%
종합운동장(9) 18
9.0%
삼전 16
8.0%
올림픽공원(9) 14
7.0%
선정릉(9) 12
6.0%
한성백제 12
6.0%
중앙보훈병원 10
 
5.0%
Other values (3) 20
10.0%

Length

2023-12-13T08:10:44.074355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
언주 28
13.9%
석촌(9 25
12.4%
봉은사 24
11.9%
삼성중앙 22
10.9%
종합운동장(9 18
9.0%
삼전 16
8.0%
올림픽공원(9 14
7.0%
선정릉(9 12
6.0%
한성백제 12
6.0%
중앙보훈병원 10
 
5.0%
Other values (3) 20
10.0%

장비종류
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
E/S
201 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowE/S
2nd rowE/S
3rd rowE/S
4th rowE/S
5th rowE/S

Common Values

ValueCountFrequency (%)
E/S 201
100.0%

Length

2023-12-13T08:10:44.205361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:10:44.297456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
e/s 201
100.0%

호기
Real number (ℝ)

Distinct28
Distinct (%)13.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.8756219
Minimum1
Maximum28
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-13T08:10:44.401297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q14
median9
Q314
95-th percentile23
Maximum28
Range27
Interquartile range (IQR)10

Descriptive statistics

Standard deviation6.7163571
Coefficient of variation (CV)0.6800946
Kurtosis-0.44125362
Mean9.8756219
Median Absolute Deviation (MAD)5
Skewness0.66209195
Sum1985
Variance45.109453
MonotonicityNot monotonic
2023-12-13T08:10:44.526220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=28)
ValueCountFrequency (%)
1 13
 
6.5%
3 13
 
6.5%
4 13
 
6.5%
5 13
 
6.5%
6 13
 
6.5%
2 13
 
6.5%
7 11
 
5.5%
8 11
 
5.5%
9 10
 
5.0%
10 10
 
5.0%
Other values (18) 81
40.3%
ValueCountFrequency (%)
1 13
6.5%
2 13
6.5%
3 13
6.5%
4 13
6.5%
5 13
6.5%
6 13
6.5%
7 11
5.5%
8 11
5.5%
9 10
5.0%
10 10
5.0%
ValueCountFrequency (%)
28 1
 
0.5%
27 1
 
0.5%
26 1
 
0.5%
25 2
1.0%
24 3
1.5%
23 3
1.5%
22 4
2.0%
21 4
2.0%
20 4
2.0%
19 4
2.0%

승강기번호
Text

UNIQUE 

Distinct201
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-13T08:10:44.823523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length8
Min length8

Characters and Unicode

Total characters1608
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique201 ?
Unique (%)100.0%

Sample

1st row1809-530
2nd row1809-529
3rd row1809-536
4th row1809-535
5th row1809-531
ValueCountFrequency (%)
1809-530 1
 
0.5%
1812-727 1
 
0.5%
1812-768 1
 
0.5%
1812-723 1
 
0.5%
1812-721 1
 
0.5%
1812-718 1
 
0.5%
1812-717 1
 
0.5%
1812-722 1
 
0.5%
1812-719 1
 
0.5%
1812-720 1
 
0.5%
Other values (191) 191
95.0%
2023-12-13T08:10:45.230863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 368
22.9%
8 243
15.1%
- 201
12.5%
5 147
 
9.1%
7 138
 
8.6%
0 132
 
8.2%
9 130
 
8.1%
2 114
 
7.1%
6 52
 
3.2%
4 43
 
2.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1407
87.5%
Dash Punctuation 201
 
12.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 368
26.2%
8 243
17.3%
5 147
 
10.4%
7 138
 
9.8%
0 132
 
9.4%
9 130
 
9.2%
2 114
 
8.1%
6 52
 
3.7%
4 43
 
3.1%
3 40
 
2.8%
Dash Punctuation
ValueCountFrequency (%)
- 201
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1608
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 368
22.9%
8 243
15.1%
- 201
12.5%
5 147
 
9.1%
7 138
 
8.6%
0 132
 
8.2%
9 130
 
8.1%
2 114
 
7.1%
6 52
 
3.2%
4 43
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1608
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 368
22.9%
8 243
15.1%
- 201
12.5%
5 147
 
9.1%
7 138
 
8.6%
0 132
 
8.2%
9 130
 
8.1%
2 114
 
7.1%
6 52
 
3.2%
4 43
 
2.7%

운행구간
Categorical

Distinct28
Distinct (%)13.9%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
1F-B1
33 
B1-1F
33 
B2-B1
25 
B1-B2
23 
BM-1F
Other values (23)
79 

Length

Max length6
Median length5
Mean length4.9452736
Min length4

Unique

Unique3 ?
Unique (%)1.5%

Sample

1st rowB4-B5
2nd rowB5-B4
3rd rowB4-B5
4th rowB5-B4
5th rowB5-B4

Common Values

ValueCountFrequency (%)
1F-B1 33
16.4%
B1-1F 33
16.4%
B2-B1 25
12.4%
B1-B2 23
11.4%
BM-1F 8
 
4.0%
BM-B1 6
 
3.0%
B1-B3 5
 
2.5%
B3-B1 5
 
2.5%
B2-B4 5
 
2.5%
M-1F 4
 
2.0%
Other values (18) 54
26.9%

Length

2023-12-13T08:10:45.373387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1f-b1 33
16.4%
b1-1f 33
16.4%
b2-b1 25
12.4%
b1-b2 23
11.4%
bm-1f 8
 
4.0%
bm-b1 6
 
3.0%
b1-b3 5
 
2.5%
b3-b1 5
 
2.5%
b2-b4 5
 
2.5%
b4-b5 4
 
2.0%
Other values (18) 54
26.9%
Distinct131
Distinct (%)65.2%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-13T08:10:45.549513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length22
Mean length14.562189
Min length10

Characters and Unicode

Total characters2927
Distinct characters37
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique90 ?
Unique (%)44.8%

Sample

1st row(중층 → 하선 3-1)
2nd row(하선 3-1 → 중층)
3rd row(중층 → 상선 4-4)
4th row(상선 4-4 → 중층)
5th row(하선 5-3 → 중층)
ValueCountFrequency (%)
199
23.7%
대합실 162
19.3%
출구 110
13.1%
상선 40
 
4.8%
3번 34
 
4.1%
하선 32
 
3.8%
중층 30
 
3.6%
2번 24
 
2.9%
1번 20
 
2.4%
4번 18
 
2.1%
Other values (37) 169
20.2%
2023-12-13T08:10:45.909918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
638
21.8%
( 224
 
7.7%
) 224
 
7.7%
201
 
6.9%
190
 
6.5%
184
 
6.3%
184
 
6.3%
142
 
4.9%
127
 
4.3%
127
 
4.3%
Other values (27) 686
23.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1276
43.6%
Space Separator 638
21.8%
Decimal Number 292
 
10.0%
Open Punctuation 224
 
7.7%
Close Punctuation 224
 
7.7%
Math Symbol 201
 
6.9%
Dash Punctuation 72
 
2.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
190
14.9%
184
14.4%
184
14.4%
142
11.1%
127
10.0%
127
10.0%
78
6.1%
46
 
3.6%
46
 
3.6%
40
 
3.1%
Other values (13) 112
8.8%
Decimal Number
ValueCountFrequency (%)
3 66
22.6%
2 60
20.5%
1 49
16.8%
4 43
14.7%
5 31
10.6%
6 20
 
6.8%
7 11
 
3.8%
8 10
 
3.4%
9 2
 
0.7%
Space Separator
ValueCountFrequency (%)
638
100.0%
Open Punctuation
ValueCountFrequency (%)
( 224
100.0%
Close Punctuation
ValueCountFrequency (%)
) 224
100.0%
Math Symbol
ValueCountFrequency (%)
201
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 72
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1651
56.4%
Hangul 1276
43.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
190
14.9%
184
14.4%
184
14.4%
142
11.1%
127
10.0%
127
10.0%
78
6.1%
46
 
3.6%
46
 
3.6%
40
 
3.1%
Other values (13) 112
8.8%
Common
ValueCountFrequency (%)
638
38.6%
( 224
 
13.6%
) 224
 
13.6%
201
 
12.2%
- 72
 
4.4%
3 66
 
4.0%
2 60
 
3.6%
1 49
 
3.0%
4 43
 
2.6%
5 31
 
1.9%
Other values (4) 43
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1450
49.5%
Hangul 1276
43.6%
Arrows 201
 
6.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
638
44.0%
( 224
 
15.4%
) 224
 
15.4%
- 72
 
5.0%
3 66
 
4.6%
2 60
 
4.1%
1 49
 
3.4%
4 43
 
3.0%
5 31
 
2.1%
6 20
 
1.4%
Other values (3) 23
 
1.6%
Arrows
ValueCountFrequency (%)
201
100.0%
Hangul
ValueCountFrequency (%)
190
14.9%
184
14.4%
184
14.4%
142
11.1%
127
10.0%
127
10.0%
78
6.1%
46
 
3.6%
46
 
3.6%
40
 
3.1%
Other values (13) 112
8.8%

Interactions

2023-12-13T08:10:43.069716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:10:42.873285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:10:43.178519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:10:42.981541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:10:46.067013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번호선역명호기운행구간
연번1.0000.9990.9580.3720.803
호선0.9991.0001.0000.2600.547
역명0.9581.0001.0000.0000.777
호기0.3720.2600.0001.0000.423
운행구간0.8030.5470.7770.4231.000
2023-12-13T08:10:46.193176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
운행구간역명호선
운행구간1.0000.3580.407
역명0.3581.0000.972
호선0.4070.9721.000
2023-12-13T08:10:46.284196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번호기호선역명운행구간
연번1.000-0.2170.9530.8350.429
호기-0.2171.0000.1950.0000.155
호선0.9530.1951.0000.9720.407
역명0.8350.0000.9721.0000.358
운행구간0.4290.1550.4070.3581.000

Missing values

2023-12-13T08:10:43.313358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:10:43.459193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번호선역명장비종류호기승강기번호운행구간설치위치
019(2단계)언주E/S11809-530B4-B5(중층 → 하선 3-1)
129(2단계)언주E/S21809-529B5-B4(하선 3-1 → 중층)
239(2단계)언주E/S31809-536B4-B5(중층 → 상선 4-4)
349(2단계)언주E/S41809-535B5-B4(상선 4-4 → 중층)
459(2단계)언주E/S51809-531B5-B4(하선 5-3 → 중층)
569(2단계)언주E/S61809-532B4-B5(중층 → 하선 5-3)
679(2단계)언주E/S71809-533B5-B4(상선 2-2 → 중층)
789(2단계)언주E/S81809-534B4-B5(중층 → 상선 2-2)
899(2단계)언주E/S91809-537B4-B1(중층 → 대합실)
9109(2단계)언주E/S101809-538B1-B4(대합실 → 중층)
연번호선역명장비종류호기승강기번호운행구간설치위치
1911929(3단계)중앙보훈병원E/S11811-576B2-B1(하선 5-3 → 대합실)
1921939(3단계)중앙보훈병원E/S21811-577B1-B2(대합실 → 하선 5-3)
1931949(3단계)중앙보훈병원E/S31811-578B2-B1(상선 2-2 → 대합실)
1941959(3단계)중앙보훈병원E/S41811-579B1-B2(대합실 → 상선 2-2)
1951969(3단계)중앙보훈병원E/S51811-5811F-B1(1번 출구 → 대합실)
1961979(3단계)중앙보훈병원E/S61811-580B1-1F(대합실 → 1번 출구)
1971989(3단계)중앙보훈병원E/S71811-5831F-B1(2번 출구 → 대합실)
1981999(3단계)중앙보훈병원E/S81811-582B1-1F(대합실 → 2번 출구)
1992009(3단계)중앙보훈병원E/S91811-5851F-B1(3번 출구 → 대합실)
2002019(3단계)중앙보훈병원E/S101811-584B1-1F(대합실 → 3번 출구)