Overview

Dataset statistics

Number of variables7
Number of observations113
Missing cells67
Missing cells (%)8.5%
Duplicate rows19
Duplicate rows (%)16.8%
Total size in memory6.6 KiB
Average record size in memory60.2 B

Variable types

Categorical6
Numeric1

Dataset

Description수도권6호선에 포함된 도시광역철도역들의 엘리베이터 데이터로 철도운영기관명, 선명, 역명, 출입구번호, 상세위치, 정원인원, 정원중량의데이터가 있습니다.
Author국가철도공단
URLhttps://www.data.go.kr/data/15041394/fileData.do

Alerts

철도운영기관명 has constant value ""Constant
선명 has constant value ""Constant
Dataset has 19 (16.8%) duplicate rowsDuplicates
정원_인원 is highly overall correlated with 상세위치 and 1 other fieldsHigh correlation
정원_중량(kg) is highly overall correlated with 정원_인원High correlation
출입구번호 is highly overall correlated with 상세위치High correlation
상세위치 is highly overall correlated with 출입구번호 and 1 other fieldsHigh correlation
정원_인원 is highly imbalanced (61.0%)Imbalance
정원_중량(kg) is highly imbalanced (58.3%)Imbalance
출입구번호 has 67 (59.3%) missing valuesMissing

Reproduction

Analysis started2023-12-12 18:57:51.738336
Analysis finished2023-12-12 18:57:52.537680
Duration0.8 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

철도운영기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
서울교통공사
113 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울교통공사
2nd row서울교통공사
3rd row서울교통공사
4th row서울교통공사
5th row서울교통공사

Common Values

ValueCountFrequency (%)
서울교통공사 113
100.0%

Length

2023-12-13T03:57:52.631246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:57:52.776567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울교통공사 113
100.0%

선명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
6호선
113 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row6호선
2nd row6호선
3rd row6호선
4th row6호선
5th row6호선

Common Values

ValueCountFrequency (%)
6호선 113
100.0%

Length

2023-12-13T03:57:52.900959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:57:53.029162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
6호선 113
100.0%

역명
Categorical

Distinct39
Distinct (%)34.5%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
합정
 
5
청구
 
5
버티고개
 
5
삼각지
 
5
공덕
 
5
Other values (34)
88 

Length

Max length14
Median length11
Mean length4.5663717
Min length2

Unique

Unique4 ?
Unique (%)3.5%

Sample

1st row고려대(종암)
2nd row고려대(종암)
3rd row고려대(종암)
4th row고려대(종암)
5th row공덕

Common Values

ValueCountFrequency (%)
합정 5
 
4.4%
청구 5
 
4.4%
버티고개 5
 
4.4%
삼각지 5
 
4.4%
공덕 5
 
4.4%
고려대(종암) 4
 
3.5%
약수 4
 
3.5%
화랑대(서울여대입구) 4
 
3.5%
석계 4
 
3.5%
태릉입구 4
 
3.5%
Other values (29) 68
60.2%

Length

2023-12-13T03:57:53.168755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
합정 5
 
4.4%
버티고개 5
 
4.4%
삼각지 5
 
4.4%
공덕 5
 
4.4%
청구 5
 
4.4%
고려대(종암 4
 
3.5%
약수 4
 
3.5%
화랑대(서울여대입구 4
 
3.5%
석계 4
 
3.5%
태릉입구 4
 
3.5%
Other values (29) 68
60.2%

출입구번호
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct11
Distinct (%)23.9%
Missing67
Missing (%)59.3%
Infinite0
Infinite (%)0.0%
Mean4.0652174
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-13T03:57:53.333667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median3.5
Q36
95-th percentile8.75
Maximum12
Range11
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.8859982
Coefficient of variation (CV)0.70992469
Kurtosis0.1743709
Mean4.0652174
Median Absolute Deviation (MAD)2.5
Skewness0.92364706
Sum187
Variance8.3289855
MonotonicityNot monotonic
2023-12-13T03:57:53.488350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
1 10
 
8.8%
4 8
 
7.1%
2 8
 
7.1%
3 5
 
4.4%
6 5
 
4.4%
8 5
 
4.4%
5 1
 
0.9%
12 1
 
0.9%
11 1
 
0.9%
9 1
 
0.9%
(Missing) 67
59.3%
ValueCountFrequency (%)
1 10
8.8%
2 8
7.1%
3 5
4.4%
4 8
7.1%
5 1
 
0.9%
6 5
4.4%
7 1
 
0.9%
8 5
4.4%
9 1
 
0.9%
11 1
 
0.9%
ValueCountFrequency (%)
12 1
 
0.9%
11 1
 
0.9%
9 1
 
0.9%
8 5
4.4%
7 1
 
0.9%
6 5
4.4%
5 1
 
0.9%
4 8
7.1%
3 5
4.4%
2 8
7.1%

상세위치
Categorical

HIGH CORRELATION 

Distinct45
Distinct (%)39.8%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
(B1-B2) 승강장
14 
(B2-B3) 승강장
12 
(B2-B3)승강장
(B1-B4) 승강장
 
6
(F1-B1)1번 출입구
 
5
Other values (40)
69 

Length

Max length18
Median length15
Mean length12.097345
Min length10

Unique

Unique24 ?
Unique (%)21.2%

Sample

1st row(B1-B4) 승강장
2nd row(B1-B4) 승강장
3rd row(F1-B1)1번 출입구
4th row(F1-B1)3/4번 출입구
5th row(B1-B2) 승강장

Common Values

ValueCountFrequency (%)
(B1-B2) 승강장 14
 
12.4%
(B2-B3) 승강장 12
 
10.6%
(B2-B3)승강장 7
 
6.2%
(B1-B4) 승강장 6
 
5.3%
(F1-B1)1번 출입구 5
 
4.4%
(F1-B1)2번 출입구 5
 
4.4%
(F1-B1)4번 출입구 4
 
3.5%
(B3-B4) 승강장 4
 
3.5%
(F1-B2)1번 출입구 4
 
3.5%
(F1-B1)3번 출입구 3
 
2.7%
Other values (35) 49
43.4%

Length

2023-12-13T03:57:53.709499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
출입구 41
19.9%
승강장 39
18.9%
b1-b2 14
 
6.8%
b2-b3 12
 
5.8%
대합실 11
 
5.3%
b2-b3)승강장 7
 
3.4%
b1-b4 6
 
2.9%
f1-b1)1번 5
 
2.4%
f1-b1)2번 5
 
2.4%
f1-b2)1번 4
 
1.9%
Other values (40) 62
30.1%

정원_인원
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
15
95 
11
 
8
24
 
6
17
 
3
13
 
1

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)0.9%

Sample

1st row15
2nd row15
3rd row15
4th row15
5th row15

Common Values

ValueCountFrequency (%)
15 95
84.1%
11 8
 
7.1%
24 6
 
5.3%
17 3
 
2.7%
13 1
 
0.9%

Length

2023-12-13T03:57:53.898391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:57:54.049615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
15 95
84.1%
11 8
 
7.1%
24 6
 
5.3%
17 3
 
2.7%
13 1
 
0.9%

정원_중량(kg)
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
1000
96 
750
 
8
1600
 
6
1150
 
3

Length

Max length4
Median length4
Mean length3.9292035
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1000
2nd row1000
3rd row1000
4th row1000
5th row1000

Common Values

ValueCountFrequency (%)
1000 96
85.0%
750 8
 
7.1%
1600 6
 
5.3%
1150 3
 
2.7%

Length

2023-12-13T03:57:54.264368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:57:54.449649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1000 96
85.0%
750 8
 
7.1%
1600 6
 
5.3%
1150 3
 
2.7%

Interactions

2023-12-13T03:57:52.153749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T03:57:54.595412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
역명출입구번호상세위치정원_인원정원_중량(kg)
역명1.0000.0000.8040.7640.808
출입구번호0.0001.0001.0000.2450.245
상세위치0.8041.0001.0000.9270.804
정원_인원0.7640.2450.9271.0001.000
정원_중량(kg)0.8080.2450.8041.0001.000
2023-12-13T03:57:54.789274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상세위치정원_인원역명정원_중량(kg)
상세위치1.0000.5450.2340.430
정원_인원0.5451.0000.3930.995
역명0.2340.3931.0000.457
정원_중량(kg)0.4300.9950.4571.000
2023-12-13T03:57:54.975031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
출입구번호역명상세위치정원_인원정원_중량(kg)
출입구번호1.0000.0000.7880.1350.135
역명0.0001.0000.2340.3930.457
상세위치0.7880.2341.0000.5450.430
정원_인원0.1350.3930.5451.0000.995
정원_중량(kg)0.1350.4570.4300.9951.000

Missing values

2023-12-13T03:57:52.304135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:57:52.470242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

철도운영기관명선명역명출입구번호상세위치정원_인원정원_중량(kg)
0서울교통공사6호선고려대(종암)<NA>(B1-B4) 승강장151000
1서울교통공사6호선고려대(종암)<NA>(B1-B4) 승강장151000
2서울교통공사6호선고려대(종암)1(F1-B1)1번 출입구151000
3서울교통공사6호선고려대(종암)3(F1-B1)3/4번 출입구151000
4서울교통공사6호선공덕<NA>(B1-B2) 승강장151000
5서울교통공사6호선공덕<NA>(B1-B2) 승강장151000
6서울교통공사6호선공덕5(F1-B1)5번 출입구151000
7서울교통공사6호선공덕6(B2-B3)환승통로151000
8서울교통공사6호선공덕6(B2-B3)환승통로151000
9서울교통공사6호선광흥창(서강)1(F1-B2)1번 출입구151000
철도운영기관명선명역명출입구번호상세위치정원_인원정원_중량(kg)
103서울교통공사6호선합정<NA>(B3-B4) 승강장151000
104서울교통공사6호선합정6(F1-B1)6번 출입구151000
105서울교통공사6호선합정9(F1-B1)9번 출입구151000
106서울교통공사6호선합정8(F1-B1)8-1번 출입구151000
107서울교통공사6호선화랑대(서울여대입구)<NA>(B1-B2) 승강장151000
108서울교통공사6호선화랑대(서울여대입구)<NA>(B1-B2) 승강장151000
109서울교통공사6호선화랑대(서울여대입구)1(F1-B1)1번 출입구151000
110서울교통공사6호선화랑대(서울여대입구)7(F1-B1)7번 출입구241600
111서울교통공사6호선효창공원앞<NA>(B2-B3)승강장151000
112서울교통공사6호선효창공원앞<NA>(B1-B2)지하2층 대합실151000

Duplicate rows

Most frequently occurring

철도운영기관명선명역명출입구번호상세위치정원_인원정원_중량(kg)# duplicates
0서울교통공사6호선고려대(종암)<NA>(B1-B4) 승강장1510002
1서울교통공사6호선공덕6(B2-B3)환승통로1510002
2서울교통공사6호선공덕<NA>(B1-B2) 승강장1510002
3서울교통공사6호선돌곶이<NA>(B1-B2) 승강장1510002
4서울교통공사6호선동묘앞<NA>(B2-B3)승강장1510002
5서울교통공사6호선디지털미디어시티<NA>(B1-B2) 승강장117502
6서울교통공사6호선마포구청<NA>(B2-B3) 승강장1510002
7서울교통공사6호선버티고개<NA>(B3-B4) 승강장117502
8서울교통공사6호선불광<NA>(B1-B4) 승강장1510002
9서울교통공사6호선삼각지<NA>(B3-B4)승강장1510002