Overview

Dataset statistics

Number of variables6
Number of observations630
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory32.1 KiB
Average record size in memory52.2 B

Variable types

Numeric4
Text2

Dataset

Description한국철도공사가 운영하고 있는 철도역(간선철도)과 광역철도(지하철, 도시철도)역에 대한 승강장 이동 편의 시설인 엘리베이터, 에스컬레이터, 장애인 리프트 현황입니다.
Author한국철도공사
URLhttps://www.data.go.kr/data/15120911/fileData.do

Alerts

엘리베이터 is highly overall correlated with 에스컬레이터High correlation
에스컬레이터 is highly overall correlated with 엘리베이터High correlation
연번 has unique valuesUnique
역명 has unique valuesUnique
엘리베이터 has 244 (38.7%) zerosZeros
에스컬레이터 has 295 (46.8%) zerosZeros
휠체어리프트 has 580 (92.1%) zerosZeros

Reproduction

Analysis started2023-12-12 08:50:17.302907
Analysis finished2023-12-12 08:50:19.842436
Duration2.54 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct630
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean315.5
Minimum1
Maximum630
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.7 KiB
2023-12-12T17:50:19.929507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile32.45
Q1158.25
median315.5
Q3472.75
95-th percentile598.55
Maximum630
Range629
Interquartile range (IQR)314.5

Descriptive statistics

Standard deviation182.00962
Coefficient of variation (CV)0.5768926
Kurtosis-1.2
Mean315.5
Median Absolute Deviation (MAD)157.5
Skewness0
Sum198765
Variance33127.5
MonotonicityStrictly increasing
2023-12-12T17:50:20.150965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
425 1
 
0.2%
418 1
 
0.2%
419 1
 
0.2%
420 1
 
0.2%
421 1
 
0.2%
422 1
 
0.2%
423 1
 
0.2%
424 1
 
0.2%
426 1
 
0.2%
Other values (620) 620
98.4%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
630 1
0.2%
629 1
0.2%
628 1
0.2%
627 1
0.2%
626 1
0.2%
625 1
0.2%
624 1
0.2%
623 1
0.2%
622 1
0.2%
621 1
0.2%
Distinct63
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size5.1 KiB
2023-12-12T17:50:20.354071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length3
Mean length3.1
Min length3

Characters and Unicode

Total characters1953
Distinct characters74
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)4.1%

Sample

1st row경부선
2nd row경부선
3rd row경부선
4th row경원선
5th row경원선
ValueCountFrequency (%)
경부선 83
 
13.2%
중앙선 50
 
7.9%
분당선 35
 
5.6%
경원선 35
 
5.6%
호남선 33
 
5.2%
동해선 32
 
5.1%
경전선 30
 
4.8%
전라선 27
 
4.3%
영동선 26
 
4.1%
장항선 24
 
3.8%
Other values (53) 255
40.5%
2023-12-12T17:50:20.727227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
622
31.8%
248
 
12.7%
100
 
5.1%
61
 
3.1%
58
 
3.0%
53
 
2.7%
50
 
2.6%
50
 
2.6%
39
 
2.0%
39
 
2.0%
Other values (64) 633
32.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1953
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
622
31.8%
248
 
12.7%
100
 
5.1%
61
 
3.1%
58
 
3.0%
53
 
2.7%
50
 
2.6%
50
 
2.6%
39
 
2.0%
39
 
2.0%
Other values (64) 633
32.4%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1953
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
622
31.8%
248
 
12.7%
100
 
5.1%
61
 
3.1%
58
 
3.0%
53
 
2.7%
50
 
2.6%
50
 
2.6%
39
 
2.0%
39
 
2.0%
Other values (64) 633
32.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1953
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
622
31.8%
248
 
12.7%
100
 
5.1%
61
 
3.1%
58
 
3.0%
53
 
2.7%
50
 
2.6%
50
 
2.6%
39
 
2.0%
39
 
2.0%
Other values (64) 633
32.4%

역명
Text

UNIQUE 

Distinct630
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size5.1 KiB
2023-12-12T17:50:21.050772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length4
Mean length4.4095238
Min length4

Characters and Unicode

Total characters2778
Distinct characters265
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique630 ?
Unique (%)100.0%

Sample

1st row남영역사
2nd row서울역사
3rd row용산역사
4th row광운대역사
5th row녹천역사
ValueCountFrequency (%)
남영역사 1
 
0.2%
오수역사 1
 
0.2%
전주역사 1
 
0.2%
남원역사 1
 
0.2%
봉천역사 1
 
0.2%
산성역사 1
 
0.2%
삼례역사 1
 
0.2%
서도역사 1
 
0.2%
신리역사 1
 
0.2%
동산역사 1
 
0.2%
Other values (620) 620
98.4%
2023-12-12T17:50:21.515330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
646
23.3%
631
22.7%
57
 
2.1%
53
 
1.9%
36
 
1.3%
36
 
1.3%
34
 
1.2%
33
 
1.2%
28
 
1.0%
26
 
0.9%
Other values (255) 1198
43.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2774
99.9%
Close Punctuation 2
 
0.1%
Open Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
646
23.3%
631
22.7%
57
 
2.1%
53
 
1.9%
36
 
1.3%
36
 
1.3%
34
 
1.2%
33
 
1.2%
28
 
1.0%
26
 
0.9%
Other values (253) 1194
43.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2774
99.9%
Common 4
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
646
23.3%
631
22.7%
57
 
2.1%
53
 
1.9%
36
 
1.3%
36
 
1.3%
34
 
1.2%
33
 
1.2%
28
 
1.0%
26
 
0.9%
Other values (253) 1194
43.0%
Common
ValueCountFrequency (%)
) 2
50.0%
( 2
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2774
99.9%
ASCII 4
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
646
23.3%
631
22.7%
57
 
2.1%
53
 
1.9%
36
 
1.3%
36
 
1.3%
34
 
1.2%
33
 
1.2%
28
 
1.0%
26
 
0.9%
Other values (253) 1194
43.0%
ASCII
ValueCountFrequency (%)
) 2
50.0%
( 2
50.0%

엘리베이터
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct17
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.2460317
Minimum0
Maximum18
Zeros244
Zeros (%)38.7%
Negative0
Negative (%)0.0%
Memory size5.7 KiB
2023-12-12T17:50:21.646282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median2
Q34
95-th percentile6
Maximum18
Range18
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.5049601
Coefficient of variation (CV)1.1152826
Kurtosis6.56422
Mean2.2460317
Median Absolute Deviation (MAD)2
Skewness1.8999274
Sum1415
Variance6.2748252
MonotonicityNot monotonic
2023-12-12T17:50:21.766152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=17)
ValueCountFrequency (%)
0 244
38.7%
2 115
18.3%
4 108
17.1%
3 83
 
13.2%
5 23
 
3.7%
6 20
 
3.2%
1 12
 
1.9%
7 6
 
1.0%
8 4
 
0.6%
10 4
 
0.6%
Other values (7) 11
 
1.7%
ValueCountFrequency (%)
0 244
38.7%
1 12
 
1.9%
2 115
18.3%
3 83
 
13.2%
4 108
17.1%
5 23
 
3.7%
6 20
 
3.2%
7 6
 
1.0%
8 4
 
0.6%
9 2
 
0.3%
ValueCountFrequency (%)
18 1
 
0.2%
16 1
 
0.2%
14 3
0.5%
13 2
 
0.3%
12 1
 
0.2%
11 1
 
0.2%
10 4
0.6%
9 2
 
0.3%
8 4
0.6%
7 6
1.0%

에스컬레이터
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct27
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.3269841
Minimum0
Maximum31
Zeros295
Zeros (%)46.8%
Negative0
Negative (%)0.0%
Memory size5.7 KiB
2023-12-12T17:50:21.885758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median2
Q36
95-th percentile17.1
Maximum31
Range31
Interquartile range (IQR)6

Descriptive statistics

Standard deviation5.814384
Coefficient of variation (CV)1.3437498
Kurtosis3.1232763
Mean4.3269841
Median Absolute Deviation (MAD)2
Skewness1.7372281
Sum2726
Variance33.807061
MonotonicityNot monotonic
2023-12-12T17:50:22.019653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%)
0 295
46.8%
4 96
 
15.2%
8 51
 
8.1%
6 51
 
8.1%
2 20
 
3.2%
10 17
 
2.7%
12 17
 
2.7%
16 11
 
1.7%
20 9
 
1.4%
7 8
 
1.3%
Other values (17) 55
 
8.7%
ValueCountFrequency (%)
0 295
46.8%
1 4
 
0.6%
2 20
 
3.2%
3 4
 
0.6%
4 96
 
15.2%
5 5
 
0.8%
6 51
 
8.1%
7 8
 
1.3%
8 51
 
8.1%
9 2
 
0.3%
ValueCountFrequency (%)
31 1
 
0.2%
30 1
 
0.2%
28 2
 
0.3%
26 1
 
0.2%
24 2
 
0.3%
23 4
0.6%
22 4
0.6%
21 1
 
0.2%
20 9
1.4%
19 2
 
0.3%

휠체어리프트
Real number (ℝ)

ZEROS 

Distinct6
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.17619048
Minimum0
Maximum5
Zeros580
Zeros (%)92.1%
Negative0
Negative (%)0.0%
Memory size5.7 KiB
2023-12-12T17:50:22.123480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile2
Maximum5
Range5
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.67835354
Coefficient of variation (CV)3.8501147
Kurtosis22.054915
Mean0.17619048
Median Absolute Deviation (MAD)0
Skewness4.4888478
Sum111
Variance0.46016352
MonotonicityNot monotonic
2023-12-12T17:50:22.230969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0 580
92.1%
2 20
 
3.2%
1 14
 
2.2%
3 11
 
1.7%
5 4
 
0.6%
4 1
 
0.2%
ValueCountFrequency (%)
0 580
92.1%
1 14
 
2.2%
2 20
 
3.2%
3 11
 
1.7%
4 1
 
0.2%
5 4
 
0.6%
ValueCountFrequency (%)
5 4
 
0.6%
4 1
 
0.2%
3 11
 
1.7%
2 20
 
3.2%
1 14
 
2.2%
0 580
92.1%

Interactions

2023-12-12T17:50:19.209123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:50:17.785205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:50:18.328175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:50:18.787512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:50:19.307014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:50:17.927713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:50:18.466047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:50:18.887378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:50:19.447600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:50:18.071946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:50:18.569566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:50:18.994704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:50:19.549495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:50:18.205019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:50:18.682475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:50:19.115344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:50:22.317989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번노선명엘리베이터에스컬레이터휠체어리프트
연번1.0000.9640.5380.5000.267
노선명0.9641.0000.4520.5540.000
엘리베이터0.5380.4521.0000.8680.263
에스컬레이터0.5000.5540.8681.0000.198
휠체어리프트0.2670.0000.2630.1981.000
2023-12-12T17:50:22.422702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번엘리베이터에스컬레이터휠체어리프트
연번1.000-0.384-0.362-0.127
엘리베이터-0.3841.0000.8360.065
에스컬레이터-0.3620.8361.0000.088
휠체어리프트-0.1270.0650.0881.000

Missing values

2023-12-12T17:50:19.686006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:50:19.802499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번노선명역명엘리베이터에스컬레이터휠체어리프트
01경부선남영역사001
12경부선서울역사18231
23경부선용산역사14230
34경원선광운대역사142
45경원선녹천역사800
56경원선도봉역사300
67경원선도봉산역사5100
78경원선망월사역사442
89경원선방학역사400
910경원선석계역사110
연번노선명역명엘리베이터에스컬레이터휠체어리프트
620621온산선온산역사000
621622우암선신선대역사000
622623우암선우암역사000
623624울산신항선용암역사000
624625울산신항선울산신항역사000
625626울산항선울산항역사000
626627진해선신창원역사000
627628진해선남창원역사000
628629진해선성주사역사000
629630진해선진해역사000