Overview

Dataset statistics

Number of variables7
Number of observations3192
Missing cells3
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory187.2 KiB
Average record size in memory60.0 B

Variable types

Categorical4
Text1
Numeric2

Dataset

Description대구교통공사에서 운영하는 노선의 승강강 이격거리에 대한 데이터로 철도운영기관명, 선명, 역명, 승강장이격거리에 대한 승강장번호, 차량순서, 차량출입문번호, 안전거리의 데이터가 있습니다.
Author국가철도공단
URLhttps://www.data.go.kr/data/15041498/fileData.do

Alerts

철도운영기관명 has constant value ""Constant

Reproduction

Analysis started2023-12-12 07:07:27.133601
Analysis finished2023-12-12 07:07:28.165453
Duration1.03 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

철도운영기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size25.1 KiB
대구교통공사
3192 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대구교통공사
2nd row대구교통공사
3rd row대구교통공사
4th row대구교통공사
5th row대구교통공사

Common Values

ValueCountFrequency (%)
대구교통공사 3192
100.0%

Length

2023-12-12T16:07:28.227355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:07:28.320714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대구교통공사 3192
100.0%

선명
Categorical

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size25.1 KiB
1호선
1440 
2호선
1392 
3호선
360 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1호선
2nd row1호선
3rd row1호선
4th row1호선
5th row1호선

Common Values

ValueCountFrequency (%)
1호선 1440
45.1%
2호선 1392
43.6%
3호선 360
 
11.3%

Length

2023-12-12T16:07:28.430253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:07:28.572183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1호선 1440
45.1%
2호선 1392
43.6%
3호선 360
 
11.3%

역명
Text

Distinct86
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size25.1 KiB
2023-12-12T16:07:28.872467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length2
Mean length3.6578947
Min length2

Characters and Unicode

Total characters11676
Distinct characters135
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row각산
2nd row각산
3rd row각산
4th row각산
5th row각산
ValueCountFrequency (%)
반월당 96
 
3.0%
명덕(2.28민주운동기념회관 60
 
1.9%
신남 60
 
1.9%
만촌 48
 
1.5%
교대 48
 
1.5%
죽전 48
 
1.5%
각산 48
 
1.5%
반고개 48
 
1.5%
계명대 48
 
1.5%
고산 48
 
1.5%
Other values (76) 2640
82.7%
2023-12-12T16:07:29.306179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
804
 
6.9%
( 384
 
3.3%
) 384
 
3.3%
372
 
3.2%
288
 
2.5%
276
 
2.4%
252
 
2.2%
252
 
2.2%
240
 
2.1%
240
 
2.1%
Other values (125) 8184
70.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10380
88.9%
Open Punctuation 384
 
3.3%
Close Punctuation 384
 
3.3%
Decimal Number 180
 
1.5%
Uppercase Letter 180
 
1.5%
Other Punctuation 168
 
1.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
804
 
7.7%
372
 
3.6%
288
 
2.8%
276
 
2.7%
252
 
2.4%
252
 
2.4%
240
 
2.3%
240
 
2.3%
192
 
1.8%
192
 
1.8%
Other values (113) 7272
70.1%
Uppercase Letter
ValueCountFrequency (%)
B 60
33.3%
S 48
26.7%
K 48
26.7%
C 12
 
6.7%
T 12
 
6.7%
Other Punctuation
ValueCountFrequency (%)
. 60
35.7%
/ 60
35.7%
· 48
28.6%
Decimal Number
ValueCountFrequency (%)
2 120
66.7%
8 60
33.3%
Open Punctuation
ValueCountFrequency (%)
( 384
100.0%
Close Punctuation
ValueCountFrequency (%)
) 384
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10380
88.9%
Common 1116
 
9.6%
Latin 180
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
804
 
7.7%
372
 
3.6%
288
 
2.8%
276
 
2.7%
252
 
2.4%
252
 
2.4%
240
 
2.3%
240
 
2.3%
192
 
1.8%
192
 
1.8%
Other values (113) 7272
70.1%
Common
ValueCountFrequency (%)
( 384
34.4%
) 384
34.4%
2 120
 
10.8%
8 60
 
5.4%
. 60
 
5.4%
/ 60
 
5.4%
· 48
 
4.3%
Latin
ValueCountFrequency (%)
B 60
33.3%
S 48
26.7%
K 48
26.7%
C 12
 
6.7%
T 12
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10332
88.5%
ASCII 1248
 
10.7%
Compat Jamo 48
 
0.4%
None 48
 
0.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
804
 
7.8%
372
 
3.6%
288
 
2.8%
276
 
2.7%
252
 
2.4%
252
 
2.4%
240
 
2.3%
240
 
2.3%
192
 
1.9%
192
 
1.9%
Other values (112) 7224
69.9%
ASCII
ValueCountFrequency (%)
( 384
30.8%
) 384
30.8%
2 120
 
9.6%
8 60
 
4.8%
. 60
 
4.8%
B 60
 
4.8%
/ 60
 
4.8%
S 48
 
3.8%
K 48
 
3.8%
C 12
 
1.0%
Compat Jamo
ValueCountFrequency (%)
48
100.0%
None
ValueCountFrequency (%)
· 48
100.0%

승강장번호
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size25.1 KiB
1
1596 
2
1596 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 1596
50.0%
2 1596
50.0%

Length

2023-12-12T16:07:29.442586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:07:29.542215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 1596
50.0%
2 1596
50.0%

차량순서
Real number (ℝ)

Distinct6
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.3308271
Minimum1
Maximum6
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size28.2 KiB
2023-12-12T16:07:29.635888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q35
95-th percentile6
Maximum6
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.6996918
Coefficient of variation (CV)0.51029122
Kurtosis-1.2327157
Mean3.3308271
Median Absolute Deviation (MAD)1
Skewness0.15320804
Sum10632
Variance2.8889522
MonotonicityNot monotonic
2023-12-12T16:07:29.747982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
1 592
18.5%
2 592
18.5%
3 592
18.5%
4 472
14.8%
5 472
14.8%
6 472
14.8%
ValueCountFrequency (%)
1 592
18.5%
2 592
18.5%
3 592
18.5%
4 472
14.8%
5 472
14.8%
6 472
14.8%
ValueCountFrequency (%)
6 472
14.8%
5 472
14.8%
4 472
14.8%
3 592
18.5%
2 592
18.5%
1 592
18.5%
Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size25.1 KiB
1
888 
2
888 
4
708 
3
708 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row4
3rd row3
4th row2
5th row3

Common Values

ValueCountFrequency (%)
1 888
27.8%
2 888
27.8%
4 708
22.2%
3 708
22.2%

Length

2023-12-12T16:07:29.869175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:07:29.977626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 888
27.8%
2 888
27.8%
4 708
22.2%
3 708
22.2%

안전거리
Real number (ℝ)

Distinct79
Distinct (%)2.5%
Missing3
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean6.0501098
Minimum0.3
Maximum12.4
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size28.2 KiB
2023-12-12T16:07:30.183039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.3
5-th percentile3.5
Q15
median6
Q37
95-th percentile8
Maximum12.4
Range12.1
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.3874248
Coefficient of variation (CV)0.22932225
Kurtosis1.5544847
Mean6.0501098
Median Absolute Deviation (MAD)1
Skewness-0.22787015
Sum19293.8
Variance1.9249474
MonotonicityNot monotonic
2023-12-12T16:07:30.361737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
6.5 538
16.9%
6.0 473
14.8%
7.0 472
14.8%
7.5 210
 
6.6%
5.0 203
 
6.4%
5.5 197
 
6.2%
4.5 101
 
3.2%
8.0 94
 
2.9%
4.0 69
 
2.2%
3.0 61
 
1.9%
Other values (69) 771
24.2%
ValueCountFrequency (%)
0.3 1
 
< 0.1%
0.5 2
 
0.1%
1.0 1
 
< 0.1%
1.2 1
 
< 0.1%
1.5 6
 
0.2%
1.9 1
 
< 0.1%
2.0 17
0.5%
2.4 1
 
< 0.1%
2.5 31
1.0%
2.7 3
 
0.1%
ValueCountFrequency (%)
12.4 1
 
< 0.1%
12.0 2
 
0.1%
11.8 2
 
0.1%
11.5 5
0.2%
11.2 2
 
0.1%
11.0 6
0.2%
10.8 2
 
0.1%
10.5 1
 
< 0.1%
10.0 1
 
< 0.1%
9.6 1
 
< 0.1%

Interactions

2023-12-12T16:07:27.735102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:07:27.515125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:07:27.836869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:07:27.615027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:07:30.476173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
선명역명승강장번호차량순서차량출입문번호안전거리
선명1.0000.9970.0000.4730.2340.592
역명0.9971.0000.0000.0000.1470.768
승강장번호0.0000.0001.0000.0000.0000.070
차량순서0.4730.0000.0001.0000.0670.171
차량출입문번호0.2340.1470.0000.0671.0000.238
안전거리0.5920.7680.0700.1710.2381.000
2023-12-12T16:07:30.603623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
선명승강장번호차량출입문번호
선명1.0000.0000.223
승강장번호0.0001.0000.000
차량출입문번호0.2230.0001.000
2023-12-12T16:07:30.711947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
차량순서안전거리선명승강장번호차량출입문번호
차량순서1.0000.0970.2220.0000.043
안전거리0.0971.0000.4350.0540.144
선명0.2220.4351.0000.0000.223
승강장번호0.0000.0540.0001.0000.000
차량출입문번호0.0430.1440.2230.0001.000

Missing values

2023-12-12T16:07:27.969398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:07:28.119617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

철도운영기관명선명역명승강장번호차량순서차량출입문번호안전거리
0대구교통공사1호선각산1116.0
1대구교통공사1호선각산1145.2
2대구교통공사1호선각산1137.5
3대구교통공사1호선각산1127.0
4대구교통공사1호선각산1237.5
5대구교통공사1호선각산1227.0
6대구교통공사1호선각산1216.0
7대구교통공사1호선각산1246.5
8대구교통공사1호선각산1316.0
9대구교통공사1호선각산1326.0
철도운영기관명선명역명승강장번호차량순서차량출입문번호안전거리
3182대구교통공사3호선황금1224.4
3183대구교통공사3호선황금1214.5
3184대구교통공사3호선황금1324.9
3185대구교통공사3호선황금1314.9
3186대구교통공사3호선황금2114.5
3187대구교통공사3호선황금2124.4
3188대구교통공사3호선황금2224.7
3189대구교통공사3호선황금2215.5
3190대구교통공사3호선황금2314.2
3191대구교통공사3호선황금2324.7