Overview

Dataset statistics

Number of variables7
Number of observations4528
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory265.4 KiB
Average record size in memory60.0 B

Variable types

Categorical4
Text1
Numeric2

Dataset

Description부산교통공사에서 운영하는 노선의 승강장 이격거리에 대한 데이터로 철도운영기관명,선명,역명,승강장번호,차량순서,차량출입문번호,안전거리 등이 있습니다.
Author국가철도공단
URLhttps://www.data.go.kr/data/15041493/fileData.do

Alerts

철도운영기관명 has constant value ""Constant

Reproduction

Analysis started2023-12-12 20:06:40.977695
Analysis finished2023-12-12 20:06:42.351001
Duration1.37 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

철도운영기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size35.5 KiB
부산교통공사
4528 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산교통공사
2nd row부산교통공사
3rd row부산교통공사
4th row부산교통공사
5th row부산교통공사

Common Values

ValueCountFrequency (%)
부산교통공사 4528
100.0%

Length

2023-12-13T05:06:42.426178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:06:42.535802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산교통공사 4528
100.0%

선명
Categorical

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size35.5 KiB
2호선
2016 
1호선
1632 
3호선
544 
4호선
336 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1호선
2nd row1호선
3rd row1호선
4th row1호선
5th row1호선

Common Values

ValueCountFrequency (%)
2호선 2016
44.5%
1호선 1632
36.0%
3호선 544
 
12.0%
4호선 336
 
7.4%

Length

2023-12-13T05:06:42.645753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:06:42.746163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2호선 2016
44.5%
1호선 1632
36.0%
3호선 544
 
12.0%
4호선 336
 
7.4%

역명
Text

Distinct101
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size35.5 KiB
2023-12-13T05:06:42.982618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length2
Mean length4.6696113
Min length2

Characters and Unicode

Total characters21144
Distinct characters163
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row괴정
2nd row괴정
3rd row괴정
4th row괴정
5th row괴정
ValueCountFrequency (%)
서면 96
 
2.1%
연산 80
 
1.8%
수영 80
 
1.8%
덕천(부산과기대 80
 
1.8%
동래 72
 
1.6%
미남 56
 
1.2%
동원 48
 
1.1%
문현 48
 
1.1%
못골(남구청 48
 
1.1%
모라 48
 
1.1%
Other values (91) 3872
85.5%
2023-12-13T05:06:43.394730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 1312
 
6.2%
) 1312
 
6.2%
1096
 
5.2%
1032
 
4.9%
792
 
3.7%
536
 
2.5%
432
 
2.0%
400
 
1.9%
384
 
1.8%
376
 
1.8%
Other values (153) 13472
63.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 17816
84.3%
Open Punctuation 1312
 
6.2%
Close Punctuation 1312
 
6.2%
Uppercase Letter 384
 
1.8%
Other Punctuation 320
 
1.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1096
 
6.2%
1032
 
5.8%
792
 
4.4%
536
 
3.0%
432
 
2.4%
400
 
2.2%
384
 
2.2%
376
 
2.1%
344
 
1.9%
320
 
1.8%
Other values (143) 12104
67.9%
Uppercase Letter
ValueCountFrequency (%)
B 96
25.0%
E 48
12.5%
X 48
12.5%
C 48
12.5%
O 48
12.5%
S 48
12.5%
K 48
12.5%
Open Punctuation
ValueCountFrequency (%)
( 1312
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1312
100.0%
Other Punctuation
ValueCountFrequency (%)
· 320
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 17816
84.3%
Common 2944
 
13.9%
Latin 384
 
1.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1096
 
6.2%
1032
 
5.8%
792
 
4.4%
536
 
3.0%
432
 
2.4%
400
 
2.2%
384
 
2.2%
376
 
2.1%
344
 
1.9%
320
 
1.8%
Other values (143) 12104
67.9%
Latin
ValueCountFrequency (%)
B 96
25.0%
E 48
12.5%
X 48
12.5%
C 48
12.5%
O 48
12.5%
S 48
12.5%
K 48
12.5%
Common
ValueCountFrequency (%)
( 1312
44.6%
) 1312
44.6%
· 320
 
10.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 17816
84.3%
ASCII 3008
 
14.2%
None 320
 
1.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 1312
43.6%
) 1312
43.6%
B 96
 
3.2%
E 48
 
1.6%
X 48
 
1.6%
C 48
 
1.6%
O 48
 
1.6%
S 48
 
1.6%
K 48
 
1.6%
Hangul
ValueCountFrequency (%)
1096
 
6.2%
1032
 
5.8%
792
 
4.4%
536
 
3.0%
432
 
2.4%
400
 
2.2%
384
 
2.2%
376
 
2.1%
344
 
1.9%
320
 
1.8%
Other values (143) 12104
67.9%
None
ValueCountFrequency (%)
· 320
100.0%

승강장번호
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size35.5 KiB
1
2264 
2
2264 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 2264
50.0%
2 2264
50.0%

Length

2023-12-13T05:06:43.568874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:06:43.694323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 2264
50.0%
2 2264
50.0%

차량순서
Real number (ℝ)

Distinct8
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.7402827
Minimum1
Maximum8
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size39.9 KiB
2023-12-13T05:06:43.880878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median4
Q35
95-th percentile7
Maximum8
Range7
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.9952768
Coefficient of variation (CV)0.53345614
Kurtosis-0.83251994
Mean3.7402827
Median Absolute Deviation (MAD)2
Skewness0.32543965
Sum16936
Variance3.9811293
MonotonicityNot monotonic
2023-12-13T05:06:44.062363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
1 732
16.2%
2 732
16.2%
3 732
16.2%
4 732
16.2%
5 596
13.2%
6 596
13.2%
7 204
 
4.5%
8 204
 
4.5%
ValueCountFrequency (%)
1 732
16.2%
2 732
16.2%
3 732
16.2%
4 732
16.2%
5 596
13.2%
6 596
13.2%
7 204
 
4.5%
8 204
 
4.5%
ValueCountFrequency (%)
8 204
 
4.5%
7 204
 
4.5%
6 596
13.2%
5 596
13.2%
4 732
16.2%
3 732
16.2%
2 732
16.2%
1 732
16.2%
Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size35.5 KiB
1
1352 
2
1352 
3
1184 
4
640 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row3
3rd row2
4th row3
5th row1

Common Values

ValueCountFrequency (%)
1 1352
29.9%
2 1352
29.9%
3 1184
26.1%
4 640
14.1%

Length

2023-12-13T05:06:44.234114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:06:44.341244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 1352
29.9%
2 1352
29.9%
3 1184
26.1%
4 640
14.1%

안전거리
Real number (ℝ)

Distinct99
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.9501546
Minimum2.6
Maximum18.5
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size39.9 KiB
2023-12-13T05:06:44.487329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.6
5-th percentile4.5
Q15.2
median6.2
Q39
95-th percentile10.5
Maximum18.5
Range15.9
Interquartile range (IQR)3.8

Descriptive statistics

Standard deviation2.2882939
Coefficient of variation (CV)0.3292436
Kurtosis1.8985937
Mean6.9501546
Median Absolute Deviation (MAD)1.2
Skewness1.182079
Sum31470.3
Variance5.2362889
MonotonicityNot monotonic
2023-12-13T05:06:44.678463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5.0 516
 
11.4%
5.5 476
 
10.5%
6.0 283
 
6.2%
4.5 267
 
5.9%
7.0 192
 
4.2%
6.5 191
 
4.2%
9.0 166
 
3.7%
9.5 165
 
3.6%
10.0 124
 
2.7%
4.0 106
 
2.3%
Other values (89) 2042
45.1%
ValueCountFrequency (%)
2.6 1
 
< 0.1%
3.0 7
 
0.2%
3.2 2
 
< 0.1%
3.3 1
 
< 0.1%
3.5 17
 
0.4%
3.6 3
 
0.1%
3.7 6
 
0.1%
3.8 10
 
0.2%
3.9 2
 
< 0.1%
4.0 106
2.3%
ValueCountFrequency (%)
18.5 2
 
< 0.1%
18.0 4
 
0.1%
17.5 5
0.1%
17.0 5
0.1%
16.5 4
 
0.1%
16.0 11
0.2%
15.5 5
0.1%
15.0 3
 
0.1%
14.5 5
0.1%
14.0 11
0.2%

Interactions

2023-12-13T05:06:41.915259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:06:41.290438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:06:42.038242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:06:41.437492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T05:06:44.820636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
선명승강장번호차량순서차량출입문번호안전거리
선명1.0000.0000.5620.5400.672
승강장번호0.0001.0000.0000.0000.109
차량순서0.5620.0001.0000.1440.155
차량출입문번호0.5400.0000.1441.0000.169
안전거리0.6720.1090.1550.1691.000
2023-12-13T05:06:44.960290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
승강장번호차량출입문번호선명
승강장번호1.0000.0000.000
차량출입문번호0.0001.0000.235
선명0.0000.2351.000
2023-12-13T05:06:45.087204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
차량순서안전거리선명승강장번호차량출입문번호
차량순서1.000-0.0950.2780.0000.065
안전거리-0.0951.0000.4720.0830.102
선명0.2780.4721.0000.0000.235
승강장번호0.0000.0830.0001.0000.000
차량출입문번호0.0650.1020.2350.0001.000

Missing values

2023-12-13T05:06:42.179971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:06:42.300399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

철도운영기관명선명역명승강장번호차량순서차량출입문번호안전거리
0부산교통공사1호선괴정1119.0
1부산교통공사1호선괴정1137.5
2부산교통공사1호선괴정1128.0
3부산교통공사1호선괴정1235.5
4부산교통공사1호선괴정1217.0
5부산교통공사1호선괴정1226.5
6부산교통공사1호선괴정1315.0
7부산교통공사1호선괴정1324.5
8부산교통공사1호선괴정1335.0
9부산교통공사1호선괴정1435.5
철도운영기관명선명역명승강장번호차량순서차량출입문번호안전거리
4518부산교통공사4호선충렬사(안락)2219.1
4519부산교통공사4호선충렬사(안락)2229.0
4520부산교통공사4호선충렬사(안락)2329.3
4521부산교통공사4호선충렬사(안락)2319.5
4522부산교통공사4호선충렬사(안락)2419.0
4523부산교통공사4호선충렬사(안락)24210.0
4524부산교통공사4호선충렬사(안락)2529.4
4525부산교통공사4호선충렬사(안락)2518.9
4526부산교통공사4호선충렬사(안락)2619.0
4527부산교통공사4호선충렬사(안락)2629.3