Overview

Dataset statistics

Number of variables9
Number of observations194
Missing cells0
Missing cells (%)0.0%
Duplicate rows45
Duplicate rows (%)23.2%
Total size in memory14.1 KiB
Average record size in memory74.7 B

Variable types

Categorical8
Numeric1

Dataset

Description부산교통공사에서 운영하는 역사들의 환승정보 데이터로 철도운영기관명, 선명, 역명, 환승철도운영기관, 환승선명, 환승이후역명, 환승기점역명, 차량순서, 차량출입문번호의 데이터가 있습니다.
Author국가철도공단
URLhttps://www.data.go.kr/data/15041091/fileData.do

Alerts

Dataset has 45 (23.2%) duplicate rowsDuplicates
철도운영기관명 is highly overall correlated with 선명 and 2 other fieldsHigh correlation
환승철도운영기관 is highly overall correlated with 역명 and 3 other fieldsHigh correlation
환승기점역명 is highly overall correlated with 환승철도운영기관 and 2 other fieldsHigh correlation
선명 is highly overall correlated with 철도운영기관명 and 2 other fieldsHigh correlation
환승선명 is highly overall correlated with 역명 and 3 other fieldsHigh correlation
역명 is highly overall correlated with 철도운영기관명 and 4 other fieldsHigh correlation
환승이후역명 is highly overall correlated with 철도운영기관명 and 5 other fieldsHigh correlation
철도운영기관명 is highly imbalanced (53.5%)Imbalance
환승철도운영기관 is highly imbalanced (69.9%)Imbalance

Reproduction

Analysis started2023-12-12 19:31:31.401375
Analysis finished2023-12-12 19:31:32.564015
Duration1.16 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

철도운영기관명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
부산교통공사
166 
부산김해경전철
 
16
코레일
 
12

Length

Max length7
Median length6
Mean length5.8969072
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산교통공사
2nd row부산교통공사
3rd row부산교통공사
4th row부산교통공사
5th row부산교통공사

Common Values

ValueCountFrequency (%)
부산교통공사 166
85.6%
부산김해경전철 16
 
8.2%
코레일 12
 
6.2%

Length

2023-12-13T04:31:32.677759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:31:32.812645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산교통공사 166
85.6%
부산김해경전철 16
 
8.2%
코레일 12
 
6.2%

선명
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
3호선
80 
4호선
54 
1호선
16 
2호선
16 
부산김해경전철
16 

Length

Max length7
Median length3
Mean length3.2680412
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1호선
2nd row1호선
3rd row1호선
4th row1호선
5th row1호선

Common Values

ValueCountFrequency (%)
3호선 80
41.2%
4호선 54
27.8%
1호선 16
 
8.2%
2호선 16
 
8.2%
부산김해경전철 16
 
8.2%
동해 12
 
6.2%

Length

2023-12-13T04:31:32.953197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:31:33.071913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3호선 80
41.2%
4호선 54
27.8%
1호선 16
 
8.2%
2호선 16
 
8.2%
부산김해경전철 16
 
8.2%
동해 12
 
6.2%

역명
Categorical

HIGH CORRELATION 

Distinct12
Distinct (%)6.2%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
수영
66 
미남
52 
사상(서부터미널)
10 
대저
10 
서면
Other values (7)
48 

Length

Max length10
Median length2
Mean length3.1649485
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서면
2nd row서면
3rd row서면
4th row서면
5th row연산

Common Values

ValueCountFrequency (%)
수영 66
34.0%
미남 52
26.8%
사상(서부터미널) 10
 
5.2%
대저 10
 
5.2%
서면 8
 
4.1%
연산 8
 
4.1%
교대 8
 
4.1%
동래 8
 
4.1%
덕천(부산과기대) 8
 
4.1%
거제(법원·검찰청) 8
 
4.1%
Other values (2) 8
 
4.1%

Length

2023-12-13T04:31:33.207913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
수영 66
34.0%
미남 52
26.8%
사상(서부터미널 10
 
5.2%
대저 10
 
5.2%
서면 8
 
4.1%
연산 8
 
4.1%
교대 8
 
4.1%
동래 8
 
4.1%
덕천(부산과기대 8
 
4.1%
거제(법원·검찰청 8
 
4.1%
Other values (2) 8
 
4.1%

환승철도운영기관
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
부산교통공사
178 
코레일
 
12
부산김해경전철
 
4

Length

Max length7
Median length6
Mean length5.8350515
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산교통공사
2nd row부산교통공사
3rd row부산교통공사
4th row부산교통공사
5th row부산교통공사

Common Values

ValueCountFrequency (%)
부산교통공사 178
91.8%
코레일 12
 
6.2%
부산김해경전철 4
 
2.1%

Length

2023-12-13T04:31:33.357443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:31:33.479887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산교통공사 178
91.8%
코레일 12
 
6.2%
부산김해경전철 4
 
2.1%

환승선명
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2호선
84 
3호선
72 
1호선
16 
동해
12 
4호선
 
6

Length

Max length7
Median length3
Mean length3.0206186
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2호선
2nd row2호선
3rd row2호선
4th row2호선
5th row3호선

Common Values

ValueCountFrequency (%)
2호선 84
43.3%
3호선 72
37.1%
1호선 16
 
8.2%
동해 12
 
6.2%
4호선 6
 
3.1%
부산김해경전철 4
 
2.1%

Length

2023-12-13T04:31:33.589386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:31:33.714974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2호선 84
43.3%
3호선 72
37.1%
1호선 16
 
8.2%
동해 12
 
6.2%
4호선 6
 
3.1%
부산김해경전철 4
 
2.1%

환승이후역명
Categorical

HIGH CORRELATION 

Distinct35
Distinct (%)18.0%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
광안
32 
민락
32 
만덕
25 
사직
25 
체육공원
Other values (30)
72 

Length

Max length16
Median length2
Mean length3.0721649
Min length2

Unique

Unique2 ?
Unique (%)1.0%

Sample

1st row부암(온종합병원)
2nd row전포
3rd row부암(온종합병원)
4th row전포
5th row물만골

Common Values

ValueCountFrequency (%)
광안 32
16.5%
민락 32
16.5%
만덕 25
12.9%
사직 25
12.9%
체육공원 8
 
4.1%
동래 6
 
3.1%
교대 6
 
3.1%
연산 4
 
2.1%
덕포 4
 
2.1%
감전(사상구청) 4
 
2.1%
Other values (25) 48
24.7%

Length

2023-12-13T04:31:33.905501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
광안 32
16.5%
민락 32
16.5%
만덕 25
12.9%
사직 25
12.9%
체육공원 8
 
4.1%
동래 6
 
3.1%
교대 6
 
3.1%
연산 4
 
2.1%
덕포 4
 
2.1%
감전(사상구청 4
 
2.1%
Other values (25) 48
24.7%

환승기점역명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
양산(시청·동원과학기술대학교)
42 
장산(해운대백병원)
42 
수영
39 
대저
33 
노포(종합버스터미널)
Other values (8)
30 

Length

Max length16
Median length11
Mean length7.3865979
Min length2

Unique

Unique1 ?
Unique (%)0.5%

Sample

1st row양산(시청·동원과학기술대학교)
2nd row장산(해운대백병원)
3rd row양산(시청·동원과학기술대학교)
4th row장산(해운대백병원)
5th row수영

Common Values

ValueCountFrequency (%)
양산(시청·동원과학기술대학교) 42
21.6%
장산(해운대백병원) 42
21.6%
수영 39
20.1%
대저 33
17.0%
노포(종합버스터미널) 8
 
4.1%
부전 6
 
3.1%
일광 6
 
3.1%
신평 6
 
3.1%
안평(고촌주택단지) 4
 
2.1%
대사 3
 
1.5%
Other values (3) 5
 
2.6%

Length

2023-12-13T04:31:34.086171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
양산(시청·동원과학기술대학교 42
21.6%
장산(해운대백병원 42
21.6%
수영 39
20.1%
대저 33
17.0%
노포(종합버스터미널 8
 
4.1%
부전 6
 
3.1%
일광 6
 
3.1%
신평 6
 
3.1%
안평(고촌주택단지 4
 
2.1%
대사 3
 
1.5%
Other values (3) 5
 
2.6%

차량순서
Real number (ℝ)

Distinct8
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.0154639
Minimum1
Maximum8
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-13T04:31:34.233428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q34
95-th percentile6
Maximum8
Range7
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.7319814
Coefficient of variation (CV)0.57436649
Kurtosis-0.50153074
Mean3.0154639
Median Absolute Deviation (MAD)1
Skewness0.59264507
Sum585
Variance2.9997596
MonotonicityNot monotonic
2023-12-13T04:31:34.362367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
1 47
24.2%
2 44
22.7%
4 35
18.0%
3 29
14.9%
6 18
 
9.3%
5 17
 
8.8%
7 2
 
1.0%
8 2
 
1.0%
ValueCountFrequency (%)
1 47
24.2%
2 44
22.7%
3 29
14.9%
4 35
18.0%
5 17
 
8.8%
6 18
 
9.3%
7 2
 
1.0%
8 2
 
1.0%
ValueCountFrequency (%)
8 2
 
1.0%
7 2
 
1.0%
6 18
 
9.3%
5 17
 
8.8%
4 35
18.0%
3 29
14.9%
2 44
22.7%
1 47
24.2%
Distinct4
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
1
64 
2
50 
4
45 
3
35 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 64
33.0%
2 50
25.8%
4 45
23.2%
3 35
18.0%

Length

2023-12-13T04:31:34.528442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:31:34.663775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 64
33.0%
2 50
25.8%
4 45
23.2%
3 35
18.0%

Interactions

2023-12-13T04:31:32.162791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T04:31:34.781035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
철도운영기관명선명역명환승철도운영기관환승선명환승이후역명환승기점역명차량순서차량출입문번호
철도운영기관명1.0001.0000.9800.0000.2830.9900.4360.2930.203
선명1.0001.0000.9810.5440.8380.9950.7350.5240.296
역명0.9800.9811.0000.8740.9480.9950.8020.6380.467
환승철도운영기관0.0000.5440.8741.0001.0000.9801.0000.4250.128
환승선명0.2830.8380.9481.0001.0000.9861.0000.4170.196
환승이후역명0.9900.9950.9950.9800.9861.0000.9870.7000.316
환승기점역명0.4360.7350.8021.0001.0000.9871.0000.3090.000
차량순서0.2930.5240.6380.4250.4170.7000.3091.0000.247
차량출입문번호0.2030.2960.4670.1280.1960.3160.0000.2471.000
2023-12-13T04:31:34.951643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
차량출입문번호철도운영기관명환승철도운영기관역명환승기점역명선명환승선명환승이후역명
차량출입문번호1.0000.1920.1200.2260.0000.1930.1260.150
철도운영기관명0.1921.0000.0000.8130.2670.9920.1210.879
환승철도운영기관0.1200.0001.0000.6000.9730.2630.9920.844
역명0.2260.8130.6001.0000.4630.7840.6690.885
환승기점역명0.0000.2670.9730.4631.0000.4690.9810.827
선명0.1930.9920.2630.7840.4691.0000.4550.891
환승선명0.1260.1210.9920.6690.9810.4551.0000.839
환승이후역명0.1500.8790.8440.8850.8270.8910.8391.000
2023-12-13T04:31:35.438326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
차량순서철도운영기관명선명역명환승철도운영기관환승선명환승이후역명환승기점역명차량출입문번호
차량순서1.0000.1910.3220.3280.2950.2450.3310.1420.111
철도운영기관명0.1911.0000.9920.8130.0000.1210.8790.2670.192
선명0.3220.9921.0000.7840.2630.4550.8910.4690.193
역명0.3280.8130.7841.0000.6000.6690.8850.4630.226
환승철도운영기관0.2950.0000.2630.6001.0000.9920.8440.9730.120
환승선명0.2450.1210.4550.6690.9921.0000.8390.9810.126
환승이후역명0.3310.8790.8910.8850.8440.8391.0000.8270.150
환승기점역명0.1420.2670.4690.4630.9730.9810.8271.0000.000
차량출입문번호0.1110.1920.1930.2260.1200.1260.1500.0001.000

Missing values

2023-12-13T04:31:32.307093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:31:32.482501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

철도운영기관명선명역명환승철도운영기관환승선명환승이후역명환승기점역명차량순서차량출입문번호
0부산교통공사1호선서면부산교통공사2호선부암(온종합병원)양산(시청·동원과학기술대학교)61
1부산교통공사1호선서면부산교통공사2호선전포장산(해운대백병원)61
2부산교통공사1호선서면부산교통공사2호선부암(온종합병원)양산(시청·동원과학기술대학교)71
3부산교통공사1호선서면부산교통공사2호선전포장산(해운대백병원)71
4부산교통공사1호선연산부산교통공사3호선물만골수영41
5부산교통공사1호선연산부산교통공사3호선거제대저41
6부산교통공사1호선연산부산교통공사3호선거제대저51
7부산교통공사1호선연산부산교통공사3호선물만골수영51
8부산교통공사1호선교대코레일동해거제(법원·검찰청)부전62
9부산교통공사1호선교대코레일동해동래일광62
철도운영기관명선명역명환승철도운영기관환승선명환승이후역명환승기점역명차량순서차량출입문번호
184코레일동해거제(법원·검찰청)부산교통공사3호선연산수영22
185코레일동해거제(법원·검찰청)부산교통공사3호선종합운동장대저22
186코레일동해교대부산교통공사1호선연산다대포해수욕장44
187코레일동해교대부산교통공사1호선동래노포(종합버스터미널)44
188코레일동해교대부산교통공사1호선동래노포(종합버스터미널)11
189코레일동해교대부산교통공사1호선연산다대포해수욕장11
190코레일동해벡스코부산교통공사2호선동백장산(해운대백병원)32
191코레일동해벡스코부산교통공사2호선동백장산(해운대백병원)24
192코레일동해벡스코부산교통공사2호선센텀시티(BEXCO·신세계)양산(시청·동원과학기술대학교)32
193코레일동해벡스코부산교통공사2호선센텀시티(BEXCO·신세계)양산(시청·동원과학기술대학교)24

Duplicate rows

Most frequently occurring

철도운영기관명선명역명환승철도운영기관환승선명환승이후역명환승기점역명차량순서차량출입문번호# duplicates
0부산교통공사2호선사상(서부터미널)부산김해경전철부산김해경전철괘법르네시떼(강변공원)대사332
1부산교통공사2호선서면부산교통공사1호선범내골신평542
2부산교통공사2호선서면부산교통공사1호선부전(부산시민공원·송상현광장)노포(종합버스터미널)642
3부산교통공사3호선덕천(부산과기대)부산교통공사2호선구명장산(해운대백병원)112
4부산교통공사3호선덕천(부산과기대)부산교통공사2호선수정(방송통신대)양산(시청·동원과학기술대학교)242
5부산교통공사3호선수영부산교통공사2호선광안양산(시청·동원과학기술대학교)112
6부산교통공사3호선수영부산교통공사2호선광안양산(시청·동원과학기술대학교)122
7부산교통공사3호선수영부산교통공사2호선광안양산(시청·동원과학기술대학교)132
8부산교통공사3호선수영부산교통공사2호선광안양산(시청·동원과학기술대학교)142
9부산교통공사3호선수영부산교통공사2호선광안양산(시청·동원과학기술대학교)212