Overview

Dataset statistics

Number of variables8
Number of observations139
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.7%
Total size in memory8.8 KiB
Average record size in memory64.9 B

Variable types

Categorical7
Text1

Dataset

Description부산교통공사에서 관리하는 도시광역철도역들의 철도운영기관명, 선명, 역명, 상하행구분, 출입구번호, 상세위치, 시작층, 종료층의 데이터가 있습니다.
Author국가철도공단
URLhttps://www.data.go.kr/data/15041352/fileData.do

Alerts

철도운영기관 has constant value ""Constant
선명 has constant value ""Constant
Dataset has 1 (0.7%) duplicate rowsDuplicates
역명 is highly overall correlated with 시작층 and 1 other fieldsHigh correlation
시작층 is highly overall correlated with 역명 and 1 other fieldsHigh correlation
종료층 is highly overall correlated with 역명 and 1 other fieldsHigh correlation

Reproduction

Analysis started2023-12-12 20:44:58.498905
Analysis finished2023-12-12 20:44:59.571159
Duration1.07 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

철도운영기관
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
부산교통공사
139 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산교통공사
2nd row부산교통공사
3rd row부산교통공사
4th row부산교통공사
5th row부산교통공사

Common Values

ValueCountFrequency (%)
부산교통공사 139
100.0%

Length

2023-12-13T05:44:59.659048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:44:59.776633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산교통공사 139
100.0%

선명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
1호선
139 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1호선
2nd row1호선
3rd row1호선
4th row1호선
5th row1호선

Common Values

ValueCountFrequency (%)
1호선 139
100.0%

Length

2023-12-13T05:44:59.897240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:45:00.007889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1호선 139
100.0%

역명
Categorical

HIGH CORRELATION 

Distinct20
Distinct (%)14.4%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
다대포항
21 
장림
14 
다대포해수욕장
13 
낫개
13 
신장림
12 
Other values (15)
66 

Length

Max length11
Median length2
Mean length3.3093525
Min length2

Unique

Unique3 ?
Unique (%)2.2%

Sample

1st row교대
2nd row남포
3rd row남포
4th row남포
5th row남포

Common Values

ValueCountFrequency (%)
다대포항 21
15.1%
장림 14
10.1%
다대포해수욕장 13
9.4%
낫개 13
9.4%
신장림 12
8.6%
동매 11
7.9%
서면 10
 
7.2%
서대신 7
 
5.0%
남포 7
 
5.0%
부산 6
 
4.3%
Other values (10) 25
18.0%

Length

2023-12-13T05:45:00.149291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
다대포항 21
15.1%
장림 14
10.1%
다대포해수욕장 13
9.4%
낫개 13
9.4%
신장림 12
8.6%
동매 11
7.9%
서면 10
 
7.2%
서대신 7
 
5.0%
남포 7
 
5.0%
자갈치 6
 
4.3%
Other values (10) 25
18.0%

상하행구분
Categorical

Distinct2
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
상행
98 
하행
41 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row상행
2nd row상행
3rd row하행
4th row상행
5th row상행

Common Values

ValueCountFrequency (%)
상행 98
70.5%
하행 41
29.5%

Length

2023-12-13T05:45:00.308946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:45:00.447786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
상행 98
70.5%
하행 41
29.5%

출입구번호
Categorical

Distinct12
Distinct (%)8.6%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
<NA>
64 
3
17 
4
13 
1
11 
2
Other values (7)
25 

Length

Max length4
Median length3
Mean length2.5251799
Min length1

Unique

Unique2 ?
Unique (%)1.4%

Sample

1st row4
2nd row2
3rd row6
4th row6
5th row1/3

Common Values

ValueCountFrequency (%)
<NA> 64
46.0%
3 17
 
12.2%
4 13
 
9.4%
1 11
 
7.9%
2 9
 
6.5%
6 7
 
5.0%
5 6
 
4.3%
1/3 5
 
3.6%
10 3
 
2.2%
2/4 2
 
1.4%
Other values (2) 2
 
1.4%

Length

2023-12-13T05:45:00.591487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 64
46.0%
3 17
 
12.2%
4 13
 
9.4%
1 11
 
7.9%
2 9
 
6.5%
6 7
 
5.0%
5 6
 
4.3%
1/3 5
 
3.6%
10 3
 
2.2%
2/4 2
 
1.4%
Other values (2) 2
 
1.4%
Distinct125
Distinct (%)89.9%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-13T05:45:00.949926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length26
Mean length18.683453
Min length6

Characters and Unicode

Total characters2597
Distinct characters124
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique117 ?
Unique (%)84.2%

Sample

1st row(B1) 4번출입구
2nd row(B1) 2번 출입구 방향
3rd row(1F) 6번 출입구 앞
4th row(B1) 6번 출입구 방향
5th row(B2) 자갈치역 방향 승강장 6-2 출입문 앞
ValueCountFrequency (%)
방향 87
 
12.5%
출입구 60
 
8.6%
42
 
6.0%
b2 38
 
5.4%
출입문 37
 
5.3%
근처 37
 
5.3%
승강장 33
 
4.7%
b1 32
 
4.6%
개찰구 14
 
2.0%
13
 
1.9%
Other values (113) 305
43.7%
2023-12-13T05:45:01.506624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
569
21.9%
) 115
 
4.4%
( 115
 
4.4%
107
 
4.1%
106
 
4.1%
1 106
 
4.1%
B 96
 
3.7%
88
 
3.4%
87
 
3.4%
2 84
 
3.2%
Other values (114) 1124
43.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1294
49.8%
Space Separator 569
21.9%
Decimal Number 330
 
12.7%
Close Punctuation 115
 
4.4%
Open Punctuation 115
 
4.4%
Uppercase Letter 114
 
4.4%
Dash Punctuation 52
 
2.0%
Other Punctuation 6
 
0.2%
Math Symbol 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
107
 
8.3%
106
 
8.2%
88
 
6.8%
87
 
6.7%
78
 
6.0%
75
 
5.8%
53
 
4.1%
53
 
4.1%
48
 
3.7%
46
 
3.6%
Other values (95) 553
42.7%
Decimal Number
ValueCountFrequency (%)
1 106
32.1%
2 84
25.5%
3 58
17.6%
4 26
 
7.9%
6 22
 
6.7%
0 15
 
4.5%
5 13
 
3.9%
8 3
 
0.9%
7 2
 
0.6%
9 1
 
0.3%
Uppercase Letter
ValueCountFrequency (%)
B 96
84.2%
F 18
 
15.8%
Other Punctuation
ValueCountFrequency (%)
. 4
66.7%
/ 2
33.3%
Space Separator
ValueCountFrequency (%)
569
100.0%
Close Punctuation
ValueCountFrequency (%)
) 115
100.0%
Open Punctuation
ValueCountFrequency (%)
( 115
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 52
100.0%
Math Symbol
ValueCountFrequency (%)
> 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1294
49.8%
Common 1189
45.8%
Latin 114
 
4.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
107
 
8.3%
106
 
8.2%
88
 
6.8%
87
 
6.7%
78
 
6.0%
75
 
5.8%
53
 
4.1%
53
 
4.1%
48
 
3.7%
46
 
3.6%
Other values (95) 553
42.7%
Common
ValueCountFrequency (%)
569
47.9%
) 115
 
9.7%
( 115
 
9.7%
1 106
 
8.9%
2 84
 
7.1%
3 58
 
4.9%
- 52
 
4.4%
4 26
 
2.2%
6 22
 
1.9%
0 15
 
1.3%
Other values (7) 27
 
2.3%
Latin
ValueCountFrequency (%)
B 96
84.2%
F 18
 
15.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1303
50.2%
Hangul 1294
49.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
569
43.7%
) 115
 
8.8%
( 115
 
8.8%
1 106
 
8.1%
B 96
 
7.4%
2 84
 
6.4%
3 58
 
4.5%
- 52
 
4.0%
4 26
 
2.0%
6 22
 
1.7%
Other values (9) 60
 
4.6%
Hangul
ValueCountFrequency (%)
107
 
8.3%
106
 
8.2%
88
 
6.8%
87
 
6.7%
78
 
6.0%
75
 
5.8%
53
 
4.1%
53
 
4.1%
48
 
3.7%
46
 
3.6%
Other values (95) 553
42.7%

시작층
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
지하1
56 
지하2
50 
지상1
21 
지하3
지하5
 
2
Other values (2)
 
4

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지하1
2nd row지하1
3rd row지상1
4th row지하1
5th row지하2

Common Values

ValueCountFrequency (%)
지하1 56
40.3%
지하2 50
36.0%
지상1 21
 
15.1%
지하3 6
 
4.3%
지하5 2
 
1.4%
지상2 2
 
1.4%
지하4 2
 
1.4%

Length

2023-12-13T05:45:01.680511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:45:01.817371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지하1 56
40.3%
지하2 50
36.0%
지상1 21
 
15.1%
지하3 6
 
4.3%
지하5 2
 
1.4%
지상2 2
 
1.4%
지하4 2
 
1.4%

종료층
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
지하1
83 
지상1
31 
지하2
14 
지하3
 
4
지상2
 
3
Other values (2)
 
4

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지상1
2nd row지상1
3rd row지하1
4th row지상1
5th row지하1

Common Values

ValueCountFrequency (%)
지하1 83
59.7%
지상1 31
 
22.3%
지하2 14
 
10.1%
지하3 4
 
2.9%
지상2 3
 
2.2%
지하5 2
 
1.4%
지하4 2
 
1.4%

Length

2023-12-13T05:45:01.990973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:45:02.121881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지하1 83
59.7%
지상1 31
 
22.3%
지하2 14
 
10.1%
지하3 4
 
2.9%
지상2 3
 
2.2%
지하5 2
 
1.4%
지하4 2
 
1.4%

Correlations

2023-12-13T05:45:02.242545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
역명상하행구분출입구번호시작층종료층
역명1.0000.0000.7850.8450.827
상하행구분0.0001.0000.0000.4160.272
출입구번호0.7850.0001.0000.2330.000
시작층0.8450.4160.2331.0000.910
종료층0.8270.2720.0000.9101.000
2023-12-13T05:45:02.374059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시작층역명상하행구분출입구번호종료층
시작층1.0000.5420.4380.1240.558
역명0.5421.0000.0000.4410.515
상하행구분0.4380.0001.0000.0000.285
출입구번호0.1240.4410.0001.0000.000
종료층0.5580.5150.2850.0001.000
2023-12-13T05:45:02.513695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
역명상하행구분출입구번호시작층종료층
역명1.0000.0000.4410.5420.515
상하행구분0.0001.0000.0000.4380.285
출입구번호0.4410.0001.0000.1240.000
시작층0.5420.4380.1241.0000.558
종료층0.5150.2850.0000.5581.000

Missing values

2023-12-13T05:44:58.962870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:44:59.502087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

철도운영기관선명역명상하행구분출입구번호상세위치시작층종료층
0부산교통공사1호선교대상행4(B1) 4번출입구지하1지상1
1부산교통공사1호선남포상행2(B1) 2번 출입구 방향지하1지상1
2부산교통공사1호선남포하행6(1F) 6번 출입구 앞지상1지하1
3부산교통공사1호선남포상행6(B1) 6번 출입구 방향지하1지상1
4부산교통공사1호선남포상행1/3(B2) 자갈치역 방향 승강장 6-2 출입문 앞지하2지하1
5부산교통공사1호선남포상행2(B2) 중앙역 방향 승강장 5-2 출입문 앞지하2지하1
6부산교통공사1호선남포상행5(B2) 자갈치역 방향 승강장 3-3 출입문 앞지하2지하1
7부산교통공사1호선남포상행4/6(B2) 중앙역 방향 승강장 2-3 출입문 앞지하2지하1
8부산교통공사1호선노포(종합버스터미널)상행3(1F) 노포행 승강장 4-3 출입문 앞지상1지상2
9부산교통공사1호선대티(동주대학)하행<NA>(B3) 개찰구 내 4번 계단 옆지하3지하5
철도운영기관선명역명상하행구분출입구번호상세위치시작층종료층
129부산교통공사1호선장림하행5(1F)하이마트 맞은편지상1지하1
130부산교통공사1호선장림상행5(B1)5번 출입구 근처지하1지상1
131부산교통공사1호선장림하행6(1F)홈플러스 맞은편지상1지하1
132부산교통공사1호선장림상행6(B1)6번 출입구 근처지하1지상1
133부산교통공사1호선장림상행3(B2)신장림역 방향 6-3 근처지하2지하1
134부산교통공사1호선장림상행3(B2)동매역 방향 6-3 근처지하2지하1
135부산교통공사1호선장림상행3(B1)신장림역 방향 표 내는 곳 근처지하1지하2
136부산교통공사1호선장림하행3(B2)신장림역 방향 승강장 3-3 근처지하2지하1
137부산교통공사1호선장림상행3(B2)동매역 방향 승강장 3-2 근처지하2지하1
138부산교통공사1호선장림하행3(B1)동매역 방향 표 내는 곳 근처지하1지하2

Duplicate rows

Most frequently occurring

철도운영기관선명역명상하행구분출입구번호상세위치시작층종료층# duplicates
0부산교통공사1호선대티(동주대학)하행<NA>(B3) 개찰구 내 4번 계단 옆지하3지하52