Overview

Dataset statistics

Number of variables8
Number of observations99
Missing cells0
Missing cells (%)0.0%
Duplicate rows3
Duplicate rows (%)3.0%
Total size in memory6.3 KiB
Average record size in memory65.3 B

Variable types

Categorical7
Text1

Dataset

Description광주도시철도공사에서 관리하는 도시광역철도역들의 철도운영기관명, 선명, 역명, 상하행구분, 출입구번호, 상세위치, 시작층, 종료층의 데이터가 있습니다.
Author국가철도공단
URLhttps://www.data.go.kr/data/15041362/fileData.do

Alerts

철도운영기관 has constant value ""Constant
선명 has constant value ""Constant
Dataset has 3 (3.0%) duplicate rowsDuplicates
상하행구분 is highly overall correlated with 종료층High correlation
종료층 is highly overall correlated with 상하행구분High correlation

Reproduction

Analysis started2023-12-12 15:15:58.853109
Analysis finished2023-12-12 15:15:59.560089
Duration0.71 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

철도운영기관
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size924.0 B
광주도시철도공사
99 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row광주도시철도공사
2nd row광주도시철도공사
3rd row광주도시철도공사
4th row광주도시철도공사
5th row광주도시철도공사

Common Values

ValueCountFrequency (%)
광주도시철도공사 99
100.0%

Length

2023-12-13T00:15:59.626746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:15:59.743047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
광주도시철도공사 99
100.0%

선명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size924.0 B
1호선
99 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1호선
2nd row1호선
3rd row1호선
4th row1호선
5th row1호선

Common Values

ValueCountFrequency (%)
1호선 99
100.0%

Length

2023-12-13T00:15:59.869030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:15:59.956051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1호선 99
100.0%

역명
Categorical

Distinct17
Distinct (%)17.2%
Missing0
Missing (%)0.0%
Memory size924.0 B
문화전당
10 
양동시장
10 
광주송정역
금남로4가
김대중컨벤션센터
Other values (12)
54 

Length

Max length8
Median length7
Mean length3.8787879
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공항
2nd row공항
3rd row공항
4th row공항
5th row공항

Common Values

ValueCountFrequency (%)
문화전당 10
10.1%
양동시장 10
10.1%
광주송정역 9
9.1%
금남로4가 8
 
8.1%
김대중컨벤션센터 8
 
8.1%
상무 8
 
8.1%
공항 7
 
7.1%
송정공원 6
 
6.1%
소태 6
 
6.1%
남광주 6
 
6.1%
Other values (7) 21
21.2%

Length

2023-12-13T00:16:00.096477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
문화전당 10
10.1%
양동시장 10
10.1%
광주송정역 9
9.1%
금남로4가 8
 
8.1%
김대중컨벤션센터 8
 
8.1%
상무 8
 
8.1%
공항 7
 
7.1%
남광주 6
 
6.1%
소태 6
 
6.1%
송정공원 6
 
6.1%
Other values (7) 21
21.2%

상하행구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size924.0 B
상행
62 
하행
37 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row하행
2nd row상행
3rd row상행
4th row하행
5th row하행

Common Values

ValueCountFrequency (%)
상행 62
62.6%
하행 37
37.4%

Length

2023-12-13T00:16:00.256182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:16:00.347736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
상행 62
62.6%
하행 37
37.4%

출입구번호
Categorical

Distinct7
Distinct (%)7.1%
Missing0
Missing (%)0.0%
Memory size924.0 B
<NA>
71 
5
2
 
7
4
 
6
1
 
4
Other values (2)
 
3

Length

Max length5
Median length4
Mean length3.2323232
Min length1

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row2
2nd row2
3rd row5
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 71
71.7%
5 8
 
8.1%
2 7
 
7.1%
4 6
 
6.1%
1 4
 
4.0%
1/2/3 2
 
2.0%
3 1
 
1.0%

Length

2023-12-13T00:16:00.476866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:16:00.605065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 71
71.7%
5 8
 
8.1%
2 7
 
7.1%
4 6
 
6.1%
1 4
 
4.0%
1/2/3 2
 
2.0%
3 1
 
1.0%
Distinct92
Distinct (%)92.9%
Missing0
Missing (%)0.0%
Memory size924.0 B
2023-12-13T00:16:00.870311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length29
Mean length23.313131
Min length13

Characters and Unicode

Total characters2308
Distinct characters176
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique85 ?
Unique (%)85.9%

Sample

1st row(1F) 2번 출입구 광주공항 방향
2nd row(B1) 2번 출입구 광주공항 방향
3rd row(B1) 5번 출입구 송정동초등학고 방향
4th row(B1) 개찰구 내 표사는 곳 옆
5th row(B1) 개찰구 내 표사는 곳 옆
ValueCountFrequency (%)
방향 71
 
11.8%
51
 
8.5%
출입구 43
 
7.1%
승강장 39
 
6.5%
b1 37
 
6.1%
b2 34
 
5.6%
23
 
3.8%
출입문 21
 
3.5%
b3 14
 
2.3%
역무실 10
 
1.7%
Other values (116) 260
43.1%
2023-12-13T00:16:01.298483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
508
22.0%
( 102
 
4.4%
) 102
 
4.4%
B 89
 
3.9%
1 80
 
3.5%
80
 
3.5%
79
 
3.4%
74
 
3.2%
71
 
3.1%
66
 
2.9%
Other values (166) 1057
45.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1191
51.6%
Space Separator 508
22.0%
Decimal Number 236
 
10.2%
Uppercase Letter 130
 
5.6%
Open Punctuation 102
 
4.4%
Close Punctuation 102
 
4.4%
Dash Punctuation 29
 
1.3%
Other Punctuation 10
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
80
 
6.7%
79
 
6.6%
74
 
6.2%
71
 
6.0%
66
 
5.5%
52
 
4.4%
51
 
4.3%
49
 
4.1%
45
 
3.8%
39
 
3.3%
Other values (134) 585
49.1%
Uppercase Letter
ValueCountFrequency (%)
B 89
68.5%
F 12
 
9.2%
T 5
 
3.8%
E 3
 
2.3%
R 3
 
2.3%
S 3
 
2.3%
O 2
 
1.5%
C 2
 
1.5%
K 2
 
1.5%
X 2
 
1.5%
Other values (7) 7
 
5.4%
Decimal Number
ValueCountFrequency (%)
1 80
33.9%
2 56
23.7%
4 43
18.2%
3 34
14.4%
5 19
 
8.1%
6 3
 
1.3%
8 1
 
0.4%
Other Punctuation
ValueCountFrequency (%)
' 6
60.0%
· 2
 
20.0%
& 1
 
10.0%
? 1
 
10.0%
Space Separator
ValueCountFrequency (%)
508
100.0%
Open Punctuation
ValueCountFrequency (%)
( 102
100.0%
Close Punctuation
ValueCountFrequency (%)
) 102
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 29
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1191
51.6%
Common 987
42.8%
Latin 130
 
5.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
80
 
6.7%
79
 
6.6%
74
 
6.2%
71
 
6.0%
66
 
5.5%
52
 
4.4%
51
 
4.3%
49
 
4.1%
45
 
3.8%
39
 
3.3%
Other values (134) 585
49.1%
Latin
ValueCountFrequency (%)
B 89
68.5%
F 12
 
9.2%
T 5
 
3.8%
E 3
 
2.3%
R 3
 
2.3%
S 3
 
2.3%
O 2
 
1.5%
C 2
 
1.5%
K 2
 
1.5%
X 2
 
1.5%
Other values (7) 7
 
5.4%
Common
ValueCountFrequency (%)
508
51.5%
( 102
 
10.3%
) 102
 
10.3%
1 80
 
8.1%
2 56
 
5.7%
4 43
 
4.4%
3 34
 
3.4%
- 29
 
2.9%
5 19
 
1.9%
' 6
 
0.6%
Other values (5) 8
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1191
51.6%
ASCII 1115
48.3%
None 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
508
45.6%
( 102
 
9.1%
) 102
 
9.1%
B 89
 
8.0%
1 80
 
7.2%
2 56
 
5.0%
4 43
 
3.9%
3 34
 
3.0%
- 29
 
2.6%
5 19
 
1.7%
Other values (21) 53
 
4.8%
Hangul
ValueCountFrequency (%)
80
 
6.7%
79
 
6.6%
74
 
6.2%
71
 
6.0%
66
 
5.5%
52
 
4.4%
51
 
4.3%
49
 
4.1%
45
 
3.8%
39
 
3.3%
Other values (134) 585
49.1%
None
ValueCountFrequency (%)
· 2
100.0%

시작층
Categorical

Distinct6
Distinct (%)6.1%
Missing0
Missing (%)0.0%
Memory size924.0 B
지하1
37 
지하2
34 
지하3
14 
지상1
지하4

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row지상1
2nd row지하1
3rd row지하1
4th row지하1
5th row지하1

Common Values

ValueCountFrequency (%)
지하1 37
37.4%
지하2 34
34.3%
지하3 14
 
14.1%
지상1 9
 
9.1%
지하4 4
 
4.0%
지상2 1
 
1.0%

Length

2023-12-13T00:16:01.493646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:16:01.630668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지하1 37
37.4%
지하2 34
34.3%
지하3 14
 
14.1%
지상1 9
 
9.1%
지하4 4
 
4.0%
지상2 1
 
1.0%

종료층
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)6.1%
Missing0
Missing (%)0.0%
Memory size924.0 B
지하1
38 
지하2
27 
지상1
19 
지하3
10 
지하4

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row지하1
2nd row지상1
3rd row지상1
4th row지하2
5th row지하2

Common Values

ValueCountFrequency (%)
지하1 38
38.4%
지하2 27
27.3%
지상1 19
19.2%
지하3 10
 
10.1%
지하4 4
 
4.0%
지상2 1
 
1.0%

Length

2023-12-13T00:16:01.770376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:16:01.903014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지하1 38
38.4%
지하2 27
27.3%
지상1 19
19.2%
지하3 10
 
10.1%
지하4 4
 
4.0%
지상2 1
 
1.0%

Correlations

2023-12-13T00:16:02.027706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
역명상하행구분출입구번호상세위치시작층종료층
역명1.0000.0000.7650.9830.6720.652
상하행구분0.0001.0000.0000.8180.6260.726
출입구번호0.7650.0001.0000.9010.6480.611
상세위치0.9830.8180.9011.0001.0000.973
시작층0.6720.6260.6481.0001.0000.866
종료층0.6520.7260.6110.9730.8661.000
2023-12-13T00:16:02.146180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상하행구분출입구번호종료층역명시작층
상하행구분1.0000.0000.5250.0000.446
출입구번호0.0001.0000.2810.4610.309
종료층0.5250.2811.0000.3520.495
역명0.0000.4610.3521.0000.369
시작층0.4460.3090.4950.3691.000
2023-12-13T00:16:02.280167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
역명상하행구분출입구번호시작층종료층
역명1.0000.0000.4610.3690.352
상하행구분0.0001.0000.0000.4460.525
출입구번호0.4610.0001.0000.3090.281
시작층0.3690.4460.3091.0000.495
종료층0.3520.5250.2810.4951.000

Missing values

2023-12-13T00:15:59.385752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:15:59.510432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

철도운영기관선명역명상하행구분출입구번호상세위치시작층종료층
0광주도시철도공사1호선공항하행2(1F) 2번 출입구 광주공항 방향지상1지하1
1광주도시철도공사1호선공항상행2(B1) 2번 출입구 광주공항 방향지하1지상1
2광주도시철도공사1호선공항상행5(B1) 5번 출입구 송정동초등학고 방향지하1지상1
3광주도시철도공사1호선공항하행<NA>(B1) 개찰구 내 표사는 곳 옆지하1지하2
4광주도시철도공사1호선공항하행<NA>(B1) 개찰구 내 표사는 곳 옆지하1지하2
5광주도시철도공사1호선공항상행<NA>(B2) 송정공원 방향 승강장 4-2 출입구 앞지하2지하1
6광주도시철도공사1호선공항상행<NA>(B2) 송정공원 방향 승강장 1-3 출입구 앞지하2지하1
7광주도시철도공사1호선광주송정역하행5(1F) 5번출입구 헌혈의 집 근처지상1지하1
8광주도시철도공사1호선광주송정역상행5(B1) 5번출입구 헌혈의 집 방향지하1지상1
9광주도시철도공사1호선광주송정역하행4(1F) 4번출입구 광주송정역 KTX·SRT 역사 앞지상1지하1
철도운영기관선명역명상하행구분출입구번호상세위치시작층종료층
89광주도시철도공사1호선양동시장상행<NA>(B3) 금남로5가역 방향 승강장 1-3 출입문 앞지하3지하2
90광주도시철도공사1호선양동시장하행<NA>(B2) 돌고개역 방향 승강장 내려가는 계단지하2지하3
91광주도시철도공사1호선양동시장상행<NA>(B3) 돌고개역 방향 승강장 4-2 출입문 앞지하3지하2
92광주도시철도공사1호선평동상행1/2/3(1F) 개찰구 내 계단 맞은편지상1지상2
93광주도시철도공사1호선평동하행1/2/3(2F) 옥동기지 방향 승강장 1-2 출입문 앞지상2지상1
94광주도시철도공사1호선학동증심사입구상행<NA>(B2) 소태역 방향 승강장 1-3 출입문 앞지하2지하1
95광주도시철도공사1호선학동증심사입구하행<NA>(B1) 4번 출입구 자전거 보관소 옆지하1지하2
96광주도시철도공사1호선학동증심사입구하행<NA>(B1) 4번 출입구 역무실 옆지하1지하2
97광주도시철도공사1호선화정상행<NA>(B2) 대합실 표 사는 곳 앞지하2지하1
98광주도시철도공사1호선화정하행<NA>(B1) 2번 출입구 방향 물품보관소 앞지하1지하2

Duplicate rows

Most frequently occurring

철도운영기관선명역명상하행구분출입구번호상세위치시작층종료층# duplicates
0광주도시철도공사1호선공항하행<NA>(B1) 개찰구 내 표사는 곳 옆지하1지하22
1광주도시철도공사1호선광주송정역하행<NA>(B1) 대합실 내 '국창 임방울 선생 전시관' 옆지하1지하22
2광주도시철도공사1호선송정공원하행<NA>(B1) 개찰구 내 역무실 옆지하1지하22