Overview

Dataset statistics

Number of variables8
Number of observations172
Missing cells0
Missing cells (%)0.0%
Duplicate rows7
Duplicate rows (%)4.1%
Total size in memory11.0 KiB
Average record size in memory65.8 B

Variable types

Categorical7
Text1

Dataset

Description대구3호선에 포함된 도시광역철도역들의 철도운영기관명, 선명, 역명, 상하행구분, 출입구번호, 상세위치, 시작층, 종료층의 데이터가 있습니다.
Author국가철도공단
URLhttps://www.data.go.kr/data/15041360/fileData.do

Alerts

철도운영기관 has constant value ""Constant
선명 has constant value ""Constant
Dataset has 7 (4.1%) duplicate rowsDuplicates
상하행구분 is highly overall correlated with 시작층 and 1 other fieldsHigh correlation
시작층 is highly overall correlated with 상하행구분 and 1 other fieldsHigh correlation
종료층 is highly overall correlated with 상하행구분 and 1 other fieldsHigh correlation

Reproduction

Analysis started2023-12-12 14:24:55.529303
Analysis finished2023-12-12 14:24:56.086178
Duration0.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

철도운영기관
Categorical

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
대구교통공사
172 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대구교통공사
2nd row대구교통공사
3rd row대구교통공사
4th row대구교통공사
5th row대구교통공사

Common Values

ValueCountFrequency (%)
대구교통공사 172
100.0%

Length

2023-12-12T23:24:56.166017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:24:56.283125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대구교통공사 172
100.0%

선명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
3호선
172 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3호선
2nd row3호선
3rd row3호선
4th row3호선
5th row3호선

Common Values

ValueCountFrequency (%)
3호선 172
100.0%

Length

2023-12-12T23:24:56.403661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:24:56.499836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3호선 172
100.0%

역명
Categorical

Distinct30
Distinct (%)17.4%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
명덕(2.28민주운동기념회관)
 
10
청라언덕
 
8
만평
 
8
어린이회관
 
6
구암
 
6
Other values (25)
134 

Length

Max length16
Median length13
Mean length4.4651163
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row건들바위
2nd row건들바위
3rd row건들바위
4th row건들바위
5th row공단

Common Values

ValueCountFrequency (%)
명덕(2.28민주운동기념회관) 10
 
5.8%
청라언덕 8
 
4.7%
만평 8
 
4.7%
어린이회관 6
 
3.5%
구암 6
 
3.5%
남산 6
 
3.5%
대봉교 6
 
3.5%
동천 6
 
3.5%
매천 6
 
3.5%
매천시장 6
 
3.5%
Other values (20) 104
60.5%

Length

2023-12-12T23:24:56.620301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
명덕(2.28민주운동기념회관 10
 
5.8%
만평 8
 
4.7%
청라언덕 8
 
4.7%
범물 6
 
3.5%
칠곡운암 6
 
3.5%
태전 6
 
3.5%
팔거(국립농관원·통계청 6
 
3.5%
수성못(tbc 6
 
3.5%
팔달시장 6
 
3.5%
수성구민운동장 6
 
3.5%
Other values (20) 104
60.5%

상하행구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
상행
111 
하행
61 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row상행
2nd row상행
3rd row상행
4th row상행
5th row상행

Common Values

ValueCountFrequency (%)
상행 111
64.5%
하행 61
35.5%

Length

2023-12-12T23:24:56.775161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:24:56.888597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
상행 111
64.5%
하행 61
35.5%

출입구번호
Categorical

Distinct6
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
<NA>
106 
1
34 
4
12 
2
11 
3
 
8

Length

Max length4
Median length4
Mean length2.8488372
Min length1

Unique

Unique1 ?
Unique (%)0.6%

Sample

1st row4
2nd row2
3rd row<NA>
4th row<NA>
5th row1

Common Values

ValueCountFrequency (%)
<NA> 106
61.6%
1 34
 
19.8%
4 12
 
7.0%
2 11
 
6.4%
3 8
 
4.7%
6 1
 
0.6%

Length

2023-12-12T23:24:57.006863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:24:57.509261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 106
61.6%
1 34
 
19.8%
4 12
 
7.0%
2 11
 
6.4%
3 8
 
4.7%
6 1
 
0.6%
Distinct125
Distinct (%)72.7%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-12T23:24:57.859515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length24
Mean length17.726744
Min length7

Characters and Unicode

Total characters3049
Distinct characters163
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique102 ?
Unique (%)59.3%

Sample

1st row(1F) 4번 출입구
2nd row(1F) 2번 출입구
3rd row(2F) 개찰구 개표 후 명덕 방향
4th row(2F) 개찰구 개표 후 대봉교 방향
5th row(1F) 1번 출입구 근처
ValueCountFrequency (%)
출입구 82
 
10.5%
방향 61
 
7.8%
2f 60
 
7.7%
43
 
5.5%
1번 39
 
5.0%
33
 
4.2%
1f 28
 
3.6%
출입문 24
 
3.1%
3f 21
 
2.7%
대합실 20
 
2.6%
Other values (135) 372
47.5%
2023-12-12T23:24:58.356971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
615
20.2%
( 172
 
5.6%
F 172
 
5.6%
) 172
 
5.6%
2 125
 
4.1%
1 122
 
4.0%
111
 
3.6%
109
 
3.6%
98
 
3.2%
83
 
2.7%
Other values (153) 1270
41.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1519
49.8%
Space Separator 615
20.2%
Decimal Number 330
 
10.8%
Uppercase Letter 200
 
6.6%
Open Punctuation 172
 
5.6%
Close Punctuation 172
 
5.6%
Dash Punctuation 31
 
1.0%
Lowercase Letter 8
 
0.3%
Other Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
111
 
7.3%
109
 
7.2%
98
 
6.5%
83
 
5.5%
81
 
5.3%
64
 
4.2%
59
 
3.9%
46
 
3.0%
42
 
2.8%
39
 
2.6%
Other values (131) 787
51.8%
Uppercase Letter
ValueCountFrequency (%)
F 172
86.0%
E 6
 
3.0%
L 6
 
3.0%
M 4
 
2.0%
B 4
 
2.0%
S 3
 
1.5%
P 2
 
1.0%
D 2
 
1.0%
G 1
 
0.5%
Decimal Number
ValueCountFrequency (%)
2 125
37.9%
1 122
37.0%
3 64
19.4%
4 13
 
3.9%
5 3
 
0.9%
6 2
 
0.6%
7 1
 
0.3%
Space Separator
ValueCountFrequency (%)
615
100.0%
Open Punctuation
ValueCountFrequency (%)
( 172
100.0%
Close Punctuation
ValueCountFrequency (%)
) 172
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 31
100.0%
Lowercase Letter
ValueCountFrequency (%)
m 8
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1519
49.8%
Common 1322
43.4%
Latin 208
 
6.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
111
 
7.3%
109
 
7.2%
98
 
6.5%
83
 
5.5%
81
 
5.3%
64
 
4.2%
59
 
3.9%
46
 
3.0%
42
 
2.8%
39
 
2.6%
Other values (131) 787
51.8%
Common
ValueCountFrequency (%)
615
46.5%
( 172
 
13.0%
) 172
 
13.0%
2 125
 
9.5%
1 122
 
9.2%
3 64
 
4.8%
- 31
 
2.3%
4 13
 
1.0%
5 3
 
0.2%
/ 2
 
0.2%
Other values (2) 3
 
0.2%
Latin
ValueCountFrequency (%)
F 172
82.7%
m 8
 
3.8%
E 6
 
2.9%
L 6
 
2.9%
M 4
 
1.9%
B 4
 
1.9%
S 3
 
1.4%
P 2
 
1.0%
D 2
 
1.0%
G 1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1530
50.2%
Hangul 1519
49.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
615
40.2%
( 172
 
11.2%
F 172
 
11.2%
) 172
 
11.2%
2 125
 
8.2%
1 122
 
8.0%
3 64
 
4.2%
- 31
 
2.0%
4 13
 
0.8%
m 8
 
0.5%
Other values (12) 36
 
2.4%
Hangul
ValueCountFrequency (%)
111
 
7.3%
109
 
7.2%
98
 
6.5%
83
 
5.5%
81
 
5.3%
64
 
4.2%
59
 
3.9%
46
 
3.0%
42
 
2.8%
39
 
2.6%
Other values (131) 787
51.8%

시작층
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
지상2
79 
지상1
53 
지상3
36 
지하2
 
2
지하1
 
1

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique2 ?
Unique (%)1.2%

Sample

1st row지상1
2nd row지상1
3rd row지상2
4th row지상2
5th row지상1

Common Values

ValueCountFrequency (%)
지상2 79
45.9%
지상1 53
30.8%
지상3 36
20.9%
지하2 2
 
1.2%
지하1 1
 
0.6%
지하3 1
 
0.6%

Length

2023-12-12T23:24:58.506438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:24:58.627346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지상2 79
45.9%
지상1 53
30.8%
지상3 36
20.9%
지하2 2
 
1.2%
지하1 1
 
0.6%
지하3 1
 
0.6%

종료층
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
지상2
82 
지상3
54 
지상1
32 
지하1
 
3
지하3
 
1

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique1 ?
Unique (%)0.6%

Sample

1st row지상2
2nd row지상2
3rd row지상3
4th row지상3
5th row지상1

Common Values

ValueCountFrequency (%)
지상2 82
47.7%
지상3 54
31.4%
지상1 32
 
18.6%
지하1 3
 
1.7%
지하3 1
 
0.6%

Length

2023-12-12T23:24:58.757885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:24:58.871629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지상2 82
47.7%
지상3 54
31.4%
지상1 32
 
18.6%
지하1 3
 
1.7%
지하3 1
 
0.6%

Correlations

2023-12-12T23:24:58.960533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
역명상하행구분출입구번호시작층종료층
역명1.0000.1430.5610.0000.000
상하행구분0.1431.0000.0000.9170.442
출입구번호0.5610.0001.0000.0000.085
시작층0.0000.9170.0001.0000.725
종료층0.0000.4420.0850.7251.000
2023-12-12T23:24:59.065702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상하행구분출입구번호종료층역명시작층
상하행구분1.0000.0000.5330.0990.735
출입구번호0.0001.0000.0970.2240.000
종료층0.5330.0971.0000.0000.591
역명0.0990.2240.0001.0000.000
시작층0.7350.0000.5910.0001.000
2023-12-12T23:24:59.192513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
역명상하행구분출입구번호시작층종료층
역명1.0000.0990.2240.0000.000
상하행구분0.0991.0000.0000.7350.533
출입구번호0.2240.0001.0000.0000.097
시작층0.0000.7350.0001.0000.591
종료층0.0000.5330.0970.5911.000

Missing values

2023-12-12T23:24:55.922581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:24:56.037886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

철도운영기관선명역명상하행구분출입구번호상세위치시작층종료층
0대구교통공사3호선건들바위상행4(1F) 4번 출입구지상1지상2
1대구교통공사3호선건들바위상행2(1F) 2번 출입구지상1지상2
2대구교통공사3호선건들바위상행<NA>(2F) 개찰구 개표 후 명덕 방향지상2지상3
3대구교통공사3호선건들바위상행<NA>(2F) 개찰구 개표 후 대봉교 방향지상2지상3
4대구교통공사3호선공단상행1(1F) 1번 출입구 근처지상1지상1
5대구교통공사3호선공단상행1(MF) 1번 출입구 화장실 방향지상1지상2
6대구교통공사3호선공단상행<NA>(2F) 1번 출입구 한국건강관리협회 방향지상2지상1
7대구교통공사3호선공단상행<NA>(2F) 1번 출입구 할리스커피 방향지상2지상1
8대구교통공사3호선공단상행<NA>(MF) 1번 출입구 한국건강관리협회 방향지상1지상3
9대구교통공사3호선공단상행<NA>(MF) 1번 출입구 할리스커피 방향지상1지상3
철도운영기관선명역명상하행구분출입구번호상세위치시작층종료층
162대구교통공사3호선학정상행4(1F)칠곡경대병원역 방향지상1지상2
163대구교통공사3호선학정상행3(1F)팔거역 방향지상1지상2
164대구교통공사3호선학정하행<NA>(3F)칠곡경대병원역 방향 승강장 3-2 출입문 앞지상3지상2
165대구교통공사3호선학정상행<NA>(2F)표내는 곳 근처지상2지상3
166대구교통공사3호선황금하행1(2F)대합실지상2지상1
167대구교통공사3호선황금상행1(1F)1번 출입구 옆지상1지상2
168대구교통공사3호선황금하행<NA>(3F)어린이회관역 방향 3-2 출입문지상3지상2
169대구교통공사3호선황금상행<NA>(2F)대합실지상2지상3
170대구교통공사3호선황금상행<NA>(2F)대합실지상2지상3
171대구교통공사3호선황금하행<NA>(3F)수성못역 방향 1-2 출입문지상3지상2

Duplicate rows

Most frequently occurring

철도운영기관선명역명상하행구분출입구번호상세위치시작층종료층# duplicates
0대구교통공사3호선구암상행<NA>(2F)대합실 창고 앞지상2지상32
1대구교통공사3호선남산하행<NA>(3F) 승강장 청라언덕역 방향 끝지점지상3지상22
2대구교통공사3호선수성못(TBC)상행<NA>(2F)대합실지상2지상32
3대구교통공사3호선어린이회관상행<NA>(2F)대합실지상2지상32
4대구교통공사3호선원대상행<NA>(2F) 1번 출입구 대합실 표 내는 곳 옆지상2지상12
5대구교통공사3호선팔달시장상행<NA>(2F) 1번 출입구 대합실 표 내는 곳 옆지상2지상12
6대구교통공사3호선황금상행<NA>(2F)대합실지상2지상32