Overview

Dataset statistics

Number of variables9
Number of observations91
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.8 KiB
Average record size in memory76.5 B

Variable types

Categorical7
Text2

Dataset

Description대구교통공사에서 관리하는 도시광역철도역들의 철도운영기관명, 선명, 역명, 지상지하구분, 역층, 상세위치, 충전설비수, 이용요금, 전화번호의 데이터가 포함되어 있습니다.
Author국가철도공단
URLhttps://www.data.go.kr/data/15041281/fileData.do

Alerts

철도운영기관명 has constant value ""Constant
이용요금 has constant value ""Constant
지상지하구분 is highly overall correlated with 선명 and 1 other fieldsHigh correlation
선명 is highly overall correlated with 지상지하구분 and 1 other fieldsHigh correlation
역층 is highly overall correlated with 상세위치High correlation
상세위치 is highly overall correlated with 선명 and 3 other fieldsHigh correlation
충전설비수 is highly overall correlated with 상세위치High correlation
충전설비수 is highly imbalanced (91.3%)Imbalance

Reproduction

Analysis started2023-12-12 14:51:22.915727
Analysis finished2023-12-12 14:51:23.865936
Duration0.95 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

철도운영기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size860.0 B
대구교통공사
91 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대구교통공사
2nd row대구교통공사
3rd row대구교통공사
4th row대구교통공사
5th row대구교통공사

Common Values

ValueCountFrequency (%)
대구교통공사 91
100.0%

Length

2023-12-12T23:51:23.931599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:51:24.017767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대구교통공사 91
100.0%

선명
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size860.0 B
1호선
32 
3호선
30 
2호선
29 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1호선
2nd row1호선
3rd row1호선
4th row1호선
5th row1호선

Common Values

ValueCountFrequency (%)
1호선 32
35.2%
3호선 30
33.0%
2호선 29
31.9%

Length

2023-12-12T23:51:24.104712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:51:24.220325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1호선 32
35.2%
3호선 30
33.0%
2호선 29
31.9%

역명
Text

Distinct88
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Memory size860.0 B
2023-12-12T23:51:24.509413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length2
Mean length3.956044
Min length2

Characters and Unicode

Total characters360
Distinct characters137
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique85 ?
Unique (%)93.4%

Sample

1st row각산
2nd row교대
3rd row대곡(정부대구청사)
4th row대구역
5th row대명
ValueCountFrequency (%)
청라언덕 2
 
2.2%
명덕(2.28민주운동기념회관 2
 
2.2%
반월당 2
 
2.2%
원대 1
 
1.1%
이곡 1
 
1.1%
달성공원 1
 
1.1%
남산 1
 
1.1%
구암 1
 
1.1%
공단 1
 
1.1%
건들바위 1
 
1.1%
Other values (78) 78
85.7%
2023-12-12T23:51:25.051518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
19
 
5.3%
( 13
 
3.6%
) 13
 
3.6%
11
 
3.1%
10
 
2.8%
8
 
2.2%
8
 
2.2%
8
 
2.2%
8
 
2.2%
7
 
1.9%
Other values (127) 255
70.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 316
87.8%
Open Punctuation 13
 
3.6%
Close Punctuation 13
 
3.6%
Other Punctuation 6
 
1.7%
Decimal Number 6
 
1.7%
Uppercase Letter 6
 
1.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
19
 
6.0%
11
 
3.5%
10
 
3.2%
8
 
2.5%
8
 
2.5%
8
 
2.5%
8
 
2.5%
7
 
2.2%
7
 
2.2%
6
 
1.9%
Other values (116) 224
70.9%
Uppercase Letter
ValueCountFrequency (%)
B 2
33.3%
S 1
16.7%
T 1
16.7%
C 1
16.7%
K 1
16.7%
Other Punctuation
ValueCountFrequency (%)
· 4
66.7%
. 2
33.3%
Decimal Number
ValueCountFrequency (%)
2 4
66.7%
8 2
33.3%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 316
87.8%
Common 38
 
10.6%
Latin 6
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
19
 
6.0%
11
 
3.5%
10
 
3.2%
8
 
2.5%
8
 
2.5%
8
 
2.5%
8
 
2.5%
7
 
2.2%
7
 
2.2%
6
 
1.9%
Other values (116) 224
70.9%
Common
ValueCountFrequency (%)
( 13
34.2%
) 13
34.2%
· 4
 
10.5%
2 4
 
10.5%
. 2
 
5.3%
8 2
 
5.3%
Latin
ValueCountFrequency (%)
B 2
33.3%
S 1
16.7%
T 1
16.7%
C 1
16.7%
K 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 316
87.8%
ASCII 40
 
11.1%
None 4
 
1.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
19
 
6.0%
11
 
3.5%
10
 
3.2%
8
 
2.5%
8
 
2.5%
8
 
2.5%
8
 
2.5%
7
 
2.2%
7
 
2.2%
6
 
1.9%
Other values (116) 224
70.9%
ASCII
ValueCountFrequency (%)
( 13
32.5%
) 13
32.5%
2 4
 
10.0%
B 2
 
5.0%
. 2
 
5.0%
8 2
 
5.0%
S 1
 
2.5%
T 1
 
2.5%
C 1
 
2.5%
K 1
 
2.5%
None
ValueCountFrequency (%)
· 4
100.0%

지상지하구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size860.0 B
지하
60 
지상
31 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지하
2nd row지하
3rd row지하
4th row지하
5th row지하

Common Values

ValueCountFrequency (%)
지하 60
65.9%
지상 31
34.1%

Length

2023-12-12T23:51:25.239108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:51:25.360100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지하 60
65.9%
지상 31
34.1%

역층
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size860.0 B
1
45 
2
40 
3

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row2
3rd row2
4th row2
5th row1

Common Values

ValueCountFrequency (%)
1 45
49.5%
2 40
44.0%
3 6
 
6.6%

Length

2023-12-12T23:51:25.511695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:51:25.637903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 45
49.5%
2 40
44.0%
3 6
 
6.6%

상세위치
Categorical

HIGH CORRELATION 

Distinct29
Distinct (%)31.9%
Missing0
Missing (%)0.0%
Memory size860.0 B
지하1층 대합실
23 
지상2층 대합실
20 
지상1층 대합실
지하2층 대합실
지하3층 대합실
 
3
Other values (24)
30 

Length

Max length15
Median length8
Mean length9.2637363
Min length8

Unique

Unique19 ?
Unique (%)20.9%

Sample

1st row지하1층 대합실
2nd row지하2층 2발매기 옆
3rd row지하2층 화장실 앞
4th row지하2층 역무실 앞
5th row지하1층 대합실

Common Values

ValueCountFrequency (%)
지하1층 대합실 23
25.3%
지상2층 대합실 20
22.0%
지상1층 대합실 8
 
8.8%
지하2층 대합실 7
 
7.7%
지하3층 대합실 3
 
3.3%
지하2층 화장실 앞 3
 
3.3%
지상2층 대합실 2
 
2.2%
지하1층 대합실 쉼터 내 2
 
2.2%
지하1층 역무실 앞 2
 
2.2%
지하2층 대합실 화장실 앞 2
 
2.2%
Other values (19) 19
20.9%

Length

2023-12-12T23:51:25.790118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
대합실 69
31.5%
지하1층 37
16.9%
지상2층 22
 
10.0%
지하2층 18
 
8.2%
10
 
4.6%
8
 
3.7%
지상1층 8
 
3.7%
화장실 6
 
2.7%
지하3층 5
 
2.3%
역무실 5
 
2.3%
Other values (22) 31
14.2%

충전설비수
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size860.0 B
1
90 
2
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)1.1%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 90
98.9%
2 1
 
1.1%

Length

2023-12-12T23:51:25.965251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:51:26.098092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 90
98.9%
2 1
 
1.1%

이용요금
Categorical

CONSTANT 

Distinct1
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size860.0 B
0
91 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 91
100.0%

Length

2023-12-12T23:51:26.211037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:51:26.314876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 91
100.0%
Distinct66
Distinct (%)72.5%
Missing0
Missing (%)0.0%
Memory size860.0 B
2023-12-12T23:51:26.592266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters1092
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique61 ?
Unique (%)67.0%

Sample

1st row053-963-7754
2nd row053-473-7702
3rd row053-644-7723
4th row053-426-7797
5th row053-627-7746
ValueCountFrequency (%)
053-640-7611 6
 
6.6%
053-640-7431 6
 
6.6%
053-640-7581 6
 
6.6%
053-640-7521 6
 
6.6%
053-640-7381 6
 
6.6%
053-752-0206 1
 
1.1%
053-791-0858 1
 
1.1%
053-656-6038 1
 
1.1%
053-252-0689 1
 
1.1%
053-752-0959 1
 
1.1%
Other values (56) 56
61.5%
2023-12-12T23:51:27.053475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 182
16.7%
5 153
14.0%
0 146
13.4%
3 126
11.5%
7 123
11.3%
6 95
8.7%
4 64
 
5.9%
1 62
 
5.7%
2 60
 
5.5%
8 48
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 910
83.3%
Dash Punctuation 182
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 153
16.8%
0 146
16.0%
3 126
13.8%
7 123
13.5%
6 95
10.4%
4 64
7.0%
1 62
6.8%
2 60
 
6.6%
8 48
 
5.3%
9 33
 
3.6%
Dash Punctuation
ValueCountFrequency (%)
- 182
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1092
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 182
16.7%
5 153
14.0%
0 146
13.4%
3 126
11.5%
7 123
11.3%
6 95
8.7%
4 64
 
5.9%
1 62
 
5.7%
2 60
 
5.5%
8 48
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1092
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 182
16.7%
5 153
14.0%
0 146
13.4%
3 126
11.5%
7 123
11.3%
6 95
8.7%
4 64
 
5.9%
1 62
 
5.7%
2 60
 
5.5%
8 48
 
4.4%

Correlations

2023-12-12T23:51:27.165521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
선명역명지상지하구분역층상세위치충전설비수전화번호
선명1.0000.0000.7540.6230.9080.0261.000
역명0.0001.0000.0000.0000.9570.0000.982
지상지하구분0.7540.0001.0000.2241.0000.0001.000
역층0.6230.0000.2241.0001.0000.2280.970
상세위치0.9080.9571.0001.0001.0001.0000.997
충전설비수0.0260.0000.0000.2281.0001.0001.000
전화번호1.0000.9821.0000.9700.9971.0001.000
2023-12-12T23:51:27.288617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상세위치충전설비수지상지하구분역층선명
상세위치1.0000.8350.8350.8390.640
충전설비수0.8351.0000.0000.3700.039
지상지하구분0.8350.0001.0000.3640.970
역층0.8390.3700.3641.0000.289
선명0.6400.0390.9700.2891.000
2023-12-12T23:51:27.427475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
선명지상지하구분역층상세위치충전설비수
선명1.0000.9700.2890.6400.039
지상지하구분0.9701.0000.3640.8350.000
역층0.2890.3641.0000.8390.370
상세위치0.6400.8350.8391.0000.835
충전설비수0.0390.0000.3700.8351.000

Missing values

2023-12-12T23:51:23.656410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:51:23.810738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

철도운영기관명선명역명지상지하구분역층상세위치충전설비수이용요금전화번호
0대구교통공사1호선각산지하1지하1층 대합실10053-963-7754
1대구교통공사1호선교대지하2지하2층 2발매기 옆10053-473-7702
2대구교통공사1호선대곡(정부대구청사)지하2지하2층 화장실 앞10053-644-7723
3대구교통공사1호선대구역지하2지하2층 역무실 앞10053-426-7797
4대구교통공사1호선대명지하1지하1층 대합실10053-627-7746
5대구교통공사1호선동구청(큰고개)지하1지하1층 대합실10053-942-7721
6대구교통공사1호선동대구역지하1지하1층 1번출구 앞10053-742-7787
7대구교통공사1호선동촌지하3지하3층 대합실10053-981-7731
8대구교통공사1호선명덕(2.28민주운동기념회관)지하1지하1층 역무실 앞10053-255-7723
9대구교통공사1호선반야월지하1지하1층 대합실10053-962-7798
철도운영기관명선명역명지상지하구분역층상세위치충전설비수이용요금전화번호
81대구교통공사3호선지산지상2지상2층 대합실10053-640-7611
82대구교통공사3호선청라언덕지상2지상2층 대합실10053-640-7521
83대구교통공사3호선칠곡경대병원지상2지상2층 대합실10053-640-7381
84대구교통공사3호선칠곡운암지상1지상1층 대합실10053-640-7381
85대구교통공사3호선태전지상1지상1층 대합실10053-640-7431
86대구교통공사3호선팔거(국립농관원·통계청)지상2지상2층 대합실10053-640-7381
87대구교통공사3호선팔달지상1지상1층 대합실10053-640-7431
88대구교통공사3호선팔달시장지상2지상2층 대합실10053-640-7521
89대구교통공사3호선학정지상2지상2층 대합실10053-640-7381
90대구교통공사3호선황금지상2지상2층 대합실10053-640-7611