Overview

Dataset statistics

Number of variables8
Number of observations129
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.8%
Total size in memory8.2 KiB
Average record size in memory65.0 B

Variable types

Categorical7
Text1

Dataset

Description부산4호선에 포함된 도시광역철도역들의 철도운영기관명, 선명, 역명, 상하행구분, 출입구번호, 상세위치, 시작층, 종료층의 데이터가 있습니다.
Author국가철도공단
URLhttps://www.data.go.kr/data/15041356/fileData.do

Alerts

철도운영기관 has constant value ""Constant
선명 has constant value ""Constant
Dataset has 1 (0.8%) duplicate rowsDuplicates
시작층 is highly overall correlated with 종료층High correlation
종료층 is highly overall correlated with 시작층High correlation

Reproduction

Analysis started2023-12-12 09:33:17.505584
Analysis finished2023-12-12 09:33:18.204252
Duration0.7 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

철도운영기관
Categorical

CONSTANT 

Distinct1
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
부산교통공사
129 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산교통공사
2nd row부산교통공사
3rd row부산교통공사
4th row부산교통공사
5th row부산교통공사

Common Values

ValueCountFrequency (%)
부산교통공사 129
100.0%

Length

2023-12-12T18:33:18.284869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:33:18.400434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산교통공사 129
100.0%

선명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
4호선
129 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row4호선
2nd row4호선
3rd row4호선
4th row4호선
5th row4호선

Common Values

ValueCountFrequency (%)
4호선 129
100.0%

Length

2023-12-12T18:33:18.524887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:33:18.639564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
4호선 129
100.0%

역명
Categorical

Distinct14
Distinct (%)10.9%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
동래
22 
수안
16 
명장
12 
서동
12 
충렬사(안락)
12 
Other values (9)
55 

Length

Max length10
Median length2
Mean length3.4496124
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row고촌
2nd row고촌
3rd row고촌
4th row고촌
5th row고촌

Common Values

ValueCountFrequency (%)
동래 22
17.1%
수안 16
12.4%
명장 12
9.3%
서동 12
9.3%
충렬사(안락) 12
9.3%
금사 9
7.0%
낙민 7
 
5.4%
윗반송 7
 
5.4%
고촌 6
 
4.7%
미남 6
 
4.7%
Other values (4) 20
15.5%

Length

2023-12-12T18:33:18.754065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
동래 22
17.1%
수안 16
12.4%
명장 12
9.3%
서동 12
9.3%
충렬사(안락 12
9.3%
금사 9
7.0%
낙민 7
 
5.4%
윗반송 7
 
5.4%
고촌 6
 
4.7%
미남 6
 
4.7%
Other values (4) 20
15.5%

상하행구분
Categorical

Distinct2
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
상행
76 
하행
53 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row상행
2nd row하행
3rd row하행
4th row상행
5th row상행

Common Values

ValueCountFrequency (%)
상행 76
58.9%
하행 53
41.1%

Length

2023-12-12T18:33:18.899486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:33:19.030237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
상행 76
58.9%
하행 53
41.1%

출입구번호
Categorical

Distinct13
Distinct (%)10.1%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
<NA>
69 
4
16 
3
10 
1
7
 
6
Other values (8)
20 

Length

Max length5
Median length4
Mean length2.7364341
Min length1

Unique

Unique4 ?
Unique (%)3.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 69
53.5%
4 16
 
12.4%
3 10
 
7.8%
1 8
 
6.2%
7 6
 
4.7%
8 6
 
4.7%
2 6
 
4.7%
1/3 2
 
1.6%
6/8 2
 
1.6%
9 1
 
0.8%
Other values (3) 3
 
2.3%

Length

2023-12-12T18:33:19.162902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 69
53.5%
4 16
 
12.4%
3 10
 
7.8%
1 8
 
6.2%
7 6
 
4.7%
8 6
 
4.7%
2 6
 
4.7%
1/3 2
 
1.6%
6/8 2
 
1.6%
9 1
 
0.8%
Other values (3) 3
 
2.3%
Distinct115
Distinct (%)89.1%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-12T18:33:19.481391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length24
Mean length16.48062
Min length7

Characters and Unicode

Total characters2126
Distinct characters133
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique105 ?
Unique (%)81.4%

Sample

1st row상선 측 실로암묘원에서 버스종점 엘리베트 전방
2nd row2층 대합실 출입구앞(상선 측)
3rd row하선 측 운봉교에서 안평역방향
4th row2층 대합실 출입구앞(하선 측)
5th row대합실 매표소 표내는 곳
ValueCountFrequency (%)
45
 
7.8%
출입구 45
 
7.8%
방향 35
 
6.1%
b1 28
 
4.9%
출입문 26
 
4.5%
20
 
3.5%
근처 20
 
3.5%
b3 15
 
2.6%
승강장 14
 
2.4%
4번 14
 
2.4%
Other values (121) 313
54.4%
2023-12-12T18:33:20.054515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
454
21.4%
( 120
 
5.6%
) 120
 
5.6%
1 111
 
5.2%
79
 
3.7%
B 78
 
3.7%
75
 
3.5%
62
 
2.9%
2 61
 
2.9%
55
 
2.6%
Other values (123) 911
42.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 981
46.1%
Space Separator 454
21.4%
Decimal Number 268
 
12.6%
Uppercase Letter 123
 
5.8%
Open Punctuation 120
 
5.6%
Close Punctuation 120
 
5.6%
Dash Punctuation 35
 
1.6%
Math Symbol 13
 
0.6%
Other Punctuation 12
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
79
 
8.1%
75
 
7.6%
62
 
6.3%
55
 
5.6%
47
 
4.8%
42
 
4.3%
38
 
3.9%
29
 
3.0%
26
 
2.7%
24
 
2.4%
Other values (101) 504
51.4%
Decimal Number
ValueCountFrequency (%)
1 111
41.4%
2 61
22.8%
3 41
 
15.3%
4 22
 
8.2%
6 11
 
4.1%
8 8
 
3.0%
5 6
 
2.2%
7 6
 
2.2%
9 1
 
0.4%
0 1
 
0.4%
Uppercase Letter
ValueCountFrequency (%)
B 78
63.4%
F 29
 
23.6%
E 8
 
6.5%
L 4
 
3.3%
S 2
 
1.6%
V 2
 
1.6%
Space Separator
ValueCountFrequency (%)
454
100.0%
Open Punctuation
ValueCountFrequency (%)
( 120
100.0%
Close Punctuation
ValueCountFrequency (%)
) 120
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 35
100.0%
Math Symbol
ValueCountFrequency (%)
> 13
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1022
48.1%
Hangul 981
46.1%
Latin 123
 
5.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
79
 
8.1%
75
 
7.6%
62
 
6.3%
55
 
5.6%
47
 
4.8%
42
 
4.3%
38
 
3.9%
29
 
3.0%
26
 
2.7%
24
 
2.4%
Other values (101) 504
51.4%
Common
ValueCountFrequency (%)
454
44.4%
( 120
 
11.7%
) 120
 
11.7%
1 111
 
10.9%
2 61
 
6.0%
3 41
 
4.0%
- 35
 
3.4%
4 22
 
2.2%
> 13
 
1.3%
/ 12
 
1.2%
Other values (6) 33
 
3.2%
Latin
ValueCountFrequency (%)
B 78
63.4%
F 29
 
23.6%
E 8
 
6.5%
L 4
 
3.3%
S 2
 
1.6%
V 2
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1145
53.9%
Hangul 981
46.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
454
39.7%
( 120
 
10.5%
) 120
 
10.5%
1 111
 
9.7%
B 78
 
6.8%
2 61
 
5.3%
3 41
 
3.6%
- 35
 
3.1%
F 29
 
2.5%
4 22
 
1.9%
Other values (12) 74
 
6.5%
Hangul
ValueCountFrequency (%)
79
 
8.1%
75
 
7.6%
62
 
6.3%
55
 
5.6%
47
 
4.8%
42
 
4.3%
38
 
3.9%
29
 
3.0%
26
 
2.7%
24
 
2.4%
Other values (101) 504
51.4%

시작층
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)5.4%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
지하1
37 
지상1
31 
지하3
23 
지하2
18 
지상2
13 
Other values (2)

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지상1
2nd row지상2
3rd row지상1
4th row지상2
5th row지상2

Common Values

ValueCountFrequency (%)
지하1 37
28.7%
지상1 31
24.0%
지하3 23
17.8%
지하2 18
14.0%
지상2 13
 
10.1%
지하4 4
 
3.1%
지상3 3
 
2.3%

Length

2023-12-12T18:33:20.226871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:33:20.361153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지하1 37
28.7%
지상1 31
24.0%
지하3 23
17.8%
지하2 18
14.0%
지상2 13
 
10.1%
지하4 4
 
3.1%
지상3 3
 
2.3%

종료층
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)5.4%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
지하1
37 
지상1
29 
지하2
26 
지상2
14 
지하3
14 
Other values (2)

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지상2
2nd row지상1
3rd row지상2
4th row지상1
5th row지상3

Common Values

ValueCountFrequency (%)
지하1 37
28.7%
지상1 29
22.5%
지하2 26
20.2%
지상2 14
 
10.9%
지하3 14
 
10.9%
지상3 7
 
5.4%
지하4 2
 
1.6%

Length

2023-12-12T18:33:20.549600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:33:20.711303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지하1 37
28.7%
지상1 29
22.5%
지하2 26
20.2%
지상2 14
 
10.9%
지하3 14
 
10.9%
지상3 7
 
5.4%
지하4 2
 
1.6%

Correlations

2023-12-12T18:33:20.834288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
역명상하행구분출입구번호시작층종료층
역명1.0000.0000.6900.6630.671
상하행구분0.0001.0000.0000.2830.224
출입구번호0.6900.0001.0000.0000.287
시작층0.6630.2830.0001.0000.907
종료층0.6710.2240.2870.9071.000
2023-12-12T18:33:20.980376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
역명출입구번호시작층상하행구분종료층
역명1.0000.3370.2970.0000.303
출입구번호0.3371.0000.0000.0000.110
시작층0.2970.0001.0000.2960.553
상하행구분0.0000.0000.2961.0000.234
종료층0.3030.1100.5530.2341.000
2023-12-12T18:33:21.119237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
역명상하행구분출입구번호시작층종료층
역명1.0000.0000.3370.2970.303
상하행구분0.0001.0000.0000.2960.234
출입구번호0.3370.0001.0000.0000.110
시작층0.2970.2960.0001.0000.553
종료층0.3030.2340.1100.5531.000

Missing values

2023-12-12T18:33:17.955583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:33:18.149257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

철도운영기관선명역명상하행구분출입구번호상세위치시작층종료층
0부산교통공사4호선고촌상행<NA>상선 측 실로암묘원에서 버스종점 엘리베트 전방지상1지상2
1부산교통공사4호선고촌하행<NA>2층 대합실 출입구앞(상선 측)지상2지상1
2부산교통공사4호선고촌하행<NA>하선 측 운봉교에서 안평역방향지상1지상2
3부산교통공사4호선고촌상행<NA>2층 대합실 출입구앞(하선 측)지상2지상1
4부산교통공사4호선고촌상행<NA>대합실 매표소 표내는 곳지상2지상3
5부산교통공사4호선고촌하행<NA>안평역 방향 대합실지상3지상2
6부산교통공사4호선금사상행3(1F) 3번 출입구지상1지하1
7부산교통공사4호선금사하행3(B1) 3번 출입구지하1지상1
8부산교통공사4호선금사하행4(1F) 4번 출입구지상1지하1
9부산교통공사4호선금사상행4(B1) 4번 출입구지하1지상1
철도운영기관선명역명상하행구분출입구번호상세위치시작층종료층
119부산교통공사4호선충렬사(안락)하행44번 출입구 근처지상1지하1
120부산교통공사4호선충렬사(안락)상행4(B1)만남의 장소 > 서원시장 방면 4번 출입구지하1지상1
121부산교통공사4호선충렬사(안락)상행<NA>(B2)대합실 > (B1)표 내는곳지하2지하1
122부산교통공사4호선충렬사(안락)하행<NA>(B1)표 내는곳 > (B2)대합실지하1지하2
123부산교통공사4호선충렬사(안락)상행<NA>(B3)낙민역 방향 승강장 6-2 앞 > (B2)대합실지하3지하2
124부산교통공사4호선충렬사(안락)상행<NA>(B3)명장역 방향 승강장 6-2 앞 > (B2)대합실지하3지하2
125부산교통공사4호선충렬사(안락)상행<NA>(B3)낙민역 방향 승강장 1-1 앞 > (B2)대합실지하3지하2
126부산교통공사4호선충렬사(안락)하행<NA>(B2)대합실 > (B3)낙민역 방향 승강장 1-1 앞지하2지하3
127부산교통공사4호선충렬사(안락)상행<NA>(B3)명장역 방향 승강장 1-1 앞 > (B2)대합실지하3지하2
128부산교통공사4호선충렬사(안락)하행<NA>(B2)대합실 > (B3)명장역 방향 승강장 1-1 앞지하2지하3

Duplicate rows

Most frequently occurring

철도운영기관선명역명상하행구분출입구번호상세위치시작층종료층# duplicates
0부산교통공사4호선동래하행<NA>(B3) 누가타커피옆지하3지하42