Overview

Dataset statistics

Number of variables5
Number of observations114
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.6 KiB
Average record size in memory41.2 B

Variable types

Categorical2
Text3

Dataset

Description부산교통공사에서 관리하는 도시광역철도역들의 철도운영기관명, 선명, 역명, 지번주소, 도로명주소의 데이터가 있습니다.
Author국가철도공단
URLhttps://www.data.go.kr/data/15041101/fileData.do

Alerts

철도운영기관명 has constant value ""Constant

Reproduction

Analysis started2023-12-12 06:08:39.916101
Analysis finished2023-12-12 06:08:40.362119
Duration0.45 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

철도운영기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
부산교통공사
114 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산교통공사
2nd row부산교통공사
3rd row부산교통공사
4th row부산교통공사
5th row부산교통공사

Common Values

ValueCountFrequency (%)
부산교통공사 114
100.0%

Length

2023-12-12T15:08:40.430461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:08:40.532692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산교통공사 114
100.0%

선명
Categorical

Distinct4
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2호선
43 
1호선
40 
3호선
17 
4호선
14 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1호선
2nd row1호선
3rd row1호선
4th row1호선
5th row1호선

Common Values

ValueCountFrequency (%)
2호선 43
37.7%
1호선 40
35.1%
3호선 17
 
14.9%
4호선 14
 
12.3%

Length

2023-12-12T15:08:40.632716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:08:40.771105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2호선 43
37.7%
1호선 40
35.1%
3호선 17
 
14.9%
4호선 14
 
12.3%

역명
Text

Distinct108
Distinct (%)94.7%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-12T15:08:41.066198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length2
Mean length4.5
Min length2

Characters and Unicode

Total characters513
Distinct characters169
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique102 ?
Unique (%)89.5%

Sample

1st row괴정
2nd row교대
3rd row구서
4th row남산(부산외국대학교)
5th row남포
ValueCountFrequency (%)
덕천(부산과기대 2
 
1.8%
미남 2
 
1.8%
수영 2
 
1.8%
연산 2
 
1.8%
동래 2
 
1.8%
서면 2
 
1.8%
지게골 1
 
0.9%
전포 1
 
0.9%
주례 1
 
0.9%
중동 1
 
0.9%
Other values (98) 98
86.0%
2023-12-12T15:08:41.551571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 30
 
5.8%
) 30
 
5.8%
26
 
5.1%
25
 
4.9%
17
 
3.3%
13
 
2.5%
10
 
1.9%
10
 
1.9%
9
 
1.8%
9
 
1.8%
Other values (159) 334
65.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 438
85.4%
Open Punctuation 30
 
5.8%
Close Punctuation 30
 
5.8%
Uppercase Letter 8
 
1.6%
Other Punctuation 7
 
1.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
26
 
5.9%
25
 
5.7%
17
 
3.9%
13
 
3.0%
10
 
2.3%
10
 
2.3%
9
 
2.1%
9
 
2.1%
9
 
2.1%
8
 
1.8%
Other values (149) 302
68.9%
Uppercase Letter
ValueCountFrequency (%)
B 2
25.0%
C 1
12.5%
O 1
12.5%
X 1
12.5%
E 1
12.5%
S 1
12.5%
K 1
12.5%
Open Punctuation
ValueCountFrequency (%)
( 30
100.0%
Close Punctuation
ValueCountFrequency (%)
) 30
100.0%
Other Punctuation
ValueCountFrequency (%)
· 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 438
85.4%
Common 67
 
13.1%
Latin 8
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
26
 
5.9%
25
 
5.7%
17
 
3.9%
13
 
3.0%
10
 
2.3%
10
 
2.3%
9
 
2.1%
9
 
2.1%
9
 
2.1%
8
 
1.8%
Other values (149) 302
68.9%
Latin
ValueCountFrequency (%)
B 2
25.0%
C 1
12.5%
O 1
12.5%
X 1
12.5%
E 1
12.5%
S 1
12.5%
K 1
12.5%
Common
ValueCountFrequency (%)
( 30
44.8%
) 30
44.8%
· 7
 
10.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 438
85.4%
ASCII 68
 
13.3%
None 7
 
1.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 30
44.1%
) 30
44.1%
B 2
 
2.9%
C 1
 
1.5%
O 1
 
1.5%
X 1
 
1.5%
E 1
 
1.5%
S 1
 
1.5%
K 1
 
1.5%
Hangul
ValueCountFrequency (%)
26
 
5.9%
25
 
5.7%
17
 
3.9%
13
 
3.0%
10
 
2.3%
10
 
2.3%
9
 
2.1%
9
 
2.1%
9
 
2.1%
8
 
1.8%
Other values (149) 302
68.9%
None
ValueCountFrequency (%)
· 7
100.0%
Distinct105
Distinct (%)92.1%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-12T15:08:41.895818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length22
Mean length18.675439
Min length15

Characters and Unicode

Total characters2129
Distinct characters106
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique97 ?
Unique (%)85.1%

Sample

1st row부산광역시 사하구 괴정동 934-3
2nd row부산광역시 연제구 거제동 72-25
3rd row부산광역시 금정구 구서동 475
4th row부산광역시 금정구 남산동 136-1
5th row부산광역시 중구 남포동1가 11
ValueCountFrequency (%)
부산광역시 103
 
22.2%
북구 13
 
2.8%
사하구 12
 
2.6%
동래구 11
 
2.4%
부산진구 10
 
2.2%
해운대구 10
 
2.2%
금정구 9
 
1.9%
연제구 8
 
1.7%
수영구 7
 
1.5%
사상구 7
 
1.5%
Other values (183) 273
59.0%
2023-12-12T15:08:42.310578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
349
16.4%
134
 
6.3%
127
 
6.0%
124
 
5.8%
114
 
5.4%
112
 
5.3%
1 109
 
5.1%
106
 
5.0%
103
 
4.8%
- 74
 
3.5%
Other values (96) 777
36.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1245
58.5%
Decimal Number 461
 
21.7%
Space Separator 349
 
16.4%
Dash Punctuation 74
 
3.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
134
 
10.8%
127
 
10.2%
124
 
10.0%
114
 
9.2%
112
 
9.0%
106
 
8.5%
103
 
8.3%
23
 
1.8%
21
 
1.7%
16
 
1.3%
Other values (84) 365
29.3%
Decimal Number
ValueCountFrequency (%)
1 109
23.6%
2 53
11.5%
4 45
9.8%
5 45
9.8%
3 44
9.5%
7 43
 
9.3%
6 38
 
8.2%
0 30
 
6.5%
8 28
 
6.1%
9 26
 
5.6%
Space Separator
ValueCountFrequency (%)
349
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 74
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1245
58.5%
Common 884
41.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
134
 
10.8%
127
 
10.2%
124
 
10.0%
114
 
9.2%
112
 
9.0%
106
 
8.5%
103
 
8.3%
23
 
1.8%
21
 
1.7%
16
 
1.3%
Other values (84) 365
29.3%
Common
ValueCountFrequency (%)
349
39.5%
1 109
 
12.3%
- 74
 
8.4%
2 53
 
6.0%
4 45
 
5.1%
5 45
 
5.1%
3 44
 
5.0%
7 43
 
4.9%
6 38
 
4.3%
0 30
 
3.4%
Other values (2) 54
 
6.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1245
58.5%
ASCII 884
41.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
349
39.5%
1 109
 
12.3%
- 74
 
8.4%
2 53
 
6.0%
4 45
 
5.1%
5 45
 
5.1%
3 44
 
5.0%
7 43
 
4.9%
6 38
 
4.3%
0 30
 
3.4%
Other values (2) 54
 
6.1%
Hangul
ValueCountFrequency (%)
134
 
10.8%
127
 
10.2%
124
 
10.0%
114
 
9.2%
112
 
9.0%
106
 
8.5%
103
 
8.3%
23
 
1.8%
21
 
1.7%
16
 
1.3%
Other values (84) 365
29.3%
Distinct109
Distinct (%)95.6%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-12T15:08:42.627697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length25
Mean length21.052632
Min length15

Characters and Unicode

Total characters2400
Distinct characters100
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique104 ?
Unique (%)91.2%

Sample

1st row부산광역시 사하구 낙동대로 지하 221(괴정동)
2nd row부산광역시 연제구 중앙대로 지하 1217
3rd row부산광역시 금정구 금정로 233번길 46(구서동)
4th row부산광역시 금정구 중앙대로 2019-1
5th row부산광역시 중구 구덕로 지하 12
ValueCountFrequency (%)
부산광역시 103
 
19.0%
지하 74
 
13.7%
중앙대로 21
 
3.9%
북구 13
 
2.4%
사하구 12
 
2.2%
동래구 11
 
2.0%
부산진구 10
 
1.8%
수영로 10
 
1.8%
반송로 9
 
1.7%
금정구 9
 
1.7%
Other values (157) 270
49.8%
2023-12-12T15:08:43.104830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
428
17.8%
126
 
5.2%
119
 
5.0%
118
 
4.9%
114
 
4.8%
112
 
4.7%
109
 
4.5%
106
 
4.4%
98
 
4.1%
84
 
3.5%
Other values (90) 986
41.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1579
65.8%
Space Separator 428
 
17.8%
Decimal Number 347
 
14.5%
Open Punctuation 21
 
0.9%
Close Punctuation 21
 
0.9%
Dash Punctuation 4
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
126
 
8.0%
119
 
7.5%
118
 
7.5%
114
 
7.2%
112
 
7.1%
109
 
6.9%
106
 
6.7%
98
 
6.2%
84
 
5.3%
81
 
5.1%
Other values (76) 512
32.4%
Decimal Number
ValueCountFrequency (%)
1 63
18.2%
2 53
15.3%
0 42
12.1%
4 35
10.1%
7 32
9.2%
3 32
9.2%
9 25
 
7.2%
6 24
 
6.9%
5 21
 
6.1%
8 20
 
5.8%
Space Separator
ValueCountFrequency (%)
428
100.0%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1579
65.8%
Common 821
34.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
126
 
8.0%
119
 
7.5%
118
 
7.5%
114
 
7.2%
112
 
7.1%
109
 
6.9%
106
 
6.7%
98
 
6.2%
84
 
5.3%
81
 
5.1%
Other values (76) 512
32.4%
Common
ValueCountFrequency (%)
428
52.1%
1 63
 
7.7%
2 53
 
6.5%
0 42
 
5.1%
4 35
 
4.3%
7 32
 
3.9%
3 32
 
3.9%
9 25
 
3.0%
6 24
 
2.9%
5 21
 
2.6%
Other values (4) 66
 
8.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1579
65.8%
ASCII 821
34.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
428
52.1%
1 63
 
7.7%
2 53
 
6.5%
0 42
 
5.1%
4 35
 
4.3%
7 32
 
3.9%
3 32
 
3.9%
9 25
 
3.0%
6 24
 
2.9%
5 21
 
2.6%
Other values (4) 66
 
8.0%
Hangul
ValueCountFrequency (%)
126
 
8.0%
119
 
7.5%
118
 
7.5%
114
 
7.2%
112
 
7.1%
109
 
6.9%
106
 
6.7%
98
 
6.2%
84
 
5.3%
81
 
5.1%
Other values (76) 512
32.4%

Missing values

2023-12-12T15:08:40.196614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:08:40.322130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

철도운영기관명선명역명지번주소도로명주소
0부산교통공사1호선괴정부산광역시 사하구 괴정동 934-3부산광역시 사하구 낙동대로 지하 221(괴정동)
1부산교통공사1호선교대부산광역시 연제구 거제동 72-25부산광역시 연제구 중앙대로 지하 1217
2부산교통공사1호선구서부산광역시 금정구 구서동 475부산광역시 금정구 금정로 233번길 46(구서동)
3부산교통공사1호선남산(부산외국대학교)부산광역시 금정구 남산동 136-1부산광역시 금정구 중앙대로 2019-1
4부산교통공사1호선남포부산광역시 중구 남포동1가 11부산광역시 중구 구덕로 지하 12
5부산교통공사1호선낫개부산시 사하구 다대동 63-13부산시 사하구 다대로 지하 422 낫개역
6부산교통공사1호선노포(종합버스터미널)부산광역시 금정구 노포동 133부산광역시 금정구 중앙대로 2238(노포동)
7부산교통공사1호선다대포항부산시 사하구 다대동 909부산시 사하구 다대로 지하 548 다대포항역
8부산교통공사1호선다대포해수욕장부산시 사하구 다대동 1552-16부산시 사하구 다대로 지하 692 다대포해수욕장역
9부산교통공사1호선당리(사하구청)부산광역시 사하구 당리동 323부산광역시 사하구 낙동대로 지하 405(당리동)
철도운영기관명선명역명지번주소도로명주소
104부산교통공사4호선명장부산광역시 동래구 명장동 184부산광역시 동래구 반송로 지하 281
105부산교통공사4호선미남부산광역시 동래구 온천동 1470부산광역시 동래구 아시아드대로 지하 232
106부산교통공사4호선반여농산물시장부산광역시 해운대구 석대동 507-1부산광역시 해운대구 반송로 550
107부산교통공사4호선서동부산광역시 금정구 서동 241부산광역시 금정구 반송로 387(서동)
108부산교통공사4호선석대부산광역시 해운대구 석대동 574-14부산광역시 해운대구 석대천로 121
109부산교통공사4호선수안부산광역시 동래구 수안동 204-2부산광역시 동래구 충렬대로 지하 223
110부산교통공사4호선안평(고촌주택단지)부산광역시 기장군 철마면 안평리 543-23부산광역시 기장군 철마면 반송로 1001
111부산교통공사4호선영산대(아랫반송)부산광역시 해운대구 반송동 698-12부산광역시 해운대구 반송로 803
112부산교통공사4호선윗반송부산광역시 해운대구 반송동 385-56부산광역시 해운대구 반송로 917
113부산교통공사4호선충렬사(안락)부산광역시 동래구 안락동 473부산광역시 동래구 반송로 지하 205