Overview

Dataset statistics

Number of variables5
Number of observations43
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.8 KiB
Average record size in memory43.1 B

Variable types

Categorical2
Text3

Dataset

Description부산2호선에 포함된 도시광역철도역들의 철도운영기관명, 선명, 역명, 지번주소, 도로명주소의 데이터가 있습니다.
Author국가철도공단
URLhttps://www.data.go.kr/data/15041098/fileData.do

Alerts

철도운영기관명 has constant value ""Constant
선명 has constant value ""Constant
역명 has unique valuesUnique
도로명주소 has unique valuesUnique

Reproduction

Analysis started2023-12-12 19:13:58.305672
Analysis finished2023-12-12 19:13:58.690380
Duration0.38 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

철도운영기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size476.0 B
부산교통공사
43 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산교통공사
2nd row부산교통공사
3rd row부산교통공사
4th row부산교통공사
5th row부산교통공사

Common Values

ValueCountFrequency (%)
부산교통공사 43
100.0%

Length

2023-12-13T04:13:58.746645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:13:58.850201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산교통공사 43
100.0%

선명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size476.0 B
2호선
43 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2호선
2nd row2호선
3rd row2호선
4th row2호선
5th row2호선

Common Values

ValueCountFrequency (%)
2호선 43
100.0%

Length

2023-12-13T04:13:58.946851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:13:59.040445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2호선 43
100.0%

역명
Text

UNIQUE 

Distinct43
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size476.0 B
2023-12-13T04:13:59.254573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length2
Mean length5.1162791
Min length2

Characters and Unicode

Total characters220
Distinct characters106
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)100.0%

Sample

1st row가야
2nd row감전(사상구청)
3rd row개금
4th row경성대·부경대(동명대학교)
5th row광안
ValueCountFrequency (%)
가야 1
 
2.3%
문현 1
 
2.3%
벡스코(시립미술관 1
 
2.3%
부산대양산캠퍼스 1
 
2.3%
부암(온종합병원 1
 
2.3%
사상(서부터미널 1
 
2.3%
서면 1
 
2.3%
센텀시티(bexco·신세계 1
 
2.3%
수영 1
 
2.3%
수정(방송통신대 1
 
2.3%
Other values (33) 33
76.7%
2023-12-13T04:13:59.720389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 14
 
6.4%
) 14
 
6.4%
11
 
5.0%
9
 
4.1%
6
 
2.7%
6
 
2.7%
· 5
 
2.3%
5
 
2.3%
5
 
2.3%
4
 
1.8%
Other values (96) 141
64.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 179
81.4%
Open Punctuation 14
 
6.4%
Close Punctuation 14
 
6.4%
Uppercase Letter 8
 
3.6%
Other Punctuation 5
 
2.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11
 
6.1%
9
 
5.0%
6
 
3.4%
6
 
3.4%
5
 
2.8%
5
 
2.8%
4
 
2.2%
4
 
2.2%
4
 
2.2%
3
 
1.7%
Other values (86) 122
68.2%
Uppercase Letter
ValueCountFrequency (%)
B 2
25.0%
X 1
12.5%
E 1
12.5%
C 1
12.5%
O 1
12.5%
K 1
12.5%
S 1
12.5%
Open Punctuation
ValueCountFrequency (%)
( 14
100.0%
Close Punctuation
ValueCountFrequency (%)
) 14
100.0%
Other Punctuation
ValueCountFrequency (%)
· 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 179
81.4%
Common 33
 
15.0%
Latin 8
 
3.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11
 
6.1%
9
 
5.0%
6
 
3.4%
6
 
3.4%
5
 
2.8%
5
 
2.8%
4
 
2.2%
4
 
2.2%
4
 
2.2%
3
 
1.7%
Other values (86) 122
68.2%
Latin
ValueCountFrequency (%)
B 2
25.0%
X 1
12.5%
E 1
12.5%
C 1
12.5%
O 1
12.5%
K 1
12.5%
S 1
12.5%
Common
ValueCountFrequency (%)
( 14
42.4%
) 14
42.4%
· 5
 
15.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 179
81.4%
ASCII 36
 
16.4%
None 5
 
2.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 14
38.9%
) 14
38.9%
B 2
 
5.6%
X 1
 
2.8%
E 1
 
2.8%
C 1
 
2.8%
O 1
 
2.8%
K 1
 
2.8%
S 1
 
2.8%
Hangul
ValueCountFrequency (%)
11
 
6.1%
9
 
5.0%
6
 
3.4%
6
 
3.4%
5
 
2.8%
5
 
2.8%
4
 
2.2%
4
 
2.2%
4
 
2.2%
3
 
1.7%
Other values (86) 122
68.2%
None
ValueCountFrequency (%)
· 5
100.0%
Distinct41
Distinct (%)95.3%
Missing0
Missing (%)0.0%
Memory size476.0 B
2023-12-13T04:14:00.008066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length20
Mean length18.767442
Min length15

Characters and Unicode

Total characters807
Distinct characters67
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique39 ?
Unique (%)90.7%

Sample

1st row부산광역시 부산진구 가야동 15-13
2nd row부산광역시 사상구 감전동 174
3rd row부산광역시 부산진구 개금동 487-2
4th row부산광역시 남구 대연동 77
5th row부산광역시 수영구 광안동 205
ValueCountFrequency (%)
부산광역시 38
21.5%
북구 8
 
4.5%
사상구 7
 
4.0%
부산진구 6
 
3.4%
해운대구 6
 
3.4%
남구 6
 
3.4%
경상남도 5
 
2.8%
양산시 5
 
2.8%
수영구 5
 
2.8%
우동 4
 
2.3%
Other values (69) 87
49.2%
2023-12-13T04:14:00.499246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
134
16.6%
53
 
6.6%
47
 
5.8%
43
 
5.3%
41
 
5.1%
40
 
5.0%
40
 
5.0%
1 39
 
4.8%
38
 
4.7%
- 31
 
3.8%
Other values (57) 301
37.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 471
58.4%
Decimal Number 171
 
21.2%
Space Separator 134
 
16.6%
Dash Punctuation 31
 
3.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
53
 
11.3%
47
 
10.0%
43
 
9.1%
41
 
8.7%
40
 
8.5%
40
 
8.5%
38
 
8.1%
13
 
2.8%
12
 
2.5%
9
 
1.9%
Other values (45) 135
28.7%
Decimal Number
ValueCountFrequency (%)
1 39
22.8%
7 21
12.3%
2 20
11.7%
5 18
10.5%
6 16
9.4%
8 14
 
8.2%
4 14
 
8.2%
3 13
 
7.6%
9 11
 
6.4%
0 5
 
2.9%
Space Separator
ValueCountFrequency (%)
134
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 31
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 471
58.4%
Common 336
41.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
53
 
11.3%
47
 
10.0%
43
 
9.1%
41
 
8.7%
40
 
8.5%
40
 
8.5%
38
 
8.1%
13
 
2.8%
12
 
2.5%
9
 
1.9%
Other values (45) 135
28.7%
Common
ValueCountFrequency (%)
134
39.9%
1 39
 
11.6%
- 31
 
9.2%
7 21
 
6.2%
2 20
 
6.0%
5 18
 
5.4%
6 16
 
4.8%
8 14
 
4.2%
4 14
 
4.2%
3 13
 
3.9%
Other values (2) 16
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 471
58.4%
ASCII 336
41.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
134
39.9%
1 39
 
11.6%
- 31
 
9.2%
7 21
 
6.2%
2 20
 
6.0%
5 18
 
5.4%
6 16
 
4.8%
8 14
 
4.2%
4 14
 
4.2%
3 13
 
3.9%
Other values (2) 16
 
4.8%
Hangul
ValueCountFrequency (%)
53
 
11.3%
47
 
10.0%
43
 
9.1%
41
 
8.7%
40
 
8.5%
40
 
8.5%
38
 
8.1%
13
 
2.8%
12
 
2.5%
9
 
1.9%
Other values (45) 135
28.7%

도로명주소
Text

UNIQUE 

Distinct43
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size476.0 B
2023-12-13T04:14:00.799811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length23
Mean length20.511628
Min length15

Characters and Unicode

Total characters882
Distinct characters57
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)100.0%

Sample

1st row부산광역시 부산진구 가야대로 지하 650
2nd row부산광역시 사상구 사상로 지하 93-1
3rd row부산광역시 부산진구 가야대로 지하 442
4th row부산광역시 남구 수영로 지하 324
5th row부산광역시 수영구 수영로 지하 576(광안동)
ValueCountFrequency (%)
부산광역시 38
18.5%
지하 30
 
14.6%
수영로 9
 
4.4%
북구 8
 
3.9%
사상구 7
 
3.4%
남구 6
 
2.9%
부산진구 6
 
2.9%
가야대로 6
 
2.9%
금곡대로 6
 
2.9%
수영구 5
 
2.4%
Other values (56) 84
41.0%
2023-12-13T04:14:01.344535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
162
18.4%
50
 
5.7%
44
 
5.0%
43
 
4.9%
43
 
4.9%
40
 
4.5%
38
 
4.3%
37
 
4.2%
36
 
4.1%
36
 
4.1%
Other values (47) 353
40.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 580
65.8%
Space Separator 162
 
18.4%
Decimal Number 127
 
14.4%
Close Punctuation 5
 
0.6%
Open Punctuation 5
 
0.6%
Dash Punctuation 3
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
50
 
8.6%
44
 
7.6%
43
 
7.4%
43
 
7.4%
40
 
6.9%
38
 
6.6%
37
 
6.4%
36
 
6.2%
36
 
6.2%
30
 
5.2%
Other values (33) 183
31.6%
Decimal Number
ValueCountFrequency (%)
2 17
13.4%
1 17
13.4%
0 16
12.6%
4 15
11.8%
7 14
11.0%
6 14
11.0%
3 10
7.9%
9 9
7.1%
5 8
6.3%
8 7
5.5%
Space Separator
ValueCountFrequency (%)
162
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 580
65.8%
Common 302
34.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
50
 
8.6%
44
 
7.6%
43
 
7.4%
43
 
7.4%
40
 
6.9%
38
 
6.6%
37
 
6.4%
36
 
6.2%
36
 
6.2%
30
 
5.2%
Other values (33) 183
31.6%
Common
ValueCountFrequency (%)
162
53.6%
2 17
 
5.6%
1 17
 
5.6%
0 16
 
5.3%
4 15
 
5.0%
7 14
 
4.6%
6 14
 
4.6%
3 10
 
3.3%
9 9
 
3.0%
5 8
 
2.6%
Other values (4) 20
 
6.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 580
65.8%
ASCII 302
34.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
162
53.6%
2 17
 
5.6%
1 17
 
5.6%
0 16
 
5.3%
4 15
 
5.0%
7 14
 
4.6%
6 14
 
4.6%
3 10
 
3.3%
9 9
 
3.0%
5 8
 
2.6%
Other values (4) 20
 
6.6%
Hangul
ValueCountFrequency (%)
50
 
8.6%
44
 
7.6%
43
 
7.4%
43
 
7.4%
40
 
6.9%
38
 
6.6%
37
 
6.4%
36
 
6.2%
36
 
6.2%
30
 
5.2%
Other values (33) 183
31.6%

Correlations

2023-12-13T04:14:01.478470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
역명지번주소도로명주소
역명1.0001.0001.000
지번주소1.0001.0001.000
도로명주소1.0001.0001.000

Missing values

2023-12-13T04:13:58.560472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:13:58.656216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

철도운영기관명선명역명지번주소도로명주소
0부산교통공사2호선가야부산광역시 부산진구 가야동 15-13부산광역시 부산진구 가야대로 지하 650
1부산교통공사2호선감전(사상구청)부산광역시 사상구 감전동 174부산광역시 사상구 사상로 지하 93-1
2부산교통공사2호선개금부산광역시 부산진구 개금동 487-2부산광역시 부산진구 가야대로 지하 442
3부산교통공사2호선경성대·부경대(동명대학교)부산광역시 남구 대연동 77부산광역시 남구 수영로 지하 324
4부산교통공사2호선광안부산광역시 수영구 광안동 205부산광역시 수영구 수영로 지하 576(광안동)
5부산교통공사2호선구남부산광역시 북구 구포동 897-21부산광역시 북구 백양대로 지하1026
6부산교통공사2호선구명부산광역시 북구 구포동 969-1부산광역시 북구 백양대로 지하1094
7부산교통공사2호선국제금융센터·부산은행부산광역시 남구 문현동 728-3부산광역시 남구 전포대로 지하 97
8부산교통공사2호선금곡부산광역시 북구 금곡동 581-1부산광역시 북구 금곡대로 685
9부산교통공사2호선금련산부산광역시 수영구 남천동 78부산광역시 수영구 수영로 지하 482(남천동)
철도운영기관명선명역명지번주소도로명주소
33부산교통공사2호선율리부산광역시 북구 금곡동 1241부산광역시 북구 금곡대로 지하440
34부산교통공사2호선장산(해운대백병원)부산광역시 해운대구 좌동 1491부산광역시 해운대구 해운대로 지하 820
35부산교통공사2호선전포부산광역시 부산진구 전포동 870-1부산광역시 부산진구 전포대로 지하 181
36부산교통공사2호선주례부산광역시 사상구 주례동 1162-16부산광역시 사상구 가야대로 지하 292
37부산교통공사2호선중동부산광역시 해운대구 중동 1783-1부산광역시 해운대구 해운대로 지하 730
38부산교통공사2호선증산경상남도 양산시 물금읍 증산리 35-6경남 양산시 물금읍 메기로 168번
39부산교통공사2호선지게골부산광역시 남구 문현동 254-2부산광역시 남구 수영로 지하 70
40부산교통공사2호선해운대부산광역시 해운대구 우동 656부산광역시 해운대구 해운대로 지하 626
41부산교통공사2호선호포경상남도 양산시 동면 가산리 산 91-5경남 양산시 동면 양산대로 62번
42부산교통공사2호선화명부산광역시 북구 화명동 1458-9부산광역시 북구 금곡대로 지하311