Overview

Dataset statistics

Number of variables5
Number of observations276
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.9 KiB
Average record size in memory40.5 B

Variable types

Categorical2
Text3

Dataset

Description서울교통공사에서 관리하는 도시광역철도역들의 철도운영기관명, 선명, 역명, 지번주소, 도로명주소의 데이터가 있습니다.
Author국가철도공단
URLhttps://www.data.go.kr/data/15041124/fileData.do

Alerts

철도운영기관명 has constant value ""Constant
지번주소 has unique valuesUnique

Reproduction

Analysis started2023-12-12 16:00:38.581841
Analysis finished2023-12-12 16:00:39.181858
Duration0.6 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

철도운영기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
서울교통공사
276 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울교통공사
2nd row서울교통공사
3rd row서울교통공사
4th row서울교통공사
5th row서울교통공사

Common Values

ValueCountFrequency (%)
서울교통공사 276
100.0%

Length

2023-12-13T01:00:39.258515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:00:39.388499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울교통공사 276
100.0%

선명
Categorical

Distinct8
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
5호선
56 
2호선
51 
7호선
42 
6호선
39 
3호선
34 
Other values (3)
54 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1호선
2nd row1호선
3rd row1호선
4th row1호선
5th row1호선

Common Values

ValueCountFrequency (%)
5호선 56
20.3%
2호선 51
18.5%
7호선 42
15.2%
6호선 39
14.1%
3호선 34
12.3%
4호선 26
9.4%
8호선 18
 
6.5%
1호선 10
 
3.6%

Length

2023-12-13T01:00:39.517476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:00:39.675917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5호선 56
20.3%
2호선 51
18.5%
7호선 42
15.2%
6호선 39
14.1%
3호선 34
12.3%
4호선 26
9.4%
8호선 18
 
6.5%
1호선 10
 
3.6%

역명
Text

Distinct240
Distinct (%)87.0%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2023-12-13T01:00:39.970296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length14
Mean length4.2898551
Min length2

Characters and Unicode

Total characters1184
Distinct characters235
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique205 ?
Unique (%)74.3%

Sample

1st row동대문
2nd row동묘앞
3rd row서울역
4th row시청
5th row신설동
ValueCountFrequency (%)
동대문역사문화공원 3
 
1.1%
까치산 2
 
0.7%
충정로(경기대입구 2
 
0.7%
사당 2
 
0.7%
충무로 2
 
0.7%
영등포구청 2
 
0.7%
왕십리 2
 
0.7%
을지로4가 2
 
0.7%
잠실(송파구청 2
 
0.7%
을지로3가 2
 
0.7%
Other values (230) 255
92.4%
2023-12-13T01:00:40.753896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 58
 
4.9%
( 58
 
4.9%
49
 
4.1%
48
 
4.1%
30
 
2.5%
28
 
2.4%
28
 
2.4%
23
 
1.9%
22
 
1.9%
20
 
1.7%
Other values (225) 820
69.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1056
89.2%
Close Punctuation 58
 
4.9%
Open Punctuation 58
 
4.9%
Decimal Number 8
 
0.7%
Other Punctuation 4
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
49
 
4.6%
48
 
4.5%
30
 
2.8%
28
 
2.7%
28
 
2.7%
23
 
2.2%
22
 
2.1%
20
 
1.9%
20
 
1.9%
17
 
1.6%
Other values (219) 771
73.0%
Decimal Number
ValueCountFrequency (%)
3 5
62.5%
4 2
 
25.0%
5 1
 
12.5%
Close Punctuation
ValueCountFrequency (%)
) 58
100.0%
Open Punctuation
ValueCountFrequency (%)
( 58
100.0%
Other Punctuation
ValueCountFrequency (%)
· 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1056
89.2%
Common 128
 
10.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
49
 
4.6%
48
 
4.5%
30
 
2.8%
28
 
2.7%
28
 
2.7%
23
 
2.2%
22
 
2.1%
20
 
1.9%
20
 
1.9%
17
 
1.6%
Other values (219) 771
73.0%
Common
ValueCountFrequency (%)
) 58
45.3%
( 58
45.3%
3 5
 
3.9%
· 4
 
3.1%
4 2
 
1.6%
5 1
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1056
89.2%
ASCII 124
 
10.5%
None 4
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 58
46.8%
( 58
46.8%
3 5
 
4.0%
4 2
 
1.6%
5 1
 
0.8%
Hangul
ValueCountFrequency (%)
49
 
4.6%
48
 
4.5%
30
 
2.8%
28
 
2.7%
28
 
2.7%
23
 
2.2%
22
 
2.1%
20
 
1.9%
20
 
1.9%
17
 
1.6%
Other values (219) 771
73.0%
None
ValueCountFrequency (%)
· 4
100.0%

지번주소
Text

UNIQUE 

Distinct276
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2023-12-13T01:00:41.228928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length35
Mean length28.753623
Min length19

Characters and Unicode

Total characters7936
Distinct characters240
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique276 ?
Unique (%)100.0%

Sample

1st row서울특별시 종로구 창신동 492-1 동대문역(1호선)
2nd row서울특별시 종로구 숭인동 117 동묘앞역(1호선)
3rd row서울특별시 중구 남대문로5가 73-6 서울역(1호선)
4th row서울특별시 중구 정동 5-5 시청역(1호선)
5th row서울특별시 동대문구 신설동 76-5 신설동역(1호선)
ValueCountFrequency (%)
서울특별시 260
 
18.6%
중구 23
 
1.6%
송파구 21
 
1.5%
강남구 17
 
1.2%
마포구 16
 
1.1%
경기도 15
 
1.1%
종로구 15
 
1.1%
성동구 14
 
1.0%
은평구 13
 
0.9%
노원구 13
 
0.9%
Other values (729) 988
70.8%
2023-12-13T01:00:41.831768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1119
 
14.1%
330
 
4.2%
311
 
3.9%
302
 
3.8%
289
 
3.6%
284
 
3.6%
283
 
3.6%
283
 
3.6%
) 280
 
3.5%
( 280
 
3.5%
Other values (230) 4175
52.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4711
59.4%
Decimal Number 1355
 
17.1%
Space Separator 1119
 
14.1%
Close Punctuation 280
 
3.5%
Open Punctuation 280
 
3.5%
Dash Punctuation 191
 
2.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
330
 
7.0%
311
 
6.6%
302
 
6.4%
289
 
6.1%
284
 
6.0%
283
 
6.0%
283
 
6.0%
265
 
5.6%
261
 
5.5%
260
 
5.5%
Other values (216) 1843
39.1%
Decimal Number
ValueCountFrequency (%)
1 229
16.9%
2 192
14.2%
5 151
11.1%
3 149
11.0%
6 145
10.7%
4 133
9.8%
7 128
9.4%
9 82
 
6.1%
8 80
 
5.9%
0 66
 
4.9%
Space Separator
ValueCountFrequency (%)
1119
100.0%
Close Punctuation
ValueCountFrequency (%)
) 280
100.0%
Open Punctuation
ValueCountFrequency (%)
( 280
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 191
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4711
59.4%
Common 3225
40.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
330
 
7.0%
311
 
6.6%
302
 
6.4%
289
 
6.1%
284
 
6.0%
283
 
6.0%
283
 
6.0%
265
 
5.6%
261
 
5.5%
260
 
5.5%
Other values (216) 1843
39.1%
Common
ValueCountFrequency (%)
1119
34.7%
) 280
 
8.7%
( 280
 
8.7%
1 229
 
7.1%
2 192
 
6.0%
- 191
 
5.9%
5 151
 
4.7%
3 149
 
4.6%
6 145
 
4.5%
4 133
 
4.1%
Other values (4) 356
 
11.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4711
59.4%
ASCII 3225
40.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1119
34.7%
) 280
 
8.7%
( 280
 
8.7%
1 229
 
7.1%
2 192
 
6.0%
- 191
 
5.9%
5 151
 
4.7%
3 149
 
4.6%
6 145
 
4.5%
4 133
 
4.1%
Other values (4) 356
 
11.0%
Hangul
ValueCountFrequency (%)
330
 
7.0%
311
 
6.6%
302
 
6.4%
289
 
6.1%
284
 
6.0%
283
 
6.0%
283
 
6.0%
265
 
5.6%
261
 
5.5%
260
 
5.5%
Other values (216) 1843
39.1%
Distinct264
Distinct (%)95.7%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2023-12-13T01:00:42.224594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length29
Mean length24.583333
Min length19

Characters and Unicode

Total characters6785
Distinct characters208
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique252 ?
Unique (%)91.3%

Sample

1st row서울특별시 종로구 종로 지하302(창신동)
2nd row서울특별시 종로구 종로 359(숭인동)
3rd row서울특별시 중구 세종대로 지하2(남대문로 5가)
4th row서울특별시 중구 세종대로 지하101(정동)
5th row서울특별시 동대문구 왕산로 지하1(신설동)
ValueCountFrequency (%)
서울특별시 261
 
23.2%
중구 23
 
2.0%
송파구 21
 
1.9%
강남구 17
 
1.5%
마포구 16
 
1.4%
종로구 15
 
1.3%
경기도 15
 
1.3%
성동구 14
 
1.2%
노원구 13
 
1.2%
은평구 13
 
1.2%
Other values (411) 715
63.7%
2023-12-13T01:00:42.754818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
849
 
12.5%
333
 
4.9%
316
 
4.7%
310
 
4.6%
285
 
4.2%
276
 
4.1%
) 274
 
4.0%
( 274
 
4.0%
261
 
3.8%
261
 
3.8%
Other values (198) 3346
49.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4519
66.6%
Decimal Number 850
 
12.5%
Space Separator 849
 
12.5%
Close Punctuation 274
 
4.0%
Open Punctuation 274
 
4.0%
Dash Punctuation 19
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
333
 
7.4%
316
 
7.0%
310
 
6.9%
285
 
6.3%
276
 
6.1%
261
 
5.8%
261
 
5.8%
261
 
5.8%
261
 
5.8%
254
 
5.6%
Other values (184) 1701
37.6%
Decimal Number
ValueCountFrequency (%)
1 169
19.9%
2 131
15.4%
0 93
10.9%
3 93
10.9%
7 71
8.4%
4 68
8.0%
5 66
 
7.8%
6 55
 
6.5%
9 53
 
6.2%
8 51
 
6.0%
Space Separator
ValueCountFrequency (%)
849
100.0%
Close Punctuation
ValueCountFrequency (%)
) 274
100.0%
Open Punctuation
ValueCountFrequency (%)
( 274
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 19
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4519
66.6%
Common 2266
33.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
333
 
7.4%
316
 
7.0%
310
 
6.9%
285
 
6.3%
276
 
6.1%
261
 
5.8%
261
 
5.8%
261
 
5.8%
261
 
5.8%
254
 
5.6%
Other values (184) 1701
37.6%
Common
ValueCountFrequency (%)
849
37.5%
) 274
 
12.1%
( 274
 
12.1%
1 169
 
7.5%
2 131
 
5.8%
0 93
 
4.1%
3 93
 
4.1%
7 71
 
3.1%
4 68
 
3.0%
5 66
 
2.9%
Other values (4) 178
 
7.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4519
66.6%
ASCII 2266
33.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
849
37.5%
) 274
 
12.1%
( 274
 
12.1%
1 169
 
7.5%
2 131
 
5.8%
0 93
 
4.1%
3 93
 
4.1%
7 71
 
3.1%
4 68
 
3.0%
5 66
 
2.9%
Other values (4) 178
 
7.9%
Hangul
ValueCountFrequency (%)
333
 
7.4%
316
 
7.0%
310
 
6.9%
285
 
6.3%
276
 
6.1%
261
 
5.8%
261
 
5.8%
261
 
5.8%
261
 
5.8%
254
 
5.6%
Other values (184) 1701
37.6%

Missing values

2023-12-13T01:00:39.006149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:00:39.142417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

철도운영기관명선명역명지번주소도로명주소
0서울교통공사1호선동대문서울특별시 종로구 창신동 492-1 동대문역(1호선)서울특별시 종로구 종로 지하302(창신동)
1서울교통공사1호선동묘앞서울특별시 종로구 숭인동 117 동묘앞역(1호선)서울특별시 종로구 종로 359(숭인동)
2서울교통공사1호선서울역서울특별시 중구 남대문로5가 73-6 서울역(1호선)서울특별시 중구 세종대로 지하2(남대문로 5가)
3서울교통공사1호선시청서울특별시 중구 정동 5-5 시청역(1호선)서울특별시 중구 세종대로 지하101(정동)
4서울교통공사1호선신설동서울특별시 동대문구 신설동 76-5 신설동역(1호선)서울특별시 동대문구 왕산로 지하1(신설동)
5서울교통공사1호선제기동서울특별시 동대문구 제기동 65 제기동역(1호선)서울특별시 동대문구 왕산로 지하93(제기동)
6서울교통공사1호선종각서울특별시 종로구 종로1가 54 종각역(1호선)서울특별시 종로구 종로 지하55(종로1가)
7서울교통공사1호선종로3가서울특별시 종로구 종로3가 10-5 종로3가역(1호선)서울특별시 종로구 종로 지하129(종로3가)
8서울교통공사1호선종로5가서울특별시 종로구 종로5가 82-1 종로5가역(1호선)서울특별시 종로구 종로 지하216(종로5가)
9서울교통공사1호선청량리(서울시립대입구)서울특별시 동대문구 전농동 620-69 청량리역(1호선)서울특별시 동대문구 왕산로 지하205(전농동)
철도운영기관명선명역명지번주소도로명주소
266서울교통공사8호선복정서울특별시 송파구 장지동 596-21 복정역(8호선)서울특별시 송파구 송파대로 지하6(장지동)
267서울교통공사8호선산성경기도 성남시 수정구 신흥동 7 산성역(8호선)경기도 성남시 수정구 수정로 지하365(신흥동)
268서울교통공사8호선석촌서울특별시 송파구 석촌동 209 석촌역(8호선)서울특별시 송파구 송파대로 지하439(석촌동)
269서울교통공사8호선송파서울특별시 송파구 가락동 459-4 송파역(8호선)서울특별시 송파구 송파대로 지하354(가락동)
270서울교통공사8호선수진경기도 성남시 수정구 수진동 2205-1 수진역(8호선)경기도 성남시 수정구 산성대로 지하200(수진동)
271서울교통공사8호선신흥경기도 성남시 수정구 신흥동 2467 신흥역(8호선)경기도 성남시 수정구 산성대로 지하280(신흥동)
272서울교통공사8호선암사서울특별시 강동구 암사동 501 암사역(8호선)서울특별시 강동구 올림픽로 지하776(암사동)
273서울교통공사8호선잠실(송파구청)서울특별시 송파구 신천동 7-4 잠실역(8호선)서울특별시 송파구 올림픽로 305(신천동)
274서울교통공사8호선장지서울특별시 송파구 장지동 201-5 장지역(8호선)서울특별시 송파구 송파대로 지하82(장지동)
275서울교통공사8호선천호(풍납토성)서울특별시 강동구 천호동 455 천호역(8호선)서울특별시 강동구 천호대로 지하997(천호동)