Overview

Dataset statistics

Number of variables5
Number of observations25
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.1 KiB
Average record size in memory45.3 B

Variable types

Categorical2
Text3

Dataset

Description경춘선에 포함된 도시광역철도역들의 철도운영기관명, 선명, 역명, 지번주소, 도로명주소의 데이터가 있습니다.
Author국가철도공단
URLhttps://www.data.go.kr/data/15041115/fileData.do

Alerts

철도운영기관명 has constant value ""Constant
선명 has constant value ""Constant
역명 has unique valuesUnique
지번주소 has unique valuesUnique
도로명주소 has unique valuesUnique

Reproduction

Analysis started2023-12-12 03:59:09.899767
Analysis finished2023-12-12 03:59:10.192627
Duration0.29 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

철도운영기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
코레일
25 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row코레일
2nd row코레일
3rd row코레일
4th row코레일
5th row코레일

Common Values

ValueCountFrequency (%)
코레일 25
100.0%

Length

2023-12-12T12:59:10.247590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:59:10.326606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
코레일 25
100.0%

선명
Categorical

CONSTANT 

Distinct1
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
경춘
25 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경춘
2nd row경춘
3rd row경춘
4th row경춘
5th row경춘

Common Values

ValueCountFrequency (%)
경춘 25
100.0%

Length

2023-12-12T12:59:10.419919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:59:10.507897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경춘 25
100.0%

역명
Text

UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2023-12-12T12:59:10.916290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length10
Mean length4.32
Min length2

Characters and Unicode

Total characters108
Distinct characters65
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)100.0%

Sample

1st row가평(자라섬·남이섬)
2nd row갈매
3rd row강촌
4th row광운대
5th row굴봉산(제이드가든)
ValueCountFrequency (%)
가평(자라섬·남이섬 1
 
4.0%
사릉 1
 
4.0%
평내호평 1
 
4.0%
퇴계원 1
 
4.0%
춘천(한림대 1
 
4.0%
청평 1
 
4.0%
청량리 1
 
4.0%
천마산 1
 
4.0%
중랑 1
 
4.0%
신내 1
 
4.0%
Other values (15) 15
60.0%
2023-12-12T12:59:11.322404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 7
 
6.5%
) 7
 
6.5%
5
 
4.6%
4
 
3.7%
4
 
3.7%
4
 
3.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
2
 
1.9%
Other values (55) 66
61.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 93
86.1%
Open Punctuation 7
 
6.5%
Close Punctuation 7
 
6.5%
Other Punctuation 1
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5
 
5.4%
4
 
4.3%
4
 
4.3%
4
 
4.3%
3
 
3.2%
3
 
3.2%
3
 
3.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
Other values (52) 61
65.6%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Other Punctuation
ValueCountFrequency (%)
· 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 93
86.1%
Common 15
 
13.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5
 
5.4%
4
 
4.3%
4
 
4.3%
4
 
4.3%
3
 
3.2%
3
 
3.2%
3
 
3.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
Other values (52) 61
65.6%
Common
ValueCountFrequency (%)
( 7
46.7%
) 7
46.7%
· 1
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 93
86.1%
ASCII 14
 
13.0%
None 1
 
0.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 7
50.0%
) 7
50.0%
Hangul
ValueCountFrequency (%)
5
 
5.4%
4
 
4.3%
4
 
4.3%
4
 
4.3%
3
 
3.2%
3
 
3.2%
3
 
3.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
Other values (52) 61
65.6%
None
ValueCountFrequency (%)
· 1
100.0%

지번주소
Text

UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2023-12-12T12:59:11.580801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length22
Mean length20.28
Min length13

Characters and Unicode

Total characters507
Distinct characters76
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)100.0%

Sample

1st row경기도 가평군 가평읍 달전리 567
2nd row구리시 갈매동 502-39
3rd row강원특별자치도 춘천시 남산면 방곡리 409
4th row서울특별시 노원구 월계동 85
5th row강원특별자치도 춘천시 남산면 백양리 588-30
ValueCountFrequency (%)
경기도 11
 
10.0%
남양주시 7
 
6.4%
서울특별시 7
 
6.4%
춘천시 6
 
5.5%
강원특별자치도 5
 
4.5%
가평군 4
 
3.6%
중랑구 4
 
3.6%
남산면 3
 
2.7%
청평면 3
 
2.7%
동대문구 2
 
1.8%
Other values (56) 58
52.7%
2023-12-12T12:59:12.025938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
85
 
16.8%
21
 
4.1%
- 19
 
3.7%
18
 
3.6%
2 17
 
3.4%
1 16
 
3.2%
16
 
3.2%
0 14
 
2.8%
13
 
2.6%
13
 
2.6%
Other values (66) 275
54.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 300
59.2%
Decimal Number 103
 
20.3%
Space Separator 85
 
16.8%
Dash Punctuation 19
 
3.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
21
 
7.0%
18
 
6.0%
16
 
5.3%
13
 
4.3%
13
 
4.3%
12
 
4.0%
12
 
4.0%
11
 
3.7%
10
 
3.3%
10
 
3.3%
Other values (54) 164
54.7%
Decimal Number
ValueCountFrequency (%)
2 17
16.5%
1 16
15.5%
0 14
13.6%
3 12
11.7%
6 9
8.7%
5 9
8.7%
8 8
7.8%
7 7
6.8%
4 6
 
5.8%
9 5
 
4.9%
Space Separator
ValueCountFrequency (%)
85
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 19
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 300
59.2%
Common 207
40.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
21
 
7.0%
18
 
6.0%
16
 
5.3%
13
 
4.3%
13
 
4.3%
12
 
4.0%
12
 
4.0%
11
 
3.7%
10
 
3.3%
10
 
3.3%
Other values (54) 164
54.7%
Common
ValueCountFrequency (%)
85
41.1%
- 19
 
9.2%
2 17
 
8.2%
1 16
 
7.7%
0 14
 
6.8%
3 12
 
5.8%
6 9
 
4.3%
5 9
 
4.3%
8 8
 
3.9%
7 7
 
3.4%
Other values (2) 11
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 300
59.2%
ASCII 207
40.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
85
41.1%
- 19
 
9.2%
2 17
 
8.2%
1 16
 
7.7%
0 14
 
6.8%
3 12
 
5.8%
6 9
 
4.3%
5 9
 
4.3%
8 8
 
3.9%
7 7
 
3.4%
Other values (2) 11
 
5.3%
Hangul
ValueCountFrequency (%)
21
 
7.0%
18
 
6.0%
16
 
5.3%
13
 
4.3%
13
 
4.3%
12
 
4.0%
12
 
4.0%
11
 
3.7%
10
 
3.3%
10
 
3.3%
Other values (54) 164
54.7%

도로명주소
Text

UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2023-12-12T12:59:12.307542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length25
Mean length20.64
Min length12

Characters and Unicode

Total characters516
Distinct characters82
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)100.0%

Sample

1st row경기도 가평군 가평읍 문화로 13-42
2nd row구리시 경춘북로 229
3rd row강원특별자치도 춘천시 남산면 강촌로 150
4th row서울특별시 노원구 석계로 98-2
5th row강원특별자치도 춘천시 남산면 서백길 192
ValueCountFrequency (%)
경기도 11
 
9.9%
남양주시 7
 
6.3%
서울특별시 7
 
6.3%
춘천시 6
 
5.4%
강원특별자치도 5
 
4.5%
가평군 4
 
3.6%
중랑구 4
 
3.6%
경춘북로 3
 
2.7%
남산면 3
 
2.7%
청평면 3
 
2.7%
Other values (55) 58
52.3%
2023-12-12T12:59:12.801265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
87
 
16.9%
23
 
4.5%
21
 
4.1%
18
 
3.5%
16
 
3.1%
1 14
 
2.7%
2 13
 
2.5%
13
 
2.5%
12
 
2.3%
12
 
2.3%
Other values (72) 287
55.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 333
64.5%
Space Separator 87
 
16.9%
Decimal Number 83
 
16.1%
Dash Punctuation 5
 
1.0%
Close Punctuation 4
 
0.8%
Open Punctuation 4
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
23
 
6.9%
21
 
6.3%
18
 
5.4%
16
 
4.8%
13
 
3.9%
12
 
3.6%
12
 
3.6%
11
 
3.3%
11
 
3.3%
10
 
3.0%
Other values (58) 186
55.9%
Decimal Number
ValueCountFrequency (%)
1 14
16.9%
2 13
15.7%
9 11
13.3%
5 11
13.3%
3 8
9.6%
7 7
8.4%
0 6
7.2%
4 5
 
6.0%
6 4
 
4.8%
8 4
 
4.8%
Space Separator
ValueCountFrequency (%)
87
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 333
64.5%
Common 183
35.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
23
 
6.9%
21
 
6.3%
18
 
5.4%
16
 
4.8%
13
 
3.9%
12
 
3.6%
12
 
3.6%
11
 
3.3%
11
 
3.3%
10
 
3.0%
Other values (58) 186
55.9%
Common
ValueCountFrequency (%)
87
47.5%
1 14
 
7.7%
2 13
 
7.1%
9 11
 
6.0%
5 11
 
6.0%
3 8
 
4.4%
7 7
 
3.8%
0 6
 
3.3%
4 5
 
2.7%
- 5
 
2.7%
Other values (4) 16
 
8.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 333
64.5%
ASCII 183
35.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
87
47.5%
1 14
 
7.7%
2 13
 
7.1%
9 11
 
6.0%
5 11
 
6.0%
3 8
 
4.4%
7 7
 
3.8%
0 6
 
3.3%
4 5
 
2.7%
- 5
 
2.7%
Other values (4) 16
 
8.7%
Hangul
ValueCountFrequency (%)
23
 
6.9%
21
 
6.3%
18
 
5.4%
16
 
4.8%
13
 
3.9%
12
 
3.6%
12
 
3.6%
11
 
3.3%
11
 
3.3%
10
 
3.0%
Other values (58) 186
55.9%

Correlations

2023-12-12T12:59:12.928694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
역명지번주소도로명주소
역명1.0001.0001.000
지번주소1.0001.0001.000
도로명주소1.0001.0001.000

Missing values

2023-12-12T12:59:10.083907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:59:10.157869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

철도운영기관명선명역명지번주소도로명주소
0코레일경춘가평(자라섬·남이섬)경기도 가평군 가평읍 달전리 567경기도 가평군 가평읍 문화로 13-42
1코레일경춘갈매구리시 갈매동 502-39구리시 경춘북로 229
2코레일경춘강촌강원특별자치도 춘천시 남산면 방곡리 409강원특별자치도 춘천시 남산면 강촌로 150
3코레일경춘광운대서울특별시 노원구 월계동 85서울특별시 노원구 석계로 98-2
4코레일경춘굴봉산(제이드가든)강원특별자치도 춘천시 남산면 백양리 588-30강원특별자치도 춘천시 남산면 서백길 192
5코레일경춘금곡경기도 남양주시 금곡동 404-276경기도 남양주시 금곡로19번길 47(금곡동)
6코레일경춘김유정강원특별자치도 춘천시 신동면 증리 945-2강원특별자치도 춘천시 신동면 김유정로 1435
7코레일경춘남춘천(강원대)춘천시 퇴계동 633-2춘천시 영서로 2260
8코레일경춘대성리경기도 가평군 청평면 대성리 393-3경기도 가평군 청평면 경춘로88
9코레일경춘마석경기도 남양주시 화도읍 마석우리 222-2경기도 남양주시 화도읍 마석중앙로 107
철도운영기관명선명역명지번주소도로명주소
15코레일경춘상천(호명호수)경기도 가평군 청평면 상천리 1260-1경기도 가평군 청평면 상천역로29
16코레일경춘신내서울특별시 중랑구 망우동 320-2서울특별시 중랑구 신내역로 20
17코레일경춘중랑서울특별시 중랑구 중화동 73-7서울특별시 중랑구 중랑역로 9
18코레일경춘천마산경기도 남양주시 화도읍 묵현리 320-11경기도 남양주시 화도읍 묵현로 25번길 37(묵현리)
19코레일경춘청량리서울특별시 동대문구 전농동 588-1서울특별시 동대문구 왕산로 214
20코레일경춘청평경기도 가평군 청평면 청평리 125-1경기도 가평군 청평면 청평리 청평역로 97-33
21코레일경춘춘천(한림대)강원특별자치도 춘천시 근화동 190강원특별자치도 춘천시 공지로 591
22코레일경춘퇴계원경기도 남양주시 퇴계원면 퇴계원리 218-142경기도 남양주시 퇴계원면 경춘북로 545
23코레일경춘평내호평경기도 남양주시 평내동 660경기도 남양주시 경춘로 1375(평내동)
24코레일경춘회기서울특별시 동대문구 휘경동 317-101서울특별시 동대문구 회기로 196