Overview

Dataset statistics

Number of variables5
Number of observations63
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.6 KiB
Average record size in memory42.1 B

Variable types

Categorical2
Text3

Dataset

Description수인분당선에 포함 된 도시광역철도역들의 철도운영기관명, 선명, 역명, 지번주소, 도로명주소의 데이터가 있습니다.
Author국가철도공단
URLhttps://www.data.go.kr/data/15041116/fileData.do

Alerts

철도운영기관명 has constant value ""Constant
선명 has constant value ""Constant
역명 has unique valuesUnique
지번주소 has unique valuesUnique
도로명주소 has unique valuesUnique

Reproduction

Analysis started2023-12-12 06:31:00.609797
Analysis finished2023-12-12 06:31:01.136072
Duration0.53 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

철도운영기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size636.0 B
코레일
63 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row코레일
2nd row코레일
3rd row코레일
4th row코레일
5th row코레일

Common Values

ValueCountFrequency (%)
코레일 63
100.0%

Length

2023-12-12T15:31:01.243210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:31:01.397287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
코레일 63
100.0%

선명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size636.0 B
수인분당
63 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수인분당
2nd row수인분당
3rd row수인분당
4th row수인분당
5th row수인분당

Common Values

ValueCountFrequency (%)
수인분당 63
100.0%

Length

2023-12-12T15:31:01.567978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:31:01.703172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수인분당 63
100.0%

역명
Text

UNIQUE 

Distinct63
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size636.0 B
2023-12-12T15:31:01.966412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length2
Mean length3.9047619
Min length2

Characters and Unicode

Total characters246
Distinct characters119
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique63 ?
Unique (%)100.0%

Sample

1st row가천대
2nd row강남구청
3rd row개포동
4th row고색
5th row고잔
ValueCountFrequency (%)
가천대 1
 
1.6%
신갈 1
 
1.6%
신포 1
 
1.6%
안산 1
 
1.6%
압구정로데오 1
 
1.6%
야목 1
 
1.6%
야탑 1
 
1.6%
어천 1
 
1.6%
연수 1
 
1.6%
영통(경희대 1
 
1.6%
Other values (53) 53
84.1%
2023-12-12T15:31:02.491596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 11
 
4.5%
( 11
 
4.5%
10
 
4.1%
7
 
2.8%
6
 
2.4%
6
 
2.4%
6
 
2.4%
5
 
2.0%
5
 
2.0%
5
 
2.0%
Other values (109) 174
70.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 224
91.1%
Close Punctuation 11
 
4.5%
Open Punctuation 11
 
4.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10
 
4.5%
7
 
3.1%
6
 
2.7%
6
 
2.7%
6
 
2.7%
5
 
2.2%
5
 
2.2%
5
 
2.2%
5
 
2.2%
4
 
1.8%
Other values (107) 165
73.7%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 224
91.1%
Common 22
 
8.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10
 
4.5%
7
 
3.1%
6
 
2.7%
6
 
2.7%
6
 
2.7%
5
 
2.2%
5
 
2.2%
5
 
2.2%
5
 
2.2%
4
 
1.8%
Other values (107) 165
73.7%
Common
ValueCountFrequency (%)
) 11
50.0%
( 11
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 224
91.1%
ASCII 22
 
8.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 11
50.0%
( 11
50.0%
Hangul
ValueCountFrequency (%)
10
 
4.5%
7
 
3.1%
6
 
2.7%
6
 
2.7%
6
 
2.7%
5
 
2.2%
5
 
2.2%
5
 
2.2%
5
 
2.2%
4
 
1.8%
Other values (107) 165
73.7%

지번주소
Text

UNIQUE 

Distinct63
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size636.0 B
2023-12-12T15:31:02.905372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length23
Mean length19.460317
Min length7

Characters and Unicode

Total characters1226
Distinct characters110
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique63 ?
Unique (%)100.0%

Sample

1st row경기도 성남시 수정구 태평동 7131
2nd row서울 강남구 삼성동 77-94
3rd row서울시 강남구 개포동 181-2
4th row경기도 수원시 권선구 고색동 377-2
5th row경기도 안산시 단원구 고잔동 453-67
ValueCountFrequency (%)
경기도 36
 
12.6%
강남구 10
 
3.5%
성남시 10
 
3.5%
서울특별시 10
 
3.5%
수원시 9
 
3.2%
안산시 7
 
2.5%
인천광역시 7
 
2.5%
분당구 7
 
2.5%
용인시 6
 
2.1%
단원구 5
 
1.8%
Other values (136) 178
62.5%
2023-12-12T15:31:03.457102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
222
 
18.1%
67
 
5.5%
65
 
5.3%
59
 
4.8%
1 58
 
4.7%
- 43
 
3.5%
41
 
3.3%
37
 
3.0%
2 37
 
3.0%
36
 
2.9%
Other values (100) 561
45.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 707
57.7%
Decimal Number 254
 
20.7%
Space Separator 222
 
18.1%
Dash Punctuation 43
 
3.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
67
 
9.5%
65
 
9.2%
59
 
8.3%
41
 
5.8%
37
 
5.2%
36
 
5.1%
25
 
3.5%
21
 
3.0%
19
 
2.7%
17
 
2.4%
Other values (88) 320
45.3%
Decimal Number
ValueCountFrequency (%)
1 58
22.8%
2 37
14.6%
7 30
11.8%
5 26
10.2%
6 24
9.4%
8 22
 
8.7%
3 19
 
7.5%
9 14
 
5.5%
4 14
 
5.5%
0 10
 
3.9%
Space Separator
ValueCountFrequency (%)
222
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 43
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 707
57.7%
Common 519
42.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
67
 
9.5%
65
 
9.2%
59
 
8.3%
41
 
5.8%
37
 
5.2%
36
 
5.1%
25
 
3.5%
21
 
3.0%
19
 
2.7%
17
 
2.4%
Other values (88) 320
45.3%
Common
ValueCountFrequency (%)
222
42.8%
1 58
 
11.2%
- 43
 
8.3%
2 37
 
7.1%
7 30
 
5.8%
5 26
 
5.0%
6 24
 
4.6%
8 22
 
4.2%
3 19
 
3.7%
9 14
 
2.7%
Other values (2) 24
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 707
57.7%
ASCII 519
42.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
222
42.8%
1 58
 
11.2%
- 43
 
8.3%
2 37
 
7.1%
7 30
 
5.8%
5 26
 
5.0%
6 24
 
4.6%
8 22
 
4.2%
3 19
 
3.7%
9 14
 
2.7%
Other values (2) 24
 
4.6%
Hangul
ValueCountFrequency (%)
67
 
9.5%
65
 
9.2%
59
 
8.3%
41
 
5.8%
37
 
5.2%
36
 
5.1%
25
 
3.5%
21
 
3.0%
19
 
2.7%
17
 
2.4%
Other values (88) 320
45.3%

도로명주소
Text

UNIQUE 

Distinct63
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size636.0 B
2023-12-12T15:31:03.802559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length29
Mean length20.349206
Min length15

Characters and Unicode

Total characters1282
Distinct characters128
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique63 ?
Unique (%)100.0%

Sample

1st row경기도 성남시 수정구 성남대로 1332
2nd row서울특별시 강남구 학동로 346
3rd row서울시 강남구 개포로 지하420
4th row경기도 수원시 권선구 매송고색로 지하 690
5th row경기도 안산시 단원구 중앙대로 784
ValueCountFrequency (%)
경기도 37
 
12.3%
성남대로 10
 
3.3%
서울특별시 10
 
3.3%
강남구 10
 
3.3%
성남시 10
 
3.3%
수원시 9
 
3.0%
지하 8
 
2.6%
분당구 7
 
2.3%
안산시 7
 
2.3%
인천광역시 6
 
2.0%
Other values (134) 188
62.3%
2023-12-12T15:31:04.342002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
249
 
19.4%
67
 
5.2%
62
 
4.8%
61
 
4.8%
43
 
3.4%
38
 
3.0%
38
 
3.0%
1 36
 
2.8%
35
 
2.7%
25
 
2.0%
Other values (118) 628
49.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 816
63.7%
Space Separator 249
 
19.4%
Decimal Number 203
 
15.8%
Open Punctuation 6
 
0.5%
Close Punctuation 6
 
0.5%
Dash Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
67
 
8.2%
62
 
7.6%
61
 
7.5%
43
 
5.3%
38
 
4.7%
38
 
4.7%
35
 
4.3%
25
 
3.1%
25
 
3.1%
19
 
2.3%
Other values (104) 403
49.4%
Decimal Number
ValueCountFrequency (%)
1 36
17.7%
2 24
11.8%
0 24
11.8%
4 23
11.3%
3 22
10.8%
5 20
9.9%
6 20
9.9%
7 14
 
6.9%
8 11
 
5.4%
9 9
 
4.4%
Space Separator
ValueCountFrequency (%)
249
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 816
63.7%
Common 466
36.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
67
 
8.2%
62
 
7.6%
61
 
7.5%
43
 
5.3%
38
 
4.7%
38
 
4.7%
35
 
4.3%
25
 
3.1%
25
 
3.1%
19
 
2.3%
Other values (104) 403
49.4%
Common
ValueCountFrequency (%)
249
53.4%
1 36
 
7.7%
2 24
 
5.2%
0 24
 
5.2%
4 23
 
4.9%
3 22
 
4.7%
5 20
 
4.3%
6 20
 
4.3%
7 14
 
3.0%
8 11
 
2.4%
Other values (4) 23
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 816
63.7%
ASCII 466
36.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
249
53.4%
1 36
 
7.7%
2 24
 
5.2%
0 24
 
5.2%
4 23
 
4.9%
3 22
 
4.7%
5 20
 
4.3%
6 20
 
4.3%
7 14
 
3.0%
8 11
 
2.4%
Other values (4) 23
 
4.9%
Hangul
ValueCountFrequency (%)
67
 
8.2%
62
 
7.6%
61
 
7.5%
43
 
5.3%
38
 
4.7%
38
 
4.7%
35
 
4.3%
25
 
3.1%
25
 
3.1%
19
 
2.3%
Other values (104) 403
49.4%

Correlations

2023-12-12T15:31:04.442796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
역명지번주소도로명주소
역명1.0001.0001.000
지번주소1.0001.0001.000
도로명주소1.0001.0001.000

Missing values

2023-12-12T15:31:00.943392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:31:01.066980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

철도운영기관명선명역명지번주소도로명주소
0코레일수인분당가천대경기도 성남시 수정구 태평동 7131경기도 성남시 수정구 성남대로 1332
1코레일수인분당강남구청서울 강남구 삼성동 77-94서울특별시 강남구 학동로 346
2코레일수인분당개포동서울시 강남구 개포동 181-2서울시 강남구 개포로 지하420
3코레일수인분당고색경기도 수원시 권선구 고색동 377-2경기도 수원시 권선구 매송고색로 지하 690
4코레일수인분당고잔경기도 안산시 단원구 고잔동 453-67경기도 안산시 단원구 중앙대로 784
5코레일수인분당구룡서울특별시 강남구 개포동 175-3서울특별시 강남구 개포로 지하 403
6코레일수인분당구성경기도 용인시 기흥구 마북동460-3경기도 용인시 기흥구 용구대로 2403(마북동460-3)
7코레일수인분당기흥(백남준아트센터)경기도 용인시 기흥구 구갈동 227-25경기도 용인시 기흥구 중부대로 460
8코레일수인분당남동인더스파크인천광역시 남동구 고잔동 970-8인천광역시 남동구 은청로 17
9코레일수인분당달월경기도 시흥시 월곶동 662-3경기도 시흥시 서해안로 736번길 55
철도운영기관명선명역명지번주소도로명주소
53코레일수인분당정자경기도 성남시 분당구 정자동 95-1경기도 성남시 분당구 성남대로 333
54코레일수인분당죽전(단국대)경기도 용인시 수지구 죽전동 1286경기도 용인시 수지구 포은대로 530
55코레일수인분당중앙경기도 안산시 단원구 고잔동 167-378 중앙역경기도 안산시 단원구 중앙대로 918
56코레일수인분당청량리서울특별시 동대문구 전농동 588-1서울특별시 동대문구 왕산로 214
57코레일수인분당청명수원시 영통구 영통동 1055번지수원시 영통구 봉영로 1670
58코레일수인분당초지경기도 안산시 단원구 초지동 25-1경기도 안산시 단원구 중앙대로 620
59코레일수인분당태평경기도 성남시 수정구 수진동 4808경기도 성남시 수정구 성남대로 1229
60코레일수인분당한대앞경기도 안산시 상록구 이동 211-5경기도 안산시 상록구 충장로 337
61코레일수인분당한티서울특별시 강남구 대치동 1011-28서울특별시 강남구 선릉로 228 지하1층
62코레일수인분당호구포인천시 남동구 논현동 731-1인천광역시 남동구 호구포로 205