Overview

Dataset statistics

Number of variables5
Number of observations546
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.2%
Total size in memory22.0 KiB
Average record size in memory41.2 B

Variable types

Categorical3
Numeric1
Text1

Dataset

Description수도권5호선에 포함된 도시광역철도역들의 철도운영기관명,선명,역명,출구번호,출구별 주요시설명, 주소 등의 데이터 입니다.
Author국가철도공단
URLhttps://www.data.go.kr/data/15073459/fileData.do

Alerts

철도운영기관명 has constant value ""Constant
선명 has constant value ""Constant
Dataset has 1 (0.2%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 05:33:30.678078
Analysis finished2023-12-12 05:33:31.270133
Duration0.59 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

철도운영기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
서울교통공사
546 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울교통공사
2nd row서울교통공사
3rd row서울교통공사
4th row서울교통공사
5th row서울교통공사

Common Values

ValueCountFrequency (%)
서울교통공사 546
100.0%

Length

2023-12-12T14:33:31.353509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:33:31.487011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울교통공사 546
100.0%

선명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
5호선
546 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5호선
2nd row5호선
3rd row5호선
4th row5호선
5th row5호선

Common Values

ValueCountFrequency (%)
5호선 546
100.0%

Length

2023-12-12T14:33:31.663296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:33:31.768177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5호선 546
100.0%

역명
Categorical

Distinct36
Distinct (%)6.6%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
광화문(세종문화회관)
 
31
서대문
 
30
공덕
 
29
방화
 
25
화곡
 
22
Other values (31)
409 

Length

Max length13
Median length11
Mean length4.3003663
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row방화
2nd row방화
3rd row방화
4th row방화
5th row방화

Common Values

ValueCountFrequency (%)
광화문(세종문화회관) 31
 
5.7%
서대문 30
 
5.5%
공덕 29
 
5.3%
방화 25
 
4.6%
화곡 22
 
4.0%
천호(풍납토성) 21
 
3.8%
광나루(장신대) 20
 
3.7%
오목교(목동운동장앞) 20
 
3.7%
애오개 19
 
3.5%
여의나루 17
 
3.1%
Other values (26) 312
57.1%

Length

2023-12-12T14:33:31.935585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
광화문(세종문화회관 31
 
5.7%
서대문 30
 
5.5%
공덕 29
 
5.3%
방화 25
 
4.6%
화곡 22
 
4.0%
천호(풍납토성 21
 
3.8%
광나루(장신대 20
 
3.7%
오목교(목동운동장앞 20
 
3.7%
애오개 19
 
3.5%
여의나루 17
 
3.1%
Other values (26) 312
57.1%

출구번호
Real number (ℝ)

Distinct10
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.4450549
Minimum1
Maximum10
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.9 KiB
2023-12-12T14:33:32.080121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q34
95-th percentile8
Maximum10
Range9
Interquartile range (IQR)2

Descriptive statistics

Standard deviation2.1223371
Coefficient of variation (CV)0.61605321
Kurtosis0.13569001
Mean3.4450549
Median Absolute Deviation (MAD)1
Skewness0.90874443
Sum1881
Variance4.504315
MonotonicityNot monotonic
2023-12-12T14:33:32.241117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
3 115
21.1%
2 110
20.1%
1 104
19.0%
4 82
15.0%
5 44
 
8.1%
7 32
 
5.9%
8 25
 
4.6%
6 23
 
4.2%
9 8
 
1.5%
10 3
 
0.5%
ValueCountFrequency (%)
1 104
19.0%
2 110
20.1%
3 115
21.1%
4 82
15.0%
5 44
 
8.1%
6 23
 
4.2%
7 32
 
5.9%
8 25
 
4.6%
9 8
 
1.5%
10 3
 
0.5%
ValueCountFrequency (%)
10 3
 
0.5%
9 8
 
1.5%
8 25
 
4.6%
7 32
 
5.9%
6 23
 
4.2%
5 44
 
8.1%
4 82
15.0%
3 115
21.1%
2 110
20.1%
1 104
19.0%
Distinct506
Distinct (%)92.7%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
2023-12-12T14:33:32.520851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length14
Mean length6.7930403
Min length2

Characters and Unicode

Total characters3709
Distinct characters302
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique470 ?
Unique (%)86.1%

Sample

1st row치현초등학교
2nd row방화3파출소
3rd row방화우체국
4th row방화소방파출소
5th row강서공업고등학교
ValueCountFrequency (%)
방면 11
 
1.8%
고등학교 5
 
0.8%
고교 5
 
0.8%
서울지방검찰청남부지청 3
 
0.5%
여자고등학교 3
 
0.5%
명덕외국어고등학교 3
 
0.5%
양목초등학교 3
 
0.5%
명덕고등학교 3
 
0.5%
마포대교 2
 
0.3%
명원초등학교 2
 
0.3%
Other values (528) 561
93.3%
2023-12-12T14:33:32.987183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
157
 
4.2%
144
 
3.9%
133
 
3.6%
100
 
2.7%
89
 
2.4%
84
 
2.3%
78
 
2.1%
70
 
1.9%
67
 
1.8%
62
 
1.7%
Other values (292) 2725
73.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3516
94.8%
Decimal Number 67
 
1.8%
Space Separator 55
 
1.5%
Other Punctuation 25
 
0.7%
Open Punctuation 15
 
0.4%
Close Punctuation 15
 
0.4%
Uppercase Letter 10
 
0.3%
Other Symbol 5
 
0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
157
 
4.5%
144
 
4.1%
133
 
3.8%
100
 
2.8%
89
 
2.5%
84
 
2.4%
78
 
2.2%
70
 
2.0%
67
 
1.9%
62
 
1.8%
Other values (267) 2532
72.0%
Decimal Number
ValueCountFrequency (%)
1 21
31.3%
2 16
23.9%
3 12
17.9%
5 6
 
9.0%
4 4
 
6.0%
8 3
 
4.5%
6 2
 
3.0%
7 2
 
3.0%
9 1
 
1.5%
Uppercase Letter
ValueCountFrequency (%)
S 3
30.0%
B 2
20.0%
G 1
 
10.0%
L 1
 
10.0%
C 1
 
10.0%
I 1
 
10.0%
K 1
 
10.0%
Other Punctuation
ValueCountFrequency (%)
/ 22
88.0%
· 1
 
4.0%
. 1
 
4.0%
& 1
 
4.0%
Space Separator
ValueCountFrequency (%)
55
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Other Symbol
ValueCountFrequency (%)
5
100.0%
Lowercase Letter
ValueCountFrequency (%)
s 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3521
94.9%
Common 177
 
4.8%
Latin 11
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
157
 
4.5%
144
 
4.1%
133
 
3.8%
100
 
2.8%
89
 
2.5%
84
 
2.4%
78
 
2.2%
70
 
2.0%
67
 
1.9%
62
 
1.8%
Other values (268) 2537
72.1%
Common
ValueCountFrequency (%)
55
31.1%
/ 22
 
12.4%
1 21
 
11.9%
2 16
 
9.0%
( 15
 
8.5%
) 15
 
8.5%
3 12
 
6.8%
5 6
 
3.4%
4 4
 
2.3%
8 3
 
1.7%
Other values (6) 8
 
4.5%
Latin
ValueCountFrequency (%)
S 3
27.3%
B 2
18.2%
G 1
 
9.1%
L 1
 
9.1%
C 1
 
9.1%
I 1
 
9.1%
K 1
 
9.1%
s 1
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3516
94.8%
ASCII 187
 
5.0%
None 6
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
157
 
4.5%
144
 
4.1%
133
 
3.8%
100
 
2.8%
89
 
2.5%
84
 
2.4%
78
 
2.2%
70
 
2.0%
67
 
1.9%
62
 
1.8%
Other values (267) 2532
72.0%
ASCII
ValueCountFrequency (%)
55
29.4%
/ 22
 
11.8%
1 21
 
11.2%
2 16
 
8.6%
( 15
 
8.0%
) 15
 
8.0%
3 12
 
6.4%
5 6
 
3.2%
4 4
 
2.1%
8 3
 
1.6%
Other values (13) 18
 
9.6%
None
ValueCountFrequency (%)
5
83.3%
· 1
 
16.7%

Interactions

2023-12-12T14:33:30.948782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:33:33.097692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
역명출구번호
역명1.0000.513
출구번호0.5131.000
2023-12-12T14:33:33.201477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
출구번호역명
출구번호1.0000.200
역명0.2001.000

Missing values

2023-12-12T14:33:31.102915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:33:31.223142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

철도운영기관명선명역명출구번호출구별 주요시설명
0서울교통공사5호선방화1치현초등학교
1서울교통공사5호선방화1방화3파출소
2서울교통공사5호선방화1방화우체국
3서울교통공사5호선방화1방화소방파출소
4서울교통공사5호선방화1강서공업고등학교
5서울교통공사5호선방화1국립국어연구원s
6서울교통공사5호선방화1방화3동사무소
7서울교통공사5호선방화1삼익아파트
8서울교통공사5호선방화1삼환아파트
9서울교통공사5호선방화1신안아파트
철도운영기관명선명역명출구번호출구별 주요시설명
536서울교통공사5호선상일동1고덕2동사무소
537서울교통공사5호선상일동1고덕초등학교
538서울교통공사5호선상일동1고덕평생학습관
539서울교통공사5호선상일동1강덕초등학교
540서울교통공사5호선상일동1한국구화학교(우성원)
541서울교통공사5호선상일동1고덕중학교
542서울교통공사5호선상일동2광주축협(고덕지점)
543서울교통공사5호선상일동3고일초등학교
544서울교통공사5호선상일동4고덕우체국
545서울교통공사5호선상일동4상일동사무소

Duplicate rows

Most frequently occurring

철도운영기관명선명역명출구번호출구별 주요시설명# duplicates
0서울교통공사5호선신정(은행정)3양목초등학교2