Overview

Dataset statistics

Number of variables5
Number of observations965
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows60
Duplicate rows (%)6.2%
Total size in memory38.8 KiB
Average record size in memory41.1 B

Variable types

Categorical3
Numeric1
Text1

Dataset

Description부산1호선에 포함된 도시광역철도역들의 철도운영기관명,선명,역명,출구번호,출구별 주요시설명 등의 데이터 입니다.
Author국가철도공단
URLhttps://www.data.go.kr/data/15068949/fileData.do

Alerts

철도운영기관명 has constant value ""Constant
선명 has constant value ""Constant
Dataset has 60 (6.2%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 16:48:44.619599
Analysis finished2023-12-12 16:48:45.292495
Duration0.67 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

철도운영기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size7.7 KiB
부산교통공사
965 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산교통공사
2nd row부산교통공사
3rd row부산교통공사
4th row부산교통공사
5th row부산교통공사

Common Values

ValueCountFrequency (%)
부산교통공사 965
100.0%

Length

2023-12-13T01:48:45.370732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:48:45.470923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산교통공사 965
100.0%

선명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size7.7 KiB
1호선
965 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1호선
2nd row1호선
3rd row1호선
4th row1호선
5th row1호선

Common Values

ValueCountFrequency (%)
1호선 965
100.0%

Length

2023-12-13T01:48:45.572633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:48:45.673787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1호선 965
100.0%

역명
Categorical

Distinct39
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size7.7 KiB
연산
178 
중앙
62 
하단(부산본병원)
 
43
자갈치
 
39
서면
 
38
Other values (34)
605 

Length

Max length16
Median length2
Mean length3.5606218
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row괴정
2nd row괴정
3rd row괴정
4th row괴정
5th row괴정

Common Values

ValueCountFrequency (%)
연산 178
 
18.4%
중앙 62
 
6.4%
하단(부산본병원) 43
 
4.5%
자갈치 39
 
4.0%
서면 38
 
3.9%
범일 35
 
3.6%
토성 33
 
3.4%
부전(부산시민공원·송상현광장) 33
 
3.4%
범내골 29
 
3.0%
시청(연제) 27
 
2.8%
Other values (29) 448
46.4%

Length

2023-12-13T01:48:45.815723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
연산 178
 
18.4%
중앙 62
 
6.4%
하단(부산본병원 43
 
4.5%
자갈치 39
 
4.0%
서면 38
 
3.9%
범일 35
 
3.6%
토성 33
 
3.4%
부전(부산시민공원·송상현광장 33
 
3.4%
범내골 29
 
3.0%
양정 27
 
2.8%
Other values (29) 448
46.4%

출구번호
Real number (ℝ)

Distinct17
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.4663212
Minimum1
Maximum17
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.6 KiB
2023-12-13T01:48:45.970996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median5
Q38
95-th percentile12.8
Maximum17
Range16
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.8290974
Coefficient of variation (CV)0.70048891
Kurtosis0.48412448
Mean5.4663212
Median Absolute Deviation (MAD)3
Skewness0.9648592
Sum5275
Variance14.661987
MonotonicityNot monotonic
2023-12-13T01:48:46.144771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=17)
ValueCountFrequency (%)
2 142
14.7%
1 128
13.3%
4 106
11.0%
3 95
9.8%
5 82
8.5%
6 80
8.3%
8 78
8.1%
7 77
8.0%
10 51
 
5.3%
12 40
 
4.1%
Other values (7) 86
8.9%
ValueCountFrequency (%)
1 128
13.3%
2 142
14.7%
3 95
9.8%
4 106
11.0%
5 82
8.5%
6 80
8.3%
7 77
8.0%
8 78
8.1%
9 24
 
2.5%
10 51
 
5.3%
ValueCountFrequency (%)
17 18
 
1.9%
16 5
 
0.5%
15 9
 
0.9%
14 10
 
1.0%
13 7
 
0.7%
12 40
4.1%
11 13
 
1.3%
10 51
5.3%
9 24
 
2.5%
8 78
8.1%
Distinct634
Distinct (%)65.8%
Missing1
Missing (%)0.1%
Memory size7.7 KiB
2023-12-13T01:48:46.430800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length17
Mean length6.8340249
Min length2

Characters and Unicode

Total characters6588
Distinct characters351
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique457 ?
Unique (%)47.4%

Sample

1st row괴정119안전센터
2nd row괴정1동주민센터
3rd row괴정1치안센터
4th row괴정학문외과
5th row사하초등학교
ValueCountFrequency (%)
부산은행 25
 
2.3%
외환은행 14
 
1.3%
국민건강보험공단 14
 
1.3%
연산동지점 13
 
1.2%
연산4동주민센터 8
 
0.7%
벨지움영사관 7
 
0.6%
대한웰니스병원 7
 
0.6%
부산경상대학교 7
 
0.6%
연일시장 7
 
0.6%
미소아동여성병원 7
 
0.6%
Other values (676) 994
90.1%
2023-12-13T01:48:46.910014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
295
 
4.5%
273
 
4.1%
244
 
3.7%
182
 
2.8%
169
 
2.6%
145
 
2.2%
145
 
2.2%
121
 
1.8%
121
 
1.8%
106
 
1.6%
Other values (341) 4787
72.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6194
94.0%
Space Separator 169
 
2.6%
Decimal Number 90
 
1.4%
Uppercase Letter 52
 
0.8%
Other Punctuation 26
 
0.4%
Lowercase Letter 23
 
0.3%
Close Punctuation 14
 
0.2%
Open Punctuation 14
 
0.2%
Other Symbol 6
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
295
 
4.8%
273
 
4.4%
244
 
3.9%
182
 
2.9%
145
 
2.3%
145
 
2.3%
121
 
2.0%
121
 
2.0%
106
 
1.7%
91
 
1.5%
Other values (302) 4471
72.2%
Uppercase Letter
ValueCountFrequency (%)
C 13
25.0%
S 9
17.3%
G 5
 
9.6%
V 4
 
7.7%
K 4
 
7.7%
F 4
 
7.7%
P 2
 
3.8%
B 2
 
3.8%
I 2
 
3.8%
D 1
 
1.9%
Other values (6) 6
11.5%
Lowercase Letter
ValueCountFrequency (%)
s 8
34.8%
m 4
17.4%
b 2
 
8.7%
o 2
 
8.7%
k 2
 
8.7%
e 1
 
4.3%
h 1
 
4.3%
t 1
 
4.3%
x 1
 
4.3%
c 1
 
4.3%
Decimal Number
ValueCountFrequency (%)
1 30
33.3%
2 19
21.1%
4 15
16.7%
3 10
 
11.1%
5 7
 
7.8%
9 6
 
6.7%
0 3
 
3.3%
Other Punctuation
ValueCountFrequency (%)
/ 24
92.3%
& 2
 
7.7%
Space Separator
ValueCountFrequency (%)
169
100.0%
Close Punctuation
ValueCountFrequency (%)
) 14
100.0%
Open Punctuation
ValueCountFrequency (%)
( 14
100.0%
Other Symbol
ValueCountFrequency (%)
6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6200
94.1%
Common 313
 
4.8%
Latin 75
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
295
 
4.8%
273
 
4.4%
244
 
3.9%
182
 
2.9%
145
 
2.3%
145
 
2.3%
121
 
2.0%
121
 
2.0%
106
 
1.7%
91
 
1.5%
Other values (303) 4477
72.2%
Latin
ValueCountFrequency (%)
C 13
17.3%
S 9
12.0%
s 8
 
10.7%
G 5
 
6.7%
V 4
 
5.3%
m 4
 
5.3%
K 4
 
5.3%
F 4
 
5.3%
b 2
 
2.7%
o 2
 
2.7%
Other values (16) 20
26.7%
Common
ValueCountFrequency (%)
169
54.0%
1 30
 
9.6%
/ 24
 
7.7%
2 19
 
6.1%
4 15
 
4.8%
) 14
 
4.5%
( 14
 
4.5%
3 10
 
3.2%
5 7
 
2.2%
9 6
 
1.9%
Other values (2) 5
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6194
94.0%
ASCII 388
 
5.9%
None 6
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
295
 
4.8%
273
 
4.4%
244
 
3.9%
182
 
2.9%
145
 
2.3%
145
 
2.3%
121
 
2.0%
121
 
2.0%
106
 
1.7%
91
 
1.5%
Other values (302) 4471
72.2%
ASCII
ValueCountFrequency (%)
169
43.6%
1 30
 
7.7%
/ 24
 
6.2%
2 19
 
4.9%
4 15
 
3.9%
) 14
 
3.6%
( 14
 
3.6%
C 13
 
3.4%
3 10
 
2.6%
S 9
 
2.3%
Other values (28) 71
18.3%
None
ValueCountFrequency (%)
6
100.0%

Interactions

2023-12-13T01:48:44.946837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T01:48:47.013693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
역명출구번호
역명1.0000.550
출구번호0.5501.000
2023-12-13T01:48:47.107806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
출구번호역명
출구번호1.0000.219
역명0.2191.000

Missing values

2023-12-13T01:48:45.128901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:48:45.237906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

철도운영기관명선명역명출구번호출구별 주요시설명
0부산교통공사1호선괴정1괴정119안전센터
1부산교통공사1호선괴정1괴정1동주민센터
2부산교통공사1호선괴정1괴정1치안센터
3부산교통공사1호선괴정1괴정학문외과
4부산교통공사1호선괴정1사하초등학교
5부산교통공사1호선괴정10괴정3동 주민센터
6부산교통공사1호선괴정10괴정3동주민센터
7부산교통공사1호선괴정10괴정시장
8부산교통공사1호선괴정10배주한피부과의원
9부산교통공사1호선괴정10하이투자증권 사하지점
철도운영기관명선명역명출구번호출구별 주요시설명
955부산교통공사1호선하단(부산본병원)7에덴공원
956부산교통공사1호선하단(부산본병원)7하단교차로
957부산교통공사1호선하단(부산본병원)7하단우체국
958부산교통공사1호선하단(부산본병원)8대신증권 사하지점
959부산교통공사1호선하단(부산본병원)8부산신용보증재단 서부산지점
960부산교통공사1호선하단(부산본병원)8하단교차로
961부산교통공사1호선하단(부산본병원)9건국중/고등학교
962부산교통공사1호선하단(부산본병원)9동아대학교
963부산교통공사1호선하단(부산본병원)9부산여자고등학교
964부산교통공사1호선하단(부산본병원)9적십자 하단 헌혈의집

Duplicate rows

Most frequently occurring

철도운영기관명선명역명출구번호출구별 주요시설명# duplicates
24부산교통공사1호선연산4서울ms치과3
34부산교통공사1호선연산8미소아동여성병원3
35부산교통공사1호선연산8부산경상대학교3
37부산교통공사1호선연산8연산중학교3
38부산교통공사1호선연산8연일시장3
43부산교통공사1호선연산10연산4동주민센터3
45부산교통공사1호선연산12CS연합치과3
46부산교통공사1호선연산12대한웰니스병원3
47부산교통공사1호선연산12동래봉생병원3
48부산교통공사1호선연산12벨지움영사관3