Overview

Dataset statistics

Number of variables3
Number of observations282
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.0 KiB
Average record size in memory25.5 B

Variable types

Categorical1
Numeric1
Text1

Dataset

Description대한민국 수도권 도시철도 구간의 급행열차 정보로 급행선명이나 급행구성의 순서를 담고 있으며 급행선에 포함되는 역명의 정보를 담고 있습니다.
Author국토교통부
URLhttps://www.data.go.kr/data/15122969/fileData.do

Reproduction

Analysis started2023-12-12 14:15:58.198834
Analysis finished2023-12-12 14:15:58.661457
Duration0.46 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

급행명
Categorical

Distinct12
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
문산-지평
53 
소요산-인천
52 
당고개-오이도
42 
왕십리-수원
28 
용산-신창
23 
Other values (7)
84 

Length

Max length12
Median length10
Mean length6.4680851
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row용산-동인천(급행)
2nd row용산-동인천(급행)
3rd row용산-동인천(급행)
4th row용산-동인천(급행)
5th row용산-동인천(급행)

Common Values

ValueCountFrequency (%)
문산-지평 53
18.8%
소요산-인천 52
18.4%
당고개-오이도 42
14.9%
왕십리-수원 28
9.9%
용산-신창 23
8.2%
서울역-신창 21
 
7.4%
용산-동인천(급행) 16
 
5.7%
종합운동장-김포공항 12
 
4.3%
청량리-춘천 12
 
4.3%
문산-서울역 11
 
3.9%
Other values (2) 12
 
4.3%

Length

2023-12-12T23:15:58.739421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
문산-지평 53
18.8%
소요산-인천 52
18.4%
당고개-오이도 42
14.9%
왕십리-수원 28
9.9%
용산-신창 23
8.2%
서울역-신창 21
 
7.4%
용산-동인천(급행 16
 
5.7%
종합운동장-김포공항 12
 
4.3%
청량리-춘천 12
 
4.3%
문산-서울역 11
 
3.9%
Other values (2) 12
 
4.3%

급행구성순서
Real number (ℝ)

Distinct53
Distinct (%)18.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17.851064
Minimum1
Maximum53
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.6 KiB
2023-12-12T23:15:58.908496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q17
median14
Q326.75
95-th percentile45.95
Maximum53
Range52
Interquartile range (IQR)19.75

Descriptive statistics

Standard deviation13.855089
Coefficient of variation (CV)0.77614923
Kurtosis-0.37198675
Mean17.851064
Median Absolute Deviation (MAD)9
Skewness0.80938566
Sum5034
Variance191.9635
MonotonicityNot monotonic
2023-12-12T23:15:59.091888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 12
 
4.3%
3 12
 
4.3%
2 12
 
4.3%
4 11
 
3.9%
5 11
 
3.9%
6 11
 
3.9%
7 11
 
3.9%
8 11
 
3.9%
9 11
 
3.9%
10 10
 
3.5%
Other values (43) 170
60.3%
ValueCountFrequency (%)
1 12
4.3%
2 12
4.3%
3 12
4.3%
4 11
3.9%
5 11
3.9%
6 11
3.9%
7 11
3.9%
8 11
3.9%
9 11
3.9%
10 10
3.5%
ValueCountFrequency (%)
53 1
0.4%
52 2
0.7%
51 2
0.7%
50 2
0.7%
49 2
0.7%
48 2
0.7%
47 2
0.7%
46 2
0.7%
45 2
0.7%
44 2
0.7%

역명
Text

Distinct210
Distinct (%)74.5%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2023-12-12T23:15:59.375949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length2
Mean length3.3333333
Min length2

Characters and Unicode

Total characters940
Distinct characters207
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique162 ?
Unique (%)57.4%

Sample

1st row용산
2nd row노량진
3rd row대방
4th row신길
5th row영등포
ValueCountFrequency (%)
용산 5
 
1.8%
노량진 5
 
1.8%
서울역 5
 
1.8%
신도림 4
 
1.4%
구로 4
 
1.4%
영등포 4
 
1.4%
송내 3
 
1.1%
신길 3
 
1.1%
주안 3
 
1.1%
회기 3
 
1.1%
Other values (200) 243
86.2%
2023-12-12T23:15:59.906559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
35
 
3.7%
( 31
 
3.3%
) 31
 
3.3%
25
 
2.7%
24
 
2.6%
23
 
2.4%
22
 
2.3%
21
 
2.2%
18
 
1.9%
17
 
1.8%
Other values (197) 693
73.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 873
92.9%
Open Punctuation 31
 
3.3%
Close Punctuation 31
 
3.3%
Decimal Number 4
 
0.4%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
35
 
4.0%
25
 
2.9%
24
 
2.7%
23
 
2.6%
22
 
2.5%
21
 
2.4%
18
 
2.1%
17
 
1.9%
17
 
1.9%
14
 
1.6%
Other values (190) 657
75.3%
Decimal Number
ValueCountFrequency (%)
3 1
25.0%
2 1
25.0%
5 1
25.0%
1 1
25.0%
Open Punctuation
ValueCountFrequency (%)
( 31
100.0%
Close Punctuation
ValueCountFrequency (%)
) 31
100.0%
Other Punctuation
ValueCountFrequency (%)
· 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 873
92.9%
Common 67
 
7.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
35
 
4.0%
25
 
2.9%
24
 
2.7%
23
 
2.6%
22
 
2.5%
21
 
2.4%
18
 
2.1%
17
 
1.9%
17
 
1.9%
14
 
1.6%
Other values (190) 657
75.3%
Common
ValueCountFrequency (%)
( 31
46.3%
) 31
46.3%
· 1
 
1.5%
3 1
 
1.5%
2 1
 
1.5%
5 1
 
1.5%
1 1
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 873
92.9%
ASCII 66
 
7.0%
None 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
35
 
4.0%
25
 
2.9%
24
 
2.7%
23
 
2.6%
22
 
2.5%
21
 
2.4%
18
 
2.1%
17
 
1.9%
17
 
1.9%
14
 
1.6%
Other values (190) 657
75.3%
ASCII
ValueCountFrequency (%)
( 31
47.0%
) 31
47.0%
3 1
 
1.5%
2 1
 
1.5%
5 1
 
1.5%
1 1
 
1.5%
None
ValueCountFrequency (%)
· 1
100.0%

Interactions

2023-12-12T23:15:58.391621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:16:00.023479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
급행명급행구성순서
급행명1.0000.381
급행구성순서0.3811.000
2023-12-12T23:16:00.490487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
급행구성순서급행명
급행구성순서1.0000.170
급행명0.1701.000

Missing values

2023-12-12T23:15:58.534098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:15:58.626056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

급행명급행구성순서역명
0용산-동인천(급행)1용산
1용산-동인천(급행)2노량진
2용산-동인천(급행)3대방
3용산-동인천(급행)4신길
4용산-동인천(급행)5영등포
5용산-동인천(급행)6신도림
6용산-동인천(급행)7구로
7용산-동인천(급행)8개봉
8용산-동인천(급행)9역곡
9용산-동인천(급행)10부천
급행명급행구성순서역명
272문산-서울역2금촌
273문산-서울역3운정
274문산-서울역4일산
275문산-서울역5백마
276문산-서울역6대곡
277문산-서울역7행신
278문산-서울역8디지털미디어시티
279문산-서울역9가좌
280문산-서울역10신촌
281문산-서울역11서울역