Overview

Dataset statistics

Number of variables6
Number of observations91
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.7 KiB
Average record size in memory52.5 B

Variable types

Categorical2
Text1
Numeric3

Dataset

Description대구교통공사에서 관리하는 도시광역철도역들의 철도운영기관명, 선명, 역명, 호선구성역정보의 역구성순서, 구간키로, 기점키로의 데이터가 있습니다.
Author국가철도공단
URLhttps://www.data.go.kr/data/15041441/fileData.do

Alerts

철도운영기관명 has constant value ""Constant
역구성순서 is highly overall correlated with 기점키로High correlation
기점키로 is highly overall correlated with 역구성순서High correlation
구간키로 has 3 (3.3%) zerosZeros
기점키로 has 3 (3.3%) zerosZeros

Reproduction

Analysis started2023-12-12 17:51:22.631310
Analysis finished2023-12-12 17:51:24.319689
Duration1.69 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

철도운영기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size860.0 B
대구교통공사
91 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대구교통공사
2nd row대구교통공사
3rd row대구교통공사
4th row대구교통공사
5th row대구교통공사

Common Values

ValueCountFrequency (%)
대구교통공사 91
100.0%

Length

2023-12-13T02:51:24.410645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:51:24.529840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대구교통공사 91
100.0%

선명
Categorical

Distinct3
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size860.0 B
1호선
32 
3호선
30 
2호선
29 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1호선
2nd row1호선
3rd row1호선
4th row1호선
5th row1호선

Common Values

ValueCountFrequency (%)
1호선 32
35.2%
3호선 30
33.0%
2호선 29
31.9%

Length

2023-12-13T02:51:24.661667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:51:24.768242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1호선 32
35.2%
3호선 30
33.0%
2호선 29
31.9%

역명
Text

Distinct88
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Memory size860.0 B
2023-12-13T02:51:25.090663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length2
Mean length3.956044
Min length2

Characters and Unicode

Total characters360
Distinct characters137
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique85 ?
Unique (%)93.4%

Sample

1st row설화명곡
2nd row화원
3rd row대곡(정부대구청사)
4th row진천
5th row월배
ValueCountFrequency (%)
청라언덕 2
 
2.2%
반월당 2
 
2.2%
명덕(2.28민주운동기념회관 2
 
2.2%
매천 1
 
1.1%
사월 1
 
1.1%
칠곡운암 1
 
1.1%
동천 1
 
1.1%
팔거(국립농관원·통계청 1
 
1.1%
학정 1
 
1.1%
칠곡경대병원 1
 
1.1%
Other values (78) 78
85.7%
2023-12-13T02:51:25.655880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
19
 
5.3%
( 13
 
3.6%
) 13
 
3.6%
11
 
3.1%
10
 
2.8%
8
 
2.2%
8
 
2.2%
8
 
2.2%
8
 
2.2%
7
 
1.9%
Other values (127) 255
70.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 316
87.8%
Open Punctuation 13
 
3.6%
Close Punctuation 13
 
3.6%
Decimal Number 6
 
1.7%
Other Punctuation 6
 
1.7%
Uppercase Letter 6
 
1.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
19
 
6.0%
11
 
3.5%
10
 
3.2%
8
 
2.5%
8
 
2.5%
8
 
2.5%
8
 
2.5%
7
 
2.2%
7
 
2.2%
6
 
1.9%
Other values (116) 224
70.9%
Uppercase Letter
ValueCountFrequency (%)
B 2
33.3%
C 1
16.7%
T 1
16.7%
K 1
16.7%
S 1
16.7%
Decimal Number
ValueCountFrequency (%)
2 4
66.7%
8 2
33.3%
Other Punctuation
ValueCountFrequency (%)
· 4
66.7%
. 2
33.3%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 316
87.8%
Common 38
 
10.6%
Latin 6
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
19
 
6.0%
11
 
3.5%
10
 
3.2%
8
 
2.5%
8
 
2.5%
8
 
2.5%
8
 
2.5%
7
 
2.2%
7
 
2.2%
6
 
1.9%
Other values (116) 224
70.9%
Common
ValueCountFrequency (%)
( 13
34.2%
) 13
34.2%
2 4
 
10.5%
· 4
 
10.5%
8 2
 
5.3%
. 2
 
5.3%
Latin
ValueCountFrequency (%)
B 2
33.3%
C 1
16.7%
T 1
16.7%
K 1
16.7%
S 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 316
87.8%
ASCII 40
 
11.1%
None 4
 
1.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
19
 
6.0%
11
 
3.5%
10
 
3.2%
8
 
2.5%
8
 
2.5%
8
 
2.5%
8
 
2.5%
7
 
2.2%
7
 
2.2%
6
 
1.9%
Other values (116) 224
70.9%
ASCII
ValueCountFrequency (%)
( 13
32.5%
) 13
32.5%
2 4
 
10.0%
B 2
 
5.0%
8 2
 
5.0%
. 2
 
5.0%
C 1
 
2.5%
T 1
 
2.5%
K 1
 
2.5%
S 1
 
2.5%
None
ValueCountFrequency (%)
· 4
100.0%

역구성순서
Real number (ℝ)

HIGH CORRELATION 

Distinct32
Distinct (%)35.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.692308
Minimum1
Maximum32
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size951.0 B
2023-12-13T02:51:25.864952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q18
median16
Q323
95-th percentile29
Maximum32
Range31
Interquartile range (IQR)15

Descriptive statistics

Standard deviation8.8452025
Coefficient of variation (CV)0.56366486
Kurtosis-1.1756804
Mean15.692308
Median Absolute Deviation (MAD)8
Skewness0.018130435
Sum1428
Variance78.237607
MonotonicityNot monotonic
2023-12-13T02:51:26.042421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=32)
ValueCountFrequency (%)
1 3
 
3.3%
16 3
 
3.3%
29 3
 
3.3%
28 3
 
3.3%
27 3
 
3.3%
26 3
 
3.3%
25 3
 
3.3%
24 3
 
3.3%
23 3
 
3.3%
22 3
 
3.3%
Other values (22) 61
67.0%
ValueCountFrequency (%)
1 3
3.3%
2 3
3.3%
3 3
3.3%
4 3
3.3%
5 3
3.3%
6 3
3.3%
7 3
3.3%
8 3
3.3%
9 3
3.3%
10 3
3.3%
ValueCountFrequency (%)
32 1
 
1.1%
31 1
 
1.1%
30 2
2.2%
29 3
3.3%
28 3
3.3%
27 3
3.3%
26 3
3.3%
25 3
3.3%
24 3
3.3%
23 3
3.3%

구간키로
Real number (ℝ)

ZEROS 

Distinct15
Distinct (%)16.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.91136264
Minimum0
Maximum2.9
Zeros3
Zeros (%)3.3%
Negative0
Negative (%)0.0%
Memory size951.0 B
2023-12-13T02:51:26.223412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0.6
Q10.8
median0.9
Q31
95-th percentile1.3
Maximum2.9
Range2.9
Interquartile range (IQR)0.2

Descriptive statistics

Standard deviation0.3360021
Coefficient of variation (CV)0.36868101
Kurtosis14.228711
Mean0.91136264
Median Absolute Deviation (MAD)0.1
Skewness1.946132
Sum82.934
Variance0.11289741
MonotonicityNot monotonic
2023-12-13T02:51:26.379390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
0.9 22
24.2%
0.8 14
15.4%
0.7 12
13.2%
1.0 11
12.1%
1.2 7
 
7.7%
1.1 7
 
7.7%
0.6 6
 
6.6%
0.0 3
 
3.3%
1.3 3
 
3.3%
0.99 1
 
1.1%
Other values (5) 5
 
5.5%
ValueCountFrequency (%)
0.0 3
 
3.3%
0.6 6
 
6.6%
0.7 12
13.2%
0.8 14
15.4%
0.84 1
 
1.1%
0.9 22
24.2%
0.99 1
 
1.1%
1.0 11
12.1%
1.004 1
 
1.1%
1.1 7
 
7.7%
ValueCountFrequency (%)
2.9 1
 
1.1%
1.8 1
 
1.1%
1.4 1
 
1.1%
1.3 3
 
3.3%
1.2 7
 
7.7%
1.1 7
 
7.7%
1.004 1
 
1.1%
1.0 11
12.1%
0.99 1
 
1.1%
0.9 22
24.2%

기점키로
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct83
Distinct (%)91.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.926154
Minimum0
Maximum30.9
Zeros3
Zeros (%)3.3%
Negative0
Negative (%)0.0%
Memory size951.0 B
2023-12-13T02:51:26.560292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1.4
Q17.45
median13.4
Q320.1
95-th percentile27.945
Maximum30.9
Range30.9
Interquartile range (IQR)12.65

Descriptive statistics

Standard deviation8.2708645
Coefficient of variation (CV)0.59390874
Kurtosis-0.9204211
Mean13.926154
Median Absolute Deviation (MAD)6.5
Skewness0.17188553
Sum1267.28
Variance68.407199
MonotonicityNot monotonic
2023-12-13T02:51:26.735430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 3
 
3.3%
10.8 2
 
2.2%
15.9 2
 
2.2%
12.5 2
 
2.2%
10.0 2
 
2.2%
5.3 2
 
2.2%
11.7 2
 
2.2%
21.8 1
 
1.1%
2.3 1
 
1.1%
1.6 1
 
1.1%
Other values (73) 73
80.2%
ValueCountFrequency (%)
0.0 3
3.3%
0.8 1
 
1.1%
1.2 1
 
1.1%
1.6 1
 
1.1%
2.3 1
 
1.1%
2.5 1
 
1.1%
2.9 1
 
1.1%
3.0 1
 
1.1%
3.5 1
 
1.1%
3.7 1
 
1.1%
ValueCountFrequency (%)
30.9 1
1.1%
29.8 1
1.1%
29.09 1
1.1%
28.7 1
1.1%
28.19 1
1.1%
27.7 1
1.1%
27.2 1
1.1%
26.9 1
1.1%
26.2 1
1.1%
25.8 1
1.1%

Interactions

2023-12-13T02:51:23.731523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:51:23.011920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:51:23.359870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:51:23.860239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:51:23.120645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:51:23.499305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:51:23.973543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:51:23.235442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:51:23.618965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:51:26.854132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
선명역명역구성순서구간키로기점키로
선명1.0000.0000.0000.5060.000
역명0.0001.0000.9370.9820.987
역구성순서0.0000.9371.0000.3300.879
구간키로0.5060.9820.3301.0000.446
기점키로0.0000.9870.8790.4461.000
2023-12-13T02:51:27.337415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
역구성순서구간키로기점키로선명
역구성순서1.0000.2190.9620.000
구간키로0.2191.0000.3320.236
기점키로0.9620.3321.0000.000
선명0.0000.2360.0001.000

Missing values

2023-12-13T02:51:24.124765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:51:24.269811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

철도운영기관명선명역명역구성순서구간키로기점키로
0대구교통공사1호선설화명곡10.00.0
1대구교통공사1호선화원21.21.2
2대구교통공사1호선대곡(정부대구청사)31.32.5
3대구교통공사1호선진천41.03.5
4대구교통공사1호선월배50.84.3
5대구교통공사1호선상인60.75.0
6대구교통공사1호선월촌70.95.9
7대구교통공사1호선송현81.06.9
8대구교통공사1호선서부정류장(관문시장)90.87.7
9대구교통공사1호선대명100.88.5
철도운영기관명선명역명역구성순서구간키로기점키로
81대구교통공사3호선건들바위210.915.3
82대구교통공사3호선대봉교220.615.9
83대구교통공사3호선수성시장230.916.8
84대구교통공사3호선수성구민운동장241.017.8
85대구교통공사3호선어린이회관250.818.6
86대구교통공사3호선황금260.719.3
87대구교통공사3호선수성못(TBC)270.920.2
88대구교통공사3호선지산281.121.3
89대구교통공사3호선범물290.922.2
90대구교통공사3호선용지300.722.9