Overview

Dataset statistics

Number of variables9
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory820.3 KiB
Average record size in memory84.0 B

Variable types

Categorical7
DateTime1
Numeric1

Dataset

Description광주교통공사의 열차 시간표 데이터로, 요일, 종착역 코드, 방향(상/하행), 도착시간, 역사코드, 기준일자, 호선, 종착역명, 역사명을 포함하는 데이터
Author광주교통공사
URLhttps://www.data.go.kr/data/15111497/fileData.do

Alerts

기준일자 has constant value ""Constant
호선 has constant value ""Constant
종착역명 is highly overall correlated with 종착역 코드 and 1 other fieldsHigh correlation
종착역 코드 is highly overall correlated with 방향(상_하행) and 1 other fieldsHigh correlation
방향(상_하행) is highly overall correlated with 종착역 코드 and 1 other fieldsHigh correlation
역사코드 is highly overall correlated with 역사명High correlation
역사명 is highly overall correlated with 역사코드High correlation

Reproduction

Analysis started2023-12-12 04:32:17.978930
Analysis finished2023-12-12 04:32:19.472181
Duration1.49 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

요일
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
평일
2900 
토요일
2522 
휴일
2511 
명절
1983 
평일
 
84

Length

Max length3
Median length2
Mean length2.2606
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row평일
2nd row명절
3rd row토요일
4th row토요일
5th row토요일

Common Values

ValueCountFrequency (%)
평일 2900
29.0%
토요일 2522
25.2%
휴일 2511
25.1%
명절 1983
19.8%
평일 84
 
0.8%

Length

2023-12-12T13:32:19.543217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:32:19.644533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
평일 2984
29.8%
토요일 2522
25.2%
휴일 2511
25.1%
명절 1983
19.8%

종착역 코드
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
119
4954 
101
4038 
100
1008 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row119
2nd row119
3rd row101
4th row100
5th row100

Common Values

ValueCountFrequency (%)
119 4954
49.5%
101 4038
40.4%
100 1008
 
10.1%

Length

2023-12-12T13:32:19.785270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:32:19.933013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
119 4954
49.5%
101 4038
40.4%
100 1008
 
10.1%

방향(상_하행)
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2
5046 
1
4954 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row2
4th row2
5th row2

Common Values

ValueCountFrequency (%)
2 5046
50.5%
1 4954
49.5%

Length

2023-12-12T13:32:20.067904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:32:20.187706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 5046
50.5%
1 4954
49.5%
Distinct1106
Distinct (%)11.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2023-12-12 05:25:00
Maximum2023-12-12 23:58:00
2023-12-12T13:32:20.346559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:32:20.514753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

역사코드
Real number (ℝ)

HIGH CORRELATION 

Distinct20
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean109.9617
Minimum100
Maximum119
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T13:32:20.679593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum100
5-th percentile102
Q1105
median110
Q3115
95-th percentile118
Maximum119
Range19
Interquartile range (IQR)10

Descriptive statistics

Standard deviation5.28377
Coefficient of variation (CV)0.048051003
Kurtosis-1.1775411
Mean109.9617
Median Absolute Deviation (MAD)5
Skewness-0.015875587
Sum1099617
Variance27.918225
MonotonicityNot monotonic
2023-12-12T13:32:20.828392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=20)
ValueCountFrequency (%)
117 570
 
5.7%
118 568
 
5.7%
109 562
 
5.6%
105 560
 
5.6%
113 557
 
5.6%
103 557
 
5.6%
114 555
 
5.5%
116 550
 
5.5%
112 548
 
5.5%
110 548
 
5.5%
Other values (10) 4425
44.2%
ValueCountFrequency (%)
100 57
 
0.6%
101 323
3.2%
102 521
5.2%
103 557
5.6%
104 542
5.4%
105 560
5.6%
106 523
5.2%
107 546
5.5%
108 547
5.5%
109 562
5.6%
ValueCountFrequency (%)
119 285
2.9%
118 568
5.7%
117 570
5.7%
116 550
5.5%
115 533
5.3%
114 555
5.5%
113 557
5.6%
112 548
5.5%
111 548
5.5%
110 548
5.5%

기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2022-12-30
10000 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-12-30
2nd row2022-12-30
3rd row2022-12-30
4th row2022-12-30
5th row2022-12-30

Common Values

ValueCountFrequency (%)
2022-12-30 10000
100.0%

Length

2023-12-12T13:32:20.975514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:32:21.078942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-12-30 10000
100.0%

호선
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 10000
100.0%

Length

2023-12-12T13:32:21.196548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:32:21.305987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 10000
100.0%

종착역명
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
평동역
4954 
소태역
4038 
녹동역
1008 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row평동역
2nd row평동역
3rd row소태역
4th row녹동역
5th row녹동역

Common Values

ValueCountFrequency (%)
평동역 4954
49.5%
소태역 4038
40.4%
녹동역 1008
 
10.1%

Length

2023-12-12T13:32:21.398405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:32:21.509886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
평동역 4954
49.5%
소태역 4038
40.4%
녹동역 1008
 
10.1%

역사명
Categorical

HIGH CORRELATION 

Distinct20
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
광주송정역
 
571
도산역
 
565
농성역
 
559
금남로4가역
 
557
상무역
 
557
Other values (15)
7191 

Length

Max length9
Median length8
Mean length4.4793
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row문화전당역
2nd row금남로4가역
3rd row도산역
4th row문화전당역
5th row공항역

Common Values

ValueCountFrequency (%)
광주송정역 571
 
5.7%
도산역 565
 
5.7%
농성역 559
 
5.6%
금남로4가역 557
 
5.6%
상무역 557
 
5.6%
김대중컨벤션센터역 555
 
5.5%
남광주역 555
 
5.5%
운천역 550
 
5.5%
양동시장역 549
 
5.5%
송정공원역 549
 
5.5%
Other values (10) 4433
44.3%

Length

2023-12-12T13:32:21.666389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
광주송정역 571
 
5.7%
도산역 565
 
5.7%
농성역 559
 
5.6%
금남로4가역 557
 
5.6%
상무역 557
 
5.6%
김대중컨벤션센터역 555
 
5.5%
남광주역 555
 
5.5%
운천역 550
 
5.5%
양동시장역 549
 
5.5%
송정공원역 549
 
5.5%
Other values (10) 4433
44.3%

Interactions

2023-12-12T13:32:19.054038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T13:32:21.785798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
요일종착역 코드방향(상_하행)역사코드종착역명역사명
요일1.0000.0900.0750.3020.0900.415
종착역 코드0.0901.0001.0000.2091.0000.296
방향(상_하행)0.0751.0001.0000.2141.0000.275
역사코드0.3020.2090.2141.0000.2091.000
종착역명0.0901.0001.0000.2091.0000.296
역사명0.4150.2960.2751.0000.2961.000
2023-12-12T13:32:21.945014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
종착역명종착역 코드역사명요일방향(상_하행)
종착역명1.0001.0000.1630.0671.000
종착역 코드1.0001.0000.1630.0671.000
역사명0.1630.1631.0000.1910.217
요일0.0670.0670.1911.0000.092
방향(상_하행)1.0001.0000.2170.0921.000
2023-12-12T13:32:22.110611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
역사코드요일종착역 코드방향(상_하행)종착역명역사명
역사코드1.0000.1300.1270.1640.1270.989
요일0.1301.0000.0670.0920.0670.191
종착역 코드0.1270.0671.0001.0001.0000.163
방향(상_하행)0.1640.0921.0001.0001.0000.217
종착역명0.1270.0671.0001.0001.0000.163
역사명0.9890.1910.1630.2170.1631.000

Missing values

2023-12-12T13:32:19.243912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:32:19.412538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

요일종착역 코드방향(상_하행)도착시간역사코드기준일자호선종착역명역사명
7632평일119108:251042022-12-301평동역문화전당역
14385명절119122:331052022-12-301평동역금남로4가역
105토요일101206:031182022-12-301소태역도산역
1578토요일100212:001042022-12-301녹동역문화전당역
457토요일100213:401152022-12-301녹동역공항역
1940토요일119117:501182022-12-301평동역도산역
13654명절119122:171142022-12-301평동역김대중컨벤션센터역
2527토요일119112:281122022-12-301평동역운천역
245토요일100212:361172022-12-301녹동역광주송정역
7856평일119107:001022022-12-301평동역학동증심사입구역
요일종착역 코드방향(상_하행)도착시간역사코드기준일자호선종착역명역사명
8379휴일100217:061172022-12-301녹동역광주송정역
1241토요일101207:051072022-12-301소태역양동시장역
1431토요일101221:471062022-12-301소태역금남로5가역
9083휴일100216:501102022-12-301녹동역화정역
11044휴일119120:411082022-12-301평동역돌고개역
9157휴일101212:211092022-12-301소태역농성역
5786평일101206:411022022-12-301소태역학동증심사입구역
12037명절101208:051162022-12-301소태역송정공원역
4827평일100206:381102022-12-301녹동역화정역
14115명절119116:361082022-12-301평동역돌고개역