Overview

Dataset statistics

Number of variables9
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory810.5 KiB
Average record size in memory83.0 B

Variable types

Numeric3
Text2
Categorical2
DateTime2

Dataset

Description대구광역시_시내버스 정류소별_노선별_평균배차간격_20221108
Author대구광역시
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15060936&dataSetDetailId=150609361a8078f73b266&provdMethod=FILE

Alerts

운행회수 is highly overall correlated with 평균배차시간High correlation
평균배차시간 is highly overall correlated with 운행회수High correlation

Reproduction

Analysis started2024-04-21 08:04:09.129591
Analysis finished2024-04-21 08:04:13.489853
Duration4.36 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

정류소 아이디
Real number (ℝ)

Distinct2924
Distinct (%)29.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.0171958 × 109
Minimum3.5700011 × 108
Maximum7.701001 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-21T17:04:13.695924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3.5700011 × 108
5-th percentile7.0010081 × 109
Q17.0210117 × 109
median7.041008 × 109
Q37.0610173 × 109
95-th percentile7.1210032 × 109
Maximum7.701001 × 109
Range7.3440009 × 109
Interquartile range (IQR)40005625

Descriptive statistics

Standard deviation3.3479881 × 108
Coefficient of variation (CV)0.047711197
Kurtosis153.20346
Mean7.0171958 × 109
Median Absolute Deviation (MAD)20003000
Skewness-11.805803
Sum7.0171958 × 1013
Variance1.1209024 × 1017
MonotonicityNot monotonic
2024-04-21T17:04:14.152296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
7001005800 20
 
0.2%
7001001800 18
 
0.2%
7111069100 17
 
0.2%
7021025000 16
 
0.2%
7021018100 16
 
0.2%
7021056300 16
 
0.2%
7011010700 15
 
0.1%
7061022600 15
 
0.1%
7061001300 15
 
0.1%
7061026000 15
 
0.1%
Other values (2914) 9837
98.4%
ValueCountFrequency (%)
357000111 1
 
< 0.1%
357000114 1
 
< 0.1%
357000115 2
 
< 0.1%
1570000400 2
 
< 0.1%
3600015400 3
< 0.1%
3600015500 1
 
< 0.1%
3600035800 7
0.1%
3600052500 2
 
< 0.1%
3600052600 2
 
< 0.1%
3600055800 1
 
< 0.1%
ValueCountFrequency (%)
7701001000 1
< 0.1%
7701000900 1
< 0.1%
7701000800 1
< 0.1%
7701000600 1
< 0.1%
7701000400 1
< 0.1%
7701000100 1
< 0.1%
7301001404 1
< 0.1%
7301001403 1
< 0.1%
7301001200 1
< 0.1%
7301000900 1
< 0.1%
Distinct2924
Distinct (%)29.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-21T17:04:14.896229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length20
Mean length7.769
Min length2

Characters and Unicode

Total characters77690
Distinct characters476
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique867 ?
Unique (%)8.7%

Sample

1st row영남대 앞
2nd row유가읍사무소건너
3rd row고산정수사업소건너
4th row서한이다음1차앞
5th row팔레스호텔앞1
ValueCountFrequency (%)
건너 179
 
1.7%
31
 
0.3%
섬유회관앞 20
 
0.2%
남문시장건너2 18
 
0.2%
북부동행정복지센터 18
 
0.2%
영남대 17
 
0.2%
방천리공영차고지 17
 
0.2%
강남약국건너 16
 
0.2%
대백인터빌앞 16
 
0.2%
태전역2 16
 
0.2%
Other values (2893) 9955
96.6%
2024-04-21T17:04:15.778748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3463
 
4.5%
3367
 
4.3%
3365
 
4.3%
2320
 
3.0%
2283
 
2.9%
1843
 
2.4%
1455
 
1.9%
1420
 
1.8%
1350
 
1.7%
1 1317
 
1.7%
Other values (466) 55507
71.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 71113
91.5%
Decimal Number 3278
 
4.2%
Close Punctuation 1136
 
1.5%
Open Punctuation 1127
 
1.5%
Uppercase Letter 553
 
0.7%
Space Separator 303
 
0.4%
Other Punctuation 97
 
0.1%
Dash Punctuation 40
 
0.1%
Lowercase Letter 39
 
0.1%
Other Symbol 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3463
 
4.9%
3367
 
4.7%
3365
 
4.7%
2320
 
3.3%
2283
 
3.2%
1843
 
2.6%
1455
 
2.0%
1420
 
2.0%
1350
 
1.9%
1280
 
1.8%
Other values (431) 48967
68.9%
Uppercase Letter
ValueCountFrequency (%)
T 88
15.9%
K 59
10.7%
L 58
10.5%
C 55
9.9%
G 51
9.2%
A 37
6.7%
V 37
6.7%
H 36
6.5%
B 30
 
5.4%
I 28
 
5.1%
Other values (6) 74
13.4%
Decimal Number
ValueCountFrequency (%)
1 1317
40.2%
2 1263
38.5%
3 256
 
7.8%
4 191
 
5.8%
5 85
 
2.6%
6 38
 
1.2%
7 36
 
1.1%
8 36
 
1.1%
9 32
 
1.0%
0 24
 
0.7%
Other Punctuation
ValueCountFrequency (%)
. 69
71.1%
/ 17
 
17.5%
· 11
 
11.3%
Close Punctuation
ValueCountFrequency (%)
) 1136
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1127
100.0%
Space Separator
ValueCountFrequency (%)
303
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 40
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 39
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 71117
91.5%
Common 5981
 
7.7%
Latin 592
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3463
 
4.9%
3367
 
4.7%
3365
 
4.7%
2320
 
3.3%
2283
 
3.2%
1843
 
2.6%
1455
 
2.0%
1420
 
2.0%
1350
 
1.9%
1280
 
1.8%
Other values (432) 48971
68.9%
Common
ValueCountFrequency (%)
1 1317
22.0%
2 1263
21.1%
) 1136
19.0%
( 1127
18.8%
303
 
5.1%
3 256
 
4.3%
4 191
 
3.2%
5 85
 
1.4%
. 69
 
1.2%
- 40
 
0.7%
Other values (7) 194
 
3.2%
Latin
ValueCountFrequency (%)
T 88
14.9%
K 59
10.0%
L 58
9.8%
C 55
9.3%
G 51
8.6%
e 39
 
6.6%
A 37
 
6.2%
V 37
 
6.2%
H 36
 
6.1%
B 30
 
5.1%
Other values (7) 102
17.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 71113
91.5%
ASCII 6562
 
8.4%
None 15
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3463
 
4.9%
3367
 
4.7%
3365
 
4.7%
2320
 
3.3%
2283
 
3.2%
1843
 
2.6%
1455
 
2.0%
1420
 
2.0%
1350
 
1.9%
1280
 
1.8%
Other values (431) 48967
68.9%
ASCII
ValueCountFrequency (%)
1 1317
20.1%
2 1263
19.2%
) 1136
17.3%
( 1127
17.2%
303
 
4.6%
3 256
 
3.9%
4 191
 
2.9%
T 88
 
1.3%
5 85
 
1.3%
. 69
 
1.1%
Other values (23) 727
11.1%
None
ValueCountFrequency (%)
· 11
73.3%
4
 
26.7%

노선
Text

Distinct123
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-21T17:04:16.913093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length3
Mean length3.1761
Min length3

Characters and Unicode

Total characters31761
Distinct characters29
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row840
2nd row급행8-1
3rd row403
4th row동구4-1
5th row349
ValueCountFrequency (%)
509 198
 
2.0%
708 196
 
2.0%
524 187
 
1.9%
405 184
 
1.8%
655 180
 
1.8%
808 180
 
1.8%
814 168
 
1.7%
840 166
 
1.7%
623 165
 
1.7%
503 163
 
1.6%
Other values (113) 8213
82.1%
2024-04-21T17:04:18.291056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 3550
11.2%
1 3253
10.2%
3 3055
9.6%
4 2991
9.4%
5 2806
8.8%
2 2247
 
7.1%
6 2232
 
7.0%
9 1675
 
5.3%
8 1650
 
5.2%
7 1338
 
4.2%
Other values (19) 6964
21.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 24797
78.1%
Other Letter 6098
 
19.2%
Dash Punctuation 866
 
2.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
931
15.3%
864
14.2%
648
10.6%
599
9.8%
378
 
6.2%
378
 
6.2%
378
 
6.2%
293
 
4.8%
293
 
4.8%
274
 
4.5%
Other values (8) 1062
17.4%
Decimal Number
ValueCountFrequency (%)
0 3550
14.3%
1 3253
13.1%
3 3055
12.3%
4 2991
12.1%
5 2806
11.3%
2 2247
9.1%
6 2232
9.0%
9 1675
6.8%
8 1650
6.7%
7 1338
 
5.4%
Dash Punctuation
ValueCountFrequency (%)
- 866
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 25663
80.8%
Hangul 6098
 
19.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
931
15.3%
864
14.2%
648
10.6%
599
9.8%
378
 
6.2%
378
 
6.2%
378
 
6.2%
293
 
4.8%
293
 
4.8%
274
 
4.5%
Other values (8) 1062
17.4%
Common
ValueCountFrequency (%)
0 3550
13.8%
1 3253
12.7%
3 3055
11.9%
4 2991
11.7%
5 2806
10.9%
2 2247
8.8%
6 2232
8.7%
9 1675
6.5%
8 1650
6.4%
7 1338
 
5.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 25663
80.8%
Hangul 6098
 
19.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 3550
13.8%
1 3253
12.7%
3 3055
11.9%
4 2991
11.7%
5 2806
10.9%
2 2247
8.8%
6 2232
8.7%
9 1675
6.5%
8 1650
6.4%
7 1338
 
5.2%
Hangul
ValueCountFrequency (%)
931
15.3%
864
14.2%
648
10.6%
599
9.8%
378
 
6.2%
378
 
6.2%
378
 
6.2%
293
 
4.8%
293
 
4.8%
274
 
4.5%
Other values (8) 1062
17.4%

진행방향
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
정방향
6252 
역방향
3748 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row역방향
2nd row정방향
3rd row정방향
4th row정방향
5th row역방향

Common Values

ValueCountFrequency (%)
정방향 6252
62.5%
역방향 3748
37.5%

Length

2024-04-21T17:04:18.515445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T17:04:18.674810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정방향 6252
62.5%
역방향 3748
37.5%

시간표유형
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
휴일
3553 
평일
3518 
토요일(감차)
2424 
공통
505 

Length

Max length7
Median length2
Mean length3.212
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row평일
2nd row공통
3rd row휴일
4th row휴일
5th row평일

Common Values

ValueCountFrequency (%)
휴일 3553
35.5%
평일 3518
35.2%
토요일(감차) 2424
24.2%
공통 505
 
5.1%

Length

2024-04-21T17:04:18.883277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T17:04:19.209177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
휴일 3553
35.5%
평일 3518
35.2%
토요일(감차 2424
24.2%
공통 505
 
5.1%

첫차
Date

Distinct222
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2024-04-21 05:30:00
Maximum2024-04-21 16:13:00
2024-04-21T17:04:19.555562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T17:04:19.986700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

막차
Date

Distinct280
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2024-04-21 07:17:00
Maximum2024-04-21 23:30:00
2024-04-21T17:04:20.379816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T17:04:20.783613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

운행회수
Real number (ℝ)

HIGH CORRELATION 

Distinct133
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean68.4857
Minimum2
Maximum176
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-21T17:04:21.188708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile30
Q157
median68
Q382
95-th percentile105
Maximum176
Range174
Interquartile range (IQR)25

Descriptive statistics

Standard deviation22.268246
Coefficient of variation (CV)0.32515176
Kurtosis1.0935859
Mean68.4857
Median Absolute Deviation (MAD)12
Skewness-0.30493875
Sum684857
Variance495.87478
MonotonicityNot monotonic
2024-04-21T17:04:21.829304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
62 294
 
2.9%
63 282
 
2.8%
68 272
 
2.7%
59 254
 
2.5%
73 253
 
2.5%
71 245
 
2.5%
60 238
 
2.4%
70 235
 
2.4%
74 223
 
2.2%
72 222
 
2.2%
Other values (123) 7482
74.8%
ValueCountFrequency (%)
2 20
0.2%
3 31
0.3%
4 20
0.2%
5 26
0.3%
6 20
0.2%
7 26
0.3%
8 41
0.4%
9 47
0.5%
10 47
0.5%
11 12
 
0.1%
ValueCountFrequency (%)
176 1
 
< 0.1%
175 1
 
< 0.1%
161 1
 
< 0.1%
157 1
 
< 0.1%
141 1
 
< 0.1%
140 1
 
< 0.1%
133 1
 
< 0.1%
130 3
 
< 0.1%
128 1
 
< 0.1%
127 10
0.1%

평균배차시간
Real number (ℝ)

HIGH CORRELATION 

Distinct136
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19.6237
Minimum6
Maximum922
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-21T17:04:22.246492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6
5-th percentile9
Q112
median15
Q318
95-th percentile31
Maximum922
Range916
Interquartile range (IQR)6

Descriptive statistics

Standard deviation28.714738
Coefficient of variation (CV)1.4632683
Kurtosis335.52837
Mean19.6237
Median Absolute Deviation (MAD)3
Skewness14.065357
Sum196237
Variance824.53615
MonotonicityNot monotonic
2024-04-21T17:04:22.659783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
14 1183
11.8%
13 969
9.7%
15 906
9.1%
16 871
 
8.7%
11 812
 
8.1%
12 760
 
7.6%
17 748
 
7.5%
18 598
 
6.0%
10 478
 
4.8%
19 478
 
4.8%
Other values (126) 2197
22.0%
ValueCountFrequency (%)
6 4
 
< 0.1%
7 2
 
< 0.1%
8 246
 
2.5%
9 287
 
2.9%
10 478
4.8%
11 812
8.1%
12 760
7.6%
13 969
9.7%
14 1183
11.8%
15 906
9.1%
ValueCountFrequency (%)
922 1
< 0.1%
920 2
< 0.1%
474 1
< 0.1%
472 1
< 0.1%
436 2
< 0.1%
376 1
< 0.1%
342 1
< 0.1%
333 2
< 0.1%
327 2
< 0.1%
326 2
< 0.1%

Interactions

2024-04-21T17:04:11.952963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T17:04:10.285892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T17:04:11.117869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T17:04:12.230298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T17:04:10.572233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T17:04:11.410551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T17:04:12.499373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T17:04:10.856913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T17:04:11.693493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T17:04:22.912282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
정류소 아이디진행방향시간표유형운행회수평균배차시간
정류소 아이디1.0000.0430.0000.0760.087
진행방향0.0431.0000.1440.1610.064
시간표유형0.0000.1441.0000.5250.288
운행회수0.0760.1610.5251.0000.499
평균배차시간0.0870.0640.2880.4991.000
2024-04-21T17:04:23.171000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시간표유형진행방향
시간표유형1.0000.095
진행방향0.0951.000
2024-04-21T17:04:23.412219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
정류소 아이디운행회수평균배차시간진행방향시간표유형
정류소 아이디1.000-0.0870.0750.0240.000
운행회수-0.0871.000-0.9870.1260.340
평균배차시간0.075-0.9871.0000.0680.201
진행방향0.0240.1260.0681.0000.095
시간표유형0.0000.3400.2010.0951.000

Missing values

2024-04-21T17:04:12.841370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T17:04:13.295580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

정류소 아이디정류소노선진행방향시간표유형첫차막차운행회수평균배차시간
189457121005700영남대 앞840역방향평일05:3523:261149
218547111067100유가읍사무소건너급행8-1정방향공통05:4023:057414
47157061036000고산정수사업소건너403정방향휴일05:4422:345718
247407011052600서한이다음1차앞동구4-1정방향휴일05:5823:126316
35617051011800팔레스호텔앞1349역방향평일05:3123:139111
85627031007000진달래아파트건너523정방향평일05:4123:218312
181267061020600지산한라타운앞814정방향토요일(감차)06:1923:211049
204877121000300경산네거리(대구방향)980정방향평일05:4022:578911
184827021011400침산네거리2836정방향평일05:4922:558911
161027041064400상인역(5번출구)726역방향휴일05:4322:459810
정류소 아이디정류소노선진행방향시간표유형첫차막차운행회수평균배차시간
221557041024000월촌초등학교앞달서1역방향평일06:0323:036915
68167121011900압량읍사무소 건너449정방향토요일(감차)06:2223:257314
224067041001400STX메탈앞달서3역방향토요일(감차)06:2723:275419
132967111014400명곡미래빌3단지건너653역방향평일06:1123:186715
75337061001400경신고등학교건너509역방향토요일(감차)05:3722:487813
279707021015800대산초등학교건너순환2-1정방향토요일(감차)05:5023:258213
2060670610095002차대자연맨션앞가창2정방향공통05:4822:465518
176287021029600칠성시장역(서문프라자건너)808정방향평일05:4623:309611
222567041052900유천포스코앞달서1역방향토요일(감차)05:4123:216416
296017011026300공산댐건너팔공2역방향휴일06:4021:561376