Overview

Dataset statistics

Number of variables8
Number of observations1386
Missing cells95
Missing cells (%)0.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory88.1 KiB
Average record size in memory65.1 B

Variable types

Numeric1
Categorical3
DateTime2
Text2

Dataset

Description통영시 관내 버스 운행에 대하여 노선,기점,출발시간,경유1,무전동,경유2,종점,버스회사 정보를 제공합니다.
URLhttps://www.data.go.kr/data/3067173/fileData.do

Alerts

노선 is highly overall correlated with 기점 and 1 other fieldsHigh correlation
기점 is highly overall correlated with 노선 and 1 other fieldsHigh correlation
종점 is highly overall correlated with 노선 and 1 other fieldsHigh correlation
버스회사 is highly overall correlated with 기점 and 1 other fieldsHigh correlation
경유1 has 28 (2.0%) missing valuesMissing
경유2 has 67 (4.8%) missing valuesMissing

Reproduction

Analysis started2023-12-12 06:14:29.067583
Analysis finished2023-12-12 06:14:30.503261
Duration1.44 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

노선
Real number (ℝ)

HIGH CORRELATION 

Distinct105
Distinct (%)7.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean397.48701
Minimum100
Maximum978
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size12.3 KiB
2023-12-12T15:14:30.630615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum100
5-th percentile100
Q1200
median400
Q3541
95-th percentile750
Maximum978
Range878
Interquartile range (IQR)341

Descriptive statistics

Standard deviation229.72693
Coefficient of variation (CV)0.57794827
Kurtosis-0.55869627
Mean397.48701
Median Absolute Deviation (MAD)200
Skewness0.40398649
Sum550917
Variance52774.462
MonotonicityIncreasing
2023-12-12T15:14:30.829253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100 90
 
6.5%
101 67
 
4.8%
400 54
 
3.9%
531 52
 
3.8%
141 51
 
3.7%
530 48
 
3.5%
200 44
 
3.2%
143 41
 
3.0%
401 39
 
2.8%
301 37
 
2.7%
Other values (95) 863
62.3%
ValueCountFrequency (%)
100 90
6.5%
101 67
4.8%
104 32
 
2.3%
105 10
 
0.7%
113 15
 
1.1%
121 24
 
1.7%
128 1
 
0.1%
140 2
 
0.1%
141 51
3.7%
143 41
3.0%
ValueCountFrequency (%)
978 1
 
0.1%
976 3
 
0.2%
974 1
 
0.1%
973 1
 
0.1%
972 1
 
0.1%
971 5
0.4%
965 3
 
0.2%
964 6
0.4%
963 10
0.7%
961 1
 
0.1%

기점
Categorical

HIGH CORRELATION 

Distinct38
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size11.0 KiB
시외터미널
366 
도남동
150 
서호시장
121 
거제대교
92 
신우희가로
90 
Other values (33)
567 

Length

Max length5
Median length4
Mean length3.8333333
Min length2

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row거제대교
2nd row거제대교
3rd row거제대교
4th row거제대교
5th row거제대교

Common Values

ValueCountFrequency (%)
시외터미널 366
26.4%
도남동 150
10.8%
서호시장 121
 
8.7%
거제대교 92
 
6.6%
신우희가로 90
 
6.5%
용화사 74
 
5.3%
해양과학대 73
 
5.3%
미수동 62
 
4.5%
척포 50
 
3.6%
인평동 34
 
2.5%
Other values (28) 274
19.8%

Length

2023-12-12T15:14:31.011415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
시외터미널 366
26.4%
도남동 150
10.8%
서호시장 121
 
8.7%
거제대교 92
 
6.6%
신우희가로 90
 
6.5%
용화사 74
 
5.3%
해양과학대 73
 
5.3%
미수동 62
 
4.5%
척포 50
 
3.6%
인평동 34
 
2.5%
Other values (28) 274
19.8%
Distinct472
Distinct (%)34.1%
Missing0
Missing (%)0.0%
Memory size11.0 KiB
Minimum2023-12-12 04:35:00
Maximum2023-12-12 23:20:00
2023-12-12T15:14:31.192510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:14:31.381179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

경유1
Text

MISSING 

Distinct90
Distinct (%)6.6%
Missing28
Missing (%)2.0%
Memory size11.0 KiB
2023-12-12T15:14:32.038216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length5
Mean length4.865243
Min length2

Characters and Unicode

Total characters6607
Distinct characters122
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)1.7%

Sample

1st row보건-무전
2nd row보건-무전
3rd row보건-무전
4th row보건-무전
5th row보건-무전
ValueCountFrequency (%)
한선-고려 198
 
14.6%
토성-무전 189
 
13.9%
토성-고려 108
 
8.0%
토성 63
 
4.6%
한선-무전 62
 
4.6%
정량동 62
 
4.6%
향교-고려 47
 
3.5%
보건-무전 46
 
3.4%
향교-무전 41
 
3.0%
토성고개 35
 
2.6%
Other values (79) 507
37.3%
2023-12-12T15:14:32.526122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1112
16.8%
481
 
7.3%
470
 
7.1%
470
 
7.1%
410
 
6.2%
366
 
5.5%
364
 
5.5%
364
 
5.5%
360
 
5.4%
150
 
2.3%
Other values (112) 2060
31.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5454
82.5%
Dash Punctuation 1112
 
16.8%
Lowercase Letter 28
 
0.4%
Decimal Number 9
 
0.1%
Space Separator 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
481
 
8.8%
470
 
8.6%
470
 
8.6%
410
 
7.5%
366
 
6.7%
364
 
6.7%
364
 
6.7%
360
 
6.6%
150
 
2.8%
146
 
2.7%
Other values (108) 1873
34.3%
Dash Punctuation
ValueCountFrequency (%)
- 1112
100.0%
Lowercase Letter
ValueCountFrequency (%)
a 28
100.0%
Decimal Number
ValueCountFrequency (%)
2 9
100.0%
Space Separator
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5454
82.5%
Common 1125
 
17.0%
Latin 28
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
481
 
8.8%
470
 
8.6%
470
 
8.6%
410
 
7.5%
366
 
6.7%
364
 
6.7%
364
 
6.7%
360
 
6.6%
150
 
2.8%
146
 
2.7%
Other values (108) 1873
34.3%
Common
ValueCountFrequency (%)
- 1112
98.8%
2 9
 
0.8%
4
 
0.4%
Latin
ValueCountFrequency (%)
a 28
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5454
82.5%
ASCII 1153
 
17.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1112
96.4%
a 28
 
2.4%
2 9
 
0.8%
4
 
0.3%
Hangul
ValueCountFrequency (%)
481
 
8.8%
470
 
8.6%
470
 
8.6%
410
 
7.5%
366
 
6.7%
364
 
6.7%
364
 
6.7%
360
 
6.6%
150
 
2.8%
146
 
2.7%
Other values (108) 1873
34.3%
Distinct409
Distinct (%)29.5%
Missing0
Missing (%)0.0%
Memory size11.0 KiB
Minimum2023-12-12 04:45:00
Maximum2023-12-12 23:40:00
2023-12-12T15:14:32.710376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:14:32.919486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

경유2
Text

MISSING 

Distinct73
Distinct (%)5.5%
Missing67
Missing (%)4.8%
Memory size11.0 KiB
2023-12-12T15:14:33.149273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length11
Mean length3.9643669
Min length2

Characters and Unicode

Total characters5229
Distinct characters108
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique15 ?
Unique (%)1.1%

Sample

1st row토성
2nd row토성
3rd row토성
4th row토성
5th row토성
ValueCountFrequency (%)
토성 234
17.7%
고려-한선 127
 
9.6%
향교 82
 
6.2%
한선 74
 
5.6%
정량동 72
 
5.5%
무전-토성 60
 
4.5%
한선-조암 58
 
4.4%
토성고개 52
 
3.9%
무전-향교 42
 
3.2%
무전-한선 38
 
2.9%
Other values (64) 481
36.4%
2023-12-12T15:14:33.566831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 699
 
13.4%
470
 
9.0%
469
 
9.0%
348
 
6.7%
342
 
6.5%
298
 
5.7%
218
 
4.2%
190
 
3.6%
182
 
3.5%
159
 
3.0%
Other values (98) 1854
35.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4502
86.1%
Dash Punctuation 699
 
13.4%
Lowercase Letter 25
 
0.5%
Space Separator 2
 
< 0.1%
Decimal Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
470
 
10.4%
469
 
10.4%
348
 
7.7%
342
 
7.6%
298
 
6.6%
218
 
4.8%
190
 
4.2%
182
 
4.0%
159
 
3.5%
155
 
3.4%
Other values (94) 1671
37.1%
Dash Punctuation
ValueCountFrequency (%)
- 699
100.0%
Lowercase Letter
ValueCountFrequency (%)
a 25
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4502
86.1%
Common 702
 
13.4%
Latin 25
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
470
 
10.4%
469
 
10.4%
348
 
7.7%
342
 
7.6%
298
 
6.6%
218
 
4.8%
190
 
4.2%
182
 
4.0%
159
 
3.5%
155
 
3.4%
Other values (94) 1671
37.1%
Common
ValueCountFrequency (%)
- 699
99.6%
2
 
0.3%
2 1
 
0.1%
Latin
ValueCountFrequency (%)
a 25
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4502
86.1%
ASCII 727
 
13.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 699
96.1%
a 25
 
3.4%
2
 
0.3%
2 1
 
0.1%
Hangul
ValueCountFrequency (%)
470
 
10.4%
469
 
10.4%
348
 
7.7%
342
 
7.6%
298
 
6.6%
218
 
4.8%
190
 
4.2%
182
 
4.0%
159
 
3.5%
155
 
3.4%
Other values (94) 1671
37.1%

종점
Categorical

HIGH CORRELATION 

Distinct38
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size11.0 KiB
시외터미널
365 
도남동
150 
서호시장
121 
거제대교
92 
신우희가로
91 
Other values (33)
567 

Length

Max length5
Median length4
Mean length3.8333333
Min length2

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row도남동
2nd row도남동
3rd row도남동
4th row도남동
5th row도남동

Common Values

ValueCountFrequency (%)
시외터미널 365
26.3%
도남동 150
10.8%
서호시장 121
 
8.7%
거제대교 92
 
6.6%
신우희가로 91
 
6.6%
용화사 74
 
5.3%
해양과학대 73
 
5.3%
미수동 62
 
4.5%
척포 50
 
3.6%
인평동 34
 
2.5%
Other values (28) 274
19.8%

Length

2023-12-12T15:14:33.726320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
시외터미널 365
26.3%
도남동 150
10.8%
서호시장 121
 
8.7%
거제대교 92
 
6.6%
신우희가로 91
 
6.6%
용화사 74
 
5.3%
해양과학대 73
 
5.3%
미수동 62
 
4.5%
척포 50
 
3.6%
인평동 34
 
2.5%
Other values (28) 274
19.8%

버스회사
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size11.0 KiB
통영부산교통
1129 
신흥여객
257 

Length

Max length6
Median length6
Mean length5.6291486
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row통영부산교통
2nd row통영부산교통
3rd row통영부산교통
4th row통영부산교통
5th row통영부산교통

Common Values

ValueCountFrequency (%)
통영부산교통 1129
81.5%
신흥여객 257
 
18.5%

Length

2023-12-12T15:14:33.867694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:14:33.979105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
통영부산교통 1129
81.5%
신흥여객 257
 
18.5%

Interactions

2023-12-12T15:14:30.078616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:14:34.059346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
노선기점경유1경유2종점버스회사
노선1.0000.9190.9470.9230.9120.372
기점0.9191.0000.9920.9660.9230.768
경유10.9470.9921.0000.9940.9510.936
경유20.9230.9660.9941.0000.9900.891
종점0.9120.9230.9510.9901.0000.767
버스회사0.3720.7680.9360.8910.7671.000
2023-12-12T15:14:34.182650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기점버스회사종점
기점1.0000.6230.344
버스회사0.6231.0000.622
종점0.3440.6221.000
2023-12-12T15:14:34.287020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
노선기점종점버스회사
노선1.0000.6500.6320.371
기점0.6501.0000.3440.623
종점0.6320.3441.0000.622
버스회사0.3710.6230.6221.000

Missing values

2023-12-12T15:14:30.201074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:14:30.332896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T15:14:30.436640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

노선기점출발시간경유1무전동경유2종점버스회사
0100거제대교05:50:00보건-무전06:05:00토성도남동통영부산교통
1100거제대교06:10:00보건-무전06:25:00토성도남동통영부산교통
2100거제대교06:40:00보건-무전07:00:00토성도남동통영부산교통
3100거제대교07:45:00보건-무전08:05:00토성도남동통영부산교통
4100거제대교08:15:00보건-무전08:35:00토성도남동통영부산교통
5100거제대교08:25:00보건-무전08:45:00토성도남동통영부산교통
6100거제대교08:40:00보건-무전09:00:00토성도남동통영부산교통
7100거제대교09:50:00보건-무전10:10:00토성도남동통영부산교통
8100거제대교10:10:00보건-무전10:30:00토성도남동통영부산교통
9100거제대교10:40:00보건-무전11:00:00토성도남동통영부산교통
노선기점출발시간경유1무전동경유2종점버스회사
1376971분지포18:35:00원문고개18:55:00향교시외터미널신흥여객
1377971분지포21:55:00원문고개22:15:00향교시외터미널신흥여객
1378971시외터미널06:00:00향교06:10:00원문고개분지포신흥여객
1379972가오치18:50:00도산19:10:00노산삼거리시외터미널통영부산교통
1380973저산21:10:00수월21:35:00노산삼거리시외터미널통영부산교통
1381974수월20:10:00저산20:35:00노산삼거리시외터미널통영부산교통
1382976시외터미널05:35:00소방서05:40:00시외터미널원산리통영부산교통
1383976원산리19:40:00<NA>20:20:00노산삼거리시외터미널통영부산교통
1384976원산리21:50:00도산22:10:00노산삼거리시외터미널통영부산교통
1385978월평리16:25:00<NA>16:40:00<NA>시외터미널통영부산교통