Overview

Dataset statistics

Number of variables12
Number of observations164
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory16.0 KiB
Average record size in memory99.8 B

Variable types

Numeric2
Text5
Categorical4
DateTime1

Dataset

Description양평군 버스 시간표에 대한 데이터로 노선번호, 기점, 경유지, 종점, 거리, 회수, 대수, 기점 출발시간 등의 정보를 제공합니다.
URLhttps://www.data.go.kr/data/3066397/fileData.do

Alerts

기준일자 has constant value ""Constant
순번 is highly overall correlated with 비고(변경사항 등)High correlation
거리 is highly overall correlated with 비고(변경사항 등)High correlation
회수 is highly overall correlated with 대수High correlation
대수 is highly overall correlated with 회수High correlation
비고(변경사항 등) is highly overall correlated with 순번 and 1 other fieldsHigh correlation
비고(변경사항 등) is highly imbalanced (77.5%)Imbalance
순번 has unique valuesUnique
노선번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 13:23:11.990451
Analysis finished2023-12-12 13:23:13.505843
Duration1.52 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct164
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean82.5
Minimum1
Maximum164
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.6 KiB
2023-12-12T22:23:13.596535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile9.15
Q141.75
median82.5
Q3123.25
95-th percentile155.85
Maximum164
Range163
Interquartile range (IQR)81.5

Descriptive statistics

Standard deviation47.48684
Coefficient of variation (CV)0.57559806
Kurtosis-1.2
Mean82.5
Median Absolute Deviation (MAD)41
Skewness0
Sum13530
Variance2255
MonotonicityStrictly increasing
2023-12-12T22:23:13.791668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.6%
105 1
 
0.6%
107 1
 
0.6%
108 1
 
0.6%
109 1
 
0.6%
110 1
 
0.6%
111 1
 
0.6%
112 1
 
0.6%
113 1
 
0.6%
114 1
 
0.6%
Other values (154) 154
93.9%
ValueCountFrequency (%)
1 1
0.6%
2 1
0.6%
3 1
0.6%
4 1
0.6%
5 1
0.6%
6 1
0.6%
7 1
0.6%
8 1
0.6%
9 1
0.6%
10 1
0.6%
ValueCountFrequency (%)
164 1
0.6%
163 1
0.6%
162 1
0.6%
161 1
0.6%
160 1
0.6%
159 1
0.6%
158 1
0.6%
157 1
0.6%
156 1
0.6%
155 1
0.6%

노선번호
Text

UNIQUE 

Distinct164
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2023-12-12T22:23:14.163001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length6.2926829
Min length3

Characters and Unicode

Total characters1032
Distinct characters21
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique164 ?
Unique (%)100.0%

Sample

1st row[1]
2nd row[1-1]
3rd row[1-10]
4th row[1-11]
5th row[1-12]
ValueCountFrequency (%)
25
 
13.2%
1 1
 
0.5%
6-2 1
 
0.5%
5-88 1
 
0.5%
50-2 1
 
0.5%
6 1
 
0.5%
6-1 1
 
0.5%
6-10 1
 
0.5%
6-12 1
 
0.5%
군공영 1
 
0.5%
Other values (156) 156
82.1%
2023-12-12T22:23:14.683398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
[ 164
15.9%
] 164
15.9%
- 153
14.8%
1 82
 
7.9%
2 65
 
6.3%
3 48
 
4.7%
7 44
 
4.3%
4 38
 
3.7%
0 34
 
3.3%
8 34
 
3.3%
Other values (11) 206
20.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 421
40.8%
Open Punctuation 195
18.9%
Close Punctuation 195
18.9%
Dash Punctuation 153
 
14.8%
Other Letter 41
 
4.0%
Space Separator 26
 
2.5%
Uppercase Letter 1
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 82
19.5%
2 65
15.4%
3 48
11.4%
7 44
10.5%
4 38
9.0%
0 34
8.1%
8 34
8.1%
5 34
8.1%
6 23
 
5.5%
9 19
 
4.5%
Other Letter
ValueCountFrequency (%)
26
63.4%
5
 
12.2%
5
 
12.2%
5
 
12.2%
Open Punctuation
ValueCountFrequency (%)
[ 164
84.1%
( 31
 
15.9%
Close Punctuation
ValueCountFrequency (%)
] 164
84.1%
) 31
 
15.9%
Dash Punctuation
ValueCountFrequency (%)
- 153
100.0%
Space Separator
ValueCountFrequency (%)
26
100.0%
Uppercase Letter
ValueCountFrequency (%)
G 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 990
95.9%
Hangul 41
 
4.0%
Latin 1
 
0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
[ 164
16.6%
] 164
16.6%
- 153
15.5%
1 82
8.3%
2 65
 
6.6%
3 48
 
4.8%
7 44
 
4.4%
4 38
 
3.8%
0 34
 
3.4%
8 34
 
3.4%
Other values (6) 164
16.6%
Hangul
ValueCountFrequency (%)
26
63.4%
5
 
12.2%
5
 
12.2%
5
 
12.2%
Latin
ValueCountFrequency (%)
G 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 991
96.0%
Hangul 41
 
4.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
[ 164
16.5%
] 164
16.5%
- 153
15.4%
1 82
8.3%
2 65
 
6.6%
3 48
 
4.8%
7 44
 
4.4%
4 38
 
3.8%
0 34
 
3.4%
8 34
 
3.4%
Other values (7) 165
16.6%
Hangul
ValueCountFrequency (%)
26
63.4%
5
 
12.2%
5
 
12.2%
5
 
12.2%

기점
Categorical

Distinct13
Distinct (%)7.9%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
양평터미널
85 
문호리종점
23 
용문터미널
22 
여주터미널
11 
대신터미널
 
7
Other values (8)
16 

Length

Max length18
Median length5
Mean length5.0914634
Min length3

Unique

Unique6 ?
Unique (%)3.7%

Sample

1st row양평터미널
2nd row양평터미널
3rd row여주터미널
4th row여주터미널
5th row여주터미널

Common Values

ValueCountFrequency (%)
양평터미널 85
51.8%
문호리종점 23
 
14.0%
용문터미널 22
 
13.4%
여주터미널 11
 
6.7%
대신터미널 7
 
4.3%
양동역 6
 
3.7%
용두리터미널 4
 
2.4%
용문터미널 1
 
0.6%
양평역 1
 
0.6%
양평터미널(서후2) 1
 
0.6%
Other values (3) 3
 
1.8%

Length

2023-12-12T22:23:14.880508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
양평터미널 85
51.8%
문호리종점 23
 
14.0%
용문터미널 23
 
14.0%
여주터미널 11
 
6.7%
대신터미널 7
 
4.3%
양동역 6
 
3.7%
용두리터미널 4
 
2.4%
양평역 1
 
0.6%
양평터미널(서후2 1
 
0.6%
양수역 1
 
0.6%
Other values (2) 2
 
1.2%
Distinct135
Distinct (%)82.3%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2023-12-12T22:23:15.172949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length82
Median length49
Mean length28.420732
Min length3

Characters and Unicode

Total characters4661
Distinct characters197
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique111 ?
Unique (%)67.7%

Sample

1st row회현,개군,천서리,보통리,대신,후포,오학
2nd row회현,개군,천서리,곡수삼거리,옥촌,대신,오학
3rd row오학,후포,대신,보통리,천서리,개군,회현
4th row오학,대신,곡수삼거리,송촌리,천서리,회현리,양평시장
5th row오학,대신,천서리,이포,개군,회현
ValueCountFrequency (%)
양수역 5
 
2.8%
양평군청,오빈리,아신리,아세아신학대후문,국수리,양수리,골용진,노적사 4
 
2.3%
서후2리종점 2
 
1.1%
종점 2
 
1.1%
양평시장,양평역 2
 
1.1%
양평터미널 2
 
1.1%
목왕1리,벗고개 2
 
1.1%
양평시장,양평고교,회현리,석장리,앙덕리,구미리 2
 
1.1%
양평시장,덕평리,신애리,오빈리,아신역,옥천면사무소 2
 
1.1%
양평군청,오빈리,아신리,아세아신학대후문,복포리,국수역,정자리,청계리중말,진개울,대하초교,중등리 2
 
1.1%
Other values (135) 152
85.9%
2023-12-12T22:23:15.648750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
, 867
 
18.6%
486
 
10.4%
181
 
3.9%
178
 
3.8%
92
 
2.0%
82
 
1.8%
74
 
1.6%
72
 
1.5%
71
 
1.5%
69
 
1.5%
Other values (187) 2489
53.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3632
77.9%
Other Punctuation 871
 
18.7%
Decimal Number 91
 
2.0%
Close Punctuation 20
 
0.4%
Open Punctuation 20
 
0.4%
Uppercase Letter 14
 
0.3%
Space Separator 13
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
486
 
13.4%
181
 
5.0%
178
 
4.9%
92
 
2.5%
82
 
2.3%
74
 
2.0%
72
 
2.0%
71
 
2.0%
69
 
1.9%
68
 
1.9%
Other values (175) 2259
62.2%
Decimal Number
ValueCountFrequency (%)
1 36
39.6%
2 32
35.2%
3 22
24.2%
4 1
 
1.1%
Other Punctuation
ValueCountFrequency (%)
, 867
99.5%
. 4
 
0.5%
Close Punctuation
ValueCountFrequency (%)
) 18
90.0%
] 2
 
10.0%
Open Punctuation
ValueCountFrequency (%)
( 18
90.0%
[ 2
 
10.0%
Uppercase Letter
ValueCountFrequency (%)
A 14
100.0%
Space Separator
ValueCountFrequency (%)
13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3632
77.9%
Common 1015
 
21.8%
Latin 14
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
486
 
13.4%
181
 
5.0%
178
 
4.9%
92
 
2.5%
82
 
2.3%
74
 
2.0%
72
 
2.0%
71
 
2.0%
69
 
1.9%
68
 
1.9%
Other values (175) 2259
62.2%
Common
ValueCountFrequency (%)
, 867
85.4%
1 36
 
3.5%
2 32
 
3.2%
3 22
 
2.2%
) 18
 
1.8%
( 18
 
1.8%
13
 
1.3%
. 4
 
0.4%
[ 2
 
0.2%
] 2
 
0.2%
Latin
ValueCountFrequency (%)
A 14
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3632
77.9%
ASCII 1029
 
22.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
, 867
84.3%
1 36
 
3.5%
2 32
 
3.1%
3 22
 
2.1%
) 18
 
1.7%
( 18
 
1.7%
A 14
 
1.4%
13
 
1.3%
. 4
 
0.4%
[ 2
 
0.2%
Other values (2) 3
 
0.3%
Hangul
ValueCountFrequency (%)
486
 
13.4%
181
 
5.0%
178
 
4.9%
92
 
2.5%
82
 
2.3%
74
 
2.0%
72
 
2.0%
71
 
2.0%
69
 
1.9%
68
 
1.9%
Other values (175) 2259
62.2%

종점
Text

Distinct99
Distinct (%)60.4%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2023-12-12T22:23:16.001295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length14
Mean length4.8658537
Min length2

Characters and Unicode

Total characters798
Distinct characters127
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique57 ?
Unique (%)34.8%

Sample

1st row여주
2nd row(곡수)여주
3rd row양평
4th row양평
5th row양평
ValueCountFrequency (%)
양평 13
 
7.2%
양수역 7
 
3.9%
대신 7
 
3.9%
여주 6
 
3.3%
용문사 5
 
2.8%
항금리 4
 
2.2%
지평 4
 
2.2%
양평터미널 4
 
2.2%
곡수 4
 
2.2%
문호리 4
 
2.2%
Other values (84) 122
67.8%
2023-12-12T22:23:16.480535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
94
 
11.8%
59
 
7.4%
35
 
4.4%
( 33
 
4.1%
) 33
 
4.1%
26
 
3.3%
25
 
3.1%
23
 
2.9%
20
 
2.5%
15
 
1.9%
Other values (117) 435
54.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 636
79.7%
Space Separator 59
 
7.4%
Open Punctuation 33
 
4.1%
Close Punctuation 33
 
4.1%
Decimal Number 23
 
2.9%
Control 10
 
1.3%
Other Punctuation 4
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
94
 
14.8%
35
 
5.5%
26
 
4.1%
25
 
3.9%
23
 
3.6%
20
 
3.1%
15
 
2.4%
14
 
2.2%
14
 
2.2%
13
 
2.0%
Other values (109) 357
56.1%
Decimal Number
ValueCountFrequency (%)
3 12
52.2%
2 7
30.4%
1 4
 
17.4%
Space Separator
ValueCountFrequency (%)
59
100.0%
Open Punctuation
ValueCountFrequency (%)
( 33
100.0%
Close Punctuation
ValueCountFrequency (%)
) 33
100.0%
Control
ValueCountFrequency (%)
10
100.0%
Other Punctuation
ValueCountFrequency (%)
, 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 636
79.7%
Common 162
 
20.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
94
 
14.8%
35
 
5.5%
26
 
4.1%
25
 
3.9%
23
 
3.6%
20
 
3.1%
15
 
2.4%
14
 
2.2%
14
 
2.2%
13
 
2.0%
Other values (109) 357
56.1%
Common
ValueCountFrequency (%)
59
36.4%
( 33
20.4%
) 33
20.4%
3 12
 
7.4%
10
 
6.2%
2 7
 
4.3%
, 4
 
2.5%
1 4
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 636
79.7%
ASCII 162
 
20.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
94
 
14.8%
35
 
5.5%
26
 
4.1%
25
 
3.9%
23
 
3.6%
20
 
3.1%
15
 
2.4%
14
 
2.2%
14
 
2.2%
13
 
2.0%
Other values (109) 357
56.1%
ASCII
ValueCountFrequency (%)
59
36.4%
( 33
20.4%
) 33
20.4%
3 12
 
7.4%
10
 
6.2%
2 7
 
4.3%
, 4
 
2.5%
1 4
 
2.5%

거리
Real number (ℝ)

HIGH CORRELATION 

Distinct115
Distinct (%)70.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22.442683
Minimum3.3
Maximum59.5
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.6 KiB
2023-12-12T22:23:16.648285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3.3
5-th percentile5.615
Q111.5
median18.8
Q330.15
95-th percentile48.785
Maximum59.5
Range56.2
Interquartile range (IQR)18.65

Descriptive statistics

Standard deviation13.533068
Coefficient of variation (CV)0.60300581
Kurtosis-0.37005982
Mean22.442683
Median Absolute Deviation (MAD)8.9
Skewness0.72765046
Sum3680.6
Variance183.14393
MonotonicityNot monotonic
2023-12-12T22:23:16.855341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
28.1 5
 
3.0%
11.5 4
 
2.4%
11.7 3
 
1.8%
8.1 3
 
1.8%
10.4 3
 
1.8%
27.7 3
 
1.8%
7.2 3
 
1.8%
19.3 3
 
1.8%
34.6 2
 
1.2%
17.9 2
 
1.2%
Other values (105) 133
81.1%
ValueCountFrequency (%)
3.3 2
1.2%
4.0 2
1.2%
4.7 2
1.2%
5.5 1
 
0.6%
5.6 2
1.2%
5.7 1
 
0.6%
6.3 1
 
0.6%
6.4 2
1.2%
6.9 2
1.2%
7.2 3
1.8%
ValueCountFrequency (%)
59.5 1
0.6%
54.6 1
0.6%
53.3 2
1.2%
52.4 2
1.2%
52.1 1
0.6%
49.5 1
0.6%
48.8 1
0.6%
48.7 1
0.6%
48.5 1
0.6%
48.0 1
0.6%

회수
Categorical

HIGH CORRELATION 

Distinct18
Distinct (%)11.0%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
1
66 
2
28 
3
21 
4
13 
1(편도)
11 
Other values (13)
25 

Length

Max length24
Median length1
Mean length1.6402439
Min length1

Unique

Unique8 ?
Unique (%)4.9%

Sample

1st row6
2nd row2
3rd row4
4th row2
5th row1

Common Values

ValueCountFrequency (%)
1 66
40.2%
2 28
17.1%
3 21
 
12.8%
4 13
 
7.9%
1(편도) 11
 
6.7%
6 6
 
3.7%
5 5
 
3.0%
6(편도) 2
 
1.2%
4(편도) 2
 
1.2%
3(편도) 2
 
1.2%
Other values (8) 8
 
4.9%

Length

2023-12-12T22:23:17.043727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1 66
39.5%
2 28
16.8%
3 21
 
12.6%
4 14
 
8.4%
1(편도 11
 
6.6%
6 7
 
4.2%
5 5
 
3.0%
4(편도 2
 
1.2%
3(편도 2
 
1.2%
6(편도 2
 
1.2%
Other values (9) 9
 
5.4%

대수
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
<NA>
120 
1
38 
2
 
6

Length

Max length4
Median length4
Mean length3.195122
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 120
73.2%
1 38
 
23.2%
2 6
 
3.7%

Length

2023-12-12T22:23:17.193598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:23:17.299774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 120
73.2%
1 38
 
23.2%
2 6
 
3.7%
Distinct140
Distinct (%)85.4%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2023-12-12T22:23:17.612239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length96
Median length54
Mean length15.871951
Min length5

Characters and Unicode

Total characters2603
Distinct characters54
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique122 ?
Unique (%)74.4%

Sample

1st row06:10, 06:30, 08:00, 12:20, 17:00, 20:10
2nd row11:40, 16:10
3rd row07:50, 14:00, 18:30, 21:30
4th row11:30, 19:10
5th row09:30
ValueCountFrequency (%)
14:00 10
 
2.4%
18:00 10
 
2.4%
13:00 8
 
2.0%
07:50 8
 
2.0%
11:00 8
 
2.0%
07:00 8
 
2.0%
17:00 8
 
2.0%
18:30 8
 
2.0%
16:00 8
 
2.0%
15:00 7
 
1.7%
Other values (127) 326
79.7%
2023-12-12T22:23:18.097030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 629
24.2%
: 398
15.3%
1 330
12.7%
246
 
9.5%
, 243
 
9.3%
5 111
 
4.3%
3 108
 
4.1%
2 96
 
3.7%
4 86
 
3.3%
7 73
 
2.8%
Other values (44) 283
10.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1593
61.2%
Other Punctuation 645
24.8%
Space Separator 246
 
9.5%
Other Letter 106
 
4.1%
Dash Punctuation 13
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
 
13.2%
12
 
11.3%
12
 
11.3%
12
 
11.3%
6
 
5.7%
4
 
3.8%
4
 
3.8%
3
 
2.8%
2
 
1.9%
2
 
1.9%
Other values (28) 35
33.0%
Decimal Number
ValueCountFrequency (%)
0 629
39.5%
1 330
20.7%
5 111
 
7.0%
3 108
 
6.8%
2 96
 
6.0%
4 86
 
5.4%
7 73
 
4.6%
6 56
 
3.5%
8 55
 
3.5%
9 49
 
3.1%
Other Punctuation
ValueCountFrequency (%)
: 398
61.7%
, 243
37.7%
. 3
 
0.5%
/ 1
 
0.2%
Space Separator
ValueCountFrequency (%)
246
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2497
95.9%
Hangul 105
 
4.0%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14
13.3%
12
 
11.4%
12
 
11.4%
12
 
11.4%
6
 
5.7%
4
 
3.8%
4
 
3.8%
3
 
2.9%
2
 
1.9%
2
 
1.9%
Other values (27) 34
32.4%
Common
ValueCountFrequency (%)
0 629
25.2%
: 398
15.9%
1 330
13.2%
246
 
9.9%
, 243
 
9.7%
5 111
 
4.4%
3 108
 
4.3%
2 96
 
3.8%
4 86
 
3.4%
7 73
 
2.9%
Other values (6) 177
 
7.1%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2497
95.9%
Hangul 105
 
4.0%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 629
25.2%
: 398
15.9%
1 330
13.2%
246
 
9.9%
, 243
 
9.7%
5 111
 
4.4%
3 108
 
4.3%
2 96
 
3.8%
4 86
 
3.4%
7 73
 
2.9%
Other values (6) 177
 
7.1%
Hangul
ValueCountFrequency (%)
14
13.3%
12
 
11.4%
12
 
11.4%
12
 
11.4%
6
 
5.7%
4
 
3.8%
4
 
3.8%
3
 
2.9%
2
 
1.9%
2
 
1.9%
Other values (27) 34
32.4%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct121
Distinct (%)73.8%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2023-12-12T22:23:18.396567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length96
Median length56
Mean length13.323171
Min length4

Characters and Unicode

Total characters2185
Distinct characters41
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique110 ?
Unique (%)67.1%

Sample

1st row편도운행
2nd row편도운행
3rd row편도운행
4th row편도운행
5th row편도운행
ValueCountFrequency (%)
편도운행 35
 
9.7%
07:30 7
 
1.9%
11:30 6
 
1.7%
18:50 5
 
1.4%
07:00 5
 
1.4%
08:10 5
 
1.4%
19:00 5
 
1.4%
08:00 5
 
1.4%
18:20 5
 
1.4%
16:20 5
 
1.4%
Other values (146) 276
76.9%
2023-12-12T22:23:18.851688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 386
17.7%
: 318
14.6%
1 286
13.1%
196
9.0%
, 192
8.8%
5 171
7.8%
2 105
 
4.8%
3 84
 
3.8%
4 59
 
2.7%
7 57
 
2.6%
Other values (31) 331
15.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1267
58.0%
Other Punctuation 510
23.3%
Space Separator 196
 
9.0%
Other Letter 187
 
8.6%
Close Punctuation 11
 
0.5%
Open Punctuation 11
 
0.5%
Dash Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
36
19.3%
35
18.7%
35
18.7%
35
18.7%
6
 
3.2%
5
 
2.7%
5
 
2.7%
5
 
2.7%
3
 
1.6%
3
 
1.6%
Other values (15) 19
10.2%
Decimal Number
ValueCountFrequency (%)
0 386
30.5%
1 286
22.6%
5 171
13.5%
2 105
 
8.3%
3 84
 
6.6%
4 59
 
4.7%
7 57
 
4.5%
8 57
 
4.5%
9 43
 
3.4%
6 19
 
1.5%
Other Punctuation
ValueCountFrequency (%)
: 318
62.4%
, 192
37.6%
Space Separator
ValueCountFrequency (%)
196
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1998
91.4%
Hangul 187
 
8.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
36
19.3%
35
18.7%
35
18.7%
35
18.7%
6
 
3.2%
5
 
2.7%
5
 
2.7%
5
 
2.7%
3
 
1.6%
3
 
1.6%
Other values (15) 19
10.2%
Common
ValueCountFrequency (%)
0 386
19.3%
: 318
15.9%
1 286
14.3%
196
9.8%
, 192
9.6%
5 171
8.6%
2 105
 
5.3%
3 84
 
4.2%
4 59
 
3.0%
7 57
 
2.9%
Other values (6) 144
 
7.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1998
91.4%
Hangul 187
 
8.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 386
19.3%
: 318
15.9%
1 286
14.3%
196
9.8%
, 192
9.6%
5 171
8.6%
2 105
 
5.3%
3 84
 
4.2%
4 59
 
3.0%
7 57
 
2.9%
Other values (6) 144
 
7.2%
Hangul
ValueCountFrequency (%)
36
19.3%
35
18.7%
35
18.7%
35
18.7%
6
 
3.2%
5
 
2.7%
5
 
2.7%
5
 
2.7%
3
 
1.6%
3
 
1.6%
Other values (15) 19
10.2%

비고(변경사항 등)
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
<NA>
151 
(토,일,공휴일 미운행)
 
8
(토,일,공휴일만 운행)
 
2
(토,일,공휴일,방학 중 미운행)
 
2
평일 8회, 토,일,공 6회
 
1

Length

Max length18
Median length4
Mean length4.7865854
Min length4

Unique

Unique1 ?
Unique (%)0.6%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 151
92.1%
(토,일,공휴일 미운행) 8
 
4.9%
(토,일,공휴일만 운행) 2
 
1.2%
(토,일,공휴일,방학 중 미운행) 2
 
1.2%
평일 8회, 토,일,공 6회 1
 
0.6%

Length

2023-12-12T22:23:19.011713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:23:19.154507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 151
83.4%
미운행 10
 
5.5%
토,일,공휴일 8
 
4.4%
토,일,공휴일만 2
 
1.1%
운행 2
 
1.1%
토,일,공휴일,방학 2
 
1.1%
2
 
1.1%
평일 1
 
0.6%
8회 1
 
0.6%
토,일,공 1
 
0.6%

기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
Minimum2023-06-15 00:00:00
Maximum2023-06-15 00:00:00
2023-12-12T22:23:19.254612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:23:19.354510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T22:23:12.942651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:23:12.685293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:23:13.051393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:23:12.815918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T22:23:19.734051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번기점종점거리회수대수비고(변경사항 등)
순번1.0000.7470.9460.6210.0000.6170.767
기점0.7471.0000.9200.4200.0000.5950.855
종점0.9460.9201.0000.9230.8681.0001.000
거리0.6210.4200.9231.0000.5850.4820.967
회수0.0000.0000.8680.5851.0000.8160.743
대수0.6170.5951.0000.4820.8161.0000.000
비고(변경사항 등)0.7670.8551.0000.9670.7430.0001.000
2023-12-12T22:23:19.844890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
비고(변경사항 등)대수기점회수
비고(변경사항 등)1.0000.0000.4930.211
대수0.0001.0000.4130.558
기점0.4930.4131.0000.000
회수0.2110.5580.0001.000
2023-12-12T22:23:19.942288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번거리기점회수대수비고(변경사항 등)
순번1.000-0.2810.4250.0000.4250.775
거리-0.2811.0000.1850.2600.4370.770
기점0.4250.1851.0000.0000.4130.493
회수0.0000.2600.0001.0000.5580.211
대수0.4250.4370.4130.5581.0000.000
비고(변경사항 등)0.7750.7700.4930.2110.0001.000

Missing values

2023-12-12T22:23:13.205044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:23:13.441650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번노선번호기점경유지종점거리회수대수기점 출발시간종점 출발시간비고(변경사항 등)기준일자
01[1]양평터미널회현,개군,천서리,보통리,대신,후포,오학여주34.66206:10, 06:30, 08:00, 12:20, 17:00, 20:10편도운행<NA>2023-06-15
12[1-1]양평터미널회현,개군,천서리,곡수삼거리,옥촌,대신,오학(곡수)여주42.42211:40, 16:10편도운행<NA>2023-06-15
23[1-10]여주터미널오학,후포,대신,보통리,천서리,개군,회현양평34.34<NA>07:50, 14:00, 18:30, 21:30편도운행<NA>2023-06-15
34[1-11]여주터미널오학,대신,곡수삼거리,송촌리,천서리,회현리,양평시장양평42.12<NA>11:30, 19:10편도운행<NA>2023-06-15
45[1-12]여주터미널오학,대신,천서리,이포,개군,회현양평38.71<NA>09:30편도운행<NA>2023-06-15
56[1-13]여주터미널오학,대신,곡수삼거리,옥현,지평,광탄,용문,백안리,양평병원양평53.33<NA>07:30, 13:30, 18:00편도운행<NA>2023-06-15
67[1-14]여주터미널오학,후포,대신,보통리,천서리,개군,회현,포레나A,군민회관양평34.33<NA>07:50, 14:00, 21:30편도운행<NA>2023-06-15
78[1-18]대신터미널율촌리도롱리3.31107:0007:05<NA>2023-06-15
89[1-19]대신터미널보통2리양촌리4.01<NA>07:1007:15<NA>2023-06-15
910[1-2]양평터미널회현,개군,이포,천서리,대신,오학(이포) 여주39.01109:50편도운행<NA>2023-06-15
순번노선번호기점경유지종점거리회수대수기점 출발시간종점 출발시간비고(변경사항 등)기준일자
154155[9-1 (벽)]문호리종점명달리 종점명달리19.15106:00, 09:20, 12:05, 14:05, 17:05, 19:0510:00, 12:50, 15:00, 17:50, 19:40<NA>2023-06-15
155156[9-2]문호리종점서후2리 종점서후리(문호리)11.52<NA>16:00, 19:0516:20, 19:25<NA>2023-06-15
156157[9-20]문호리종점서후2리종점서후리(문호리)11.52110:00, 13:0010:20, 13:20<NA>2023-06-15
157158[9-22]문호리종점서후2리종점서후리(문호리)11.51<NA>08:0008:20<NA>2023-06-15
158159[9-3]문호리종점정배리종점정배리(문호리)11.61<NA>17:4017:55<NA>2023-06-15
159160[9-30 (벽)]문호리종점정배리종점정배리(문호리)11.53<NA>08:35, 12:05, 14:4008:50, 12:20, 14:55<NA>2023-06-15
160161[9-6]명달리종점양평터미널양평48.81(편도)106:35명달-문호,양수역-양평편도운행<NA>2023-06-15
161162[9-7]문호리종점양평터미널(고현)양평(고현)38.01(편도)<NA>19:45 문호-양수두물,국수역,청계,정자골,고현-양평편도운행<NA>2023-06-15
162163[9-8]양평터미널(중미산양현마을)문호리종점문호리25.31(편도)<NA>07:00 양평出 / 07:30 양현마을-정배-문호리편도운행<NA>2023-06-15
163164[G9311]용문터미널용문구터미널, 양평터미널, 양평시장, 양평군청, 아신역입구, 아신대, 양평전자고앞, 양수리지석묘(토,일,공휴일 경유하지않을수 있음)잠실환승센터59.58(6)206:00, 07:00, 10:00, 11:00, 14:00, 15:00, 18:00, 19:0007:30, 08:30, 11:30, 12:30, 15:30, 16:30, 19:30, (20:30)평일 8회, 토,일,공 6회2023-06-15