Overview

Dataset statistics

Number of variables4
Number of observations27
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory996.0 B
Average record size in memory36.9 B

Variable types

Text4

Dataset

Description대구광역시에서 운행 중인 시내버스의 버스회사 목록 파일입니다. 회사명, 주소, 전화번호, 운행노선 등을 제공하고 있습니다.
Author대구광역시
URLhttps://www.data.go.kr/data/15060933/fileData.do

Alerts

버스회사명 has unique valuesUnique
전화번호 has unique valuesUnique
운행노선 has unique valuesUnique

Reproduction

Analysis started2023-12-13 00:59:24.959004
Analysis finished2023-12-13 00:59:25.275350
Duration0.32 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

버스회사명
Text

UNIQUE 

Distinct27
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size348.0 B
2023-12-13T09:59:25.388258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length7.0740741
Min length5

Characters and Unicode

Total characters191
Distinct characters47
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)100.0%

Sample

1st row(주)세진교통
2nd row경북교통(주)
3rd row동명교통(주)
4th row신흥버스(주)
5th row광남자동차(주)
ValueCountFrequency (%)
주)세진교통 1
 
3.7%
주)달구벌버스 1
 
3.7%
세운버스(주 1
 
3.7%
주)신일여객 1
 
3.7%
경상버스(주 1
 
3.7%
남도버스(주 1
 
3.7%
대덕교통(주 1
 
3.7%
신진자동차(주 1
 
3.7%
현대교통(주 1
 
3.7%
주)관음교통 1
 
3.7%
Other values (17) 17
63.0%
2023-12-13T09:59:25.640808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
27
14.1%
( 26
13.6%
) 26
13.6%
14
 
7.3%
14
 
7.3%
7
 
3.7%
7
 
3.7%
4
 
2.1%
4
 
2.1%
4
 
2.1%
Other values (37) 58
30.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 138
72.3%
Open Punctuation 26
 
13.6%
Close Punctuation 26
 
13.6%
Other Symbol 1
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
27
19.6%
14
 
10.1%
14
 
10.1%
7
 
5.1%
7
 
5.1%
4
 
2.9%
4
 
2.9%
4
 
2.9%
4
 
2.9%
3
 
2.2%
Other values (34) 50
36.2%
Open Punctuation
ValueCountFrequency (%)
( 26
100.0%
Close Punctuation
ValueCountFrequency (%)
) 26
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 139
72.8%
Common 52
 
27.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
27
19.4%
14
 
10.1%
14
 
10.1%
7
 
5.0%
7
 
5.0%
4
 
2.9%
4
 
2.9%
4
 
2.9%
4
 
2.9%
3
 
2.2%
Other values (35) 51
36.7%
Common
ValueCountFrequency (%)
( 26
50.0%
) 26
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 138
72.3%
ASCII 52
 
27.2%
None 1
 
0.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
27
19.6%
14
 
10.1%
14
 
10.1%
7
 
5.1%
7
 
5.1%
4
 
2.9%
4
 
2.9%
4
 
2.9%
4
 
2.9%
3
 
2.2%
Other values (34) 50
36.2%
ASCII
ValueCountFrequency (%)
( 26
50.0%
) 26
50.0%
None
ValueCountFrequency (%)
1
100.0%

주소
Text

Distinct24
Distinct (%)88.9%
Missing0
Missing (%)0.0%
Memory size348.0 B
2023-12-13T09:59:25.833642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length22
Mean length19.925926
Min length16

Characters and Unicode

Total characters538
Distinct characters77
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)77.8%

Sample

1st row대구광역시 수성구 만촌동 794-1
2nd row대구광역시 동구 동호동 105-4
3rd row대구광역시 북구 읍내동 514
4th row대구광역시 달서구 갈산동 358-27
5th row대구시 동구 봉무동 126-2
ValueCountFrequency (%)
대구광역시 26
22.0%
수성구 6
 
5.1%
동구 6
 
5.1%
달성군 5
 
4.2%
달서구 4
 
3.4%
북구 4
 
3.4%
2길 2
 
1.7%
신매동 2
 
1.7%
45 2
 
1.7%
동호동 2
 
1.7%
Other values (53) 59
50.0%
2023-12-13T09:59:26.315459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
91
16.9%
49
 
9.1%
34
 
6.3%
29
 
5.4%
27
 
5.0%
26
 
4.8%
26
 
4.8%
1 23
 
4.3%
2 14
 
2.6%
4 14
 
2.6%
Other values (67) 205
38.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 332
61.7%
Decimal Number 96
 
17.8%
Space Separator 91
 
16.9%
Dash Punctuation 13
 
2.4%
Open Punctuation 3
 
0.6%
Close Punctuation 3
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
49
14.8%
34
 
10.2%
29
 
8.7%
27
 
8.1%
26
 
7.8%
26
 
7.8%
12
 
3.6%
11
 
3.3%
8
 
2.4%
8
 
2.4%
Other values (53) 102
30.7%
Decimal Number
ValueCountFrequency (%)
1 23
24.0%
2 14
14.6%
4 14
14.6%
5 9
 
9.4%
7 7
 
7.3%
9 7
 
7.3%
0 7
 
7.3%
3 6
 
6.2%
6 6
 
6.2%
8 3
 
3.1%
Space Separator
ValueCountFrequency (%)
91
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 332
61.7%
Common 206
38.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
49
14.8%
34
 
10.2%
29
 
8.7%
27
 
8.1%
26
 
7.8%
26
 
7.8%
12
 
3.6%
11
 
3.3%
8
 
2.4%
8
 
2.4%
Other values (53) 102
30.7%
Common
ValueCountFrequency (%)
91
44.2%
1 23
 
11.2%
2 14
 
6.8%
4 14
 
6.8%
- 13
 
6.3%
5 9
 
4.4%
7 7
 
3.4%
9 7
 
3.4%
0 7
 
3.4%
3 6
 
2.9%
Other values (4) 15
 
7.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 332
61.7%
ASCII 206
38.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
91
44.2%
1 23
 
11.2%
2 14
 
6.8%
4 14
 
6.8%
- 13
 
6.3%
5 9
 
4.4%
7 7
 
3.4%
9 7
 
3.4%
0 7
 
3.4%
3 6
 
2.9%
Other values (4) 15
 
7.3%
Hangul
ValueCountFrequency (%)
49
14.8%
34
 
10.2%
29
 
8.7%
27
 
8.1%
26
 
7.8%
26
 
7.8%
12
 
3.6%
11
 
3.3%
8
 
2.4%
8
 
2.4%
Other values (53) 102
30.7%

전화번호
Text

UNIQUE 

Distinct27
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size348.0 B
2023-12-13T09:59:26.481915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.037037
Min length12

Characters and Unicode

Total characters325
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)100.0%

Sample

1st row053-812-2812
2nd row053-962-4714
3rd row053-353-1374
4th row053-583-1101
5th row053-985-5006
ValueCountFrequency (%)
053-812-2812 1
 
3.6%
053-962-4714 1
 
3.6%
053-586-5540 1
 
3.6%
053-963-3341 1
 
3.6%
053-552-5831 1
 
3.6%
053-633-1224 1
 
3.6%
961-7454 1
 
3.6%
053 1
 
3.6%
053-784-2902 1
 
3.6%
053-781-7005 1
 
3.6%
Other values (18) 18
64.3%
2023-12-13T09:59:26.736989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 54
16.6%
- 54
16.6%
0 48
14.8%
5 47
14.5%
1 29
8.9%
2 21
 
6.5%
8 19
 
5.8%
9 14
 
4.3%
7 14
 
4.3%
6 13
 
4.0%
Other values (2) 12
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 270
83.1%
Dash Punctuation 54
 
16.6%
Space Separator 1
 
0.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 54
20.0%
0 48
17.8%
5 47
17.4%
1 29
10.7%
2 21
 
7.8%
8 19
 
7.0%
9 14
 
5.2%
7 14
 
5.2%
6 13
 
4.8%
4 11
 
4.1%
Dash Punctuation
ValueCountFrequency (%)
- 54
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 325
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 54
16.6%
- 54
16.6%
0 48
14.8%
5 47
14.5%
1 29
8.9%
2 21
 
6.5%
8 19
 
5.8%
9 14
 
4.3%
7 14
 
4.3%
6 13
 
4.0%
Other values (2) 12
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 325
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 54
16.6%
- 54
16.6%
0 48
14.8%
5 47
14.5%
1 29
8.9%
2 21
 
6.5%
8 19
 
5.8%
9 14
 
4.3%
7 14
 
4.3%
6 13
 
4.0%
Other values (2) 12
 
3.7%

운행노선
Text

UNIQUE 

Distinct27
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size348.0 B
2023-12-13T09:59:26.979739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length101
Median length49
Mean length46.62963
Min length15

Characters and Unicode

Total characters1259
Distinct characters32
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)100.0%

Sample

1st row403+[공동]564+[공동]618+[공동]724+[공동]814+[공동]8140+[공동]939+[공동]급행1+[공동]급행5+[공동]순환3-1+수성3-1
2nd row805+[공동]651+[공동]814+[공동]937+[공동]급행5+[공동]동구2+[공동]순환3+수성3
3rd row730+[공동]724+[공동]7250+[공동]726+[공동]급행3+[공동]급행7+[공동]북구4+[공동]팔공3+급행9+급행9-1+칠곡2+칠곡3+칠곡4+칠곡5
4th row[공동]425+[공동]503+[공동]518+[공동]7250+[공동]급행5+[공동]급행7+[공동]급행8+[공동]달서1+서구1-1
5th row101+101-1+[공동]401+[공동]8140+[공동]급행6+[공동]팔공1+북구2+팔공2
ValueCountFrequency (%)
403+[공동]564+[공동]618+[공동]724+[공동]814+[공동]8140+[공동]939+[공동]급행1+[공동]급행5+[공동]순환3-1+수성3-1 1
 
3.7%
300+518-1+[공동]156+[공동]518+[공동]8140+동구4+동구7+북구3 1
 
3.7%
공동]156+[공동]618+[공동]급행6+[공동]급행8+[공동]급행8-1+[공동]달서5+달성1 1
 
3.7%
808+[공동]동구2+동구6 1
 
3.7%
323+323-1+[공동]649+[공동]719+[공동]840+동구1 1
 
3.7%
공동]653+[공동]706+[공동]달서5+달서4+달서4-1 1
 
3.7%
공동]650+[공동]708+[공동]836+동구1-1 1
 
3.7%
410+[공동]401+[공동]급행3+[공동]순환3+[공동]순환3-1+수성1+수성1-1 1
 
3.7%
410-1+600+[공동]204+[공동]401+[공동]524+[공동]급행4+[공동]급행8+[공동]급행8-1+[공동]달서1+남구1+남구1-1+달성2+달성3+달성5+달성6+달성7+수성4 1
 
3.7%
306+[공동]425+[공동]719+동구3 1
 
3.7%
Other values (17) 17
63.0%
2023-12-13T09:59:27.340408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
+ 173
13.7%
126
 
10.0%
118
 
9.4%
[ 114
 
9.1%
] 114
 
9.1%
1 75
 
6.0%
4 52
 
4.1%
0 50
 
4.0%
5 49
 
3.9%
3 45
 
3.6%
Other values (22) 343
27.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 435
34.6%
Other Letter 402
31.9%
Math Symbol 173
 
13.7%
Open Punctuation 114
 
9.1%
Close Punctuation 114
 
9.1%
Dash Punctuation 21
 
1.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
126
31.3%
118
29.4%
25
 
6.2%
25
 
6.2%
21
 
5.2%
16
 
4.0%
13
 
3.2%
13
 
3.2%
7
 
1.7%
7
 
1.7%
Other values (8) 31
 
7.7%
Decimal Number
ValueCountFrequency (%)
1 75
17.2%
4 52
12.0%
0 50
11.5%
5 49
11.3%
3 45
10.3%
2 41
9.4%
6 38
8.7%
8 34
7.8%
7 26
 
6.0%
9 25
 
5.7%
Math Symbol
ValueCountFrequency (%)
+ 173
100.0%
Open Punctuation
ValueCountFrequency (%)
[ 114
100.0%
Close Punctuation
ValueCountFrequency (%)
] 114
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 21
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 857
68.1%
Hangul 402
31.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
126
31.3%
118
29.4%
25
 
6.2%
25
 
6.2%
21
 
5.2%
16
 
4.0%
13
 
3.2%
13
 
3.2%
7
 
1.7%
7
 
1.7%
Other values (8) 31
 
7.7%
Common
ValueCountFrequency (%)
+ 173
20.2%
[ 114
13.3%
] 114
13.3%
1 75
8.8%
4 52
 
6.1%
0 50
 
5.8%
5 49
 
5.7%
3 45
 
5.3%
2 41
 
4.8%
6 38
 
4.4%
Other values (4) 106
12.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 857
68.1%
Hangul 402
31.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
+ 173
20.2%
[ 114
13.3%
] 114
13.3%
1 75
8.8%
4 52
 
6.1%
0 50
 
5.8%
5 49
 
5.7%
3 45
 
5.3%
2 41
 
4.8%
6 38
 
4.4%
Other values (4) 106
12.4%
Hangul
ValueCountFrequency (%)
126
31.3%
118
29.4%
25
 
6.2%
25
 
6.2%
21
 
5.2%
16
 
4.0%
13
 
3.2%
13
 
3.2%
7
 
1.7%
7
 
1.7%
Other values (8) 31
 
7.7%

Correlations

2023-12-13T09:59:27.425826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
버스회사명주소전화번호운행노선
버스회사명1.0001.0001.0001.000
주소1.0001.0001.0001.000
전화번호1.0001.0001.0001.000
운행노선1.0001.0001.0001.000

Missing values

2023-12-13T09:59:25.163103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T09:59:25.243905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

버스회사명주소전화번호운행노선
0(주)세진교통대구광역시 수성구 만촌동 794-1053-812-2812403+[공동]564+[공동]618+[공동]724+[공동]814+[공동]8140+[공동]939+[공동]급행1+[공동]급행5+[공동]순환3-1+수성3-1
1경북교통(주)대구광역시 동구 동호동 105-4053-962-4714805+[공동]651+[공동]814+[공동]937+[공동]급행5+[공동]동구2+[공동]순환3+수성3
2동명교통(주)대구광역시 북구 읍내동 514053-353-1374730+[공동]724+[공동]7250+[공동]726+[공동]급행3+[공동]급행7+[공동]북구4+[공동]팔공3+급행9+급행9-1+칠곡2+칠곡3+칠곡4+칠곡5
3신흥버스(주)대구광역시 달서구 갈산동 358-27053-583-1101[공동]425+[공동]503+[공동]518+[공동]7250+[공동]급행5+[공동]급행7+[공동]급행8+[공동]달서1+서구1-1
4광남자동차(주)대구시 동구 봉무동 126-2053-985-5006101+101-1+[공동]401+[공동]8140+[공동]급행6+[공동]팔공1+북구2+팔공2
5(주)세왕교통대구광역시 달성군 가창면 삼산리 360053-763-2201304+405+413+[공동]240+[공동]449+[공동]8140
6성보교통(주)대구광역시 북구 검단동 1393-46053-381-5961[공동]356+[공동]623+[공동]8140+[공동]급행2+가창2+북구1+순환2+순환2-1
7한일운수(주)대구광역시 수성구 신매동 10 - 13053-801-0941909+[공동]449+[공동]609+[공동]840+동구4-1
8우진교통(주)대구광역시 달성군 하빈면 하빈남로 414053-522-9301[공동]524+[공동]527+[공동]564+[공동]655+[공동]급행1+[공동]급행7+성서1+성서1-1+성서3
9삼천리버스(주)대구광역시 수성구 신매동 79-2053-818-3071309+349+[공동]509
버스회사명주소전화번호운행노선
17세한여객(주)대구광역시 달성군 유가읍 테크노중앙대로 2길 45053-553-1301[공동]356+[공동]623+[공동]655+[공동]급행4+[공동]급행8+[공동]급행8-1
18(주)관음교통대구광역시 동구 방촌동 1114-2053-982-9001306+[공동]425+[공동]719+동구3
19현대교통(주)대구광역시 달성군 유가읍 테크노중앙대로 2길 45053-781-7005410-1+600+[공동]204+[공동]401+[공동]524+[공동]급행4+[공동]급행8+[공동]급행8-1+[공동]달서1+남구1+남구1-1+달성2+달성3+달성5+달성6+달성7+수성4
20신진자동차(주)대구광역시 수성구 범물동 1290053-784-2902410+[공동]401+[공동]급행3+[공동]순환3+[공동]순환3-1+수성1+수성1-1
21대덕교통(주)대구광역시 동구 경안로700(동호동)053- 961-7454[공동]650+[공동]708+[공동]836+동구1-1
22남도버스(주)대구광역시 달서구 대곡동 411053-633-1224[공동]653+[공동]706+[공동]달서5+달서4+달서4-1
23경상버스(주)대구광역시 서구 이현동 45-52053-552-5831323+323-1+[공동]649+[공동]719+[공동]840+동구1
24(주)신일여객대구광역시 동구 대림로2길 8 (괴전동)053-963-3341808+[공동]동구2+동구6
25세운버스(주)대구광역시 달서구 달서대로 117053-586-5540[공동]156+[공동]618+[공동]급행6+[공동]급행8+[공동]급행8-1+[공동]달서5+달성1
26군위교통㈜대구광역시 군위군 군위읍 중앙길 12054-383-00231+2+3+4+5+6+7+8+9+10+11+12