Overview

Dataset statistics

Number of variables5
Number of observations149
Missing cells1
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.9 KiB
Average record size in memory40.9 B

Variable types

Categorical2
Text3

Dataset

Description경상남도 전세버스 운송사업 업체 현황으로, 전세버스 운송사업의 관할관청, 업체명, 주소, 연락처, 구분에 관한 정보를 제공합니다.
Author경상남도
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=3083295

Alerts

구분 is highly imbalanced (59.6%)Imbalance

Reproduction

Analysis started2023-12-11 00:14:36.524285
Analysis finished2023-12-11 00:14:36.928465
Duration0.4 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

관할관청
Categorical

Distinct19
Distinct (%)12.8%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
창원시
30 
김해시
19 
거제시
17 
양산시
15 
진주시
14 
Other values (14)
54 

Length

Max length4
Median length3
Mean length3.0469799
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row창원시
2nd row창원시
3rd row창원시
4th row창원시
5th row창원시

Common Values

ValueCountFrequency (%)
창원시 30
20.1%
김해시 19
12.8%
거제시 17
11.4%
양산시 15
10.1%
진주시 14
9.4%
함안군 9
 
6.0%
사천시 6
 
4.0%
남해군 5
 
3.4%
통영시 5
 
3.4%
거창군 4
 
2.7%
Other values (9) 25
16.8%

Length

2023-12-11T09:14:37.007956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
창원시 30
20.1%
김해시 19
12.8%
거제시 17
11.4%
진주시 16
10.7%
양산시 15
10.1%
함안군 9
 
6.0%
사천시 6
 
4.0%
남해군 5
 
3.4%
통영시 5
 
3.4%
밀양시 4
 
2.7%
Other values (8) 23
15.4%
Distinct141
Distinct (%)94.6%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-11T09:14:37.261029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length11
Mean length7.1812081
Min length3

Characters and Unicode

Total characters1070
Distinct characters147
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique134 ?
Unique (%)89.9%

Sample

1st row성운고속
2nd row(주)동원고속관광
3rd row(주)하나로고속관광
4th row(주)성산고속관광
5th row(주)현대고속관광
ValueCountFrequency (%)
경남전세버스협동조합 3
 
2.0%
명신고속관광㈜ 2
 
1.3%
㈜남양관광 2
 
1.3%
㈜명신관광 2
 
1.3%
그린고속관광㈜ 2
 
1.3%
합)양산코리아고속관광 2
 
1.3%
㈜영진고속관광 2
 
1.3%
삼성고속관광㈜ 1
 
0.7%
대금투어 1
 
0.7%
㈜싱싱고속관광 1
 
0.7%
Other values (132) 132
88.0%
2023-12-11T09:14:37.659235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
108
 
10.1%
105
 
9.8%
91
 
8.5%
72
 
6.7%
72
 
6.7%
27
 
2.5%
27
 
2.5%
( 26
 
2.4%
) 26
 
2.4%
22
 
2.1%
Other values (137) 494
46.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 922
86.2%
Other Symbol 91
 
8.5%
Open Punctuation 26
 
2.4%
Close Punctuation 26
 
2.4%
Uppercase Letter 3
 
0.3%
Space Separator 1
 
0.1%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
108
 
11.7%
105
 
11.4%
72
 
7.8%
72
 
7.8%
27
 
2.9%
27
 
2.9%
22
 
2.4%
22
 
2.4%
21
 
2.3%
20
 
2.2%
Other values (129) 426
46.2%
Uppercase Letter
ValueCountFrequency (%)
V 1
33.3%
I 1
33.3%
P 1
33.3%
Other Symbol
ValueCountFrequency (%)
91
100.0%
Open Punctuation
ValueCountFrequency (%)
( 26
100.0%
Close Punctuation
ValueCountFrequency (%)
) 26
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1013
94.7%
Common 54
 
5.0%
Latin 3
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
108
 
10.7%
105
 
10.4%
91
 
9.0%
72
 
7.1%
72
 
7.1%
27
 
2.7%
27
 
2.7%
22
 
2.2%
22
 
2.2%
21
 
2.1%
Other values (130) 446
44.0%
Common
ValueCountFrequency (%)
( 26
48.1%
) 26
48.1%
1
 
1.9%
- 1
 
1.9%
Latin
ValueCountFrequency (%)
V 1
33.3%
I 1
33.3%
P 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 922
86.2%
None 91
 
8.5%
ASCII 57
 
5.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
108
 
11.7%
105
 
11.4%
72
 
7.8%
72
 
7.8%
27
 
2.9%
27
 
2.9%
22
 
2.4%
22
 
2.4%
21
 
2.3%
20
 
2.2%
Other values (129) 426
46.2%
None
ValueCountFrequency (%)
91
100.0%
ASCII
ValueCountFrequency (%)
( 26
45.6%
) 26
45.6%
1
 
1.8%
- 1
 
1.8%
V 1
 
1.8%
I 1
 
1.8%
P 1
 
1.8%
Distinct139
Distinct (%)93.3%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-11T09:14:37.940634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length34
Mean length18.818792
Min length10

Characters and Unicode

Total characters2804
Distinct characters195
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique132 ?
Unique (%)88.6%

Sample

1st row창원시 의창구 우곡로217번길 14-1
2nd row창원시 의창구 의창대로 317번길 9
3rd row창원시 의창구 용지로 151
4th row창원시 의창구 우록로217번길 24
5th row창원시 의창구 도계로 13
ValueCountFrequency (%)
창원시 29
 
5.0%
김해시 19
 
3.2%
거제시 17
 
2.9%
진주시 16
 
2.7%
양산시 15
 
2.6%
의창구 9
 
1.5%
성산구 9
 
1.5%
함안군 9
 
1.5%
마산회원구 8
 
1.4%
중앙로 8
 
1.4%
Other values (337) 446
76.2%
2023-12-11T09:14:38.361961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
440
 
15.7%
1 140
 
5.0%
132
 
4.7%
112
 
4.0%
2 76
 
2.7%
3 69
 
2.5%
63
 
2.2%
57
 
2.0%
57
 
2.0%
4 54
 
1.9%
Other values (185) 1604
57.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1625
58.0%
Decimal Number 577
 
20.6%
Space Separator 440
 
15.7%
Other Punctuation 54
 
1.9%
Open Punctuation 44
 
1.6%
Close Punctuation 43
 
1.5%
Dash Punctuation 16
 
0.6%
Uppercase Letter 5
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
132
 
8.1%
112
 
6.9%
63
 
3.9%
57
 
3.5%
57
 
3.5%
48
 
3.0%
41
 
2.5%
39
 
2.4%
38
 
2.3%
37
 
2.3%
Other values (166) 1001
61.6%
Decimal Number
ValueCountFrequency (%)
1 140
24.3%
2 76
13.2%
3 69
12.0%
4 54
 
9.4%
0 53
 
9.2%
5 43
 
7.5%
9 42
 
7.3%
7 35
 
6.1%
6 34
 
5.9%
8 31
 
5.4%
Uppercase Letter
ValueCountFrequency (%)
K 2
40.0%
T 1
20.0%
B 1
20.0%
S 1
20.0%
Space Separator
ValueCountFrequency (%)
440
100.0%
Other Punctuation
ValueCountFrequency (%)
, 54
100.0%
Open Punctuation
ValueCountFrequency (%)
( 44
100.0%
Close Punctuation
ValueCountFrequency (%)
) 43
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1625
58.0%
Common 1174
41.9%
Latin 5
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
132
 
8.1%
112
 
6.9%
63
 
3.9%
57
 
3.5%
57
 
3.5%
48
 
3.0%
41
 
2.5%
39
 
2.4%
38
 
2.3%
37
 
2.3%
Other values (166) 1001
61.6%
Common
ValueCountFrequency (%)
440
37.5%
1 140
 
11.9%
2 76
 
6.5%
3 69
 
5.9%
4 54
 
4.6%
, 54
 
4.6%
0 53
 
4.5%
( 44
 
3.7%
5 43
 
3.7%
) 43
 
3.7%
Other values (5) 158
 
13.5%
Latin
ValueCountFrequency (%)
K 2
40.0%
T 1
20.0%
B 1
20.0%
S 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1625
58.0%
ASCII 1179
42.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
440
37.3%
1 140
 
11.9%
2 76
 
6.4%
3 69
 
5.9%
4 54
 
4.6%
, 54
 
4.6%
0 53
 
4.5%
( 44
 
3.7%
5 43
 
3.6%
) 43
 
3.6%
Other values (9) 163
 
13.8%
Hangul
ValueCountFrequency (%)
132
 
8.1%
112
 
6.9%
63
 
3.9%
57
 
3.5%
57
 
3.5%
48
 
3.0%
41
 
2.5%
39
 
2.4%
38
 
2.3%
37
 
2.3%
Other values (166) 1001
61.6%
Distinct141
Distinct (%)95.3%
Missing1
Missing (%)0.7%
Memory size1.3 KiB
2023-12-11T09:14:38.627131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters1776
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique135 ?
Unique (%)91.2%

Sample

1st row055-237-3050
2nd row055-296-2445
3rd row055-263-3111
4th row055-288-8207
5th row055-265-4500
ValueCountFrequency (%)
055-688-1231 3
 
2.0%
055-742-5555 2
 
1.4%
055-299-1234 2
 
1.4%
055-388-0300 2
 
1.4%
055-237-3050 2
 
1.4%
055-688-6188 2
 
1.4%
055-374-1100 1
 
0.7%
055-586-4513 1
 
0.7%
055-385-6700 1
 
0.7%
055-380-1048 1
 
0.7%
Other values (131) 131
88.5%
2023-12-11T09:14:38.972031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 400
22.5%
- 296
16.7%
0 257
14.5%
3 154
 
8.7%
2 131
 
7.4%
6 115
 
6.5%
1 112
 
6.3%
4 94
 
5.3%
8 84
 
4.7%
7 81
 
4.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1480
83.3%
Dash Punctuation 296
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 400
27.0%
0 257
17.4%
3 154
 
10.4%
2 131
 
8.9%
6 115
 
7.8%
1 112
 
7.6%
4 94
 
6.4%
8 84
 
5.7%
7 81
 
5.5%
9 52
 
3.5%
Dash Punctuation
ValueCountFrequency (%)
- 296
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1776
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 400
22.5%
- 296
16.7%
0 257
14.5%
3 154
 
8.7%
2 131
 
7.4%
6 115
 
6.5%
1 112
 
6.3%
4 94
 
5.3%
8 84
 
4.7%
7 81
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1776
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 400
22.5%
- 296
16.7%
0 257
14.5%
3 154
 
8.7%
2 131
 
7.4%
6 115
 
6.5%
1 112
 
6.3%
4 94
 
5.3%
8 84
 
4.7%
7 81
 
4.6%

구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
주사무소
137 
영업소
 
12

Length

Max length4
Median length4
Mean length3.9194631
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row주사무소
2nd row주사무소
3rd row주사무소
4th row주사무소
5th row주사무소

Common Values

ValueCountFrequency (%)
주사무소 137
91.9%
영업소 12
 
8.1%

Length

2023-12-11T09:14:39.115935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:14:39.252789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
주사무소 137
91.9%
영업소 12
 
8.1%

Correlations

2023-12-11T09:14:39.328784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관할관청구분
관할관청1.0000.475
구분0.4751.000
2023-12-11T09:14:39.419134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관할관청구분
관할관청1.0000.397
구분0.3971.000
2023-12-11T09:14:39.495165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관할관청구분
관할관청1.0000.397
구분0.3971.000

Missing values

2023-12-11T09:14:36.798882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:14:36.887063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

관할관청업체명주 소연락처구분
0창원시성운고속창원시 의창구 우곡로217번길 14-1055-237-3050주사무소
1창원시(주)동원고속관광창원시 의창구 의창대로 317번길 9055-296-2445주사무소
2창원시(주)하나로고속관광창원시 의창구 용지로 151055-263-3111주사무소
3창원시(주)성산고속관광창원시 의창구 우록로217번길 24055-288-8207주사무소
4창원시(주)현대고속관광창원시 의창구 도계로 13055-265-4500주사무소
5창원시㈜동창원투어창원시 의창구 동읍 동읍로 112055-292-6662주사무소
6창원시(주)다모아투어창원시 의창구 용지로 161055-266-1155주사무소
7창원시(주)경청고속투어창원시 의창구 신사로 58 105호055-267-2525주사무소
8창원시(주)성운고속관광창원시 의창구 우곡로217번길 14-1055-237-3050주사무소
9창원시㈜명신관광창원시 성산구 상남로 37, B 101호(상남동, 덕산베스트텔)055-238-5040주사무소
관할관청업체명주 소연락처구분
139산청군신동아관광(협)산청군 신안면 지리산대로 3490055-973-6376주사무소
140함양군(주)뉴-신흥관광함양군 함양읍 용평3길 1, 2층055-963-9276주사무소
141함양군명신고속관광㈜함양군 함양읍 용평중앙길 32055-962-3377영업소
142거창군(주)거창관광거창군 거창읍 중앙로1길 62055-944-7170주사무소
143거창군누리고속관광㈜거창군거창읍강남로236055-945-0630주사무소
144거창군거창시민관광㈜거창군거창읍중앙로100-1055-945-1310주사무소
145거창군명신고속관광㈜거창군 거창읍 강변로 155055-944-7111주사무소
146합천군(주)해인고속관광합천군 합천읍 대야로 883055-931-4477주사무소
147합천군금화고속관광합천군 삼가면 삼가로 347055-934-2006주사무소
148합천군합천새천년관광㈜합천군 옥산로 102 2층(노블빌딩)055-931-1212주사무소