Overview

Dataset statistics

Number of variables5
Number of observations141
Missing cells4
Missing cells (%)0.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.6 KiB
Average record size in memory40.9 B

Variable types

Categorical2
Text3

Dataset

Description경상남도 전세버스 운송사업 업체 현황으로, 전세버스 운송사업의 관할관청, 업체명, 주소, 연락처, 구분에 관한 정보를 제공합니다.
Author경상남도
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=3083295

Alerts

구분 is highly imbalanced (53.3%)Imbalance
연락처 has 4 (2.8%) missing valuesMissing

Reproduction

Analysis started2023-12-11 00:14:31.678326
Analysis finished2023-12-11 00:14:32.055994
Duration0.38 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

관할관청
Categorical

Distinct20
Distinct (%)14.2%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
창원시
31 
김해시
16 
양산시
15 
거제시
15 
진주시
13 
Other values (15)
51 

Length

Max length4
Median length3
Mean length3.0425532
Min length3

Unique

Unique1 ?
Unique (%)0.7%

Sample

1st row창원시
2nd row창원시
3rd row창원시
4th row창원시
5th row창원시

Common Values

ValueCountFrequency (%)
창원시 31
22.0%
김해시 16
11.3%
양산시 15
10.6%
거제시 15
10.6%
진주시 13
9.2%
함안군 8
 
5.7%
통영시 5
 
3.5%
밀양시 4
 
2.8%
사천시 4
 
2.8%
거창군 4
 
2.8%
Other values (10) 26
18.4%

Length

2023-12-11T09:14:32.136947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
창원시 31
22.0%
김해시 16
11.3%
진주시 16
11.3%
양산시 15
10.6%
거제시 15
10.6%
함안군 8
 
5.7%
통영시 5
 
3.5%
남해군 4
 
2.8%
거창군 4
 
2.8%
사천시 4
 
2.8%
Other values (8) 23
16.3%
Distinct137
Distinct (%)97.2%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-11T09:14:32.312286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length16
Mean length7.893617
Min length3

Characters and Unicode

Total characters1113
Distinct characters143
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique133 ?
Unique (%)94.3%

Sample

1st row㈜성산고속관광
2nd row㈜현대고속관광
3rd row㈜동창원투어
4th row성운고속관광㈜
5th row㈜하나로고속관광
ValueCountFrequency (%)
㈜영진고속관광 3
 
2.0%
남해영업소 3
 
2.0%
금화고속관광 2
 
1.4%
㈜무학항공여행사 2
 
1.4%
㈜세원고속관광 2
 
1.4%
㈜남양관광 2
 
1.4%
주)싱싱고속관광 1
 
0.7%
진성관광버스(협 1
 
0.7%
주)한솔고속관광 1
 
0.7%
만수산고속관광(주 1
 
0.7%
Other values (130) 130
87.8%
2023-12-11T09:14:32.673688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
102
 
9.2%
101
 
9.1%
98
 
8.8%
70
 
6.3%
69
 
6.2%
( 28
 
2.5%
) 28
 
2.5%
26
 
2.3%
26
 
2.3%
25
 
2.2%
Other values (133) 540
48.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 948
85.2%
Other Symbol 101
 
9.1%
Open Punctuation 28
 
2.5%
Close Punctuation 28
 
2.5%
Space Separator 7
 
0.6%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
102
 
10.8%
98
 
10.3%
70
 
7.4%
69
 
7.3%
26
 
2.7%
26
 
2.7%
25
 
2.6%
23
 
2.4%
22
 
2.3%
20
 
2.1%
Other values (128) 467
49.3%
Other Symbol
ValueCountFrequency (%)
101
100.0%
Open Punctuation
ValueCountFrequency (%)
( 28
100.0%
Close Punctuation
ValueCountFrequency (%)
) 28
100.0%
Space Separator
ValueCountFrequency (%)
7
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1049
94.2%
Common 64
 
5.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
102
 
9.7%
101
 
9.6%
98
 
9.3%
70
 
6.7%
69
 
6.6%
26
 
2.5%
26
 
2.5%
25
 
2.4%
23
 
2.2%
22
 
2.1%
Other values (129) 487
46.4%
Common
ValueCountFrequency (%)
( 28
43.8%
) 28
43.8%
7
 
10.9%
- 1
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 948
85.2%
None 101
 
9.1%
ASCII 64
 
5.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
102
 
10.8%
98
 
10.3%
70
 
7.4%
69
 
7.3%
26
 
2.7%
26
 
2.7%
25
 
2.6%
23
 
2.4%
22
 
2.3%
20
 
2.1%
Other values (128) 467
49.3%
None
ValueCountFrequency (%)
101
100.0%
ASCII
ValueCountFrequency (%)
( 28
43.8%
) 28
43.8%
7
 
10.9%
- 1
 
1.6%

주소
Text

Distinct133
Distinct (%)94.3%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-11T09:14:32.997164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length35
Mean length21.283688
Min length11

Characters and Unicode

Total characters3001
Distinct characters201
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique127 ?
Unique (%)90.1%

Sample

1st row창원시 의창구 우곡로217번길 24, 모비딕빌딩 401호(명서동)
2nd row창원시 의창구 평산로159번길 5,2층(중동)
3rd row창원시 의창구 동읍 동읍로 112
4th row창원시 성산구 용지로169번길 15, 레이크펄스 502호
5th row창원시 성산구 원이대로 581, 3층 (용호동,창원시노동복지회관)
ValueCountFrequency (%)
창원시 31
 
5.1%
성산구 16
 
2.6%
진주시 16
 
2.6%
양산시 15
 
2.5%
김해시 15
 
2.5%
거제시 15
 
2.5%
마산회원구 9
 
1.5%
2층 9
 
1.5%
함안군 8
 
1.3%
1층 6
 
1.0%
Other values (348) 472
77.1%
2023-12-11T09:14:33.490837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
471
 
15.7%
1 154
 
5.1%
123
 
4.1%
107
 
3.6%
2 99
 
3.3%
92
 
3.1%
, 70
 
2.3%
( 69
 
2.3%
) 68
 
2.3%
67
 
2.2%
Other values (191) 1681
56.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1689
56.3%
Decimal Number 607
 
20.2%
Space Separator 471
 
15.7%
Other Punctuation 70
 
2.3%
Open Punctuation 69
 
2.3%
Close Punctuation 68
 
2.3%
Dash Punctuation 25
 
0.8%
Uppercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
123
 
7.3%
107
 
6.3%
92
 
5.4%
67
 
4.0%
51
 
3.0%
51
 
3.0%
40
 
2.4%
40
 
2.4%
39
 
2.3%
37
 
2.2%
Other values (174) 1042
61.7%
Decimal Number
ValueCountFrequency (%)
1 154
25.4%
2 99
16.3%
3 66
10.9%
0 54
 
8.9%
5 50
 
8.2%
7 42
 
6.9%
4 41
 
6.8%
6 38
 
6.3%
9 38
 
6.3%
8 25
 
4.1%
Uppercase Letter
ValueCountFrequency (%)
S 1
50.0%
K 1
50.0%
Space Separator
ValueCountFrequency (%)
471
100.0%
Other Punctuation
ValueCountFrequency (%)
, 70
100.0%
Open Punctuation
ValueCountFrequency (%)
( 69
100.0%
Close Punctuation
ValueCountFrequency (%)
) 68
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 25
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1689
56.3%
Common 1310
43.7%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
123
 
7.3%
107
 
6.3%
92
 
5.4%
67
 
4.0%
51
 
3.0%
51
 
3.0%
40
 
2.4%
40
 
2.4%
39
 
2.3%
37
 
2.2%
Other values (174) 1042
61.7%
Common
ValueCountFrequency (%)
471
36.0%
1 154
 
11.8%
2 99
 
7.6%
, 70
 
5.3%
( 69
 
5.3%
) 68
 
5.2%
3 66
 
5.0%
0 54
 
4.1%
5 50
 
3.8%
7 42
 
3.2%
Other values (5) 167
 
12.7%
Latin
ValueCountFrequency (%)
S 1
50.0%
K 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1689
56.3%
ASCII 1312
43.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
471
35.9%
1 154
 
11.7%
2 99
 
7.5%
, 70
 
5.3%
( 69
 
5.3%
) 68
 
5.2%
3 66
 
5.0%
0 54
 
4.1%
5 50
 
3.8%
7 42
 
3.2%
Other values (7) 169
 
12.9%
Hangul
ValueCountFrequency (%)
123
 
7.3%
107
 
6.3%
92
 
5.4%
67
 
4.0%
51
 
3.0%
51
 
3.0%
40
 
2.4%
40
 
2.4%
39
 
2.3%
37
 
2.2%
Other values (174) 1042
61.7%

연락처
Text

MISSING 

Distinct128
Distinct (%)93.4%
Missing4
Missing (%)2.8%
Memory size1.2 KiB
2023-12-11T09:14:33.750530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.021898
Min length12

Characters and Unicode

Total characters1647
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique120 ?
Unique (%)87.6%

Sample

1st row055-288-8207
2nd row055-265-4500
3rd row055-292-6662
4th row055-237-3050
5th row055-263-3111
ValueCountFrequency (%)
070-7618-8268 3
 
2.2%
055-833-9797 2
 
1.5%
055-222-8433 2
 
1.5%
055-934-0040 2
 
1.5%
055-754-6698 2
 
1.5%
055-747-3636 2
 
1.5%
055-834-2200 2
 
1.5%
055-688-6188 2
 
1.5%
055-372-4411 1
 
0.7%
055-688-1231 1
 
0.7%
Other values (118) 118
86.1%
2023-12-11T09:14:34.458332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 358
21.7%
- 274
16.6%
0 243
14.8%
3 128
 
7.8%
2 122
 
7.4%
1 112
 
6.8%
6 105
 
6.4%
8 95
 
5.8%
7 82
 
5.0%
4 77
 
4.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1373
83.4%
Dash Punctuation 274
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 358
26.1%
0 243
17.7%
3 128
 
9.3%
2 122
 
8.9%
1 112
 
8.2%
6 105
 
7.6%
8 95
 
6.9%
7 82
 
6.0%
4 77
 
5.6%
9 51
 
3.7%
Dash Punctuation
ValueCountFrequency (%)
- 274
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1647
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 358
21.7%
- 274
16.6%
0 243
14.8%
3 128
 
7.8%
2 122
 
7.4%
1 112
 
6.8%
6 105
 
6.4%
8 95
 
5.8%
7 82
 
5.0%
4 77
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1647
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 358
21.7%
- 274
16.6%
0 243
14.8%
3 128
 
7.8%
2 122
 
7.4%
1 112
 
6.8%
6 105
 
6.4%
8 95
 
5.8%
7 82
 
5.0%
4 77
 
4.7%

구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
주사무소
127 
영업소
14 

Length

Max length4
Median length4
Mean length3.9007092
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row주사무소
2nd row주사무소
3rd row주사무소
4th row주사무소
5th row주사무소

Common Values

ValueCountFrequency (%)
주사무소 127
90.1%
영업소 14
 
9.9%

Length

2023-12-11T09:14:34.600974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:14:34.741185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
주사무소 127
90.1%
영업소 14
 
9.9%

Correlations

2023-12-11T09:14:34.837283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관할관청구분
관할관청1.0000.633
구분0.6331.000
2023-12-11T09:14:34.940521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관할관청구분
관할관청1.0000.472
구분0.4721.000
2023-12-11T09:14:35.038411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관할관청구분
관할관청1.0000.472
구분0.4721.000

Missing values

2023-12-11T09:14:31.916112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:14:32.012253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

관할관청업체명주소연락처구분
0창원시㈜성산고속관광창원시 의창구 우곡로217번길 24, 모비딕빌딩 401호(명서동)055-288-8207주사무소
1창원시㈜현대고속관광창원시 의창구 평산로159번길 5,2층(중동)055-265-4500주사무소
2창원시㈜동창원투어창원시 의창구 동읍 동읍로 112055-292-6662주사무소
3창원시성운고속관광㈜창원시 성산구 용지로169번길 15, 레이크펄스 502호055-237-3050주사무소
4창원시㈜하나로고속관광창원시 성산구 원이대로 581, 3층 (용호동,창원시노동복지회관)055-263-3111주사무소
5창원시㈜다모아투어창원시 성산구 용지로 161,201호(용호동,경남빌딩)055-266-1155주사무소
6창원시㈜경청고속투어창원시 성산구 신사로 58, 105호 (신월동,학원상가)055-267-2525주사무소
7창원시㈜명신관광창원시 성산구 상남로 37, 지하101호(상남동, 덕산베스트빌)055-238-5040주사무소
8창원시창원고속관광㈜창원시 성산구 중앙대로 111, 1007호(중앙동, 평화오피스텔)055-267-0056주사무소
9창원시㈜대성고속관광창원시 성산구 완암로 50, 11층 1111호(성산동, SK테크노파크테크동)055-283-1212주사무소
관할관청업체명주소연락처구분
131산청군신동아관광협동조합산청군 신안면 지리산대로3490055-973-6377주사무소
132함양군(주)뉴-신흥관광함양군 함양읍 용평3길 1, 2층055-963-2323주사무소
133함양군명신고속관광(주)함양군 함양읍 용평중앙길 32055-962-3377영업소
134거창군㈜거창관광거창군 거창읍 중앙로1길 62055-944-7170주사무소
135거창군누리고속관광㈜거창군 거창읍 강남로 236055-945-0630주사무소
136거창군거창시민관광㈜거창군 거창읍 거열로 173055-945-1310주사무소
137거창군명신고속관광㈜거창군 거창읍 거열로 131-1055-944-7111주사무소
138합천군합천새천년관광㈜합천군 합천읍 옥산로 102055-931-1212주사무소
139합천군해인고속관광㈜합천군 합천읍 동서로 63<NA>주사무소
140합천군금화고속관광합천군 삼가면 삼가로 347055-934-0040주사무소