Overview

Dataset statistics

Number of variables7
Number of observations160
Missing cells1
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.2 KiB
Average record size in memory58.8 B

Variable types

Numeric2
Text3
DateTime1
Categorical1

Dataset

Description경상남도 전세버스 운송사업 업체 현황으로, 전세버스 운송사업의 관할관청, 업체명, 주소, 연락처, 구분에 관한 정보를 제공합니다.
Author경상남도
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=3083295

Alerts

등록대수 is highly overall correlated with 비고High correlation
비고 is highly overall correlated with 등록대수High correlation
비고 is highly imbalanced (55.1%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-11 00:14:40.541630
Analysis finished2023-12-11 00:14:41.343465
Duration0.8 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct160
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean80.5
Minimum1
Maximum160
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2023-12-11T09:14:41.412812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile8.95
Q140.75
median80.5
Q3120.25
95-th percentile152.05
Maximum160
Range159
Interquartile range (IQR)79.5

Descriptive statistics

Standard deviation46.332134
Coefficient of variation (CV)0.57555446
Kurtosis-1.2
Mean80.5
Median Absolute Deviation (MAD)40
Skewness0
Sum12880
Variance2146.6667
MonotonicityStrictly increasing
2023-12-11T09:14:41.545056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.6%
82 1
 
0.6%
104 1
 
0.6%
105 1
 
0.6%
106 1
 
0.6%
107 1
 
0.6%
108 1
 
0.6%
109 1
 
0.6%
110 1
 
0.6%
111 1
 
0.6%
Other values (150) 150
93.8%
ValueCountFrequency (%)
1 1
0.6%
2 1
0.6%
3 1
0.6%
4 1
0.6%
5 1
0.6%
6 1
0.6%
7 1
0.6%
8 1
0.6%
9 1
0.6%
10 1
0.6%
ValueCountFrequency (%)
160 1
0.6%
159 1
0.6%
158 1
0.6%
157 1
0.6%
156 1
0.6%
155 1
0.6%
154 1
0.6%
153 1
0.6%
152 1
0.6%
151 1
0.6%
Distinct152
Distinct (%)95.0%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2023-12-11T09:14:41.806091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10
Mean length7.18125
Min length3

Characters and Unicode

Total characters1149
Distinct characters153
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique145 ?
Unique (%)90.6%

Sample

1st row성운고속투어(주)
2nd row(주)동원고속관광
3rd row(주)하나로고속관광
4th row(주)성산고속관광
5th row(주)현대고속관광
ValueCountFrequency (%)
㈜영진고속관광 3
 
1.9%
㈜블루투어 2
 
1.2%
합)양산코리아고속관광 2
 
1.2%
㈜태산항공 2
 
1.2%
㈜미진고속관광 2
 
1.2%
뉴명신관광㈜ 2
 
1.2%
그린고속관광㈜ 2
 
1.2%
주)삼양고속관광 1
 
0.6%
우리관광버스(협 1
 
0.6%
주)삼성애니투어 1
 
0.6%
Other values (142) 142
88.8%
2023-12-11T09:14:42.191951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
119
 
10.4%
114
 
9.9%
108
 
9.4%
80
 
7.0%
79
 
6.9%
) 29
 
2.5%
( 29
 
2.5%
25
 
2.2%
25
 
2.2%
24
 
2.1%
Other values (143) 517
45.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 978
85.1%
Other Symbol 108
 
9.4%
Close Punctuation 29
 
2.5%
Open Punctuation 29
 
2.5%
Uppercase Letter 3
 
0.3%
Space Separator 1
 
0.1%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
119
 
12.2%
114
 
11.7%
80
 
8.2%
79
 
8.1%
25
 
2.6%
25
 
2.6%
24
 
2.5%
23
 
2.4%
22
 
2.2%
20
 
2.0%
Other values (135) 447
45.7%
Uppercase Letter
ValueCountFrequency (%)
P 1
33.3%
V 1
33.3%
I 1
33.3%
Other Symbol
ValueCountFrequency (%)
108
100.0%
Close Punctuation
ValueCountFrequency (%)
) 29
100.0%
Open Punctuation
ValueCountFrequency (%)
( 29
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1086
94.5%
Common 60
 
5.2%
Latin 3
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
119
 
11.0%
114
 
10.5%
108
 
9.9%
80
 
7.4%
79
 
7.3%
25
 
2.3%
25
 
2.3%
24
 
2.2%
23
 
2.1%
22
 
2.0%
Other values (136) 467
43.0%
Common
ValueCountFrequency (%)
) 29
48.3%
( 29
48.3%
1
 
1.7%
, 1
 
1.7%
Latin
ValueCountFrequency (%)
P 1
33.3%
V 1
33.3%
I 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 978
85.1%
None 108
 
9.4%
ASCII 63
 
5.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
119
 
12.2%
114
 
11.7%
80
 
8.2%
79
 
8.1%
25
 
2.6%
25
 
2.6%
24
 
2.5%
23
 
2.4%
22
 
2.2%
20
 
2.0%
Other values (135) 447
45.7%
None
ValueCountFrequency (%)
108
100.0%
ASCII
ValueCountFrequency (%)
) 29
46.0%
( 29
46.0%
P 1
 
1.6%
V 1
 
1.6%
I 1
 
1.6%
1
 
1.6%
, 1
 
1.6%
Distinct149
Distinct (%)93.7%
Missing1
Missing (%)0.6%
Memory size1.4 KiB
2023-12-11T09:14:42.439233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters1908
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique140 ?
Unique (%)88.1%

Sample

1st row055-237-3050
2nd row055-296-2445
3rd row055-263-3111
4th row055-288-8207
5th row055-265-4500
ValueCountFrequency (%)
055-332-5123 3
 
1.9%
055-833-9797 2
 
1.3%
055-643-4011 2
 
1.3%
055-834-2266 2
 
1.3%
055-311-3900 2
 
1.3%
055-237-3050 2
 
1.3%
055-296-2445 2
 
1.3%
055-331-5300 2
 
1.3%
055-742-5555 2
 
1.3%
055-388-0300 1
 
0.6%
Other values (139) 139
87.4%
2023-12-11T09:14:42.879615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 441
23.1%
- 318
16.7%
0 278
14.6%
3 175
 
9.2%
2 124
 
6.5%
1 117
 
6.1%
6 114
 
6.0%
4 111
 
5.8%
8 92
 
4.8%
7 88
 
4.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1590
83.3%
Dash Punctuation 318
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 441
27.7%
0 278
17.5%
3 175
 
11.0%
2 124
 
7.8%
1 117
 
7.4%
6 114
 
7.2%
4 111
 
7.0%
8 92
 
5.8%
7 88
 
5.5%
9 50
 
3.1%
Dash Punctuation
ValueCountFrequency (%)
- 318
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1908
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 441
23.1%
- 318
16.7%
0 278
14.6%
3 175
 
9.2%
2 124
 
6.5%
1 117
 
6.1%
6 114
 
6.0%
4 111
 
5.8%
8 92
 
4.8%
7 88
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1908
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 441
23.1%
- 318
16.7%
0 278
14.6%
3 175
 
9.2%
2 124
 
6.5%
1 117
 
6.1%
6 114
 
6.0%
4 111
 
5.8%
8 92
 
4.8%
7 88
 
4.6%

주소
Text

Distinct151
Distinct (%)94.4%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2023-12-11T09:14:43.262625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length34
Mean length18.475
Min length10

Characters and Unicode

Total characters2956
Distinct characters194
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique142 ?
Unique (%)88.8%

Sample

1st row창원시 의창구 대원로 70
2nd row창원시 의창구 의창대로 317번길 9
3rd row창원시 의창구 용지로 151
4th row창원시 의창구 우록로217번길 24
5th row창원시 의창구 도계로 13
ValueCountFrequency (%)
창원시 31
 
5.1%
김해시 23
 
3.8%
양산시 19
 
3.1%
진주시 17
 
2.8%
거제시 14
 
2.3%
의창구 9
 
1.5%
함안군 9
 
1.5%
성산구 8
 
1.3%
중앙로 8
 
1.3%
사천시 8
 
1.3%
Other values (349) 466
76.1%
2023-12-11T09:14:43.758715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
457
 
15.5%
136
 
4.6%
1 134
 
4.5%
124
 
4.2%
94
 
3.2%
2 82
 
2.8%
( 66
 
2.2%
) 66
 
2.2%
3 64
 
2.2%
4 58
 
2.0%
Other values (184) 1675
56.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1707
57.7%
Decimal Number 589
 
19.9%
Space Separator 457
 
15.5%
Open Punctuation 66
 
2.2%
Close Punctuation 66
 
2.2%
Other Punctuation 46
 
1.6%
Dash Punctuation 23
 
0.8%
Uppercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
136
 
8.0%
124
 
7.3%
94
 
5.5%
57
 
3.3%
57
 
3.3%
52
 
3.0%
47
 
2.8%
43
 
2.5%
42
 
2.5%
37
 
2.2%
Other values (167) 1018
59.6%
Decimal Number
ValueCountFrequency (%)
1 134
22.8%
2 82
13.9%
3 64
10.9%
4 58
9.8%
5 51
 
8.7%
0 49
 
8.3%
8 43
 
7.3%
9 40
 
6.8%
7 35
 
5.9%
6 33
 
5.6%
Uppercase Letter
ValueCountFrequency (%)
S 1
50.0%
K 1
50.0%
Space Separator
ValueCountFrequency (%)
457
100.0%
Open Punctuation
ValueCountFrequency (%)
( 66
100.0%
Close Punctuation
ValueCountFrequency (%)
) 66
100.0%
Other Punctuation
ValueCountFrequency (%)
, 46
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 23
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1707
57.7%
Common 1247
42.2%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
136
 
8.0%
124
 
7.3%
94
 
5.5%
57
 
3.3%
57
 
3.3%
52
 
3.0%
47
 
2.8%
43
 
2.5%
42
 
2.5%
37
 
2.2%
Other values (167) 1018
59.6%
Common
ValueCountFrequency (%)
457
36.6%
1 134
 
10.7%
2 82
 
6.6%
( 66
 
5.3%
) 66
 
5.3%
3 64
 
5.1%
4 58
 
4.7%
5 51
 
4.1%
0 49
 
3.9%
, 46
 
3.7%
Other values (5) 174
 
14.0%
Latin
ValueCountFrequency (%)
S 1
50.0%
K 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1707
57.7%
ASCII 1249
42.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
457
36.6%
1 134
 
10.7%
2 82
 
6.6%
( 66
 
5.3%
) 66
 
5.3%
3 64
 
5.1%
4 58
 
4.6%
5 51
 
4.1%
0 49
 
3.9%
, 46
 
3.7%
Other values (7) 176
 
14.1%
Hangul
ValueCountFrequency (%)
136
 
8.0%
124
 
7.3%
94
 
5.5%
57
 
3.3%
57
 
3.3%
52
 
3.0%
47
 
2.8%
43
 
2.5%
42
 
2.5%
37
 
2.2%
Other values (167) 1018
59.6%
Distinct154
Distinct (%)96.2%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
Minimum1982-09-23 00:00:00
Maximum2015-11-24 00:00:00
2023-12-11T09:14:43.902937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:14:44.117108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

등록대수
Real number (ℝ)

HIGH CORRELATION 

Distinct48
Distinct (%)30.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20.35
Minimum1
Maximum85
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2023-12-11T09:14:44.290424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5
Q111
median16
Q326
95-th percentile45.15
Maximum85
Range84
Interquartile range (IQR)15

Descriptive statistics

Standard deviation13.716904
Coefficient of variation (CV)0.67404934
Kurtosis3.5665094
Mean20.35
Median Absolute Deviation (MAD)6
Skewness1.6064297
Sum3256
Variance188.15346
MonotonicityNot monotonic
2023-12-11T09:14:44.412282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=48)
ValueCountFrequency (%)
10 13
 
8.1%
11 11
 
6.9%
15 10
 
6.2%
16 9
 
5.6%
12 7
 
4.4%
5 7
 
4.4%
22 5
 
3.1%
24 5
 
3.1%
20 5
 
3.1%
18 5
 
3.1%
Other values (38) 83
51.9%
ValueCountFrequency (%)
1 2
 
1.2%
2 2
 
1.2%
3 1
 
0.6%
5 7
4.4%
6 1
 
0.6%
7 4
 
2.5%
8 3
 
1.9%
9 2
 
1.2%
10 13
8.1%
11 11
6.9%
ValueCountFrequency (%)
85 1
0.6%
70 1
0.6%
61 1
0.6%
58 1
0.6%
51 2
1.2%
50 1
0.6%
48 1
0.6%
45 2
1.2%
44 2
1.2%
43 1
0.6%

비고
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
주사무소
145 
영업소
15 

Length

Max length4
Median length4
Mean length3.90625
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row주사무소
2nd row주사무소
3rd row주사무소
4th row주사무소
5th row주사무소

Common Values

ValueCountFrequency (%)
주사무소 145
90.6%
영업소 15
 
9.4%

Length

2023-12-11T09:14:44.537647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:14:44.644854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
주사무소 145
90.6%
영업소 15
 
9.4%

Interactions

2023-12-11T09:14:41.004335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:14:40.832061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:14:41.090642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:14:40.929117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T09:14:44.748897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번등록대수비고
연번1.0000.3560.409
등록대수0.3561.0000.826
비고0.4090.8261.000
2023-12-11T09:14:45.110777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번등록대수비고
연번1.000-0.2480.306
등록대수-0.2481.0000.642
비고0.3060.6421.000

Missing values

2023-12-11T09:14:41.192729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:14:41.302078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업체명전화번호주소등록일자등록대수비고
01성운고속투어(주)055-237-3050창원시 의창구 대원로 701995-09-1950주사무소
12(주)동원고속관광055-296-2445창원시 의창구 의창대로 317번길 92001-06-0126주사무소
23(주)하나로고속관광055-263-3111창원시 의창구 용지로 1512001-12-0512주사무소
34(주)성산고속관광055-288-8207창원시 의창구 우록로217번길 242002-12-1485주사무소
45(주)현대고속관광055-265-4500창원시 의창구 도계로 132003-03-0661주사무소
56(주)대정고속관광055-293-7044창원시 의창구 동읍 의창대로8492007-08-1010주사무소
67(주)다모아투어055-266-1155창원시 의창구 용지로 1612008-02-2833주사무소
78(주)경청고속투어055-267-2525창원시 의창구 신사로 58 105호2004-06-0813주사무소
89(주)성운고속관광055-237-3050창원시 의창구 대원로 702004-11-2414주사무소
910㈜명신관광055-238-5040창원시 성산구 대정로 110, 221호(대방동, 성원3차상가)1994-05-1334주사무소
연번업체명전화번호주소등록일자등록대수비고
150151경호관광㈜055-972-8844산청읍 친환경로26742001-01-0310주사무소
151152(뉴)신흥관광055-963-2323함양군 함양읍 용평3길12007-01-0715주사무소
152153명신관광㈜055-963-3377함양군 함양읍 용평중앙길322001-06-045영업소
153154(주)거창관광055-944-7170거창군 거창읍 중앙로1길 621994-03-0225주사무소
154155거창고속㈜055-945-0630거창군 거창읍 강남로 2362004-07-2815주사무소
155156거창시민관광㈜055-945-1310거창군 거창읍 중앙로 100-12004-07-2610주사무소
156157명신고속관광㈜055-944-7111거창군 거창읍 강변로 1552001-09-1010주사무소
157158합천새천년관광㈜055-931-1212합천군 대야로 880-41986-03-2516주사무소
158159해인고속관광㈜055-931-4477합천군 대야로 8831983-03-0811주사무소
159160금화고속관광055-934-0040합천군 삼가로 3471995-12-2315주사무소