Overview

Dataset statistics

Number of variables5
Number of observations38
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.7 KiB
Average record size in memory44.5 B

Variable types

Numeric1
Text3
DateTime1

Dataset

Description경상남도 내의 시외버스 업체 현황을 제공합니다. 시외버스 업체의 회사명, 회사주소, 전화번호, 면허일자등의 데이터를 포함하고있습니다.
Author경상남도
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=3083989

Alerts

연번 has unique valuesUnique

Reproduction

Analysis started2023-12-11 00:50:03.284541
Analysis finished2023-12-11 00:50:03.807485
Duration0.52 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct38
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19.5
Minimum1
Maximum38
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size474.0 B
2023-12-11T09:50:03.878497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.85
Q110.25
median19.5
Q328.75
95-th percentile36.15
Maximum38
Range37
Interquartile range (IQR)18.5

Descriptive statistics

Standard deviation11.113055
Coefficient of variation (CV)0.56990028
Kurtosis-1.2
Mean19.5
Median Absolute Deviation (MAD)9.5
Skewness0
Sum741
Variance123.5
MonotonicityStrictly increasing
2023-12-11T09:50:04.036823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=38)
ValueCountFrequency (%)
1 1
 
2.6%
30 1
 
2.6%
23 1
 
2.6%
24 1
 
2.6%
25 1
 
2.6%
26 1
 
2.6%
27 1
 
2.6%
28 1
 
2.6%
29 1
 
2.6%
31 1
 
2.6%
Other values (28) 28
73.7%
ValueCountFrequency (%)
1 1
2.6%
2 1
2.6%
3 1
2.6%
4 1
2.6%
5 1
2.6%
6 1
2.6%
7 1
2.6%
8 1
2.6%
9 1
2.6%
10 1
2.6%
ValueCountFrequency (%)
38 1
2.6%
37 1
2.6%
36 1
2.6%
35 1
2.6%
34 1
2.6%
33 1
2.6%
32 1
2.6%
31 1
2.6%
30 1
2.6%
29 1
2.6%
Distinct21
Distinct (%)55.3%
Missing0
Missing (%)0.0%
Memory size436.0 B
2023-12-11T09:50:04.204859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length5
Mean length5.1315789
Min length3

Characters and Unicode

Total characters195
Distinct characters47
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)36.8%

Sample

1st row경남고속㈜
2nd row부산교통㈜
3rd row부산교통㈜
4th row대한여객㈜
5th row대한여객㈜
ValueCountFrequency (%)
창원고속㈜ 5
13.2%
거창고속㈜ 4
 
10.5%
대한여객㈜ 4
 
10.5%
㈜세원 4
 
10.5%
영화여객㈜ 3
 
7.9%
함양지리산고속㈜ 2
 
5.3%
부산교통㈜ 2
 
5.3%
김해여객㈜ 1
 
2.6%
밀성여객㈜ 1
 
2.6%
해운대고속(주 1
 
2.6%
Other values (11) 11
28.9%
2023-12-11T09:50:04.515440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
36
18.5%
16
 
8.2%
16
 
8.2%
15
 
7.7%
14
 
7.2%
10
 
5.1%
9
 
4.6%
6
 
3.1%
5
 
2.6%
4
 
2.1%
Other values (37) 64
32.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 157
80.5%
Other Symbol 36
 
18.5%
Close Punctuation 1
 
0.5%
Open Punctuation 1
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
16
 
10.2%
16
 
10.2%
15
 
9.6%
14
 
8.9%
10
 
6.4%
9
 
5.7%
6
 
3.8%
5
 
3.2%
4
 
2.5%
4
 
2.5%
Other values (34) 58
36.9%
Other Symbol
ValueCountFrequency (%)
36
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 193
99.0%
Common 2
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
36
18.7%
16
 
8.3%
16
 
8.3%
15
 
7.8%
14
 
7.3%
10
 
5.2%
9
 
4.7%
6
 
3.1%
5
 
2.6%
4
 
2.1%
Other values (35) 62
32.1%
Common
ValueCountFrequency (%)
) 1
50.0%
( 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 157
80.5%
None 36
 
18.5%
ASCII 2
 
1.0%

Most frequent character per block

None
ValueCountFrequency (%)
36
100.0%
Hangul
ValueCountFrequency (%)
16
 
10.2%
16
 
10.2%
15
 
9.6%
14
 
8.9%
10
 
6.4%
9
 
5.7%
6
 
3.8%
5
 
3.2%
4
 
2.5%
4
 
2.5%
Other values (34) 58
36.9%
ASCII
ValueCountFrequency (%)
) 1
50.0%
( 1
50.0%

주소
Text

Distinct31
Distinct (%)81.6%
Missing0
Missing (%)0.0%
Memory size436.0 B
2023-12-11T09:50:04.754233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length34
Mean length29.947368
Min length21

Characters and Unicode

Total characters1138
Distinct characters85
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)76.3%

Sample

1st row부산광역시 금정구 중앙대로 2008(남산동 118-8)
2nd row경상남도 진주시 진양호로 438 (인사동 157)
3rd row경상남도 진주시 진양호로 438 (인사동 157)
4th row경상남도 진주시 진양호로 438 (인사동 157)
5th row경상남도 진주시 진양호로 438 (인사동 157)
ValueCountFrequency (%)
경상남도 34
 
14.7%
진주시 11
 
4.7%
진양호로 9
 
3.9%
438 9
 
3.9%
인사동 9
 
3.9%
창원시 8
 
3.4%
157 7
 
3.0%
마산합포구 6
 
2.6%
양산시 5
 
2.2%
750(합성동 5
 
2.2%
Other values (88) 129
55.6%
2023-12-11T09:50:05.123522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
194
 
17.0%
1 61
 
5.4%
46
 
4.0%
) 42
 
3.7%
( 37
 
3.3%
36
 
3.2%
34
 
3.0%
34
 
3.0%
34
 
3.0%
33
 
2.9%
Other values (75) 587
51.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 588
51.7%
Decimal Number 265
23.3%
Space Separator 194
 
17.0%
Close Punctuation 42
 
3.7%
Open Punctuation 37
 
3.3%
Dash Punctuation 12
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
46
 
7.8%
36
 
6.1%
34
 
5.8%
34
 
5.8%
34
 
5.8%
33
 
5.6%
30
 
5.1%
24
 
4.1%
21
 
3.6%
19
 
3.2%
Other values (61) 277
47.1%
Decimal Number
ValueCountFrequency (%)
1 61
23.0%
2 27
10.2%
5 26
9.8%
3 26
9.8%
6 25
9.4%
0 24
 
9.1%
7 23
 
8.7%
4 18
 
6.8%
8 18
 
6.8%
9 17
 
6.4%
Space Separator
ValueCountFrequency (%)
194
100.0%
Close Punctuation
ValueCountFrequency (%)
) 42
100.0%
Open Punctuation
ValueCountFrequency (%)
( 37
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 588
51.7%
Common 550
48.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
46
 
7.8%
36
 
6.1%
34
 
5.8%
34
 
5.8%
34
 
5.8%
33
 
5.6%
30
 
5.1%
24
 
4.1%
21
 
3.6%
19
 
3.2%
Other values (61) 277
47.1%
Common
ValueCountFrequency (%)
194
35.3%
1 61
 
11.1%
) 42
 
7.6%
( 37
 
6.7%
2 27
 
4.9%
5 26
 
4.7%
3 26
 
4.7%
6 25
 
4.5%
0 24
 
4.4%
7 23
 
4.2%
Other values (4) 65
 
11.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 588
51.7%
ASCII 550
48.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
194
35.3%
1 61
 
11.1%
) 42
 
7.6%
( 37
 
6.7%
2 27
 
4.9%
5 26
 
4.7%
3 26
 
4.7%
6 25
 
4.5%
0 24
 
4.4%
7 23
 
4.2%
Other values (4) 65
 
11.8%
Hangul
ValueCountFrequency (%)
46
 
7.8%
36
 
6.1%
34
 
5.8%
34
 
5.8%
34
 
5.8%
33
 
5.6%
30
 
5.1%
24
 
4.1%
21
 
3.6%
19
 
3.2%
Other values (61) 277
47.1%
Distinct37
Distinct (%)97.4%
Missing0
Missing (%)0.0%
Memory size436.0 B
2023-12-11T09:50:05.357694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters456
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)94.7%

Sample

1st row051-583-7441
2nd row055-741-8802
3rd row055-741-8803
4th row055-742-3300
5th row055-742-3301
ValueCountFrequency (%)
051-559-1101 2
 
5.3%
051-583-7441 1
 
2.6%
055-354-6107 1
 
2.6%
055-863-3501 1
 
2.6%
055-945-0631 1
 
2.6%
055-945-0632 1
 
2.6%
055-945-0633 1
 
2.6%
055-945-0634 1
 
2.6%
055-224-3308 1
 
2.6%
055-547-8423 1
 
2.6%
Other values (27) 27
71.1%
2023-12-11T09:50:05.730305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 108
23.7%
- 76
16.7%
0 57
12.5%
4 40
 
8.8%
3 40
 
8.8%
2 28
 
6.1%
1 27
 
5.9%
6 24
 
5.3%
7 23
 
5.0%
8 22
 
4.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 380
83.3%
Dash Punctuation 76
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 108
28.4%
0 57
15.0%
4 40
 
10.5%
3 40
 
10.5%
2 28
 
7.4%
1 27
 
7.1%
6 24
 
6.3%
7 23
 
6.1%
8 22
 
5.8%
9 11
 
2.9%
Dash Punctuation
ValueCountFrequency (%)
- 76
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 456
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 108
23.7%
- 76
16.7%
0 57
12.5%
4 40
 
8.8%
3 40
 
8.8%
2 28
 
6.1%
1 27
 
5.9%
6 24
 
5.3%
7 23
 
5.0%
8 22
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 456
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 108
23.7%
- 76
16.7%
0 57
12.5%
4 40
 
8.8%
3 40
 
8.8%
2 28
 
6.1%
1 27
 
5.9%
6 24
 
5.3%
7 23
 
5.0%
8 22
 
4.8%
Distinct21
Distinct (%)55.3%
Missing0
Missing (%)0.0%
Memory size436.0 B
Minimum1946-12-31 00:00:00
Maximum2015-11-06 00:00:00
2023-12-11T09:50:05.843851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:50:05.935510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)

Interactions

2023-12-11T09:50:03.551172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T09:50:06.000074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업체명주소전화번호면허일자
연번1.0000.9470.9251.0000.930
업체명0.9471.0000.9590.6690.998
주소0.9250.9591.0001.0000.926
전화번호1.0000.6691.0001.0000.669
면허일자0.9300.9980.9260.6691.000

Missing values

2023-12-11T09:50:03.666764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:50:03.767189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업체명주소전화번호면허일자
01경남고속㈜부산광역시 금정구 중앙대로 2008(남산동 118-8)051-583-74411966-02-01
12부산교통㈜경상남도 진주시 진양호로 438 (인사동 157)055-741-88021963-08-01
23부산교통㈜경상남도 진주시 진양호로 438 (인사동 157)055-741-88031963-08-01
34대한여객㈜경상남도 진주시 진양호로 438 (인사동 157)055-742-33001966-03-04
45대한여객㈜경상남도 진주시 진양호로 438 (인사동 157)055-742-33011966-03-04
56대한여객㈜경상남도 진주시 진양호로 438 (인사동 157)055-742-33021966-03-04
67대한여객㈜경상남도 진주시 진양호로 438 (인사동 157)055-742-33031966-03-04
78영화여객㈜경상남도 진주시 진양호로 438 (인사동 155)055-745-48811980-03-27
89영화여객㈜경상남도 진주시 진양호로 438 (인사동 156)055-745-48821980-03-28
910영화여객㈜경상남도 진주시 진양호로 438 (인사동 157)055-745-48831980-03-29
연번업체명주소전화번호면허일자
2829거제현대고속㈜경상남도 창원시 마산합포구 월영동서로 10 (해운동 5-56)055-224-33081966-01-14
2930동아여객㈜경상남도 창원시 진해구 태평로 34번길 17 (인의동 24-3)055-547-84232001-04-01
3031㈜세원경상남도 양산시 산막동단남 11길 129(북정동 94)055-384-66122003-08-01
3132㈜세원경상남도 양산시 산막동단남 11길 129(북정동 95)055-384-66132003-08-01
3233㈜세원경상남도 양산시 산막동단남 11길 129(북정동 96)055-384-66142003-08-01
3334㈜세원경상남도 양산시 산막동단남 11길 129(북정동 97)055-384-66152003-08-01
3435함양지리산고속㈜경상남도 함양군 함양읍 고운로 102 (용평리 679-6)055-963-37451980-04-01
3536함양지리산고속㈜경상남도 함양군 함양읍 고운로 102 (용평리 679-7)055-963-37461980-04-01
3637해운대고속(주)부산광역시 해운대구 해운대로 641(우동)051-743-16891980-06-24
3738동일익스프레스경상남도 함안군 군북면 함마대로 772055-583-79752015-11-06