Overview

Dataset statistics

Number of variables4
Number of observations370
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.7 KiB
Average record size in memory32.4 B

Variable types

Categorical1
Text3

Dataset

Description해당 데이터는 한국도로공사 본부 내 지사별로 현재 운영중인 영업소 현황을 보여준다. 영업소 현황을 보다 쉽게 정리하여 보여주는 자료임.
URLhttps://www.data.go.kr/data/15085270/fileData.do

Alerts

영업소 has unique valuesUnique
주소 has unique valuesUnique

Reproduction

Analysis started2023-12-12 05:56:10.413937
Analysis finished2023-12-12 05:56:10.924262
Duration0.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

본부
Categorical

Distinct8
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
부산경남
69 
대구경북
57 
광주전남
52 
강원
48 
수도권
41 
Other values (3)
103 

Length

Max length4
Median length4
Mean length3.2675676
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수도권
2nd row수도권
3rd row수도권
4th row수도권
5th row수도권

Common Values

ValueCountFrequency (%)
부산경남 69
18.6%
대구경북 57
15.4%
광주전남 52
14.1%
강원 48
13.0%
수도권 41
11.1%
대전충남 36
9.7%
전북 34
9.2%
충북 33
8.9%

Length

2023-12-12T14:56:11.042297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:56:11.231198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산경남 69
18.6%
대구경북 57
15.4%
광주전남 52
14.1%
강원 48
13.0%
수도권 41
11.1%
대전충남 36
9.7%
전북 34
9.2%
충북 33
8.9%

지사
Text

Distinct56
Distinct (%)15.1%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
2023-12-12T14:56:11.513809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length2
Mean length2.0702703
Min length2

Characters and Unicode

Total characters766
Distinct characters58
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row인천
2nd row인천
3rd row시흥
4th row시흥
5th row시흥
ValueCountFrequency (%)
대구 15
 
4.1%
진주 12
 
3.2%
함평 11
 
3.0%
울산 10
 
2.7%
화성 10
 
2.7%
이천 9
 
2.4%
순천 9
 
2.4%
춘천 9
 
2.4%
강릉 9
 
2.4%
군포 9
 
2.4%
Other values (46) 267
72.2%
2023-12-12T14:56:11.957929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
72
 
9.4%
56
 
7.3%
36
 
4.7%
31
 
4.0%
29
 
3.8%
28
 
3.7%
27
 
3.5%
25
 
3.3%
25
 
3.3%
21
 
2.7%
Other values (48) 416
54.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 766
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
72
 
9.4%
56
 
7.3%
36
 
4.7%
31
 
4.0%
29
 
3.8%
28
 
3.7%
27
 
3.5%
25
 
3.3%
25
 
3.3%
21
 
2.7%
Other values (48) 416
54.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 766
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
72
 
9.4%
56
 
7.3%
36
 
4.7%
31
 
4.0%
29
 
3.8%
28
 
3.7%
27
 
3.5%
25
 
3.3%
25
 
3.3%
21
 
2.7%
Other values (48) 416
54.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 766
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
72
 
9.4%
56
 
7.3%
36
 
4.7%
31
 
4.0%
29
 
3.8%
28
 
3.7%
27
 
3.5%
25
 
3.3%
25
 
3.3%
21
 
2.7%
Other values (48) 416
54.3%

영업소
Text

UNIQUE 

Distinct370
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
2023-12-12T14:56:12.391544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length2
Mean length2.5324324
Min length2

Characters and Unicode

Total characters937
Distinct characters187
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique370 ?
Unique (%)100.0%

Sample

1st row인천
2nd row김포
3rd row서서울
4th row시흥
5th row남인천
ValueCountFrequency (%)
인천 1
 
0.3%
서포항 1
 
0.3%
경주 1
 
0.3%
동김천 1
 
0.3%
칠곡물류 1
 
0.3%
추풍령 1
 
0.3%
김천 1
 
0.3%
구미 1
 
0.3%
남구미 1
 
0.3%
왜관 1
 
0.3%
Other values (360) 360
97.3%
2023-12-12T14:56:12.985007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
42
 
4.5%
40
 
4.3%
39
 
4.2%
37
 
3.9%
37
 
3.9%
36
 
3.8%
29
 
3.1%
25
 
2.7%
24
 
2.6%
20
 
2.1%
Other values (177) 608
64.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 930
99.3%
Open Punctuation 3
 
0.3%
Close Punctuation 3
 
0.3%
Decimal Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
42
 
4.5%
40
 
4.3%
39
 
4.2%
37
 
4.0%
37
 
4.0%
36
 
3.9%
29
 
3.1%
25
 
2.7%
24
 
2.6%
20
 
2.2%
Other values (174) 601
64.6%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 930
99.3%
Common 7
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
42
 
4.5%
40
 
4.3%
39
 
4.2%
37
 
4.0%
37
 
4.0%
36
 
3.9%
29
 
3.1%
25
 
2.7%
24
 
2.6%
20
 
2.2%
Other values (174) 601
64.6%
Common
ValueCountFrequency (%)
( 3
42.9%
) 3
42.9%
2 1
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 930
99.3%
ASCII 7
 
0.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
42
 
4.5%
40
 
4.3%
39
 
4.2%
37
 
4.0%
37
 
4.0%
36
 
3.9%
29
 
3.1%
25
 
2.7%
24
 
2.6%
20
 
2.2%
Other values (174) 601
64.6%
ASCII
ValueCountFrequency (%)
( 3
42.9%
) 3
42.9%
2 1
 
14.3%

주소
Text

UNIQUE 

Distinct370
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
2023-12-12T14:56:13.322701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length32
Mean length22.794595
Min length14

Characters and Unicode

Total characters8434
Distinct characters267
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique370 ?
Unique (%)100.0%

Sample

1st row인천광역시 계양구 경인고속도로 17
2nd row경기도 김포시 김포대로 319번길 209-23
3rd row경기도 안산시 상록구 장하로 141-2
4th row경기도 시흥시 서해안로 1780번길94
5th row인천광역시 남동구 음실로 117번길 32
ValueCountFrequency (%)
경기도 57
 
3.2%
경상남도 50
 
2.8%
경상북도 49
 
2.7%
전라남도 39
 
2.2%
강원도 35
 
1.9%
전라북도 33
 
1.8%
충청남도 29
 
1.6%
충청북도 29
 
1.6%
대구광역시 15
 
0.8%
남해고속도로 12
 
0.7%
Other values (991) 1457
80.7%
2023-12-12T14:56:13.882001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1448
 
17.2%
468
 
5.5%
326
 
3.9%
1 246
 
2.9%
241
 
2.9%
207
 
2.5%
196
 
2.3%
189
 
2.2%
2 183
 
2.2%
162
 
1.9%
Other values (257) 4768
56.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5463
64.8%
Space Separator 1448
 
17.2%
Decimal Number 1322
 
15.7%
Dash Punctuation 124
 
1.5%
Close Punctuation 37
 
0.4%
Open Punctuation 37
 
0.4%
Uppercase Letter 2
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
468
 
8.6%
326
 
6.0%
241
 
4.4%
207
 
3.8%
196
 
3.6%
189
 
3.5%
162
 
3.0%
161
 
2.9%
139
 
2.5%
135
 
2.5%
Other values (240) 3239
59.3%
Decimal Number
ValueCountFrequency (%)
1 246
18.6%
2 183
13.8%
3 154
11.6%
5 141
10.7%
4 132
10.0%
7 106
8.0%
6 97
 
7.3%
0 93
 
7.0%
9 87
 
6.6%
8 83
 
6.3%
Uppercase Letter
ValueCountFrequency (%)
I 1
50.0%
C 1
50.0%
Space Separator
ValueCountFrequency (%)
1448
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 124
100.0%
Close Punctuation
ValueCountFrequency (%)
) 37
100.0%
Open Punctuation
ValueCountFrequency (%)
( 37
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5463
64.8%
Common 2969
35.2%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
468
 
8.6%
326
 
6.0%
241
 
4.4%
207
 
3.8%
196
 
3.6%
189
 
3.5%
162
 
3.0%
161
 
2.9%
139
 
2.5%
135
 
2.5%
Other values (240) 3239
59.3%
Common
ValueCountFrequency (%)
1448
48.8%
1 246
 
8.3%
2 183
 
6.2%
3 154
 
5.2%
5 141
 
4.7%
4 132
 
4.4%
- 124
 
4.2%
7 106
 
3.6%
6 97
 
3.3%
0 93
 
3.1%
Other values (5) 245
 
8.3%
Latin
ValueCountFrequency (%)
I 1
50.0%
C 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5463
64.8%
ASCII 2971
35.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1448
48.7%
1 246
 
8.3%
2 183
 
6.2%
3 154
 
5.2%
5 141
 
4.7%
4 132
 
4.4%
- 124
 
4.2%
7 106
 
3.6%
6 97
 
3.3%
0 93
 
3.1%
Other values (7) 247
 
8.3%
Hangul
ValueCountFrequency (%)
468
 
8.6%
326
 
6.0%
241
 
4.4%
207
 
3.8%
196
 
3.6%
189
 
3.5%
162
 
3.0%
161
 
2.9%
139
 
2.5%
135
 
2.5%
Other values (240) 3239
59.3%

Correlations

2023-12-12T14:56:14.027028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
본부지사
본부1.0001.000
지사1.0001.000

Missing values

2023-12-12T14:56:10.795712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:56:10.884912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

본부지사영업소주소
0수도권인천인천인천광역시 계양구 경인고속도로 17
1수도권인천김포경기도 김포시 김포대로 319번길 209-23
2수도권시흥서서울경기도 안산시 상록구 장하로 141-2
3수도권시흥시흥경기도 시흥시 서해안로 1780번길94
4수도권시흥남인천인천광역시 남동구 음실로 117번길 32
5수도권군포군자경기도 시흥시 군자로335번길 36-29
6수도권군포서안산경기도 안산시 단원구 시흥대로 19-30
7수도권군포안산경기도 안산시 오리골길 15-1
8수도권군포군포경기도 군포시 영동고속도로 26
9수도권군포동군포경기도 군포시 영동고속도로 25
본부지사영업소주소
360부산경남고성동고성경상남도 고성군 거류면 송산로 421
361부산경남고성북통영경상남도 통영시 광도면 남해안대로 1227
362부산경남고성통영경상남도 통영시 용남면 기호로 25-52
363부산경남서울산부산부산광역시 금정구 고분로 148
364부산경남서울산노포부산광역시 금정구 고분로93번길 51
365부산경남서울산양산경상남도 양산시 상북면 와곡2길 12-1
366부산경남서울산통도사울산광역시 울주군 삼남면 반구대로 80
367부산경남서울산서울산울산광역시 울주군 삼남면 반구대로 760
368부산경남서울산활천울산광역시 울주군 두서면 활천내와로 30-25
369부산경남서울산배내골경상남도 양산시 원동면 배내로 915