Overview

Dataset statistics

Number of variables3
Number of observations370
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.8 KiB
Average record size in memory24.4 B

Variable types

Categorical1
Text2

Dataset

Description해당 데이터는 고속도로 노선별로 현재 운영중인 영업소 현황을 보여준다. 영업소 현황을 보다 쉽게 정리하여 보여주는 자료임.
URLhttps://www.data.go.kr/data/15085269/fileData.do

Alerts

영업소 has unique valuesUnique
주소 has unique valuesUnique

Reproduction

Analysis started2023-12-12 22:49:41.225061
Analysis finished2023-12-12 22:49:41.578826
Duration0.35 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

노선명
Categorical

Distinct33
Distinct (%)8.9%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
경부선
38 
남해선
32 
통영대전 중부선
29 
서해안선
27 
영동선
23 
Other values (28)
221 

Length

Max length12
Median length8
Mean length4.9459459
Min length3

Unique

Unique4 ?
Unique (%)1.1%

Sample

1st row경인선
2nd row수도권제1순환선
3rd row서해안선
4th row수도권제1순환선
5th row제2경인선

Common Values

ValueCountFrequency (%)
경부선 38
 
10.3%
남해선 32
 
8.6%
통영대전 중부선 29
 
7.8%
서해안선 27
 
7.3%
영동선 23
 
6.2%
중부내륙선 23
 
6.2%
당진영덕선 22
 
5.9%
중앙선 21
 
5.7%
호남선 19
 
5.1%
동해선 17
 
4.6%
Other values (23) 119
32.2%

Length

2023-12-13T07:49:41.644558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경부선 38
 
9.0%
남해선 32
 
7.6%
통영대전 29
 
6.9%
중부선 29
 
6.9%
서해안선 27
 
6.4%
영동선 23
 
5.5%
중부내륙선 23
 
5.5%
당진영덕선 22
 
5.2%
중앙선 21
 
5.0%
호남선 19
 
4.5%
Other values (26) 158
37.5%

영업소
Text

UNIQUE 

Distinct370
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
2023-12-13T07:49:42.055575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length2
Mean length2.5324324
Min length2

Characters and Unicode

Total characters937
Distinct characters187
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique370 ?
Unique (%)100.0%

Sample

1st row인천
2nd row김포
3rd row서서울
4th row시흥
5th row남인천
ValueCountFrequency (%)
인천 1
 
0.3%
서포항 1
 
0.3%
경주 1
 
0.3%
동김천 1
 
0.3%
칠곡물류 1
 
0.3%
추풍령 1
 
0.3%
김천 1
 
0.3%
구미 1
 
0.3%
남구미 1
 
0.3%
왜관 1
 
0.3%
Other values (360) 360
97.3%
2023-12-13T07:49:42.566220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
42
 
4.5%
40
 
4.3%
39
 
4.2%
37
 
3.9%
37
 
3.9%
36
 
3.8%
29
 
3.1%
25
 
2.7%
24
 
2.6%
20
 
2.1%
Other values (177) 608
64.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 930
99.3%
Open Punctuation 3
 
0.3%
Close Punctuation 3
 
0.3%
Decimal Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
42
 
4.5%
40
 
4.3%
39
 
4.2%
37
 
4.0%
37
 
4.0%
36
 
3.9%
29
 
3.1%
25
 
2.7%
24
 
2.6%
20
 
2.2%
Other values (174) 601
64.6%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 930
99.3%
Common 7
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
42
 
4.5%
40
 
4.3%
39
 
4.2%
37
 
4.0%
37
 
4.0%
36
 
3.9%
29
 
3.1%
25
 
2.7%
24
 
2.6%
20
 
2.2%
Other values (174) 601
64.6%
Common
ValueCountFrequency (%)
( 3
42.9%
) 3
42.9%
2 1
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 930
99.3%
ASCII 7
 
0.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
42
 
4.5%
40
 
4.3%
39
 
4.2%
37
 
4.0%
37
 
4.0%
36
 
3.9%
29
 
3.1%
25
 
2.7%
24
 
2.6%
20
 
2.2%
Other values (174) 601
64.6%
ASCII
ValueCountFrequency (%)
( 3
42.9%
) 3
42.9%
2 1
 
14.3%

주소
Text

UNIQUE 

Distinct370
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
2023-12-13T07:49:42.879645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length32
Mean length22.797297
Min length14

Characters and Unicode

Total characters8435
Distinct characters268
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique370 ?
Unique (%)100.0%

Sample

1st row인천광역시 계양구 경인고속도로 17
2nd row경기도 김포시 김포대로 319번길 209-23
3rd row경기도 안산시 상록구 장하로 141-2
4th row경기도 시흥시 서해안로 1780번길94
5th row인천광역시 남동구 음실로 117번길 32
ValueCountFrequency (%)
경기도 57
 
3.2%
경상남도 50
 
2.8%
경상북도 49
 
2.7%
전라남도 39
 
2.2%
강원도 35
 
1.9%
전라북도 33
 
1.8%
충청남도 29
 
1.6%
충청북도 29
 
1.6%
대구광역시 15
 
0.8%
남해고속도로 12
 
0.7%
Other values (991) 1457
80.7%
2023-12-13T07:49:43.289655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1448
 
17.2%
468
 
5.5%
326
 
3.9%
1 246
 
2.9%
241
 
2.9%
207
 
2.5%
196
 
2.3%
189
 
2.2%
2 183
 
2.2%
162
 
1.9%
Other values (258) 4769
56.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5463
64.8%
Space Separator 1448
 
17.2%
Decimal Number 1322
 
15.7%
Dash Punctuation 124
 
1.5%
Close Punctuation 37
 
0.4%
Open Punctuation 37
 
0.4%
Other Punctuation 2
 
< 0.1%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
468
 
8.6%
326
 
6.0%
241
 
4.4%
207
 
3.8%
196
 
3.6%
189
 
3.5%
162
 
3.0%
161
 
2.9%
139
 
2.5%
135
 
2.5%
Other values (240) 3239
59.3%
Decimal Number
ValueCountFrequency (%)
1 246
18.6%
2 183
13.8%
3 154
11.6%
5 141
10.7%
4 132
10.0%
7 106
8.0%
6 97
 
7.3%
0 93
 
7.0%
9 87
 
6.6%
8 83
 
6.3%
Other Punctuation
ValueCountFrequency (%)
? 1
50.0%
. 1
50.0%
Uppercase Letter
ValueCountFrequency (%)
I 1
50.0%
C 1
50.0%
Space Separator
ValueCountFrequency (%)
1448
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 124
100.0%
Close Punctuation
ValueCountFrequency (%)
) 37
100.0%
Open Punctuation
ValueCountFrequency (%)
( 37
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5463
64.8%
Common 2970
35.2%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
468
 
8.6%
326
 
6.0%
241
 
4.4%
207
 
3.8%
196
 
3.6%
189
 
3.5%
162
 
3.0%
161
 
2.9%
139
 
2.5%
135
 
2.5%
Other values (240) 3239
59.3%
Common
ValueCountFrequency (%)
1448
48.8%
1 246
 
8.3%
2 183
 
6.2%
3 154
 
5.2%
5 141
 
4.7%
4 132
 
4.4%
- 124
 
4.2%
7 106
 
3.6%
6 97
 
3.3%
0 93
 
3.1%
Other values (6) 246
 
8.3%
Latin
ValueCountFrequency (%)
I 1
50.0%
C 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5463
64.8%
ASCII 2972
35.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1448
48.7%
1 246
 
8.3%
2 183
 
6.2%
3 154
 
5.2%
5 141
 
4.7%
4 132
 
4.4%
- 124
 
4.2%
7 106
 
3.6%
6 97
 
3.3%
0 93
 
3.1%
Other values (8) 248
 
8.3%
Hangul
ValueCountFrequency (%)
468
 
8.6%
326
 
6.0%
241
 
4.4%
207
 
3.8%
196
 
3.6%
189
 
3.5%
162
 
3.0%
161
 
2.9%
139
 
2.5%
135
 
2.5%
Other values (240) 3239
59.3%

Missing values

2023-12-13T07:49:41.457264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:49:41.545786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

노선명영업소주소
0경인선인천인천광역시 계양구 경인고속도로 17
1수도권제1순환선김포경기도 김포시 김포대로 319번길 209-23
2서해안선서서울경기도 안산시 상록구 장하로 141-2
3수도권제1순환선시흥경기도 시흥시 서해안로 1780번길94
4제2경인선남인천인천광역시 남동구 음실로 117번길 32
5영동선군자경기도 시흥시 군자로335번길 36-29
6영동선서안산경기도 안산시 단원구 시흥대로 19-30
7영동선안산경기도 안산시 오리골길 15-1
8영동선군포경기도 군포시 영동고속도로 26
9영동선동군포경기도 군포시 영동고속도로 25
노선명영업소주소
360통영대전 중부선동고성경상남도 고성군 거류면 송산로 421
361통영대전 중부선북통영경상남도 통영시 광도면 남해안대로 1227
362통영대전 중부선통영경상남도 통영시 용남면 기호로 25-52
363경부선부산부산광역시 금정구 고분로 148
364경부선노포부산광역시 금정구 고분로93번길 51
365경부선양산경상남도 양산시 상북면 와곡2길 12-1
366경부선통도사울산광역시 울주군 삼남면 반구대로 80
367경부선서울산울산광역시 울주군 삼남면 반구대로 760
368경부선활천울산광역시 울주군 두서면 활천내와로 30-25
369함양울산선배내골경상남도 양산시 원동면 배내로 915