Overview

Dataset statistics

Number of variables3
Number of observations370
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.8 KiB
Average record size in memory24.4 B

Variable types

Categorical1
Text2

Dataset

Description해당 데이터는 고속도로 운영형태별(폐쇄식/개방식)로 현재 운영중인 영업소 현황을 보여준다. 영업소 현황을 보다 쉽게 정리하여 보여주는 자료임.
URLhttps://www.data.go.kr/data/15085271/fileData.do

Alerts

운영형태 is highly imbalanced (71.9%)Imbalance
영업소 has unique valuesUnique
주소 has unique valuesUnique

Reproduction

Analysis started2023-12-12 10:22:08.689684
Analysis finished2023-12-12 10:22:09.081523
Duration0.39 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

운영형태
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
폐쇄식
352 
개방식
 
18

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개방식
2nd row개방식
3rd row폐쇄식
4th row개방식
5th row개방식

Common Values

ValueCountFrequency (%)
폐쇄식 352
95.1%
개방식 18
 
4.9%

Length

2023-12-12T19:22:09.155401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:22:09.259770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
폐쇄식 352
95.1%
개방식 18
 
4.9%

영업소
Text

UNIQUE 

Distinct370
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
2023-12-12T19:22:09.649431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length2
Mean length2.5324324
Min length2

Characters and Unicode

Total characters937
Distinct characters187
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique370 ?
Unique (%)100.0%

Sample

1st row인천
2nd row김포
3rd row서서울
4th row시흥
5th row남인천
ValueCountFrequency (%)
인천 1
 
0.3%
서포항 1
 
0.3%
경주 1
 
0.3%
동김천 1
 
0.3%
칠곡물류 1
 
0.3%
추풍령 1
 
0.3%
김천 1
 
0.3%
구미 1
 
0.3%
남구미 1
 
0.3%
왜관 1
 
0.3%
Other values (360) 360
97.3%
2023-12-12T19:22:10.294810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
42
 
4.5%
40
 
4.3%
39
 
4.2%
37
 
3.9%
37
 
3.9%
36
 
3.8%
29
 
3.1%
25
 
2.7%
24
 
2.6%
20
 
2.1%
Other values (177) 608
64.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 930
99.3%
Open Punctuation 3
 
0.3%
Close Punctuation 3
 
0.3%
Decimal Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
42
 
4.5%
40
 
4.3%
39
 
4.2%
37
 
4.0%
37
 
4.0%
36
 
3.9%
29
 
3.1%
25
 
2.7%
24
 
2.6%
20
 
2.2%
Other values (174) 601
64.6%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 930
99.3%
Common 7
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
42
 
4.5%
40
 
4.3%
39
 
4.2%
37
 
4.0%
37
 
4.0%
36
 
3.9%
29
 
3.1%
25
 
2.7%
24
 
2.6%
20
 
2.2%
Other values (174) 601
64.6%
Common
ValueCountFrequency (%)
( 3
42.9%
) 3
42.9%
2 1
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 930
99.3%
ASCII 7
 
0.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
42
 
4.5%
40
 
4.3%
39
 
4.2%
37
 
4.0%
37
 
4.0%
36
 
3.9%
29
 
3.1%
25
 
2.7%
24
 
2.6%
20
 
2.2%
Other values (174) 601
64.6%
ASCII
ValueCountFrequency (%)
( 3
42.9%
) 3
42.9%
2 1
 
14.3%

주소
Text

UNIQUE 

Distinct370
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
2023-12-12T19:22:10.638213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length32
Mean length22.794595
Min length14

Characters and Unicode

Total characters8434
Distinct characters267
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique370 ?
Unique (%)100.0%

Sample

1st row인천광역시 계양구 경인고속도로 17
2nd row경기도 김포시 김포대로 319번길 209-23
3rd row경기도 안산시 상록구 장하로 141-2
4th row경기도 시흥시 서해안로 1780번길94
5th row인천광역시 남동구 음실로 117번길 32
ValueCountFrequency (%)
경기도 57
 
3.2%
경상남도 50
 
2.8%
경상북도 49
 
2.7%
전라남도 39
 
2.2%
강원도 35
 
1.9%
전라북도 33
 
1.8%
충청남도 29
 
1.6%
충청북도 29
 
1.6%
대구광역시 15
 
0.8%
남해고속도로 12
 
0.7%
Other values (991) 1457
80.7%
2023-12-12T19:22:11.145707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1448
 
17.2%
468
 
5.5%
326
 
3.9%
1 246
 
2.9%
241
 
2.9%
207
 
2.5%
196
 
2.3%
189
 
2.2%
2 183
 
2.2%
162
 
1.9%
Other values (257) 4768
56.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5463
64.8%
Space Separator 1448
 
17.2%
Decimal Number 1322
 
15.7%
Dash Punctuation 124
 
1.5%
Close Punctuation 37
 
0.4%
Open Punctuation 37
 
0.4%
Uppercase Letter 2
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
468
 
8.6%
326
 
6.0%
241
 
4.4%
207
 
3.8%
196
 
3.6%
189
 
3.5%
162
 
3.0%
161
 
2.9%
139
 
2.5%
135
 
2.5%
Other values (240) 3239
59.3%
Decimal Number
ValueCountFrequency (%)
1 246
18.6%
2 183
13.8%
3 154
11.6%
5 141
10.7%
4 132
10.0%
7 106
8.0%
6 97
 
7.3%
0 93
 
7.0%
9 87
 
6.6%
8 83
 
6.3%
Uppercase Letter
ValueCountFrequency (%)
I 1
50.0%
C 1
50.0%
Space Separator
ValueCountFrequency (%)
1448
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 124
100.0%
Close Punctuation
ValueCountFrequency (%)
) 37
100.0%
Open Punctuation
ValueCountFrequency (%)
( 37
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5463
64.8%
Common 2969
35.2%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
468
 
8.6%
326
 
6.0%
241
 
4.4%
207
 
3.8%
196
 
3.6%
189
 
3.5%
162
 
3.0%
161
 
2.9%
139
 
2.5%
135
 
2.5%
Other values (240) 3239
59.3%
Common
ValueCountFrequency (%)
1448
48.8%
1 246
 
8.3%
2 183
 
6.2%
3 154
 
5.2%
5 141
 
4.7%
4 132
 
4.4%
- 124
 
4.2%
7 106
 
3.6%
6 97
 
3.3%
0 93
 
3.1%
Other values (5) 245
 
8.3%
Latin
ValueCountFrequency (%)
I 1
50.0%
C 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5463
64.8%
ASCII 2971
35.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1448
48.7%
1 246
 
8.3%
2 183
 
6.2%
3 154
 
5.2%
5 141
 
4.7%
4 132
 
4.4%
- 124
 
4.2%
7 106
 
3.6%
6 97
 
3.3%
0 93
 
3.1%
Other values (7) 247
 
8.3%
Hangul
ValueCountFrequency (%)
468
 
8.6%
326
 
6.0%
241
 
4.4%
207
 
3.8%
196
 
3.6%
189
 
3.5%
162
 
3.0%
161
 
2.9%
139
 
2.5%
135
 
2.5%
Other values (240) 3239
59.3%

Missing values

2023-12-12T19:22:08.970838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:22:09.045877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

운영형태영업소주소
0개방식인천인천광역시 계양구 경인고속도로 17
1개방식김포경기도 김포시 김포대로 319번길 209-23
2폐쇄식서서울경기도 안산시 상록구 장하로 141-2
3개방식시흥경기도 시흥시 서해안로 1780번길94
4개방식남인천인천광역시 남동구 음실로 117번길 32
5폐쇄식군자경기도 시흥시 군자로335번길 36-29
6폐쇄식서안산경기도 안산시 단원구 시흥대로 19-30
7폐쇄식안산경기도 안산시 오리골길 15-1
8폐쇄식군포경기도 군포시 영동고속도로 26
9폐쇄식동군포경기도 군포시 영동고속도로 25
운영형태영업소주소
360폐쇄식동고성경상남도 고성군 거류면 송산로 421
361폐쇄식북통영경상남도 통영시 광도면 남해안대로 1227
362폐쇄식통영경상남도 통영시 용남면 기호로 25-52
363폐쇄식부산부산광역시 금정구 고분로 148
364폐쇄식노포부산광역시 금정구 고분로93번길 51
365폐쇄식양산경상남도 양산시 상북면 와곡2길 12-1
366폐쇄식통도사울산광역시 울주군 삼남면 반구대로 80
367폐쇄식서울산울산광역시 울주군 삼남면 반구대로 760
368폐쇄식활천울산광역시 울주군 두서면 활천내와로 30-25
369폐쇄식배내골경상남도 양산시 원동면 배내로 915