Overview

Dataset statistics

Number of variables15
Number of observations355
Missing cells4
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory41.7 KiB
Average record size in memory120.4 B

Variable types

Categorical2
Text1
Unsupported12

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15015/S/1/datasetView.do

Alerts

관리기관 is highly overall correlated with 도로종별High correlation
도로종별 is highly overall correlated with 관리기관High correlation
관리기관 is highly imbalanced (97.2%)Imbalance
도로명 has 4 (1.1%) missing valuesMissing
도로표지 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
관광지표지 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
사설표지 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysisUnsupported
전체표지 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 13 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 14 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-11 04:20:53.562902
Analysis finished2023-12-11 04:20:54.177962
Duration0.62 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

관리기관
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
서울특별시
354 
<NA>
 
1

Length

Max length5
Median length5
Mean length4.9971831
Min length4

Unique

Unique1 ?
Unique (%)0.3%

Sample

1st row<NA>
2nd row서울특별시
3rd row서울특별시
4th row서울특별시
5th row서울특별시

Common Values

ValueCountFrequency (%)
서울특별시 354
99.7%
<NA> 1
 
0.3%

Length

2023-12-11T13:20:54.275290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T13:20:54.399747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울특별시 354
99.7%
na 1
 
0.3%

도로종별
Categorical

HIGH CORRELATION 

Distinct31
Distinct (%)8.7%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
구도
186 
시도
53 
시도61
 
18
일반국도1
 
17
시도70
 
15
Other values (26)
66 

Length

Max length6
Median length2
Mean length2.7802817
Min length2

Unique

Unique16 ?
Unique (%)4.5%

Sample

1st row<NA>
2nd row고속국도15
3rd row고속국도15
4th row군도
5th row일반국도1

Common Values

ValueCountFrequency (%)
구도 186
52.4%
시도 53
 
14.9%
시도61 18
 
5.1%
일반국도1 17
 
4.8%
시도70 15
 
4.2%
시도88 15
 
4.2%
시도30 14
 
3.9%
시도20 5
 
1.4%
일반국도47 3
 
0.8%
시도47 3
 
0.8%
Other values (21) 26
 
7.3%

Length

2023-12-11T13:20:54.537700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
구도 186
52.4%
시도 53
 
14.9%
시도61 18
 
5.1%
일반국도1 17
 
4.8%
시도70 15
 
4.2%
시도88 15
 
4.2%
시도30 14
 
3.9%
시도20 5
 
1.4%
일반국도47 3
 
0.8%
시도47 3
 
0.8%
Other values (21) 26
 
7.3%

도로명
Text

MISSING 

Distinct272
Distinct (%)77.5%
Missing4
Missing (%)1.1%
Memory size2.9 KiB
2023-12-11T13:20:54.964362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length4.4216524
Min length3

Characters and Unicode

Total characters1552
Distinct characters209
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique220 ?
Unique (%)62.7%

Sample

1st row서부샛길
2nd row서해안고속도로
3rd row서오릉로
4th row가좌로
5th row경부고속도로
ValueCountFrequency (%)
강변북로 5
 
1.4%
천호대로 5
 
1.4%
화곡로 4
 
1.1%
양재대로 4
 
1.1%
동부간선도로 4
 
1.1%
화랑로 4
 
1.1%
가좌로 3
 
0.9%
양녕로 3
 
0.9%
강동대로 3
 
0.9%
강남대로 3
 
0.9%
Other values (262) 313
89.2%
2023-12-11T13:20:55.579762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
331
 
21.3%
100
 
6.4%
55
 
3.5%
32
 
2.1%
31
 
2.0%
1 30
 
1.9%
28
 
1.8%
2 25
 
1.6%
23
 
1.5%
23
 
1.5%
Other values (199) 874
56.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1369
88.2%
Decimal Number 168
 
10.8%
Uppercase Letter 12
 
0.8%
Connector Punctuation 2
 
0.1%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
331
24.2%
100
 
7.3%
55
 
4.0%
32
 
2.3%
31
 
2.3%
28
 
2.0%
23
 
1.7%
23
 
1.7%
20
 
1.5%
16
 
1.2%
Other values (182) 710
51.9%
Decimal Number
ValueCountFrequency (%)
1 30
17.9%
2 25
14.9%
3 23
13.7%
5 20
11.9%
6 15
8.9%
4 15
8.9%
0 12
 
7.1%
8 10
 
6.0%
9 9
 
5.4%
7 9
 
5.4%
Uppercase Letter
ValueCountFrequency (%)
N 4
33.3%
O 2
16.7%
E 2
16.7%
M 2
16.7%
A 2
16.7%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1369
88.2%
Common 171
 
11.0%
Latin 12
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
331
24.2%
100
 
7.3%
55
 
4.0%
32
 
2.3%
31
 
2.3%
28
 
2.0%
23
 
1.7%
23
 
1.7%
20
 
1.5%
16
 
1.2%
Other values (182) 710
51.9%
Common
ValueCountFrequency (%)
1 30
17.5%
2 25
14.6%
3 23
13.5%
5 20
11.7%
6 15
8.8%
4 15
8.8%
0 12
 
7.0%
8 10
 
5.8%
9 9
 
5.3%
7 9
 
5.3%
Other values (2) 3
 
1.8%
Latin
ValueCountFrequency (%)
N 4
33.3%
O 2
16.7%
E 2
16.7%
M 2
16.7%
A 2
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1369
88.2%
ASCII 183
 
11.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
331
24.2%
100
 
7.3%
55
 
4.0%
32
 
2.3%
31
 
2.3%
28
 
2.0%
23
 
1.7%
23
 
1.7%
20
 
1.5%
16
 
1.2%
Other values (182) 710
51.9%
ASCII
ValueCountFrequency (%)
1 30
16.4%
2 25
13.7%
3 23
12.6%
5 20
10.9%
6 15
8.2%
4 15
8.2%
0 12
 
6.6%
8 10
 
5.5%
9 9
 
4.9%
7 9
 
4.9%
Other values (7) 15
8.2%

도로표지
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size2.9 KiB

Unnamed: 4
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size2.9 KiB

Unnamed: 5
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size2.9 KiB

관광지표지
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size2.9 KiB

Unnamed: 7
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size2.9 KiB

Unnamed: 8
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size2.9 KiB

사설표지
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size2.9 KiB

Unnamed: 10
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size2.9 KiB

Unnamed: 11
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size2.9 KiB

전체표지
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size2.9 KiB

Unnamed: 13
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size2.9 KiB

Unnamed: 14
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size2.9 KiB

Correlations

2023-12-11T13:20:55.705005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도로종별
도로종별1.000
2023-12-11T13:20:55.816729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관리기관도로종별
관리기관1.0001.000
도로종별1.0001.000
2023-12-11T13:20:55.916822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관리기관도로종별
관리기관1.0001.000
도로종별1.0001.000

Missing values

2023-12-11T13:20:53.831565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T13:20:54.083619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

관리기관도로종별도로명도로표지Unnamed: 4Unnamed: 5관광지표지Unnamed: 7Unnamed: 8사설표지Unnamed: 10Unnamed: 11전체표지Unnamed: 13Unnamed: 14
0<NA><NA><NA>입력보유비율입력보유비율입력보유비율입력보유비율
1서울특별시고속국도15서부샛길200000000200
2서울특별시고속국도15서해안고속도로200000000200
3서울특별시군도서오릉로100000000100
4서울특별시일반국도1<NA>400000000400
5서울특별시일반국도1가좌로100000000100
6서울특별시일반국도1경부고속도로100000000100
7서울특별시일반국도1국회대로100000000100
8서울특별시일반국도1마포나루길100000000100
9서울특별시일반국도1서부간선도로13000000001300
관리기관도로종별도로명도로표지Unnamed: 4Unnamed: 5관광지표지Unnamed: 7Unnamed: 8사설표지Unnamed: 10Unnamed: 11전체표지Unnamed: 13Unnamed: 14
345서울특별시구도현충로100000000100
346서울특별시구도호암로100000000100
347서울특별시구도홍제천로100000000100
348서울특별시구도화곡로300000000300
349서울특별시구도화곡로27길100000000100
350서울특별시구도화랑로100000000100
351서울특별시구도회기로100000000100
352서울특별시구도효령로200000000200
353서울특별시구도70양녕로100000000100
354서울특별시구도111증가로6길100000000100