Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory556.6 KiB
Average record size in memory57.0 B

Variable types

Numeric1
Text1
Categorical4

Dataset

Description순번,노선명(도로명),도로종류,도로기능,도로규모,도로폭,시도구도구분
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15496/S/1/datasetView.do

Alerts

도로종류 is highly overall correlated with 도로기능High correlation
도로규모 is highly overall correlated with 도로폭High correlation
도로기능 is highly overall correlated with 도로종류High correlation
도로폭 is highly overall correlated with 도로규모High correlation
도로종류 is highly imbalanced (99.7%)Imbalance
도로기능 is highly imbalanced (58.2%)Imbalance
도로규모 is highly imbalanced (64.2%)Imbalance
도로폭 is highly imbalanced (64.2%)Imbalance
순번 has unique valuesUnique

Reproduction

Analysis started2024-05-11 04:17:52.812577
Analysis finished2024-05-11 04:17:55.490086
Duration2.68 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14080.245
Minimum1
Maximum28153
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-11T04:17:55.711311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1425.85
Q17038.75
median14084.5
Q321162
95-th percentile26775.15
Maximum28153
Range28152
Interquartile range (IQR)14123.25

Descriptive statistics

Standard deviation8142.2864
Coefficient of variation (CV)0.57827731
Kurtosis-1.2045182
Mean14080.245
Median Absolute Deviation (MAD)7070
Skewness0.00055325065
Sum1.4080245 × 108
Variance66296827
MonotonicityNot monotonic
2024-05-11T04:17:56.189420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
8963 1
 
< 0.1%
1362 1
 
< 0.1%
21554 1
 
< 0.1%
19936 1
 
< 0.1%
17112 1
 
< 0.1%
3208 1
 
< 0.1%
19628 1
 
< 0.1%
27197 1
 
< 0.1%
24290 1
 
< 0.1%
17634 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
11 1
< 0.1%
12 1
< 0.1%
13 1
< 0.1%
15 1
< 0.1%
17 1
< 0.1%
19 1
< 0.1%
ValueCountFrequency (%)
28153 1
< 0.1%
28149 1
< 0.1%
28144 1
< 0.1%
28142 1
< 0.1%
28141 1
< 0.1%
28136 1
< 0.1%
28135 1
< 0.1%
28132 1
< 0.1%
28131 1
< 0.1%
28129 1
< 0.1%
Distinct9997
Distinct (%)> 99.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-11T04:17:57.059474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length8.2355
Min length3

Characters and Unicode

Total characters82355
Distinct characters310
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9994 ?
Unique (%)99.9%

Sample

1st row무명도로-25139
2nd row무명도로-44267
3rd row연세로13길
4th row무명도로-24209
5th row오패산로35길
ValueCountFrequency (%)
대사관로5길 2
 
< 0.1%
홍지문길 2
 
< 0.1%
양산로7길 2
 
< 0.1%
시루봉로15길 1
 
< 0.1%
도봉로167길 1
 
< 0.1%
다산로36길 1
 
< 0.1%
전농로4길 1
 
< 0.1%
무명도로-24751 1
 
< 0.1%
왕십리로16길 1
 
< 0.1%
선잠로5다길 1
 
< 0.1%
Other values (9987) 9987
99.9%
2024-05-11T04:17:58.342335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9568
 
11.6%
5193
 
6.3%
4970
 
6.0%
4963
 
6.0%
- 4938
 
6.0%
4820
 
5.9%
1 4785
 
5.8%
2 4137
 
5.0%
4 3841
 
4.7%
0 3774
 
4.6%
Other values (300) 31366
38.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 43168
52.4%
Decimal Number 34186
41.5%
Dash Punctuation 4938
 
6.0%
Math Symbol 56
 
0.1%
Other Punctuation 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9568
22.2%
5193
12.0%
4970
11.5%
4963
11.5%
4820
11.2%
637
 
1.5%
567
 
1.3%
399
 
0.9%
384
 
0.9%
297
 
0.7%
Other values (287) 11370
26.3%
Decimal Number
ValueCountFrequency (%)
1 4785
14.0%
2 4137
12.1%
4 3841
11.2%
0 3774
11.0%
3 3443
10.1%
6 3193
9.3%
5 3055
8.9%
9 2739
8.0%
7 2735
8.0%
8 2484
7.3%
Dash Punctuation
ValueCountFrequency (%)
- 4938
100.0%
Math Symbol
ValueCountFrequency (%)
~ 56
100.0%
Other Punctuation
ValueCountFrequency (%)
. 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 43168
52.4%
Common 39187
47.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9568
22.2%
5193
12.0%
4970
11.5%
4963
11.5%
4820
11.2%
637
 
1.5%
567
 
1.3%
399
 
0.9%
384
 
0.9%
297
 
0.7%
Other values (287) 11370
26.3%
Common
ValueCountFrequency (%)
- 4938
12.6%
1 4785
12.2%
2 4137
10.6%
4 3841
9.8%
0 3774
9.6%
3 3443
8.8%
6 3193
8.1%
5 3055
7.8%
9 2739
7.0%
7 2735
7.0%
Other values (3) 2547
6.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 43168
52.4%
ASCII 39187
47.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
9568
22.2%
5193
12.0%
4970
11.5%
4963
11.5%
4820
11.2%
637
 
1.5%
567
 
1.3%
399
 
0.9%
384
 
0.9%
297
 
0.7%
Other values (287) 11370
26.3%
ASCII
ValueCountFrequency (%)
- 4938
12.6%
1 4785
12.2%
2 4137
10.6%
4 3841
9.8%
0 3774
9.6%
3 3443
8.8%
6 3193
8.1%
5 3055
7.8%
9 2739
7.0%
7 2735
7.0%
Other values (3) 2547
6.5%

도로종류
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
일반도로
9996 
자동차전용도로
 
3
<NA>
 
1

Length

Max length7
Median length4
Mean length4.0009
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row일반도로
2nd row일반도로
3rd row일반도로
4th row일반도로
5th row일반도로

Common Values

ValueCountFrequency (%)
일반도로 9996
> 99.9%
자동차전용도로 3
 
< 0.1%
<NA> 1
 
< 0.1%

Length

2024-05-11T04:17:58.841316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T04:17:59.275093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반도로 9996
> 99.9%
자동차전용도로 3
 
< 0.1%
na 1
 
< 0.1%

도로기능
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
국지도로
6653 
기타
3283 
보조간선도로
 
50
주간선도로
 
11
도시고속도로
 
3

Length

Max length6
Median length4
Mean length3.3551
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row기타
2nd row기타
3rd row국지도로
4th row기타
5th row국지도로

Common Values

ValueCountFrequency (%)
국지도로 6653
66.5%
기타 3283
32.8%
보조간선도로 50
 
0.5%
주간선도로 11
 
0.1%
도시고속도로 3
 
< 0.1%

Length

2024-05-11T04:17:59.833426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T04:18:00.317957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국지도로 6653
66.5%
기타 3283
32.8%
보조간선도로 50
 
0.5%
주간선도로 11
 
0.1%
도시고속도로 3
 
< 0.1%

도로규모
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct13
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
소로
7635 
소로3류
1149 
소로2류
 
438
소로1류
 
210
중로1류
 
205
Other values (8)
 
363

Length

Max length4
Median length2
Mean length2.473
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row소로
2nd row소로
3rd row소로
4th row소로
5th row소로

Common Values

ValueCountFrequency (%)
소로 7635
76.3%
소로3류 1149
 
11.5%
소로2류 438
 
4.4%
소로1류 210
 
2.1%
중로1류 205
 
2.1%
중로2류 180
 
1.8%
확인불가 118
 
1.2%
대로3류 28
 
0.3%
대로2류 20
 
0.2%
대로1류 9
 
0.1%
Other values (3) 8
 
0.1%

Length

2024-05-11T04:18:00.928225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
소로 7635
76.3%
소로3류 1149
 
11.5%
소로2류 438
 
4.4%
소로1류 210
 
2.1%
중로1류 205
 
2.1%
중로2류 180
 
1.8%
확인불가 118
 
1.2%
대로3류 28
 
0.3%
대로2류 20
 
0.2%
대로1류 9
 
0.1%
Other values (3) 8
 
0.1%

도로폭
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct13
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
6m미만
7635 
폭6-8m
1149 
폭8-10m
 
438
폭10-12m
 
210
폭20-25m
 
205
Other values (8)
 
363

Length

Max length7
Median length4
Mean length4.4005
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row6m미만
2nd row6m미만
3rd row6m미만
4th row6m미만
5th row6m미만

Common Values

ValueCountFrequency (%)
6m미만 7635
76.3%
폭6-8m 1149
 
11.5%
폭8-10m 438
 
4.4%
폭10-12m 210
 
2.1%
폭20-25m 205
 
2.1%
폭15-20m 180
 
1.8%
<NA> 118
 
1.2%
폭25-30m 28
 
0.3%
폭30-35m 20
 
0.2%
폭35-40m 9
 
0.1%
Other values (3) 8
 
0.1%

Length

2024-05-11T04:18:01.360516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
6m미만 7635
76.3%
폭6-8m 1149
 
11.5%
폭8-10m 438
 
4.4%
폭10-12m 210
 
2.1%
폭20-25m 205
 
2.1%
폭15-20m 180
 
1.8%
na 118
 
1.2%
폭25-30m 28
 
0.3%
폭30-35m 20
 
0.2%
폭35-40m 9
 
0.1%
Other values (3) 8
 
0.1%

Interactions

2024-05-11T04:17:54.507558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-11T04:18:01.606246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번도로종류도로기능도로규모도로폭
순번1.0000.0270.7380.1650.149
도로종류0.0271.0001.0000.1230.148
도로기능0.7381.0001.0000.6260.617
도로규모0.1650.1230.6261.0001.000
도로폭0.1490.1480.6171.0001.000
2024-05-11T04:18:01.925725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도로종류도로규모도로기능도로폭
도로종류1.0000.1141.0000.115
도로규모0.1141.0000.4021.000
도로기능1.0000.4021.0000.400
도로폭0.1151.0000.4001.000
2024-05-11T04:18:02.422260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번도로종류도로기능도로규모도로폭
순번1.0000.0210.3960.0690.063
도로종류0.0211.0001.0000.1140.115
도로기능0.3961.0001.0000.4020.400
도로규모0.0690.1140.4021.0001.000
도로폭0.0630.1150.4001.0001.000

Missing values

2024-05-11T04:17:54.932395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-11T04:17:55.316033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번노선명(도로명)도로종류도로기능도로규모도로폭
89628963무명도로-25139일반도로기타소로6m미만
1265112652무명도로-44267일반도로기타소로6m미만
2370623706연세로13길일반도로국지도로소로6m미만
85868587무명도로-24209일반도로기타소로6m미만
2408724088오패산로35길일반도로국지도로소로6m미만
1650516506무명도로-68141일반도로국지도로소로6m미만
2031720316사근동10가길일반도로국지도로소로6m미만
1104411045무명도로-34905일반도로기타소로6m미만
1801718018무명도로-90563일반도로국지도로소로6m미만
56075608무명도로-10786일반도로기타소로6m미만
순번노선명(도로명)도로종류도로기능도로규모도로폭
2606326064중랑천로50길일반도로국지도로소로6m미만
63646365무명도로-14864일반도로기타소로6m미만
1368213683무명도로-47291일반도로기타소로6m미만
1508515086무명도로-5133일반도로기타소로3류폭6-8m
1783817839무명도로-90288일반도로국지도로소로6m미만
1793117932무명도로-90444일반도로국지도로소로6m미만
2224522246시루봉로15길일반도로국지도로소로6m미만
1359913600무명도로-47136일반도로기타소로6m미만
22002201노해로35길일반도로국지도로소로6m미만
51065107목동중앙로13나길일반도로국지도로소로6m미만