Overview

Dataset statistics

Number of variables8
Number of observations1105
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory70.3 KiB
Average record size in memory65.1 B

Variable types

Numeric1
Categorical3
Text2
DateTime2

Dataset

Description인천광역시 서구에 위치한 도로의 도로명 부여 현황(도로 위계, 도로명, 도로 기점, 도로 종점, 부여사유 등)입니다.
URLhttps://www.data.go.kr/data/15063597/fileData.do

Alerts

종속구분 has constant value ""Constant
데이터 기준일 has constant value ""Constant
도로위계 is highly imbalanced (59.9%)Imbalance
연번 has unique valuesUnique
도로명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 16:23:17.443699
Analysis finished2023-12-12 16:23:18.433735
Duration0.99 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct1105
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean553
Minimum1
Maximum1105
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.8 KiB
2023-12-13T01:23:18.526342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile56.2
Q1277
median553
Q3829
95-th percentile1049.8
Maximum1105
Range1104
Interquartile range (IQR)552

Descriptive statistics

Standard deviation319.13033
Coefficient of variation (CV)0.5770892
Kurtosis-1.2
Mean553
Median Absolute Deviation (MAD)276
Skewness0
Sum611065
Variance101844.17
MonotonicityStrictly increasing
2023-12-13T01:23:18.681881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
736 1
 
0.1%
742 1
 
0.1%
741 1
 
0.1%
740 1
 
0.1%
739 1
 
0.1%
738 1
 
0.1%
737 1
 
0.1%
735 1
 
0.1%
727 1
 
0.1%
Other values (1095) 1095
99.1%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1105 1
0.1%
1104 1
0.1%
1103 1
0.1%
1102 1
0.1%
1101 1
0.1%
1100 1
0.1%
1099 1
0.1%
1098 1
0.1%
1097 1
0.1%
1096 1
0.1%

도로위계
Categorical

IMBALANCE 

Distinct4
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size8.8 KiB
875 
219 
대로
 
9
고속도로
 
2

Length

Max length4
Median length1
Mean length1.0135747
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
875
79.2%
219
 
19.8%
대로 9
 
0.8%
고속도로 2
 
0.2%

Length

2023-12-13T01:23:18.828153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:23:18.952326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
875
79.2%
219
 
19.8%
대로 9
 
0.8%
고속도로 2
 
0.2%

도로명
Text

UNIQUE 

Distinct1105
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size8.8 KiB
2023-12-13T01:23:19.273995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length7.0832579
Min length3

Characters and Unicode

Total characters7827
Distinct characters195
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1105 ?
Unique (%)100.0%

Sample

1st row가경주로
2nd row가경주로10번길
3rd row가경주로20번길
4th row가경주로24번길
5th row가경주로40번길
ValueCountFrequency (%)
가경주로 1
 
0.1%
여우재로86번길 1
 
0.1%
여우재로 1
 
0.1%
여우재로111번길 1
 
0.1%
여우재로112번길 1
 
0.1%
여우재로75번길 1
 
0.1%
여우재로82번길 1
 
0.1%
어울로136번안길 1
 
0.1%
연희로 1
 
0.1%
어울로136번길 1
 
0.1%
Other values (1095) 1095
99.1%
2023-12-13T01:23:19.751072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1094
 
14.0%
888
 
11.3%
851
 
10.9%
1 427
 
5.5%
2 314
 
4.0%
3 269
 
3.4%
4 212
 
2.7%
8 198
 
2.5%
6 192
 
2.5%
5 182
 
2.3%
Other values (185) 3200
40.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5577
71.3%
Decimal Number 2250
28.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1094
19.6%
888
15.9%
851
15.3%
126
 
2.3%
99
 
1.8%
95
 
1.7%
83
 
1.5%
76
 
1.4%
70
 
1.3%
69
 
1.2%
Other values (175) 2126
38.1%
Decimal Number
ValueCountFrequency (%)
1 427
19.0%
2 314
14.0%
3 269
12.0%
4 212
9.4%
8 198
8.8%
6 192
8.5%
5 182
8.1%
7 179
8.0%
0 140
 
6.2%
9 137
 
6.1%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5577
71.3%
Common 2250
28.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1094
19.6%
888
15.9%
851
15.3%
126
 
2.3%
99
 
1.8%
95
 
1.7%
83
 
1.5%
76
 
1.4%
70
 
1.3%
69
 
1.2%
Other values (175) 2126
38.1%
Common
ValueCountFrequency (%)
1 427
19.0%
2 314
14.0%
3 269
12.0%
4 212
9.4%
8 198
8.8%
6 192
8.5%
5 182
8.1%
7 179
8.0%
0 140
 
6.2%
9 137
 
6.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5577
71.3%
ASCII 2250
28.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1094
19.6%
888
15.9%
851
15.3%
126
 
2.3%
99
 
1.8%
95
 
1.7%
83
 
1.5%
76
 
1.4%
70
 
1.3%
69
 
1.2%
Other values (175) 2126
38.1%
ASCII
ValueCountFrequency (%)
1 427
19.0%
2 314
14.0%
3 269
12.0%
4 212
9.4%
8 198
8.8%
6 192
8.5%
5 182
8.1%
7 179
8.0%
0 140
 
6.2%
9 137
 
6.1%

종속구분
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size8.8 KiB
주도로
1105 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row주도로
2nd row주도로
3rd row주도로
4th row주도로
5th row주도로

Common Values

ValueCountFrequency (%)
주도로 1105
100.0%

Length

2023-12-13T01:23:19.921515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:23:20.018525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
주도로 1105
100.0%
Distinct54
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Memory size8.8 KiB
Minimum2008-12-13 00:00:00
Maximum2022-05-30 00:00:00
2023-12-13T01:23:20.109847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:23:20.214860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct54
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Memory size8.8 KiB
Minimum2007-06-30 00:00:00
Maximum2022-05-30 00:00:00
2023-12-13T01:23:20.309806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:23:20.420938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct1079
Distinct (%)97.6%
Missing0
Missing (%)0.0%
Memory size8.8 KiB
2023-12-13T01:23:20.701836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length100
Median length61
Mean length33.916742
Min length7

Characters and Unicode

Total characters37478
Distinct characters417
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1071 ?
Unique (%)96.9%

Sample

1st row가정동의 옛 이름인 "가경주"에서 차명
2nd row가경주로의 시작지점에서부터 약 100m지점에서 오른쪽으로 분기되는 도로
3rd row가경주로의 시작지점에서부터 약 200m지점에서 오른쪽으로 분기되는 도로
4th row가경주로의 시작지점에서부터 약 240m지점에서 오른쪽으로 분기되는 도로
5th row가경주로의 시작지점에서부터 약 400m지점에서 오른쪽으로 분기되는 도로
ValueCountFrequency (%)
도로 902
 
12.5%
분기되는 812
 
11.3%
651
 
9.0%
시작지점에서부터 591
 
8.2%
오른쪽으로 376
 
5.2%
왼쪽으로 348
 
4.8%
안쪽으로 56
 
0.8%
시작지점부터 55
 
0.8%
명명 50
 
0.7%
가정로의 47
 
0.7%
Other values (1410) 3325
46.1%
2023-12-13T01:23:21.116272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6116
 
16.3%
2869
 
7.7%
1632
 
4.4%
1574
 
4.2%
1548
 
4.1%
1488
 
4.0%
1065
 
2.8%
957
 
2.6%
929
 
2.5%
0 883
 
2.4%
Other values (407) 18417
49.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 27200
72.6%
Space Separator 6116
 
16.3%
Decimal Number 2855
 
7.6%
Lowercase Letter 755
 
2.0%
Other Punctuation 511
 
1.4%
Open Punctuation 16
 
< 0.1%
Close Punctuation 16
 
< 0.1%
Uppercase Letter 8
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2869
 
10.5%
1632
 
6.0%
1574
 
5.8%
1548
 
5.7%
1488
 
5.5%
1065
 
3.9%
957
 
3.5%
929
 
3.4%
877
 
3.2%
853
 
3.1%
Other values (370) 13408
49.3%
Lowercase Letter
ValueCountFrequency (%)
m 735
97.4%
l 4
 
0.5%
e 3
 
0.4%
t 2
 
0.3%
g 2
 
0.3%
n 2
 
0.3%
a 2
 
0.3%
y 1
 
0.1%
o 1
 
0.1%
w 1
 
0.1%
Other values (2) 2
 
0.3%
Decimal Number
ValueCountFrequency (%)
0 883
30.9%
1 389
13.6%
2 288
 
10.1%
3 253
 
8.9%
4 195
 
6.8%
8 187
 
6.5%
6 179
 
6.3%
7 174
 
6.1%
5 173
 
6.1%
9 134
 
4.7%
Other Punctuation
ValueCountFrequency (%)
, 429
84.0%
" 36
 
7.0%
' 25
 
4.9%
. 16
 
3.1%
; 2
 
0.4%
& 2
 
0.4%
/ 1
 
0.2%
Uppercase Letter
ValueCountFrequency (%)
M 5
62.5%
I 1
 
12.5%
P 1
 
12.5%
H 1
 
12.5%
Space Separator
ValueCountFrequency (%)
6116
100.0%
Open Punctuation
ValueCountFrequency (%)
( 16
100.0%
Close Punctuation
ValueCountFrequency (%)
) 16
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 27198
72.6%
Common 9515
 
25.4%
Latin 763
 
2.0%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2869
 
10.5%
1632
 
6.0%
1574
 
5.8%
1548
 
5.7%
1488
 
5.5%
1065
 
3.9%
957
 
3.5%
929
 
3.4%
877
 
3.2%
853
 
3.1%
Other values (368) 13406
49.3%
Common
ValueCountFrequency (%)
6116
64.3%
0 883
 
9.3%
, 429
 
4.5%
1 389
 
4.1%
2 288
 
3.0%
3 253
 
2.7%
4 195
 
2.0%
8 187
 
2.0%
6 179
 
1.9%
7 174
 
1.8%
Other values (11) 422
 
4.4%
Latin
ValueCountFrequency (%)
m 735
96.3%
M 5
 
0.7%
l 4
 
0.5%
e 3
 
0.4%
t 2
 
0.3%
g 2
 
0.3%
n 2
 
0.3%
a 2
 
0.3%
I 1
 
0.1%
y 1
 
0.1%
Other values (6) 6
 
0.8%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 27198
72.6%
ASCII 10278
 
27.4%
CJK 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6116
59.5%
0 883
 
8.6%
m 735
 
7.2%
, 429
 
4.2%
1 389
 
3.8%
2 288
 
2.8%
3 253
 
2.5%
4 195
 
1.9%
8 187
 
1.8%
6 179
 
1.7%
Other values (27) 624
 
6.1%
Hangul
ValueCountFrequency (%)
2869
 
10.5%
1632
 
6.0%
1574
 
5.8%
1548
 
5.7%
1488
 
5.5%
1065
 
3.9%
957
 
3.5%
929
 
3.4%
877
 
3.2%
853
 
3.1%
Other values (368) 13406
49.3%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%

데이터 기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size8.8 KiB
2023-07-10
1105 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-07-10
2nd row2023-07-10
3rd row2023-07-10
4th row2023-07-10
5th row2023-07-10

Common Values

ValueCountFrequency (%)
2023-07-10 1105
100.0%

Length

2023-12-13T01:23:21.227875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:23:21.296786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-07-10 1105
100.0%

Interactions

2023-12-13T01:23:18.080852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T01:23:21.342788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번도로위계고시일자부여일자
연번1.0000.2240.7600.762
도로위계0.2241.0000.9060.893
고시일자0.7600.9061.0000.999
부여일자0.7620.8930.9991.000
2023-12-13T01:23:21.418137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번도로위계
연번1.0000.135
도로위계0.1351.000

Missing values

2023-12-13T01:23:18.219435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:23:18.367548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번도로위계도로명종속구분고시일자부여일자부여사유데이터 기준일
01가경주로주도로2009-09-222009-08-28가정동의 옛 이름인 "가경주"에서 차명2023-07-10
12가경주로10번길주도로2009-09-222009-08-28가경주로의 시작지점에서부터 약 100m지점에서 오른쪽으로 분기되는 도로2023-07-10
23가경주로20번길주도로2009-09-222009-08-28가경주로의 시작지점에서부터 약 200m지점에서 오른쪽으로 분기되는 도로2023-07-10
34가경주로24번길주도로2009-09-222009-08-28가경주로의 시작지점에서부터 약 240m지점에서 오른쪽으로 분기되는 도로2023-07-10
45가경주로40번길주도로2009-09-222009-08-28가경주로의 시작지점에서부터 약 400m지점에서 오른쪽으로 분기되는 도로2023-07-10
56가남로주도로2009-09-222009-08-28도로구간 내에 있는 가정동, 가좌동의 머릿글자와 석남동의 뒷글자를 따서 명명2023-07-10
67가남로291번길주도로2009-09-222009-08-28가남로의 시작지점에서부터 약 2,910m지점에서 왼쪽으로 분기되는 도로2023-07-10
78가람로주도로2012-10-252012-10-19"강" 을 뜻하는 순우리말에서 착안2023-07-10
89가석로주도로2009-09-222009-08-28도로구간 내에 있는 가정동, 가좌동과 석남동의 머릿글자를 따서 명명2023-07-10
910가석로126번길주도로2009-09-222009-08-28가석로의 시작지점에서부터 약 1,260m지점에서 오른쪽으로 분기되는 도로2023-07-10
연번도로위계도로명종속구분고시일자부여일자부여사유데이터 기준일
10951096향동1길주도로2009-09-222009-08-28향동로의 시작지점에서부터 첫번째로 분기된 도로2023-07-10
10961097향동고개길주도로2009-09-222009-08-28옛 자연부락의 명칭을 사용하여 향동고개로라 명명2023-07-10
10971098향동로주도로2009-07-102009-12-17지역명칭 인용2023-07-10
10981099허암길주도로2009-09-222009-08-28허암 정희량 선생의 호를 따 허암길이라 명명함2023-07-10
10991100호두산로주도로2009-09-222009-08-28도로진행방향에 호두산이 위치하고 있어 호두산로라 명명함2023-07-10
11001101호두산로10번길주도로2009-09-222008-08-28호두산로의 시작지점에서부터 약 100m지점에서 오른쪽으로 분기되는 도로2023-07-10
11011102호두산로58번길주도로2009-09-222008-08-28호두산로의 시작지점에서부터 약 580m지점에서 오른쪽으로 분기되는 도로2023-07-10
11021103호두산로94번길주도로2009-09-222008-08-28호두산로의 시작지점에서부터 약 940m지점에서 오른쪽으로 분기되는 도로2023-07-10
11031104환경로주도로2012-03-022009-08-28종합환경연구단지가 입지한 지역적특성 반영2023-07-10
11041105환경로28번길주도로2012-03-022009-08-28종합환경연구단지가 입지한 지역적 특성 반영2023-07-10