Overview

Dataset statistics

Number of variables4
Number of observations31
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.1 KiB
Average record size in memory36.1 B

Variable types

Text2
DateTime1
Categorical1

Dataset

Description경기도 이천시 도로개발계획으로 사업명, 연장, 개통연도, 추진단계(공사완료, 보상중, 시공중) 등을 알수 있습니다.
Author경기도 이천시
URLhttps://www.data.go.kr/data/3065098/fileData.do

Alerts

추진단계 is highly imbalanced (52.4%)Imbalance
사업명 has unique valuesUnique
연장 has unique valuesUnique

Reproduction

Analysis started2024-03-14 09:54:26.345720
Analysis finished2024-03-14 09:54:27.041040
Duration0.7 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사업명
Text

UNIQUE 

Distinct31
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size376.0 B
2024-03-14T18:54:27.691930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length23
Mean length18.419355
Min length8

Characters and Unicode

Total characters571
Distinct characters98
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)100.0%

Sample

1st row도자특구 진입도로
2nd row동산리 농어촌도로(2공구)
3rd row수광-마교간 농어촌도로
4th row현방-우곡간 도로확포장공사(2공구)
5th row대흥-초지간 도로확포장공사
ValueCountFrequency (%)
도시계획도로 11
 
14.5%
농어촌도로 8
 
10.5%
도로확포장공사 2
 
2.6%
확포장공사 2
 
2.6%
도자특구 1
 
1.3%
갈산동 1
 
1.3%
개설(중로1-10 1
 
1.3%
관고동 1
 
1.3%
개설(소로2-321,323 1
 
1.3%
송정동 1
 
1.3%
Other values (47) 47
61.8%
2024-03-14T18:54:28.821016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
49
 
8.6%
47
 
8.2%
45
 
7.9%
- 29
 
5.1%
( 19
 
3.3%
) 19
 
3.3%
1 16
 
2.8%
15
 
2.6%
2 15
 
2.6%
14
 
2.5%
Other values (88) 303
53.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 395
69.2%
Decimal Number 61
 
10.7%
Space Separator 45
 
7.9%
Dash Punctuation 29
 
5.1%
Open Punctuation 19
 
3.3%
Close Punctuation 19
 
3.3%
Other Punctuation 3
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
49
 
12.4%
47
 
11.9%
15
 
3.8%
14
 
3.5%
14
 
3.5%
13
 
3.3%
13
 
3.3%
12
 
3.0%
12
 
3.0%
12
 
3.0%
Other values (74) 194
49.1%
Decimal Number
ValueCountFrequency (%)
1 16
26.2%
2 15
24.6%
3 11
18.0%
0 8
13.1%
6 3
 
4.9%
9 3
 
4.9%
5 2
 
3.3%
4 2
 
3.3%
8 1
 
1.6%
Space Separator
ValueCountFrequency (%)
45
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 29
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Other Punctuation
ValueCountFrequency (%)
, 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 395
69.2%
Common 176
30.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
49
 
12.4%
47
 
11.9%
15
 
3.8%
14
 
3.5%
14
 
3.5%
13
 
3.3%
13
 
3.3%
12
 
3.0%
12
 
3.0%
12
 
3.0%
Other values (74) 194
49.1%
Common
ValueCountFrequency (%)
45
25.6%
- 29
16.5%
( 19
10.8%
) 19
10.8%
1 16
 
9.1%
2 15
 
8.5%
3 11
 
6.2%
0 8
 
4.5%
, 3
 
1.7%
6 3
 
1.7%
Other values (4) 8
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 395
69.2%
ASCII 176
30.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
49
 
12.4%
47
 
11.9%
15
 
3.8%
14
 
3.5%
14
 
3.5%
13
 
3.3%
13
 
3.3%
12
 
3.0%
12
 
3.0%
12
 
3.0%
Other values (74) 194
49.1%
ASCII
ValueCountFrequency (%)
45
25.6%
- 29
16.5%
( 19
10.8%
) 19
10.8%
1 16
 
9.1%
2 15
 
8.5%
3 11
 
6.2%
0 8
 
4.5%
, 3
 
1.7%
6 3
 
1.7%
Other values (4) 8
 
4.5%

연장
Text

UNIQUE 

Distinct31
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size376.0 B
2024-03-14T18:54:29.570738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length15.129032
Min length13

Characters and Unicode

Total characters469
Distinct characters20
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)100.0%

Sample

1st rowL=1.7km, B=20m
2nd rowL=0.9km, B=10m
3rd rowL=1.92km, B=10m
4th rowL=1.7km, B=10m
5th rowL=1.9km, B=8.5m
ValueCountFrequency (%)
b=10m 8
 
15.7%
b=10.5m 3
 
5.9%
l=1.7km 2
 
3.9%
l=0.8km 1
 
2.0%
l=0.37km,b=15m 1
 
2.0%
l=0.56km,b=20m 1
 
2.0%
l=1.213km,b=8m 1
 
2.0%
l=0.684km,b=10m 1
 
2.0%
l=1.3km,b=20m 1
 
2.0%
l=0.19km,b=10.5m 1
 
2.0%
Other values (31) 31
60.8%
2024-03-14T18:54:30.762954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
= 62
13.2%
m 59
12.6%
1 40
8.5%
0 40
8.5%
. 37
7.9%
, 32
 
6.8%
L 31
 
6.6%
B 31
 
6.6%
k 26
 
5.5%
25
 
5.3%
Other values (10) 86
18.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 161
34.3%
Lowercase Letter 85
18.1%
Other Punctuation 69
14.7%
Math Symbol 62
 
13.2%
Uppercase Letter 62
 
13.2%
Space Separator 25
 
5.3%
Other Symbol 3
 
0.6%
Dash Punctuation 2
 
0.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 40
24.8%
0 40
24.8%
5 15
 
9.3%
8 12
 
7.5%
4 11
 
6.8%
2 11
 
6.8%
3 9
 
5.6%
9 8
 
5.0%
7 8
 
5.0%
6 7
 
4.3%
Lowercase Letter
ValueCountFrequency (%)
m 59
69.4%
k 26
30.6%
Other Punctuation
ValueCountFrequency (%)
. 37
53.6%
, 32
46.4%
Uppercase Letter
ValueCountFrequency (%)
L 31
50.0%
B 31
50.0%
Math Symbol
ValueCountFrequency (%)
= 62
100.0%
Space Separator
ValueCountFrequency (%)
25
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 322
68.7%
Latin 147
31.3%

Most frequent character per script

Common
ValueCountFrequency (%)
= 62
19.3%
1 40
12.4%
0 40
12.4%
. 37
11.5%
, 32
9.9%
25
7.8%
5 15
 
4.7%
8 12
 
3.7%
4 11
 
3.4%
2 11
 
3.4%
Other values (6) 37
11.5%
Latin
ValueCountFrequency (%)
m 59
40.1%
L 31
21.1%
B 31
21.1%
k 26
17.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 466
99.4%
CJK Compat 3
 
0.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
= 62
13.3%
m 59
12.7%
1 40
8.6%
0 40
8.6%
. 37
7.9%
, 32
 
6.9%
L 31
 
6.7%
B 31
 
6.7%
k 26
 
5.6%
25
 
5.4%
Other values (9) 83
17.8%
CJK Compat
ValueCountFrequency (%)
3
100.0%
Distinct21
Distinct (%)67.7%
Missing0
Missing (%)0.0%
Memory size376.0 B
Minimum2017-11-01 00:00:00
Maximum2022-04-01 00:00:00
2024-03-14T18:54:31.114488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T18:54:31.472248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)

추진단계
Categorical

IMBALANCE 

Distinct3
Distinct (%)9.7%
Missing0
Missing (%)0.0%
Memory size376.0 B
공사완료
26 
시공중
보상중
 
1

Length

Max length4
Median length4
Mean length3.8387097
Min length3

Unique

Unique1 ?
Unique (%)3.2%

Sample

1st row공사완료
2nd row공사완료
3rd row공사완료
4th row공사완료
5th row공사완료

Common Values

ValueCountFrequency (%)
공사완료 26
83.9%
시공중 4
 
12.9%
보상중 1
 
3.2%

Length

2024-03-14T18:54:31.878711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T18:54:32.199708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공사완료 26
83.9%
시공중 4
 
12.9%
보상중 1
 
3.2%

Correlations

2024-03-14T18:54:32.401950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업명연장개통연도추진단계
사업명1.0001.0001.0001.000
연장1.0001.0001.0001.000
개통연도1.0001.0001.0000.322
추진단계1.0001.0000.3221.000

Missing values

2024-03-14T18:54:26.640368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T18:54:26.929476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업명연장개통연도추진단계
0도자특구 진입도로L=1.7km, B=20m2017-12-01공사완료
1동산리 농어촌도로(2공구)L=0.9km, B=10m2018-12-01공사완료
2수광-마교간 농어촌도로L=1.92km, B=10m2019-07-01공사완료
3현방-우곡간 도로확포장공사(2공구)L=1.7km, B=10m2019-04-01공사완료
4대흥-초지간 도로확포장공사L=1.9km, B=8.5m2019-05-01공사완료
5원두-소사간 도로확포장공사L=1.5km, B=11m2019-04-01공사완료
6송계리 농어촌도로 확포장공사L=0.66km, B=8m2018-02-01공사완료
7총곡리 농어촌도로L=0.37km, B=10m2017-11-01공사완료
8도암-장동 농어촌도로L=0.574km, B=10m2018-11-01공사완료
9백사생활체육공원-연당간 농어촌도로L=0.45km, B=10m2018-07-01공사완료
사업명연장개통연도추진단계
21유산-고담간 도시계획도로 개설(중로1-36)L=1.3km,B=20m2019-12-01공사완료
22사음동 도시계획도로 개설(소로1-100)L=0.19km,B=10.5m2018-06-01공사완료
23수정교차로-부발역사간 농어촌도로 확포장공사L=1.87km, B=10.5m2020-12-01공사완료
24초지-장평간 도로 확포장공사(시도20호선)L=0.8km, B=11.9m2020-12-01공사완료
25죽당천 제방도로L=4.8km, B=11.0m2021-02-01공사완료
26도봉-장동간 농어촌도로L=2.5km, B=10m2022-04-01시공중
27와현-풍계간 도로(시도6호선)L=3.37㎞, B=10.5m2022-04-01시공중
28작촌-해월간 도로확포장공사(시도19호선)L=1,84㎞, B=10.5m2022-04-01시공중
29송말리 농어촌도로L=0.96㎞, B=10.0m2022-01-01시공중
30설성체육공원 진입도로(2공구)L=358m, B=10m2021-06-01공사완료