Overview

Dataset statistics

Number of variables14
Number of observations130
Missing cells1138
Missing cells (%)62.5%
Duplicate rows19
Duplicate rows (%)14.6%
Total size in memory14.3 KiB
Average record size in memory113.0 B

Variable types

Text2
Categorical1
Unsupported11

Dataset

Description부산광역시서구_도로현황데이터_20230704
Author부산광역시 서구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15116265

Alerts

Unnamed: 13 has constant value ""Constant
Dataset has 19 (14.6%) duplicate rowsDuplicates
차로별 도로현황(총괄) has 55 (42.3%) missing valuesMissing
Unnamed: 2 has 33 (25.4%) missing valuesMissing
Unnamed: 3 has 99 (76.2%) missing valuesMissing
Unnamed: 4 has 24 (18.5%) missing valuesMissing
Unnamed: 5 has 32 (24.6%) missing valuesMissing
Unnamed: 6 has 32 (24.6%) missing valuesMissing
Unnamed: 7 has 114 (87.7%) missing valuesMissing
Unnamed: 8 has 124 (95.4%) missing valuesMissing
Unnamed: 9 has 124 (95.4%) missing valuesMissing
Unnamed: 10 has 124 (95.4%) missing valuesMissing
Unnamed: 11 has 126 (96.9%) missing valuesMissing
Unnamed: 12 has 122 (93.8%) missing valuesMissing
Unnamed: 13 has 129 (99.2%) missing valuesMissing
Unnamed: 2 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 12 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-10 16:37:03.529793
Analysis finished2023-12-10 16:37:04.616035
Duration1.09 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct51
Distinct (%)68.0%
Missing55
Missing (%)42.3%
Memory size1.1 KiB
2023-12-11T01:37:04.780565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length9
Mean length6.0933333
Min length2

Characters and Unicode

Total characters457
Distinct characters48
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique39 ?
Unique (%)52.0%

Sample

1st row구 분
2nd row총 계
3rd row고속국도
4th row일반국도
5th row광역시도
ValueCountFrequency (%)
소로 32
24.6%
중로 10
 
7.7%
서구 6
 
4.6%
노선번호 6
 
4.6%
도로현황 6
 
4.6%
3-17 5
 
3.8%
1-8 3
 
2.3%
3
 
2.3%
대로 3
 
2.3%
3-1 2
 
1.5%
Other values (45) 54
41.5%
2023-12-11T01:37:05.297074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
73
16.0%
54
11.8%
- 44
 
9.6%
3 41
 
9.0%
32
 
7.0%
1 32
 
7.0%
13
 
2.8%
13
 
2.8%
8 12
 
2.6%
2 12
 
2.6%
Other values (38) 131
28.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 202
44.2%
Decimal Number 137
30.0%
Space Separator 73
 
16.0%
Dash Punctuation 44
 
9.6%
Control 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
54
26.7%
32
15.8%
13
 
6.4%
13
 
6.4%
10
 
5.0%
8
 
4.0%
7
 
3.5%
6
 
3.0%
6
 
3.0%
6
 
3.0%
Other values (25) 47
23.3%
Decimal Number
ValueCountFrequency (%)
3 41
29.9%
1 32
23.4%
8 12
 
8.8%
2 12
 
8.8%
7 11
 
8.0%
0 9
 
6.6%
4 7
 
5.1%
5 5
 
3.6%
6 5
 
3.6%
9 3
 
2.2%
Space Separator
ValueCountFrequency (%)
73
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 44
100.0%
Control
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 255
55.8%
Hangul 202
44.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
54
26.7%
32
15.8%
13
 
6.4%
13
 
6.4%
10
 
5.0%
8
 
4.0%
7
 
3.5%
6
 
3.0%
6
 
3.0%
6
 
3.0%
Other values (25) 47
23.3%
Common
ValueCountFrequency (%)
73
28.6%
- 44
17.3%
3 41
16.1%
1 32
12.5%
8 12
 
4.7%
2 12
 
4.7%
7 11
 
4.3%
0 9
 
3.5%
4 7
 
2.7%
5 5
 
2.0%
Other values (3) 9
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 255
55.8%
Hangul 202
44.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
73
28.6%
- 44
17.3%
3 41
16.1%
1 32
12.5%
8 12
 
4.7%
2 12
 
4.7%
7 11
 
4.3%
0 9
 
3.5%
4 7
 
2.7%
5 5
 
2.0%
Other values (3) 9
 
3.5%
Hangul
ValueCountFrequency (%)
54
26.7%
32
15.8%
13
 
6.4%
13
 
6.4%
10
 
5.0%
8
 
4.0%
7
 
3.5%
6
 
3.0%
6
 
3.0%
6
 
3.0%
Other values (25) 47
23.3%

Unnamed: 1
Categorical

Distinct31
Distinct (%)23.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
<NA>
57 
(중용연장)
[전용연장]
전체연장
보수로
Other values (26)
43 

Length

Max length8
Median length4
Mean length4.3076923
Min length2

Unique

Unique20 ?
Unique (%)15.4%

Sample

1st row<NA>
2nd row<NA>
3rd row(중용연장)
4th row[전용연장]
5th row전체연장

Common Values

ValueCountFrequency (%)
<NA> 57
43.8%
(중용연장) 8
 
6.2%
[전용연장] 8
 
6.2%
전체연장 8
 
6.2%
보수로 6
 
4.6%
노 선 명 6
 
4.6%
충무로 5
 
3.8%
충무대로 5
 
3.8%
구덕로 3
 
2.3%
암남공원로 2
 
1.5%
Other values (21) 22
 
16.9%

Length

2023-12-11T01:37:05.494646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 57
39.3%
전용연장 8
 
5.5%
전체연장 8
 
5.5%
중용연장 8
 
5.5%
보수로 6
 
4.1%
6
 
4.1%
6
 
4.1%
6
 
4.1%
충무로 5
 
3.4%
충무대로 5
 
3.4%
Other values (24) 30
20.7%

Unnamed: 2
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing33
Missing (%)25.4%
Memory size1.1 KiB

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing99
Missing (%)76.2%
Memory size1.1 KiB

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing24
Missing (%)18.5%
Memory size1.1 KiB

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing32
Missing (%)24.6%
Memory size1.1 KiB

Unnamed: 6
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing32
Missing (%)24.6%
Memory size1.1 KiB

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing114
Missing (%)87.7%
Memory size1.1 KiB

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing124
Missing (%)95.4%
Memory size1.1 KiB

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing124
Missing (%)95.4%
Memory size1.1 KiB

Unnamed: 10
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing124
Missing (%)95.4%
Memory size1.1 KiB

Unnamed: 11
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing126
Missing (%)96.9%
Memory size1.1 KiB

Unnamed: 12
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing122
Missing (%)93.8%
Memory size1.1 KiB

Unnamed: 13
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing129
Missing (%)99.2%
Memory size1.1 KiB
2023-12-11T01:37:05.611175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters2
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)100.0%

Sample

1st row비고
ValueCountFrequency (%)
비고 1
100.0%
2023-12-11T01:37:05.875455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Correlations

2023-12-11T01:37:05.986718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
차로별 도로현황(총괄)Unnamed: 1
차로별 도로현황(총괄)1.0001.000
Unnamed: 11.0001.000

Missing values

2023-12-11T01:37:03.794102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:37:04.097760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T01:37:04.371241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

차로별 도로현황(총괄)Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13
0구 분<NA>노선수연장(m)포 장 도 (m)NaNNaNNaNNaNNaNNaN미포장도\n(m)미개설도\n(m)비고
1<NA><NA>NaNNaN1 차선2 차선4 차선6 차선8 차선10차이상NaNNaN<NA>
2총 계(중용연장)NaN0000000000<NA>
3<NA>[전용연장]7611714795284200512634436554375665512028021863<NA>
4<NA>전체연장7611714795284200512634436554375665512028021863<NA>
5고속국도(중용연장)NaN00NaNNaNNaNNaNNaNNaNNaNNaN<NA>
6<NA>[전용연장]NaN00NaNNaNNaNNaNNaNNaNNaNNaN<NA>
7<NA>전체연장NaN00NaNNaNNaNNaNNaNNaNNaNNaN<NA>
8일반국도(중용연장)NaN00NaNNaNNaNNaNNaNNaNNaNNaN<NA>
9<NA>[전용연장]229802980NaNNaN2980NaNNaNNaNNaNNaN<NA>
차로별 도로현황(총괄)Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13
120대로1-8충무대로송도아랫길NaNNaN25~353218NaNNaNNaNNaNNaNNaN<NA>
121대로 1-8충무대로충무동 1가 91-1NaNNaN35220NaNNaNNaNNaNNaNNaN<NA>
122대로1-8충무대로암남동사무소 앞NaN송도교차로35152NaNNaNNaNNaNNaNNaN<NA>
123대로 1-8충무대로충무동1가 14-119NaN공동어시장냉동창고 일원35770NaNNaNNaNNaNNaNNaN<NA>
12410차선 도로현황<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN<NA>
125노선번호노 선 명구 간NaNNaN재 원NaN비 고NaNNaNNaNNaNNaN<NA>
126<NA><NA>시 점경 유종 점폭(m)길이(m)NaNNaNNaNNaNNaNNaN<NA>
127서구<NA>NaNNaNNaNNaN2028NaNNaNNaNNaNNaNNaN<NA>
128<NA>보수로구덕운동장 앞 육교NaN동아대부속병원 진입로70235NaNNaNNaNNaNNaNNaN<NA>
129<NA>광로부평교차로NaN구덕운동장 육교40~701793NaNNaNNaNNaNNaNNaN<NA>

Duplicate rows

Most frequently occurring

차로별 도로현황(총괄)Unnamed: 1Unnamed: 13# duplicates
11<NA>[전용연장]<NA>8
16<NA>전체연장<NA>8
18<NA><NA><NA>7
0노선번호노 선 명<NA>6
3서구<NA><NA>6
14<NA>보수로<NA>6
7소로 3-17<NA><NA>5
17<NA>충무로<NA>5
1대로 1-8충무대로<NA>3
12<NA>구덕로<NA>3