Overview

Dataset statistics

Number of variables7
Number of observations33
Missing cells54
Missing cells (%)23.4%
Duplicate rows1
Duplicate rows (%)3.0%
Total size in memory1.9 KiB
Average record size in memory60.0 B

Variable types

Unsupported2
Text5

Dataset

Description안전한보행환경조성사업추진현황20156
Author전라북도
URLhttps://www.bigdatahub.go.kr/opendata/dataSet/detail.nm?contentId=37&rlik=49451aebf056b486&serviceId=202596

Alerts

Unnamed: 6 has constant value ""Constant
Dataset has 1 (3.0%) duplicate rowsDuplicates
2010~2015년 안전한 보행환경 조성사업 추진 현황 has 2 (6.1%) missing valuesMissing
Unnamed: 1 has 8 (24.2%) missing valuesMissing
Unnamed: 2 has 2 (6.1%) missing valuesMissing
Unnamed: 3 has 2 (6.1%) missing valuesMissing
Unnamed: 4 has 8 (24.2%) missing valuesMissing
Unnamed: 6 has 32 (97.0%) missing valuesMissing
2010~2015년 안전한 보행환경 조성사업 추진 현황 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-03-14 03:00:34.936365
Analysis finished2024-03-14 03:00:35.410388
Duration0.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

2010~2015년 안전한 보행환경 조성사업 추진 현황
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)6.1%
Memory size396.0 B

Unnamed: 1
Text

MISSING 

Distinct24
Distinct (%)96.0%
Missing8
Missing (%)24.2%
Memory size396.0 B
2024-03-14T12:00:35.518577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length21
Mean length20.6
Min length3

Characters and Unicode

Total characters515
Distinct characters75
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)92.0%

Sample

1st row사업명
2nd row안전한 보행환경 조성사업 (순창 농암)
3rd row안전한 보행환경 조성사업 (부안 모산)
4th row안전한 보행환경 조성사업 (고창 선운)
5th row안전한 보행환경 조성사업 (정읍 운학)
ValueCountFrequency (%)
안전한 24
20.0%
조성사업 24
20.0%
보행환경 24
20.0%
익산 5
 
4.2%
완주 4
 
3.3%
정읍 2
 
1.7%
무주 2
 
1.7%
김제 2
 
1.7%
순창 2
 
1.7%
남원 2
 
1.7%
Other values (28) 29
24.2%
2024-03-14T12:00:35.808275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
71
 
13.8%
26
 
5.0%
25
 
4.9%
25
 
4.9%
24
 
4.7%
24
 
4.7%
24
 
4.7%
24
 
4.7%
24
 
4.7%
24
 
4.7%
Other values (65) 224
43.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 368
71.5%
Space Separator 71
 
13.8%
Control 24
 
4.7%
Open Punctuation 24
 
4.7%
Close Punctuation 24
 
4.7%
Dash Punctuation 2
 
0.4%
Decimal Number 2
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
26
 
7.1%
25
 
6.8%
25
 
6.8%
24
 
6.5%
24
 
6.5%
24
 
6.5%
24
 
6.5%
24
 
6.5%
24
 
6.5%
24
 
6.5%
Other values (58) 124
33.7%
Decimal Number
ValueCountFrequency (%)
1 1
50.0%
2 1
50.0%
Space Separator
ValueCountFrequency (%)
71
100.0%
Control
ValueCountFrequency (%)
24
100.0%
Open Punctuation
ValueCountFrequency (%)
( 24
100.0%
Close Punctuation
ValueCountFrequency (%)
) 24
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 368
71.5%
Common 147
 
28.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
26
 
7.1%
25
 
6.8%
25
 
6.8%
24
 
6.5%
24
 
6.5%
24
 
6.5%
24
 
6.5%
24
 
6.5%
24
 
6.5%
24
 
6.5%
Other values (58) 124
33.7%
Common
ValueCountFrequency (%)
71
48.3%
24
 
16.3%
( 24
 
16.3%
) 24
 
16.3%
- 2
 
1.4%
1 1
 
0.7%
2 1
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 368
71.5%
ASCII 147
 
28.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
71
48.3%
24
 
16.3%
( 24
 
16.3%
) 24
 
16.3%
- 2
 
1.4%
1 1
 
0.7%
2 1
 
0.7%
Hangul
ValueCountFrequency (%)
26
 
7.1%
25
 
6.8%
25
 
6.8%
24
 
6.5%
24
 
6.5%
24
 
6.5%
24
 
6.5%
24
 
6.5%
24
 
6.5%
24
 
6.5%
Other values (58) 124
33.7%

Unnamed: 2
Text

MISSING 

Distinct28
Distinct (%)90.3%
Missing2
Missing (%)6.1%
Memory size396.0 B
2024-03-14T12:00:36.024485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length8.2903226
Min length3

Characters and Unicode

Total characters257
Distinct characters63
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)83.9%

Sample

1st row위 치
2nd row25개소
3rd row7개소
4th row순창 복흥
5th row부안 부안
ValueCountFrequency (%)
익산 7
 
12.1%
완주 4
 
6.9%
부안 3
 
5.2%
금마(지722 3
 
5.2%
김제 2
 
3.4%
정읍 2
 
3.4%
남원 2
 
3.4%
순창 2
 
3.4%
7개소 2
 
3.4%
3개소 1
 
1.7%
Other values (30) 30
51.7%
2024-03-14T12:00:36.310611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
27
 
10.5%
19
 
7.4%
) 18
 
7.0%
( 18
 
7.0%
7 18
 
7.0%
15
 
5.8%
2 10
 
3.9%
4 7
 
2.7%
7
 
2.7%
7
 
2.7%
Other values (53) 111
43.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 133
51.8%
Decimal Number 60
23.3%
Space Separator 27
 
10.5%
Close Punctuation 18
 
7.0%
Open Punctuation 18
 
7.0%
Other Punctuation 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
19
 
14.3%
15
 
11.3%
7
 
5.3%
7
 
5.3%
6
 
4.5%
6
 
4.5%
4
 
3.0%
4
 
3.0%
3
 
2.3%
3
 
2.3%
Other values (39) 59
44.4%
Decimal Number
ValueCountFrequency (%)
7 18
30.0%
2 10
16.7%
4 7
 
11.7%
1 6
 
10.0%
6 5
 
8.3%
5 4
 
6.7%
3 4
 
6.7%
0 2
 
3.3%
9 2
 
3.3%
8 2
 
3.3%
Space Separator
ValueCountFrequency (%)
27
100.0%
Close Punctuation
ValueCountFrequency (%)
) 18
100.0%
Open Punctuation
ValueCountFrequency (%)
( 18
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 133
51.8%
Common 124
48.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
19
 
14.3%
15
 
11.3%
7
 
5.3%
7
 
5.3%
6
 
4.5%
6
 
4.5%
4
 
3.0%
4
 
3.0%
3
 
2.3%
3
 
2.3%
Other values (39) 59
44.4%
Common
ValueCountFrequency (%)
27
21.8%
) 18
14.5%
( 18
14.5%
7 18
14.5%
2 10
 
8.1%
4 7
 
5.6%
1 6
 
4.8%
6 5
 
4.0%
5 4
 
3.2%
3 4
 
3.2%
Other values (4) 7
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 133
51.8%
ASCII 124
48.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
27
21.8%
) 18
14.5%
( 18
14.5%
7 18
14.5%
2 10
 
8.1%
4 7
 
5.6%
1 6
 
4.8%
6 5
 
4.0%
5 4
 
3.2%
3 4
 
3.2%
Other values (4) 7
 
5.6%
Hangul
ValueCountFrequency (%)
19
 
14.3%
15
 
11.3%
7
 
5.3%
7
 
5.3%
6
 
4.5%
6
 
4.5%
4
 
3.0%
4
 
3.0%
3
 
2.3%
3
 
2.3%
Other values (39) 59
44.4%

Unnamed: 3
Text

MISSING 

Distinct22
Distinct (%)71.0%
Missing2
Missing (%)6.1%
Memory size396.0 B
2024-03-14T12:00:36.456936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length5
Mean length5.4193548
Min length5

Characters and Unicode

Total characters168
Distinct characters22
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18 ?
Unique (%)58.1%

Sample

1st row사 업 량 (km)
2nd rowL=13.4
3rd rowL=4.3
4th rowL=0.73
5th rowL=0.78
ValueCountFrequency (%)
l=0.4 5
 
14.7%
l=0.5 4
 
11.8%
l=1.0 2
 
5.9%
l=0.3 2
 
5.9%
l=0.73 1
 
2.9%
l=0.78 1
 
2.9%
l=0.51 1
 
2.9%
l=0.6 1
 
2.9%
l=0.2 1
 
2.9%
l=0.8 1
 
2.9%
Other values (15) 15
44.1%
2024-03-14T12:00:36.698700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
L 30
17.9%
. 30
17.9%
= 30
17.9%
0 25
14.9%
4 11
 
6.5%
1 9
 
5.4%
3 8
 
4.8%
5 5
 
3.0%
7 3
 
1.8%
8 3
 
1.8%
Other values (12) 14
8.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 68
40.5%
Uppercase Letter 30
17.9%
Other Punctuation 30
17.9%
Math Symbol 30
17.9%
Other Letter 3
 
1.8%
Space Separator 2
 
1.2%
Lowercase Letter 2
 
1.2%
Close Punctuation 1
 
0.6%
Open Punctuation 1
 
0.6%
Control 1
 
0.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 25
36.8%
4 11
16.2%
1 9
 
13.2%
3 8
 
11.8%
5 5
 
7.4%
7 3
 
4.4%
8 3
 
4.4%
2 2
 
2.9%
9 1
 
1.5%
6 1
 
1.5%
Other Letter
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Lowercase Letter
ValueCountFrequency (%)
m 1
50.0%
k 1
50.0%
Uppercase Letter
ValueCountFrequency (%)
L 30
100.0%
Other Punctuation
ValueCountFrequency (%)
. 30
100.0%
Math Symbol
ValueCountFrequency (%)
= 30
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Control
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 133
79.2%
Latin 32
 
19.0%
Hangul 3
 
1.8%

Most frequent character per script

Common
ValueCountFrequency (%)
. 30
22.6%
= 30
22.6%
0 25
18.8%
4 11
 
8.3%
1 9
 
6.8%
3 8
 
6.0%
5 5
 
3.8%
7 3
 
2.3%
8 3
 
2.3%
2
 
1.5%
Other values (6) 7
 
5.3%
Latin
ValueCountFrequency (%)
L 30
93.8%
m 1
 
3.1%
k 1
 
3.1%
Hangul
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 165
98.2%
Hangul 3
 
1.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
L 30
18.2%
. 30
18.2%
= 30
18.2%
0 25
15.2%
4 11
 
6.7%
1 9
 
5.5%
3 8
 
4.8%
5 5
 
3.0%
7 3
 
1.8%
8 3
 
1.8%
Other values (9) 11
 
6.7%
Hangul
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Unnamed: 4
Text

MISSING 

Distinct18
Distinct (%)72.0%
Missing8
Missing (%)24.2%
Memory size396.0 B
2024-03-14T12:00:36.867233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length15
Mean length14.56
Min length4

Characters and Unicode

Total characters364
Distinct characters16
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)56.0%

Sample

1st row사업기간
2nd row2010.05~2010.12
3rd row2010.05~2010.11
4th row2010.05~2010.12
5th row2010.05~2010.12
ValueCountFrequency (%)
2011.06~2012.02 3
 
12.0%
2010.05~2010.12 3
 
12.0%
2012.05~2013.01 3
 
12.0%
2012.10~2013.01 2
 
8.0%
2011.09~2012.07 1
 
4.0%
2012.03~2012.07 1
 
4.0%
2013.01~2014.06 1
 
4.0%
2013.10~2013.12 1
 
4.0%
2013.01~2013.08 1
 
4.0%
2011.10~2012.02 1
 
4.0%
Other values (8) 8
32.0%
2024-03-14T12:00:37.111226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 103
28.3%
1 77
21.2%
2 70
19.2%
. 48
13.2%
~ 24
 
6.6%
3 13
 
3.6%
6 8
 
2.2%
5 7
 
1.9%
7 4
 
1.1%
4 3
 
0.8%
Other values (6) 7
 
1.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 288
79.1%
Other Punctuation 48
 
13.2%
Math Symbol 24
 
6.6%
Other Letter 4
 
1.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 103
35.8%
1 77
26.7%
2 70
24.3%
3 13
 
4.5%
6 8
 
2.8%
5 7
 
2.4%
7 4
 
1.4%
4 3
 
1.0%
9 2
 
0.7%
8 1
 
0.3%
Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Other Punctuation
ValueCountFrequency (%)
. 48
100.0%
Math Symbol
ValueCountFrequency (%)
~ 24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 360
98.9%
Hangul 4
 
1.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 103
28.6%
1 77
21.4%
2 70
19.4%
. 48
13.3%
~ 24
 
6.7%
3 13
 
3.6%
6 8
 
2.2%
5 7
 
1.9%
7 4
 
1.1%
4 3
 
0.8%
Other values (2) 3
 
0.8%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 360
98.9%
Hangul 4
 
1.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 103
28.6%
1 77
21.4%
2 70
19.4%
. 48
13.3%
~ 24
 
6.7%
3 13
 
3.6%
6 8
 
2.2%
5 7
 
1.9%
7 4
 
1.1%
4 3
 
0.8%
Other values (2) 3
 
0.8%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Unnamed: 5
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size396.0 B

Unnamed: 6
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing32
Missing (%)97.0%
Memory size396.0 B
2024-03-14T12:00:37.195584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters2
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)100.0%

Sample

1st row비고
ValueCountFrequency (%)
비고 1
100.0%
2024-03-14T12:00:37.350610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Correlations

2024-03-14T12:00:37.417271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4
Unnamed: 11.0000.9590.9910.985
Unnamed: 20.9591.0000.8930.436
Unnamed: 30.9910.8931.0000.000
Unnamed: 40.9850.4360.0001.000

Missing values

2024-03-14T12:00:35.155633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T12:00:35.241076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-14T12:00:35.343043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

2010~2015년 안전한 보행환경 조성사업 추진 현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6
0NaN<NA><NA><NA><NA>(2015.6.30일 기준)<NA>
1년도별사업명위 치사 업 량 (km)사업기간사업비비고
2NaN<NA><NA><NA><NA>(백만원)<NA>
3<NA>25개소L=13.4<NA>8790<NA>
42010<NA>7개소L=4.3<NA>2800<NA>
52010안전한 보행환경 조성사업 (순창 농암)순창 복흥L=0.732010.05~2010.12190<NA>
62010안전한 보행환경 조성사업 (부안 모산)부안 부안L=0.782010.05~2010.11430<NA>
72010안전한 보행환경 조성사업 (고창 선운)고창 부안L=0.512010.05~2010.12340<NA>
82010안전한 보행환경 조성사업 (정읍 운학)정읍 영원L=0.312010.05~2010.12170<NA>
92010안전한 보행환경 조성사업 (익산 동촌)익산 여산, 익산 낭산L=1.412010.06~2010.12920<NA>
2010~2015년 안전한 보행환경 조성사업 추진 현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6
232012안전한 보행환경 조성사업 (김제 청하)김제 청하(지711)L=0.82012.05~2013.01430<NA>
242012안전한 보행환경 조성사업 (순창 복흥)순창 복흥(지897)L=0.52012.10~2013.01280<NA>
252012안전한 보행환경 조성사업 (완주 화산)완주 화산(지643)L=0.42012.10~2013.01290<NA>
262013<NA>3개소L=1.0<NA>510<NA>
272013안전한 보행환경 조성사업 (익산 석천)익산 낭산(지718)L=0.42013.01~2013.01340<NA>
282013안전한 보행환경 조성사업 (정읍 두지)정읍 두지(지736)L=0.22013.01~2013.0880<NA>
292013안전한 보행환경 조성사업 (익산 금마-1공구)익산 금마(지722)L=0.42013.10~2013.1290<NA>
302014<NA>2개소L=0.6<NA>373<NA>
312014안전한 보행환경 조성사업 (완주 어우)완주 고산(지741)L=0.32013.01~2014.06283<NA>
322014안전한 보행환경 조성사업 (익산 금마-2공구)익산 금마(지722)L=0.32014.01~2014.0790<NA>

Duplicate rows

Most frequently occurring

Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 6# duplicates
0<NA><NA><NA><NA><NA>2