Overview

Dataset statistics

Number of variables5
Number of observations22
Missing cells22
Missing cells (%)20.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1012.0 B
Average record size in memory46.0 B

Variable types

Categorical2
Text3

Dataset

Description이 데이터는 과천시 관내 주요도로 연평균 일교통량에 관한 건으로 도로명, 지점, 방향, 연평균 일교통량 등의 항목을 포함하고 있습니다.
Author경기도 과천시
URLhttps://www.data.go.kr/data/15087046/fileData.do

Alerts

연평균 일교통량(대/일) has 4 (18.2%) missing valuesMissing
비고(공사로 인한 운영중지 기간 등) has 18 (81.8%) missing valuesMissing

Reproduction

Analysis started2023-12-12 22:30:48.444392
Analysis finished2023-12-12 22:30:49.002364
Duration0.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

도로명
Categorical

Distinct10
Distinct (%)45.5%
Missing0
Missing (%)0.0%
Memory size308.0 B
양재대로
과천대로
선암로
우면산로
대공원로
Other values (5)

Length

Max length14
Median length4
Mean length4.9090909
Min length3

Unique

Unique2 ?
Unique (%)9.1%

Sample

1st row선암로
2nd row선암로
3rd row 남태령로
4th row남태령로
5th row양재대로

Common Values

ValueCountFrequency (%)
양재대로 4
18.2%
과천대로 4
18.2%
선암로 2
9.1%
우면산로 2
9.1%
대공원로 2
9.1%
경마공원로 2
9.1%
중앙로 2
9.1%
과천-봉담간 도시고속화도로 2
9.1%
남태령로 1
 
4.5%
남태령로 1
 
4.5%

Length

2023-12-13T07:30:49.082702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:30:49.216630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
양재대로 4
16.7%
과천대로 4
16.7%
선암로 2
8.3%
우면산로 2
8.3%
대공원로 2
8.3%
경마공원로 2
8.3%
중앙로 2
8.3%
과천-봉담간 2
8.3%
도시고속화도로 2
8.3%
남태령로 2
8.3%

지점
Text

Distinct11
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Memory size308.0 B
2023-12-13T07:30:49.433022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length6
Mean length4.6363636
Min length3

Characters and Unicode

Total characters102
Distinct characters39
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row선암주유소
2nd row선암주유소
3rd row남태령전철역
4th row남태령전철역
5th row주암교
ValueCountFrequency (%)
선암주유소 2
9.1%
남태령전철역 2
9.1%
주암교 2
9.1%
플라워마트 2
9.1%
찬우물 2
9.1%
광명주유소 2
9.1%
우면산로 2
9.1%
대공원로 2
9.1%
과천소방파출소 2
9.1%
청사사거리 2
9.1%
2023-12-13T07:30:49.748307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8
 
7.8%
6
 
5.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
2
 
2.0%
Other values (29) 58
56.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 102
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8
 
7.8%
6
 
5.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
2
 
2.0%
Other values (29) 58
56.9%

Most occurring scripts

ValueCountFrequency (%)
Hangul 102
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8
 
7.8%
6
 
5.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
2
 
2.0%
Other values (29) 58
56.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 102
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
8
 
7.8%
6
 
5.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
2
 
2.0%
Other values (29) 58
56.9%

방향
Categorical

Distinct2
Distinct (%)9.1%
Missing0
Missing (%)0.0%
Memory size308.0 B
상행
11 
하행
11 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row상행
2nd row하행
3rd row상행
4th row하행
5th row상행

Common Values

ValueCountFrequency (%)
상행 11
50.0%
하행 11
50.0%

Length

2023-12-13T07:30:49.877696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:30:49.971887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
상행 11
50.0%
하행 11
50.0%
Distinct18
Distinct (%)100.0%
Missing4
Missing (%)18.2%
Memory size308.0 B
2023-12-13T07:30:50.127398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length6
Mean length5.6111111
Min length5

Characters and Unicode

Total characters101
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18 ?
Unique (%)100.0%

Sample

1st row21,117
2nd row7,348
3rd row46,177
4th row42,459
5th row17,477
ValueCountFrequency (%)
85,005 1
 
5.6%
46,177 1
 
5.6%
88,070 1
 
5.6%
3,917 1
 
5.6%
4,522 1
 
5.6%
7,646 1
 
5.6%
3,656 1
 
5.6%
9,943 1
 
5.6%
47,127 1
 
5.6%
21,117 1
 
5.6%
Other values (8) 8
44.4%
2023-12-13T07:30:50.432685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
, 18
17.8%
4 13
12.9%
7 13
12.9%
1 11
10.9%
2 9
8.9%
5 8
7.9%
0 7
 
6.9%
8 6
 
5.9%
6 6
 
5.9%
9 6
 
5.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 83
82.2%
Other Punctuation 18
 
17.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 13
15.7%
7 13
15.7%
1 11
13.3%
2 9
10.8%
5 8
9.6%
0 7
8.4%
8 6
7.2%
6 6
7.2%
9 6
7.2%
3 4
 
4.8%
Other Punctuation
ValueCountFrequency (%)
, 18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 101
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
, 18
17.8%
4 13
12.9%
7 13
12.9%
1 11
10.9%
2 9
8.9%
5 8
7.9%
0 7
 
6.9%
8 6
 
5.9%
6 6
 
5.9%
9 6
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 101
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
, 18
17.8%
4 13
12.9%
7 13
12.9%
1 11
10.9%
2 9
8.9%
5 8
7.9%
0 7
 
6.9%
8 6
 
5.9%
6 6
 
5.9%
9 6
 
5.9%
Distinct2
Distinct (%)50.0%
Missing18
Missing (%)81.8%
Memory size308.0 B
2023-12-13T07:30:50.614580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length37
Mean length37
Min length35

Characters and Unicode

Total characters148
Distinct characters39
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강남순환고속도로 매헌지하차도 공사로 인해 운영중지(2016년 ~ 현재)
2nd row강남순환고속도로 매헌지하차도 공사로 인해 운영중지(2016년 ~ 현재)
3rd row국도47호선 우회도로 공사로 인해 운영중지(2019년 ~ 현재)
4th row국도47호선 우회도로 공사로 인해 운영중지(2019년 ~ 현재)
ValueCountFrequency (%)
공사로 4
14.3%
인해 4
14.3%
4
14.3%
현재 4
14.3%
강남순환고속도로 2
7.1%
매헌지하차도 2
7.1%
운영중지(2016년 2
7.1%
국도47호선 2
7.1%
우회도로 2
7.1%
운영중지(2019년 2
7.1%
2023-12-13T07:30:50.925855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24
 
16.2%
8
 
5.4%
8
 
5.4%
6
 
4.1%
4
 
2.7%
4
 
2.7%
4
 
2.7%
1 4
 
2.7%
0 4
 
2.7%
2 4
 
2.7%
Other values (29) 78
52.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 92
62.2%
Space Separator 24
 
16.2%
Decimal Number 20
 
13.5%
Open Punctuation 4
 
2.7%
Close Punctuation 4
 
2.7%
Math Symbol 4
 
2.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8
 
8.7%
8
 
8.7%
6
 
6.5%
4
 
4.3%
4
 
4.3%
4
 
4.3%
4
 
4.3%
4
 
4.3%
4
 
4.3%
4
 
4.3%
Other values (18) 42
45.7%
Decimal Number
ValueCountFrequency (%)
1 4
20.0%
0 4
20.0%
2 4
20.0%
4 2
10.0%
7 2
10.0%
6 2
10.0%
9 2
10.0%
Space Separator
ValueCountFrequency (%)
24
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Math Symbol
ValueCountFrequency (%)
~ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 92
62.2%
Common 56
37.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8
 
8.7%
8
 
8.7%
6
 
6.5%
4
 
4.3%
4
 
4.3%
4
 
4.3%
4
 
4.3%
4
 
4.3%
4
 
4.3%
4
 
4.3%
Other values (18) 42
45.7%
Common
ValueCountFrequency (%)
24
42.9%
1 4
 
7.1%
0 4
 
7.1%
2 4
 
7.1%
( 4
 
7.1%
) 4
 
7.1%
~ 4
 
7.1%
4 2
 
3.6%
7 2
 
3.6%
6 2
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 92
62.2%
ASCII 56
37.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
24
42.9%
1 4
 
7.1%
0 4
 
7.1%
2 4
 
7.1%
( 4
 
7.1%
) 4
 
7.1%
~ 4
 
7.1%
4 2
 
3.6%
7 2
 
3.6%
6 2
 
3.6%
Hangul
ValueCountFrequency (%)
8
 
8.7%
8
 
8.7%
6
 
6.5%
4
 
4.3%
4
 
4.3%
4
 
4.3%
4
 
4.3%
4
 
4.3%
4
 
4.3%
4
 
4.3%
Other values (18) 42
45.7%

Correlations

2023-12-13T07:30:51.014518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도로명지점방향연평균 일교통량(대/일)비고(공사로 인한 운영중지 기간 등)
도로명1.0000.9740.0001.0000.000
지점0.9741.0000.0001.0000.000
방향0.0000.0001.0001.0000.000
연평균 일교통량(대/일)1.0001.0001.0001.000NaN
비고(공사로 인한 운영중지 기간 등)0.0000.0000.000NaN1.000
2023-12-13T07:30:51.401891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
방향도로명
방향1.0000.000
도로명0.0001.000
2023-12-13T07:30:51.494612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도로명방향
도로명1.0000.000
방향0.0001.000

Missing values

2023-12-13T07:30:48.765988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:30:48.864029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T07:30:48.950919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

도로명지점방향연평균 일교통량(대/일)비고(공사로 인한 운영중지 기간 등)
0선암로선암주유소상행21,117<NA>
1선암로선암주유소하행7,348<NA>
2남태령로남태령전철역상행46,177<NA>
3남태령로남태령전철역하행42,459<NA>
4양재대로주암교상행17,477<NA>
5양재대로주암교하행20,511<NA>
6양재대로플라워마트상행<NA>강남순환고속도로 매헌지하차도 공사로 인해 운영중지(2016년 ~ 현재)
7양재대로플라워마트하행<NA>강남순환고속도로 매헌지하차도 공사로 인해 운영중지(2016년 ~ 현재)
8과천대로찬우물상행42,817<NA>
9과천대로찬우물하행41,529<NA>
도로명지점방향연평균 일교통량(대/일)비고(공사로 인한 운영중지 기간 등)
12우면산로우면산로상행44,098<NA>
13우면산로우면산로하행47,127<NA>
14대공원로대공원로상행9,943<NA>
15대공원로대공원로하행3,656<NA>
16경마공원로과천소방파출소상행7,646<NA>
17경마공원로과천소방파출소하행4,522<NA>
18중앙로청사사거리상행3,917<NA>
19중앙로청사사거리하행5,062<NA>
20과천-봉담간 도시고속화도로과천터널상행<NA>국도47호선 우회도로 공사로 인해 운영중지(2019년 ~ 현재)
21과천-봉담간 도시고속화도로과천터널하행<NA>국도47호선 우회도로 공사로 인해 운영중지(2019년 ~ 현재)