Overview

Dataset statistics

Number of variables9
Number of observations287
Missing cells287
Missing cells (%)11.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory21.1 KiB
Average record size in memory75.5 B

Variable types

Categorical6
Text1
Numeric1
Unsupported1

Dataset

Description해당 자료는 도로관리사업소 진주지소에서 관리하는 지방도, 위임국도의 도로시설물인 교량 및 터널의 기본사항, 현황을 설명하는 자료입니다.
Author경상남도
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15102843

Alerts

집계년도 has constant value ""Constant
시군명 is highly overall correlated with 시설물위치 and 1 other fieldsHigh correlation
시설물위치 is highly overall correlated with 시군명 and 1 other fieldsHigh correlation
노선명 is highly overall correlated with 시군명 and 1 other fieldsHigh correlation
시설물구분명 is highly imbalanced (87.3%)Imbalance
Unnamed: 8 has 287 (100.0%) missing valuesMissing
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-10 23:58:14.803431
Analysis finished2023-12-10 23:58:15.605287
Duration0.8 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

집계년도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
2023
287 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023
2nd row2023
3rd row2023
4th row2023
5th row2023

Common Values

ValueCountFrequency (%)
2023 287
100.0%

Length

2023-12-11T08:58:15.720314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:58:15.885899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023 287
100.0%

시군명
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
진주시
56 
하동군
54 
거창군
54 
산청군
39 
함양군
36 
Other values (2)
48 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row진주시
2nd row진주시
3rd row진주시
4th row진주시
5th row진주시

Common Values

ValueCountFrequency (%)
진주시 56
19.5%
하동군 54
18.8%
거창군 54
18.8%
산청군 39
13.6%
함양군 36
12.5%
사천시 30
10.5%
남해군 18
 
6.3%

Length

2023-12-11T08:58:16.048309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:58:16.198133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
진주시 56
19.5%
하동군 54
18.8%
거창군 54
18.8%
산청군 39
13.6%
함양군 36
12.5%
사천시 30
10.5%
남해군 18
 
6.3%
Distinct273
Distinct (%)95.1%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
2023-12-11T08:58:16.557837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length3
Mean length3.3135889
Min length3

Characters and Unicode

Total characters951
Distinct characters171
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique261 ?
Unique (%)90.9%

Sample

1st row가봉교
2nd row가산1교
3rd row갈전1교
4th row갈전2교
5th row갈촌과선교
ValueCountFrequency (%)
월아교 3
 
1.0%
신기교 3
 
1.0%
가천교 2
 
0.7%
신촌교 2
 
0.7%
평촌교 2
 
0.7%
하평교 2
 
0.7%
상평교 2
 
0.7%
대현교 2
 
0.7%
청룡교 2
 
0.7%
대사교 2
 
0.7%
Other values (263) 265
92.3%
2023-12-11T08:58:17.076164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
285
30.0%
25
 
2.6%
24
 
2.5%
23
 
2.4%
2 22
 
2.3%
1 21
 
2.2%
13
 
1.4%
12
 
1.3%
12
 
1.3%
11
 
1.2%
Other values (161) 503
52.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 890
93.6%
Decimal Number 51
 
5.4%
Close Punctuation 4
 
0.4%
Open Punctuation 4
 
0.4%
Uppercase Letter 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
285
32.0%
25
 
2.8%
24
 
2.7%
23
 
2.6%
13
 
1.5%
12
 
1.3%
12
 
1.3%
11
 
1.2%
11
 
1.2%
11
 
1.2%
Other values (152) 463
52.0%
Decimal Number
ValueCountFrequency (%)
2 22
43.1%
1 21
41.2%
3 6
 
11.8%
5 1
 
2.0%
6 1
 
2.0%
Uppercase Letter
ValueCountFrequency (%)
C 1
50.0%
I 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 890
93.6%
Common 59
 
6.2%
Latin 2
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
285
32.0%
25
 
2.8%
24
 
2.7%
23
 
2.6%
13
 
1.5%
12
 
1.3%
12
 
1.3%
11
 
1.2%
11
 
1.2%
11
 
1.2%
Other values (152) 463
52.0%
Common
ValueCountFrequency (%)
2 22
37.3%
1 21
35.6%
3 6
 
10.2%
) 4
 
6.8%
( 4
 
6.8%
5 1
 
1.7%
6 1
 
1.7%
Latin
ValueCountFrequency (%)
C 1
50.0%
I 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 890
93.6%
ASCII 61
 
6.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
285
32.0%
25
 
2.8%
24
 
2.7%
23
 
2.6%
13
 
1.5%
12
 
1.3%
12
 
1.3%
11
 
1.2%
11
 
1.2%
11
 
1.2%
Other values (152) 463
52.0%
ASCII
ValueCountFrequency (%)
2 22
36.1%
1 21
34.4%
3 6
 
9.8%
) 4
 
6.6%
( 4
 
6.6%
C 1
 
1.6%
I 1
 
1.6%
5 1
 
1.6%
6 1
 
1.6%

시설물구분명
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
교량
282 
터널
 
5

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row교량
2nd row교량
3rd row교량
4th row교량
5th row교량

Common Values

ValueCountFrequency (%)
교량 282
98.3%
터널 5
 
1.7%

Length

2023-12-11T08:58:17.234018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:58:17.328040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
교량 282
98.3%
터널 5
 
1.7%

시설물위치
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
진주
56 
하동
54 
거창
54 
산청
39 
함양
36 
Other values (2)
48 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row진주
2nd row진주
3rd row진주
4th row진주
5th row진주

Common Values

ValueCountFrequency (%)
진주 56
19.5%
하동 54
18.8%
거창 54
18.8%
산청 39
13.6%
함양 36
12.5%
사천 30
10.5%
남해 18
 
6.3%

Length

2023-12-11T08:58:17.424601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:58:17.548098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
진주 56
19.5%
하동 54
18.8%
거창 54
18.8%
산청 39
13.6%
함양 36
12.5%
사천 30
10.5%
남해 18
 
6.3%

시설물준공년도
Real number (ℝ)

Distinct53
Distinct (%)18.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1999.5819
Minimum1965
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.7 KiB
2023-12-11T08:58:17.730751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1965
5-th percentile1981.3
Q11992
median1999
Q32008
95-th percentile2019
Maximum2023
Range58
Interquartile range (IQR)16

Descriptive statistics

Standard deviation11.69242
Coefficient of variation (CV)0.0058474323
Kurtosis-0.043947558
Mean1999.5819
Median Absolute Deviation (MAD)8
Skewness-0.17983814
Sum573880
Variance136.71268
MonotonicityNot monotonic
2023-12-11T08:58:17.902661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1992 14
 
4.9%
1999 14
 
4.9%
1995 13
 
4.5%
1994 13
 
4.5%
1997 13
 
4.5%
2000 11
 
3.8%
1988 10
 
3.5%
2005 10
 
3.5%
1996 10
 
3.5%
2009 10
 
3.5%
Other values (43) 169
58.9%
ValueCountFrequency (%)
1965 1
 
0.3%
1966 1
 
0.3%
1968 1
 
0.3%
1969 1
 
0.3%
1970 1
 
0.3%
1972 2
0.7%
1974 1
 
0.3%
1977 3
1.0%
1978 1
 
0.3%
1980 2
0.7%
ValueCountFrequency (%)
2023 3
 
1.0%
2022 3
 
1.0%
2021 5
1.7%
2020 2
 
0.7%
2019 3
 
1.0%
2018 8
2.8%
2017 2
 
0.7%
2016 5
1.7%
2015 3
 
1.0%
2014 2
 
0.7%

종별구분명
Categorical

Distinct4
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
3종
128 
기타
110 
2종
39 
1종
 
10

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3종
2nd row기타
3rd row기타
4th row기타
5th row2종

Common Values

ValueCountFrequency (%)
3종 128
44.6%
기타 110
38.3%
2종 39
 
13.6%
1종 10
 
3.5%

Length

2023-12-11T08:58:18.044039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:58:18.149990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3종 128
44.6%
기타 110
38.3%
2종 39
 
13.6%
1종 10
 
3.5%

노선명
Categorical

HIGH CORRELATION 

Distinct27
Distinct (%)9.4%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
1001
30 
37
19 
1003
19 
1024
 
18
국59
 
17
Other values (22)
184 

Length

Max length4
Median length4
Mean length3.5993031
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1007
2nd row30
3rd row1009
4th row1009
5th row1007

Common Values

ValueCountFrequency (%)
1001 30
 
10.5%
37 19
 
6.6%
1003 19
 
6.6%
1024 18
 
6.3%
국59 17
 
5.9%
1084 16
 
5.6%
1002 16
 
5.6%
1005 13
 
4.5%
1089 13
 
4.5%
1099 12
 
4.2%
Other values (17) 114
39.7%

Length

2023-12-11T08:58:18.316558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1001 30
 
10.5%
1003 19
 
6.6%
37 19
 
6.6%
1024 18
 
6.3%
국59 17
 
5.9%
1084 16
 
5.6%
1002 16
 
5.6%
1005 13
 
4.5%
1089 13
 
4.5%
1099 12
 
4.2%
Other values (17) 114
39.7%

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing287
Missing (%)100.0%
Memory size2.7 KiB

Interactions

2023-12-11T08:58:15.262993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T08:58:18.429501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명시설물구분명시설물위치시설물준공년도종별구분명노선명
시군명1.0000.0601.0000.2380.2390.943
시설물구분명0.0601.0000.0600.0000.4710.419
시설물위치1.0000.0601.0000.2380.2390.943
시설물준공년도0.2380.0000.2381.0000.4320.507
종별구분명0.2390.4710.2390.4321.0000.484
노선명0.9430.4190.9430.5070.4841.000
2023-12-11T08:58:18.541430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설물구분명시군명시설물위치종별구분명노선명
시설물구분명1.0000.0640.0640.3170.344
시군명0.0641.0001.0000.1650.734
시설물위치0.0641.0001.0000.1650.734
종별구분명0.3170.1650.1651.0000.262
노선명0.3440.7340.7340.2621.000
2023-12-11T08:58:18.653440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설물준공년도시군명시설물구분명시설물위치종별구분명노선명
시설물준공년도1.0000.1170.0000.1170.2670.199
시군명0.1171.0000.0641.0000.1650.734
시설물구분명0.0000.0641.0000.0640.3170.344
시설물위치0.1171.0000.0641.0000.1650.734
종별구분명0.2670.1650.3170.1651.0000.262
노선명0.1990.7340.3440.7340.2621.000

Missing values

2023-12-11T08:58:15.389834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:58:15.533748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

집계년도시군명시설물명시설물구분명시설물위치시설물준공년도종별구분명노선명Unnamed: 8
02023진주시가봉교교량진주19963종1007<NA>
12023진주시가산1교교량진주2013기타30<NA>
22023진주시갈전1교교량진주1987기타1009<NA>
32023진주시갈전2교교량진주1987기타1009<NA>
42023진주시갈촌과선교교량진주19972종1007<NA>
52023진주시계원교교량진주1992기타1006<NA>
62023진주시관지5교교량진주2015기타1006<NA>
72023진주시광석교교량진주20043종1013<NA>
82023진주시금호교교량진주1987기타1009<NA>
92023진주시금호2교교량진주2023기타1009<NA>
집계년도시군명시설물명시설물구분명시설물위치시설물준공년도종별구분명노선명Unnamed: 8
2772023거창군과정1교교량거창19883종국59<NA>
2782023거창군과정2교교량거창20073종국59<NA>
2792023거창군진산교(상)교량거창19853종국37<NA>
2802023거창군진산교(하)교량거창20093종국37<NA>
2812023거창군거열교(구)교량거창19962종국37<NA>
2822023거창군거열교(신)교량거창20052종국37<NA>
2832023거창군장풍교교량거창19862종국37<NA>
2842023거창군금계교교량거창20023종국37<NA>
2852023거창군완대교교량거창19903종국37<NA>
2862023거창군율리교교량거창2016기타국37<NA>