Overview

Dataset statistics

Number of variables13
Number of observations586
Missing cells2584
Missing cells (%)33.9%
Duplicate rows52
Duplicate rows (%)8.9%
Total size in memory64.8 KiB
Average record size in memory113.2 B

Variable types

Numeric5
Categorical3
DateTime1
Unsupported4

Dataset

Description경기도_도로대장 전산화 시스템_방음시설
Author경기도
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=XSF6W96PB3DM82JPWM7D33902771&infSeq=1

Alerts

Dataset has 52 (8.9%) duplicate rowsDuplicates
노선번호 is highly overall correlated with 비고High correlation
위치 is highly overall correlated with 위치_종점High correlation
위치_종점 is highly overall correlated with 위치High correlation
높이 is highly overall correlated with 비고High correlation
종류 is highly overall correlated with 비고High correlation
비고 is highly overall correlated with 노선번호 and 2 other fieldsHigh correlation
비고 is highly imbalanced (62.3%)Imbalance
설치일자 has 240 (41.0%) missing valuesMissing
공간경도시점 has 586 (100.0%) missing valuesMissing
공간위도시점 has 586 (100.0%) missing valuesMissing
공간경도종점 has 586 (100.0%) missing valuesMissing
공간위도종점 has 586 (100.0%) missing valuesMissing
공간경도시점 is an unsupported type, check if it needs cleaning or further analysisUnsupported
공간위도시점 is an unsupported type, check if it needs cleaning or further analysisUnsupported
공간경도종점 is an unsupported type, check if it needs cleaning or further analysisUnsupported
공간위도종점 is an unsupported type, check if it needs cleaning or further analysisUnsupported
높이 has 56 (9.6%) zerosZeros

Reproduction

Analysis started2023-12-10 22:04:07.385494
Analysis finished2023-12-10 22:04:10.622864
Duration3.24 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

노선번호
Real number (ℝ)

HIGH CORRELATION 

Distinct36
Distinct (%)6.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean252.06826
Minimum23
Maximum387
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.3 KiB
2023-12-11T07:04:10.678129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum23
5-th percentile23
Q198
median309
Q3330
95-th percentile375
Maximum387
Range364
Interquartile range (IQR)232

Descriptive statistics

Standard deviation122.31172
Coefficient of variation (CV)0.48523252
Kurtosis-1.0633791
Mean252.06826
Median Absolute Deviation (MAD)24
Skewness-0.83582405
Sum147712
Variance14960.156
MonotonicityNot monotonic
2023-12-11T07:04:10.786463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=36)
ValueCountFrequency (%)
309 151
25.8%
330 66
11.3%
98 41
 
7.0%
23 36
 
6.1%
78 29
 
4.9%
318 28
 
4.8%
364 27
 
4.6%
82 22
 
3.8%
387 20
 
3.4%
56 19
 
3.2%
Other values (26) 147
25.1%
ValueCountFrequency (%)
23 36
6.1%
39 2
 
0.3%
56 19
3.2%
57 8
 
1.4%
70 6
 
1.0%
78 29
4.9%
82 22
3.8%
86 11
 
1.9%
98 41
7.0%
301 1
 
0.2%
ValueCountFrequency (%)
387 20
3.4%
383 7
 
1.2%
375 13
2.2%
372 1
 
0.2%
371 6
 
1.0%
368 2
 
0.3%
367 2
 
0.3%
364 27
4.6%
360 14
2.4%
356 2
 
0.3%

구간번호
Real number (ℝ)

Distinct19
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.2423208
Minimum1
Maximum99
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.3 KiB
2023-12-11T07:04:10.891553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median6
Q38
95-th percentile13
Maximum99
Range98
Interquartile range (IQR)6

Descriptive statistics

Standard deviation7.0963564
Coefficient of variation (CV)1.1368138
Kurtosis99.025624
Mean6.2423208
Median Absolute Deviation (MAD)3
Skewness8.0262435
Sum3658
Variance50.358274
MonotonicityNot monotonic
2023-12-11T07:04:10.983596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
1 135
23.0%
8 98
16.7%
7 64
10.9%
4 43
 
7.3%
3 40
 
6.8%
6 38
 
6.5%
5 35
 
6.0%
2 27
 
4.6%
10 24
 
4.1%
9 21
 
3.6%
Other values (9) 61
10.4%
ValueCountFrequency (%)
1 135
23.0%
2 27
 
4.6%
3 40
 
6.8%
4 43
 
7.3%
5 35
 
6.0%
6 38
 
6.5%
7 64
10.9%
8 98
16.7%
9 21
 
3.6%
10 24
 
4.1%
ValueCountFrequency (%)
99 2
 
0.3%
26 7
 
1.2%
24 3
 
0.5%
23 2
 
0.3%
18 2
 
0.3%
14 12
2.0%
13 17
2.9%
12 14
2.4%
11 2
 
0.3%
10 24
4.1%

위치
Real number (ℝ)

HIGH CORRELATION 

Distinct494
Distinct (%)84.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.2168097
Minimum0.005
Maximum15.75
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.3 KiB
2023-12-11T07:04:11.087720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.005
5-th percentile0.46625
Q12.2855
median4.7625
Q37.907
95-th percentile10.93025
Maximum15.75
Range15.745
Interquartile range (IQR)5.6215

Descriptive statistics

Standard deviation3.4370121
Coefficient of variation (CV)0.65883409
Kurtosis-0.65070831
Mean5.2168097
Median Absolute Deviation (MAD)2.6415
Skewness0.44725637
Sum3057.0505
Variance11.813052
MonotonicityNot monotonic
2023-12-11T07:04:11.211079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9.3 4
 
0.7%
10.1 4
 
0.7%
9.1 3
 
0.5%
7.84 3
 
0.5%
2.216 3
 
0.5%
0.913 3
 
0.5%
2.54 3
 
0.5%
2.98 3
 
0.5%
1.68 3
 
0.5%
2.56 3
 
0.5%
Other values (484) 554
94.5%
ValueCountFrequency (%)
0.005 1
0.2%
0.064 1
0.2%
0.087 1
0.2%
0.094 1
0.2%
0.16 1
0.2%
0.17 1
0.2%
0.192 1
0.2%
0.2 1
0.2%
0.22 2
0.3%
0.23 1
0.2%
ValueCountFrequency (%)
15.75 1
0.2%
15.547 1
0.2%
15.21 1
0.2%
13.125 2
0.3%
12.87 2
0.3%
12.815 1
0.2%
12.74 1
0.2%
12.695 1
0.2%
12.681 1
0.2%
12.635 1
0.2%

위치_종점
Real number (ℝ)

HIGH CORRELATION 

Distinct497
Distinct (%)84.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.3417739
Minimum0.06
Maximum15.827
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.3 KiB
2023-12-11T07:04:11.358813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.06
5-th percentile0.5165
Q12.46
median4.84
Q38.084
95-th percentile11.00825
Maximum15.827
Range15.767
Interquartile range (IQR)5.624

Descriptive statistics

Standard deviation3.4429379
Coefficient of variation (CV)0.64453082
Kurtosis-0.63889348
Mean5.3417739
Median Absolute Deviation (MAD)2.622
Skewness0.446249
Sum3130.2795
Variance11.853822
MonotonicityNot monotonic
2023-12-11T07:04:11.505110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5.28 4
 
0.7%
8.87 4
 
0.7%
1.0 3
 
0.5%
2.57 3
 
0.5%
0.46 3
 
0.5%
1.9 3
 
0.5%
7.93 3
 
0.5%
1.79 3
 
0.5%
1.472 3
 
0.5%
4.38 3
 
0.5%
Other values (487) 554
94.5%
ValueCountFrequency (%)
0.06 1
0.2%
0.153 1
0.2%
0.244 1
0.2%
0.252 1
0.2%
0.254 1
0.2%
0.26 2
0.3%
0.276 1
0.2%
0.28 2
0.3%
0.281 1
0.2%
0.287 1
0.2%
ValueCountFrequency (%)
15.827 1
0.2%
15.746 1
0.2%
15.315 1
0.2%
13.445 2
0.3%
13.125 2
0.3%
12.92 1
0.2%
12.823 1
0.2%
12.815 1
0.2%
12.805 1
0.2%
12.695 1
0.2%

위치?방향
Categorical

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size4.7 KiB
하행
294 
상행
292 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row하행
2nd row하행
3rd row상행
4th row상행
5th row하행

Common Values

ValueCountFrequency (%)
하행 294
50.2%
상행 292
49.8%

Length

2023-12-11T07:04:11.640805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:04:11.716990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
하행 294
50.2%
상행 292
49.8%

종류
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size4.7 KiB
기타
180 
흡음형
171 
반사형
129 
혼합형
79 
<NA>
27 

Length

Max length4
Median length3
Mean length2.7389078
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row반사형
2nd row반사형
3rd row반사형
4th row반사형
5th row반사형

Common Values

ValueCountFrequency (%)
기타 180
30.7%
흡음형 171
29.2%
반사형 129
22.0%
혼합형 79
13.5%
<NA> 27
 
4.6%

Length

2023-12-11T07:04:11.808049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:04:11.900525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기타 180
30.7%
흡음형 171
29.2%
반사형 129
22.0%
혼합형 79
13.5%
na 27
 
4.6%

높이
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct25
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.0334471
Minimum0
Maximum60
Zeros56
Zeros (%)9.6%
Negative0
Negative (%)0.0%
Memory size5.3 KiB
2023-12-11T07:04:11.991123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q12
median2.5
Q33.5
95-th percentile6
Maximum60
Range60
Interquartile range (IQR)1.5

Descriptive statistics

Standard deviation3.6444137
Coefficient of variation (CV)1.20141
Kurtosis148.40992
Mean3.0334471
Median Absolute Deviation (MAD)0.5
Skewness10.454239
Sum1777.6
Variance13.281751
MonotonicityNot monotonic
2023-12-11T07:04:12.093770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
2.0 175
29.9%
3.0 80
13.7%
0.0 56
 
9.6%
2.5 49
 
8.4%
4.0 42
 
7.2%
3.5 30
 
5.1%
1.0 30
 
5.1%
4.5 27
 
4.6%
5.0 20
 
3.4%
1.5 18
 
3.1%
Other values (15) 59
 
10.1%
ValueCountFrequency (%)
0.0 56
 
9.6%
1.0 30
 
5.1%
1.5 18
 
3.1%
2.0 175
29.9%
2.3 2
 
0.3%
2.5 49
 
8.4%
3.0 80
13.7%
3.5 30
 
5.1%
4.0 42
 
7.2%
4.5 27
 
4.6%
ValueCountFrequency (%)
60.0 1
 
0.2%
50.0 1
 
0.2%
14.5 1
 
0.2%
12.0 1
 
0.2%
11.5 1
 
0.2%
11.0 3
0.5%
10.5 1
 
0.2%
10.0 2
 
0.3%
9.5 6
1.0%
8.5 2
 
0.3%

설치일자
Date

MISSING 

Distinct33
Distinct (%)9.5%
Missing240
Missing (%)41.0%
Memory size4.7 KiB
Minimum2005-01-01 00:00:00
Maximum2020-12-28 00:00:00
2023-12-11T07:04:12.200967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:04:12.302523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)

비고
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct50
Distinct (%)8.5%
Missing0
Missing (%)0.0%
Memory size4.7 KiB
<NA>
399 
투명형
70 
0
 
28
분리(좌측)
 
13
투명형(H=4.0사용)
 
6
Other values (45)
70 

Length

Max length15
Median length4
Mean length4.109215
Min length1

Unique

Unique33 ?
Unique (%)5.6%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 399
68.1%
투명형 70
 
11.9%
0 28
 
4.8%
분리(좌측) 13
 
2.2%
투명형(H=4.0사용) 6
 
1.0%
분리(우측) 6
 
1.0%
목재혼합형 5
 
0.9%
투명형(H=3.0사용) 4
 
0.7%
RAMP-A 4
 
0.7%
교량구간 ( 가유1가도교 ) 4
 
0.7%
Other values (40) 47
 
8.0%

Length

2023-12-11T07:04:12.427934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 399
66.4%
투명형 71
 
11.8%
0 28
 
4.7%
분리(좌측 13
 
2.2%
8
 
1.3%
투명형(h=4.0사용 6
 
1.0%
분리(우측 6
 
1.0%
목재혼합형 5
 
0.8%
투명형(h=3.0사용 4
 
0.7%
ramp-a 4
 
0.7%
Other values (43) 57
 
9.5%

공간경도시점
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing586
Missing (%)100.0%
Memory size5.3 KiB

공간위도시점
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing586
Missing (%)100.0%
Memory size5.3 KiB

공간경도종점
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing586
Missing (%)100.0%
Memory size5.3 KiB

공간위도종점
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing586
Missing (%)100.0%
Memory size5.3 KiB

Interactions

2023-12-11T07:04:09.920377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:04:07.851121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:04:08.352124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:04:08.790594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:04:09.220062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:04:10.006277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:04:07.957924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:04:08.435380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:04:08.884848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:04:09.318371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:04:10.076549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:04:08.066877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:04:08.515289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:04:08.960731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:04:09.395879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:04:10.159672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:04:08.174903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:04:08.603469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:04:09.044610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:04:09.477139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:04:10.261335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:04:08.271231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:04:08.710538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:04:09.130701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:04:09.829327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T07:04:12.503332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
노선번호구간번호위치위치_종점위치?방향종류높이설치일자비고
노선번호1.0000.5120.3840.3810.0000.3450.1420.9850.867
구간번호0.5121.0000.2450.2390.0000.3120.0000.9960.000
위치0.3840.2451.0001.0000.0000.3120.2740.7990.783
위치_종점0.3810.2391.0001.0000.0000.3040.2630.7990.780
위치?방향0.0000.0000.0000.0001.0000.0600.0050.3490.513
종류0.3450.3120.3120.3040.0601.0000.1910.9050.996
높이0.1420.0000.2740.2630.0050.1911.0000.7530.917
설치일자0.9850.9960.7990.7990.3490.9050.7531.0000.977
비고0.8670.0000.7830.7800.5130.9960.9170.9771.000
2023-12-11T07:04:12.620582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
종류위치?방향비고
종류1.0000.0390.846
위치?방향0.0391.0000.369
비고0.8460.3691.000
2023-12-11T07:04:12.704513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
노선번호구간번호위치위치_종점높이위치?방향종류비고
노선번호1.000-0.0300.0150.0080.2130.0000.2280.516
구간번호-0.0301.0000.0350.037-0.0110.0000.1260.000
위치0.0150.0351.0000.9990.2250.0000.1900.366
위치_종점0.0080.0370.9991.0000.2230.0000.1850.363
높이0.213-0.0110.2250.2231.0000.0000.1620.776
위치?방향0.0000.0000.0000.0000.0001.0000.0390.369
종류0.2280.1260.1900.1850.1620.0391.0000.846
비고0.5160.0000.3660.3630.7760.3690.8461.000

Missing values

2023-12-11T07:04:10.385543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T07:04:10.566852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

노선번호구간번호위치위치_종점위치?방향종류높이설치일자비고공간경도시점공간위도시점공간경도종점공간위도종점
0387130.5850.763하행반사형2.520190319<NA><NA><NA><NA><NA>
1387130.8011.008하행반사형2.520190319<NA><NA><NA><NA><NA>
2387131.5611.635상행반사형2.020190319<NA><NA><NA><NA><NA>
3387131.6651.738상행반사형2.020190319<NA><NA><NA><NA><NA>
4387133.5893.666하행반사형2.020190319<NA><NA><NA><NA><NA>
5387133.7013.871상행반사형1.020190319<NA><NA><NA><NA><NA>
6387133.7513.849하행반사형1.020190319<NA><NA><NA><NA><NA>
7387134.3564.437상행반사형2.020190319<NA><NA><NA><NA><NA>
8387134.5014.592하행반사형2.020190319<NA><NA><NA><NA><NA>
9387135.5915.63하행반사형3.020190319<NA><NA><NA><NA><NA>
노선번호구간번호위치위치_종점위치?방향종류높이설치일자비고공간경도시점공간위도시점공간경도종점공간위도종점
57631852.7552.853하행흡음형1.0<NA><NA><NA><NA><NA><NA>
57731852.8553.091상행반사형2.5<NA><NA><NA><NA><NA><NA>
5783181415.54715.746하행흡음형2.0<NA><NA><NA><NA><NA><NA>
57931810.780.88하행흡음형2.0<NA>구름내<NA><NA><NA><NA>
58031811.61.72상행흡음형3.5<NA>은쟁이<NA><NA><NA><NA>
58131812.422.58상행흡음형4.0<NA>돌팍재<NA><NA><NA><NA>
58231812.5832.619하행기타1.0<NA>상안교 탄도방향<NA><NA><NA><NA>
58331812.62.72하행흡음형2.0<NA>지촌말<NA><NA><NA><NA>
58431812.6192.685하행기타2.0<NA>상안교 탄도방향<NA><NA><NA><NA>
58531851.5151.675상행흡음형2.5<NA>가설방음벽<NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

노선번호구간번호위치위치_종점위치?방향종류높이설치일자비고# duplicates
07021.241.472상행흡음형2.02005<NA>2
17022.983.1하행흡음형2.02005<NA>2
28240.240.28하행반사형2.32005<NA>2
38241.661.78상행흡음형2.02005<NA>2
48241.681.84하행흡음형2.02005<NA>2
58242.112.18상행흡음형2.02005<NA>2
68242.242.3하행흡음형2.02005<NA>2
78242.282.401하행흡음형2.02005<NA>2
88242.542.655하행흡음형2.02005<NA>2
98244.284.38하행흡음형2.02005<NA>2