Overview

Dataset statistics

Number of variables9
Number of observations773
Missing cells764
Missing cells (%)11.0%
Duplicate rows22
Duplicate rows (%)2.8%
Total size in memory56.7 KiB
Average record size in memory75.2 B

Variable types

Categorical5
DateTime1
Numeric3

Dataset

Description경상남도 사천시 공간정보시스템 데이터베이스 테이블 중 포장 테이블 자료입니다.(보수종류 , 보수공종, 종료일, 연장, 면적, 폭원 등)
Author경상남도 사천시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15091550

Alerts

Dataset has 22 (2.8%) duplicate rowsDuplicates
보수공종 is highly overall correlated with 보수종류 and 3 other fieldsHigh correlation
보수종류 is highly overall correlated with 보수공종 and 1 other fieldsHigh correlation
연장 is highly overall correlated with 면적High correlation
면적 is highly overall correlated with 연장 and 1 other fieldsHigh correlation
폭원 is highly overall correlated with 면적 and 1 other fieldsHigh correlation
차도포장재질 is highly overall correlated with 폭원 and 3 other fieldsHigh correlation
보도포장재질 is highly overall correlated with 보수공종 and 2 other fieldsHigh correlation
이전차도포장재질 is highly overall correlated with 보수종류 and 3 other fieldsHigh correlation
종료일 has 764 (98.8%) missing valuesMissing

Reproduction

Analysis started2023-12-11 00:12:45.480382
Analysis finished2023-12-11 00:12:46.876887
Duration1.4 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

보수종류
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size6.2 KiB
미분류
540 
신설
116 
기타
108 
전면개수
 
9

Length

Max length4
Median length3
Mean length2.7218629
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row미분류
2nd row미분류
3rd row미분류
4th row미분류
5th row미분류

Common Values

ValueCountFrequency (%)
미분류 540
69.9%
신설 116
 
15.0%
기타 108
 
14.0%
전면개수 9
 
1.2%

Length

2023-12-11T09:12:46.951077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:12:47.068518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미분류 540
69.9%
신설 116
 
15.0%
기타 108
 
14.0%
전면개수 9
 
1.2%

보수공종
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size6.2 KiB
미분류
540 
기타
233 

Length

Max length3
Median length3
Mean length2.698577
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row미분류
2nd row미분류
3rd row미분류
4th row미분류
5th row미분류

Common Values

ValueCountFrequency (%)
미분류 540
69.9%
기타 233
30.1%

Length

2023-12-11T09:12:47.204236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:12:47.319938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미분류 540
69.9%
기타 233
30.1%

종료일
Date

MISSING 

Distinct8
Distinct (%)88.9%
Missing764
Missing (%)98.8%
Memory size6.2 KiB
Minimum2001-04-30 00:00:00
Maximum2006-06-02 00:00:00
2023-12-11T09:12:47.406334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:12:47.511563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)

연장
Real number (ℝ)

HIGH CORRELATION 

Distinct667
Distinct (%)86.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean129.04017
Minimum0.01
Maximum1457.33
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.9 KiB
2023-12-11T09:12:47.635464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.01
5-th percentile4.948
Q123.19
median61.03
Q3152.5
95-th percentile513.476
Maximum1457.33
Range1457.32
Interquartile range (IQR)129.31

Descriptive statistics

Standard deviation182.65233
Coefficient of variation (CV)1.4154687
Kurtosis11.536862
Mean129.04017
Median Absolute Deviation (MAD)48.12
Skewness2.9753635
Sum99748.05
Variance33361.872
MonotonicityNot monotonic
2023-12-11T09:12:47.775663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
359.34 13
 
1.7%
521.93 7
 
0.9%
379.75 6
 
0.8%
50.13 5
 
0.6%
9.6 5
 
0.6%
217.85 5
 
0.6%
507.84 4
 
0.5%
40.1 4
 
0.5%
198.05 4
 
0.5%
10.62 4
 
0.5%
Other values (657) 716
92.6%
ValueCountFrequency (%)
0.01 1
0.1%
0.08 1
0.1%
1.04 1
0.1%
1.32 1
0.1%
1.92 1
0.1%
2.0 1
0.1%
2.01 1
0.1%
2.02 1
0.1%
2.28 1
0.1%
2.38 1
0.1%
ValueCountFrequency (%)
1457.33 1
0.1%
1236.0 1
0.1%
1220.69 1
0.1%
1212.43 1
0.1%
1071.38 1
0.1%
930.44 1
0.1%
871.5 1
0.1%
863.7 1
0.1%
859.18 1
0.1%
851.81 1
0.1%

면적
Real number (ℝ)

HIGH CORRELATION 

Distinct686
Distinct (%)88.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1282.2714
Minimum0.04
Maximum37049.35
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.9 KiB
2023-12-11T09:12:47.907087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.04
5-th percentile15.256
Q199.44
median400.6
Q31162.69
95-th percentile5629.032
Maximum37049.35
Range37049.31
Interquartile range (IQR)1063.25

Descriptive statistics

Standard deviation2885.3404
Coefficient of variation (CV)2.2501792
Kurtosis49.240503
Mean1282.2714
Median Absolute Deviation (MAD)352.02
Skewness5.9192294
Sum991195.79
Variance8325189.1
MonotonicityNot monotonic
2023-12-11T09:12:48.040750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
48.58 13
 
1.7%
491.2 7
 
0.9%
919.48 6
 
0.8%
8.54 5
 
0.6%
522.28 5
 
0.6%
63.33 4
 
0.5%
214.06 4
 
0.5%
6.08 4
 
0.5%
1248.07 4
 
0.5%
41.64 4
 
0.5%
Other values (676) 717
92.8%
ValueCountFrequency (%)
0.04 1
 
0.1%
3.07 1
 
0.1%
3.58 1
 
0.1%
4.02 1
 
0.1%
4.91 1
 
0.1%
5.16 1
 
0.1%
5.3 1
 
0.1%
5.88 1
 
0.1%
6.08 4
0.5%
6.3 1
 
0.1%
ValueCountFrequency (%)
37049.35 1
0.1%
26822.85 1
0.1%
20513.88 1
0.1%
20177.2 1
0.1%
19064.16 1
0.1%
18820.62 1
0.1%
18679.53 1
0.1%
14116.19 1
0.1%
13866.09 1
0.1%
12339.98 1
0.1%

폭원
Real number (ℝ)

HIGH CORRELATION 

Distinct392
Distinct (%)50.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.385317
Minimum1.3
Maximum45.3
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.9 KiB
2023-12-11T09:12:48.174350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.3
5-th percentile1.818
Q13.3
median9.1
Q317
95-th percentile28.808
Maximum45.3
Range44
Interquartile range (IQR)13.7

Descriptive statistics

Standard deviation8.7913079
Coefficient of variation (CV)0.7721619
Kurtosis-0.00028751488
Mean11.385317
Median Absolute Deviation (MAD)6.14
Skewness0.87398193
Sum8800.85
Variance77.287095
MonotonicityNot monotonic
2023-12-11T09:12:48.310082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2.5 36
 
4.7%
1.5 25
 
3.2%
8.0 25
 
3.2%
10.0 20
 
2.6%
15.2 14
 
1.8%
2.2 14
 
1.8%
19.9 13
 
1.7%
2.1 13
 
1.7%
2.4 11
 
1.4%
15.0 10
 
1.3%
Other values (382) 592
76.6%
ValueCountFrequency (%)
1.3 1
 
0.1%
1.4 2
 
0.3%
1.44 1
 
0.1%
1.5 25
3.2%
1.57 1
 
0.1%
1.62 1
 
0.1%
1.69 1
 
0.1%
1.7 4
 
0.5%
1.8 3
 
0.4%
1.83 1
 
0.1%
ValueCountFrequency (%)
45.3 1
0.1%
44.7 1
0.1%
38.46 1
0.1%
38.37 1
0.1%
34.47 1
0.1%
34.0 1
0.1%
33.86 1
0.1%
33.37 1
0.1%
33.3 1
0.1%
32.37 1
0.1%

차도포장재질
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size6.2 KiB
아스팔트콘크리트
440 
미분류
233 
속성나중입력
89 
콘크리트
 
11

Length

Max length8
Median length8
Mean length6.2056921
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row아스팔트콘크리트
2nd row아스팔트콘크리트
3rd row아스팔트콘크리트
4th row아스팔트콘크리트
5th row아스팔트콘크리트

Common Values

ValueCountFrequency (%)
아스팔트콘크리트 440
56.9%
미분류 233
30.1%
속성나중입력 89
 
11.5%
콘크리트 11
 
1.4%

Length

2023-12-11T09:12:48.442034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:12:48.559778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
아스팔트콘크리트 440
56.9%
미분류 233
30.1%
속성나중입력 89
 
11.5%
콘크리트 11
 
1.4%

보도포장재질
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size6.2 KiB
소형고압블록
339 
미분류
332 
속성나중입력
72 
투수성아스콘
 
15
콘크리트
 
11

Length

Max length6
Median length6
Mean length4.6623545
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row미분류
2nd row미분류
3rd row미분류
4th row미분류
5th row미분류

Common Values

ValueCountFrequency (%)
소형고압블록 339
43.9%
미분류 332
42.9%
속성나중입력 72
 
9.3%
투수성아스콘 15
 
1.9%
콘크리트 11
 
1.4%
기타 4
 
0.5%

Length

2023-12-11T09:12:49.011005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:12:49.141948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
소형고압블록 339
43.9%
미분류 332
42.9%
속성나중입력 72
 
9.3%
투수성아스콘 15
 
1.9%
콘크리트 11
 
1.4%
기타 4
 
0.5%

이전차도포장재질
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size6.2 KiB
미분류
574 
기타
98 
속성나중입력
90 
아스팔트콘크리트
 
11

Length

Max length8
Median length3
Mean length3.2936611
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row미분류
2nd row미분류
3rd row미분류
4th row미분류
5th row미분류

Common Values

ValueCountFrequency (%)
미분류 574
74.3%
기타 98
 
12.7%
속성나중입력 90
 
11.6%
아스팔트콘크리트 11
 
1.4%

Length

2023-12-11T09:12:49.297371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:12:49.450374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미분류 574
74.3%
기타 98
 
12.7%
속성나중입력 90
 
11.6%
아스팔트콘크리트 11
 
1.4%

Interactions

2023-12-11T09:12:46.437736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:12:45.883529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:12:46.171328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:12:46.521606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:12:45.980558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:12:46.270719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:12:46.605456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:12:46.090055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:12:46.361913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T09:12:49.555819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
보수종류보수공종종료일연장면적폭원차도포장재질보도포장재질이전차도포장재질
보수종류1.0001.000NaN0.3720.0000.4520.6740.5070.847
보수공종1.0001.000NaN0.3350.0000.5120.7580.7760.977
종료일NaNNaN1.0001.0001.0001.000NaNNaNNaN
연장0.3720.3351.0001.0000.7810.3390.3110.0000.360
면적0.0000.0001.0000.7811.0000.2470.1690.1210.182
폭원0.4520.5121.0000.3390.2471.0000.7810.5360.523
차도포장재질0.6740.758NaN0.3110.1690.7811.0000.6900.908
보도포장재질0.5070.776NaN0.0000.1210.5360.6901.0000.699
이전차도포장재질0.8470.977NaN0.3600.1820.5230.9080.6991.000
2023-12-11T09:12:49.701341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
차도포장재질보도포장재질보수공종보수종류이전차도포장재질
차도포장재질1.0000.5200.5500.3220.602
보도포장재질0.5201.0000.5800.3500.529
보수공종0.5500.5801.0000.9990.862
보수종류0.3220.3500.9991.0000.501
이전차도포장재질0.6020.5290.8620.5011.000
2023-12-11T09:12:49.801387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연장면적폭원보수종류보수공종차도포장재질보도포장재질이전차도포장재질
연장1.0000.6800.1240.2300.2560.1900.0000.222
면적0.6801.0000.5030.0000.0000.1160.0720.125
폭원0.1240.5031.0000.2810.3820.5940.3160.334
보수종류0.2300.0000.2811.0000.9990.3220.3500.501
보수공종0.2560.0000.3820.9991.0000.5500.5800.862
차도포장재질0.1900.1160.5940.3220.5501.0000.5200.602
보도포장재질0.0000.0720.3160.3500.5800.5201.0000.529
이전차도포장재질0.2220.1250.3340.5010.8620.6020.5291.000

Missing values

2023-12-11T09:12:46.704689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:12:46.825393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

보수종류보수공종종료일연장면적폭원차도포장재질보도포장재질이전차도포장재질
0미분류미분류<NA>462.410112.0420.17아스팔트콘크리트미분류미분류
1미분류미분류<NA>38.991048.0526.42아스팔트콘크리트미분류미분류
2미분류미분류<NA>40.83939.223.52아스팔트콘크리트미분류미분류
3미분류미분류<NA>170.483479.9920.38아스팔트콘크리트미분류미분류
4미분류미분류<NA>35.47746.5921.05아스팔트콘크리트미분류미분류
5미분류미분류<NA>589.3212275.4120.34아스팔트콘크리트미분류미분류
6미분류미분류<NA>55.181482.3126.88아스팔트콘크리트미분류미분류
7미분류미분류<NA>473.8110637.2520.57아스팔트콘크리트미분류미분류
8미분류미분류<NA>22.62564.1325.02아스팔트콘크리트미분류미분류
9미분류미분류<NA>42.12124.882.95콘크리트미분류미분류
보수종류보수공종종료일연장면적폭원차도포장재질보도포장재질이전차도포장재질
763미분류미분류<NA>282.972567.848.97아스팔트콘크리트미분류미분류
764미분류미분류<NA>396.363642.869.0아스팔트콘크리트미분류미분류
765미분류미분류<NA>32.01351.8511.94아스팔트콘크리트미분류미분류
766미분류미분류<NA>471.884814.659.0아스팔트콘크리트미분류미분류
767미분류미분류<NA>147.01301.161.96미분류소형고압블록미분류
768미분류미분류<NA>147.32306.41.97미분류소형고압블록미분류
769미분류미분류<NA>145.481176.258.05아스팔트콘크리트미분류미분류
770미분류미분류<NA>33.666.361.97미분류소형고압블록미분류
771미분류미분류<NA>42.0383.141.98미분류소형고압블록미분류
772미분류미분류<NA>36.54867.7130.0아스팔트콘크리트미분류미분류

Duplicate rows

Most frequently occurring

보수종류보수공종종료일연장면적폭원차도포장재질보도포장재질이전차도포장재질# duplicates
8기타기타<NA>359.3448.5815.2속성나중입력소형고압블록속성나중입력12
20신설기타<NA>521.93491.215.0속성나중입력소형고압블록속성나중입력6
9기타기타<NA>379.75919.4819.7속성나중입력소형고압블록속성나중입력5
0기타기타<NA>9.68.5420.8속성나중입력소형고압블록속성나중입력4
7기타기타<NA>217.85522.2819.9속성나중입력소형고압블록속성나중입력4
2기타기타<NA>40.141.6415.6속성나중입력소형고압블록속성나중입력3
3기타기타<NA>50.1363.3321.9속성나중입력소형고압블록속성나중입력3
5기타기타<NA>174.05303.6412.0속성나중입력소형고압블록속성나중입력3
6기타기타<NA>198.05214.0619.9속성나중입력소형고압블록속성나중입력3
14신설기타<NA>10.626.0820.5속성나중입력소형고압블록속성나중입력3