Overview

Dataset statistics

Number of variables6
Number of observations2178
Missing cells4356
Missing cells (%)33.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory112.9 KiB
Average record size in memory53.1 B

Variable types

Numeric2
Categorical2
Unsupported2

Dataset

Description경상남도 공사대장시스템의 비고데이터입니다. 공사대장시스템의 공사년도, 공사구분, 비고1, 비고2 등의 데이터를 포함하고 있습니다.
Author경상남도
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15049531

Alerts

부서코드 has constant value ""Constant
비고1 has 2178 (100.0%) missing valuesMissing
비고2 has 2178 (100.0%) missing valuesMissing
비고1 is an unsupported type, check if it needs cleaning or further analysisUnsupported
비고2 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-10 23:03:10.892996
Analysis finished2023-12-10 23:03:11.515651
Duration0.62 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

공사년도
Real number (ℝ)

Distinct23
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2009.3067
Minimum1991
Maximum2019
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size19.3 KiB
2023-12-11T08:03:11.566337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1991
5-th percentile2003
Q12008
median2009
Q32011
95-th percentile2016
Maximum2019
Range28
Interquartile range (IQR)3

Descriptive statistics

Standard deviation3.4427613
Coefficient of variation (CV)0.0017134076
Kurtosis1.8395157
Mean2009.3067
Median Absolute Deviation (MAD)2
Skewness-0.26190332
Sum4376270
Variance11.852605
MonotonicityNot monotonic
2023-12-11T08:03:11.672376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
2008 428
19.7%
2009 344
15.8%
2010 275
12.6%
2012 213
9.8%
2007 179
8.2%
2011 166
 
7.6%
2013 129
 
5.9%
2006 107
 
4.9%
2016 63
 
2.9%
2015 47
 
2.2%
Other values (13) 227
10.4%
ValueCountFrequency (%)
1991 1
 
< 0.1%
1995 1
 
< 0.1%
1999 25
 
1.1%
2000 31
 
1.4%
2001 33
 
1.5%
2002 13
 
0.6%
2003 8
 
0.4%
2004 17
 
0.8%
2005 41
 
1.9%
2006 107
4.9%
ValueCountFrequency (%)
2019 23
 
1.1%
2018 16
 
0.7%
2017 13
 
0.6%
2016 63
 
2.9%
2015 47
 
2.2%
2014 5
 
0.2%
2013 129
5.9%
2012 213
9.8%
2011 166
7.6%
2010 275
12.6%

공사구분
Categorical

Distinct4
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size17.1 KiB
공사
997 
용역
987 
기타
148 
구매
 
46

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공사
2nd row공사
3rd row공사
4th row용역
5th row용역

Common Values

ValueCountFrequency (%)
공사 997
45.8%
용역 987
45.3%
기타 148
 
6.8%
구매 46
 
2.1%

Length

2023-12-11T08:03:11.783046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:03:11.897616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공사 997
45.8%
용역 987
45.3%
기타 148
 
6.8%
구매 46
 
2.1%

공사번호
Real number (ℝ)

Distinct489
Distinct (%)22.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean151.04132
Minimum1
Maximum619
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size19.3 KiB
2023-12-11T08:03:12.020305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile11
Q152
median114
Q3222
95-th percentile425.45
Maximum619
Range618
Interquartile range (IQR)170

Descriptive statistics

Standard deviation128.22945
Coefficient of variation (CV)0.84896933
Kurtosis1.1570067
Mean151.04132
Median Absolute Deviation (MAD)75
Skewness1.2124481
Sum328968
Variance16442.792
MonotonicityNot monotonic
2023-12-11T08:03:12.173824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
11 16
 
0.7%
49 16
 
0.7%
14 15
 
0.7%
39 14
 
0.6%
10 14
 
0.6%
12 14
 
0.6%
43 14
 
0.6%
33 14
 
0.6%
64 14
 
0.6%
9 13
 
0.6%
Other values (479) 2034
93.4%
ValueCountFrequency (%)
1 13
0.6%
2 10
0.5%
3 9
0.4%
4 4
 
0.2%
5 13
0.6%
6 9
0.4%
7 10
0.5%
8 5
 
0.2%
9 13
0.6%
10 14
0.6%
ValueCountFrequency (%)
619 1
< 0.1%
618 1
< 0.1%
616 1
< 0.1%
615 1
< 0.1%
614 1
< 0.1%
607 1
< 0.1%
604 1
< 0.1%
601 1
< 0.1%
595 1
< 0.1%
594 1
< 0.1%

부서코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.1 KiB
1
2178 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 2178
100.0%

Length

2023-12-11T08:03:12.313734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:03:12.399093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 2178
100.0%

비고1
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2178
Missing (%)100.0%
Memory size19.3 KiB

비고2
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2178
Missing (%)100.0%
Memory size19.3 KiB

Interactions

2023-12-11T08:03:11.188374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:03:11.019295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:03:11.281594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:03:11.105039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T08:03:12.452299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공사년도공사구분공사번호
공사년도1.0000.3960.315
공사구분0.3961.0000.499
공사번호0.3150.4991.000
2023-12-11T08:03:12.528814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공사년도공사번호공사구분
공사년도1.0000.1040.262
공사번호0.1041.0000.321
공사구분0.2620.3211.000

Missing values

2023-12-11T08:03:11.390547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:03:11.479287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

공사년도공사구분공사번호부서코드비고1비고2
01991공사711<NA><NA>
11995공사751<NA><NA>
21999공사11<NA><NA>
31999용역51<NA><NA>
41999용역91<NA><NA>
51999공사221<NA><NA>
61999공사321<NA><NA>
71999공사391<NA><NA>
81999공사401<NA><NA>
91999공사441<NA><NA>
공사년도공사구분공사번호부서코드비고1비고2
21682019공사1081<NA><NA>
21692019공사1101<NA><NA>
21702019공사1121<NA><NA>
21712000용역561<NA><NA>
21722012공사1811<NA><NA>
21732011공사2701<NA><NA>
21742019공사1021<NA><NA>
21752011공사2781<NA><NA>
21762017공사721<NA><NA>
21772011공사2731<NA><NA>