Overview

Dataset statistics

Number of variables7
Number of observations68
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.0 KiB
Average record size in memory60.9 B

Variable types

Numeric2
Categorical2
DateTime2
Text1

Dataset

Description경상남도_당해공기연기 데이터입니다. (공사년도, 공사구분, 공사번호, 당해연기시작일, 당해연기마감일, 연기사유 등의 데이터를 포함하고있습니다.)
Author경상남도
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15049525

Alerts

부서코드 has constant value ""Constant
공사년도 is highly overall correlated with 공사구분High correlation
공사번호 is highly overall correlated with 공사구분High correlation
공사구분 is highly overall correlated with 공사년도 and 1 other fieldsHigh correlation
공사구분 is highly imbalanced (56.9%)Imbalance

Reproduction

Analysis started2023-12-11 00:58:01.577325
Analysis finished2023-12-11 00:58:02.701114
Duration1.12 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

공사년도
Real number (ℝ)

HIGH CORRELATION 

Distinct9
Distinct (%)13.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2011.9265
Minimum2005
Maximum2014
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size744.0 B
2023-12-11T09:58:02.766951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2005
5-th percentile2006
Q12012
median2013
Q32013
95-th percentile2014
Maximum2014
Range9
Interquartile range (IQR)1

Descriptive statistics

Standard deviation2.2679873
Coefficient of variation (CV)0.0011272715
Kurtosis2.9668793
Mean2011.9265
Median Absolute Deviation (MAD)0
Skewness-1.9943125
Sum136811
Variance5.1437665
MonotonicityNot monotonic
2023-12-11T09:58:02.911412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
2013 40
58.8%
2012 8
 
11.8%
2011 5
 
7.4%
2014 5
 
7.4%
2006 3
 
4.4%
2005 2
 
2.9%
2007 2
 
2.9%
2010 2
 
2.9%
2008 1
 
1.5%
ValueCountFrequency (%)
2005 2
 
2.9%
2006 3
 
4.4%
2007 2
 
2.9%
2008 1
 
1.5%
2010 2
 
2.9%
2011 5
 
7.4%
2012 8
 
11.8%
2013 40
58.8%
2014 5
 
7.4%
ValueCountFrequency (%)
2014 5
 
7.4%
2013 40
58.8%
2012 8
 
11.8%
2011 5
 
7.4%
2010 2
 
2.9%
2008 1
 
1.5%
2007 2
 
2.9%
2006 3
 
4.4%
2005 2
 
2.9%

공사구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size676.0 B
공사
62 
용역
 
6

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공사
2nd row공사
3rd row공사
4th row공사
5th row공사

Common Values

ValueCountFrequency (%)
공사 62
91.2%
용역 6
 
8.8%

Length

2023-12-11T09:58:03.021726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:58:03.111208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공사 62
91.2%
용역 6
 
8.8%

공사번호
Real number (ℝ)

HIGH CORRELATION 

Distinct47
Distinct (%)69.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean83.985294
Minimum4
Maximum459
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size744.0 B
2023-12-11T09:58:03.231701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4
5-th percentile13.35
Q132
median63.5
Q3103.25
95-th percentile230.5
Maximum459
Range455
Interquartile range (IQR)71.25

Descriptive statistics

Standard deviation85.575923
Coefficient of variation (CV)1.0189394
Kurtosis9.7073428
Mean83.985294
Median Absolute Deviation (MAD)34.5
Skewness2.8176564
Sum5711
Variance7323.2386
MonotonicityNot monotonic
2023-12-11T09:58:03.384681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=47)
ValueCountFrequency (%)
49 4
 
5.9%
84 3
 
4.4%
119 3
 
4.4%
32 3
 
4.4%
20 3
 
4.4%
18 3
 
4.4%
80 2
 
2.9%
66 2
 
2.9%
88 2
 
2.9%
144 2
 
2.9%
Other values (37) 41
60.3%
ValueCountFrequency (%)
4 1
 
1.5%
12 1
 
1.5%
13 2
2.9%
14 1
 
1.5%
16 1
 
1.5%
18 3
4.4%
19 1
 
1.5%
20 3
4.4%
26 1
 
1.5%
28 1
 
1.5%
ValueCountFrequency (%)
459 1
 
1.5%
456 1
 
1.5%
294 1
 
1.5%
234 1
 
1.5%
224 1
 
1.5%
148 1
 
1.5%
144 2
2.9%
129 2
2.9%
123 2
2.9%
119 3
4.4%

부서코드
Categorical

CONSTANT 

Distinct1
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size676.0 B
1
68 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 68
100.0%

Length

2023-12-11T09:58:03.512279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:58:03.619890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 68
100.0%
Distinct60
Distinct (%)88.2%
Missing0
Missing (%)0.0%
Memory size676.0 B
Minimum2006-09-24 00:00:00
Maximum2015-07-17 00:00:00
2023-12-11T09:58:03.761045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:58:03.895958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct58
Distinct (%)85.3%
Missing0
Missing (%)0.0%
Memory size676.0 B
Minimum2006-11-24 00:00:00
Maximum2016-01-17 00:00:00
2023-12-11T09:58:04.031553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:58:04.180132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct53
Distinct (%)77.9%
Missing0
Missing (%)0.0%
Memory size676.0 B
2023-12-11T09:58:04.491180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length23.5
Mean length14.470588
Min length2

Characters and Unicode

Total characters984
Distinct characters140
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique46 ?
Unique (%)67.6%

Sample

1st row국비지원 지연으로 공기연장
2nd row동절기 시공중지
3rd row퇴직공제 부금 납부 사석투하 추가(증 27㎥)
4th row건의사항 수렴으로 절대공기부족
5th row사업물량 증가
ValueCountFrequency (%)
동절기 25
 
9.3%
연장 22
 
8.1%
따른 22
 
8.1%
공기 15
 
5.6%
공사 11
 
4.1%
연기 7
 
2.6%
부족 6
 
2.2%
공기연장 5
 
1.9%
인한 5
 
1.9%
중지에 5
 
1.9%
Other values (110) 147
54.4%
2023-12-11T09:58:05.013741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
204
20.7%
80
 
8.1%
58
 
5.9%
49
 
5.0%
35
 
3.6%
32
 
3.3%
31
 
3.2%
28
 
2.8%
25
 
2.5%
24
 
2.4%
Other values (130) 418
42.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 761
77.3%
Space Separator 204
 
20.7%
Decimal Number 12
 
1.2%
Close Punctuation 2
 
0.2%
Open Punctuation 2
 
0.2%
Uppercase Letter 2
 
0.2%
Other Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
80
 
10.5%
58
 
7.6%
49
 
6.4%
35
 
4.6%
32
 
4.2%
31
 
4.1%
28
 
3.7%
25
 
3.3%
24
 
3.2%
23
 
3.0%
Other values (117) 376
49.4%
Decimal Number
ValueCountFrequency (%)
6 4
33.3%
9 2
16.7%
2 2
16.7%
8 1
 
8.3%
5 1
 
8.3%
4 1
 
8.3%
7 1
 
8.3%
Uppercase Letter
ValueCountFrequency (%)
I 1
50.0%
C 1
50.0%
Space Separator
ValueCountFrequency (%)
204
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 761
77.3%
Common 221
 
22.5%
Latin 2
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
80
 
10.5%
58
 
7.6%
49
 
6.4%
35
 
4.6%
32
 
4.2%
31
 
4.1%
28
 
3.7%
25
 
3.3%
24
 
3.2%
23
 
3.0%
Other values (117) 376
49.4%
Common
ValueCountFrequency (%)
204
92.3%
6 4
 
1.8%
9 2
 
0.9%
) 2
 
0.9%
( 2
 
0.9%
2 2
 
0.9%
8 1
 
0.5%
5 1
 
0.5%
4 1
 
0.5%
1
 
0.5%
Latin
ValueCountFrequency (%)
I 1
50.0%
C 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 761
77.3%
ASCII 222
 
22.6%
CJK Compat 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
204
91.9%
6 4
 
1.8%
9 2
 
0.9%
) 2
 
0.9%
( 2
 
0.9%
2 2
 
0.9%
8 1
 
0.5%
5 1
 
0.5%
4 1
 
0.5%
I 1
 
0.5%
Other values (2) 2
 
0.9%
Hangul
ValueCountFrequency (%)
80
 
10.5%
58
 
7.6%
49
 
6.4%
35
 
4.6%
32
 
4.2%
31
 
4.1%
28
 
3.7%
25
 
3.3%
24
 
3.2%
23
 
3.0%
Other values (117) 376
49.4%
CJK Compat
ValueCountFrequency (%)
1
100.0%

Interactions

2023-12-11T09:58:02.274903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:58:02.067179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:58:02.385627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:58:02.169107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T09:58:05.105904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공사년도공사구분공사번호당해연기시작일당해연기마감일연기사유
공사년도1.0000.9100.8900.9880.9360.996
공사구분0.9101.0000.9181.0000.7961.000
공사번호0.8900.9181.0000.7810.0000.959
당해연기시작일0.9881.0000.7811.0000.9870.733
당해연기마감일0.9360.7960.0000.9871.0000.929
연기사유0.9961.0000.9590.7330.9291.000
2023-12-11T09:58:05.208339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공사년도공사번호공사구분
공사년도1.000-0.1960.697
공사번호-0.1961.0000.716
공사구분0.6970.7161.000

Missing values

2023-12-11T09:58:02.522857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:58:02.643751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

공사년도공사구분공사번호부서코드당해연기시작일당해연기마감일연기사유
02005공사2612006-09-242006-12-30국비지원 지연으로 공기연장
12005공사3312006-10-262006-11-24동절기 시공중지
22006공사1612007-03-102007-10-15퇴직공제 부금 납부 사석투하 추가(증 27㎥)
32006공사1412007-01-012007-03-12건의사항 수렴으로 절대공기부족
42006공사7412006-10-182007-08-21사업물량 증가
52007용역4912007-12-052008-01-16공직선거법 제86조 제2항 제5호에 의거 공청회 개최 불가에 따른
62007공사9212008-06-232008-10-31토지보상 지연 민원에 따른 조정
72008용역22412008-08-122008-09-11과업량 추가에 따른 용역기간 변경
82010용역23412010-08-172010-06-14감 64일 공사준공기한 도래에 따른 감리용역 과업기간 조정
92011용역45912010-02-082012-01-16예산범위내 조사항목 조정에 따른 기간연장 변경
공사년도공사구분공사번호부서코드당해연기시작일당해연기마감일연기사유
582013공사11912014-08-302014-10-29방재게이트 안전성 검토에 따른 연장
592013공사11912014-07-072014-08-30현장 여건 반영
602013공사12912013-09-032014-07-15민원제기에 따른 공기연장
612013공사12912013-09-032014-05-16동절기로 인한 공기연장
622013공사14812013-01-152015-01-20동절기 공사 중지에 따른 공기 연장
632014공사4612014-12-312015-03-27설계변경에 따른 연기
642014공사4512014-12-312015-03-27총체분 내역변경에 따른 공정계획 반영
652014공사5412014-12-312015-04-06사업량 조정 등
662014공사5712014-10-282015-04-14구제역 방역 및 연결로 협의 기간 과다 소요
672014공사11612015-07-172016-01-17현장여건 반영