Overview

Dataset statistics

Number of variables9
Number of observations3038
Missing cells3727
Missing cells (%)13.6%
Duplicate rows6
Duplicate rows (%)0.2%
Total size in memory225.6 KiB
Average record size in memory76.0 B

Variable types

Numeric3
Categorical3
Text3

Dataset

Description경상남도 공사계약시스템의 공동도급자 데이터입니다. 공사년도, 공사구분, 업체명, 금액, 비율, 계약분야 등의 데이터를 포함하고있습니다.
Author경상남도
URLhttps://www.data.go.kr/data/15049527/fileData.do

Alerts

부서코드 has constant value ""Constant
Dataset has 6 (0.2%) duplicate rowsDuplicates
분담방식 is highly imbalanced (54.5%)Imbalance
금액 has 1229 (40.5%) missing valuesMissing
비율 has 195 (6.4%) missing valuesMissing
계약분야 has 2302 (75.8%) missing valuesMissing
금액 is highly skewed (γ1 = 21.3874501)Skewed
금액 has 77 (2.5%) zerosZeros

Reproduction

Analysis started2023-12-12 12:52:10.505403
Analysis finished2023-12-12 12:52:12.795301
Duration2.29 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

공사년도
Real number (ℝ)

Distinct30
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2006.7633
Minimum1990
Maximum2019
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size26.8 KiB
2023-12-12T21:52:12.867793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1990
5-th percentile1996
Q12003
median2007
Q32011
95-th percentile2017
Maximum2019
Range29
Interquartile range (IQR)8

Descriptive statistics

Standard deviation6.0505182
Coefficient of variation (CV)0.0030150632
Kurtosis-0.14898723
Mean2006.7633
Median Absolute Deviation (MAD)4
Skewness-0.318829
Sum6096547
Variance36.60877
MonotonicityNot monotonic
2023-12-12T21:52:13.070673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
2010 318
 
10.5%
2009 197
 
6.5%
2007 196
 
6.5%
2008 189
 
6.2%
2011 172
 
5.7%
2006 169
 
5.6%
2005 165
 
5.4%
2012 161
 
5.3%
2003 161
 
5.3%
2004 149
 
4.9%
Other values (20) 1161
38.2%
ValueCountFrequency (%)
1990 22
 
0.7%
1991 23
 
0.8%
1992 12
 
0.4%
1993 10
 
0.3%
1994 14
 
0.5%
1995 38
1.3%
1996 51
1.7%
1997 66
2.2%
1998 61
2.0%
1999 70
2.3%
ValueCountFrequency (%)
2019 47
 
1.5%
2018 64
 
2.1%
2017 68
 
2.2%
2016 52
 
1.7%
2015 56
 
1.8%
2014 57
 
1.9%
2013 114
 
3.8%
2012 161
5.3%
2011 172
5.7%
2010 318
10.5%

공사구분
Categorical

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size23.9 KiB
공사
2048 
용역
986 
구매
 
4

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공사
2nd row공사
3rd row공사
4th row공사
5th row공사

Common Values

ValueCountFrequency (%)
공사 2048
67.4%
용역 986
32.5%
구매 4
 
0.1%

Length

2023-12-12T21:52:13.250811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:52:13.369478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공사 2048
67.4%
용역 986
32.5%
구매 4
 
0.1%

공사번호
Real number (ℝ)

Distinct361
Distinct (%)11.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean120.35253
Minimum1
Maximum566
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size26.8 KiB
2023-12-12T21:52:13.490951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile9
Q143
median73
Q3153
95-th percentile420
Maximum566
Range565
Interquartile range (IQR)110

Descriptive statistics

Standard deviation119.69481
Coefficient of variation (CV)0.99453503
Kurtosis1.7646911
Mean120.35253
Median Absolute Deviation (MAD)41
Skewness1.5972811
Sum365631
Variance14326.848
MonotonicityNot monotonic
2023-12-12T21:52:13.661853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
45 42
 
1.4%
21 39
 
1.3%
46 38
 
1.3%
63 35
 
1.2%
41 34
 
1.1%
61 34
 
1.1%
73 33
 
1.1%
72 33
 
1.1%
51 31
 
1.0%
58 31
 
1.0%
Other values (351) 2688
88.5%
ValueCountFrequency (%)
1 26
0.9%
2 21
0.7%
3 12
0.4%
4 19
0.6%
5 24
0.8%
6 12
0.4%
7 26
0.9%
8 3
 
0.1%
9 10
 
0.3%
10 13
0.4%
ValueCountFrequency (%)
566 2
 
0.1%
524 4
0.1%
508 2
 
0.1%
505 3
0.1%
496 2
 
0.1%
495 3
0.1%
488 2
 
0.1%
487 5
0.2%
486 2
 
0.1%
485 4
0.1%

부서코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size23.9 KiB
1
3038 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 3038
100.0%

Length

2023-12-12T21:52:13.831895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:52:13.952860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 3038
100.0%
Distinct717
Distinct (%)23.6%
Missing1
Missing (%)< 0.1%
Memory size23.9 KiB
2023-12-12T21:52:14.211090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length21
Mean length7.9199868
Min length1

Characters and Unicode

Total characters24053
Distinct characters315
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique351 ?
Unique (%)11.6%

Sample

1st row대창건설(주)
2nd row(주)대백종합건설
3rd row일신진흥건설(주)
4th row대창건설(주)
5th row롯데건설(주)
ValueCountFrequency (%)
주)대아건설 63
 
2.0%
현대건설(주 51
 
1.7%
주)덕성 49
 
1.6%
주)천진엔지니어링 45
 
1.5%
흥한건설(주 42
 
1.4%
삼성물산(주 42
 
1.4%
삼부토건(주 41
 
1.3%
주)한성개발공사 37
 
1.2%
주)다산컨설턴트 37
 
1.2%
주)청암엔지니어링 35
 
1.1%
Other values (714) 2639
85.7%
2023-12-12T21:52:14.658598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 2895
 
12.0%
) 2894
 
12.0%
2885
 
12.0%
1620
 
6.7%
1397
 
5.8%
448
 
1.9%
415
 
1.7%
404
 
1.7%
403
 
1.7%
339
 
1.4%
Other values (305) 10353
43.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 18115
75.3%
Open Punctuation 2895
 
12.0%
Close Punctuation 2894
 
12.0%
Uppercase Letter 80
 
0.3%
Space Separator 44
 
0.2%
Decimal Number 13
 
0.1%
Math Symbol 8
 
< 0.1%
Other Punctuation 3
 
< 0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2885
 
15.9%
1620
 
8.9%
1397
 
7.7%
448
 
2.5%
415
 
2.3%
404
 
2.2%
403
 
2.2%
339
 
1.9%
338
 
1.9%
338
 
1.9%
Other values (275) 9528
52.6%
Uppercase Letter
ValueCountFrequency (%)
S 21
26.2%
K 14
17.5%
G 9
11.2%
I 7
 
8.8%
N 6
 
7.5%
L 6
 
7.5%
E 3
 
3.8%
T 2
 
2.5%
A 2
 
2.5%
O 2
 
2.5%
Other values (7) 8
 
10.0%
Decimal Number
ValueCountFrequency (%)
0 3
23.1%
4 3
23.1%
1 2
15.4%
3 2
15.4%
2 2
15.4%
6 1
 
7.7%
Other Punctuation
ValueCountFrequency (%)
. 2
66.7%
: 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 2895
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2894
100.0%
Space Separator
ValueCountFrequency (%)
44
100.0%
Math Symbol
ValueCountFrequency (%)
8
100.0%
Lowercase Letter
ValueCountFrequency (%)
i 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 18115
75.3%
Common 5857
 
24.4%
Latin 81
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2885
 
15.9%
1620
 
8.9%
1397
 
7.7%
448
 
2.5%
415
 
2.3%
404
 
2.2%
403
 
2.2%
339
 
1.9%
338
 
1.9%
338
 
1.9%
Other values (275) 9528
52.6%
Latin
ValueCountFrequency (%)
S 21
25.9%
K 14
17.3%
G 9
11.1%
I 7
 
8.6%
N 6
 
7.4%
L 6
 
7.4%
E 3
 
3.7%
T 2
 
2.5%
A 2
 
2.5%
O 2
 
2.5%
Other values (8) 9
11.1%
Common
ValueCountFrequency (%)
( 2895
49.4%
) 2894
49.4%
44
 
0.8%
8
 
0.1%
0 3
 
0.1%
4 3
 
0.1%
1 2
 
< 0.1%
3 2
 
< 0.1%
. 2
 
< 0.1%
2 2
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 18115
75.3%
ASCII 5930
 
24.7%
Arrows 8
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 2895
48.8%
) 2894
48.8%
44
 
0.7%
S 21
 
0.4%
K 14
 
0.2%
G 9
 
0.2%
I 7
 
0.1%
N 6
 
0.1%
L 6
 
0.1%
0 3
 
0.1%
Other values (19) 31
 
0.5%
Hangul
ValueCountFrequency (%)
2885
 
15.9%
1620
 
8.9%
1397
 
7.7%
448
 
2.5%
415
 
2.3%
404
 
2.2%
403
 
2.2%
339
 
1.9%
338
 
1.9%
338
 
1.9%
Other values (275) 9528
52.6%
Arrows
ValueCountFrequency (%)
8
100.0%

금액
Real number (ℝ)

MISSING  SKEWED  ZEROS 

Distinct1351
Distinct (%)74.7%
Missing1229
Missing (%)40.5%
Infinite0
Infinite (%)0.0%
Mean1.0589634 × 109
Minimum0
Maximum1.26763 × 1011
Zeros77
Zeros (%)2.5%
Negative0
Negative (%)0.0%
Memory size26.8 KiB
2023-12-12T21:52:14.811352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile565219.6
Q11.14666 × 108
median3.3 × 108
Q38.92272 × 108
95-th percentile3.678438 × 109
Maximum1.26763 × 1011
Range1.26763 × 1011
Interquartile range (IQR)7.77606 × 108

Descriptive statistics

Standard deviation4.2641494 × 109
Coefficient of variation (CV)4.0267201
Kurtosis567.20952
Mean1.0589634 × 109
Median Absolute Deviation (MAD)2.79719 × 108
Skewness21.38745
Sum1.9156649 × 1012
Variance1.818297 × 1019
MonotonicityNot monotonic
2023-12-12T21:52:14.957590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 77
 
2.5%
124725250 6
 
0.2%
1943200000 6
 
0.2%
116348750 6
 
0.2%
391881000 6
 
0.2%
173151000 5
 
0.2%
40140000 5
 
0.2%
88320000 5
 
0.2%
150000000 5
 
0.2%
120090000 4
 
0.1%
Other values (1341) 1684
55.4%
(Missing) 1229
40.5%
ValueCountFrequency (%)
0 77
2.5%
65 1
 
< 0.1%
108550 3
 
0.1%
187740 1
 
< 0.1%
329918 1
 
< 0.1%
348660 1
 
< 0.1%
440000 1
 
< 0.1%
473110 1
 
< 0.1%
548446 1
 
< 0.1%
550000 4
 
0.1%
ValueCountFrequency (%)
126763000000 1
 
< 0.1%
97953320000 1
 
< 0.1%
28157677500 1
 
< 0.1%
22893700340 2
0.1%
22526142000 1
 
< 0.1%
20957090000 1
 
< 0.1%
20166860000 1
 
< 0.1%
19530500000 1
 
< 0.1%
15520560000 1
 
< 0.1%
14404900000 3
0.1%

비율
Text

MISSING 

Distinct251
Distinct (%)8.8%
Missing195
Missing (%)6.4%
Memory size23.9 KiB
2023-12-12T21:52:15.220334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length2
Mean length2.2099894
Min length1

Characters and Unicode

Total characters6283
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique139 ?
Unique (%)4.9%

Sample

1st row42.64
2nd row57.36
3rd row50
4th row50
5th row50
ValueCountFrequency (%)
50 266
 
9.4%
30 233
 
8.2%
10 222
 
7.8%
20 183
 
6.4%
40 156
 
5.5%
70 154
 
5.4%
25 148
 
5.2%
60 109
 
3.8%
15 102
 
3.6%
5 69
 
2.4%
Other values (241) 1201
42.2%
2023-12-12T21:52:15.630072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1519
24.2%
5 1030
16.4%
1 658
10.5%
2 552
 
8.8%
3 549
 
8.7%
4 433
 
6.9%
7 408
 
6.5%
. 358
 
5.7%
6 304
 
4.8%
9 248
 
3.9%
Other values (2) 224
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5921
94.2%
Other Punctuation 358
 
5.7%
Dash Punctuation 4
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1519
25.7%
5 1030
17.4%
1 658
11.1%
2 552
 
9.3%
3 549
 
9.3%
4 433
 
7.3%
7 408
 
6.9%
6 304
 
5.1%
9 248
 
4.2%
8 220
 
3.7%
Other Punctuation
ValueCountFrequency (%)
. 358
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 6283
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1519
24.2%
5 1030
16.4%
1 658
10.5%
2 552
 
8.8%
3 549
 
8.7%
4 433
 
6.9%
7 408
 
6.5%
. 358
 
5.7%
6 304
 
4.8%
9 248
 
3.9%
Other values (2) 224
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6283
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1519
24.2%
5 1030
16.4%
1 658
10.5%
2 552
 
8.8%
3 549
 
8.7%
4 433
 
6.9%
7 408
 
6.5%
. 358
 
5.7%
6 304
 
4.8%
9 248
 
3.9%
Other values (2) 224
 
3.6%

계약분야
Text

MISSING 

Distinct124
Distinct (%)16.8%
Missing2302
Missing (%)75.8%
Memory size23.9 KiB
2023-12-12T21:52:15.853850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length16
Mean length3.3505435
Min length1

Characters and Unicode

Total characters2466
Distinct characters140
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique70 ?
Unique (%)9.5%

Sample

1st row-
2nd row-
3rd row-
4th row공동이행방식
5th row공동이행방식
ValueCountFrequency (%)
167
22.0%
토목 104
 
13.7%
공동이행 71
 
9.4%
전분야 59
 
7.8%
공동이행방식 25
 
3.3%
책임감리 24
 
3.2%
전기 14
 
1.8%
건축 14
 
1.8%
토목공사업 11
 
1.4%
하천대장작성 10
 
1.3%
Other values (116) 260
34.3%
2023-12-12T21:52:16.247571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 166
 
6.7%
157
 
6.4%
130
 
5.3%
125
 
5.1%
112
 
4.5%
103
 
4.2%
103
 
4.2%
101
 
4.1%
74
 
3.0%
72
 
2.9%
Other values (130) 1323
53.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2222
90.1%
Dash Punctuation 166
 
6.7%
Decimal Number 31
 
1.3%
Space Separator 23
 
0.9%
Other Punctuation 14
 
0.6%
Close Punctuation 3
 
0.1%
Open Punctuation 3
 
0.1%
Uppercase Letter 3
 
0.1%
Modifier Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
157
 
7.1%
130
 
5.9%
125
 
5.6%
112
 
5.0%
103
 
4.6%
103
 
4.6%
101
 
4.5%
74
 
3.3%
72
 
3.2%
67
 
3.0%
Other values (107) 1178
53.0%
Decimal Number
ValueCountFrequency (%)
0 8
25.8%
4 5
16.1%
8 4
12.9%
1 3
 
9.7%
5 2
 
6.5%
7 2
 
6.5%
9 2
 
6.5%
3 2
 
6.5%
2 2
 
6.5%
6 1
 
3.2%
Other Punctuation
ValueCountFrequency (%)
. 8
57.1%
% 3
 
21.4%
? 1
 
7.1%
; 1
 
7.1%
/ 1
 
7.1%
Uppercase Letter
ValueCountFrequency (%)
Q 1
33.3%
B 1
33.3%
D 1
33.3%
Dash Punctuation
ValueCountFrequency (%)
- 166
100.0%
Space Separator
ValueCountFrequency (%)
23
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2222
90.1%
Common 241
 
9.8%
Latin 3
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
157
 
7.1%
130
 
5.9%
125
 
5.6%
112
 
5.0%
103
 
4.6%
103
 
4.6%
101
 
4.5%
74
 
3.3%
72
 
3.2%
67
 
3.0%
Other values (107) 1178
53.0%
Common
ValueCountFrequency (%)
- 166
68.9%
23
 
9.5%
. 8
 
3.3%
0 8
 
3.3%
4 5
 
2.1%
8 4
 
1.7%
% 3
 
1.2%
) 3
 
1.2%
( 3
 
1.2%
1 3
 
1.2%
Other values (10) 15
 
6.2%
Latin
ValueCountFrequency (%)
Q 1
33.3%
B 1
33.3%
D 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2222
90.1%
ASCII 244
 
9.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 166
68.0%
23
 
9.4%
. 8
 
3.3%
0 8
 
3.3%
4 5
 
2.0%
8 4
 
1.6%
% 3
 
1.2%
) 3
 
1.2%
( 3
 
1.2%
1 3
 
1.2%
Other values (13) 18
 
7.4%
Hangul
ValueCountFrequency (%)
157
 
7.1%
130
 
5.9%
125
 
5.6%
112
 
5.0%
103
 
4.6%
103
 
4.6%
101
 
4.5%
74
 
3.3%
72
 
3.2%
67
 
3.0%
Other values (107) 1178
53.0%

분담방식
Categorical

IMBALANCE 

Distinct10
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size23.9 KiB
<NA>
1542 
공동이행방식
1185 
분담이행방식
213 
주계약자관리방식
 
48
공동이행
 
29
Other values (5)
 
21

Length

Max length8
Median length4
Mean length4.9664253
Min length1

Unique

Unique3 ?
Unique (%)0.1%

Sample

1st row-
2nd row-
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 1542
50.8%
공동이행방식 1185
39.0%
분담이행방식 213
 
7.0%
주계약자관리방식 48
 
1.6%
공동이행 29
 
1.0%
- 16
 
0.5%
토공 2
 
0.1%
전기분야 1
 
< 0.1%
소방분야 1
 
< 0.1%
건축분야 1
 
< 0.1%

Length

2023-12-12T21:52:16.386036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:52:16.500701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 1542
50.8%
공동이행방식 1185
39.0%
분담이행방식 213
 
7.0%
주계약자관리방식 48
 
1.6%
공동이행 29
 
1.0%
16
 
0.5%
토공 2
 
0.1%
전기분야 1
 
< 0.1%
소방분야 1
 
< 0.1%
건축분야 1
 
< 0.1%

Interactions

2023-12-12T21:52:11.768607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:52:11.110596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:52:11.441749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:52:11.875938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:52:11.219241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:52:11.549916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:52:11.971773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:52:11.339153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:52:11.663841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:52:16.594708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공사년도공사구분공사번호금액분담방식
공사년도1.0000.4700.6170.0000.603
공사구분0.4701.0000.5900.0000.366
공사번호0.6170.5901.0000.0000.315
금액0.0000.0000.0001.0000.000
분담방식0.6030.3660.3150.0001.000
2023-12-12T21:52:16.691844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분담방식공사구분
분담방식1.0000.365
공사구분0.3651.000
2023-12-12T21:52:16.771941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공사년도공사번호금액공사구분분담방식
공사년도1.0000.354-0.0240.3220.369
공사번호0.3541.000-0.1830.4270.148
금액-0.024-0.1831.0000.0000.000
공사구분0.3220.4270.0001.0000.365
분담방식0.3690.1480.0000.3651.000

Missing values

2023-12-12T21:52:12.123205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:52:12.271638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T21:52:12.381296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

공사년도공사구분공사번호부서코드업체명금액비율계약분야분담방식
01990공사21대창건설(주)33579980742.64--
11990공사21(주)대백종합건설45172319357.36--
21990공사171일신진흥건설(주)36203750050-<NA>
31990공사171대창건설(주)36203750050<NA><NA>
41990공사211롯데건설(주)22090000050<NA><NA>
51990공사211서진산업(주)22090000050<NA><NA>
61990공사231은하산업(주)9255900030<NA><NA>
71990공사231대능건설(주)21597100070<NA><NA>
81990공사581명신산업(주)34535000050<NA><NA>
91990공사581유원건설(주)34535000050<NA><NA>
공사년도공사구분공사번호부서코드업체명금액비율계약분야분담방식
30282019공사1111(주)삼호337217100051토목공동이행방식
30292018공사731(주)토지종합건설133002818550토목공동이행방식
30302018공사731(주)영원종합건설133002818550토목공동이행방식
30312018공사361(주)태영건설134705500041-공동이행방식
30322018공사361용원씨에스(주)128134500039-공동이행방식
30332017공사1321일광이앤씨 주식회사318040145049.9-공동이행방식
30342017공사1321청산종합건설(주)319314855050.1-공동이행방식
30352017공사721(주)토지종합건설365526500050-공동이행방식
30362017공사721(주)영원종합건설365526500050-공동이행방식
30372018공사361(주)대아건설65710000020-공동이행방식

Duplicate rows

Most frequently occurring

공사년도공사구분공사번호부서코드업체명금액비율계약분야분담방식# duplicates
01995공사931(주)동명전기7704048295.8<NA><NA>2
11996공사451(주)신일건업90216490070<NA><NA>2
22002공사431동아건설(주)<NA><NA><NA><NA>2
32005용역501(주)한맥기술<NA>40<NA>공동이행방식2
42005용역761(주)큰길엔지니어링<NA>20<NA>공동이행방식2
52009용역2741(주)청암엔지니어링<NA>30<NA>공동이행방식2