Overview

Dataset statistics

Number of variables6
Number of observations603
Missing cells4
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory29.0 KiB
Average record size in memory49.2 B

Variable types

Text2
Numeric1
DateTime2
Categorical1

Dataset

Description서울주택도시공사(SH공사)의 공사계약의 계약명, 입찰방법, 계약대상, 계약시작일, 계약종료일 등을 포함하는 정보입니다
URLhttps://www.data.go.kr/data/3045249/fileData.do

Alerts

입찰방법 is highly imbalanced (87.3%)Imbalance

Reproduction

Analysis started2023-12-12 13:09:21.947144
Analysis finished2023-12-12 13:09:22.809733
Duration0.86 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct585
Distinct (%)97.2%
Missing1
Missing (%)0.2%
Memory size4.8 KiB
2023-12-12T22:09:23.121664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length39
Mean length24.041528
Min length7

Characters and Unicode

Total characters14473
Distinct characters299
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique568 ?
Unique (%)94.4%

Sample

1st row빈집활용사업 4차 철거공사
2nd row은평 문화공원3 문화재 보수정비공사
3rd row중계4단지 수도꼭지 교체공사
4th row중계3단지 수도꼭지 교체공사
5th row신월동 460-11 공동체주택 건설공사
ValueCountFrequency (%)
임대아파트 173
 
6.3%
시설물 155
 
5.6%
아파트 133
 
4.8%
113
 
4.1%
유지보수공사 108
 
3.9%
건설공사 95
 
3.4%
전기공사 76
 
2.7%
정보통신공사 64
 
2.3%
보수공사 47
 
1.7%
교체공사 45
 
1.6%
Other values (620) 1757
63.5%
2023-12-12T22:09:23.560764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2171
 
15.0%
816
 
5.6%
668
 
4.6%
629
 
4.3%
325
 
2.2%
324
 
2.2%
321
 
2.2%
2 319
 
2.2%
315
 
2.2%
295
 
2.0%
Other values (289) 8290
57.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10927
75.5%
Space Separator 2171
 
15.0%
Decimal Number 1022
 
7.1%
Dash Punctuation 107
 
0.7%
Uppercase Letter 98
 
0.7%
Open Punctuation 58
 
0.4%
Close Punctuation 58
 
0.4%
Other Punctuation 25
 
0.2%
Lowercase Letter 6
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
816
 
7.5%
668
 
6.1%
629
 
5.8%
325
 
3.0%
324
 
3.0%
321
 
2.9%
315
 
2.9%
295
 
2.7%
265
 
2.4%
261
 
2.4%
Other values (260) 6708
61.4%
Decimal Number
ValueCountFrequency (%)
2 319
31.2%
1 262
25.6%
0 134
13.1%
3 97
 
9.5%
4 65
 
6.4%
5 37
 
3.6%
6 35
 
3.4%
8 29
 
2.8%
7 28
 
2.7%
9 16
 
1.6%
Uppercase Letter
ValueCountFrequency (%)
L 30
30.6%
B 18
18.4%
A 16
16.3%
E 13
13.3%
D 13
13.3%
U 2
 
2.0%
C 2
 
2.0%
S 2
 
2.0%
H 2
 
2.0%
Lowercase Letter
ValueCountFrequency (%)
i 2
33.3%
t 2
33.3%
y 2
33.3%
Other Punctuation
ValueCountFrequency (%)
/ 22
88.0%
, 3
 
12.0%
Space Separator
ValueCountFrequency (%)
2171
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 107
100.0%
Open Punctuation
ValueCountFrequency (%)
( 58
100.0%
Close Punctuation
ValueCountFrequency (%)
) 58
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10927
75.5%
Common 3442
 
23.8%
Latin 104
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
816
 
7.5%
668
 
6.1%
629
 
5.8%
325
 
3.0%
324
 
3.0%
321
 
2.9%
315
 
2.9%
295
 
2.7%
265
 
2.4%
261
 
2.4%
Other values (260) 6708
61.4%
Common
ValueCountFrequency (%)
2171
63.1%
2 319
 
9.3%
1 262
 
7.6%
0 134
 
3.9%
- 107
 
3.1%
3 97
 
2.8%
4 65
 
1.9%
( 58
 
1.7%
) 58
 
1.7%
5 37
 
1.1%
Other values (7) 134
 
3.9%
Latin
ValueCountFrequency (%)
L 30
28.8%
B 18
17.3%
A 16
15.4%
E 13
12.5%
D 13
12.5%
U 2
 
1.9%
C 2
 
1.9%
i 2
 
1.9%
t 2
 
1.9%
y 2
 
1.9%
Other values (2) 4
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10927
75.5%
ASCII 3546
 
24.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2171
61.2%
2 319
 
9.0%
1 262
 
7.4%
0 134
 
3.8%
- 107
 
3.0%
3 97
 
2.7%
4 65
 
1.8%
( 58
 
1.6%
) 58
 
1.6%
5 37
 
1.0%
Other values (19) 238
 
6.7%
Hangul
ValueCountFrequency (%)
816
 
7.5%
668
 
6.1%
629
 
5.8%
325
 
3.0%
324
 
3.0%
321
 
2.9%
315
 
2.9%
295
 
2.7%
265
 
2.4%
261
 
2.4%
Other values (260) 6708
61.4%

계약금액(원)
Real number (ℝ)

Distinct599
Distinct (%)99.5%
Missing1
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean1.6730692 × 1010
Minimum1.001189 × 109
Maximum7.27 × 1011
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.4 KiB
2023-12-12T22:09:23.691128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.001189 × 109
5-th percentile1.1138247 × 109
Q11.6250082 × 109
median2.9745972 × 109
Q37.769076 × 109
95-th percentile9.3919071 × 1010
Maximum7.27 × 1011
Range7.2599881 × 1011
Interquartile range (IQR)6.1440678 × 109

Descriptive statistics

Standard deviation5.1985744 × 1010
Coefficient of variation (CV)3.1072082
Kurtosis108.57806
Mean1.6730692 × 1010
Median Absolute Deviation (MAD)1.6446916 × 109
Skewness8.9533447
Sum1.0071877 × 1013
Variance2.7025176 × 1021
MonotonicityNot monotonic
2023-12-12T22:09:23.826960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
173000000000 2
 
0.3%
138000000000 2
 
0.3%
131000000000 2
 
0.3%
7180122000 1
 
0.2%
50069103000 1
 
0.2%
3211756317 1
 
0.2%
5673770000 1
 
0.2%
1869446377 1
 
0.2%
1903723000 1
 
0.2%
1741569940 1
 
0.2%
Other values (589) 589
97.7%
ValueCountFrequency (%)
1001188980 1
0.2%
1004857000 1
0.2%
1012679600 1
0.2%
1013042070 1
0.2%
1014826570 1
0.2%
1015156200 1
0.2%
1016749810 1
0.2%
1017212680 1
0.2%
1024131000 1
0.2%
1024279000 1
0.2%
ValueCountFrequency (%)
727000000000 1
0.2%
700000000000 1
0.2%
261000000000 1
0.2%
225000000000 1
0.2%
189000000000 1
0.2%
180000000000 1
0.2%
173000000000 2
0.3%
170000000000 1
0.2%
169000000000 1
0.2%
159000000000 1
0.2%
Distinct289
Distinct (%)48.0%
Missing1
Missing (%)0.2%
Memory size4.8 KiB
Minimum2006-06-29 00:00:00
Maximum2023-06-29 00:00:00
2023-12-12T22:09:23.956745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:09:24.087168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct259
Distinct (%)43.0%
Missing1
Missing (%)0.2%
Memory size4.8 KiB
Minimum2013-03-31 00:00:00
Maximum2026-11-06 00:00:00
2023-12-12T22:09:24.211871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:09:24.335225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

입찰방법
Categorical

IMBALANCE 

Distinct6
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
제한경쟁
576 
일반경쟁
 
17
수의계약
 
7
조달청계약
 
1
조달계약
 
1

Length

Max length5
Median length4
Mean length4.0016584
Min length4

Unique

Unique3 ?
Unique (%)0.5%

Sample

1st row일반경쟁
2nd row제한경쟁
3rd row제한경쟁
4th row제한경쟁
5th row제한경쟁

Common Values

ValueCountFrequency (%)
제한경쟁 576
95.5%
일반경쟁 17
 
2.8%
수의계약 7
 
1.2%
조달청계약 1
 
0.2%
조달계약 1
 
0.2%
<NA> 1
 
0.2%

Length

2023-12-12T22:09:24.451860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:09:24.551558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제한경쟁 576
95.5%
일반경쟁 17
 
2.8%
수의계약 7
 
1.2%
조달청계약 1
 
0.2%
조달계약 1
 
0.2%
na 1
 
0.2%
Distinct521
Distinct (%)86.4%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2023-12-12T22:09:24.772675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length20
Mean length8.7446103
Min length4

Characters and Unicode

Total characters5273
Distinct characters308
Distinct categories8 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique458 ?
Unique (%)76.0%

Sample

1st row주식회사 청하건설
2nd row삼부토건 주식회사
3rd row상지이앤씨 (주)
4th row주식회사 엑타
5th row주식회사 디엔아이컨스트럭션
ValueCountFrequency (%)
주식회사 215
 
25.8%
한신공영(주 5
 
0.6%
5
 
0.6%
금호산업(주 4
 
0.5%
대흥토건 4
 
0.5%
계룡건설산업(주 4
 
0.5%
진흥기업(주 4
 
0.5%
두산건설 3
 
0.4%
株式會社 3
 
0.4%
주식회사재왕건설 3
 
0.4%
Other values (520) 583
70.0%
2023-12-12T22:09:25.192595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
589
 
11.2%
( 330
 
6.3%
) 329
 
6.2%
287
 
5.4%
259
 
4.9%
256
 
4.9%
234
 
4.4%
232
 
4.4%
201
 
3.8%
114
 
2.2%
Other values (298) 2442
46.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4367
82.8%
Open Punctuation 330
 
6.3%
Close Punctuation 329
 
6.2%
Space Separator 232
 
4.4%
Uppercase Letter 9
 
0.2%
Lowercase Letter 2
 
< 0.1%
Decimal Number 2
 
< 0.1%
Other Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
589
 
13.5%
287
 
6.6%
259
 
5.9%
256
 
5.9%
234
 
5.4%
201
 
4.6%
114
 
2.6%
69
 
1.6%
67
 
1.5%
67
 
1.5%
Other values (280) 2224
50.9%
Uppercase Letter
ValueCountFrequency (%)
S 1
11.1%
Y 1
11.1%
E 1
11.1%
N 1
11.1%
C 1
11.1%
L 1
11.1%
T 1
11.1%
D 1
11.1%
A 1
11.1%
Lowercase Letter
ValueCountFrequency (%)
d 1
50.0%
t 1
50.0%
Decimal Number
ValueCountFrequency (%)
1 1
50.0%
2 1
50.0%
Other Punctuation
ValueCountFrequency (%)
. 1
50.0%
, 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 330
100.0%
Close Punctuation
ValueCountFrequency (%)
) 329
100.0%
Space Separator
ValueCountFrequency (%)
232
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4341
82.3%
Common 895
 
17.0%
Han 26
 
0.5%
Latin 11
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
589
 
13.6%
287
 
6.6%
259
 
6.0%
256
 
5.9%
234
 
5.4%
201
 
4.6%
114
 
2.6%
69
 
1.6%
67
 
1.5%
67
 
1.5%
Other values (266) 2198
50.6%
Han
ValueCountFrequency (%)
3
11.5%
3
11.5%
3
11.5%
3
11.5%
3
11.5%
3
11.5%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
Other values (4) 4
15.4%
Latin
ValueCountFrequency (%)
d 1
9.1%
t 1
9.1%
S 1
9.1%
Y 1
9.1%
E 1
9.1%
N 1
9.1%
C 1
9.1%
L 1
9.1%
T 1
9.1%
D 1
9.1%
Common
ValueCountFrequency (%)
( 330
36.9%
) 329
36.8%
232
25.9%
1 1
 
0.1%
2 1
 
0.1%
. 1
 
0.1%
, 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4341
82.3%
ASCII 906
 
17.2%
CJK 25
 
0.5%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
589
 
13.6%
287
 
6.6%
259
 
6.0%
256
 
5.9%
234
 
5.4%
201
 
4.6%
114
 
2.6%
69
 
1.6%
67
 
1.5%
67
 
1.5%
Other values (266) 2198
50.6%
ASCII
ValueCountFrequency (%)
( 330
36.4%
) 329
36.3%
232
25.6%
d 1
 
0.1%
t 1
 
0.1%
S 1
 
0.1%
1 1
 
0.1%
2 1
 
0.1%
Y 1
 
0.1%
E 1
 
0.1%
Other values (8) 8
 
0.9%
CJK
ValueCountFrequency (%)
3
12.0%
3
12.0%
3
12.0%
3
12.0%
3
12.0%
3
12.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
Other values (3) 3
12.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%

Interactions

2023-12-12T22:09:22.362498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T22:09:25.295218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
계약금액(원)입찰방법
계약금액(원)1.0000.426
입찰방법0.4261.000
2023-12-12T22:09:25.399615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
계약금액(원)입찰방법
계약금액(원)1.0000.170
입찰방법0.1701.000

Missing values

2023-12-12T22:09:22.486975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:09:22.601302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T22:09:22.722573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

계약명계약금액(원)계약시작일계약종료일입찰방법계약대상
0빈집활용사업 4차 철거공사17415699402022-08-012023-10-05일반경쟁주식회사 청하건설
1은평 문화공원3 문화재 보수정비공사15105710002022-07-282025-07-26제한경쟁삼부토건 주식회사
2중계4단지 수도꼭지 교체공사10434370302022-07-222022-12-16제한경쟁상지이앤씨 (주)
3중계3단지 수도꼭지 교체공사12089106902022-07-222022-12-16제한경쟁주식회사 엑타
4신월동 460-11 공동체주택 건설공사10172126802022-06-272023-05-13제한경쟁주식회사 디엔아이컨스트럭션
5고덕강일공공주택지구 2BL 제로에너지아파트 소방(전기)공사19959511902022-06-242024-10-02제한경쟁주식회사 창승전력
6고덕강일공공주택지구 2BL 제로에너지아파트 정보통신공사57035839402022-06-242024-10-02제한경쟁주식회사 온리정보통신
7고덕강일공공주택지구 2BL 제로에너지아파트 전기공사96801380002022-06-242024-10-02제한경쟁대일전기 주식회사
8답십리 제17구역 주택재개발정비사업 아파트 건설공사581493500002022-05-232022-12-19수의계약주식회사 삼호
9방학동 313-738번지 공공리모델링 공사20245549702022-04-042023-02-16제한경쟁(주)에스앤비건설
계약명계약금액(원)계약시작일계약종료일입찰방법계약대상
5932023년 임대아파트 인터폰 교체공사(1권역)14066077602023-04-062023-12-01제한경쟁삼부건설 주식회사
594고덕강일 공공주택지구 3단지 아파트 건설공사2610000000002023-05-122026-11-06제한경쟁(주)합동전자산업
5952023년 임대아파트 도로 및 보도교체공사19115338002023-05-122023-11-25제한경쟁화성산업주식회사
596신내10단지 화장실 바닥타일, 위생기구 악세사리 교체공사14389600002023-05-222023-11-17제한경쟁동도건설 주식회사
597강일육교 및 동부간선도로(좌안) 진출램프교 보수보강공사12526667002023-05-232023-08-25제한경쟁주식회사 삼영건설
5982023년도 구로두산 외 9개단지 세대분전반 내 차단기 교체공사11213816802023-06-282023-11-30제한경쟁하웅종합건설(주)
5992023년도 옥수삼성 외 4개단지 세대분전반 내 차단기 교체공사10776892002023-06-282023-11-30제한경쟁주식회사 재연이엔씨
6002023년도 돈암풍림 외 8개단지 세대분전반 내 차단기 교체공사12548159002023-06-292023-11-30제한경쟁주식회사 남일기업
6012022,2023년 취약계층 에너지복지사업 LED등기구 교체공사(면목,가양4)12911629002023-06-292023-11-30제한경쟁주식회사 제이에스파워텍
602<NA><NA><NA><NA><NA>주식회사 나원이엔씨