Overview

Dataset statistics

Number of variables8
Number of observations63
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.2 KiB
Average record size in memory68.1 B

Variable types

Numeric2
Categorical2
Text2
DateTime2

Dataset

Description경기도 하남시에서 제공하는 계약명, 계약금액, 계약일, 업체명, 데이터기준일자 등 공사 하도급 계약현황을 보여주는 정보입니다.
URLhttps://www.data.go.kr/data/15050142/fileData.do

Alerts

구분 has constant value ""Constant
데이터기준일자 has constant value ""Constant
부서명 is highly imbalanced (79.8%)Imbalance
연번 has unique valuesUnique
계약금액 has unique valuesUnique

Reproduction

Analysis started2023-12-12 01:43:34.616448
Analysis finished2023-12-12 01:43:36.100296
Duration1.48 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct63
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean32
Minimum1
Maximum63
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size699.0 B
2023-12-12T10:43:36.210557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.1
Q116.5
median32
Q347.5
95-th percentile59.9
Maximum63
Range62
Interquartile range (IQR)31

Descriptive statistics

Standard deviation18.330303
Coefficient of variation (CV)0.57282196
Kurtosis-1.2
Mean32
Median Absolute Deviation (MAD)16
Skewness0
Sum2016
Variance336
MonotonicityStrictly increasing
2023-12-12T10:43:36.398919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.6%
2 1
 
1.6%
35 1
 
1.6%
36 1
 
1.6%
37 1
 
1.6%
38 1
 
1.6%
39 1
 
1.6%
40 1
 
1.6%
41 1
 
1.6%
42 1
 
1.6%
Other values (53) 53
84.1%
ValueCountFrequency (%)
1 1
1.6%
2 1
1.6%
3 1
1.6%
4 1
1.6%
5 1
1.6%
6 1
1.6%
7 1
1.6%
8 1
1.6%
9 1
1.6%
10 1
1.6%
ValueCountFrequency (%)
63 1
1.6%
62 1
1.6%
61 1
1.6%
60 1
1.6%
59 1
1.6%
58 1
1.6%
57 1
1.6%
56 1
1.6%
55 1
1.6%
54 1
1.6%

구분
Categorical

CONSTANT 

Distinct1
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size636.0 B
공사
63 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공사
2nd row공사
3rd row공사
4th row공사
5th row공사

Common Values

ValueCountFrequency (%)
공사 63
100.0%

Length

2023-12-12T10:43:36.575152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:43:36.706043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공사 63
100.0%

부서명
Categorical

IMBALANCE 

Distinct5
Distinct (%)7.9%
Missing0
Missing (%)0.0%
Memory size636.0 B
본청
59 
감북동
 
1
친환경사업소
 
1
보건소
 
1
평생교육원
 
1

Length

Max length6
Median length2
Mean length2.1428571
Min length2

Unique

Unique4 ?
Unique (%)6.3%

Sample

1st row본청
2nd row본청
3rd row본청
4th row본청
5th row본청

Common Values

ValueCountFrequency (%)
본청 59
93.7%
감북동 1
 
1.6%
친환경사업소 1
 
1.6%
보건소 1
 
1.6%
평생교육원 1
 
1.6%

Length

2023-12-12T10:43:36.846119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:43:37.002070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
본청 59
93.7%
감북동 1
 
1.6%
친환경사업소 1
 
1.6%
보건소 1
 
1.6%
평생교육원 1
 
1.6%
Distinct62
Distinct (%)98.4%
Missing0
Missing (%)0.0%
Memory size636.0 B
2023-12-12T10:43:37.287568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length27
Mean length19.888889
Min length8

Characters and Unicode

Total characters1253
Distinct characters186
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique61 ?
Unique (%)96.8%

Sample

1st row감일~초이간 광역도로 개설공사
2nd row천현동 주민센터~국일에너지간 도로개설공사
3rd row하남 이성산성 주변 경관광장 조성공사
4th row팔당대교 내진성능 보강공사
5th row덕풍동 도서관 리모델링 건축 공사
ValueCountFrequency (%)
하남시 12
 
5.0%
건축공사 12
 
5.0%
공사 10
 
4.2%
보수공사 6
 
2.5%
조성공사 5
 
2.1%
기계 5
 
2.1%
증축 4
 
1.7%
공원녹지대 4
 
1.7%
개설공사 4
 
1.7%
조경 4
 
1.7%
Other values (137) 172
72.3%
2023-12-12T10:43:37.735203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
175
 
14.0%
84
 
6.7%
78
 
6.2%
35
 
2.8%
32
 
2.6%
23
 
1.8%
, 23
 
1.8%
22
 
1.8%
) 21
 
1.7%
( 21
 
1.7%
Other values (176) 739
59.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 961
76.7%
Space Separator 175
 
14.0%
Decimal Number 41
 
3.3%
Other Punctuation 27
 
2.2%
Close Punctuation 21
 
1.7%
Open Punctuation 21
 
1.7%
Dash Punctuation 3
 
0.2%
Math Symbol 2
 
0.2%
Uppercase Letter 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
84
 
8.7%
78
 
8.1%
35
 
3.6%
32
 
3.3%
23
 
2.4%
22
 
2.3%
21
 
2.2%
20
 
2.1%
18
 
1.9%
18
 
1.9%
Other values (160) 610
63.5%
Decimal Number
ValueCountFrequency (%)
1 12
29.3%
2 10
24.4%
0 7
17.1%
8 4
 
9.8%
3 4
 
9.8%
5 3
 
7.3%
9 1
 
2.4%
Other Punctuation
ValueCountFrequency (%)
, 23
85.2%
· 4
 
14.8%
Uppercase Letter
ValueCountFrequency (%)
C 1
50.0%
I 1
50.0%
Space Separator
ValueCountFrequency (%)
175
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 961
76.7%
Common 290
 
23.1%
Latin 2
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
84
 
8.7%
78
 
8.1%
35
 
3.6%
32
 
3.3%
23
 
2.4%
22
 
2.3%
21
 
2.2%
20
 
2.1%
18
 
1.9%
18
 
1.9%
Other values (160) 610
63.5%
Common
ValueCountFrequency (%)
175
60.3%
, 23
 
7.9%
) 21
 
7.2%
( 21
 
7.2%
1 12
 
4.1%
2 10
 
3.4%
0 7
 
2.4%
8 4
 
1.4%
3 4
 
1.4%
· 4
 
1.4%
Other values (4) 9
 
3.1%
Latin
ValueCountFrequency (%)
C 1
50.0%
I 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 961
76.7%
ASCII 288
 
23.0%
None 4
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
175
60.8%
, 23
 
8.0%
) 21
 
7.3%
( 21
 
7.3%
1 12
 
4.2%
2 10
 
3.5%
0 7
 
2.4%
8 4
 
1.4%
3 4
 
1.4%
- 3
 
1.0%
Other values (5) 8
 
2.8%
Hangul
ValueCountFrequency (%)
84
 
8.7%
78
 
8.1%
35
 
3.6%
32
 
3.3%
23
 
2.4%
22
 
2.3%
21
 
2.2%
20
 
2.1%
18
 
1.9%
18
 
1.9%
Other values (160) 610
63.5%
None
ValueCountFrequency (%)
· 4
100.0%

계약금액
Real number (ℝ)

UNIQUE 

Distinct63
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.9941331 × 109
Minimum24062000
Maximum2.0208155 × 1010
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size699.0 B
2023-12-12T10:43:37.906497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum24062000
5-th percentile67068300
Q12.613461 × 108
median9.9809388 × 108
Q33.7274695 × 109
95-th percentile1.1902981 × 1010
Maximum2.0208155 × 1010
Range2.0184093 × 1010
Interquartile range (IQR)3.4661234 × 109

Descriptive statistics

Standard deviation4.2877914 × 109
Coefficient of variation (CV)1.4320644
Kurtosis3.8397472
Mean2.9941331 × 109
Median Absolute Deviation (MAD)8.7168408 × 108
Skewness1.9663808
Sum1.8863038 × 1011
Variance1.8385155 × 1019
MonotonicityNot monotonic
2023-12-12T10:43:38.111506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
12835042000 1
 
1.6%
2780452440 1
 
1.6%
1998260000 1
 
1.6%
753842770 1
 
1.6%
925976400 1
 
1.6%
3913661370 1
 
1.6%
8781030000 1
 
1.6%
190809300 1
 
1.6%
1597540000 1
 
1.6%
66957000 1
 
1.6%
Other values (53) 53
84.1%
ValueCountFrequency (%)
24062000 1
1.6%
53379990 1
1.6%
65208000 1
1.6%
66957000 1
1.6%
68070000 1
1.6%
71643000 1
1.6%
81130000 1
1.6%
99407000 1
1.6%
126409800 1
1.6%
134453620 1
1.6%
ValueCountFrequency (%)
20208155000 1
1.6%
14404500000 1
1.6%
12835042000 1
1.6%
12123402300 1
1.6%
9919189000 1
1.6%
9851251300 1
1.6%
9352496000 1
1.6%
8901035000 1
1.6%
8781030000 1
1.6%
8213287100 1
1.6%
Distinct59
Distinct (%)93.7%
Missing0
Missing (%)0.0%
Memory size636.0 B
Minimum2012-01-11 00:00:00
Maximum2023-02-22 00:00:00
2023-12-12T10:43:38.268677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:43:38.429476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct60
Distinct (%)95.2%
Missing0
Missing (%)0.0%
Memory size636.0 B
2023-12-12T10:43:38.719861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length8.4920635
Min length4

Characters and Unicode

Total characters535
Distinct characters104
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique57 ?
Unique (%)90.5%

Sample

1st row한라산업개발(주)
2nd row(주)영신
3rd row(주)성토건설
4th row(주)벽산제일건설
5th row(주)통합건설
ValueCountFrequency (%)
주식회사 12
 
16.0%
서진티씨주식회사 2
 
2.7%
하나건설 2
 
2.7%
스페이스대건종합건설(주 2
 
2.7%
도성종합건설(주 1
 
1.3%
광성에스씨건설 1
 
1.3%
동명기산(주 1
 
1.3%
유일건설 1
 
1.3%
주)건양이엔지 1
 
1.3%
주)대지건설 1
 
1.3%
Other values (51) 51
68.0%
2023-12-12T10:43:39.533406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
62
 
11.6%
43
 
8.0%
) 41
 
7.7%
( 41
 
7.7%
38
 
7.1%
20
 
3.7%
20
 
3.7%
20
 
3.7%
18
 
3.4%
17
 
3.2%
Other values (94) 215
40.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 441
82.4%
Close Punctuation 41
 
7.7%
Open Punctuation 41
 
7.7%
Space Separator 12
 
2.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
62
 
14.1%
43
 
9.8%
38
 
8.6%
20
 
4.5%
20
 
4.5%
20
 
4.5%
18
 
4.1%
17
 
3.9%
10
 
2.3%
9
 
2.0%
Other values (91) 184
41.7%
Close Punctuation
ValueCountFrequency (%)
) 41
100.0%
Open Punctuation
ValueCountFrequency (%)
( 41
100.0%
Space Separator
ValueCountFrequency (%)
12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 441
82.4%
Common 94
 
17.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
62
 
14.1%
43
 
9.8%
38
 
8.6%
20
 
4.5%
20
 
4.5%
20
 
4.5%
18
 
4.1%
17
 
3.9%
10
 
2.3%
9
 
2.0%
Other values (91) 184
41.7%
Common
ValueCountFrequency (%)
) 41
43.6%
( 41
43.6%
12
 
12.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 441
82.4%
ASCII 94
 
17.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
62
 
14.1%
43
 
9.8%
38
 
8.6%
20
 
4.5%
20
 
4.5%
20
 
4.5%
18
 
4.1%
17
 
3.9%
10
 
2.3%
9
 
2.0%
Other values (91) 184
41.7%
ASCII
ValueCountFrequency (%)
) 41
43.6%
( 41
43.6%
12
 
12.8%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size636.0 B
Minimum2023-08-17 00:00:00
Maximum2023-08-17 00:00:00
2023-12-12T10:43:39.699039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:43:39.816864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T10:43:35.499312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:43:35.242569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:43:35.631434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:43:35.354843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T10:43:39.898349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번부서명계약명계약금액계약일업체명
연번1.0000.2671.0000.0000.9880.945
부서명0.2671.0001.0000.0001.0000.000
계약명1.0001.0001.0000.0001.0001.000
계약금액0.0000.0000.0001.0000.9470.818
계약일0.9881.0001.0000.9471.0000.992
업체명0.9450.0001.0000.8180.9921.000
2023-12-12T10:43:40.033709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번계약금액부서명
연번1.0000.1370.097
계약금액0.1371.0000.000
부서명0.0970.0001.000

Missing values

2023-12-12T10:43:35.855944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:43:36.031421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번구분부서명계약명계약금액계약일업체명데이터기준일자
01공사본청감일~초이간 광역도로 개설공사128350420002012-01-11한라산업개발(주)2023-08-17
12공사본청천현동 주민센터~국일에너지간 도로개설공사27804524402014-01-22(주)영신2023-08-17
23공사본청하남 이성산성 주변 경관광장 조성공사5894460002014-11-27(주)성토건설2023-08-17
34공사본청팔당대교 내진성능 보강공사35729170002015-05-12(주)벽산제일건설2023-08-17
45공사본청덕풍동 도서관 리모델링 건축 공사6204294902016-03-02(주)통합건설2023-08-17
56공사본청하남시 장애인복지관 건축(토목, 기계, 조경 포함) 공사89010350002016-03-31원하종합건설(주)2023-08-17
67공사본청미사동 자전거 경사로 및 소로개설공사(토목공사)3261000002016-04-05(주)경복개발2023-08-17
78공사본청미사1통 경로당 신축공사(건축)4578789302016-04-05삼립건설(주)2023-08-17
89공사본청미사1동 주민센터 신축 건축공사35033000002016-08-26성지종합건설(주)2023-08-17
910공사본청다목적복지회관 증축공사(건축, 기계)4312000002016-09-22석진건설(주)2023-08-17
연번구분부서명계약명계약금액계약일업체명데이터기준일자
5354공사본청하남시 종합복지타운 건립 건축공사121234023002021-12-27주식회사오렌지이앤씨2023-08-17
5455공사본청당정근린공원 조성공사(조경, 토목, 건축)34482910002022-01-26비엠에스건설 주식회사2023-08-17
5556공사본청하남시 감일공공복합청사 건립 건축공사82132871002022-03-02주식회사 제아씨앤씨2023-08-17
5657공사본청천현동 꿈나무공원 공영주차장(지하) 조성공사56985967202022-03-24건웅종합건설 주식회사2023-08-17
5758공사본청하남시 시민행복센터 건립 건축공사144045000002022-04-11거송종합건설(주)2023-08-17
5859공사본청선동IC 확장·개선 공사21747890002022-04-13해오름종합건설주식회사2023-08-17
5960공사본청감북동 가무나리 마을진입로 개설공사11519150002022-05-02(주)부원종합건설2023-08-17
6061공사본청신장동 주민참여형 가로환경개선사업 공사9980938802022-06-07청도건설 주식회사2023-08-17
6162공사본청덕풍근린공원 제2공영주차장 보수·보강 공사10516970002022-11-09삼대종합개발(주)2023-08-17
6263공사본청하남시 산곡3교 보수공사4973320002023-02-22드림네트웍스(주)2023-08-17