Overview

Dataset statistics

Number of variables7
Number of observations176
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.1 KiB
Average record size in memory58.8 B

Variable types

Numeric2
Categorical2
Text2
DateTime1

Dataset

Description경기도 하남시 2023년 공사 관련 계약현황, 계약금, 계약금액, 계약일, 업체명, 공사당시 기준 단가, 관련정보에 대한 내용을 제공합니다.
URLhttps://www.data.go.kr/data/15050143/fileData.do

Alerts

구분 has constant value ""Constant
부서명 is highly imbalanced (93.6%)Imbalance
번호 has unique valuesUnique
계약명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 19:31:45.337522
Analysis finished2023-12-12 19:31:46.262309
Duration0.92 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct176
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean88.5
Minimum1
Maximum176
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2023-12-13T04:31:46.343972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile9.75
Q144.75
median88.5
Q3132.25
95-th percentile167.25
Maximum176
Range175
Interquartile range (IQR)87.5

Descriptive statistics

Standard deviation50.950957
Coefficient of variation (CV)0.57571703
Kurtosis-1.2
Mean88.5
Median Absolute Deviation (MAD)44
Skewness0
Sum15576
Variance2596
MonotonicityStrictly increasing
2023-12-13T04:31:46.488670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.6%
90 1
 
0.6%
114 1
 
0.6%
115 1
 
0.6%
116 1
 
0.6%
117 1
 
0.6%
118 1
 
0.6%
119 1
 
0.6%
120 1
 
0.6%
121 1
 
0.6%
Other values (166) 166
94.3%
ValueCountFrequency (%)
1 1
0.6%
2 1
0.6%
3 1
0.6%
4 1
0.6%
5 1
0.6%
6 1
0.6%
7 1
0.6%
8 1
0.6%
9 1
0.6%
10 1
0.6%
ValueCountFrequency (%)
176 1
0.6%
175 1
0.6%
174 1
0.6%
173 1
0.6%
172 1
0.6%
171 1
0.6%
170 1
0.6%
169 1
0.6%
168 1
0.6%
167 1
0.6%

구분
Categorical

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
공사
176 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공사
2nd row공사
3rd row공사
4th row공사
5th row공사

Common Values

ValueCountFrequency (%)
공사 176
100.0%

Length

2023-12-13T04:31:46.623733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:31:46.729342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공사 176
100.0%

부서명
Categorical

IMBALANCE 

Distinct3
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
본청
174 
친환경사업소
 
1
신장2동
 
1

Length

Max length6
Median length2
Mean length2.0340909
Min length2

Unique

Unique2 ?
Unique (%)1.1%

Sample

1st row본청
2nd row본청
3rd row본청
4th row본청
5th row본청

Common Values

ValueCountFrequency (%)
본청 174
98.9%
친환경사업소 1
 
0.6%
신장2동 1
 
0.6%

Length

2023-12-13T04:31:46.841186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:31:46.961078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
본청 174
98.9%
친환경사업소 1
 
0.6%
신장2동 1
 
0.6%

계약명
Text

UNIQUE 

Distinct176
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-13T04:31:47.187835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length31
Mean length24.488636
Min length9

Characters and Unicode

Total characters4310
Distinct characters271
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique176 ?
Unique (%)100.0%

Sample

1st row덕풍다목적체육관 리모델링 및 방수공사(건축,기계)
2nd row2023년 하남초 어린이보호구역 개선사업(하남형 스쿨존) 공사
3rd row시립빛나는하남어린이집 그린리모델링 공사
4th row빛으로 행복한 벚꽃길 명소개발 사업(3차)
5th row저단형 현수막 게시대 상판보수 공사 계약 건의
ValueCountFrequency (%)
2023년 106
 
13.4%
공사 39
 
4.9%
수목관리공사(단가계약 25
 
3.2%
19
 
2.4%
17
 
2.2%
공사(단가계약 13
 
1.6%
관내 11
 
1.4%
보수공사 9
 
1.1%
구역 9
 
1.1%
가로수관리공사(단가계약 8
 
1.0%
Other values (315) 533
67.6%
2023-12-13T04:31:47.605774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
613
 
14.2%
2 242
 
5.6%
221
 
5.1%
212
 
4.9%
3 121
 
2.8%
0 114
 
2.6%
108
 
2.5%
101
 
2.3%
) 100
 
2.3%
( 100
 
2.3%
Other values (261) 2378
55.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2939
68.2%
Space Separator 613
 
14.2%
Decimal Number 515
 
11.9%
Close Punctuation 100
 
2.3%
Open Punctuation 100
 
2.3%
Uppercase Letter 22
 
0.5%
Other Punctuation 11
 
0.3%
Dash Punctuation 5
 
0.1%
Lowercase Letter 5
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
221
 
7.5%
212
 
7.2%
108
 
3.7%
101
 
3.4%
93
 
3.2%
91
 
3.1%
84
 
2.9%
78
 
2.7%
75
 
2.6%
67
 
2.3%
Other values (235) 1809
61.6%
Decimal Number
ValueCountFrequency (%)
2 242
47.0%
3 121
23.5%
0 114
22.1%
1 19
 
3.7%
6 7
 
1.4%
5 4
 
0.8%
9 3
 
0.6%
4 3
 
0.6%
8 2
 
0.4%
Uppercase Letter
ValueCountFrequency (%)
C 8
36.4%
V 4
18.2%
T 4
18.2%
B 2
 
9.1%
L 2
 
9.1%
R 1
 
4.5%
A 1
 
4.5%
Lowercase Letter
ValueCountFrequency (%)
c 2
40.0%
v 1
20.0%
a 1
20.0%
t 1
20.0%
Other Punctuation
ValueCountFrequency (%)
, 9
81.8%
· 2
 
18.2%
Space Separator
ValueCountFrequency (%)
613
100.0%
Close Punctuation
ValueCountFrequency (%)
) 100
100.0%
Open Punctuation
ValueCountFrequency (%)
( 100
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2939
68.2%
Common 1344
31.2%
Latin 27
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
221
 
7.5%
212
 
7.2%
108
 
3.7%
101
 
3.4%
93
 
3.2%
91
 
3.1%
84
 
2.9%
78
 
2.7%
75
 
2.6%
67
 
2.3%
Other values (235) 1809
61.6%
Common
ValueCountFrequency (%)
613
45.6%
2 242
 
18.0%
3 121
 
9.0%
0 114
 
8.5%
) 100
 
7.4%
( 100
 
7.4%
1 19
 
1.4%
, 9
 
0.7%
6 7
 
0.5%
- 5
 
0.4%
Other values (5) 14
 
1.0%
Latin
ValueCountFrequency (%)
C 8
29.6%
V 4
14.8%
T 4
14.8%
c 2
 
7.4%
B 2
 
7.4%
L 2
 
7.4%
v 1
 
3.7%
R 1
 
3.7%
a 1
 
3.7%
A 1
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2939
68.2%
ASCII 1369
31.8%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
613
44.8%
2 242
 
17.7%
3 121
 
8.8%
0 114
 
8.3%
) 100
 
7.3%
( 100
 
7.3%
1 19
 
1.4%
, 9
 
0.7%
C 8
 
0.6%
6 7
 
0.5%
Other values (15) 36
 
2.6%
Hangul
ValueCountFrequency (%)
221
 
7.5%
212
 
7.2%
108
 
3.7%
101
 
3.4%
93
 
3.2%
91
 
3.1%
84
 
2.9%
78
 
2.7%
75
 
2.6%
67
 
2.3%
Other values (235) 1809
61.6%
None
ValueCountFrequency (%)
· 2
100.0%

계약금액
Real number (ℝ)

Distinct175
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.4865426 × 108
Minimum5280000
Maximum1.2183882 × 1010
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2023-12-13T04:31:47.749123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5280000
5-th percentile34875335
Q176077435
median96197190
Q31.781075 × 108
95-th percentile6.775142 × 108
Maximum1.2183882 × 1010
Range1.2178602 × 1010
Interquartile range (IQR)1.0203007 × 108

Descriptive statistics

Standard deviation9.3693794 × 108
Coefficient of variation (CV)3.7680349
Kurtosis152.74813
Mean2.4865426 × 108
Median Absolute Deviation (MAD)38448500
Skewness12.003109
Sum4.376315 × 1010
Variance8.778527 × 1017
MonotonicityNot monotonic
2023-12-13T04:31:47.911500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
87153160 2
 
1.1%
54190660 1
 
0.6%
92720450 1
 
0.6%
82702600 1
 
0.6%
260780370 1
 
0.6%
84994820 1
 
0.6%
50853600 1
 
0.6%
83121420 1
 
0.6%
65604700 1
 
0.6%
172246930 1
 
0.6%
Other values (165) 165
93.8%
ValueCountFrequency (%)
5280000 1
0.6%
10280000 1
0.6%
20900000 1
0.6%
23675920 1
0.6%
27884000 1
0.6%
31207600 1
0.6%
31207630 1
0.6%
31367300 1
0.6%
32251340 1
0.6%
35750000 1
0.6%
ValueCountFrequency (%)
12183882400 1
0.6%
1764295550 1
0.6%
1596565050 1
0.6%
1407123150 1
0.6%
932538500 1
0.6%
826255640 1
0.6%
774605990 1
0.6%
682088910 1
0.6%
680912100 1
0.6%
676381560 1
0.6%
Distinct81
Distinct (%)46.0%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
Minimum2022-12-01 00:00:00
Maximum2023-08-07 00:00:00
2023-12-13T04:31:48.349801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:31:48.476907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct114
Distinct (%)64.8%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2023-12-13T04:31:48.702674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length13
Mean length7.9943182
Min length3

Characters and Unicode

Total characters1407
Distinct characters157
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique83 ?
Unique (%)47.2%

Sample

1st row(주)다인이엔씨
2nd row흥진통신(주)
3rd row(주)퍼팩트건축
4th row서우전기
5th row하남광고공사(주)
ValueCountFrequency (%)
주식회사 60
25.3%
주)규림조경 6
 
2.5%
한서엘앤디 5
 
2.1%
한아름조경 4
 
1.7%
하남광고공사(주 4
 
1.7%
주)유니드건설 4
 
1.7%
흥진통신(주 4
 
1.7%
우리조경 4
 
1.7%
주)더봄 4
 
1.7%
비경 3
 
1.3%
Other values (106) 139
58.6%
2023-12-13T04:31:49.120279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
159
 
11.3%
( 99
 
7.0%
) 99
 
7.0%
71
 
5.0%
61
 
4.3%
61
 
4.3%
61
 
4.3%
48
 
3.4%
42
 
3.0%
36
 
2.6%
Other values (147) 670
47.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1148
81.6%
Open Punctuation 99
 
7.0%
Close Punctuation 99
 
7.0%
Space Separator 61
 
4.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
159
 
13.9%
71
 
6.2%
61
 
5.3%
61
 
5.3%
48
 
4.2%
42
 
3.7%
36
 
3.1%
30
 
2.6%
20
 
1.7%
20
 
1.7%
Other values (144) 600
52.3%
Open Punctuation
ValueCountFrequency (%)
( 99
100.0%
Close Punctuation
ValueCountFrequency (%)
) 99
100.0%
Space Separator
ValueCountFrequency (%)
61
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1148
81.6%
Common 259
 
18.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
159
 
13.9%
71
 
6.2%
61
 
5.3%
61
 
5.3%
48
 
4.2%
42
 
3.7%
36
 
3.1%
30
 
2.6%
20
 
1.7%
20
 
1.7%
Other values (144) 600
52.3%
Common
ValueCountFrequency (%)
( 99
38.2%
) 99
38.2%
61
23.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1148
81.6%
ASCII 259
 
18.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
159
 
13.9%
71
 
6.2%
61
 
5.3%
61
 
5.3%
48
 
4.2%
42
 
3.7%
36
 
3.1%
30
 
2.6%
20
 
1.7%
20
 
1.7%
Other values (144) 600
52.3%
ASCII
ValueCountFrequency (%)
( 99
38.2%
) 99
38.2%
61
23.6%

Interactions

2023-12-13T04:31:45.856217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:31:45.679750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:31:45.961013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:31:45.767141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T04:31:49.255121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호부서명계약금액계약일
번호1.0000.0830.1900.997
부서명0.0831.0000.0001.000
계약금액0.1900.0001.0000.945
계약일0.9971.0000.9451.000
2023-12-13T04:31:49.393071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호계약금액부서명
번호1.0000.0600.019
계약금액0.0601.0000.000
부서명0.0190.0001.000

Missing values

2023-12-13T04:31:46.099975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:31:46.215785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호구분부서명계약명계약금액계약일업체명
01공사본청덕풍다목적체육관 리모델링 및 방수공사(건축,기계)541906602023-08-07(주)다인이엔씨
12공사본청2023년 하남초 어린이보호구역 개선사업(하남형 스쿨존) 공사1780301002023-08-03흥진통신(주)
23공사본청시립빛나는하남어린이집 그린리모델링 공사1289666602023-07-27(주)퍼팩트건축
34공사본청빛으로 행복한 벚꽃길 명소개발 사업(3차)966753402023-07-27서우전기
45공사본청저단형 현수막 게시대 상판보수 공사 계약 건의102800002023-07-25하남광고공사(주)
56공사본청감일복합커뮤니티센터 건립 통신공사5182921802023-07-25청석전기(주)
67공사본청감일복합커뮤니티센터 건립 전기공사15965650502023-07-25주식회사 플랭클린전력
78공사본청2023년 주민불편사항 보도정비공사(2구역)4675207202023-07-24성지건설(주)
89공사본청2023년 동부초 어린이보호구역 개선사업(하남형 스쿨존) 공사1783397202023-07-20성현산업건설 주식회사
910공사본청시민 이동편의 증진 교통 개선사업 공사1788593202023-07-19성현산업건설 주식회사
번호구분부서명계약명계약금액계약일업체명
166167공사본청2023년 버스정류소 전기설비 유지보수공사881379202022-12-06주식회사 라온텍
167168공사본청감일지구 공영주차장(주6부지) 조성공사(통신)236759202022-12-05진아이티(주)
168169공사본청덕풍초등학교 통학로 확장 및 덕풍동360-2번지 경사로 설치공사1112130002022-12-05(주)영신디엔씨
169170공사본청2023년 원도심 등 공원녹지대 전기시설 유지관리공사(연간단가)636747802022-12-05현대전기
170171공사본청산곡천 산책로·자전거도로 조성공사4446310002022-12-05홍성종합건설(주)
171172공사본청어린이보호구역 종합 정비사업 공사2080608002022-12-05동아차선(주)
172173공사본청감일지구 공영주차장(주6부지) 조성공사(토목)2404610002022-12-02(주)성건
173174공사본청2023년 미사지구 공원녹지대 전기시설 유지관리공사(연간단가)763345802022-12-02(주)청한
174175공사본청2022년 가로수 등 수목월동관리 공사312076302022-12-01정원팩토리(주)
175176공사본청하남종합운동장 국민체육센터 교통영향평가 토목 공사416130002022-12-01다울건설 주식회사