Overview

Dataset statistics

Number of variables7
Number of observations145
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.3 KiB
Average record size in memory58.9 B

Variable types

Numeric2
Categorical2
Text2
DateTime1

Dataset

Description보령시에서 공사를 하도급 계약한 정보(관서명, 계약방법 ,계약명, 계약금액, 계약일, 계약상대자)에 관한 현황입니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=327&beforeMenuCd=DOM_000000201001001000&publicdatapk=15090095

Alerts

관서명 is highly imbalanced (92.5%)Imbalance
번호 has unique valuesUnique
계약명 has unique valuesUnique
계약금액 has unique valuesUnique

Reproduction

Analysis started2024-01-09 22:28:36.196828
Analysis finished2024-01-09 22:28:36.931397
Duration0.73 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct145
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean73
Minimum1
Maximum145
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2024-01-10T07:28:36.992665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile8.2
Q137
median73
Q3109
95-th percentile137.8
Maximum145
Range144
Interquartile range (IQR)72

Descriptive statistics

Standard deviation42.001984
Coefficient of variation (CV)0.57536964
Kurtosis-1.2
Mean73
Median Absolute Deviation (MAD)36
Skewness0
Sum10585
Variance1764.1667
MonotonicityStrictly increasing
2024-01-10T07:28:37.114219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.7%
110 1
 
0.7%
94 1
 
0.7%
95 1
 
0.7%
96 1
 
0.7%
97 1
 
0.7%
98 1
 
0.7%
99 1
 
0.7%
100 1
 
0.7%
101 1
 
0.7%
Other values (135) 135
93.1%
ValueCountFrequency (%)
1 1
0.7%
2 1
0.7%
3 1
0.7%
4 1
0.7%
5 1
0.7%
6 1
0.7%
7 1
0.7%
8 1
0.7%
9 1
0.7%
10 1
0.7%
ValueCountFrequency (%)
145 1
0.7%
144 1
0.7%
143 1
0.7%
142 1
0.7%
141 1
0.7%
140 1
0.7%
139 1
0.7%
138 1
0.7%
137 1
0.7%
136 1
0.7%

관서명
Categorical

IMBALANCE 

Distinct3
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
본청
143 
보건소
 
1
농업기술센터
 
1

Length

Max length6
Median length2
Mean length2.0344828
Min length2

Unique

Unique2 ?
Unique (%)1.4%

Sample

1st row본청
2nd row본청
3rd row본청
4th row본청
5th row본청

Common Values

ValueCountFrequency (%)
본청 143
98.6%
보건소 1
 
0.7%
농업기술센터 1
 
0.7%

Length

2024-01-10T07:28:37.242270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:28:37.341709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
본청 143
98.6%
보건소 1
 
0.7%
농업기술센터 1
 
0.7%

계약방법
Categorical

Distinct3
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
제한경쟁
98 
수의2인이상견적
35 
일반경쟁
12 

Length

Max length8
Median length4
Mean length4.9655172
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row제한경쟁
2nd row수의2인이상견적
3rd row제한경쟁
4th row수의2인이상견적
5th row제한경쟁

Common Values

ValueCountFrequency (%)
제한경쟁 98
67.6%
수의2인이상견적 35
 
24.1%
일반경쟁 12
 
8.3%

Length

2024-01-10T07:28:37.453789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:28:37.551764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제한경쟁 98
67.6%
수의2인이상견적 35
 
24.1%
일반경쟁 12
 
8.3%

계약명
Text

UNIQUE 

Distinct145
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2024-01-10T07:28:37.779311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length29
Mean length20.324138
Min length9

Characters and Unicode

Total characters2947
Distinct characters264
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique145 ?
Unique (%)100.0%

Sample

1st row보령시 건설기계 공영주기장 조성(2차분)
2nd row신설중앙공원 보완사업
3rd row시청사 민원동 건립공사(건축)(2차)
4th row박람회 임시주차장(제4주차장) 조성공사
5th row대천항 주차타워 조성사업(건축)
ValueCountFrequency (%)
농어촌도로 13
 
2.8%
조성공사 13
 
2.8%
개설공사 11
 
2.4%
도시계획도로 10
 
2.1%
확포장공사 9
 
1.9%
정비공사 7
 
1.5%
6
 
1.3%
증축공사(건축 5
 
1.1%
신축공사(건축 5
 
1.1%
정비사업 5
 
1.1%
Other values (298) 383
82.0%
2024-01-10T07:28:38.148102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
322
 
10.9%
158
 
5.4%
134
 
4.5%
) 98
 
3.3%
( 98
 
3.3%
94
 
3.2%
60
 
2.0%
2 52
 
1.8%
51
 
1.7%
51
 
1.7%
Other values (254) 1829
62.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2240
76.0%
Space Separator 322
 
10.9%
Decimal Number 157
 
5.3%
Close Punctuation 98
 
3.3%
Open Punctuation 98
 
3.3%
Dash Punctuation 16
 
0.5%
Other Punctuation 9
 
0.3%
Math Symbol 5
 
0.2%
Uppercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
158
 
7.1%
134
 
6.0%
94
 
4.2%
60
 
2.7%
51
 
2.3%
51
 
2.3%
50
 
2.2%
50
 
2.2%
48
 
2.1%
45
 
2.0%
Other values (234) 1499
66.9%
Decimal Number
ValueCountFrequency (%)
2 52
33.1%
1 32
20.4%
3 20
 
12.7%
0 17
 
10.8%
6 11
 
7.0%
8 6
 
3.8%
4 6
 
3.8%
9 6
 
3.8%
5 4
 
2.5%
7 3
 
1.9%
Other Punctuation
ValueCountFrequency (%)
, 8
88.9%
/ 1
 
11.1%
Math Symbol
ValueCountFrequency (%)
~ 4
80.0%
+ 1
 
20.0%
Uppercase Letter
ValueCountFrequency (%)
I 1
50.0%
C 1
50.0%
Space Separator
ValueCountFrequency (%)
322
100.0%
Close Punctuation
ValueCountFrequency (%)
) 98
100.0%
Open Punctuation
ValueCountFrequency (%)
( 98
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2240
76.0%
Common 705
 
23.9%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
158
 
7.1%
134
 
6.0%
94
 
4.2%
60
 
2.7%
51
 
2.3%
51
 
2.3%
50
 
2.2%
50
 
2.2%
48
 
2.1%
45
 
2.0%
Other values (234) 1499
66.9%
Common
ValueCountFrequency (%)
322
45.7%
) 98
 
13.9%
( 98
 
13.9%
2 52
 
7.4%
1 32
 
4.5%
3 20
 
2.8%
0 17
 
2.4%
- 16
 
2.3%
6 11
 
1.6%
, 8
 
1.1%
Other values (8) 31
 
4.4%
Latin
ValueCountFrequency (%)
I 1
50.0%
C 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2240
76.0%
ASCII 707
 
24.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
322
45.5%
) 98
 
13.9%
( 98
 
13.9%
2 52
 
7.4%
1 32
 
4.5%
3 20
 
2.8%
0 17
 
2.4%
- 16
 
2.3%
6 11
 
1.6%
, 8
 
1.1%
Other values (10) 33
 
4.7%
Hangul
ValueCountFrequency (%)
158
 
7.1%
134
 
6.0%
94
 
4.2%
60
 
2.7%
51
 
2.3%
51
 
2.3%
50
 
2.2%
50
 
2.2%
48
 
2.1%
45
 
2.0%
Other values (234) 1499
66.9%

계약금액
Real number (ℝ)

UNIQUE 

Distinct145
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.2084377 × 109
Minimum34490000
Maximum1.3025797 × 1010
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2024-01-10T07:28:38.273675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum34490000
5-th percentile1.0266274 × 108
Q12.80939 × 108
median5.4588368 × 108
Q31.274125 × 109
95-th percentile5.0795314 × 109
Maximum1.3025797 × 1010
Range1.2991307 × 1010
Interquartile range (IQR)9.93186 × 108

Descriptive statistics

Standard deviation2.021315 × 109
Coefficient of variation (CV)1.6726679
Kurtosis15.508485
Mean1.2084377 × 109
Median Absolute Deviation (MAD)3.3856068 × 108
Skewness3.7577044
Sum1.7522347 × 1011
Variance4.0857142 × 1018
MonotonicityNot monotonic
2024-01-10T07:28:38.401247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
545883680 1
 
0.7%
342680000 1
 
0.7%
290708000 1
 
0.7%
183027940 1
 
0.7%
1077279000 1
 
0.7%
11240400000 1
 
0.7%
150552000 1
 
0.7%
150226000 1
 
0.7%
81272000 1
 
0.7%
1924677000 1
 
0.7%
Other values (135) 135
93.1%
ValueCountFrequency (%)
34490000 1
0.7%
73964000 1
0.7%
81272000 1
0.7%
85200000 1
0.7%
87082530 1
0.7%
99753000 1
0.7%
102358000 1
0.7%
102424000 1
0.7%
103617700 1
0.7%
112624000 1
0.7%
ValueCountFrequency (%)
13025797000 1
0.7%
11240400000 1
0.7%
9816380000 1
0.7%
9179100000 1
0.7%
8063556000 1
0.7%
6012500000 1
0.7%
5573179000 1
0.7%
5263478000 1
0.7%
4343745000 1
0.7%
4291994000 1
0.7%
Distinct134
Distinct (%)92.4%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
Minimum2014-01-16 00:00:00
Maximum2022-07-05 00:00:00
2024-01-10T07:28:38.536870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:28:38.655003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct111
Distinct (%)76.6%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2024-01-10T07:28:38.853700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length12
Mean length8.5103448
Min length4

Characters and Unicode

Total characters1234
Distinct characters110
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique86 ?
Unique (%)59.3%

Sample

1st row지엠건설 주식회사
2nd row(주)태광
3rd row길림개발 주식회사
4th row세원건설 주식회사
5th row대성건설주식회사
ValueCountFrequency (%)
주식회사 43
 
22.8%
케이티씨건설 6
 
3.2%
서림종합건설(주 4
 
2.1%
대운건설(주 3
 
1.6%
영화종합건설 3
 
1.6%
지수종합건설 3
 
1.6%
청암건설(주 2
 
1.1%
지엠건설 2
 
1.1%
태성공영 2
 
1.1%
삼성건설 2
 
1.1%
Other values (103) 119
63.0%
2024-01-10T07:28:39.179163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
135
 
10.9%
123
 
10.0%
113
 
9.2%
) 76
 
6.2%
( 76
 
6.2%
59
 
4.8%
59
 
4.8%
58
 
4.7%
50
 
4.1%
49
 
4.0%
Other values (100) 436
35.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1035
83.9%
Close Punctuation 76
 
6.2%
Open Punctuation 76
 
6.2%
Space Separator 44
 
3.6%
Lowercase Letter 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
135
 
13.0%
123
 
11.9%
113
 
10.9%
59
 
5.7%
59
 
5.7%
58
 
5.6%
50
 
4.8%
49
 
4.7%
32
 
3.1%
18
 
1.7%
Other values (94) 339
32.8%
Lowercase Letter
ValueCountFrequency (%)
e 1
33.3%
o 1
33.3%
n 1
33.3%
Close Punctuation
ValueCountFrequency (%)
) 76
100.0%
Open Punctuation
ValueCountFrequency (%)
( 76
100.0%
Space Separator
ValueCountFrequency (%)
44
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1035
83.9%
Common 196
 
15.9%
Latin 3
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
135
 
13.0%
123
 
11.9%
113
 
10.9%
59
 
5.7%
59
 
5.7%
58
 
5.6%
50
 
4.8%
49
 
4.7%
32
 
3.1%
18
 
1.7%
Other values (94) 339
32.8%
Common
ValueCountFrequency (%)
) 76
38.8%
( 76
38.8%
44
22.4%
Latin
ValueCountFrequency (%)
e 1
33.3%
o 1
33.3%
n 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1035
83.9%
ASCII 199
 
16.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
135
 
13.0%
123
 
11.9%
113
 
10.9%
59
 
5.7%
59
 
5.7%
58
 
5.6%
50
 
4.8%
49
 
4.7%
32
 
3.1%
18
 
1.7%
Other values (94) 339
32.8%
ASCII
ValueCountFrequency (%)
) 76
38.2%
( 76
38.2%
44
22.1%
e 1
 
0.5%
o 1
 
0.5%
n 1
 
0.5%

Interactions

2024-01-10T07:28:36.623900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:28:36.473835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:28:36.699344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:28:36.546539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T07:28:39.270095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호관서명계약방법계약금액
번호1.0000.0380.4680.274
관서명0.0381.0000.4310.000
계약방법0.4680.4311.0000.748
계약금액0.2740.0000.7481.000
2024-01-10T07:28:39.344076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
계약방법관서명
계약방법1.0000.163
관서명0.1631.000
2024-01-10T07:28:39.414356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호계약금액관서명계약방법
번호1.0000.0070.0060.311
계약금액0.0071.0000.0000.444
관서명0.0060.0001.0000.163
계약방법0.3110.4440.1631.000

Missing values

2024-01-10T07:28:36.797916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:28:36.890650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호관서명계약방법계약명계약금액계약일계약상대자
01본청제한경쟁보령시 건설기계 공영주기장 조성(2차분)5458836802022-07-05지엠건설 주식회사
12본청수의2인이상견적신설중앙공원 보완사업1745690002022-05-06(주)태광
23본청제한경쟁시청사 민원동 건립공사(건축)(2차)42919940002022-05-01길림개발 주식회사
34본청수의2인이상견적박람회 임시주차장(제4주차장) 조성공사2657221502022-04-07세원건설 주식회사
45본청제한경쟁대천항 주차타워 조성사업(건축)19500000002022-03-31대성건설주식회사
56본청제한경쟁남포210호(양매선) 농어촌도로 확포장공사(1차분)4081670002022-03-15주식회사 한진건설산업
67본청수의2인이상견적주교202호(팔대선) 농어촌도로 확포장공사1795610002022-02-24케이티씨건설 주식회사
78본청제한경쟁평라1 급경사지 붕괴위험지역 정비사업13524060002022-01-27미성건설(주)
89본청제한경쟁장고도2지구 연안정비사업(2차)4777190002022-01-26(주)우석건설
910본청제한경쟁주교면 생활문화플랫폼 신축공사(건축)10000000002021-12-21주식회사태정씨앤디
번호관서명계약방법계약명계약금액계약일계약상대자
135136본청제한경쟁보령족구장 조성공사4472140002017-02-23에스지종합건설
136137본청일반경쟁장고도 농어촌마을하수도 정비사업(1차분)13020410002017-02-06서림종합건설(주)
137138본청일반경쟁고대도 농어촌마을하수도 정비사업(1차분)11794200002017-02-02극동건설(주)
138139본청일반경쟁한내여중길~국도36호선 도시계획도로개설공사55731790002016-11-25명헌건설(주)
139140농업기술센터일반경쟁미생물 배양시설 신축공사(건축)3200400002016-07-05(주)명성종합건설
140141본청제한경쟁시도17호(성연~죽림) 도로확포장공사13077600002016-06-17(주)지안스건설
141142본청일반경쟁신구소하천 정비공사12843000002016-05-03동광건설(주)
142143본청제한경쟁간치 자연재해위험개선지구 정비사업(1차,2차)60125000002016-04-29네오서진건설 주식회사(neo)
143144본청제한경쟁신흑동 이주단지 재해위험시설 정비공사7064600002015-06-23새한건설주식회사
144145본청일반경쟁대천1지구 우수저류시설 설치사업130257970002014-01-16(주)라온토건