Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells20
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory644.5 KiB
Average record size in memory66.0 B

Variable types

Numeric2
Categorical2
Text2
DateTime1

Dataset

Description보령시에서 공사를 수의계약한 정보(관서명, 계약방법 ,계약명, 계약금액, 계약일, 계약상대자)에 관한 현황입니다.
Author충청남도 보령시
URLhttps://www.data.go.kr/data/15090098/fileData.do

Alerts

계약방법 has constant value ""Constant
계약금액 is highly skewed (γ1 = 37.34202606)Skewed
번호 has unique valuesUnique

Reproduction

Analysis started2024-03-14 16:09:02.777048
Analysis finished2024-03-14 16:09:05.167128
Duration2.39 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6997.2982
Minimum1
Maximum13990
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T01:09:05.373973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile707.95
Q13506.75
median6997
Q310495.5
95-th percentile13293.05
Maximum13990
Range13989
Interquartile range (IQR)6988.75

Descriptive statistics

Standard deviation4041.6201
Coefficient of variation (CV)0.57759723
Kurtosis-1.2007147
Mean6997.2982
Median Absolute Deviation (MAD)3494
Skewness-4.7092157 × 10-5
Sum69972982
Variance16334693
MonotonicityNot monotonic
2024-03-15T01:09:05.813967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
8490 1
 
< 0.1%
8217 1
 
< 0.1%
1862 1
 
< 0.1%
7188 1
 
< 0.1%
912 1
 
< 0.1%
8338 1
 
< 0.1%
5514 1
 
< 0.1%
1579 1
 
< 0.1%
5205 1
 
< 0.1%
8599 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
11 1
< 0.1%
12 1
< 0.1%
13 1
< 0.1%
14 1
< 0.1%
ValueCountFrequency (%)
13990 1
< 0.1%
13989 1
< 0.1%
13988 1
< 0.1%
13986 1
< 0.1%
13985 1
< 0.1%
13981 1
< 0.1%
13980 1
< 0.1%
13977 1
< 0.1%
13976 1
< 0.1%
13975 1
< 0.1%

관서명
Categorical

Distinct23
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
본청
3288 
웅천읍
634 
천북면
575 
주교면
552 
청소면
547 
Other values (18)
4404 

Length

Max length9
Median length3
Mean length2.8776
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row오천면
2nd row오천면
3rd row청라면
4th row본청
5th row원산출장소

Common Values

ValueCountFrequency (%)
본청 3288
32.9%
웅천읍 634
 
6.3%
천북면 575
 
5.8%
주교면 552
 
5.5%
청소면 547
 
5.5%
주산면 542
 
5.4%
청라면 539
 
5.4%
남포면 485
 
4.9%
미산면 445
 
4.5%
오천면 440
 
4.4%
Other values (13) 1953
19.5%

Length

2024-03-15T01:09:06.213814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
본청 3288
32.9%
웅천읍 634
 
6.3%
천북면 575
 
5.8%
주교면 552
 
5.5%
청소면 547
 
5.5%
주산면 542
 
5.4%
청라면 539
 
5.4%
남포면 485
 
4.9%
미산면 445
 
4.5%
오천면 440
 
4.4%
Other values (13) 1953
19.5%

계약방법
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
수의1인견적
10000 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수의1인견적
2nd row수의1인견적
3rd row수의1인견적
4th row수의1인견적
5th row수의1인견적

Common Values

ValueCountFrequency (%)
수의1인견적 10000
100.0%

Length

2024-03-15T01:09:06.526112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T01:09:06.818841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수의1인견적 10000
100.0%
Distinct9411
Distinct (%)94.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T01:09:08.733267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length39
Mean length18.239
Min length6

Characters and Unicode

Total characters182390
Distinct characters703
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9038 ?
Unique (%)90.4%

Sample

1st row「오포2리 농로개설공사」
2nd row갈현리 마을안길 아스콘 덧씌우기공사
3rd row「2020년 보관 및 방치슬레이트 처리사업 지정폐기물수거공사」 시행결의
4th row시청사 입구 지장 가로등 이설공사
5th row야영장(소록도, 원산도) 전기 승압 및 조명공사
ValueCountFrequency (%)
배수로 1857
 
5.1%
정비공사 1637
 
4.5%
공사 1121
 
3.0%
설치공사 1079
 
2.9%
마을안길 917
 
2.5%
868
 
2.4%
시행결의 644
 
1.8%
보수공사 566
 
1.5%
포장공사 560
 
1.5%
설치 363
 
1.0%
Other values (7771) 27149
73.9%
2024-03-15T01:09:10.876724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
26761
 
14.7%
10618
 
5.8%
9551
 
5.2%
5630
 
3.1%
4922
 
2.7%
4124
 
2.3%
3562
 
2.0%
3173
 
1.7%
3090
 
1.7%
1 3068
 
1.7%
Other values (693) 107891
59.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 136713
75.0%
Space Separator 26761
 
14.7%
Decimal Number 11162
 
6.1%
Close Punctuation 3108
 
1.7%
Open Punctuation 3103
 
1.7%
Dash Punctuation 749
 
0.4%
Uppercase Letter 479
 
0.3%
Other Punctuation 218
 
0.1%
Connector Punctuation 36
 
< 0.1%
Math Symbol 34
 
< 0.1%
Other values (2) 27
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10618
 
7.8%
9551
 
7.0%
5630
 
4.1%
4922
 
3.6%
4124
 
3.0%
3562
 
2.6%
3173
 
2.3%
3090
 
2.3%
2769
 
2.0%
2427
 
1.8%
Other values (635) 86847
63.5%
Uppercase Letter
ValueCountFrequency (%)
C 129
26.9%
T 64
13.4%
V 63
13.2%
E 57
11.9%
L 55
11.5%
D 52
10.9%
A 12
 
2.5%
I 10
 
2.1%
P 10
 
2.1%
S 6
 
1.3%
Other values (7) 21
 
4.4%
Decimal Number
ValueCountFrequency (%)
1 3068
27.5%
2 2968
26.6%
3 1349
12.1%
4 814
 
7.3%
0 713
 
6.4%
5 562
 
5.0%
6 475
 
4.3%
8 451
 
4.0%
7 419
 
3.8%
9 343
 
3.1%
Other Punctuation
ValueCountFrequency (%)
, 139
63.8%
. 30
 
13.8%
· 24
 
11.0%
/ 15
 
6.9%
" 4
 
1.8%
; 3
 
1.4%
: 2
 
0.9%
? 1
 
0.5%
Lowercase Letter
ValueCountFrequency (%)
c 6
23.1%
z 4
15.4%
i 4
15.4%
p 4
15.4%
v 3
11.5%
t 3
11.5%
e 2
 
7.7%
Close Punctuation
ValueCountFrequency (%)
) 2882
92.7%
] 216
 
6.9%
7
 
0.2%
2
 
0.1%
1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 2877
92.7%
[ 216
 
7.0%
7
 
0.2%
2
 
0.1%
1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 33
97.1%
+ 1
 
2.9%
Space Separator
ValueCountFrequency (%)
26761
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 749
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 36
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 136713
75.0%
Common 45171
 
24.8%
Latin 506
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10618
 
7.8%
9551
 
7.0%
5630
 
4.1%
4922
 
3.6%
4124
 
3.0%
3562
 
2.6%
3173
 
2.3%
3090
 
2.3%
2769
 
2.0%
2427
 
1.8%
Other values (635) 86847
63.5%
Common
ValueCountFrequency (%)
26761
59.2%
1 3068
 
6.8%
2 2968
 
6.6%
) 2882
 
6.4%
( 2877
 
6.4%
3 1349
 
3.0%
4 814
 
1.8%
- 749
 
1.7%
0 713
 
1.6%
5 562
 
1.2%
Other values (23) 2428
 
5.4%
Latin
ValueCountFrequency (%)
C 129
25.5%
T 64
12.6%
V 63
12.5%
E 57
11.3%
L 55
10.9%
D 52
10.3%
A 12
 
2.4%
I 10
 
2.0%
P 10
 
2.0%
S 6
 
1.2%
Other values (15) 48
 
9.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 136713
75.0%
ASCII 45632
 
25.0%
None 44
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
26761
58.6%
1 3068
 
6.7%
2 2968
 
6.5%
) 2882
 
6.3%
( 2877
 
6.3%
3 1349
 
3.0%
4 814
 
1.8%
- 749
 
1.6%
0 713
 
1.6%
5 562
 
1.2%
Other values (40) 2889
 
6.3%
Hangul
ValueCountFrequency (%)
10618
 
7.8%
9551
 
7.0%
5630
 
4.1%
4922
 
3.6%
4124
 
3.0%
3562
 
2.6%
3173
 
2.3%
3090
 
2.3%
2769
 
2.0%
2427
 
1.8%
Other values (635) 86847
63.5%
None
ValueCountFrequency (%)
· 24
54.5%
7
 
15.9%
7
 
15.9%
2
 
4.5%
2
 
4.5%
1
 
2.3%
1
 
2.3%
Number Forms
ValueCountFrequency (%)
1
100.0%

계약금액
Real number (ℝ)

SKEWED 

Distinct5479
Distinct (%)54.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10974420
Minimum110000
Maximum1.84759 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T01:09:11.378074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum110000
5-th percentile1600000
Q14438000
median8348500
Q313801500
95-th percentile19782100
Maximum1.84759 × 109
Range1.84748 × 109
Interquartile range (IQR)9363500

Descriptive statistics

Standard deviation30480326
Coefficient of variation (CV)2.7773974
Kurtosis1863.3007
Mean10974420
Median Absolute Deviation (MAD)4486000
Skewness37.342026
Sum1.097442 × 1011
Variance9.290503 × 1014
MonotonicityNot monotonic
2024-03-15T01:09:11.642849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4750000 85
 
0.9%
9500000 54
 
0.5%
2850000 50
 
0.5%
7600000 49
 
0.5%
18000000 43
 
0.4%
3800000 43
 
0.4%
3000000 41
 
0.4%
4700000 39
 
0.4%
9000000 37
 
0.4%
4900000 37
 
0.4%
Other values (5469) 9522
95.2%
ValueCountFrequency (%)
110000 1
 
< 0.1%
165000 1
 
< 0.1%
250000 1
 
< 0.1%
280000 1
 
< 0.1%
296290 1
 
< 0.1%
300000 2
< 0.1%
310000 1
 
< 0.1%
330000 4
< 0.1%
378010 1
 
< 0.1%
380000 1
 
< 0.1%
ValueCountFrequency (%)
1847590000 1
< 0.1%
1315460000 1
< 0.1%
1118856000 1
< 0.1%
620542000 1
< 0.1%
482079190 1
< 0.1%
400000640 1
< 0.1%
393316680 1
< 0.1%
352686000 1
< 0.1%
350330000 1
< 0.1%
326880000 1
< 0.1%
Distinct2218
Distinct (%)22.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2014-01-13 00:00:00
Maximum2024-01-26 00:00:00
2024-03-15T01:09:11.915020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T01:09:12.218137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct1143
Distinct (%)11.5%
Missing20
Missing (%)0.2%
Memory size156.2 KiB
2024-03-15T01:09:13.215275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length17
Mean length7.3742485
Min length2

Characters and Unicode

Total characters73595
Distinct characters368
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique433 ?
Unique (%)4.3%

Sample

1st row신천건설(주)
2nd row보령도시에너지건설
3rd row(주)동진건설
4th row(주)건영
5th row대림전기
ValueCountFrequency (%)
주식회사 1641
 
14.0%
신천건설(주 193
 
1.6%
유)네오건설 189
 
1.6%
우리토건(주 183
 
1.6%
주)럭키건설 177
 
1.5%
주)씨제이 159
 
1.4%
대한건설(주 146
 
1.2%
주)거산 140
 
1.2%
주)천마건설엔지니어링 139
 
1.2%
보령시산림조합 126
 
1.1%
Other values (1116) 8627
73.6%
2024-03-15T01:09:14.405782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7940
 
10.8%
( 5868
 
8.0%
) 5853
 
8.0%
5306
 
7.2%
4705
 
6.4%
2450
 
3.3%
2277
 
3.1%
2221
 
3.0%
1740
 
2.4%
1687
 
2.3%
Other values (358) 33548
45.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 60060
81.6%
Open Punctuation 5868
 
8.0%
Close Punctuation 5853
 
8.0%
Space Separator 1740
 
2.4%
Other Punctuation 37
 
0.1%
Uppercase Letter 22
 
< 0.1%
Decimal Number 10
 
< 0.1%
Other Symbol 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7940
 
13.2%
5306
 
8.8%
4705
 
7.8%
2450
 
4.1%
2277
 
3.8%
2221
 
3.7%
1687
 
2.8%
1273
 
2.1%
1201
 
2.0%
1119
 
1.9%
Other values (337) 29881
49.8%
Uppercase Letter
ValueCountFrequency (%)
E 6
27.3%
N 6
27.3%
C 4
18.2%
M 1
 
4.5%
V 1
 
4.5%
T 1
 
4.5%
S 1
 
4.5%
G 1
 
4.5%
H 1
 
4.5%
Decimal Number
ValueCountFrequency (%)
8 3
30.0%
3 2
20.0%
1 2
20.0%
9 1
 
10.0%
5 1
 
10.0%
2 1
 
10.0%
Other Punctuation
ValueCountFrequency (%)
. 35
94.6%
, 2
 
5.4%
Open Punctuation
ValueCountFrequency (%)
( 5868
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5853
100.0%
Space Separator
ValueCountFrequency (%)
1740
100.0%
Other Symbol
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 60065
81.6%
Common 13508
 
18.4%
Latin 22
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7940
 
13.2%
5306
 
8.8%
4705
 
7.8%
2450
 
4.1%
2277
 
3.8%
2221
 
3.7%
1687
 
2.8%
1273
 
2.1%
1201
 
2.0%
1119
 
1.9%
Other values (338) 29886
49.8%
Common
ValueCountFrequency (%)
( 5868
43.4%
) 5853
43.3%
1740
 
12.9%
. 35
 
0.3%
8 3
 
< 0.1%
3 2
 
< 0.1%
1 2
 
< 0.1%
, 2
 
< 0.1%
9 1
 
< 0.1%
5 1
 
< 0.1%
Latin
ValueCountFrequency (%)
E 6
27.3%
N 6
27.3%
C 4
18.2%
M 1
 
4.5%
V 1
 
4.5%
T 1
 
4.5%
S 1
 
4.5%
G 1
 
4.5%
H 1
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 60057
81.6%
ASCII 13530
 
18.4%
None 5
 
< 0.1%
Compat Jamo 3
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
7940
 
13.2%
5306
 
8.8%
4705
 
7.8%
2450
 
4.1%
2277
 
3.8%
2221
 
3.7%
1687
 
2.8%
1273
 
2.1%
1201
 
2.0%
1119
 
1.9%
Other values (334) 29878
49.7%
ASCII
ValueCountFrequency (%)
( 5868
43.4%
) 5853
43.3%
1740
 
12.9%
. 35
 
0.3%
E 6
 
< 0.1%
N 6
 
< 0.1%
C 4
 
< 0.1%
8 3
 
< 0.1%
3 2
 
< 0.1%
1 2
 
< 0.1%
Other values (10) 11
 
0.1%
None
ValueCountFrequency (%)
5
100.0%
Compat Jamo
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Interactions

2024-03-15T01:09:04.409709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T01:09:03.962222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T01:09:04.602316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T01:09:04.225633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T01:09:14.673519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호관서명계약금액
번호1.0000.2460.000
관서명0.2461.0000.000
계약금액0.0000.0001.000
2024-03-15T01:09:14.936349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호계약금액관서명
번호1.0000.0320.093
계약금액0.0321.0000.000
관서명0.0930.0001.000

Missing values

2024-03-15T01:09:04.858712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T01:09:05.069541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호관서명계약방법계약명계약금액계약일계약상대자
84898490오천면수의1인견적「오포2리 농로개설공사」193600002020-04-28신천건설(주)
62676268오천면수의1인견적갈현리 마을안길 아스콘 덧씌우기공사160760002018-10-25보령도시에너지건설
91209121청라면수의1인견적「2020년 보관 및 방치슬레이트 처리사업 지정폐기물수거공사」 시행결의56300002020-11-10(주)동진건설
1244812449본청수의1인견적시청사 입구 지장 가로등 이설공사78700002023-01-30(주)건영
1331813319원산출장소수의1인견적야영장(소록도, 원산도) 전기 승압 및 조명공사45000002023-06-20대림전기
53445345주교면수의1인견적관창2리 마을안길 정비공사36000002018-03-12(주)일동건설
73967397본청수의1인견적2019년 미산면 남부의용소방대 청사 외1건 철거사업(건설구조물 해체)171435102019-06-12(주)이코
1208112082대천5동수의1인견적내항2통 마을안길 정비221500002022-10-21대한건설
1303213033청라면수의1인견적장산1리(390) 경계석 설치 공사 시행결의57500002023-05-01혜성건설
1188811889본청수의1인견적교통신호기 유지보수 단가계약(2차)211960002022-09-08(주)한빛전기
번호관서명계약방법계약명계약금액계약일계약상대자
79837984청소면수의1인견적장곡1리 농로 포장 공사169700002020-02-14길운건설(주)
14651466보건소수의1인견적도서 보건진료소 CCTV 설치공사135000002015-03-13(주)에너지코리아
79017902주산면수의1인견적주야2리 소류지 준설공사47120002019-12-10주식회사 오성건설중기
44064407주포면수의1인견적연지리 연정동 마을쉼터 안전휀스 설치공사15700002017-06-21대천샷시부속
113114대천3동수의1인견적동대3통 농로포장 공사93500002014-02-27보람건설(주)
47214722청라면수의1인견적나원2리 마을안길 개설 및 포장공사134500002017-09-25(주)동진건설
1260212603청라면수의1인견적내현1리 배수로 설치공사 시행결의 (본예산)104500002023-02-24신천건설(주)
62346235본청수의1인견적보령종합체육관 탁구장 마루 설치공사179000002018-10-11주래건설 주식회사
555556본청수의1인견적세원사앞 마을안길 덧씌우기 공사34392102014-04-22경부포장산업(주)
1068710688천북면수의1인견적낙동4리 호우피해복구공사134400002022-02-16(주)홍광산업