Overview

Dataset statistics

Number of variables7
Number of observations4979
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory282.1 KiB
Average record size in memory58.0 B

Variable types

Numeric2
Categorical2
Text2
DateTime1

Dataset

Description보령시에서 물품을 수의 계약한 정보(관서명, 계약방법 ,계약명, 계약금액, 계약일, 계약상대자)에 관한 현황입니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=327&beforeMenuCd=DOM_000000201001001000&publicdatapk=15090077

Alerts

계약방법 has constant value ""Constant
관서명 is highly imbalanced (56.7%)Imbalance
계약금액 is highly skewed (γ1 = 68.24003123)Skewed
번호 has unique valuesUnique

Reproduction

Analysis started2024-01-09 20:41:45.832958
Analysis finished2024-01-09 20:41:47.222486
Duration1.39 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct4979
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2490
Minimum1
Maximum4979
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size43.9 KiB
2024-01-10T05:41:47.317639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile249.9
Q11245.5
median2490
Q33734.5
95-th percentile4730.1
Maximum4979
Range4978
Interquartile range (IQR)2489

Descriptive statistics

Standard deviation1437.4578
Coefficient of variation (CV)0.5772923
Kurtosis-1.2
Mean2490
Median Absolute Deviation (MAD)1245
Skewness0
Sum12397710
Variance2066285
MonotonicityStrictly increasing
2024-01-10T05:41:47.496423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
3319 1
 
< 0.1%
3326 1
 
< 0.1%
3325 1
 
< 0.1%
3324 1
 
< 0.1%
3323 1
 
< 0.1%
3322 1
 
< 0.1%
3321 1
 
< 0.1%
3320 1
 
< 0.1%
3318 1
 
< 0.1%
Other values (4969) 4969
99.8%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
4979 1
< 0.1%
4978 1
< 0.1%
4977 1
< 0.1%
4976 1
< 0.1%
4975 1
< 0.1%
4974 1
< 0.1%
4973 1
< 0.1%
4972 1
< 0.1%
4971 1
< 0.1%
4970 1
< 0.1%

관서명
Categorical

IMBALANCE 

Distinct22
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size39.0 KiB
본청
3322 
보건소
810 
농업기술센터
 
127
문화체육관리사업소
 
101
남포면
 
73
Other values (17)
546 

Length

Max length9
Median length2
Mean length2.5726049
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row본청
2nd row본청
3rd row본청
4th row본청
5th row본청

Common Values

ValueCountFrequency (%)
본청 3322
66.7%
보건소 810
 
16.3%
농업기술센터 127
 
2.6%
문화체육관리사업소 101
 
2.0%
남포면 73
 
1.5%
주산면 54
 
1.1%
오천면 52
 
1.0%
대천2동 47
 
0.9%
성주면 46
 
0.9%
청라면 45
 
0.9%
Other values (12) 302
 
6.1%

Length

2024-01-10T05:41:47.663600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
본청 3322
66.7%
보건소 810
 
16.3%
농업기술센터 127
 
2.6%
문화체육관리사업소 101
 
2.0%
남포면 73
 
1.5%
주산면 54
 
1.1%
오천면 52
 
1.0%
대천2동 47
 
0.9%
성주면 46
 
0.9%
청라면 45
 
0.9%
Other values (12) 302
 
6.1%

계약방법
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size39.0 KiB
수의1인견적
4979 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수의1인견적
2nd row수의1인견적
3rd row수의1인견적
4th row수의1인견적
5th row수의1인견적

Common Values

ValueCountFrequency (%)
수의1인견적 4979
100.0%

Length

2024-01-10T05:41:47.796990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:41:47.902040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수의1인견적 4979
100.0%
Distinct4566
Distinct (%)91.7%
Missing0
Missing (%)0.0%
Memory size39.0 KiB
2024-01-10T05:41:48.125425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length42
Mean length19.241816
Min length3

Characters and Unicode

Total characters95805
Distinct characters799
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4288 ?
Unique (%)86.1%

Sample

1st row2014년 방과후공부방 아동 석식 식자재 구입
2nd row2014년 재가복지 밑반찬 재료비
3rd row세외수입콜센터운영위한 컴퓨터 구입
4th row장고도 위생매립장 음식물 쓰레기 처리기 구입
5th row종량제 규격봉투(일반-100ℓ) 제작
ValueCountFrequency (%)
구입 2187
 
11.2%
제작 357
 
1.8%
351
 
1.8%
설치 199
 
1.0%
177
 
0.9%
관급자재 127
 
0.6%
물품 116
 
0.6%
운영 94
 
0.5%
인쇄 91
 
0.5%
홍보물품 84
 
0.4%
Other values (6023) 15795
80.7%
2024-01-10T05:41:48.755687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
14599
 
15.2%
3741
 
3.9%
3181
 
3.3%
( 1603
 
1.7%
) 1600
 
1.7%
1595
 
1.7%
1469
 
1.5%
1342
 
1.4%
1301
 
1.4%
1211
 
1.3%
Other values (789) 64163
67.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 73185
76.4%
Space Separator 14599
 
15.2%
Decimal Number 2707
 
2.8%
Open Punctuation 1707
 
1.8%
Close Punctuation 1702
 
1.8%
Uppercase Letter 995
 
1.0%
Other Punctuation 343
 
0.4%
Connector Punctuation 266
 
0.3%
Dash Punctuation 173
 
0.2%
Lowercase Letter 115
 
0.1%
Other values (2) 13
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3741
 
5.1%
3181
 
4.3%
1595
 
2.2%
1469
 
2.0%
1342
 
1.8%
1301
 
1.8%
1211
 
1.7%
1182
 
1.6%
1096
 
1.5%
1087
 
1.5%
Other values (714) 55980
76.5%
Uppercase Letter
ValueCountFrequency (%)
C 148
14.9%
D 131
13.2%
V 99
9.9%
T 93
9.3%
B 66
 
6.6%
E 63
 
6.3%
P 59
 
5.9%
I 56
 
5.6%
L 49
 
4.9%
A 40
 
4.0%
Other values (13) 191
19.2%
Lowercase Letter
ValueCountFrequency (%)
23
20.0%
e 14
12.2%
c 9
 
7.8%
l 7
 
6.1%
a 6
 
5.2%
f 6
 
5.2%
v 5
 
4.3%
m 5
 
4.3%
i 5
 
4.3%
d 4
 
3.5%
Other values (12) 31
27.0%
Decimal Number
ValueCountFrequency (%)
2 761
28.1%
1 612
22.6%
0 517
19.1%
3 236
 
8.7%
4 124
 
4.6%
9 113
 
4.2%
5 113
 
4.2%
6 90
 
3.3%
7 83
 
3.1%
8 58
 
2.1%
Other Punctuation
ValueCountFrequency (%)
, 257
74.9%
. 40
 
11.7%
· 24
 
7.0%
" 10
 
2.9%
/ 8
 
2.3%
* 4
 
1.2%
Open Punctuation
ValueCountFrequency (%)
( 1603
93.9%
[ 90
 
5.3%
12
 
0.7%
2
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 1600
94.0%
] 88
 
5.2%
12
 
0.7%
2
 
0.1%
Math Symbol
ValueCountFrequency (%)
+ 8
66.7%
~ 4
33.3%
Space Separator
ValueCountFrequency (%)
14599
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 266
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 173
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 73182
76.4%
Common 21533
 
22.5%
Latin 1087
 
1.1%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3741
 
5.1%
3181
 
4.3%
1595
 
2.2%
1469
 
2.0%
1342
 
1.8%
1301
 
1.8%
1211
 
1.7%
1182
 
1.6%
1096
 
1.5%
1087
 
1.5%
Other values (713) 55977
76.5%
Latin
ValueCountFrequency (%)
C 148
13.6%
D 131
12.1%
V 99
 
9.1%
T 93
 
8.6%
B 66
 
6.1%
E 63
 
5.8%
P 59
 
5.4%
I 56
 
5.2%
L 49
 
4.5%
A 40
 
3.7%
Other values (34) 283
26.0%
Common
ValueCountFrequency (%)
14599
67.8%
( 1603
 
7.4%
) 1600
 
7.4%
2 761
 
3.5%
1 612
 
2.8%
0 517
 
2.4%
_ 266
 
1.2%
, 257
 
1.2%
3 236
 
1.1%
- 173
 
0.8%
Other values (21) 909
 
4.2%
Han
ValueCountFrequency (%)
3
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 73182
76.4%
ASCII 22544
 
23.5%
None 52
 
0.1%
Letterlike Symbols 23
 
< 0.1%
CJK 3
 
< 0.1%
CJK Compat 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
14599
64.8%
( 1603
 
7.1%
) 1600
 
7.1%
2 761
 
3.4%
1 612
 
2.7%
0 517
 
2.3%
_ 266
 
1.2%
, 257
 
1.1%
3 236
 
1.0%
- 173
 
0.8%
Other values (58) 1920
 
8.5%
Hangul
ValueCountFrequency (%)
3741
 
5.1%
3181
 
4.3%
1595
 
2.2%
1469
 
2.0%
1342
 
1.8%
1301
 
1.8%
1211
 
1.7%
1182
 
1.6%
1096
 
1.5%
1087
 
1.5%
Other values (713) 55977
76.5%
None
ValueCountFrequency (%)
· 24
46.2%
12
23.1%
12
23.1%
2
 
3.8%
2
 
3.8%
Letterlike Symbols
ValueCountFrequency (%)
23
100.0%
CJK
ValueCountFrequency (%)
3
100.0%
CJK Compat
ValueCountFrequency (%)
1
100.0%

계약금액
Real number (ℝ)

SKEWED 

Distinct3068
Distinct (%)61.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13926321
Minimum29880
Maximum1.256075 × 1010
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size43.9 KiB
2024-01-10T05:41:49.192637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum29880
5-th percentile659800
Q12750000
median5735000
Q311250000
95-th percentile33670850
Maximum1.256075 × 1010
Range1.256072 × 1010
Interquartile range (IQR)8500000

Descriptive statistics

Standard deviation1.7988265 × 108
Coefficient of variation (CV)12.916739
Kurtosis4757.5954
Mean13926321
Median Absolute Deviation (MAD)3635000
Skewness68.240031
Sum6.9339152 × 1010
Variance3.2357768 × 1016
MonotonicityNot monotonic
2024-01-10T05:41:49.639926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3200000 41
 
0.8%
3000000 36
 
0.7%
2000000 34
 
0.7%
4500000 25
 
0.5%
5000000 25
 
0.5%
6000000 23
 
0.5%
4000000 23
 
0.5%
5700000 22
 
0.4%
9000000 22
 
0.4%
18000000 22
 
0.4%
Other values (3058) 4706
94.5%
ValueCountFrequency (%)
29880 1
< 0.1%
36370 1
< 0.1%
50000 1
< 0.1%
55000 1
< 0.1%
58000 1
< 0.1%
74000 1
< 0.1%
77000 1
< 0.1%
80000 2
< 0.1%
98000 2
< 0.1%
99000 1
< 0.1%
ValueCountFrequency (%)
12560750000 1
< 0.1%
923571350 1
< 0.1%
418446000 1
< 0.1%
411642000 1
< 0.1%
405554330 1
< 0.1%
375435000 1
< 0.1%
330640000 1
< 0.1%
324038000 1
< 0.1%
321750000 1
< 0.1%
259529600 1
< 0.1%
Distinct1664
Distinct (%)33.4%
Missing0
Missing (%)0.0%
Memory size39.0 KiB
Minimum2014-01-01 00:00:00
Maximum2021-09-14 00:00:00
2024-01-10T05:41:49.896188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:41:50.086698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct1182
Distinct (%)23.7%
Missing0
Missing (%)0.0%
Memory size39.0 KiB
2024-01-10T05:41:50.355227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length24
Mean length6.6224141
Min length2

Characters and Unicode

Total characters32973
Distinct characters516
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique681 ?
Unique (%)13.7%

Sample

1st row대천농업협동조합
2nd row대천농업협동조합
3rd row조달청
4th row조달청
5th row조달청
ValueCountFrequency (%)
대전지방조달청 796
 
14.7%
주식회사 164
 
3.0%
안진팜 98
 
1.8%
주)거산 83
 
1.5%
동일사 79
 
1.5%
자)제중약품 78
 
1.4%
리바트 74
 
1.4%
주)광명프라자 71
 
1.3%
보령사무용가구 70
 
1.3%
대전지방조달청장 65
 
1.2%
Other values (1214) 3820
70.8%
2024-01-10T05:41:50.779793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1332
 
4.0%
1256
 
3.8%
( 1199
 
3.6%
) 1196
 
3.6%
1088
 
3.3%
1076
 
3.3%
1031
 
3.1%
1003
 
3.0%
924
 
2.8%
915
 
2.8%
Other values (506) 21953
66.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 29854
90.5%
Open Punctuation 1199
 
3.6%
Close Punctuation 1196
 
3.6%
Space Separator 419
 
1.3%
Uppercase Letter 248
 
0.8%
Other Punctuation 38
 
0.1%
Decimal Number 10
 
< 0.1%
Lowercase Letter 6
 
< 0.1%
Other Symbol 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1332
 
4.5%
1256
 
4.2%
1088
 
3.6%
1076
 
3.6%
1031
 
3.5%
1003
 
3.4%
924
 
3.1%
915
 
3.1%
834
 
2.8%
494
 
1.7%
Other values (473) 19901
66.7%
Uppercase Letter
ValueCountFrequency (%)
N 35
14.1%
E 27
10.9%
S 25
10.1%
G 21
8.5%
B 20
8.1%
A 17
 
6.9%
C 15
 
6.0%
H 15
 
6.0%
W 13
 
5.2%
K 12
 
4.8%
Other values (10) 48
19.4%
Other Punctuation
ValueCountFrequency (%)
. 22
57.9%
, 11
28.9%
/ 4
 
10.5%
& 1
 
2.6%
Decimal Number
ValueCountFrequency (%)
2 8
80.0%
4 1
 
10.0%
5 1
 
10.0%
Lowercase Letter
ValueCountFrequency (%)
w 3
50.0%
s 3
50.0%
Open Punctuation
ValueCountFrequency (%)
( 1199
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1196
100.0%
Space Separator
ValueCountFrequency (%)
419
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 29857
90.5%
Common 2862
 
8.7%
Latin 254
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1332
 
4.5%
1256
 
4.2%
1088
 
3.6%
1076
 
3.6%
1031
 
3.5%
1003
 
3.4%
924
 
3.1%
915
 
3.1%
834
 
2.8%
494
 
1.7%
Other values (474) 19904
66.7%
Latin
ValueCountFrequency (%)
N 35
13.8%
E 27
10.6%
S 25
9.8%
G 21
 
8.3%
B 20
 
7.9%
A 17
 
6.7%
C 15
 
5.9%
H 15
 
5.9%
W 13
 
5.1%
K 12
 
4.7%
Other values (12) 54
21.3%
Common
ValueCountFrequency (%)
( 1199
41.9%
) 1196
41.8%
419
 
14.6%
. 22
 
0.8%
, 11
 
0.4%
2 8
 
0.3%
/ 4
 
0.1%
& 1
 
< 0.1%
4 1
 
< 0.1%
5 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 29854
90.5%
ASCII 3116
 
9.5%
None 3
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1332
 
4.5%
1256
 
4.2%
1088
 
3.6%
1076
 
3.6%
1031
 
3.5%
1003
 
3.4%
924
 
3.1%
915
 
3.1%
834
 
2.8%
494
 
1.7%
Other values (473) 19901
66.7%
ASCII
ValueCountFrequency (%)
( 1199
38.5%
) 1196
38.4%
419
 
13.4%
N 35
 
1.1%
E 27
 
0.9%
S 25
 
0.8%
. 22
 
0.7%
G 21
 
0.7%
B 20
 
0.6%
A 17
 
0.5%
Other values (22) 135
 
4.3%
None
ValueCountFrequency (%)
3
100.0%

Interactions

2024-01-10T05:41:46.826444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:41:46.637738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:41:46.919982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:41:46.724830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T05:41:50.871322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호관서명계약금액
번호1.0000.4230.002
관서명0.4231.0000.000
계약금액0.0020.0001.000
2024-01-10T05:41:50.961229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호계약금액관서명
번호1.0000.1090.170
계약금액0.1091.0000.000
관서명0.1700.0001.000

Missing values

2024-01-10T05:41:47.048707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:41:47.163377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호관서명계약방법계약명계약금액계약일계약상대자
01본청수의1인견적2014년 방과후공부방 아동 석식 식자재 구입374400002014-01-01대천농업협동조합
12본청수의1인견적2014년 재가복지 밑반찬 재료비201600002014-01-01대천농업협동조합
23본청수의1인견적세외수입콜센터운영위한 컴퓨터 구입27145802014-01-03조달청
34본청수의1인견적장고도 위생매립장 음식물 쓰레기 처리기 구입139713002014-01-03조달청
45본청수의1인견적종량제 규격봉투(일반-100ℓ) 제작74922402014-01-03조달청
56본청수의1인견적종량제 규격봉투(음식물-20,재사용20)제작117511102014-01-06조달청
67본청수의1인견적전자회의용 L2스위치 구입20658002014-01-07(주)케이티
78대천3동수의1인견적주민자치센터 집기비품 구입91900002014-01-07대전지방조달청
89본청수의1인견적세외수입 납부지원 콜센터 운영위한 사무용품 구입(책상1,의자1)5050002014-01-08동일사
910본청수의1인견적행정업무용 소프트웨어 구입496011002014-01-08조달청
번호관서명계약방법계약명계약금액계약일계약상대자
49694970본청수의1인견적보령 복싱체육관 건립사업 간판제작 지급결의84600002021-09-03머드안전산업
49704971보건소수의1인견적심뇌혈관질환 예방관리사업 운영 홍보 물품 구입57000002021-09-06시시각각
49714972보건소수의1인견적임상검사실 검사물품구입26980002021-09-07메디팜
49724973본청수의1인견적오천항 경사부두 여객선 입·출항시 주의사항 안내판 구입14210002021-09-07미스터플래카드
49734974보건소수의1인견적코로나19 예방접종용 희석액 구입32100002021-09-13안진팜
49744975본청수의1인견적해수욕장 비개장기간 물놀이 안전관리요원 근무복41695502021-09-13주식회사 보령에스
49754976본청수의1인견적2022년 시책보고회 자료 인쇄80750002021-09-13도서출판 종합인쇄
49764977본청수의1인견적사무용 가구 구입(2회 추경)76470002021-09-14보령가구아울렛
49774978보건소수의1인견적금연 홍보 표지판(부착용) 제작49500002021-09-14종합광고사
49784979보건소수의1인견적구강보건예방 리플렛 제작21450002021-09-14(주)악어미디어