Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows8
Duplicate rows (%)0.1%
Total size in memory556.6 KiB
Average record size in memory57.0 B

Variable types

Categorical2
Text2
Numeric1
DateTime1

Dataset

Description부산광역시_중구_계약정보공개시스템_계약대장_20211126
Author부산광역시 중구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15039586

Alerts

Dataset has 8 (0.1%) duplicate rowsDuplicates
관서명 is highly imbalanced (70.8%)Imbalance
계약금액 is highly skewed (γ1 = 22.00683959)Skewed
계약금액 has 103 (1.0%) zerosZeros

Reproduction

Analysis started2023-12-10 16:43:48.569471
Analysis finished2023-12-10 16:43:49.867123
Duration1.3 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
물품
6586 
용역
2207 
공사
1207 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row용역
2nd row공사
3rd row물품
4th row용역
5th row물품

Common Values

ValueCountFrequency (%)
물품 6586
65.9%
용역 2207
 
22.1%
공사 1207
 
12.1%

Length

2023-12-11T01:43:49.923300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:43:50.018792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
물품 6586
65.9%
용역 2207
 
22.1%
공사 1207
 
12.1%

관서명
Categorical

IMBALANCE 

Distinct13
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
본청
8170 
보건소
1148 
시설관리사업소
 
174
영주2동
 
79
의회
 
60
Other values (8)
 
369

Length

Max length7
Median length2
Mean length2.2593
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row본청
2nd row본청
3rd row본청
4th row본청
5th row본청

Common Values

ValueCountFrequency (%)
본청 8170
81.7%
보건소 1148
 
11.5%
시설관리사업소 174
 
1.7%
영주2동 79
 
0.8%
의회 60
 
0.6%
광복동 58
 
0.6%
대청동 56
 
0.6%
부평동 54
 
0.5%
동광동 52
 
0.5%
남포동 50
 
0.5%
Other values (3) 99
 
1.0%

Length

2023-12-11T01:43:50.193083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
본청 8170
81.7%
보건소 1148
 
11.5%
시설관리사업소 174
 
1.7%
영주2동 79
 
0.8%
의회 60
 
0.6%
광복동 58
 
0.6%
대청동 56
 
0.6%
부평동 54
 
0.5%
동광동 52
 
0.5%
남포동 50
 
0.5%
Other values (3) 99
 
1.0%
Distinct8768
Distinct (%)87.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T01:43:50.873825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length37
Mean length21.4115
Min length4

Characters and Unicode

Total characters214115
Distinct characters851
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8224 ?
Unique (%)82.2%

Sample

1st row신천지시장 천정 및 LED교체.방송통신공사 실시설계 용역
2nd row보수종합시장 LED조명 교체공사
3rd row협동조합 활성화 사업 홍보물 제작
4th row지역관광브랜드 구축 및 체류관광활성화 연구용역
5th row청경실 냉난방기 구입
ValueCountFrequency (%)
구입 2289
 
5.3%
제작 1007
 
2.3%
893
 
2.1%
일원 516
 
1.2%
구매 480
 
1.1%
422
 
1.0%
용역 339
 
0.8%
설치 338
 
0.8%
교체 281
 
0.7%
정비 279
 
0.6%
Other values (9016) 36271
84.1%
2023-12-11T01:43:51.492856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
33201
 
15.5%
5521
 
2.6%
5123
 
2.4%
( 3737
 
1.7%
3705
 
1.7%
) 3668
 
1.7%
3111
 
1.5%
2872
 
1.3%
2573
 
1.2%
2 2483
 
1.2%
Other values (841) 148121
69.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 158393
74.0%
Space Separator 33201
 
15.5%
Decimal Number 9760
 
4.6%
Open Punctuation 3879
 
1.8%
Close Punctuation 3810
 
1.8%
Uppercase Letter 2678
 
1.3%
Other Punctuation 1054
 
0.5%
Dash Punctuation 816
 
0.4%
Lowercase Letter 337
 
0.2%
Connector Punctuation 108
 
0.1%
Other values (3) 79
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5521
 
3.5%
5123
 
3.2%
3705
 
2.3%
3111
 
2.0%
2872
 
1.8%
2573
 
1.6%
2393
 
1.5%
2352
 
1.5%
2347
 
1.5%
2219
 
1.4%
Other values (757) 126177
79.7%
Uppercase Letter
ValueCountFrequency (%)
C 295
11.0%
D 280
10.5%
E 277
10.3%
L 234
8.7%
V 216
8.1%
F 201
7.5%
P 200
7.5%
T 193
 
7.2%
B 161
 
6.0%
I 129
 
4.8%
Other values (14) 492
18.4%
Lowercase Letter
ValueCountFrequency (%)
a 41
12.2%
t 40
11.9%
s 40
11.9%
i 37
11.0%
r 29
8.6%
o 26
7.7%
y 24
7.1%
e 18
 
5.3%
b 12
 
3.6%
p 12
 
3.6%
Other values (11) 58
17.2%
Other Punctuation
ValueCountFrequency (%)
, 444
42.1%
. 287
27.2%
· 156
 
14.8%
/ 86
 
8.2%
' 35
 
3.3%
" 23
 
2.2%
: 13
 
1.2%
& 4
 
0.4%
* 2
 
0.2%
! 2
 
0.2%
Other values (2) 2
 
0.2%
Decimal Number
ValueCountFrequency (%)
2 2483
25.4%
1 2210
22.6%
0 1938
19.9%
4 685
 
7.0%
5 468
 
4.8%
9 441
 
4.5%
3 419
 
4.3%
6 387
 
4.0%
8 374
 
3.8%
7 355
 
3.6%
Open Punctuation
ValueCountFrequency (%)
( 3737
96.3%
95
 
2.4%
26
 
0.7%
[ 21
 
0.5%
Close Punctuation
ValueCountFrequency (%)
) 3668
96.3%
95
 
2.5%
26
 
0.7%
] 21
 
0.6%
Math Symbol
ValueCountFrequency (%)
~ 61
89.7%
< 3
 
4.4%
> 3
 
4.4%
+ 1
 
1.5%
Space Separator
ValueCountFrequency (%)
33201
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 816
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 108
100.0%
Modifier Symbol
ValueCountFrequency (%)
˙ 8
100.0%
Letter Number
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 158390
74.0%
Common 52706
 
24.6%
Latin 3016
 
1.4%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5521
 
3.5%
5123
 
3.2%
3705
 
2.3%
3111
 
2.0%
2872
 
1.8%
2573
 
1.6%
2393
 
1.5%
2352
 
1.5%
2347
 
1.5%
2219
 
1.4%
Other values (754) 126174
79.7%
Latin
ValueCountFrequency (%)
C 295
 
9.8%
D 280
 
9.3%
E 277
 
9.2%
L 234
 
7.8%
V 216
 
7.2%
F 201
 
6.7%
P 200
 
6.6%
T 193
 
6.4%
B 161
 
5.3%
I 129
 
4.3%
Other values (35) 830
27.5%
Common
ValueCountFrequency (%)
33201
63.0%
( 3737
 
7.1%
) 3668
 
7.0%
2 2483
 
4.7%
1 2210
 
4.2%
0 1938
 
3.7%
- 816
 
1.5%
4 685
 
1.3%
5 468
 
0.9%
, 444
 
0.8%
Other values (29) 3056
 
5.8%
Han
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 158387
74.0%
ASCII 55310
 
25.8%
None 399
 
0.2%
Modifier Letters 8
 
< 0.1%
Compat Jamo 3
 
< 0.1%
Number Forms 3
 
< 0.1%
CJK 3
 
< 0.1%
Letterlike Symbols 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
33201
60.0%
( 3737
 
6.8%
) 3668
 
6.6%
2 2483
 
4.5%
1 2210
 
4.0%
0 1938
 
3.5%
- 816
 
1.5%
4 685
 
1.2%
5 468
 
0.8%
, 444
 
0.8%
Other values (65) 5660
 
10.2%
Hangul
ValueCountFrequency (%)
5521
 
3.5%
5123
 
3.2%
3705
 
2.3%
3111
 
2.0%
2872
 
1.8%
2573
 
1.6%
2393
 
1.5%
2352
 
1.5%
2347
 
1.5%
2219
 
1.4%
Other values (753) 126171
79.7%
None
ValueCountFrequency (%)
· 156
39.1%
95
23.8%
95
23.8%
26
 
6.5%
26
 
6.5%
1
 
0.3%
Modifier Letters
ValueCountFrequency (%)
˙ 8
100.0%
Compat Jamo
ValueCountFrequency (%)
3
100.0%
Number Forms
ValueCountFrequency (%)
3
100.0%
Letterlike Symbols
ValueCountFrequency (%)
2
100.0%
CJK
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

계약금액
Real number (ℝ)

SKEWED  ZEROS 

Distinct5418
Distinct (%)54.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15736939
Minimum0
Maximum3.474702 × 109
Zeros103
Zeros (%)1.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T01:43:51.655895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile111528.5
Q11000000
median3171050
Q39501000
95-th percentile49282450
Maximum3.474702 × 109
Range3.474702 × 109
Interquartile range (IQR)8501000

Descriptive statistics

Standard deviation81725939
Coefficient of variation (CV)5.1932553
Kurtosis699.17212
Mean15736939
Median Absolute Deviation (MAD)2676050
Skewness22.00684
Sum1.5736939 × 1011
Variance6.6791291 × 1015
MonotonicityNot monotonic
2023-12-11T01:43:51.796294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1800000 138
 
1.4%
0 103
 
1.0%
990000 47
 
0.5%
3000000 46
 
0.5%
550000 44
 
0.4%
1000000 43
 
0.4%
1980000 43
 
0.4%
440000 42
 
0.4%
1500000 42
 
0.4%
880000 41
 
0.4%
Other values (5408) 9411
94.1%
ValueCountFrequency (%)
0 103
1.0%
8000 3
 
< 0.1%
9200 1
 
< 0.1%
9600 6
 
0.1%
10400 3
 
< 0.1%
11000 1
 
< 0.1%
11200 3
 
< 0.1%
12000 1
 
< 0.1%
13830 1
 
< 0.1%
16500 4
 
< 0.1%
ValueCountFrequency (%)
3474702000 1
< 0.1%
2992475330 1
< 0.1%
2742102000 1
< 0.1%
1649276000 1
< 0.1%
1573107980 1
< 0.1%
1500523000 1
< 0.1%
1257652000 1
< 0.1%
1128642000 1
< 0.1%
1117107000 1
< 0.1%
1110000000 1
< 0.1%
Distinct2051
Distinct (%)20.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2013-09-05 00:00:00
Maximum2021-12-03 00:00:00
2023-12-11T01:43:51.975739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:43:52.105031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2894
Distinct (%)28.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T01:43:52.342316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length18
Mean length7.3712
Min length1

Characters and Unicode

Total characters73712
Distinct characters641
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1610 ?
Unique (%)16.1%

Sample

1st row희건종합건축사사무소
2nd row(주)경화전기
3rd row(주)엠아이비
4th row사단법인한국경제개발연구원
5th row(주)경상비투비
ValueCountFrequency (%)
주식회사 994
 
8.6%
동성인쇄사 128
 
1.1%
한일인쇄사 125
 
1.1%
주)삼창에스씨 106
 
0.9%
인쇄출판태산 93
 
0.8%
부산우유보수보급소 91
 
0.8%
주)새론테크 85
 
0.7%
엘지전자 78
 
0.7%
너울광고기획 76
 
0.7%
부경아스콘사업협동조합 71
 
0.6%
Other values (2976) 9709
84.0%
2023-12-11T01:43:52.778765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5286
 
7.2%
) 3987
 
5.4%
( 3977
 
5.4%
2784
 
3.8%
1629
 
2.2%
1580
 
2.1%
1578
 
2.1%
1559
 
2.1%
1480
 
2.0%
1457
 
2.0%
Other values (631) 48395
65.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 63254
85.8%
Close Punctuation 3988
 
5.4%
Open Punctuation 3978
 
5.4%
Space Separator 1559
 
2.1%
Uppercase Letter 688
 
0.9%
Lowercase Letter 109
 
0.1%
Other Punctuation 62
 
0.1%
Decimal Number 50
 
0.1%
Dash Punctuation 16
 
< 0.1%
Connector Punctuation 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5286
 
8.4%
2784
 
4.4%
1629
 
2.6%
1580
 
2.5%
1578
 
2.5%
1480
 
2.3%
1457
 
2.3%
1058
 
1.7%
1030
 
1.6%
1011
 
1.6%
Other values (572) 44361
70.1%
Uppercase Letter
ValueCountFrequency (%)
S 129
18.8%
M 83
12.1%
E 65
9.4%
K 43
 
6.2%
N 39
 
5.7%
O 38
 
5.5%
A 38
 
5.5%
J 36
 
5.2%
C 33
 
4.8%
T 31
 
4.5%
Other values (13) 153
22.2%
Lowercase Letter
ValueCountFrequency (%)
e 23
21.1%
n 14
12.8%
g 9
 
8.3%
t 9
 
8.3%
r 7
 
6.4%
a 7
 
6.4%
o 6
 
5.5%
s 6
 
5.5%
m 5
 
4.6%
b 5
 
4.6%
Other values (6) 18
16.5%
Decimal Number
ValueCountFrequency (%)
6 10
20.0%
3 10
20.0%
5 10
20.0%
2 9
18.0%
1 5
10.0%
9 2
 
4.0%
0 2
 
4.0%
8 1
 
2.0%
4 1
 
2.0%
Other Punctuation
ValueCountFrequency (%)
. 37
59.7%
& 24
38.7%
/ 1
 
1.6%
Close Punctuation
ValueCountFrequency (%)
) 3987
> 99.9%
1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 3977
> 99.9%
1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
1559
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 16
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 6
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 63256
85.8%
Common 9659
 
13.1%
Latin 797
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5286
 
8.4%
2784
 
4.4%
1629
 
2.6%
1580
 
2.5%
1578
 
2.5%
1480
 
2.3%
1457
 
2.3%
1058
 
1.7%
1030
 
1.6%
1011
 
1.6%
Other values (573) 44363
70.1%
Latin
ValueCountFrequency (%)
S 129
16.2%
M 83
 
10.4%
E 65
 
8.2%
K 43
 
5.4%
N 39
 
4.9%
O 38
 
4.8%
A 38
 
4.8%
J 36
 
4.5%
C 33
 
4.1%
T 31
 
3.9%
Other values (29) 262
32.9%
Common
ValueCountFrequency (%)
) 3987
41.3%
( 3977
41.2%
1559
 
16.1%
. 37
 
0.4%
& 24
 
0.2%
- 16
 
0.2%
6 10
 
0.1%
3 10
 
0.1%
5 10
 
0.1%
2 9
 
0.1%
Other values (9) 20
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 63254
85.8%
ASCII 10454
 
14.2%
None 4
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5286
 
8.4%
2784
 
4.4%
1629
 
2.6%
1580
 
2.5%
1578
 
2.5%
1480
 
2.3%
1457
 
2.3%
1058
 
1.7%
1030
 
1.6%
1011
 
1.6%
Other values (572) 44361
70.1%
ASCII
ValueCountFrequency (%)
) 3987
38.1%
( 3977
38.0%
1559
 
14.9%
S 129
 
1.2%
M 83
 
0.8%
E 65
 
0.6%
K 43
 
0.4%
N 39
 
0.4%
O 38
 
0.4%
A 38
 
0.4%
Other values (46) 496
 
4.7%
None
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%

Interactions

2023-12-11T01:43:49.518954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:43:52.902206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분관서명계약금액
구분1.0000.2790.166
관서명0.2791.0000.000
계약금액0.1660.0001.000
2023-12-11T01:43:53.008881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분관서명
구분1.0000.163
관서명0.1631.000
2023-12-11T01:43:53.096885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
계약금액구분관서명
계약금액1.0000.1060.000
구분0.1061.0000.163
관서명0.0000.1631.000

Missing values

2023-12-11T01:43:49.710165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:43:49.817701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분관서명계약명계약금액계약일계약대상자
9223용역본청신천지시장 천정 및 LED교체.방송통신공사 실시설계 용역147000002015-04-29희건종합건축사사무소
7239공사본청보수종합시장 LED조명 교체공사137410002016-07-04(주)경화전기
6705물품본청협동조합 활성화 사업 홍보물 제작25000002016-12-01(주)엠아이비
3260용역본청지역관광브랜드 구축 및 체류관광활성화 연구용역362290002019-06-17사단법인한국경제개발연구원
3432물품본청청경실 냉난방기 구입19000002019-04-24(주)경상비투비
7797용역본청2016년 신천지시장 외관정비공사 실시설계 용역248000002016-03-21미건종합건축사사무소
8940물품본청전통시장 메르스 확산 예방대책 추진물품 구입(배너)4680002015-06-25너울광고기획
812용역광복동2021년도 상반기 제3종시설물 정기점검 실시8800002021-03-09(주)연우엔지니어링
3370공사본청영주 Hi-story 육아나눔터 조성사업 정보통신공사92015002019-05-20엑사정보기술 주식회사
3063물품본청버스승강장 냉방기(에어커튼) 구매·설치(조달)18000002019-08-07(주) 세기시스템
구분관서명계약명계약금액계약일계약대상자
3589공사본청저소득층 LED조명 교체공사34650002019-03-25동서전력
6546물품본청Nice 중구 갤러리 작품 구입13000002017-01-11여목
889용역본청「2021년 5년차 이상 민방위대원 사이버교육」위탁용역 계약의뢰66000002021-02-22(주)국안에듀
6972용역본청북항 재개발지역 관할구역 경계설정 용역110000002016-10-04부산대학교 산학협력단
11305공사본청중구 대청동 주거환경정비공사1824900002014-01-10혜도종합토건(주)
1219물품본청2020년 드림스타트 홍보달력 및 쇼핑백 제작39600002020-12-01부성카렌다사
3071물품본청평생학습관 물품 구입(수강용의자 등)185200002019-08-06주식회사 기영포맥스
10256물품의회의정활동 사진 인화료 지급3505002014-08-21서화사진관 김숙희
10476물품본청구청사 현관 민선6기 현판 등 제작27500002014-07-03중앙광고기업
1094물품본청중구 지사협 사무실 컴퓨터 구입19665602020-12-16(주)주연테크

Duplicate rows

Most frequently occurring

구분관서명계약명계약금액계약일계약대상자# duplicates
0물품보건소2018년 방역약품 구입154000002018-02-21부산지방조달청2
1물품본청불법주정차 단속원 근무복 구입14400002014-02-14로체아웃도어2
2물품본청업무용 명함 제작440002016-07-06인쇄출판태산2
3물품본청영도대교 일원 수변 산책로 난간조명 설치 공사 관급자재(SMPS함, 인입전주)20790002021-06-11남선전기(주)2
4물품본청영도대교 일원 수변 산책로 난간조명 설치 공사 관급자재(난간)1036800002021-06-17주식회사 국제에스티2
5물품본청영도대교 일원 수변 산책로 난간조명 설치 공사 관급자재(제어반, 가로등점89400002021-06-11부국전자주식회사2
6물품본청재난대비 주민행동요령 책자 제작15000002017-05-16해양문화사2
7물품시설관리사업소닑다작은도서관 리모델링 전기공사 조명기구 관급자재 구입105140002019-12-19주식회사 유환2