Overview

Dataset statistics

Number of variables6
Number of observations546
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory26.3 KiB
Average record size in memory49.2 B

Variable types

Text2
Numeric1
DateTime1
Categorical2

Dataset

Description사립학교교직원연금공단 구매현황리스트와 관련된 데이터로 계약명, 업체명, 금액(천원), 계약시작일자, 계약방법, 담당부서 항목의 데이터를 제공합니다.
URLhttps://www.data.go.kr/data/15064938/fileData.do

Alerts

금액(천원) is highly overall correlated with 계약방법High correlation
계약방법 is highly overall correlated with 금액(천원)High correlation
계약방법 is highly imbalanced (57.6%)Imbalance

Reproduction

Analysis started2023-12-12 05:23:24.687971
Analysis finished2023-12-12 05:23:25.690709
Duration1 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct514
Distinct (%)94.1%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
2023-12-12T14:23:25.906168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length30.5
Mean length17.540293
Min length4

Characters and Unicode

Total characters9577
Distinct characters431
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique492 ?
Unique (%)90.1%

Sample

1st row업무용 소프트웨어 한글 2020 구매
2nd row2022년도 연금인상내역 안내문 제작
3rd rowTP 교육센터 구축 효율성 제고를 위한 종합컨설팅 용역 변경 계약
4th row2021회계연도 기금운용실적보고서 제작
5th row2022년도 상반기 연금수급자 안내책자 행복든든 길라잡이 제작
ValueCountFrequency (%)
구매 34
 
3.8%
제작 25
 
2.8%
22
 
2.5%
업무용 11
 
1.2%
디지털 9
 
1.0%
재구축 9
 
1.0%
2022년도 9
 
1.0%
홈페이지 8
 
0.9%
관련보안장비 8
 
0.9%
sw 8
 
0.9%
Other values (621) 747
83.9%
2023-12-12T14:23:26.370043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
344
 
3.6%
286
 
3.0%
242
 
2.5%
240
 
2.5%
196
 
2.0%
194
 
2.0%
2 184
 
1.9%
158
 
1.6%
151
 
1.6%
139
 
1.5%
Other values (421) 7443
77.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8131
84.9%
Decimal Number 531
 
5.5%
Space Separator 344
 
3.6%
Uppercase Letter 245
 
2.6%
Lowercase Letter 101
 
1.1%
Open Punctuation 83
 
0.9%
Close Punctuation 83
 
0.9%
Other Punctuation 40
 
0.4%
Dash Punctuation 12
 
0.1%
Math Symbol 7
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
286
 
3.5%
242
 
3.0%
240
 
3.0%
196
 
2.4%
194
 
2.4%
158
 
1.9%
151
 
1.9%
139
 
1.7%
139
 
1.7%
126
 
1.5%
Other values (357) 6260
77.0%
Uppercase Letter
ValueCountFrequency (%)
C 44
18.0%
P 36
14.7%
S 34
13.9%
T 20
8.2%
I 14
 
5.7%
B 12
 
4.9%
W 11
 
4.5%
E 10
 
4.1%
N 9
 
3.7%
V 8
 
3.3%
Other values (12) 47
19.2%
Lowercase Letter
ValueCountFrequency (%)
e 16
15.8%
n 11
10.9%
o 9
8.9%
s 8
 
7.9%
t 7
 
6.9%
r 7
 
6.9%
a 7
 
6.9%
m 6
 
5.9%
i 5
 
5.0%
l 4
 
4.0%
Other values (11) 21
20.8%
Decimal Number
ValueCountFrequency (%)
2 184
34.7%
0 121
22.8%
1 107
20.2%
8 29
 
5.5%
3 26
 
4.9%
9 17
 
3.2%
6 17
 
3.2%
4 16
 
3.0%
5 7
 
1.3%
7 7
 
1.3%
Other Punctuation
ValueCountFrequency (%)
, 10
25.0%
· 7
17.5%
; 7
17.5%
& 7
17.5%
# 7
17.5%
/ 2
 
5.0%
Space Separator
ValueCountFrequency (%)
344
100.0%
Open Punctuation
ValueCountFrequency (%)
( 83
100.0%
Close Punctuation
ValueCountFrequency (%)
) 83
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%
Math Symbol
ValueCountFrequency (%)
~ 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8131
84.9%
Common 1100
 
11.5%
Latin 346
 
3.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
286
 
3.5%
242
 
3.0%
240
 
3.0%
196
 
2.4%
194
 
2.4%
158
 
1.9%
151
 
1.9%
139
 
1.7%
139
 
1.7%
126
 
1.5%
Other values (357) 6260
77.0%
Latin
ValueCountFrequency (%)
C 44
 
12.7%
P 36
 
10.4%
S 34
 
9.8%
T 20
 
5.8%
e 16
 
4.6%
I 14
 
4.0%
B 12
 
3.5%
W 11
 
3.2%
n 11
 
3.2%
E 10
 
2.9%
Other values (33) 138
39.9%
Common
ValueCountFrequency (%)
344
31.3%
2 184
16.7%
0 121
 
11.0%
1 107
 
9.7%
( 83
 
7.5%
) 83
 
7.5%
8 29
 
2.6%
3 26
 
2.4%
9 17
 
1.5%
6 17
 
1.5%
Other values (11) 89
 
8.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8126
84.8%
ASCII 1439
 
15.0%
None 7
 
0.1%
Compat Jamo 5
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
344
23.9%
2 184
12.8%
0 121
 
8.4%
1 107
 
7.4%
( 83
 
5.8%
) 83
 
5.8%
C 44
 
3.1%
P 36
 
2.5%
S 34
 
2.4%
8 29
 
2.0%
Other values (53) 374
26.0%
Hangul
ValueCountFrequency (%)
286
 
3.5%
242
 
3.0%
240
 
3.0%
196
 
2.4%
194
 
2.4%
158
 
1.9%
151
 
1.9%
139
 
1.7%
139
 
1.7%
126
 
1.6%
Other values (356) 6255
77.0%
None
ValueCountFrequency (%)
· 7
100.0%
Compat Jamo
ValueCountFrequency (%)
5
100.0%
Distinct348
Distinct (%)63.7%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
2023-12-12T14:23:26.643500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length26
Mean length8.6739927
Min length3

Characters and Unicode

Total characters4736
Distinct characters322
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique263 ?
Unique (%)48.2%

Sample

1st row(주)블루포트
2nd row(주)우연시스템
3rd row(주)엠씨미디어솔루션
4th row(사)남북장애인교류협회 인쇄사업부
5th row(사)장애인생산품판매지원협회인쇄사업
ValueCountFrequency (%)
주식회사 24
 
4.1%
디지털oa센터 11
 
1.9%
정이디자인 10
 
1.7%
오티스엘리베이터(유 8
 
1.4%
주)제니엘 8
 
1.4%
주)진두아이에스 8
 
1.4%
주)에이텍 8
 
1.4%
사)장애인생산품판매지원협회인쇄사업 7
 
1.2%
나주사무용가구 7
 
1.2%
주)비츠코리아 7
 
1.2%
Other values (343) 482
83.1%
2023-12-12T14:23:27.061339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
386
 
8.2%
) 358
 
7.6%
( 351
 
7.4%
181
 
3.8%
154
 
3.3%
126
 
2.7%
117
 
2.5%
108
 
2.3%
104
 
2.2%
68
 
1.4%
Other values (312) 2783
58.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3873
81.8%
Close Punctuation 358
 
7.6%
Open Punctuation 351
 
7.4%
Uppercase Letter 69
 
1.5%
Space Separator 34
 
0.7%
Other Symbol 16
 
0.3%
Decimal Number 16
 
0.3%
Other Punctuation 14
 
0.3%
Dash Punctuation 2
 
< 0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
386
 
10.0%
181
 
4.7%
154
 
4.0%
126
 
3.3%
117
 
3.0%
108
 
2.8%
104
 
2.7%
68
 
1.8%
64
 
1.7%
63
 
1.6%
Other values (284) 2502
64.6%
Uppercase Letter
ValueCountFrequency (%)
A 14
20.3%
O 12
17.4%
C 10
14.5%
S 10
14.5%
K 7
10.1%
L 6
8.7%
N 3
 
4.3%
G 2
 
2.9%
E 2
 
2.9%
H 2
 
2.9%
Decimal Number
ValueCountFrequency (%)
4 4
25.0%
5 3
18.8%
1 2
12.5%
0 2
12.5%
3 2
12.5%
6 2
12.5%
2 1
 
6.2%
Other Punctuation
ValueCountFrequency (%)
, 12
85.7%
& 2
 
14.3%
Lowercase Letter
ValueCountFrequency (%)
o 1
50.0%
a 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 358
100.0%
Open Punctuation
ValueCountFrequency (%)
( 351
100.0%
Space Separator
ValueCountFrequency (%)
34
100.0%
Other Symbol
ValueCountFrequency (%)
16
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3889
82.1%
Common 776
 
16.4%
Latin 71
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
386
 
9.9%
181
 
4.7%
154
 
4.0%
126
 
3.2%
117
 
3.0%
108
 
2.8%
104
 
2.7%
68
 
1.7%
64
 
1.6%
63
 
1.6%
Other values (285) 2518
64.7%
Common
ValueCountFrequency (%)
) 358
46.1%
( 351
45.2%
34
 
4.4%
, 12
 
1.5%
4 4
 
0.5%
5 3
 
0.4%
& 2
 
0.3%
1 2
 
0.3%
- 2
 
0.3%
0 2
 
0.3%
Other values (4) 6
 
0.8%
Latin
ValueCountFrequency (%)
A 14
19.7%
O 12
16.9%
C 10
14.1%
S 10
14.1%
K 7
9.9%
L 6
8.5%
N 3
 
4.2%
G 2
 
2.8%
E 2
 
2.8%
H 2
 
2.8%
Other values (3) 3
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3873
81.8%
ASCII 847
 
17.9%
None 16
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
386
 
10.0%
181
 
4.7%
154
 
4.0%
126
 
3.3%
117
 
3.0%
108
 
2.8%
104
 
2.7%
68
 
1.8%
64
 
1.7%
63
 
1.6%
Other values (284) 2502
64.6%
ASCII
ValueCountFrequency (%)
) 358
42.3%
( 351
41.4%
34
 
4.0%
A 14
 
1.7%
O 12
 
1.4%
, 12
 
1.4%
C 10
 
1.2%
S 10
 
1.2%
K 7
 
0.8%
L 6
 
0.7%
Other values (17) 33
 
3.9%
None
ValueCountFrequency (%)
16
100.0%

금액(천원)
Real number (ℝ)

HIGH CORRELATION 

Distinct445
Distinct (%)81.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean137090.48
Minimum0
Maximum11704459
Zeros2
Zeros (%)0.4%
Negative0
Negative (%)0.0%
Memory size4.9 KiB
2023-12-12T14:23:27.206145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile10886.5
Q117254.25
median22400
Q344650
95-th percentile463273
Maximum11704459
Range11704459
Interquartile range (IQR)27395.75

Descriptive statistics

Standard deviation688637.69
Coefficient of variation (CV)5.0232349
Kurtosis178.12045
Mean137090.48
Median Absolute Deviation (MAD)7450
Skewness12.2383
Sum74851403
Variance4.7422187 × 1011
MonotonicityNot monotonic
2023-12-12T14:23:27.387799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
19500 10
 
1.8%
19800 7
 
1.3%
29000 6
 
1.1%
20000 5
 
0.9%
22000 5
 
0.9%
11000 4
 
0.7%
28500 4
 
0.7%
26000 4
 
0.7%
15000 4
 
0.7%
18000 4
 
0.7%
Other values (435) 493
90.3%
ValueCountFrequency (%)
0 2
0.4%
1295 1
0.2%
3637 1
0.2%
7097 1
0.2%
9800 1
0.2%
10000 1
0.2%
10188 1
0.2%
10296 1
0.2%
10318 1
0.2%
10362 1
0.2%
ValueCountFrequency (%)
11704459 1
0.2%
7854000 1
0.2%
3899227 1
0.2%
3200367 2
0.4%
3160335 1
0.2%
2490207 1
0.2%
2042916 1
0.2%
1162920 1
0.2%
1070167 1
0.2%
1018142 1
0.2%
Distinct424
Distinct (%)77.7%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
Minimum2007-03-01 00:00:00
Maximum2022-12-16 00:00:00
2023-12-12T14:23:27.549344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:23:27.714104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

계약방법
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct38
Distinct (%)7.0%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
수의계약
370 
조달구매
 
35
제3자단가계약(조달구매)
 
27
제한경쟁
 
18
수의
 
17
Other values (33)
79 

Length

Max length22
Median length4
Mean length4.8498168
Min length2

Unique

Unique16 ?
Unique (%)2.9%

Sample

1st row조달구매
2nd row수의계약
3rd row수의계약
4th row수의계약
5th row수의계약

Common Values

ValueCountFrequency (%)
수의계약 370
67.8%
조달구매 35
 
6.4%
제3자단가계약(조달구매) 27
 
4.9%
제한경쟁 18
 
3.3%
수의 17
 
3.1%
재계약 15
 
2.7%
수의계약(재계약) 8
 
1.5%
조달청 4
 
0.7%
수의계약(지속적) 4
 
0.7%
수의게약 3
 
0.5%
Other values (28) 45
 
8.2%

Length

2023-12-12T14:23:27.961539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
수의계약 370
65.3%
조달구매 35
 
6.2%
제3자단가계약(조달구매 27
 
4.8%
제한경쟁 18
 
3.2%
수의 17
 
3.0%
재계약 15
 
2.6%
수의계약(재계약 8
 
1.4%
협상에 5
 
0.9%
의한 5
 
0.9%
수의계약(지속적 4
 
0.7%
Other values (37) 63
 
11.1%

담당부서
Categorical

Distinct24
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
행정지원팀
226 
총무팀
114 
총무부
75 
서울지부
55 
사업개발팀
 
16
Other values (19)
60 

Length

Max length6
Median length5
Mean length4.1501832
Min length3

Unique

Unique8 ?
Unique (%)1.5%

Sample

1st row행정지원팀
2nd row행정지원팀
3rd row행정지원팀
4th row행정지원팀
5th row행정지원팀

Common Values

ValueCountFrequency (%)
행정지원팀 226
41.4%
총무팀 114
20.9%
총무부 75
 
13.7%
서울지부 55
 
10.1%
사업개발팀 16
 
2.9%
호남지부 9
 
1.6%
서울회관 9
 
1.6%
정보시스템부 7
 
1.3%
정보지원실 6
 
1.1%
고객센터 4
 
0.7%
Other values (14) 25
 
4.6%

Length

2023-12-12T14:23:28.107126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
행정지원팀 226
41.4%
총무팀 114
20.9%
총무부 75
 
13.7%
서울지부 55
 
10.1%
사업개발팀 16
 
2.9%
호남지부 9
 
1.6%
서울회관 9
 
1.6%
정보시스템부 7
 
1.3%
정보지원실 6
 
1.1%
고객센터 4
 
0.7%
Other values (14) 25
 
4.6%

Interactions

2023-12-12T14:23:25.263087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:23:28.201565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
금액(천원)계약방법담당부서
금액(천원)1.0000.9090.769
계약방법0.9091.0000.835
담당부서0.7690.8351.000
2023-12-12T14:23:28.305326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
담당부서계약방법
담당부서1.0000.338
계약방법0.3381.000
2023-12-12T14:23:28.433502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
금액(천원)계약방법담당부서
금액(천원)1.0000.6610.421
계약방법0.6611.0000.338
담당부서0.4210.3381.000

Missing values

2023-12-12T14:23:25.457956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:23:25.626439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

계약명업체명금액(천원)계약시작일자계약방법담당부서
0업무용 소프트웨어 한글 2020 구매(주)블루포트202292022-01-19조달구매행정지원팀
12022년도 연금인상내역 안내문 제작(주)우연시스템132002022-01-28수의계약행정지원팀
2TP 교육센터 구축 효율성 제고를 위한 종합컨설팅 용역 변경 계약(주)엠씨미디어솔루션135002022-02-21수의계약행정지원팀
32021회계연도 기금운용실적보고서 제작(사)남북장애인교류협회 인쇄사업부104502022-02-28수의계약행정지원팀
42022년도 상반기 연금수급자 안내책자 행복든든 길라잡이 제작(사)장애인생산품판매지원협회인쇄사업191952022-03-11수의계약행정지원팀
52021년도 경영실적보고서 제작정이디자인192172022-03-11수의계약행정지원팀
62022 TP 홍보 브로슈어 리뉴얼 제작큐라인134552022-04-11수의계약행정지원팀
7ESG 인권 경영을 위한 업무용 녹취 시스템 구축 용역(주)두루스코485902022-04-15수의계약행정지원팀
82022년도 노후화 PC 교체에 따른 구매(주)에이텍460562022-04-19조달구매행정지원팀
9탄성포장재 구매주식회사 한국공원체육산업258402022-06-20조달구매행정지원팀
계약명업체명금액(천원)계약시작일자계약방법담당부서
536기계경비용역계약(주)캡스211202010-03-01수의영남지부
537인경비주차용역계약(주)유니에스2731562010-03-01수의영남지부
538구내교환기유지보수용역대신통신기술(주)238922010-03-01수의계약(지속적)서울지부
539승강기유지보수용역오티스엘리베이터(유)267172010-03-01수의계약(지속적)서울지부
540기계경비용역(주)에스원(4년차)2767512007-03-01경쟁입찰서울지부
541경비·주차용역(주)한덕엔지니어링(4년차)2767512010-03-01경쟁입찰서울지부
542승강기설비일부부품교체작업오티스엘리베이터(유)432962010-02-25수의호남지부
543고객센터상담실장비구입및증설공사(주)케이티네트웍스728002010-02-25수의호남지부
544시설ㆍ청소ㆍ주차관리용역(주)태성공사5613702010-01-01수의호남지부
545시설및청소용역계약(주)삼우통상6395002010-01-01수의영남지부