Overview

Dataset statistics

Number of variables5
Number of observations54
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.3 KiB
Average record size in memory43.4 B

Variable types

Numeric1
Categorical1
Text2
DateTime1

Dataset

Description한국보건복지인재원의 발주계획 데이터로 순번, 사업구분, 사업예산, 사업명, 발주예정 연월 내용의 데이터를 제공하고 있습니다.
Author한국보건복지인재원
URLhttps://www.data.go.kr/data/15105292/fileData.do

Alerts

사업구분 is highly imbalanced (59.3%)Imbalance
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 17:06:45.830278
Analysis finished2023-12-12 17:06:46.500418
Duration0.67 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct54
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean27.5
Minimum1
Maximum54
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size618.0 B
2023-12-13T02:06:46.597797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.65
Q114.25
median27.5
Q340.75
95-th percentile51.35
Maximum54
Range53
Interquartile range (IQR)26.5

Descriptive statistics

Standard deviation15.732133
Coefficient of variation (CV)0.57207755
Kurtosis-1.2
Mean27.5
Median Absolute Deviation (MAD)13.5
Skewness0
Sum1485
Variance247.5
MonotonicityStrictly increasing
2023-12-13T02:06:46.790078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.9%
42 1
 
1.9%
31 1
 
1.9%
32 1
 
1.9%
33 1
 
1.9%
34 1
 
1.9%
35 1
 
1.9%
36 1
 
1.9%
37 1
 
1.9%
38 1
 
1.9%
Other values (44) 44
81.5%
ValueCountFrequency (%)
1 1
1.9%
2 1
1.9%
3 1
1.9%
4 1
1.9%
5 1
1.9%
6 1
1.9%
7 1
1.9%
8 1
1.9%
9 1
1.9%
10 1
1.9%
ValueCountFrequency (%)
54 1
1.9%
53 1
1.9%
52 1
1.9%
51 1
1.9%
50 1
1.9%
49 1
1.9%
48 1
1.9%
47 1
1.9%
46 1
1.9%
45 1
1.9%

사업구분
Categorical

IMBALANCE 

Distinct4
Distinct (%)7.4%
Missing0
Missing (%)0.0%
Memory size564.0 B
용역
45 
물품
임차
 
1
공사
 
1

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique2 ?
Unique (%)3.7%

Sample

1st row용역
2nd row용역
3rd row용역
4th row용역
5th row임차

Common Values

ValueCountFrequency (%)
용역 45
83.3%
물품 7
 
13.0%
임차 1
 
1.9%
공사 1
 
1.9%

Length

2023-12-13T02:06:46.932508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:06:47.057228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
용역 45
83.3%
물품 7
 
13.0%
임차 1
 
1.9%
공사 1
 
1.9%
Distinct35
Distinct (%)64.8%
Missing0
Missing (%)0.0%
Memory size564.0 B
2023-12-13T02:06:47.260383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length7.8333333
Min length2

Characters and Unicode

Total characters423
Distinct characters12
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)44.4%

Sample

1st row10000000
2nd row12378300
3rd row504300000
4th row437590000
5th row12000000
ValueCountFrequency (%)
12000000 4
 
7.4%
20000000 4
 
7.4%
10000000 3
 
5.6%
15000000 3
 
5.6%
미정 3
 
5.6%
5000000 3
 
5.6%
150000000 2
 
3.7%
70000000 2
 
3.7%
100000000 2
 
3.7%
19250000 2
 
3.7%
Other values (25) 26
48.1%
2023-12-13T02:06:47.636841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 304
71.9%
1 28
 
6.6%
2 20
 
4.7%
5 17
 
4.0%
7 11
 
2.6%
3 11
 
2.6%
4 8
 
1.9%
9 7
 
1.7%
6 6
 
1.4%
8 5
 
1.2%
Other values (2) 6
 
1.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 417
98.6%
Other Letter 6
 
1.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 304
72.9%
1 28
 
6.7%
2 20
 
4.8%
5 17
 
4.1%
7 11
 
2.6%
3 11
 
2.6%
4 8
 
1.9%
9 7
 
1.7%
6 6
 
1.4%
8 5
 
1.2%
Other Letter
ValueCountFrequency (%)
3
50.0%
3
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 417
98.6%
Hangul 6
 
1.4%

Most frequent character per script

Common
ValueCountFrequency (%)
0 304
72.9%
1 28
 
6.7%
2 20
 
4.8%
5 17
 
4.1%
7 11
 
2.6%
3 11
 
2.6%
4 8
 
1.9%
9 7
 
1.7%
6 6
 
1.4%
8 5
 
1.2%
Hangul
ValueCountFrequency (%)
3
50.0%
3
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 417
98.6%
Hangul 6
 
1.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 304
72.9%
1 28
 
6.7%
2 20
 
4.8%
5 17
 
4.1%
7 11
 
2.6%
3 11
 
2.6%
4 8
 
1.9%
9 7
 
1.7%
6 6
 
1.4%
8 5
 
1.2%
Hangul
ValueCountFrequency (%)
3
50.0%
3
50.0%
Distinct52
Distinct (%)96.3%
Missing0
Missing (%)0.0%
Memory size564.0 B
2023-12-13T02:06:47.937420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length28
Mean length20.240741
Min length5

Characters and Unicode

Total characters1093
Distinct characters250
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique50 ?
Unique (%)92.6%

Sample

1st row2022년도 공공기관 고객만족도 조사
2nd row2023년 보건산업교육실 기자재 유지·관리
3rd row2023~2024년 이러닝 콘텐츠 확충 및 운영지원 사업
4th row2023년 이러닝 콘텐츠 개발 및 수정사업
5th row웹메일클라우드
ValueCountFrequency (%)
2023년 14
 
6.0%
개발 12
 
5.2%
7
 
3.0%
콘텐츠 6
 
2.6%
용역 5
 
2.2%
운영 4
 
1.7%
사업 4
 
1.7%
프로그램 3
 
1.3%
구매 3
 
1.3%
이러닝콘텐츠 3
 
1.3%
Other values (144) 171
73.7%
2023-12-13T02:06:48.420365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
180
 
16.5%
2 36
 
3.3%
0 20
 
1.8%
20
 
1.8%
19
 
1.7%
3 18
 
1.6%
17
 
1.6%
17
 
1.6%
16
 
1.5%
16
 
1.5%
Other values (240) 734
67.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 742
67.9%
Space Separator 180
 
16.5%
Decimal Number 82
 
7.5%
Uppercase Letter 48
 
4.4%
Lowercase Letter 13
 
1.2%
Open Punctuation 10
 
0.9%
Close Punctuation 10
 
0.9%
Other Punctuation 5
 
0.5%
Dash Punctuation 2
 
0.2%
Math Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
20
 
2.7%
19
 
2.6%
17
 
2.3%
17
 
2.3%
16
 
2.2%
16
 
2.2%
15
 
2.0%
14
 
1.9%
14
 
1.9%
13
 
1.8%
Other values (195) 581
78.3%
Uppercase Letter
ValueCountFrequency (%)
I 6
12.5%
H 6
12.5%
D 5
10.4%
V 5
10.4%
O 5
10.4%
R 4
8.3%
K 3
6.2%
S 3
6.2%
C 2
 
4.2%
P 2
 
4.2%
Other values (7) 7
14.6%
Lowercase Letter
ValueCountFrequency (%)
o 2
15.4%
l 1
7.7%
a 1
7.7%
t 1
7.7%
f 1
7.7%
m 1
7.7%
r 1
7.7%
s 1
7.7%
w 1
7.7%
d 1
7.7%
Other values (2) 2
15.4%
Decimal Number
ValueCountFrequency (%)
2 36
43.9%
0 20
24.4%
3 18
22.0%
1 4
 
4.9%
5 2
 
2.4%
9 1
 
1.2%
4 1
 
1.2%
Other Punctuation
ValueCountFrequency (%)
? 2
40.0%
/ 1
20.0%
· 1
20.0%
, 1
20.0%
Space Separator
ValueCountFrequency (%)
180
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 742
67.9%
Common 290
 
26.5%
Latin 61
 
5.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
20
 
2.7%
19
 
2.6%
17
 
2.3%
17
 
2.3%
16
 
2.2%
16
 
2.2%
15
 
2.0%
14
 
1.9%
14
 
1.9%
13
 
1.8%
Other values (195) 581
78.3%
Latin
ValueCountFrequency (%)
I 6
 
9.8%
H 6
 
9.8%
D 5
 
8.2%
V 5
 
8.2%
O 5
 
8.2%
R 4
 
6.6%
K 3
 
4.9%
S 3
 
4.9%
C 2
 
3.3%
o 2
 
3.3%
Other values (19) 20
32.8%
Common
ValueCountFrequency (%)
180
62.1%
2 36
 
12.4%
0 20
 
6.9%
3 18
 
6.2%
( 10
 
3.4%
) 10
 
3.4%
1 4
 
1.4%
- 2
 
0.7%
? 2
 
0.7%
5 2
 
0.7%
Other values (6) 6
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 742
67.9%
ASCII 350
32.0%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
180
51.4%
2 36
 
10.3%
0 20
 
5.7%
3 18
 
5.1%
( 10
 
2.9%
) 10
 
2.9%
I 6
 
1.7%
H 6
 
1.7%
D 5
 
1.4%
V 5
 
1.4%
Other values (34) 54
 
15.4%
Hangul
ValueCountFrequency (%)
20
 
2.7%
19
 
2.6%
17
 
2.3%
17
 
2.3%
16
 
2.2%
16
 
2.2%
15
 
2.0%
14
 
1.9%
14
 
1.9%
13
 
1.8%
Other values (195) 581
78.3%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct3
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size564.0 B
Minimum2023-01-01 00:00:00
Maximum2023-03-01 00:00:00
2023-12-13T02:06:48.553454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:06:48.663406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=3)

Interactions

2023-12-13T02:06:46.220967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:06:48.754229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번사업구분사업예산사업명발주예정연월
순번1.0000.5550.5010.8460.895
사업구분0.5551.0000.7201.0000.000
사업예산0.5010.7201.0000.9920.815
사업명0.8461.0000.9921.0001.000
발주예정연월0.8950.0000.8151.0001.000
2023-12-13T02:06:48.876278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번사업구분
순번1.0000.339
사업구분0.3391.000

Missing values

2023-12-13T02:06:46.345605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:06:46.456309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번사업구분사업예산사업명발주예정연월
01용역100000002022년도 공공기관 고객만족도 조사2023-01-01
12용역123783002023년 보건산업교육실 기자재 유지·관리2023-01-01
23용역5043000002023~2024년 이러닝 콘텐츠 확충 및 운영지원 사업2023-01-01
34용역4375900002023년 이러닝 콘텐츠 개발 및 수정사업2023-01-01
45임차12000000웹메일클라우드2023-02-01
56용역20000000공공데이터 사업 컨설팅2023-02-01
67물품7260000VDI용 Windows 10 1년 라이선스 갱신2023-02-01
78물품26237740V3라이선스 갱신2023-02-01
89물품33178200PMS(패치관리시스템) 구매2023-02-01
910물품16000000L3 구매2023-02-01
순번사업구분사업예산사업명발주예정연월
4445용역40000000아동안전교육 교구 제작2023-03-01
4546물품미정2023년 기관 홍보물품 통합 구매2023-03-01
4647용역1100000002023년 KOHI 교육컨설팅 사업2023-03-01
4748용역650000002023년 KOHI 현업적용도 및 성과기여도 사업2023-03-01
4849용역19000000긴급복지지원소진예방교육 비대면 소진예방 프로그램 위탁 운영2023-03-01
4950용역9000000긴급복지지원소진예방교육 교보재 발송 용역2023-03-01
5051용역198000002023년 긴급복지지원교육사업 이러닝콘텐츠 개발2023-03-01
5152용역120000002023년 기초연금교육사업 이러닝콘텐츠 개발2023-03-01
5253용역149000000VR 콘텐츠 개발2023-03-01
5354용역19250000제2기 역학조사관 기본교육 강의장 임차2023-03-01