Overview

Dataset statistics

Number of variables10
Number of observations10000
Missing cells174
Missing cells (%)0.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory878.9 KiB
Average record size in memory90.0 B

Variable types

Categorical3
Text2
Numeric1
DateTime3
Boolean1

Dataset

Description한국노인인력개발원에서 운영하는 노인일자리 사업의 심사상태, 사업유형, 심의요청 정보 등의 항목을 제공하는 데이터입니다.
Author한국노인인력개발원
URLhttps://www.data.go.kr/data/15050145/fileData.do

Alerts

사업년도 has constant value ""Constant
삭제여부 is highly overall correlated with 심사상태코드High correlation
심사상태코드 is highly overall correlated with 삭제여부High correlation
심사상태코드 is highly imbalanced (85.7%)Imbalance
삭제여부 is highly imbalanced (82.2%)Imbalance
심의종료일 has 174 (1.7%) missing valuesMissing
사업번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 00:52:18.696648
Analysis finished2023-12-12 00:52:20.626352
Duration1.93 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사업유형
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
공익활동형
5677 
사회서비스형
2445 
시장형
1878 

Length

Max length6
Median length5
Mean length4.8689
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사회서비스형
2nd row시장형
3rd row공익활동형
4th row시장형
5th row사회서비스형

Common Values

ValueCountFrequency (%)
공익활동형 5677
56.8%
사회서비스형 2445
24.4%
시장형 1878
 
18.8%

Length

2023-12-12T09:52:20.715697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:52:20.865561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공익활동형 5677
56.8%
사회서비스형 2445
24.4%
시장형 1878
 
18.8%

사업번호
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T09:52:21.316477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters100000
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st row2023-09583
2nd row2023-05582
3rd row2023-06177
4th row2023-00555
5th row2023-07556
ValueCountFrequency (%)
2023-09583 1
 
< 0.1%
2023-01089 1
 
< 0.1%
2023-06637 1
 
< 0.1%
2023-07912 1
 
< 0.1%
2023-03776 1
 
< 0.1%
2023-10937 1
 
< 0.1%
2023-00051 1
 
< 0.1%
2023-04698 1
 
< 0.1%
2023-06805 1
 
< 0.1%
2023-07411 1
 
< 0.1%
Other values (9990) 9990
99.9%
2023-12-12T09:52:21.880686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 23928
23.9%
0 23071
23.1%
3 13972
14.0%
- 10000
10.0%
1 5472
 
5.5%
7 3986
 
4.0%
6 3955
 
4.0%
4 3921
 
3.9%
8 3915
 
3.9%
9 3891
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 90000
90.0%
Dash Punctuation 10000
 
10.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 23928
26.6%
0 23071
25.6%
3 13972
15.5%
1 5472
 
6.1%
7 3986
 
4.4%
6 3955
 
4.4%
4 3921
 
4.4%
8 3915
 
4.3%
9 3891
 
4.3%
5 3889
 
4.3%
Dash Punctuation
ValueCountFrequency (%)
- 10000
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 100000
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 23928
23.9%
0 23071
23.1%
3 13972
14.0%
- 10000
10.0%
1 5472
 
5.5%
7 3986
 
4.0%
6 3955
 
4.0%
4 3921
 
3.9%
8 3915
 
3.9%
9 3891
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 100000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 23928
23.9%
0 23071
23.1%
3 13972
14.0%
- 10000
10.0%
1 5472
 
5.5%
7 3986
 
4.0%
6 3955
 
4.0%
4 3921
 
3.9%
8 3915
 
3.9%
9 3891
 
3.9%

사업계획변경순번
Real number (ℝ)

Distinct12
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.2302
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T09:52:22.065776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q33
95-th percentile5
Maximum12
Range11
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.2723089
Coefficient of variation (CV)0.57049092
Kurtosis3.1236224
Mean2.2302
Median Absolute Deviation (MAD)1
Skewness1.4349936
Sum22302
Variance1.6187698
MonotonicityNot monotonic
2023-12-12T09:52:22.240454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
2 3450
34.5%
1 3305
33.1%
3 1823
18.2%
4 850
 
8.5%
5 345
 
3.5%
6 139
 
1.4%
7 58
 
0.6%
8 17
 
0.2%
9 7
 
0.1%
10 3
 
< 0.1%
Other values (2) 3
 
< 0.1%
ValueCountFrequency (%)
1 3305
33.1%
2 3450
34.5%
3 1823
18.2%
4 850
 
8.5%
5 345
 
3.5%
6 139
 
1.4%
7 58
 
0.6%
8 17
 
0.2%
9 7
 
0.1%
10 3
 
< 0.1%
ValueCountFrequency (%)
12 1
 
< 0.1%
11 2
 
< 0.1%
10 3
 
< 0.1%
9 7
 
0.1%
8 17
 
0.2%
7 58
 
0.6%
6 139
 
1.4%
5 345
 
3.5%
4 850
8.5%
3 1823
18.2%

사업년도
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023
10000 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023
2nd row2023
3rd row2023
4th row2023
5th row2023

Common Values

ValueCountFrequency (%)
2023 10000
100.0%

Length

2023-12-12T09:52:22.378483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:52:22.484413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023 10000
100.0%
Distinct7707
Distinct (%)77.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T09:52:22.715782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length36
Mean length9.4875
Min length2

Characters and Unicode

Total characters94875
Distinct characters854
Distinct categories16 ?
Distinct scripts4 ?
Distinct blocks10 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6962 ?
Unique (%)69.6%

Sample

1st row상담지원서비스
2nd row백년청춘문화예술단
3rd row지역환경지킴이
4th row카페 향
5th row시니어금융업무지원
ValueCountFrequency (%)
노노케어 249
 
1.8%
187
 
1.3%
시니어 173
 
1.2%
경로당 132
 
0.9%
지원 112
 
0.8%
도우미 105
 
0.7%
사업 103
 
0.7%
사업단 69
 
0.5%
공공시설 63
 
0.4%
노인일자리 63
 
0.4%
Other values (7971) 12845
91.1%
2023-12-12T09:52:23.175641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4309
 
4.5%
4209
 
4.4%
4179
 
4.4%
2790
 
2.9%
2688
 
2.8%
2399
 
2.5%
1874
 
2.0%
1672
 
1.8%
1658
 
1.7%
1647
 
1.7%
Other values (844) 67450
71.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 85790
90.4%
Space Separator 4209
 
4.4%
Open Punctuation 1334
 
1.4%
Close Punctuation 1333
 
1.4%
Decimal Number 984
 
1.0%
Uppercase Letter 364
 
0.4%
Other Punctuation 358
 
0.4%
Dash Punctuation 206
 
0.2%
Lowercase Letter 198
 
0.2%
Initial Punctuation 24
 
< 0.1%
Other values (6) 75
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4309
 
5.0%
4179
 
4.9%
2790
 
3.3%
2688
 
3.1%
2399
 
2.8%
1874
 
2.2%
1672
 
1.9%
1658
 
1.9%
1647
 
1.9%
1557
 
1.8%
Other values (756) 61017
71.1%
Uppercase Letter
ValueCountFrequency (%)
C 45
12.4%
E 43
11.8%
S 43
11.8%
G 35
9.6%
M 29
8.0%
A 27
 
7.4%
O 20
 
5.5%
T 20
 
5.5%
K 18
 
4.9%
I 15
 
4.1%
Other values (14) 69
19.0%
Lowercase Letter
ValueCountFrequency (%)
e 50
25.3%
a 25
12.6%
f 25
12.6%
i 19
 
9.6%
n 15
 
7.6%
c 11
 
5.6%
o 9
 
4.5%
r 8
 
4.0%
m 6
 
3.0%
l 5
 
2.5%
Other values (12) 25
12.6%
Other Punctuation
ValueCountFrequency (%)
' 93
26.0%
, 93
26.0%
" 53
14.8%
& 33
 
9.2%
. 28
 
7.8%
· 21
 
5.9%
! 19
 
5.3%
: 9
 
2.5%
# 5
 
1.4%
/ 3
 
0.8%
Decimal Number
ValueCountFrequency (%)
2 286
29.1%
1 186
18.9%
3 129
13.1%
0 126
12.8%
9 65
 
6.6%
8 58
 
5.9%
6 48
 
4.9%
5 44
 
4.5%
4 22
 
2.2%
7 20
 
2.0%
Math Symbol
ValueCountFrequency (%)
+ 4
40.0%
~ 3
30.0%
1
 
10.0%
< 1
 
10.0%
> 1
 
10.0%
Letter Number
ValueCountFrequency (%)
10
45.5%
9
40.9%
3
 
13.6%
Open Punctuation
ValueCountFrequency (%)
( 1312
98.4%
[ 22
 
1.6%
Close Punctuation
ValueCountFrequency (%)
) 1311
98.3%
] 22
 
1.7%
Initial Punctuation
ValueCountFrequency (%)
13
54.2%
11
45.8%
Final Punctuation
ValueCountFrequency (%)
13
56.5%
10
43.5%
Space Separator
ValueCountFrequency (%)
4209
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 206
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 17
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 85728
90.4%
Common 8501
 
9.0%
Latin 584
 
0.6%
Han 62
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4309
 
5.0%
4179
 
4.9%
2790
 
3.3%
2688
 
3.1%
2399
 
2.8%
1874
 
2.2%
1672
 
2.0%
1658
 
1.9%
1647
 
1.9%
1557
 
1.8%
Other values (740) 60955
71.1%
Latin
ValueCountFrequency (%)
e 50
 
8.6%
C 45
 
7.7%
E 43
 
7.4%
S 43
 
7.4%
G 35
 
6.0%
M 29
 
5.0%
A 27
 
4.6%
a 25
 
4.3%
f 25
 
4.3%
O 20
 
3.4%
Other values (39) 242
41.4%
Common
ValueCountFrequency (%)
4209
49.5%
( 1312
 
15.4%
) 1311
 
15.4%
2 286
 
3.4%
- 206
 
2.4%
1 186
 
2.2%
3 129
 
1.5%
0 126
 
1.5%
' 93
 
1.1%
, 93
 
1.1%
Other values (29) 550
 
6.5%
Han
ValueCountFrequency (%)
34
54.8%
10
 
16.1%
3
 
4.8%
2
 
3.2%
2
 
3.2%
1
 
1.6%
1
 
1.6%
1
 
1.6%
1
 
1.6%
1
 
1.6%
Other values (6) 6
 
9.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 85727
90.4%
ASCII 8993
 
9.5%
CJK 52
 
0.1%
Punctuation 47
 
< 0.1%
Number Forms 22
 
< 0.1%
None 21
 
< 0.1%
CJK Compat Ideographs 10
 
< 0.1%
Compat Jamo 1
 
< 0.1%
Arrows 1
 
< 0.1%
Letterlike Symbols 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4309
 
5.0%
4179
 
4.9%
2790
 
3.3%
2688
 
3.1%
2399
 
2.8%
1874
 
2.2%
1672
 
2.0%
1658
 
1.9%
1647
 
1.9%
1557
 
1.8%
Other values (739) 60954
71.1%
ASCII
ValueCountFrequency (%)
4209
46.8%
( 1312
 
14.6%
) 1311
 
14.6%
2 286
 
3.2%
- 206
 
2.3%
1 186
 
2.1%
3 129
 
1.4%
0 126
 
1.4%
' 93
 
1.0%
, 93
 
1.0%
Other values (68) 1042
 
11.6%
CJK
ValueCountFrequency (%)
34
65.4%
3
 
5.8%
2
 
3.8%
2
 
3.8%
1
 
1.9%
1
 
1.9%
1
 
1.9%
1
 
1.9%
1
 
1.9%
1
 
1.9%
Other values (5) 5
 
9.6%
None
ValueCountFrequency (%)
· 21
100.0%
Punctuation
ValueCountFrequency (%)
13
27.7%
13
27.7%
11
23.4%
10
21.3%
Number Forms
ValueCountFrequency (%)
10
45.5%
9
40.9%
3
 
13.6%
CJK Compat Ideographs
ValueCountFrequency (%)
10
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Arrows
ValueCountFrequency (%)
1
100.0%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%

심사상태코드
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
승인완료
9504 
변경심사요청
 
221
반려
 
169
삭제요청
 
88
조건부승인
 
11

Length

Max length6
Median length4
Mean length4.0115
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row승인완료
2nd row승인완료
3rd row승인완료
4th row승인완료
5th row승인완료

Common Values

ValueCountFrequency (%)
승인완료 9504
95.0%
변경심사요청 221
 
2.2%
반려 169
 
1.7%
삭제요청 88
 
0.9%
조건부승인 11
 
0.1%
심사요청 7
 
0.1%

Length

2023-12-12T09:52:23.343902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:52:23.463448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
승인완료 9504
95.0%
변경심사요청 221
 
2.2%
반려 169
 
1.7%
삭제요청 88
 
0.9%
조건부승인 11
 
0.1%
심사요청 7
 
0.1%
Distinct233
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2022-11-29 00:00:00
Maximum2023-10-06 00:00:00
2023-12-12T09:52:23.606693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:52:23.805307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct231
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2022-11-29 00:00:00
Maximum2023-10-06 00:00:00
2023-12-12T09:52:23.976221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:52:24.157622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

심의종료일
Date

MISSING 

Distinct157
Distinct (%)1.6%
Missing174
Missing (%)1.7%
Memory size156.2 KiB
Minimum2022-11-29 00:00:00
Maximum2023-10-05 00:00:00
2023-12-12T09:52:24.347057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:52:24.550198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

삭제여부
Boolean

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
False
9732 
True
 
268
ValueCountFrequency (%)
False 9732
97.3%
True 268
 
2.7%
2023-12-12T09:52:24.689790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Interactions

2023-12-12T09:52:19.760988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T09:52:24.767122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업유형사업계획변경순번심사상태코드삭제여부
사업유형1.0000.1340.1880.033
사업계획변경순번0.1341.0000.0650.061
심사상태코드0.1880.0651.0000.998
삭제여부0.0330.0610.9981.000
2023-12-12T09:52:24.871033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
삭제여부사업유형심사상태코드
삭제여부1.0000.0550.964
사업유형0.0551.0000.079
심사상태코드0.9640.0791.000
2023-12-12T09:52:24.983866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업계획변경순번사업유형심사상태코드삭제여부
사업계획변경순번1.0000.0800.0340.047
사업유형0.0801.0000.0790.055
심사상태코드0.0340.0791.0000.964
삭제여부0.0470.0550.9641.000

Missing values

2023-12-12T09:52:20.284856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:52:20.510686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업유형사업번호사업계획변경순번사업년도사업명심사상태코드최초심의일자심의요청일심의종료일삭제여부
1794사회서비스형2023-0958332023상담지원서비스승인완료2022-12-16 00:002022-12-16 00:002022-12-16 00:00N
5964시장형2023-0558232023백년청춘문화예술단승인완료2023-03-10 00:002022-12-22 00:002022-12-23 00:00N
5064공익활동형2023-0617732023지역환경지킴이승인완료2023-01-05 00:002022-12-13 00:002022-12-13 00:00N
8776시장형2023-0055512023카페 향승인완료2022-12-09 00:002022-12-09 00:002022-12-19 00:00N
3172사회서비스형2023-0755622023시니어금융업무지원승인완료2022-12-13 00:002022-12-13 00:002022-12-14 00:00N
7945공익활동형2023-0037822023학교지킴이승인완료2023-07-13 00:002022-12-08 00:002022-12-19 00:00N
1951공익활동형2023-1025612023노노케어승인완료2022-12-13 00:002022-12-13 00:002022-12-14 00:00N
8775사회서비스형2023-0137922023두루두루 도서관 지원단승인완료2022-12-20 00:002022-12-20 00:002022-12-20 00:00N
5085사회서비스형2023-0702612023함께돌봄사업승인완료2022-12-09 00:002022-12-09 00:002022-12-13 00:00N
6657공익활동형2023-0669722023경륜전수승인완료2022-12-14 00:002022-12-14 00:002022-12-16 00:00N
사업유형사업번호사업계획변경순번사업년도사업명심사상태코드최초심의일자심의요청일심의종료일삭제여부
6905시장형2023-0414622023행복한보리밥카페2호점승인완료2022-12-19 00:002022-12-19 00:002022-12-20 00:00N
5463공익활동형2023-0458422023급식도우미승인완료2022-12-30 00:002022-12-08 00:002022-12-15 00:00N
7003공익활동형2023-0407242023향토문화기초조사승인완료2023-02-21 00:002023-02-20 00:002022-12-15 00:00N
341사회서비스형2023-1059112023초등학교 방역행정업무(초.코.파.이)승인완료2022-12-16 00:002022-12-16 00:002022-12-21 00:00N
2124사회서비스형2023-0846642023노인관련시설지원승인완료2023-09-12 00:002023-07-04 00:002022-12-16 00:00N
8893공익활동형2023-0160422023스마트실버승인완료2023-01-03 00:002022-12-08 00:002022-12-08 00:00N
9749공익활동형2023-0365822023환경지킴이(퇴계동,신동면)승인완료2023-01-05 00:002022-12-07 00:002022-12-14 00:00N
8537공익활동형2023-0109712023늘 푸른 공원가꾸미승인완료2022-12-09 00:002022-12-09 00:002022-12-16 00:00N
953사회서비스형2023-7621412023스마트 기억 e음승인완료2023-06-15 00:002023-06-15 00:002023-06-15 00:00N
5405공익활동형2023-1086272023노노케어승인완료2023-05-04 00:002023-04-28 00:002022-12-21 00:00N