Overview

Dataset statistics

Number of variables8
Number of observations91
Missing cells12
Missing cells (%)1.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.0 KiB
Average record size in memory67.5 B

Variable types

Numeric2
Categorical4
Text1
DateTime1

Dataset

Description대전광역시 서구 노인일자리 및 사회활동지원사업현황(기관구분, 기관명, 활동구분명, 사업명, 일자리 수 등)정보를 제공합니다.
Author대전광역시 서구
URLhttps://www.data.go.kr/data/15113070/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
기관명 is highly overall correlated with 순번 and 2 other fieldsHigh correlation
기관구분명 is highly overall correlated with 순번 and 3 other fieldsHigh correlation
일자리수단위 is highly overall correlated with 순번 and 4 other fieldsHigh correlation
활동구분명 is highly overall correlated with 순번 and 1 other fieldsHigh correlation
순번 is highly overall correlated with 기관구분명 and 3 other fieldsHigh correlation
일자리수 is highly overall correlated with 기관구분명 and 1 other fieldsHigh correlation
기관구분명 is highly imbalanced (74.0%)Imbalance
일자리수단위 is highly imbalanced (69.3%)Imbalance
사업명 has 12 (13.2%) missing valuesMissing
순번 has unique valuesUnique

Reproduction

Analysis started2024-04-21 02:16:27.249680
Analysis finished2024-04-21 02:16:30.581074
Duration3.33 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct91
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean46
Minimum1
Maximum91
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size951.0 B
2024-04-21T11:16:30.668515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.5
Q123.5
median46
Q368.5
95-th percentile86.5
Maximum91
Range90
Interquartile range (IQR)45

Descriptive statistics

Standard deviation26.41338
Coefficient of variation (CV)0.57420392
Kurtosis-1.2
Mean46
Median Absolute Deviation (MAD)23
Skewness0
Sum4186
Variance697.66667
MonotonicityStrictly increasing
2024-04-21T11:16:30.839955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.1%
59 1
 
1.1%
68 1
 
1.1%
67 1
 
1.1%
66 1
 
1.1%
65 1
 
1.1%
64 1
 
1.1%
63 1
 
1.1%
62 1
 
1.1%
61 1
 
1.1%
Other values (81) 81
89.0%
ValueCountFrequency (%)
1 1
1.1%
2 1
1.1%
3 1
1.1%
4 1
1.1%
5 1
1.1%
6 1
1.1%
7 1
1.1%
8 1
1.1%
9 1
1.1%
10 1
1.1%
ValueCountFrequency (%)
91 1
1.1%
90 1
1.1%
89 1
1.1%
88 1
1.1%
87 1
1.1%
86 1
1.1%
85 1
1.1%
84 1
1.1%
83 1
1.1%
82 1
1.1%

기관구분명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size860.0 B
민간기관
87 
공공기관
 
4

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공공기관
2nd row공공기관
3rd row공공기관
4th row공공기관
5th row민간기관

Common Values

ValueCountFrequency (%)
민간기관 87
95.6%
공공기관 4
 
4.4%

Length

2024-04-21T11:16:30.994623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T11:16:31.094511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
민간기관 87
95.6%
공공기관 4
 
4.4%

기관명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)14.3%
Missing0
Missing (%)0.0%
Memory size860.0 B
서구시니어클럽
24 
서구노인복지관
12 
유등노인복지관
10 
서구노인지회
관저종합사회복지관
Other values (8)
31 

Length

Max length12
Median length7
Mean length7.5164835
Min length2

Unique

Unique1 ?
Unique (%)1.1%

Sample

1st row서구
2nd row서구
3rd row서구
4th row서구
5th row서구시니어클럽

Common Values

ValueCountFrequency (%)
서구시니어클럽 24
26.4%
서구노인복지관 12
13.2%
유등노인복지관 10
11.0%
서구노인지회 7
 
7.7%
관저종합사회복지관 7
 
7.7%
정림종합사회복지관 6
 
6.6%
월평종합사회복지관 5
 
5.5%
서구 4
 
4.4%
둔산종합사회복지관 4
 
4.4%
한밭종합사회복지관 4
 
4.4%
Other values (3) 8
 
8.8%

Length

2024-04-21T11:16:31.204623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서구시니어클럽 24
26.4%
서구노인복지관 12
13.2%
유등노인복지관 10
11.0%
서구노인지회 7
 
7.7%
관저종합사회복지관 7
 
7.7%
정림종합사회복지관 6
 
6.6%
월평종합사회복지관 5
 
5.5%
서구 4
 
4.4%
둔산종합사회복지관 4
 
4.4%
한밭종합사회복지관 4
 
4.4%
Other values (3) 8
 
8.8%

활동구분명
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)5.5%
Missing0
Missing (%)0.0%
Memory size860.0 B
공익활동
50 
사회서비스형
21 
전담인력
12 
시장형
인력파견형
 
1

Length

Max length6
Median length4
Mean length4.3956044
Min length3

Unique

Unique1 ?
Unique (%)1.1%

Sample

1st row공익활동
2nd row공익활동
3rd row공익활동
4th row전담인력
5th row공익활동

Common Values

ValueCountFrequency (%)
공익활동 50
54.9%
사회서비스형 21
23.1%
전담인력 12
 
13.2%
시장형 7
 
7.7%
인력파견형 1
 
1.1%

Length

2024-04-21T11:16:31.343107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T11:16:31.465022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공익활동 50
54.9%
사회서비스형 21
23.1%
전담인력 12
 
13.2%
시장형 7
 
7.7%
인력파견형 1
 
1.1%

사업명
Text

MISSING 

Distinct78
Distinct (%)98.7%
Missing12
Missing (%)13.2%
Memory size860.0 B
2024-04-21T11:16:31.709270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length12
Mean length8.1518987
Min length4

Characters and Unicode

Total characters644
Distinct characters184
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique77 ?
Unique (%)97.5%

Sample

1st row환경지킴이사업
2nd row스콜존교통안전지원
3rd row실버순찰대
4th row공공시설보조도우미
5th row실버급식도우미
ValueCountFrequency (%)
커피찌꺼기새활용사업 2
 
2.4%
환경지킴이사업 1
 
1.2%
복지시설봉사 1
 
1.2%
월평사랑지킴이사업 1
 
1.2%
사랑의도시락배달 1
 
1.2%
황토길관리 1
 
1.2%
1
 
1.2%
공공시설 1
 
1.2%
함께하는희망세상 1
 
1.2%
수밋들행복나누미 1
 
1.2%
Other values (73) 73
86.9%
2024-04-21T11:16:32.166663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
30
 
4.7%
30
 
4.7%
20
 
3.1%
19
 
3.0%
17
 
2.6%
15
 
2.3%
15
 
2.3%
14
 
2.2%
14
 
2.2%
14
 
2.2%
Other values (174) 456
70.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 624
96.9%
Close Punctuation 7
 
1.1%
Open Punctuation 7
 
1.1%
Space Separator 5
 
0.8%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
30
 
4.8%
30
 
4.8%
20
 
3.2%
19
 
3.0%
17
 
2.7%
15
 
2.4%
15
 
2.4%
14
 
2.2%
14
 
2.2%
14
 
2.2%
Other values (170) 436
69.9%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Space Separator
ValueCountFrequency (%)
5
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 624
96.9%
Common 20
 
3.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
30
 
4.8%
30
 
4.8%
20
 
3.2%
19
 
3.0%
17
 
2.7%
15
 
2.4%
15
 
2.4%
14
 
2.2%
14
 
2.2%
14
 
2.2%
Other values (170) 436
69.9%
Common
ValueCountFrequency (%)
) 7
35.0%
( 7
35.0%
5
25.0%
/ 1
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 624
96.9%
ASCII 20
 
3.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
30
 
4.8%
30
 
4.8%
20
 
3.2%
19
 
3.0%
17
 
2.7%
15
 
2.4%
15
 
2.4%
14
 
2.2%
14
 
2.2%
14
 
2.2%
Other values (170) 436
69.9%
ASCII
ValueCountFrequency (%)
) 7
35.0%
( 7
35.0%
5
25.0%
/ 1
 
5.0%

일자리수
Real number (ℝ)

HIGH CORRELATION 

Distinct51
Distinct (%)56.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean60.934066
Minimum1
Maximum988
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size951.0 B
2024-04-21T11:16:32.314412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q120
median40
Q369.5
95-th percentile160
Maximum988
Range987
Interquartile range (IQR)49.5

Descriptive statistics

Standard deviation108.50938
Coefficient of variation (CV)1.780767
Kurtosis60.343396
Mean60.934066
Median Absolute Deviation (MAD)25
Skewness7.1583713
Sum5545
Variance11774.284
MonotonicityNot monotonic
2024-04-21T11:16:32.466522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
40 10
 
11.0%
20 9
 
9.9%
60 6
 
6.6%
1 4
 
4.4%
70 3
 
3.3%
30 3
 
3.3%
12 3
 
3.3%
78 2
 
2.2%
15 2
 
2.2%
50 2
 
2.2%
Other values (41) 47
51.6%
ValueCountFrequency (%)
1 4
4.4%
2 2
2.2%
3 2
2.2%
4 1
 
1.1%
5 2
2.2%
8 1
 
1.1%
11 1
 
1.1%
12 3
3.3%
13 1
 
1.1%
14 1
 
1.1%
ValueCountFrequency (%)
988 1
1.1%
224 1
1.1%
200 1
1.1%
190 1
1.1%
184 1
1.1%
136 1
1.1%
124 1
1.1%
118 1
1.1%
116 1
1.1%
112 1
1.1%

일자리수단위
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size860.0 B
86 
<NA>
 
5

Length

Max length4
Median length1
Mean length1.1648352
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
86
94.5%
<NA> 5
 
5.5%

Length

2024-04-21T11:16:32.602488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T11:16:32.716928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
86
94.5%
na 5
 
5.5%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size860.0 B
Minimum2024-01-01 00:00:00
Maximum2024-01-01 00:00:00
2024-04-21T11:16:32.801808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:16:32.902917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-04-21T11:16:30.201768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:16:29.963740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:16:30.280754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:16:30.128517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T11:16:32.991978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번기관구분명기관명활동구분명사업명일자리수
순번1.0000.7580.9150.8560.8980.334
기관구분명0.7581.0001.0000.0001.0000.801
기관명0.9151.0001.0000.3850.0000.356
활동구분명0.8560.0000.3851.0001.0000.144
사업명0.8981.0000.0001.0001.0001.000
일자리수0.3340.8010.3560.1441.0001.000
2024-04-21T11:16:33.126462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기관명기관구분명일자리수단위활동구분명
기관명1.0000.9361.0000.206
기관구분명0.9361.0001.0000.000
일자리수단위1.0001.0001.0001.000
활동구분명0.2060.0001.0001.000
2024-04-21T11:16:33.238858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번일자리수기관구분명기관명활동구분명일자리수단위
순번1.000-0.2710.5250.7000.5131.000
일자리수-0.2711.0000.5860.2000.1151.000
기관구분명0.5250.5861.0000.9360.0001.000
기관명0.7000.2000.9361.0000.2061.000
활동구분명0.5130.1150.0000.2061.0001.000
일자리수단위1.0001.0001.0001.0001.0001.000

Missing values

2024-04-21T11:16:30.393299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T11:16:30.526563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번기관구분명기관명활동구분명사업명일자리수일자리수단위데이터기준일자
01공공기관서구공익활동환경지킴이사업9882024-01-01
12공공기관서구공익활동스콜존교통안전지원2242024-01-01
23공공기관서구공익활동실버순찰대1842024-01-01
34공공기관서구전담인력<NA>12024-01-01
45민간기관서구시니어클럽공익활동공공시설보조도우미702024-01-01
56민간기관서구시니어클럽공익활동실버급식도우미2002024-01-01
67민간기관서구시니어클럽공익활동종이팩 재활용 활성화222024-01-01
78민간기관서구시니어클럽사회서비스형소비자안전모니터요원802024-01-01
89민간기관서구시니어클럽사회서비스형공익방송모니터요원492024-01-01
910민간기관서구시니어클럽사회서비스형함께누리302024-01-01
순번기관구분명기관명활동구분명사업명일자리수일자리수단위데이터기준일자
8182민간기관한밭종합사회복지관공익활동커피찌꺼기새활용사업402024-01-01
8283민간기관한밭종합사회복지관전담인력<NA>12024-01-01
8384민간기관용문종합사회복지관공익활동커피찌꺼기새활용사업202024-01-01
8485민간기관용문종합사회복지관공익활동독거노인친구만들기402024-01-01
8586민간기관용문종합사회복지관공익활동유등천지킴이402024-01-01
8687민간기관용문종합사회복지관전담인력<NA>12024-01-01
8788민간기관시설관리공단공익활동장묘문화개선사업402024-01-01
8889민간기관퇴직공무원재능나눔봉사단사회서비스형사전연명의료의향서홍보상담사업402024-01-01
8990민간기관퇴직공무원재능나눔봉사단사회서비스형취약계층교육지원사업202024-01-01
9091민간기관퇴직공무원재능나눔봉사단전담인력<NA>12024-01-01