Overview

Dataset statistics

Number of variables4
Number of observations35
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 KiB
Average record size in memory36.8 B

Variable types

Categorical2
Text2

Dataset

Description예금보험공사의 주요 사회공헌사업. 한 개의 부서당 한 개의 사회복지시설과 각각 지속적인 관계를 맺으며 연간 봉사활동
URLhttps://www.data.go.kr/data/15104351/fileData.do

Alerts

주요 활동내역 is highly imbalanced (76.5%)Imbalance

Reproduction

Analysis started2023-12-12 20:38:18.826516
Analysis finished2023-12-12 20:38:19.294067
Duration0.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Categorical

Distinct2
Distinct (%)5.7%
Missing0
Missing (%)0.0%
Memory size412.0 B
2021
21 
2022
14 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021
2nd row2021
3rd row2021
4th row2021
5th row2021

Common Values

ValueCountFrequency (%)
2021 21
60.0%
2022 14
40.0%

Length

2023-12-13T05:38:19.382983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:38:19.817864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021 21
60.0%
2022 14
40.0%

부서
Text

Distinct24
Distinct (%)68.6%
Missing0
Missing (%)0.0%
Memory size412.0 B
2023-12-13T05:38:20.041370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length5
Mean length5.2857143
Min length3

Characters and Unicode

Total characters185
Distinct characters53
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique13 ?
Unique (%)37.1%

Sample

1st rowIT전략운영부
2nd row정보보호실
3rd row감사실
4th row기금관리실
5th row기금운용실
ValueCountFrequency (%)
it전략운영부 2
 
5.7%
정보보호실 2
 
5.7%
기획조정부 2
 
5.7%
감사실 2
 
5.7%
기금관리실 2
 
5.7%
저축은행관리부 2
 
5.7%
기금운용실 2
 
5.7%
홍보실 2
 
5.7%
인재개발실 2
 
5.7%
해외재산조사부 2
 
5.7%
Other values (14) 15
42.9%
2023-12-13T05:38:20.413720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
20
 
10.8%
15
 
8.1%
11
 
5.9%
10
 
5.4%
9
 
4.9%
8
 
4.3%
7
 
3.8%
7
 
3.8%
6
 
3.2%
5
 
2.7%
Other values (43) 87
47.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 180
97.3%
Uppercase Letter 4
 
2.2%
Decimal Number 1
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
20
 
11.1%
15
 
8.3%
11
 
6.1%
10
 
5.6%
9
 
5.0%
8
 
4.4%
7
 
3.9%
7
 
3.9%
6
 
3.3%
5
 
2.8%
Other values (40) 82
45.6%
Uppercase Letter
ValueCountFrequency (%)
I 2
50.0%
T 2
50.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 180
97.3%
Latin 4
 
2.2%
Common 1
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
20
 
11.1%
15
 
8.3%
11
 
6.1%
10
 
5.6%
9
 
5.0%
8
 
4.4%
7
 
3.9%
7
 
3.9%
6
 
3.3%
5
 
2.8%
Other values (40) 82
45.6%
Latin
ValueCountFrequency (%)
I 2
50.0%
T 2
50.0%
Common
ValueCountFrequency (%)
1 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 180
97.3%
ASCII 5
 
2.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
20
 
11.1%
15
 
8.3%
11
 
6.1%
10
 
5.6%
9
 
5.0%
8
 
4.4%
7
 
3.9%
7
 
3.9%
6
 
3.3%
5
 
2.8%
Other values (40) 82
45.6%
ASCII
ValueCountFrequency (%)
I 2
40.0%
T 2
40.0%
1 1
20.0%
Distinct20
Distinct (%)57.1%
Missing0
Missing (%)0.0%
Memory size412.0 B
2023-12-13T05:38:20.643018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length11
Mean length8.8571429
Min length3

Characters and Unicode

Total characters310
Distinct characters64
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)28.6%

Sample

1st row등대지역아동센터
2nd row등대지역아동센터
3rd row송죽원
4th row구로구 다문화가족 지원센터
5th row신당종합사회복지관
ValueCountFrequency (%)
등대지역아동센터 4
 
8.5%
동대문노인종합복지관 4
 
8.5%
유락종합사회복지관 3
 
6.4%
협동조합 2
 
4.3%
갈월종합사회복지관 2
 
4.3%
모두 2
 
4.3%
다문화도서관 2
 
4.3%
종로지역아동센터 2
 
4.3%
송죽원 2
 
4.3%
사회적 2
 
4.3%
Other values (18) 22
46.8%
2023-12-13T05:38:20.984077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24
 
7.7%
16
 
5.2%
14
 
4.5%
14
 
4.5%
14
 
4.5%
14
 
4.5%
12
 
3.9%
12
 
3.9%
12
 
3.9%
10
 
3.2%
Other values (54) 168
54.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 298
96.1%
Space Separator 12
 
3.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
 
8.1%
16
 
5.4%
14
 
4.7%
14
 
4.7%
14
 
4.7%
14
 
4.7%
12
 
4.0%
12
 
4.0%
10
 
3.4%
10
 
3.4%
Other values (53) 158
53.0%
Space Separator
ValueCountFrequency (%)
12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 298
96.1%
Common 12
 
3.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
 
8.1%
16
 
5.4%
14
 
4.7%
14
 
4.7%
14
 
4.7%
14
 
4.7%
12
 
4.0%
12
 
4.0%
10
 
3.4%
10
 
3.4%
Other values (53) 158
53.0%
Common
ValueCountFrequency (%)
12
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 298
96.1%
ASCII 12
 
3.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
24
 
8.1%
16
 
5.4%
14
 
4.7%
14
 
4.7%
14
 
4.7%
14
 
4.7%
12
 
4.0%
12
 
4.0%
10
 
3.4%
10
 
3.4%
Other values (53) 158
53.0%
ASCII
ValueCountFrequency (%)
12
100.0%

주요 활동내역
Categorical

IMBALANCE 

Distinct3
Distinct (%)8.6%
Missing0
Missing (%)0.0%
Memory size412.0 B
물품지원 및 후원
33 
도시락 배달
 
1
물품지원 및 후원·도시락 배달
 
1

Length

Max length16
Median length9
Mean length9.1142857
Min length6

Unique

Unique2 ?
Unique (%)5.7%

Sample

1st row물품지원 및 후원
2nd row물품지원 및 후원
3rd row물품지원 및 후원
4th row물품지원 및 후원
5th row도시락 배달

Common Values

ValueCountFrequency (%)
물품지원 및 후원 33
94.3%
도시락 배달 1
 
2.9%
물품지원 및 후원·도시락 배달 1
 
2.9%

Length

2023-12-13T05:38:21.120096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:38:21.243405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
물품지원 34
32.4%
34
32.4%
후원 33
31.4%
배달 2
 
1.9%
도시락 1
 
1.0%
후원·도시락 1
 
1.0%

Correlations

2023-12-13T05:38:21.335719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도부서결연기관주요 활동내역
연도1.0000.0000.0000.043
부서0.0001.0000.9850.000
결연기관0.0000.9851.0000.000
주요 활동내역0.0430.0000.0001.000
2023-12-13T05:38:21.456975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주요 활동내역연도
주요 활동내역1.0000.058
연도0.0581.000
2023-12-13T05:38:21.568982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도주요 활동내역
연도1.0000.058
주요 활동내역0.0581.000

Missing values

2023-12-13T05:38:19.124764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:38:19.249918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도부서결연기관주요 활동내역
02021IT전략운영부등대지역아동센터물품지원 및 후원
12021정보보호실등대지역아동센터물품지원 및 후원
22021감사실송죽원물품지원 및 후원
32021기금관리실구로구 다문화가족 지원센터물품지원 및 후원
42021기금운용실신당종합사회복지관도시락 배달
52021기금정책부서울노인복지센터물품지원 및 후원
62021저축은행관리부동대문노인종합복지관물품지원 및 후원
72021기획조정부동대문노인종합복지관물품지원 및 후원
82021리스크총괄부약수노인종합복지관물품지원 및 후원
92021대형금융회사관리부청소년 숲 사회적 협동조합물품지원 및 후원
연도부서결연기관주요 활동내역
252022기금관리실구로구가족센터물품지원 및 후원
262022기금운용실신당종합사회복지관물품지원 및 후원·도시락 배달
272022기획조정부동대문노인종합복지관물품지원 및 후원
282022법무실아동복지실천회 세움물품지원 및 후원
292022저축은행관리부동대문노인종합복지관물품지원 및 후원
302022조사기획부나비훨훨지역아동센터물품지원 및 후원
312022채권관리부생명나눔실천본부물품지원 및 후원
322022해외재산조사부갈월종합사회복지관물품지원 및 후원
332022홍보실종로지역아동센터물품지원 및 후원
342022회수기획부다문화도서관 모두물품지원 및 후원