Overview

Dataset statistics

Number of variables4
Number of observations112
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.7 KiB
Average record size in memory34.2 B

Variable types

Categorical3
Numeric1

Dataset

Description개인정보보호위원회에서 관리하는 2023년 자율규제단체들의 개인정보보호 교육 및 컨설팅 진행 정보에 대한 데이터입니다.
Author공공데이터포털
URLhttps://www.data.go.kr/data/15119747/fileData.do

Alerts

분류 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 분류High correlation
진행 수 has 59 (52.7%) zerosZeros

Reproduction

Analysis started2024-04-17 18:26:19.712573
Analysis finished2024-04-17 18:26:20.033054
Duration0.32 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

단체명
Categorical

Distinct28
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
개인정보보호협회
 
4
대한병원협회
 
4
대한약사회
 
4
대한의사협회
 
4
대한치과의사협회
 
4
Other values (23)
92 

Length

Max length23
Median length12
Mean length9.7142857
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개인정보보호협회
2nd row대한병원협회
3rd row대한약사회
4th row대한의사협회
5th row대한치과의사협회

Common Values

ValueCountFrequency (%)
개인정보보호협회 4
 
3.6%
대한병원협회 4
 
3.6%
대한약사회 4
 
3.6%
대한의사협회 4
 
3.6%
대한치과의사협회 4
 
3.6%
대한한방병원협회 4
 
3.6%
대한한약사회 4
 
3.6%
대한한의사협회 4
 
3.6%
전국이동통신유통협회 4
 
3.6%
코리아스타트업포럼 4
 
3.6%
Other values (18) 72
64.3%

Length

2024-04-18T03:26:20.102515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
교육 8
 
5.7%
처리방침 4
 
2.9%
한국장애인복지관협회 4
 
2.9%
한국케이블tv방송협회 4
 
2.9%
한국학원총연합회 4
 
2.9%
한국학점은행평생교육협의회 4
 
2.9%
한국호텔업협회 4
 
2.9%
공통(개인정보 4
 
2.9%
개인정보보호협회 4
 
2.9%
대한병원협회 4
 
2.9%
Other values (24) 96
68.6%

구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
교육
56 
컨설팅
56 

Length

Max length3
Median length2.5
Mean length2.5
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row교육
2nd row교육
3rd row교육
4th row교육
5th row교육

Common Values

ValueCountFrequency (%)
교육 56
50.0%
컨설팅 56
50.0%

Length

2024-04-18T03:26:20.193981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T03:26:20.264334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
교육 56
50.0%
컨설팅 56
50.0%

분류
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
전문기관
28 
자체교육
28 
현장
28 
부스
28 

Length

Max length4
Median length3
Mean length3
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전문기관
2nd row전문기관
3rd row전문기관
4th row전문기관
5th row전문기관

Common Values

ValueCountFrequency (%)
전문기관 28
25.0%
자체교육 28
25.0%
현장 28
25.0%
부스 28
25.0%

Length

2024-04-18T03:26:20.345181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T03:26:20.426230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전문기관 28
25.0%
자체교육 28
25.0%
현장 28
25.0%
부스 28
25.0%

진행 수
Real number (ℝ)

ZEROS 

Distinct16
Distinct (%)14.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.6517857
Minimum0
Maximum36
Zeros59
Zeros (%)52.7%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2024-04-18T03:26:20.498213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q33
95-th percentile13.9
Maximum36
Range36
Interquartile range (IQR)3

Descriptive statistics

Standard deviation5.2207311
Coefficient of variation (CV)1.9687605
Kurtosis16.813418
Mean2.6517857
Median Absolute Deviation (MAD)0
Skewness3.6179174
Sum297
Variance27.256033
MonotonicityNot monotonic
2024-04-18T03:26:20.599666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
0 59
52.7%
2 11
 
9.8%
3 10
 
8.9%
1 8
 
7.1%
6 5
 
4.5%
4 5
 
4.5%
7 3
 
2.7%
5 2
 
1.8%
15 2
 
1.8%
9 1
 
0.9%
Other values (6) 6
 
5.4%
ValueCountFrequency (%)
0 59
52.7%
1 8
 
7.1%
2 11
 
9.8%
3 10
 
8.9%
4 5
 
4.5%
5 2
 
1.8%
6 5
 
4.5%
7 3
 
2.7%
9 1
 
0.9%
12 1
 
0.9%
ValueCountFrequency (%)
36 1
 
0.9%
23 1
 
0.9%
17 1
 
0.9%
16 1
 
0.9%
15 2
 
1.8%
13 1
 
0.9%
12 1
 
0.9%
9 1
 
0.9%
7 3
2.7%
6 5
4.5%

Interactions

2024-04-18T03:26:19.838268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-18T03:26:20.662568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
단체명구분분류진행 수
단체명1.0000.0000.0000.016
구분0.0001.0001.0000.000
분류0.0001.0001.0000.252
진행 수0.0160.0000.2521.000
2024-04-18T03:26:20.731680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
단체명분류구분
단체명1.0000.0000.000
분류0.0001.0000.991
구분0.0000.9911.000
2024-04-18T03:26:20.795507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
진행 수단체명구분분류
진행 수1.0000.0000.0000.171
단체명0.0001.0000.0000.000
구분0.0000.0001.0000.991
분류0.1710.0000.9911.000

Missing values

2024-04-18T03:26:19.927222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-18T03:26:20.000767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

단체명구분분류진행 수
0개인정보보호협회교육전문기관0
1대한병원협회교육전문기관1
2대한약사회교육전문기관0
3대한의사협회교육전문기관0
4대한치과의사협회교육전문기관0
5대한한방병원협회교육전문기관0
6대한한약사회교육전문기관0
7대한한의사협회교육전문기관0
8전국이동통신유통협회교육전문기관3
9코리아스타트업포럼교육전문기관2
단체명구분분류진행 수
102한국온라인쇼핑협회컨설팅부스0
103한국장애인복지관협회컨설팅부스0
104한국케이블TV방송협회컨설팅부스0
105한국학원총연합회컨설팅부스0
106한국학점은행평생교육협의회컨설팅부스0
107한국호텔업협회컨설팅부스0
108공통(개인정보 처리방침 교육)컨설팅부스0
109공통(단체 담당자 교육)컨설팅부스0
110공통(전문가 양성교육)컨설팅부스0
111공통(PIS FAIR, 신규단체)부스컨설팅컨설팅부스3