Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 26 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 2.1 KiB |
Average record size in memory | 82.1 B |
Variable types
Numeric | 2 |
---|---|
Text | 1 |
Boolean | 2 |
Categorical | 3 |
DateTime | 1 |
Dataset
Description | 온라인 개인정보보호 포털 내 온라인 교육콘텐츠 및 강의정보에 관련한 데이터로 교육과정명, 등록 일시, 차수 등의 정보를 제공합니다. |
---|---|
Author | 한국인터넷진흥원 |
URL | https://www.data.go.kr/data/15070607/fileData.do |
조회수 has constant value "" | Constant |
선택이수챕터수 is highly overall correlated with 노출여부 and 2 other fields | High correlation |
노출여부 is highly overall correlated with 선택이수챕터수 | High correlation |
필수시험여부 is highly overall correlated with 선택이수챕터수 | High correlation |
시험과락갯수 is highly overall correlated with 선택이수챕터수 | High correlation |
노출여부 is highly imbalanced (60.9%) | Imbalance |
선택이수챕터수 is highly imbalanced (60.8%) | Imbalance |
필수시험여부 is highly imbalanced (76.5%) | Imbalance |
시험과락갯수 is highly imbalanced (76.5%) | Imbalance |
인덱스 has unique values | Unique |
강의명 has unique values | Unique |
Reproduction
Analysis started | 2023-12-13 00:43:48.866254 |
---|---|
Analysis finished | 2023-12-13 00:43:49.570390 |
Duration | 0.7 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
인덱스
Real number (ℝ)
UNIQUE
 
Distinct | 26 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 14.192308 |
Minimum | 1 |
---|---|
Maximum | 27 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 366.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2.25 |
Q1 | 7.25 |
median | 14.5 |
Q3 | 20.75 |
95-th percentile | 25.75 |
Maximum | 27 |
Range | 26 |
Interquartile range (IQR) | 13.5 |
Descriptive statistics
Standard deviation | 8.0300398 |
---|---|
Coefficient of variation (CV) | 0.56580226 |
Kurtosis | -1.2168073 |
Mean | 14.192308 |
Median Absolute Deviation (MAD) | 7 |
Skewness | -0.067387075 |
Sum | 369 |
Variance | 64.481538 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 3.8% |
16 | 1 | 3.8% |
27 | 1 | 3.8% |
26 | 1 | 3.8% |
25 | 1 | 3.8% |
24 | 1 | 3.8% |
23 | 1 | 3.8% |
22 | 1 | 3.8% |
21 | 1 | 3.8% |
20 | 1 | 3.8% |
Other values (16) | 16 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
10 | 1 | |
11 | 1 |
Value | Count | Frequency (%) |
27 | 1 | |
26 | 1 | |
25 | 1 | |
24 | 1 | |
23 | 1 | |
22 | 1 | |
21 | 1 | |
20 | 1 | |
19 | 1 | |
18 | 1 |
강의명
Text
UNIQUE
 
Distinct | 26 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 340.0 B |
Length
Max length | 31 |
---|---|
Median length | 21 |
Mean length | 16.192308 |
Min length | 7 |
Characters and Unicode
Total characters | 421 |
---|---|
Distinct characters | 81 |
Distinct categories | 9 ? |
Distinct scripts | 3 ? |
Distinct blocks | 3 ? |
Unique
Unique | 26 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 개인정보보호 교육과정1 |
---|---|
2nd row | 개인정보보호 교육과정2 |
3rd row | 정보보호 실무과정 |
4th row | 정보보호 기초과정 |
5th row | 개인정보보호교육 - 업종별 교육과정 |
Value | Count | Frequency (%) |
12 | 14.1% | |
개인정보보호교육 | 10 | 11.8% |
교육과정 | 5 | 5.9% |
사업자 | 3 | 3.5% |
school | 3 | 3.5% |
student | 3 | 3.5% |
ceo·cpo | 3 | 3.5% |
교육 | 2 | 2.4% |
위치정보보호 | 2 | 2.4% |
개인정보보호 | 2 | 2.4% |
Other values (39) | 40 |
Most occurring characters
Value | Count | Frequency (%) |
60 | 14.3% | |
보 | 33 | 7.8% |
정 | 27 | 6.4% |
교 | 24 | 5.7% |
육 | 23 | 5.5% |
호 | 16 | 3.8% |
인 | 13 | 3.1% |
- | 12 | 2.9% |
개 | 12 | 2.9% |
n | 10 | 2.4% |
Other values (71) | 191 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 219 | |
Lowercase Letter | 90 | |
Space Separator | 60 | 14.3% |
Uppercase Letter | 26 | 6.2% |
Dash Punctuation | 12 | 2.9% |
Decimal Number | 9 | 2.1% |
Other Punctuation | 3 | 0.7% |
Open Punctuation | 1 | 0.2% |
Close Punctuation | 1 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
보 | 33 | |
정 | 27 | |
교 | 24 | |
육 | 23 | |
호 | 16 | 7.3% |
인 | 13 | 5.9% |
개 | 12 | 5.5% |
과 | 10 | 4.6% |
업 | 4 | 1.8% |
제 | 4 | 1.8% |
Other values (33) | 53 |
Lowercase Letter
Value | Count | Frequency (%) |
n | 10 | |
e | 8 | |
t | 7 | 7.8% |
s | 7 | 7.8% |
c | 7 | 7.8% |
h | 7 | 7.8% |
o | 7 | 7.8% |
l | 7 | 7.8% |
i | 6 | 6.7% |
d | 6 | 6.7% |
Other values (9) | 18 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 6 | |
O | 6 | |
E | 4 | |
P | 4 | |
M | 2 | 7.7% |
G | 1 | 3.8% |
H | 1 | 3.8% |
S | 1 | 3.8% |
I | 1 | 3.8% |
Decimal Number
Value | Count | Frequency (%) |
2 | 3 | |
1 | 3 | |
0 | 1 | 11.1% |
4 | 1 | 11.1% |
3 | 1 | 11.1% |
Space Separator
Value | Count | Frequency (%) |
60 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 12 |
Other Punctuation
Value | Count | Frequency (%) |
· | 3 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 219 | |
Latin | 116 | |
Common | 86 | 20.4% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
보 | 33 | |
정 | 27 | |
교 | 24 | |
육 | 23 | |
호 | 16 | 7.3% |
인 | 13 | 5.9% |
개 | 12 | 5.5% |
과 | 10 | 4.6% |
업 | 4 | 1.8% |
제 | 4 | 1.8% |
Other values (33) | 53 |
Latin
Value | Count | Frequency (%) |
n | 10 | 8.6% |
e | 8 | 6.9% |
t | 7 | 6.0% |
s | 7 | 6.0% |
c | 7 | 6.0% |
h | 7 | 6.0% |
o | 7 | 6.0% |
l | 7 | 6.0% |
C | 6 | 5.2% |
O | 6 | 5.2% |
Other values (18) | 44 |
Common
Value | Count | Frequency (%) |
60 | ||
- | 12 | 14.0% |
2 | 3 | 3.5% |
· | 3 | 3.5% |
1 | 3 | 3.5% |
( | 1 | 1.2% |
0 | 1 | 1.2% |
4 | 1 | 1.2% |
3 | 1 | 1.2% |
) | 1 | 1.2% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 219 | |
ASCII | 199 | |
None | 3 | 0.7% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
60 | ||
- | 12 | 6.0% |
n | 10 | 5.0% |
e | 8 | 4.0% |
t | 7 | 3.5% |
s | 7 | 3.5% |
c | 7 | 3.5% |
h | 7 | 3.5% |
o | 7 | 3.5% |
l | 7 | 3.5% |
Other values (27) | 67 |
Hangul
Value | Count | Frequency (%) |
보 | 33 | |
정 | 27 | |
교 | 24 | |
육 | 23 | |
호 | 16 | 7.3% |
인 | 13 | 5.9% |
개 | 12 | 5.5% |
과 | 10 | 4.6% |
업 | 4 | 1.8% |
제 | 4 | 1.8% |
Other values (33) | 53 |
None
Value | Count | Frequency (%) |
· | 3 |
노출여부
Boolean
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 7.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 158.0 B |
True | |
---|---|
False | 2 |
Value | Count | Frequency (%) |
True | 24 | |
False | 2 | 7.7% |
조회수
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 3.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 340.0 B |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 26 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 26 |
등록일자
Date
Distinct | 18 |
---|---|
Distinct (%) | 69.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 340.0 B |
Minimum | 2010-11-17 11:52:00 |
---|---|
Maximum | 2020-02-18 11:08:00 |
이수챕터수
Real number (ℝ)
Distinct | 6 |
---|---|
Distinct (%) | 23.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.2692308 |
Minimum | 1 |
---|---|
Maximum | 10 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 366.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 2 |
median | 3 |
Q3 | 4.75 |
95-th percentile | 5 |
Maximum | 10 |
Range | 9 |
Interquartile range (IQR) | 2.75 |
Descriptive statistics
Standard deviation | 1.9911342 |
---|---|
Coefficient of variation (CV) | 0.60905281 |
Kurtosis | 3.9534447 |
Mean | 3.2692308 |
Median Absolute Deviation (MAD) | 1.5 |
Skewness | 1.4406119 |
Sum | 85 |
Variance | 3.9646154 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3 | 9 | |
5 | 6 | |
1 | 6 | |
2 | 2 | 7.7% |
4 | 2 | 7.7% |
10 | 1 | 3.8% |
Value | Count | Frequency (%) |
1 | 6 | |
2 | 2 | 7.7% |
3 | 9 | |
4 | 2 | 7.7% |
5 | 6 | |
10 | 1 | 3.8% |
Value | Count | Frequency (%) |
10 | 1 | 3.8% |
5 | 6 | |
4 | 2 | 7.7% |
3 | 9 | |
2 | 2 | 7.7% |
1 | 6 |
선택이수챕터수
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 11.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 340.0 B |
0 | |
---|---|
1 | 2 |
3 | 1 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 3.8% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 1 |
4th row | 1 |
5th row | 3 |
Common Values
Value | Count | Frequency (%) |
0 | 23 | |
1 | 2 | 7.7% |
3 | 1 | 3.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 23 | |
1 | 2 | 7.7% |
3 | 1 | 3.8% |
필수시험여부
Boolean
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 7.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 158.0 B |
False | |
---|---|
True | 1 |
Value | Count | Frequency (%) |
False | 25 | |
True | 1 | 3.8% |
시험과락갯수
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 7.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 340.0 B |
0 | |
---|---|
5 | 1 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 3.8% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 5 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 25 | |
5 | 1 | 3.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 25 | |
5 | 1 | 3.8% |
인덱스 | 강의명 | 노출여부 | 등록일자 | 이수챕터수 | 선택이수챕터수 | 필수시험여부 | 시험과락갯수 | |
---|---|---|---|---|---|---|---|---|
인덱스 | 1.000 | 1.000 | 0.000 | 0.954 | 0.691 | 0.000 | 0.000 | 0.000 |
강의명 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
노출여부 | 0.000 | 1.000 | 1.000 | 1.000 | 0.451 | 1.000 | 0.389 | 0.389 |
등록일자 | 0.954 | 1.000 | 1.000 | 1.000 | 0.990 | 1.000 | 0.000 | 0.000 |
이수챕터수 | 0.691 | 1.000 | 0.451 | 0.990 | 1.000 | 0.391 | 0.000 | 0.000 |
선택이수챕터수 | 0.000 | 1.000 | 1.000 | 1.000 | 0.391 | 1.000 | 0.422 | 0.422 |
필수시험여부 | 0.000 | 1.000 | 0.389 | 0.000 | 0.000 | 0.422 | 1.000 | 0.000 |
시험과락갯수 | 0.000 | 1.000 | 0.389 | 0.000 | 0.000 | 0.422 | 0.000 | 1.000 |
필수시험여부 | 선택이수챕터수 | 노출여부 | 시험과락갯수 | |
---|---|---|---|---|
필수시험여부 | 1.000 | 0.645 | 0.252 | 0.000 |
선택이수챕터수 | 0.645 | 1.000 | 0.979 | 0.645 |
노출여부 | 0.252 | 0.979 | 1.000 | 0.252 |
시험과락갯수 | 0.000 | 0.645 | 0.252 | 1.000 |
인덱스 | 이수챕터수 | 노출여부 | 선택이수챕터수 | 필수시험여부 | 시험과락갯수 | |
---|---|---|---|---|---|---|
인덱스 | 1.000 | 0.413 | 0.000 | 0.000 | 0.000 | 0.000 |
이수챕터수 | 0.413 | 1.000 | 0.285 | 0.137 | 0.000 | 0.000 |
노출여부 | 0.000 | 0.285 | 1.000 | 0.979 | 0.252 | 0.252 |
선택이수챕터수 | 0.000 | 0.137 | 0.979 | 1.000 | 0.645 | 0.645 |
필수시험여부 | 0.000 | 0.000 | 0.252 | 0.645 | 1.000 | 0.000 |
시험과락갯수 | 0.000 | 0.000 | 0.252 | 0.645 | 0.000 | 1.000 |
인덱스 | 강의명 | 노출여부 | 조회수 | 등록일자 | 이수챕터수 | 선택이수챕터수 | 필수시험여부 | 시험과락갯수 | |
---|---|---|---|---|---|---|---|---|---|
0 | 1 | 개인정보보호 교육과정1 | Y | 0 | 2010-11-17 11:52 | 5 | 0 | N | 0 |
1 | 2 | 개인정보보호 교육과정2 | Y | 0 | 2010-11-17 11:52 | 5 | 0 | N | 0 |
2 | 3 | 정보보호 실무과정 | N | 0 | 2010-11-18 15:40 | 1 | 1 | Y | 0 |
3 | 4 | 정보보호 기초과정 | N | 0 | 2010-11-18 15:40 | 1 | 1 | N | 5 |
4 | 5 | 개인정보보호교육 - 업종별 교육과정 | Y | 0 | 2012-03-26 15:21 | 1 | 3 | N | 0 |
5 | 6 | 위치정보보호 교육과정 | Y | 0 | 2012-07-31 16:21 | 2 | 0 | N | 0 |
6 | 7 | 정보통신망법 신규제도 교육 | Y | 0 | 2012-12-24 13:40 | 3 | 0 | N | 0 |
7 | 8 | 2014 개인정보보호교육 | Y | 0 | 2014-09-11 13:13 | 2 | 0 | N | 0 |
8 | 10 | PIMS 교육 | Y | 0 | 2015-04-01 15:18 | 4 | 0 | N | 0 |
9 | 11 | CEO·CPO 교육과정 - 제1편 통찰 | Y | 0 | 2016-02-24 16:48 | 1 | 0 | N | 0 |
인덱스 | 강의명 | 노출여부 | 조회수 | 등록일자 | 이수챕터수 | 선택이수챕터수 | 필수시험여부 | 시험과락갯수 | |
---|---|---|---|---|---|---|---|---|---|
16 | 18 | 개인정보보호교육 - 사업자 기본교육 | Y | 0 | 2019-02-22 16:45 | 5 | 0 | N | 0 |
17 | 19 | 개인정보보호교육 - 사업자 실무교육 | Y | 0 | 2019-02-22 16:45 | 4 | 0 | N | 0 |
18 | 20 | 개인정보보호교육 - 사업자 전문교육 | Y | 0 | 2019-02-22 16:46 | 5 | 0 | N | 0 |
19 | 21 | 개인정보보호교육 - 교원 | Y | 0 | 2019-03-06 10:35 | 10 | 0 | N | 0 |
20 | 22 | Elementary school student | Y | 0 | 2020-02-11 16:13 | 3 | 0 | N | 0 |
21 | 23 | Middle school student | Y | 0 | 2020-02-11 16:13 | 3 | 0 | N | 0 |
22 | 24 | High school student | Y | 0 | 2020-02-11 16:13 | 3 | 0 | N | 0 |
23 | 25 | General public | Y | 0 | 2020-02-11 16:14 | 3 | 0 | N | 0 |
24 | 26 | ngi nc ngoi nhp c v du hc sinh | Y | 0 | 2020-02-11 16:14 | 5 | 0 | N | 0 |
25 | 27 | 위치정보보호 교육과정(신) | Y | 0 | 2020-02-18 11:08 | 5 | 0 | N | 0 |