Overview

Dataset statistics

Number of variables3
Number of observations60
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.7 KiB
Average record size in memory28.2 B

Variable types

Numeric1
Categorical2

Dataset

Description온라인 개인정보보호 포털 내 강의시험 상세정보 관련 데이터입니다.
Author한국인터넷진흥원
URLhttps://www.data.go.kr/data/15070605/fileData.do

Alerts

강의시험상세번호 is highly overall correlated with 상세내용High correlation
상세내용 is highly overall correlated with 강의시험상세번호High correlation
강의 인덱스 has 4 (6.7%) zerosZeros

Reproduction

Analysis started2023-12-12 23:55:02.687906
Analysis finished2023-12-12 23:55:02.969044
Duration0.28 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

강의 인덱스
Real number (ℝ)

ZEROS 

Distinct15
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7
Minimum0
Maximum14
Zeros4
Zeros (%)6.7%
Negative0
Negative (%)0.0%
Memory size672.0 B
2023-12-13T08:55:03.020425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q13
median7
Q311
95-th percentile14
Maximum14
Range14
Interquartile range (IQR)8

Descriptive statistics

Standard deviation4.3569543
Coefficient of variation (CV)0.62242204
Kurtosis-1.2109379
Mean7
Median Absolute Deviation (MAD)4
Skewness0
Sum420
Variance18.983051
MonotonicityIncreasing
2023-12-13T08:55:03.397972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
0 4
 
6.7%
1 4
 
6.7%
2 4
 
6.7%
3 4
 
6.7%
4 4
 
6.7%
5 4
 
6.7%
6 4
 
6.7%
7 4
 
6.7%
8 4
 
6.7%
9 4
 
6.7%
Other values (5) 20
33.3%
ValueCountFrequency (%)
0 4
6.7%
1 4
6.7%
2 4
6.7%
3 4
6.7%
4 4
6.7%
5 4
6.7%
6 4
6.7%
7 4
6.7%
8 4
6.7%
9 4
6.7%
ValueCountFrequency (%)
14 4
6.7%
13 4
6.7%
12 4
6.7%
11 4
6.7%
10 4
6.7%
9 4
6.7%
8 4
6.7%
7 4
6.7%
6 4
6.7%
5 4
6.7%

강의시험상세번호
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Memory size612.0 B
1
15 
3
15 
4
15 
2
14 
0
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)1.7%

Sample

1st row1
2nd row2
3rd row3
4th row4
5th row1

Common Values

ValueCountFrequency (%)
1 15
25.0%
3 15
25.0%
4 15
25.0%
2 14
23.3%
0 1
 
1.7%

Length

2023-12-13T08:55:03.499104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:55:03.586970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 15
25.0%
3 15
25.0%
4 15
25.0%
2 14
23.3%
0 1
 
1.7%

상세내용
Categorical

HIGH CORRELATION 

Distinct22
Distinct (%)36.7%
Missing0
Missing (%)0.0%
Memory size612.0 B
어쭈구리
11 
얼씨구
11 
오마이갓
11 
이런
지구야 우리가 간다
 
1
Other values (17)
17 

Length

Max length10
Median length9
Mean length3.7333333
Min length2

Unique

Unique18 ?
Unique (%)30.0%

Sample

1st row항목 1
2nd row항목 2
3rd row항목 3
4th row항목 4
5th row어쭈구리

Common Values

ValueCountFrequency (%)
어쭈구리 11
18.3%
얼씨구 11
18.3%
오마이갓 11
18.3%
이런 9
15.0%
지구야 우리가 간다 1
 
1.7%
항목 3 1
 
1.7%
항목 4 1
 
1.7%
지구는 살아있다 1
 
1.7%
아무런 이유없이 1
 
1.7%
폼생폼사 1
 
1.7%
Other values (12) 12
20.0%

Length

2023-12-13T08:55:03.701175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
어쭈구리 11
15.5%
오마이갓 11
15.5%
얼씨구 11
15.5%
이런 9
12.7%
항목 4
 
5.6%
칸초베리 1
 
1.4%
사이다 1
 
1.4%
오란씨 1
 
1.4%
코카콜라 1
 
1.4%
붕어빵 1
 
1.4%
Other values (20) 20
28.2%

Interactions

2023-12-13T08:55:02.775894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:55:03.783257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
강의 인덱스강의시험상세번호상세내용
강의 인덱스1.0000.0000.000
강의시험상세번호0.0001.0000.935
상세내용0.0000.9351.000
2023-12-13T08:55:03.859738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
강의시험상세번호상세내용
강의시험상세번호1.0000.660
상세내용0.6601.000
2023-12-13T08:55:03.951786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
강의 인덱스강의시험상세번호상세내용
강의 인덱스1.0000.0000.000
강의시험상세번호0.0001.0000.660
상세내용0.0000.6601.000

Missing values

2023-12-13T08:55:02.873350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:55:02.940237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

강의 인덱스강의시험상세번호상세내용
001항목 1
102항목 2
203항목 3
304항목 4
411어쭈구리
512이런
613지구는 살아있다
714오마이갓
821어쭈구리
922아무런 이유없이
강의 인덱스강의시험상세번호상세내용
50123얼씨구
51124칸초베리
52131어쭈구리
53132공룡
54133얼씨구
55134오마이갓
56141마구
57142이런
58143얼씨구
59144오마이갓