Overview

Dataset statistics

Number of variables8
Number of observations92
Missing cells49
Missing cells (%)6.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.1 KiB
Average record size in memory68.4 B

Variable types

Numeric2
Categorical5
DateTime1

Dataset

Description온라인 교육프로그램에 대한 교육과정 유형, 과제, 진도, 수강자 이용현황 등과 같은 정보입니다.
Author국가평생교육진흥원
URLhttps://www.data.go.kr/data/15072260/fileData.do

Alerts

파일테이블IDX is highly overall correlated with 메뉴구분코드 and 2 other fieldsHigh correlation
대분류코드 is highly overall correlated with 하위정보명High correlation
메뉴구분코드 is highly overall correlated with 파일테이블IDX and 2 other fieldsHigh correlation
하위정보명 is highly overall correlated with 대분류코드 and 1 other fieldsHigh correlation
중분류명 is highly overall correlated with 파일테이블IDX and 2 other fieldsHigh correlation
등록자 is highly overall correlated with 파일테이블IDX and 2 other fieldsHigh correlation
수정자 is highly overall correlated with 하위정보명High correlation
수정일 has 49 (53.3%) missing valuesMissing
파일테이블IDX has unique valuesUnique

Reproduction

Analysis started2023-12-12 22:47:30.062165
Analysis finished2023-12-12 22:47:31.135598
Duration1.07 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

파일테이블IDX
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct92
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean109.3913
Minimum60
Maximum158
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size960.0 B
2023-12-13T07:47:31.238183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum60
5-th percentile64.55
Q185.5
median109.5
Q3134.25
95-th percentile153.45
Maximum158
Range98
Interquartile range (IQR)48.75

Descriptive statistics

Standard deviation28.680489
Coefficient of variation (CV)0.26218253
Kurtosis-1.159998
Mean109.3913
Median Absolute Deviation (MAD)25
Skewness-0.015202294
Sum10064
Variance822.57047
MonotonicityNot monotonic
2023-12-13T07:47:31.710598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
65 1
 
1.1%
158 1
 
1.1%
83 1
 
1.1%
82 1
 
1.1%
81 1
 
1.1%
79 1
 
1.1%
78 1
 
1.1%
80 1
 
1.1%
76 1
 
1.1%
75 1
 
1.1%
Other values (82) 82
89.1%
ValueCountFrequency (%)
60 1
1.1%
61 1
1.1%
62 1
1.1%
63 1
1.1%
64 1
1.1%
65 1
1.1%
66 1
1.1%
68 1
1.1%
69 1
1.1%
70 1
1.1%
ValueCountFrequency (%)
158 1
1.1%
157 1
1.1%
156 1
1.1%
155 1
1.1%
154 1
1.1%
153 1
1.1%
152 1
1.1%
151 1
1.1%
150 1
1.1%
149 1
1.1%

메뉴구분코드
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Memory size868.0 B
3
26 
4
26 
2
23 
1
17 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row2
3rd row2
4th row3
5th row3

Common Values

ValueCountFrequency (%)
3 26
28.3%
4 26
28.3%
2 23
25.0%
1 17
18.5%

Length

2023-12-13T07:47:31.860239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:47:31.967723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3 26
28.3%
4 26
28.3%
2 23
25.0%
1 17
18.5%

대분류코드
Real number (ℝ)

HIGH CORRELATION 

Distinct11
Distinct (%)12.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.3478261
Minimum1
Maximum11
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size960.0 B
2023-12-13T07:47:32.066862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median6
Q37
95-th percentile11
Maximum11
Range10
Interquartile range (IQR)4

Descriptive statistics

Standard deviation3.0685744
Coefficient of variation (CV)0.57379847
Kurtosis-0.75620207
Mean5.3478261
Median Absolute Deviation (MAD)2
Skewness0.34911789
Sum492
Variance9.4161491
MonotonicityNot monotonic
2023-12-13T07:47:32.174898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
6 20
21.7%
4 15
16.3%
1 12
13.0%
11 9
9.8%
2 9
9.8%
7 8
 
8.7%
3 5
 
5.4%
10 5
 
5.4%
5 4
 
4.3%
8 3
 
3.3%
ValueCountFrequency (%)
1 12
13.0%
2 9
9.8%
3 5
 
5.4%
4 15
16.3%
5 4
 
4.3%
6 20
21.7%
7 8
 
8.7%
8 3
 
3.3%
9 2
 
2.2%
10 5
 
5.4%
ValueCountFrequency (%)
11 9
9.8%
10 5
 
5.4%
9 2
 
2.2%
8 3
 
3.3%
7 8
 
8.7%
6 20
21.7%
5 4
 
4.3%
4 15
16.3%
3 5
 
5.4%
2 9
9.8%

하위정보명
Categorical

HIGH CORRELATION 

Distinct36
Distinct (%)39.1%
Missing0
Missing (%)0.0%
Memory size868.0 B
장애학생 교육
 
5
성교육
 
4
아동학대
 
4
자녀양육법
 
4
평생학습계좌제
 
4
Other values (31)
71 

Length

Max length13
Median length8.5
Mean length5.4565217
Min length3

Unique

Unique11 ?
Unique (%)12.0%

Sample

1st row자녀돌봄서비스
2nd row평생학습계좌제
3rd row교육평가
4th row성교육
5th row아동학대

Common Values

ValueCountFrequency (%)
장애학생 교육 5
 
5.4%
성교육 4
 
4.3%
아동학대 4
 
4.3%
자녀양육법 4
 
4.3%
평생학습계좌제 4
 
4.3%
정책과제 4
 
4.3%
학생봉사활동 4
 
4.3%
자녀학교생활 4
 
4.3%
다문화 교육 4
 
4.3%
창의적 체험활동 3
 
3.3%
Other values (26) 52
56.5%

Length

2023-12-13T07:47:32.308821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
교육 11
 
9.6%
장애학생 5
 
4.4%
정책과제 4
 
3.5%
다문화 4
 
3.5%
학생봉사활동 4
 
3.5%
자녀학교생활 4
 
3.5%
평생학습계좌제 4
 
3.5%
자녀양육법 4
 
3.5%
아동학대 4
 
3.5%
성교육 4
 
3.5%
Other values (32) 66
57.9%

중분류명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size868.0 B
2017-02-15
52 
2017-02-16
40 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2017-02-15
2nd row2017-02-15
3rd row2017-02-15
4th row2017-02-15
5th row2017-02-15

Common Values

ValueCountFrequency (%)
2017-02-15 52
56.5%
2017-02-16 40
43.5%

Length

2023-12-13T07:47:32.430567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:47:32.527763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2017-02-15 52
56.5%
2017-02-16 40
43.5%

등록자
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size868.0 B
TEST1
46 
test002
46 

Length

Max length7
Median length6
Mean length6
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowTEST1
2nd rowtest002
3rd rowtest002
4th rowtest002
5th rowtest002

Common Values

ValueCountFrequency (%)
TEST1 46
50.0%
test002 46
50.0%

Length

2023-12-13T07:47:32.644396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:47:32.763184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
test1 46
50.0%
test002 46
50.0%

수정일
Date

MISSING 

Distinct36
Distinct (%)83.7%
Missing49
Missing (%)53.3%
Memory size868.0 B
Minimum2017-02-15 00:00:00
Maximum2017-04-28 09:24:00
2023-12-13T07:47:32.877601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:47:33.026024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=36)

수정자
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size868.0 B
<NA>
49 
karam9940
39 
test002
 
4

Length

Max length9
Median length4
Mean length6.25
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowkaram9940
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 49
53.3%
karam9940 39
42.4%
test002 4
 
4.3%

Length

2023-12-13T07:47:33.150141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:47:33.239119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 49
53.3%
karam9940 39
42.4%
test002 4
 
4.3%

Interactions

2023-12-13T07:47:30.649789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:47:30.471492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:47:30.750565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:47:30.568812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T07:47:33.302374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
파일테이블IDX메뉴구분코드대분류코드하위정보명중분류명등록자수정일수정자
파일테이블IDX1.0000.9480.6990.0000.9970.9830.9290.191
메뉴구분코드0.9481.0000.0000.0000.9440.9050.9920.202
대분류코드0.6990.0001.0001.0000.2850.0000.4810.509
하위정보명0.0000.0001.0001.0000.0000.0000.9011.000
중분류명0.9970.9440.2850.0001.0000.9550.9300.000
등록자0.9830.9050.0000.0000.9551.0000.7820.000
수정일0.9290.9920.4810.9010.9300.7821.0001.000
수정자0.1910.2020.5091.0000.0000.0001.0001.000
2023-12-13T07:47:33.403831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
등록자하위정보명중분류명메뉴구분코드수정자
등록자1.0000.0000.8090.7120.000
하위정보명0.0001.0000.0000.0000.698
중분류명0.8090.0001.0000.7770.000
메뉴구분코드0.7120.0000.7771.0000.124
수정자0.0000.6980.0000.1241.000
2023-12-13T07:47:33.499326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
파일테이블IDX대분류코드메뉴구분코드하위정보명중분류명등록자수정자
파일테이블IDX1.0000.1560.8400.0000.9060.8460.111
대분류코드0.1561.0000.0000.8260.1960.0000.462
메뉴구분코드0.8400.0001.0000.0000.7770.7120.124
하위정보명0.0000.8260.0001.0000.0000.0000.698
중분류명0.9060.1960.7770.0001.0000.8090.000
등록자0.8460.0000.7120.0000.8091.0000.000
수정자0.1110.4620.1240.6980.0000.0001.000

Missing values

2023-12-13T07:47:30.877519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:47:31.071881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

파일테이블IDX메뉴구분코드대분류코드하위정보명중분류명등록자수정일수정자
06516자녀돌봄서비스2017-02-15TEST12017-04-06 12:07karam9940
19727평생학습계좌제2017-02-15test002<NA><NA>
29821교육평가2017-02-15test002<NA><NA>
311136성교육2017-02-15test002<NA><NA>
411236아동학대2017-02-15test002<NA><NA>
511336청소년활동2017-02-15test0022017-04-06 1:13karam9940
611839고교(입시)정보2017-02-16TEST12017-04-06 1:08karam9940
7119311인성교육2017-02-16TEST1<NA><NA>
8120311창의교육2017-02-16TEST1<NA><NA>
9121311창의적 체험활동2017-02-16TEST12017-02-16test002
파일테이블IDX메뉴구분코드대분류코드하위정보명중분류명등록자수정일수정자
8215247인문, 교양2017-02-16TEST12017-04-06 1:23karam9940
8315347평생학습계좌제2017-02-16TEST1<NA><NA>
8415441교육평가2017-02-16TEST1<NA><NA>
8515541학교정보2017-02-16TEST12017-04-03 6:33karam9940
8615641자녀학교생활2017-02-16TEST1<NA><NA>
877217평생학습계좌제2017-02-15test002<NA><NA>
88116310자유학기제2017-02-16TEST12017-04-06 1:09karam9940
897117인문교양2017-02-15test0022017-04-06 12:00karam9940
907311교육과정2017-02-15test0022017-04-28 9:24karam9940
91117310진로교육2017-02-16TEST12017-04-06 1:08karam9940