Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory478.5 KiB
Average record size in memory49.0 B

Variable types

Numeric1
Categorical3
Boolean1

Dataset

Description한국기술교육대학교 온라인평생교육원 스마트 직업훈련 플랫폼 (STEP)에 대한 게시판 내 게시글들의 아이디 및 카테고리 내용을 제공합니다.
Author한국기술교육대학교
URLhttps://www.data.go.kr/data/15091108/fileData.do

Alerts

게시판명 is highly overall correlated with 타입 코드 and 1 other fieldsHigh correlation
편집 권한 is highly overall correlated with 아이디 and 2 other fieldsHigh correlation
타입 코드 is highly overall correlated with 게시판명 and 1 other fieldsHigh correlation
아이디 is highly overall correlated with 편집 권한High correlation
수정가능 여부 is highly imbalanced (99.9%)Imbalance
아이디 has unique valuesUnique

Reproduction

Analysis started2023-12-12 18:09:31.023573
Analysis finished2023-12-12 18:09:31.869263
Duration0.85 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

아이디
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean121208.28
Minimum3
Maximum277412
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T03:09:31.963735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3
5-th percentile4898.45
Q138339.75
median102188.5
Q3203420.75
95-th percentile263672.8
Maximum277412
Range277409
Interquartile range (IQR)165081

Descriptive statistics

Standard deviation89071.843
Coefficient of variation (CV)0.73486598
Kurtosis-1.3978111
Mean121208.28
Median Absolute Deviation (MAD)78290
Skewness0.25365688
Sum1.2120828 × 109
Variance7.9337932 × 109
MonotonicityNot monotonic
2023-12-13T03:09:32.141076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
205138 1
 
< 0.1%
6147 1
 
< 0.1%
9021 1
 
< 0.1%
3063 1
 
< 0.1%
41689 1
 
< 0.1%
200697 1
 
< 0.1%
177118 1
 
< 0.1%
20390 1
 
< 0.1%
52982 1
 
< 0.1%
8385 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
3 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
31 1
< 0.1%
36 1
< 0.1%
43 1
< 0.1%
51 1
< 0.1%
60 1
< 0.1%
62 1
< 0.1%
74 1
< 0.1%
ValueCountFrequency (%)
277412 1
< 0.1%
277359 1
< 0.1%
277336 1
< 0.1%
277302 1
< 0.1%
277296 1
< 0.1%
277286 1
< 0.1%
277274 1
< 0.1%
277264 1
< 0.1%
277238 1
< 0.1%
277232 1
< 0.1%

타입 코드
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
공지사항
2416 
자료실
2398 
메모
2392 
질의응답
2385 
콘텐츠 개발공정
 
175
Other values (4)
 
234

Length

Max length9
Median length8
Mean length3.4606
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공지사항
2nd row메모
3rd row공지사항
4th row자료실
5th row공지사항

Common Values

ValueCountFrequency (%)
공지사항 2416
24.2%
자료실 2398
24.0%
메모 2392
23.9%
질의응답 2385
23.8%
콘텐츠 개발공정 175
 
1.8%
콘텐츠 오류 신고 124
 
1.2%
콘텐츠 오류신고 73
 
0.7%
콘텐츠 개발 공정 34
 
0.3%
자주묻는질문 3
 
< 0.1%

Length

2023-12-13T03:09:32.699191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:09:32.899534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공지사항 2416
22.9%
자료실 2398
22.7%
메모 2392
22.6%
질의응답 2385
22.6%
콘텐츠 406
 
3.8%
개발공정 175
 
1.7%
오류 124
 
1.2%
신고 124
 
1.2%
오류신고 73
 
0.7%
개발 34
 
0.3%
Other values (2) 37
 
0.4%

게시판명
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
공지사항
2416 
자료실
2398 
메모
2392 
질의응답
2385 
콘텐츠 개발공정
 
175
Other values (4)
 
234

Length

Max length9
Median length8
Mean length3.4606
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공지사항
2nd row메모
3rd row공지사항
4th row자료실
5th row공지사항

Common Values

ValueCountFrequency (%)
공지사항 2416
24.2%
자료실 2398
24.0%
메모 2392
23.9%
질의응답 2385
23.8%
콘텐츠 개발공정 175
 
1.8%
콘텐츠 오류 신고 124
 
1.2%
콘텐츠 오류신고 73
 
0.7%
콘텐츠 개발 공정 34
 
0.3%
자주묻는질문 3
 
< 0.1%

Length

2023-12-13T03:09:33.106165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:09:33.269840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공지사항 2416
22.9%
자료실 2398
22.7%
메모 2392
22.6%
질의응답 2385
22.6%
콘텐츠 406
 
3.8%
개발공정 175
 
1.7%
오류 124
 
1.2%
신고 124
 
1.2%
오류신고 73
 
0.7%
개발 34
 
0.3%
Other values (2) 37
 
0.4%

편집 권한
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
일반회원
5191 
운영자
4809 

Length

Max length4
Median length4
Mean length3.5191
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row운영자
2nd row일반회원
3rd row운영자
4th row운영자
5th row운영자

Common Values

ValueCountFrequency (%)
일반회원 5191
51.9%
운영자 4809
48.1%

Length

2023-12-13T03:09:33.460798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:09:33.608830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반회원 5191
51.9%
운영자 4809
48.1%

수정가능 여부
Boolean

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
True
9999 
False
 
1
ValueCountFrequency (%)
True 9999
> 99.9%
False 1
 
< 0.1%
2023-12-13T03:09:33.732949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Interactions

2023-12-13T03:09:31.420077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T03:09:33.822974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
아이디타입 코드게시판명편집 권한수정가능 여부
아이디1.0000.5320.5320.7480.000
타입 코드0.5321.0001.0000.9760.000
게시판명0.5321.0001.0000.9760.000
편집 권한0.7480.9760.9761.0000.000
수정가능 여부0.0000.0000.0000.0001.000
2023-12-13T03:09:33.968670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
게시판명수정가능 여부편집 권한타입 코드
게시판명1.0000.0000.9991.000
수정가능 여부0.0001.0000.0000.000
편집 권한0.9990.0001.0000.999
타입 코드1.0000.0000.9991.000
2023-12-13T03:09:34.101498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
아이디타입 코드게시판명편집 권한수정가능 여부
아이디1.0000.2760.2760.5870.000
타입 코드0.2761.0001.0000.9990.000
게시판명0.2761.0001.0000.9990.000
편집 권한0.5870.9990.9991.0000.000
수정가능 여부0.0000.0000.0000.0001.000

Missing values

2023-12-13T03:09:31.654747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:09:31.800850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

아이디타입 코드게시판명편집 권한수정가능 여부
71782205138공지사항공지사항운영자Y
2671949806메모메모일반회원Y
64218180608공지사항공지사항운영자Y
1262820653자료실자료실운영자Y
66562187696공지사항공지사항운영자Y
71945205635질의응답질의응답일반회원Y
83038734공지사항공지사항운영자Y
1436422408자료실자료실운영자Y
2726050461메모메모일반회원Y
1197019986자료실자료실운영자Y
아이디타입 코드게시판명편집 권한수정가능 여부
85510251325질의응답질의응답일반회원Y
2356839255질의응답질의응답일반회원Y
4621599280공지사항공지사항운영자Y
86151253268공지사항공지사항운영자Y
1548223538자료실자료실운영자Y
52627118727메모메모일반회원Y
1446422511자료실자료실운영자Y
64089180221질의응답질의응답일반회원Y
259295콘텐츠 오류 신고콘텐츠 오류 신고일반회원Y
68730194633메모메모일반회원Y