Overview

Dataset statistics

Number of variables4
Number of observations878
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.1%
Total size in memory29.3 KiB
Average record size in memory34.2 B

Variable types

Numeric2
Categorical1
DateTime1

Dataset

Description한국기술교육대학교 온라인평생교육원 스마트 직업훈련 플랫폼 (STEP)에 대한 라이브세미나 시청회수 관련 된 정보를 제공합니다.
Author한국기술교육대학교
URLhttps://www.data.go.kr/data/15090840/fileData.do

Alerts

Dataset has 1 (0.1%) duplicate rowsDuplicates
등록 국가 is highly imbalanced (61.7%)Imbalance
총 시청 시간 has 103 (11.7%) zerosZeros

Reproduction

Analysis started2023-12-12 05:18:41.253206
Analysis finished2023-12-12 05:18:42.080994
Duration0.83 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct27
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean943.29385
Minimum274
Maximum1039
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.8 KiB
2023-12-12T14:18:42.149085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum274
5-th percentile782
Q1962
median962
Q3989
95-th percentile1027
Maximum1039
Range765
Interquartile range (IQR)27

Descriptive statistics

Standard deviation92.871303
Coefficient of variation (CV)0.098454265
Kurtosis12.238428
Mean943.29385
Median Absolute Deviation (MAD)12
Skewness-2.8769644
Sum828212
Variance8625.0789
MonotonicityNot monotonic
2023-12-12T14:18:42.283489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%)
962 262
29.8%
974 92
 
10.5%
973 89
 
10.1%
782 67
 
7.6%
1023 63
 
7.2%
1000 56
 
6.4%
795 53
 
6.0%
956 51
 
5.8%
997 45
 
5.1%
1027 28
 
3.2%
Other values (17) 72
 
8.2%
ValueCountFrequency (%)
274 1
 
0.1%
298 2
 
0.2%
481 2
 
0.2%
484 6
 
0.7%
782 67
7.6%
795 53
6.0%
807 2
 
0.2%
811 7
 
0.8%
824 1
 
0.1%
825 1
 
0.1%
ValueCountFrequency (%)
1039 10
 
1.1%
1028 13
 
1.5%
1027 28
3.2%
1023 63
7.2%
1012 2
 
0.2%
1000 56
6.4%
997 45
5.1%
989 6
 
0.7%
983 1
 
0.1%
982 5
 
0.6%

총 시청 시간
Real number (ℝ)

ZEROS 

Distinct356
Distinct (%)40.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2141.5661
Minimum0
Maximum150760
Zeros103
Zeros (%)11.7%
Negative0
Negative (%)0.0%
Memory size7.8 KiB
2023-12-12T14:18:42.426832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q110
median70
Q31378.75
95-th percentile7343.75
Maximum150760
Range150760
Interquartile range (IQR)1368.75

Descriptive statistics

Standard deviation8246.7173
Coefficient of variation (CV)3.8507882
Kurtosis142.87024
Mean2141.5661
Median Absolute Deviation (MAD)70
Skewness10.162996
Sum1880295
Variance68008346
MonotonicityNot monotonic
2023-12-12T14:18:42.574213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 103
 
11.7%
5 79
 
9.0%
10 49
 
5.6%
15 36
 
4.1%
25 33
 
3.8%
20 30
 
3.4%
35 20
 
2.3%
30 18
 
2.1%
85 12
 
1.4%
60 12
 
1.4%
Other values (346) 486
55.4%
ValueCountFrequency (%)
0 103
11.7%
1 1
 
0.1%
5 79
9.0%
6 1
 
0.1%
10 49
5.6%
11 2
 
0.2%
12 1
 
0.1%
15 36
 
4.1%
17 1
 
0.1%
20 30
 
3.4%
ValueCountFrequency (%)
150760 1
0.1%
76929 1
0.1%
71177 1
0.1%
70709 1
0.1%
46315 1
0.1%
45230 1
0.1%
43115 1
0.1%
42867 1
0.1%
42724 1
0.1%
42140 1
0.1%

등록 국가
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size7.0 KiB
KR
762 
UNKNOWN
109 
US
 
7

Length

Max length7
Median length2
Mean length2.6207289
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowUNKNOWN
2nd rowKR
3rd rowKR
4th rowKR
5th rowKR

Common Values

ValueCountFrequency (%)
KR 762
86.8%
UNKNOWN 109
 
12.4%
US 7
 
0.8%

Length

2023-12-12T14:18:42.707527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:18:42.845459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
kr 762
86.8%
unknown 109
 
12.4%
us 7
 
0.8%
Distinct877
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size7.0 KiB
Minimum2016-10-10 14:38:32
Maximum2018-12-05 20:08:50
2023-12-12T14:18:42.962746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:18:43.129013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T14:18:41.605445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:18:41.379601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:18:41.742792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:18:41.492262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:18:43.241791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
라이브 세미나 아이디총 시청 시간등록 국가
라이브 세미나 아이디1.0000.0000.154
총 시청 시간0.0001.0000.000
등록 국가0.1540.0001.000
2023-12-12T14:18:43.654384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
라이브 세미나 아이디총 시청 시간등록 국가
라이브 세미나 아이디1.0000.0140.065
총 시청 시간0.0141.0000.000
등록 국가0.0650.0001.000

Missing values

2023-12-12T14:18:41.913225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:18:42.028641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

라이브 세미나 아이디총 시청 시간등록 국가등록 일시
08250UNKNOWN2017-01-18 18:47:14
12980KR2017-06-13 09:43:08
22980KR2017-06-13 09:43:11
382420KR2017-06-13 09:44:20
479535KR2016-10-28 10:59:21
579520KR2016-10-28 11:00:01
678210KR2016-10-28 11:00:27
797435KR2018-05-31 09:46:05
89741980KR2018-05-31 09:46:46
99890UNKNOWN2018-07-03 16:24:53
라이브 세미나 아이디총 시청 시간등록 국가등록 일시
86810271660KR2018-10-17 22:38:19
869102740KR2018-10-17 23:07:11
870102785KR2018-10-17 23:07:55
871102765KR2018-10-17 21:22:04
8721027330KR2018-10-20 16:29:23
87310273240KR2018-10-26 12:18:19
87410282350KR2018-11-16 19:24:47
875102720KR2018-11-16 20:04:28
87610285KR2018-11-16 20:05:10
877102765KR2018-11-21 16:55:45

Duplicate rows

Most frequently occurring

라이브 세미나 아이디총 시청 시간등록 국가등록 일시# duplicates
096255KR2017-12-07 21:38:282