Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory488.3 KiB
Average record size in memory50.0 B

Variable types

Numeric2
Boolean1
Categorical1
DateTime1

Dataset

Description한국기술교육대학교 온라인평생교육원 스마트 직업훈련 플랫폼 (STEP)에 대한 사용자 약관서비스 관련 내용을 제공합니다.
Author한국기술교육대학교
URLhttps://www.data.go.kr/data/15090970/fileData.do

Alerts

동의 여부 is highly imbalanced (99.7%)Imbalance
등록 국가 is highly imbalanced (86.0%)Imbalance

Reproduction

Analysis started2023-12-12 09:43:03.685470
Analysis finished2023-12-12 09:43:04.610252
Duration0.92 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사용자 인덱스
Real number (ℝ)

Distinct9581
Distinct (%)95.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean327522.55
Minimum402
Maximum766062
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T18:43:04.694299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum402
5-th percentile58295
Q1157357.75
median280796
Q3485179.5
95-th percentile700780.95
Maximum766062
Range765660
Interquartile range (IQR)327821.75

Descriptive statistics

Standard deviation207227.44
Coefficient of variation (CV)0.63271197
Kurtosis-0.94119931
Mean327522.55
Median Absolute Deviation (MAD)149849.5
Skewness0.5102777
Sum3.2752255 × 109
Variance4.294321 × 1010
MonotonicityNot monotonic
2023-12-12T18:43:04.837600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
296538 3
 
< 0.1%
549869 3
 
< 0.1%
523345 3
 
< 0.1%
196119 3
 
< 0.1%
314987 3
 
< 0.1%
629242 3
 
< 0.1%
618410 3
 
< 0.1%
91580 3
 
< 0.1%
602855 3
 
< 0.1%
262153 3
 
< 0.1%
Other values (9571) 9970
99.7%
ValueCountFrequency (%)
402 1
< 0.1%
470 1
< 0.1%
488 1
< 0.1%
521 1
< 0.1%
524 1
< 0.1%
554 1
< 0.1%
565 1
< 0.1%
594 1
< 0.1%
1468 1
< 0.1%
1493 1
< 0.1%
ValueCountFrequency (%)
766062 1
< 0.1%
766034 1
< 0.1%
766027 2
< 0.1%
765942 1
< 0.1%
765758 1
< 0.1%
765227 1
< 0.1%
764887 1
< 0.1%
764618 1
< 0.1%
764441 1
< 0.1%
764371 1
< 0.1%
Distinct24
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.7486
Minimum1
Maximum109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T18:43:04.958257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q14
median4
Q34
95-th percentile46
Maximum109
Range108
Interquartile range (IQR)0

Descriptive statistics

Standard deviation12.085794
Coefficient of variation (CV)1.559739
Kurtosis13.852382
Mean7.7486
Median Absolute Deviation (MAD)0
Skewness3.6343489
Sum77486
Variance146.0664
MonotonicityNot monotonic
2023-12-12T18:43:05.087850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
4 7018
70.2%
7 1509
 
15.1%
46 582
 
5.8%
1 537
 
5.4%
30 53
 
0.5%
5 53
 
0.5%
23 45
 
0.4%
55 38
 
0.4%
8 30
 
0.3%
87 29
 
0.3%
Other values (14) 106
 
1.1%
ValueCountFrequency (%)
1 537
 
5.4%
4 7018
70.2%
5 53
 
0.5%
7 1509
 
15.1%
8 30
 
0.3%
9 19
 
0.2%
10 19
 
0.2%
12 9
 
0.1%
16 5
 
0.1%
19 2
 
< 0.1%
ValueCountFrequency (%)
109 1
 
< 0.1%
95 7
 
0.1%
91 4
 
< 0.1%
87 29
 
0.3%
83 5
 
0.1%
64 6
 
0.1%
59 2
 
< 0.1%
55 38
 
0.4%
49 20
 
0.2%
46 582
5.8%

동의 여부
Boolean

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
True
9998 
False
 
2
ValueCountFrequency (%)
True 9998
> 99.9%
False 2
 
< 0.1%
2023-12-12T18:43:05.185927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

등록 국가
Categorical

IMBALANCE 

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
KR
9299 
US
 
415
UNKNOWN
 
277
GB
 
3
JP
 
2
Other values (4)
 
4

Length

Max length7
Median length2
Mean length2.1385
Min length2

Unique

Unique4 ?
Unique (%)< 0.1%

Sample

1st rowKR
2nd rowKR
3rd rowKR
4th rowKR
5th rowKR

Common Values

ValueCountFrequency (%)
KR 9299
93.0%
US 415
 
4.2%
UNKNOWN 277
 
2.8%
GB 3
 
< 0.1%
JP 2
 
< 0.1%
CN 1
 
< 0.1%
HK 1
 
< 0.1%
CA 1
 
< 0.1%
KP 1
 
< 0.1%

Length

2023-12-12T18:43:05.275664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:43:05.376881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
kr 9299
93.0%
us 415
 
4.2%
unknown 277
 
2.8%
gb 3
 
< 0.1%
jp 2
 
< 0.1%
cn 1
 
< 0.1%
hk 1
 
< 0.1%
ca 1
 
< 0.1%
kp 1
 
< 0.1%
Distinct9142
Distinct (%)91.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2014-09-11 17:16:35
Maximum2023-09-18 14:36:58
2023-12-12T18:43:05.499455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:43:05.629893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T18:43:04.222971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:43:04.012075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:43:04.327744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:43:04.122622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:43:05.737747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사용자 인덱스약관 서비스 아이디동의 여부등록 국가
사용자 인덱스1.0000.0350.0250.160
약관 서비스 아이디0.0351.0000.0000.275
동의 여부0.0250.0001.0000.000
등록 국가0.1600.2750.0001.000
2023-12-12T18:43:05.858465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
동의 여부등록 국가
동의 여부1.0000.000
등록 국가0.0001.000
2023-12-12T18:43:05.941313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사용자 인덱스약관 서비스 아이디동의 여부등록 국가
사용자 인덱스1.0000.0210.0190.073
약관 서비스 아이디0.0211.0000.0000.138
동의 여부0.0190.0001.0000.000
등록 국가0.0730.1380.0001.000

Missing values

2023-12-12T18:43:04.451823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:43:04.554105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사용자 인덱스약관 서비스 아이디동의 여부등록 국가등록 일시
526373088084YKR2016-07-27 08:40:20
593273470284YKR2016-10-07 08:56:40
710184860914YKR2017-03-15 15:02:03
498942948044YKR2016-06-19 15:14:16
745795482214YKR2017-04-14 13:14:23
653864297384YKR2017-01-02 18:01:06
154421139044YKR2015-10-28 13:23:42
446742685494YKR2016-04-23 09:34:39
943627552317YKR2017-09-12 16:29:16
773335863904YKR2017-05-16 18:40:42
사용자 인덱스약관 서비스 아이디동의 여부등록 국가등록 일시
732225295644YKR2017-03-31 10:49:05
307651951314YKR2016-01-11 09:19:40
543583176454YKR2016-08-22 19:58:09
317571999754YKR2016-01-19 10:03:55
7646751154YKR2015-08-24 09:25:58
240421592994YKR2015-11-29 18:33:28
153401134994YKR2015-10-27 14:48:54
6829716374YKR2015-08-04 13:35:25
922707205974YKR2017-08-26 11:39:12
904607017244YKR2017-08-06 22:28:20