Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 566.4 KiB |
Average record size in memory | 58.0 B |
Variable types
Numeric | 2 |
---|---|
Text | 1 |
Categorical | 2 |
DateTime | 1 |
Dataset
Description | 한국기술교육대학교 온라인평생교육원 스마트 직업훈련 플랫폼 (STEP)에 대한 학습자 등록 관련된 내용을 제공합니다. |
---|---|
Author | 한국기술교육대학교 |
URL | https://www.data.go.kr/data/15091074/fileData.do |
Reproduction
Analysis started | 2023-12-12 00:53:39.664180 |
---|---|
Analysis finished | 2023-12-12 00:53:41.021885 |
Duration | 1.36 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
아이디
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 135906.33 |
Minimum | 17 |
---|---|
Maximum | 276489 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 17 |
---|---|
5-th percentile | 13611.7 |
Q1 | 63843.25 |
median | 132472 |
Q3 | 206362 |
95-th percentile | 265776.4 |
Maximum | 276489 |
Range | 276472 |
Interquartile range (IQR) | 142518.75 |
Descriptive statistics
Standard deviation | 80814.735 |
---|---|
Coefficient of variation (CV) | 0.59463553 |
Kurtosis | -1.216497 |
Mean | 135906.33 |
Median Absolute Deviation (MAD) | 71289 |
Skewness | 0.072401255 |
Sum | 1.3590633 × 109 |
Variance | 6.5310214 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
211795 | 1 | < 0.1% |
115462 | 1 | < 0.1% |
131710 | 1 | < 0.1% |
226045 | 1 | < 0.1% |
103009 | 1 | < 0.1% |
132778 | 1 | < 0.1% |
47475 | 1 | < 0.1% |
269 | 1 | < 0.1% |
10423 | 1 | < 0.1% |
178849 | 1 | < 0.1% |
Other values (9990) | 9990 |
Value | Count | Frequency (%) |
17 | 1 | |
47 | 1 | |
54 | 1 | |
62 | 1 | |
79 | 1 | |
82 | 1 | |
91 | 1 | |
112 | 1 | |
124 | 1 | |
130 | 1 |
Value | Count | Frequency (%) |
276489 | 1 | |
276483 | 1 | |
276449 | 1 | |
276441 | 1 | |
276429 | 1 | |
276423 | 1 | |
276403 | 1 | |
276389 | 1 | |
276361 | 1 | |
276341 | 1 |
코드
Text
Distinct | 9566 |
---|---|
Distinct (%) | 95.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 25 |
---|---|
Median length | 24 |
Mean length | 23.2277 |
Min length | 13 |
Characters and Unicode
Total characters | 232277 |
---|---|
Distinct characters | 12 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 9174 ? |
---|---|
Unique (%) | 91.7% |
Sample
1st row | A200000140343-2020211795 |
---|---|
2nd row | A200000090166-2020139420 |
3rd row | A201361010018-2020130507 |
4th row | A200000040159-202094042 |
5th row | A200000060370-202097378 |
Value | Count | Frequency (%) |
a200000060324-2020213640 | 7 | 0.1% |
a200000040430-202079582 | 4 | < 0.1% |
a200000020071-202064561 | 4 | < 0.1% |
a201271020025-2020127774 | 4 | < 0.1% |
a200000000172-2020103180 | 4 | < 0.1% |
a201271010012-2020122857 | 4 | < 0.1% |
a190000200268-201951619 | 4 | < 0.1% |
a201361010016-2020138970 | 4 | < 0.1% |
a200000000171-2020254137 | 3 | < 0.1% |
a201271010009-2020104386 | 3 | < 0.1% |
Other values (9556) | 9959 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 83971 | |
2 | 40165 | |
1 | 27289 | 11.7% |
3 | 10283 | 4.4% |
9 | 10242 | 4.4% |
- | 10000 | 4.3% |
5 | 9945 | 4.3% |
A | 9549 | 4.1% |
7 | 8484 | 3.7% |
4 | 8160 | 3.5% |
Other values (2) | 14189 | 6.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 212728 | |
Dash Punctuation | 10000 | 4.3% |
Uppercase Letter | 9549 | 4.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 83971 | |
2 | 40165 | |
1 | 27289 | 12.8% |
3 | 10283 | 4.8% |
9 | 10242 | 4.8% |
5 | 9945 | 4.7% |
7 | 8484 | 4.0% |
4 | 8160 | 3.8% |
6 | 7451 | 3.5% |
8 | 6738 | 3.2% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10000 |
Uppercase Letter
Value | Count | Frequency (%) |
A | 9549 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 222728 | |
Latin | 9549 | 4.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 83971 | |
2 | 40165 | |
1 | 27289 | 12.3% |
3 | 10283 | 4.6% |
9 | 10242 | 4.6% |
- | 10000 | 4.5% |
5 | 9945 | 4.5% |
7 | 8484 | 3.8% |
4 | 8160 | 3.7% |
6 | 7451 | 3.3% |
Latin
Value | Count | Frequency (%) |
A | 9549 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 232277 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 83971 | |
2 | 40165 | |
1 | 27289 | 11.7% |
3 | 10283 | 4.4% |
9 | 10242 | 4.4% |
- | 10000 | 4.3% |
5 | 9945 | 4.3% |
A | 9549 | 4.1% |
7 | 8484 | 3.7% |
4 | 8160 | 3.5% |
Other values (2) | 14189 | 6.1% |
과정 아이디
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 4086 |
---|---|
Distinct (%) | 40.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 114306.65 |
Minimum | 388 |
---|---|
Maximum | 164129 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 388 |
---|---|
5-th percentile | 21365.95 |
Q1 | 98379.25 |
median | 123533.5 |
Q3 | 139264.75 |
95-th percentile | 154177 |
Maximum | 164129 |
Range | 163741 |
Interquartile range (IQR) | 40885.5 |
Descriptive statistics
Standard deviation | 34290.341 |
---|---|
Coefficient of variation (CV) | 0.29998554 |
Kurtosis | 2.1065562 |
Mean | 114306.65 |
Median Absolute Deviation (MAD) | 18106.5 |
Skewness | -1.4117844 |
Sum | 1.1430665 × 109 |
Variance | 1.1758275 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
132448 | 310 | 3.1% |
132451 | 277 | 2.8% |
142699 | 266 | 2.7% |
90337 | 215 | 2.1% |
87487 | 171 | 1.7% |
10212 | 122 | 1.2% |
127546 | 119 | 1.2% |
105427 | 116 | 1.2% |
105425 | 115 | 1.1% |
127555 | 94 | 0.9% |
Other values (4076) | 8195 |
Value | Count | Frequency (%) |
388 | 1 | |
2598 | 1 | |
3062 | 1 | |
3279 | 1 | |
3483 | 1 | |
3690 | 1 | |
3823 | 1 | |
3870 | 1 | |
4275 | 1 | |
4408 | 1 |
Value | Count | Frequency (%) |
164129 | 3 | < 0.1% |
164127 | 2 | < 0.1% |
163858 | 1 | < 0.1% |
163813 | 47 | |
163810 | 36 | |
163807 | 3 | < 0.1% |
162679 | 16 | 0.2% |
162676 | 5 | 0.1% |
162673 | 5 | 0.1% |
162670 | 24 |
등록 국가
Categorical
IMBALANCE
 
Distinct | 10 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
KR | |
---|---|
UNKNOWN | |
US | 145 |
RU | 85 |
CN | 41 |
Other values (5) | 11 |
Length
Max length | 7 |
---|---|
Median length | 2 |
Mean length | 3.091 |
Min length | 2 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | KR |
---|---|
2nd row | KR |
3rd row | KR |
4th row | UNKNOWN |
5th row | UNKNOWN |
Common Values
Value | Count | Frequency (%) |
KR | 7536 | |
UNKNOWN | 2182 | 21.8% |
US | 145 | 1.5% |
RU | 85 | 0.9% |
CN | 41 | 0.4% |
JP | 3 | < 0.1% |
ES | 3 | < 0.1% |
GB | 3 | < 0.1% |
VU | 1 | < 0.1% |
TW | 1 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
kr | 7536 | |
unknown | 2182 | 21.8% |
us | 145 | 1.5% |
ru | 85 | 0.9% |
cn | 41 | 0.4% |
jp | 3 | < 0.1% |
es | 3 | < 0.1% |
gb | 3 | < 0.1% |
vu | 1 | < 0.1% |
tw | 1 | < 0.1% |
등록 기기 타입
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
PC | |
---|---|
모바일 | 470 |
Length
Max length | 3 |
---|---|
Median length | 2 |
Mean length | 2.047 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | PC |
---|---|
2nd row | PC |
3rd row | PC |
4th row | PC |
5th row | PC |
Common Values
Value | Count | Frequency (%) |
PC | 9530 | |
모바일 | 470 | 4.7% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
pc | 9530 | |
모바일 | 470 | 4.7% |
코드 등록 일시
Date
Distinct | 9991 |
---|---|
Distinct (%) | 99.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Minimum | 2019-09-14 15:07:48 |
---|---|
Maximum | 2021-01-03 00:09:43 |
아이디 | 과정 아이디 | 등록 국가 | 등록 기기 타입 | |
---|---|---|---|---|
아이디 | 1.000 | 0.856 | 0.257 | 0.176 |
과정 아이디 | 0.856 | 1.000 | 0.208 | 0.159 |
등록 국가 | 0.257 | 0.208 | 1.000 | 0.492 |
등록 기기 타입 | 0.176 | 0.159 | 0.492 | 1.000 |
등록 기기 타입 | 등록 국가 | |
---|---|---|
등록 기기 타입 | 1.000 | 0.378 |
등록 국가 | 0.378 | 1.000 |
아이디 | 과정 아이디 | 등록 국가 | 등록 기기 타입 | |
---|---|---|---|---|
아이디 | 1.000 | 0.812 | 0.081 | 0.135 |
과정 아이디 | 0.812 | 1.000 | 0.065 | 0.122 |
등록 국가 | 0.081 | 0.065 | 1.000 | 0.378 |
등록 기기 타입 | 0.135 | 0.122 | 0.378 | 1.000 |
아이디 | 코드 | 과정 아이디 | 등록 국가 | 등록 기기 타입 | 코드 등록 일시 | |
---|---|---|---|---|---|---|
72408 | 211795 | A200000140343-2020211795 | 144871 | KR | PC | 2020-10-15 12:56:12 |
48490 | 139420 | A200000090166-2020139420 | 128863 | KR | PC | 2020-08-03 15:34:28 |
45519 | 130507 | A201361010018-2020130507 | 124888 | KR | PC | 2020-07-22 17:56:16 |
33366 | 94042 | A200000040159-202094042 | 111592 | UNKNOWN | PC | 2020-06-14 14:07:28 |
34478 | 97378 | A200000060370-202097378 | 120847 | UNKNOWN | PC | 2020-06-16 07:48:58 |
28347 | 78985 | A200000040156-202078985 | 111583 | KR | PC | 2020-05-15 10:20:57 |
93895 | 273681 | A200000170213-2020248284 | 153163 | UNKNOWN | PC | 2020-12-28 13:03:18 |
17676 | 48859 | A200000000071-202048857 | 105425 | KR | PC | 2020-02-29 19:17:21 |
66243 | 193300 | A200000120461-2020193300 | 139798 | UNKNOWN | PC | 2020-09-16 10:09:35 |
19265 | 52037 | 10245-202050863 | 10245 | KR | PC | 2020-03-10 10:25:06 |
아이디 | 코드 | 과정 아이디 | 등록 국가 | 등록 기기 타입 | 코드 등록 일시 | |
---|---|---|---|---|---|---|
69675 | 203596 | A200000130337-2020203596 | 141943 | UNKNOWN | PC | 2020-10-05 11:44:40 |
71999 | 210568 | A201415010001-2020210568 | 146689 | UNKNOWN | PC | 2020-10-15 08:51:50 |
13394 | 39109 | A191015100019-201939109 | 95197 | KR | PC | 2020-01-16 17:22:17 |
37744 | 107182 | A200000000172-2020101071 | 113686 | KR | 모바일 | 2020-06-26 13:38:13 |
8580 | 24658 | A191015090092-201924658 | 91921 | KR | PC | 2019-12-02 16:35:46 |
14544 | 42562 | A171037060002-201742559 | 20842 | UNKNOWN | PC | 2020-02-09 15:51:07 |
9236 | 26632 | A191015090012-201926626 | 91678 | KR | PC | 2019-12-05 18:49:37 |
83581 | 245314 | A200000170192-2020245314 | 153100 | KR | PC | 2020-12-01 08:28:02 |
5956 | 16786 | A190000140366-201916786 | 88092 | KR | PC | 2019-11-15 11:09:23 |
90474 | 265993 | A201355030043-2020265993 | 154093 | KR | PC | 2020-12-18 12:36:03 |