Dataset statistics
Number of variables | 2 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 244.1 KiB |
Average record size in memory | 25.0 B |
Variable types
Numeric | 1 |
---|---|
Text | 1 |
Dataset
Description | 한국기술교육대학교 온라인평생교육원 스마트 직업훈련 플랫폼 (STEP)에 대한 과목 키워드와 관련된 내용을 제공합니다. |
---|---|
Author | 한국기술교육대학교 |
URL | https://www.data.go.kr/data/15091098/fileData.do |
Reproduction
Analysis started | 2023-12-12 07:31:16.364960 |
---|---|
Analysis finished | 2023-12-12 07:31:16.833897 |
Duration | 0.47 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
과정 아이디
Real number (ℝ)
Distinct | 7797 |
---|---|
Distinct (%) | 78.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 63607.498 |
Minimum | 34 |
---|---|
Maximum | 413848 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 34 |
---|---|
5-th percentile | 3055 |
Q1 | 5649.75 |
median | 9275 |
Q3 | 95059.75 |
95-th percentile | 296641.4 |
Maximum | 413848 |
Range | 413814 |
Interquartile range (IQR) | 89410 |
Descriptive statistics
Standard deviation | 99830.891 |
---|---|
Coefficient of variation (CV) | 1.5694831 |
Kurtosis | 1.9175251 |
Mean | 63607.498 |
Median Absolute Deviation (MAD) | 4796.5 |
Skewness | 1.7147149 |
Sum | 6.3607498 × 108 |
Variance | 9.9662069 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
11918 | 5 | 0.1% |
10125 | 4 | < 0.1% |
9319 | 4 | < 0.1% |
2829 | 4 | < 0.1% |
9604 | 4 | < 0.1% |
9337 | 4 | < 0.1% |
8869 | 4 | < 0.1% |
3678 | 4 | < 0.1% |
4810 | 4 | < 0.1% |
5705 | 4 | < 0.1% |
Other values (7787) | 9959 |
Value | Count | Frequency (%) |
34 | 1 | < 0.1% |
122 | 1 | < 0.1% |
261 | 1 | < 0.1% |
262 | 1 | < 0.1% |
308 | 1 | < 0.1% |
312 | 1 | < 0.1% |
327 | 2 | |
332 | 1 | < 0.1% |
333 | 2 | |
334 | 3 |
Value | Count | Frequency (%) |
413848 | 1 | |
413839 | 1 | |
413836 | 1 | |
413806 | 1 | |
413797 | 1 | |
413794 | 1 | |
413785 | 1 | |
413746 | 1 | |
413725 | 1 | |
413662 | 1 |
키워드
Text
Distinct | 763 |
---|---|
Distinct (%) | 7.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
신규 | 2304 | 20.4% |
모바일 | 605 | 5.4% |
plc | 165 | 1.5% |
프로그래밍 | 135 | 1.2% |
도면 | 124 | 1.1% |
자동제어 | 117 | 1.0% |
제어 | 111 | 1.0% |
설계 | 108 | 1.0% |
네트워크 | 101 | 0.9% |
13기 | 94 | 0.8% |
Other values (764) | 7418 |
Most occurring characters
Value | Count | Frequency (%) |
신 | 2412 | 6.5% |
규 | 2312 | 6.3% |
1309 | 3.6% | |
기 | 920 | 2.5% |
모 | 854 | 2.3% |
바 | 675 | 1.8% |
C | 642 | 1.7% |
계 | 637 | 1.7% |
일 | 625 | 1.7% |
스 | 620 | 1.7% |
Other values (440) | 25824 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 29388 | |
Uppercase Letter | 3281 | 8.9% |
Lowercase Letter | 1954 | 5.3% |
Space Separator | 1309 | 3.6% |
Decimal Number | 623 | 1.7% |
Other Punctuation | 86 | 0.2% |
Dash Punctuation | 52 | 0.1% |
Open Punctuation | 47 | 0.1% |
Close Punctuation | 47 | 0.1% |
Math Symbol | 42 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
신 | 2412 | 8.2% |
규 | 2312 | 7.9% |
기 | 920 | 3.1% |
모 | 854 | 2.9% |
바 | 675 | 2.3% |
계 | 637 | 2.2% |
일 | 625 | 2.1% |
스 | 620 | 2.1% |
이 | 592 | 2.0% |
어 | 577 | 2.0% |
Other values (372) | 19164 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 642 | |
D | 343 | |
P | 322 | |
L | 272 | |
M | 271 | |
A | 224 | 6.8% |
S | 187 | 5.7% |
I | 126 | 3.8% |
T | 125 | 3.8% |
H | 118 | 3.6% |
Other values (14) | 651 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 239 | |
o | 217 | |
a | 180 | |
t | 166 | 8.5% |
l | 144 | 7.4% |
c | 137 | 7.0% |
r | 132 | 6.8% |
i | 118 | 6.0% |
s | 113 | 5.8% |
n | 84 | 4.3% |
Other values (14) | 424 |
Decimal Number
Value | Count | Frequency (%) |
1 | 223 | |
3 | 190 | |
7 | 86 | 13.8% |
2 | 39 | 6.3% |
5 | 33 | 5.3% |
0 | 22 | 3.5% |
8 | 21 | 3.4% |
6 | 8 | 1.3% |
4 | 1 | 0.2% |
Other Punctuation
Value | Count | Frequency (%) |
, | 38 | |
# | 28 | |
/ | 18 | |
. | 1 | 1.2% |
& | 1 | 1.2% |
Space Separator
Value | Count | Frequency (%) |
1309 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 52 |
Open Punctuation
Value | Count | Frequency (%) |
( | 47 |
Close Punctuation
Value | Count | Frequency (%) |
) | 47 |
Math Symbol
Value | Count | Frequency (%) |
+ | 42 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 29388 | |
Latin | 5235 | 14.2% |
Common | 2207 | 6.0% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
신 | 2412 | 8.2% |
규 | 2312 | 7.9% |
기 | 920 | 3.1% |
모 | 854 | 2.9% |
바 | 675 | 2.3% |
계 | 637 | 2.2% |
일 | 625 | 2.1% |
스 | 620 | 2.1% |
이 | 592 | 2.0% |
어 | 577 | 2.0% |
Other values (372) | 19164 |
Latin
Value | Count | Frequency (%) |
C | 642 | 12.3% |
D | 343 | 6.6% |
P | 322 | 6.2% |
L | 272 | 5.2% |
M | 271 | 5.2% |
e | 239 | 4.6% |
A | 224 | 4.3% |
o | 217 | 4.1% |
S | 187 | 3.6% |
a | 180 | 3.4% |
Other values (38) | 2338 |
Common
Value | Count | Frequency (%) |
1309 | ||
1 | 223 | 10.1% |
3 | 190 | 8.6% |
7 | 86 | 3.9% |
- | 52 | 2.4% |
( | 47 | 2.1% |
) | 47 | 2.1% |
+ | 42 | 1.9% |
2 | 39 | 1.8% |
, | 38 | 1.7% |
Other values (10) | 134 | 6.1% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 29388 | |
ASCII | 7442 | 20.2% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
신 | 2412 | 8.2% |
규 | 2312 | 7.9% |
기 | 920 | 3.1% |
모 | 854 | 2.9% |
바 | 675 | 2.3% |
계 | 637 | 2.2% |
일 | 625 | 2.1% |
스 | 620 | 2.1% |
이 | 592 | 2.0% |
어 | 577 | 2.0% |
Other values (372) | 19164 |
ASCII
Value | Count | Frequency (%) |
1309 | 17.6% | |
C | 642 | 8.6% |
D | 343 | 4.6% |
P | 322 | 4.3% |
L | 272 | 3.7% |
M | 271 | 3.6% |
e | 239 | 3.2% |
A | 224 | 3.0% |
1 | 223 | 3.0% |
o | 217 | 2.9% |
Other values (58) | 3380 |
과정 아이디 | 키워드 | |
---|---|---|
16645 | 7612 | 네트워크 보안 |
34287 | 128821 | 모바일 |
21096 | 9038 | PH-Lab |
541 | 2620 | 인적자원관리 |
30047 | 30110 | 추천 |
15974 | 7349 | CAM |
14533 | 6898 | 산업설비 |
38290 | 200517 | 신규 |
16476 | 7565 | 도면 |
8857 | 5126 | 솔리드웍스 |
과정 아이디 | 키워드 | |
---|---|---|
41758 | 309349 | 신규 |
37055 | 179643 | 신규 |
13446 | 6416 | 공압 |
9458 | 5306 | 박막증착 |
23485 | 9772 | 재무제표 |
34681 | 140392 | 신규 |
25652 | 11258 | 자바 |
26882 | 12254 | 진단장비 |
28136 | 13520 | 양중기 |
42261 | 326881 | 신규 |