Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 10000 |
Missing cells | 242 |
Missing cells (%) | 0.6% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 390.6 KiB |
Average record size in memory | 40.0 B |
Variable types
Text | 2 |
---|---|
DateTime | 1 |
Categorical | 1 |
Dataset
Description | 한국기술교육대학교 온라인평생교육원 스마트 직업훈련 플랫폼 (STEP)에 대한 사용자 기업명 내용을 제공합니다. |
---|---|
Author | 한국기술교육대학교 |
URL | https://www.data.go.kr/data/15091047/fileData.do |
Reproduction
Analysis started | 2024-04-17 09:54:08.630972 |
---|---|
Analysis finished | 2024-04-17 09:54:09.192588 |
Duration | 0.56 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
회사명
Text
MISSING
 
Distinct | 5116 |
---|---|
Distinct (%) | 52.4% |
Missing | 242 |
Missing (%) | 2.4% |
Memory size | 156.2 KiB |
Length
Max length | 41 |
---|---|
Median length | 28 |
Mean length | 6.9821685 |
Min length | 1 |
Characters and Unicode
Total characters | 68132 |
---|---|
Distinct characters | 748 |
Distinct categories | 11 ? |
Distinct scripts | 4 ? |
Distinct blocks | 4 ? |
Unique
Unique | 4137 ? |
---|---|
Unique (%) | 42.4% |
Sample
1st row | (주)덕산코트랜 |
---|---|
2nd row | 어보브반도체 |
3rd row | 제일교육학원제일요양보호사교육원 |
4th row | 마두간호학원 |
5th row | (주)진화이앤씨 |
Value | Count | Frequency (%) |
㈜삼성디스플레이 | 291 | 2.7% |
인천교통공사 | 208 | 2.0% |
주식회사 | 180 | 1.7% |
주)미래컴퍼니 | 113 | 1.1% |
세메스 | 88 | 0.8% |
유라코퍼레이션 | 84 | 0.8% |
주)테라세미콘 | 80 | 0.8% |
에스에프에이 | 73 | 0.7% |
케이씨텍 | 68 | 0.6% |
주)캐스트이즈 | 62 | 0.6% |
Other values (5237) | 9413 |
Most occurring characters
Value | Count | Frequency (%) |
이 | 2740 | 4.0% |
스 | 2411 | 3.5% |
주 | 2201 | 3.2% |
) | 1938 | 2.8% |
( | 1898 | 2.8% |
학 | 1304 | 1.9% |
에 | 1249 | 1.8% |
원 | 1239 | 1.8% |
교 | 1238 | 1.8% |
전 | 1021 | 1.5% |
Other values (738) | 50893 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 60121 | |
Close Punctuation | 1938 | 2.8% |
Open Punctuation | 1898 | 2.8% |
Uppercase Letter | 1809 | 2.7% |
Space Separator | 965 | 1.4% |
Lowercase Letter | 821 | 1.2% |
Other Symbol | 393 | 0.6% |
Decimal Number | 122 | 0.2% |
Other Punctuation | 47 | 0.1% |
Dash Punctuation | 15 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
이 | 2740 | 4.6% |
스 | 2411 | 4.0% |
주 | 2201 | 3.7% |
학 | 1304 | 2.2% |
에 | 1249 | 2.1% |
원 | 1239 | 2.1% |
교 | 1238 | 2.1% |
전 | 1021 | 1.7% |
아 | 1019 | 1.7% |
사 | 987 | 1.6% |
Other values (665) | 44712 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 211 | |
T | 165 | 9.1% |
C | 163 | 9.0% |
K | 151 | 8.3% |
E | 116 | 6.4% |
H | 105 | 5.8% |
M | 104 | 5.7% |
I | 100 | 5.5% |
A | 94 | 5.2% |
B | 91 | 5.0% |
Other values (16) | 509 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 116 | |
s | 98 | |
t | 61 | 7.4% |
n | 58 | 7.1% |
o | 58 | 7.1% |
c | 52 | 6.3% |
m | 48 | 5.8% |
r | 43 | 5.2% |
i | 42 | 5.1% |
a | 41 | 5.0% |
Other values (15) | 204 |
Decimal Number
Value | Count | Frequency (%) |
1 | 45 | |
2 | 25 | |
0 | 15 | 12.3% |
3 | 13 | 10.7% |
7 | 7 | 5.7% |
5 | 6 | 4.9% |
9 | 4 | 3.3% |
6 | 4 | 3.3% |
4 | 2 | 1.6% |
8 | 1 | 0.8% |
Other Punctuation
Value | Count | Frequency (%) |
. | 19 | |
& | 10 | |
/ | 10 | |
, | 4 | 8.5% |
: | 2 | 4.3% |
' | 2 | 4.3% |
Close Punctuation
Value | Count | Frequency (%) |
) | 1938 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1898 |
Space Separator
Value | Count | Frequency (%) |
965 |
Other Symbol
Value | Count | Frequency (%) |
㈜ | 393 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 15 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 60513 | |
Common | 4988 | 7.3% |
Latin | 2630 | 3.9% |
Han | 1 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
이 | 2740 | 4.5% |
스 | 2411 | 4.0% |
주 | 2201 | 3.6% |
학 | 1304 | 2.2% |
에 | 1249 | 2.1% |
원 | 1239 | 2.0% |
교 | 1238 | 2.0% |
전 | 1021 | 1.7% |
아 | 1019 | 1.7% |
사 | 987 | 1.6% |
Other values (665) | 45104 |
Latin
Value | Count | Frequency (%) |
S | 211 | 8.0% |
T | 165 | 6.3% |
C | 163 | 6.2% |
K | 151 | 5.7% |
e | 116 | 4.4% |
E | 116 | 4.4% |
H | 105 | 4.0% |
M | 104 | 4.0% |
I | 100 | 3.8% |
s | 98 | 3.7% |
Other values (41) | 1301 |
Common
Value | Count | Frequency (%) |
) | 1938 | |
( | 1898 | |
965 | ||
1 | 45 | 0.9% |
2 | 25 | 0.5% |
. | 19 | 0.4% |
0 | 15 | 0.3% |
- | 15 | 0.3% |
3 | 13 | 0.3% |
& | 10 | 0.2% |
Other values (11) | 45 | 0.9% |
Han
Value | Count | Frequency (%) |
器 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 60120 | |
ASCII | 7618 | 11.2% |
None | 393 | 0.6% |
CJK | 1 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
이 | 2740 | 4.6% |
스 | 2411 | 4.0% |
주 | 2201 | 3.7% |
학 | 1304 | 2.2% |
에 | 1249 | 2.1% |
원 | 1239 | 2.1% |
교 | 1238 | 2.1% |
전 | 1021 | 1.7% |
아 | 1019 | 1.7% |
사 | 987 | 1.6% |
Other values (664) | 44711 |
ASCII
Value | Count | Frequency (%) |
) | 1938 | |
( | 1898 | |
965 | ||
S | 211 | 2.8% |
T | 165 | 2.2% |
C | 163 | 2.1% |
K | 151 | 2.0% |
e | 116 | 1.5% |
E | 116 | 1.5% |
H | 105 | 1.4% |
Other values (62) | 1790 |
None
Value | Count | Frequency (%) |
㈜ | 393 |
CJK
Value | Count | Frequency (%) |
器 | 1 |
사업자 등록 번호
Text
Distinct | 5478 |
---|---|
Distinct (%) | 54.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 12 |
---|---|
Median length | 10 |
Mean length | 10.4519 |
Min length | 1 |
Characters and Unicode
Total characters | 104519 |
---|---|
Distinct characters | 17 |
Distinct categories | 6 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 4470 ? |
---|---|
Unique (%) | 44.7% |
Sample
1st row | 5048135674 |
---|---|
2nd row | 1208526960 |
3rd row | 6169206236 |
4th row | 1289260072 |
5th row | 114-81-31024 |
Value | Count | Frequency (%) |
1398202409 | 141 | 1.4% |
1248194031 | 131 | 1.3% |
2118196221 | 107 | 1.1% |
6098135227 | 69 | 0.7% |
139-82-02409 | 67 | 0.7% |
2208193209 | 62 | 0.6% |
312-81-13969 | 62 | 0.6% |
1428145237 | 59 | 0.6% |
1248198532 | 59 | 0.6% |
3148117170 | 53 | 0.5% |
Other values (5484) | 9218 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 19560 | |
8 | 13240 | |
2 | 12356 | |
0 | 11894 | |
3 | 8879 | |
4 | 7943 | |
5 | 7087 | 6.8% |
6 | 7031 | 6.7% |
9 | 6870 | 6.6% |
7 | 4991 | 4.8% |
Other values (7) | 4668 | 4.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 99851 | |
Dash Punctuation | 4613 | 4.4% |
Space Separator | 30 | < 0.1% |
Other Punctuation | 13 | < 0.1% |
Uppercase Letter | 11 | < 0.1% |
Other Letter | 1 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 19560 | |
8 | 13240 | |
2 | 12356 | |
0 | 11894 | |
3 | 8879 | |
4 | 7943 | |
5 | 7087 | 7.1% |
6 | 7031 | 7.0% |
9 | 6870 | 6.9% |
7 | 4991 | 5.0% |
Other Punctuation
Value | Count | Frequency (%) |
. | 10 | |
* | 3 | 23.1% |
Uppercase Letter
Value | Count | Frequency (%) |
E | 10 | |
S | 1 | 9.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 4613 |
Space Separator
Value | Count | Frequency (%) |
30 |
Other Letter
Value | Count | Frequency (%) |
ㅂ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 104507 | |
Latin | 11 | < 0.1% |
Hangul | 1 | < 0.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 19560 | |
8 | 13240 | |
2 | 12356 | |
0 | 11894 | |
3 | 8879 | |
4 | 7943 | |
5 | 7087 | 6.8% |
6 | 7031 | 6.7% |
9 | 6870 | 6.6% |
7 | 4991 | 4.8% |
Other values (4) | 4656 | 4.5% |
Latin
Value | Count | Frequency (%) |
E | 10 | |
S | 1 | 9.1% |
Hangul
Value | Count | Frequency (%) |
ㅂ | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 104518 | |
Compat Jamo | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 19560 | |
8 | 13240 | |
2 | 12356 | |
0 | 11894 | |
3 | 8879 | |
4 | 7943 | |
5 | 7087 | 6.8% |
6 | 7031 | 6.7% |
9 | 6870 | 6.6% |
7 | 4991 | 4.8% |
Other values (6) | 4667 | 4.5% |
Compat Jamo
Value | Count | Frequency (%) |
ㅂ | 1 |
등록 일시
Date
Distinct | 8104 |
---|---|
Distinct (%) | 81.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Minimum | 2014-09-15 09:59:20 |
---|---|
Maximum | 2019-09-11 21:39:54 |
등록 국가
Categorical
IMBALANCE
 
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
KR | |
---|---|
US | 414 |
UNKNOWN | 12 |
CN | 2 |
GB | 1 |
Length
Max length | 7 |
---|---|
Median length | 2 |
Mean length | 2.006 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | KR |
---|---|
2nd row | KR |
3rd row | KR |
4th row | KR |
5th row | KR |
Common Values
Value | Count | Frequency (%) |
KR | 9571 | |
US | 414 | 4.1% |
UNKNOWN | 12 | 0.1% |
CN | 2 | < 0.1% |
GB | 1 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
kr | 9571 | |
us | 414 | 4.1% |
unknown | 12 | 0.1% |
cn | 2 | < 0.1% |
gb | 1 | < 0.1% |
회사명 | 사업자 등록 번호 | 등록 일시 | 등록 국가 | |
---|---|---|---|---|
4860 | (주)덕산코트랜 | 5048135674 | 2016-07-12 10:00:09 | KR |
6075 | 어보브반도체 | 1208526960 | 2017-01-19 17:27:59 | KR |
7521 | 제일교육학원제일요양보호사교육원 | 6169206236 | 2017-05-25 18:19:07 | KR |
5743 | 마두간호학원 | 1289260072 | 2016-11-17 14:14:39 | KR |
14129 | (주)진화이앤씨 | 114-81-31024 | 2019-08-21 10:00:12 | KR |
595 | 에이치에스엘 일렉트로닉스 | 5048135320 | 2015-06-12 08:37:02 | KR |
3064 | (주)진넷시스템 | 1138647173 | 2015-12-30 08:58:21 | KR |
3872 | 한국산업경영자문 | 4098184021 | 2016-03-08 20:18:35 | KR |
1633 | 대한문화 | 2012967774 | 2015-10-05 11:43:44 | KR |
2284 | ㈜삼성디스플레이 | 1428145449 | 2015-11-05 11:27:52 | KR |
회사명 | 사업자 등록 번호 | 등록 일시 | 등록 국가 | |
---|---|---|---|---|
9678 | 엠오에스충청 | 3148171058 | 2018-01-30 13:39:05 | KR |
7933 | 이너비즈 | 3968100711 | 2017-06-15 10:41:15 | KR |
7136 | KT | 1028142945 | 2017-05-12 10:51:35 | KR |
2455 | ABB코리아 | 1208104589 | 2015-11-05 17:29:23 | KR |
6432 | 주식회사 테라세미콘 | 124-81-94031 | 2017-02-28 13:26:13 | KR |
3943 | 이시다매뉴팩쳐링코리아 | 1308167198 | 2016-03-14 09:52:27 | KR |
7909 | 월정(주) 나무소리 | 409-86-32547 | 2017-06-13 15:12:10 | KR |
2683 | 한국다우코닝 | 1068121414 | 2015-11-25 20:28:13 | KR |
9341 | 이오시스템 | 1378109108 | 2017-12-11 11:12:57 | KR |
480 | 피에스케이 | 1258106879 | 2015-05-22 11:26:14 | KR |