Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 307 |
Missing cells | 3 |
Missing cells (%) | 0.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 17.5 KiB |
Average record size in memory | 58.4 B |
Variable types
Text | 3 |
---|---|
Categorical | 3 |
Numeric | 1 |
Dataset
Description | 2021년 2월 기준 기술과 관련된 정보입니다. |
---|---|
Author | 한국연구재단 정보통신기획평가원 |
URL | https://www.data.go.kr/data/15077418/fileData.do |
생성자 has constant value "" | Constant |
생성일시 is highly overall correlated with 클러스터분류레벨 | High correlation |
클러스터분류레벨 is highly overall correlated with 생성일시 | High correlation |
클러스터분류 has unique values | Unique |
클러스터분류이름타이틀 has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 22:07:14.177306 |
---|---|
Analysis finished | 2023-12-12 22:07:14.630848 |
Duration | 0.45 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
클러스터분류
Text
UNIQUE
 
Distinct | 307 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.5 KiB |
Value | Count | Frequency (%) |
b2e12 | 2 | 0.7% |
a | 1 | 0.3% |
b2e05d | 1 | 0.3% |
b2b01a | 1 | 0.3% |
b2a05a | 1 | 0.3% |
b2a04c | 1 | 0.3% |
b2a04b | 1 | 0.3% |
b2a04a | 1 | 0.3% |
b2a03h | 1 | 0.3% |
b2a03f | 1 | 0.3% |
Other values (296) | 296 |
Most occurring characters
Value | Count | Frequency (%) |
A | 284 | |
0 | 262 | |
B | 259 | |
2 | 247 | |
1 | 218 | |
C | 158 | |
E | 66 | 4.0% |
3 | 47 | 2.8% |
D | 29 | 1.7% |
4 | 24 | 1.4% |
Other values (10) | 67 | 4.0% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 856 | |
Uppercase Letter | 804 | |
Close Punctuation | 1 | 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 262 | |
2 | 247 | |
1 | 218 | |
3 | 47 | 5.5% |
4 | 24 | 2.8% |
5 | 15 | 1.8% |
7 | 15 | 1.8% |
6 | 11 | 1.3% |
9 | 9 | 1.1% |
8 | 8 | 0.9% |
Uppercase Letter
Value | Count | Frequency (%) |
A | 284 | |
B | 259 | |
C | 158 | |
E | 66 | 8.2% |
D | 29 | 3.6% |
F | 3 | 0.4% |
G | 2 | 0.2% |
H | 2 | 0.2% |
I | 1 | 0.1% |
Close Punctuation
Value | Count | Frequency (%) |
) | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 857 | |
Latin | 804 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 262 | |
2 | 247 | |
1 | 218 | |
3 | 47 | 5.5% |
4 | 24 | 2.8% |
5 | 15 | 1.8% |
7 | 15 | 1.8% |
6 | 11 | 1.3% |
9 | 9 | 1.1% |
8 | 8 | 0.9% |
Latin
Value | Count | Frequency (%) |
A | 284 | |
B | 259 | |
C | 158 | |
E | 66 | 8.2% |
D | 29 | 3.6% |
F | 3 | 0.4% |
G | 2 | 0.2% |
H | 2 | 0.2% |
I | 1 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1661 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
A | 284 | |
0 | 262 | |
B | 259 | |
2 | 247 | |
1 | 218 | |
C | 158 | |
E | 66 | 4.0% |
3 | 47 | 2.8% |
D | 29 | 1.7% |
4 | 24 | 1.4% |
Other values (10) | 67 | 4.0% |
생성자
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.5 KiB |
000000000000000UU0 |
---|
Length
Max length | 18 |
---|---|
Median length | 18 |
Mean length | 18 |
Min length | 18 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 000000000000000UU0 |
---|---|
2nd row | 000000000000000UU0 |
3rd row | 000000000000000UU0 |
4th row | 000000000000000UU0 |
5th row | 000000000000000UU0 |
Common Values
Value | Count | Frequency (%) |
000000000000000UU0 | 307 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
000000000000000uu0 | 307 |
생성일시
Categorical
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.5 KiB |
2015-03-18 오전 9:46:51 | |
---|---|
2015-03-18 오전 9:46:18 | |
2015-03-18 오전 9:46:50 | |
2015-03-18 오전 9:45:47 | |
2015-03-18 오전 9:44:14 | 7 |
Length
Max length | 21 |
---|---|
Median length | 21 |
Mean length | 21 |
Min length | 21 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2015-03-18 오전 9:43:32 |
---|---|
2nd row | 2015-03-18 오전 9:43:32 |
3rd row | 2015-03-18 오전 9:43:32 |
4th row | 2015-03-18 오전 9:44:14 |
5th row | 2015-03-18 오전 9:44:14 |
Common Values
Value | Count | Frequency (%) |
2015-03-18 오전 9:46:51 | 135 | |
2015-03-18 오전 9:46:18 | 75 | |
2015-03-18 오전 9:46:50 | 66 | |
2015-03-18 오전 9:45:47 | 21 | 6.8% |
2015-03-18 오전 9:44:14 | 7 | 2.3% |
2015-03-18 오전 9:43:32 | 3 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2015-03-18 | 307 | |
오전 | 307 | |
9:46:51 | 135 | |
9:46:18 | 75 | 8.1% |
9:46:50 | 66 | 7.2% |
9:45:47 | 21 | 2.3% |
9:44:14 | 7 | 0.8% |
9:43:32 | 3 | 0.3% |
상위클러스터분류ID
Text
Distinct | 106 |
---|---|
Distinct (%) | 34.9% |
Missing | 3 |
Missing (%) | 1.0% |
Memory size | 2.5 KiB |
Value | Count | Frequency (%) |
b2e | 13 | 4.3% |
a1a | 10 | 3.3% |
b1a02 | 9 | 3.0% |
c2b | 8 | 2.6% |
b2a03 | 8 | 2.6% |
b2e07 | 6 | 2.0% |
c2c | 5 | 1.6% |
b2e09 | 5 | 1.6% |
b2e12 | 5 | 1.6% |
a1b01 | 5 | 1.6% |
Other values (96) | 230 |
Most occurring characters
Value | Count | Frequency (%) |
2 | 226 | |
A | 201 | |
B | 193 | |
0 | 190 | |
1 | 188 | |
C | 120 | |
E | 58 | 4.5% |
3 | 36 | 2.8% |
4 | 17 | 1.3% |
7 | 12 | 0.9% |
Other values (5) | 38 | 3.0% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 699 | |
Uppercase Letter | 580 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
2 | 226 | |
0 | 190 | |
1 | 188 | |
3 | 36 | 5.2% |
4 | 17 | 2.4% |
7 | 12 | 1.7% |
5 | 10 | 1.4% |
6 | 8 | 1.1% |
9 | 7 | 1.0% |
8 | 5 | 0.7% |
Uppercase Letter
Value | Count | Frequency (%) |
A | 201 | |
B | 193 | |
C | 120 | |
E | 58 | 10.0% |
D | 8 | 1.4% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 699 | |
Latin | 580 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
2 | 226 | |
0 | 190 | |
1 | 188 | |
3 | 36 | 5.2% |
4 | 17 | 2.4% |
7 | 12 | 1.7% |
5 | 10 | 1.4% |
6 | 8 | 1.1% |
9 | 7 | 1.0% |
8 | 5 | 0.7% |
Latin
Value | Count | Frequency (%) |
A | 201 | |
B | 193 | |
C | 120 | |
E | 58 | 10.0% |
D | 8 | 1.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1279 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
2 | 226 | |
A | 201 | |
B | 193 | |
0 | 190 | |
1 | 188 | |
C | 120 | |
E | 58 | 4.5% |
3 | 36 | 2.8% |
4 | 17 | 1.3% |
7 | 12 | 0.9% |
Other values (5) | 38 | 3.0% |
클러스터분류레벨
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 1.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.5 KiB |
5 | |
---|---|
4 | |
3 | |
2 | 7 |
1 | 3 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 2 |
5th row | 2 |
Common Values
Value | Count | Frequency (%) |
5 | 201 | |
4 | 75 | 24.4% |
3 | 21 | 6.8% |
2 | 7 | 2.3% |
1 | 3 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
5 | 201 | |
4 | 75 | 24.4% |
3 | 21 | 6.8% |
2 | 7 | 2.3% |
1 | 3 | 1.0% |
클러스터분류이름타이틀
Text
UNIQUE
 
Distinct | 307 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.5 KiB |
Length
Max length | 17 |
---|---|
Median length | 17 |
Mean length | 16.410423 |
Min length | 12 |
Characters and Unicode
Total characters | 5038 |
---|---|
Distinct characters | 25 |
Distinct categories | 5 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 307 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | TCD_CLS_ID.A |
---|---|
2nd row | TCD_CLS_ID.B |
3rd row | TCD_CLS_ID.C |
4th row | TCD_CLS_ID.A1 |
5th row | TCD_CLS_ID.A2 |
Value | Count | Frequency (%) |
tcd_cls_id.b2e12 | 2 | 0.7% |
tcd_cls_id.a | 1 | 0.3% |
tcd_cls_id.b2e05d | 1 | 0.3% |
tcd_cls_id.b2b01a | 1 | 0.3% |
tcd_cls_id.b2a05a | 1 | 0.3% |
tcd_cls_id.b2a04c | 1 | 0.3% |
tcd_cls_id.b2a04b | 1 | 0.3% |
tcd_cls_id.b2a04a | 1 | 0.3% |
tcd_cls_id.b2a03h | 1 | 0.3% |
tcd_cls_id.b2a03f | 1 | 0.3% |
Other values (296) | 296 |
Most occurring characters
Value | Count | Frequency (%) |
C | 772 | |
D | 643 | |
_ | 614 | |
I | 308 | 6.1% |
T | 307 | 6.1% |
L | 307 | 6.1% |
S | 307 | 6.1% |
. | 307 | 6.1% |
A | 284 | 5.6% |
0 | 262 | 5.2% |
Other values (15) | 927 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 3260 | |
Decimal Number | 856 | 17.0% |
Connector Punctuation | 614 | 12.2% |
Other Punctuation | 307 | 6.1% |
Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
C | 772 | |
D | 643 | |
I | 308 | 9.4% |
T | 307 | 9.4% |
L | 307 | 9.4% |
S | 307 | 9.4% |
A | 284 | 8.7% |
B | 259 | 7.9% |
E | 66 | 2.0% |
F | 3 | 0.1% |
Other values (2) | 4 | 0.1% |
Decimal Number
Value | Count | Frequency (%) |
0 | 262 | |
2 | 247 | |
1 | 218 | |
3 | 47 | 5.5% |
4 | 24 | 2.8% |
7 | 15 | 1.8% |
5 | 15 | 1.8% |
6 | 11 | 1.3% |
9 | 9 | 1.1% |
8 | 8 | 0.9% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 614 |
Other Punctuation
Value | Count | Frequency (%) |
. | 307 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 3260 | |
Common | 1778 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
_ | 614 | |
. | 307 | |
0 | 262 | |
2 | 247 | |
1 | 218 | 12.3% |
3 | 47 | 2.6% |
4 | 24 | 1.3% |
7 | 15 | 0.8% |
5 | 15 | 0.8% |
6 | 11 | 0.6% |
Other values (3) | 18 | 1.0% |
Latin
Value | Count | Frequency (%) |
C | 772 | |
D | 643 | |
I | 308 | 9.4% |
T | 307 | 9.4% |
L | 307 | 9.4% |
S | 307 | 9.4% |
A | 284 | 8.7% |
B | 259 | 7.9% |
E | 66 | 2.0% |
F | 3 | 0.1% |
Other values (2) | 4 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 5038 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
C | 772 | |
D | 643 | |
_ | 614 | |
I | 308 | 6.1% |
T | 307 | 6.1% |
L | 307 | 6.1% |
S | 307 | 6.1% |
. | 307 | 6.1% |
A | 284 | 5.6% |
0 | 262 | 5.2% |
Other values (15) | 927 |
표시순서
Real number (ℝ)
Distinct | 11 |
---|---|
Distinct (%) | 3.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.4820847 |
Minimum | 0 |
---|---|
Maximum | 10 |
Zeros | 1 |
Zeros (%) | 0.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.8 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 2 |
Q3 | 3 |
95-th percentile | 6 |
Maximum | 10 |
Range | 10 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 1.7514247 |
---|---|
Coefficient of variation (CV) | 0.70562648 |
Kurtosis | 3.1739993 |
Mean | 2.4820847 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 1.6997066 |
Sum | 762 |
Variance | 3.0674885 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 106 | |
2 | 88 | |
3 | 50 | |
4 | 28 | 9.1% |
5 | 14 | 4.6% |
6 | 6 | 2.0% |
7 | 5 | 1.6% |
8 | 5 | 1.6% |
9 | 3 | 1.0% |
10 | 1 | 0.3% |
Value | Count | Frequency (%) |
0 | 1 | 0.3% |
1 | 106 | |
2 | 88 | |
3 | 50 | |
4 | 28 | 9.1% |
5 | 14 | 4.6% |
6 | 6 | 2.0% |
7 | 5 | 1.6% |
8 | 5 | 1.6% |
9 | 3 | 1.0% |
Value | Count | Frequency (%) |
10 | 1 | 0.3% |
9 | 3 | 1.0% |
8 | 5 | 1.6% |
7 | 5 | 1.6% |
6 | 6 | 2.0% |
5 | 14 | 4.6% |
4 | 28 | 9.1% |
3 | 50 | |
2 | 88 | |
1 | 106 |
생성일시 | 클러스터분류레벨 | 표시순서 | |
---|---|---|---|
생성일시 | 1.000 | 1.000 | 0.000 |
클러스터분류레벨 | 1.000 | 1.000 | 0.000 |
표시순서 | 0.000 | 0.000 | 1.000 |
생성일시 | 클러스터분류레벨 | |
---|---|---|
생성일시 | 1.000 | 0.998 |
클러스터분류레벨 | 0.998 | 1.000 |
표시순서 | 생성일시 | 클러스터분류레벨 | |
---|---|---|---|
표시순서 | 1.000 | 0.000 | 0.000 |
생성일시 | 0.000 | 1.000 | 0.998 |
클러스터분류레벨 | 0.000 | 0.998 | 1.000 |
클러스터분류 | 생성자 | 생성일시 | 상위클러스터분류ID | 클러스터분류레벨 | 클러스터분류이름타이틀 | 표시순서 | |
---|---|---|---|---|---|---|---|
0 | A | 000000000000000UU0 | 2015-03-18 오전 9:43:32 | <NA> | 1 | TCD_CLS_ID.A | 1 |
1 | B | 000000000000000UU0 | 2015-03-18 오전 9:43:32 | <NA> | 1 | TCD_CLS_ID.B | 2 |
2 | C | 000000000000000UU0 | 2015-03-18 오전 9:43:32 | <NA> | 1 | TCD_CLS_ID.C | 3 |
3 | A1 | 000000000000000UU0 | 2015-03-18 오전 9:44:14 | A | 2 | TCD_CLS_ID.A1 | 1 |
4 | A2 | 000000000000000UU0 | 2015-03-18 오전 9:44:14 | A | 2 | TCD_CLS_ID.A2 | 2 |
5 | B1 | 000000000000000UU0 | 2015-03-18 오전 9:44:14 | B | 2 | TCD_CLS_ID.B1 | 1 |
6 | B2 | 000000000000000UU0 | 2015-03-18 오전 9:44:14 | B | 2 | TCD_CLS_ID.B2 | 2 |
7 | C1 | 000000000000000UU0 | 2015-03-18 오전 9:44:14 | C | 2 | TCD_CLS_ID.C1 | 1 |
8 | C2 | 000000000000000UU0 | 2015-03-18 오전 9:44:14 | C | 2 | TCD_CLS_ID.C2 | 2 |
9 | C3 | 000000000000000UU0 | 2015-03-18 오전 9:44:14 | C | 2 | TCD_CLS_ID.C3 | 3 |
클러스터분류 | 생성자 | 생성일시 | 상위클러스터분류ID | 클러스터분류레벨 | 클러스터분류이름타이틀 | 표시순서 | |
---|---|---|---|---|---|---|---|
297 | C2C04A | 000000000000000UU0 | 2015-03-18 오전 9:46:51 | C2C04 | 5 | TCD_CLS_ID.C2C04A | 1 |
298 | C2C04B | 000000000000000UU0 | 2015-03-18 오전 9:46:51 | C2C04 | 5 | TCD_CLS_ID.C2C04B | 2 |
299 | C2C04C | 000000000000000UU0 | 2015-03-18 오전 9:46:51 | C2C04 | 5 | TCD_CLS_ID.C2C04C | 3 |
300 | C2C04D | 000000000000000UU0 | 2015-03-18 오전 9:46:51 | C2C04 | 5 | TCD_CLS_ID.C2C04D | 4 |
301 | C2C05A | 000000000000000UU0 | 2015-03-18 오전 9:46:51 | C2C05 | 5 | TCD_CLS_ID.C2C05A | 1 |
302 | C3A01A | 000000000000000UU0 | 2015-03-18 오전 9:46:51 | C3A01 | 5 | TCD_CLS_ID.C3A01A | 1 |
303 | C3A01B | 000000000000000UU0 | 2015-03-18 오전 9:46:51 | C3A01 | 5 | TCD_CLS_ID.C3A01B | 2 |
304 | C3A01C | 000000000000000UU0 | 2015-03-18 오전 9:46:51 | C3A01 | 5 | TCD_CLS_ID.C3A01C | 3 |
305 | C3B01A | 000000000000000UU0 | 2015-03-18 오전 9:46:51 | C3B01 | 5 | TCD_CLS_ID.C3B01A | 1 |
306 | C3B01B | 000000000000000UU0 | 2015-03-18 오전 9:46:51 | C3B01 | 5 | TCD_CLS_ID.C3B01B | 2 |