Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 10000 |
Missing cells | 29926 |
Missing cells (%) | 33.3% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 810.5 KiB |
Average record size in memory | 83.0 B |
Variable types
Text | 4 |
---|---|
Categorical | 2 |
Numeric | 2 |
DateTime | 1 |
Dataset
Description | 한국세라믹기술원 세라믹소재정보은행의 첨부파일 정보입니다. |
---|---|
Author | 한국세라믹기술원 |
URL | https://www.data.go.kr/data/15072093/fileData.do |
파일개수 is highly overall correlated with 파일크기 and 2 other fields | High correlation |
파일타입 is highly overall correlated with 파일크기 and 2 other fields | High correlation |
파일크기 is highly overall correlated with 파일타입 and 1 other fields | High correlation |
파일색인명 is highly overall correlated with 파일타입 and 1 other fields | High correlation |
파일타입 is highly imbalanced (51.1%) | Imbalance |
파일개수 is highly imbalanced (97.5%) | Imbalance |
파일크기 has 9976 (99.8%) missing values | Missing |
등록일 has 9975 (99.8%) missing values | Missing |
파일색인명 has 9975 (99.8%) missing values | Missing |
파일시퀀스 has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 03:39:21.969624 |
---|---|
Analysis finished | 2023-12-12 03:39:23.661054 |
Duration | 1.69 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
파일시퀀스
Text
UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 14 |
---|---|
Median length | 14 |
Mean length | 14 |
Min length | 14 |
Characters and Unicode
Total characters | 140000 |
---|---|
Distinct characters | 17 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 10000 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | FIL-1000048553 |
---|---|
2nd row | FIL-1000020380 |
3rd row | FIL-1000025674 |
4th row | FIL-1000017977 |
5th row | FIL-1000032838 |
Value | Count | Frequency (%) |
fil-1000048553 | 1 | < 0.1% |
fil-1000021482 | 1 | < 0.1% |
fil-1000082389 | 1 | < 0.1% |
fil-1000036105 | 1 | < 0.1% |
fil-1000094817 | 1 | < 0.1% |
fil-1000094680 | 1 | < 0.1% |
fil-1000044734 | 1 | < 0.1% |
fil-1000018397 | 1 | < 0.1% |
fil-1000064422 | 1 | < 0.1% |
fil-1000024640 | 1 | < 0.1% |
Other values (9990) | 9990 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 43218 | |
1 | 17012 | 12.2% |
- | 10000 | 7.1% |
F | 9989 | 7.1% |
I | 9989 | 7.1% |
L | 9989 | 7.1% |
4 | 5407 | 3.9% |
5 | 5357 | 3.8% |
3 | 5208 | 3.7% |
8 | 5040 | 3.6% |
Other values (7) | 18791 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 100000 | |
Uppercase Letter | 30000 | 21.4% |
Dash Punctuation | 10000 | 7.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 43218 | |
1 | 17012 | 17.0% |
4 | 5407 | 5.4% |
5 | 5357 | 5.4% |
3 | 5208 | 5.2% |
8 | 5040 | 5.0% |
2 | 5006 | 5.0% |
6 | 4796 | 4.8% |
9 | 4591 | 4.6% |
7 | 4365 | 4.4% |
Uppercase Letter
Value | Count | Frequency (%) |
F | 9989 | |
I | 9989 | |
L | 9989 | |
B | 11 | < 0.1% |
A | 11 | < 0.1% |
K | 11 | < 0.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 110000 | |
Latin | 30000 | 21.4% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 43218 | |
1 | 17012 | 15.5% |
- | 10000 | 9.1% |
4 | 5407 | 4.9% |
5 | 5357 | 4.9% |
3 | 5208 | 4.7% |
8 | 5040 | 4.6% |
2 | 5006 | 4.6% |
6 | 4796 | 4.4% |
9 | 4591 | 4.2% |
Latin
Value | Count | Frequency (%) |
F | 9989 | |
I | 9989 | |
L | 9989 | |
B | 11 | < 0.1% |
A | 11 | < 0.1% |
K | 11 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 140000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 43218 | |
1 | 17012 | 12.2% |
- | 10000 | 7.1% |
F | 9989 | 7.1% |
I | 9989 | 7.1% |
L | 9989 | 7.1% |
4 | 5407 | 3.9% |
5 | 5357 | 3.8% |
3 | 5208 | 3.7% |
8 | 5040 | 3.6% |
Other values (7) | 18791 |
파일타입
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
F02001 | |
---|---|
F02002 | |
F02003 | |
F_MAMO | 37 |
F_NANO | 9 |
Length
Max length | 10 |
---|---|
Median length | 6 |
Mean length | 6.0016 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | F02001 |
---|---|
2nd row | F02001 |
3rd row | F02001 |
4th row | F02001 |
5th row | F02001 |
Common Values
Value | Count | Frequency (%) |
F02001 | 6571 | |
F02002 | 2393 | 23.9% |
F02003 | 986 | 9.9% |
F_MAMO | 37 | 0.4% |
F_NANO | 9 | 0.1% |
F_NANO_ETC | 4 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
f02001 | 6571 | |
f02002 | 2393 | 23.9% |
f02003 | 986 | 9.9% |
f_mamo | 37 | 0.4% |
f_nano | 9 | 0.1% |
f_nano_etc | 4 | < 0.1% |
원본명
Text
Distinct | 3442 |
---|---|
Distinct (%) | 34.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
1.04e+13 | 2072 | |
1.05e+13 | 1160 | 11.6% |
1.04e+14 | 1126 | 11.3% |
1.05e+12 | 771 | 7.7% |
1.02e+13 | 566 | 5.7% |
1.04e+12 | 383 | 3.8% |
1.02e+12 | 200 | 2.0% |
1.05e+11 | 142 | 1.4% |
1.04e+11 | 55 | 0.5% |
1.02e+11 | 22 | 0.2% |
Other values (3432) | 3503 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 21731 | |
1 | 20143 | |
E | 6584 | 6.7% |
+ | 6545 | 6.7% |
. | 6545 | 6.7% |
4 | 6176 | 6.3% |
3 | 5394 | 5.5% |
2 | 4016 | 4.1% |
5 | 3485 | 3.6% |
P | 2743 | 2.8% |
Other values (8) | 14454 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 67209 | |
Uppercase Letter | 14774 | 15.1% |
Math Symbol | 6545 | 6.7% |
Other Punctuation | 6545 | 6.7% |
Dash Punctuation | 2743 | 2.8% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 21731 | |
1 | 20143 | |
4 | 6176 | 9.2% |
3 | 5394 | 8.0% |
2 | 4016 | 6.0% |
5 | 3485 | 5.2% |
9 | 1729 | 2.6% |
8 | 1567 | 2.3% |
6 | 1539 | 2.3% |
7 | 1429 | 2.1% |
Uppercase Letter
Value | Count | Frequency (%) |
E | 6584 | |
P | 2743 | |
R | 2704 | |
O | 2704 | |
Q | 39 | 0.3% |
Math Symbol
Value | Count | Frequency (%) |
+ | 6545 |
Other Punctuation
Value | Count | Frequency (%) |
. | 6545 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2743 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 83042 | |
Latin | 14774 | 15.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 21731 | |
1 | 20143 | |
+ | 6545 | 7.9% |
. | 6545 | 7.9% |
4 | 6176 | 7.4% |
3 | 5394 | 6.5% |
2 | 4016 | 4.8% |
5 | 3485 | 4.2% |
- | 2743 | 3.3% |
9 | 1729 | 2.1% |
Other values (3) | 4535 | 5.5% |
Latin
Value | Count | Frequency (%) |
E | 6584 | |
P | 2743 | |
R | 2704 | |
O | 2704 | |
Q | 39 | 0.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 97816 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 21731 | |
1 | 20143 | |
E | 6584 | 6.7% |
+ | 6545 | 6.7% |
. | 6545 | 6.7% |
4 | 6176 | 6.3% |
3 | 5394 | 5.5% |
2 | 4016 | 4.1% |
5 | 3485 | 3.6% |
P | 2743 | 2.8% |
Other values (8) | 14454 |
실제파일명
Text
Distinct | 2466 |
---|---|
Distinct (%) | 24.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 82 |
---|---|
Median length | 80 |
Mean length | 17.6879 |
Min length | 5 |
Characters and Unicode
Total characters | 176879 |
---|---|
Distinct characters | 161 |
Distinct categories | 11 ? |
Distinct scripts | 4 ? |
Distinct blocks | 4 ? |
Unique
Unique | 1864 ? |
---|---|
Unique (%) | 18.6% |
Sample
1st row | 1050101022pic3.jpg |
---|---|
2nd row | 1040301010pic1.jpg |
3rd row | 1040301010pic1.jpg |
4th row | 1040301010pic1.jpg |
5th row | 1050101040pic1.jpg |
Value | Count | Frequency (%) |
1040301010pic1.jpg | 2107 | 16.7% |
1050101040pic1.jpg | 497 | 3.9% |
1050101050pic5.jpg | 244 | 1.9% |
1040301020pic45.jpg | 162 | 1.3% |
10202060pic90.jpg | 119 | 0.9% |
1040301020pic43.jpg | 118 | 0.9% |
1040301020pic19.jpg | 115 | 0.9% |
1050101050pic1.jpg | 113 | 0.9% |
1050101060pic6.jpg | 106 | 0.8% |
102010pic6.jpg | 101 | 0.8% |
Other values (2658) | 8930 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 33324 | |
1 | 24118 | |
p | 16518 | 9.3% |
. | 10333 | 5.8% |
g | 8871 | 5.0% |
i | 8579 | 4.9% |
c | 7798 | 4.4% |
j | 7640 | 4.3% |
3 | 6166 | 3.5% |
4 | 5858 | 3.3% |
Other values (151) | 47674 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 84153 | |
Lowercase Letter | 68728 | |
Other Punctuation | 10364 | 5.9% |
Uppercase Letter | 6644 | 3.8% |
Space Separator | 2612 | 1.5% |
Dash Punctuation | 2056 | 1.2% |
Connector Punctuation | 1024 | 0.6% |
Other Letter | 637 | 0.4% |
Open Punctuation | 326 | 0.2% |
Close Punctuation | 326 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
자 | 63 | 9.9% |
료 | 58 | 9.1% |
그 | 39 | 6.1% |
림 | 35 | 5.5% |
이 | 19 | 3.0% |
미 | 18 | 2.8% |
없 | 17 | 2.7% |
다 | 16 | 2.5% |
니 | 16 | 2.5% |
일 | 16 | 2.5% |
Other values (76) | 340 |
Lowercase Letter
Value | Count | Frequency (%) |
p | 16518 | |
g | 8871 | |
i | 8579 | |
c | 7798 | |
j | 7640 | |
e | 2179 | 3.2% |
r | 1865 | 2.7% |
t | 1862 | 2.7% |
a | 1716 | 2.5% |
n | 1612 | 2.3% |
Other values (16) | 10088 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 763 | |
J | 662 | 10.0% |
C | 643 | 9.7% |
S | 624 | 9.4% |
G | 584 | 8.8% |
D | 424 | 6.4% |
M | 335 | 5.0% |
E | 330 | 5.0% |
A | 311 | 4.7% |
F | 227 | 3.4% |
Other values (16) | 1741 |
Decimal Number
Value | Count | Frequency (%) |
0 | 33324 | |
1 | 24118 | |
3 | 6166 | 7.3% |
4 | 5858 | 7.0% |
2 | 5346 | 6.4% |
5 | 4195 | 5.0% |
6 | 1982 | 2.4% |
9 | 1314 | 1.6% |
7 | 1068 | 1.3% |
8 | 782 | 0.9% |
Other Punctuation
Value | Count | Frequency (%) |
. | 10333 | |
, | 14 | 0.1% |
% | 9 | 0.1% |
& | 6 | 0.1% |
' | 2 | < 0.1% |
Math Symbol
Value | Count | Frequency (%) |
= | 7 | |
~ | 1 | 11.1% |
+ | 1 | 11.1% |
Space Separator
Value | Count | Frequency (%) |
2612 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2056 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1024 |
Open Punctuation
Value | Count | Frequency (%) |
( | 326 |
Close Punctuation
Value | Count | Frequency (%) |
) | 326 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 100870 | |
Latin | 75372 | |
Hangul | 617 | 0.3% |
Katakana | 20 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
자 | 63 | 10.2% |
료 | 58 | 9.4% |
그 | 39 | 6.3% |
림 | 35 | 5.7% |
이 | 19 | 3.1% |
미 | 18 | 2.9% |
없 | 17 | 2.8% |
다 | 16 | 2.6% |
니 | 16 | 2.6% |
일 | 16 | 2.6% |
Other values (72) | 320 |
Latin
Value | Count | Frequency (%) |
p | 16518 | |
g | 8871 | |
i | 8579 | |
c | 7798 | |
j | 7640 | |
e | 2179 | 2.9% |
r | 1865 | 2.5% |
t | 1862 | 2.5% |
a | 1716 | 2.3% |
n | 1612 | 2.1% |
Other values (42) | 16732 |
Common
Value | Count | Frequency (%) |
0 | 33324 | |
1 | 24118 | |
. | 10333 | 10.2% |
3 | 6166 | 6.1% |
4 | 5858 | 5.8% |
2 | 5346 | 5.3% |
5 | 4195 | 4.2% |
2612 | 2.6% | |
- | 2056 | 2.0% |
6 | 1982 | 2.0% |
Other values (13) | 4880 | 4.8% |
Katakana
Value | Count | Frequency (%) |
カ | 5 | |
タ | 5 | |
グ | 5 | |
ロ | 5 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 176242 | |
Hangul | 610 | 0.3% |
Katakana | 20 | < 0.1% |
Compat Jamo | 7 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 33324 | |
1 | 24118 | |
p | 16518 | 9.4% |
. | 10333 | 5.9% |
g | 8871 | 5.0% |
i | 8579 | 4.9% |
c | 7798 | 4.4% |
j | 7640 | 4.3% |
3 | 6166 | 3.5% |
4 | 5858 | 3.3% |
Other values (65) | 47037 |
Hangul
Value | Count | Frequency (%) |
자 | 63 | 10.3% |
료 | 58 | 9.5% |
그 | 39 | 6.4% |
림 | 35 | 5.7% |
이 | 19 | 3.1% |
미 | 18 | 3.0% |
없 | 17 | 2.8% |
다 | 16 | 2.6% |
니 | 16 | 2.6% |
일 | 16 | 2.6% |
Other values (71) | 313 |
Compat Jamo
Value | Count | Frequency (%) |
ㄴ | 7 |
Katakana
Value | Count | Frequency (%) |
カ | 5 | |
タ | 5 | |
グ | 5 | |
ロ | 5 |
내부파일명
Text
Distinct | 2943 |
---|---|
Distinct (%) | 29.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 83 |
---|---|
Median length | 80 |
Mean length | 18.0193 |
Min length | 2 |
Characters and Unicode
Total characters | 180193 |
---|---|
Distinct characters | 153 |
Distinct categories | 11 ? |
Distinct scripts | 4 ? |
Distinct blocks | 4 ? |
Unique
Unique | 2448 ? |
---|---|
Unique (%) | 24.5% |
Sample
1st row | 1050101022pic3.jpg |
---|---|
2nd row | 1040301010pic1.jpg |
3rd row | 1040301010pic1.jpg |
4th row | 1040301010pic1.jpg |
5th row | 1050101040pic1.jpg |
Value | Count | Frequency (%) |
1040301010pic1.jpg | 2107 | 16.7% |
1050101040pic1.jpg | 497 | 4.0% |
1050101050pic5.jpg | 244 | 1.9% |
1040301020pic45.jpg | 162 | 1.3% |
10202060pic90.jpg | 119 | 0.9% |
1040301020pic43.jpg | 118 | 0.9% |
1040301020pic19.jpg | 115 | 0.9% |
1050101050pic1.jpg | 113 | 0.9% |
1050101060pic6.jpg | 106 | 0.8% |
102010pic6.jpg | 101 | 0.8% |
Other values (3153) | 8898 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 35492 | |
1 | 24966 | |
p | 16033 | 8.9% |
. | 9797 | 5.4% |
g | 8831 | 4.9% |
i | 8551 | 4.7% |
c | 7795 | 4.3% |
j | 7634 | 4.2% |
3 | 6457 | 3.6% |
4 | 6105 | 3.4% |
Other values (143) | 48532 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 89623 | |
Lowercase Letter | 67160 | |
Other Punctuation | 9828 | 5.5% |
Uppercase Letter | 6774 | 3.8% |
Space Separator | 2580 | 1.4% |
Dash Punctuation | 2056 | 1.1% |
Connector Punctuation | 1034 | 0.6% |
Other Letter | 477 | 0.3% |
Close Punctuation | 326 | 0.2% |
Open Punctuation | 326 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
자 | 63 | 13.2% |
료 | 58 | 12.2% |
그 | 39 | 8.2% |
림 | 35 | 7.3% |
미 | 18 | 3.8% |
대 | 13 | 2.7% |
학 | 12 | 2.5% |
회 | 11 | 2.3% |
도 | 11 | 2.3% |
전 | 9 | 1.9% |
Other values (68) | 208 |
Lowercase Letter
Value | Count | Frequency (%) |
p | 16033 | |
g | 8831 | |
i | 8551 | |
c | 7795 | |
j | 7634 | |
e | 2179 | 3.2% |
r | 1865 | 2.8% |
t | 1861 | 2.8% |
a | 1715 | 2.6% |
n | 1611 | 2.4% |
Other values (16) | 9085 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 764 | |
J | 663 | 9.8% |
C | 638 | 9.4% |
S | 624 | 9.2% |
G | 584 | 8.6% |
D | 424 | 6.3% |
M | 351 | 5.2% |
E | 346 | 5.1% |
A | 311 | 4.6% |
F | 242 | 3.6% |
Other values (16) | 1827 |
Decimal Number
Value | Count | Frequency (%) |
0 | 35492 | |
1 | 24966 | |
3 | 6457 | 7.2% |
4 | 6105 | 6.8% |
2 | 5671 | 6.3% |
5 | 4502 | 5.0% |
6 | 2223 | 2.5% |
9 | 1560 | 1.7% |
8 | 1386 | 1.5% |
7 | 1261 | 1.4% |
Other Punctuation
Value | Count | Frequency (%) |
. | 9797 | |
, | 14 | 0.1% |
% | 9 | 0.1% |
& | 6 | 0.1% |
' | 2 | < 0.1% |
Math Symbol
Value | Count | Frequency (%) |
= | 7 | |
~ | 1 | 11.1% |
+ | 1 | 11.1% |
Space Separator
Value | Count | Frequency (%) |
2580 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2056 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1034 |
Close Punctuation
Value | Count | Frequency (%) |
) | 326 |
Open Punctuation
Value | Count | Frequency (%) |
( | 326 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 105782 | |
Latin | 73934 | |
Hangul | 457 | 0.3% |
Katakana | 20 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
자 | 63 | 13.8% |
료 | 58 | 12.7% |
그 | 39 | 8.5% |
림 | 35 | 7.7% |
미 | 18 | 3.9% |
대 | 13 | 2.8% |
학 | 12 | 2.6% |
회 | 11 | 2.4% |
도 | 11 | 2.4% |
전 | 9 | 2.0% |
Other values (64) | 188 |
Latin
Value | Count | Frequency (%) |
p | 16033 | |
g | 8831 | |
i | 8551 | |
c | 7795 | |
j | 7634 | |
e | 2179 | 2.9% |
r | 1865 | 2.5% |
t | 1861 | 2.5% |
a | 1715 | 2.3% |
n | 1611 | 2.2% |
Other values (42) | 15859 |
Common
Value | Count | Frequency (%) |
0 | 35492 | |
1 | 24966 | |
. | 9797 | 9.3% |
3 | 6457 | 6.1% |
4 | 6105 | 5.8% |
2 | 5671 | 5.4% |
5 | 4502 | 4.3% |
2580 | 2.4% | |
6 | 2223 | 2.1% |
- | 2056 | 1.9% |
Other values (13) | 5933 | 5.6% |
Katakana
Value | Count | Frequency (%) |
ロ | 5 | |
グ | 5 | |
タ | 5 | |
カ | 5 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 179716 | |
Hangul | 450 | 0.2% |
Katakana | 20 | < 0.1% |
Compat Jamo | 7 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 35492 | |
1 | 24966 | |
p | 16033 | 8.9% |
. | 9797 | 5.5% |
g | 8831 | 4.9% |
i | 8551 | 4.8% |
c | 7795 | 4.3% |
j | 7634 | 4.2% |
3 | 6457 | 3.6% |
4 | 6105 | 3.4% |
Other values (65) | 48055 |
Hangul
Value | Count | Frequency (%) |
자 | 63 | 14.0% |
료 | 58 | 12.9% |
그 | 39 | 8.7% |
림 | 35 | 7.8% |
미 | 18 | 4.0% |
대 | 13 | 2.9% |
학 | 12 | 2.7% |
회 | 11 | 2.4% |
도 | 11 | 2.4% |
전 | 9 | 2.0% |
Other values (63) | 181 |
Compat Jamo
Value | Count | Frequency (%) |
ㄴ | 7 |
Katakana
Value | Count | Frequency (%) |
ロ | 5 | |
グ | 5 | |
タ | 5 | |
カ | 5 |
파일크기
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 23 |
---|---|
Distinct (%) | 95.8% |
Missing | 9976 |
Missing (%) | 99.8% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 669227.33 |
Minimum | 4190 |
---|---|
Maximum | 5095819 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 4190 |
---|---|
5-th percentile | 9824.45 |
Q1 | 46716.75 |
median | 84977.5 |
Q3 | 404990.75 |
95-th percentile | 2326219.4 |
Maximum | 5095819 |
Range | 5091629 |
Interquartile range (IQR) | 358274 |
Descriptive statistics
Standard deviation | 1240283.8 |
---|---|
Coefficient of variation (CV) | 1.8533071 |
Kurtosis | 6.3354369 |
Mean | 669227.33 |
Median Absolute Deviation (MAD) | 70225 |
Skewness | 2.4321328 |
Sum | 16061456 |
Variance | 1.5383038 × 1012 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
230285 | 2 | < 0.1% |
4190 | 1 | < 0.1% |
367135 | 1 | < 0.1% |
90323 | 1 | < 0.1% |
73075 | 1 | < 0.1% |
23901 | 1 | < 0.1% |
518558 | 1 | < 0.1% |
2232620 | 1 | < 0.1% |
104329 | 1 | < 0.1% |
84271 | 1 | < 0.1% |
Other values (13) | 13 | 0.1% |
(Missing) | 9976 |
Value | Count | Frequency (%) |
4190 | 1 | |
9521 | 1 | |
11544 | 1 | |
17961 | 1 | |
19068 | 1 | |
23901 | 1 | |
54322 | 1 | |
57696 | 1 | |
66263 | 1 | |
73075 | 1 |
Value | Count | Frequency (%) |
5095819 | 1 | |
2342737 | 1 | |
2232620 | 1 | |
2219188 | 1 | |
2049000 | 1 | |
518558 | 1 | |
367135 | 1 | |
230285 | 2 | |
104329 | 1 | |
90323 | 1 |
등록일
Date
MISSING
 
Distinct | 13 |
---|---|
Distinct (%) | 52.0% |
Missing | 9975 |
Missing (%) | 99.8% |
Memory size | 156.2 KiB |
Minimum | 2008-08-05 00:00:00 |
---|---|
Maximum | 2010-12-15 00:00:00 |
파일개수
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
1 | 25 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.9925 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 9975 | |
1 | 25 | 0.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 9975 | |
1 | 25 | 0.2% |
파일색인명
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 25 |
---|---|
Distinct (%) | 100.0% |
Missing | 9975 |
Missing (%) | 99.8% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 752.16 |
Minimum | 226 |
---|---|
Maximum | 872 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 226 |
---|---|
5-th percentile | 694.8 |
Q1 | 733 |
median | 765 |
Q3 | 802 |
95-th percentile | 855.6 |
Maximum | 872 |
Range | 646 |
Interquartile range (IQR) | 69 |
Descriptive statistics
Standard deviation | 119.92209 |
---|---|
Coefficient of variation (CV) | 0.15943694 |
Kurtosis | 16.557659 |
Mean | 752.16 |
Median Absolute Deviation (MAD) | 37 |
Skewness | -3.7022422 |
Sum | 18804 |
Variance | 14381.307 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
226 | 1 | < 0.1% |
789 | 1 | < 0.1% |
692 | 1 | < 0.1% |
733 | 1 | < 0.1% |
762 | 1 | < 0.1% |
806 | 1 | < 0.1% |
802 | 1 | < 0.1% |
736 | 1 | < 0.1% |
765 | 1 | < 0.1% |
706 | 1 | < 0.1% |
Other values (15) | 15 | 0.1% |
(Missing) | 9975 |
Value | Count | Frequency (%) |
226 | 1 | |
692 | 1 | |
706 | 1 | |
707 | 1 | |
716 | 1 | |
723 | 1 | |
733 | 1 | |
736 | 1 | |
751 | 1 | |
761 | 1 |
Value | Count | Frequency (%) |
872 | 1 | |
857 | 1 | |
850 | 1 | |
831 | 1 | |
826 | 1 | |
806 | 1 | |
802 | 1 | |
796 | 1 | |
789 | 1 | |
780 | 1 |
파일타입 | 파일크기 | 등록일 | 파일색인명 | |
---|---|---|---|---|
파일타입 | 1.000 | NaN | NaN | NaN |
파일크기 | NaN | 1.000 | 0.285 | 0.000 |
등록일 | NaN | 0.285 | 1.000 | 1.000 |
파일색인명 | NaN | 0.000 | 1.000 | 1.000 |
파일개수 | 파일타입 | |
---|---|---|
파일개수 | 1.000 | 1.000 |
파일타입 | 1.000 | 1.000 |
파일크기 | 파일색인명 | 파일타입 | 파일개수 | |
---|---|---|---|---|
파일크기 | 1.000 | -0.141 | 1.000 | 1.000 |
파일색인명 | -0.141 | 1.000 | 1.000 | 1.000 |
파일타입 | 1.000 | 1.000 | 1.000 | 1.000 |
파일개수 | 1.000 | 1.000 | 1.000 | 1.000 |
파일시퀀스 | 파일타입 | 원본명 | 실제파일명 | 내부파일명 | 파일크기 | 등록일 | 파일개수 | 파일색인명 | |
---|---|---|---|---|---|---|---|---|---|
12401 | FIL-1000048553 | F02001 | 1.05E+13 | 1050101022pic3.jpg | 1050101022pic3.jpg | <NA> | <NA> | <NA> | <NA> |
51309 | FIL-1000020380 | F02001 | 1.04E+13 | 1040301010pic1.jpg | 1040301010pic1.jpg | <NA> | <NA> | <NA> | <NA> |
44877 | FIL-1000025674 | F02001 | 1.04E+14 | 1040301010pic1.jpg | 1040301010pic1.jpg | <NA> | <NA> | <NA> | <NA> |
50239 | FIL-1000017977 | F02001 | 1.04E+13 | 1040301010pic1.jpg | 1040301010pic1.jpg | <NA> | <NA> | <NA> | <NA> |
55423 | FIL-1000032838 | F02001 | 1.05E+13 | 1050101040pic1.jpg | 1050101040pic1.jpg | <NA> | <NA> | <NA> | <NA> |
75991 | FIL-1000093488 | F02003 | PRO-1000062694 | Fig_9.jpg | Fig_923.jpg | <NA> | <NA> | <NA> | <NA> |
30298 | FIL-1000002381 | F02001 | 1.02E+13 | 102010pic19.jpg | 102010pic19.jpg | <NA> | <NA> | <NA> | <NA> |
38535 | FIL-1000013851 | F02001 | 1.04E+13 | 1040301020pic21.jpg | 1040301020pic21.jpg | <NA> | <NA> | <NA> | <NA> |
34385 | FIL-1000008017 | F02001 | 1.04E+12 | 1040301010pic1.jpg | 1040301010pic1.jpg | <NA> | <NA> | <NA> | <NA> |
98598 | FIL-1000179037 | F02002 | PRO-1000124481 | fig.2 Thermal expansion.jpg | fig.2 Thermal expansion.jpg | <NA> | <NA> | <NA> | <NA> |
파일시퀀스 | 파일타입 | 원본명 | 실제파일명 | 내부파일명 | 파일크기 | 등록일 | 파일개수 | 파일색인명 | |
---|---|---|---|---|---|---|---|---|---|
13895 | FIL-1000039381 | F02001 | 1.04E+13 | 1040301030pic19.jpg | 1040301030pic19.jpg | <NA> | <NA> | <NA> | <NA> |
28235 | FIL-1000004735 | F02001 | 1.02E+13 | 102010pic10.jpg | 102010pic10.jpg | <NA> | <NA> | <NA> | <NA> |
78228 | FIL-1000095711 | F_MAMO | EQP-0000000052 | wearloss.jpg | wearloss1000095711.jpg | <NA> | <NA> | <NA> | <NA> |
6430 | FIL-1000041067 | F02001 | 1.04E+13 | 1040301020pic47.jpg | 1040301020pic47.jpg | <NA> | <NA> | <NA> | <NA> |
23218 | FIL-1000055857 | F02001 | 2.01E+11 | 20105110pic13.jpg | 20105110pic13.jpg | <NA> | <NA> | <NA> | <NA> |
59819 | FIL-1000049348 | F02001 | 1.04E+13 | 1040301060pic7.jpg | 1040301060pic7.jpg | <NA> | <NA> | <NA> | <NA> |
6509 | FIL-1000061950 | F02001 | 1.05E+12 | 1050101060pic3.jpg | 1050101060pic3.jpg | <NA> | <NA> | <NA> | <NA> |
46043 | FIL-1000025940 | F02001 | 1.04E+14 | 1040301010pic1.jpg | 1040301010pic1.jpg | <NA> | <NA> | <NA> | <NA> |
37001 | FIL-1000017756 | F02001 | 1.04E+13 | 1040301010pic1.jpg | 1040301010pic1.jpg | <NA> | <NA> | <NA> | <NA> |
28397 | FIL-1000003935 | F02001 | 1.02E+13 | 102010pic11.jpg | 102010pic11.jpg | <NA> | <NA> | <NA> | <NA> |