Dataset statistics
Number of variables | 15 |
---|---|
Number of observations | 2260 |
Missing cells | 10 |
Missing cells (%) | < 0.1% |
Duplicate rows | 158 |
Duplicate rows (%) | 7.0% |
Total size in memory | 289.2 KiB |
Average record size in memory | 131.1 B |
Variable types
Categorical | 5 |
---|---|
Text | 2 |
Numeric | 8 |
Dataset
Description | Sample |
---|---|
Author | ㈜한국금융솔루션 |
URL | https://www.bigdata-finance.kr/dataset/datasetView.do?datastId=SET1300048 |
Dataset has 158 (7.0%) duplicate rows | Duplicates |
중립문서개수 is highly overall correlated with 감성점수값 and 11 other fields | High correlation |
분석일시 is highly overall correlated with 감성점수값 and 11 other fields | High correlation |
기준일자 is highly overall correlated with 감성점수값 and 11 other fields | High correlation |
주식시장명 is highly overall correlated with 전체문서개수 and 8 other fields | High correlation |
이전중립문서개수 is highly overall correlated with 감성점수값 and 11 other fields | High correlation |
감성점수값 is highly overall correlated with 감성레벨값 and 8 other fields | High correlation |
감성레벨값 is highly overall correlated with 감성점수값 and 8 other fields | High correlation |
전체문서개수 is highly overall correlated with 감성점수값 and 11 other fields | High correlation |
긍정문서개수 is highly overall correlated with 감성점수값 and 9 other fields | High correlation |
부정문서개수 is highly overall correlated with 전체문서개수 and 6 other fields | High correlation |
이전전체문서개수 is highly overall correlated with 감성점수값 and 11 other fields | High correlation |
이전긍정문서개수 is highly overall correlated with 감성점수값 and 9 other fields | High correlation |
이전부정문서개수 is highly overall correlated with 전체문서개수 and 7 other fields | High correlation |
기준일자 is highly imbalanced (99.4%) | Imbalance |
주식시장명 is highly imbalanced (62.7%) | Imbalance |
분석일시 is highly imbalanced (99.4%) | Imbalance |
중립문서개수 is highly imbalanced (99.4%) | Imbalance |
이전중립문서개수 is highly imbalanced (99.4%) | Imbalance |
감성점수값 has 1444 (63.9%) zeros | Zeros |
감성레벨값 has 1404 (62.1%) zeros | Zeros |
전체문서개수 has 190 (8.4%) zeros | Zeros |
긍정문서개수 has 1508 (66.7%) zeros | Zeros |
이전전체문서개수 has 172 (7.6%) zeros | Zeros |
이전긍정문서개수 has 1460 (64.6%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-10 13:06:01.352749 |
---|---|
Analysis finished | 2023-12-10 13:06:11.098258 |
Duration | 9.75 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
기준일자
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 17.8 KiB |
20211022 | |
---|---|
2259 217857 20211023024014 F_BBP20_00044 | 1 |
Length
Max length | 40 |
---|---|
Median length | 8 |
Mean length | 8.0141593 |
Min length | 8 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 20211022 |
---|---|
2nd row | 20211022 |
3rd row | 20211022 |
4th row | 20211022 |
5th row | 20211022 |
Common Values
Value | Count | Frequency (%) |
20211022 | 2259 | |
2259 217857 20211023024014 F_BBP20_00044 | 1 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
20211022 | 2259 | |
2259 | 1 | < 0.1% |
217857 | 1 | < 0.1% |
20211023024014 | 1 | < 0.1% |
f_bbp20_00044 | 1 | < 0.1% |
주식종목값
Text
Distinct | 251 |
---|---|
Distinct (%) | 11.1% |
Missing | 1 |
Missing (%) | < 0.1% |
Memory size | 17.8 KiB |
Value | Count | Frequency (%) |
total | 9 | 0.4% |
kr7071050009 | 9 | 0.4% |
kr7069620003 | 9 | 0.4% |
kr7103140000 | 9 | 0.4% |
kr7138930003 | 9 | 0.4% |
kr7139480008 | 9 | 0.4% |
kr7145990008 | 9 | 0.4% |
kr7161390000 | 9 | 0.4% |
kr7161890009 | 9 | 0.4% |
kr7170900005 | 9 | 0.4% |
Other values (241) | 2169 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 10674 | |
7 | 3123 | 11.6% |
K | 2250 | 8.4% |
R | 2232 | 8.3% |
1 | 1332 | 4.9% |
3 | 1242 | 4.6% |
2 | 1080 | 4.0% |
4 | 1062 | 3.9% |
5 | 981 | 3.6% |
6 | 972 | 3.6% |
Other values (11) | 1980 | 7.4% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 22320 | |
Uppercase Letter | 4608 | 17.1% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
K | 2250 | |
R | 2232 | |
O | 27 | 0.6% |
T | 18 | 0.4% |
S | 18 | 0.4% |
A | 18 | 0.4% |
Q | 9 | 0.2% |
D | 9 | 0.2% |
L | 9 | 0.2% |
P | 9 | 0.2% |
Decimal Number
Value | Count | Frequency (%) |
0 | 10674 | |
7 | 3123 | 14.0% |
1 | 1332 | 6.0% |
3 | 1242 | 5.6% |
2 | 1080 | 4.8% |
4 | 1062 | 4.8% |
5 | 981 | 4.4% |
6 | 972 | 4.4% |
8 | 963 | 4.3% |
9 | 891 | 4.0% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 22320 | |
Latin | 4608 | 17.1% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
K | 2250 | |
R | 2232 | |
O | 27 | 0.6% |
T | 18 | 0.4% |
S | 18 | 0.4% |
A | 18 | 0.4% |
Q | 9 | 0.2% |
D | 9 | 0.2% |
L | 9 | 0.2% |
P | 9 | 0.2% |
Common
Value | Count | Frequency (%) |
0 | 10674 | |
7 | 3123 | 14.0% |
1 | 1332 | 6.0% |
3 | 1242 | 5.6% |
2 | 1080 | 4.8% |
4 | 1062 | 4.8% |
5 | 981 | 4.4% |
6 | 972 | 4.4% |
8 | 963 | 4.3% |
9 | 891 | 4.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 26928 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 10674 | |
7 | 3123 | 11.6% |
K | 2250 | 8.4% |
R | 2232 | 8.3% |
1 | 1332 | 4.9% |
3 | 1242 | 4.6% |
2 | 1080 | 4.0% |
4 | 1062 | 3.9% |
5 | 981 | 3.6% |
6 | 972 | 3.6% |
Other values (11) | 1980 | 7.4% |
주식종목명
Text
Distinct | 251 |
---|---|
Distinct (%) | 11.1% |
Missing | 1 |
Missing (%) | < 0.1% |
Memory size | 17.8 KiB |
Value | Count | Frequency (%) |
total | 9 | 0.4% |
금호타이어 | 9 | 0.4% |
풍산 | 9 | 0.4% |
bnk금융지주 | 9 | 0.4% |
이마트 | 9 | 0.4% |
삼양사 | 9 | 0.4% |
한국타이어 | 9 | 0.4% |
한국콜마 | 9 | 0.4% |
동아에스티 | 9 | 0.4% |
종근당 | 9 | 0.4% |
Other values (242) | 2178 |
Most occurring characters
Value | Count | Frequency (%) |
스 | 396 | 4.0% |
한 | 315 | 3.2% |
이 | 288 | 2.9% |
대 | 234 | 2.4% |
S | 216 | 2.2% |
K | 153 | 1.5% |
성 | 144 | 1.4% |
에 | 135 | 1.4% |
G | 135 | 1.4% |
삼 | 135 | 1.4% |
Other values (250) | 7785 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 8586 | |
Uppercase Letter | 1260 | 12.7% |
Other Punctuation | 54 | 0.5% |
Lowercase Letter | 18 | 0.2% |
Space Separator | 9 | 0.1% |
Dash Punctuation | 9 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
스 | 396 | 4.6% |
한 | 315 | 3.7% |
이 | 288 | 3.4% |
대 | 234 | 2.7% |
성 | 144 | 1.7% |
에 | 135 | 1.6% |
삼 | 135 | 1.6% |
트 | 126 | 1.5% |
국 | 126 | 1.5% |
화 | 126 | 1.5% |
Other values (225) | 6561 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 216 | |
K | 153 | |
G | 135 | |
L | 126 | |
C | 117 | |
T | 81 | 6.4% |
O | 63 | 5.0% |
P | 54 | 4.3% |
B | 54 | 4.3% |
J | 45 | 3.6% |
Other values (10) | 216 |
Lowercase Letter
Value | Count | Frequency (%) |
l | 9 | |
i | 9 |
Other Punctuation
Value | Count | Frequency (%) |
& | 54 |
Space Separator
Value | Count | Frequency (%) |
9 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 9 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 8586 | |
Latin | 1278 | 12.9% |
Common | 72 | 0.7% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
스 | 396 | 4.6% |
한 | 315 | 3.7% |
이 | 288 | 3.4% |
대 | 234 | 2.7% |
성 | 144 | 1.7% |
에 | 135 | 1.6% |
삼 | 135 | 1.6% |
트 | 126 | 1.5% |
국 | 126 | 1.5% |
화 | 126 | 1.5% |
Other values (225) | 6561 |
Latin
Value | Count | Frequency (%) |
S | 216 | |
K | 153 | |
G | 135 | |
L | 126 | |
C | 117 | |
T | 81 | 6.3% |
O | 63 | 4.9% |
P | 54 | 4.2% |
B | 54 | 4.2% |
J | 45 | 3.5% |
Other values (12) | 234 |
Common
Value | Count | Frequency (%) |
& | 54 | |
9 | 12.5% | |
- | 9 | 12.5% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 8586 | |
ASCII | 1350 | 13.6% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
스 | 396 | 4.6% |
한 | 315 | 3.7% |
이 | 288 | 3.4% |
대 | 234 | 2.7% |
성 | 144 | 1.7% |
에 | 135 | 1.6% |
삼 | 135 | 1.6% |
트 | 126 | 1.5% |
국 | 126 | 1.5% |
화 | 126 | 1.5% |
Other values (225) | 6561 |
ASCII
Value | Count | Frequency (%) |
S | 216 | |
K | 153 | |
G | 135 | |
L | 126 | |
C | 117 | 8.7% |
T | 81 | 6.0% |
O | 63 | 4.7% |
P | 54 | 4.0% |
& | 54 | 4.0% |
B | 54 | 4.0% |
Other values (15) | 297 |
주식시장명
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 17.8 KiB |
KOSPI | |
---|---|
KOSDAQ | |
TOTAL | 9 |
<NA> | 1 |
Length
Max length | 6 |
---|---|
Median length | 5 |
Mean length | 5.190708 |
Min length | 4 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | TOTAL |
---|---|
2nd row | KOSPI |
3rd row | KOSDAQ |
4th row | KOSPI |
5th row | KOSPI |
Common Values
Value | Count | Frequency (%) |
KOSPI | 1818 | |
KOSDAQ | 432 | 19.1% |
TOTAL | 9 | 0.4% |
<NA> | 1 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
kospi | 1818 | |
kosdaq | 432 | 19.1% |
total | 9 | 0.4% |
na | 1 | < 0.1% |
분석일시
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 17.8 KiB |
202110 | |
---|---|
<NA> | 1 |
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 5.999115 |
Min length | 4 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 202110 |
---|---|
2nd row | 202110 |
3rd row | 202110 |
4th row | 202110 |
5th row | 202110 |
Common Values
Value | Count | Frequency (%) |
202110 | 2259 | |
<NA> | 1 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
202110 | 2259 | |
na | 1 | < 0.1% |
감성점수값
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 737 |
---|---|
Distinct (%) | 32.6% |
Missing | 1 |
Missing (%) | < 0.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20.678477 |
Minimum | 0 |
---|---|
Maximum | 100 |
Zeros | 1444 |
Zeros (%) | 63.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 20.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 42.15 |
95-th percentile | 92.714 |
Maximum | 100 |
Range | 100 |
Interquartile range (IQR) | 42.15 |
Descriptive statistics
Standard deviation | 32.981819 |
---|---|
Coefficient of variation (CV) | 1.594983 |
Kurtosis | -0.073564863 |
Mean | 20.678477 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 1.24116 |
Sum | 46712.68 |
Variance | 1087.8004 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0 | 1444 | |
100.0 | 22 | 1.0% |
99.98 | 7 | 0.3% |
9.04 | 5 | 0.2% |
99.09 | 4 | 0.2% |
16.56 | 4 | 0.2% |
0.3 | 3 | 0.1% |
0.15 | 3 | 0.1% |
99.99 | 3 | 0.1% |
87.46 | 2 | 0.1% |
Other values (727) | 762 |
Value | Count | Frequency (%) |
0.0 | 1444 | |
0.07 | 1 | < 0.1% |
0.15 | 3 | 0.1% |
0.22 | 2 | 0.1% |
0.27 | 2 | 0.1% |
0.3 | 3 | 0.1% |
0.31 | 1 | < 0.1% |
0.33 | 2 | 0.1% |
0.38 | 1 | < 0.1% |
0.39 | 1 | < 0.1% |
Value | Count | Frequency (%) |
100.0 | 22 | |
99.99 | 3 | 0.1% |
99.98 | 7 | 0.3% |
99.97 | 2 | 0.1% |
99.96 | 2 | 0.1% |
99.95 | 2 | 0.1% |
99.85 | 1 | < 0.1% |
99.74 | 1 | < 0.1% |
99.41 | 1 | < 0.1% |
99.37 | 1 | < 0.1% |
감성레벨값
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 800 |
---|---|
Distinct (%) | 35.4% |
Missing | 1 |
Missing (%) | < 0.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 17.542625 |
Minimum | 0 |
---|---|
Maximum | 100 |
Zeros | 1404 |
Zeros (%) | 62.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 20.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 27.28 |
95-th percentile | 89.621 |
Maximum | 100 |
Range | 100 |
Interquartile range (IQR) | 27.28 |
Descriptive statistics
Standard deviation | 29.413018 |
---|---|
Coefficient of variation (CV) | 1.67666 |
Kurtosis | 0.97581699 |
Mean | 17.542625 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 1.5324688 |
Sum | 39628.79 |
Variance | 865.12562 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0 | 1404 | |
99.99 | 4 | 0.2% |
100.0 | 4 | 0.2% |
0.08 | 4 | 0.2% |
7.91 | 3 | 0.1% |
44.62 | 3 | 0.1% |
95.98 | 3 | 0.1% |
0.01 | 3 | 0.1% |
83.41 | 3 | 0.1% |
27.28 | 2 | 0.1% |
Other values (790) | 826 |
Value | Count | Frequency (%) |
0.0 | 1404 | |
0.01 | 3 | 0.1% |
0.02 | 1 | < 0.1% |
0.03 | 2 | 0.1% |
0.04 | 1 | < 0.1% |
0.05 | 1 | < 0.1% |
0.08 | 4 | 0.2% |
0.09 | 1 | < 0.1% |
0.1 | 1 | < 0.1% |
0.12 | 2 | 0.1% |
Value | Count | Frequency (%) |
100.0 | 4 | |
99.99 | 4 | |
99.98 | 1 | < 0.1% |
99.93 | 1 | < 0.1% |
99.92 | 1 | < 0.1% |
99.91 | 1 | < 0.1% |
99.83 | 1 | < 0.1% |
99.76 | 1 | < 0.1% |
99.58 | 1 | < 0.1% |
99.52 | 2 |
전체문서개수
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 409 |
---|---|
Distinct (%) | 18.1% |
Missing | 1 |
Missing (%) | < 0.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 311.67596 |
Minimum | 0 |
---|---|
Maximum | 35443 |
Zeros | 190 |
Zeros (%) | 8.4% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 20.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 4 |
median | 18 |
Q3 | 70 |
95-th percentile | 546.2 |
Maximum | 35443 |
Range | 35443 |
Interquartile range (IQR) | 66 |
Descriptive statistics
Standard deviation | 2234.0305 |
---|---|
Coefficient of variation (CV) | 7.1677984 |
Kurtosis | 136.6604 |
Mean | 311.67596 |
Median Absolute Deviation (MAD) | 17 |
Skewness | 11.340446 |
Sum | 704076 |
Variance | 4990892.1 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 190 | 8.4% |
1 | 149 | 6.6% |
2 | 97 | 4.3% |
3 | 94 | 4.2% |
4 | 76 | 3.4% |
6 | 49 | 2.2% |
12 | 47 | 2.1% |
5 | 47 | 2.1% |
9 | 44 | 1.9% |
8 | 44 | 1.9% |
Other values (399) | 1422 |
Value | Count | Frequency (%) |
0 | 190 | |
1 | 149 | |
2 | 97 | |
3 | 94 | |
4 | 76 | 3.4% |
5 | 47 | 2.1% |
6 | 49 | 2.2% |
7 | 43 | 1.9% |
8 | 44 | 1.9% |
9 | 44 | 1.9% |
Value | Count | Frequency (%) |
35443 | 1 | |
32782 | 1 | |
30681 | 1 | |
29241 | 1 | |
28672 | 1 | |
27284 | 1 | |
26305 | 1 | |
25692 | 1 | |
24081 | 1 | |
23294 | 1 |
긍정문서개수
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 287 |
---|---|
Distinct (%) | 12.7% |
Missing | 1 |
Missing (%) | < 0.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 235.68659 |
Minimum | 0 |
---|---|
Maximum | 35134 |
Zeros | 1508 |
Zeros (%) | 66.7% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 20.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 14 |
95-th percentile | 270.1 |
Maximum | 35134 |
Range | 35134 |
Interquartile range (IQR) | 14 |
Descriptive statistics
Standard deviation | 2184.817 |
---|---|
Coefficient of variation (CV) | 9.27001 |
Kurtosis | 145.7591 |
Mean | 235.68659 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 11.844217 |
Sum | 532416 |
Variance | 4773425.4 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 1508 | |
1 | 41 | 1.8% |
4 | 19 | 0.8% |
5 | 18 | 0.8% |
2 | 17 | 0.8% |
11 | 16 | 0.7% |
3 | 15 | 0.7% |
9 | 13 | 0.6% |
19 | 13 | 0.6% |
6 | 11 | 0.5% |
Other values (277) | 588 | 26.0% |
Value | Count | Frequency (%) |
0 | 1508 | |
1 | 41 | 1.8% |
2 | 17 | 0.8% |
3 | 15 | 0.7% |
4 | 19 | 0.8% |
5 | 18 | 0.8% |
6 | 11 | 0.5% |
7 | 8 | 0.4% |
8 | 4 | 0.2% |
9 | 13 | 0.6% |
Value | Count | Frequency (%) |
35134 | 1 | |
32532 | 1 | |
30401 | 1 | |
28992 | 1 | |
28391 | 1 | |
27076 | 1 | |
26049 | 1 | |
25458 | 1 | |
23861 | 1 | |
23075 | 1 |
중립문서개수
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 17.8 KiB |
0 | |
---|---|
<NA> | 1 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.0013274 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 2259 | |
<NA> | 1 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 2259 | |
na | 1 | < 0.1% |
부정문서개수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 278 |
---|---|
Distinct (%) | 12.3% |
Missing | 1 |
Missing (%) | < 0.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 76.638336 |
Minimum | 1 |
---|---|
Maximum | 4684 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 20.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 4 |
median | 12 |
Q3 | 40 |
95-th percentile | 244.3 |
Maximum | 4684 |
Range | 4683 |
Interquartile range (IQR) | 36 |
Descriptive statistics
Standard deviation | 347.86786 |
---|---|
Coefficient of variation (CV) | 4.5390843 |
Kurtosis | 116.69697 |
Mean | 76.638336 |
Median Absolute Deviation (MAD) | 10 |
Skewness | 10.270848 |
Sum | 173126 |
Variance | 121012.05 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 250 | 11.1% |
2 | 169 | 7.5% |
3 | 125 | 5.5% |
4 | 95 | 4.2% |
7 | 78 | 3.5% |
6 | 75 | 3.3% |
5 | 72 | 3.2% |
8 | 65 | 2.9% |
10 | 58 | 2.6% |
11 | 55 | 2.4% |
Other values (268) | 1217 |
Value | Count | Frequency (%) |
1 | 250 | |
2 | 169 | |
3 | 125 | |
4 | 95 | 4.2% |
5 | 72 | 3.2% |
6 | 75 | 3.3% |
7 | 78 | 3.5% |
8 | 65 | 2.9% |
9 | 54 | 2.4% |
10 | 58 | 2.6% |
Value | Count | Frequency (%) |
4684 | 1 | |
4659 | 1 | |
4625 | 1 | |
4586 | 1 | |
4501 | 1 | |
4388 | 1 | |
4224 | 1 | |
4015 | 1 | |
3839 | 1 | |
3718 | 1 |
이전전체문서개수
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 403 |
---|---|
Distinct (%) | 17.8% |
Missing | 1 |
Missing (%) | < 0.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 247.14343 |
Minimum | 0 |
---|---|
Maximum | 31250 |
Zeros | 172 |
Zeros (%) | 7.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 20.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 5 |
median | 17 |
Q3 | 69 |
95-th percentile | 518.2 |
Maximum | 31250 |
Range | 31250 |
Interquartile range (IQR) | 64 |
Descriptive statistics
Standard deviation | 1774.8713 |
---|---|
Coefficient of variation (CV) | 7.1815435 |
Kurtosis | 162.94576 |
Mean | 247.14343 |
Median Absolute Deviation (MAD) | 16 |
Skewness | 12.253493 |
Sum | 558297 |
Variance | 3150168 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 172 | 7.6% |
1 | 133 | 5.9% |
2 | 110 | 4.9% |
3 | 75 | 3.3% |
7 | 74 | 3.3% |
4 | 68 | 3.0% |
6 | 62 | 2.7% |
5 | 58 | 2.6% |
9 | 56 | 2.5% |
11 | 50 | 2.2% |
Other values (393) | 1401 |
Value | Count | Frequency (%) |
0 | 172 | |
1 | 133 | |
2 | 110 | |
3 | 75 | |
4 | 68 | 3.0% |
5 | 58 | 2.6% |
6 | 62 | 2.7% |
7 | 74 | |
8 | 50 | 2.2% |
9 | 56 | 2.5% |
Value | Count | Frequency (%) |
31250 | 1 | |
28614 | 1 | |
25904 | 1 | |
24919 | 1 | |
23415 | 1 | |
22895 | 1 | |
20842 | 1 | |
20425 | 1 | |
18858 | 1 | |
17448 | 1 |
이전긍정문서개수
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 266 |
---|---|
Distinct (%) | 11.8% |
Missing | 1 |
Missing (%) | < 0.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 109.6857 |
Minimum | 0 |
---|---|
Maximum | 21896 |
Zeros | 1460 |
Zeros (%) | 64.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 20.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 11 |
95-th percentile | 248.8 |
Maximum | 21896 |
Range | 21896 |
Interquartile range (IQR) | 11 |
Descriptive statistics
Standard deviation | 1011.47 |
---|---|
Coefficient of variation (CV) | 9.2215298 |
Kurtosis | 262.84606 |
Mean | 109.6857 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 15.476554 |
Sum | 247780 |
Variance | 1023071.5 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 1460 | |
1 | 42 | 1.9% |
5 | 32 | 1.4% |
3 | 29 | 1.3% |
2 | 28 | 1.2% |
6 | 18 | 0.8% |
4 | 18 | 0.8% |
10 | 17 | 0.8% |
8 | 17 | 0.8% |
30 | 15 | 0.7% |
Other values (256) | 583 | 25.8% |
Value | Count | Frequency (%) |
0 | 1460 | |
1 | 42 | 1.9% |
2 | 28 | 1.2% |
3 | 29 | 1.3% |
4 | 18 | 0.8% |
5 | 32 | 1.4% |
6 | 18 | 0.8% |
7 | 13 | 0.6% |
8 | 17 | 0.8% |
9 | 12 | 0.5% |
Value | Count | Frequency (%) |
21896 | 1 | |
18607 | 1 | |
17864 | 1 | |
15653 | 1 | |
15280 | 1 | |
12943 | 1 | |
12909 | 1 | |
10790 | 1 | |
7320 | 1 | |
6151 | 1 |
이전중립문서개수
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 17.8 KiB |
0 | |
---|---|
<NA> | 1 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.0013274 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 2259 | |
<NA> | 1 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 2259 | |
na | 1 | < 0.1% |
이전부정문서개수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 311 |
---|---|
Distinct (%) | 13.8% |
Missing | 1 |
Missing (%) | < 0.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 138.08101 |
Minimum | 1 |
---|---|
Maximum | 13105 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 20.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 4 |
median | 12 |
Q3 | 48 |
95-th percentile | 272.3 |
Maximum | 13105 |
Range | 13104 |
Interquartile range (IQR) | 44 |
Descriptive statistics
Standard deviation | 861.79658 |
---|---|
Coefficient of variation (CV) | 6.241239 |
Kurtosis | 115.46243 |
Mean | 138.08101 |
Median Absolute Deviation (MAD) | 10 |
Skewness | 10.424376 |
Sum | 311925 |
Variance | 742693.35 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 206 | 9.1% |
2 | 162 | 7.2% |
3 | 126 | 5.6% |
4 | 114 | 5.0% |
5 | 90 | 4.0% |
8 | 78 | 3.5% |
7 | 76 | 3.4% |
6 | 62 | 2.7% |
10 | 61 | 2.7% |
12 | 57 | 2.5% |
Other values (301) | 1227 |
Value | Count | Frequency (%) |
1 | 206 | |
2 | 162 | |
3 | 126 | |
4 | 114 | |
5 | 90 | |
6 | 62 | 2.7% |
7 | 76 | 3.4% |
8 | 78 | 3.5% |
9 | 53 | 2.3% |
10 | 61 | 2.7% |
Value | Count | Frequency (%) |
13105 | 1 | |
11904 | 1 | |
10472 | 1 | |
10418 | 1 | |
10251 | 1 | |
10200 | 1 | |
10007 | 1 | |
9387 | 1 | |
9354 | 1 | |
8496 | 1 |
기준일자 | 주식시장명 | 감성점수값 | 감성레벨값 | 전체문서개수 | 긍정문서개수 | 부정문서개수 | 이전전체문서개수 | 이전긍정문서개수 | 이전부정문서개수 | |
---|---|---|---|---|---|---|---|---|---|---|
기준일자 | 1.000 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
주식시장명 | NaN | 1.000 | 0.348 | 0.323 | 0.684 | 0.661 | 0.276 | 0.708 | 0.708 | 0.867 |
감성점수값 | NaN | 0.348 | 1.000 | 0.617 | 0.263 | 0.252 | 0.172 | 0.402 | 0.311 | 0.313 |
감성레벨값 | NaN | 0.323 | 0.617 | 1.000 | 0.244 | 0.249 | 0.179 | 0.315 | 0.375 | 0.197 |
전체문서개수 | NaN | 0.684 | 0.263 | 0.244 | 1.000 | 1.000 | 0.631 | 0.940 | 0.912 | 0.792 |
긍정문서개수 | NaN | 0.661 | 0.252 | 0.249 | 1.000 | 1.000 | 0.000 | 0.945 | 0.927 | 0.770 |
부정문서개수 | NaN | 0.276 | 0.172 | 0.179 | 0.631 | 0.000 | 1.000 | 0.592 | 0.000 | 0.921 |
이전전체문서개수 | NaN | 0.708 | 0.402 | 0.315 | 0.940 | 0.945 | 0.592 | 1.000 | 0.978 | 0.880 |
이전긍정문서개수 | NaN | 0.708 | 0.311 | 0.375 | 0.912 | 0.927 | 0.000 | 0.978 | 1.000 | 0.815 |
이전부정문서개수 | NaN | 0.867 | 0.313 | 0.197 | 0.792 | 0.770 | 0.921 | 0.880 | 0.815 | 1.000 |
중립문서개수 | 분석일시 | 기준일자 | 주식시장명 | 이전중립문서개수 | |
---|---|---|---|---|---|
중립문서개수 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
분석일시 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
기준일자 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
주식시장명 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
이전중립문서개수 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
감성점수값 | 감성레벨값 | 전체문서개수 | 긍정문서개수 | 부정문서개수 | 이전전체문서개수 | 이전긍정문서개수 | 이전부정문서개수 | 기준일자 | 주식시장명 | 분석일시 | 중립문서개수 | 이전중립문서개수 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
감성점수값 | 1.000 | 0.617 | 0.556 | 0.946 | 0.177 | 0.524 | 0.646 | 0.404 | 1.000 | 0.223 | 1.000 | 1.000 | 1.000 |
감성레벨값 | 0.617 | 1.000 | 0.522 | 0.640 | 0.319 | 0.587 | 0.946 | 0.263 | 1.000 | 0.204 | 1.000 | 1.000 | 1.000 |
전체문서개수 | 0.556 | 0.522 | 1.000 | 0.636 | 0.863 | 0.913 | 0.601 | 0.814 | 1.000 | 0.565 | 1.000 | 1.000 | 1.000 |
긍정문서개수 | 0.946 | 0.640 | 0.636 | 1.000 | 0.305 | 0.598 | 0.708 | 0.469 | 1.000 | 0.565 | 1.000 | 1.000 | 1.000 |
부정문서개수 | 0.177 | 0.319 | 0.863 | 0.305 | 1.000 | 0.775 | 0.386 | 0.721 | 1.000 | 0.126 | 1.000 | 1.000 | 1.000 |
이전전체문서개수 | 0.524 | 0.587 | 0.913 | 0.598 | 0.775 | 1.000 | 0.668 | 0.887 | 1.000 | 0.567 | 1.000 | 1.000 | 1.000 |
이전긍정문서개수 | 0.646 | 0.946 | 0.601 | 0.708 | 0.386 | 0.668 | 1.000 | 0.392 | 1.000 | 0.566 | 1.000 | 1.000 | 1.000 |
이전부정문서개수 | 0.404 | 0.263 | 0.814 | 0.469 | 0.721 | 0.887 | 0.392 | 1.000 | 1.000 | 0.587 | 1.000 | 1.000 | 1.000 |
기준일자 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
주식시장명 | 0.223 | 0.204 | 0.565 | 0.565 | 0.126 | 0.567 | 0.566 | 0.587 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
분석일시 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
중립문서개수 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
이전중립문서개수 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
기준일자 | 주식종목값 | 주식종목명 | 주식시장명 | 분석일시 | 감성점수값 | 감성레벨값 | 전체문서개수 | 긍정문서개수 | 중립문서개수 | 부정문서개수 | 이전전체문서개수 | 이전긍정문서개수 | 이전중립문서개수 | 이전부정문서개수 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 20211022 | TOTAL | TOTAL | TOTAL | 202110 | 98.63 | 35.09 | 18097 | 17849 | 0 | 248 | 11843 | 4155 | 0 | 7688 |
1 | 20211022 | KOSPI | KOSPI | KOSPI | 202110 | 99.41 | 36.36 | 16041 | 15946 | 0 | 95 | 9816 | 3569 | 0 | 6247 |
2 | 20211022 | KOSDAQ | KOSDAQ | KOSDAQ | 202110 | 28.78 | 18.84 | 2056 | 591 | 0 | 1465 | 2027 | 381 | 0 | 1646 |
3 | 20211022 | KR7034830000 | 한국토지신탁 | KOSPI | 202110 | 0.0 | 0.0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 |
4 | 20211022 | KR7026960005 | 동서 | KOSPI | 202110 | 63.26 | 54.64 | 15 | 9 | 0 | 6 | 25 | 13 | 0 | 12 |
5 | 20211022 | KR7035720002 | 카카오 | KOSPI | 202110 | 100.0 | 94.98 | 51 | 51 | 0 | 1 | 228 | 216 | 0 | 12 |
6 | 20211022 | KR7000080002 | 하이트진로 | KOSPI | 202110 | 0.0 | 46.78 | 5 | 0 | 0 | 6 | 12 | 5 | 0 | 7 |
7 | 20211022 | KR7000100008 | 유한양행 | KOSPI | 202110 | 0.0 | 0.0 | 10 | 0 | 0 | 11 | 8 | 0 | 0 | 9 |
8 | 20211022 | KR7000120006 | CJ대한통운 | KOSPI | 202110 | 0.0 | 0.0 | 120 | 0 | 0 | 121 | 17 | 0 | 0 | 18 |
9 | 20211022 | KR7000140004 | 하이트진로홀딩스 | KOSPI | 202110 | 0.0 | 0.0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 |
기준일자 | 주식종목값 | 주식종목명 | 주식시장명 | 분석일시 | 감성점수값 | 감성레벨값 | 전체문서개수 | 긍정문서개수 | 중립문서개수 | 부정문서개수 | 이전전체문서개수 | 이전긍정문서개수 | 이전중립문서개수 | 이전부정문서개수 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2250 | 20211022 | KR7091700005 | 파트론 | KOSDAQ | 202110 | 0.0 | 0.0 | 21 | 0 | 0 | 22 | 17 | 0 | 0 | 18 |
2251 | 20211022 | KR7096530001 | 씨젠 | KOSDAQ | 202110 | 52.88 | 81.95 | 546 | 288 | 0 | 258 | 397 | 325 | 0 | 72 |
2252 | 20211022 | KR7102940004 | 코오롱생명과학 | KOSDAQ | 202110 | 0.0 | 0.0 | 21 | 0 | 0 | 22 | 9 | 0 | 0 | 10 |
2253 | 20211022 | KR7108790007 | 인터파크 | KOSDAQ | 202110 | 0.0 | 0.0 | 79 | 0 | 0 | 80 | 9 | 0 | 0 | 10 |
2254 | 20211022 | KR7112040001 | 위메이드 | KOSDAQ | 202110 | 0.0 | 24.05 | 1226 | 0 | 0 | 1227 | 1234 | 296 | 0 | 938 |
2255 | 20211022 | KR7122870009 | 와이지엔터테인먼트 | KOSDAQ | 202110 | 69.11 | 0.0 | 100 | 69 | 0 | 31 | 97 | 0 | 0 | 98 |
2256 | 20211022 | KR7007390008 | 네이처셀 | KOSDAQ | 202110 | 0.0 | 0.0 | 307 | 0 | 0 | 308 | 317 | 0 | 0 | 318 |
2257 | 20211022 | KR7108230004 | 톱텍 | KOSDAQ | 202110 | 0.0 | 0.0 | 51 | 0 | 0 | 52 | 30 | 0 | 0 | 31 |
2258 | 20211022 | KR7215600008 | 신라젠 | KOSDAQ | 202110 | 0.0 | 0.0 | 25 | 0 | 0 | 26 | 29 | 0 | 0 | 30 |
2259 | 2259 217857 20211023024014 F_BBP20_00044 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
Most frequently occurring
기준일자 | 주식종목값 | 주식종목명 | 주식시장명 | 분석일시 | 감성점수값 | 감성레벨값 | 전체문서개수 | 긍정문서개수 | 중립문서개수 | 부정문서개수 | 이전전체문서개수 | 이전긍정문서개수 | 이전중립문서개수 | 이전부정문서개수 | # duplicates | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
6 | 20211022 | KR7000140004 | 하이트진로홀딩스 | KOSPI | 202110 | 0.0 | 0.0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 9 |
10 | 20211022 | KR7000480004 | 조선내화 | KOSPI | 202110 | 0.0 | 0.0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 2 | 9 |
26 | 20211022 | KR7003240009 | 태광산업 | KOSPI | 202110 | 0.0 | 0.0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 2 | 9 |
27 | 20211022 | KR7003300001 | 한일시멘트 | KOSPI | 202110 | 0.0 | 0.0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 9 |
39 | 20211022 | KR7004700001 | 조광피혁 | KOSPI | 202110 | 0.0 | 0.0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 9 |
46 | 20211022 | KR7005830005 | DB손해보험 | KOSPI | 202110 | 0.0 | 0.0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 9 |
80 | 20211022 | KR7016100000 | 산성앨엔에스 | KOSDAQ | 202110 | 0.0 | 0.0 | 1 | 0 | 0 | 2 | 1 | 0 | 0 | 2 | 9 |
134 | 20211022 | KR7084870005 | TBH글로벌 | KOSPI | 202110 | 0.0 | 0.0 | 1 | 0 | 0 | 2 | 0 | 0 | 0 | 1 | 9 |
141 | 20211022 | KR7108790007 | 인터파크 | KOSDAQ | 202110 | 0.0 | 0.0 | 53 | 0 | 0 | 54 | 9 | 0 | 0 | 10 | 8 |
12 | 20211022 | KR7000670000 | 영풍 | KOSPI | 202110 | 0.0 | 0.0 | 1 | 0 | 0 | 2 | 0 | 0 | 0 | 1 | 7 |