Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 400 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 16.9 KiB |
Average record size in memory | 43.3 B |
Variable types
Numeric | 3 |
---|---|
DateTime | 1 |
Text | 1 |
Dataset
Description | Sample |
---|---|
Author | 코난테크놀로지 |
URL | https://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=TOPICINSTT |
Reproduction
Analysis started | 2023-12-10 06:35:39.382445 |
---|---|
Analysis finished | 2023-12-10 06:35:41.651034 |
Duration | 2.27 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
"기본키값"
Real number (ℝ)
UNIQUE
 
Distinct | 400 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 65020.5 |
Minimum | 64821 |
---|---|
Maximum | 65220 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 64821 |
---|---|
5-th percentile | 64840.95 |
Q1 | 64920.75 |
median | 65020.5 |
Q3 | 65120.25 |
95-th percentile | 65200.05 |
Maximum | 65220 |
Range | 399 |
Interquartile range (IQR) | 199.5 |
Descriptive statistics
Standard deviation | 115.6143 |
---|---|
Coefficient of variation (CV) | 0.0017781208 |
Kurtosis | -1.2 |
Mean | 65020.5 |
Median Absolute Deviation (MAD) | 100 |
Skewness | 0 |
Sum | 26008200 |
Variance | 13366.667 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
64821 | 1 | 0.2% |
65085 | 1 | 0.2% |
65095 | 1 | 0.2% |
65094 | 1 | 0.2% |
65093 | 1 | 0.2% |
65092 | 1 | 0.2% |
65091 | 1 | 0.2% |
65090 | 1 | 0.2% |
65089 | 1 | 0.2% |
65088 | 1 | 0.2% |
Other values (390) | 390 |
Value | Count | Frequency (%) |
64821 | 1 | |
64822 | 1 | |
64823 | 1 | |
64824 | 1 | |
64825 | 1 | |
64826 | 1 | |
64827 | 1 | |
64828 | 1 | |
64829 | 1 | |
64830 | 1 |
Value | Count | Frequency (%) |
65220 | 1 | |
65219 | 1 | |
65218 | 1 | |
65217 | 1 | |
65216 | 1 | |
65215 | 1 | |
65214 | 1 | |
65213 | 1 | |
65212 | 1 | |
65211 | 1 |
"해당일시"
Date
Distinct | 40 |
---|---|
Distinct (%) | 10.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
Minimum | 2020-10-01 00:00:00 |
---|---|
Maximum | 2020-10-02 15:00:00 |
"차례값"
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 10 |
---|---|
Distinct (%) | 2.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.5 |
Minimum | 1 |
---|---|
Maximum | 10 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 5.5 |
Q3 | 8 |
95-th percentile | 10 |
Maximum | 10 |
Range | 9 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 2.8758784 |
---|---|
Coefficient of variation (CV) | 0.52288699 |
Kurtosis | -1.224533 |
Mean | 5.5 |
Median Absolute Deviation (MAD) | 2.5 |
Skewness | 0 |
Sum | 2200 |
Variance | 8.2706767 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 40 | |
2 | 40 | |
3 | 40 | |
4 | 40 | |
5 | 40 | |
6 | 40 | |
7 | 40 | |
8 | 40 | |
9 | 40 | |
10 | 40 |
Value | Count | Frequency (%) |
1 | 40 | |
2 | 40 | |
3 | 40 | |
4 | 40 | |
5 | 40 | |
6 | 40 | |
7 | 40 | |
8 | 40 | |
9 | 40 | |
10 | 40 |
Value | Count | Frequency (%) |
10 | 40 | |
9 | 40 | |
8 | 40 | |
7 | 40 | |
6 | 40 | |
5 | 40 | |
4 | 40 | |
3 | 40 | |
2 | 40 | |
1 | 40 |
"이슈어값"
Text
Distinct | 165 |
---|---|
Distinct (%) | 41.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
Value | Count | Frequency (%) |
네이버 | 13 | 3.2% |
ytn | 12 | 3.0% |
청와대 | 12 | 3.0% |
국회 | 11 | 2.8% |
애플 | 10 | 2.5% |
sbs | 9 | 2.2% |
kbs | 9 | 2.2% |
롯데 | 8 | 2.0% |
중앙일보 | 8 | 2.0% |
더불어민주당 | 7 | 1.8% |
Other values (155) | 301 |
Most occurring characters
Value | Count | Frequency (%) |
이 | 47 | 3.1% |
일 | 45 | 3.0% |
스 | 43 | 2.9% |
국 | 37 | 2.5% |
보 | 37 | 2.5% |
s | 33 | 2.2% |
부 | 29 | 1.9% |
b | 29 | 1.9% |
원 | 25 | 1.7% |
뉴 | 24 | 1.6% |
Other values (226) | 1159 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1341 | |
Lowercase Letter | 157 | 10.4% |
Decimal Number | 10 | 0.7% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
이 | 47 | 3.5% |
일 | 45 | 3.4% |
스 | 43 | 3.2% |
국 | 37 | 2.8% |
보 | 37 | 2.8% |
부 | 29 | 2.2% |
원 | 25 | 1.9% |
뉴 | 24 | 1.8% |
청 | 24 | 1.8% |
데 | 24 | 1.8% |
Other values (210) | 1006 |
Lowercase Letter
Value | Count | Frequency (%) |
s | 33 | |
b | 29 | |
t | 23 | |
k | 17 | |
n | 13 | 8.3% |
y | 12 | 7.6% |
c | 9 | 5.7% |
g | 5 | 3.2% |
m | 5 | 3.2% |
v | 5 | 3.2% |
Other values (3) | 6 | 3.8% |
Decimal Number
Value | Count | Frequency (%) |
1 | 8 | |
2 | 1 | 10.0% |
4 | 1 | 10.0% |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1341 | |
Latin | 157 | 10.4% |
Common | 10 | 0.7% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
이 | 47 | 3.5% |
일 | 45 | 3.4% |
스 | 43 | 3.2% |
국 | 37 | 2.8% |
보 | 37 | 2.8% |
부 | 29 | 2.2% |
원 | 25 | 1.9% |
뉴 | 24 | 1.8% |
청 | 24 | 1.8% |
데 | 24 | 1.8% |
Other values (210) | 1006 |
Latin
Value | Count | Frequency (%) |
s | 33 | |
b | 29 | |
t | 23 | |
k | 17 | |
n | 13 | 8.3% |
y | 12 | 7.6% |
c | 9 | 5.7% |
g | 5 | 3.2% |
m | 5 | 3.2% |
v | 5 | 3.2% |
Other values (3) | 6 | 3.8% |
Common
Value | Count | Frequency (%) |
1 | 8 | |
2 | 1 | 10.0% |
4 | 1 | 10.0% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1341 | |
ASCII | 167 | 11.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
이 | 47 | 3.5% |
일 | 45 | 3.4% |
스 | 43 | 3.2% |
국 | 37 | 2.8% |
보 | 37 | 2.8% |
부 | 29 | 2.2% |
원 | 25 | 1.9% |
뉴 | 24 | 1.8% |
청 | 24 | 1.8% |
데 | 24 | 1.8% |
Other values (210) | 1006 |
ASCII
Value | Count | Frequency (%) |
s | 33 | |
b | 29 | |
t | 23 | |
k | 17 | |
n | 13 | 7.8% |
y | 12 | 7.2% |
c | 9 | 5.4% |
1 | 8 | 4.8% |
g | 5 | 3.0% |
m | 5 | 3.0% |
Other values (6) | 13 | 7.8% |
"건수값"
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 32 |
---|---|
Distinct (%) | 8.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.905 |
Minimum | 2 |
---|---|
Maximum | 85 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 2 |
---|---|
5-th percentile | 2 |
Q1 | 3 |
median | 5 |
Q3 | 8 |
95-th percentile | 17.05 |
Maximum | 85 |
Range | 83 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 7.6335872 |
---|---|
Coefficient of variation (CV) | 1.1055159 |
Kurtosis | 41.152899 |
Mean | 6.905 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 5.3769407 |
Sum | 2762 |
Variance | 58.271654 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3 | 69 | |
4 | 63 | |
5 | 51 | |
2 | 47 | |
6 | 38 | |
7 | 30 | |
8 | 20 | 5.0% |
9 | 20 | 5.0% |
10 | 9 | 2.2% |
11 | 8 | 2.0% |
Other values (22) | 45 |
Value | Count | Frequency (%) |
2 | 47 | |
3 | 69 | |
4 | 63 | |
5 | 51 | |
6 | 38 | |
7 | 30 | |
8 | 20 | 5.0% |
9 | 20 | 5.0% |
10 | 9 | 2.2% |
11 | 8 | 2.0% |
Value | Count | Frequency (%) |
85 | 1 | |
68 | 1 | |
49 | 1 | |
42 | 1 | |
36 | 1 | |
35 | 1 | |
34 | 1 | |
33 | 1 | |
28 | 1 | |
25 | 1 |
"기본키값" | "해당일시" | "차례값" | "건수값" | |
---|---|---|---|---|
"기본키값" | 1.000 | 1.000 | 0.000 | 0.124 |
"해당일시" | 1.000 | 1.000 | 0.000 | 0.252 |
"차례값" | 0.000 | 0.000 | 1.000 | 0.448 |
"건수값" | 0.124 | 0.252 | 0.448 | 1.000 |
"기본키값" | "차례값" | "건수값" | |
---|---|---|---|
"기본키값" | 1.000 | 0.025 | 0.141 |
"차례값" | 0.025 | 1.000 | -0.674 |
"건수값" | 0.141 | -0.674 | 1.000 |
"기본키값" | "해당일시" | "차례값" | "이슈어값" | "건수값" | |
---|---|---|---|---|---|
0 | 64821 | 2020-10-01 00:00:00 | 1 | 기호일보 | 16 |
1 | 64822 | 2020-10-01 00:00:00 | 2 | 청와대 | 7 |
2 | 64823 | 2020-10-01 00:00:00 | 3 | 기초과학연구원 | 7 |
3 | 64824 | 2020-10-01 00:00:00 | 4 | 디즈니 | 6 |
4 | 64825 | 2020-10-01 00:00:00 | 5 | ytn | 6 |
5 | 64826 | 2020-10-01 00:00:00 | 6 | 울산과학기술원 | 5 |
6 | 64827 | 2020-10-01 00:00:00 | 7 | 국방부 | 4 |
7 | 64828 | 2020-10-01 00:00:00 | 8 | lg전자 | 4 |
8 | 64829 | 2020-10-01 00:00:00 | 9 | 롯데제과 | 4 |
9 | 64830 | 2020-10-01 00:00:00 | 10 | 두산베어스 | 3 |
"기본키값" | "해당일시" | "차례값" | "이슈어값" | "건수값" | |
---|---|---|---|---|---|
390 | 65211 | 2020-10-02 15:00:00 | 1 | 청와대 | 22 |
391 | 65212 | 2020-10-02 15:00:00 | 2 | 페르소나 | 6 |
392 | 65213 | 2020-10-02 15:00:00 | 3 | 서울신문 | 5 |
393 | 65214 | 2020-10-02 15:00:00 | 4 | jtbc | 5 |
394 | 65215 | 2020-10-02 15:00:00 | 5 | 중앙일보 | 4 |
395 | 65216 | 2020-10-02 15:00:00 | 6 | 구글 | 4 |
396 | 65217 | 2020-10-02 15:00:00 | 7 | 굿스마일 | 4 |
397 | 65218 | 2020-10-02 15:00:00 | 8 | 천지일보 | 4 |
398 | 65219 | 2020-10-02 15:00:00 | 9 | 헤럴드경제 | 3 |
399 | 65220 | 2020-10-02 15:00:00 | 10 | 산업통상자원부 | 3 |