Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 400 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 20.1 KiB |
Average record size in memory | 51.3 B |
Variable types
Numeric | 3 |
---|---|
Categorical | 1 |
DateTime | 1 |
Text | 1 |
Dataset
Description | Sample |
---|---|
Author | 코난테크놀로지 |
URL | https://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=TPONATION |
Reproduction
Analysis started | 2023-12-10 06:22:27.865864 |
---|---|
Analysis finished | 2023-12-10 06:22:30.518602 |
Duration | 2.65 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
"기본키값"
Real number (ℝ)
UNIQUE
 
Distinct | 400 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 26846.86 |
Minimum | 15922 |
---|---|
Maximum | 47861 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 15922 |
---|---|
5-th percentile | 15941.95 |
Q1 | 16021.75 |
median | 31895.5 |
Q3 | 31995.25 |
95-th percentile | 47841.05 |
Maximum | 47861 |
Range | 31939 |
Interquartile range (IQR) | 15973.5 |
Descriptive statistics
Standard deviation | 10786.31 |
---|---|
Coefficient of variation (CV) | 0.40177174 |
Kurtosis | -0.79218373 |
Mean | 26846.86 |
Median Absolute Deviation (MAD) | 15822 |
Skewness | 0.48860499 |
Sum | 10738744 |
Variance | 1.1634447 × 108 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
15922 | 1 | 0.2% |
31960 | 1 | 0.2% |
31970 | 1 | 0.2% |
31969 | 1 | 0.2% |
31968 | 1 | 0.2% |
31967 | 1 | 0.2% |
31966 | 1 | 0.2% |
31965 | 1 | 0.2% |
31964 | 1 | 0.2% |
31963 | 1 | 0.2% |
Other values (390) | 390 |
Value | Count | Frequency (%) |
15922 | 1 | |
15923 | 1 | |
15924 | 1 | |
15925 | 1 | |
15926 | 1 | |
15927 | 1 | |
15928 | 1 | |
15929 | 1 | |
15930 | 1 | |
15931 | 1 |
Value | Count | Frequency (%) |
47861 | 1 | |
47860 | 1 | |
47859 | 1 | |
47858 | 1 | |
47857 | 1 | |
47856 | 1 | |
47855 | 1 | |
47854 | 1 | |
47853 | 1 | |
47852 | 1 |
"채널값"
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
"블로그" |
---|
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 5 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | "블로그" |
---|---|
2nd row | "블로그" |
3rd row | "블로그" |
4th row | "블로그" |
5th row | "블로그" |
Common Values
Value | Count | Frequency (%) |
"블로그" | 400 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
블로그 | 400 |
"해당일자"
Date
Distinct | 3 |
---|---|
Distinct (%) | 0.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
Minimum | 2020-05-01 00:00:00 |
---|---|
Maximum | 2020-05-03 00:00:00 |
"차례값"
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 176 |
---|---|
Distinct (%) | 44.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 80.82 |
Minimum | 1 |
---|---|
Maximum | 176 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 7 |
Q1 | 34 |
median | 76.5 |
Q3 | 126.25 |
95-th percentile | 166.05 |
Maximum | 176 |
Range | 175 |
Interquartile range (IQR) | 92.25 |
Descriptive statistics
Standard deviation | 52.286732 |
---|---|
Coefficient of variation (CV) | 0.64695288 |
Kurtosis | -1.2506889 |
Mean | 80.82 |
Median Absolute Deviation (MAD) | 45.5 |
Skewness | 0.19316505 |
Sum | 32328 |
Variance | 2733.9024 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 3 | 0.8% |
26 | 3 | 0.8% |
28 | 3 | 0.8% |
29 | 3 | 0.8% |
30 | 3 | 0.8% |
31 | 3 | 0.8% |
32 | 3 | 0.8% |
33 | 3 | 0.8% |
34 | 3 | 0.8% |
35 | 3 | 0.8% |
Other values (166) | 370 |
Value | Count | Frequency (%) |
1 | 3 | |
2 | 3 | |
3 | 3 | |
4 | 3 | |
5 | 3 | |
6 | 3 | |
7 | 3 | |
8 | 3 | |
9 | 3 | |
10 | 3 |
Value | Count | Frequency (%) |
176 | 2 | |
175 | 2 | |
174 | 2 | |
173 | 2 | |
172 | 2 | |
171 | 2 | |
170 | 2 | |
169 | 2 | |
168 | 2 | |
167 | 2 |
"이슈어값"
Text
Distinct | 176 |
---|---|
Distinct (%) | 44.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
Value | Count | Frequency (%) |
미국 | 3 | 0.8% |
파키스탄 | 3 | 0.8% |
스웨덴 | 3 | 0.8% |
멕시코 | 3 | 0.8% |
덴마크 | 3 | 0.8% |
이집트 | 3 | 0.8% |
노르웨이 | 3 | 0.8% |
오스트리아 | 3 | 0.8% |
벨기에 | 3 | 0.8% |
포르투갈 | 3 | 0.8% |
Other values (166) | 370 |
Most occurring characters
Value | Count | Frequency (%) |
" | 800 | |
아 | 113 | 4.8% |
스 | 63 | 2.7% |
리 | 59 | 2.5% |
이 | 45 | 1.9% |
르 | 44 | 1.9% |
라 | 44 | 1.9% |
니 | 42 | 1.8% |
나 | 30 | 1.3% |
바 | 30 | 1.3% |
Other values (160) | 1093 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1563 | |
Other Punctuation | 800 |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
아 | 113 | 7.2% |
스 | 63 | 4.0% |
리 | 59 | 3.8% |
이 | 45 | 2.9% |
르 | 44 | 2.8% |
라 | 44 | 2.8% |
니 | 42 | 2.7% |
나 | 30 | 1.9% |
바 | 30 | 1.9% |
트 | 29 | 1.9% |
Other values (159) | 1064 |
Other Punctuation
Value | Count | Frequency (%) |
" | 800 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1563 | |
Common | 800 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
아 | 113 | 7.2% |
스 | 63 | 4.0% |
리 | 59 | 3.8% |
이 | 45 | 2.9% |
르 | 44 | 2.8% |
라 | 44 | 2.8% |
니 | 42 | 2.7% |
나 | 30 | 1.9% |
바 | 30 | 1.9% |
트 | 29 | 1.9% |
Other values (159) | 1064 |
Common
Value | Count | Frequency (%) |
" | 800 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1563 | |
ASCII | 800 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
" | 800 |
Hangul
Value | Count | Frequency (%) |
아 | 113 | 7.2% |
스 | 63 | 4.0% |
리 | 59 | 3.8% |
이 | 45 | 2.9% |
르 | 44 | 2.8% |
라 | 44 | 2.8% |
니 | 42 | 2.7% |
나 | 30 | 1.9% |
바 | 30 | 1.9% |
트 | 29 | 1.9% |
Other values (159) | 1064 |
"건수값"
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 219 |
---|---|
Distinct (%) | 54.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 483.0675 |
Minimum | 1 |
---|---|
Maximum | 12675 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 4 |
Q1 | 15.75 |
median | 52 |
Q3 | 227.25 |
95-th percentile | 1853.5 |
Maximum | 12675 |
Range | 12674 |
Interquartile range (IQR) | 211.5 |
Descriptive statistics
Standard deviation | 1620.5171 |
---|---|
Coefficient of variation (CV) | 3.3546391 |
Kurtosis | 37.615775 |
Mean | 483.0675 |
Median Absolute Deviation (MAD) | 44 |
Skewness | 5.9215354 |
Sum | 193227 |
Variance | 2626075.8 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
15 | 12 | 3.0% |
16 | 11 | 2.8% |
3 | 10 | 2.5% |
7 | 9 | 2.2% |
9 | 8 | 2.0% |
10 | 8 | 2.0% |
25 | 8 | 2.0% |
6 | 8 | 2.0% |
4 | 6 | 1.5% |
8 | 6 | 1.5% |
Other values (209) | 314 |
Value | Count | Frequency (%) |
1 | 2 | 0.5% |
2 | 6 | |
3 | 10 | |
4 | 6 | |
5 | 4 | 1.0% |
6 | 8 | |
7 | 9 | |
8 | 6 | |
9 | 8 | |
10 | 8 |
Value | Count | Frequency (%) |
12675 | 1 | |
12092 | 1 | |
11986 | 1 | |
11898 | 1 | |
11753 | 1 | |
11454 | 1 | |
6919 | 1 | |
6389 | 1 | |
6333 | 1 | |
4244 | 1 |
"기본키값" | "해당일자" | "차례값" | "건수값" | |
---|---|---|---|---|
"기본키값" | 1.000 | 1.000 | 0.599 | 0.367 |
"해당일자" | 1.000 | 1.000 | 0.447 | 0.135 |
"차례값" | 0.599 | 0.447 | 1.000 | 0.472 |
"건수값" | 0.367 | 0.135 | 0.472 | 1.000 |
"기본키값" | "차례값" | "건수값" | |
---|---|---|---|
"기본키값" | 1.000 | 0.147 | -0.127 |
"차례값" | 0.147 | 1.000 | -0.999 |
"건수값" | -0.127 | -0.999 | 1.000 |
"기본키값" | "채널값" | "해당일자" | "차례값" | "이슈어값" | "건수값" | |
---|---|---|---|---|---|---|
0 | 15922 | "블로그" | 2020-05-01 | 1 | "미국" | 12675 |
1 | 15923 | "블로그" | 2020-05-01 | 2 | "중국" | 12092 |
2 | 15924 | "블로그" | 2020-05-01 | 3 | "일본" | 6919 |
3 | 15925 | "블로그" | 2020-05-01 | 4 | "영국" | 4142 |
4 | 15926 | "블로그" | 2020-05-01 | 5 | "독일" | 2686 |
5 | 15927 | "블로그" | 2020-05-01 | 6 | "프랑스" | 2268 |
6 | 15928 | "블로그" | 2020-05-01 | 7 | "이탈리아" | 2010 |
7 | 15929 | "블로그" | 2020-05-01 | 8 | "베트남" | 1831 |
8 | 15930 | "블로그" | 2020-05-01 | 9 | "캐나다" | 1762 |
9 | 15931 | "블로그" | 2020-05-01 | 10 | "북한" | 1550 |
"기본키값" | "채널값" | "해당일자" | "차례값" | "이슈어값" | "건수값" | |
---|---|---|---|---|---|---|
390 | 47852 | "블로그" | 2020-05-03 | 39 | "아르헨티나" | 166 |
391 | 47853 | "블로그" | 2020-05-03 | 40 | "핀란드" | 162 |
392 | 47854 | "블로그" | 2020-05-03 | 41 | "미얀마" | 147 |
393 | 47855 | "블로그" | 2020-05-03 | 42 | "방글라데시" | 136 |
394 | 47856 | "블로그" | 2020-05-03 | 43 | "칠레" | 135 |
395 | 47857 | "블로그" | 2020-05-03 | 44 | "사우디아라비아" | 126 |
396 | 47858 | "블로그" | 2020-05-03 | 45 | "헝가리" | 122 |
397 | 47859 | "블로그" | 2020-05-03 | 46 | "캄보디아" | 120 |
398 | 47860 | "블로그" | 2020-05-03 | 47 | "에티오피아" | 107 |
399 | 47861 | "블로그" | 2020-05-03 | 48 | "파키스탄" | 102 |