Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 400 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 20.1 KiB |
Average record size in memory | 51.3 B |
Variable types
Numeric | 3 |
---|---|
Categorical | 1 |
DateTime | 1 |
Text | 1 |
Dataset
Description | Sample |
---|---|
Author | 코난테크놀로지 |
URL | https://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=TPOOCCASION |
Reproduction
Analysis started | 2023-12-10 06:22:46.887983 |
---|---|
Analysis finished | 2023-12-10 06:22:49.019567 |
Duration | 2.13 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
"기본키값"
Real number (ℝ)
UNIQUE
 
Distinct | 400 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20254.5 |
Minimum | 16467 |
---|---|
Maximum | 32466 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 16467 |
---|---|
5-th percentile | 16486.95 |
Q1 | 16566.75 |
median | 16666.5 |
Q3 | 16766.25 |
95-th percentile | 32446.05 |
Maximum | 32466 |
Range | 15999 |
Interquartile range (IQR) | 199.5 |
Descriptive statistics
Standard deviation | 6657.9512 |
---|---|
Coefficient of variation (CV) | 0.32871467 |
Kurtosis | -0.3432128 |
Mean | 20254.5 |
Median Absolute Deviation (MAD) | 100 |
Skewness | 1.2875265 |
Sum | 8101800 |
Variance | 44328314 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
16467 | 1 | 0.2% |
16731 | 1 | 0.2% |
16741 | 1 | 0.2% |
16740 | 1 | 0.2% |
16739 | 1 | 0.2% |
16738 | 1 | 0.2% |
16737 | 1 | 0.2% |
16736 | 1 | 0.2% |
16735 | 1 | 0.2% |
16734 | 1 | 0.2% |
Other values (390) | 390 |
Value | Count | Frequency (%) |
16467 | 1 | |
16468 | 1 | |
16469 | 1 | |
16470 | 1 | |
16471 | 1 | |
16472 | 1 | |
16473 | 1 | |
16474 | 1 | |
16475 | 1 | |
16476 | 1 |
Value | Count | Frequency (%) |
32466 | 1 | |
32465 | 1 | |
32464 | 1 | |
32463 | 1 | |
32462 | 1 | |
32461 | 1 | |
32460 | 1 | |
32459 | 1 | |
32458 | 1 | |
32457 | 1 |
"채널값"
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
"블로그" |
---|
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 5 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | "블로그" |
---|---|
2nd row | "블로그" |
3rd row | "블로그" |
4th row | "블로그" |
5th row | "블로그" |
Common Values
Value | Count | Frequency (%) |
"블로그" | 400 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
블로그 | 400 |
"해당일자"
Date
Distinct | 2 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
Minimum | 2020-05-01 00:00:00 |
---|---|
Maximum | 2020-05-02 00:00:00 |
"차례값"
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 308 |
---|---|
Distinct (%) | 77.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 129.66 |
Minimum | 1 |
---|---|
Maximum | 308 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 10.95 |
Q1 | 50.75 |
median | 108.5 |
Q3 | 208.25 |
95-th percentile | 288.05 |
Maximum | 308 |
Range | 307 |
Interquartile range (IQR) | 157.5 |
Descriptive statistics
Standard deviation | 91.300514 |
---|---|
Coefficient of variation (CV) | 0.70415328 |
Kurtosis | -1.1570759 |
Mean | 129.66 |
Median Absolute Deviation (MAD) | 72 |
Skewness | 0.38745295 |
Sum | 51864 |
Variance | 8335.7839 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 2 | 0.5% |
60 | 2 | 0.5% |
69 | 2 | 0.5% |
68 | 2 | 0.5% |
67 | 2 | 0.5% |
66 | 2 | 0.5% |
65 | 2 | 0.5% |
64 | 2 | 0.5% |
63 | 2 | 0.5% |
62 | 2 | 0.5% |
Other values (298) | 380 |
Value | Count | Frequency (%) |
1 | 2 | |
2 | 2 | |
3 | 2 | |
4 | 2 | |
5 | 2 | |
6 | 2 | |
7 | 2 | |
8 | 2 | |
9 | 2 | |
10 | 2 |
Value | Count | Frequency (%) |
308 | 1 | |
307 | 1 | |
306 | 1 | |
305 | 1 | |
304 | 1 | |
303 | 1 | |
302 | 1 | |
301 | 1 | |
300 | 1 | |
299 | 1 |
"이슈어값"
Text
Distinct | 308 |
---|---|
Distinct (%) | 77.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
Value | Count | Frequency (%) |
여행 | 2 | 0.5% |
이별 | 2 | 0.5% |
야식 | 2 | 0.5% |
낚시 | 2 | 0.5% |
해외여행 | 2 | 0.5% |
선거 | 2 | 0.5% |
설거지 | 2 | 0.5% |
투표 | 2 | 0.5% |
고백 | 2 | 0.5% |
개학 | 2 | 0.5% |
Other values (298) | 380 |
Most occurring characters
Value | Count | Frequency (%) |
" | 800 | |
년 | 65 | 3.3% |
주 | 63 | 3.2% |
회 | 26 | 1.3% |
여 | 19 | 1.0% |
기 | 19 | 1.0% |
시 | 18 | 0.9% |
1 | 18 | 0.9% |
2 | 17 | 0.9% |
학 | 16 | 0.8% |
Other values (270) | 916 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1059 | |
Other Punctuation | 800 | |
Decimal Number | 118 | 6.0% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
년 | 65 | 6.1% |
주 | 63 | 5.9% |
회 | 26 | 2.5% |
여 | 19 | 1.8% |
기 | 19 | 1.8% |
시 | 18 | 1.7% |
학 | 16 | 1.5% |
행 | 16 | 1.5% |
가 | 16 | 1.5% |
휴 | 14 | 1.3% |
Other values (259) | 787 |
Decimal Number
Value | Count | Frequency (%) |
1 | 18 | |
2 | 17 | |
3 | 13 | |
5 | 12 | |
6 | 12 | |
0 | 11 | |
9 | 9 | |
4 | 9 | |
8 | 9 | |
7 | 8 |
Other Punctuation
Value | Count | Frequency (%) |
" | 800 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1059 | |
Common | 918 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
년 | 65 | 6.1% |
주 | 63 | 5.9% |
회 | 26 | 2.5% |
여 | 19 | 1.8% |
기 | 19 | 1.8% |
시 | 18 | 1.7% |
학 | 16 | 1.5% |
행 | 16 | 1.5% |
가 | 16 | 1.5% |
휴 | 14 | 1.3% |
Other values (259) | 787 |
Common
Value | Count | Frequency (%) |
" | 800 | |
1 | 18 | 2.0% |
2 | 17 | 1.9% |
3 | 13 | 1.4% |
5 | 12 | 1.3% |
6 | 12 | 1.3% |
0 | 11 | 1.2% |
9 | 9 | 1.0% |
4 | 9 | 1.0% |
8 | 9 | 1.0% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1059 | |
ASCII | 918 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
" | 800 | |
1 | 18 | 2.0% |
2 | 17 | 1.9% |
3 | 13 | 1.4% |
5 | 12 | 1.3% |
6 | 12 | 1.3% |
0 | 11 | 1.2% |
9 | 9 | 1.0% |
4 | 9 | 1.0% |
8 | 9 | 1.0% |
Hangul
Value | Count | Frequency (%) |
년 | 65 | 6.1% |
주 | 63 | 5.9% |
회 | 26 | 2.5% |
여 | 19 | 1.8% |
기 | 19 | 1.8% |
시 | 18 | 1.7% |
학 | 16 | 1.5% |
행 | 16 | 1.5% |
가 | 16 | 1.5% |
휴 | 14 | 1.3% |
Other values (259) | 787 |
"건수값"
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 315 |
---|---|
Distinct (%) | 78.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1672.3375 |
Minimum | 1 |
---|---|
Maximum | 21147 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 46.75 |
median | 489 |
Q3 | 1686.5 |
95-th percentile | 7014.65 |
Maximum | 21147 |
Range | 21146 |
Interquartile range (IQR) | 1639.75 |
Descriptive statistics
Standard deviation | 3184.7658 |
---|---|
Coefficient of variation (CV) | 1.9043798 |
Kurtosis | 12.22317 |
Mean | 1672.3375 |
Median Absolute Deviation (MAD) | 476.5 |
Skewness | 3.3124698 |
Sum | 668935 |
Variance | 10142733 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 16 | 4.0% |
4 | 11 | 2.8% |
2 | 7 | 1.8% |
6 | 6 | 1.5% |
17 | 4 | 1.0% |
8 | 4 | 1.0% |
14 | 4 | 1.0% |
47 | 3 | 0.8% |
5 | 3 | 0.8% |
59 | 3 | 0.8% |
Other values (305) | 339 |
Value | Count | Frequency (%) |
1 | 16 | |
2 | 7 | |
3 | 3 | 0.8% |
4 | 11 | |
5 | 3 | 0.8% |
6 | 6 | 1.5% |
7 | 2 | 0.5% |
8 | 4 | 1.0% |
9 | 3 | 0.8% |
10 | 2 | 0.5% |
Value | Count | Frequency (%) |
21147 | 1 | |
19147 | 1 | |
18027 | 1 | |
15816 | 1 | |
15312 | 1 | |
15066 | 1 | |
15063 | 1 | |
14708 | 1 | |
14083 | 1 | |
13706 | 1 |
"기본키값" | "해당일자" | "차례값" | "건수값" | |
---|---|---|---|---|
"기본키값" | 1.000 | 1.000 | 0.738 | 0.407 |
"해당일자" | 1.000 | 1.000 | 0.735 | 0.394 |
"차례값" | 0.738 | 0.735 | 1.000 | 0.728 |
"건수값" | 0.407 | 0.394 | 0.728 | 1.000 |
"기본키값" | "차례값" | "건수값" | |
---|---|---|---|
"기본키값" | 1.000 | 0.206 | -0.231 |
"차례값" | 0.206 | 1.000 | -1.000 |
"건수값" | -0.231 | -1.000 | 1.000 |
"기본키값" | "채널값" | "해당일자" | "차례값" | "이슈어값" | "건수값" | |
---|---|---|---|---|---|---|
0 | 16467 | "블로그" | 2020-05-01 | 1 | "여행" | 21147 |
1 | 16468 | "블로그" | 2020-05-01 | 2 | "쇼핑" | 18027 |
2 | 16469 | "블로그" | 2020-05-01 | 3 | "공부" | 15816 |
3 | 16470 | "블로그" | 2020-05-01 | 4 | "약속" | 15312 |
4 | 16471 | "블로그" | 2020-05-01 | 5 | "운전" | 15066 |
5 | 16472 | "블로그" | 2020-05-01 | 6 | "식사" | 14708 |
6 | 16473 | "블로그" | 2020-05-01 | 7 | "운동" | 13588 |
7 | 16474 | "블로그" | 2020-05-01 | 8 | "요리" | 11842 |
8 | 16475 | "블로그" | 2020-05-01 | 9 | "치료" | 9838 |
9 | 16476 | "블로그" | 2020-05-01 | 10 | "결혼" | 8072 |
"기본키값" | "채널값" | "해당일자" | "차례값" | "이슈어값" | "건수값" | |
---|---|---|---|---|---|---|
390 | 32457 | "블로그" | 2020-05-02 | 83 | "직장생활" | 690 |
391 | 32458 | "블로그" | 2020-05-02 | 84 | "토론" | 654 |
392 | 32459 | "블로그" | 2020-05-02 | 85 | "가족모임" | 639 |
393 | 32460 | "블로그" | 2020-05-02 | 86 | "승진" | 631 |
394 | 32461 | "블로그" | 2020-05-02 | 87 | "후원" | 602 |
395 | 32462 | "블로그" | 2020-05-02 | 88 | "스터디" | 589 |
396 | 32463 | "블로그" | 2020-05-02 | 89 | "호캉스" | 555 |
397 | 32464 | "블로그" | 2020-05-02 | 90 | "여가" | 545 |
398 | 32465 | "블로그" | 2020-05-02 | 91 | "신혼" | 536 |
399 | 32466 | "블로그" | 2020-05-02 | 92 | "돌잔치" | 533 |