Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 400 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 20.1 KiB |
Average record size in memory | 51.3 B |
Variable types
Numeric | 3 |
---|---|
Categorical | 1 |
DateTime | 1 |
Text | 1 |
Dataset
Description | Sample |
---|---|
Author | 코난테크놀로지 |
URL | https://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=TPORELATION |
Reproduction
Analysis started | 2023-12-10 06:17:07.137498 |
---|---|
Analysis finished | 2023-12-10 06:17:11.400065 |
Duration | 4.26 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
"기본키값"
Real number (ℝ)
UNIQUE
 
Distinct | 400 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 31751.19 |
Minimum | 12357 |
---|---|
Maximum | 61564 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 12357 |
---|---|
5-th percentile | 12376.95 |
Q1 | 12456.75 |
median | 28334.5 |
Q3 | 44328.25 |
95-th percentile | 61544.05 |
Maximum | 61564 |
Range | 49207 |
Interquartile range (IQR) | 31871.5 |
Descriptive statistics
Standard deviation | 15834.912 |
---|---|
Coefficient of variation (CV) | 0.49871869 |
Kurtosis | -0.95766719 |
Mean | 31751.19 |
Median Absolute Deviation (MAD) | 15936 |
Skewness | 0.2782427 |
Sum | 12700476 |
Variance | 2.5074444 × 108 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
12357 | 1 | 0.2% |
44293 | 1 | 0.2% |
44303 | 1 | 0.2% |
44302 | 1 | 0.2% |
44301 | 1 | 0.2% |
44300 | 1 | 0.2% |
44299 | 1 | 0.2% |
44298 | 1 | 0.2% |
44297 | 1 | 0.2% |
44296 | 1 | 0.2% |
Other values (390) | 390 |
Value | Count | Frequency (%) |
12357 | 1 | |
12358 | 1 | |
12359 | 1 | |
12360 | 1 | |
12361 | 1 | |
12362 | 1 | |
12363 | 1 | |
12364 | 1 | |
12365 | 1 | |
12366 | 1 |
Value | Count | Frequency (%) |
61564 | 1 | |
61563 | 1 | |
61562 | 1 | |
61561 | 1 | |
61560 | 1 | |
61559 | 1 | |
61558 | 1 | |
61557 | 1 | |
61556 | 1 | |
61555 | 1 |
"채널값"
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
"블로그" |
---|
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 5 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | "블로그" |
---|---|
2nd row | "블로그" |
3rd row | "블로그" |
4th row | "블로그" |
5th row | "블로그" |
Common Values
Value | Count | Frequency (%) |
"블로그" | 400 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
블로그 | 400 |
"해당일자"
Date
Distinct | 4 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
Minimum | 2020-05-01 00:00:00 |
---|---|
Maximum | 2020-05-04 00:00:00 |
"차례값"
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 122 |
---|---|
Distinct (%) | 30.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 56.51 |
Minimum | 1 |
---|---|
Maximum | 122 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 5.95 |
Q1 | 25.75 |
median | 54 |
Q3 | 87 |
95-th percentile | 114 |
Maximum | 122 |
Range | 121 |
Interquartile range (IQR) | 61.25 |
Descriptive statistics
Standard deviation | 35.236827 |
---|---|
Coefficient of variation (CV) | 0.62355029 |
Kurtosis | -1.2263758 |
Mean | 56.51 |
Median Absolute Deviation (MAD) | 30.5 |
Skewness | 0.16115045 |
Sum | 22604 |
Variance | 1241.634 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 4 | 1.0% |
22 | 4 | 1.0% |
24 | 4 | 1.0% |
25 | 4 | 1.0% |
26 | 4 | 1.0% |
27 | 4 | 1.0% |
28 | 4 | 1.0% |
29 | 4 | 1.0% |
30 | 4 | 1.0% |
31 | 4 | 1.0% |
Other values (112) | 360 |
Value | Count | Frequency (%) |
1 | 4 | |
2 | 4 | |
3 | 4 | |
4 | 4 | |
5 | 4 | |
6 | 4 | |
7 | 4 | |
8 | 4 | |
9 | 4 | |
10 | 4 |
Value | Count | Frequency (%) |
122 | 1 | 0.2% |
121 | 1 | 0.2% |
120 | 2 | |
119 | 2 | |
118 | 3 | |
117 | 3 | |
116 | 3 | |
115 | 3 | |
114 | 3 | |
113 | 3 |
"이슈어값"
Text
Distinct | 122 |
---|---|
Distinct (%) | 30.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
Value | Count | Frequency (%) |
친구 | 4 | 1.0% |
삼촌 | 4 | 1.0% |
여자친구 | 4 | 1.0% |
형제 | 4 | 1.0% |
누나 | 4 | 1.0% |
선배 | 4 | 1.0% |
할아버지 | 4 | 1.0% |
딸(女 | 4 | 1.0% |
막내 | 4 | 1.0% |
반려견 | 4 | 1.0% |
Other values (112) | 360 |
Most occurring characters
Value | Count | Frequency (%) |
" | 800 | |
아 | 49 | 2.5% |
친 | 47 | 2.4% |
동 | 44 | 2.3% |
니 | 42 | 2.1% |
남 | 42 | 2.1% |
사 | 38 | 1.9% |
자 | 30 | 1.5% |
부 | 27 | 1.4% |
구 | 27 | 1.4% |
Other values (99) | 808 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1146 | |
Other Punctuation | 800 | |
Close Punctuation | 4 | 0.2% |
Open Punctuation | 4 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
아 | 49 | 4.3% |
친 | 47 | 4.1% |
동 | 44 | 3.8% |
니 | 42 | 3.7% |
남 | 42 | 3.7% |
사 | 38 | 3.3% |
자 | 30 | 2.6% |
부 | 27 | 2.4% |
구 | 27 | 2.4% |
할 | 26 | 2.3% |
Other values (96) | 774 |
Other Punctuation
Value | Count | Frequency (%) |
" | 800 |
Close Punctuation
Value | Count | Frequency (%) |
) | 4 |
Open Punctuation
Value | Count | Frequency (%) |
( | 4 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1142 | |
Common | 808 | |
Han | 4 | 0.2% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
아 | 49 | 4.3% |
친 | 47 | 4.1% |
동 | 44 | 3.9% |
니 | 42 | 3.7% |
남 | 42 | 3.7% |
사 | 38 | 3.3% |
자 | 30 | 2.6% |
부 | 27 | 2.4% |
구 | 27 | 2.4% |
할 | 26 | 2.3% |
Other values (95) | 770 |
Common
Value | Count | Frequency (%) |
" | 800 | |
) | 4 | 0.5% |
( | 4 | 0.5% |
Han
Value | Count | Frequency (%) |
女 | 4 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1142 | |
ASCII | 808 | |
CJK Compat Ideographs | 4 | 0.2% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
" | 800 | |
) | 4 | 0.5% |
( | 4 | 0.5% |
Hangul
Value | Count | Frequency (%) |
아 | 49 | 4.3% |
친 | 47 | 4.1% |
동 | 44 | 3.9% |
니 | 42 | 3.7% |
남 | 42 | 3.7% |
사 | 38 | 3.3% |
자 | 30 | 2.6% |
부 | 27 | 2.4% |
구 | 27 | 2.4% |
할 | 26 | 2.3% |
Other values (95) | 770 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
女 | 4 |
"건수값"
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 282 |
---|---|
Distinct (%) | 70.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2212.52 |
Minimum | 1 |
---|---|
Maximum | 28557 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 4.95 |
Q1 | 31 |
median | 184 |
Q3 | 1435.75 |
95-th percentile | 12925.1 |
Maximum | 28557 |
Range | 28556 |
Interquartile range (IQR) | 1404.75 |
Descriptive statistics
Standard deviation | 4767.2909 |
---|---|
Coefficient of variation (CV) | 2.1546883 |
Kurtosis | 10.669678 |
Mean | 2212.52 |
Median Absolute Deviation (MAD) | 176 |
Skewness | 3.1502129 |
Sum | 885008 |
Variance | 22727063 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
11 | 7 | 1.8% |
1 | 7 | 1.8% |
30 | 6 | 1.5% |
24 | 6 | 1.5% |
8 | 5 | 1.2% |
17 | 5 | 1.2% |
4 | 5 | 1.2% |
2 | 4 | 1.0% |
37 | 4 | 1.0% |
56 | 4 | 1.0% |
Other values (272) | 347 |
Value | Count | Frequency (%) |
1 | 7 | |
2 | 4 | |
3 | 4 | |
4 | 5 | |
5 | 4 | |
6 | 1 | 0.2% |
7 | 3 | |
8 | 5 | |
9 | 4 | |
10 | 1 | 0.2% |
Value | Count | Frequency (%) |
28557 | 1 | |
27997 | 1 | |
26419 | 1 | |
24182 | 1 | |
23957 | 1 | |
23398 | 1 | |
22753 | 1 | |
21880 | 1 | |
19244 | 1 | |
18120 | 1 |
"기본키값" | "해당일자" | "차례값" | "건수값" | |
---|---|---|---|---|
"기본키값" | 1.000 | 1.000 | 0.284 | 0.222 |
"해당일자" | 1.000 | 1.000 | 0.282 | 0.185 |
"차례값" | 0.284 | 0.282 | 1.000 | 0.813 |
"건수값" | 0.222 | 0.185 | 0.813 | 1.000 |
"기본키값" | "차례값" | "건수값" | |
---|---|---|---|
"기본키값" | 1.000 | 0.096 | -0.094 |
"차례값" | 0.096 | 1.000 | -1.000 |
"건수값" | -0.094 | -1.000 | 1.000 |
"기본키값" | "채널값" | "해당일자" | "차례값" | "이슈어값" | "건수값" | |
---|---|---|---|---|---|---|
0 | 12357 | "블로그" | 2020-05-01 | 1 | "친구" | 26419 |
1 | 12358 | "블로그" | 2020-05-01 | 2 | "가족" | 24182 |
2 | 12359 | "블로그" | 2020-05-01 | 3 | "엄마" | 18120 |
3 | 12360 | "블로그" | 2020-05-01 | 4 | "혼자" | 15575 |
4 | 12361 | "블로그" | 2020-05-01 | 5 | "아기" | 14965 |
5 | 12362 | "블로그" | 2020-05-01 | 6 | "부모" | 11697 |
6 | 12363 | "블로그" | 2020-05-01 | 7 | "남편" | 11489 |
7 | 12364 | "블로그" | 2020-05-01 | 8 | "아빠" | 10859 |
8 | 12365 | "블로그" | 2020-05-01 | 9 | "지인" | 7804 |
9 | 12366 | "블로그" | 2020-05-01 | 10 | "자녀" | 7621 |
"기본키값" | "채널값" | "해당일자" | "차례값" | "이슈어값" | "건수값" | |
---|---|---|---|---|---|---|
390 | 61555 | "블로그" | 2020-05-04 | 31 | "막내" | 1182 |
391 | 61556 | "블로그" | 2020-05-04 | 32 | "친정" | 1151 |
392 | 61557 | "블로그" | 2020-05-04 | 33 | "직장동료" | 1070 |
393 | 61558 | "블로그" | 2020-05-04 | 34 | "자매" | 947 |
394 | 61559 | "블로그" | 2020-05-04 | 35 | "동창" | 841 |
395 | 61560 | "블로그" | 2020-05-04 | 36 | "시댁" | 818 |
396 | 61561 | "블로그" | 2020-05-04 | 37 | "후배" | 811 |
397 | 61562 | "블로그" | 2020-05-04 | 38 | "친정어머니" | 592 |
398 | 61563 | "블로그" | 2020-05-04 | 39 | "삼촌" | 570 |
399 | 61564 | "블로그" | 2020-05-04 | 40 | "여동생" | 559 |