Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 400 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 20.1 KiB |
Average record size in memory | 51.3 B |
Variable types
Numeric | 3 |
---|---|
Categorical | 2 |
Text | 1 |
Dataset
Description | Sample |
---|---|
Author | 코난테크놀로지 |
URL | https://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=TPOGROUP |
"채널값" has constant value "" | Constant |
"기본키값" is highly overall correlated with "해당일자" | High correlation |
"차례값" is highly overall correlated with "건수값" | High correlation |
"건수값" is highly overall correlated with "차례값" | High correlation |
"해당일자" is highly overall correlated with "기본키값" | High correlation |
"기본키값" has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 06:40:05.416490 |
---|---|
Analysis finished | 2023-12-10 06:40:07.867004 |
Duration | 2.45 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
"기본키값"
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 400 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20364.97 |
Minimum | 12156 |
---|---|
Maximum | 44075 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 12156 |
---|---|
5-th percentile | 12175.95 |
Q1 | 12255.75 |
median | 12355.5 |
Q3 | 28157.25 |
95-th percentile | 28237.05 |
Maximum | 44075 |
Range | 31919 |
Interquartile range (IQR) | 15901.5 |
Descriptive statistics
Standard deviation | 8347.5253 |
---|---|
Coefficient of variation (CV) | 0.40989627 |
Kurtosis | -1.3648771 |
Mean | 20364.97 |
Median Absolute Deviation (MAD) | 199 |
Skewness | 0.2216268 |
Sum | 8145988 |
Variance | 69681179 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
12156 | 1 | 0.2% |
28122 | 1 | 0.2% |
28132 | 1 | 0.2% |
28131 | 1 | 0.2% |
28130 | 1 | 0.2% |
28129 | 1 | 0.2% |
28128 | 1 | 0.2% |
28127 | 1 | 0.2% |
28126 | 1 | 0.2% |
28125 | 1 | 0.2% |
Other values (390) | 390 |
Value | Count | Frequency (%) |
12156 | 1 | |
12157 | 1 | |
12158 | 1 | |
12159 | 1 | |
12160 | 1 | |
12161 | 1 | |
12162 | 1 | |
12163 | 1 | |
12164 | 1 | |
12165 | 1 |
Value | Count | Frequency (%) |
44075 | 1 | |
44074 | 1 | |
44073 | 1 | |
44072 | 1 | |
44071 | 1 | |
28252 | 1 | |
28251 | 1 | |
28250 | 1 | |
28249 | 1 | |
28248 | 1 |
"채널값"
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
"블로그" |
---|
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 5 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | "블로그" |
---|---|
2nd row | "블로그" |
3rd row | "블로그" |
4th row | "블로그" |
5th row | "블로그" |
Common Values
Value | Count | Frequency (%) |
"블로그" | 400 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
블로그 | 400 |
"해당일자"
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 0.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
2020-05-01 | |
---|---|
2020-05-02 | |
2020-05-03 | 5 |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2020-05-01 |
---|---|
2nd row | 2020-05-01 |
3rd row | 2020-05-01 |
4th row | 2020-05-01 |
5th row | 2020-05-01 |
Common Values
Value | Count | Frequency (%) |
2020-05-01 | 201 | |
2020-05-02 | 194 | |
2020-05-03 | 5 | 1.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2020-05-01 | 201 | |
2020-05-02 | 194 | |
2020-05-03 | 5 | 1.2% |
"차례값"
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 201 |
---|---|
Distinct (%) | 50.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 98.0775 |
Minimum | 1 |
---|---|
Maximum | 201 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 8 |
Q1 | 48 |
median | 98 |
Q3 | 148 |
95-th percentile | 188 |
Maximum | 201 |
Range | 200 |
Interquartile range (IQR) | 100 |
Descriptive statistics
Standard deviation | 57.781079 |
---|---|
Coefficient of variation (CV) | 0.58913695 |
Kurtosis | -1.2011587 |
Mean | 98.0775 |
Median Absolute Deviation (MAD) | 50 |
Skewness | 0.0076202642 |
Sum | 39231 |
Variance | 3338.6531 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 3 | 0.8% |
3 | 3 | 0.8% |
4 | 3 | 0.8% |
5 | 3 | 0.8% |
2 | 3 | 0.8% |
136 | 2 | 0.5% |
127 | 2 | 0.5% |
128 | 2 | 0.5% |
129 | 2 | 0.5% |
130 | 2 | 0.5% |
Other values (191) | 375 |
Value | Count | Frequency (%) |
1 | 3 | |
2 | 3 | |
3 | 3 | |
4 | 3 | |
5 | 3 | |
6 | 2 | |
7 | 2 | |
8 | 2 | |
9 | 2 | |
10 | 2 |
Value | Count | Frequency (%) |
201 | 1 | |
200 | 1 | |
199 | 1 | |
198 | 1 | |
197 | 1 | |
196 | 1 | |
195 | 1 | |
194 | 2 | |
193 | 2 | |
192 | 2 |
"이슈어값"
Text
Distinct | 209 |
---|---|
Distinct (%) | 52.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
Value | Count | Frequency (%) |
고객 | 3 | 0.8% |
어린이 | 3 | 0.8% |
아기 | 3 | 0.8% |
학생 | 3 | 0.8% |
회사원 | 3 | 0.8% |
복학생 | 2 | 0.5% |
예비역 | 2 | 0.5% |
하위층 | 2 | 0.5% |
5인가구 | 2 | 0.5% |
은퇴자 | 2 | 0.5% |
Other values (199) | 375 |
Most occurring characters
Value | Count | Frequency (%) |
" | 800 | |
자 | 50 | 2.3% |
인 | 44 | 2.0% |
대 | 39 | 1.8% |
생 | 35 | 1.6% |
세 | 33 | 1.5% |
족 | 31 | 1.4% |
아 | 26 | 1.2% |
가 | 25 | 1.2% |
이 | 25 | 1.2% |
Other values (227) | 1055 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1286 | |
Other Punctuation | 800 | |
Decimal Number | 63 | 2.9% |
Lowercase Letter | 14 | 0.6% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
자 | 50 | 3.9% |
인 | 44 | 3.4% |
대 | 39 | 3.0% |
생 | 35 | 2.7% |
세 | 33 | 2.6% |
족 | 31 | 2.4% |
아 | 26 | 2.0% |
가 | 25 | 1.9% |
이 | 25 | 1.9% |
부 | 24 | 1.9% |
Other values (210) | 954 |
Decimal Number
Value | Count | Frequency (%) |
0 | 22 | |
3 | 9 | |
8 | 6 | 9.5% |
4 | 6 | 9.5% |
5 | 6 | 9.5% |
6 | 5 | 7.9% |
2 | 4 | 6.3% |
7 | 3 | 4.8% |
1 | 2 | 3.2% |
Lowercase Letter
Value | Count | Frequency (%) |
p | 2 | |
y | 2 | |
n | 2 | |
i | 2 | |
v | 2 | |
x | 2 | |
z | 2 |
Other Punctuation
Value | Count | Frequency (%) |
" | 800 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1286 | |
Common | 863 | |
Latin | 14 | 0.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
자 | 50 | 3.9% |
인 | 44 | 3.4% |
대 | 39 | 3.0% |
생 | 35 | 2.7% |
세 | 33 | 2.6% |
족 | 31 | 2.4% |
아 | 26 | 2.0% |
가 | 25 | 1.9% |
이 | 25 | 1.9% |
부 | 24 | 1.9% |
Other values (210) | 954 |
Common
Value | Count | Frequency (%) |
" | 800 | |
0 | 22 | 2.5% |
3 | 9 | 1.0% |
8 | 6 | 0.7% |
4 | 6 | 0.7% |
5 | 6 | 0.7% |
6 | 5 | 0.6% |
2 | 4 | 0.5% |
7 | 3 | 0.3% |
1 | 2 | 0.2% |
Latin
Value | Count | Frequency (%) |
p | 2 | |
y | 2 | |
n | 2 | |
i | 2 | |
v | 2 | |
x | 2 | |
z | 2 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1286 | |
ASCII | 877 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
" | 800 | |
0 | 22 | 2.5% |
3 | 9 | 1.0% |
8 | 6 | 0.7% |
4 | 6 | 0.7% |
5 | 6 | 0.7% |
6 | 5 | 0.6% |
2 | 4 | 0.5% |
7 | 3 | 0.3% |
p | 2 | 0.2% |
Other values (7) | 14 | 1.6% |
Hangul
Value | Count | Frequency (%) |
자 | 50 | 3.9% |
인 | 44 | 3.4% |
대 | 39 | 3.0% |
생 | 35 | 2.7% |
세 | 33 | 2.6% |
족 | 31 | 2.4% |
아 | 26 | 2.0% |
가 | 25 | 1.9% |
이 | 25 | 1.9% |
부 | 24 | 1.9% |
Other values (210) | 954 |
"건수값"
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 234 |
---|---|
Distinct (%) | 58.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 866.3475 |
Minimum | 1 |
---|---|
Maximum | 41399 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 9 |
median | 50 |
Q3 | 362.5 |
95-th percentile | 3271.85 |
Maximum | 41399 |
Range | 41398 |
Interquartile range (IQR) | 353.5 |
Descriptive statistics
Standard deviation | 3510.8186 |
---|---|
Coefficient of variation (CV) | 4.0524369 |
Kurtosis | 82.50825 |
Mean | 866.3475 |
Median Absolute Deviation (MAD) | 48 |
Skewness | 8.4880796 |
Sum | 346539 |
Variance | 12325847 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3 | 22 | 5.5% |
2 | 21 | 5.2% |
1 | 17 | 4.2% |
5 | 11 | 2.8% |
4 | 8 | 2.0% |
8 | 8 | 2.0% |
18 | 7 | 1.8% |
6 | 7 | 1.8% |
17 | 6 | 1.5% |
11 | 5 | 1.2% |
Other values (224) | 288 |
Value | Count | Frequency (%) |
1 | 17 | |
2 | 21 | |
3 | 22 | |
4 | 8 | 2.0% |
5 | 11 | |
6 | 7 | 1.8% |
7 | 3 | 0.8% |
8 | 8 | 2.0% |
9 | 4 | 1.0% |
10 | 3 | 0.8% |
Value | Count | Frequency (%) |
41399 | 1 | |
35165 | 1 | |
31707 | 1 | |
15236 | 1 | |
14965 | 1 | |
13789 | 1 | |
8255 | 1 | |
6665 | 1 | |
6222 | 1 | |
6172 | 1 |
"기본키값" | "해당일자" | "차례값" | "건수값" | |
---|---|---|---|---|
"기본키값" | 1.000 | 1.000 | 0.606 | 0.447 |
"해당일자" | 1.000 | 1.000 | 0.278 | 0.766 |
"차례값" | 0.606 | 0.278 | 1.000 | 0.411 |
"건수값" | 0.447 | 0.766 | 0.411 | 1.000 |
"기본키값" | "차례값" | "건수값" | "해당일자" | |
---|---|---|---|---|
"기본키값" | 1.000 | 0.421 | -0.456 | 0.999 |
"차례값" | 0.421 | 1.000 | -0.999 | 0.164 |
"건수값" | -0.456 | -0.999 | 1.000 | 0.442 |
"해당일자" | 0.999 | 0.164 | 0.442 | 1.000 |
"기본키값" | "채널값" | "해당일자" | "차례값" | "이슈어값" | "건수값" | |
---|---|---|---|---|---|---|
0 | 12156 | "블로그" | 2020-05-01 | 1 | "고객" | 41399 |
1 | 12157 | "블로그" | 2020-05-01 | 2 | "아기" | 14965 |
2 | 12158 | "블로그" | 2020-05-01 | 3 | "학생" | 8255 |
3 | 12159 | "블로그" | 2020-05-01 | 4 | "회사원" | 6172 |
4 | 12160 | "블로그" | 2020-05-01 | 5 | "근로자" | 5823 |
5 | 12161 | "블로그" | 2020-05-01 | 6 | "어린이" | 5704 |
6 | 12162 | "블로그" | 2020-05-01 | 7 | "주부" | 4096 |
7 | 12163 | "블로그" | 2020-05-01 | 8 | "연예인" | 3421 |
8 | 12164 | "블로그" | 2020-05-01 | 9 | "교수" | 3050 |
9 | 12165 | "블로그" | 2020-05-01 | 10 | "생산자" | 2647 |
"기본키값" | "채널값" | "해당일자" | "차례값" | "이슈어값" | "건수값" | |
---|---|---|---|---|---|---|
390 | 28248 | "블로그" | 2020-05-02 | 190 | "미혼부" | 1 |
391 | 28249 | "블로그" | 2020-05-02 | 191 | "다이아수저" | 1 |
392 | 28250 | "블로그" | 2020-05-02 | 192 | "차상위층" | 1 |
393 | 28251 | "블로그" | 2020-05-02 | 193 | "다세대가족" | 1 |
394 | 28252 | "블로그" | 2020-05-02 | 194 | "워킹푸어" | 1 |
395 | 44071 | "블로그" | 2020-05-03 | 1 | "고객" | 31707 |
396 | 44072 | "블로그" | 2020-05-03 | 2 | "아기" | 13789 |
397 | 44073 | "블로그" | 2020-05-03 | 3 | "학생" | 6222 |
398 | 44074 | "블로그" | 2020-05-03 | 4 | "회사원" | 5723 |
399 | 44075 | "블로그" | 2020-05-03 | 5 | "어린이" | 5241 |