Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 400 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 20.1 KiB |
Average record size in memory | 51.3 B |
Variable types
Numeric | 3 |
---|---|
Categorical | 1 |
DateTime | 1 |
Text | 1 |
Dataset
Description | Sample |
---|---|
Author | 코난테크놀로지 |
URL | https://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=TPOBRAND |
"채널값" has constant value "" | Constant |
"해당일자" has constant value "" | Constant |
"기본키값" is highly overall correlated with "차례값" and 1 other fields | High correlation |
"차례값" is highly overall correlated with "기본키값" and 1 other fields | High correlation |
"건수값" is highly overall correlated with "기본키값" and 1 other fields | High correlation |
"기본키값" has unique values | Unique |
"차례값" has unique values | Unique |
"이슈어값" has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 06:16:16.403603 |
---|---|
Analysis finished | 2023-12-10 06:16:18.537404 |
Duration | 2.13 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
"기본키값"
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 400 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 14707.5 |
Minimum | 14508 |
---|---|
Maximum | 14907 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 14508 |
---|---|
5-th percentile | 14527.95 |
Q1 | 14607.75 |
median | 14707.5 |
Q3 | 14807.25 |
95-th percentile | 14887.05 |
Maximum | 14907 |
Range | 399 |
Interquartile range (IQR) | 199.5 |
Descriptive statistics
Standard deviation | 115.6143 |
---|---|
Coefficient of variation (CV) | 0.0078609078 |
Kurtosis | -1.2 |
Mean | 14707.5 |
Median Absolute Deviation (MAD) | 100 |
Skewness | 0 |
Sum | 5883000 |
Variance | 13366.667 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
14508 | 1 | 0.2% |
14772 | 1 | 0.2% |
14782 | 1 | 0.2% |
14781 | 1 | 0.2% |
14780 | 1 | 0.2% |
14779 | 1 | 0.2% |
14778 | 1 | 0.2% |
14777 | 1 | 0.2% |
14776 | 1 | 0.2% |
14775 | 1 | 0.2% |
Other values (390) | 390 |
Value | Count | Frequency (%) |
14508 | 1 | |
14509 | 1 | |
14510 | 1 | |
14511 | 1 | |
14512 | 1 | |
14513 | 1 | |
14514 | 1 | |
14515 | 1 | |
14516 | 1 | |
14517 | 1 |
Value | Count | Frequency (%) |
14907 | 1 | |
14906 | 1 | |
14905 | 1 | |
14904 | 1 | |
14903 | 1 | |
14902 | 1 | |
14901 | 1 | |
14900 | 1 | |
14899 | 1 | |
14898 | 1 |
"채널값"
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
"블로그" |
---|
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 5 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | "블로그" |
---|---|
2nd row | "블로그" |
3rd row | "블로그" |
4th row | "블로그" |
5th row | "블로그" |
Common Values
Value | Count | Frequency (%) |
"블로그" | 400 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
블로그 | 400 |
"해당일자"
Date
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
Minimum | 2020-05-01 00:00:00 |
---|---|
Maximum | 2020-05-01 00:00:00 |
"차례값"
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 400 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 200.5 |
Minimum | 1 |
---|---|
Maximum | 400 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 20.95 |
Q1 | 100.75 |
median | 200.5 |
Q3 | 300.25 |
95-th percentile | 380.05 |
Maximum | 400 |
Range | 399 |
Interquartile range (IQR) | 199.5 |
Descriptive statistics
Standard deviation | 115.6143 |
---|---|
Coefficient of variation (CV) | 0.57662993 |
Kurtosis | -1.2 |
Mean | 200.5 |
Median Absolute Deviation (MAD) | 100 |
Skewness | 0 |
Sum | 80200 |
Variance | 13366.667 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 0.2% |
265 | 1 | 0.2% |
275 | 1 | 0.2% |
274 | 1 | 0.2% |
273 | 1 | 0.2% |
272 | 1 | 0.2% |
271 | 1 | 0.2% |
270 | 1 | 0.2% |
269 | 1 | 0.2% |
268 | 1 | 0.2% |
Other values (390) | 390 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
400 | 1 | |
399 | 1 | |
398 | 1 | |
397 | 1 | |
396 | 1 | |
395 | 1 | |
394 | 1 | |
393 | 1 | |
392 | 1 | |
391 | 1 |
"이슈어값"
Text
UNIQUE
 
Distinct | 400 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
Value | Count | Frequency (%) |
쿠팡 | 1 | 0.2% |
다시다 | 1 | 0.2% |
뉴코아 | 1 | 0.2% |
커피빈 | 1 | 0.2% |
넥서스 | 1 | 0.2% |
롯데호텔 | 1 | 0.2% |
로젠택배 | 1 | 0.2% |
불스원 | 1 | 0.2% |
바디프랜드 | 1 | 0.2% |
녹십자 | 1 | 0.2% |
Other values (390) | 390 |
Most occurring characters
Value | Count | Frequency (%) |
" | 800 | |
스 | 85 | 3.8% |
이 | 48 | 2.1% |
리 | 38 | 1.7% |
아 | 26 | 1.2% |
라 | 23 | 1.0% |
로 | 22 | 1.0% |
레 | 20 | 0.9% |
트 | 18 | 0.8% |
카 | 18 | 0.8% |
Other values (387) | 1157 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1359 | |
Other Punctuation | 801 | |
Lowercase Letter | 86 | 3.8% |
Decimal Number | 9 | 0.4% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
스 | 85 | 6.3% |
이 | 48 | 3.5% |
리 | 38 | 2.8% |
아 | 26 | 1.9% |
라 | 23 | 1.7% |
로 | 22 | 1.6% |
레 | 20 | 1.5% |
트 | 18 | 1.3% |
카 | 18 | 1.3% |
오 | 17 | 1.3% |
Other values (359) | 1044 |
Lowercase Letter
Value | Count | Frequency (%) |
k | 10 | |
g | 9 | |
b | 8 | 9.3% |
c | 7 | 8.1% |
e | 7 | 8.1% |
l | 6 | 7.0% |
s | 6 | 7.0% |
m | 5 | 5.8% |
h | 4 | 4.7% |
t | 4 | 4.7% |
Other values (12) | 20 |
Decimal Number
Value | Count | Frequency (%) |
2 | 4 | |
1 | 2 | |
4 | 2 | |
5 | 1 | 11.1% |
Other Punctuation
Value | Count | Frequency (%) |
" | 800 | |
& | 1 | 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1359 | |
Common | 810 | |
Latin | 86 | 3.8% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
스 | 85 | 6.3% |
이 | 48 | 3.5% |
리 | 38 | 2.8% |
아 | 26 | 1.9% |
라 | 23 | 1.7% |
로 | 22 | 1.6% |
레 | 20 | 1.5% |
트 | 18 | 1.3% |
카 | 18 | 1.3% |
오 | 17 | 1.3% |
Other values (359) | 1044 |
Latin
Value | Count | Frequency (%) |
k | 10 | |
g | 9 | |
b | 8 | 9.3% |
c | 7 | 8.1% |
e | 7 | 8.1% |
l | 6 | 7.0% |
s | 6 | 7.0% |
m | 5 | 5.8% |
h | 4 | 4.7% |
t | 4 | 4.7% |
Other values (12) | 20 |
Common
Value | Count | Frequency (%) |
" | 800 | |
2 | 4 | 0.5% |
1 | 2 | 0.2% |
4 | 2 | 0.2% |
& | 1 | 0.1% |
5 | 1 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1359 | |
ASCII | 896 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
" | 800 | |
k | 10 | 1.1% |
g | 9 | 1.0% |
b | 8 | 0.9% |
c | 7 | 0.8% |
e | 7 | 0.8% |
l | 6 | 0.7% |
s | 6 | 0.7% |
m | 5 | 0.6% |
h | 4 | 0.4% |
Other values (18) | 34 | 3.8% |
Hangul
Value | Count | Frequency (%) |
스 | 85 | 6.3% |
이 | 48 | 3.5% |
리 | 38 | 2.8% |
아 | 26 | 1.9% |
라 | 23 | 1.7% |
로 | 22 | 1.6% |
레 | 20 | 1.5% |
트 | 18 | 1.3% |
카 | 18 | 1.3% |
오 | 17 | 1.3% |
Other values (359) | 1044 |
"건수값"
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 258 |
---|---|
Distinct (%) | 64.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 666.5975 |
Minimum | 49 |
---|---|
Maximum | 51100 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 49 |
---|---|
5-th percentile | 53 |
Q1 | 79 |
median | 149.5 |
Q3 | 364.5 |
95-th percentile | 2188 |
Maximum | 51100 |
Range | 51051 |
Interquartile range (IQR) | 285.5 |
Descriptive statistics
Standard deviation | 2941.1778 |
---|---|
Coefficient of variation (CV) | 4.4122245 |
Kurtosis | 221.69218 |
Mean | 666.5975 |
Median Absolute Deviation (MAD) | 86.5 |
Skewness | 13.684745 |
Sum | 266639 |
Variance | 8650526.8 |
Monotonicity | Decreasing |
Value | Count | Frequency (%) |
49 | 8 | 2.0% |
94 | 7 | 1.8% |
78 | 6 | 1.5% |
66 | 6 | 1.5% |
71 | 5 | 1.2% |
70 | 5 | 1.2% |
133 | 5 | 1.2% |
89 | 4 | 1.0% |
62 | 4 | 1.0% |
77 | 4 | 1.0% |
Other values (248) | 346 |
Value | Count | Frequency (%) |
49 | 8 | |
50 | 2 | 0.5% |
51 | 4 | |
52 | 4 | |
53 | 4 | |
54 | 4 | |
55 | 3 | 0.8% |
56 | 4 | |
57 | 2 | 0.5% |
58 | 4 |
Value | Count | Frequency (%) |
51100 | 1 | |
16559 | 1 | |
15636 | 1 | |
12546 | 1 | |
8008 | 1 | |
7739 | 1 | |
3647 | 1 | |
3394 | 1 | |
3381 | 1 | |
3350 | 1 |
"기본키값" | "차례값" | "건수값" | |
---|---|---|---|
"기본키값" | 1.000 | 1.000 | 0.215 |
"차례값" | 1.000 | 1.000 | 0.257 |
"건수값" | 0.215 | 0.257 | 1.000 |
"기본키값" | "차례값" | "건수값" | |
---|---|---|---|
"기본키값" | 1.000 | 1.000 | -1.000 |
"차례값" | 1.000 | 1.000 | -1.000 |
"건수값" | -1.000 | -1.000 | 1.000 |
"기본키값" | "채널값" | "해당일자" | "차례값" | "이슈어값" | "건수값" | |
---|---|---|---|---|---|---|
0 | 14508 | "블로그" | 2020-05-01 | 1 | "쿠팡" | 51100 |
1 | 14509 | "블로그" | 2020-05-01 | 2 | "지방시" | 16559 |
2 | 14510 | "블로그" | 2020-05-01 | 3 | "카카오" | 15636 |
3 | 14511 | "블로그" | 2020-05-01 | 4 | "푸마" | 12546 |
4 | 14512 | "블로그" | 2020-05-01 | 5 | "퓨마" | 8008 |
5 | 14513 | "블로그" | 2020-05-01 | 6 | "엘르" | 7739 |
6 | 14514 | "블로그" | 2020-05-01 | 7 | "아이폰" | 3647 |
7 | 14515 | "블로그" | 2020-05-01 | 8 | "까르띠에" | 3394 |
8 | 14516 | "블로그" | 2020-05-01 | 9 | "펩시콜라" | 3381 |
9 | 14517 | "블로그" | 2020-05-01 | 10 | "애플" | 3350 |
"기본키값" | "채널값" | "해당일자" | "차례값" | "이슈어값" | "건수값" | |
---|---|---|---|---|---|---|
390 | 14898 | "블로그" | 2020-05-01 | 391 | "일룸" | 50 |
391 | 14899 | "블로그" | 2020-05-01 | 392 | "레모나" | 50 |
392 | 14900 | "블로그" | 2020-05-01 | 393 | "일동후디스" | 49 |
393 | 14901 | "블로그" | 2020-05-01 | 394 | "tg삼보" | 49 |
394 | 14902 | "블로그" | 2020-05-01 | 395 | "에스티로더" | 49 |
395 | 14903 | "블로그" | 2020-05-01 | 396 | "sk와이번스" | 49 |
396 | 14904 | "블로그" | 2020-05-01 | 397 | "성심당" | 49 |
397 | 14905 | "블로그" | 2020-05-01 | 398 | "persil" | 49 |
398 | 14906 | "블로그" | 2020-05-01 | 399 | "모닝글로리" | 49 |
399 | 14907 | "블로그" | 2020-05-01 | 400 | "롯데자이언츠" | 49 |