Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 400 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 20.1 KiB |
Average record size in memory | 51.3 B |
Variable types
Numeric | 3 |
---|---|
Categorical | 2 |
Text | 1 |
Dataset
Description | Sample |
---|---|
Author | 코난테크놀로지 |
URL | https://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=TPOCOLOR |
"채널값" has constant value "" | Constant |
"기본키값" is highly overall correlated with "해당일자" | High correlation |
"차례값" is highly overall correlated with "건수값" | High correlation |
"건수값" is highly overall correlated with "차례값" | High correlation |
"해당일자" is highly overall correlated with "기본키값" | High correlation |
"기본키값" has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 06:39:16.993835 |
---|---|
Analysis finished | 2023-12-10 06:39:19.116367 |
Duration | 2.12 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
"기본키값"
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 400 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 40601.59 |
Minimum | 15802 |
---|---|
Maximum | 65155 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 15802 |
---|---|
5-th percentile | 15821.95 |
Q1 | 31711.75 |
median | 47693.5 |
Q3 | 65055.25 |
95-th percentile | 65135.05 |
Maximum | 65155 |
Range | 49353 |
Interquartile range (IQR) | 33343.5 |
Descriptive statistics
Standard deviation | 18298.247 |
---|---|
Coefficient of variation (CV) | 0.45067808 |
Kurtosis | -1.3425882 |
Mean | 40601.59 |
Median Absolute Deviation (MAD) | 15985 |
Skewness | 0.011495093 |
Sum | 16240636 |
Variance | 3.3482583 × 108 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
15802 | 1 | 0.2% |
47758 | 1 | 0.2% |
47768 | 1 | 0.2% |
47767 | 1 | 0.2% |
47766 | 1 | 0.2% |
47765 | 1 | 0.2% |
47764 | 1 | 0.2% |
47763 | 1 | 0.2% |
47762 | 1 | 0.2% |
47761 | 1 | 0.2% |
Other values (390) | 390 |
Value | Count | Frequency (%) |
15802 | 1 | |
15803 | 1 | |
15804 | 1 | |
15805 | 1 | |
15806 | 1 | |
15807 | 1 | |
15808 | 1 | |
15809 | 1 | |
15810 | 1 | |
15811 | 1 |
Value | Count | Frequency (%) |
65155 | 1 | |
65154 | 1 | |
65153 | 1 | |
65152 | 1 | |
65151 | 1 | |
65150 | 1 | |
65149 | 1 | |
65148 | 1 | |
65147 | 1 | |
65146 | 1 |
"채널값"
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
"블로그" |
---|
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 5 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | "블로그" |
---|---|
2nd row | "블로그" |
3rd row | "블로그" |
4th row | "블로그" |
5th row | "블로그" |
Common Values
Value | Count | Frequency (%) |
"블로그" | 400 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
블로그 | 400 |
"해당일자"
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
2020-05-04 | |
---|---|
2020-05-03 | |
2020-05-02 | |
2020-05-01 |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2020-05-01 |
---|---|
2nd row | 2020-05-01 |
3rd row | 2020-05-01 |
4th row | 2020-05-01 |
5th row | 2020-05-01 |
Common Values
Value | Count | Frequency (%) |
2020-05-04 | 103 | |
2020-05-03 | 102 | |
2020-05-02 | 99 | |
2020-05-01 | 96 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2020-05-04 | 103 | |
2020-05-03 | 102 | |
2020-05-02 | 99 | |
2020-05-01 | 96 |
"차례값"
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 103 |
---|---|
Distinct (%) | 25.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 50.5375 |
Minimum | 1 |
---|---|
Maximum | 103 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 5.95 |
Q1 | 25.75 |
median | 50.5 |
Q3 | 75.25 |
95-th percentile | 95.05 |
Maximum | 103 |
Range | 102 |
Interquartile range (IQR) | 49.5 |
Descriptive statistics
Standard deviation | 28.966727 |
---|---|
Coefficient of variation (CV) | 0.57317293 |
Kurtosis | -1.1898357 |
Mean | 50.5375 |
Median Absolute Deviation (MAD) | 25 |
Skewness | 0.0076089644 |
Sum | 20215 |
Variance | 839.07127 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 4 | 1.0% |
50 | 4 | 1.0% |
72 | 4 | 1.0% |
71 | 4 | 1.0% |
70 | 4 | 1.0% |
69 | 4 | 1.0% |
68 | 4 | 1.0% |
67 | 4 | 1.0% |
66 | 4 | 1.0% |
65 | 4 | 1.0% |
Other values (93) | 360 |
Value | Count | Frequency (%) |
1 | 4 | |
2 | 4 | |
3 | 4 | |
4 | 4 | |
5 | 4 | |
6 | 4 | |
7 | 4 | |
8 | 4 | |
9 | 4 | |
10 | 4 |
Value | Count | Frequency (%) |
103 | 1 | 0.2% |
102 | 2 | |
101 | 2 | |
100 | 2 | |
99 | 3 | |
98 | 3 | |
97 | 3 | |
96 | 4 | |
95 | 4 | |
94 | 4 |
"이슈어값"
Text
Distinct | 108 |
---|---|
Distinct (%) | 27.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
Value | Count | Frequency (%) |
검정색 | 4 | 1.0% |
울트라바이올렛 | 4 | 1.0% |
쑥색 | 4 | 1.0% |
에머랄드색 | 4 | 1.0% |
스틸그레이 | 4 | 1.0% |
초콜렛색 | 4 | 1.0% |
체리토마토 | 4 | 1.0% |
군청색 | 4 | 1.0% |
회갈색 | 4 | 1.0% |
와인레드 | 4 | 1.0% |
Other values (98) | 360 |
Most occurring characters
Value | Count | Frequency (%) |
" | 800 | |
색 | 296 | 13.8% |
이 | 52 | 2.4% |
레 | 35 | 1.6% |
갈 | 27 | 1.3% |
라 | 22 | 1.0% |
크 | 20 | 0.9% |
리 | 19 | 0.9% |
드 | 19 | 0.9% |
코 | 19 | 0.9% |
Other values (132) | 833 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1342 | |
Other Punctuation | 800 |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
색 | 296 | 22.1% |
이 | 52 | 3.9% |
레 | 35 | 2.6% |
갈 | 27 | 2.0% |
라 | 22 | 1.6% |
크 | 20 | 1.5% |
리 | 19 | 1.4% |
드 | 19 | 1.4% |
코 | 19 | 1.4% |
청 | 19 | 1.4% |
Other values (131) | 814 |
Other Punctuation
Value | Count | Frequency (%) |
" | 800 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1342 | |
Common | 800 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
색 | 296 | 22.1% |
이 | 52 | 3.9% |
레 | 35 | 2.6% |
갈 | 27 | 2.0% |
라 | 22 | 1.6% |
크 | 20 | 1.5% |
리 | 19 | 1.4% |
드 | 19 | 1.4% |
코 | 19 | 1.4% |
청 | 19 | 1.4% |
Other values (131) | 814 |
Common
Value | Count | Frequency (%) |
" | 800 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1342 | |
ASCII | 800 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
" | 800 |
Hangul
Value | Count | Frequency (%) |
색 | 296 | 22.1% |
이 | 52 | 3.9% |
레 | 35 | 2.6% |
갈 | 27 | 2.0% |
라 | 22 | 1.6% |
크 | 20 | 1.5% |
리 | 19 | 1.4% |
드 | 19 | 1.4% |
코 | 19 | 1.4% |
청 | 19 | 1.4% |
Other values (131) | 814 |
"건수값"
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 167 |
---|---|
Distinct (%) | 41.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 683.6925 |
Minimum | 1 |
---|---|
Maximum | 23742 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 5 |
median | 21 |
Q3 | 94.75 |
95-th percentile | 2583.2 |
Maximum | 23742 |
Range | 23741 |
Interquartile range (IQR) | 89.75 |
Descriptive statistics
Standard deviation | 2595.2291 |
---|---|
Coefficient of variation (CV) | 3.7959011 |
Kurtosis | 44.072311 |
Mean | 683.6925 |
Median Absolute Deviation (MAD) | 19 |
Skewness | 6.2117494 |
Sum | 273477 |
Variance | 6735214 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 31 | 7.8% |
3 | 23 | 5.8% |
2 | 21 | 5.2% |
5 | 17 | 4.2% |
4 | 15 | 3.8% |
6 | 10 | 2.5% |
18 | 9 | 2.2% |
7 | 9 | 2.2% |
8 | 9 | 2.2% |
12 | 7 | 1.8% |
Other values (157) | 249 |
Value | Count | Frequency (%) |
1 | 31 | |
2 | 21 | |
3 | 23 | |
4 | 15 | |
5 | 17 | |
6 | 10 | 2.5% |
7 | 9 | 2.2% |
8 | 9 | 2.2% |
9 | 5 | 1.2% |
10 | 6 | 1.5% |
Value | Count | Frequency (%) |
23742 | 1 | |
22558 | 1 | |
20511 | 1 | |
19361 | 1 | |
11702 | 1 | |
9945 | 1 | |
9147 | 1 | |
8943 | 1 | |
8785 | 1 | |
8686 | 1 |
"기본키값" | "해당일자" | "차례값" | "건수값" | |
---|---|---|---|---|
"기본키값" | 1.000 | 1.000 | 0.000 | 0.000 |
"해당일자" | 1.000 | 1.000 | 0.000 | 0.000 |
"차례값" | 0.000 | 0.000 | 1.000 | 0.438 |
"건수값" | 0.000 | 0.000 | 0.438 | 1.000 |
"기본키값" | "차례값" | "건수값" | "해당일자" | |
---|---|---|---|---|
"기본키값" | 1.000 | 0.294 | -0.292 | 1.000 |
"차례값" | 0.294 | 1.000 | -0.998 | 0.000 |
"건수값" | -0.292 | -0.998 | 1.000 | 0.000 |
"해당일자" | 1.000 | 0.000 | 0.000 | 1.000 |
"기본키값" | "채널값" | "해당일자" | "차례값" | "이슈어값" | "건수값" | |
---|---|---|---|---|---|---|
0 | 15802 | "블로그" | 2020-05-01 | 1 | "검정색" | 23742 |
1 | 15803 | "블로그" | 2020-05-01 | 2 | "파랑" | 11702 |
2 | 15804 | "블로그" | 2020-05-01 | 3 | "빨강" | 8943 |
3 | 15805 | "블로그" | 2020-05-01 | 4 | "흰색" | 7341 |
4 | 15806 | "블로그" | 2020-05-01 | 5 | "초록" | 4025 |
5 | 15807 | "블로그" | 2020-05-01 | 6 | "베이지" | 2533 |
6 | 15808 | "블로그" | 2020-05-01 | 7 | "노랑색" | 2216 |
7 | 15809 | "블로그" | 2020-05-01 | 8 | "분홍색" | 1688 |
8 | 15810 | "블로그" | 2020-05-01 | 9 | "회색" | 1687 |
9 | 15811 | "블로그" | 2020-05-01 | 10 | "갈색" | 1660 |
"기본키값" | "채널값" | "해당일자" | "차례값" | "이슈어값" | "건수값" | |
---|---|---|---|---|---|---|
390 | 65146 | "블로그" | 2020-05-04 | 94 | "딸기색" | 2 |
391 | 65147 | "블로그" | 2020-05-04 | 95 | "슬레이트그레이" | 2 |
392 | 65148 | "블로그" | 2020-05-04 | 96 | "라피스블루" | 1 |
393 | 65149 | "블로그" | 2020-05-04 | 97 | "포도색" | 1 |
394 | 65150 | "블로그" | 2020-05-04 | 98 | "헤이즐넛색" | 1 |
395 | 65151 | "블로그" | 2020-05-04 | 99 | "금갈색" | 1 |
396 | 65152 | "블로그" | 2020-05-04 | 100 | "코르크색" | 1 |
397 | 65153 | "블로그" | 2020-05-04 | 101 | "스노화이트" | 1 |
398 | 65154 | "블로그" | 2020-05-04 | 102 | "샐먼핑크" | 1 |
399 | 65155 | "블로그" | 2020-05-04 | 103 | "진남색" | 1 |