Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 199 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 11.4 KiB |
Average record size in memory | 58.7 B |
Variable types
DateTime | 1 |
---|---|
Text | 1 |
Categorical | 3 |
Numeric | 2 |
Dataset
Description | Sample |
---|---|
Author | 두잉랩 |
URL | https://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=DLADATA202008 |
2020-01-01 has constant value "" | Constant |
1 is highly overall correlated with 5.797101 | High correlation |
5.797101 is highly overall correlated with 1 and 3 other fields | High correlation |
4 is highly overall correlated with 5.797101 and 2 other fields | High correlation |
[40-49] is highly overall correlated with 5.797101 and 2 other fields | High correlation |
[18-20] is highly overall correlated with 5.797101 and 2 other fields | High correlation |
4 is highly imbalanced (64.8%) | Imbalance |
[40-49] is highly imbalanced (72.5%) | Imbalance |
Reproduction
Analysis started | 2023-12-10 06:25:18.973308 |
---|---|
Analysis finished | 2023-12-10 06:25:20.741424 |
Duration | 1.77 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
2020-01-01
Date
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
Minimum | 2020-01-01 00:00:00 |
---|---|
Maximum | 2020-01-01 00:00:00 |
배추김치
Text
Distinct | 160 |
---|---|
Distinct (%) | 80.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
Value | Count | Frequency (%) |
아메리카노 | 6 | 2.5% |
고구마(찐것 | 4 | 1.7% |
배추김치 | 4 | 1.7% |
커피믹스 | 3 | 1.3% |
멸치볶음 | 3 | 1.3% |
떡만둣국 | 3 | 1.3% |
귤 | 3 | 1.3% |
인스턴트 | 3 | 1.3% |
채소샐러드 | 3 | 1.3% |
쌀밥 | 3 | 1.3% |
Other values (183) | 203 |
Most occurring characters
Value | Count | Frequency (%) |
238 | 19.6% | |
치 | 28 | 2.3% |
( | 23 | 1.9% |
) | 22 | 1.8% |
이 | 20 | 1.6% |
김 | 20 | 1.6% |
스 | 17 | 1.4% |
드 | 17 | 1.4% |
카 | 15 | 1.2% |
아 | 15 | 1.2% |
Other values (270) | 800 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 905 | |
Space Separator | 238 | 19.6% |
Open Punctuation | 23 | 1.9% |
Close Punctuation | 22 | 1.8% |
Decimal Number | 10 | 0.8% |
Uppercase Letter | 9 | 0.7% |
Lowercase Letter | 6 | 0.5% |
Other Punctuation | 2 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
치 | 28 | 3.1% |
이 | 20 | 2.2% |
김 | 20 | 2.2% |
스 | 17 | 1.9% |
드 | 17 | 1.9% |
카 | 15 | 1.7% |
아 | 15 | 1.7% |
구 | 15 | 1.7% |
리 | 14 | 1.5% |
밥 | 13 | 1.4% |
Other values (247) | 731 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 2 | |
T | 1 | |
A | 1 | |
P | 1 | |
I | 1 | |
G | 1 | |
J | 1 | |
C | 1 |
Decimal Number
Value | Count | Frequency (%) |
0 | 3 | |
5 | 3 | |
1 | 1 | 10.0% |
2 | 1 | 10.0% |
7 | 1 | 10.0% |
3 | 1 | 10.0% |
Lowercase Letter
Value | Count | Frequency (%) |
o | 2 | |
l | 1 | |
m | 1 | |
e | 1 | |
s | 1 |
Space Separator
Value | Count | Frequency (%) |
238 |
Open Punctuation
Value | Count | Frequency (%) |
( | 23 |
Close Punctuation
Value | Count | Frequency (%) |
) | 22 |
Other Punctuation
Value | Count | Frequency (%) |
% | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 905 | |
Common | 295 | 24.3% |
Latin | 15 | 1.2% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
치 | 28 | 3.1% |
이 | 20 | 2.2% |
김 | 20 | 2.2% |
스 | 17 | 1.9% |
드 | 17 | 1.9% |
카 | 15 | 1.7% |
아 | 15 | 1.7% |
구 | 15 | 1.7% |
리 | 14 | 1.5% |
밥 | 13 | 1.4% |
Other values (247) | 731 |
Latin
Value | Count | Frequency (%) |
o | 2 | |
S | 2 | |
l | 1 | 6.7% |
m | 1 | 6.7% |
T | 1 | 6.7% |
A | 1 | 6.7% |
P | 1 | 6.7% |
I | 1 | 6.7% |
e | 1 | 6.7% |
s | 1 | 6.7% |
Other values (3) | 3 |
Common
Value | Count | Frequency (%) |
238 | ||
( | 23 | 7.8% |
) | 22 | 7.5% |
0 | 3 | 1.0% |
5 | 3 | 1.0% |
% | 2 | 0.7% |
1 | 1 | 0.3% |
2 | 1 | 0.3% |
7 | 1 | 0.3% |
3 | 1 | 0.3% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 905 | |
ASCII | 310 | 25.5% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
238 | ||
( | 23 | 7.4% |
) | 22 | 7.1% |
0 | 3 | 1.0% |
5 | 3 | 1.0% |
o | 2 | 0.6% |
S | 2 | 0.6% |
% | 2 | 0.6% |
l | 1 | 0.3% |
1 | 1 | 0.3% |
Other values (13) | 13 | 4.2% |
Hangul
Value | Count | Frequency (%) |
치 | 28 | 3.1% |
이 | 20 | 2.2% |
김 | 20 | 2.2% |
스 | 17 | 1.9% |
드 | 17 | 1.9% |
카 | 15 | 1.7% |
아 | 15 | 1.7% |
구 | 15 | 1.7% |
리 | 14 | 1.5% |
밥 | 13 | 1.4% |
Other values (247) | 731 |
4
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 12 |
---|---|
Distinct (%) | 6.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
1 | |
---|---|
2 | |
3 | 7 |
5 | 2 |
4 | 2 |
Other values (7) | 7 |
Length
Max length | 5 |
---|---|
Median length | 2 |
Mean length | 2.0452261 |
Min length | 2 |
Unique
Unique | 7 ? |
---|---|
Unique (%) | 3.5% |
Sample
1st row | 튀김) |
---|---|
2nd row | 3 |
3rd row | 3 |
4th row | 2 |
5th row | 2 |
Common Values
Value | Count | Frequency (%) |
1 | 153 | |
2 | 28 | 14.1% |
3 | 7 | 3.5% |
5 | 2 | 1.0% |
4 | 2 | 1.0% |
튀김) | 1 | 0.5% |
냉동 | 1 | 0.5% |
6 | 1 | 0.5% |
9 | 1 | 0.5% |
바닐라맛 | 1 | 0.5% |
Other values (2) | 2 | 1.0% |
Length
Value | Count | Frequency (%) |
1 | 153 | |
2 | 28 | 14.1% |
3 | 7 | 3.5% |
5 | 2 | 1.0% |
4 | 2 | 1.0% |
튀김 | 1 | 0.5% |
냉동 | 1 | 0.5% |
6 | 1 | 0.5% |
9 | 1 | 0.5% |
바닐라맛 | 1 | 0.5% |
Other values (2) | 2 | 1.0% |
1
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 67 |
---|---|
Distinct (%) | 33.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 28.422111 |
Minimum | 1 |
---|---|
Maximum | 67 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.9 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 10.5 |
median | 28 |
Q3 | 44.5 |
95-th percentile | 59 |
Maximum | 67 |
Range | 66 |
Interquartile range (IQR) | 34 |
Descriptive statistics
Standard deviation | 18.934835 |
---|---|
Coefficient of variation (CV) | 0.66620089 |
Kurtosis | -1.2083884 |
Mean | 28.422111 |
Median Absolute Deviation (MAD) | 17 |
Skewness | 0.17603012 |
Sum | 5656 |
Variance | 358.52799 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 8 | 4.0% |
3 | 6 | 3.0% |
5 | 5 | 2.5% |
6 | 5 | 2.5% |
4 | 5 | 2.5% |
7 | 5 | 2.5% |
8 | 4 | 2.0% |
9 | 4 | 2.0% |
10 | 4 | 2.0% |
11 | 4 | 2.0% |
Other values (57) | 149 |
Value | Count | Frequency (%) |
1 | 8 | |
2 | 4 | |
3 | 6 | |
4 | 5 | |
5 | 5 | |
6 | 5 | |
7 | 5 | |
8 | 4 | |
9 | 4 | |
10 | 4 |
Value | Count | Frequency (%) |
67 | 1 | |
66 | 1 | |
65 | 1 | |
64 | 1 | |
63 | 1 | |
62 | 1 | |
61 | 1 | |
60 | 2 | |
59 | 2 | |
58 | 2 |
5.797101
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 23 |
---|---|
Distinct (%) | 11.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.5752467 |
Minimum | 1.075269 |
---|---|
Maximum | 33 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.9 KiB |
Quantile statistics
Minimum | 1.075269 |
---|---|
5-th percentile | 1.075269 |
Q1 | 1.075269 |
median | 1.449275 |
Q3 | 2.150538 |
95-th percentile | 9.090909 |
Maximum | 33 |
Range | 31.924731 |
Interquartile range (IQR) | 1.075269 |
Descriptive statistics
Standard deviation | 3.6663051 |
---|---|
Coefficient of variation (CV) | 1.4236714 |
Kurtosis | 30.604725 |
Mean | 2.5752467 |
Median Absolute Deviation (MAD) | 0.374006 |
Skewness | 4.8526068 |
Sum | 512.4741 |
Variance | 13.441793 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1.075269 | 51 | |
1.204819 | 46 | |
1.449275 | 45 | |
9.090909 | 11 | 5.5% |
2.409639 | 10 | 5.0% |
2.150538 | 9 | 4.5% |
2.898551 | 5 | 2.5% |
3.921569 | 4 | 2.0% |
5.882353 | 2 | 1.0% |
3.614458 | 2 | 1.0% |
Other values (13) | 14 | 7.0% |
Value | Count | Frequency (%) |
1.075269 | 51 | |
1.204819 | 46 | |
1.449275 | 45 | |
2.0 | 1 | 0.5% |
2.150538 | 9 | 4.5% |
2.409639 | 10 | 5.0% |
2.898551 | 5 | 2.5% |
3.225806 | 1 | 0.5% |
3.614458 | 2 | 1.0% |
3.921569 | 4 | 2.0% |
Value | Count | Frequency (%) |
33.0 | 1 | 0.5% |
24.0 | 1 | 0.5% |
17.0 | 1 | 0.5% |
16.0 | 1 | 0.5% |
9.677419 | 1 | 0.5% |
9.090909 | 11 | |
7.843137 | 1 | 0.5% |
7.228916 | 1 | 0.5% |
6.024096 | 1 | 0.5% |
5.882353 | 2 | 1.0% |
[40-49]
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 5 |
---|---|
Distinct (%) | 2.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
[40-49] | |
---|---|
[50-59] | |
1.075269 | 3 |
4.347826 | 1 |
1.449275 | 1 |
Length
Max length | 9 |
---|---|
Median length | 8 |
Mean length | 8.0251256 |
Min length | 8 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 1.0% |
Sample
1st row | 4.347826 |
---|---|
2nd row | [40-49] |
3rd row | [40-49] |
4th row | [40-49] |
5th row | [40-49] |
Common Values
Value | Count | Frequency (%) |
[40-49] | 176 | |
[50-59] | 18 | 9.0% |
1.075269 | 3 | 1.5% |
4.347826 | 1 | 0.5% |
1.449275 | 1 | 0.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
40-49 | 176 | |
50-59 | 18 | 9.0% |
1.075269 | 3 | 1.5% |
4.347826 | 1 | 0.5% |
1.449275 | 1 | 0.5% |
[18-20]
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
[22-24] | |
---|---|
[20-22] | |
[18-20] | |
[40-49] | 5 |
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 8 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | [40-49] |
---|---|
2nd row | [18-20] |
3rd row | [18-20] |
4th row | [18-20] |
5th row | [18-20] |
Common Values
Value | Count | Frequency (%) |
[22-24] | 67 | |
[20-22] | 64 | |
[18-20] | 63 | |
[40-49] | 5 | 2.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
22-24 | 67 | |
20-22 | 64 | |
18-20 | 63 | |
40-49 | 5 | 2.5% |
4 | 1 | 5.797101 | [40-49] | [18-20] | |
---|---|---|---|---|---|
4 | 1.000 | 0.458 | 0.992 | 0.939 | 0.861 |
1 | 0.458 | 1.000 | 0.393 | 0.497 | 0.279 |
5.797101 | 0.992 | 0.393 | 1.000 | 0.877 | 0.696 |
[40-49] | 0.939 | 0.497 | 0.877 | 1.000 | 0.653 |
[18-20] | 0.861 | 0.279 | 0.696 | 0.653 | 1.000 |
[18-20] | [40-49] | 4 | |
---|---|---|---|
[18-20] | 1.000 | 0.582 | 0.552 |
[40-49] | 0.582 | 1.000 | 0.849 |
4 | 0.552 | 0.849 | 1.000 |
1 | 5.797101 | 4 | [40-49] | [18-20] | |
---|---|---|---|---|---|
1 | 1.000 | -0.722 | 0.210 | 0.225 | 0.167 |
5.797101 | -0.722 | 1.000 | 0.854 | 0.801 | 0.525 |
4 | 0.210 | 0.854 | 1.000 | 0.849 | 0.552 |
[40-49] | 0.225 | 0.801 | 0.849 | 1.000 | 0.582 |
[18-20] | 0.167 | 0.525 | 0.552 | 0.582 | 1.000 |
2020-01-01 | 배추김치 | 4 | 1 | 5.797101 | [40-49] | [18-20] | |
---|---|---|---|---|---|---|---|
0 | 2020-01-01 | 닭고기(통닭 | 튀김) | 3 | 2.0 | 4.347826 | [40-49] |
1 | 2020-01-01 | 닭튀김(양념소스) | 3 | 3 | 4.347826 | [40-49] | [18-20] |
2 | 2020-01-01 | 감 | 3 | 4 | 4.347826 | [40-49] | [18-20] |
3 | 2020-01-01 | 잡곡밥 | 2 | 5 | 2.898551 | [40-49] | [18-20] |
4 | 2020-01-01 | 다래 | 2 | 6 | 2.898551 | [40-49] | [18-20] |
5 | 2020-01-01 | 떡국 | 2 | 7 | 2.898551 | [40-49] | [18-20] |
6 | 2020-01-01 | 계란과자 | 2 | 8 | 2.898551 | [40-49] | [18-20] |
7 | 2020-01-01 | 치킨무 | 2 | 9 | 2.898551 | [40-49] | [18-20] |
8 | 2020-01-01 | 맛동산 | 1 | 10 | 1.449275 | [40-49] | [18-20] |
9 | 2020-01-01 | 두부 | 1 | 11 | 1.449275 | [40-49] | [18-20] |
2020-01-01 | 배추김치 | 4 | 1 | 5.797101 | [40-49] | [18-20] | |
---|---|---|---|---|---|---|---|
189 | 2020-01-01 | 라떼 | 1 | 9 | 9.090909 | [50-59] | [18-20] |
190 | 2020-01-01 | 동지팥죽 | 1 | 10 | 9.090909 | [50-59] | [18-20] |
191 | 2020-01-01 | 배추김치 | 1 | 11 | 9.090909 | [50-59] | [18-20] |
192 | 2020-01-01 | 배추김치 | 4 | 1 | 7.843137 | [50-59] | [22-24] |
193 | 2020-01-01 | 아메리카노 | 3 | 2 | 5.882353 | [50-59] | [22-24] |
194 | 2020-01-01 | 고구마(찐것) | 3 | 3 | 5.882353 | [50-59] | [22-24] |
195 | 2020-01-01 | 채소샐러드 | 2 | 4 | 3.921569 | [50-59] | [22-24] |
196 | 2020-01-01 | 계란찜 | 2 | 5 | 3.921569 | [50-59] | [22-24] |
197 | 2020-01-01 | 골드키위주스 | 2 | 6 | 3.921569 | [50-59] | [22-24] |
198 | 2020-01-01 | 멸치볶음 | 2 | 7 | 3.921569 | [50-59] | [22-24] |