Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 199 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 11.4 KiB |
Average record size in memory | 58.7 B |
Variable types
Categorical | 4 |
---|---|
Text | 1 |
Numeric | 2 |
Dataset
Description | Sample |
---|---|
Author | 두잉랩 |
URL | https://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=DLADATA202006 |
2020-01-01 has constant value "" | Constant |
1 is highly overall correlated with 9.090909 | High correlation |
9.090909 is highly overall correlated with 1 and 2 other fields | High correlation |
3 is highly overall correlated with 9.090909 and 2 other fields | High correlation |
[20-29] is highly overall correlated with 9.090909 and 2 other fields | High correlation |
[22-24] is highly overall correlated with 3 and 1 other fields | High correlation |
3 is highly imbalanced (67.9%) | Imbalance |
Reproduction
Analysis started | 2023-12-10 06:32:07.372680 |
---|---|
Analysis finished | 2023-12-10 06:32:08.952698 |
Duration | 1.58 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
2020-01-01
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
2020-01-01 |
---|
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2020-01-01 |
---|---|
2nd row | 2020-01-01 |
3rd row | 2020-01-01 |
4th row | 2020-01-01 |
5th row | 2020-01-01 |
Common Values
Value | Count | Frequency (%) |
2020-01-01 | 199 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2020-01-01 | 199 |
떡국
Text
Distinct | 148 |
---|---|
Distinct (%) | 74.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
Value | Count | Frequency (%) |
배추김치 | 8 | 3.8% |
쌀밥 | 6 | 2.8% |
떡국 | 5 | 2.3% |
흑미밥 | 4 | 1.9% |
소등심(구운것 | 4 | 1.9% |
라면 | 4 | 1.9% |
떡만둣국 | 4 | 1.9% |
귤 | 4 | 1.9% |
계란후라이 | 3 | 1.4% |
갈비탕 | 2 | 0.9% |
Other values (149) | 169 |
Most occurring characters
Value | Count | Frequency (%) |
213 | 20.1% | |
치 | 29 | 2.7% |
김 | 23 | 2.2% |
이 | 20 | 1.9% |
( | 20 | 1.9% |
밥 | 19 | 1.8% |
) | 18 | 1.7% |
라 | 15 | 1.4% |
구 | 15 | 1.4% |
떡 | 14 | 1.3% |
Other values (216) | 673 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 801 | |
Space Separator | 213 | 20.1% |
Open Punctuation | 20 | 1.9% |
Close Punctuation | 18 | 1.7% |
Decimal Number | 5 | 0.5% |
Lowercase Letter | 2 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
치 | 29 | 3.6% |
김 | 23 | 2.9% |
이 | 20 | 2.5% |
밥 | 19 | 2.4% |
라 | 15 | 1.9% |
구 | 15 | 1.9% |
떡 | 14 | 1.7% |
국 | 13 | 1.6% |
것 | 12 | 1.5% |
지 | 12 | 1.5% |
Other values (208) | 629 |
Decimal Number
Value | Count | Frequency (%) |
0 | 3 | |
5 | 1 | 20.0% |
1 | 1 | 20.0% |
Lowercase Letter
Value | Count | Frequency (%) |
m | 1 | |
l | 1 |
Space Separator
Value | Count | Frequency (%) |
213 |
Open Punctuation
Value | Count | Frequency (%) |
( | 20 |
Close Punctuation
Value | Count | Frequency (%) |
) | 18 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 801 | |
Common | 256 | 24.2% |
Latin | 2 | 0.2% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
치 | 29 | 3.6% |
김 | 23 | 2.9% |
이 | 20 | 2.5% |
밥 | 19 | 2.4% |
라 | 15 | 1.9% |
구 | 15 | 1.9% |
떡 | 14 | 1.7% |
국 | 13 | 1.6% |
것 | 12 | 1.5% |
지 | 12 | 1.5% |
Other values (208) | 629 |
Common
Value | Count | Frequency (%) |
213 | ||
( | 20 | 7.8% |
) | 18 | 7.0% |
0 | 3 | 1.2% |
5 | 1 | 0.4% |
1 | 1 | 0.4% |
Latin
Value | Count | Frequency (%) |
m | 1 | |
l | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 801 | |
ASCII | 258 | 24.4% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
213 | ||
( | 20 | 7.8% |
) | 18 | 7.0% |
0 | 3 | 1.2% |
5 | 1 | 0.4% |
m | 1 | 0.4% |
l | 1 | 0.4% |
1 | 1 | 0.4% |
Hangul
Value | Count | Frequency (%) |
치 | 29 | 3.6% |
김 | 23 | 2.9% |
이 | 20 | 2.5% |
밥 | 19 | 2.4% |
라 | 15 | 1.9% |
구 | 15 | 1.9% |
떡 | 14 | 1.7% |
국 | 13 | 1.6% |
것 | 12 | 1.5% |
지 | 12 | 1.5% |
Other values (208) | 629 |
3
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 9 |
---|---|
Distinct (%) | 4.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
1 | |
---|---|
2 | |
3 | 5 |
튀김) | 2 |
4 | 2 |
Other values (4) | 4 |
Length
Max length | 5 |
---|---|
Median length | 2 |
Mean length | 2.0603015 |
Min length | 2 |
Unique
Unique | 4 ? |
---|---|
Unique (%) | 2.0% |
Sample
1st row | 3 |
---|---|
2nd row | 2 |
3rd row | 2 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 163 | |
2 | 23 | 11.6% |
3 | 5 | 2.5% |
튀김) | 2 | 1.0% |
4 | 2 | 1.0% |
양념장 | 1 | 0.5% |
삶은것 | 1 | 0.5% |
도토리묵 | 1 | 0.5% |
액상 | 1 | 0.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 163 | |
2 | 23 | 11.6% |
3 | 5 | 2.5% |
튀김 | 2 | 1.0% |
4 | 2 | 1.0% |
양념장 | 1 | 0.5% |
삶은것 | 1 | 0.5% |
도토리묵 | 1 | 0.5% |
액상 | 1 | 0.5% |
1
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 44 |
---|---|
Distinct (%) | 22.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 12.889447 |
Minimum | 1 |
---|---|
Maximum | 44 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.9 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 5 |
median | 11 |
Q3 | 18 |
95-th percentile | 34.1 |
Maximum | 44 |
Range | 43 |
Interquartile range (IQR) | 13 |
Descriptive statistics
Standard deviation | 10.018561 |
---|---|
Coefficient of variation (CV) | 0.77726844 |
Kurtosis | 0.71498838 |
Mean | 12.889447 |
Median Absolute Deviation (MAD) | 6 |
Skewness | 1.0531295 |
Sum | 2565 |
Variance | 100.37155 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 14 | 7.0% |
2 | 12 | 6.0% |
3 | 11 | 5.5% |
6 | 10 | 5.0% |
4 | 9 | 4.5% |
5 | 9 | 4.5% |
7 | 8 | 4.0% |
8 | 8 | 4.0% |
9 | 8 | 4.0% |
10 | 8 | 4.0% |
Other values (34) | 102 |
Value | Count | Frequency (%) |
1 | 14 | |
2 | 12 | |
3 | 11 | |
4 | 9 | |
5 | 9 | |
6 | 10 | |
7 | 8 | |
8 | 8 | |
9 | 8 | |
10 | 8 |
Value | Count | Frequency (%) |
44 | 1 | |
43 | 1 | |
42 | 1 | |
41 | 1 | |
40 | 1 | |
39 | 1 | |
38 | 1 | |
37 | 1 | |
36 | 1 | |
35 | 1 |
9.090909
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 27 |
---|---|
Distinct (%) | 13.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.4480969 |
Minimum | 1 |
---|---|
Maximum | 33.333333 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.9 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1.724138 |
Q1 | 3.030303 |
median | 4.166667 |
Q3 | 4.545455 |
95-th percentile | 16.666667 |
Maximum | 33.333333 |
Range | 32.333333 |
Interquartile range (IQR) | 1.515152 |
Descriptive statistics
Standard deviation | 5.4311226 |
---|---|
Coefficient of variation (CV) | 0.99688437 |
Kurtosis | 10.707151 |
Mean | 5.4480969 |
Median Absolute Deviation (MAD) | 1.005747 |
Skewness | 3.0492192 |
Sum | 1084.1713 |
Variance | 29.497093 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1.724138 | 35 | |
4.545455 | 32 | |
3.225806 | 24 | |
3.030303 | 22 | |
4.166667 | 15 | |
4.347826 | 14 | 7.0% |
16.666667 | 12 | 6.0% |
9.090909 | 6 | 3.0% |
3.333333 | 6 | 3.0% |
3.448276 | 6 | 3.0% |
Other values (17) | 27 |
Value | Count | Frequency (%) |
1.0 | 1 | 0.5% |
1.724138 | 35 | |
3.030303 | 22 | |
3.225806 | 24 | |
3.333333 | 6 | 3.0% |
3.448276 | 6 | 3.0% |
4.0 | 1 | 0.5% |
4.166667 | 15 | |
4.347826 | 14 | 7.0% |
4.545455 | 32 |
Value | Count | Frequency (%) |
33.333333 | 3 | 1.5% |
27.0 | 1 | 0.5% |
18.0 | 1 | 0.5% |
17.391304 | 1 | 0.5% |
17.0 | 1 | 0.5% |
16.666667 | 12 | |
13.043478 | 1 | 0.5% |
12.5 | 1 | 0.5% |
9.090909 | 6 | |
8.695652 | 1 | 0.5% |
[20-29]
Categorical
HIGH CORRELATION
 
Distinct | 11 |
---|---|
Distinct (%) | 5.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
[0-19] | |
---|---|
[20-29] | |
[40-49] | |
[30-39] | |
[50-59] | 6 |
Other values (6) |
Length
Max length | 9 |
---|---|
Median length | 8 |
Mean length | 7.6231156 |
Min length | 7 |
Unique
Unique | 4 ? |
---|---|
Unique (%) | 2.0% |
Sample
1st row | [20-29] |
---|---|
2nd row | [20-29] |
3rd row | [20-29] |
4th row | [20-29] |
5th row | [20-29] |
Common Values
Value | Count | Frequency (%) |
[0-19] | 81 | |
[20-29] | 52 | |
[40-49] | 35 | |
[30-39] | 13 | 6.5% |
[50-59] | 6 | 3.0% |
[60-99] | 6 | 3.0% |
4.545455 | 2 | 1.0% |
3.030303 | 1 | 0.5% |
3.225806 | 1 | 0.5% |
4.166667 | 1 | 0.5% |
Length
Value | Count | Frequency (%) |
0-19 | 81 | |
20-29 | 52 | |
40-49 | 35 | |
30-39 | 13 | 6.5% |
50-59 | 6 | 3.0% |
60-99 | 6 | 3.0% |
4.545455 | 2 | 1.0% |
3.030303 | 1 | 0.5% |
3.225806 | 1 | 0.5% |
4.166667 | 1 | 0.5% |
[22-24]
Categorical
HIGH CORRELATION
 
Distinct | 7 |
---|---|
Distinct (%) | 3.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
[22-24] | |
---|---|
[20-22] | |
[18-20] | |
[20-29] | 2 |
[0-19] | 2 |
Other values (2) | 2 |
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 7.9899497 |
Min length | 7 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 1.0% |
Sample
1st row | [22-24] |
---|---|
2nd row | [22-24] |
3rd row | [22-24] |
4th row | [22-24] |
5th row | [22-24] |
Common Values
Value | Count | Frequency (%) |
[22-24] | 109 | |
[20-22] | 46 | |
[18-20] | 38 | 19.1% |
[20-29] | 2 | 1.0% |
[0-19] | 2 | 1.0% |
[40-49] | 1 | 0.5% |
[30-39] | 1 | 0.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
22-24 | 109 | |
20-22 | 46 | |
18-20 | 38 | 19.1% |
20-29 | 2 | 1.0% |
0-19 | 2 | 1.0% |
40-49 | 1 | 0.5% |
30-39 | 1 | 0.5% |
3 | 1 | 9.090909 | [20-29] | [22-24] | |
---|---|---|---|---|---|
3 | 1.000 | 0.217 | 0.817 | 0.893 | 0.893 |
1 | 0.217 | 1.000 | 0.380 | 0.000 | 0.000 |
9.090909 | 0.817 | 0.380 | 1.000 | 0.843 | 0.602 |
[20-29] | 0.893 | 0.000 | 0.843 | 1.000 | 0.909 |
[22-24] | 0.893 | 0.000 | 0.602 | 0.909 | 1.000 |
[22-24] | 3 | [20-29] | |
---|---|---|---|
[22-24] | 1.000 | 0.751 | 0.751 |
3 | 0.751 | 1.000 | 0.689 |
[20-29] | 0.751 | 0.689 | 1.000 |
1 | 9.090909 | 3 | [20-29] | [22-24] | |
---|---|---|---|---|---|
1 | 1.000 | -0.745 | 0.099 | 0.000 | 0.000 |
9.090909 | -0.745 | 1.000 | 0.581 | 0.609 | 0.376 |
3 | 0.099 | 0.581 | 1.000 | 0.689 | 0.751 |
[20-29] | 0.000 | 0.609 | 0.689 | 1.000 | 0.751 |
[22-24] | 0.000 | 0.376 | 0.751 | 0.751 | 1.000 |
2020-01-01 | 떡국 | 3 | 1 | 9.090909 | [20-29] | [22-24] | |
---|---|---|---|---|---|---|---|
0 | 2020-01-01 | 배추김치 | 3 | 2 | 9.090909 | [20-29] | [22-24] |
1 | 2020-01-01 | 된장찌개 | 2 | 3 | 6.060606 | [20-29] | [22-24] |
2 | 2020-01-01 | 간장 | 2 | 4 | 6.060606 | [20-29] | [22-24] |
3 | 2020-01-01 | 옥수수샐러드 | 1 | 5 | 3.030303 | [20-29] | [22-24] |
4 | 2020-01-01 | 후르츠칵테일(통조림) | 1 | 6 | 3.030303 | [20-29] | [22-24] |
5 | 2020-01-01 | 잡곡밥 | 1 | 7 | 3.030303 | [20-29] | [22-24] |
6 | 2020-01-01 | 크림빵 | 1 | 8 | 3.030303 | [20-29] | [22-24] |
7 | 2020-01-01 | 검정콩밥 | 1 | 9 | 3.030303 | [20-29] | [22-24] |
8 | 2020-01-01 | 라면 | 1 | 10 | 3.030303 | [20-29] | [22-24] |
9 | 2020-01-01 | 계란국 | 1 | 11 | 3.030303 | [20-29] | [22-24] |
2020-01-01 | 떡국 | 3 | 1 | 9.090909 | [20-29] | [22-24] | |
---|---|---|---|---|---|---|---|
189 | 2020-01-01 | 떡만둣국 | 2 | 2 | 6.666667 | [30-39] | [22-24] |
190 | 2020-01-01 | 배추김치 | 2 | 3 | 6.666667 | [30-39] | [22-24] |
191 | 2020-01-01 | 쌀밥 | 2 | 4 | 6.666667 | [30-39] | [22-24] |
192 | 2020-01-01 | 깍두기 | 2 | 5 | 6.666667 | [30-39] | [22-24] |
193 | 2020-01-01 | 쌈장 | 1 | 6 | 3.333333 | [30-39] | [22-24] |
194 | 2020-01-01 | 채소샐러드 | 1 | 7 | 3.333333 | [30-39] | [22-24] |
195 | 2020-01-01 | 계란찜 | 1 | 8 | 3.333333 | [30-39] | [22-24] |
196 | 2020-01-01 | 고구마(구운것) | 1 | 9 | 3.333333 | [30-39] | [22-24] |
197 | 2020-01-01 | 고구마형 과자 | 1 | 10 | 3.333333 | [30-39] | [22-24] |
198 | 2020-01-01 | 짜장밥 | 1 | 11 | 3.333333 | [30-39] | [22-24] |