Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 149 |
Missing cells | 109 |
Missing cells (%) | 12.2% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 7.8 KiB |
Average record size in memory | 53.9 B |
Variable types
Text | 1 |
---|---|
Numeric | 5 |
Dataset
Description | Sample |
---|---|
Author | 제타럭스시스템 |
URL | https://bigdata-geo.kr/user/dataset/view.do?data_sn=134 |
pop_grade is highly overall correlated with comp_grade | High correlation |
life_grade is highly overall correlated with total_grad | High correlation |
comp_grade is highly overall correlated with pop_grade | High correlation |
total_grad is highly overall correlated with life_grade | High correlation |
sales_grad has 109 (73.2%) missing values | Missing |
gid has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 13:25:20.544893 |
---|---|
Analysis finished | 2023-12-10 13:25:28.239521 |
Duration | 7.69 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
gid
Text
UNIQUE
 
Distinct | 149 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.3 KiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 1490 |
---|---|
Distinct characters | 14 |
Distinct categories | 3 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 149 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 다사67ab47aa |
---|---|
2nd row | 다사68bb42ba |
3rd row | 다사67aa43ab |
4th row | 다사66aa46ba |
5th row | 다사64bb48bb |
Value | Count | Frequency (%) |
다사67ab47aa | 1 | 0.7% |
다사67bb45bb | 1 | 0.7% |
다사68ab45bb | 1 | 0.7% |
다사66ba44bb | 1 | 0.7% |
다사69aa42bb | 1 | 0.7% |
다사66aa46bb | 1 | 0.7% |
다사64ab48ab | 1 | 0.7% |
다사65aa48ab | 1 | 0.7% |
다사68ab46ab | 1 | 0.7% |
다사65aa48ba | 1 | 0.7% |
Other values (139) | 139 |
Most occurring characters
Value | Count | Frequency (%) |
b | 309 | |
a | 287 | |
6 | 203 | |
4 | 198 | |
다 | 149 | |
사 | 149 | |
5 | 45 | 3.0% |
7 | 37 | 2.5% |
3 | 34 | 2.3% |
8 | 34 | 2.3% |
Other values (4) | 45 | 3.0% |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 596 | |
Decimal Number | 596 | |
Other Letter | 298 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
6 | 203 | |
4 | 198 | |
5 | 45 | 7.6% |
7 | 37 | 6.2% |
3 | 34 | 5.7% |
8 | 34 | 5.7% |
2 | 20 | 3.4% |
9 | 13 | 2.2% |
1 | 11 | 1.8% |
0 | 1 | 0.2% |
Lowercase Letter
Value | Count | Frequency (%) |
b | 309 | |
a | 287 |
Other Letter
Value | Count | Frequency (%) |
다 | 149 | |
사 | 149 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 596 | |
Common | 596 | |
Hangul | 298 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
6 | 203 | |
4 | 198 | |
5 | 45 | 7.6% |
7 | 37 | 6.2% |
3 | 34 | 5.7% |
8 | 34 | 5.7% |
2 | 20 | 3.4% |
9 | 13 | 2.2% |
1 | 11 | 1.8% |
0 | 1 | 0.2% |
Latin
Value | Count | Frequency (%) |
b | 309 | |
a | 287 |
Hangul
Value | Count | Frequency (%) |
다 | 149 | |
사 | 149 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1192 | |
Hangul | 298 | 20.0% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
b | 309 | |
a | 287 | |
6 | 203 | |
4 | 198 | |
5 | 45 | 3.8% |
7 | 37 | 3.1% |
3 | 34 | 2.9% |
8 | 34 | 2.9% |
2 | 20 | 1.7% |
9 | 13 | 1.1% |
Other values (2) | 12 | 1.0% |
Hangul
Value | Count | Frequency (%) |
다 | 149 | |
사 | 149 |
pop_grade
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 13 |
---|---|
Distinct (%) | 8.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.3959732 |
Minimum | 1 |
---|---|
Maximum | 39 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.4 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 1 |
Q3 | 1 |
95-th percentile | 8 |
Maximum | 39 |
Range | 38 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 5.7290379 |
---|---|
Coefficient of variation (CV) | 2.3911111 |
Kurtosis | 27.631369 |
Mean | 2.3959732 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 5.1738443 |
Sum | 357 |
Variance | 32.821876 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 127 | |
2 | 10 | 6.7% |
8 | 2 | 1.3% |
7 | 1 | 0.7% |
27 | 1 | 0.7% |
38 | 1 | 0.7% |
10 | 1 | 0.7% |
13 | 1 | 0.7% |
5 | 1 | 0.7% |
33 | 1 | 0.7% |
Other values (3) | 3 | 2.0% |
Value | Count | Frequency (%) |
1 | 127 | |
2 | 10 | 6.7% |
5 | 1 | 0.7% |
6 | 1 | 0.7% |
7 | 1 | 0.7% |
8 | 2 | 1.3% |
10 | 1 | 0.7% |
13 | 1 | 0.7% |
16 | 1 | 0.7% |
27 | 1 | 0.7% |
Value | Count | Frequency (%) |
39 | 1 | |
38 | 1 | |
33 | 1 | |
27 | 1 | |
16 | 1 | |
13 | 1 | |
10 | 1 | |
8 | 2 | |
7 | 1 | |
6 | 1 |
life_grade
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 27 |
---|---|
Distinct (%) | 18.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 18.348993 |
Minimum | 4 |
---|---|
Maximum | 70 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.4 KiB |
Quantile statistics
Minimum | 4 |
---|---|
5-th percentile | 13 |
Q1 | 15 |
median | 15 |
Q3 | 15 |
95-th percentile | 41.6 |
Maximum | 70 |
Range | 66 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 10.190699 |
---|---|
Coefficient of variation (CV) | 0.55538193 |
Kurtosis | 10.184438 |
Mean | 18.348993 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 3.115943 |
Sum | 2734 |
Variance | 103.85035 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
15 | 112 | |
12 | 4 | 2.7% |
64 | 2 | 1.3% |
13 | 2 | 1.3% |
34 | 2 | 1.3% |
19 | 2 | 1.3% |
18 | 2 | 1.3% |
23 | 2 | 1.3% |
31 | 2 | 1.3% |
14 | 2 | 1.3% |
Other values (17) | 17 | 11.4% |
Value | Count | Frequency (%) |
4 | 1 | 0.7% |
8 | 1 | 0.7% |
10 | 1 | 0.7% |
12 | 4 | 2.7% |
13 | 2 | 1.3% |
14 | 2 | 1.3% |
15 | 112 | |
18 | 2 | 1.3% |
19 | 2 | 1.3% |
20 | 1 | 0.7% |
Value | Count | Frequency (%) |
70 | 1 | |
64 | 2 | |
54 | 1 | |
47 | 1 | |
46 | 1 | |
44 | 1 | |
42 | 1 | |
41 | 1 | |
40 | 1 | |
35 | 1 |
comp_grade
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 11 |
---|---|
Distinct (%) | 7.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 73.939597 |
Minimum | 41 |
---|---|
Maximum | 76 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.4 KiB |
Quantile statistics
Minimum | 41 |
---|---|
5-th percentile | 66 |
Q1 | 76 |
median | 76 |
Q3 | 76 |
95-th percentile | 76 |
Maximum | 76 |
Range | 35 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 4.8452097 |
---|---|
Coefficient of variation (CV) | 0.065529295 |
Kurtosis | 20.953212 |
Mean | 73.939597 |
Median Absolute Deviation (MAD) | 0 |
Skewness | -3.982741 |
Sum | 11017 |
Variance | 23.476057 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
76 | 112 | |
71 | 22 | 14.8% |
66 | 3 | 2.0% |
64 | 3 | 2.0% |
68 | 2 | 1.3% |
67 | 2 | 1.3% |
41 | 1 | 0.7% |
46 | 1 | 0.7% |
72 | 1 | 0.7% |
63 | 1 | 0.7% |
Value | Count | Frequency (%) |
41 | 1 | 0.7% |
46 | 1 | 0.7% |
61 | 1 | 0.7% |
63 | 1 | 0.7% |
64 | 3 | 2.0% |
66 | 3 | 2.0% |
67 | 2 | 1.3% |
68 | 2 | 1.3% |
71 | 22 | |
72 | 1 | 0.7% |
Value | Count | Frequency (%) |
76 | 112 | |
72 | 1 | 0.7% |
71 | 22 | 14.8% |
68 | 2 | 1.3% |
67 | 2 | 1.3% |
66 | 3 | 2.0% |
64 | 3 | 2.0% |
63 | 1 | 0.7% |
61 | 1 | 0.7% |
46 | 1 | 0.7% |
sales_grad
Real number (ℝ)
MISSING
 
Distinct | 16 |
---|---|
Distinct (%) | 40.0% |
Missing | 109 |
Missing (%) | 73.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 10.2 |
Minimum | 1 |
---|---|
Maximum | 100 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.4 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1.95 |
Q1 | 5 |
median | 7.5 |
Q3 | 10 |
95-th percentile | 24.05 |
Maximum | 100 |
Range | 99 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 15.46576 |
---|---|
Coefficient of variation (CV) | 1.516251 |
Kurtosis | 30.816629 |
Mean | 10.2 |
Median Absolute Deviation (MAD) | 2.5 |
Skewness | 5.2895895 |
Sum | 408 |
Variance | 239.18974 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
9 | 5 | 3.4% |
6 | 4 | 2.7% |
7 | 4 | 2.7% |
8 | 4 | 2.7% |
10 | 4 | 2.7% |
2 | 3 | 2.0% |
5 | 3 | 2.0% |
3 | 3 | 2.0% |
11 | 2 | 1.3% |
1 | 2 | 1.3% |
Other values (6) | 6 | 4.0% |
(Missing) | 109 |
Value | Count | Frequency (%) |
1 | 2 | 1.3% |
2 | 3 | |
3 | 3 | |
4 | 1 | 0.7% |
5 | 3 | |
6 | 4 | |
7 | 4 | |
8 | 4 | |
9 | 5 | |
10 | 4 |
Value | Count | Frequency (%) |
100 | 1 | 0.7% |
25 | 1 | 0.7% |
24 | 1 | 0.7% |
17 | 1 | 0.7% |
15 | 1 | 0.7% |
11 | 2 | 1.3% |
10 | 4 | |
9 | 5 | |
8 | 4 | |
7 | 4 |
total_grad
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 22 |
---|---|
Distinct (%) | 14.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 42.342282 |
Minimum | 20 |
---|---|
Maximum | 73 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.4 KiB |
Quantile statistics
Minimum | 20 |
---|---|
5-th percentile | 38 |
Q1 | 41 |
median | 41 |
Q3 | 41 |
95-th percentile | 53 |
Maximum | 73 |
Range | 53 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 5.662551 |
---|---|
Coefficient of variation (CV) | 0.13373278 |
Kurtosis | 11.659739 |
Mean | 42.342282 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 2.4859726 |
Sum | 6309 |
Variance | 32.064484 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
41 | 113 | |
51 | 5 | 3.4% |
38 | 4 | 2.7% |
37 | 3 | 2.0% |
40 | 2 | 1.3% |
44 | 2 | 1.3% |
45 | 2 | 1.3% |
53 | 2 | 1.3% |
55 | 2 | 1.3% |
47 | 2 | 1.3% |
Other values (12) | 12 | 8.1% |
Value | Count | Frequency (%) |
20 | 1 | 0.7% |
35 | 1 | 0.7% |
36 | 1 | 0.7% |
37 | 3 | 2.0% |
38 | 4 | 2.7% |
39 | 1 | 0.7% |
40 | 2 | 1.3% |
41 | 113 | |
42 | 1 | 0.7% |
43 | 1 | 0.7% |
Value | Count | Frequency (%) |
73 | 1 | 0.7% |
67 | 1 | 0.7% |
66 | 1 | 0.7% |
63 | 1 | 0.7% |
56 | 1 | 0.7% |
55 | 2 | 1.3% |
53 | 2 | 1.3% |
51 | 5 | |
50 | 1 | 0.7% |
47 | 2 | 1.3% |
pop_grade | life_grade | comp_grade | sales_grad | total_grad | |
---|---|---|---|---|---|
pop_grade | 1.000 | 0.641 | 0.656 | 0.277 | 0.740 |
life_grade | 0.641 | 1.000 | 0.759 | 0.000 | 0.809 |
comp_grade | 0.656 | 0.759 | 1.000 | 0.000 | 0.769 |
sales_grad | 0.277 | 0.000 | 0.000 | 1.000 | 0.000 |
total_grad | 0.740 | 0.809 | 0.769 | 0.000 | 1.000 |
pop_grade | life_grade | comp_grade | sales_grad | total_grad | |
---|---|---|---|---|---|
pop_grade | 1.000 | 0.416 | -0.723 | -0.023 | 0.496 |
life_grade | 0.416 | 1.000 | -0.422 | 0.330 | 0.702 |
comp_grade | -0.723 | -0.422 | 1.000 | -0.158 | -0.266 |
sales_grad | -0.023 | 0.330 | -0.158 | 1.000 | 0.090 |
total_grad | 0.496 | 0.702 | -0.266 | 0.090 | 1.000 |
gid | pop_grade | life_grade | comp_grade | sales_grad | total_grad | |
---|---|---|---|---|---|---|
0 | 다사67ab47aa | 1 | 15 | 76 | <NA> | 41 |
1 | 다사68bb42ba | 1 | 15 | 76 | <NA> | 41 |
2 | 다사67aa43ab | 1 | 15 | 76 | 3 | 41 |
3 | 다사66aa46ba | 1 | 15 | 76 | <NA> | 41 |
4 | 다사64bb48bb | 1 | 15 | 76 | <NA> | 41 |
5 | 다사64bb46bb | 1 | 15 | 76 | <NA> | 41 |
6 | 다사64bb45bb | 1 | 15 | 76 | 7 | 41 |
7 | 다사67ab44ba | 1 | 14 | 71 | <NA> | 38 |
8 | 다사63ba44bb | 1 | 15 | 76 | <NA> | 41 |
9 | 다사63ab45bb | 7 | 64 | 41 | 7 | 51 |
gid | pop_grade | life_grade | comp_grade | sales_grad | total_grad | |
---|---|---|---|---|---|---|
139 | 다사66ba40bb | 1 | 15 | 76 | <NA> | 41 |
140 | 다사63aa45bb | 6 | 34 | 64 | 8 | 47 |
141 | 다사68ab44ab | 1 | 15 | 76 | <NA> | 41 |
142 | 다사67ab41ab | 1 | 15 | 76 | <NA> | 41 |
143 | 다사62bb45ba | 8 | 19 | 71 | 2 | 44 |
144 | 다사64bb44aa | 1 | 15 | 76 | <NA> | 41 |
145 | 다사62ba47aa | 1 | 15 | 76 | <NA> | 41 |
146 | 다사66ba44ba | 1 | 15 | 76 | <NA> | 41 |
147 | 다사63ba45ba | 1 | 15 | 76 | 6 | 41 |
148 | 다사69ba44ba | 1 | 15 | 76 | <NA> | 41 |