Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 55 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 2.5 KiB |
Average record size in memory | 46.4 B |
Variable types
Text | 1 |
---|---|
Categorical | 1 |
Numeric | 3 |
Dataset
Description | Sample |
---|---|
Author | 제타럭스시스템 |
URL | https://bigdata-geo.kr/user/dataset/view.do?data_sn=499 |
LIFE_INFRA is highly overall correlated with TOTL_GRAD | High correlation |
CMPTT_GRAD is highly overall correlated with TOTL_GRAD and 1 other fields | High correlation |
TOTL_GRAD is highly overall correlated with LIFE_INFRA and 2 other fields | High correlation |
PUL_GRAD is highly overall correlated with CMPTT_GRAD and 1 other fields | High correlation |
PUL_GRAD is highly imbalanced (86.9%) | Imbalance |
GRID_NO has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 13:21:32.855896 |
---|---|
Analysis finished | 2023-12-10 13:21:35.235605 |
Duration | 2.38 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
GRID_NO
Text
UNIQUE
 
Distinct | 55 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 572.0 B |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 550 |
---|---|
Distinct characters | 9 |
Distinct categories | 3 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 55 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 다바96bb47bb |
---|---|
2nd row | 다바96bb48aa |
3rd row | 다바97aa47bb |
4th row | 다바97aa48aa |
5th row | 다바97aa48ab |
Value | Count | Frequency (%) |
다바96bb47bb | 1 | 1.8% |
다바97bb49ab | 1 | 1.8% |
다바98aa47bb | 1 | 1.8% |
다바98aa48aa | 1 | 1.8% |
다바98aa48ab | 1 | 1.8% |
다바98aa48ba | 1 | 1.8% |
다바98aa48bb | 1 | 1.8% |
다바98aa49aa | 1 | 1.8% |
다바98aa49ab | 1 | 1.8% |
다바98aa49ba | 1 | 1.8% |
Other values (45) | 45 |
Most occurring characters
Value | Count | Frequency (%) |
a | 115 | |
b | 105 | |
9 | 75 | |
다 | 55 | |
바 | 55 | |
4 | 55 | |
8 | 53 | |
7 | 35 | 6.4% |
6 | 2 | 0.4% |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 220 | |
Decimal Number | 220 | |
Other Letter | 110 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
9 | 75 | |
4 | 55 | |
8 | 53 | |
7 | 35 | |
6 | 2 | 0.9% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 115 | |
b | 105 |
Other Letter
Value | Count | Frequency (%) |
다 | 55 | |
바 | 55 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 220 | |
Common | 220 | |
Hangul | 110 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
9 | 75 | |
4 | 55 | |
8 | 53 | |
7 | 35 | |
6 | 2 | 0.9% |
Latin
Value | Count | Frequency (%) |
a | 115 | |
b | 105 |
Hangul
Value | Count | Frequency (%) |
다 | 55 | |
바 | 55 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 440 | |
Hangul | 110 | 20.0% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
a | 115 | |
b | 105 | |
9 | 75 | |
4 | 55 | |
8 | 53 | |
7 | 35 | 8.0% |
6 | 2 | 0.5% |
Hangul
Value | Count | Frequency (%) |
다 | 55 | |
바 | 55 |
PUL_GRAD
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 3.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 572.0 B |
1 | |
---|---|
2 | 1 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 1.8% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 54 | |
2 | 1 | 1.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 54 | |
2 | 1 | 1.8% |
LIFE_INFRA
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 14 |
---|---|
Distinct (%) | 25.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 16.909091 |
Minimum | 7 |
---|---|
Maximum | 37 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 627.0 B |
Quantile statistics
Minimum | 7 |
---|---|
5-th percentile | 9.7 |
Q1 | 17 |
median | 17 |
Q3 | 17 |
95-th percentile | 20.6 |
Maximum | 37 |
Range | 30 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 4.5593726 |
---|---|
Coefficient of variation (CV) | 0.26964032 |
Kurtosis | 10.909339 |
Mean | 16.909091 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 2.3153311 |
Sum | 930 |
Variance | 20.787879 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
17 | 41 | |
13 | 2 | 3.6% |
35 | 1 | 1.8% |
20 | 1 | 1.8% |
37 | 1 | 1.8% |
12 | 1 | 1.8% |
22 | 1 | 1.8% |
8 | 1 | 1.8% |
9 | 1 | 1.8% |
7 | 1 | 1.8% |
Other values (4) | 4 | 7.3% |
Value | Count | Frequency (%) |
7 | 1 | 1.8% |
8 | 1 | 1.8% |
9 | 1 | 1.8% |
10 | 1 | 1.8% |
12 | 1 | 1.8% |
13 | 2 | 3.6% |
14 | 1 | 1.8% |
15 | 1 | 1.8% |
17 | 41 | |
18 | 1 | 1.8% |
Value | Count | Frequency (%) |
37 | 1 | 1.8% |
35 | 1 | 1.8% |
22 | 1 | 1.8% |
20 | 1 | 1.8% |
18 | 1 | 1.8% |
17 | 41 | |
15 | 1 | 1.8% |
14 | 1 | 1.8% |
13 | 2 | 3.6% |
12 | 1 | 1.8% |
CMPTT_GRAD
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 7 |
---|---|
Distinct (%) | 12.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 71.363636 |
Minimum | 66 |
---|---|
Maximum | 72 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 627.0 B |
Quantile statistics
Minimum | 66 |
---|---|
5-th percentile | 67.7 |
Q1 | 71 |
median | 72 |
Q3 | 72 |
95-th percentile | 72 |
Maximum | 72 |
Range | 6 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 1.3792595 |
---|---|
Coefficient of variation (CV) | 0.019327204 |
Kurtosis | 6.6734925 |
Mean | 71.363636 |
Median Absolute Deviation (MAD) | 0 |
Skewness | -2.6711829 |
Sum | 3925 |
Variance | 1.9023569 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
72 | 39 | |
71 | 10 | 18.2% |
67 | 2 | 3.6% |
69 | 1 | 1.8% |
70 | 1 | 1.8% |
68 | 1 | 1.8% |
66 | 1 | 1.8% |
Value | Count | Frequency (%) |
66 | 1 | 1.8% |
67 | 2 | 3.6% |
68 | 1 | 1.8% |
69 | 1 | 1.8% |
70 | 1 | 1.8% |
71 | 10 | 18.2% |
72 | 39 |
Value | Count | Frequency (%) |
72 | 39 | |
71 | 10 | 18.2% |
70 | 1 | 1.8% |
69 | 1 | 1.8% |
68 | 1 | 1.8% |
67 | 2 | 3.6% |
66 | 1 | 1.8% |
TOTL_GRAD
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 11 |
---|---|
Distinct (%) | 20.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 33.436364 |
Minimum | 27 |
---|---|
Maximum | 44 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 627.0 B |
Quantile statistics
Minimum | 27 |
---|---|
5-th percentile | 28.7 |
Q1 | 34 |
median | 34 |
Q3 | 34 |
95-th percentile | 34.6 |
Maximum | 44 |
Range | 17 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 2.8203207 |
---|---|
Coefficient of variation (CV) | 0.084348906 |
Kurtosis | 5.5081023 |
Mean | 33.436364 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 0.88160079 |
Sum | 1839 |
Variance | 7.9542088 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
34 | 40 | |
30 | 4 | 7.3% |
27 | 2 | 3.6% |
29 | 2 | 3.6% |
44 | 1 | 1.8% |
43 | 1 | 1.8% |
36 | 1 | 1.8% |
33 | 1 | 1.8% |
28 | 1 | 1.8% |
31 | 1 | 1.8% |
Value | Count | Frequency (%) |
27 | 2 | 3.6% |
28 | 1 | 1.8% |
29 | 2 | 3.6% |
30 | 4 | 7.3% |
31 | 1 | 1.8% |
32 | 1 | 1.8% |
33 | 1 | 1.8% |
34 | 40 | |
36 | 1 | 1.8% |
43 | 1 | 1.8% |
Value | Count | Frequency (%) |
44 | 1 | 1.8% |
43 | 1 | 1.8% |
36 | 1 | 1.8% |
34 | 40 | |
33 | 1 | 1.8% |
32 | 1 | 1.8% |
31 | 1 | 1.8% |
30 | 4 | 7.3% |
29 | 2 | 3.6% |
28 | 1 | 1.8% |
GRID_NO | PUL_GRAD | LIFE_INFRA | CMPTT_GRAD | TOTL_GRAD | |
---|---|---|---|---|---|
GRID_NO | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
PUL_GRAD | 1.000 | 1.000 | 0.000 | 0.608 | 0.608 |
LIFE_INFRA | 1.000 | 0.000 | 1.000 | 0.924 | 0.978 |
CMPTT_GRAD | 1.000 | 0.608 | 0.924 | 1.000 | 0.863 |
TOTL_GRAD | 1.000 | 0.608 | 0.978 | 0.863 | 1.000 |
LIFE_INFRA | CMPTT_GRAD | TOTL_GRAD | PUL_GRAD | |
---|---|---|---|---|
LIFE_INFRA | 1.000 | 0.182 | 0.804 | 0.000 |
CMPTT_GRAD | 0.182 | 1.000 | 0.546 | 0.622 |
TOTL_GRAD | 0.804 | 0.546 | 1.000 | 0.622 |
PUL_GRAD | 0.000 | 0.622 | 0.622 | 1.000 |
GRID_NO | PUL_GRAD | LIFE_INFRA | CMPTT_GRAD | TOTL_GRAD | |
---|---|---|---|---|---|
0 | 다바96bb47bb | 1 | 35 | 71 | 44 |
1 | 다바96bb48aa | 1 | 20 | 69 | 34 |
2 | 다바97aa47bb | 1 | 37 | 67 | 43 |
3 | 다바97aa48aa | 1 | 13 | 71 | 30 |
4 | 다바97aa48ab | 1 | 12 | 71 | 30 |
5 | 다바97aa48ba | 1 | 22 | 71 | 36 |
6 | 다바97aa48bb | 1 | 17 | 72 | 34 |
7 | 다바97aa49aa | 1 | 17 | 72 | 34 |
8 | 다바97aa49ab | 1 | 17 | 72 | 34 |
9 | 다바97ab47bb | 1 | 17 | 72 | 34 |
GRID_NO | PUL_GRAD | LIFE_INFRA | CMPTT_GRAD | TOTL_GRAD | |
---|---|---|---|---|---|
45 | 다바98ab49ab | 1 | 17 | 72 | 34 |
46 | 다바98ab49ba | 1 | 17 | 72 | 34 |
47 | 다바98ab49bb | 1 | 17 | 72 | 34 |
48 | 다바98ba47bb | 1 | 17 | 72 | 34 |
49 | 다바98ba48ab | 1 | 18 | 66 | 30 |
50 | 다바98ba48ba | 1 | 17 | 72 | 34 |
51 | 다바98ba48bb | 1 | 17 | 72 | 34 |
52 | 다바98ba49aa | 1 | 17 | 72 | 34 |
53 | 다바98ba49ab | 1 | 17 | 72 | 34 |
54 | 다바98ba49ba | 1 | 17 | 72 | 34 |