Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 139 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 6.1 KiB |
Average record size in memory | 44.9 B |
Variable types
Text | 1 |
---|---|
Categorical | 3 |
Numeric | 1 |
Dataset
Description | Sample |
---|---|
Author | 제타럭스시스템 |
URL | https://bigdata-geo.kr/user/dataset/view.do?data_sn=496 |
PUL_GRAD has constant value "" | Constant |
LIFE_INFRA is highly overall correlated with CMPTT_GRAD and 1 other fields | High correlation |
CMPTT_GRAD is highly overall correlated with LIFE_INFRA and 1 other fields | High correlation |
TOTL_GRAD is highly overall correlated with LIFE_INFRA and 1 other fields | High correlation |
CMPTT_GRAD is highly imbalanced (82.0%) | Imbalance |
TOTL_GRAD is highly imbalanced (82.0%) | Imbalance |
GIRD_NO has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 13:21:10.600496 |
---|---|
Analysis finished | 2023-12-10 13:21:11.433612 |
Duration | 0.83 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
GIRD_NO
Text
UNIQUE
 
Distinct | 139 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.2 KiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 1390 |
---|---|
Distinct characters | 11 |
Distinct categories | 3 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 139 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 마마72ba30ab |
---|---|
2nd row | 마마73aa30ba |
3rd row | 마마73aa30bb |
4th row | 마마73ab30bb |
5th row | 마마73aa31aa |
Value | Count | Frequency (%) |
마마72ba30ab | 1 | 0.7% |
마마71ba31ba | 1 | 0.7% |
마마71ba31aa | 1 | 0.7% |
마마71ba30bb | 1 | 0.7% |
마마71ba30ba | 1 | 0.7% |
마마71ba30ab | 1 | 0.7% |
마마71ba30aa | 1 | 0.7% |
마마71ba29bb | 1 | 0.7% |
마마71ba29ba | 1 | 0.7% |
마마71aa29ab | 1 | 0.7% |
Other values (129) | 129 |
Most occurring characters
Value | Count | Frequency (%) |
b | 283 | |
마 | 278 | |
a | 273 | |
7 | 129 | |
0 | 103 | 7.4% |
3 | 89 | 6.4% |
1 | 82 | 5.9% |
2 | 77 | 5.5% |
9 | 45 | 3.2% |
8 | 20 | 1.4% |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 556 | |
Decimal Number | 556 | |
Other Letter | 278 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
7 | 129 | |
0 | 103 | |
3 | 89 | |
1 | 82 | |
2 | 77 | |
9 | 45 | 8.1% |
8 | 20 | 3.6% |
6 | 11 | 2.0% |
Lowercase Letter
Value | Count | Frequency (%) |
b | 283 | |
a | 273 |
Other Letter
Value | Count | Frequency (%) |
마 | 278 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 556 | |
Common | 556 | |
Hangul | 278 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
7 | 129 | |
0 | 103 | |
3 | 89 | |
1 | 82 | |
2 | 77 | |
9 | 45 | 8.1% |
8 | 20 | 3.6% |
6 | 11 | 2.0% |
Latin
Value | Count | Frequency (%) |
b | 283 | |
a | 273 |
Hangul
Value | Count | Frequency (%) |
마 | 278 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1112 | |
Hangul | 278 | 20.0% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
b | 283 | |
a | 273 | |
7 | 129 | |
0 | 103 | 9.3% |
3 | 89 | 8.0% |
1 | 82 | 7.4% |
2 | 77 | 6.9% |
9 | 45 | 4.0% |
8 | 20 | 1.8% |
6 | 11 | 1.0% |
Hangul
Value | Count | Frequency (%) |
마 | 278 |
PUL_GRAD
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.2 KiB |
1 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 139 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 139 |
LIFE_INFRA
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 7 |
---|---|
Distinct (%) | 5.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 16.784173 |
Minimum | 10 |
---|---|
Maximum | 24 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.4 KiB |
Quantile statistics
Minimum | 10 |
---|---|
5-th percentile | 16.7 |
Q1 | 17 |
median | 17 |
Q3 | 17 |
95-th percentile | 17 |
Maximum | 24 |
Range | 14 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 1.344676 |
---|---|
Coefficient of variation (CV) | 0.080115714 |
Kurtosis | 17.713999 |
Mean | 16.784173 |
Median Absolute Deviation (MAD) | 0 |
Skewness | -2.1198479 |
Sum | 2333 |
Variance | 1.8081535 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
17 | 131 | |
11 | 3 | 2.2% |
14 | 1 | 0.7% |
10 | 1 | 0.7% |
12 | 1 | 0.7% |
24 | 1 | 0.7% |
13 | 1 | 0.7% |
Value | Count | Frequency (%) |
10 | 1 | 0.7% |
11 | 3 | 2.2% |
12 | 1 | 0.7% |
13 | 1 | 0.7% |
14 | 1 | 0.7% |
17 | 131 | |
24 | 1 | 0.7% |
Value | Count | Frequency (%) |
24 | 1 | 0.7% |
17 | 131 | |
14 | 1 | 0.7% |
13 | 1 | 0.7% |
12 | 1 | 0.7% |
11 | 3 | 2.2% |
10 | 1 | 0.7% |
CMPTT_GRAD
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 5 |
---|---|
Distinct (%) | 3.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.2 KiB |
72 | |
---|---|
71 | 4 |
67 | 2 |
68 | 1 |
69 | 1 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 1.4% |
Sample
1st row | 72 |
---|---|
2nd row | 72 |
3rd row | 72 |
4th row | 72 |
5th row | 72 |
Common Values
Value | Count | Frequency (%) |
72 | 131 | |
71 | 4 | 2.9% |
67 | 2 | 1.4% |
68 | 1 | 0.7% |
69 | 1 | 0.7% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
72 | 131 | |
71 | 4 | 2.9% |
67 | 2 | 1.4% |
68 | 1 | 0.7% |
69 | 1 | 0.7% |
TOTL_GRAD
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 5 |
---|---|
Distinct (%) | 3.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.2 KiB |
34 | |
---|---|
29 | 4 |
27 | 2 |
31 | 1 |
35 | 1 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 1.4% |
Sample
1st row | 34 |
---|---|
2nd row | 34 |
3rd row | 34 |
4th row | 34 |
5th row | 34 |
Common Values
Value | Count | Frequency (%) |
34 | 131 | |
29 | 4 | 2.9% |
27 | 2 | 1.4% |
31 | 1 | 0.7% |
35 | 1 | 0.7% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
34 | 131 | |
29 | 4 | 2.9% |
27 | 2 | 1.4% |
31 | 1 | 0.7% |
35 | 1 | 0.7% |
LIFE_INFRA | CMPTT_GRAD | TOTL_GRAD | |
---|---|---|---|
LIFE_INFRA | 1.000 | 0.975 | 0.987 |
CMPTT_GRAD | 0.975 | 1.000 | 0.970 |
TOTL_GRAD | 0.987 | 0.970 | 1.000 |
TOTL_GRAD | CMPTT_GRAD | |
---|---|---|
TOTL_GRAD | 1.000 | 0.752 |
CMPTT_GRAD | 0.752 | 1.000 |
LIFE_INFRA | CMPTT_GRAD | TOTL_GRAD | |
---|---|---|---|
LIFE_INFRA | 1.000 | 0.773 | 0.833 |
CMPTT_GRAD | 0.773 | 1.000 | 0.752 |
TOTL_GRAD | 0.833 | 0.752 | 1.000 |
GIRD_NO | PUL_GRAD | LIFE_INFRA | CMPTT_GRAD | TOTL_GRAD | |
---|---|---|---|---|---|
0 | 마마72ba30ab | 1 | 17 | 72 | 34 |
1 | 마마73aa30ba | 1 | 17 | 72 | 34 |
2 | 마마73aa30bb | 1 | 17 | 72 | 34 |
3 | 마마73ab30bb | 1 | 17 | 72 | 34 |
4 | 마마73aa31aa | 1 | 17 | 72 | 34 |
5 | 마마73ab31aa | 1 | 17 | 72 | 34 |
6 | 마마69ba29bb | 1 | 17 | 72 | 34 |
7 | 마마69ba30aa | 1 | 17 | 72 | 34 |
8 | 마마69ba30ab | 1 | 17 | 72 | 34 |
9 | 마마69ba30ba | 1 | 17 | 72 | 34 |
GIRD_NO | PUL_GRAD | LIFE_INFRA | CMPTT_GRAD | TOTL_GRAD | |
---|---|---|---|---|---|
129 | 마마72ab31aa | 1 | 17 | 72 | 34 |
130 | 마마72ab31ab | 1 | 17 | 72 | 34 |
131 | 마마72ab31ba | 1 | 17 | 72 | 34 |
132 | 마마72ab31bb | 1 | 17 | 72 | 34 |
133 | 마마72ba30ba | 1 | 17 | 72 | 34 |
134 | 마마72ba30bb | 1 | 17 | 72 | 34 |
135 | 마마72ba31aa | 1 | 17 | 72 | 34 |
136 | 마마72bb30ba | 1 | 17 | 72 | 34 |
137 | 마마72bb30bb | 1 | 17 | 72 | 34 |
138 | 마마72bb31aa | 1 | 17 | 72 | 34 |