Dataset statistics
Number of variables | 10 |
---|---|
Number of observations | 3225 |
Missing cells | 3225 |
Missing cells (%) | 10.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 261.5 KiB |
Average record size in memory | 83.0 B |
Variable types
Text | 2 |
---|---|
Categorical | 6 |
Unsupported | 1 |
Numeric | 1 |
Dataset
Description | 충청남도 예산군_충청남도 예산군_도시계획정보시스템 개발행위허가필지도 데이터 베이스로 개발행위허가지역에 관련하여 도면, 면적, 시군 구 코드등이 담겨있음. |
---|---|
Author | 충청남도 예산군 |
URL | https://www.data.go.kr/data/15123962/fileData.do |
LCLAS_CL has constant value "" | Constant |
SIGNGU_SE has constant value "" | Constant |
CREATE_DAT has constant value "" | Constant |
ATRB_SE is highly overall correlated with MLSFC_CL and 1 other fields | High correlation |
MLSFC_CL is highly overall correlated with ATRB_SE and 1 other fields | High correlation |
DGM_NM is highly overall correlated with MLSFC_CL and 1 other fields | High correlation |
SCLAS_CL has 3225 (100.0%) missing values | Missing |
DGM_AR is highly skewed (γ1 = 26.47834231) | Skewed |
PRESENT_SN has unique values | Unique |
SCLAS_CL is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2023-12-12 04:45:13.135732 |
---|---|
Analysis finished | 2023-12-12 04:45:13.806732 |
Duration | 0.67 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
PRESENT_SN
Text
UNIQUE
 
Distinct | 3225 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 25.3 KiB |
Length
Max length | 24 |
---|---|
Median length | 24 |
Mean length | 24 |
Min length | 24 |
Characters and Unicode
Total characters | 77400 |
---|---|
Distinct characters | 14 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 3225 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 44810UQ174PS201004301796 |
---|---|
2nd row | 44810UQ174PS201004301506 |
3rd row | 44810UQ174PS201101013264 |
4th row | 44810UQ174PS201004302548 |
5th row | 44810UQ174PS201301013669 |
Value | Count | Frequency (%) |
44810uq174ps201004301796 | 1 | < 0.1% |
44810uq174ps201004301605 | 1 | < 0.1% |
44810uq174ps201004301414 | 1 | < 0.1% |
44810uq174ps201004300484 | 1 | < 0.1% |
44810uq174ps201101013085 | 1 | < 0.1% |
44810uq174ps201004301716 | 1 | < 0.1% |
44810uq174ps201004300704 | 1 | < 0.1% |
44810uq174ps201004300752 | 1 | < 0.1% |
44810uq174ps201004300461 | 1 | < 0.1% |
44810uq174ps201004301427 | 1 | < 0.1% |
Other values (3215) | 3215 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 16928 | |
1 | 13693 | |
4 | 13284 | |
2 | 5105 | 6.6% |
3 | 4233 | 5.5% |
7 | 4194 | 5.4% |
8 | 4163 | 5.4% |
U | 3225 | 4.2% |
Q | 3225 | 4.2% |
P | 3225 | 4.2% |
Other values (4) | 6125 | 7.9% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 64500 | |
Uppercase Letter | 12900 | 16.7% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 16928 | |
1 | 13693 | |
4 | 13284 | |
2 | 5105 | 7.9% |
3 | 4233 | 6.6% |
7 | 4194 | 6.5% |
8 | 4163 | 6.5% |
9 | 974 | 1.5% |
5 | 969 | 1.5% |
6 | 957 | 1.5% |
Uppercase Letter
Value | Count | Frequency (%) |
U | 3225 | |
Q | 3225 | |
P | 3225 | |
S | 3225 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 64500 | |
Latin | 12900 | 16.7% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 16928 | |
1 | 13693 | |
4 | 13284 | |
2 | 5105 | 7.9% |
3 | 4233 | 6.6% |
7 | 4194 | 6.5% |
8 | 4163 | 6.5% |
9 | 974 | 1.5% |
5 | 969 | 1.5% |
6 | 957 | 1.5% |
Latin
Value | Count | Frequency (%) |
U | 3225 | |
Q | 3225 | |
P | 3225 | |
S | 3225 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 77400 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 16928 | |
1 | 13693 | |
4 | 13284 | |
2 | 5105 | 6.6% |
3 | 4233 | 5.5% |
7 | 4194 | 5.4% |
8 | 4163 | 5.4% |
U | 3225 | 4.2% |
Q | 3225 | 4.2% |
P | 3225 | 4.2% |
Other values (4) | 6125 | 7.9% |
LCLAS_CL
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 25.3 KiB |
UQQA00 |
---|
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | UQQA00 |
---|---|
2nd row | UQQA00 |
3rd row | UQQA00 |
4th row | UQQA00 |
5th row | UQQA00 |
Common Values
Value | Count | Frequency (%) |
UQQA00 | 3225 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
uqqa00 | 3225 |
MLSFC_CL
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 25.3 KiB |
UQQA20 | |
---|---|
UQQA40 | |
UQQA30 | 181 |
UQQA10 | 65 |
UQQA50 | 42 |
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | UQQA20 |
---|---|
2nd row | UQQA20 |
3rd row | UQQA40 |
4th row | UQQA20 |
5th row | UQQA20 |
Common Values
Value | Count | Frequency (%) |
UQQA20 | 2273 | |
UQQA40 | 664 | 20.6% |
UQQA30 | 181 | 5.6% |
UQQA10 | 65 | 2.0% |
UQQA50 | 42 | 1.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
uqqa20 | 2273 | |
uqqa40 | 664 | 20.6% |
uqqa30 | 181 | 5.6% |
uqqa10 | 65 | 2.0% |
uqqa50 | 42 | 1.3% |
SCLAS_CL
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 3225 |
---|---|
Missing (%) | 100.0% |
Memory size | 28.5 KiB |
ATRB_SE
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 25.3 KiB |
UQQA20 | |
---|---|
UQQA40 | |
UQQA30 | 181 |
UQQA10 | 65 |
UQQA50 | 42 |
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | UQQA20 |
---|---|
2nd row | UQQA20 |
3rd row | UQQA40 |
4th row | UQQA20 |
5th row | UQQA20 |
Common Values
Value | Count | Frequency (%) |
UQQA20 | 2273 | |
UQQA40 | 664 | 20.6% |
UQQA30 | 181 | 5.6% |
UQQA10 | 65 | 2.0% |
UQQA50 | 42 | 1.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
uqqa20 | 2273 | |
uqqa40 | 664 | 20.6% |
uqqa30 | 181 | 5.6% |
uqqa10 | 65 | 2.0% |
uqqa50 | 42 | 1.3% |
PERM_SE
Text
Distinct | 2308 |
---|---|
Distinct (%) | 71.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 25.3 KiB |
Length
Max length | 20 |
---|---|
Median length | 20 |
Mean length | 20 |
Min length | 20 |
Characters and Unicode
Total characters | 64500 |
---|---|
Distinct characters | 12 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 1799 ? |
---|---|
Unique (%) | 55.8% |
Sample
1st row | 44810PPR201004301783 |
---|---|
2nd row | 44810PPR201004301567 |
3rd row | 44810PPR201101012044 |
4th row | 44810PPR201004304012 |
5th row | 44810PPR201301012305 |
Value | Count | Frequency (%) |
44810ppr201004302801 | 24 | 0.7% |
44810ppr201004301139 | 19 | 0.6% |
44810ppr201004301702 | 11 | 0.3% |
44810ppr201201012274 | 11 | 0.3% |
44810ppr201004301585 | 11 | 0.3% |
44810ppr201004301479 | 10 | 0.3% |
44810ppr201301012352 | 10 | 0.3% |
44810ppr201004301375 | 10 | 0.3% |
44810ppr201004301277 | 9 | 0.3% |
44810ppr201004301931 | 8 | 0.2% |
Other values (2298) | 3102 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 16793 | |
1 | 10491 | |
4 | 10030 | |
P | 6450 | 10.0% |
2 | 6063 | 9.4% |
8 | 4049 | 6.3% |
3 | 3540 | 5.5% |
R | 3225 | 5.0% |
9 | 1024 | 1.6% |
5 | 1010 | 1.6% |
Other values (2) | 1825 | 2.8% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 54825 | |
Uppercase Letter | 9675 | 15.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 16793 | |
1 | 10491 | |
4 | 10030 | |
2 | 6063 | 11.1% |
8 | 4049 | 7.4% |
3 | 3540 | 6.5% |
9 | 1024 | 1.9% |
5 | 1010 | 1.8% |
6 | 913 | 1.7% |
7 | 912 | 1.7% |
Uppercase Letter
Value | Count | Frequency (%) |
P | 6450 | |
R | 3225 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 54825 | |
Latin | 9675 | 15.0% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 16793 | |
1 | 10491 | |
4 | 10030 | |
2 | 6063 | 11.1% |
8 | 4049 | 7.4% |
3 | 3540 | 6.5% |
9 | 1024 | 1.9% |
5 | 1010 | 1.8% |
6 | 913 | 1.7% |
7 | 912 | 1.7% |
Latin
Value | Count | Frequency (%) |
P | 6450 | |
R | 3225 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 64500 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 16793 | |
1 | 10491 | |
4 | 10030 | |
P | 6450 | 10.0% |
2 | 6063 | 9.4% |
8 | 4049 | 6.3% |
3 | 3540 | 5.5% |
R | 3225 | 5.0% |
9 | 1024 | 1.6% |
5 | 1010 | 1.6% |
Other values (2) | 1825 | 2.8% |
DGM_NM
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 25.3 KiB |
토지형질변경 | |
---|---|
토지분할 | |
토석채취 | 181 |
공작물설치 | 65 |
물건적치 | 42 |
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 5.4297674 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 토지형질변경 |
---|---|
2nd row | 토지형질변경 |
3rd row | 토지분할 |
4th row | 토지형질변경 |
5th row | 토지형질변경 |
Common Values
Value | Count | Frequency (%) |
토지형질변경 | 2273 | |
토지분할 | 664 | 20.6% |
토석채취 | 181 | 5.6% |
공작물설치 | 65 | 2.0% |
물건적치 | 42 | 1.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
토지형질변경 | 2273 | |
토지분할 | 664 | 20.6% |
토석채취 | 181 | 5.6% |
공작물설치 | 65 | 2.0% |
물건적치 | 42 | 1.3% |
DGM_AR
Real number (ℝ)
SKEWED
 
Distinct | 3213 |
---|---|
Distinct (%) | 99.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3745.5149 |
Minimum | 0.18 |
---|---|
Maximum | 1063463.9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 28.5 KiB |
Quantile statistics
Minimum | 0.18 |
---|---|
5-th percentile | 129.604 |
Q1 | 435.55 |
median | 881.3 |
Q3 | 2008.03 |
95-th percentile | 8275.514 |
Maximum | 1063463.9 |
Range | 1063463.7 |
Interquartile range (IQR) | 1572.48 |
Descriptive statistics
Standard deviation | 31190.576 |
---|---|
Coefficient of variation (CV) | 8.3274467 |
Kurtosis | 814.5134 |
Mean | 3745.5149 |
Median Absolute Deviation (MAD) | 563.08 |
Skewness | 26.478342 |
Sum | 12079286 |
Variance | 9.7285203 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
504.71 | 3 | 0.1% |
835.31 | 2 | 0.1% |
373.63 | 2 | 0.1% |
542.35 | 2 | 0.1% |
408.65 | 2 | 0.1% |
636.71 | 2 | 0.1% |
569.52 | 2 | 0.1% |
725.1 | 2 | 0.1% |
659.86 | 2 | 0.1% |
437.25 | 2 | 0.1% |
Other values (3203) | 3204 |
Value | Count | Frequency (%) |
0.18 | 1 | |
2.34 | 1 | |
3.65 | 1 | |
5.59 | 1 | |
5.95 | 1 | |
7.26 | 1 | |
7.48 | 1 | |
10.02 | 1 | |
11.04 | 1 | |
11.93 | 1 |
Value | Count | Frequency (%) |
1063463.88 | 1 | |
1023527.48 | 1 | |
550428.51 | 1 | |
435297.66 | 1 | |
297905.03 | 1 | |
282455.12 | 1 | |
255842.02 | 1 | |
254479.01 | 1 | |
152276.79 | 1 | |
147756.68 | 1 |
SIGNGU_SE
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 25.3 KiB |
44810 |
---|
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 5 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 44810 |
---|---|
2nd row | 44810 |
3rd row | 44810 |
4th row | 44810 |
5th row | 44810 |
Common Values
Value | Count | Frequency (%) |
44810 | 3225 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
44810 | 3225 |
CREATE_DAT
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 25.3 KiB |
2015-01-31 |
---|
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2015-01-31 |
---|---|
2nd row | 2015-01-31 |
3rd row | 2015-01-31 |
4th row | 2015-01-31 |
5th row | 2015-01-31 |
Common Values
Value | Count | Frequency (%) |
2015-01-31 | 3225 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2015-01-31 | 3225 |
MLSFC_CL | ATRB_SE | DGM_NM | DGM_AR | |
---|---|---|---|---|
MLSFC_CL | 1.000 | 1.000 | 1.000 | 0.052 |
ATRB_SE | 1.000 | 1.000 | 1.000 | 0.052 |
DGM_NM | 1.000 | 1.000 | 1.000 | 0.052 |
DGM_AR | 0.052 | 0.052 | 0.052 | 1.000 |
ATRB_SE | MLSFC_CL | DGM_NM | |
---|---|---|---|
ATRB_SE | 1.000 | 1.000 | 1.000 |
MLSFC_CL | 1.000 | 1.000 | 1.000 |
DGM_NM | 1.000 | 1.000 | 1.000 |
DGM_AR | MLSFC_CL | ATRB_SE | DGM_NM | |
---|---|---|---|---|
DGM_AR | 1.000 | 0.035 | 0.035 | 0.035 |
MLSFC_CL | 0.035 | 1.000 | 1.000 | 1.000 |
ATRB_SE | 0.035 | 1.000 | 1.000 | 1.000 |
DGM_NM | 0.035 | 1.000 | 1.000 | 1.000 |
PRESENT_SN | LCLAS_CL | MLSFC_CL | SCLAS_CL | ATRB_SE | PERM_SE | DGM_NM | DGM_AR | SIGNGU_SE | CREATE_DAT | |
---|---|---|---|---|---|---|---|---|---|---|
0 | 44810UQ174PS201004301796 | UQQA00 | UQQA20 | <NA> | UQQA20 | 44810PPR201004301783 | 토지형질변경 | 1348.34 | 44810 | 2015-01-31 |
1 | 44810UQ174PS201004301506 | UQQA00 | UQQA20 | <NA> | UQQA20 | 44810PPR201004301567 | 토지형질변경 | 3653.39 | 44810 | 2015-01-31 |
2 | 44810UQ174PS201101013264 | UQQA00 | UQQA40 | <NA> | UQQA40 | 44810PPR201101012044 | 토지분할 | 911.52 | 44810 | 2015-01-31 |
3 | 44810UQ174PS201004302548 | UQQA00 | UQQA20 | <NA> | UQQA20 | 44810PPR201004304012 | 토지형질변경 | 537.62 | 44810 | 2015-01-31 |
4 | 44810UQ174PS201301013669 | UQQA00 | UQQA20 | <NA> | UQQA20 | 44810PPR201301012305 | 토지형질변경 | 2159.4 | 44810 | 2015-01-31 |
5 | 44810UQ174PS201301013670 | UQQA00 | UQQA20 | <NA> | UQQA20 | 44810PPR201301012305 | 토지형질변경 | 2214.76 | 44810 | 2015-01-31 |
6 | 44810UQ174PS201004302607 | UQQA00 | UQQA20 | <NA> | UQQA20 | 44810PPR201004302679 | 토지형질변경 | 114.66 | 44810 | 2015-01-31 |
7 | 44810UQ174PS201101013268 | UQQA00 | UQQA40 | <NA> | UQQA40 | 44810PPR201101012046 | 토지분할 | 255.14 | 44810 | 2015-01-31 |
8 | 44810UQ174PS201101013257 | UQQA00 | UQQA40 | <NA> | UQQA40 | 44810PPR201101012040 | 토지분할 | 1254.7 | 44810 | 2015-01-31 |
9 | 44810UQ174PS201004300193 | UQQA00 | UQQA20 | <NA> | UQQA20 | 44810PPR201004301535 | 토지형질변경 | 3096.18 | 44810 | 2015-01-31 |
PRESENT_SN | LCLAS_CL | MLSFC_CL | SCLAS_CL | ATRB_SE | PERM_SE | DGM_NM | DGM_AR | SIGNGU_SE | CREATE_DAT | |
---|---|---|---|---|---|---|---|---|---|---|
3215 | 44810UQ174PS201004303044 | UQQA00 | UQQA20 | <NA> | UQQA20 | 44810PPR201004302051 | 토지형질변경 | 651.6 | 44810 | 2015-01-31 |
3216 | 44810UQ174PS201004302667 | UQQA00 | UQQA20 | <NA> | UQQA20 | 44810PPR201004302739 | 토지형질변경 | 130.68 | 44810 | 2015-01-31 |
3217 | 44810UQ174PS201004300165 | UQQA00 | UQQA10 | <NA> | UQQA10 | 44810PPR201004301507 | 공작물설치 | 355.76 | 44810 | 2015-01-31 |
3218 | 44810UQ174PS201201013406 | UQQA00 | UQQA40 | <NA> | UQQA40 | 44810PPR201201012145 | 토지분할 | 2428.71 | 44810 | 2015-01-31 |
3219 | 44810UQ174PS201004302119 | UQQA00 | UQQA20 | <NA> | UQQA20 | 44810PPR201004302176 | 토지형질변경 | 1316.9 | 44810 | 2015-01-31 |
3220 | 44810UQ174PS201004301125 | UQQA00 | UQQA20 | <NA> | UQQA20 | 44810PPR201004301185 | 토지형질변경 | 578.51 | 44810 | 2015-01-31 |
3221 | 44810UQ174PS201004302490 | UQQA00 | UQQA40 | <NA> | UQQA40 | 44810PPR201004303059 | 토지분할 | 421.41 | 44810 | 2015-01-31 |
3222 | 44810UQ174PS201004301996 | UQQA00 | UQQA20 | <NA> | UQQA20 | 44810PPR201004300983 | 토지형질변경 | 210.32 | 44810 | 2015-01-31 |
3223 | 44810UQ174PS201004303030 | UQQA00 | UQQA40 | <NA> | UQQA40 | 44810PPR201004303030 | 토지분할 | 630.35 | 44810 | 2015-01-31 |
3224 | 44810UQ174PS201004302058 | UQQA00 | UQQA50 | <NA> | UQQA50 | 44810PPR201004302117 | 물건적치 | 2805.13 | 44810 | 2015-01-31 |