Dataset statistics
Number of variables | 10 |
---|---|
Number of observations | 1948 |
Missing cells | 1948 |
Missing cells (%) | 10.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 158.0 KiB |
Average record size in memory | 83.1 B |
Variable types
Text | 2 |
---|---|
Categorical | 5 |
Unsupported | 1 |
Numeric | 1 |
DateTime | 1 |
Dataset
Description | 도시계획정보 시스템에 등록된 하동군의 유통공급시설 현황, 현황도형 관리번호, 도형 대분류코드, 도형 중분류코드, 도형 소분류코드, 도형 속성코드, (허가)계획관리번호, 라벨명, 면적(도형), 시군구코드, 현황도형 생성일시 정보 |
---|---|
Author | 경상남도 하동군 |
URL | https://www.data.go.kr/data/15123809/fileData.do |
도형 대분류코드 has constant value "" | Constant |
시군구코드 has constant value "" | Constant |
도형 속성코드 is highly overall correlated with 도형 중분류코드 and 1 other fields | High correlation |
라벨명 is highly overall correlated with 도형 중분류코드 and 1 other fields | High correlation |
도형 중분류코드 is highly overall correlated with 도형 속성코드 and 1 other fields | High correlation |
도형 중분류코드 is highly imbalanced (69.4%) | Imbalance |
도형 속성코드 is highly imbalanced (66.2%) | Imbalance |
라벨명 is highly imbalanced (66.2%) | Imbalance |
도형 소분류코드 has 1948 (100.0%) missing values | Missing |
면적(도형) is highly skewed (γ1 = 22.18563398) | Skewed |
현황도형 관리번호 has unique values | Unique |
도형 소분류코드 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2023-12-12 15:03:52.513262 |
---|---|
Analysis finished | 2023-12-12 15:03:54.256013 |
Duration | 1.74 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
현황도형 관리번호
Text
UNIQUE
 
Distinct | 1948 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 15.3 KiB |
Length
Max length | 20 |
---|---|
Median length | 20 |
Mean length | 20 |
Min length | 20 |
Characters and Unicode
Total characters | 38960 |
---|---|
Distinct characters | 12 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 1948 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 48850PPR200712181703 |
---|---|
2nd row | 48850PPR200711201704 |
3rd row | 48850PPR200707161696 |
4th row | 48850PPR200712031705 |
5th row | 48850PPR200708101699 |
Value | Count | Frequency (%) |
48850ppr200712181703 | 1 | 0.1% |
48850ppr201008100447 | 1 | 0.1% |
48850ppr201111150671 | 1 | 0.1% |
48850ppr201107250474 | 1 | 0.1% |
48850ppr201108170470 | 1 | 0.1% |
48850ppr201204100672 | 1 | 0.1% |
48850ppr201111300669 | 1 | 0.1% |
48850ppr201104280468 | 1 | 0.1% |
48850ppr201007140466 | 1 | 0.1% |
48850ppr201104280467 | 1 | 0.1% |
Other values (1938) | 1938 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 8986 | |
1 | 4982 | |
8 | 4846 | |
2 | 4137 | |
P | 3896 | |
5 | 3003 | 7.7% |
4 | 2885 | 7.4% |
R | 1948 | 5.0% |
7 | 1298 | 3.3% |
6 | 1074 | 2.8% |
Other values (2) | 1905 | 4.9% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 33116 | |
Uppercase Letter | 5844 | 15.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 8986 | |
1 | 4982 | |
8 | 4846 | |
2 | 4137 | |
5 | 3003 | 9.1% |
4 | 2885 | 8.7% |
7 | 1298 | 3.9% |
6 | 1074 | 3.2% |
3 | 1058 | 3.2% |
9 | 847 | 2.6% |
Uppercase Letter
Value | Count | Frequency (%) |
P | 3896 | |
R | 1948 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 33116 | |
Latin | 5844 | 15.0% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 8986 | |
1 | 4982 | |
8 | 4846 | |
2 | 4137 | |
5 | 3003 | 9.1% |
4 | 2885 | 8.7% |
7 | 1298 | 3.9% |
6 | 1074 | 3.2% |
3 | 1058 | 3.2% |
9 | 847 | 2.6% |
Latin
Value | Count | Frequency (%) |
P | 3896 | |
R | 1948 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 38960 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 8986 | |
1 | 4982 | |
8 | 4846 | |
2 | 4137 | |
P | 3896 | |
5 | 3003 | 7.7% |
4 | 2885 | 7.4% |
R | 1948 | 5.0% |
7 | 1298 | 3.3% |
6 | 1074 | 2.8% |
Other values (2) | 1905 | 4.9% |
도형 대분류코드
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 15.3 KiB |
UQQA00 |
---|
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | UQQA00 |
---|---|
2nd row | UQQA00 |
3rd row | UQQA00 |
4th row | UQQA00 |
5th row | UQQA00 |
Common Values
Value | Count | Frequency (%) |
UQQA00 | 1948 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
uqqa00 | 1948 |
도형 중분류코드
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 6 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 15.3 KiB |
UQQA20 | |
---|---|
UQQA40 | |
UQQA50 | 25 |
UQQA10 | 5 |
UQQA30 | 4 |
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | UQQA40 |
---|---|
2nd row | UQQA40 |
3rd row | UQQA20 |
4th row | UQQA40 |
5th row | UQQA40 |
Common Values
Value | Count | Frequency (%) |
UQQA20 | 1593 | |
UQQA40 | 320 | 16.4% |
UQQA50 | 25 | 1.3% |
UQQA10 | 5 | 0.3% |
UQQA30 | 4 | 0.2% |
UQQA00 | 1 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
uqqa20 | 1593 | |
uqqa40 | 320 | 16.4% |
uqqa50 | 25 | 1.3% |
uqqa10 | 5 | 0.3% |
uqqa30 | 4 | 0.2% |
uqqa00 | 1 | 0.1% |
도형 소분류코드
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 1948 |
---|---|
Missing (%) | 100.0% |
Memory size | 17.2 KiB |
도형 속성코드
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 5 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 15.3 KiB |
UQQA20 | |
---|---|
UQQA40 | |
UQQA50 | 25 |
UQQA10 | 5 |
UQQA30 | 4 |
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | UQQA40 |
---|---|
2nd row | UQQA40 |
3rd row | UQQA20 |
4th row | UQQA40 |
5th row | UQQA40 |
Common Values
Value | Count | Frequency (%) |
UQQA20 | 1594 | |
UQQA40 | 320 | 16.4% |
UQQA50 | 25 | 1.3% |
UQQA10 | 5 | 0.3% |
UQQA30 | 4 | 0.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
uqqa20 | 1594 | |
uqqa40 | 320 | 16.4% |
uqqa50 | 25 | 1.3% |
uqqa10 | 5 | 0.3% |
uqqa30 | 4 | 0.2% |
(허가)계획관리번호
Text
Distinct | 1676 |
---|---|
Distinct (%) | 86.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 15.3 KiB |
Length
Max length | 20 |
---|---|
Median length | 20 |
Mean length | 20 |
Min length | 20 |
Characters and Unicode
Total characters | 38960 |
---|---|
Distinct characters | 12 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 1494 ? |
---|---|
Unique (%) | 76.7% |
Sample
1st row | 48850PPR200712180295 |
---|---|
2nd row | 48850PPR200711200315 |
3rd row | 48850PPR200707160182 |
4th row | 48850PPR200712030312 |
5th row | 48850PPR200708100419 |
Value | Count | Frequency (%) |
48850ppr201206250970 | 10 | 0.5% |
48850ppr201304110729 | 7 | 0.4% |
48850ppr201303281235 | 6 | 0.3% |
48850ppr201110080834 | 5 | 0.3% |
48850ppr201112190566 | 5 | 0.3% |
48850ppr201012100473 | 5 | 0.3% |
48850ppr201005280229 | 5 | 0.3% |
48850ppr201210041282 | 5 | 0.3% |
48850ppr201210191080 | 5 | 0.3% |
48850ppr201302261398 | 5 | 0.3% |
Other values (1666) | 1890 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 9613 | |
8 | 4858 | |
1 | 4377 | |
2 | 4214 | |
P | 3896 | |
5 | 2983 | 7.7% |
4 | 2841 | 7.3% |
R | 1948 | 5.0% |
7 | 1241 | 3.2% |
3 | 1116 | 2.9% |
Other values (2) | 1873 | 4.8% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 33116 | |
Uppercase Letter | 5844 | 15.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 9613 | |
8 | 4858 | |
1 | 4377 | |
2 | 4214 | |
5 | 2983 | 9.0% |
4 | 2841 | 8.6% |
7 | 1241 | 3.7% |
3 | 1116 | 3.4% |
6 | 1070 | 3.2% |
9 | 803 | 2.4% |
Uppercase Letter
Value | Count | Frequency (%) |
P | 3896 | |
R | 1948 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 33116 | |
Latin | 5844 | 15.0% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 9613 | |
8 | 4858 | |
1 | 4377 | |
2 | 4214 | |
5 | 2983 | 9.0% |
4 | 2841 | 8.6% |
7 | 1241 | 3.7% |
3 | 1116 | 3.4% |
6 | 1070 | 3.2% |
9 | 803 | 2.4% |
Latin
Value | Count | Frequency (%) |
P | 3896 | |
R | 1948 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 38960 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 9613 | |
8 | 4858 | |
1 | 4377 | |
2 | 4214 | |
P | 3896 | |
5 | 2983 | 7.7% |
4 | 2841 | 7.3% |
R | 1948 | 5.0% |
7 | 1241 | 3.2% |
3 | 1116 | 2.9% |
Other values (2) | 1873 | 4.8% |
라벨명
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 5 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 15.3 KiB |
토지형질변경 | |
---|---|
토지분할 | |
물건적치 | 25 |
공작물설치 | 5 |
토석채취 | 4 |
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 5.639117 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 토지분할 |
---|---|
2nd row | 토지분할 |
3rd row | 토지형질변경 |
4th row | 토지분할 |
5th row | 토지분할 |
Common Values
Value | Count | Frequency (%) |
토지형질변경 | 1594 | |
토지분할 | 320 | 16.4% |
물건적치 | 25 | 1.3% |
공작물설치 | 5 | 0.3% |
토석채취 | 4 | 0.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
토지형질변경 | 1594 | |
토지분할 | 320 | 16.4% |
물건적치 | 25 | 1.3% |
공작물설치 | 5 | 0.3% |
토석채취 | 4 | 0.2% |
면적(도형)
Real number (ℝ)
SKEWED
 
Distinct | 1940 |
---|---|
Distinct (%) | 99.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5370.7096 |
Minimum | 12.06 |
---|---|
Maximum | 1358825.7 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 17.2 KiB |
Quantile statistics
Minimum | 12.06 |
---|---|
5-th percentile | 111.957 |
Q1 | 400.11 |
median | 728.8 |
Q3 | 1695.615 |
95-th percentile | 18959.396 |
Maximum | 1358825.7 |
Range | 1358813.6 |
Interquartile range (IQR) | 1295.505 |
Descriptive statistics
Standard deviation | 46503.525 |
---|---|
Coefficient of variation (CV) | 8.6587301 |
Kurtosis | 550.02077 |
Mean | 5370.7096 |
Median Absolute Deviation (MAD) | 443.83 |
Skewness | 22.185634 |
Sum | 10462142 |
Variance | 2.1625778 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
603.73 | 2 | 0.1% |
660.95 | 2 | 0.1% |
652.38 | 2 | 0.1% |
646.34 | 2 | 0.1% |
994.46 | 2 | 0.1% |
656.75 | 2 | 0.1% |
2059.63 | 2 | 0.1% |
803.75 | 2 | 0.1% |
2782.5 | 1 | 0.1% |
482.42 | 1 | 0.1% |
Other values (1930) | 1930 |
Value | Count | Frequency (%) |
12.06 | 1 | |
12.44 | 1 | |
13.37 | 1 | |
14.22 | 1 | |
14.25 | 1 | |
15.1 | 1 | |
15.56 | 1 | |
17.59 | 1 | |
19.99 | 1 | |
21.67 | 1 |
Value | Count | Frequency (%) |
1358825.67 | 1 | |
1049907.56 | 1 | |
749147.03 | 1 | |
610443.38 | 1 | |
432037.44 | 1 | |
137917.8 | 1 | |
118140.12 | 1 | |
101130.72 | 1 | |
94740.34 | 1 | |
89059.86 | 1 |
시군구코드
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 15.3 KiB |
48850 |
---|
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 5 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 48850 |
---|---|
2nd row | 48850 |
3rd row | 48850 |
4th row | 48850 |
5th row | 48850 |
Common Values
Value | Count | Frequency (%) |
48850 | 1948 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
48850 | 1948 |
현황도형 생성일시
Date
Distinct | 3 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 15.3 KiB |
Minimum | 2012-05-13 00:00:00 |
---|---|
Maximum | 2013-05-30 00:00:00 |
도형 중분류코드 | 도형 속성코드 | 라벨명 | 면적(도형) | 현황도형 생성일시 | |
---|---|---|---|---|---|
도형 중분류코드 | 1.000 | 1.000 | 1.000 | 0.000 | 0.977 |
도형 속성코드 | 1.000 | 1.000 | 1.000 | 0.000 | 0.473 |
라벨명 | 1.000 | 1.000 | 1.000 | 0.000 | 0.473 |
면적(도형) | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 |
현황도형 생성일시 | 0.977 | 0.473 | 0.473 | 0.000 | 1.000 |
도형 속성코드 | 라벨명 | 도형 중분류코드 | |
---|---|---|---|
도형 속성코드 | 1.000 | 1.000 | 1.000 |
라벨명 | 1.000 | 1.000 | 1.000 |
도형 중분류코드 | 1.000 | 1.000 | 1.000 |
면적(도형) | 도형 중분류코드 | 도형 속성코드 | 라벨명 | |
---|---|---|---|---|
면적(도형) | 1.000 | 0.000 | 0.000 | 0.000 |
도형 중분류코드 | 0.000 | 1.000 | 1.000 | 1.000 |
도형 속성코드 | 0.000 | 1.000 | 1.000 | 1.000 |
라벨명 | 0.000 | 1.000 | 1.000 | 1.000 |
현황도형 관리번호 | 도형 대분류코드 | 도형 중분류코드 | 도형 소분류코드 | 도형 속성코드 | (허가)계획관리번호 | 라벨명 | 면적(도형) | 시군구코드 | 현황도형 생성일시 | |
---|---|---|---|---|---|---|---|---|---|---|
0 | 48850PPR200712181703 | UQQA00 | UQQA40 | <NA> | UQQA40 | 48850PPR200712180295 | 토지분할 | 814.0 | 48850 | 2012-05-13 |
1 | 48850PPR200711201704 | UQQA00 | UQQA40 | <NA> | UQQA40 | 48850PPR200711200315 | 토지분할 | 1966.46 | 48850 | 2012-05-13 |
2 | 48850PPR200707161696 | UQQA00 | UQQA20 | <NA> | UQQA20 | 48850PPR200707160182 | 토지형질변경 | 4374.48 | 48850 | 2012-05-13 |
3 | 48850PPR200712031705 | UQQA00 | UQQA40 | <NA> | UQQA40 | 48850PPR200712030312 | 토지분할 | 183.81 | 48850 | 2012-05-13 |
4 | 48850PPR200708101699 | UQQA00 | UQQA40 | <NA> | UQQA40 | 48850PPR200708100419 | 토지분할 | 1069.39 | 48850 | 2012-05-13 |
5 | 48850PPR200710231702 | UQQA00 | UQQA40 | <NA> | UQQA40 | 48850PPR200710230520 | 토지분할 | 37130.65 | 48850 | 2012-05-13 |
6 | 48850PPR200708271701 | UQQA00 | UQQA40 | <NA> | UQQA40 | 48850PPR200708270409 | 토지분할 | 15895.82 | 48850 | 2012-05-13 |
7 | 48850PPR200712281698 | UQQA00 | UQQA40 | <NA> | UQQA40 | 48850PPR200712280276 | 토지분할 | 14183.77 | 48850 | 2012-05-13 |
8 | 48850PPR200707181706 | UQQA00 | UQQA20 | <NA> | UQQA20 | 48850PPR200707180179 | 토지형질변경 | 784.94 | 48850 | 2012-05-13 |
9 | 48850PPR200702051707 | UQQA00 | UQQA40 | <NA> | UQQA40 | 48850PPR200702050515 | 토지분할 | 537.29 | 48850 | 2012-05-13 |
현황도형 관리번호 | 도형 대분류코드 | 도형 중분류코드 | 도형 소분류코드 | 도형 속성코드 | (허가)계획관리번호 | 라벨명 | 면적(도형) | 시군구코드 | 현황도형 생성일시 | |
---|---|---|---|---|---|---|---|---|---|---|
1938 | 48850PPR200704241931 | UQQA00 | UQQA20 | <NA> | UQQA20 | 48850PPR200704240087 | 토지형질변경 | 247.99 | 48850 | 2012-05-13 |
1939 | 48850PPR200611151937 | UQQA00 | UQQA20 | <NA> | UQQA20 | 48850PPR200611150546 | 토지형질변경 | 658.44 | 48850 | 2012-05-13 |
1940 | 48850PPR200707301938 | UQQA00 | UQQA40 | <NA> | UQQA40 | 48850PPR200707300204 | 토지분할 | 2390.3 | 48850 | 2012-05-13 |
1941 | 48850PPR200709031939 | UQQA00 | UQQA40 | <NA> | UQQA40 | 48850PPR200709030402 | 토지분할 | 2456.36 | 48850 | 2012-05-13 |
1942 | 48850PPR200511101941 | UQQA00 | UQQA20 | <NA> | UQQA20 | 48850PPR200511100486 | 토지형질변경 | 780.14 | 48850 | 2012-05-13 |
1943 | 48850PPR200507261947 | UQQA00 | UQQA20 | <NA> | UQQA20 | 48850PPR200507260839 | 토지형질변경 | 664.16 | 48850 | 2012-05-13 |
1944 | 48850PPR200703121940 | UQQA00 | UQQA40 | <NA> | UQQA40 | 48850PPR200703120395 | 토지분할 | 788.39 | 48850 | 2012-05-13 |
1945 | 48850PPR200708081946 | UQQA00 | UQQA20 | <NA> | UQQA20 | 48850PPR200708080152 | 토지형질변경 | 665.84 | 48850 | 2012-05-13 |
1946 | 48850PPR200711131944 | UQQA00 | UQQA40 | <NA> | UQQA40 | 48850PPR200711130321 | 토지분할 | 9715.92 | 48850 | 2012-05-13 |
1947 | 48850PPR200708101873 | UQQA00 | UQQA40 | <NA> | UQQA40 | 48850PPR200708100422 | 토지분할 | 3218.98 | 48850 | 2012-05-13 |