Dataset statistics
Number of variables | 14 |
---|---|
Number of observations | 7975 |
Missing cells | 1 |
Missing cells (%) | < 0.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 903.5 KiB |
Average record size in memory | 116.0 B |
Variable types
Numeric | 4 |
---|---|
Text | 4 |
Categorical | 5 |
DateTime | 1 |
Dataset
Description | 객체id,현황도형 관리번호,도형 대분류코드,도형 중분류코드,도형 소분류코드,도형 속성코드,도형 조서관리 코드,결정고시관리코드,라벨명,시군구코드,도면번호,현황도형 생성일시,면적(도형),길이(도형) |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-21136/S/1/datasetView.do |
도형 속성코드 is highly overall correlated with 도형 대분류코드 and 2 other fields | High correlation |
도형 대분류코드 is highly overall correlated with 도형 중분류코드 and 1 other fields | High correlation |
도형 소분류코드 is highly overall correlated with 도형 중분류코드 and 1 other fields | High correlation |
도형 중분류코드 is highly overall correlated with 도형 대분류코드 and 2 other fields | High correlation |
면적(도형) is highly overall correlated with 길이(도형) | High correlation |
길이(도형) is highly overall correlated with 면적(도형) | High correlation |
도형 대분류코드 is highly imbalanced (67.4%) | Imbalance |
도형 중분류코드 is highly imbalanced (64.1%) | Imbalance |
도면번호 is highly imbalanced (99.4%) | Imbalance |
시군구코드 is highly skewed (γ1 = 62.7665626) | Skewed |
면적(도형) is highly skewed (γ1 = 21.24487173) | Skewed |
객체id has unique values | Unique |
Reproduction
Analysis started | 2024-05-11 01:46:11.735739 |
---|---|
Analysis finished | 2024-05-11 01:46:22.337446 |
Duration | 10.6 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
객체id
Real number (ℝ)
UNIQUE
 
Distinct | 7975 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 298377.94 |
Minimum | 294343 |
---|---|
Maximum | 302365 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 70.2 KiB |
Quantile statistics
Minimum | 294343 |
---|---|
5-th percentile | 294789.7 |
Q1 | 296384.5 |
median | 298378 |
Q3 | 300371.5 |
95-th percentile | 301966.3 |
Maximum | 302365 |
Range | 8022 |
Interquartile range (IQR) | 3987 |
Descriptive statistics
Standard deviation | 2302.4333 |
---|---|
Coefficient of variation (CV) | 0.0077164997 |
Kurtosis | -1.1997763 |
Mean | 298377.94 |
Median Absolute Deviation (MAD) | 1994 |
Skewness | -0.00015911514 |
Sum | 2.3795641 × 109 |
Variance | 5301199 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
294343 | 1 | < 0.1% |
299703 | 1 | < 0.1% |
299716 | 1 | < 0.1% |
299715 | 1 | < 0.1% |
299714 | 1 | < 0.1% |
299713 | 1 | < 0.1% |
299712 | 1 | < 0.1% |
299711 | 1 | < 0.1% |
299710 | 1 | < 0.1% |
299709 | 1 | < 0.1% |
Other values (7965) | 7965 |
Value | Count | Frequency (%) |
294343 | 1 | |
294344 | 1 | |
294345 | 1 | |
294346 | 1 | |
294347 | 1 | |
294348 | 1 | |
294349 | 1 | |
294350 | 1 | |
294351 | 1 | |
294352 | 1 |
Value | Count | Frequency (%) |
302365 | 1 | |
302364 | 1 | |
302363 | 1 | |
302362 | 1 | |
302361 | 1 | |
302360 | 1 | |
302359 | 1 | |
302358 | 1 | |
302357 | 1 | |
302356 | 1 |
현황도형 관리번호
Text
Distinct | 7885 |
---|---|
Distinct (%) | 98.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 62.4 KiB |
Length
Max length | 24 |
---|---|
Median length | 24 |
Mean length | 24 |
Min length | 24 |
Characters and Unicode
Total characters | 191400 |
---|---|
Distinct characters | 14 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 7845 ? |
---|---|
Unique (%) | 98.4% |
Sample
1st row | 11000UQ111PS201912151888 |
---|---|
2nd row | 11000UQ111PS201912152144 |
3rd row | 11000UQ111PS201912152145 |
4th row | 11000UQ111PS201912151968 |
5th row | 11000UQ111PS201912152024 |
Value | Count | Frequency (%) |
11000uq111ps201912153295 | 13 | 0.2% |
11000uq111ps201912150878 | 10 | 0.1% |
11000uq111ps201910160083 | 7 | 0.1% |
11000uq111ps201910160084 | 7 | 0.1% |
11000uq111ps201912153498 | 6 | 0.1% |
11000uq111ps201912155654 | 5 | 0.1% |
11000uq111ps201912154890 | 5 | 0.1% |
11000uq111ps201912155790 | 4 | 0.1% |
11000uq111ps201912154675 | 4 | 0.1% |
11000uq111ps201912155655 | 4 | 0.1% |
Other values (7875) | 7910 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 63763 | |
0 | 40633 | |
2 | 20930 | 10.9% |
5 | 9247 | 4.8% |
9 | 8659 | 4.5% |
U | 7975 | 4.2% |
Q | 7975 | 4.2% |
P | 7975 | 4.2% |
S | 7975 | 4.2% |
3 | 3752 | 2.0% |
Other values (4) | 12516 | 6.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 159500 | |
Uppercase Letter | 31900 | 16.7% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 63763 | |
0 | 40633 | |
2 | 20930 | 13.1% |
5 | 9247 | 5.8% |
9 | 8659 | 5.4% |
3 | 3752 | 2.4% |
7 | 3551 | 2.2% |
4 | 3419 | 2.1% |
6 | 3196 | 2.0% |
8 | 2350 | 1.5% |
Uppercase Letter
Value | Count | Frequency (%) |
U | 7975 | |
Q | 7975 | |
P | 7975 | |
S | 7975 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 159500 | |
Latin | 31900 | 16.7% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 63763 | |
0 | 40633 | |
2 | 20930 | 13.1% |
5 | 9247 | 5.8% |
9 | 8659 | 5.4% |
3 | 3752 | 2.4% |
7 | 3551 | 2.2% |
4 | 3419 | 2.1% |
6 | 3196 | 2.0% |
8 | 2350 | 1.5% |
Latin
Value | Count | Frequency (%) |
U | 7975 | |
Q | 7975 | |
P | 7975 | |
S | 7975 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 191400 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 63763 | |
0 | 40633 | |
2 | 20930 | 10.9% |
5 | 9247 | 4.8% |
9 | 8659 | 4.5% |
U | 7975 | 4.2% |
Q | 7975 | 4.2% |
P | 7975 | 4.2% |
S | 7975 | 4.2% |
3 | 3752 | 2.0% |
Other values (4) | 12516 | 6.5% |
도형 대분류코드
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 8 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 62.4 KiB |
UQA100 | |
---|---|
UQA400 | 607 |
UQA200 | 540 |
UQA999 | 227 |
UQA300 | 51 |
Other values (3) | 3 |
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 5.999373 |
Min length | 1 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | UQA100 |
---|---|
2nd row | UQA100 |
3rd row | UQA100 |
4th row | UQA100 |
5th row | UQA100 |
Common Values
Value | Count | Frequency (%) |
UQA100 | 6547 | |
UQA400 | 607 | 7.6% |
UQA200 | 540 | 6.8% |
UQA999 | 227 | 2.8% |
UQA300 | 51 | 0.6% |
1 | < 0.1% | |
UQ1200 | 1 | < 0.1% |
UQA120 | 1 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
uqa100 | 6547 | |
uqa400 | 607 | 7.6% |
uqa200 | 540 | 6.8% |
uqa999 | 227 | 2.8% |
uqa300 | 51 | 0.6% |
uq1200 | 1 | < 0.1% |
uqa120 | 1 | < 0.1% |
도형 중분류코드
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 16 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 62.4 KiB |
UQA120 | |
---|---|
UQA430 | 589 |
UQA130 | 539 |
UQA220 | 476 |
228 | |
Other values (11) | 193 |
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 5.8570533 |
Min length | 1 |
Unique
Unique | 4 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | UQA120 |
---|---|
2nd row | UQA120 |
3rd row | UQA120 |
4th row | UQA120 |
5th row | UQA120 |
Common Values
Value | Count | Frequency (%) |
UQA120 | 5950 | |
UQA430 | 589 | 7.4% |
UQA130 | 539 | 6.8% |
UQA220 | 476 | 6.0% |
228 | 2.9% | |
UQA330 | 51 | 0.6% |
UQA230 | 49 | 0.6% |
UQA110 | 48 | 0.6% |
UQA420 | 17 | 0.2% |
UQA240 | 15 | 0.2% |
Other values (6) | 13 | 0.2% |
Length
Value | Count | Frequency (%) |
uqa120 | 5950 | |
uqa430 | 589 | 7.6% |
uqa130 | 539 | 7.0% |
uqa220 | 476 | 6.1% |
uqa330 | 51 | 0.7% |
uqa230 | 49 | 0.6% |
uqa110 | 48 | 0.6% |
uqa420 | 17 | 0.2% |
uqa240 | 15 | 0.2% |
uqa190 | 6 | 0.1% |
Other values (5) | 7 | 0.1% |
도형 소분류코드
Categorical
HIGH CORRELATION
 
Distinct | 13 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 62.4 KiB |
UQA122 | |
---|---|
UQA121 | |
UQA124 | |
UQA123 | |
Other values (8) | 58 |
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 4.7640125 |
Min length | 1 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | UQA124 |
---|---|
2nd row | UQA124 |
3rd row | UQA124 |
4th row | UQA124 |
5th row | UQA124 |
Common Values
Value | Count | Frequency (%) |
1971 | ||
UQA122 | 1898 | |
UQA121 | 1484 | |
UQA124 | 1361 | |
UQA123 | 1203 | |
UQA111 | 41 | 0.5% |
UQA129 | 5 | 0.1% |
UQA112 | 4 | 0.1% |
UQA119 | 3 | < 0.1% |
UQA220 | 2 | < 0.1% |
Other values (3) | 3 | < 0.1% |
Length
Value | Count | Frequency (%) |
uqa122 | 1898 | |
uqa121 | 1484 | |
uqa124 | 1361 | |
uqa123 | 1203 | |
uqa111 | 41 | 0.7% |
uqa129 | 5 | 0.1% |
uqa112 | 4 | 0.1% |
uqa119 | 3 | < 0.1% |
uqa220 | 2 | < 0.1% |
na | 1 | < 0.1% |
Other values (2) | 2 | < 0.1% |
도형 속성코드
Categorical
HIGH CORRELATION
 
Distinct | 21 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 62.4 KiB |
UQA122 | |
---|---|
UQA121 | |
UQA124 | |
UQA123 | |
UQA430 | |
Other values (16) |
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 5.999373 |
Min length | 1 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | UQA124 |
---|---|
2nd row | UQA124 |
3rd row | UQA124 |
4th row | UQA124 |
5th row | UQA124 |
Common Values
Value | Count | Frequency (%) |
UQA122 | 1897 | |
UQA121 | 1485 | |
UQA124 | 1361 | |
UQA123 | 1204 | |
UQA430 | 589 | 7.4% |
UQA130 | 539 | 6.8% |
UQA220 | 476 | 6.0% |
UQA999 | 227 | 2.8% |
UQA330 | 51 | 0.6% |
UQA230 | 49 | 0.6% |
Other values (11) | 97 | 1.2% |
Length
Value | Count | Frequency (%) |
uqa122 | 1897 | |
uqa121 | 1485 | |
uqa124 | 1361 | |
uqa123 | 1204 | |
uqa430 | 589 | 7.4% |
uqa130 | 539 | 6.8% |
uqa220 | 476 | 6.0% |
uqa999 | 227 | 2.8% |
uqa330 | 51 | 0.6% |
uqa230 | 49 | 0.6% |
Other values (10) | 96 | 1.2% |
도형 조서관리 코드
Text
Distinct | 2257 |
---|---|
Distinct (%) | 28.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 62.4 KiB |
Length
Max length | 20 |
---|---|
Median length | 20 |
Mean length | 20 |
Min length | 20 |
Characters and Unicode
Total characters | 159500 |
---|---|
Distinct characters | 14 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 2173 ? |
---|---|
Unique (%) | 27.2% |
Sample
1st row | 11000ARZ000000001111 |
---|---|
2nd row | 11000ARZ000000001111 |
3rd row | 11000ARZ000000001111 |
4th row | 11000ARZ000000001111 |
5th row | 11000ARZ000000001111 |
Value | Count | Frequency (%) |
11000arz000000001111 | 5557 | |
11650arz202105210001 | 10 | 0.1% |
11000arz201306242004 | 10 | 0.1% |
11410arz202203020003 | 8 | 0.1% |
11560arz201908050002 | 7 | 0.1% |
11560arz201908050006 | 7 | 0.1% |
11000arz201807051156 | 5 | 0.1% |
11000arz202203290001 | 5 | 0.1% |
11000arz200209301219 | 4 | 0.1% |
11000arz202001280002 | 4 | 0.1% |
Other values (2247) | 2358 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 76747 | |
1 | 43322 | |
A | 7975 | 5.0% |
Z | 7975 | 5.0% |
R | 7974 | 5.0% |
2 | 6201 | 3.9% |
3 | 1701 | 1.1% |
5 | 1489 | 0.9% |
9 | 1402 | 0.9% |
4 | 1391 | 0.9% |
Other values (4) | 3323 | 2.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 135575 | |
Uppercase Letter | 23925 | 15.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 76747 | |
1 | 43322 | |
2 | 6201 | 4.6% |
3 | 1701 | 1.3% |
5 | 1489 | 1.1% |
9 | 1402 | 1.0% |
4 | 1391 | 1.0% |
8 | 1152 | 0.8% |
6 | 1146 | 0.8% |
7 | 1024 | 0.8% |
Uppercase Letter
Value | Count | Frequency (%) |
A | 7975 | |
Z | 7975 | |
R | 7974 | |
G | 1 | < 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 135575 | |
Latin | 23925 | 15.0% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 76747 | |
1 | 43322 | |
2 | 6201 | 4.6% |
3 | 1701 | 1.3% |
5 | 1489 | 1.1% |
9 | 1402 | 1.0% |
4 | 1391 | 1.0% |
8 | 1152 | 0.8% |
6 | 1146 | 0.8% |
7 | 1024 | 0.8% |
Latin
Value | Count | Frequency (%) |
A | 7975 | |
Z | 7975 | |
R | 7974 | |
G | 1 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 159500 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 76747 | |
1 | 43322 | |
A | 7975 | 5.0% |
Z | 7975 | 5.0% |
R | 7974 | 5.0% |
2 | 6201 | 3.9% |
3 | 1701 | 1.1% |
5 | 1489 | 0.9% |
9 | 1402 | 0.9% |
4 | 1391 | 0.9% |
Other values (4) | 3323 | 2.1% |
결정고시관리코드
Text
Distinct | 1342 |
---|---|
Distinct (%) | 16.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 62.4 KiB |
Value | Count | Frequency (%) |
11000ntc200101270199 | 23 | 1.0% |
11560ntc201908050001 | 18 | 0.8% |
11260ntc202112160004 | 16 | 0.7% |
11000ntc201304256817 | 11 | 0.5% |
11000ntc202204270002 | 11 | 0.5% |
11000ntc202106250006 | 10 | 0.4% |
11110ntc201908130003 | 10 | 0.4% |
11000ntc201004224815 | 10 | 0.4% |
11000ntc202203290001 | 9 | 0.4% |
11530ntc202205180003 | 9 | 0.4% |
Other values (1331) | 2270 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 15801 | |
1 | 9236 | |
2 | 6177 | 11.5% |
5578 | 10.4% | |
T | 2401 | 4.5% |
C | 2397 | 4.5% |
N | 2393 | 4.5% |
3 | 1654 | 3.1% |
9 | 1392 | 2.6% |
6 | 1386 | 2.6% |
Other values (4) | 5103 | 9.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 40749 | |
Uppercase Letter | 7191 | 13.4% |
Space Separator | 5578 | 10.4% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 15801 | |
1 | 9236 | |
2 | 6177 | 15.2% |
3 | 1654 | 4.1% |
9 | 1392 | 3.4% |
6 | 1386 | 3.4% |
7 | 1319 | 3.2% |
8 | 1307 | 3.2% |
5 | 1277 | 3.1% |
4 | 1200 | 2.9% |
Uppercase Letter
Value | Count | Frequency (%) |
T | 2401 | |
C | 2397 | |
N | 2393 |
Space Separator
Value | Count | Frequency (%) |
5578 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 46327 | |
Latin | 7191 | 13.4% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 15801 | |
1 | 9236 | |
2 | 6177 | 13.3% |
5578 | 12.0% | |
3 | 1654 | 3.6% |
9 | 1392 | 3.0% |
6 | 1386 | 3.0% |
7 | 1319 | 2.8% |
8 | 1307 | 2.8% |
5 | 1277 | 2.8% |
Latin
Value | Count | Frequency (%) |
T | 2401 | |
C | 2397 | |
N | 2393 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 53518 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 15801 | |
1 | 9236 | |
2 | 6177 | 11.5% |
5578 | 10.4% | |
T | 2401 | 4.5% |
C | 2397 | 4.5% |
N | 2393 | 4.5% |
3 | 1654 | 3.1% |
9 | 1392 | 2.6% |
6 | 1386 | 2.6% |
Other values (4) | 5103 | 9.5% |
라벨명
Text
Distinct | 74 |
---|---|
Distinct (%) | 0.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 62.4 KiB |
Length
Max length | 19 |
---|---|
Median length | 9 |
Mean length | 9.2599373 |
Min length | 1 |
Characters and Unicode
Total characters | 73848 |
---|---|
Distinct characters | 80 |
Distinct categories | 8 ? |
Distinct scripts | 3 ? |
Distinct blocks | 5 ? |
Unique
Unique | 47 ? |
---|---|
Unique (%) | 0.6% |
Sample
1st row | 제2종일반주거지역(7층이하) |
---|---|
2nd row | 제2종일반주거지역(7층이하) |
3rd row | 제2종일반주거지역(7층이하) |
4th row | 제2종일반주거지역(7층이하) |
5th row | 제2종일반주거지역(7층이하) |
Value | Count | Frequency (%) |
제2종일반주거지역 | 1841 | |
제1종일반주거지역 | 1477 | |
제2종일반주거지역(7층이하 | 1261 | |
제3종일반주거지역 | 1202 | |
자연녹지지역 | 586 | 7.1% |
준주거지역 | 544 | 6.6% |
일반상업지역 | 470 | 5.7% |
기타 | 227 | 2.8% |
도시지역 | 227 | 2.8% |
제2종일반주거지역(7층 | 76 | 0.9% |
Other values (63) | 310 | 3.8% |
Most occurring characters
Value | Count | Frequency (%) |
지 | 8575 | |
역 | 7972 | |
주 | 6542 | |
거 | 6541 | |
일 | 6420 | |
반 | 6420 | |
종 | 5987 | |
제 | 5977 | |
2 | 3320 | 4.5% |
1 | 1590 | 2.2% |
Other values (70) | 14504 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 63270 | |
Decimal Number | 7475 | 10.1% |
Open Punctuation | 1408 | 1.9% |
Close Punctuation | 1408 | 1.9% |
Space Separator | 248 | 0.3% |
Other Punctuation | 28 | < 0.1% |
Dash Punctuation | 10 | < 0.1% |
Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
지 | 8575 | |
역 | 7972 | |
주 | 6542 | |
거 | 6541 | |
일 | 6420 | |
반 | 6420 | |
종 | 5987 | |
제 | 5977 | |
층 | 1408 | 2.2% |
이 | 1294 | 2.0% |
Other values (59) | 6134 |
Decimal Number
Value | Count | Frequency (%) |
2 | 3320 | |
1 | 1590 | |
7 | 1356 | |
3 | 1208 | 16.2% |
5 | 1 | < 0.1% |
Open Punctuation
Value | Count | Frequency (%) |
( | 1408 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1408 |
Space Separator
Value | Count | Frequency (%) |
248 |
Other Punctuation
Value | Count | Frequency (%) |
? | 28 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 63258 | |
Common | 10578 | 14.3% |
Han | 12 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
지 | 8575 | |
역 | 7972 | |
주 | 6542 | |
거 | 6541 | |
일 | 6420 | |
반 | 6420 | |
종 | 5987 | |
제 | 5977 | |
층 | 1408 | 2.2% |
이 | 1294 | 2.0% |
Other values (55) | 6122 |
Common
Value | Count | Frequency (%) |
2 | 3320 | |
1 | 1590 | |
( | 1408 | |
) | 1408 | |
7 | 1356 | |
3 | 1208 | 11.4% |
248 | 2.3% | |
? | 28 | 0.3% |
- | 10 | 0.1% |
5 | 1 | < 0.1% |
Han
Value | Count | Frequency (%) |
二 | 4 | |
吏 | 4 | |
醫 | 3 | |
以 | 1 | 8.3% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 63258 | |
ASCII | 10550 | 14.3% |
None | 28 | < 0.1% |
CJK | 8 | < 0.1% |
CJK Compat Ideographs | 4 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
지 | 8575 | |
역 | 7972 | |
주 | 6542 | |
거 | 6541 | |
일 | 6420 | |
반 | 6420 | |
종 | 5987 | |
제 | 5977 | |
층 | 1408 | 2.2% |
이 | 1294 | 2.0% |
Other values (55) | 6122 |
ASCII
Value | Count | Frequency (%) |
2 | 3320 | |
1 | 1590 | |
( | 1408 | |
) | 1408 | |
7 | 1356 | |
3 | 1208 | 11.5% |
248 | 2.4% | |
- | 10 | 0.1% |
5 | 1 | < 0.1% |
_ | 1 | < 0.1% |
None
Value | Count | Frequency (%) |
? | 28 |
CJK
Value | Count | Frequency (%) |
二 | 4 | |
醫 | 3 | |
以 | 1 | 12.5% |
CJK Compat Ideographs
Value | Count | Frequency (%) |
吏 | 4 |
시군구코드
Real number (ℝ)
SKEWED
 
Distinct | 27 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 11039.242 |
Minimum | 11000 |
---|---|
Maximum | 99999 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 70.2 KiB |
Quantile statistics
Minimum | 11000 |
---|---|
5-th percentile | 11000 |
Q1 | 11000 |
median | 11000 |
Q3 | 11000 |
95-th percentile | 11000 |
Maximum | 99999 |
Range | 88999 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 1411.8042 |
---|---|
Coefficient of variation (CV) | 0.12788959 |
Kurtosis | 3953.9175 |
Mean | 11039.242 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 62.766563 |
Sum | 88037958 |
Variance | 1993191 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
11000 | 7627 | |
11590 | 37 | 0.5% |
11560 | 36 | 0.5% |
11170 | 28 | 0.4% |
11380 | 25 | 0.3% |
11230 | 22 | 0.3% |
11110 | 22 | 0.3% |
11440 | 20 | 0.3% |
11140 | 16 | 0.2% |
11620 | 16 | 0.2% |
Other values (17) | 126 | 1.6% |
Value | Count | Frequency (%) |
11000 | 7627 | |
11110 | 22 | 0.3% |
11140 | 16 | 0.2% |
11170 | 28 | 0.4% |
11200 | 12 | 0.2% |
11215 | 15 | 0.2% |
11230 | 22 | 0.3% |
11260 | 5 | 0.1% |
11290 | 15 | 0.2% |
11305 | 10 | 0.1% |
Value | Count | Frequency (%) |
99999 | 2 | < 0.1% |
11740 | 1 | < 0.1% |
11710 | 6 | 0.1% |
11680 | 7 | 0.1% |
11650 | 9 | 0.1% |
11620 | 16 | |
11590 | 37 | |
11560 | 36 | |
11545 | 7 | 0.1% |
11530 | 12 | 0.2% |
도면번호
Categorical
IMBALANCE
 
Distinct | 8 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 62.4 KiB |
2 | 4 |
---|---|
6 | 2 |
① | 1 |
<NA> | 1 |
Other values (3) | 3 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.0003762 |
Min length | 1 |
Unique
Unique | 5 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | |
---|---|
2nd row | |
3rd row | |
4th row | |
5th row |
Common Values
Value | Count | Frequency (%) |
7964 | ||
2 | 4 | 0.1% |
6 | 2 | < 0.1% |
① | 1 | < 0.1% |
<NA> | 1 | < 0.1% |
1 | 1 | < 0.1% |
3 | 1 | < 0.1% |
5 | 1 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2 | 4 | |
6 | 2 | |
① | 1 | 9.1% |
na | 1 | 9.1% |
1 | 1 | 9.1% |
3 | 1 | 9.1% |
5 | 1 | 9.1% |
현황도형 생성일시
Date
Distinct | 381 |
---|---|
Distinct (%) | 4.8% |
Missing | 1 |
Missing (%) | < 0.1% |
Memory size | 62.4 KiB |
Minimum | 1899-12-29 23:27:52 |
---|---|
Maximum | 2024-04-24 00:00:00 |
면적(도형)
Real number (ℝ)
HIGH CORRELATION
  SKEWED
 
Distinct | 7852 |
---|---|
Distinct (%) | 98.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 85201.138 |
Minimum | 0 |
---|---|
Maximum | 20968960 |
Zeros | 1 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 70.2 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 6.4207582 |
Q1 | 1395.2277 |
median | 10182.636 |
Q3 | 42292.295 |
95-th percentile | 262986.43 |
Maximum | 20968960 |
Range | 20968960 |
Interquartile range (IQR) | 40897.068 |
Descriptive statistics
Standard deviation | 571030.2 |
---|---|
Coefficient of variation (CV) | 6.7021429 |
Kurtosis | 565.53651 |
Mean | 85201.138 |
Median Absolute Deviation (MAD) | 10063.157 |
Skewness | 21.244872 |
Sum | 6.7947907 × 108 |
Variance | 3.2607549 × 1011 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
108192.33280801 | 10 | 0.1% |
40462.70964938 | 8 | 0.1% |
80.41972918 | 5 | 0.1% |
1343.79765862 | 4 | 0.1% |
286.864463 | 4 | 0.1% |
127452.38791959 | 4 | 0.1% |
0.0006585 | 4 | 0.1% |
7585.56642794 | 4 | 0.1% |
249.18700122 | 3 | < 0.1% |
4066.47418999 | 3 | < 0.1% |
Other values (7842) | 7926 |
Value | Count | Frequency (%) |
0.0 | 1 | < 0.1% |
8.794e-05 | 1 | < 0.1% |
0.0006585 | 4 | |
0.00341802 | 1 | < 0.1% |
0.00359357 | 1 | < 0.1% |
0.00364012 | 1 | < 0.1% |
0.0044337 | 1 | < 0.1% |
0.004558 | 1 | < 0.1% |
0.0046161 | 1 | < 0.1% |
0.0047279 | 1 | < 0.1% |
Value | Count | Frequency (%) |
20968960.184733 | 1 | |
17523067.9555952 | 1 | |
16535936.1506327 | 1 | |
13697999.1170577 | 1 | |
12284179.1735757 | 1 | |
10890628.6113886 | 1 | |
10510314.5286172 | 1 | |
9697471.7484725 | 1 | |
9169537.04875847 | 1 | |
8790953.12064867 | 1 |
길이(도형)
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 7846 |
---|---|
Distinct (%) | 98.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1464.8789 |
Minimum | 0 |
---|---|
Maximum | 136718.04 |
Zeros | 1 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 70.2 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 37.851105 |
Q1 | 216.19085 |
median | 573.73641 |
Q3 | 1367.192 |
95-th percentile | 5472.948 |
Maximum | 136718.04 |
Range | 136718.04 |
Interquartile range (IQR) | 1151.0012 |
Descriptive statistics
Standard deviation | 3715.7273 |
---|---|
Coefficient of variation (CV) | 2.5365423 |
Kurtosis | 327.79919 |
Mean | 1464.8789 |
Median Absolute Deviation (MAD) | 431.50292 |
Skewness | 13.584859 |
Sum | 11682409 |
Variance | 13806629 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1609.99212575 | 10 | 0.1% |
1740.40594112 | 8 | 0.1% |
81.65300163 | 5 | 0.1% |
145.24008958 | 4 | 0.1% |
5594.93074492 | 4 | 0.1% |
0.14327141 | 4 | 0.1% |
80.7661672 | 4 | 0.1% |
380.09925477 | 4 | 0.1% |
173.42904194 | 3 | < 0.1% |
136.02299212 | 3 | < 0.1% |
Other values (7836) | 7926 |
Value | Count | Frequency (%) |
0.0 | 1 | < 0.1% |
0.116897 | 1 | < 0.1% |
0.14327141 | 4 | |
1.26213596 | 1 | < 0.1% |
1.32463171 | 1 | < 0.1% |
1.44799569 | 1 | < 0.1% |
1.49449806 | 1 | < 0.1% |
1.53601185 | 1 | < 0.1% |
1.59970724 | 1 | < 0.1% |
1.81426135 | 1 | < 0.1% |
Value | Count | Frequency (%) |
136718.03627726 | 1 | |
83567.21487144 | 1 | |
81593.90324953 | 1 | |
60910.45569796 | 1 | |
60702.46359381 | 1 | |
55507.04157824 | 1 | |
52250.07519019 | 1 | |
50456.9303414 | 1 | |
45702.14415603 | 1 | |
45046.82776143 | 1 |
객체id | 도형 대분류코드 | 도형 중분류코드 | 도형 소분류코드 | 도형 속성코드 | 라벨명 | 시군구코드 | 도면번호 | 면적(도형) | 길이(도형) | |
---|---|---|---|---|---|---|---|---|---|---|
객체id | 1.000 | 0.085 | 0.102 | 0.116 | 0.157 | 0.169 | 0.274 | 0.031 | 0.000 | 0.000 |
도형 대분류코드 | 0.085 | 1.000 | 0.998 | 0.794 | 0.984 | 0.993 | 0.000 | 0.000 | 0.119 | 0.144 |
도형 중분류코드 | 0.102 | 0.998 | 1.000 | 0.858 | 0.992 | 0.993 | 0.000 | 0.000 | 0.103 | 0.127 |
도형 소분류코드 | 0.116 | 0.794 | 0.858 | 1.000 | 0.987 | 0.992 | 0.000 | 0.000 | 0.000 | 0.125 |
도형 속성코드 | 0.157 | 0.984 | 0.992 | 0.987 | 1.000 | 0.999 | 0.000 | 0.000 | 0.090 | 0.160 |
라벨명 | 0.169 | 0.993 | 0.993 | 0.992 | 0.999 | 1.000 | 0.000 | 0.223 | 0.714 | 0.288 |
시군구코드 | 0.274 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | NaN | NaN |
도면번호 | 0.031 | 0.000 | 0.000 | 0.000 | 0.000 | 0.223 | 0.000 | 1.000 | 0.000 | 0.000 |
면적(도형) | 0.000 | 0.119 | 0.103 | 0.000 | 0.090 | 0.714 | NaN | 0.000 | 1.000 | 0.756 |
길이(도형) | 0.000 | 0.144 | 0.127 | 0.125 | 0.160 | 0.288 | NaN | 0.000 | 0.756 | 1.000 |
도형 속성코드 | 도면번호 | 도형 대분류코드 | 도형 소분류코드 | 도형 중분류코드 | |
---|---|---|---|---|---|
도형 속성코드 | 1.000 | 0.000 | 0.924 | 0.904 | 0.931 |
도면번호 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 |
도형 대분류코드 | 0.924 | 0.000 | 1.000 | 0.486 | 0.925 |
도형 소분류코드 | 0.904 | 0.000 | 0.486 | 1.000 | 0.521 |
도형 중분류코드 | 0.931 | 0.000 | 0.925 | 0.521 | 1.000 |
객체id | 시군구코드 | 면적(도형) | 길이(도형) | 도형 대분류코드 | 도형 중분류코드 | 도형 소분류코드 | 도형 속성코드 | 도면번호 | |
---|---|---|---|---|---|---|---|---|---|
객체id | 1.000 | -0.019 | -0.003 | -0.005 | 0.040 | 0.040 | 0.049 | 0.058 | 0.016 |
시군구코드 | -0.019 | 1.000 | 0.049 | 0.062 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
면적(도형) | -0.003 | 0.049 | 1.000 | 0.939 | 0.057 | 0.040 | 0.000 | 0.033 | 0.000 |
길이(도형) | -0.005 | 0.062 | 0.939 | 1.000 | 0.048 | 0.045 | 0.053 | 0.066 | 0.000 |
도형 대분류코드 | 0.040 | 0.000 | 0.057 | 0.048 | 1.000 | 0.925 | 0.486 | 0.924 | 0.000 |
도형 중분류코드 | 0.040 | 0.000 | 0.040 | 0.045 | 0.925 | 1.000 | 0.521 | 0.931 | 0.000 |
도형 소분류코드 | 0.049 | 0.000 | 0.000 | 0.053 | 0.486 | 0.521 | 1.000 | 0.904 | 0.000 |
도형 속성코드 | 0.058 | 0.000 | 0.033 | 0.066 | 0.924 | 0.931 | 0.904 | 1.000 | 0.000 |
도면번호 | 0.016 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 |
객체id | 현황도형 관리번호 | 도형 대분류코드 | 도형 중분류코드 | 도형 소분류코드 | 도형 속성코드 | 도형 조서관리 코드 | 결정고시관리코드 | 라벨명 | 시군구코드 | 도면번호 | 현황도형 생성일시 | 면적(도형) | 길이(도형) | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 294343 | 11000UQ111PS201912151888 | UQA100 | UQA120 | UQA124 | UQA124 | 11000ARZ000000001111 | 제2종일반주거지역(7층이하) | 11000 | 2019-12-15 00:00:00.0 | 152954.900644 | 4227.108135 | ||
1 | 294344 | 11000UQ111PS201912152144 | UQA100 | UQA120 | UQA124 | UQA124 | 11000ARZ000000001111 | 제2종일반주거지역(7층이하) | 11000 | 2019-12-15 00:00:00.0 | 108067.862098 | 1749.209302 | ||
2 | 294345 | 11000UQ111PS201912152145 | UQA100 | UQA120 | UQA124 | UQA124 | 11000ARZ000000001111 | 제2종일반주거지역(7층이하) | 11000 | 2019-12-15 00:00:00.0 | 14.684567 | 110.004502 | ||
3 | 294346 | 11000UQ111PS201912151968 | UQA100 | UQA120 | UQA124 | UQA124 | 11000ARZ000000001111 | 제2종일반주거지역(7층이하) | 11000 | 2019-12-15 00:00:00.0 | 6165.861378 | 322.816516 | ||
4 | 294347 | 11000UQ111PS201912152024 | UQA100 | UQA120 | UQA124 | UQA124 | 11000ARZ000000001111 | 제2종일반주거지역(7층이하) | 11000 | 2019-12-15 00:00:00.0 | 457830.786808 | 7774.463035 | ||
5 | 294348 | 11000UQ111PS201912153565 | UQA100 | UQA120 | UQA122 | UQA122 | 11000ARZ000000001111 | 제2종일반주거지역 | 11000 | 2019-12-15 00:00:00.0 | 27587.712678 | 954.538723 | ||
6 | 294349 | 11000UQ111PS201912152311 | UQA100 | UQA120 | UQA121 | UQA121 | 11000ARZ000000001111 | 제1종일반주거지역 | 11000 | 2019-12-15 00:00:00.0 | 100240.286727 | 4227.558148 | ||
7 | 294350 | 11000UQ111PS201912154250 | UQA100 | UQA120 | UQA123 | UQA123 | 11000ARZ000000001111 | 제3종일반주거지역 | 11000 | 2019-12-15 00:00:00.0 | 141849.276107 | 5919.309664 | ||
8 | 294351 | 11000UQ111PS201912155548 | UQA100 | UQA120 | UQA122 | UQA122 | 11000ARZ201808130031 | 11000NTC201806143176 | 제2종일반주거지역 | 11000 | 2019-12-15 00:00:00.0 | 200542.056634 | 4472.758837 | |
9 | 294352 | 11000UQ111PS201912155555 | UQA100 | UQA120 | UQA122 | UQA122 | 11000ARZ000000001111 | 제2종일반주거지역 | 11000 | 2019-12-15 00:00:00.0 | 271545.69064 | 2899.692749 |
객체id | 현황도형 관리번호 | 도형 대분류코드 | 도형 중분류코드 | 도형 소분류코드 | 도형 속성코드 | 도형 조서관리 코드 | 결정고시관리코드 | 라벨명 | 시군구코드 | 도면번호 | 현황도형 생성일시 | 면적(도형) | 길이(도형) | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
7965 | 302356 | 11000UQ111PS201912154510 | UQA100 | UQA130 | UQA130 | 11000ARZ000000001111 | 준주거지역 | 11000 | 2019-12-15 00:00:00.0 | 35557.721619 | 1038.024727 | |||
7966 | 302357 | 11000UQ111PS201912153145 | UQA100 | UQA120 | UQA122 | UQA122 | 11000ARZ000000001111 | 제2종일반주거지역 | 11000 | 2019-12-15 00:00:00.0 | 157.062678 | 112.002287 | ||
7967 | 302358 | 11000UQ111PS201912154532 | UQA100 | UQA120 | UQA122 | UQA122 | 11000ARZ000000001111 | 제2종일반주거지역 | 11000 | 2019-12-15 00:00:00.0 | 3.067403 | 8.385077 | ||
7968 | 302359 | 11000UQ111PS201912153749 | UQA100 | UQA120 | UQA122 | UQA122 | 11000ARZ000000001111 | 제2종일반주거지역 | 11000 | 2019-12-15 00:00:00.0 | 2951.799732 | 251.339698 | ||
7969 | 302360 | 11000UQ111PS201912153497 | UQA100 | UQA120 | UQA121 | UQA121 | 11000ARZ200912171829 | 11000NTC200604069910 | 제1종일반주거지역 | 11000 | 2019-12-15 00:00:00.0 | 7523.740706 | 1158.548256 | |
7970 | 302361 | 11000UQ111PS201912154164 | UQA100 | UQA120 | UQA121 | UQA121 | 11000ARZ000000001111 | 제1종일반주거지역 | 11000 | 2019-12-15 00:00:00.0 | 2276.668912 | 242.072846 | ||
7971 | 302362 | 11000UQ111PS201912156418 | UQA400 | UQA430 | UQA430 | 11000ARZ200912015718 | 11000NTC200412157362 | 자연녹지지역 | 11000 | 2019-12-15 00:00:00.0 | 1076200.772099 | 8987.346539 | ||
7972 | 302363 | 11000UQ111PS202007126789 | UQA400 | UQA430 | UQA430 | 11000ARZ000000001111 | 자연녹지지역 | 11000 | 2020-07-12 00:00:00.0 | 1856.137365 | 3044.039554 | |||
7973 | 302364 | 11000UQ111PS202007126791 | UQA400 | UQA430 | UQA430 | 11000ARZ000000001111 | 자연녹지지역 | 11000 | 2020-07-12 00:00:00.0 | 30.644845 | 390.9741 | |||
7974 | 302365 | 11000UQ111PS202007126793 | UQA400 | UQA430 | UQA430 | 11000ARZ000000001111 | 자연녹지지역 | 11000 | 2020-07-12 00:00:00.0 | 166.579261 | 1622.877766 |