Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 10000 |
Missing cells | 52 |
Missing cells (%) | 0.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 742.2 KiB |
Average record size in memory | 76.0 B |
Variable types
Text | 4 |
---|---|
Categorical | 3 |
Numeric | 1 |
Dataset
Description | 관리_지역지구구역_pk,관리_주택대장_pk,지역지구구역_구분_코드,지역지구구역_코드,대표_여부,동_구분_코드,지역지구구역_명,작업_일자 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15676/S/1/datasetView.do |
지역지구구역_구분_코드 is highly overall correlated with 동_구분_코드 | High correlation |
대표_여부 is highly overall correlated with 동_구분_코드 | High correlation |
동_구분_코드 is highly overall correlated with 작업_일자 and 2 other fields | High correlation |
작업_일자 is highly overall correlated with 동_구분_코드 | High correlation |
관리_지역지구구역_pk has unique values | Unique |
Reproduction
Analysis started | 2024-05-04 01:56:05.506756 |
---|---|
Analysis finished | 2024-05-04 01:56:07.504561 |
Duration | 2 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
관리_지역지구구역_pk
Text
UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 28 |
---|---|
Median length | 9 |
Mean length | 11.2395 |
Min length | 7 |
Characters and Unicode
Total characters | 112395 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 10000 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 11500-444 |
---|---|
2nd row | 11440-465 |
3rd row | 11740-10 |
4th row | 11530-745 |
5th row | 11530-1461 |
Value | Count | Frequency (%) |
11500-444 | 1 | < 0.1% |
11320-107 | 1 | < 0.1% |
11740-100003345 | 1 | < 0.1% |
11140-100001421 | 1 | < 0.1% |
11530-1192 | 1 | < 0.1% |
11590-33 | 1 | < 0.1% |
11170-226 | 1 | < 0.1% |
11500-792 | 1 | < 0.1% |
11000-100001361 | 1 | < 0.1% |
11110-1000000000000000107301 | 1 | < 0.1% |
Other values (9990) | 9990 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 31305 | |
1 | 29544 | |
- | 10000 | 8.9% |
2 | 7534 | 6.7% |
5 | 7471 | 6.6% |
3 | 6590 | 5.9% |
4 | 5492 | 4.9% |
6 | 4787 | 4.3% |
7 | 3616 | 3.2% |
8 | 3165 | 2.8% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 102395 | |
Dash Punctuation | 10000 | 8.9% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 31305 | |
1 | 29544 | |
2 | 7534 | 7.4% |
5 | 7471 | 7.3% |
3 | 6590 | 6.4% |
4 | 5492 | 5.4% |
6 | 4787 | 4.7% |
7 | 3616 | 3.5% |
8 | 3165 | 3.1% |
9 | 2891 | 2.8% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 112395 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 31305 | |
1 | 29544 | |
- | 10000 | 8.9% |
2 | 7534 | 6.7% |
5 | 7471 | 6.6% |
3 | 6590 | 5.9% |
4 | 5492 | 4.9% |
6 | 4787 | 4.3% |
7 | 3616 | 3.2% |
8 | 3165 | 2.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 112395 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 31305 | |
1 | 29544 | |
- | 10000 | 8.9% |
2 | 7534 | 6.7% |
5 | 7471 | 6.6% |
3 | 6590 | 5.9% |
4 | 5492 | 4.9% |
6 | 4787 | 4.3% |
7 | 3616 | 3.2% |
8 | 3165 | 2.8% |
관리_주택대장_pk
Text
Distinct | 2953 |
---|---|
Distinct (%) | 29.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
11530-17 | 692 | 6.9% |
11305-8 | 456 | 4.6% |
11500-18 | 120 | 1.2% |
11620-11 | 103 | 1.0% |
11545-2 | 95 | 0.9% |
11710-27 | 79 | 0.8% |
11560-33 | 79 | 0.8% |
11200-38 | 69 | 0.7% |
11620-1 | 68 | 0.7% |
11620-15 | 61 | 0.6% |
Other values (2943) | 8178 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 29275 | |
0 | 26842 | |
- | 10000 | 9.6% |
5 | 7256 | 6.9% |
2 | 6063 | 5.8% |
3 | 6046 | 5.8% |
4 | 5005 | 4.8% |
6 | 4531 | 4.3% |
7 | 4014 | 3.8% |
8 | 3262 | 3.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 94708 | |
Dash Punctuation | 10000 | 9.6% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 29275 | |
0 | 26842 | |
5 | 7256 | 7.7% |
2 | 6063 | 6.4% |
3 | 6046 | 6.4% |
4 | 5005 | 5.3% |
6 | 4531 | 4.8% |
7 | 4014 | 4.2% |
8 | 3262 | 3.4% |
9 | 2414 | 2.5% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 104708 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 29275 | |
0 | 26842 | |
- | 10000 | 9.6% |
5 | 7256 | 6.9% |
2 | 6063 | 5.8% |
3 | 6046 | 5.8% |
4 | 5005 | 4.8% |
6 | 4531 | 4.3% |
7 | 4014 | 3.8% |
8 | 3262 | 3.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 104708 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 29275 | |
0 | 26842 | |
- | 10000 | 9.6% |
5 | 7256 | 6.9% |
2 | 6063 | 5.8% |
3 | 6046 | 5.8% |
4 | 5005 | 4.8% |
6 | 4531 | 4.3% |
7 | 4014 | 3.8% |
8 | 3262 | 3.1% |
지역지구구역_구분_코드
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
1 | |
---|---|
2 | |
3 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 3 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 2 |
Common Values
Value | Count | Frequency (%) |
1 | 6587 | |
2 | 2451 | 24.5% |
3 | 962 | 9.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 6587 | |
2 | 2451 | 24.5% |
3 | 962 | 9.6% |
지역지구구역_코드
Text
Distinct | 148 |
---|---|
Distinct (%) | 1.5% |
Missing | 26 |
Missing (%) | 0.3% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
1020 | 3546 | |
uqa122 | 495 | 5.0% |
112 | 482 | 4.8% |
1022 | 414 | 4.2% |
uqa123 | 373 | 3.7% |
103 | 320 | 3.2% |
uqa001 | 309 | 3.1% |
1230 | 301 | 3.0% |
1023 | 284 | 2.8% |
0100 | 277 | 2.8% |
Other values (138) | 3173 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 13574 | |
1 | 9563 | |
2 | 8262 | |
U | 2525 | 5.9% |
Q | 2262 | 5.3% |
3 | 2162 | 5.1% |
A | 1725 | 4.0% |
8 | 414 | 1.0% |
6 | 357 | 0.8% |
9 | 301 | 0.7% |
Other values (25) | 1459 | 3.4% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 34922 | |
Uppercase Letter | 7682 | 18.0% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
U | 2525 | |
Q | 2262 | |
A | 1725 | |
G | 194 | 2.5% |
D | 167 | 2.2% |
Z | 120 | 1.6% |
O | 110 | 1.4% |
N | 98 | 1.3% |
M | 86 | 1.1% |
H | 77 | 1.0% |
Other values (15) | 318 | 4.1% |
Decimal Number
Value | Count | Frequency (%) |
0 | 13574 | |
1 | 9563 | |
2 | 8262 | |
3 | 2162 | 6.2% |
8 | 414 | 1.2% |
6 | 357 | 1.0% |
9 | 301 | 0.9% |
5 | 187 | 0.5% |
4 | 86 | 0.2% |
7 | 16 | < 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 34922 | |
Latin | 7682 | 18.0% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
U | 2525 | |
Q | 2262 | |
A | 1725 | |
G | 194 | 2.5% |
D | 167 | 2.2% |
Z | 120 | 1.6% |
O | 110 | 1.4% |
N | 98 | 1.3% |
M | 86 | 1.1% |
H | 77 | 1.0% |
Other values (15) | 318 | 4.1% |
Common
Value | Count | Frequency (%) |
0 | 13574 | |
1 | 9563 | |
2 | 8262 | |
3 | 2162 | 6.2% |
8 | 414 | 1.2% |
6 | 357 | 1.0% |
9 | 301 | 0.9% |
5 | 187 | 0.5% |
4 | 86 | 0.2% |
7 | 16 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 42604 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 13574 | |
1 | 9563 | |
2 | 8262 | |
U | 2525 | 5.9% |
Q | 2262 | 5.3% |
3 | 2162 | 5.1% |
A | 1725 | 4.0% |
8 | 414 | 1.0% |
6 | 357 | 0.8% |
9 | 301 | 0.7% |
Other values (25) | 1459 | 3.4% |
대표_여부
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
1 | |
---|---|
0 | |
<NA> | 6 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.0018 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 0 |
3rd row | 1 |
4th row | 0 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 7345 | |
0 | 2649 | 26.5% |
<NA> | 6 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 7345 | |
0 | 2649 | 26.5% |
na | 6 | 0.1% |
동_구분_코드
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
1 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 2.779 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | <NA> |
4th row | <NA> |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
<NA> | 5930 | |
1 | 4070 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 5930 | |
1 | 4070 |
지역지구구역_명
Text
Distinct | 128 |
---|---|
Distinct (%) | 1.3% |
Missing | 26 |
Missing (%) | 0.3% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
일반주거지역 | 3556 | |
제2종일반주거지역 | 909 | 9.0% |
제3종일반주거지역 | 657 | 6.5% |
도시지역 | 604 | 6.0% |
최고고도지구 | 526 | 5.2% |
일반미관지구 | 404 | 4.0% |
준공업지역 | 354 | 3.5% |
주차장정비지구 | 235 | 2.3% |
공항지구 | 218 | 2.2% |
재개발구역 | 207 | 2.1% |
Other values (128) | 2399 |
Most occurring characters
Value | Count | Frequency (%) |
지 | 9571 | |
역 | 7739 | |
일 | 5741 | 8.7% |
반 | 5741 | 8.7% |
주 | 5632 | 8.5% |
거 | 5374 | 8.2% |
구 | 3708 | 5.6% |
제 | 2001 | 3.0% |
종 | 1856 | 2.8% |
도 | 1316 | 2.0% |
Other values (134) | 17239 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 63914 | |
Decimal Number | 1863 | 2.8% |
Space Separator | 95 | 0.1% |
Close Punctuation | 23 | < 0.1% |
Open Punctuation | 23 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
지 | 9571 | |
역 | 7739 | |
일 | 5741 | 9.0% |
반 | 5741 | 9.0% |
주 | 5632 | 8.8% |
거 | 5374 | 8.4% |
구 | 3708 | 5.8% |
제 | 2001 | 3.1% |
종 | 1856 | 2.9% |
도 | 1316 | 2.1% |
Other values (125) | 15235 |
Decimal Number
Value | Count | Frequency (%) |
2 | 958 | |
3 | 661 | |
1 | 227 | 12.2% |
4 | 10 | 0.5% |
5 | 6 | 0.3% |
6 | 1 | 0.1% |
Space Separator
Value | Count | Frequency (%) |
95 |
Close Punctuation
Value | Count | Frequency (%) |
) | 23 |
Open Punctuation
Value | Count | Frequency (%) |
( | 23 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 63914 | |
Common | 2004 | 3.0% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
지 | 9571 | |
역 | 7739 | |
일 | 5741 | 9.0% |
반 | 5741 | 9.0% |
주 | 5632 | 8.8% |
거 | 5374 | 8.4% |
구 | 3708 | 5.8% |
제 | 2001 | 3.1% |
종 | 1856 | 2.9% |
도 | 1316 | 2.1% |
Other values (125) | 15235 |
Common
Value | Count | Frequency (%) |
2 | 958 | |
3 | 661 | |
1 | 227 | 11.3% |
95 | 4.7% | |
) | 23 | 1.1% |
( | 23 | 1.1% |
4 | 10 | 0.5% |
5 | 6 | 0.3% |
6 | 1 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 63914 | |
ASCII | 2004 | 3.0% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
지 | 9571 | |
역 | 7739 | |
일 | 5741 | 9.0% |
반 | 5741 | 9.0% |
주 | 5632 | 8.8% |
거 | 5374 | 8.4% |
구 | 3708 | 5.8% |
제 | 2001 | 3.1% |
종 | 1856 | 2.9% |
도 | 1316 | 2.1% |
Other values (125) | 15235 |
ASCII
Value | Count | Frequency (%) |
2 | 958 | |
3 | 661 | |
1 | 227 | 11.3% |
95 | 4.7% | |
) | 23 | 1.1% |
( | 23 | 1.1% |
4 | 10 | 0.5% |
5 | 6 | 0.3% |
6 | 1 | < 0.1% |
작업_일자
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 485 |
---|---|
Distinct (%) | 4.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20141060 |
Minimum | 20111227 |
---|---|
Maximum | 20240503 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 20111227 |
---|---|
5-th percentile | 20111227 |
Q1 | 20111227 |
median | 20111227 |
Q3 | 20180927 |
95-th percentile | 20240208 |
Maximum | 20240503 |
Range | 129276 |
Interquartile range (IQR) | 69700 |
Descriptive statistics
Standard deviation | 46221.045 |
---|---|
Coefficient of variation (CV) | 0.0022948666 |
Kurtosis | -0.40895471 |
Mean | 20141060 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 1.1389643 |
Sum | 2.014106 × 1011 |
Variance | 2.136385 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20111227 | 6181 | |
20120207 | 481 | 4.8% |
20191203 | 337 | 3.4% |
20240503 | 330 | 3.3% |
20180927 | 300 | 3.0% |
20120208 | 136 | 1.4% |
20211029 | 123 | 1.2% |
20240208 | 99 | 1.0% |
20240102 | 64 | 0.6% |
20120605 | 31 | 0.3% |
Other values (475) | 1918 | 19.2% |
Value | Count | Frequency (%) |
20111227 | 6181 | |
20120112 | 1 | < 0.1% |
20120113 | 1 | < 0.1% |
20120207 | 481 | 4.8% |
20120208 | 136 | 1.4% |
20120222 | 15 | 0.1% |
20120223 | 4 | < 0.1% |
20120229 | 2 | < 0.1% |
20120306 | 1 | < 0.1% |
20120309 | 3 | < 0.1% |
Value | Count | Frequency (%) |
20240503 | 330 | |
20240425 | 3 | < 0.1% |
20240420 | 20 | 0.2% |
20240417 | 1 | < 0.1% |
20240416 | 4 | < 0.1% |
20240411 | 3 | < 0.1% |
20240406 | 4 | < 0.1% |
20240402 | 6 | 0.1% |
20240330 | 6 | 0.1% |
20240327 | 19 | 0.2% |
지역지구구역_구분_코드 | 대표_여부 | 작업_일자 | |
---|---|---|---|
지역지구구역_구분_코드 | 1.000 | 0.069 | 0.212 |
대표_여부 | 0.069 | 1.000 | 0.118 |
작업_일자 | 0.212 | 0.118 | 1.000 |
지역지구구역_구분_코드 | 대표_여부 | 동_구분_코드 | |
---|---|---|---|
지역지구구역_구분_코드 | 1.000 | 0.115 | 1.000 |
대표_여부 | 0.115 | 1.000 | 1.000 |
동_구분_코드 | 1.000 | 1.000 | 1.000 |
작업_일자 | 지역지구구역_구분_코드 | 대표_여부 | 동_구분_코드 | |
---|---|---|---|---|
작업_일자 | 1.000 | 0.137 | 0.048 | 1.000 |
지역지구구역_구분_코드 | 0.137 | 1.000 | 0.115 | 1.000 |
대표_여부 | 0.048 | 0.115 | 1.000 | 1.000 |
동_구분_코드 | 1.000 | 1.000 | 1.000 | 1.000 |
관리_지역지구구역_pk | 관리_주택대장_pk | 지역지구구역_구분_코드 | 지역지구구역_코드 | 대표_여부 | 동_구분_코드 | 지역지구구역_명 | 작업_일자 | |
---|---|---|---|---|---|---|---|---|
8952 | 11500-444 | 11500-18 | 3 | 980 | 1 | 1 | 용도구역미지정 | 20111227 |
7131 | 11440-465 | 11440-89 | 1 | 1023 | 0 | 1 | 제3종일반주거지역 | 20111227 |
16497 | 11740-10 | 11740-5 | 1 | 1020 | 1 | <NA> | 일반주거지역 | 20120207 |
11298 | 11530-745 | 11530-17 | 1 | 1020 | 0 | <NA> | 일반주거지역 | 20111227 |
10345 | 11530-1461 | 11530-38 | 2 | 260 | 1 | 1 | 주차장정비지구 | 20111227 |
1438 | 11200-124 | 11200-7 | 1 | 1020 | 1 | 1 | 일반주거지역 | 20111227 |
10573 | 11530-1667 | 11530-61 | 2 | 990 | 1 | 1 | 기타지구 | 20111227 |
6427 | 11410-100002373 | 11410-100010284 | 2 | UQG130 | 1 | <NA> | 일반미관지구 | 20190112 |
3591 | 11290-354 | 11290-33 | 3 | 280 | 1 | 1 | 재개발구역 | 20111227 |
4911 | 11305-853 | 11305-10 | 1 | 1020 | 1 | 1 | 일반주거지역 | 20111227 |
관리_지역지구구역_pk | 관리_주택대장_pk | 지역지구구역_구분_코드 | 지역지구구역_코드 | 대표_여부 | 동_구분_코드 | 지역지구구역_명 | 작업_일자 | |
---|---|---|---|---|---|---|---|---|
3666 | 11290-421 | 11290-36 | 1 | 1020 | 1 | 1 | 일반주거지역 | 20111227 |
5298 | 11320-31 | 11320-24 | 3 | 050 | 1 | <NA> | 상세계획구역 | 20111227 |
12989 | 11590-330 | 11590-27 | 1 | 1020 | 1 | 1 | 일반주거지역 | 20111227 |
11631 | 11545-100001643 | 11545-100004362 | 1 | UQA001 | 0 | <NA> | 도시지역 | 20230929 |
7983 | 11470-86 | 11470-47 | 1 | 1020 | 1 | <NA> | 일반주거지역 | 20191203 |
16474 | 11710-8 | 11710-5 | 1 | 1020 | 1 | <NA> | 일반주거지역 | 20111227 |
1020 | 11170-100001622 | 11170-100005441 | 1 | 1022 | 1 | <NA> | 제2종일반주거지역 | 20161001 |
5605 | 11350-128 | 11350-308 | 2 | 160 | 1 | 1 | 택지개발지구 | 20111227 |
493 | 11000-100004342 | 11000-100006546 | 1 | UQA130 | 1 | <NA> | 준주거지역 | 20240503 |
5490 | 11350-1 | 11350-10 | 1 | 1020 | 1 | <NA> | 일반주거지역 | 20111227 |