Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 1848 |
Missing cells (%) | 2.6% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 654.3 KiB |
Average record size in memory | 67.0 B |
Variable types
Text | 4 |
---|---|
Categorical | 2 |
Numeric | 1 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15655/S/1/datasetView.do |
Reproduction
Analysis started | 2024-05-04 02:56:47.097427 |
---|---|
Analysis finished | 2024-05-04 02:56:50.057525 |
Duration | 2.96 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
관리_지역지구구역
Text
UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 15 |
---|---|
Median length | 11 |
Mean length | 11.2789 |
Min length | 8 |
Characters and Unicode
Total characters | 112789 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 10000 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 11620-46961 |
---|---|
2nd row | 11620-35514 |
3rd row | 11140-21174 |
4th row | 11380-7844 |
5th row | 11530-13240 |
Value | Count | Frequency (%) |
11620-46961 | 1 | < 0.1% |
11620-33799 | 1 | < 0.1% |
11545-25718 | 1 | < 0.1% |
11320-13690 | 1 | < 0.1% |
11620-30413 | 1 | < 0.1% |
11140-12675 | 1 | < 0.1% |
11260-10786 | 1 | < 0.1% |
11545-11552 | 1 | < 0.1% |
11530-6175 | 1 | < 0.1% |
11290-18424 | 1 | < 0.1% |
Other values (9990) | 9990 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 31002 | |
0 | 16413 | |
5 | 10157 | 9.0% |
- | 10000 | 8.9% |
2 | 9564 | 8.5% |
4 | 7640 | 6.8% |
3 | 7205 | 6.4% |
6 | 6876 | 6.1% |
9 | 4808 | 4.3% |
7 | 4703 | 4.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 102789 | |
Dash Punctuation | 10000 | 8.9% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 31002 | |
0 | 16413 | |
5 | 10157 | 9.9% |
2 | 9564 | 9.3% |
4 | 7640 | 7.4% |
3 | 7205 | 7.0% |
6 | 6876 | 6.7% |
9 | 4808 | 4.7% |
7 | 4703 | 4.6% |
8 | 4421 | 4.3% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 112789 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 31002 | |
0 | 16413 | |
5 | 10157 | 9.0% |
- | 10000 | 8.9% |
2 | 9564 | 8.5% |
4 | 7640 | 6.8% |
3 | 7205 | 6.4% |
6 | 6876 | 6.1% |
9 | 4808 | 4.3% |
7 | 4703 | 4.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 112789 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 31002 | |
0 | 16413 | |
5 | 10157 | 9.0% |
- | 10000 | 8.9% |
2 | 9564 | 8.5% |
4 | 7640 | 6.8% |
3 | 7205 | 6.4% |
6 | 6876 | 6.1% |
9 | 4808 | 4.3% |
7 | 4703 | 4.2% |
관리_건축물대장
Text
Distinct | 8469 |
---|---|
Distinct (%) | 84.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 15 |
---|---|
Median length | 11 |
Mean length | 10.7347 |
Min length | 7 |
Characters and Unicode
Total characters | 107347 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 7032 ? |
---|---|
Unique (%) | 70.3% |
Sample
1st row | 11620-30925 |
---|---|
2nd row | 11620-23885 |
3rd row | 11140-412 |
4th row | 11380-23504 |
5th row | 11530-589 |
Value | Count | Frequency (%) |
11110-660 | 6 | 0.1% |
11500-855 | 6 | 0.1% |
11500-583 | 5 | < 0.1% |
11545-100184705 | 4 | < 0.1% |
11500-100205108 | 4 | < 0.1% |
11500-493 | 4 | < 0.1% |
11500-100214647 | 4 | < 0.1% |
11170-100200885 | 4 | < 0.1% |
11170-6854 | 4 | < 0.1% |
11500-100259112 | 4 | < 0.1% |
Other values (8459) | 9955 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 29894 | |
0 | 13457 | |
2 | 10616 | 9.9% |
- | 10000 | 9.3% |
5 | 9128 | 8.5% |
4 | 6813 | 6.3% |
6 | 6792 | 6.3% |
3 | 6486 | 6.0% |
9 | 4840 | 4.5% |
7 | 4753 | 4.4% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 97347 | |
Dash Punctuation | 10000 | 9.3% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 29894 | |
0 | 13457 | |
2 | 10616 | 10.9% |
5 | 9128 | 9.4% |
4 | 6813 | 7.0% |
6 | 6792 | 7.0% |
3 | 6486 | 6.7% |
9 | 4840 | 5.0% |
7 | 4753 | 4.9% |
8 | 4568 | 4.7% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 107347 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 29894 | |
0 | 13457 | |
2 | 10616 | 9.9% |
- | 10000 | 9.3% |
5 | 9128 | 8.5% |
4 | 6813 | 6.3% |
6 | 6792 | 6.3% |
3 | 6486 | 6.0% |
9 | 4840 | 4.5% |
7 | 4753 | 4.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 107347 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 29894 | |
0 | 13457 | |
2 | 10616 | 9.9% |
- | 10000 | 9.3% |
5 | 9128 | 8.5% |
4 | 6813 | 6.3% |
6 | 6792 | 6.3% |
3 | 6486 | 6.0% |
9 | 4840 | 4.5% |
7 | 4753 | 4.4% |
지역지구구역_구분_코드
Categorical
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
1 | |
---|---|
2 | |
3 | 314 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 2 |
3rd row | 1 |
4th row | 1 |
5th row | 2 |
Common Values
Value | Count | Frequency (%) |
1 | 6208 | |
2 | 3478 | |
3 | 314 | 3.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 6208 | |
2 | 3478 | |
3 | 314 | 3.1% |
지역지구구역_코드
Text
MISSING
 
Distinct | 110 |
---|---|
Distinct (%) | 1.2% |
Missing | 629 |
Missing (%) | 6.3% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
1020 | 2079 | |
1022 | 2011 | |
260 | 1033 | |
990 | 938 | |
1021 | 498 | 5.3% |
112 | 432 | 4.6% |
111 | 303 | 3.2% |
1023 | 239 | 2.6% |
uqa001 | 233 | 2.5% |
uqa122 | 170 | 1.8% |
Other values (100) | 1435 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 10987 | |
2 | 9229 | |
1 | 9048 | |
9 | 1917 | 5.4% |
6 | 1047 | 2.9% |
3 | 884 | 2.5% |
U | 732 | 2.1% |
Q | 610 | 1.7% |
A | 564 | 1.6% |
7 | 110 | 0.3% |
Other values (21) | 454 | 1.3% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 33358 | |
Uppercase Letter | 2224 | 6.3% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
U | 732 | |
Q | 610 | |
A | 564 | |
N | 52 | 2.3% |
H | 45 | 2.0% |
O | 41 | 1.8% |
G | 37 | 1.7% |
E | 26 | 1.2% |
L | 25 | 1.1% |
M | 18 | 0.8% |
Other values (11) | 74 | 3.3% |
Decimal Number
Value | Count | Frequency (%) |
0 | 10987 | |
2 | 9229 | |
1 | 9048 | |
9 | 1917 | 5.7% |
6 | 1047 | 3.1% |
3 | 884 | 2.7% |
7 | 110 | 0.3% |
4 | 82 | 0.2% |
5 | 41 | 0.1% |
8 | 13 | < 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 33358 | |
Latin | 2224 | 6.3% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
U | 732 | |
Q | 610 | |
A | 564 | |
N | 52 | 2.3% |
H | 45 | 2.0% |
O | 41 | 1.8% |
G | 37 | 1.7% |
E | 26 | 1.2% |
L | 25 | 1.1% |
M | 18 | 0.8% |
Other values (11) | 74 | 3.3% |
Common
Value | Count | Frequency (%) |
0 | 10987 | |
2 | 9229 | |
1 | 9048 | |
9 | 1917 | 5.7% |
6 | 1047 | 3.1% |
3 | 884 | 2.7% |
7 | 110 | 0.3% |
4 | 82 | 0.2% |
5 | 41 | 0.1% |
8 | 13 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 35582 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 10987 | |
2 | 9229 | |
1 | 9048 | |
9 | 1917 | 5.4% |
6 | 1047 | 2.9% |
3 | 884 | 2.5% |
U | 732 | 2.1% |
Q | 610 | 1.7% |
A | 564 | 1.6% |
7 | 110 | 0.3% |
Other values (21) | 454 | 1.3% |
대표_여부
Categorical
IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
1 | |
---|---|
0 | 598 |
<NA> | 19 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.0057 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 9383 | |
0 | 598 | 6.0% |
<NA> | 19 | 0.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 9383 | |
0 | 598 | 6.0% |
na | 19 | 0.2% |
기타_지역지구구역
Text
MISSING
 
Distinct | 211 |
---|---|
Distinct (%) | 2.4% |
Missing | 1219 |
Missing (%) | 12.2% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
일반주거지역 | 1756 | |
2종일반주거지역 | 1051 | |
고도지구기타(공항고도지구 | 823 | |
일반주거 | 697 | 7.9% |
제2종일반주거지역 | 612 | 6.9% |
주차장정비지구 | 607 | 6.8% |
주차장정비 | 471 | 5.3% |
고도지구기타 | 427 | 4.8% |
도시지역 | 325 | 3.7% |
1종일반주거지역 | 240 | 2.7% |
Other values (204) | 1853 |
Most occurring characters
Value | Count | Frequency (%) |
지 | 8309 | |
주 | 6061 | 9.5% |
역 | 5039 | 7.9% |
거 | 4955 | 7.7% |
일 | 4755 | 7.4% |
반 | 4749 | 7.4% |
구 | 3727 | 5.8% |
도 | 2658 | 4.1% |
고 | 2437 | 3.8% |
종 | 2306 | 3.6% |
Other values (149) | 19060 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 59742 | |
Decimal Number | 2331 | 3.6% |
Close Punctuation | 940 | 1.5% |
Open Punctuation | 939 | 1.5% |
Space Separator | 81 | 0.1% |
Other Punctuation | 19 | < 0.1% |
Math Symbol | 2 | < 0.1% |
Lowercase Letter | 1 | < 0.1% |
Uppercase Letter | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
지 | 8309 | |
주 | 6061 | |
역 | 5039 | 8.4% |
거 | 4955 | 8.3% |
일 | 4755 | 8.0% |
반 | 4749 | 7.9% |
구 | 3727 | 6.2% |
도 | 2658 | 4.4% |
고 | 2437 | 4.1% |
종 | 2306 | 3.9% |
Other values (129) | 14746 |
Decimal Number
Value | Count | Frequency (%) |
2 | 1703 | |
1 | 416 | 17.8% |
3 | 192 | 8.2% |
4 | 12 | 0.5% |
7 | 5 | 0.2% |
6 | 1 | < 0.1% |
5 | 1 | < 0.1% |
0 | 1 | < 0.1% |
Other Punctuation
Value | Count | Frequency (%) |
, | 13 | |
/ | 5 | 26.3% |
? | 1 | 5.3% |
Close Punctuation
Value | Count | Frequency (%) |
) | 939 | |
] | 1 | 0.1% |
Open Punctuation
Value | Count | Frequency (%) |
( | 938 | |
[ | 1 | 0.1% |
Math Symbol
Value | Count | Frequency (%) |
< | 1 | |
> | 1 |
Space Separator
Value | Count | Frequency (%) |
81 |
Lowercase Letter
Value | Count | Frequency (%) |
m | 1 |
Uppercase Letter
Value | Count | Frequency (%) |
M | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 59738 | |
Common | 4312 | 6.7% |
Han | 4 | < 0.1% |
Latin | 2 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
지 | 8309 | |
주 | 6061 | |
역 | 5039 | 8.4% |
거 | 4955 | 8.3% |
일 | 4755 | 8.0% |
반 | 4749 | 7.9% |
구 | 3727 | 6.2% |
도 | 2658 | 4.4% |
고 | 2437 | 4.1% |
종 | 2306 | 3.9% |
Other values (125) | 14742 |
Common
Value | Count | Frequency (%) |
2 | 1703 | |
) | 939 | |
( | 938 | |
1 | 416 | 9.6% |
3 | 192 | 4.5% |
81 | 1.9% | |
, | 13 | 0.3% |
4 | 12 | 0.3% |
/ | 5 | 0.1% |
7 | 5 | 0.1% |
Other values (8) | 8 | 0.2% |
Han
Value | Count | Frequency (%) |
衫 | 1 | |
北 | 1 | |
斂 | 1 | |
熾 | 1 |
Latin
Value | Count | Frequency (%) |
m | 1 | |
M | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 59738 | |
ASCII | 4314 | 6.7% |
CJK | 4 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
지 | 8309 | |
주 | 6061 | |
역 | 5039 | 8.4% |
거 | 4955 | 8.3% |
일 | 4755 | 8.0% |
반 | 4749 | 7.9% |
구 | 3727 | 6.2% |
도 | 2658 | 4.4% |
고 | 2437 | 4.1% |
종 | 2306 | 3.9% |
Other values (125) | 14742 |
ASCII
Value | Count | Frequency (%) |
2 | 1703 | |
) | 939 | |
( | 938 | |
1 | 416 | 9.6% |
3 | 192 | 4.5% |
81 | 1.9% | |
, | 13 | 0.3% |
4 | 12 | 0.3% |
/ | 5 | 0.1% |
7 | 5 | 0.1% |
Other values (10) | 10 | 0.2% |
CJK
Value | Count | Frequency (%) |
衫 | 1 | |
北 | 1 | |
斂 | 1 | |
熾 | 1 |
작업_일자
Real number (ℝ)
Distinct | 458 |
---|---|
Distinct (%) | 4.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20124887 |
Minimum | 20111227 |
---|---|
Maximum | 20160730 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 20111227 |
---|---|
5-th percentile | 20111227 |
Q1 | 20111227 |
median | 20120530 |
Q3 | 20140724 |
95-th percentile | 20151202 |
Maximum | 20160730 |
Range | 49503 |
Interquartile range (IQR) | 29497 |
Descriptive statistics
Standard deviation | 16439.994 |
---|---|
Coefficient of variation (CV) | 0.00081689873 |
Kurtosis | -0.91818257 |
Mean | 20124887 |
Median Absolute Deviation (MAD) | 9303 |
Skewness | 0.79037545 |
Sum | 2.0124887 × 1011 |
Variance | 2.7027342 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20111227 | 4808 | |
20120825 | 807 | 8.1% |
20120530 | 277 | 2.8% |
20150116 | 214 | 2.1% |
20121222 | 211 | 2.1% |
20140724 | 197 | 2.0% |
20150107 | 139 | 1.4% |
20141115 | 109 | 1.1% |
20141217 | 80 | 0.8% |
20120920 | 80 | 0.8% |
Other values (448) | 3078 |
Value | Count | Frequency (%) |
20111227 | 4808 | |
20120102 | 6 | 0.1% |
20120104 | 1 | < 0.1% |
20120110 | 4 | < 0.1% |
20120111 | 1 | < 0.1% |
20120112 | 7 | 0.1% |
20120113 | 2 | < 0.1% |
20120117 | 3 | < 0.1% |
20120119 | 1 | < 0.1% |
20120120 | 1 | < 0.1% |
Value | Count | Frequency (%) |
20160730 | 9 | |
20160727 | 1 | < 0.1% |
20160726 | 5 | 0.1% |
20160723 | 15 | |
20160720 | 10 | |
20160716 | 1 | < 0.1% |
20160712 | 1 | < 0.1% |
20160709 | 11 | |
20160706 | 7 | |
20160702 | 3 | < 0.1% |
지역지구구역_구분_코드 | 대표_여부 | 작업_일자 | |
---|---|---|---|
지역지구구역_구분_코드 | 1.000 | 0.079 | 0.193 |
대표_여부 | 0.079 | 1.000 | 0.106 |
작업_일자 | 0.193 | 0.106 | 1.000 |
지역지구구역_구분_코드 | 대표_여부 | |
---|---|---|
지역지구구역_구분_코드 | 1.000 | 0.132 |
대표_여부 | 0.132 | 1.000 |
작업_일자 | 지역지구구역_구분_코드 | 대표_여부 | |
---|---|---|---|
작업_일자 | 1.000 | 0.087 | 0.142 |
지역지구구역_구분_코드 | 0.087 | 1.000 | 0.132 |
대표_여부 | 0.142 | 0.132 | 1.000 |
관리_지역지구구역 | 관리_건축물대장 | 지역지구구역_구분_코드 | 지역지구구역_코드 | 대표_여부 | 기타_지역지구구역 | 작업_일자 | |
---|---|---|---|---|---|---|---|
18753 | 11620-46961 | 11620-30925 | 1 | 1022 | 1 | 2종일반주거지역 | 20121222 |
27652 | 11620-35514 | 11620-23885 | 2 | 111 | 1 | 고도지구기타 | 20151202 |
15685 | 11140-21174 | 11140-412 | 1 | 1120 | 1 | <NA> | 20120825 |
10093 | 11380-7844 | 11380-23504 | 1 | 1020 | 1 | 일반주거지역 | 20111227 |
21326 | 11530-13240 | 11530-589 | 2 | 150 | 1 | 공항고도지구<진입표면> | 20140521 |
18538 | 11620-30456 | 11620-20734 | 2 | 111 | 1 | 고도지구기타 | 20121222 |
25613 | 11110-10277 | 11110-20350 | 1 | 1022 | 1 | 2종일반주거지역 | 20150403 |
6165 | 11140-14590 | 11140-19227 | 1 | <NA> | 1 | 일반주거 | 20111227 |
16578 | 11380-2898 | 11380-10274 | 1 | 1020 | 1 | 일반주거지역 | 20120825 |
27740 | 11530-4490 | 11530-10875 | 1 | 1020 | 1 | 일반주거지역 | 20151216 |
관리_지역지구구역 | 관리_건축물대장 | 지역지구구역_구분_코드 | 지역지구구역_코드 | 대표_여부 | 기타_지역지구구역 | 작업_일자 | |
---|---|---|---|---|---|---|---|
8152 | 11545-22766 | 11545-12370 | 2 | 990 | 1 | 고도지구기타(공항고도지구) | 20111227 |
20794 | 11170-100058814 | 11170-19118 | 1 | 1022 | 1 | <NA> | 20140131 |
12187 | 11545-28382 | 11545-15097 | 1 | 1022 | 1 | 2종일반주거지역 | 20111227 |
21429 | 11140-18042 | 11140-22905 | 1 | 1120 | 1 | <NA> | 20140618 |
5234 | 11620-6874 | 11620-6100 | 1 | 1023 | 1 | 3종일반주거지역 | 20111227 |
19936 | 11380-2127 | 11380-6819 | 1 | 1020 | 1 | 일반주거지역 | 20131008 |
7775 | 11620-30936 | 11620-21029 | 2 | 112 | 1 | 고도지구기타 | 20111227 |
14268 | 11260-833 | 11260-2250 | 1 | 1020 | 1 | 일반주거 | 20120427 |
3581 | 11620-48947 | 11620-32129 | 3 | 301 | 1 | 제1종지구단위계획구역 | 20111227 |
26800 | 11260-9054 | 11260-16401 | 1 | 1020 | 1 | 일반주거지역 | 20150801 |