Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 50 |
Missing cells (%) | 0.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 654.3 KiB |
Average record size in memory | 67.0 B |
Variable types
Text | 4 |
---|---|
Categorical | 3 |
Dataset
Description | 관리_지역지구구역_pk,관리_허가대장_pk,지역지구구역_구분_코드,지역지구구역_코드,대표_여부,주_동_구분_코드,지역지구구역_명 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15667/S/1/datasetView.do |
지역지구구역_구분_코드 is highly overall correlated with 주_동_구분_코드 | High correlation |
대표_여부 is highly overall correlated with 주_동_구분_코드 | High correlation |
주_동_구분_코드 is highly overall correlated with 지역지구구역_구분_코드 and 1 other fields | High correlation |
주_동_구분_코드 is highly imbalanced (78.9%) | Imbalance |
관리_지역지구구역_pk has unique values | Unique |
Reproduction
Analysis started | 2024-05-03 23:34:47.123177 |
---|---|
Analysis finished | 2024-05-03 23:34:49.793589 |
Duration | 2.67 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
관리_지역지구구역_pk
Text
UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 28 |
---|---|
Median length | 15 |
Mean length | 16.3506 |
Min length | 7 |
Characters and Unicode
Total characters | 163506 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 10000 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 11170-100020409 |
---|---|
2nd row | 11110-100069912 |
3rd row | 11110-1000000000000000090558 |
4th row | 11140-694 |
5th row | 11110-100099204 |
Value | Count | Frequency (%) |
11170-100020409 | 1 | < 0.1% |
11140-100093241 | 1 | < 0.1% |
11110-100049929 | 1 | < 0.1% |
11110-100093095 | 1 | < 0.1% |
11110-4440 | 1 | < 0.1% |
11170-1000000000000000655589 | 1 | < 0.1% |
11140-1000000000000000496664 | 1 | < 0.1% |
11110-100099223 | 1 | < 0.1% |
11110-100039518 | 1 | < 0.1% |
11110-100112428 | 1 | < 0.1% |
Other values (9990) | 9990 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 59173 | |
1 | 50983 | |
- | 10000 | 6.1% |
4 | 8178 | 5.0% |
7 | 5748 | 3.5% |
2 | 5275 | 3.2% |
3 | 4948 | 3.0% |
9 | 4871 | 3.0% |
8 | 4834 | 3.0% |
5 | 4763 | 2.9% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 153506 | |
Dash Punctuation | 10000 | 6.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 59173 | |
1 | 50983 | |
4 | 8178 | 5.3% |
7 | 5748 | 3.7% |
2 | 5275 | 3.4% |
3 | 4948 | 3.2% |
9 | 4871 | 3.2% |
8 | 4834 | 3.1% |
5 | 4763 | 3.1% |
6 | 4733 | 3.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 163506 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 59173 | |
1 | 50983 | |
- | 10000 | 6.1% |
4 | 8178 | 5.0% |
7 | 5748 | 3.5% |
2 | 5275 | 3.2% |
3 | 4948 | 3.0% |
9 | 4871 | 3.0% |
8 | 4834 | 3.0% |
5 | 4763 | 2.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 163506 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 59173 | |
1 | 50983 | |
- | 10000 | 6.1% |
4 | 8178 | 5.0% |
7 | 5748 | 3.5% |
2 | 5275 | 3.2% |
3 | 4948 | 3.0% |
9 | 4871 | 3.0% |
8 | 4834 | 3.0% |
5 | 4763 | 2.9% |
관리_허가대장_pk
Text
Distinct | 7800 |
---|---|
Distinct (%) | 78.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 28 |
---|---|
Median length | 15 |
Mean length | 15.6822 |
Min length | 7 |
Characters and Unicode
Total characters | 156822 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 6253 ? |
---|---|
Unique (%) | 62.5% |
Sample
1st row | 11170-100022091 |
---|---|
2nd row | 11110-100029060 |
3rd row | 11110-100056172 |
4th row | 11140-476 |
5th row | 11110-100046752 |
Value | Count | Frequency (%) |
11110-100061913 | 8 | 0.1% |
11110-100035151 | 8 | 0.1% |
11140-100079198 | 8 | 0.1% |
11140-1000000000000000319010 | 8 | 0.1% |
11110-100025597 | 8 | 0.1% |
11000-1000000000000000236251 | 8 | 0.1% |
11140-1000000000000000262122 | 7 | 0.1% |
11110-100053933 | 7 | 0.1% |
11110-100061514 | 7 | 0.1% |
11140-100057454 | 6 | 0.1% |
Other values (7790) | 9925 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 53893 | |
1 | 49980 | |
- | 10000 | 6.4% |
4 | 8449 | 5.4% |
2 | 6097 | 3.9% |
3 | 5789 | 3.7% |
7 | 5466 | 3.5% |
5 | 5276 | 3.4% |
9 | 4336 | 2.8% |
8 | 3876 | 2.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 146822 | |
Dash Punctuation | 10000 | 6.4% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 53893 | |
1 | 49980 | |
4 | 8449 | 5.8% |
2 | 6097 | 4.2% |
3 | 5789 | 3.9% |
7 | 5466 | 3.7% |
5 | 5276 | 3.6% |
9 | 4336 | 3.0% |
8 | 3876 | 2.6% |
6 | 3660 | 2.5% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 156822 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 53893 | |
1 | 49980 | |
- | 10000 | 6.4% |
4 | 8449 | 5.4% |
2 | 6097 | 3.9% |
3 | 5789 | 3.7% |
7 | 5466 | 3.5% |
5 | 5276 | 3.4% |
9 | 4336 | 2.8% |
8 | 3876 | 2.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 156822 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 53893 | |
1 | 49980 | |
- | 10000 | 6.4% |
4 | 8449 | 5.4% |
2 | 6097 | 3.9% |
3 | 5789 | 3.7% |
7 | 5466 | 3.5% |
5 | 5276 | 3.4% |
9 | 4336 | 2.8% |
8 | 3876 | 2.5% |
지역지구구역_구분_코드
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
1 | |
---|---|
3 | |
2 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 2 |
3rd row | 2 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 4781 | |
3 | 2719 | |
2 | 2500 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 4781 | |
3 | 2719 | |
2 | 2500 |
지역지구구역_코드
Text
Distinct | 143 |
---|---|
Distinct (%) | 1.4% |
Missing | 25 |
Missing (%) | 0.2% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
uqa220 | 722 | 7.2% |
uqa001 | 544 | 5.5% |
uqq300 | 501 | 5.0% |
zq0001 | 494 | 5.0% |
uqa122 | 479 | 4.8% |
uqi100 | 467 | 4.7% |
1120 | 457 | 4.6% |
uoa120 | 401 | 4.0% |
uqq310 | 386 | 3.9% |
uqa121 | 316 | 3.2% |
Other values (133) | 5208 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 13452 | |
1 | 10122 | |
Q | 6456 | |
U | 6343 | |
2 | 5733 | |
A | 3774 | 7.1% |
3 | 1873 | 3.5% |
Z | 1124 | 2.1% |
O | 552 | 1.0% |
I | 479 | 0.9% |
Other values (24) | 3452 | 6.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 32142 | |
Uppercase Letter | 21218 |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
Q | 6456 | |
U | 6343 | |
A | 3774 | |
Z | 1124 | 5.3% |
O | 552 | 2.6% |
I | 479 | 2.3% |
H | 375 | 1.8% |
D | 372 | 1.8% |
G | 367 | 1.7% |
X | 315 | 1.5% |
Other values (14) | 1061 | 5.0% |
Decimal Number
Value | Count | Frequency (%) |
0 | 13452 | |
1 | 10122 | |
2 | 5733 | |
3 | 1873 | 5.8% |
4 | 308 | 1.0% |
6 | 259 | 0.8% |
9 | 148 | 0.5% |
7 | 121 | 0.4% |
8 | 83 | 0.3% |
5 | 43 | 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 32142 | |
Latin | 21218 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
Q | 6456 | |
U | 6343 | |
A | 3774 | |
Z | 1124 | 5.3% |
O | 552 | 2.6% |
I | 479 | 2.3% |
H | 375 | 1.8% |
D | 372 | 1.8% |
G | 367 | 1.7% |
X | 315 | 1.5% |
Other values (14) | 1061 | 5.0% |
Common
Value | Count | Frequency (%) |
0 | 13452 | |
1 | 10122 | |
2 | 5733 | |
3 | 1873 | 5.8% |
4 | 308 | 1.0% |
6 | 259 | 0.8% |
9 | 148 | 0.5% |
7 | 121 | 0.4% |
8 | 83 | 0.3% |
5 | 43 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 53360 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 13452 | |
1 | 10122 | |
Q | 6456 | |
U | 6343 | |
2 | 5733 | |
A | 3774 | 7.1% |
3 | 1873 | 3.5% |
Z | 1124 | 2.1% |
O | 552 | 1.0% |
I | 479 | 0.9% |
Other values (24) | 3452 | 6.5% |
대표_여부
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
1 | |
---|---|
0 | |
<NA> | 334 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.1002 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 1 |
4th row | 1 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
1 | 6614 | |
0 | 3052 | |
<NA> | 334 | 3.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 6614 | |
0 | 3052 | |
na | 334 | 3.3% |
주_동_구분_코드
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
<NA> | 334 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.1002 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9666 | |
<NA> | 334 | 3.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9666 | |
na | 334 | 3.3% |
지역지구구역_명
Text
Distinct | 120 |
---|---|
Distinct (%) | 1.2% |
Missing | 25 |
Missing (%) | 0.2% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
일반상업지역 | 1179 | 11.4% |
도시지역 | 912 | 8.8% |
제2종일반주거지역 | 720 | 6.9% |
방화지구 | 691 | 6.7% |
지구단위계획구역 | 528 | 5.1% |
제1종지구단위계획구역 | 511 | 4.9% |
중점경관관리구역 | 494 | 4.8% |
중심지미관지구 | 442 | 4.3% |
제1종일반주거지역 | 438 | 4.2% |
최고고도지구 | 302 | 2.9% |
Other values (116) | 4144 |
Most occurring characters
Value | Count | Frequency (%) |
지 | 8396 | 12.2% |
역 | 7734 | 11.2% |
구 | 6324 | 9.2% |
일 | 2951 | 4.3% |
반 | 2951 | 4.3% |
제 | 2447 | 3.6% |
종 | 2128 | 3.1% |
주 | 2081 | 3.0% |
거 | 2071 | 3.0% |
관 | 1894 | 2.8% |
Other values (129) | 29880 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 66086 | |
Decimal Number | 2254 | 3.3% |
Space Separator | 386 | 0.6% |
Close Punctuation | 55 | 0.1% |
Open Punctuation | 55 | 0.1% |
Other Punctuation | 21 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
지 | 8396 | 12.7% |
역 | 7734 | 11.7% |
구 | 6324 | 9.6% |
일 | 2951 | 4.5% |
반 | 2951 | 4.5% |
제 | 2447 | 3.7% |
종 | 2128 | 3.2% |
주 | 2081 | 3.1% |
거 | 2071 | 3.1% |
관 | 1894 | 2.9% |
Other values (121) | 27109 |
Decimal Number
Value | Count | Frequency (%) |
1 | 1097 | |
2 | 748 | |
3 | 282 | 12.5% |
4 | 127 | 5.6% |
Space Separator
Value | Count | Frequency (%) |
386 |
Close Punctuation
Value | Count | Frequency (%) |
) | 55 |
Open Punctuation
Value | Count | Frequency (%) |
( | 55 |
Other Punctuation
Value | Count | Frequency (%) |
? | 21 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 66086 | |
Common | 2771 | 4.0% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
지 | 8396 | 12.7% |
역 | 7734 | 11.7% |
구 | 6324 | 9.6% |
일 | 2951 | 4.5% |
반 | 2951 | 4.5% |
제 | 2447 | 3.7% |
종 | 2128 | 3.2% |
주 | 2081 | 3.1% |
거 | 2071 | 3.1% |
관 | 1894 | 2.9% |
Other values (121) | 27109 |
Common
Value | Count | Frequency (%) |
1 | 1097 | |
2 | 748 | |
386 | 13.9% | |
3 | 282 | 10.2% |
4 | 127 | 4.6% |
) | 55 | 2.0% |
( | 55 | 2.0% |
? | 21 | 0.8% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 66086 | |
ASCII | 2771 | 4.0% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
지 | 8396 | 12.7% |
역 | 7734 | 11.7% |
구 | 6324 | 9.6% |
일 | 2951 | 4.5% |
반 | 2951 | 4.5% |
제 | 2447 | 3.7% |
종 | 2128 | 3.2% |
주 | 2081 | 3.1% |
거 | 2071 | 3.1% |
관 | 1894 | 2.9% |
Other values (121) | 27109 |
ASCII
Value | Count | Frequency (%) |
1 | 1097 | |
2 | 748 | |
386 | 13.9% | |
3 | 282 | 10.2% |
4 | 127 | 4.6% |
) | 55 | 2.0% |
( | 55 | 2.0% |
? | 21 | 0.8% |
지역지구구역_구분_코드 | 대표_여부 | |
---|---|---|
지역지구구역_구분_코드 | 1.000 | 0.047 |
대표_여부 | 0.047 | 1.000 |
지역지구구역_구분_코드 | 대표_여부 | 주_동_구분_코드 | |
---|---|---|---|
지역지구구역_구분_코드 | 1.000 | 0.077 | 1.000 |
대표_여부 | 0.077 | 1.000 | 1.000 |
주_동_구분_코드 | 1.000 | 1.000 | 1.000 |
지역지구구역_구분_코드 | 대표_여부 | 주_동_구분_코드 | |
---|---|---|---|
지역지구구역_구분_코드 | 1.000 | 0.077 | 1.000 |
대표_여부 | 0.077 | 1.000 | 1.000 |
주_동_구분_코드 | 1.000 | 1.000 | 1.000 |
관리_지역지구구역_pk | 관리_허가대장_pk | 지역지구구역_구분_코드 | 지역지구구역_코드 | 대표_여부 | 주_동_구분_코드 | 지역지구구역_명 | |
---|---|---|---|---|---|---|---|
99711 | 11170-100020409 | 11170-100022091 | 1 | UQA220 | 0 | 0 | 일반상업지역 |
27491 | 11110-100069912 | 11110-100029060 | 2 | UQG110 | 0 | 0 | 중심지미관지구 |
1513 | 11110-1000000000000000090558 | 11110-100056172 | 2 | UQH100 | 1 | 0 | 고도지구 |
88434 | 11140-694 | 11140-476 | 1 | 1120 | 1 | 0 | 일반상업지역 |
36610 | 11110-100099204 | 11110-100046752 | 1 | ZA0014 | 0 | 0 | 역사도심 |
21121 | 11110-100043918 | 11110-100022049 | 3 | UOA110 | <NA> | <NA> | 절대정화구역 |
25315 | 11110-100060895 | 11110-100027389 | 1 | UQA220 | 1 | 0 | 일반상업지역 |
69205 | 11140-100042851 | 11140-100044752 | 2 | UQH110 | 1 | 0 | 최고고도지구 |
54587 | 11110-9130 | 11110-4622 | 1 | 1022 | 1 | 0 | 제2종일반주거지역 |
22005 | 11110-100047657 | 11110-100016192 | 2 | UQI100 | 0 | 0 | 방화지구 |
관리_지역지구구역_pk | 관리_허가대장_pk | 지역지구구역_구분_코드 | 지역지구구역_코드 | 대표_여부 | 주_동_구분_코드 | 지역지구구역_명 | |
---|---|---|---|---|---|---|---|
42442 | 11110-100113826 | 11110-100054272 | 1 | UQA430 | 1 | 0 | 자연녹지지역 |
51685 | 11110-6004 | 11110-3042 | 1 | 1021 | 1 | 0 | 제1종일반주거지역 |
29605 | 11110-100077923 | 11110-100026919 | 2 | UQI100 | 1 | 0 | 방화지구 |
96705 | 11170-100003911 | 11170-100005980 | 1 | 1022 | 0 | 0 | 제2종일반주거지역 |
78036 | 11140-100084485 | 11140-100077218 | 1 | ZA0014 | 0 | 0 | 역사도심 |
48781 | 11110-3193 | 11110-1440 | 1 | 1021 | 1 | 0 | 제1종일반주거지역 |
82841 | 11140-1315 | 11140-1061 | 1 | 1020 | 0 | 0 | 일반주거지역 |
9754 | 11110-100007695 | 11110-100005514 | 1 | 1011 | 1 | 0 | 제1종전용주거지역 |
77780 | 11140-100083502 | 11140-100078119 | 1 | UQA220 | 1 | 0 | 일반상업지역 |
45419 | 11110-100121209 | 11110-100059732 | 1 | UQA111 | 1 | 0 | 제1종전용주거지역 |