Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 10000 |
Missing cells | 8 |
Missing cells (%) | < 0.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 742.2 KiB |
Average record size in memory | 76.0 B |
Variable types
Text | 4 |
---|---|
Categorical | 4 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15667/S/1/datasetView.do |
Reproduction
Analysis started | 2024-04-20 21:19:03.147797 |
---|---|
Analysis finished | 2024-04-20 21:19:05.041256 |
Duration | 1.89 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
관리_지역지구구역
Text
UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 15 |
---|---|
Median length | 10 |
Mean length | 10.5604 |
Min length | 7 |
Characters and Unicode
Total characters | 105604 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 10000 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 11380-5952 |
---|---|
2nd row | 11380-14346 |
3rd row | 11305-2360 |
4th row | 11320-2938 |
5th row | 11410-5113 |
Value | Count | Frequency (%) |
11380-5952 | 1 | < 0.1% |
11380-14856 | 1 | < 0.1% |
11380-7668 | 1 | < 0.1% |
11305-100000888 | 1 | < 0.1% |
11305-6174 | 1 | < 0.1% |
11410-2917 | 1 | < 0.1% |
11410-128 | 1 | < 0.1% |
11380-13225 | 1 | < 0.1% |
11440-1803 | 1 | < 0.1% |
11350-2264 | 1 | < 0.1% |
Other values (9990) | 9990 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 28797 | |
0 | 16832 | |
3 | 12721 | |
- | 10000 | 9.5% |
8 | 7147 | 6.8% |
2 | 6764 | 6.4% |
5 | 6438 | 6.1% |
4 | 6269 | 5.9% |
6 | 4010 | 3.8% |
9 | 3387 | 3.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 95604 | |
Dash Punctuation | 10000 | 9.5% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 28797 | |
0 | 16832 | |
3 | 12721 | |
8 | 7147 | 7.5% |
2 | 6764 | 7.1% |
5 | 6438 | 6.7% |
4 | 6269 | 6.6% |
6 | 4010 | 4.2% |
9 | 3387 | 3.5% |
7 | 3239 | 3.4% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 105604 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 28797 | |
0 | 16832 | |
3 | 12721 | |
- | 10000 | 9.5% |
8 | 7147 | 6.8% |
2 | 6764 | 6.4% |
5 | 6438 | 6.1% |
4 | 6269 | 5.9% |
6 | 4010 | 3.8% |
9 | 3387 | 3.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 105604 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 28797 | |
0 | 16832 | |
3 | 12721 | |
- | 10000 | 9.5% |
8 | 7147 | 6.8% |
2 | 6764 | 6.4% |
5 | 6438 | 6.1% |
4 | 6269 | 5.9% |
6 | 4010 | 3.8% |
9 | 3387 | 3.2% |
관리_허가대장
Text
Distinct | 8717 |
---|---|
Distinct (%) | 87.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 15 |
---|---|
Median length | 10 |
Mean length | 10.2658 |
Min length | 7 |
Characters and Unicode
Total characters | 102658 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 7537 ? |
---|---|
Unique (%) | 75.4% |
Sample
1st row | 11380-3828 |
---|---|
2nd row | 11380-9878 |
3rd row | 11305-1523 |
4th row | 11320-2380 |
5th row | 11410-3899 |
Value | Count | Frequency (%) |
11320-2423 | 4 | < 0.1% |
11320-2601 | 4 | < 0.1% |
11320-2398 | 4 | < 0.1% |
11305-2914 | 4 | < 0.1% |
11320-2067 | 4 | < 0.1% |
11320-3001 | 4 | < 0.1% |
11305-2054 | 4 | < 0.1% |
11380-12445 | 4 | < 0.1% |
11380-9813 | 4 | < 0.1% |
11305-2847 | 3 | < 0.1% |
Other values (8707) | 9961 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 27553 | |
0 | 15741 | |
3 | 12158 | |
- | 10000 | 9.7% |
8 | 7168 | 7.0% |
2 | 7044 | 6.9% |
5 | 6357 | 6.2% |
4 | 5830 | 5.7% |
6 | 3791 | 3.7% |
9 | 3633 | 3.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 92658 | |
Dash Punctuation | 10000 | 9.7% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 27553 | |
0 | 15741 | |
3 | 12158 | |
8 | 7168 | 7.7% |
2 | 7044 | 7.6% |
5 | 6357 | 6.9% |
4 | 5830 | 6.3% |
6 | 3791 | 4.1% |
9 | 3633 | 3.9% |
7 | 3383 | 3.7% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 102658 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 27553 | |
0 | 15741 | |
3 | 12158 | |
- | 10000 | 9.7% |
8 | 7168 | 7.0% |
2 | 7044 | 6.9% |
5 | 6357 | 6.2% |
4 | 5830 | 5.7% |
6 | 3791 | 3.7% |
9 | 3633 | 3.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 102658 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 27553 | |
0 | 15741 | |
3 | 12158 | |
- | 10000 | 9.7% |
8 | 7168 | 7.0% |
2 | 7044 | 6.9% |
5 | 6357 | 6.2% |
4 | 5830 | 5.7% |
6 | 3791 | 3.7% |
9 | 3633 | 3.5% |
지역지구구역_구분_코드
Categorical
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
1 | |
---|---|
2 | |
3 | 638 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 3 |
3rd row | 2 |
4th row | 2 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 7851 | |
2 | 1511 | 15.1% |
3 | 638 | 6.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 7851 | |
2 | 1511 | 15.1% |
3 | 638 | 6.4% |
지역지구구역_코드
Text
Distinct | 65 |
---|---|
Distinct (%) | 0.7% |
Missing | 4 |
Missing (%) | < 0.1% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
1020 | 4208 | |
1022 | 1260 | 12.6% |
0100 | 804 | 8.0% |
260 | 421 | 4.2% |
1030 | 419 | 4.2% |
1023 | 380 | 3.8% |
112 | 283 | 2.8% |
1330 | 222 | 2.2% |
1120 | 186 | 1.9% |
103 | 184 | 1.8% |
Other values (55) | 1629 | 16.3% |
Most occurring characters
Value | Count | Frequency (%) |
0 | 16100 | |
1 | 9800 | |
2 | 8791 | |
3 | 1948 | 5.1% |
6 | 431 | 1.1% |
9 | 347 | 0.9% |
7 | 222 | 0.6% |
8 | 71 | 0.2% |
4 | 52 | 0.1% |
Z | 49 | 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 37790 | |
Uppercase Letter | 49 | 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 16100 | |
1 | 9800 | |
2 | 8791 | |
3 | 1948 | 5.2% |
6 | 431 | 1.1% |
9 | 347 | 0.9% |
7 | 222 | 0.6% |
8 | 71 | 0.2% |
4 | 52 | 0.1% |
5 | 28 | 0.1% |
Uppercase Letter
Value | Count | Frequency (%) |
Z | 49 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 37790 | |
Latin | 49 | 0.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 16100 | |
1 | 9800 | |
2 | 8791 | |
3 | 1948 | 5.2% |
6 | 431 | 1.1% |
9 | 347 | 0.9% |
7 | 222 | 0.6% |
8 | 71 | 0.2% |
4 | 52 | 0.1% |
5 | 28 | 0.1% |
Latin
Value | Count | Frequency (%) |
Z | 49 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 37839 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 16100 | |
1 | 9800 | |
2 | 8791 | |
3 | 1948 | 5.1% |
6 | 431 | 1.1% |
9 | 347 | 0.9% |
7 | 222 | 0.6% |
8 | 71 | 0.2% |
4 | 52 | 0.1% |
Z | 49 | 0.1% |
대표_여부
Categorical
IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
1 | |
---|---|
0 | |
<NA> | 5 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.0015 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
1 | 8931 | |
0 | 1064 | 10.6% |
<NA> | 5 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 8931 | |
0 | 1064 | 10.6% |
na | 5 | < 0.1% |
주_동_구분_코드
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 10000 |
지역지구구역_명
Text
Distinct | 72 |
---|---|
Distinct (%) | 0.7% |
Missing | 4 |
Missing (%) | < 0.1% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
일반주거지역 | 4208 | |
제2종일반주거지역 | 1260 | 12.5% |
도시지역 | 804 | 8.0% |
주차장정비지구 | 421 | 4.2% |
준주거지역 | 419 | 4.2% |
제3종일반주거지역 | 380 | 3.8% |
최고고도지구 | 283 | 2.8% |
자연녹지지역 | 222 | 2.2% |
일반상업지역 | 186 | 1.8% |
일반미관지구 | 184 | 1.8% |
Other values (62) | 1699 |
Most occurring characters
Value | Count | Frequency (%) |
지 | 9965 | |
역 | 8635 | |
주 | 6975 | |
거 | 6554 | |
일 | 6380 | |
반 | 6380 | |
구 | 2373 | 3.6% |
제 | 2192 | 3.4% |
종 | 2043 | 3.1% |
2 | 1332 | 2.0% |
Other values (71) | 12229 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 62940 | |
Decimal Number | 2048 | 3.1% |
Space Separator | 70 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
지 | 9965 | |
역 | 8635 | |
주 | 6975 | |
거 | 6554 | |
일 | 6380 | |
반 | 6380 | |
구 | 2373 | 3.8% |
제 | 2192 | 3.5% |
종 | 2043 | 3.2% |
도 | 1181 | 1.9% |
Other values (64) | 10262 |
Decimal Number
Value | Count | Frequency (%) |
2 | 1332 | |
3 | 380 | 18.6% |
1 | 306 | 14.9% |
4 | 25 | 1.2% |
5 | 4 | 0.2% |
6 | 1 | < 0.1% |
Space Separator
Value | Count | Frequency (%) |
70 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 62940 | |
Common | 2118 | 3.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
지 | 9965 | |
역 | 8635 | |
주 | 6975 | |
거 | 6554 | |
일 | 6380 | |
반 | 6380 | |
구 | 2373 | 3.8% |
제 | 2192 | 3.5% |
종 | 2043 | 3.2% |
도 | 1181 | 1.9% |
Other values (64) | 10262 |
Common
Value | Count | Frequency (%) |
2 | 1332 | |
3 | 380 | 17.9% |
1 | 306 | 14.4% |
70 | 3.3% | |
4 | 25 | 1.2% |
5 | 4 | 0.2% |
6 | 1 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 62940 | |
ASCII | 2118 | 3.3% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
지 | 9965 | |
역 | 8635 | |
주 | 6975 | |
거 | 6554 | |
일 | 6380 | |
반 | 6380 | |
구 | 2373 | 3.8% |
제 | 2192 | 3.5% |
종 | 2043 | 3.2% |
도 | 1181 | 1.9% |
Other values (64) | 10262 |
ASCII
Value | Count | Frequency (%) |
2 | 1332 | |
3 | 380 | 17.9% |
1 | 306 | 14.4% |
70 | 3.3% | |
4 | 25 | 1.2% |
5 | 4 | 0.2% |
6 | 1 | < 0.1% |
작업_일자
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
20111227 |
---|
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 8 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 20111227 |
---|---|
2nd row | 20111227 |
3rd row | 20111227 |
4th row | 20111227 |
5th row | 20111227 |
Common Values
Value | Count | Frequency (%) |
20111227 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
20111227 | 10000 |
지역지구구역_구분_코드 | 지역지구구역_코드 | 대표_여부 | 지역지구구역_명 | |
---|---|---|---|---|
지역지구구역_구분_코드 | 1.000 | 0.995 | 0.038 | 1.000 |
지역지구구역_코드 | 0.995 | 1.000 | 0.518 | 1.000 |
대표_여부 | 0.038 | 0.518 | 1.000 | 0.544 |
지역지구구역_명 | 1.000 | 1.000 | 0.544 | 1.000 |
지역지구구역_구분_코드 | 대표_여부 | |
---|---|---|
지역지구구역_구분_코드 | 1.000 | 0.063 |
대표_여부 | 0.063 | 1.000 |
지역지구구역_구분_코드 | 대표_여부 | |
---|---|---|
지역지구구역_구분_코드 | 1.000 | 0.063 |
대표_여부 | 0.063 | 1.000 |
관리_지역지구구역 | 관리_허가대장 | 지역지구구역_구분_코드 | 지역지구구역_코드 | 대표_여부 | 주_동_구분_코드 | 지역지구구역_명 | 작업_일자 | |
---|---|---|---|---|---|---|---|---|
26570 | 11380-5952 | 11380-3828 | 1 | 1020 | 1 | 0 | 일반주거지역 | 20111227 |
19707 | 11380-14346 | 11380-9878 | 3 | 300 | 1 | 0 | 지구단위계획구역 | 20111227 |
6609 | 11305-2360 | 11305-1523 | 2 | 112 | 1 | 0 | 최고고도지구 | 20111227 |
2539 | 11320-2938 | 11320-2380 | 2 | 102 | 1 | 0 | 역사문화미관지구 | 20111227 |
24492 | 11410-5113 | 11410-3899 | 1 | 1021 | 0 | 0 | 제1종일반주거지역 | 20111227 |
5856 | 11305-100003449 | 11305-100007980 | 1 | 1022 | 1 | 0 | 제2종일반주거지역 | 20111227 |
15841 | 11380-13101 | 11380-8941 | 1 | 1020 | 1 | 0 | 일반주거지역 | 20111227 |
12914 | 11380-17202 | 11380-11981 | 2 | 103 | 1 | 0 | 일반미관지구 | 20111227 |
24312 | 11410-2043 | 11410-1525 | 1 | 0100 | 1 | 0 | 도시지역 | 20111227 |
3594 | 11350-2149 | 11350-1847 | 1 | 1020 | 1 | 0 | 일반주거지역 | 20111227 |
관리_지역지구구역 | 관리_허가대장 | 지역지구구역_구분_코드 | 지역지구구역_코드 | 대표_여부 | 주_동_구분_코드 | 지역지구구역_명 | 작업_일자 | |
---|---|---|---|---|---|---|---|---|
8274 | 11320-100002806 | 11320-100005286 | 1 | 1022 | 1 | 0 | 제2종일반주거지역 | 20111227 |
24027 | 11440-1071 | 11440-1818 | 1 | 1020 | 1 | 0 | 일반주거지역 | 20111227 |
18858 | 11380-100013123 | 11380-12446 | 1 | 1030 | 1 | 0 | 준주거지역 | 20111227 |
3937 | 11350-2828 | 11350-2559 | 1 | 1330 | 1 | 0 | 자연녹지지역 | 20111227 |
13707 | 11380-6392 | 11380-4047 | 1 | 1020 | 1 | 0 | 일반주거지역 | 20111227 |
23471 | 11440-100003865 | 11440-100005536 | 2 | 370 | 1 | 0 | 주거환경개선지구 | 20111227 |
6073 | 11320-3476 | 11320-2809 | 1 | 1120 | 1 | 0 | 일반상업지역 | 20111227 |
3998 | 11320-2659 | 11320-2138 | 1 | 1020 | 1 | 0 | 일반주거지역 | 20111227 |
19919 | 11380-11280 | 11380-7457 | 1 | 1020 | 1 | 0 | 일반주거지역 | 20111227 |
17126 | 11380-10302 | 11380-6662 | 1 | 1030 | 1 | 0 | 준주거지역 | 20111227 |