Dataset statistics
Number of variables | 10 |
---|---|
Number of observations | 121 |
Missing cells | 242 |
Missing cells (%) | 20.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 9.9 KiB |
Average record size in memory | 84.1 B |
Variable types
Numeric | 1 |
---|---|
Text | 2 |
Categorical | 5 |
Unsupported | 2 |
Dataset
Description | 순번,ID,도시계획코드,분류명,조서ID,고시ID,라벨명,고시일자,X좌표,Y좌표 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15527/S/1/datasetView.do |
조서ID has constant value "" | Constant |
고시ID has constant value "" | Constant |
고시일자 has constant value "" | Constant |
도시계획코드 is highly overall correlated with 분류명 | High correlation |
분류명 is highly overall correlated with 도시계획코드 | High correlation |
도시계획코드 is highly imbalanced (75.2%) | Imbalance |
분류명 is highly imbalanced (75.2%) | Imbalance |
X좌표 has 121 (100.0%) missing values | Missing |
Y좌표 has 121 (100.0%) missing values | Missing |
순번 has unique values | Unique |
ID has unique values | Unique |
라벨명 has unique values | Unique |
X좌표 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Y좌표 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2024-05-11 09:51:53.318129 |
---|---|
Analysis finished | 2024-05-11 09:51:55.107466 |
Duration | 1.79 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
순번
Real number (ℝ)
UNIQUE
 
Distinct | 121 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 10904.785 |
Minimum | 10675 |
---|---|
Maximum | 10987 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.2 KiB |
Quantile statistics
Minimum | 10675 |
---|---|
5-th percentile | 10681 |
Q1 | 10897 |
median | 10927 |
Q3 | 10957 |
95-th percentile | 10981 |
Maximum | 10987 |
Range | 312 |
Interquartile range (IQR) | 60 |
Descriptive statistics
Standard deviation | 86.198338 |
---|---|
Coefficient of variation (CV) | 0.0079046342 |
Kurtosis | 2.5795798 |
Mean | 10904.785 |
Median Absolute Deviation (MAD) | 30 |
Skewness | -1.9102555 |
Sum | 1319479 |
Variance | 7430.1534 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
10682 | 1 | 0.8% |
10987 | 1 | 0.8% |
10985 | 1 | 0.8% |
10984 | 1 | 0.8% |
10983 | 1 | 0.8% |
10982 | 1 | 0.8% |
10952 | 1 | 0.8% |
10951 | 1 | 0.8% |
10950 | 1 | 0.8% |
10949 | 1 | 0.8% |
Other values (111) | 111 |
Value | Count | Frequency (%) |
10675 | 1 | |
10676 | 1 | |
10677 | 1 | |
10678 | 1 | |
10679 | 1 | |
10680 | 1 | |
10681 | 1 | |
10682 | 1 | |
10683 | 1 | |
10684 | 1 |
Value | Count | Frequency (%) |
10987 | 1 | |
10986 | 1 | |
10985 | 1 | |
10984 | 1 | |
10983 | 1 | |
10982 | 1 | |
10981 | 1 | |
10980 | 1 | |
10979 | 1 | |
10978 | 1 |
ID
Text
UNIQUE
 
Distinct | 121 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.1 KiB |
Value | Count | Frequency (%) |
생활권경계_109 | 1 | 0.8% |
생활권경계_006 | 1 | 0.8% |
생활권경계_046 | 1 | 0.8% |
생활권경계_045 | 1 | 0.8% |
생활권경계_040 | 1 | 0.8% |
생활권경계_020 | 1 | 0.8% |
생활권경계_075 | 1 | 0.8% |
생활권경계_069 | 1 | 0.8% |
생활권경계_068 | 1 | 0.8% |
생활권경계_067 | 1 | 0.8% |
Other values (111) | 111 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 130 | |
생 | 121 | |
활 | 121 | |
권 | 121 | |
경 | 121 | |
계 | 121 | |
_ | 121 | |
1 | 55 | |
2 | 24 | 2.2% |
9 | 22 | 2.0% |
Other values (6) | 132 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 605 | |
Decimal Number | 363 | |
Connector Punctuation | 121 | 11.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 130 | |
1 | 55 | |
2 | 24 | 6.6% |
9 | 22 | 6.1% |
3 | 22 | 6.1% |
4 | 22 | 6.1% |
5 | 22 | 6.1% |
7 | 22 | 6.1% |
8 | 22 | 6.1% |
6 | 22 | 6.1% |
Other Letter
Value | Count | Frequency (%) |
생 | 121 | |
활 | 121 | |
권 | 121 | |
경 | 121 | |
계 | 121 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 121 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 605 | |
Common | 484 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 130 | |
_ | 121 | |
1 | 55 | |
2 | 24 | 5.0% |
9 | 22 | 4.5% |
3 | 22 | 4.5% |
4 | 22 | 4.5% |
5 | 22 | 4.5% |
7 | 22 | 4.5% |
8 | 22 | 4.5% |
Hangul
Value | Count | Frequency (%) |
생 | 121 | |
활 | 121 | |
권 | 121 | |
경 | 121 | |
계 | 121 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 605 | |
ASCII | 484 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 130 | |
_ | 121 | |
1 | 55 | |
2 | 24 | 5.0% |
9 | 22 | 4.5% |
3 | 22 | 4.5% |
4 | 22 | 4.5% |
5 | 22 | 4.5% |
7 | 22 | 4.5% |
8 | 22 | 4.5% |
Hangul
Value | Count | Frequency (%) |
생 | 121 | |
활 | 121 | |
권 | 121 | |
경 | 121 | |
계 | 121 |
도시계획코드
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 1.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.1 KiB |
ZON125 | |
---|---|
ZON121 | 5 |
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | ZON125 |
---|---|
2nd row | ZON125 |
3rd row | ZON125 |
4th row | ZON125 |
5th row | ZON125 |
Common Values
Value | Count | Frequency (%) |
ZON125 | 116 | |
ZON121 | 5 | 4.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
zon125 | 116 | |
zon121 | 5 | 4.1% |
분류명
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 1.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.1 KiB |
지역생활권 | |
---|---|
권역생활권 | 5 |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 5 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 지역생활권 |
---|---|
2nd row | 지역생활권 |
3rd row | 지역생활권 |
4th row | 지역생활권 |
5th row | 지역생활권 |
Common Values
Value | Count | Frequency (%) |
지역생활권 | 116 | |
권역생활권 | 5 | 4.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
지역생활권 | 116 | |
권역생활권 | 5 | 4.1% |
조서ID
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.1 KiB |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | |
---|---|
2nd row | |
3rd row | |
4th row | |
5th row |
Common Values
Value | Count | Frequency (%) |
121 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
No values found. |
고시ID
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.1 KiB |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | |
---|---|
2nd row | |
3rd row | |
4th row | |
5th row |
Common Values
Value | Count | Frequency (%) |
121 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
No values found. |
라벨명
Text
UNIQUE
 
Distinct | 121 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.1 KiB |
Value | Count | Frequency (%) |
강동구_길동둔촌 | 1 | 0.8% |
종로구_청운효자 | 1 | 0.8% |
강북구_번동 | 1 | 0.8% |
강북구_미아 | 1 | 0.8% |
동대문구_장안 | 1 | 0.8% |
노원구_상계 | 1 | 0.8% |
영등포구_신길 | 1 | 0.8% |
구로구_고척개봉 | 1 | 0.8% |
강서구_공항방화 | 1 | 0.8% |
양천구_신월1 | 1 | 0.8% |
Other values (111) | 111 |
Most occurring characters
Value | Count | Frequency (%) |
구 | 124 | 14.6% |
_ | 116 | 13.6% |
동 | 29 | 3.4% |
강 | 24 | 2.8% |
서 | 19 | 2.2% |
권 | 17 | 2.0% |
성 | 15 | 1.8% |
활 | 13 | 1.5% |
생 | 13 | 1.5% |
포 | 13 | 1.5% |
Other values (150) | 468 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 728 | |
Connector Punctuation | 116 | 13.6% |
Decimal Number | 6 | 0.7% |
Other Punctuation | 1 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
구 | 124 | 17.0% |
동 | 29 | 4.0% |
강 | 24 | 3.3% |
서 | 19 | 2.6% |
권 | 17 | 2.3% |
성 | 15 | 2.1% |
활 | 13 | 1.8% |
생 | 13 | 1.8% |
포 | 13 | 1.8% |
대 | 13 | 1.8% |
Other values (146) | 448 |
Decimal Number
Value | Count | Frequency (%) |
1 | 3 | |
2 | 3 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 116 |
Other Punctuation
Value | Count | Frequency (%) |
? | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 728 | |
Common | 123 | 14.5% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
구 | 124 | 17.0% |
동 | 29 | 4.0% |
강 | 24 | 3.3% |
서 | 19 | 2.6% |
권 | 17 | 2.3% |
성 | 15 | 2.1% |
활 | 13 | 1.8% |
생 | 13 | 1.8% |
포 | 13 | 1.8% |
대 | 13 | 1.8% |
Other values (146) | 448 |
Common
Value | Count | Frequency (%) |
_ | 116 | |
1 | 3 | 2.4% |
2 | 3 | 2.4% |
? | 1 | 0.8% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 728 | |
ASCII | 123 | 14.5% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
구 | 124 | 17.0% |
동 | 29 | 4.0% |
강 | 24 | 3.3% |
서 | 19 | 2.6% |
권 | 17 | 2.3% |
성 | 15 | 2.1% |
활 | 13 | 1.8% |
생 | 13 | 1.8% |
포 | 13 | 1.8% |
대 | 13 | 1.8% |
Other values (146) | 448 |
ASCII
Value | Count | Frequency (%) |
_ | 116 | |
1 | 3 | 2.4% |
2 | 3 | 2.4% |
? | 1 | 0.8% |
고시일자
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.1 KiB |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | |
---|---|
2nd row | |
3rd row | |
4th row | |
5th row |
Common Values
Value | Count | Frequency (%) |
121 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
No values found. |
X좌표
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 121 |
---|---|
Missing (%) | 100.0% |
Memory size | 1.2 KiB |
Y좌표
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 121 |
---|---|
Missing (%) | 100.0% |
Memory size | 1.2 KiB |
순번 | 도시계획코드 | 분류명 | |
---|---|---|---|
순번 | 1.000 | 0.000 | 0.000 |
도시계획코드 | 0.000 | 1.000 | 0.986 |
분류명 | 0.000 | 0.986 | 1.000 |
분류명 | 도시계획코드 | |
---|---|---|
분류명 | 1.000 | 0.895 |
도시계획코드 | 0.895 | 1.000 |
순번 | 도시계획코드 | 분류명 | |
---|---|---|---|
순번 | 1.000 | 0.000 | 0.000 |
도시계획코드 | 0.000 | 1.000 | 0.895 |
분류명 | 0.000 | 0.895 | 1.000 |
순번 | ID | 도시계획코드 | 분류명 | 조서ID | 고시ID | 라벨명 | 고시일자 | X좌표 | Y좌표 | |
---|---|---|---|---|---|---|---|---|---|---|
0 | 10682 | 생활권경계_109 | ZON125 | 지역생활권 | 강동구_길동둔촌 | <NA> | <NA> | |||
1 | 10683 | 생활권경계_110 | ZON125 | 지역생활권 | 강동구_암사 | <NA> | <NA> | |||
2 | 10684 | 생활권경계_111 | ZON125 | 지역생활권 | 송파구_석촌 | <NA> | <NA> | |||
3 | 10685 | 생활권경계_112 | ZON125 | 지역생활권 | 송파구_잠실1 | <NA> | <NA> | |||
4 | 10686 | 생활권경계_113 | ZON125 | 지역생활권 | 서초구_방배 | <NA> | <NA> | |||
5 | 10687 | 생활권경계_114 | ZON125 | 지역생활권 | 송파구_가락 | <NA> | <NA> | |||
6 | 10688 | 생활권경계_115 | ZON125 | 지역생활권 | 강남구_역삼논현 | <NA> | <NA> | |||
7 | 10881 | 생활권경계_054 | ZON125 | 지역생활권 | 서대문구_홍제생활권 | <NA> | <NA> | |||
8 | 10882 | 생활권경계_055 | ZON125 | 지역생활권 | 서대문구_가좌생활권 | <NA> | <NA> | |||
9 | 10883 | 생활권경계_057 | ZON125 | 지역생활권 | 은평구_응암생활권 | <NA> | <NA> |
순번 | ID | 도시계획코드 | 분류명 | 조서ID | 고시ID | 라벨명 | 고시일자 | X좌표 | Y좌표 | |
---|---|---|---|---|---|---|---|---|---|---|
111 | 10972 | 생활권경계_036 | ZON125 | 지역생활권 | 성동구_마장용답 | <NA> | <NA> | |||
112 | 10973 | 생활권경계_037 | ZON125 | 지역생활권 | 성동구_왕십리행당 | <NA> | <NA> | |||
113 | 10974 | 생활권경계_038 | ZON125 | 지역생활권 | 성동구_금호옥수 | <NA> | <NA> | |||
114 | 10975 | 생활권경계_001 | ZON121 | 권역생활권 | 도심권 | <NA> | <NA> | |||
115 | 10976 | 생활권경계_019 | ZON125 | 지역생활권 | 노원구_마들 | <NA> | <NA> | |||
116 | 10977 | 생활권경계_021 | ZON125 | 지역생활권 | 노원구_중계 | <NA> | <NA> | |||
117 | 10978 | 생활권경계_022 | ZON125 | 지역생활권 | 노원구_공릉 | <NA> | <NA> | |||
118 | 10979 | 생활권경계_024 | ZON125 | 지역생활권 | 노원구_하계 | <NA> | <NA> | |||
119 | 10980 | 생활권경계_025 | ZON125 | 지역생활권 | 노원구_월계 | <NA> | <NA> | |||
120 | 10981 | 생활권경계_026 | ZON125 | 지역생활권 | 광진구_구의 | <NA> | <NA> |