Dataset statistics
Number of variables | 15 |
---|---|
Number of observations | 10000 |
Missing cells | 5339 |
Missing cells (%) | 3.6% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.3 MiB |
Average record size in memory | 137.0 B |
Variable types
Numeric | 8 |
---|---|
Categorical | 4 |
Text | 3 |
Dataset
Description | 허가신고번호,허가신고양식,팀명,개발위치지역코드,개발위치산,개발위치번지,개발위치호,경도도,경도분,경도초,위도도,위도분,위도초,기준년도,구명 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-22149/S/1/datasetView.do |
구명 is highly overall correlated with 개발위치지역코드 and 1 other fields | High correlation |
팀명 is highly overall correlated with 개발위치지역코드 and 1 other fields | High correlation |
허가신고번호 is highly overall correlated with 허가신고양식 | High correlation |
개발위치지역코드 is highly overall correlated with 팀명 and 1 other fields | High correlation |
경도도 is highly overall correlated with 경도분 and 4 other fields | High correlation |
경도분 is highly overall correlated with 경도도 and 4 other fields | High correlation |
경도초 is highly overall correlated with 경도도 and 4 other fields | High correlation |
위도도 is highly overall correlated with 경도도 and 4 other fields | High correlation |
위도분 is highly overall correlated with 경도도 and 4 other fields | High correlation |
위도초 is highly overall correlated with 경도도 and 4 other fields | High correlation |
허가신고양식 is highly overall correlated with 허가신고번호 | High correlation |
허가신고양식 is highly imbalanced (74.5%) | Imbalance |
개발위치지역코드 has 260 (2.6%) missing values | Missing |
개발위치번지 has 278 (2.8%) missing values | Missing |
개발위치호 has 4801 (48.0%) missing values | Missing |
경도도 has 7042 (70.4%) zeros | Zeros |
경도분 has 7199 (72.0%) zeros | Zeros |
경도초 has 7146 (71.5%) zeros | Zeros |
위도도 has 7043 (70.4%) zeros | Zeros |
위도분 has 7084 (70.8%) zeros | Zeros |
위도초 has 7144 (71.4%) zeros | Zeros |
Reproduction
Analysis started | 2024-05-17 22:07:43.874224 |
---|---|
Analysis finished | 2024-05-17 22:08:14.603401 |
Duration | 30.73 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
허가신고번호
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 4191 |
---|---|
Distinct (%) | 41.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.0766617 × 109 |
Minimum | 1.9010012 × 108 |
---|---|
Maximum | 3.2002 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1.9010012 × 108 |
---|---|
5-th percentile | 1.1901001 × 109 |
Q1 | 2.1901002 × 109 |
median | 2.1901008 × 109 |
Q3 | 2.2 × 109 |
95-th percentile | 2.2008 × 109 |
Maximum | 3.2002 × 109 |
Range | 3.0100999 × 109 |
Interquartile range (IQR) | 9899812 |
Descriptive statistics
Standard deviation | 3.2867212 × 108 |
---|---|
Coefficient of variation (CV) | 0.15826945 |
Kurtosis | 4.5217884 |
Mean | 2.0766617 × 109 |
Median Absolute Deviation (MAD) | 8799200.5 |
Skewness | -2.4172782 |
Sum | 2.0766617 × 1013 |
Variance | 1.0802536 × 1017 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2190100226 | 13 | 0.1% |
2190100102 | 13 | 0.1% |
2201000002 | 12 | 0.1% |
2200600003 | 12 | 0.1% |
2200100001 | 11 | 0.1% |
2200000014 | 11 | 0.1% |
2200300003 | 11 | 0.1% |
2200500006 | 11 | 0.1% |
2200100005 | 11 | 0.1% |
2190100151 | 11 | 0.1% |
Other values (4181) | 9884 |
Value | Count | Frequency (%) |
190100123 | 1 | |
198400002 | 1 | |
198400003 | 1 | |
198400005 | 1 | |
198400007 | 1 | |
198400008 | 1 | |
198400012 | 1 | |
198400020 | 1 | |
198600002 | 1 | |
198900003 | 1 |
Value | Count | Frequency (%) |
3200200013 | 1 | |
3200200009 | 1 | |
3200200005 | 1 | |
3200200002 | 1 | |
3200200001 | 2 | |
3200000002 | 1 | |
2201800010 | 1 | |
2201800008 | 1 | |
2201800003 | 2 | |
2201800001 | 1 |
허가신고양식
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
신고시설 | |
---|---|
허가시설 | |
경미시설 | 216 |
유출지하수 | 7 |
기타시설 | 5 |
Length
Max length | 5 |
---|---|
Median length | 4 |
Mean length | 4.0007 |
Min length | 4 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 신고시설 |
---|---|
2nd row | 신고시설 |
3rd row | 신고시설 |
4th row | 신고시설 |
5th row | 신고시설 |
Common Values
Value | Count | Frequency (%) |
신고시설 | 8678 | |
허가시설 | 1093 | 10.9% |
경미시설 | 216 | 2.2% |
유출지하수 | 7 | 0.1% |
기타시설 | 5 | 0.1% |
온천시설 | 1 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
신고시설 | 8678 | |
허가시설 | 1093 | 10.9% |
경미시설 | 216 | 2.2% |
유출지하수 | 7 | 0.1% |
기타시설 | 5 | < 0.1% |
온천시설 | 1 | < 0.1% |
팀명
Categorical
HIGH CORRELATION
 
Distinct | 25 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
서울특별시 서초구 | |
---|---|
서울특별시 강남구 | |
서울특별시 강서구 | |
서울특별시 강동구 | |
서울특별시 노원구 | |
Other values (20) |
Length
Max length | 10 |
---|---|
Median length | 9 |
Mean length | 9.0903 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 서울특별시 금천구 |
---|---|
2nd row | 서울특별시 노원구 |
3rd row | 서울특별시 노원구 |
4th row | 서울특별시 강서구 |
5th row | 서울특별시 강서구 |
Common Values
Value | Count | Frequency (%) |
서울특별시 서초구 | 1120 | 11.2% |
서울특별시 강남구 | 924 | 9.2% |
서울특별시 강서구 | 753 | 7.5% |
서울특별시 강동구 | 646 | 6.5% |
서울특별시 노원구 | 607 | 6.1% |
서울특별시 송파구 | 587 | 5.9% |
서울특별시 구로구 | 516 | 5.2% |
서울특별시 도봉구 | 457 | 4.6% |
서울특별시 동대문구 | 415 | 4.2% |
서울특별시 영등포구 | 412 | 4.1% |
Other values (15) | 3563 |
Length
Value | Count | Frequency (%) |
서울특별시 | 10000 | |
서초구 | 1120 | 5.6% |
강남구 | 924 | 4.6% |
강서구 | 753 | 3.8% |
강동구 | 646 | 3.2% |
노원구 | 607 | 3.0% |
송파구 | 587 | 2.9% |
구로구 | 516 | 2.6% |
도봉구 | 457 | 2.3% |
동대문구 | 415 | 2.1% |
Other values (16) | 3975 | 19.9% |
개발위치지역코드
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 394 |
---|---|
Distinct (%) | 4.0% |
Missing | 260 |
Missing (%) | 2.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.1488352 × 109 |
Minimum | 1.1110101 × 109 |
---|---|
Maximum | 1.174011 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1.1110101 × 109 |
---|---|
5-th percentile | 1.1170127 × 109 |
Q1 | 1.1320108 × 109 |
median | 1.1500113 × 109 |
Q3 | 1.1650109 × 109 |
95-th percentile | 1.1740105 × 109 |
Maximum | 1.174011 × 109 |
Range | 63000900 |
Interquartile range (IQR) | 33000100 |
Descriptive statistics
Standard deviation | 18305331 |
---|---|
Coefficient of variation (CV) | 0.015933818 |
Kurtosis | -1.1342068 |
Mean | 1.1488352 × 109 |
Median Absolute Deviation (MAD) | 15001000 |
Skewness | -0.33285225 |
Sum | 1.1189655 × 1013 |
Variance | 3.3508513 × 1014 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1135010500 | 297 | 3.0% |
1130510300 | 236 | 2.4% |
1132010800 | 232 | 2.3% |
1165010300 | 219 | 2.2% |
1165010900 | 218 | 2.2% |
1171010800 | 213 | 2.1% |
1174011000 | 205 | 2.1% |
1168011200 | 191 | 1.9% |
1165010800 | 177 | 1.8% |
1153010200 | 176 | 1.8% |
Other values (384) | 7576 | |
(Missing) | 260 | 2.6% |
Value | Count | Frequency (%) |
1111010100 | 4 | |
1111010500 | 2 | < 0.1% |
1111010600 | 3 | |
1111010700 | 2 | < 0.1% |
1111010800 | 7 | |
1111011000 | 2 | < 0.1% |
1111011100 | 1 | < 0.1% |
1111011400 | 3 | |
1111011600 | 2 | < 0.1% |
1111011700 | 1 | < 0.1% |
Value | Count | Frequency (%) |
1174011000 | 205 | |
1174010900 | 66 | 0.7% |
1174010800 | 41 | 0.4% |
1174010700 | 73 | 0.7% |
1174010600 | 48 | 0.5% |
1174010500 | 66 | 0.7% |
1174010300 | 60 | 0.6% |
1174010200 | 53 | 0.5% |
1174010100 | 34 | 0.3% |
1171011400 | 36 | 0.4% |
개발위치산
Categorical
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
1 | |
2 | 188 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 2.5333 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 5111 | |
1 | 4701 | |
2 | 188 | 1.9% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 5111 | |
1 | 4701 | |
2 | 188 | 1.9% |
개발위치번지
Text
MISSING
 
Distinct | 3342 |
---|---|
Distinct (%) | 34.4% |
Missing | 278 |
Missing (%) | 2.8% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
1 | 210 | 2.1% |
8 | 48 | 0.5% |
2 | 44 | 0.5% |
12 | 37 | 0.4% |
14 | 34 | 0.3% |
618 | 32 | 0.3% |
6 | 31 | 0.3% |
13 | 30 | 0.3% |
304 | 28 | 0.3% |
9 | 28 | 0.3% |
Other values (3321) | 9251 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 5337 | |
2 | 3791 | |
3 | 3502 | |
4 | 3154 | |
5 | 2824 | |
6 | 2679 | |
7 | 2391 | |
- | 2349 | |
0 | 2238 | |
8 | 2230 | |
Other values (36) | 2585 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 30369 | |
Dash Punctuation | 2349 | 7.1% |
Other Letter | 263 | 0.8% |
Space Separator | 55 | 0.2% |
Other Punctuation | 29 | 0.1% |
Uppercase Letter | 7 | < 0.1% |
Open Punctuation | 4 | < 0.1% |
Close Punctuation | 4 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
가 | 113 | |
산 | 41 | 15.6% |
지 | 33 | 12.5% |
번 | 22 | 8.4% |
럭 | 10 | 3.8% |
블 | 8 | 3.0% |
택 | 6 | 2.3% |
단 | 5 | 1.9% |
공 | 2 | 0.8% |
호 | 2 | 0.8% |
Other values (16) | 21 | 8.0% |
Decimal Number
Value | Count | Frequency (%) |
1 | 5337 | |
2 | 3791 | |
3 | 3502 | |
4 | 3154 | |
5 | 2824 | |
6 | 2679 | |
7 | 2391 | |
0 | 2238 | |
8 | 2230 | |
9 | 2223 |
Other Punctuation
Value | Count | Frequency (%) |
, | 11 | |
/ | 11 | |
. | 7 |
Uppercase Letter
Value | Count | Frequency (%) |
B | 3 | |
L | 3 | |
W | 1 | 14.3% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2349 |
Space Separator
Value | Count | Frequency (%) |
55 |
Open Punctuation
Value | Count | Frequency (%) |
( | 4 |
Close Punctuation
Value | Count | Frequency (%) |
) | 4 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 32810 | |
Hangul | 263 | 0.8% |
Latin | 7 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
가 | 113 | |
산 | 41 | 15.6% |
지 | 33 | 12.5% |
번 | 22 | 8.4% |
럭 | 10 | 3.8% |
블 | 8 | 3.0% |
택 | 6 | 2.3% |
단 | 5 | 1.9% |
공 | 2 | 0.8% |
호 | 2 | 0.8% |
Other values (16) | 21 | 8.0% |
Common
Value | Count | Frequency (%) |
1 | 5337 | |
2 | 3791 | |
3 | 3502 | |
4 | 3154 | |
5 | 2824 | |
6 | 2679 | |
7 | 2391 | |
- | 2349 | |
0 | 2238 | |
8 | 2230 | |
Other values (7) | 2315 |
Latin
Value | Count | Frequency (%) |
B | 3 | |
L | 3 | |
W | 1 | 14.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 32817 | |
Hangul | 263 | 0.8% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 5337 | |
2 | 3791 | |
3 | 3502 | |
4 | 3154 | |
5 | 2824 | |
6 | 2679 | |
7 | 2391 | |
- | 2349 | |
0 | 2238 | |
8 | 2230 | |
Other values (10) | 2322 |
Hangul
Value | Count | Frequency (%) |
가 | 113 | |
산 | 41 | 15.6% |
지 | 33 | 12.5% |
번 | 22 | 8.4% |
럭 | 10 | 3.8% |
블 | 8 | 3.0% |
택 | 6 | 2.3% |
단 | 5 | 1.9% |
공 | 2 | 0.8% |
호 | 2 | 0.8% |
Other values (16) | 21 | 8.0% |
개발위치호
Text
MISSING
 
Distinct | 451 |
---|---|
Distinct (%) | 8.7% |
Missing | 4801 |
Missing (%) | 48.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
1 | 860 | 16.5% |
2 | 473 | 9.1% |
3 | 351 | 6.8% |
4 | 292 | 5.6% |
5 | 255 | 4.9% |
6 | 202 | 3.9% |
7 | 162 | 3.1% |
10 | 145 | 2.8% |
8 | 145 | 2.8% |
9 | 121 | 2.3% |
Other values (398) | 2193 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 2349 | |
2 | 1309 | |
3 | 888 | 10.6% |
4 | 741 | 8.8% |
5 | 609 | 7.2% |
6 | 560 | 6.7% |
7 | 495 | 5.9% |
8 | 399 | 4.7% |
9 | 386 | 4.6% |
0 | 382 | 4.5% |
Other values (4) | 286 | 3.4% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 8118 | |
Space Separator | 282 | 3.4% |
Other Punctuation | 3 | < 0.1% |
Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 2349 | |
2 | 1309 | |
3 | 888 | 10.9% |
4 | 741 | 9.1% |
5 | 609 | 7.5% |
6 | 560 | 6.9% |
7 | 495 | 6.1% |
8 | 399 | 4.9% |
9 | 386 | 4.8% |
0 | 382 | 4.7% |
Other Punctuation
Value | Count | Frequency (%) |
, | 2 | |
. | 1 |
Space Separator
Value | Count | Frequency (%) |
282 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 8404 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 2349 | |
2 | 1309 | |
3 | 888 | 10.6% |
4 | 741 | 8.8% |
5 | 609 | 7.2% |
6 | 560 | 6.7% |
7 | 495 | 5.9% |
8 | 399 | 4.7% |
9 | 386 | 4.6% |
0 | 382 | 4.5% |
Other values (4) | 286 | 3.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 8404 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 2349 | |
2 | 1309 | |
3 | 888 | 10.6% |
4 | 741 | 8.8% |
5 | 609 | 7.2% |
6 | 560 | 6.7% |
7 | 495 | 5.9% |
8 | 399 | 4.7% |
9 | 386 | 4.6% |
0 | 382 | 4.5% |
Other values (4) | 286 | 3.4% |
경도도
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 10 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 37.425 |
Minimum | 0 |
---|---|
Maximum | 204 |
Zeros | 7042 |
Zeros (%) | 70.4% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 126 |
95-th percentile | 127 |
Maximum | 204 |
Range | 204 |
Interquartile range (IQR) | 126 |
Descriptive statistics
Standard deviation | 57.776959 |
---|---|
Coefficient of variation (CV) | 1.5438065 |
Kurtosis | -1.1927284 |
Mean | 37.425 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 0.89717637 |
Sum | 374250 |
Variance | 3338.177 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 7042 | |
127 | 1739 | 17.4% |
126 | 1198 | 12.0% |
125 | 9 | 0.1% |
128 | 7 | 0.1% |
16 | 1 | < 0.1% |
37 | 1 | < 0.1% |
129 | 1 | < 0.1% |
42 | 1 | < 0.1% |
204 | 1 | < 0.1% |
Value | Count | Frequency (%) |
0 | 7042 | |
16 | 1 | < 0.1% |
37 | 1 | < 0.1% |
42 | 1 | < 0.1% |
125 | 9 | 0.1% |
126 | 1198 | 12.0% |
127 | 1739 | 17.4% |
128 | 7 | 0.1% |
129 | 1 | < 0.1% |
204 | 1 | < 0.1% |
Value | Count | Frequency (%) |
204 | 1 | < 0.1% |
129 | 1 | < 0.1% |
128 | 7 | 0.1% |
127 | 1739 | 17.4% |
126 | 1198 | 12.0% |
125 | 9 | 0.1% |
42 | 1 | < 0.1% |
37 | 1 | < 0.1% |
16 | 1 | < 0.1% |
0 | 7042 |
경도분
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 58 |
---|---|
Distinct (%) | 0.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 7.3114 |
Minimum | 0 |
---|---|
Maximum | 64 |
Zeros | 7199 |
Zeros (%) | 72.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 2 |
95-th percentile | 55 |
Maximum | 64 |
Range | 64 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 17.385156 |
---|---|
Coefficient of variation (CV) | 2.3778149 |
Kurtosis | 3.2079007 |
Mean | 7.3114 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 2.2462993 |
Sum | 73114 |
Variance | 302.24365 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 7199 | |
3 | 318 | 3.2% |
1 | 227 | 2.3% |
4 | 199 | 2.0% |
2 | 198 | 2.0% |
50 | 138 | 1.4% |
55 | 129 | 1.3% |
6 | 128 | 1.3% |
5 | 119 | 1.2% |
49 | 113 | 1.1% |
Other values (48) | 1232 | 12.3% |
Value | Count | Frequency (%) |
0 | 7199 | |
1 | 227 | 2.3% |
2 | 198 | 2.0% |
3 | 318 | 3.2% |
4 | 199 | 2.0% |
5 | 119 | 1.2% |
6 | 128 | 1.3% |
7 | 103 | 1.0% |
8 | 61 | 0.6% |
9 | 73 | 0.7% |
Value | Count | Frequency (%) |
64 | 1 | < 0.1% |
60 | 4 | < 0.1% |
59 | 84 | |
58 | 101 | |
57 | 98 | |
56 | 96 | |
55 | 129 | |
54 | 106 | |
53 | 91 | |
52 | 51 | 0.5% |
경도초
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 821 |
---|---|
Distinct (%) | 8.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 8.3126581 |
Minimum | 0 |
---|---|
Maximum | 99.6 |
Zeros | 7146 |
Zeros (%) | 71.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 8 |
95-th percentile | 49 |
Maximum | 99.6 |
Range | 99.6 |
Interquartile range (IQR) | 8 |
Descriptive statistics
Standard deviation | 16.019996 |
---|---|
Coefficient of variation (CV) | 1.9271809 |
Kurtosis | 2.2760046 |
Mean | 8.3126581 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 1.8581633 |
Sum | 83126.581 |
Variance | 256.64028 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0 | 7146 | |
30.0 | 73 | 0.7% |
10.0 | 73 | 0.7% |
20.0 | 58 | 0.6% |
9.0 | 54 | 0.5% |
15.0 | 48 | 0.5% |
57.0 | 45 | 0.4% |
8.0 | 44 | 0.4% |
21.0 | 41 | 0.4% |
31.0 | 40 | 0.4% |
Other values (811) | 2378 | 23.8% |
Value | Count | Frequency (%) |
0.0 | 7146 | |
0.38 | 1 | < 0.1% |
0.4 | 1 | < 0.1% |
0.45 | 1 | < 0.1% |
0.47 | 1 | < 0.1% |
0.5 | 1 | < 0.1% |
0.53 | 1 | < 0.1% |
0.61 | 1 | < 0.1% |
0.64 | 1 | < 0.1% |
0.65 | 1 | < 0.1% |
Value | Count | Frequency (%) |
99.6 | 1 | |
99.0 | 1 | |
97.84 | 1 | |
93.0 | 1 | |
69.0 | 1 | |
61.0 | 1 | |
59.88 | 1 | |
59.8 | 1 | |
59.65 | 1 | |
59.58 | 1 |
위도도
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 9 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 10.9348 |
Minimum | 0 |
---|---|
Maximum | 44 |
Zeros | 7043 |
Zeros (%) | 70.4% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 37 |
95-th percentile | 37 |
Maximum | 44 |
Range | 44 |
Interquartile range (IQR) | 37 |
Descriptive statistics
Standard deviation | 16.879113 |
---|---|
Coefficient of variation (CV) | 1.5436142 |
Kurtosis | -1.1967369 |
Mean | 10.9348 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 0.89612556 |
Sum | 109348 |
Variance | 284.90444 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 7043 | |
37 | 2925 | |
38 | 9 | 0.1% |
36 | 6 | 0.1% |
34 | 6 | 0.1% |
33 | 5 | 0.1% |
35 | 4 | < 0.1% |
12 | 1 | < 0.1% |
44 | 1 | < 0.1% |
Value | Count | Frequency (%) |
0 | 7043 | |
12 | 1 | < 0.1% |
33 | 5 | 0.1% |
34 | 6 | 0.1% |
35 | 4 | < 0.1% |
36 | 6 | 0.1% |
37 | 2925 | |
38 | 9 | 0.1% |
44 | 1 | < 0.1% |
Value | Count | Frequency (%) |
44 | 1 | < 0.1% |
38 | 9 | 0.1% |
37 | 2925 | |
36 | 6 | 0.1% |
35 | 4 | < 0.1% |
34 | 6 | 0.1% |
33 | 5 | 0.1% |
12 | 1 | < 0.1% |
0 | 7043 |
위도분
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 55 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 9.2767 |
Minimum | 0 |
---|---|
Maximum | 72 |
Zeros | 7084 |
Zeros (%) | 70.8% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 27 |
95-th percentile | 36 |
Maximum | 72 |
Range | 72 |
Interquartile range (IQR) | 27 |
Descriptive statistics
Standard deviation | 14.726677 |
---|---|
Coefficient of variation (CV) | 1.587491 |
Kurtosis | -0.71215687 |
Mean | 9.2767 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 1.0372748 |
Sum | 92767 |
Variance | 216.87502 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 7084 | |
33 | 340 | 3.4% |
32 | 278 | 2.8% |
27 | 273 | 2.7% |
34 | 253 | 2.5% |
30 | 250 | 2.5% |
28 | 247 | 2.5% |
31 | 243 | 2.4% |
29 | 223 | 2.2% |
35 | 129 | 1.3% |
Other values (45) | 680 | 6.8% |
Value | Count | Frequency (%) |
0 | 7084 | |
2 | 2 | < 0.1% |
6 | 1 | < 0.1% |
8 | 6 | 0.1% |
9 | 3 | < 0.1% |
10 | 2 | < 0.1% |
11 | 6 | 0.1% |
12 | 5 | 0.1% |
13 | 1 | < 0.1% |
14 | 4 | < 0.1% |
Value | Count | Frequency (%) |
72 | 1 | < 0.1% |
60 | 4 | |
59 | 2 | < 0.1% |
58 | 2 | < 0.1% |
57 | 3 | < 0.1% |
56 | 1 | < 0.1% |
55 | 1 | < 0.1% |
53 | 2 | < 0.1% |
52 | 8 | |
50 | 3 | < 0.1% |
위도초
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 805 |
---|---|
Distinct (%) | 8.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 8.3781339 |
Minimum | 0 |
---|---|
Maximum | 81 |
Zeros | 7144 |
Zeros (%) | 71.4% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 8 |
95-th percentile | 49 |
Maximum | 81 |
Range | 81 |
Interquartile range (IQR) | 8 |
Descriptive statistics
Standard deviation | 16.12269 |
---|---|
Coefficient of variation (CV) | 1.9243772 |
Kurtosis | 2.0504211 |
Mean | 8.3781339 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 1.8328855 |
Sum | 83781.339 |
Variance | 259.94114 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0 | 7144 | |
30.0 | 73 | 0.7% |
15.0 | 70 | 0.7% |
10.0 | 64 | 0.6% |
20.0 | 63 | 0.6% |
50.0 | 50 | 0.5% |
8.0 | 45 | 0.4% |
25.0 | 44 | 0.4% |
40.0 | 43 | 0.4% |
5.0 | 42 | 0.4% |
Other values (795) | 2362 | 23.6% |
Value | Count | Frequency (%) |
0.0 | 7144 | |
0.02 | 1 | < 0.1% |
0.19 | 1 | < 0.1% |
0.38 | 1 | < 0.1% |
0.4 | 1 | < 0.1% |
0.47 | 1 | < 0.1% |
0.5 | 1 | < 0.1% |
0.52 | 1 | < 0.1% |
0.6 | 2 | < 0.1% |
0.64 | 1 | < 0.1% |
Value | Count | Frequency (%) |
81.0 | 1 | |
79.0 | 1 | |
71.0 | 1 | |
63.8 | 1 | |
62.13 | 1 | |
61.72 | 1 | |
60.48 | 1 | |
59.95 | 1 | |
59.94 | 1 | |
59.93 | 1 |
기준년도
Text
Distinct | 53 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
2017 | 5651 | |
2018 | 3870 | |
2005 | 48 | 0.5% |
2011 | 43 | 0.4% |
84-0 | 37 | 0.4% |
85-0 | 35 | 0.4% |
2003 | 35 | 0.4% |
86-0 | 32 | 0.3% |
87-0 | 27 | 0.3% |
89-0 | 24 | 0.2% |
Other values (43) | 198 | 2.0% |
Most occurring characters
Value | Count | Frequency (%) |
0 | 10049 | |
2 | 9695 | |
1 | 9686 | |
7 | 5692 | |
8 | 4161 | |
- | 330 | 0.8% |
5 | 105 | 0.3% |
9 | 103 | 0.3% |
3 | 63 | 0.2% |
6 | 59 | 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 39670 | |
Dash Punctuation | 330 | 0.8% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 10049 | |
2 | 9695 | |
1 | 9686 | |
7 | 5692 | |
8 | 4161 | |
5 | 105 | 0.3% |
9 | 103 | 0.3% |
3 | 63 | 0.2% |
6 | 59 | 0.1% |
4 | 57 | 0.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 330 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 40000 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 10049 | |
2 | 9695 | |
1 | 9686 | |
7 | 5692 | |
8 | 4161 | |
- | 330 | 0.8% |
5 | 105 | 0.3% |
9 | 103 | 0.3% |
3 | 63 | 0.2% |
6 | 59 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 40000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 10049 | |
2 | 9695 | |
1 | 9686 | |
7 | 5692 | |
8 | 4161 | |
- | 330 | 0.8% |
5 | 105 | 0.3% |
9 | 103 | 0.3% |
3 | 63 | 0.2% |
6 | 59 | 0.1% |
구명
Categorical
HIGH CORRELATION
 
Distinct | 25 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
서초구 | |
---|---|
강남구 | |
강서구 | |
강동구 | |
노원구 | |
Other values (20) |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 3.0903 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 금천구 |
---|---|
2nd row | 노원구 |
3rd row | 노원구 |
4th row | 강서구 |
5th row | 강서구 |
Common Values
Value | Count | Frequency (%) |
서초구 | 1120 | 11.2% |
강남구 | 924 | 9.2% |
강서구 | 753 | 7.5% |
강동구 | 646 | 6.5% |
노원구 | 607 | 6.1% |
송파구 | 587 | 5.9% |
구로구 | 516 | 5.2% |
도봉구 | 457 | 4.6% |
동대문구 | 415 | 4.2% |
영등포구 | 412 | 4.1% |
Other values (15) | 3563 |
Length
Value | Count | Frequency (%) |
서초구 | 1120 | 11.2% |
강남구 | 924 | 9.2% |
강서구 | 753 | 7.5% |
강동구 | 646 | 6.5% |
노원구 | 607 | 6.1% |
송파구 | 587 | 5.9% |
구로구 | 516 | 5.2% |
도봉구 | 457 | 4.6% |
동대문구 | 415 | 4.2% |
영등포구 | 412 | 4.1% |
Other values (15) | 3563 |
허가신고번호 | 허가신고양식 | 팀명 | 개발위치지역코드 | 개발위치산 | 경도도 | 경도분 | 경도초 | 위도도 | 위도분 | 위도초 | 기준년도 | 구명 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
허가신고번호 | 1.000 | 0.907 | 0.313 | 0.227 | 0.037 | 0.073 | 0.156 | 0.103 | 0.071 | 0.143 | 0.090 | 0.806 | 0.313 |
허가신고양식 | 0.907 | 1.000 | 0.446 | 0.342 | 0.048 | 0.123 | 0.158 | 0.121 | 0.124 | 0.306 | 0.147 | 0.763 | 0.446 |
팀명 | 0.313 | 0.446 | 1.000 | 1.000 | 0.334 | 0.216 | 0.645 | 0.170 | 0.221 | 0.537 | 0.207 | 0.779 | 1.000 |
개발위치지역코드 | 0.227 | 0.342 | 1.000 | 1.000 | 0.219 | 0.163 | 0.632 | 0.121 | 0.165 | 0.541 | 0.167 | 0.667 | 1.000 |
개발위치산 | 0.037 | 0.048 | 0.334 | 0.219 | 1.000 | 0.000 | 0.042 | 0.011 | 0.000 | 0.132 | 0.019 | 0.000 | 0.334 |
경도도 | 0.073 | 0.123 | 0.216 | 0.163 | 0.000 | 1.000 | 0.696 | 0.610 | 0.991 | 0.840 | 0.786 | 0.000 | 0.216 |
경도분 | 0.156 | 0.158 | 0.645 | 0.632 | 0.042 | 0.696 | 1.000 | 0.480 | 0.727 | 0.735 | 0.590 | 0.251 | 0.645 |
경도초 | 0.103 | 0.121 | 0.170 | 0.121 | 0.011 | 0.610 | 0.480 | 1.000 | 0.610 | 0.621 | 0.602 | 0.119 | 0.170 |
위도도 | 0.071 | 0.124 | 0.221 | 0.165 | 0.000 | 0.991 | 0.727 | 0.610 | 1.000 | 0.865 | 0.788 | 0.171 | 0.221 |
위도분 | 0.143 | 0.306 | 0.537 | 0.541 | 0.132 | 0.840 | 0.735 | 0.621 | 0.865 | 1.000 | 0.737 | 0.191 | 0.537 |
위도초 | 0.090 | 0.147 | 0.207 | 0.167 | 0.019 | 0.786 | 0.590 | 0.602 | 0.788 | 0.737 | 1.000 | 0.000 | 0.207 |
기준년도 | 0.806 | 0.763 | 0.779 | 0.667 | 0.000 | 0.000 | 0.251 | 0.119 | 0.171 | 0.191 | 0.000 | 1.000 | 0.779 |
구명 | 0.313 | 0.446 | 1.000 | 1.000 | 0.334 | 0.216 | 0.645 | 0.170 | 0.221 | 0.537 | 0.207 | 0.779 | 1.000 |
구명 | 개발위치산 | 허가신고양식 | 팀명 | |
---|---|---|---|---|
구명 | 1.000 | 0.288 | 0.218 | 1.000 |
개발위치산 | 0.288 | 1.000 | 0.034 | 0.288 |
허가신고양식 | 0.218 | 0.034 | 1.000 | 0.218 |
팀명 | 1.000 | 0.288 | 0.218 | 1.000 |
허가신고번호 | 개발위치지역코드 | 경도도 | 경도분 | 경도초 | 위도도 | 위도분 | 위도초 | 허가신고양식 | 팀명 | 개발위치산 | 구명 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
허가신고번호 | 1.000 | 0.037 | 0.415 | 0.371 | 0.400 | 0.406 | 0.397 | 0.396 | 0.791 | 0.173 | 0.025 | 0.173 |
개발위치지역코드 | 0.037 | 1.000 | 0.020 | 0.005 | 0.004 | 0.006 | -0.063 | 0.005 | 0.187 | 0.999 | 0.168 | 0.999 |
경도도 | 0.415 | 0.020 | 1.000 | 0.899 | 0.939 | 0.984 | 0.956 | 0.941 | 0.083 | 0.094 | 0.000 | 0.094 |
경도분 | 0.371 | 0.005 | 0.899 | 1.000 | 0.922 | 0.944 | 0.934 | 0.924 | 0.084 | 0.287 | 0.032 | 0.287 |
경도초 | 0.400 | 0.004 | 0.939 | 0.922 | 1.000 | 0.955 | 0.945 | 0.950 | 0.067 | 0.068 | 0.008 | 0.068 |
위도도 | 0.406 | 0.006 | 0.984 | 0.944 | 0.955 | 1.000 | 0.970 | 0.957 | 0.084 | 0.097 | 0.000 | 0.097 |
위도분 | 0.397 | -0.063 | 0.956 | 0.934 | 0.945 | 0.970 | 1.000 | 0.943 | 0.166 | 0.220 | 0.131 | 0.220 |
위도초 | 0.396 | 0.005 | 0.941 | 0.924 | 0.950 | 0.957 | 0.943 | 1.000 | 0.078 | 0.074 | 0.014 | 0.074 |
허가신고양식 | 0.791 | 0.187 | 0.083 | 0.084 | 0.067 | 0.084 | 0.166 | 0.078 | 1.000 | 0.218 | 0.034 | 0.218 |
팀명 | 0.173 | 0.999 | 0.094 | 0.287 | 0.068 | 0.097 | 0.220 | 0.074 | 0.218 | 1.000 | 0.288 | 1.000 |
개발위치산 | 0.025 | 0.168 | 0.000 | 0.032 | 0.008 | 0.000 | 0.131 | 0.014 | 0.034 | 0.288 | 1.000 | 0.288 |
구명 | 0.173 | 0.999 | 0.094 | 0.287 | 0.068 | 0.097 | 0.220 | 0.074 | 0.218 | 1.000 | 0.288 | 1.000 |
허가신고번호 | 허가신고양식 | 팀명 | 개발위치지역코드 | 개발위치산 | 개발위치번지 | 개발위치호 | 경도도 | 경도분 | 경도초 | 위도도 | 위도분 | 위도초 | 기준년도 | 구명 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
23693 | 2190100321 | 신고시설 | 서울특별시 금천구 | 1154510200 | <NA> | 906 | 12 | 0 | 0 | 0.0 | 0 | 0 | 0.0 | 2018 | 금천구 |
7238 | 2190100118 | 신고시설 | 서울특별시 노원구 | 1135010500 | <NA> | 966-10 | <NA> | 127 | 3 | 25.0 | 37 | 40 | 15.0 | 2017 | 노원구 |
22396 | 2190100128 | 신고시설 | 서울특별시 노원구 | 1135010500 | <NA> | 1205-430 | <NA> | 0 | 0 | 0.0 | 0 | 0 | 0.0 | 2017 | 노원구 |
20419 | 2199400597 | 신고시설 | 서울특별시 강서구 | 1150010300 | <NA> | 908-20 | <NA> | 0 | 0 | 0.0 | 0 | 0 | 0.0 | 2018 | 강서구 |
13018 | 2190100031 | 신고시설 | 서울특별시 강서구 | 1150010700 | <NA> | 산51-10 | <NA> | 0 | 0 | 0.0 | 0 | 0 | 0.0 | 2018 | 강서구 |
15868 | 2190100879 | 신고시설 | 서울특별시 강동구 | 1174010900 | 1 | 334 | 2 | 0 | 0 | 0.0 | 0 | 0 | 0.0 | 2017 | 강동구 |
8985 | 2190100409 | 신고시설 | 서울특별시 송파구 | 1171011100 | <NA> | 126-4 | <NA> | 0 | 0 | 0.0 | 0 | 0 | 0.0 | 2017 | 송파구 |
15864 | 2190100263 | 신고시설 | 서울특별시 양천구 | <NA> | <NA> | <NA> | <NA> | 0 | 0 | 0.0 | 0 | 0 | 0.0 | 91-0 | 양천구 |
14638 | 2190100839 | 신고시설 | 서울특별시 강동구 | 1174010500 | 2 | 49 | <NA> | 0 | 0 | 0.0 | 0 | 0 | 0.0 | 2017 | 강동구 |
4187 | 1190100128 | 신고시설 | 서울특별시 강남구 | 1168010500 | <NA> | 128-17 | <NA> | 0 | 0 | 0.0 | 0 | 0 | 0.0 | 2017 | 강남구 |
허가신고번호 | 허가신고양식 | 팀명 | 개발위치지역코드 | 개발위치산 | 개발위치번지 | 개발위치호 | 경도도 | 경도분 | 경도초 | 위도도 | 위도분 | 위도초 | 기준년도 | 구명 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
27504 | 2200400001 | 신고시설 | 서울특별시 동작구 | 1159010800 | 1 | 343-1 | <NA> | 126 | 30 | 41.0 | 37 | 30 | 32.0 | 2017 | 동작구 |
6126 | 2190101647 | 신고시설 | 서울특별시 강남구 | 1168010600 | <NA> | 997-4 | <NA> | 0 | 0 | 0.0 | 0 | 0 | 0.0 | 2017 | 강남구 |
23618 | 2190100171 | 신고시설 | 서울특별시 양천구 | <NA> | <NA> | <NA> | <NA> | 0 | 0 | 0.0 | 0 | 0 | 0.0 | 84-1 | 양천구 |
3990 | 2200900003 | 신고시설 | 서울특별시 강동구 | 1174010700 | 1 | 105 | <NA> | 127 | 15 | 28.0 | 37 | 27 | 42.0 | 2017 | 강동구 |
23385 | 2200900004 | 신고시설 | 서울특별시 금천구 | 1154510200 | 1 | 900 | 4 | 126 | 54 | 26.93 | 37 | 28 | 48.81 | 2018 | 금천구 |
18011 | 2200900007 | 신고시설 | 서울특별시 강서구 | 1150010700 | 1 | 280 | 1 | 126 | 49 | 20.1 | 37 | 32 | 43.0 | 2018 | 강서구 |
7964 | 2190100298 | 신고시설 | 서울특별시 마포구 | 1144012400 | 1 | 480 | 22 | 0 | 0 | 0.0 | 0 | 0 | 0.0 | 2017 | 마포구 |
6449 | 2200100149 | 경미시설 | 서울특별시 서초구 | 1165010300 | 1 | 576 | 4 | 127 | 1 | 32.0 | 37 | 27 | 22.0 | 2017 | 서초구 |
26737 | 2200300003 | 신고시설 | 서울특별시 노원구 | 1135010500 | 1 | 1205 | <NA> | 127 | 3 | 2.0 | 37 | 41 | 1.0 | 2017 | 노원구 |
7563 | 2200000338 | 신고시설 | 서울특별시 도봉구 | 1132010600 | <NA> | 412-4 | <NA> | 0 | 0 | 0.0 | 0 | 0 | 0.0 | 2017 | 도봉구 |