Dataset statistics
Number of variables | 12 |
---|---|
Number of observations | 10000 |
Missing cells | 7285 |
Missing cells (%) | 6.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.1 MiB |
Average record size in memory | 114.0 B |
Variable types
Text | 2 |
---|---|
Numeric | 10 |
Dataset
Description | 관리_주택대장_PK,승인번호_년,승인번호_기관_코드,승인번호_구분_코드,승인번호_일련번호,승인_일,대지_면적,건폐_율,연면적,용적_율,기타_용도,작업_일자 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15404/S/1/datasetView.do |
승인번호_년 is highly overall correlated with 승인_일 and 1 other fields | High correlation |
승인_일 is highly overall correlated with 승인번호_년 and 1 other fields | High correlation |
대지_면적 is highly overall correlated with 건폐_율 and 2 other fields | High correlation |
건폐_율 is highly overall correlated with 대지_면적 and 2 other fields | High correlation |
연면적 is highly overall correlated with 대지_면적 and 2 other fields | High correlation |
용적_율 is highly overall correlated with 대지_면적 and 2 other fields | High correlation |
작업_일자 is highly overall correlated with 승인번호_년 and 1 other fields | High correlation |
기타_용도 has 7285 (72.9%) missing values | Missing |
승인번호_년 is highly skewed (γ1 = -83.65294151) | Skewed |
건폐_율 is highly skewed (γ1 = 94.37027062) | Skewed |
연면적 is highly skewed (γ1 = 46.11278155) | Skewed |
용적_율 is highly skewed (γ1 = 99.9999391) | Skewed |
관리_주택대장_PK has unique values | Unique |
대지_면적 has 9379 (93.8%) zeros | Zeros |
건폐_율 has 9380 (93.8%) zeros | Zeros |
연면적 has 8212 (82.1%) zeros | Zeros |
용적_율 has 9398 (94.0%) zeros | Zeros |
Reproduction
Analysis started | 2024-05-11 05:39:57.001763 |
---|---|
Analysis finished | 2024-05-11 05:40:21.846635 |
Duration | 24.84 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
관리_주택대장_PK
Text
UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 28 |
---|---|
Median length | 15 |
Mean length | 16.2167 |
Min length | 7 |
Characters and Unicode
Total characters | 162167 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 10000 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 11650-100028992 |
---|---|
2nd row | 11545-1000000000000000125803 |
3rd row | 11470-100023711 |
4th row | 11710-100037789 |
5th row | 11290-100025183 |
Value | Count | Frequency (%) |
11650-100028992 | 1 | < 0.1% |
11200-100029794 | 1 | < 0.1% |
11470-2284 | 1 | < 0.1% |
11650-100030637 | 1 | < 0.1% |
11680-1000000000000000083055 | 1 | < 0.1% |
11470-100027115 | 1 | < 0.1% |
11500-1000000000000000120058 | 1 | < 0.1% |
11680-100046865 | 1 | < 0.1% |
11350-100041831 | 1 | < 0.1% |
11710-9617 | 1 | < 0.1% |
Other values (9990) | 9990 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 60071 | |
1 | 39024 | |
- | 10000 | 6.2% |
7 | 8219 | 5.1% |
2 | 7539 | 4.6% |
5 | 7496 | 4.6% |
4 | 7279 | 4.5% |
6 | 6803 | 4.2% |
3 | 6352 | 3.9% |
8 | 5417 | 3.3% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 152167 | |
Dash Punctuation | 10000 | 6.2% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 60071 | |
1 | 39024 | |
7 | 8219 | 5.4% |
2 | 7539 | 5.0% |
5 | 7496 | 4.9% |
4 | 7279 | 4.8% |
6 | 6803 | 4.5% |
3 | 6352 | 4.2% |
8 | 5417 | 3.6% |
9 | 3967 | 2.6% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 162167 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 60071 | |
1 | 39024 | |
- | 10000 | 6.2% |
7 | 8219 | 5.1% |
2 | 7539 | 4.6% |
5 | 7496 | 4.6% |
4 | 7279 | 4.5% |
6 | 6803 | 4.2% |
3 | 6352 | 3.9% |
8 | 5417 | 3.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 162167 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 60071 | |
1 | 39024 | |
- | 10000 | 6.2% |
7 | 8219 | 5.1% |
2 | 7539 | 4.6% |
5 | 7496 | 4.6% |
4 | 7279 | 4.5% |
6 | 6803 | 4.2% |
3 | 6352 | 3.9% |
8 | 5417 | 3.3% |
승인번호_년
Real number (ℝ)
HIGH CORRELATION
  SKEWED
 
Distinct | 29 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2016.0063 |
Minimum | 199 |
---|---|
Maximum | 2024 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 199 |
---|---|
5-th percentile | 2003 |
Q1 | 2012 |
median | 2019 |
Q3 | 2021 |
95-th percentile | 2023 |
Maximum | 2024 |
Range | 1825 |
Interquartile range (IQR) | 9 |
Descriptive statistics
Standard deviation | 19.288096 |
---|---|
Coefficient of variation (CV) | 0.0095674782 |
Kurtosis | 7877.8663 |
Mean | 2016.0063 |
Median Absolute Deviation (MAD) | 3 |
Skewness | -83.652942 |
Sum | 20160063 |
Variance | 372.03066 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2021 | 1289 | |
2022 | 989 | 9.9% |
2020 | 977 | 9.8% |
2023 | 937 | 9.4% |
2007 | 692 | 6.9% |
2008 | 530 | 5.3% |
2019 | 499 | 5.0% |
2018 | 486 | 4.9% |
2017 | 447 | 4.5% |
2016 | 405 | 4.0% |
Other values (19) | 2749 |
Value | Count | Frequency (%) |
199 | 1 | < 0.1% |
1995 | 1 | < 0.1% |
1997 | 1 | < 0.1% |
1999 | 2 | < 0.1% |
2000 | 75 | 0.8% |
2001 | 199 | |
2002 | 145 | |
2003 | 170 | |
2004 | 86 | |
2005 | 80 |
Value | Count | Frequency (%) |
2024 | 312 | 3.1% |
2023 | 937 | |
2022 | 989 | |
2021 | 1289 | |
2020 | 977 | |
2019 | 499 | 5.0% |
2018 | 486 | 4.9% |
2017 | 447 | 4.5% |
2016 | 405 | 4.0% |
2015 | 381 | 3.8% |
승인번호_기관_코드
Real number (ℝ)
Distinct | 147 |
---|---|
Distinct (%) | 1.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3175588.9 |
Minimum | 3000080 |
---|---|
Maximum | 6114262 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 3000080 |
---|---|
5-th percentile | 3030158 |
Q1 | 3100164 |
median | 3180173 |
Q3 | 3230139 |
95-th percentile | 3230263 |
Maximum | 6114262 |
Range | 3114182 |
Interquartile range (IQR) | 129975 |
Descriptive statistics
Standard deviation | 232134.53 |
---|---|
Coefficient of variation (CV) | 0.07309968 |
Kurtosis | 140.78388 |
Mean | 3175588.9 |
Median Absolute Deviation (MAD) | 49990 |
Skewness | 11.331603 |
Sum | 3.1755889 × 1010 |
Variance | 5.3886439 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3100164 | 715 | 7.1% |
3230139 | 664 | 6.6% |
3230163 | 593 | 5.9% |
3230089 | 496 | 5.0% |
3230263 | 395 | 4.0% |
3230211 | 379 | 3.8% |
3220226 | 372 | 3.7% |
3030158 | 337 | 3.4% |
3220173 | 316 | 3.2% |
3210163 | 298 | 3.0% |
Other values (137) | 5435 |
Value | Count | Frequency (%) |
3000080 | 13 | 0.1% |
3000082 | 1 | < 0.1% |
3000148 | 22 | 0.2% |
3000185 | 2 | < 0.1% |
3000219 | 14 | 0.1% |
3010000 | 1 | < 0.1% |
3010107 | 25 | 0.2% |
3010179 | 68 | |
3010217 | 24 | 0.2% |
3020076 | 161 |
Value | Count | Frequency (%) |
6114262 | 1 | < 0.1% |
6114261 | 1 | < 0.1% |
6114031 | 5 | |
6113933 | 4 | |
6113930 | 6 | |
6113929 | 1 | < 0.1% |
6113486 | 9 | |
6113485 | 2 | < 0.1% |
6112999 | 7 | |
6112779 | 4 |
승인번호_구분_코드
Real number (ℝ)
Distinct | 29 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2257.5468 |
Minimum | 1501 |
---|---|
Maximum | 5812 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1501 |
---|---|
5-th percentile | 2229 |
Q1 | 2230 |
median | 2230 |
Q3 | 2232 |
95-th percentile | 2250 |
Maximum | 5812 |
Range | 4311 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 318.38198 |
---|---|
Coefficient of variation (CV) | 0.14103007 |
Kurtosis | 119.50764 |
Mean | 2257.5468 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 10.954889 |
Sum | 22575468 |
Variance | 101367.08 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2230 | 6250 | |
2232 | 1486 | 14.9% |
2229 | 733 | 7.3% |
2241 | 516 | 5.2% |
2251 | 399 | 4.0% |
2101 | 133 | 1.3% |
2233 | 131 | 1.3% |
2226 | 82 | 0.8% |
2248 | 56 | 0.6% |
5809 | 42 | 0.4% |
Other values (19) | 172 | 1.7% |
Value | Count | Frequency (%) |
1501 | 8 | 0.1% |
1502 | 4 | < 0.1% |
2101 | 133 | |
2221 | 1 | < 0.1% |
2222 | 22 | 0.2% |
2223 | 5 | 0.1% |
2225 | 14 | 0.1% |
2226 | 82 | |
2227 | 9 | 0.1% |
2228 | 17 | 0.2% |
Value | Count | Frequency (%) |
5812 | 1 | < 0.1% |
5811 | 4 | < 0.1% |
5810 | 32 | 0.3% |
5809 | 42 | 0.4% |
2301 | 10 | 0.1% |
2260 | 1 | < 0.1% |
2251 | 399 | |
2250 | 31 | 0.3% |
2249 | 5 | 0.1% |
2248 | 56 | 0.6% |
승인번호_일련번호
Real number (ℝ)
Distinct | 1601 |
---|---|
Distinct (%) | 16.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 351.0173 |
Minimum | 1 |
---|---|
Maximum | 5353 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 22 |
median | 85 |
Q3 | 243 |
95-th percentile | 2160.4 |
Maximum | 5353 |
Range | 5352 |
Interquartile range (IQR) | 221 |
Descriptive statistics
Standard deviation | 799.07205 |
---|---|
Coefficient of variation (CV) | 2.2764463 |
Kurtosis | 15.233275 |
Mean | 351.0173 |
Median Absolute Deviation (MAD) | 75 |
Skewness | 3.8043065 |
Sum | 3510173 |
Variance | 638516.14 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 338 | 3.4% |
2 | 219 | 2.2% |
3 | 205 | 2.1% |
5 | 136 | 1.4% |
4 | 134 | 1.3% |
6 | 120 | 1.2% |
10 | 107 | 1.1% |
8 | 106 | 1.1% |
7 | 100 | 1.0% |
15 | 97 | 1.0% |
Other values (1591) | 8438 |
Value | Count | Frequency (%) |
1 | 338 | |
2 | 219 | |
3 | 205 | |
4 | 134 | 1.3% |
5 | 136 | |
6 | 120 | 1.2% |
7 | 100 | 1.0% |
8 | 106 | 1.1% |
9 | 94 | 0.9% |
10 | 107 | 1.1% |
Value | Count | Frequency (%) |
5353 | 1 | |
5338 | 1 | |
5319 | 1 | |
5292 | 1 | |
5276 | 1 | |
5275 | 1 | |
5255 | 1 | |
5243 | 1 | |
5226 | 1 | |
5223 | 1 |
승인_일
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 3622 |
---|---|
Distinct (%) | 36.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20162401 |
Minimum | 19950426 |
---|---|
Maximum | 20240508 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 19950426 |
---|---|
5-th percentile | 20030620 |
Q1 | 20121030 |
median | 20181229 |
Q3 | 20211021 |
95-th percentile | 20231024 |
Maximum | 20240508 |
Range | 290082 |
Interquartile range (IQR) | 89991 |
Descriptive statistics
Standard deviation | 64707.31 |
---|---|
Coefficient of variation (CV) | 0.0032093058 |
Kurtosis | -0.45150573 |
Mean | 20162401 |
Median Absolute Deviation (MAD) | 39393.5 |
Skewness | -0.84651418 |
Sum | 2.0162401 × 1011 |
Variance | 4.187036 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20070828 | 46 | 0.5% |
20070903 | 34 | 0.3% |
20070827 | 26 | 0.3% |
20070831 | 23 | 0.2% |
20071005 | 22 | 0.2% |
20071116 | 21 | 0.2% |
20070830 | 21 | 0.2% |
20070920 | 20 | 0.2% |
20070824 | 20 | 0.2% |
20070905 | 19 | 0.2% |
Other values (3612) | 9748 |
Value | Count | Frequency (%) |
19950426 | 1 | |
19960625 | 1 | |
19970402 | 1 | |
19990528 | 1 | |
19991230 | 1 | |
20000223 | 1 | |
20000303 | 1 | |
20000308 | 1 | |
20000314 | 1 | |
20000328 | 1 |
Value | Count | Frequency (%) |
20240508 | 2 | < 0.1% |
20240507 | 8 | |
20240503 | 6 | |
20240502 | 8 | |
20240501 | 1 | < 0.1% |
20240430 | 2 | < 0.1% |
20240429 | 3 | < 0.1% |
20240426 | 5 | |
20240425 | 2 | < 0.1% |
20240424 | 4 |
대지_면적
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 469 |
---|---|
Distinct (%) | 4.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3288.64 |
Minimum | 0 |
---|---|
Maximum | 311810 |
Zeros | 9379 |
Zeros (%) | 93.8% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 4032.9 |
Maximum | 311810 |
Range | 311810 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 20112.276 |
---|---|
Coefficient of variation (CV) | 6.1156818 |
Kurtosis | 67.23311 |
Mean | 3288.64 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 7.7865051 |
Sum | 32886400 |
Variance | 4.0450363 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0 | 9379 | |
155041.0 | 20 | 0.2% |
175403.0 | 14 | 0.1% |
144209.0 | 12 | 0.1% |
163197.0 | 10 | 0.1% |
96634.0 | 10 | 0.1% |
21067.4 | 9 | 0.1% |
161345.0 | 8 | 0.1% |
146538.0 | 7 | 0.1% |
33018.0 | 7 | 0.1% |
Other values (459) | 524 | 5.2% |
Value | Count | Frequency (%) |
0.0 | 9379 | |
110.0 | 1 | < 0.1% |
112.87 | 1 | < 0.1% |
144.2 | 1 | < 0.1% |
152.0 | 1 | < 0.1% |
175.0 | 1 | < 0.1% |
193.73 | 1 | < 0.1% |
283.97 | 1 | < 0.1% |
290.76 | 1 | < 0.1% |
385.3 | 1 | < 0.1% |
Value | Count | Frequency (%) |
311810.0 | 1 | < 0.1% |
304375.3 | 2 | < 0.1% |
237830.7 | 1 | < 0.1% |
219217.9 | 2 | < 0.1% |
215214.0 | 5 | |
195080.4 | 1 | < 0.1% |
183281.0 | 2 | < 0.1% |
181627.0 | 7 | |
180149.9 | 1 | < 0.1% |
179339.0 | 5 |
건폐_율
Real number (ℝ)
HIGH CORRELATION
  SKEWED
  ZEROS
 
Distinct | 453 |
---|---|
Distinct (%) | 4.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.0272807 |
Minimum | 0 |
---|---|
Maximum | 3838.41 |
Zeros | 9380 |
Zeros (%) | 93.8% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 14.4205 |
Maximum | 3838.41 |
Range | 3838.41 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 39.121949 |
---|---|
Coefficient of variation (CV) | 19.297746 |
Kurtosis | 9250.6456 |
Mean | 2.0272807 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 94.370271 |
Sum | 20272.807 |
Variance | 1530.5269 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0 | 9380 | |
17.54 | 19 | 0.2% |
12.35 | 14 | 0.1% |
13.21 | 12 | 0.1% |
12.67 | 10 | 0.1% |
16.83 | 10 | 0.1% |
10.28 | 9 | 0.1% |
16.0 | 8 | 0.1% |
18.13 | 7 | 0.1% |
12.41 | 7 | 0.1% |
Other values (443) | 524 | 5.2% |
Value | Count | Frequency (%) |
0.0 | 9380 | |
0.02 | 3 | < 0.1% |
0.3 | 1 | < 0.1% |
0.34 | 1 | < 0.1% |
0.39 | 1 | < 0.1% |
0.61 | 1 | < 0.1% |
0.71 | 1 | < 0.1% |
0.78 | 1 | < 0.1% |
0.86 | 3 | < 0.1% |
0.89 | 1 | < 0.1% |
Value | Count | Frequency (%) |
3838.41 | 1 | < 0.1% |
100.0 | 6 | |
87.91 | 1 | < 0.1% |
74.64 | 1 | < 0.1% |
68.92 | 1 | < 0.1% |
66.32 | 1 | < 0.1% |
61.05 | 1 | < 0.1% |
59.99 | 1 | < 0.1% |
59.96 | 1 | < 0.1% |
59.93 | 2 | < 0.1% |
연면적
Real number (ℝ)
HIGH CORRELATION
  SKEWED
  ZEROS
 
Distinct | 1334 |
---|---|
Distinct (%) | 13.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 123377.42 |
Minimum | 0 |
---|---|
Maximum | 2.376202 × 108 |
Zeros | 8212 |
Zeros (%) | 82.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 137014.9 |
Maximum | 2.376202 × 108 |
Range | 2.376202 × 108 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 4491028 |
---|---|
Coefficient of variation (CV) | 36.40073 |
Kurtosis | 2174.0685 |
Mean | 123377.42 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 46.112782 |
Sum | 1.2337742 × 109 |
Variance | 2.0169333 × 1013 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0 | 8212 | |
208958.0 | 20 | 0.2% |
565619.28 | 17 | 0.2% |
241842.0 | 14 | 0.1% |
405203.72 | 11 | 0.1% |
218969.0 | 10 | 0.1% |
142544.0 | 10 | 0.1% |
6222.3586 | 9 | 0.1% |
137014.9 | 9 | 0.1% |
117492.28 | 9 | 0.1% |
Other values (1324) | 1679 | 16.8% |
Value | Count | Frequency (%) |
0.0 | 8212 | |
8.0 | 1 | < 0.1% |
11.89 | 1 | < 0.1% |
16.989 | 1 | < 0.1% |
22.897 | 1 | < 0.1% |
36.88 | 1 | < 0.1% |
49.94 | 1 | < 0.1% |
53.59 | 1 | < 0.1% |
53.78 | 1 | < 0.1% |
58.65 | 1 | < 0.1% |
Value | Count | Frequency (%) |
237620205.0 | 1 | |
222238584.0 | 1 | |
208026305.0 | 1 | |
170725215.0 | 1 | |
151064192.0 | 1 | |
22412776.0 | 1 | |
8624786.0 | 1 | |
8200562.0 | 1 | |
3700892.598 | 1 | |
2733103.0 | 1 |
용적_율
Real number (ℝ)
HIGH CORRELATION
  SKEWED
  ZEROS
 
Distinct | 444 |
---|---|
Distinct (%) | 4.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 108316.77 |
Minimum | 0 |
---|---|
Maximum | 1.0820891 × 109 |
Zeros | 9398 |
Zeros (%) | 94.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 124.16 |
Maximum | 1.0820891 × 109 |
Range | 1.0820891 × 109 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 10820892 |
---|---|
Coefficient of variation (CV) | 99.900437 |
Kurtosis | 9999.9919 |
Mean | 108316.77 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 99.999939 |
Sum | 1.0831677 × 109 |
Variance | 1.1709171 × 1014 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0 | 9398 | |
124.16 | 20 | 0.2% |
127.35 | 14 | 0.1% |
276.95 | 12 | 0.1% |
128.9 | 10 | 0.1% |
139.08 | 10 | 0.1% |
20.31 | 9 | 0.1% |
161.25 | 8 | 0.1% |
122.05 | 7 | 0.1% |
124.76 | 7 | 0.1% |
Other values (434) | 505 | 5.1% |
Value | Count | Frequency (%) |
0.0 | 9398 | |
0.5 | 1 | < 0.1% |
0.64 | 1 | < 0.1% |
1.22 | 1 | < 0.1% |
1.44 | 1 | < 0.1% |
5.86 | 1 | < 0.1% |
6.0 | 1 | < 0.1% |
6.71 | 1 | < 0.1% |
7.05 | 1 | < 0.1% |
7.14 | 1 | < 0.1% |
Value | Count | Frequency (%) |
1082089127.0 | 1 | |
599885.96 | 1 | |
340072.12 | 1 | |
999.34 | 1 | |
991.65 | 1 | |
959.94 | 1 | |
959.16 | 2 | |
921.44 | 1 | |
899.76 | 1 | |
799.64 | 1 |
기타_용도
Text
MISSING
 
Distinct | 189 |
---|---|
Distinct (%) | 7.0% |
Missing | 7285 |
Missing (%) | 72.9% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
아파트 | 1808 | |
비내력벽철거 | 391 | 13.5% |
발코니확장 | 153 | 5.3% |
공동주택(아파트 | 47 | 1.6% |
철거 | 46 | 1.6% |
비내력벽 | 37 | 1.3% |
승강기 | 24 | 0.8% |
공동주택 | 22 | 0.8% |
및 | 16 | 0.6% |
부대시설(승강기 | 13 | 0.4% |
Other values (234) | 348 | 12.0% |
Most occurring characters
Value | Count | Frequency (%) |
트 | 1889 | |
아 | 1888 | |
파 | 1888 | |
거 | 452 | 3.8% |
철 | 445 | 3.7% |
비 | 433 | 3.6% |
내 | 433 | 3.6% |
력 | 430 | 3.6% |
벽 | 430 | 3.6% |
191 | 1.6% | |
Other values (211) | 3486 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 11138 | |
Decimal Number | 275 | 2.3% |
Space Separator | 191 | 1.6% |
Open Punctuation | 128 | 1.1% |
Close Punctuation | 128 | 1.1% |
Other Punctuation | 55 | 0.5% |
Uppercase Letter | 33 | 0.3% |
Dash Punctuation | 7 | 0.1% |
Lowercase Letter | 6 | 0.1% |
Other Symbol | 2 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
트 | 1889 | |
아 | 1888 | |
파 | 1888 | |
거 | 452 | 4.1% |
철 | 445 | 4.0% |
비 | 433 | 3.9% |
내 | 433 | 3.9% |
력 | 430 | 3.9% |
벽 | 430 | 3.9% |
주 | 179 | 1.6% |
Other values (178) | 2671 |
Decimal Number
Value | Count | Frequency (%) |
1 | 76 | |
0 | 65 | |
2 | 34 | |
5 | 25 | 9.1% |
4 | 24 | 8.7% |
3 | 19 | 6.9% |
8 | 9 | 3.3% |
7 | 8 | 2.9% |
6 | 8 | 2.9% |
9 | 7 | 2.5% |
Uppercase Letter
Value | Count | Frequency (%) |
C | 12 | |
V | 7 | |
T | 7 | |
K | 2 | 6.1% |
D | 1 | 3.0% |
X | 1 | 3.0% |
M | 1 | 3.0% |
P | 1 | 3.0% |
A | 1 | 3.0% |
Other Punctuation
Value | Count | Frequency (%) |
, | 47 | |
/ | 4 | 7.3% |
. | 3 | 5.5% |
? | 1 | 1.8% |
Lowercase Letter
Value | Count | Frequency (%) |
c | 2 | |
m | 2 | |
t | 1 | |
v | 1 |
Space Separator
Value | Count | Frequency (%) |
191 |
Open Punctuation
Value | Count | Frequency (%) |
( | 128 |
Close Punctuation
Value | Count | Frequency (%) |
) | 128 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 7 |
Other Symbol
Value | Count | Frequency (%) |
㎡ | 2 |
Math Symbol
Value | Count | Frequency (%) |
~ | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 11138 | |
Common | 788 | 6.6% |
Latin | 39 | 0.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
트 | 1889 | |
아 | 1888 | |
파 | 1888 | |
거 | 452 | 4.1% |
철 | 445 | 4.0% |
비 | 433 | 3.9% |
내 | 433 | 3.9% |
력 | 430 | 3.9% |
벽 | 430 | 3.9% |
주 | 179 | 1.6% |
Other values (178) | 2671 |
Common
Value | Count | Frequency (%) |
191 | ||
( | 128 | |
) | 128 | |
1 | 76 | 9.6% |
0 | 65 | 8.2% |
, | 47 | 6.0% |
2 | 34 | 4.3% |
5 | 25 | 3.2% |
4 | 24 | 3.0% |
3 | 19 | 2.4% |
Other values (10) | 51 | 6.5% |
Latin
Value | Count | Frequency (%) |
C | 12 | |
V | 7 | |
T | 7 | |
K | 2 | 5.1% |
c | 2 | 5.1% |
m | 2 | 5.1% |
D | 1 | 2.6% |
X | 1 | 2.6% |
M | 1 | 2.6% |
t | 1 | 2.6% |
Other values (3) | 3 | 7.7% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 11138 | |
ASCII | 825 | 6.9% |
CJK Compat | 2 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
트 | 1889 | |
아 | 1888 | |
파 | 1888 | |
거 | 452 | 4.1% |
철 | 445 | 4.0% |
비 | 433 | 3.9% |
내 | 433 | 3.9% |
력 | 430 | 3.9% |
벽 | 430 | 3.9% |
주 | 179 | 1.6% |
Other values (178) | 2671 |
ASCII
Value | Count | Frequency (%) |
191 | ||
( | 128 | |
) | 128 | |
1 | 76 | 9.2% |
0 | 65 | 7.9% |
, | 47 | 5.7% |
2 | 34 | 4.1% |
5 | 25 | 3.0% |
4 | 24 | 2.9% |
3 | 19 | 2.3% |
Other values (22) | 88 |
CJK Compat
Value | Count | Frequency (%) |
㎡ | 2 |
작업_일자
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 904 |
---|---|
Distinct (%) | 9.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20181148 |
Minimum | 20120222 |
---|---|
Maximum | 20240510 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 20120222 |
---|---|
5-th percentile | 20120222 |
Q1 | 20140521 |
median | 20191203 |
Q3 | 20211204 |
95-th percentile | 20240102 |
Maximum | 20240510 |
Range | 120288 |
Interquartile range (IQR) | 70683 |
Descriptive statistics
Standard deviation | 40810.238 |
---|---|
Coefficient of variation (CV) | 0.002022196 |
Kurtosis | -1.3188226 |
Mean | 20181148 |
Median Absolute Deviation (MAD) | 29915 |
Skewness | -0.32482683 |
Sum | 2.0181148 × 1011 |
Variance | 1.6654756 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20120222 | 1762 | 17.6% |
20211029 | 937 | 9.4% |
20191203 | 437 | 4.4% |
20240102 | 117 | 1.2% |
20240208 | 107 | 1.1% |
20170406 | 89 | 0.9% |
20170928 | 74 | 0.7% |
20141115 | 65 | 0.7% |
20200304 | 62 | 0.6% |
20240510 | 61 | 0.6% |
Other values (894) | 6289 |
Value | Count | Frequency (%) |
20120222 | 1762 | |
20120223 | 1 | < 0.1% |
20120229 | 5 | 0.1% |
20120301 | 2 | < 0.1% |
20120303 | 1 | < 0.1% |
20120306 | 5 | 0.1% |
20120309 | 2 | < 0.1% |
20120315 | 2 | < 0.1% |
20120317 | 2 | < 0.1% |
20120320 | 1 | < 0.1% |
Value | Count | Frequency (%) |
20240510 | 61 | |
20240507 | 28 | |
20240425 | 11 | 0.1% |
20240420 | 22 | 0.2% |
20240417 | 16 | 0.2% |
20240411 | 4 | < 0.1% |
20240406 | 11 | 0.1% |
20240402 | 12 | 0.1% |
20240330 | 20 | 0.2% |
20240327 | 57 |
승인번호_년 | 승인번호_기관_코드 | 승인번호_구분_코드 | 승인번호_일련번호 | 승인_일 | 대지_면적 | 건폐_율 | 연면적 | 용적_율 | 작업_일자 | |
---|---|---|---|---|---|---|---|---|---|---|
승인번호_년 | 1.000 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
승인번호_기관_코드 | NaN | 1.000 | 0.000 | 0.000 | 0.150 | 0.116 | 0.000 | 0.000 | 0.000 | 0.151 |
승인번호_구분_코드 | NaN | 0.000 | 1.000 | 0.000 | 0.089 | 0.301 | 0.000 | 0.000 | 0.000 | 0.049 |
승인번호_일련번호 | NaN | 0.000 | 0.000 | 1.000 | 0.680 | 0.000 | 0.000 | 0.000 | 0.000 | 0.272 |
승인_일 | NaN | 0.150 | 0.089 | 0.680 | 1.000 | 0.239 | 0.000 | 0.000 | 0.000 | 0.938 |
대지_면적 | NaN | 0.116 | 0.301 | 0.000 | 0.239 | 1.000 | 0.000 | 0.044 | 0.000 | 0.172 |
건폐_율 | NaN | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 |
연면적 | NaN | 0.000 | 0.000 | 0.000 | 0.000 | 0.044 | 0.000 | 1.000 | 0.000 | 0.016 |
용적_율 | NaN | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 |
작업_일자 | NaN | 0.151 | 0.049 | 0.272 | 0.938 | 0.172 | 0.000 | 0.016 | 0.000 | 1.000 |
승인번호_년 | 승인번호_기관_코드 | 승인번호_구분_코드 | 승인번호_일련번호 | 승인_일 | 대지_면적 | 건폐_율 | 연면적 | 용적_율 | 작업_일자 | |
---|---|---|---|---|---|---|---|---|---|---|
승인번호_년 | 1.000 | -0.269 | -0.047 | -0.304 | 0.996 | -0.048 | -0.044 | 0.138 | -0.043 | 0.918 |
승인번호_기관_코드 | -0.269 | 1.000 | 0.005 | 0.347 | -0.266 | -0.028 | -0.026 | -0.113 | -0.024 | -0.301 |
승인번호_구분_코드 | -0.047 | 0.005 | 1.000 | -0.167 | -0.047 | -0.003 | -0.008 | -0.082 | -0.010 | -0.139 |
승인번호_일련번호 | -0.304 | 0.347 | -0.167 | 1.000 | -0.273 | -0.215 | -0.224 | -0.179 | -0.213 | -0.282 |
승인_일 | 0.996 | -0.266 | -0.047 | -0.273 | 1.000 | -0.053 | -0.049 | 0.133 | -0.049 | 0.918 |
대지_면적 | -0.048 | -0.028 | -0.003 | -0.215 | -0.053 | 1.000 | 0.988 | 0.547 | 0.966 | 0.100 |
건폐_율 | -0.044 | -0.026 | -0.008 | -0.224 | -0.049 | 0.988 | 1.000 | 0.541 | 0.971 | 0.102 |
연면적 | 0.138 | -0.113 | -0.082 | -0.179 | 0.133 | 0.547 | 0.541 | 1.000 | 0.534 | 0.208 |
용적_율 | -0.043 | -0.024 | -0.010 | -0.213 | -0.049 | 0.966 | 0.971 | 0.534 | 1.000 | 0.103 |
작업_일자 | 0.918 | -0.301 | -0.139 | -0.282 | 0.918 | 0.100 | 0.102 | 0.208 | 0.103 | 1.000 |
관리_주택대장_PK | 승인번호_년 | 승인번호_기관_코드 | 승인번호_구분_코드 | 승인번호_일련번호 | 승인_일 | 대지_면적 | 건폐_율 | 연면적 | 용적_율 | 기타_용도 | 작업_일자 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
61623 | 11650-100028992 | 2015 | 3210163 | 2230 | 153 | 20151030 | 0.0 | 0.0 | 0.0 | 0.0 | <NA> | 20151121 |
8474 | 11545-1000000000000000125803 | 2023 | 3170261 | 2229 | 4 | 20230703 | 0.0 | 0.0 | 0.0 | 0.0 | 아파트 | 20230708 |
59012 | 11470-100023711 | 2016 | 3140174 | 2230 | 153 | 20160822 | 0.0 | 0.0 | 0.0 | 0.0 | <NA> | 20160827 |
63897 | 11710-100037789 | 2015 | 3230211 | 2232 | 58 | 20150325 | 0.0 | 0.0 | 0.0 | 0.0 | 아파트 | 20150402 |
15772 | 11290-100025183 | 2022 | 3070267 | 2229 | 35 | 20220726 | 0.0 | 0.0 | 0.0 | 0.0 | <NA> | 20220728 |
74982 | 11710-3364 | 2003 | 3230089 | 2230 | 698 | 20030825 | 0.0 | 0.0 | 0.0 | 0.0 | <NA> | 20120222 |
82332 | 11710-100013658 | 2009 | 3230163 | 2232 | 425 | 20090506 | 0.0 | 0.0 | 0.0 | 0.0 | <NA> | 20120222 |
48489 | 11230-100015705 | 2019 | 3050000 | 5810 | 1 | 20190307 | 29850.3 | 30.8295 | 116772.6617 | 249.9711 | 아파트 | 20190309 |
43472 | 11470-451 | 2001 | 3140079 | 2229 | 50 | 20010312 | 0.0 | 0.0 | 0.0 | 0.0 | <NA> | 20191203 |
42939 | 11470-2138 | 2005 | 3140104 | 2230 | 370 | 20050324 | 175403.0 | 12.35 | 241842.0 | 127.35 | <NA> | 20191203 |
관리_주택대장_PK | 승인번호_년 | 승인번호_기관_코드 | 승인번호_구분_코드 | 승인번호_일련번호 | 승인_일 | 대지_면적 | 건폐_율 | 연면적 | 용적_율 | 기타_용도 | 작업_일자 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
63512 | 11650-100027570 | 2015 | 3210163 | 2241 | 7 | 20150507 | 4799.5 | 49.96 | 21924.13 | 243.91 | <NA> | 20150512 |
46538 | 11320-100016326 | 2019 | 3090075 | 2232 | 6 | 20190729 | 0.0 | 0.0 | 0.0 | 0.0 | <NA> | 20190907 |
22246 | 11590-100019044 | 2021 | 3190226 | 2241 | 6 | 20211105 | 0.0 | 0.0 | 0.0 | 0.0 | <NA> | 20211204 |
29261 | 11320-100022866 | 2021 | 3090075 | 2230 | 166 | 20210923 | 0.0 | 0.0 | 8880.0 | 0.0 | 아파트 | 20211029 |
85984 | 11710-100013691 | 2009 | 3230163 | 2230 | 82 | 20090507 | 0.0 | 0.0 | 0.0 | 0.0 | 아파트 | 20120222 |
83106 | 11710-100007848 | 2008 | 3230163 | 2232 | 1131 | 20080918 | 0.0 | 0.0 | 0.0 | 0.0 | <NA> | 20120222 |
48388 | 11200-100033914 | 2019 | 3030158 | 2230 | 41 | 20190315 | 0.0 | 0.0 | 0.0 | 0.0 | <NA> | 20190321 |
24276 | 11710-100079158 | 2021 | 3230263 | 2229 | 91 | 20210602 | 0.0 | 0.0 | 0.0 | 0.0 | <NA> | 20211029 |
84755 | 11710-4698 | 2007 | 3230139 | 2226 | 172 | 20071113 | 0.0 | 0.0 | 0.0 | 0.0 | <NA> | 20120222 |
12294 | 11740-1000000000000000076304 | 2023 | 3240172 | 2229 | 1 | 20230102 | 0.0 | 0.0 | 23616.54 | 0.0 | <NA> | 20230110 |