Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 10000 |
Missing cells | 1025 |
Missing cells (%) | 1.3% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 761.7 KiB |
Average record size in memory | 78.0 B |
Variable types
Text | 2 |
---|---|
Categorical | 1 |
Numeric | 5 |
Dataset
Description | 관리_대지_위치_PK,관리_허가대장_PK,대표_여부,시군구_코드,법정동_코드,대지_구분_코드,번,지 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15403/S/1/datasetView.do |
관리_허가대장_PK has 775 (7.8%) missing values | Missing |
대지_구분_코드 has 247 (2.5%) missing values | Missing |
관리_대지_위치_PK has unique values | Unique |
대지_구분_코드 has 9629 (96.3%) zeros | Zeros |
번 has 311 (3.1%) zeros | Zeros |
지 has 1187 (11.9%) zeros | Zeros |
Reproduction
Analysis started | 2024-05-18 04:39:56.563173 |
---|---|
Analysis finished | 2024-05-18 04:40:09.850462 |
Duration | 13.29 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
관리_대지_위치_PK
Text
UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 28 |
---|---|
Median length | 15 |
Mean length | 20.7762 |
Min length | 8 |
Characters and Unicode
Total characters | 207762 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 10000 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 11740-100062677 |
---|---|
2nd row | 11170-100093867 |
3rd row | 11320-1000000000000000046122 |
4th row | 11000-1367 |
5th row | 11170-100098583 |
Value | Count | Frequency (%) |
11740-100062677 | 1 | < 0.1% |
11560-1000000000000004450593 | 1 | < 0.1% |
11740-1000000000000004051460 | 1 | < 0.1% |
11740-1000000000000004768298 | 1 | < 0.1% |
11170-100109803 | 1 | < 0.1% |
11170-1000000000000000961300 | 1 | < 0.1% |
11320-100082588 | 1 | < 0.1% |
11320-100071591 | 1 | < 0.1% |
11170-1000000000000004824889 | 1 | < 0.1% |
11170-100007541 | 1 | < 0.1% |
Other values (9990) | 9990 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 96361 | |
1 | 40954 | |
7 | 10296 | 5.0% |
4 | 10072 | 4.8% |
- | 10000 | 4.8% |
5 | 7134 | 3.4% |
3 | 6871 | 3.3% |
2 | 6723 | 3.2% |
8 | 6647 | 3.2% |
9 | 6492 | 3.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 197762 | |
Dash Punctuation | 10000 | 4.8% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 96361 | |
1 | 40954 | |
7 | 10296 | 5.2% |
4 | 10072 | 5.1% |
5 | 7134 | 3.6% |
3 | 6871 | 3.5% |
2 | 6723 | 3.4% |
8 | 6647 | 3.4% |
9 | 6492 | 3.3% |
6 | 6212 | 3.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 207762 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 96361 | |
1 | 40954 | |
7 | 10296 | 5.0% |
4 | 10072 | 4.8% |
- | 10000 | 4.8% |
5 | 7134 | 3.4% |
3 | 6871 | 3.3% |
2 | 6723 | 3.2% |
8 | 6647 | 3.2% |
9 | 6492 | 3.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 207762 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 96361 | |
1 | 40954 | |
7 | 10296 | 5.0% |
4 | 10072 | 4.8% |
- | 10000 | 4.8% |
5 | 7134 | 3.4% |
3 | 6871 | 3.3% |
2 | 6723 | 3.2% |
8 | 6647 | 3.2% |
9 | 6492 | 3.1% |
관리_허가대장_PK
Text
MISSING
 
Distinct | 6115 |
---|---|
Distinct (%) | 66.3% |
Missing | 775 |
Missing (%) | 7.8% |
Memory size | 156.2 KiB |
Length
Max length | 28 |
---|---|
Median length | 15 |
Mean length | 19.53897 |
Min length | 7 |
Characters and Unicode
Total characters | 180247 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 4967 ? |
---|---|
Unique (%) | 53.8% |
Sample
1st row | 11740-100081951 |
---|---|
2nd row | 11170-100068812 |
3rd row | 11320-100068905 |
4th row | 11000-135 |
5th row | 11170-100073052 |
Value | Count | Frequency (%) |
11650-1000000000000000499102 | 105 | 1.1% |
11740-1000000000000000062646 | 101 | 1.1% |
11170-100063630 | 95 | 1.0% |
11740-100059186 | 93 | 1.0% |
11170-100020545 | 54 | 0.6% |
11170-100023536 | 49 | 0.5% |
11000-100004105 | 46 | 0.5% |
11140-100047972 | 45 | 0.5% |
11320-100069125 | 36 | 0.4% |
11140-100077178 | 32 | 0.3% |
Other values (6105) | 8569 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 81668 | |
1 | 36380 | |
7 | 9730 | 5.4% |
- | 9225 | 5.1% |
4 | 7733 | 4.3% |
2 | 6867 | 3.8% |
3 | 6606 | 3.7% |
5 | 6553 | 3.6% |
6 | 5734 | 3.2% |
8 | 5120 | 2.8% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 171022 | |
Dash Punctuation | 9225 | 5.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 81668 | |
1 | 36380 | |
7 | 9730 | 5.7% |
4 | 7733 | 4.5% |
2 | 6867 | 4.0% |
3 | 6606 | 3.9% |
5 | 6553 | 3.8% |
6 | 5734 | 3.4% |
8 | 5120 | 3.0% |
9 | 4631 | 2.7% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 9225 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 180247 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 81668 | |
1 | 36380 | |
7 | 9730 | 5.4% |
- | 9225 | 5.1% |
4 | 7733 | 4.3% |
2 | 6867 | 3.8% |
3 | 6606 | 3.7% |
5 | 6553 | 3.6% |
6 | 5734 | 3.2% |
8 | 5120 | 2.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 180247 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 81668 | |
1 | 36380 | |
7 | 9730 | 5.4% |
- | 9225 | 5.1% |
4 | 7733 | 4.3% |
2 | 6867 | 3.8% |
3 | 6606 | 3.7% |
5 | 6553 | 3.6% |
6 | 5734 | 3.2% |
8 | 5120 | 2.8% |
대표_여부
Categorical
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
1 | |
---|---|
0 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 0 |
3rd row | 1 |
4th row | 0 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 5745 | |
0 | 4255 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 5745 | |
0 | 4255 |
시군구_코드
Real number (ℝ)
Distinct | 25 |
---|---|
Distinct (%) | 0.3% |
Missing | 1 |
Missing (%) | < 0.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 11400.573 |
Minimum | 11110 |
---|---|
Maximum | 11740 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 11110 |
---|---|
5-th percentile | 11170 |
Q1 | 11170 |
median | 11320 |
Q3 | 11680 |
95-th percentile | 11740 |
Maximum | 11740 |
Range | 630 |
Interquartile range (IQR) | 510 |
Descriptive statistics
Standard deviation | 237.26905 |
---|---|
Coefficient of variation (CV) | 0.020812028 |
Kurtosis | -1.5562453 |
Mean | 11400.573 |
Median Absolute Deviation (MAD) | 150 |
Skewness | 0.37770878 |
Sum | 1.1399433 × 108 |
Variance | 56296.601 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
11170 | 3210 | |
11740 | 2050 | |
11320 | 1053 | 10.5% |
11680 | 369 | 3.7% |
11650 | 352 | 3.5% |
11140 | 261 | 2.6% |
11710 | 202 | 2.0% |
11260 | 202 | 2.0% |
11500 | 198 | 2.0% |
11110 | 190 | 1.9% |
Other values (15) | 1912 |
Value | Count | Frequency (%) |
11110 | 190 | 1.9% |
11140 | 261 | 2.6% |
11170 | 3210 | |
11200 | 168 | 1.7% |
11215 | 179 | 1.8% |
11230 | 114 | 1.1% |
11260 | 202 | 2.0% |
11290 | 150 | 1.5% |
11305 | 82 | 0.8% |
11320 | 1053 | 10.5% |
Value | Count | Frequency (%) |
11740 | 2050 | |
11710 | 202 | 2.0% |
11680 | 369 | 3.7% |
11650 | 352 | 3.5% |
11620 | 163 | 1.6% |
11590 | 106 | 1.1% |
11560 | 135 | 1.4% |
11545 | 103 | 1.0% |
11530 | 152 | 1.5% |
11500 | 198 | 2.0% |
법정동_코드
Real number (ℝ)
Distinct | 77 |
---|---|
Distinct (%) | 0.8% |
Missing | 2 |
Missing (%) | < 0.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 11336.907 |
Minimum | 0 |
---|---|
Maximum | 18700 |
Zeros | 1 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 10100 |
Q1 | 10500 |
median | 10800 |
Q3 | 11900 |
95-th percentile | 13200 |
Maximum | 18700 |
Range | 18700 |
Interquartile range (IQR) | 1400 |
Descriptive statistics
Standard deviation | 1342.2476 |
---|---|
Coefficient of variation (CV) | 0.11839627 |
Kurtosis | 5.3811342 |
Mean | 11336.907 |
Median Absolute Deviation (MAD) | 500 |
Skewness | 1.8932233 |
Sum | 1.133464 × 108 |
Variance | 1801628.6 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
10900 | 1048 | 10.5% |
10100 | 878 | 8.8% |
10700 | 845 | 8.5% |
10800 | 781 | 7.8% |
10500 | 751 | 7.5% |
10200 | 750 | 7.5% |
13100 | 535 | 5.3% |
10600 | 532 | 5.3% |
13000 | 505 | 5.1% |
10300 | 296 | 3.0% |
Other values (67) | 3077 |
Value | Count | Frequency (%) |
0 | 1 | < 0.1% |
10100 | 878 | |
10200 | 750 | |
10300 | 296 | 3.0% |
10400 | 258 | 2.6% |
10500 | 751 | |
10600 | 532 | |
10700 | 845 | |
10800 | 781 | |
10900 | 1048 |
Value | Count | Frequency (%) |
18700 | 1 | < 0.1% |
18600 | 1 | < 0.1% |
18500 | 1 | < 0.1% |
18400 | 4 | < 0.1% |
18300 | 13 | |
18200 | 5 | 0.1% |
18100 | 1 | < 0.1% |
17500 | 17 | |
17400 | 8 | |
17200 | 2 | < 0.1% |
대지_구분_코드
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 247 |
Missing (%) | 2.5% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.021839434 |
Minimum | 0 |
---|---|
Maximum | 9 |
Zeros | 9629 |
Zeros (%) | 96.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 0 |
Maximum | 9 |
Range | 9 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 0.22831466 |
---|---|
Coefficient of variation (CV) | 10.454239 |
Kurtosis | 470.90973 |
Mean | 0.021839434 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 17.473287 |
Sum | 213 |
Variance | 0.052127584 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 9629 | |
2 | 66 | 0.7% |
1 | 54 | 0.5% |
7 | 2 | < 0.1% |
4 | 1 | < 0.1% |
9 | 1 | < 0.1% |
(Missing) | 247 | 2.5% |
Value | Count | Frequency (%) |
0 | 9629 | |
1 | 54 | 0.5% |
2 | 66 | 0.7% |
4 | 1 | < 0.1% |
7 | 2 | < 0.1% |
9 | 1 | < 0.1% |
Value | Count | Frequency (%) |
9 | 1 | < 0.1% |
7 | 2 | < 0.1% |
4 | 1 | < 0.1% |
2 | 66 | 0.7% |
1 | 54 | 0.5% |
0 | 9629 |
번
Real number (ℝ)
ZEROS
 
Distinct | 1015 |
---|---|
Distinct (%) | 10.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 304.5908 |
Minimum | 0 |
---|---|
Maximum | 4677 |
Zeros | 311 |
Zeros (%) | 3.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 59 |
median | 229 |
Q3 | 450 |
95-th percentile | 797 |
Maximum | 4677 |
Range | 4677 |
Interquartile range (IQR) | 391 |
Descriptive statistics
Standard deviation | 309.84837 |
---|---|
Coefficient of variation (CV) | 1.0172611 |
Kurtosis | 8.8075937 |
Mean | 304.5908 |
Median Absolute Deviation (MAD) | 189 |
Skewness | 1.9611255 |
Sum | 3045908 |
Variance | 96006.013 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 366 | 3.7% |
0 | 311 | 3.1% |
423 | 124 | 1.2% |
5 | 120 | 1.2% |
40 | 107 | 1.1% |
2 | 74 | 0.7% |
11 | 55 | 0.5% |
98 | 49 | 0.5% |
63 | 48 | 0.5% |
315 | 47 | 0.5% |
Other values (1005) | 8699 |
Value | Count | Frequency (%) |
0 | 311 | |
1 | 366 | |
2 | 74 | 0.7% |
3 | 45 | 0.4% |
4 | 36 | 0.4% |
5 | 120 | 1.2% |
6 | 24 | 0.2% |
7 | 31 | 0.3% |
8 | 37 | 0.4% |
9 | 29 | 0.3% |
Value | Count | Frequency (%) |
4677 | 1 | < 0.1% |
3581 | 1 | < 0.1% |
2533 | 1 | < 0.1% |
2473 | 1 | < 0.1% |
2252 | 1 | < 0.1% |
2092 | 1 | < 0.1% |
1762 | 1 | < 0.1% |
1736 | 23 | |
1732 | 1 | < 0.1% |
1720 | 2 | < 0.1% |
지
Real number (ℝ)
ZEROS
 
Distinct | 553 |
---|---|
Distinct (%) | 5.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 62.6278 |
Minimum | 0 |
---|---|
Maximum | 3249 |
Zeros | 1187 |
Zeros (%) | 11.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 3 |
median | 12 |
Q3 | 37 |
95-th percentile | 253 |
Maximum | 3249 |
Range | 3249 |
Interquartile range (IQR) | 34 |
Descriptive statistics
Standard deviation | 209.49787 |
---|---|
Coefficient of variation (CV) | 3.3451259 |
Kurtosis | 90.503155 |
Mean | 62.6278 |
Median Absolute Deviation (MAD) | 11 |
Skewness | 8.2652721 |
Sum | 626278 |
Variance | 43889.36 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 1187 | 11.9% |
1 | 674 | 6.7% |
2 | 561 | 5.6% |
3 | 467 | 4.7% |
5 | 386 | 3.9% |
4 | 350 | 3.5% |
6 | 264 | 2.6% |
7 | 252 | 2.5% |
8 | 223 | 2.2% |
9 | 212 | 2.1% |
Other values (543) | 5424 |
Value | Count | Frequency (%) |
0 | 1187 | |
1 | 674 | |
2 | 561 | |
3 | 467 | 4.7% |
4 | 350 | 3.5% |
5 | 386 | 3.9% |
6 | 264 | 2.6% |
7 | 252 | 2.5% |
8 | 223 | 2.2% |
9 | 212 | 2.1% |
Value | Count | Frequency (%) |
3249 | 1 | |
3196 | 1 | |
3195 | 1 | |
3194 | 1 | |
3187 | 1 | |
3185 | 1 | |
3146 | 1 | |
3136 | 1 | |
3128 | 1 | |
3118 | 1 |
대표_여부 | 시군구_코드 | 법정동_코드 | 대지_구분_코드 | 번 | 지 | |
---|---|---|---|---|---|---|
대표_여부 | 1.000 | 0.101 | 0.156 | 0.082 | 0.143 | 0.080 |
시군구_코드 | 0.101 | 1.000 | 0.543 | 0.114 | 0.434 | 0.206 |
법정동_코드 | 0.156 | 0.543 | 1.000 | 0.037 | 0.256 | 0.115 |
대지_구분_코드 | 0.082 | 0.114 | 0.037 | 1.000 | 0.015 | 0.065 |
번 | 0.143 | 0.434 | 0.256 | 0.015 | 1.000 | 0.000 |
지 | 0.080 | 0.206 | 0.115 | 0.065 | 0.000 | 1.000 |
시군구_코드 | 법정동_코드 | 대지_구분_코드 | 번 | 지 | 대표_여부 | |
---|---|---|---|---|---|---|
시군구_코드 | 1.000 | -0.497 | 0.080 | 0.304 | -0.053 | 0.077 |
법정동_코드 | -0.497 | 1.000 | -0.088 | -0.145 | 0.064 | 0.112 |
대지_구분_코드 | 0.080 | -0.088 | 1.000 | -0.153 | -0.113 | 0.059 |
번 | 0.304 | -0.145 | -0.153 | 1.000 | -0.004 | 0.107 |
지 | -0.053 | 0.064 | -0.113 | -0.004 | 1.000 | 0.080 |
대표_여부 | 0.077 | 0.112 | 0.059 | 0.107 | 0.080 | 1.000 |
관리_대지_위치_PK | 관리_허가대장_PK | 대표_여부 | 시군구_코드 | 법정동_코드 | 대지_구분_코드 | 번 | 지 | |
---|---|---|---|---|---|---|---|---|
9991 | 11740-100062677 | 11740-100081951 | 1 | 11740 | 10500 | 0 | 252 | 11 |
8408 | 11170-100093867 | 11170-100068812 | 0 | 11170 | 10800 | 0 | 33 | 239 |
2862 | 11320-1000000000000000046122 | 11320-100068905 | 1 | 11320 | 10500 | 0 | 532 | 1 |
363 | 11000-1367 | 11000-135 | 0 | 11530 | 10200 | <NA> | 0 | 0 |
16039 | 11170-100098583 | 11170-100073052 | 1 | 11170 | 12800 | 0 | 65 | 375 |
5716 | 11170-1000000000000004417248 | 11170-1000000000000000420677 | 1 | 11170 | 13100 | 0 | 11 | 49 |
10858 | 11320-1000000000000000048287 | 11320-100071245 | 0 | 11320 | 10700 | 0 | 662 | 69 |
5349 | 11680-1000000000000004967662 | 11680-1000000000000000502153 | 0 | 11680 | 10700 | 0 | 582 | 13 |
5046 | 11545-100005682 | 11545-100004676 | 1 | 11545 | 10100 | 0 | 60 | 28 |
13210 | 11740-100062693 | 11740-100088391 | 1 | 11740 | 10900 | 0 | 425 | 5 |
관리_대지_위치_PK | 관리_허가대장_PK | 대표_여부 | 시군구_코드 | 법정동_코드 | 대지_구분_코드 | 번 | 지 | |
---|---|---|---|---|---|---|---|---|
15258 | 11305-1000000000000002233787 | 11305-100087941 | 0 | 11305 | 10100 | 0 | 233 | 15 |
2399 | 11000-100006657 | 11000-100004105 | 0 | 11215 | 10500 | 0 | 65 | 9 |
543 | 11320-1000000000000004944223 | 11320-1000000000000000225418 | 1 | 11320 | 10800 | 0 | 351 | 2 |
1248 | 11320-100069591 | 11320-100060065 | 0 | 11320 | 10500 | 0 | 707 | 18 |
10739 | 11170-1000000000000003256955 | 11170-100085292 | 1 | 11170 | 12100 | 0 | 1 | 2 |
3926 | 11680-1000000000000004936829 | 11680-1000000000000000498231 | 0 | 11680 | 10500 | 0 | 128 | 22 |
8586 | 11305-100096436 | 11305-100090941 | 0 | 11305 | 10300 | 0 | 47 | 56 |
7966 | 11170-1000000000000000732600 | 11170-100090633 | 1 | 11170 | 13100 | 0 | 267 | 2 |
12237 | 11170-3375 | 11170-3263 | 1 | 11170 | 11200 | 0 | 85 | 0 |
6040 | 11650-1000000000000003668677 | 11650-1000000000000000282952 | 1 | 11650 | 10300 | 0 | 493 | 2 |