Dataset statistics
Number of variables | 3 |
---|---|
Number of observations | 10000 |
Missing cells | 2278 |
Missing cells (%) | 7.6% |
Duplicate rows | 599 |
Duplicate rows (%) | 6.0% |
Total size in memory | 332.0 KiB |
Average record size in memory | 34.0 B |
Variable types
Text | 1 |
---|---|
Numeric | 2 |
Dataset
Description | 경기도 광주시 도시계획정보시스템의 건축물주제도 용적율 현황에 관한 데이터로 지번코드, 건물군관리번호, 용적률에 대한 항목을 제공합니다. |
---|---|
Author | 경기도 광주시 |
URL | https://www.data.go.kr/data/15122653/fileData.do |
Dataset has 599 (6.0%) duplicate rows | Duplicates |
건물군관리번호 has 2278 (22.8%) missing values | Missing |
용적률_심볼 has 3741 (37.4%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-12 08:22:20.268833 |
---|---|
Analysis finished | 2023-12-12 08:22:21.391517 |
Duration | 1.12 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
지번코드
Text
Distinct | 9330 |
---|---|
Distinct (%) | 93.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 64 |
---|---|
Median length | 19 |
Mean length | 19.0135 |
Min length | 19 |
Characters and Unicode
Total characters | 190135 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 8731 ? |
---|---|
Unique (%) | 87.3% |
Sample
1st row | 4161025324100170002 |
---|---|
2nd row | 4161035025102830002 |
3rd row | 4161010300100930040 |
4th row | 4161025027100770029 |
5th row | 4161025023101450029 |
Value | Count | Frequency (%) |
4161025027105020009 | 6 | 0.1% |
4161025934101490000 | 5 | 0.1% |
4161035021101160000 | 5 | 0.1% |
4161025936100720001 | 4 | < 0.1% |
4161034021101880000 | 4 | < 0.1% |
4161011100107570000 | 4 | < 0.1% |
4161034028100890000 | 4 | < 0.1% |
4161010800102110008 | 4 | < 0.1% |
4161010500100400000 | 4 | < 0.1% |
4161011300100910004 | 3 | < 0.1% |
Other values (9319) | 9954 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 65951 | |
1 | 41935 | |
2 | 18927 | 10.0% |
4 | 15785 | 8.3% |
6 | 14243 | 7.5% |
3 | 11796 | 6.2% |
5 | 9965 | 5.2% |
7 | 4158 | 2.2% |
9 | 4157 | 2.2% |
8 | 3026 | 1.6% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 189943 | |
Space Separator | 192 | 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 65951 | |
1 | 41935 | |
2 | 18927 | 10.0% |
4 | 15785 | 8.3% |
6 | 14243 | 7.5% |
3 | 11796 | 6.2% |
5 | 9965 | 5.2% |
7 | 4158 | 2.2% |
9 | 4157 | 2.2% |
8 | 3026 | 1.6% |
Space Separator
Value | Count | Frequency (%) |
192 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 190135 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 65951 | |
1 | 41935 | |
2 | 18927 | 10.0% |
4 | 15785 | 8.3% |
6 | 14243 | 7.5% |
3 | 11796 | 6.2% |
5 | 9965 | 5.2% |
7 | 4158 | 2.2% |
9 | 4157 | 2.2% |
8 | 3026 | 1.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 190135 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 65951 | |
1 | 41935 | |
2 | 18927 | 10.0% |
4 | 15785 | 8.3% |
6 | 14243 | 7.5% |
3 | 11796 | 6.2% |
5 | 9965 | 5.2% |
7 | 4158 | 2.2% |
9 | 4157 | 2.2% |
8 | 3026 | 1.6% |
건물군관리번호
Real number (ℝ)
MISSING
 
Distinct | 7223 |
---|---|
Distinct (%) | 93.5% |
Missing | 2278 |
Missing (%) | 22.8% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 32356917 |
Minimum | 1 |
---|---|
Maximum | 1.0030381 × 108 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 983.05 |
Q1 | 13790.5 |
median | 25582.5 |
Q3 | 1.0019085 × 108 |
95-th percentile | 1.0027774 × 108 |
Maximum | 1.0030381 × 108 |
Range | 1.0030381 × 108 |
Interquartile range (IQR) | 1.0017706 × 108 |
Descriptive statistics
Standard deviation | 46852877 |
---|---|
Coefficient of variation (CV) | 1.4480019 |
Kurtosis | -1.4249457 |
Mean | 32356917 |
Median Absolute Deviation (MAD) | 15011.5 |
Skewness | 0.75856592 |
Sum | 2.4986011 × 1011 |
Variance | 2.1951921 × 1015 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
17060 | 6 | 0.1% |
28619 | 5 | 0.1% |
13494 | 5 | 0.1% |
16930 | 4 | < 0.1% |
100199797 | 4 | < 0.1% |
146 | 4 | < 0.1% |
7807 | 4 | < 0.1% |
18537 | 4 | < 0.1% |
25899 | 3 | < 0.1% |
29784 | 3 | < 0.1% |
Other values (7213) | 7680 | |
(Missing) | 2278 | 22.8% |
Value | Count | Frequency (%) |
1 | 2 | |
2 | 1 | |
3 | 1 | |
5 | 1 | |
6 | 1 | |
8 | 1 | |
9 | 1 | |
12 | 1 | |
17 | 2 | |
18 | 1 |
Value | Count | Frequency (%) |
100303807 | 1 | |
100303803 | 1 | |
100303722 | 1 | |
100303703 | 1 | |
100303702 | 1 | |
100303543 | 1 | |
100303482 | 1 | |
100303448 | 1 | |
100303378 | 2 | |
100303358 | 1 |
용적률_심볼
Real number (ℝ)
ZEROS
 
Distinct | 4082 |
---|---|
Distinct (%) | 40.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 39.11274 |
Minimum | 0 |
---|---|
Maximum | 523.85 |
Zeros | 3741 |
Zeros (%) | 37.4% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 29.92 |
Q3 | 53.7975 |
95-th percentile | 144.261 |
Maximum | 523.85 |
Range | 523.85 |
Interquartile range (IQR) | 53.7975 |
Descriptive statistics
Standard deviation | 46.909809 |
---|---|
Coefficient of variation (CV) | 1.1993486 |
Kurtosis | 4.4484797 |
Mean | 39.11274 |
Median Absolute Deviation (MAD) | 29.92 |
Skewness | 1.7140783 |
Sum | 391127.4 |
Variance | 2200.5302 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0 | 3741 | |
50.0 | 14 | 0.1% |
99.62 | 11 | 0.1% |
99.75 | 11 | 0.1% |
30.0 | 11 | 0.1% |
99.77 | 11 | 0.1% |
99.83 | 11 | 0.1% |
99.97 | 11 | 0.1% |
39.9 | 10 | 0.1% |
99.92 | 10 | 0.1% |
Other values (4072) | 6159 |
Value | Count | Frequency (%) |
0.0 | 3741 | |
0.03 | 1 | < 0.1% |
0.49 | 1 | < 0.1% |
0.61 | 2 | < 0.1% |
0.64 | 2 | < 0.1% |
0.7 | 1 | < 0.1% |
1.34 | 1 | < 0.1% |
1.59 | 1 | < 0.1% |
1.67 | 1 | < 0.1% |
1.73 | 3 | < 0.1% |
Value | Count | Frequency (%) |
523.85 | 1 | |
468.98 | 1 | |
365.63 | 1 | |
314.29 | 1 | |
298.25 | 1 | |
296.19 | 1 | |
296.11 | 1 | |
288.82 | 1 | |
287.53 | 2 | |
282.42 | 1 |
건물군관리번호 | 용적률_심볼 | |
---|---|---|
건물군관리번호 | 1.000 | 0.297 |
용적률_심볼 | 0.297 | 1.000 |
건물군관리번호 | 용적률_심볼 | |
---|---|---|
건물군관리번호 | 1.000 | 0.213 |
용적률_심볼 | 0.213 | 1.000 |
지번코드 | 건물군관리번호 | 용적률_심볼 | |
---|---|---|---|
34454 | 4161025324100170002 | <NA> | 0.0 |
22772 | 4161035025102830002 | 10022 | 0.0 |
41756 | 4161010300100930040 | 28321 | 111.55 |
29232 | 4161025027100770029 | 12141 | 34.54 |
15447 | 4161025023101450029 | 8545 | 148.36 |
39771 | 4161025924100750012 | 11637 | 21.97 |
6588 | 4161025921104470001 | 6879 | 0.0 |
17333 | 4161011200100230030 | 16827 | 0.0 |
29408 | 4161025027103650003 | 27526 | 71.9 |
17112 | 4161010300101100016 | 32120 | 0.0 |
지번코드 | 건물군관리번호 | 용적률_심볼 | |
---|---|---|---|
30460 | 4161025327103970000 | 19660 | 0.0 |
16650 | 4161025933100300000 | 27189 | 28.85 |
24743 | 4161025023103680011 | 100250340 | 96.07 |
13676 | 4161010100101480065 | <NA> | 0.0 |
3252 | 4161025933101130000 | 30650 | 0.0 |
35975 | 4161025022105220015 | <NA> | 0.0 |
4368 | 4161025324100770002 | 100207180 | 22.95 |
34436 | 4161025324101170001 | <NA> | 0.0 |
11839 | 4161025329102760000 | 100202729 | 98.61 |
15193 | 4161025022103660006 | 100286839 | 146.54 |
Most frequently occurring
지번코드 | 건물군관리번호 | 용적률_심볼 | # duplicates | |
---|---|---|---|---|
246 | 4161025027105020009 | 17060 | 0.0 | 6 |
437 | 4161025934101490000 | 28619 | 0.0 | 5 |
555 | 4161035021101160000 | 13494 | 17.36 | 5 |
52 | 4161010500100400000 | 16930 | 0.0 | 4 |
83 | 4161010800102110008 | <NA> | 0.0 | 4 |
122 | 4161011100107570000 | 100199797 | 199.3 | 4 |
452 | 4161025936100720001 | 146 | 5.88 | 4 |
518 | 4161034021101880000 | 7807 | 0.0 | 4 |
548 | 4161034028100890000 | 18537 | 0.0 | 4 |
0 | <NA> | 0.0 | 3 |