Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 918 |
Missing cells | 108 |
Missing cells (%) | 2.4% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 37.8 KiB |
Average record size in memory | 42.1 B |
Variable types
Numeric | 2 |
---|---|
Text | 1 |
DateTime | 2 |
Dataset
Description | 한국토지주택공사에서 시행한 보상공고 현황(사업지구코드, 사업지구명, 보상시작일자, 보상종료일자 등)의 자료를 제공합니다. |
---|---|
Author | 한국토지주택공사 |
URL | https://www.data.go.kr/data/15122809/fileData.do |
Reproduction
Analysis started | 2023-12-12 05:49:33.809998 |
---|---|
Analysis finished | 2023-12-12 05:49:35.135343 |
Duration | 1.33 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
사업지구코드
Real number (ℝ)
Distinct | 461 |
---|---|
Distinct (%) | 50.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 156234.66 |
Minimum | 100187 |
---|---|
Maximum | 901660 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 8.2 KiB |
Quantile statistics
Minimum | 100187 |
---|---|
5-th percentile | 100261.85 |
Q1 | 100339.5 |
median | 101403 |
Q3 | 104877.5 |
95-th percentile | 300192.45 |
Maximum | 901660 |
Range | 801473 |
Interquartile range (IQR) | 4538 |
Descriptive statistics
Standard deviation | 174136.75 |
---|---|
Coefficient of variation (CV) | 1.1145846 |
Kurtosis | 12.77313 |
Mean | 156234.66 |
Median Absolute Deviation (MAD) | 1114 |
Skewness | 3.6882318 |
Sum | 1.4342342 × 108 |
Variance | 3.0323609 × 1010 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
100289 | 39 | 4.2% |
100266 | 30 | 3.3% |
100322 | 29 | 3.2% |
100276 | 17 | 1.9% |
101578 | 10 | 1.1% |
101706 | 10 | 1.1% |
100428 | 9 | 1.0% |
101458 | 9 | 1.0% |
100770 | 8 | 0.9% |
100576 | 8 | 0.9% |
Other values (451) | 749 |
Value | Count | Frequency (%) |
100187 | 1 | 0.1% |
100198 | 1 | 0.1% |
100221 | 1 | 0.1% |
100223 | 4 | |
100224 | 1 | 0.1% |
100232 | 2 | |
100233 | 1 | 0.1% |
100235 | 3 | |
100236 | 1 | 0.1% |
100237 | 2 |
Value | Count | Frequency (%) |
901660 | 1 | |
901557 | 1 | |
901164 | 1 | |
901114 | 1 | |
900922 | 1 | |
900907 | 1 | |
900906 | 1 | |
900717 | 1 | |
900447 | 1 | |
900169 | 1 |
사업지구명
Text
Distinct | 461 |
---|---|
Distinct (%) | 50.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 7.3 KiB |
Value | Count | Frequency (%) |
용인죽전 | 39 | 4.2% |
용인동백 | 30 | 3.2% |
화성동탄 | 29 | 3.1% |
원주무실2 | 17 | 1.8% |
수원고등(05주환3 | 10 | 1.1% |
하남미사(09보금3 | 10 | 1.1% |
광명소하(02gb | 9 | 1.0% |
남양뉴타운 | 9 | 1.0% |
군부대매봉산 | 8 | 0.9% |
화성동탄2 | 8 | 0.9% |
Other values (464) | 765 |
Most occurring characters
Value | Count | Frequency (%) |
) | 333 | 5.3% |
( | 333 | 5.3% |
주 | 231 | 3.7% |
0 | 184 | 2.9% |
산 | 162 | 2.6% |
2 | 136 | 2.2% |
동 | 126 | 2.0% |
천 | 123 | 1.9% |
인 | 116 | 1.8% |
성 | 109 | 1.7% |
Other values (294) | 4465 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 4740 | |
Decimal Number | 613 | 9.7% |
Close Punctuation | 333 | 5.3% |
Open Punctuation | 333 | 5.3% |
Uppercase Letter | 235 | 3.7% |
Dash Punctuation | 39 | 0.6% |
Space Separator | 16 | 0.3% |
Lowercase Letter | 4 | 0.1% |
Other Punctuation | 3 | < 0.1% |
Connector Punctuation | 2 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
주 | 231 | 4.9% |
산 | 162 | 3.4% |
동 | 126 | 2.7% |
천 | 123 | 2.6% |
인 | 116 | 2.4% |
성 | 109 | 2.3% |
양 | 105 | 2.2% |
대 | 101 | 2.1% |
용 | 99 | 2.1% |
전 | 97 | 2.0% |
Other values (265) | 3471 |
Uppercase Letter
Value | Count | Frequency (%) |
B | 108 | |
L | 79 | |
G | 29 | 12.3% |
R | 5 | 2.1% |
D | 5 | 2.1% |
T | 2 | 0.9% |
K | 2 | 0.9% |
X | 2 | 0.9% |
A | 1 | 0.4% |
M | 1 | 0.4% |
Decimal Number
Value | Count | Frequency (%) |
0 | 184 | |
2 | 136 | |
1 | 66 | 10.8% |
5 | 54 | 8.8% |
3 | 50 | 8.2% |
6 | 33 | 5.4% |
9 | 30 | 4.9% |
8 | 26 | 4.2% |
7 | 23 | 3.8% |
4 | 11 | 1.8% |
Other Punctuation
Value | Count | Frequency (%) |
. | 2 | |
& | 1 |
Close Punctuation
Value | Count | Frequency (%) |
) | 333 |
Open Punctuation
Value | Count | Frequency (%) |
( | 333 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 39 |
Space Separator
Value | Count | Frequency (%) |
16 |
Lowercase Letter
Value | Count | Frequency (%) |
n | 4 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 4740 | |
Common | 1339 | 21.2% |
Latin | 239 | 3.8% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
주 | 231 | 4.9% |
산 | 162 | 3.4% |
동 | 126 | 2.7% |
천 | 123 | 2.6% |
인 | 116 | 2.4% |
성 | 109 | 2.3% |
양 | 105 | 2.2% |
대 | 101 | 2.1% |
용 | 99 | 2.1% |
전 | 97 | 2.0% |
Other values (265) | 3471 |
Common
Value | Count | Frequency (%) |
) | 333 | |
( | 333 | |
0 | 184 | |
2 | 136 | |
1 | 66 | 4.9% |
5 | 54 | 4.0% |
3 | 50 | 3.7% |
- | 39 | 2.9% |
6 | 33 | 2.5% |
9 | 30 | 2.2% |
Other values (7) | 81 | 6.0% |
Latin
Value | Count | Frequency (%) |
B | 108 | |
L | 79 | |
G | 29 | 12.1% |
R | 5 | 2.1% |
D | 5 | 2.1% |
n | 4 | 1.7% |
T | 2 | 0.8% |
K | 2 | 0.8% |
X | 2 | 0.8% |
A | 1 | 0.4% |
Other values (2) | 2 | 0.8% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 4740 | |
ASCII | 1578 | 25.0% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
) | 333 | |
( | 333 | |
0 | 184 | |
2 | 136 | |
B | 108 | 6.8% |
L | 79 | 5.0% |
1 | 66 | 4.2% |
5 | 54 | 3.4% |
3 | 50 | 3.2% |
- | 39 | 2.5% |
Other values (19) | 196 |
Hangul
Value | Count | Frequency (%) |
주 | 231 | 4.9% |
산 | 162 | 3.4% |
동 | 126 | 2.7% |
천 | 123 | 2.6% |
인 | 116 | 2.4% |
성 | 109 | 2.3% |
양 | 105 | 2.2% |
대 | 101 | 2.1% |
용 | 99 | 2.1% |
전 | 97 | 2.0% |
Other values (265) | 3471 |
보상공고일련번호
Real number (ℝ)
Distinct | 46 |
---|---|
Distinct (%) | 5.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.0501089 |
Minimum | 1 |
---|---|
Maximum | 46 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 8.2 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 2 |
Q3 | 3 |
95-th percentile | 20 |
Maximum | 46 |
Range | 45 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 6.8811134 |
---|---|
Coefficient of variation (CV) | 1.6989947 |
Kurtosis | 12.858883 |
Mean | 4.0501089 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 3.4687766 |
Sum | 3718 |
Variance | 47.349722 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 455 | |
2 | 162 | 17.6% |
3 | 84 | 9.2% |
4 | 45 | 4.9% |
5 | 31 | 3.4% |
6 | 20 | 2.2% |
7 | 14 | 1.5% |
8 | 11 | 1.2% |
9 | 8 | 0.9% |
10 | 6 | 0.7% |
Other values (36) | 82 | 8.9% |
Value | Count | Frequency (%) |
1 | 455 | |
2 | 162 | 17.6% |
3 | 84 | 9.2% |
4 | 45 | 4.9% |
5 | 31 | 3.4% |
6 | 20 | 2.2% |
7 | 14 | 1.5% |
8 | 11 | 1.2% |
9 | 8 | 0.9% |
10 | 6 | 0.7% |
Value | Count | Frequency (%) |
46 | 1 | |
45 | 1 | |
44 | 1 | |
43 | 1 | |
42 | 1 | |
41 | 1 | |
40 | 1 | |
39 | 1 | |
38 | 1 | |
37 | 1 |
보상시작일자
Date
MISSING
 
Distinct | 695 |
---|---|
Distinct (%) | 78.7% |
Missing | 35 |
Missing (%) | 3.8% |
Memory size | 7.3 KiB |
Minimum | 1997-02-17 00:00:00 |
---|---|
Maximum | 2023-08-07 00:00:00 |
보상종료일자
Date
MISSING
 
Distinct | 656 |
---|---|
Distinct (%) | 77.6% |
Missing | 73 |
Missing (%) | 8.0% |
Memory size | 7.3 KiB |
Minimum | 1997-12-23 00:00:00 |
---|---|
Maximum | 2023-08-21 00:00:00 |
사업지구코드 | 보상공고일련번호 | |
---|---|---|
사업지구코드 | 1.000 | 0.091 |
보상공고일련번호 | 0.091 | 1.000 |
사업지구코드 | 보상공고일련번호 | |
---|---|---|
사업지구코드 | 1.000 | -0.452 |
보상공고일련번호 | -0.452 | 1.000 |
사업지구코드 | 사업지구명 | 보상공고일련번호 | 보상시작일자 | 보상종료일자 | |
---|---|---|---|---|---|
0 | 100187 | 수원영통 | 1 | 2000-06-21 | 2000-08-20 |
1 | 100198 | 순천연향2 | 1 | 1997-02-17 | 1997-12-23 |
2 | 100221 | 횡성읍마 | 1 | 1999-03-03 | 1999-03-16 |
3 | 100223 | 광주신창 | 1 | 2001-09-17 | 2001-11-16 |
4 | 100223 | 광주신창 | 2 | 2002-12-02 | 2003-02-01 |
5 | 100223 | 광주신창 | 3 | 2003-04-04 | 2003-05-03 |
6 | 100223 | 광주신창 | 4 | 2004-05-31 | 2004-06-12 |
7 | 100224 | 대구칠곡3 | 1 | 1999-12-20 | 2000-02-20 |
8 | 100232 | 대전노은1 | 1 | 1999-01-14 | 1999-02-13 |
9 | 100232 | 대전노은1 | 2 | 1999-06-07 | 1999-07-31 |
사업지구코드 | 사업지구명 | 보상공고일련번호 | 보상시작일자 | 보상종료일자 | |
---|---|---|---|---|---|
908 | 900169 | 낙동강지구2 | 1 | 2009-09-26 | 2009-09-26 |
909 | 900447 | 현내-송현간도로공사(02수탁) | 1 | 2012-05-15 | 2012-06-29 |
910 | 900717 | 창원동읍우회도로 | 1 | 2011-07-04 | 2011-08-05 |
911 | 900906 | 영월북쌍 | 1 | 2016-08-22 | 2016-09-05 |
912 | 900907 | 남원주역세권 | 1 | 2017-09-04 | 2017-09-18 |
913 | 900922 | 전남담양 | 1 | 2015-05-26 | 2015-05-26 |
914 | 901114 | 밀양나노융합센터 | 1 | 2016-03-18 | 2016-04-04 |
915 | 901164 | 괴산미니복합타운 | 1 | 2020-03-25 | 2020-04-08 |
916 | 901557 | 광명너부대(수탁보상) | 1 | 2020-10-15 | 2020-10-30 |
917 | 901660 | 무주(반디나래지원센터) | 1 | 2021-07-27 | 2021-08-11 |