Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 6.0 KiB |
Average record size in memory | 61.3 B |
Variable types
Numeric | 4 |
---|---|
Categorical | 2 |
Text | 1 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 노바코스 |
URL | https://www.bigdata-environment.kr/user/data_market/detail.do?id=c34c45c0-2e4c-11eb-8f72-932712f5aa3c |
지점 has constant value "" | Constant |
주소 has constant value "" | Constant |
기본키 is highly overall correlated with 전년대비 증감율(%) | High correlation |
2019년 개별공시지가 is highly overall correlated with 2020년 개별공시지가 | High correlation |
2020년 개별공시지가 is highly overall correlated with 2019년 개별공시지가 | High correlation |
전년대비 증감율(%) is highly overall correlated with 기본키 | High correlation |
기본키 has unique values | Unique |
지번 has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 13:02:59.148972 |
---|---|
Analysis finished | 2023-12-10 13:03:01.527345 |
Duration | 2.38 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
기본키
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 50.5 |
Minimum | 1 |
---|---|
Maximum | 100 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 5.95 |
Q1 | 25.75 |
median | 50.5 |
Q3 | 75.25 |
95-th percentile | 95.05 |
Maximum | 100 |
Range | 99 |
Interquartile range (IQR) | 49.5 |
Descriptive statistics
Standard deviation | 29.011492 |
---|---|
Coefficient of variation (CV) | 0.57448499 |
Kurtosis | -1.2 |
Mean | 50.5 |
Median Absolute Deviation (MAD) | 25 |
Skewness | 0 |
Sum | 5050 |
Variance | 841.66667 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 1.0% |
65 | 1 | 1.0% |
75 | 1 | 1.0% |
74 | 1 | 1.0% |
73 | 1 | 1.0% |
72 | 1 | 1.0% |
71 | 1 | 1.0% |
70 | 1 | 1.0% |
69 | 1 | 1.0% |
68 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
100 | 1 | |
99 | 1 | |
98 | 1 | |
97 | 1 | |
96 | 1 | |
95 | 1 | |
94 | 1 | |
93 | 1 | |
92 | 1 | |
91 | 1 |
지점
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
A-1000-0239S-10 |
---|
Length
Max length | 15 |
---|---|
Median length | 15 |
Mean length | 15 |
Min length | 15 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | A-1000-0239S-10 |
---|---|
2nd row | A-1000-0239S-10 |
3rd row | A-1000-0239S-10 |
4th row | A-1000-0239S-10 |
5th row | A-1000-0239S-10 |
Common Values
Value | Count | Frequency (%) |
A-1000-0239S-10 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
a-1000-0239s-10 | 100 |
주소
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
서울 강동구 상일동 |
---|
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 서울 강동구 상일동 |
---|---|
2nd row | 서울 강동구 상일동 |
3rd row | 서울 강동구 상일동 |
4th row | 서울 강동구 상일동 |
5th row | 서울 강동구 상일동 |
Common Values
Value | Count | Frequency (%) |
서울 강동구 상일동 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
서울 | 100 | |
강동구 | 100 | |
상일동 | 100 |
지번
Text
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
1 | 1 | 1.0% |
25-1 | 1 | 1.0% |
30-1 | 1 | 1.0% |
30 | 1 | 1.0% |
29-4 | 1 | 1.0% |
29-3 | 1 | 1.0% |
29-2 | 1 | 1.0% |
29-1 | 1 | 1.0% |
28 | 1 | 1.0% |
27 | 1 | 1.0% |
Other values (90) | 90 |
Most occurring characters
Value | Count | Frequency (%) |
[ | 100 | |
] | 100 | |
- | 82 | |
2 | 72 | |
1 | 56 | |
3 | 44 | |
4 | 23 | 4.1% |
0 | 21 | 3.8% |
6 | 15 | 2.7% |
5 | 13 | 2.3% |
Other values (3) | 29 | 5.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 273 | |
Open Punctuation | 100 | 18.0% |
Close Punctuation | 100 | 18.0% |
Dash Punctuation | 82 | 14.8% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
2 | 72 | |
1 | 56 | |
3 | 44 | |
4 | 23 | 8.4% |
0 | 21 | 7.7% |
6 | 15 | 5.5% |
5 | 13 | 4.8% |
9 | 12 | 4.4% |
8 | 11 | 4.0% |
7 | 6 | 2.2% |
Open Punctuation
Value | Count | Frequency (%) |
[ | 100 |
Close Punctuation
Value | Count | Frequency (%) |
] | 100 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 82 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 555 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
[ | 100 | |
] | 100 | |
- | 82 | |
2 | 72 | |
1 | 56 | |
3 | 44 | |
4 | 23 | 4.1% |
0 | 21 | 3.8% |
6 | 15 | 2.7% |
5 | 13 | 2.3% |
Other values (3) | 29 | 5.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 555 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
[ | 100 | |
] | 100 | |
- | 82 | |
2 | 72 | |
1 | 56 | |
3 | 44 | |
4 | 23 | 4.1% |
0 | 21 | 3.8% |
6 | 15 | 2.7% |
5 | 13 | 2.3% |
Other values (3) | 29 | 5.2% |
2019년 개별공시지가
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 56 |
---|---|
Distinct (%) | 56.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 895085 |
Minimum | 184800 |
---|---|
Maximum | 3930000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 184800 |
---|---|
5-th percentile | 324700 |
Q1 | 509000 |
median | 521700 |
Q3 | 868250 |
95-th percentile | 2762050 |
Maximum | 3930000 |
Range | 3745200 |
Interquartile range (IQR) | 359250 |
Descriptive statistics
Standard deviation | 853133.64 |
---|---|
Coefficient of variation (CV) | 0.95313142 |
Kurtosis | 4.677294 |
Mean | 895085 |
Median Absolute Deviation (MAD) | 49550 |
Skewness | 2.2843648 |
Sum | 89508500 |
Variance | 7.27837 × 1011 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
509600 | 14 | 14.0% |
324700 | 9 | 9.0% |
521700 | 9 | 9.0% |
893000 | 3 | 3.0% |
508000 | 3 | 3.0% |
509000 | 3 | 3.0% |
534000 | 3 | 3.0% |
471900 | 2 | 2.0% |
184800 | 2 | 2.0% |
536000 | 2 | 2.0% |
Other values (46) | 50 |
Value | Count | Frequency (%) |
184800 | 2 | 2.0% |
203200 | 1 | 1.0% |
324700 | 9 | |
363000 | 1 | 1.0% |
377000 | 1 | 1.0% |
467300 | 1 | 1.0% |
471900 | 2 | 2.0% |
476000 | 1 | 1.0% |
492000 | 1 | 1.0% |
493900 | 1 | 1.0% |
Value | Count | Frequency (%) |
3930000 | 1 | |
3885000 | 1 | |
3810000 | 2 | |
3276000 | 1 | |
2735000 | 1 | |
2566000 | 1 | |
2530000 | 1 | |
2523000 | 1 | |
2010000 | 1 | |
1866000 | 1 |
2020년 개별공시지가
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 58 |
---|---|
Distinct (%) | 58.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 964280 |
Minimum | 200600 |
---|---|
Maximum | 4190000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 200600 |
---|---|
5-th percentile | 349800 |
Q1 | 535900 |
median | 564200 |
Q3 | 900250 |
95-th percentile | 3030000 |
Maximum | 4190000 |
Range | 3989400 |
Interquartile range (IQR) | 364350 |
Descriptive statistics
Standard deviation | 919175.85 |
---|---|
Coefficient of variation (CV) | 0.95322505 |
Kurtosis | 4.5355787 |
Mean | 964280 |
Median Absolute Deviation (MAD) | 51350 |
Skewness | 2.270026 |
Sum | 96428000 |
Variance | 8.4488425 × 1011 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
553200 | 12 | 12.0% |
349800 | 9 | 9.0% |
528600 | 6 | 6.0% |
587000 | 3 | 3.0% |
559000 | 3 | 3.0% |
535000 | 3 | 3.0% |
919000 | 3 | 3.0% |
539500 | 3 | 3.0% |
4070000 | 2 | 2.0% |
200600 | 2 | 2.0% |
Other values (48) | 54 |
Value | Count | Frequency (%) |
200600 | 2 | 2.0% |
211200 | 1 | 1.0% |
349800 | 9 | |
389400 | 1 | 1.0% |
434000 | 1 | 1.0% |
517300 | 1 | 1.0% |
523000 | 1 | 1.0% |
528600 | 6 | |
535000 | 3 | 3.0% |
536200 | 1 | 1.0% |
Value | Count | Frequency (%) |
4190000 | 1 | |
4150000 | 1 | |
4070000 | 2 | |
3600000 | 1 | |
3000000 | 1 | |
2820000 | 1 | |
2770000 | 1 | |
2760000 | 1 | |
2110000 | 1 | |
2019000 | 1 |
전년대비 증감율(%)
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 12 |
---|---|
Distinct (%) | 12.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.0789 |
Minimum | 1.01 |
---|---|
Maximum | 1.15 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1.01 |
---|---|
5-th percentile | 1.01 |
Q1 | 1.07 |
median | 1.085 |
Q3 | 1.1 |
95-th percentile | 1.15 |
Maximum | 1.15 |
Range | 0.14 |
Interquartile range (IQR) | 0.03 |
Descriptive statistics
Standard deviation | 0.033901074 |
---|---|
Coefficient of variation (CV) | 0.031421887 |
Kurtosis | 0.077234507 |
Mean | 1.0789 |
Median Absolute Deviation (MAD) | 0.015 |
Skewness | -0.24163373 |
Sum | 107.89 |
Variance | 0.0011492828 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1.09 | 22 | |
1.08 | 19 | |
1.1 | 16 | |
1.03 | 11 | |
1.07 | 7 | 7.0% |
1.01 | 6 | 6.0% |
1.15 | 6 | 6.0% |
1.05 | 5 | 5.0% |
1.11 | 4 | 4.0% |
1.13 | 2 | 2.0% |
Other values (2) | 2 | 2.0% |
Value | Count | Frequency (%) |
1.01 | 6 | 6.0% |
1.02 | 1 | 1.0% |
1.03 | 11 | |
1.04 | 1 | 1.0% |
1.05 | 5 | 5.0% |
1.07 | 7 | 7.0% |
1.08 | 19 | |
1.09 | 22 | |
1.1 | 16 | |
1.11 | 4 | 4.0% |
Value | Count | Frequency (%) |
1.15 | 6 | 6.0% |
1.13 | 2 | 2.0% |
1.11 | 4 | 4.0% |
1.1 | 16 | |
1.09 | 22 | |
1.08 | 19 | |
1.07 | 7 | 7.0% |
1.05 | 5 | 5.0% |
1.04 | 1 | 1.0% |
1.03 | 11 |
기본키 | 지번 | 2019년 개별공시지가 | 2020년 개별공시지가 | 전년대비 증감율(%) | |
---|---|---|---|---|---|
기본키 | 1.000 | 1.000 | 0.523 | 0.484 | 0.550 |
지번 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
2019년 개별공시지가 | 0.523 | 1.000 | 1.000 | 0.992 | 0.587 |
2020년 개별공시지가 | 0.484 | 1.000 | 0.992 | 1.000 | 0.687 |
전년대비 증감율(%) | 0.550 | 1.000 | 0.587 | 0.687 | 1.000 |
기본키 | 2019년 개별공시지가 | 2020년 개별공시지가 | 전년대비 증감율(%) | |
---|---|---|---|---|
기본키 | 1.000 | -0.214 | -0.042 | 0.536 |
2019년 개별공시지가 | -0.214 | 1.000 | 0.930 | -0.155 |
2020년 개별공시지가 | -0.042 | 0.930 | 1.000 | 0.094 |
전년대비 증감율(%) | 0.536 | -0.155 | 0.094 | 1.000 |
기본키 | 지점 | 주소 | 지번 | 2019년 개별공시지가 | 2020년 개별공시지가 | 전년대비 증감율(%) | |
---|---|---|---|---|---|---|---|
0 | 1 | A-1000-0239S-10 | 서울 강동구 상일동 | [1] | 1738000 | 1881000 | 1.08 |
1 | 2 | A-1000-0239S-10 | 서울 강동구 상일동 | [2] | 844000 | 886000 | 1.05 |
2 | 3 | A-1000-0239S-10 | 서울 강동구 상일동 | [2-1] | 1866000 | 2019000 | 1.08 |
3 | 4 | A-1000-0239S-10 | 서울 강동구 상일동 | [2-2] | 1738000 | 1881000 | 1.08 |
4 | 5 | A-1000-0239S-10 | 서울 강동구 상일동 | [2-3] | 1830000 | 1980000 | 1.08 |
5 | 6 | A-1000-0239S-10 | 서울 강동구 상일동 | [2-4] | 577500 | 653400 | 1.13 |
6 | 7 | A-1000-0239S-10 | 서울 강동구 상일동 | [2-6] | 900000 | 927000 | 1.03 |
7 | 8 | A-1000-0239S-10 | 서울 강동구 상일동 | [2-7] | 577500 | 653400 | 1.13 |
8 | 9 | A-1000-0239S-10 | 서울 강동구 상일동 | [2-8] | 860000 | 886000 | 1.03 |
9 | 10 | A-1000-0239S-10 | 서울 강동구 상일동 | [3] | 816000 | 840000 | 1.03 |
기본키 | 지점 | 주소 | 지번 | 2019년 개별공시지가 | 2020년 개별공시지가 | 전년대비 증감율(%) | |
---|---|---|---|---|---|---|---|
90 | 91 | A-1000-0239S-10 | 서울 강동구 상일동 | [36-5] | 813000 | 894000 | 1.1 |
91 | 92 | A-1000-0239S-10 | 서울 강동구 상일동 | [36-6] | 1320000 | 1430000 | 1.08 |
92 | 93 | A-1000-0239S-10 | 서울 강동구 상일동 | [37] | 518000 | 535000 | 1.03 |
93 | 94 | A-1000-0239S-10 | 서울 강동구 상일동 | [38] | 518000 | 570000 | 1.1 |
94 | 95 | A-1000-0239S-10 | 서울 강동구 상일동 | [39-1] | 543200 | 601300 | 1.11 |
95 | 96 | A-1000-0239S-10 | 서울 강동구 상일동 | [39-2] | 508000 | 559000 | 1.1 |
96 | 97 | A-1000-0239S-10 | 서울 강동구 상일동 | [39-3] | 509600 | 564200 | 1.11 |
97 | 98 | A-1000-0239S-10 | 서울 강동구 상일동 | [39-4] | 508000 | 559000 | 1.1 |
98 | 99 | A-1000-0239S-10 | 서울 강동구 상일동 | [39-5] | 508000 | 559000 | 1.1 |
99 | 100 | A-1000-0239S-10 | 서울 강동구 상일동 | [39-7] | 549000 | 632000 | 1.15 |