Dataset statistics
Number of variables | 12 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 10.0 KiB |
Average record size in memory | 102.3 B |
Variable types
Numeric | 2 |
---|---|
Categorical | 6 |
Text | 1 |
Boolean | 1 |
DateTime | 2 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 노바코스 |
URL | https://www.bigdata-environment.kr/user/data_market/detail.do?id=6b2ca190-150c-11eb-a877-a5b67dc5814b |
기준년도 has constant value "" | Constant |
기준월 has constant value "" | Constant |
지점 has constant value "" | Constant |
법정동명 has constant value "" | Constant |
특수지구분코드 has constant value "" | Constant |
특수지구분명 has constant value "" | Constant |
공시일자 has constant value "" | Constant |
데이터기준일자 has constant value "" | Constant |
표준지여부 is highly imbalanced (75.8%) | Imbalance |
기본키 has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 10:15:59.717641 |
---|---|
Analysis finished | 2023-12-10 10:16:01.488874 |
Duration | 1.77 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
기본키
Real number (ℝ)
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 50.5 |
Minimum | 1 |
---|---|
Maximum | 100 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 5.95 |
Q1 | 25.75 |
median | 50.5 |
Q3 | 75.25 |
95-th percentile | 95.05 |
Maximum | 100 |
Range | 99 |
Interquartile range (IQR) | 49.5 |
Descriptive statistics
Standard deviation | 29.011492 |
---|---|
Coefficient of variation (CV) | 0.57448499 |
Kurtosis | -1.2 |
Mean | 50.5 |
Median Absolute Deviation (MAD) | 25 |
Skewness | 0 |
Sum | 5050 |
Variance | 841.66667 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 1.0% |
65 | 1 | 1.0% |
75 | 1 | 1.0% |
74 | 1 | 1.0% |
73 | 1 | 1.0% |
72 | 1 | 1.0% |
71 | 1 | 1.0% |
70 | 1 | 1.0% |
69 | 1 | 1.0% |
68 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
100 | 1 | |
99 | 1 | |
98 | 1 | |
97 | 1 | |
96 | 1 | |
95 | 1 | |
94 | 1 | |
93 | 1 | |
92 | 1 | |
91 | 1 |
기준년도
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
2021 |
---|
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2021 |
---|---|
2nd row | 2021 |
3rd row | 2021 |
4th row | 2021 |
5th row | 2021 |
Common Values
Value | Count | Frequency (%) |
2021 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2021 | 100 |
기준월
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
1 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 100 |
지점
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
A-1000-0239S-10 |
---|
Length
Max length | 15 |
---|---|
Median length | 15 |
Mean length | 15 |
Min length | 15 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | A-1000-0239S-10 |
---|---|
2nd row | A-1000-0239S-10 |
3rd row | A-1000-0239S-10 |
4th row | A-1000-0239S-10 |
5th row | A-1000-0239S-10 |
Common Values
Value | Count | Frequency (%) |
A-1000-0239S-10 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
a-1000-0239s-10 | 100 |
법정동명
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
서울 강동구 상일동 |
---|
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 서울 강동구 상일동 |
---|---|
2nd row | 서울 강동구 상일동 |
3rd row | 서울 강동구 상일동 |
4th row | 서울 강동구 상일동 |
5th row | 서울 강동구 상일동 |
Common Values
Value | Count | Frequency (%) |
서울 강동구 상일동 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
서울 | 100 | |
강동구 | 100 | |
상일동 | 100 |
지번
Text
Distinct | 50 |
---|---|
Distinct (%) | 50.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
1 | 2 | 2.0% |
12-12 | 2 | 2.0% |
120 | 2 | 2.0% |
12 | 2 | 2.0% |
12-2 | 2 | 2.0% |
12-3 | 2 | 2.0% |
12-4 | 2 | 2.0% |
12-6 | 2 | 2.0% |
12-8 | 2 | 2.0% |
12-9 | 2 | 2.0% |
Other values (40) | 80 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 86 | |
- | 84 | |
2 | 66 | |
4 | 38 | |
3 | 20 | 5.7% |
8 | 20 | 5.7% |
6 | 12 | 3.4% |
0 | 10 | 2.9% |
7 | 6 | 1.7% |
5 | 4 | 1.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 266 | |
Dash Punctuation | 84 | 24.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 86 | |
2 | 66 | |
4 | 38 | |
3 | 20 | 7.5% |
8 | 20 | 7.5% |
6 | 12 | 4.5% |
0 | 10 | 3.8% |
7 | 6 | 2.3% |
5 | 4 | 1.5% |
9 | 4 | 1.5% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 84 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 350 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 86 | |
- | 84 | |
2 | 66 | |
4 | 38 | |
3 | 20 | 5.7% |
8 | 20 | 5.7% |
6 | 12 | 3.4% |
0 | 10 | 2.9% |
7 | 6 | 1.7% |
5 | 4 | 1.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 350 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 86 | |
- | 84 | |
2 | 66 | |
4 | 38 | |
3 | 20 | 5.7% |
8 | 20 | 5.7% |
6 | 12 | 3.4% |
0 | 10 | 2.9% |
7 | 6 | 1.7% |
5 | 4 | 1.1% |
개별공시지가(원)
Real number (ℝ)
Distinct | 20 |
---|---|
Distinct (%) | 20.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 759214 |
Minimum | 230100 |
---|---|
Maximum | 2235000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 230100 |
---|---|
5-th percentile | 230100 |
Q1 | 412500 |
median | 587500 |
Q3 | 625000 |
95-th percentile | 2105000 |
Maximum | 2235000 |
Range | 2004900 |
Interquartile range (IQR) | 212500 |
Descriptive statistics
Standard deviation | 589542.53 |
---|---|
Coefficient of variation (CV) | 0.77651694 |
Kurtosis | 1.1569006 |
Mean | 759214 |
Median Absolute Deviation (MAD) | 175000 |
Skewness | 1.6167187 |
Sum | 75921400 |
Variance | 3.475604 × 1011 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
412500 | 24 | |
587500 | 18 | |
230100 | 12 | |
618700 | 6 | 6.0% |
596900 | 4 | 4.0% |
558100 | 4 | 4.0% |
1999000 | 4 | 4.0% |
694600 | 4 | 4.0% |
2102000 | 2 | 2.0% |
534300 | 2 | 2.0% |
Other values (10) | 20 |
Value | Count | Frequency (%) |
230100 | 12 | |
412500 | 24 | |
534300 | 2 | 2.0% |
558100 | 4 | 4.0% |
587500 | 18 | |
593700 | 2 | 2.0% |
596900 | 4 | 4.0% |
597300 | 2 | 2.0% |
618700 | 6 | 6.0% |
625000 | 2 | 2.0% |
Value | Count | Frequency (%) |
2235000 | 2 | |
2126000 | 2 | |
2105000 | 2 | |
2102000 | 2 | |
2083000 | 2 | |
1999000 | 4 | |
1667000 | 2 | |
1080000 | 2 | |
1041000 | 2 | |
694600 | 4 |
표준지여부
Boolean
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 232.0 B |
False | |
---|---|
True | 4 |
Value | Count | Frequency (%) |
False | 96 | |
True | 4 | 4.0% |
특수지구분코드
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
1 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 100 |
특수지구분명
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
일반 |
---|
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 일반 |
---|---|
2nd row | 일반 |
3rd row | 일반 |
4th row | 일반 |
5th row | 일반 |
Common Values
Value | Count | Frequency (%) |
일반 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
일반 | 100 |
공시일자
Date
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Minimum | 2021-05-31 00:00:00 |
---|---|
Maximum | 2021-05-31 00:00:00 |
데이터기준일자
Date
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Minimum | 2021-08-03 00:00:00 |
---|---|
Maximum | 2021-08-03 00:00:00 |
기본키 | 지번 | 개별공시지가(원) | 표준지여부 | |
---|---|---|---|---|
기본키 | 1.000 | 1.000 | 0.694 | 0.378 |
지번 | 1.000 | 1.000 | 1.000 | 1.000 |
개별공시지가(원) | 0.694 | 1.000 | 1.000 | 0.119 |
표준지여부 | 0.378 | 1.000 | 0.119 | 1.000 |
기본키 | 개별공시지가(원) | 표준지여부 | |
---|---|---|---|
기본키 | 1.000 | -0.494 | 0.277 |
개별공시지가(원) | -0.494 | 1.000 | 0.168 |
표준지여부 | 0.277 | 0.168 | 1.000 |
기본키 | 기준년도 | 기준월 | 지점 | 법정동명 | 지번 | 개별공시지가(원) | 표준지여부 | 특수지구분코드 | 특수지구분명 | 공시일자 | 데이터기준일자 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 1 | 2021 | 1 | A-1000-0239S-10 | 서울 강동구 상일동 | 1 | 1999000 | N | 1 | 일반 | 2021-05-31 | 2021-08-03 |
1 | 2 | 2021 | 1 | A-1000-0239S-10 | 서울 강동구 상일동 | 1 | 1999000 | N | 1 | 일반 | 2021-05-31 | 2021-08-03 |
2 | 3 | 2021 | 1 | A-1000-0239S-10 | 서울 강동구 상일동 | 2 | 558100 | N | 1 | 일반 | 2021-05-31 | 2021-08-03 |
3 | 4 | 2021 | 1 | A-1000-0239S-10 | 서울 강동구 상일동 | 2 | 558100 | N | 1 | 일반 | 2021-05-31 | 2021-08-03 |
4 | 5 | 2021 | 1 | A-1000-0239S-10 | 서울 강동구 상일동 | 2-1 | 2126000 | N | 1 | 일반 | 2021-05-31 | 2021-08-03 |
5 | 6 | 2021 | 1 | A-1000-0239S-10 | 서울 강동구 상일동 | 2-1 | 2126000 | N | 1 | 일반 | 2021-05-31 | 2021-08-03 |
6 | 7 | 2021 | 1 | A-1000-0239S-10 | 서울 강동구 상일동 | 2-2 | 1999000 | N | 1 | 일반 | 2021-05-31 | 2021-08-03 |
7 | 8 | 2021 | 1 | A-1000-0239S-10 | 서울 강동구 상일동 | 2-2 | 1999000 | N | 1 | 일반 | 2021-05-31 | 2021-08-03 |
8 | 9 | 2021 | 1 | A-1000-0239S-10 | 서울 강동구 상일동 | 2-3 | 2105000 | Y | 1 | 일반 | 2021-05-31 | 2021-08-03 |
9 | 10 | 2021 | 1 | A-1000-0239S-10 | 서울 강동구 상일동 | 2-3 | 2105000 | Y | 1 | 일반 | 2021-05-31 | 2021-08-03 |
기본키 | 기준년도 | 기준월 | 지점 | 법정동명 | 지번 | 개별공시지가(원) | 표준지여부 | 특수지구분코드 | 특수지구분명 | 공시일자 | 데이터기준일자 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
90 | 91 | 2021 | 1 | A-1000-0239S-10 | 서울 강동구 상일동 | 76-4 | 230100 | N | 1 | 일반 | 2021-05-31 | 2021-08-03 |
91 | 92 | 2021 | 1 | A-1000-0239S-10 | 서울 강동구 상일동 | 76-4 | 230100 | N | 1 | 일반 | 2021-05-31 | 2021-08-03 |
92 | 93 | 2021 | 1 | A-1000-0239S-10 | 서울 강동구 상일동 | 82-2 | 230100 | N | 1 | 일반 | 2021-05-31 | 2021-08-03 |
93 | 94 | 2021 | 1 | A-1000-0239S-10 | 서울 강동구 상일동 | 82-2 | 230100 | N | 1 | 일반 | 2021-05-31 | 2021-08-03 |
94 | 95 | 2021 | 1 | A-1000-0239S-10 | 서울 강동구 상일동 | 112 | 596900 | N | 1 | 일반 | 2021-05-31 | 2021-08-03 |
95 | 96 | 2021 | 1 | A-1000-0239S-10 | 서울 강동구 상일동 | 112 | 596900 | N | 1 | 일반 | 2021-05-31 | 2021-08-03 |
96 | 97 | 2021 | 1 | A-1000-0239S-10 | 서울 강동구 상일동 | 120 | 596900 | N | 1 | 일반 | 2021-05-31 | 2021-08-03 |
97 | 98 | 2021 | 1 | A-1000-0239S-10 | 서울 강동구 상일동 | 120 | 596900 | N | 1 | 일반 | 2021-05-31 | 2021-08-03 |
98 | 99 | 2021 | 1 | A-1000-0239S-10 | 서울 강동구 상일동 | 123 | 2235000 | N | 1 | 일반 | 2021-05-31 | 2021-08-03 |
99 | 100 | 2021 | 1 | A-1000-0239S-10 | 서울 강동구 상일동 | 123 | 2235000 | N | 1 | 일반 | 2021-05-31 | 2021-08-03 |