Dataset statistics
Number of variables | 15 |
---|---|
Number of observations | 9693 |
Missing cells | 4312 |
Missing cells (%) | 3.0% |
Duplicate rows | 86 |
Duplicate rows (%) | 0.9% |
Total size in memory | 1.2 MiB |
Average record size in memory | 130.0 B |
Variable types
Categorical | 7 |
---|---|
Numeric | 6 |
Text | 2 |
Dataset
Description | 충청북도 증평군_지방세에 대한 자료입니다. 지방세에는 취득세, 재산세, 자동차세, 지방소득세, 등록면허세 등 다양한 자료가 있습니다. |
---|---|
URL | https://www.data.go.kr/data/15080373/fileData.do |
Dataset has 86 (0.9%) duplicate rows | Duplicates |
특수지 is highly overall correlated with 시도명 and 4 other fields | High correlation |
시군구명 is highly overall correlated with 법정리 and 11 other fields | High correlation |
법정동 is highly overall correlated with 법정리 and 5 other fields | High correlation |
과세년도 is highly overall correlated with 법정리 and 11 other fields | High correlation |
자치단체코드 is highly overall correlated with 법정리 and 11 other fields | High correlation |
시도명 is highly overall correlated with 법정리 and 11 other fields | High correlation |
기준일자 is highly overall correlated with 법정리 and 11 other fields | High correlation |
법정리 is highly overall correlated with 시도명 and 5 other fields | High correlation |
본번 is highly overall correlated with 시도명 and 4 other fields | High correlation |
부번 is highly overall correlated with 시도명 and 4 other fields | High correlation |
동 is highly overall correlated with 시도명 and 4 other fields | High correlation |
시가표준액 is highly overall correlated with 연면적 and 5 other fields | High correlation |
연면적 is highly overall correlated with 시가표준액 and 5 other fields | High correlation |
시도명 is highly imbalanced (69.0%) | Imbalance |
시군구명 is highly imbalanced (69.0%) | Imbalance |
자치단체코드 is highly imbalanced (69.0%) | Imbalance |
과세년도 is highly imbalanced (69.0%) | Imbalance |
특수지 is highly imbalanced (73.2%) | Imbalance |
기준일자 is highly imbalanced (69.0%) | Imbalance |
법정리 has 539 (5.6%) missing values | Missing |
본번 has 539 (5.6%) missing values | Missing |
부번 has 539 (5.6%) missing values | Missing |
동 has 539 (5.6%) missing values | Missing |
호 has 539 (5.6%) missing values | Missing |
물건지 has 539 (5.6%) missing values | Missing |
시가표준액 has 539 (5.6%) missing values | Missing |
연면적 has 539 (5.6%) missing values | Missing |
시가표준액 is highly skewed (γ1 = 26.54853073) | Skewed |
연면적 is highly skewed (γ1 = 20.48913313) | Skewed |
부번 has 4614 (47.6%) zeros | Zeros |
동 has 3042 (31.4%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-12 18:39:20.562382 |
---|---|
Analysis finished | 2023-12-12 18:39:32.661953 |
Duration | 12.1 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
시도명
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 75.9 KiB |
충청북도 | |
---|---|
<NA> | 539 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 충청북도 |
---|---|
2nd row | 충청북도 |
3rd row | 충청북도 |
4th row | 충청북도 |
5th row | 충청북도 |
Common Values
Value | Count | Frequency (%) |
충청북도 | 9154 | |
<NA> | 539 | 5.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
충청북도 | 9154 | |
na | 539 | 5.6% |
시군구명
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 75.9 KiB |
증평군 | |
---|---|
<NA> | 539 |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 3.0556071 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 증평군 |
---|---|
2nd row | 증평군 |
3rd row | 증평군 |
4th row | 증평군 |
5th row | 증평군 |
Common Values
Value | Count | Frequency (%) |
증평군 | 9154 | |
<NA> | 539 | 5.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
증평군 | 9154 | |
na | 539 | 5.6% |
자치단체코드
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 75.9 KiB |
43745 | |
---|---|
<NA> | 539 |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 4.9443929 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 43745 |
---|---|
2nd row | 43745 |
3rd row | 43745 |
4th row | 43745 |
5th row | 43745 |
Common Values
Value | Count | Frequency (%) |
43745 | 9154 | |
<NA> | 539 | 5.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
43745 | 9154 | |
na | 539 | 5.6% |
과세년도
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 75.9 KiB |
2022 | |
---|---|
<NA> | 539 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2022 |
---|---|
2nd row | 2022 |
3rd row | 2022 |
4th row | 2022 |
5th row | 2022 |
Common Values
Value | Count | Frequency (%) |
2022 | 9154 | |
<NA> | 539 | 5.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2022 | 9154 | |
na | 539 | 5.6% |
법정동
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 75.9 KiB |
250 | |
---|---|
310 | |
<NA> | 539 |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 3.0556071 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 250 |
---|---|
2nd row | 250 |
3rd row | 250 |
4th row | 250 |
5th row | 250 |
Common Values
Value | Count | Frequency (%) |
250 | 7169 | |
310 | 1985 | 20.5% |
<NA> | 539 | 5.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
250 | 7169 | |
310 | 1985 | 20.5% |
na | 539 | 5.6% |
법정리
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 20 |
---|---|
Distinct (%) | 0.2% |
Missing | 539 |
Missing (%) | 5.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 28.422984 |
Minimum | 21 |
---|---|
Maximum | 40 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 85.3 KiB |
Quantile statistics
Minimum | 21 |
---|---|
5-th percentile | 22 |
Q1 | 24 |
median | 28 |
Q3 | 32 |
95-th percentile | 37 |
Maximum | 40 |
Range | 19 |
Interquartile range (IQR) | 8 |
Descriptive statistics
Standard deviation | 4.9726248 |
---|---|
Coefficient of variation (CV) | 0.17495083 |
Kurtosis | -0.79413802 |
Mean | 28.422984 |
Median Absolute Deviation (MAD) | 4 |
Skewness | 0.31726731 |
Sum | 260184 |
Variance | 24.726997 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
30 | 1060 | |
22 | 965 | |
31 | 804 | 8.3% |
27 | 669 | 6.9% |
26 | 666 | 6.9% |
23 | 648 | 6.7% |
25 | 611 | 6.3% |
33 | 481 | 5.0% |
32 | 474 | 4.9% |
21 | 448 | 4.6% |
Other values (10) | 2328 | |
(Missing) | 539 | 5.6% |
Value | Count | Frequency (%) |
21 | 448 | |
22 | 965 | |
23 | 648 | |
24 | 362 | 3.7% |
25 | 611 | |
26 | 666 | |
27 | 669 | |
28 | 351 | 3.6% |
29 | 154 | 1.6% |
30 | 1060 |
Value | Count | Frequency (%) |
40 | 145 | 1.5% |
39 | 108 | 1.1% |
38 | 158 | 1.6% |
37 | 250 | 2.6% |
36 | 168 | 1.7% |
35 | 439 | |
34 | 193 | 2.0% |
33 | 481 | |
32 | 474 | |
31 | 804 |
특수지
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 75.9 KiB |
1 | |
---|---|
<NA> | 539 |
2 | 152 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.1668214 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 9002 | |
<NA> | 539 | 5.6% |
2 | 152 | 1.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 9002 | |
na | 539 | 5.6% |
2 | 152 | 1.6% |
본번
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 936 |
---|---|
Distinct (%) | 10.2% |
Missing | 539 |
Missing (%) | 5.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 393.57352 |
Minimum | 1 |
---|---|
Maximum | 1630 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 85.3 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 20 |
Q1 | 108 |
median | 342 |
Q3 | 600 |
95-th percentile | 936.7 |
Maximum | 1630 |
Range | 1629 |
Interquartile range (IQR) | 492 |
Descriptive statistics
Standard deviation | 311.27786 |
---|---|
Coefficient of variation (CV) | 0.79090142 |
Kurtosis | -0.154116 |
Mean | 393.57352 |
Median Absolute Deviation (MAD) | 243 |
Skewness | 0.68408688 |
Sum | 3602772 |
Variance | 96893.905 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1071 | 103 | 1.1% |
602 | 97 | 1.0% |
11 | 90 | 0.9% |
24 | 83 | 0.9% |
61 | 79 | 0.8% |
582 | 67 | 0.7% |
77 | 62 | 0.6% |
84 | 55 | 0.6% |
673 | 55 | 0.6% |
532 | 54 | 0.6% |
Other values (926) | 8409 | |
(Missing) | 539 | 5.6% |
Value | Count | Frequency (%) |
1 | 18 | |
2 | 32 | |
3 | 27 | |
4 | 8 | 0.1% |
5 | 36 | |
6 | 15 | |
7 | 34 | |
8 | 20 | |
9 | 16 | |
10 | 15 |
Value | Count | Frequency (%) |
1630 | 16 | |
1629 | 1 | < 0.1% |
1515 | 1 | < 0.1% |
1482 | 2 | < 0.1% |
1473 | 1 | < 0.1% |
1420 | 12 | |
1362 | 1 | < 0.1% |
1359 | 1 | < 0.1% |
1357 | 2 | < 0.1% |
1353 | 3 | < 0.1% |
부번
Real number (ℝ)
HIGH CORRELATION
  MISSING
  ZEROS
 
Distinct | 67 |
---|---|
Distinct (%) | 0.7% |
Missing | 539 |
Missing (%) | 5.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.466135 |
Minimum | 0 |
---|---|
Maximum | 148 |
Zeros | 4614 |
Zeros (%) | 47.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 85.3 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 2 |
95-th percentile | 10 |
Maximum | 148 |
Range | 148 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 7.3591958 |
---|---|
Coefficient of variation (CV) | 2.9841009 |
Kurtosis | 119.80673 |
Mean | 2.466135 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 9.3902828 |
Sum | 22575 |
Variance | 54.157763 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 4614 | |
1 | 1633 | 16.8% |
2 | 777 | 8.0% |
3 | 470 | 4.8% |
4 | 360 | 3.7% |
5 | 270 | 2.8% |
6 | 167 | 1.7% |
8 | 159 | 1.6% |
7 | 148 | 1.5% |
10 | 71 | 0.7% |
Other values (57) | 485 | 5.0% |
(Missing) | 539 | 5.6% |
Value | Count | Frequency (%) |
0 | 4614 | |
1 | 1633 | 16.8% |
2 | 777 | 8.0% |
3 | 470 | 4.8% |
4 | 360 | 3.7% |
5 | 270 | 2.8% |
6 | 167 | 1.7% |
7 | 148 | 1.5% |
8 | 159 | 1.6% |
9 | 68 | 0.7% |
Value | Count | Frequency (%) |
148 | 1 | < 0.1% |
135 | 1 | < 0.1% |
134 | 1 | < 0.1% |
132 | 1 | < 0.1% |
116 | 1 | < 0.1% |
115 | 1 | < 0.1% |
113 | 1 | < 0.1% |
103 | 1 | < 0.1% |
99 | 1 | < 0.1% |
98 | 3 |
동
Real number (ℝ)
HIGH CORRELATION
  MISSING
  ZEROS
 
Distinct | 51 |
---|---|
Distinct (%) | 0.6% |
Missing | 539 |
Missing (%) | 5.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 832.26535 |
Minimum | 0 |
---|---|
Maximum | 9999 |
Zeros | 3042 |
Zeros (%) | 31.4% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 85.3 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 1 |
Q3 | 10 |
95-th percentile | 9001 |
Maximum | 9999 |
Range | 9999 |
Interquartile range (IQR) | 10 |
Descriptive statistics
Standard deviation | 2581.7825 |
---|---|
Coefficient of variation (CV) | 3.1021147 |
Kurtosis | 6.0343593 |
Mean | 832.26535 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 2.8293902 |
Sum | 7618557 |
Variance | 6665601.1 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 3155 | |
0 | 3042 | |
10 | 1452 | |
9001 | 701 | 7.2% |
20 | 154 | 1.6% |
101 | 129 | 1.3% |
2 | 111 | 1.1% |
9002 | 56 | 0.6% |
3 | 53 | 0.5% |
30 | 29 | 0.3% |
Other values (41) | 272 | 2.8% |
(Missing) | 539 | 5.6% |
Value | Count | Frequency (%) |
0 | 3042 | |
1 | 3155 | |
2 | 111 | 1.1% |
3 | 53 | 0.5% |
4 | 25 | 0.3% |
5 | 17 | 0.2% |
6 | 9 | 0.1% |
7 | 9 | 0.1% |
8 | 5 | 0.1% |
9 | 8 | 0.1% |
Value | Count | Frequency (%) |
9999 | 7 | 0.1% |
9012 | 1 | < 0.1% |
9011 | 1 | < 0.1% |
9008 | 2 | < 0.1% |
9007 | 3 | < 0.1% |
9006 | 5 | 0.1% |
9005 | 4 | < 0.1% |
9004 | 9 | 0.1% |
9003 | 13 | 0.1% |
9002 | 56 |
호
Text
MISSING
 
Distinct | 223 |
---|---|
Distinct (%) | 2.4% |
Missing | 539 |
Missing (%) | 5.6% |
Memory size | 75.9 KiB |
Value | Count | Frequency (%) |
101 | 4071 | |
102 | 1272 | 13.9% |
201 | 906 | 9.9% |
103 | 539 | 5.9% |
301 | 306 | 3.3% |
104 | 273 | 3.0% |
0 | 260 | 2.8% |
8101 | 239 | 2.6% |
202 | 153 | 1.7% |
105 | 153 | 1.7% |
Other values (213) | 982 | 10.7% |
Most occurring characters
Value | Count | Frequency (%) |
1 | 12813 | |
0 | 8887 | |
2 | 2804 | 10.4% |
3 | 1051 | 3.9% |
4 | 499 | 1.8% |
8 | 377 | 1.4% |
5 | 269 | 1.0% |
6 | 139 | 0.5% |
7 | 91 | 0.3% |
9 | 67 | 0.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 26997 | |
Other Letter | 46 | 0.2% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 12813 | |
0 | 8887 | |
2 | 2804 | 10.4% |
3 | 1051 | 3.9% |
4 | 499 | 1.8% |
8 | 377 | 1.4% |
5 | 269 | 1.0% |
6 | 139 | 0.5% |
7 | 91 | 0.3% |
9 | 67 | 0.2% |
Other Letter
Value | Count | Frequency (%) |
부 | 46 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 26997 | |
Hangul | 46 | 0.2% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 12813 | |
0 | 8887 | |
2 | 2804 | 10.4% |
3 | 1051 | 3.9% |
4 | 499 | 1.8% |
8 | 377 | 1.4% |
5 | 269 | 1.0% |
6 | 139 | 0.5% |
7 | 91 | 0.3% |
9 | 67 | 0.2% |
Hangul
Value | Count | Frequency (%) |
부 | 46 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 26997 | |
Hangul | 46 | 0.2% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 12813 | |
0 | 8887 | |
2 | 2804 | 10.4% |
3 | 1051 | 3.9% |
4 | 499 | 1.8% |
8 | 377 | 1.4% |
5 | 269 | 1.0% |
6 | 139 | 0.5% |
7 | 91 | 0.3% |
9 | 67 | 0.2% |
Hangul
Value | Count | Frequency (%) |
부 | 46 |
물건지
Text
MISSING
 
Distinct | 7910 |
---|---|
Distinct (%) | 86.4% |
Missing | 539 |
Missing (%) | 5.6% |
Memory size | 75.9 KiB |
Length
Max length | 35 |
---|---|
Median length | 31 |
Mean length | 26.461438 |
Min length | 18 |
Characters and Unicode
Total characters | 242228 |
---|---|
Distinct characters | 167 |
Distinct categories | 6 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 7337 ? |
---|---|
Unique (%) | 80.2% |
Sample
1st row | 충청북도 증평군 증평읍 신동리 651 10동 202호 |
---|---|
2nd row | [ 광장로 110-1 ] 0001동 0101호 |
3rd row | [ 광장로 110-1 ] 0001동 0201호 |
4th row | [ 광장로 110-1 ] 0001동 0301호 |
5th row | [ 광장로 110-1 ] 0001동 8101호 |
Value | Count | Frequency (%) |
7504 | 12.7% | |
충청북도 | 5402 | 9.2% |
증평군 | 5402 | 9.2% |
증평읍 | 3888 | 6.6% |
101호 | 2349 | 4.0% |
1동 | 2201 | 3.7% |
0101호 | 1722 | 2.9% |
0000동 | 1632 | 2.8% |
도안면 | 1514 | 2.6% |
0001동 | 954 | 1.6% |
Other values (2433) | 26297 |
Most occurring characters
Value | Count | Frequency (%) |
49711 | ||
0 | 29291 | 12.1% |
1 | 24680 | 10.2% |
증 | 9783 | 4.0% |
평 | 9579 | 4.0% |
호 | 8978 | 3.7% |
동 | 8517 | 3.5% |
도 | 7148 | 3.0% |
2 | 7121 | 2.9% |
청 | 5511 | 2.3% |
Other values (157) | 81909 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 99780 | |
Decimal Number | 81731 | |
Space Separator | 49711 | |
Close Punctuation | 3752 | 1.5% |
Open Punctuation | 3752 | 1.5% |
Dash Punctuation | 3502 | 1.4% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
증 | 9783 | 9.8% |
평 | 9579 | 9.6% |
호 | 8978 | 9.0% |
동 | 8517 | 8.5% |
도 | 7148 | 7.2% |
청 | 5511 | 5.5% |
충 | 5509 | 5.5% |
리 | 5434 | 5.4% |
북 | 5434 | 5.4% |
군 | 5403 | 5.4% |
Other values (143) | 28484 |
Decimal Number
Value | Count | Frequency (%) |
0 | 29291 | |
1 | 24680 | |
2 | 7121 | 8.7% |
3 | 4180 | 5.1% |
4 | 3210 | 3.9% |
5 | 3042 | 3.7% |
9 | 2752 | 3.4% |
6 | 2607 | 3.2% |
7 | 2459 | 3.0% |
8 | 2389 | 2.9% |
Space Separator
Value | Count | Frequency (%) |
49711 |
Close Punctuation
Value | Count | Frequency (%) |
] | 3752 |
Open Punctuation
Value | Count | Frequency (%) |
[ | 3752 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 3502 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 142448 | |
Hangul | 99780 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
증 | 9783 | 9.8% |
평 | 9579 | 9.6% |
호 | 8978 | 9.0% |
동 | 8517 | 8.5% |
도 | 7148 | 7.2% |
청 | 5511 | 5.5% |
충 | 5509 | 5.5% |
리 | 5434 | 5.4% |
북 | 5434 | 5.4% |
군 | 5403 | 5.4% |
Other values (143) | 28484 |
Common
Value | Count | Frequency (%) |
49711 | ||
0 | 29291 | |
1 | 24680 | |
2 | 7121 | 5.0% |
3 | 4180 | 2.9% |
] | 3752 | 2.6% |
[ | 3752 | 2.6% |
- | 3502 | 2.5% |
4 | 3210 | 2.3% |
5 | 3042 | 2.1% |
Other values (4) | 10207 | 7.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 142448 | |
Hangul | 99780 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
49711 | ||
0 | 29291 | |
1 | 24680 | |
2 | 7121 | 5.0% |
3 | 4180 | 2.9% |
] | 3752 | 2.6% |
[ | 3752 | 2.6% |
- | 3502 | 2.5% |
4 | 3210 | 2.3% |
5 | 3042 | 2.1% |
Other values (4) | 10207 | 7.2% |
Hangul
Value | Count | Frequency (%) |
증 | 9783 | 9.8% |
평 | 9579 | 9.6% |
호 | 8978 | 9.0% |
동 | 8517 | 8.5% |
도 | 7148 | 7.2% |
청 | 5511 | 5.5% |
충 | 5509 | 5.5% |
리 | 5434 | 5.4% |
북 | 5434 | 5.4% |
군 | 5403 | 5.4% |
Other values (143) | 28484 |
시가표준액
Real number (ℝ)
HIGH CORRELATION
  MISSING
  SKEWED
 
Distinct | 7327 |
---|---|
Distinct (%) | 80.0% |
Missing | 539 |
Missing (%) | 5.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 80851036 |
Minimum | 17280 |
---|---|
Maximum | 1.8388938 × 1010 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 85.3 KiB |
Quantile statistics
Minimum | 17280 |
---|---|
5-th percentile | 570000 |
Q1 | 3168000 |
median | 17174800 |
Q3 | 63241178 |
95-th percentile | 2.950748 × 108 |
Maximum | 1.8388938 × 1010 |
Range | 1.838892 × 1010 |
Interquartile range (IQR) | 60073178 |
Descriptive statistics
Standard deviation | 3.7027533 × 108 |
---|---|
Coefficient of variation (CV) | 4.5797228 |
Kurtosis | 1042.7788 |
Mean | 80851036 |
Median Absolute Deviation (MAD) | 16022800 |
Skewness | 26.548531 |
Sum | 7.4011039 × 1011 |
Variance | 1.3710382 × 1017 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5631240 | 34 | 0.4% |
936000 | 24 | 0.2% |
3968760 | 22 | 0.2% |
846000 | 20 | 0.2% |
756000 | 19 | 0.2% |
6795000 | 19 | 0.2% |
1026000 | 16 | 0.2% |
2628000 | 14 | 0.1% |
38734920 | 14 | 0.1% |
1509820 | 14 | 0.1% |
Other values (7317) | 8958 | |
(Missing) | 539 | 5.6% |
Value | Count | Frequency (%) |
17280 | 1 | |
23400 | 1 | |
34560 | 1 | |
45600 | 1 | |
46000 | 1 | |
47520 | 1 | |
51300 | 1 | |
51840 | 1 | |
52500 | 1 | |
55000 | 1 |
Value | Count | Frequency (%) |
18388937670 | 1 | |
15144452890 | 1 | |
7697112800 | 1 | |
7529236780 | 2 | |
6337757250 | 1 | |
5685796440 | 1 | |
5262517590 | 1 | |
4982998330 | 1 | |
4860211020 | 1 | |
4685683860 | 1 |
연면적
Real number (ℝ)
HIGH CORRELATION
  MISSING
  SKEWED
 
Distinct | 5049 |
---|---|
Distinct (%) | 55.2% |
Missing | 539 |
Missing (%) | 5.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 226.49302 |
Minimum | 0.54 |
---|---|
Maximum | 33501.435 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 85.3 KiB |
Quantile statistics
Minimum | 0.54 |
---|---|
5-th percentile | 12.872 |
Q1 | 40 |
median | 95.5 |
Q3 | 195 |
95-th percentile | 744.272 |
Maximum | 33501.435 |
Range | 33500.895 |
Interquartile range (IQR) | 155 |
Descriptive statistics
Standard deviation | 782.50292 |
---|---|
Coefficient of variation (CV) | 3.4548655 |
Kurtosis | 632.57823 |
Mean | 226.49302 |
Median Absolute Deviation (MAD) | 67.5 |
Skewness | 20.489133 |
Sum | 2073317.1 |
Variance | 612310.82 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
18.0 | 526 | 5.4% |
27.0 | 82 | 0.8% |
198.0 | 40 | 0.4% |
12.0 | 39 | 0.4% |
33.72 | 35 | 0.4% |
96.0 | 33 | 0.3% |
36.0 | 32 | 0.3% |
9.0 | 31 | 0.3% |
15.0 | 31 | 0.3% |
66.0 | 30 | 0.3% |
Other values (5039) | 8275 | |
(Missing) | 539 | 5.6% |
Value | Count | Frequency (%) |
0.54 | 1 | < 0.1% |
1.0 | 5 | |
1.2 | 3 | |
1.32 | 1 | < 0.1% |
1.4 | 1 | < 0.1% |
1.44 | 5 | |
1.56 | 1 | < 0.1% |
1.6 | 1 | < 0.1% |
1.8472 | 1 | < 0.1% |
1.95 | 1 | < 0.1% |
Value | Count | Frequency (%) |
33501.435 | 1 | |
27590.55 | 1 | |
16633.56 | 1 | |
15581.2 | 1 | |
15241.37 | 2 | |
13390.63 | 1 | |
13346.94 | 1 | |
12183.37 | 1 | |
11297.25 | 1 | |
10539.1 | 1 |
기준일자
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 75.9 KiB |
2022-06-01 | |
---|---|
<NA> | 539 |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 9.6663572 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2022-06-01 |
---|---|
2nd row | 2022-06-01 |
3rd row | 2022-06-01 |
4th row | 2022-06-01 |
5th row | 2022-06-01 |
Common Values
Value | Count | Frequency (%) |
2022-06-01 | 9154 | |
<NA> | 539 | 5.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2022-06-01 | 9154 | |
na | 539 | 5.6% |
법정동 | 법정리 | 특수지 | 본번 | 부번 | 동 | 시가표준액 | 연면적 | |
---|---|---|---|---|---|---|---|---|
법정동 | 1.000 | 0.693 | 0.182 | 0.358 | 0.084 | 0.055 | 0.036 | 0.035 |
법정리 | 0.693 | 1.000 | 0.340 | 0.670 | 0.222 | 0.204 | 0.095 | 0.106 |
특수지 | 0.182 | 0.340 | 1.000 | 0.236 | 0.300 | 0.099 | 0.000 | 0.000 |
본번 | 0.358 | 0.670 | 0.236 | 1.000 | 0.144 | 0.206 | 0.159 | 0.172 |
부번 | 0.084 | 0.222 | 0.300 | 0.144 | 1.000 | 0.065 | 0.000 | 0.000 |
동 | 0.055 | 0.204 | 0.099 | 0.206 | 0.065 | 1.000 | 0.000 | 0.000 |
시가표준액 | 0.036 | 0.095 | 0.000 | 0.159 | 0.000 | 0.000 | 1.000 | 0.988 |
연면적 | 0.035 | 0.106 | 0.000 | 0.172 | 0.000 | 0.000 | 0.988 | 1.000 |
특수지 | 시군구명 | 법정동 | 과세년도 | 자치단체코드 | 시도명 | 기준일자 | |
---|---|---|---|---|---|---|---|
특수지 | 1.000 | 1.000 | 0.117 | 1.000 | 1.000 | 1.000 | 1.000 |
시군구명 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
법정동 | 0.117 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
과세년도 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
자치단체코드 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
시도명 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
기준일자 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
법정리 | 본번 | 부번 | 동 | 시가표준액 | 연면적 | 시도명 | 시군구명 | 자치단체코드 | 과세년도 | 법정동 | 특수지 | 기준일자 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
법정리 | 1.000 | 0.260 | 0.108 | -0.070 | -0.039 | 0.003 | 1.000 | 1.000 | 1.000 | 1.000 | 0.540 | 0.261 | 1.000 |
본번 | 0.260 | 1.000 | -0.112 | -0.072 | 0.034 | 0.029 | 1.000 | 1.000 | 1.000 | 1.000 | 0.274 | 0.181 | 1.000 |
부번 | 0.108 | -0.112 | 1.000 | -0.055 | -0.048 | -0.088 | 1.000 | 1.000 | 1.000 | 1.000 | 0.065 | 0.230 | 1.000 |
동 | -0.070 | -0.072 | -0.055 | 1.000 | -0.138 | -0.210 | 1.000 | 1.000 | 1.000 | 1.000 | 0.039 | 0.071 | 1.000 |
시가표준액 | -0.039 | 0.034 | -0.048 | -0.138 | 1.000 | 0.702 | 1.000 | 1.000 | 1.000 | 1.000 | 0.038 | 0.000 | 1.000 |
연면적 | 0.003 | 0.029 | -0.088 | -0.210 | 0.702 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.038 | 0.000 | 1.000 |
시도명 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
시군구명 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
자치단체코드 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
과세년도 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
법정동 | 0.540 | 0.274 | 0.065 | 0.039 | 0.038 | 0.038 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.117 | 1.000 |
특수지 | 0.261 | 0.181 | 0.230 | 0.071 | 0.000 | 0.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.117 | 1.000 | 1.000 |
기준일자 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
시도명 | 시군구명 | 자치단체코드 | 과세년도 | 법정동 | 법정리 | 특수지 | 본번 | 부번 | 동 | 호 | 물건지 | 시가표준액 | 연면적 | 기준일자 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 충청북도 | 증평군 | 43745 | 2022 | 250 | 25 | 1 | 651 | 0 | 10 | 202 | 충청북도 증평군 증평읍 신동리 651 10동 202호 | 13892000 | 173.65 | 2022-06-01 |
1 | 충청북도 | 증평군 | 43745 | 2022 | 250 | 23 | 1 | 83 | 0 | 1 | 101 | [ 광장로 110-1 ] 0001동 0101호 | 30802150 | 81.66 | 2022-06-01 |
2 | 충청북도 | 증평군 | 43745 | 2022 | 250 | 23 | 1 | 83 | 0 | 1 | 201 | [ 광장로 110-1 ] 0001동 0201호 | 26784480 | 81.66 | 2022-06-01 |
3 | 충청북도 | 증평군 | 43745 | 2022 | 250 | 23 | 1 | 83 | 0 | 1 | 301 | [ 광장로 110-1 ] 0001동 0301호 | 26784480 | 81.66 | 2022-06-01 |
4 | 충청북도 | 증평군 | 43745 | 2022 | 250 | 23 | 1 | 83 | 0 | 1 | 8101 | [ 광장로 110-1 ] 0001동 8101호 | 25111680 | 95.7 | 2022-06-01 |
5 | 충청북도 | 증평군 | 43745 | 2022 | 250 | 30 | 1 | 572 | 6 | 0 | 101 | [ 초중8길 25-2 ] 0000동 0101호 | 41778000 | 99.0 | 2022-06-01 |
6 | 충청북도 | 증평군 | 43745 | 2022 | 250 | 33 | 1 | 854 | 2 | 1 | 105 | 충청북도 증평군 증평읍 미암리 854-2 1동 105호 | 3489600 | 43.62 | 2022-06-01 |
7 | 충청북도 | 증평군 | 43745 | 2022 | 250 | 33 | 1 | 854 | 2 | 1 | 102 | 충청북도 증평군 증평읍 미암리 854-2 1동 102호 | 5529600 | 69.12 | 2022-06-01 |
8 | 충청북도 | 증평군 | 43745 | 2022 | 250 | 33 | 1 | 854 | 2 | 1 | 103 | 충청북도 증평군 증평읍 미암리 854-2 1동 103호 | 8548800 | 106.86 | 2022-06-01 |
9 | 충청북도 | 증평군 | 43745 | 2022 | 250 | 33 | 1 | 854 | 2 | 1 | 104 | 충청북도 증평군 증평읍 미암리 854-2 1동 104호 | 230400 | 2.88 | 2022-06-01 |
시도명 | 시군구명 | 자치단체코드 | 과세년도 | 법정동 | 법정리 | 특수지 | 본번 | 부번 | 동 | 호 | 물건지 | 시가표준액 | 연면적 | 기준일자 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
9683 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
9684 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
9685 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
9686 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
9687 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
9688 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
9689 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
9690 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
9691 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
9692 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
Most frequently occurring
시도명 | 시군구명 | 자치단체코드 | 과세년도 | 법정동 | 법정리 | 특수지 | 본번 | 부번 | 동 | 호 | 물건지 | 시가표준액 | 연면적 | 기준일자 | # duplicates | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
85 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 539 |
19 | 충청북도 | 증평군 | 43745 | 2022 | 250 | 31 | 1 | 673 | 0 | 1 | 101 | 충청북도 증평군 증평읍 연탄리 673 1동 101호 | 6795000 | 90.6 | 2022-06-01 | 14 |
51 | 충청북도 | 증평군 | 43745 | 2022 | 250 | 37 | 1 | 131 | 0 | 1 | 101 | 충청북도 증평군 증평읍 남차리 131 1동 101호 | 33264000 | 264.0 | 2022-06-01 | 7 |
52 | 충청북도 | 증평군 | 43745 | 2022 | 250 | 37 | 1 | 131 | 1 | 1 | 101 | 충청북도 증평군 증평읍 남차리 131-1 1동 101호 | 33264000 | 264.0 | 2022-06-01 | 7 |
72 | 충청북도 | 증평군 | 43745 | 2022 | 310 | 26 | 1 | 222 | 0 | 1 | 101 | [ 석곡길 94-20 ] 0001동 0101호 | 18371220 | 592.62 | 2022-06-01 | 6 |
16 | 충청북도 | 증평군 | 43745 | 2022 | 250 | 31 | 1 | 602 | 1 | 1 | 101 | 충청북도 증평군 증평읍 연탄리 602-1 1동 101호 | 6480000 | 86.4 | 2022-06-01 | 4 |
37 | 충청북도 | 증평군 | 43745 | 2022 | 250 | 34 | 1 | 37 | 1 | 1 | 101 | 충청북도 증평군 증평읍 사곡리 37-1 1동 101호 | 9408000 | 224.0 | 2022-06-01 | 4 |
55 | 충청북도 | 증평군 | 43745 | 2022 | 250 | 39 | 1 | 9 | 0 | 1 | 101 | 충청북도 증평군 증평읍 죽리 9 1동 101호 | 33473250 | 230.85 | 2022-06-01 | 4 |
1 | 충청북도 | 증평군 | 43745 | 2022 | 250 | 28 | 1 | 1297 | 0 | 1 | 0 | 충청북도 증평군 증평읍 증천리 1297 1동 | 24038910 | 116.13 | 2022-06-01 | 3 |
2 | 충청북도 | 증평군 | 43745 | 2022 | 250 | 28 | 1 | 1297 | 0 | 1 | 0 | 충청북도 증평군 증평읍 증천리 1297 1동 | 27829080 | 134.44 | 2022-06-01 | 3 |