Dataset statistics
Number of variables | 11 |
---|---|
Number of observations | 10000 |
Missing cells | 1727 |
Missing cells (%) | 1.6% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 996.1 KiB |
Average record size in memory | 102.0 B |
Variable types
Text | 5 |
---|---|
Categorical | 2 |
Numeric | 4 |
Dataset
Description | 관리_전유_공용_면적_pk,호별명세_pk,평형_구분_명,전유_공용_구분_코드,주_부속_구분_코드,층_구분_코드,층_번호,구조_코드,주_용도_코드,기타_용도,면적 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15665/S/1/datasetView.do |
층_구분_코드 is highly overall correlated with 층_번호 and 1 other fields | High correlation |
층_번호 is highly overall correlated with 층_구분_코드 | High correlation |
전유_공용_구분_코드 is highly overall correlated with 층_구분_코드 | High correlation |
주_부속_구분_코드 is highly imbalanced (93.8%) | Imbalance |
층_구분_코드 has 320 (3.2%) missing values | Missing |
기타_용도 has 1336 (13.4%) missing values | Missing |
층_번호 is highly skewed (γ1 = 33.6697486) | Skewed |
면적 is highly skewed (γ1 = 35.78791434) | Skewed |
관리_전유_공용_면적_pk has unique values | Unique |
층_번호 has 5521 (55.2%) zeros | Zeros |
Reproduction
Analysis started | 2024-05-10 23:26:40.228379 |
---|---|
Analysis finished | 2024-05-10 23:26:50.727843 |
Duration | 10.5 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
관리_전유_공용_면적_pk
Text
UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 28 |
---|---|
Median length | 15 |
Mean length | 13.9095 |
Min length | 7 |
Characters and Unicode
Total characters | 139095 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 10000 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 11000-100026311 |
---|---|
2nd row | 11000-100001480 |
3rd row | 11000-100008544 |
4th row | 11110-19912 |
5th row | 11000-1713 |
Value | Count | Frequency (%) |
11000-100026311 | 1 | < 0.1% |
11110-100016724 | 1 | < 0.1% |
11140-1000000000000000751645 | 1 | < 0.1% |
11110-2013 | 1 | < 0.1% |
11000-100011668 | 1 | < 0.1% |
11000-100006933 | 1 | < 0.1% |
11000-4317 | 1 | < 0.1% |
11110-100022324 | 1 | < 0.1% |
11110-9953 | 1 | < 0.1% |
11000-100002429 | 1 | < 0.1% |
Other values (9990) | 9990 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 51056 | |
1 | 41824 | |
- | 10000 | 7.2% |
2 | 6866 | 4.9% |
6 | 4582 | 3.3% |
8 | 4232 | 3.0% |
9 | 4208 | 3.0% |
4 | 4167 | 3.0% |
7 | 4118 | 3.0% |
5 | 4085 | 2.9% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 129095 | |
Dash Punctuation | 10000 | 7.2% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 51056 | |
1 | 41824 | |
2 | 6866 | 5.3% |
6 | 4582 | 3.5% |
8 | 4232 | 3.3% |
9 | 4208 | 3.3% |
4 | 4167 | 3.2% |
7 | 4118 | 3.2% |
5 | 4085 | 3.2% |
3 | 3957 | 3.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 139095 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 51056 | |
1 | 41824 | |
- | 10000 | 7.2% |
2 | 6866 | 4.9% |
6 | 4582 | 3.3% |
8 | 4232 | 3.0% |
9 | 4208 | 3.0% |
4 | 4167 | 3.0% |
7 | 4118 | 3.0% |
5 | 4085 | 2.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 139095 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 51056 | |
1 | 41824 | |
- | 10000 | 7.2% |
2 | 6866 | 4.9% |
6 | 4582 | 3.3% |
8 | 4232 | 3.0% |
9 | 4208 | 3.0% |
4 | 4167 | 3.0% |
7 | 4118 | 3.0% |
5 | 4085 | 2.9% |
호별명세_pk
Text
Distinct | 1056 |
---|---|
Distinct (%) | 10.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 28 |
---|---|
Median length | 15 |
Mean length | 11.4847 |
Min length | 7 |
Characters and Unicode
Total characters | 114847 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 350 ? |
---|---|
Unique (%) | 3.5% |
Sample
1st row | 11000-139 |
---|---|
2nd row | 11000-106 |
3rd row | 11000-100004025 |
4th row | 11110-3477 |
5th row | 11000-18 |
Value | Count | Frequency (%) |
11000-100004025 | 400 | 4.0% |
11000-65 | 273 | 2.7% |
11000-131 | 267 | 2.7% |
11000-72 | 251 | 2.5% |
11110-100017332 | 194 | 1.9% |
11000-92 | 173 | 1.7% |
11000-56 | 162 | 1.6% |
11110-2502 | 151 | 1.5% |
11000-100004246 | 141 | 1.4% |
11000-33 | 136 | 1.4% |
Other values (1046) | 7852 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 38351 | |
0 | 38073 | |
- | 10000 | 8.7% |
2 | 4958 | 4.3% |
3 | 3991 | 3.5% |
4 | 3785 | 3.3% |
5 | 3630 | 3.2% |
7 | 3405 | 3.0% |
6 | 3008 | 2.6% |
9 | 3002 | 2.6% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 104847 | |
Dash Punctuation | 10000 | 8.7% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 38351 | |
0 | 38073 | |
2 | 4958 | 4.7% |
3 | 3991 | 3.8% |
4 | 3785 | 3.6% |
5 | 3630 | 3.5% |
7 | 3405 | 3.2% |
6 | 3008 | 2.9% |
9 | 3002 | 2.9% |
8 | 2644 | 2.5% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 114847 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 38351 | |
0 | 38073 | |
- | 10000 | 8.7% |
2 | 4958 | 4.3% |
3 | 3991 | 3.5% |
4 | 3785 | 3.3% |
5 | 3630 | 3.2% |
7 | 3405 | 3.0% |
6 | 3008 | 2.6% |
9 | 3002 | 2.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 114847 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 38351 | |
0 | 38073 | |
- | 10000 | 8.7% |
2 | 4958 | 4.3% |
3 | 3991 | 3.5% |
4 | 3785 | 3.3% |
5 | 3630 | 3.2% |
7 | 3405 | 3.0% |
6 | 3008 | 2.6% |
9 | 3002 | 2.6% |
평형_구분_명
Text
Distinct | 5174 |
---|---|
Distinct (%) | 51.8% |
Missing | 6 |
Missing (%) | 0.1% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
a | 100 | 1.0% |
b | 72 | 0.7% |
c | 49 | 0.5% |
201 | 48 | 0.5% |
d | 48 | 0.5% |
a동 | 44 | 0.4% |
101 | 38 | 0.4% |
301 | 38 | 0.4% |
203 | 36 | 0.4% |
402 | 34 | 0.3% |
Other values (4982) | 9654 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 5718 | |
0 | 3874 | 10.1% |
2 | 3789 | 9.9% |
3 | 2745 | 7.2% |
4 | 2278 | 5.9% |
. | 2049 | 5.3% |
5 | 2020 | 5.3% |
6 | 1955 | 5.1% |
7 | 1729 | 4.5% |
8 | 1569 | 4.1% |
Other values (157) | 10575 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 27030 | |
Uppercase Letter | 5483 | 14.3% |
Other Punctuation | 2075 | 5.4% |
Other Letter | 1678 | 4.4% |
Dash Punctuation | 947 | 2.5% |
Lowercase Letter | 798 | 2.1% |
Space Separator | 167 | 0.4% |
Close Punctuation | 58 | 0.2% |
Open Punctuation | 58 | 0.2% |
Math Symbol | 4 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
호 | 402 | |
동 | 136 | 8.1% |
층 | 120 | 7.2% |
시 | 67 | 4.0% |
설 | 57 | 3.4% |
타 | 53 | 3.2% |
업 | 44 | 2.6% |
워 | 43 | 2.6% |
형 | 43 | 2.6% |
무 | 42 | 2.5% |
Other values (87) | 671 |
Uppercase Letter
Value | Count | Frequency (%) |
B | 1304 | |
A | 1281 | |
S | 623 | |
C | 460 | 8.4% |
D | 281 | 5.1% |
O | 234 | 4.3% |
E | 202 | 3.7% |
F | 170 | 3.1% |
P | 121 | 2.2% |
T | 103 | 1.9% |
Other values (16) | 704 |
Lowercase Letter
Value | Count | Frequency (%) |
b | 162 | |
s | 144 | |
a | 144 | |
f | 116 | |
c | 36 | 4.5% |
o | 33 | 4.1% |
p | 33 | 4.1% |
e | 31 | 3.9% |
y | 22 | 2.8% |
d | 20 | 2.5% |
Other values (14) | 57 | 7.1% |
Decimal Number
Value | Count | Frequency (%) |
1 | 5718 | |
0 | 3874 | |
2 | 3789 | |
3 | 2745 | |
4 | 2278 | 8.4% |
5 | 2020 | 7.5% |
6 | 1955 | 7.2% |
7 | 1729 | 6.4% |
8 | 1569 | 5.8% |
9 | 1353 | 5.0% |
Other Punctuation
Value | Count | Frequency (%) |
. | 2049 | |
* | 13 | 0.6% |
, | 12 | 0.6% |
/ | 1 | < 0.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 947 |
Space Separator
Value | Count | Frequency (%) |
167 |
Close Punctuation
Value | Count | Frequency (%) |
) | 58 |
Open Punctuation
Value | Count | Frequency (%) |
( | 58 |
Math Symbol
Value | Count | Frequency (%) |
~ | 4 |
Other Symbol
Value | Count | Frequency (%) |
㎡ | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 30342 | |
Latin | 6281 | 16.4% |
Hangul | 1678 | 4.4% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
호 | 402 | |
동 | 136 | 8.1% |
층 | 120 | 7.2% |
시 | 67 | 4.0% |
설 | 57 | 3.4% |
타 | 53 | 3.2% |
업 | 44 | 2.6% |
워 | 43 | 2.6% |
형 | 43 | 2.6% |
무 | 42 | 2.5% |
Other values (87) | 671 |
Latin
Value | Count | Frequency (%) |
B | 1304 | |
A | 1281 | |
S | 623 | |
C | 460 | 7.3% |
D | 281 | 4.5% |
O | 234 | 3.7% |
E | 202 | 3.2% |
F | 170 | 2.7% |
b | 162 | 2.6% |
s | 144 | 2.3% |
Other values (40) | 1420 |
Common
Value | Count | Frequency (%) |
1 | 5718 | |
0 | 3874 | |
2 | 3789 | |
3 | 2745 | |
4 | 2278 | 7.5% |
. | 2049 | 6.8% |
5 | 2020 | 6.7% |
6 | 1955 | 6.4% |
7 | 1729 | 5.7% |
8 | 1569 | 5.2% |
Other values (10) | 2616 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 36620 | |
Hangul | 1677 | 4.4% |
CJK Compat | 3 | < 0.1% |
Compat Jamo | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 5718 | |
0 | 3874 | |
2 | 3789 | |
3 | 2745 | 7.5% |
4 | 2278 | 6.2% |
. | 2049 | 5.6% |
5 | 2020 | 5.5% |
6 | 1955 | 5.3% |
7 | 1729 | 4.7% |
8 | 1569 | 4.3% |
Other values (59) | 8894 |
Hangul
Value | Count | Frequency (%) |
호 | 402 | |
동 | 136 | 8.1% |
층 | 120 | 7.2% |
시 | 67 | 4.0% |
설 | 57 | 3.4% |
타 | 53 | 3.2% |
업 | 44 | 2.6% |
워 | 43 | 2.6% |
형 | 43 | 2.6% |
무 | 42 | 2.5% |
Other values (86) | 670 |
CJK Compat
Value | Count | Frequency (%) |
㎡ | 3 |
Compat Jamo
Value | Count | Frequency (%) |
ㄴ | 1 |
전유_공용_구분_코드
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
2 | |
---|---|
1 | |
<NA> | 1 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.0003 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 2 |
---|---|
2nd row | 2 |
3rd row | 1 |
4th row | 2 |
5th row | 2 |
Common Values
Value | Count | Frequency (%) |
2 | 7598 | |
1 | 2401 | 24.0% |
<NA> | 1 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2 | 7598 | |
1 | 2401 | 24.0% |
na | 1 | < 0.1% |
주_부속_구분_코드
Categorical
IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 120 |
<NA> | 3 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.0009 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9877 | |
1 | 120 | 1.2% |
<NA> | 3 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9877 | |
1 | 120 | 1.2% |
na | 3 | < 0.1% |
층_구분_코드
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 8 |
---|---|
Distinct (%) | 0.1% |
Missing | 320 |
Missing (%) | 3.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 25.37531 |
Minimum | 0 |
---|---|
Maximum | 60 |
Zeros | 11 |
Zeros (%) | 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 10 |
Q1 | 20 |
median | 20 |
Q3 | 40 |
95-th percentile | 40 |
Maximum | 60 |
Range | 60 |
Interquartile range (IQR) | 20 |
Descriptive statistics
Standard deviation | 12.402426 |
---|---|
Coefficient of variation (CV) | 0.4887596 |
Kurtosis | -1.6152213 |
Mean | 25.37531 |
Median Absolute Deviation (MAD) | 10 |
Skewness | 0.13527137 |
Sum | 245633 |
Variance | 153.82018 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
40 | 3804 | |
20 | 3382 | |
10 | 2403 | |
22 | 48 | 0.5% |
21 | 27 | 0.3% |
0 | 11 | 0.1% |
30 | 4 | < 0.1% |
60 | 1 | < 0.1% |
(Missing) | 320 | 3.2% |
Value | Count | Frequency (%) |
0 | 11 | 0.1% |
10 | 2403 | |
20 | 3382 | |
21 | 27 | 0.3% |
22 | 48 | 0.5% |
30 | 4 | < 0.1% |
40 | 3804 | |
60 | 1 | < 0.1% |
Value | Count | Frequency (%) |
60 | 1 | < 0.1% |
40 | 3804 | |
30 | 4 | < 0.1% |
22 | 48 | 0.5% |
21 | 27 | 0.3% |
20 | 3382 | |
10 | 2403 | |
0 | 11 | 0.1% |
층_번호
Real number (ℝ)
HIGH CORRELATION
  SKEWED
  ZEROS
 
Distinct | 53 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.0788 |
Minimum | 0 |
---|---|
Maximum | 902 |
Zeros | 5521 |
Zeros (%) | 55.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 2 |
95-th percentile | 7 |
Maximum | 902 |
Range | 902 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 16.052344 |
---|---|
Coefficient of variation (CV) | 7.7219282 |
Kurtosis | 1403.8883 |
Mean | 2.0788 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 33.669749 |
Sum | 20788 |
Variance | 257.67776 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 5521 | |
1 | 1641 | 16.4% |
2 | 744 | 7.4% |
3 | 601 | 6.0% |
4 | 500 | 5.0% |
5 | 296 | 3.0% |
6 | 190 | 1.9% |
7 | 129 | 1.3% |
10 | 107 | 1.1% |
8 | 69 | 0.7% |
Other values (43) | 202 | 2.0% |
Value | Count | Frequency (%) |
0 | 5521 | |
1 | 1641 | 16.4% |
2 | 744 | 7.4% |
3 | 601 | 6.0% |
4 | 500 | 5.0% |
5 | 296 | 3.0% |
6 | 190 | 1.9% |
7 | 129 | 1.3% |
8 | 69 | 0.7% |
9 | 45 | 0.4% |
Value | Count | Frequency (%) |
902 | 1 | < 0.1% |
501 | 2 | |
402 | 1 | < 0.1% |
401 | 4 | |
304 | 1 | < 0.1% |
303 | 1 | < 0.1% |
302 | 1 | < 0.1% |
201 | 1 | < 0.1% |
121 | 1 | < 0.1% |
102 | 1 | < 0.1% |
구조_코드
Real number (ℝ)
Distinct | 10 |
---|---|
Distinct (%) | 0.1% |
Missing | 5 |
Missing (%) | < 0.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 27.342371 |
Minimum | 11 |
---|---|
Maximum | 42 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 11 |
---|---|
5-th percentile | 21 |
Q1 | 21 |
median | 21 |
Q3 | 42 |
95-th percentile | 42 |
Maximum | 42 |
Range | 31 |
Interquartile range (IQR) | 21 |
Descriptive statistics
Standard deviation | 9.5286787 |
---|---|
Coefficient of variation (CV) | 0.34849497 |
Kurtosis | -1.2288961 |
Mean | 27.342371 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 0.85557761 |
Sum | 273287 |
Variance | 90.795718 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
21 | 6624 | |
42 | 2793 | |
22 | 254 | 2.5% |
31 | 145 | 1.5% |
41 | 124 | 1.2% |
40 | 30 | 0.3% |
11 | 13 | 0.1% |
32 | 7 | 0.1% |
26 | 4 | < 0.1% |
39 | 1 | < 0.1% |
(Missing) | 5 | 0.1% |
Value | Count | Frequency (%) |
11 | 13 | 0.1% |
21 | 6624 | |
22 | 254 | 2.5% |
26 | 4 | < 0.1% |
31 | 145 | 1.5% |
32 | 7 | 0.1% |
39 | 1 | < 0.1% |
40 | 30 | 0.3% |
41 | 124 | 1.2% |
42 | 2793 |
Value | Count | Frequency (%) |
42 | 2793 | |
41 | 124 | 1.2% |
40 | 30 | 0.3% |
39 | 1 | < 0.1% |
32 | 7 | 0.1% |
31 | 145 | 1.5% |
26 | 4 | < 0.1% |
22 | 254 | 2.5% |
21 | 6624 | |
11 | 13 | 0.1% |
주_용도_코드
Text
Distinct | 119 |
---|---|
Distinct (%) | 1.2% |
Missing | 60 |
Missing (%) | 0.6% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
14202 | 1671 | |
02001 | 1205 | |
02003 | 1096 | |
07999 | 1036 | |
07201 | 897 | 9.0% |
04001 | 437 | 4.4% |
z6999 | 316 | 3.2% |
15101 | 306 | 3.1% |
14204 | 296 | 3.0% |
07001 | 295 | 3.0% |
Other values (109) | 2385 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 18986 | |
2 | 7843 | |
1 | 7118 | 14.3% |
9 | 5985 | 12.0% |
4 | 3881 | 7.8% |
7 | 2310 | 4.6% |
3 | 1940 | 3.9% |
5 | 862 | 1.7% |
6 | 368 | 0.7% |
Z | 322 | 0.6% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 49370 | |
Uppercase Letter | 322 | 0.6% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 18986 | |
2 | 7843 | |
1 | 7118 | 14.4% |
9 | 5985 | 12.1% |
4 | 3881 | 7.9% |
7 | 2310 | 4.7% |
3 | 1940 | 3.9% |
5 | 862 | 1.7% |
6 | 368 | 0.7% |
8 | 77 | 0.2% |
Uppercase Letter
Value | Count | Frequency (%) |
Z | 322 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 49370 | |
Latin | 322 | 0.6% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 18986 | |
2 | 7843 | |
1 | 7118 | 14.4% |
9 | 5985 | 12.1% |
4 | 3881 | 7.9% |
7 | 2310 | 4.7% |
3 | 1940 | 3.9% |
5 | 862 | 1.7% |
6 | 368 | 0.7% |
8 | 77 | 0.2% |
Latin
Value | Count | Frequency (%) |
Z | 322 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 49692 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 18986 | |
2 | 7843 | |
1 | 7118 | 14.3% |
9 | 5985 | 12.0% |
4 | 3881 | 7.8% |
7 | 2310 | 4.6% |
3 | 1940 | 3.9% |
5 | 862 | 1.7% |
6 | 368 | 0.7% |
Z | 322 | 0.6% |
기타_용도
Text
MISSING
 
Distinct | 938 |
---|---|
Distinct (%) | 10.8% |
Missing | 1336 |
Missing (%) | 13.4% |
Memory size | 156.2 KiB |
Length
Max length | 72 |
---|---|
Median length | 54 |
Mean length | 10.918744 |
Min length | 1 |
Characters and Unicode
Total characters | 94600 |
---|---|
Distinct characters | 277 |
Distinct categories | 10 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 342 ? |
---|---|
Unique (%) | 3.9% |
Sample
1st row | 주차장(지6-지1) |
---|---|
2nd row | 화장실,계단실 |
3rd row | 판매시설(상점) |
4th row | 방재센타,복도,엠디에프실(지1,2층) |
5th row | 주차장 |
Value | Count | Frequency (%) |
주차장 | 993 | 10.9% |
계단실 | 428 | 4.7% |
지하주차장 | 262 | 2.9% |
판매시설 | 245 | 2.7% |
기계실,전기실 | 211 | 2.3% |
복도 | 135 | 1.5% |
계단실,elev | 132 | 1.5% |
기계실,전기실,창고,재활용창고,용역원실,휴게실,오락실,주차관제실,체력단련실,검수실,방재센터,경비실,유아실,사무실 | 117 | 1.3% |
기계실 | 116 | 1.3% |
계단실,복도,로비,화장실,공조실 | 112 | 1.2% |
Other values (901) | 6351 |
Most occurring characters
Value | Count | Frequency (%) |
, | 12085 | 12.8% |
실 | 9762 | 10.3% |
기 | 4704 | 5.0% |
계 | 3569 | 3.8% |
장 | 2951 | 3.1% |
주 | 2432 | 2.6% |
단 | 2228 | 2.4% |
전 | 2188 | 2.3% |
지 | 2078 | 2.2% |
차 | 2038 | 2.2% |
Other values (267) | 50565 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 70870 | |
Other Punctuation | 12570 | 13.3% |
Decimal Number | 3351 | 3.5% |
Uppercase Letter | 2918 | 3.1% |
Close Punctuation | 1692 | 1.8% |
Open Punctuation | 1691 | 1.8% |
Math Symbol | 505 | 0.5% |
Dash Punctuation | 504 | 0.5% |
Space Separator | 492 | 0.5% |
Lowercase Letter | 7 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
실 | 9762 | 13.8% |
기 | 4704 | 6.6% |
계 | 3569 | 5.0% |
장 | 2951 | 4.2% |
주 | 2432 | 3.4% |
단 | 2228 | 3.1% |
전 | 2188 | 3.1% |
지 | 2078 | 2.9% |
차 | 2038 | 2.9% |
시 | 1612 | 2.3% |
Other values (228) | 37308 |
Uppercase Letter
Value | Count | Frequency (%) |
E | 792 | |
V | 420 | |
D | 407 | |
F | 406 | |
M | 400 | |
L | 383 | |
C | 29 | 1.0% |
O | 28 | 1.0% |
P | 19 | 0.7% |
I | 15 | 0.5% |
Other values (5) | 19 | 0.7% |
Decimal Number
Value | Count | Frequency (%) |
1 | 1267 | |
2 | 684 | |
4 | 370 | 11.0% |
3 | 320 | 9.5% |
6 | 180 | 5.4% |
5 | 155 | 4.6% |
7 | 151 | 4.5% |
0 | 107 | 3.2% |
8 | 101 | 3.0% |
9 | 16 | 0.5% |
Lowercase Letter
Value | Count | Frequency (%) |
e | 2 | |
l | 1 | |
v | 1 | |
f | 1 | |
d | 1 | |
m | 1 |
Other Punctuation
Value | Count | Frequency (%) |
, | 12085 | |
. | 290 | 2.3% |
/ | 195 | 1.6% |
Close Punctuation
Value | Count | Frequency (%) |
) | 1692 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1691 |
Math Symbol
Value | Count | Frequency (%) |
~ | 505 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 504 |
Space Separator
Value | Count | Frequency (%) |
492 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 70870 | |
Common | 20805 | 22.0% |
Latin | 2925 | 3.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
실 | 9762 | 13.8% |
기 | 4704 | 6.6% |
계 | 3569 | 5.0% |
장 | 2951 | 4.2% |
주 | 2432 | 3.4% |
단 | 2228 | 3.1% |
전 | 2188 | 3.1% |
지 | 2078 | 2.9% |
차 | 2038 | 2.9% |
시 | 1612 | 2.3% |
Other values (228) | 37308 |
Latin
Value | Count | Frequency (%) |
E | 792 | |
V | 420 | |
D | 407 | |
F | 406 | |
M | 400 | |
L | 383 | |
C | 29 | 1.0% |
O | 28 | 1.0% |
P | 19 | 0.6% |
I | 15 | 0.5% |
Other values (11) | 26 | 0.9% |
Common
Value | Count | Frequency (%) |
, | 12085 | |
) | 1692 | 8.1% |
( | 1691 | 8.1% |
1 | 1267 | 6.1% |
2 | 684 | 3.3% |
~ | 505 | 2.4% |
- | 504 | 2.4% |
492 | 2.4% | |
4 | 370 | 1.8% |
3 | 320 | 1.5% |
Other values (8) | 1195 | 5.7% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 70870 | |
ASCII | 23730 | 25.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
, | 12085 | |
) | 1692 | 7.1% |
( | 1691 | 7.1% |
1 | 1267 | 5.3% |
E | 792 | 3.3% |
2 | 684 | 2.9% |
~ | 505 | 2.1% |
- | 504 | 2.1% |
492 | 2.1% | |
V | 420 | 1.8% |
Other values (29) | 3598 | 15.2% |
Hangul
Value | Count | Frequency (%) |
실 | 9762 | 13.8% |
기 | 4704 | 6.6% |
계 | 3569 | 5.0% |
장 | 2951 | 4.2% |
주 | 2432 | 3.4% |
단 | 2228 | 3.1% |
전 | 2188 | 3.1% |
지 | 2078 | 2.9% |
차 | 2038 | 2.9% |
시 | 1612 | 2.3% |
Other values (228) | 37308 |
면적
Real number (ℝ)
SKEWED
 
Distinct | 5608 |
---|---|
Distinct (%) | 56.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 81.635859 |
Minimum | 0 |
---|---|
Maximum | 31603.83 |
Zeros | 9 |
Zeros (%) | 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0.37 |
Q1 | 3.4975 |
median | 15.195 |
Q3 | 41.41 |
95-th percentile | 201.4825 |
Maximum | 31603.83 |
Range | 31603.83 |
Interquartile range (IQR) | 37.9125 |
Descriptive statistics
Standard deviation | 596.49359 |
---|---|
Coefficient of variation (CV) | 7.3067595 |
Kurtosis | 1645.1698 |
Mean | 81.635859 |
Median Absolute Deviation (MAD) | 13.311 |
Skewness | 35.787914 |
Sum | 816358.59 |
Variance | 355804.61 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2.41 | 37 | 0.4% |
1.03 | 33 | 0.3% |
0.86 | 25 | 0.2% |
0.11 | 23 | 0.2% |
0.94 | 21 | 0.2% |
2.43 | 20 | 0.2% |
0.1 | 20 | 0.2% |
22.04 | 19 | 0.2% |
0.06 | 18 | 0.2% |
2.44 | 17 | 0.2% |
Other values (5598) | 9767 |
Value | Count | Frequency (%) |
0.0 | 9 | |
0.003 | 1 | < 0.1% |
0.008 | 1 | < 0.1% |
0.009 | 2 | < 0.1% |
0.01 | 3 | < 0.1% |
0.013 | 1 | < 0.1% |
0.017 | 1 | < 0.1% |
0.019 | 1 | < 0.1% |
0.02 | 9 | |
0.022 | 1 | < 0.1% |
Value | Count | Frequency (%) |
31603.83 | 1 | |
29264.13 | 1 | |
23805.23 | 1 | |
12823.71 | 1 | |
10358.53 | 1 | |
8664.59 | 1 | |
8037.58 | 1 | |
6832.05 | 1 | |
6241.222 | 1 | |
5982.8 | 1 |
전유_공용_구분_코드 | 주_부속_구분_코드 | 층_구분_코드 | 층_번호 | 구조_코드 | 면적 | |
---|---|---|---|---|---|---|
전유_공용_구분_코드 | 1.000 | 0.094 | 0.729 | 0.021 | 0.084 | 0.029 |
주_부속_구분_코드 | 0.094 | 1.000 | 0.073 | 0.000 | 0.041 | 0.000 |
층_구분_코드 | 0.729 | 0.073 | 1.000 | 0.000 | 0.145 | 0.000 |
층_번호 | 0.021 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 |
구조_코드 | 0.084 | 0.041 | 0.145 | 0.000 | 1.000 | 0.000 |
면적 | 0.029 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 |
주_부속_구분_코드 | 전유_공용_구분_코드 | |
---|---|---|
주_부속_구분_코드 | 1.000 | 0.060 |
전유_공용_구분_코드 | 0.060 | 1.000 |
층_구분_코드 | 층_번호 | 구조_코드 | 면적 | 전유_공용_구분_코드 | 주_부속_구분_코드 | |
---|---|---|---|---|---|---|
층_구분_코드 | 1.000 | -0.650 | 0.093 | 0.054 | 0.539 | 0.052 |
층_번호 | -0.650 | 1.000 | -0.066 | -0.134 | 0.022 | 0.000 |
구조_코드 | 0.093 | -0.066 | 1.000 | 0.035 | 0.103 | 0.051 |
면적 | 0.054 | -0.134 | 0.035 | 1.000 | 0.031 | 0.000 |
전유_공용_구분_코드 | 0.539 | 0.022 | 0.103 | 0.031 | 1.000 | 0.060 |
주_부속_구분_코드 | 0.052 | 0.000 | 0.051 | 0.000 | 0.060 | 1.000 |
관리_전유_공용_면적_pk | 호별명세_pk | 평형_구분_명 | 전유_공용_구분_코드 | 주_부속_구분_코드 | 층_구분_코드 | 층_번호 | 구조_코드 | 주_용도_코드 | 기타_용도 | 면적 | |
---|---|---|---|---|---|---|---|---|---|---|---|
15404 | 11000-100026311 | 11000-139 | C-2N | 2 | 0 | 40 | 0 | 42 | 02001 | 주차장(지6-지1) | 92.49 |
3945 | 11000-100001480 | 11000-106 | 2115 | 2 | 0 | 40 | 0 | 21 | 03001 | 화장실,계단실 | 21.64 |
7706 | 11000-100008544 | 11000-100004025 | 370.92 | 1 | 0 | 20 | 0 | 21 | 07201 | 판매시설(상점) | 99.64 |
83716 | 11110-19912 | 11110-3477 | SB-24 | 2 | 0 | 40 | 0 | 21 | 04001 | 방재센타,복도,엠디에프실(지1,2층) | 0.47 |
26417 | 11000-1713 | 11000-18 | 3C | 2 | 0 | 10 | 8 | 42 | 14202 | 주차장 | 43.61 |
53153 | 11110-100001656 | 11110-3863 | 3D | 2 | 0 | 20 | 1 | 21 | 14202 | 관리실 | 0.55 |
79323 | 11110-15830 | 11110-2296 | 1602 | 2 | 0 | 10 | 6 | 42 | 15101 | 기계,전기실 | 8.42 |
71629 | 11110-100028450 | 11110-100059672 | 19-4 | 2 | 0 | 20 | 10 | 21 | 14202 | 홀 복도(1,2층,6~20층) | 15.12 |
57161 | 11110-100008671 | 11110-100014318 | A | 2 | 0 | 40 | 0 | 21 | 02001 | 관리사무소,주민공동시설(지2,2층) | 0.89 |
68810 | 11110-100024701 | 11110-100030551 | 20A | 2 | 0 | 40 | 0 | 21 | 02003 | 주차장 | 6.07 |
관리_전유_공용_면적_pk | 호별명세_pk | 평형_구분_명 | 전유_공용_구분_코드 | 주_부속_구분_코드 | 층_구분_코드 | 층_번호 | 구조_코드 | 주_용도_코드 | 기타_용도 | 면적 | |
---|---|---|---|---|---|---|---|---|---|---|---|
31163 | 11000-21400 | 11000-80 | AA | 2 | 0 | 10 | 1 | 21 | 02001 | 주민공동시설 | 4.23 |
5259 | 11000-100006097 | 11000-131 | 79.07 | 1 | 0 | 20 | 0 | 22 | 18001 | 창고 | 24.5 |
86182 | 11110-22130 | 11110-4797 | 9 | 1 | 0 | <NA> | 0 | 21 | 02003 | <NA> | 29.97 |
63802 | 11110-100017430 | 11110-100023046 | 57.54(a) | 2 | 0 | 10 | 1 | 21 | 02003 | 도시형생황주택(단지형다세대주택)주차장 | 30.06 |
11180 | 11000-100012017 | 11000-100004025 | 71.44 | 2 | 0 | 40 | 0 | 21 | 07001 | 기계실,전기실,창고,재활용창고,용역원실,휴게실,오락실,주차관제실,체력단련실,검수실,방재센터,경비실,유아실,사무실 | 2.53 |
36244 | 11000-25975 | 11000-92 | 4.91 | 2 | 0 | 20 | 0 | 42 | 07999 | 내부통로,화장실,방풍실,공조실 | 3.66 |
17338 | 11000-100028749 | 11000-100004306 | 1-601 | 2 | 0 | 40 | 0 | 21 | 10004 | 지하주차장(지3-지1) | 2022.08 |
2526 | 11000-1000000000000000878279 | 11000-100004485 | 40층 | 1 | 0 | 20 | 0 | 42 | 14204 | 업무시설(사무소) | 1891.78 |
58951 | 11110-100010592 | 11110-100017332 | 1807 | 2 | 0 | 40 | 1 | 42 | 15101 | 로비,계단실,승강기,홀,린넨실,복도 | 36.92 |
35245 | 11000-25075 | 11000-91 | ss030 | 2 | 0 | 10 | 8 | 42 | 07201 | 기계실,전기실,발전기실 | 30.798 |