Dataset statistics
Number of variables | 12 |
---|---|
Number of observations | 10000 |
Missing cells | 1707 |
Missing cells (%) | 1.4% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.1 MiB |
Average record size in memory | 111.0 B |
Variable types
Text | 5 |
---|---|
Categorical | 2 |
Numeric | 5 |
Dataset
Description | 관리_전유_공용_면적_pk,호별명세_pk,평형_구분_명,전유_공용_구분_코드,주_부속_구분_코드,층_구분_코드,층_번호,구조_코드,주_용도_코드,기타_용도,면적 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15391/S/1/datasetView.do |
층_구분_코드 is highly overall correlated with 층_번호 and 1 other fields | High correlation |
층_번호 is highly overall correlated with 층_구분_코드 | High correlation |
전유_공용_구분_코드 is highly overall correlated with 층_구분_코드 | High correlation |
주_부속_구분_코드 is highly imbalanced (91.6%) | Imbalance |
주_용도_코드 has 208 (2.1%) missing values | Missing |
기타_용도 has 1438 (14.4%) missing values | Missing |
층_번호 is highly skewed (γ1 = 23.37351032) | Skewed |
면적 is highly skewed (γ1 = 50.62897015) | Skewed |
관리_전유_공용_면적_pk has unique values | Unique |
층_번호 has 5348 (53.5%) zeros | Zeros |
Reproduction
Analysis started | 2024-05-03 20:06:07.682373 |
---|---|
Analysis finished | 2024-05-03 20:06:23.235780 |
Duration | 15.55 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
관리_전유_공용_면적_pk
Text
UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 28 |
---|---|
Median length | 15 |
Mean length | 14.9564 |
Min length | 9 |
Characters and Unicode
Total characters | 149564 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 10000 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 11500-100216589 |
---|---|
2nd row | 11545-100124144 |
3rd row | 11260-100087687 |
4th row | 11710-100209980 |
5th row | 11200-100048533 |
Value | Count | Frequency (%) |
11500-100216589 | 1 | < 0.1% |
11545-100123450 | 1 | < 0.1% |
11530-100084922 | 1 | < 0.1% |
11545-100118252 | 1 | < 0.1% |
11545-100185188 | 1 | < 0.1% |
11500-100219002 | 1 | < 0.1% |
11500-100221207 | 1 | < 0.1% |
11290-100057760 | 1 | < 0.1% |
11230-100070407 | 1 | < 0.1% |
11545-100162783 | 1 | < 0.1% |
Other values (9990) | 9990 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 40111 | |
0 | 39557 | |
5 | 12113 | 8.1% |
- | 10000 | 6.7% |
2 | 9078 | 6.1% |
4 | 7765 | 5.2% |
6 | 7250 | 4.8% |
7 | 6029 | 4.0% |
8 | 5987 | 4.0% |
3 | 5891 | 3.9% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 139564 | |
Dash Punctuation | 10000 | 6.7% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 40111 | |
0 | 39557 | |
5 | 12113 | 8.7% |
2 | 9078 | 6.5% |
4 | 7765 | 5.6% |
6 | 7250 | 5.2% |
7 | 6029 | 4.3% |
8 | 5987 | 4.3% |
3 | 5891 | 4.2% |
9 | 5783 | 4.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 149564 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 40111 | |
0 | 39557 | |
5 | 12113 | 8.1% |
- | 10000 | 6.7% |
2 | 9078 | 6.1% |
4 | 7765 | 5.2% |
6 | 7250 | 4.8% |
7 | 6029 | 4.0% |
8 | 5987 | 4.0% |
3 | 5891 | 3.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 149564 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 40111 | |
0 | 39557 | |
5 | 12113 | 8.1% |
- | 10000 | 6.7% |
2 | 9078 | 6.1% |
4 | 7765 | 5.2% |
6 | 7250 | 4.8% |
7 | 6029 | 4.0% |
8 | 5987 | 4.0% |
3 | 5891 | 3.9% |
호별명세_pk
Text
Distinct | 1714 |
---|---|
Distinct (%) | 17.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 28 |
---|---|
Median length | 15 |
Mean length | 14.8228 |
Min length | 8 |
Characters and Unicode
Total characters | 148228 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 519 ? |
---|---|
Unique (%) | 5.2% |
Sample
1st row | 11500-100100041 |
---|---|
2nd row | 11545-100053518 |
3rd row | 11260-100038618 |
4th row | 11710-100067608 |
5th row | 11200-100052190 |
Value | Count | Frequency (%) |
11545-100053518 | 695 | 7.0% |
11545-100065917 | 264 | 2.6% |
11200-100035965 | 189 | 1.9% |
11545-100064797 | 184 | 1.8% |
11170-100061031 | 178 | 1.8% |
11530-100079506 | 174 | 1.7% |
11500-100080633 | 155 | 1.6% |
11560-100066989 | 117 | 1.2% |
11620-100079215 | 109 | 1.1% |
11545-100054977 | 104 | 1.0% |
Other values (1704) | 7831 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 41743 | |
1 | 37220 | |
5 | 12640 | 8.5% |
- | 10000 | 6.7% |
6 | 8354 | 5.6% |
4 | 6910 | 4.7% |
2 | 6838 | 4.6% |
7 | 6773 | 4.6% |
3 | 6381 | 4.3% |
8 | 5817 | 3.9% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 138228 | |
Dash Punctuation | 10000 | 6.7% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 41743 | |
1 | 37220 | |
5 | 12640 | 9.1% |
6 | 8354 | 6.0% |
4 | 6910 | 5.0% |
2 | 6838 | 4.9% |
7 | 6773 | 4.9% |
3 | 6381 | 4.6% |
8 | 5817 | 4.2% |
9 | 5552 | 4.0% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 148228 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 41743 | |
1 | 37220 | |
5 | 12640 | 8.5% |
- | 10000 | 6.7% |
6 | 8354 | 5.6% |
4 | 6910 | 4.7% |
2 | 6838 | 4.6% |
7 | 6773 | 4.6% |
3 | 6381 | 4.3% |
8 | 5817 | 3.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 148228 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 41743 | |
1 | 37220 | |
5 | 12640 | 8.5% |
- | 10000 | 6.7% |
6 | 8354 | 5.6% |
4 | 6910 | 4.7% |
2 | 6838 | 4.6% |
7 | 6773 | 4.6% |
3 | 6381 | 4.3% |
8 | 5817 | 3.9% |
평형_구분_명
Text
Distinct | 3998 |
---|---|
Distinct (%) | 40.0% |
Missing | 3 |
Missing (%) | < 0.1% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
b | 169 | 1.6% |
a | 159 | 1.5% |
c | 142 | 1.4% |
202 | 118 | 1.1% |
d | 116 | 1.1% |
302 | 116 | 1.1% |
201 | 114 | 1.1% |
301 | 107 | 1.0% |
e | 103 | 1.0% |
501 | 100 | 1.0% |
Other values (3801) | 9089 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 6268 | |
0 | 6021 | |
2 | 3980 | 10.2% |
3 | 2981 | 7.6% |
4 | 2333 | 6.0% |
5 | 1843 | 4.7% |
6 | 1632 | 4.2% |
7 | 1313 | 3.4% |
호 | 1184 | 3.0% |
. | 1169 | 3.0% |
Other values (154) | 10265 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 28619 | |
Other Letter | 3910 | 10.0% |
Uppercase Letter | 3018 | 7.7% |
Other Punctuation | 1183 | 3.0% |
Dash Punctuation | 936 | 2.4% |
Lowercase Letter | 599 | 1.5% |
Space Separator | 336 | 0.9% |
Open Punctuation | 167 | 0.4% |
Close Punctuation | 167 | 0.4% |
Math Symbol | 26 | 0.1% |
Other values (2) | 28 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
호 | 1184 | |
공 | 385 | 9.8% |
장 | 359 | 9.2% |
동 | 304 | 7.8% |
상 | 153 | 3.9% |
가 | 144 | 3.7% |
생 | 128 | 3.3% |
근 | 127 | 3.2% |
형 | 92 | 2.4% |
시 | 69 | 1.8% |
Other values (81) | 965 |
Uppercase Letter
Value | Count | Frequency (%) |
B | 806 | |
A | 615 | |
C | 266 | 8.8% |
E | 209 | 6.9% |
F | 208 | 6.9% |
D | 161 | 5.3% |
T | 119 | 3.9% |
P | 98 | 3.2% |
O | 80 | 2.7% |
Y | 75 | 2.5% |
Other values (15) | 381 |
Lowercase Letter
Value | Count | Frequency (%) |
a | 102 | |
e | 79 | |
p | 65 | |
y | 61 | |
b | 59 | |
t | 53 | |
d | 35 | 5.8% |
c | 30 | 5.0% |
f | 21 | 3.5% |
g | 13 | 2.2% |
Other values (14) | 81 |
Decimal Number
Value | Count | Frequency (%) |
1 | 6268 | |
0 | 6021 | |
2 | 3980 | |
3 | 2981 | |
4 | 2333 | 8.2% |
5 | 1843 | 6.4% |
6 | 1632 | 5.7% |
7 | 1313 | 4.6% |
8 | 1154 | 4.0% |
9 | 1094 | 3.8% |
Other Punctuation
Value | Count | Frequency (%) |
. | 1169 | |
, | 9 | 0.8% |
' | 4 | 0.3% |
/ | 1 | 0.1% |
Math Symbol
Value | Count | Frequency (%) |
~ | 19 | |
= | 6 | 23.1% |
+ | 1 | 3.8% |
Other Symbol
Value | Count | Frequency (%) |
ⓗ | 2 | |
㎡ | 2 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 936 |
Space Separator
Value | Count | Frequency (%) |
336 |
Open Punctuation
Value | Count | Frequency (%) |
( | 167 |
Close Punctuation
Value | Count | Frequency (%) |
) | 167 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 24 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 31462 | |
Hangul | 3910 | 10.0% |
Latin | 3617 | 9.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
호 | 1184 | |
공 | 385 | 9.8% |
장 | 359 | 9.2% |
동 | 304 | 7.8% |
상 | 153 | 3.9% |
가 | 144 | 3.7% |
생 | 128 | 3.3% |
근 | 127 | 3.2% |
형 | 92 | 2.4% |
시 | 69 | 1.8% |
Other values (81) | 965 |
Latin
Value | Count | Frequency (%) |
B | 806 | |
A | 615 | |
C | 266 | 7.4% |
E | 209 | 5.8% |
F | 208 | 5.8% |
D | 161 | 4.5% |
T | 119 | 3.3% |
a | 102 | 2.8% |
P | 98 | 2.7% |
O | 80 | 2.2% |
Other values (39) | 953 |
Common
Value | Count | Frequency (%) |
1 | 6268 | |
0 | 6021 | |
2 | 3980 | |
3 | 2981 | |
4 | 2333 | 7.4% |
5 | 1843 | 5.9% |
6 | 1632 | 5.2% |
7 | 1313 | 4.2% |
. | 1169 | 3.7% |
8 | 1154 | 3.7% |
Other values (14) | 2768 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 35075 | |
Hangul | 3879 | 9.9% |
Compat Jamo | 31 | 0.1% |
Enclosed Alphanum | 2 | < 0.1% |
CJK Compat | 2 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 6268 | |
0 | 6021 | |
2 | 3980 | |
3 | 2981 | |
4 | 2333 | 6.7% |
5 | 1843 | 5.3% |
6 | 1632 | 4.7% |
7 | 1313 | 3.7% |
. | 1169 | 3.3% |
8 | 1154 | 3.3% |
Other values (61) | 6381 |
Hangul
Value | Count | Frequency (%) |
호 | 1184 | |
공 | 385 | 9.9% |
장 | 359 | 9.3% |
동 | 304 | 7.8% |
상 | 153 | 3.9% |
가 | 144 | 3.7% |
생 | 128 | 3.3% |
근 | 127 | 3.3% |
형 | 92 | 2.4% |
시 | 69 | 1.8% |
Other values (73) | 934 |
Compat Jamo
Value | Count | Frequency (%) |
ㄱ | 17 | |
ㄴ | 3 | 9.7% |
ㄷ | 3 | 9.7% |
ㅁ | 2 | 6.5% |
ㅊ | 2 | 6.5% |
ㄹ | 2 | 6.5% |
ㅅ | 1 | 3.2% |
ㅍ | 1 | 3.2% |
Enclosed Alphanum
Value | Count | Frequency (%) |
ⓗ | 2 |
CJK Compat
Value | Count | Frequency (%) |
㎡ | 2 |
전유_공용_구분_코드
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
2 | |
---|---|
1 | |
<NA> | 1 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.0003 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 1 |
---|---|
2nd row | 2 |
3rd row | 2 |
4th row | 1 |
5th row | 2 |
Common Values
Value | Count | Frequency (%) |
2 | 6976 | |
1 | 3023 | |
<NA> | 1 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2 | 6976 | |
1 | 3023 | |
na | 1 | < 0.1% |
주_부속_구분_코드
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 105 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9895 | |
1 | 105 | 1.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9895 | |
1 | 105 | 1.1% |
층_구분_코드
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 49 |
Missing (%) | 0.5% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 26.648176 |
Minimum | 0 |
---|---|
Maximum | 40 |
Zeros | 1 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 10 |
Q1 | 20 |
median | 20 |
Q3 | 40 |
95-th percentile | 40 |
Maximum | 40 |
Range | 40 |
Interquartile range (IQR) | 20 |
Descriptive statistics
Standard deviation | 11.617943 |
---|---|
Coefficient of variation (CV) | 0.43597518 |
Kurtosis | -1.5856224 |
Mean | 26.648176 |
Median Absolute Deviation (MAD) | 10 |
Skewness | 0.084141562 |
Sum | 265176 |
Variance | 134.97661 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20 | 4254 | |
40 | 4071 | |
10 | 1538 | 15.4% |
22 | 49 | 0.5% |
21 | 38 | 0.4% |
0 | 1 | < 0.1% |
(Missing) | 49 | 0.5% |
Value | Count | Frequency (%) |
0 | 1 | < 0.1% |
10 | 1538 | 15.4% |
20 | 4254 | |
21 | 38 | 0.4% |
22 | 49 | 0.5% |
40 | 4071 |
Value | Count | Frequency (%) |
40 | 4071 | |
22 | 49 | 0.5% |
21 | 38 | 0.4% |
20 | 4254 | |
10 | 1538 | 15.4% |
0 | 1 | < 0.1% |
층_번호
Real number (ℝ)
HIGH CORRELATION
  SKEWED
  ZEROS
 
Distinct | 31 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.4351 |
Minimum | 0 |
---|---|
Maximum | 503 |
Zeros | 5348 |
Zeros (%) | 53.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 2 |
95-th percentile | 9 |
Maximum | 503 |
Range | 503 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 16.160702 |
---|---|
Coefficient of variation (CV) | 6.6365663 |
Kurtosis | 592.55881 |
Mean | 2.4351 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 23.37351 |
Sum | 24351 |
Variance | 261.1683 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 5348 | |
1 | 1543 | 15.4% |
2 | 723 | 7.2% |
3 | 693 | 6.9% |
4 | 414 | 4.1% |
5 | 331 | 3.3% |
6 | 232 | 2.3% |
7 | 120 | 1.2% |
8 | 91 | 0.9% |
9 | 80 | 0.8% |
Other values (21) | 425 | 4.2% |
Value | Count | Frequency (%) |
0 | 5348 | |
1 | 1543 | 15.4% |
2 | 723 | 7.2% |
3 | 693 | 6.9% |
4 | 414 | 4.1% |
5 | 331 | 3.3% |
6 | 232 | 2.3% |
7 | 120 | 1.2% |
8 | 91 | 0.9% |
9 | 80 | 0.8% |
Value | Count | Frequency (%) |
503 | 1 | < 0.1% |
502 | 1 | < 0.1% |
501 | 1 | < 0.1% |
402 | 1 | < 0.1% |
401 | 5 | |
303 | 1 | < 0.1% |
302 | 4 | |
301 | 3 | |
201 | 2 | < 0.1% |
101 | 1 | < 0.1% |
구조_코드
Real number (ℝ)
Distinct | 8 |
---|---|
Distinct (%) | 0.1% |
Missing | 9 |
Missing (%) | 0.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 22.235712 |
Minimum | 11 |
---|---|
Maximum | 43 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 11 |
---|---|
5-th percentile | 21 |
Q1 | 21 |
median | 21 |
Q3 | 21 |
95-th percentile | 41 |
Maximum | 43 |
Range | 32 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 4.8982019 |
---|---|
Coefficient of variation (CV) | 0.22028536 |
Kurtosis | 12.185301 |
Mean | 22.235712 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 3.7268482 |
Sum | 222157 |
Variance | 23.992382 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
21 | 9336 | |
42 | 307 | 3.1% |
43 | 146 | 1.5% |
41 | 98 | 1.0% |
31 | 67 | 0.7% |
22 | 23 | 0.2% |
11 | 8 | 0.1% |
40 | 6 | 0.1% |
(Missing) | 9 | 0.1% |
Value | Count | Frequency (%) |
11 | 8 | 0.1% |
21 | 9336 | |
22 | 23 | 0.2% |
31 | 67 | 0.7% |
40 | 6 | 0.1% |
41 | 98 | 1.0% |
42 | 307 | 3.1% |
43 | 146 | 1.5% |
Value | Count | Frequency (%) |
43 | 146 | 1.5% |
42 | 307 | 3.1% |
41 | 98 | 1.0% |
40 | 6 | 0.1% |
31 | 67 | 0.7% |
22 | 23 | 0.2% |
21 | 9336 | |
11 | 8 | 0.1% |
주_용도_코드
Text
MISSING
 
Distinct | 73 |
---|---|
Distinct (%) | 0.7% |
Missing | 208 |
Missing (%) | 2.1% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
02003 | 3167 | |
14202 | 2167 | |
17999 | 1816 | |
02001 | 488 | 5.0% |
03001 | 292 | 3.0% |
04402 | 242 | 2.5% |
04001 | 233 | 2.4% |
02007 | 193 | 2.0% |
02002 | 142 | 1.5% |
03999 | 119 | 1.2% |
Other values (63) | 933 | 9.5% |
Most occurring characters
Value | Count | Frequency (%) |
0 | 18046 | |
2 | 9203 | |
9 | 6487 | 13.2% |
1 | 5515 | 11.3% |
3 | 3869 | 7.9% |
4 | 3446 | 7.0% |
7 | 2183 | 4.5% |
5 | 177 | 0.4% |
6 | 26 | 0.1% |
8 | 6 | < 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 48958 | |
Uppercase Letter | 2 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 18046 | |
2 | 9203 | |
9 | 6487 | 13.3% |
1 | 5515 | 11.3% |
3 | 3869 | 7.9% |
4 | 3446 | 7.0% |
7 | 2183 | 4.5% |
5 | 177 | 0.4% |
6 | 26 | 0.1% |
8 | 6 | < 0.1% |
Uppercase Letter
Value | Count | Frequency (%) |
Z | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 48958 | |
Latin | 2 | < 0.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 18046 | |
2 | 9203 | |
9 | 6487 | 13.3% |
1 | 5515 | 11.3% |
3 | 3869 | 7.9% |
4 | 3446 | 7.0% |
7 | 2183 | 4.5% |
5 | 177 | 0.4% |
6 | 26 | 0.1% |
8 | 6 | < 0.1% |
Latin
Value | Count | Frequency (%) |
Z | 2 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 48960 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 18046 | |
2 | 9203 | |
9 | 6487 | 13.2% |
1 | 5515 | 11.3% |
3 | 3869 | 7.9% |
4 | 3446 | 7.0% |
7 | 2183 | 4.5% |
5 | 177 | 0.4% |
6 | 26 | 0.1% |
8 | 6 | < 0.1% |
기타_용도
Text
MISSING
 
Distinct | 989 |
---|---|
Distinct (%) | 11.6% |
Missing | 1438 |
Missing (%) | 14.4% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
계단실 | 1195 | 12.5% |
주차장 | 1071 | 11.2% |
기계실 | 382 | 4.0% |
외 | 354 | 3.7% |
공장(지식산업센터 | 219 | 2.3% |
벽체공용 | 199 | 2.1% |
복도,elv | 159 | 1.7% |
지하주차장 | 159 | 1.7% |
기계식주차장 | 151 | 1.6% |
도시형생활주택(단지형다세대 | 139 | 1.5% |
Other values (898) | 5556 |
Most occurring characters
Value | Count | Frequency (%) |
실 | 6345 | 9.4% |
, | 4602 | 6.8% |
계 | 3924 | 5.8% |
주 | 3019 | 4.5% |
기 | 2943 | 4.4% |
단 | 2906 | 4.3% |
장 | 2512 | 3.7% |
( | 1825 | 2.7% |
) | 1825 | 2.7% |
지 | 1758 | 2.6% |
Other values (252) | 35669 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 53820 | |
Other Punctuation | 5040 | 7.5% |
Uppercase Letter | 2165 | 3.2% |
Open Punctuation | 1833 | 2.7% |
Close Punctuation | 1833 | 2.7% |
Decimal Number | 1173 | 1.7% |
Space Separator | 1022 | 1.5% |
Math Symbol | 220 | 0.3% |
Dash Punctuation | 186 | 0.3% |
Lowercase Letter | 36 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
실 | 6345 | 11.8% |
계 | 3924 | 7.3% |
주 | 3019 | 5.6% |
기 | 2943 | 5.5% |
단 | 2906 | 5.4% |
장 | 2512 | 4.7% |
지 | 1758 | 3.3% |
차 | 1733 | 3.2% |
도 | 1718 | 3.2% |
시 | 1409 | 2.6% |
Other values (201) | 25553 |
Uppercase Letter
Value | Count | Frequency (%) |
E | 568 | |
V | 371 | |
L | 363 | |
D | 267 | |
F | 267 | |
M | 267 | |
A | 12 | 0.6% |
S | 11 | 0.5% |
C | 11 | 0.5% |
H | 10 | 0.5% |
Other values (10) | 18 | 0.8% |
Decimal Number
Value | Count | Frequency (%) |
1 | 448 | |
2 | 187 | |
3 | 183 | |
4 | 170 | 14.5% |
5 | 81 | 6.9% |
7 | 44 | 3.8% |
6 | 32 | 2.7% |
0 | 25 | 2.1% |
8 | 2 | 0.2% |
9 | 1 | 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
f | 8 | |
m | 8 | |
d | 8 | |
e | 6 | |
v | 3 | 8.3% |
l | 3 | 8.3% |
Other Punctuation
Value | Count | Frequency (%) |
, | 4602 | |
/ | 277 | 5.5% |
. | 145 | 2.9% |
; | 8 | 0.2% |
: | 8 | 0.2% |
Open Punctuation
Value | Count | Frequency (%) |
( | 1825 | |
[ | 7 | 0.4% |
{ | 1 | 0.1% |
Close Punctuation
Value | Count | Frequency (%) |
) | 1825 | |
] | 7 | 0.4% |
} | 1 | 0.1% |
Math Symbol
Value | Count | Frequency (%) |
~ | 215 | |
= | 5 | 2.3% |
Space Separator
Value | Count | Frequency (%) |
1022 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 186 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 53820 | |
Common | 11307 | 16.8% |
Latin | 2201 | 3.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
실 | 6345 | 11.8% |
계 | 3924 | 7.3% |
주 | 3019 | 5.6% |
기 | 2943 | 5.5% |
단 | 2906 | 5.4% |
장 | 2512 | 4.7% |
지 | 1758 | 3.3% |
차 | 1733 | 3.2% |
도 | 1718 | 3.2% |
시 | 1409 | 2.6% |
Other values (201) | 25553 |
Latin
Value | Count | Frequency (%) |
E | 568 | |
V | 371 | |
L | 363 | |
D | 267 | |
F | 267 | |
M | 267 | |
A | 12 | 0.5% |
S | 11 | 0.5% |
C | 11 | 0.5% |
H | 10 | 0.5% |
Other values (16) | 54 | 2.5% |
Common
Value | Count | Frequency (%) |
, | 4602 | |
( | 1825 | 16.1% |
) | 1825 | 16.1% |
1022 | 9.0% | |
1 | 448 | 4.0% |
/ | 277 | 2.4% |
~ | 215 | 1.9% |
2 | 187 | 1.7% |
- | 186 | 1.6% |
3 | 183 | 1.6% |
Other values (15) | 537 | 4.7% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 53820 | |
ASCII | 13508 | 20.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
실 | 6345 | 11.8% |
계 | 3924 | 7.3% |
주 | 3019 | 5.6% |
기 | 2943 | 5.5% |
단 | 2906 | 5.4% |
장 | 2512 | 4.7% |
지 | 1758 | 3.3% |
차 | 1733 | 3.2% |
도 | 1718 | 3.2% |
시 | 1409 | 2.6% |
Other values (201) | 25553 |
ASCII
Value | Count | Frequency (%) |
, | 4602 | |
( | 1825 | 13.5% |
) | 1825 | 13.5% |
1022 | 7.6% | |
E | 568 | 4.2% |
1 | 448 | 3.3% |
V | 371 | 2.7% |
L | 363 | 2.7% |
/ | 277 | 2.1% |
D | 267 | 2.0% |
Other values (41) | 1940 |
면적
Real number (ℝ)
SKEWED
 
Distinct | 4420 |
---|---|
Distinct (%) | 44.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 31.632823 |
Minimum | 0 |
---|---|
Maximum | 17686.89 |
Zeros | 2 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0.52 |
Q1 | 3.29 |
median | 11.59 |
Q3 | 28.8325 |
95-th percentile | 75.64 |
Maximum | 17686.89 |
Range | 17686.89 |
Interquartile range (IQR) | 25.5425 |
Descriptive statistics
Standard deviation | 270.19672 |
---|---|
Coefficient of variation (CV) | 8.541657 |
Kurtosis | 2979.1702 |
Mean | 31.632823 |
Median Absolute Deviation (MAD) | 9.97 |
Skewness | 50.62897 |
Sum | 316328.23 |
Variance | 73006.269 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20.6213 | 59 | 0.6% |
0.66 | 57 | 0.6% |
1.56 | 48 | 0.5% |
11.59 | 44 | 0.4% |
7.1233 | 41 | 0.4% |
2.4424 | 37 | 0.4% |
27.32 | 35 | 0.4% |
16.78 | 35 | 0.4% |
48.6 | 34 | 0.3% |
14.63 | 34 | 0.3% |
Other values (4410) | 9576 |
Value | Count | Frequency (%) |
0.0 | 2 | < 0.1% |
0.002 | 1 | < 0.1% |
0.01 | 1 | < 0.1% |
0.013 | 2 | < 0.1% |
0.02 | 7 | |
0.03 | 3 | |
0.04 | 6 | |
0.048 | 1 | < 0.1% |
0.049 | 1 | < 0.1% |
0.05 | 3 |
Value | Count | Frequency (%) |
17686.89 | 1 | |
15415.76 | 1 | |
8202.8 | 1 | |
5664.41 | 1 | |
2452.13 | 1 | |
1962.182 | 1 | |
1838.94 | 2 | |
1796.11 | 1 | |
1762.63 | 2 | |
1759.44 | 1 |
작업_일자
Real number (ℝ)
Distinct | 284 |
---|---|
Distinct (%) | 2.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20208653 |
Minimum | 20200101 |
---|---|
Maximum | 20240503 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 20200101 |
---|---|
5-th percentile | 20200218 |
Q1 | 20200804 |
median | 20210930 |
Q3 | 20211124 |
95-th percentile | 20220405 |
Maximum | 20240503 |
Range | 40402 |
Interquartile range (IQR) | 10320 |
Descriptive statistics
Standard deviation | 7979.3605 |
---|---|
Coefficient of variation (CV) | 0.00039484871 |
Kurtosis | 0.53159523 |
Mean | 20208653 |
Median Absolute Deviation (MAD) | 9393 |
Skewness | 0.73495431 |
Sum | 2.0208653 × 1011 |
Variance | 63670193 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20211029 | 1833 | 18.3% |
20201111 | 389 | 3.9% |
20200218 | 374 | 3.7% |
20200219 | 365 | 3.6% |
20220405 | 361 | 3.6% |
20211103 | 245 | 2.5% |
20220204 | 223 | 2.2% |
20201103 | 208 | 2.1% |
20211201 | 177 | 1.8% |
20200808 | 154 | 1.5% |
Other values (274) | 5671 |
Value | Count | Frequency (%) |
20200101 | 30 | |
20200107 | 13 | 0.1% |
20200108 | 45 | |
20200109 | 21 | |
20200110 | 18 | 0.2% |
20200114 | 9 | 0.1% |
20200115 | 9 | 0.1% |
20200116 | 10 | 0.1% |
20200117 | 11 | 0.1% |
20200121 | 1 | < 0.1% |
Value | Count | Frequency (%) |
20240503 | 74 | |
20240417 | 1 | < 0.1% |
20240330 | 3 | < 0.1% |
20240327 | 4 | < 0.1% |
20240227 | 1 | < 0.1% |
20240208 | 1 | < 0.1% |
20230929 | 2 | < 0.1% |
20230831 | 5 | 0.1% |
20230808 | 1 | < 0.1% |
20230719 | 3 | < 0.1% |
전유_공용_구분_코드 | 주_부속_구분_코드 | 층_구분_코드 | 층_번호 | 구조_코드 | 주_용도_코드 | 면적 | 작업_일자 | |
---|---|---|---|---|---|---|---|---|
전유_공용_구분_코드 | 1.000 | 0.103 | 0.815 | 0.030 | 0.125 | 0.222 | 0.048 | 0.057 |
주_부속_구분_코드 | 0.103 | 1.000 | 0.065 | 0.000 | 0.290 | 0.531 | 0.000 | 0.121 |
층_구분_코드 | 0.815 | 0.065 | 1.000 | 0.024 | 0.141 | 0.320 | 0.020 | 0.080 |
층_번호 | 0.030 | 0.000 | 0.024 | 1.000 | 0.000 | 0.439 | 0.000 | 0.000 |
구조_코드 | 0.125 | 0.290 | 0.141 | 0.000 | 1.000 | 0.508 | 0.098 | 0.171 |
주_용도_코드 | 0.222 | 0.531 | 0.320 | 0.439 | 0.508 | 1.000 | 0.829 | 0.391 |
면적 | 0.048 | 0.000 | 0.020 | 0.000 | 0.098 | 0.829 | 1.000 | 0.033 |
작업_일자 | 0.057 | 0.121 | 0.080 | 0.000 | 0.171 | 0.391 | 0.033 | 1.000 |
전유_공용_구분_코드 | 주_부속_구분_코드 | |
---|---|---|
전유_공용_구분_코드 | 1.000 | 0.066 |
주_부속_구분_코드 | 0.066 | 1.000 |
층_구분_코드 | 층_번호 | 구조_코드 | 면적 | 작업_일자 | 전유_공용_구분_코드 | 주_부속_구분_코드 | |
---|---|---|---|---|---|---|---|
층_구분_코드 | 1.000 | -0.629 | 0.098 | 0.028 | 0.099 | 0.608 | 0.042 |
층_번호 | -0.629 | 1.000 | -0.073 | 0.043 | -0.081 | 0.032 | 0.000 |
구조_코드 | 0.098 | -0.073 | 1.000 | 0.004 | 0.074 | 0.083 | 0.193 |
면적 | 0.028 | 0.043 | 0.004 | 1.000 | 0.044 | 0.035 | 0.000 |
작업_일자 | 0.099 | -0.081 | 0.074 | 0.044 | 1.000 | 0.041 | 0.087 |
전유_공용_구분_코드 | 0.608 | 0.032 | 0.083 | 0.035 | 0.041 | 1.000 | 0.066 |
주_부속_구분_코드 | 0.042 | 0.000 | 0.193 | 0.000 | 0.087 | 0.066 | 1.000 |
관리_전유_공용_면적_pk | 호별명세_pk | 평형_구분_명 | 전유_공용_구분_코드 | 주_부속_구분_코드 | 층_구분_코드 | 층_번호 | 구조_코드 | 주_용도_코드 | 기타_용도 | 면적 | 작업_일자 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
46709 | 11500-100216589 | 11500-100100041 | 202 | 1 | 0 | 20 | 2 | 21 | 02003 | 도시형생활주택 | 26.64 | 20211204 |
74617 | 11545-100124144 | 11545-100053518 | 1429 | 2 | 0 | 40 | 0 | 21 | 17999 | 복도,ELV 외 | 16.78 | 20200218 |
40104 | 11260-100087687 | 11260-100038618 | 주7-17B | 2 | 0 | 10 | 0 | 21 | 02001 | 주차장 | 8.5754 | 20211029 |
45293 | 11710-100209980 | 11710-100067608 | 1804 | 1 | 0 | 20 | 18 | 21 | 14202 | <NA> | 15.98 | 20200215 |
84374 | 11200-100048533 | 11200-100052190 | 8I | 2 | 0 | 21 | 0 | 21 | 17999 | 주차장 | 59.6 | 20200808 |
30149 | 11620-100116815 | 11620-100078252 | 15.64형 | 2 | 0 | 20 | 1 | 21 | 14202 | 주차장 | 0.7752 | 20220304 |
40676 | 11710-100220301 | 11710-100083601 | 1201 | 2 | 0 | 20 | 3 | 21 | 14202 | 벽체면적 | 2.04 | 20210128 |
76628 | 11545-100123054 | 11545-100053518 | 1656 | 1 | 0 | 20 | 3 | 21 | 02007 | 기숙사 | 20.6213 | 20200218 |
5281 | 11530-100100973 | 11530-100079506 | 327 공장-27 | 2 | 0 | 40 | 0 | 21 | 17999 | 기계전기실,MDF등 | 5.418 | 20211201 |
30948 | 11680-100092616 | 11680-100119709 | 공공 17-3 | 2 | 0 | 20 | 1 | 21 | 02001 | 관리사무실(방재실) | 0.07 | 20220409 |
관리_전유_공용_면적_pk | 호별명세_pk | 평형_구분_명 | 전유_공용_구분_코드 | 주_부속_구분_코드 | 층_구분_코드 | 층_번호 | 구조_코드 | 주_용도_코드 | 기타_용도 | 면적 | 작업_일자 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
14364 | 11560-100086855 | 11560-100010630 | 오피스29 | 1 | 0 | 20 | 0 | 21 | 14204 | 업무시설(사무소) | 29.1974 | 20201103 |
54598 | 11380-100110287 | 11380-100080987 | 4lll | 2 | 0 | 10 | 1 | 21 | 02003 | 단지형(기계실,발전기실,통신실) | 0.48 | 20200319 |
65825 | 11620-100101214 | 11620-100057127 | 79.73 | 2 | 0 | 10 | 3 | 21 | 02001 | 기계실 | 1.19 | 20200221 |
24427 | 11470-9885 | 11470-2717 | 60 | 1 | 0 | 20 | 2 | 21 | 03003 | <NA> | 132.25 | 20200515 |
58908 | 11260-100075978 | 11260-100027732 | FB206 | 2 | 0 | 40 | 0 | 42 | 17999 | 주차장(지4~5층) | 55.78 | 20200610 |
12745 | 11680-100090880 | 11680-100169152 | 101 | 1 | 0 | 20 | 1 | 21 | 03999 | 근린생활시설 | 180.37 | 20220204 |
74962 | 11290-100055442 | 11290-100065542 | B1 | 2 | 0 | 40 | 0 | 21 | 02003 | 계단실,홀 | 7.11 | 20211029 |
15780 | 11545-100183707 | 11545-100065917 | 1120호 | 2 | 0 | 40 | 0 | 21 | 17999 | 복도,화장실 | 16.9567 | 20220405 |
43804 | 11620-100117846 | 11620-100079215 | 305 | 2 | 0 | 10 | 1 | 21 | 04402 | 기계전기통신 | 8.58 | 20220322 |
22071 | 11380-100110218 | 11380-100080987 | 6p | 1 | 0 | 20 | 5 | 21 | 14202 | <NA> | 21.78 | 20200319 |