Dataset statistics
Number of variables | 11 |
---|---|
Number of observations | 10000 |
Missing cells | 24481 |
Missing cells (%) | 22.3% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 996.1 KiB |
Average record size in memory | 102.0 B |
Variable types
Text | 5 |
---|---|
Numeric | 5 |
Categorical | 1 |
Dataset
Description | 관리_층별_개요_PK,관리_동별_개요_PK,층_번호,건축_구분_코드,주_용도_코드,기타_용도,구조_코드,기타_구조,층_면적,층_구분_코드,층_일련번호 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15402/S/1/datasetView.do |
층_번호 is highly overall correlated with 층_일련번호 | High correlation |
건축_구분_코드 is highly overall correlated with 층_일련번호 | High correlation |
층_면적 is highly overall correlated with 층_일련번호 | High correlation |
층_일련번호 is highly overall correlated with 층_번호 and 2 other fields | High correlation |
건축_구분_코드 has 318 (3.2%) missing values | Missing |
기타_용도 has 5384 (53.8%) missing values | Missing |
기타_구조 has 8642 (86.4%) missing values | Missing |
층_일련번호 has 9978 (99.8%) missing values | Missing |
층_면적 is highly skewed (γ1 = 99.26012424) | Skewed |
관리_층별_개요_PK has unique values | Unique |
층_면적 has 377 (3.8%) zeros | Zeros |
Reproduction
Analysis started | 2024-05-04 04:30:55.182759 |
---|---|
Analysis finished | 2024-05-04 04:31:07.750101 |
Duration | 12.57 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
관리_층별_개요_PK
Text
UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 28 |
---|---|
Median length | 15 |
Mean length | 11.7903 |
Min length | 7 |
Characters and Unicode
Total characters | 117903 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 10000 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 11230-9851 |
---|---|
2nd row | 11215-14567 |
3rd row | 11260-5592 |
4th row | 11140-14946 |
5th row | 11110-9898 |
Value | Count | Frequency (%) |
11230-9851 | 1 | < 0.1% |
11140-19134 | 1 | < 0.1% |
11140-15770 | 1 | < 0.1% |
11215-22697 | 1 | < 0.1% |
11170-9367 | 1 | < 0.1% |
11215-20355 | 1 | < 0.1% |
11260-100060149 | 1 | < 0.1% |
11140-100015574 | 1 | < 0.1% |
11215-29700 | 1 | < 0.1% |
11200-2384 | 1 | < 0.1% |
Other values (9990) | 9990 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 37962 | |
0 | 22991 | |
- | 10000 | 8.5% |
2 | 9491 | 8.0% |
4 | 6377 | 5.4% |
5 | 6346 | 5.4% |
3 | 5704 | 4.8% |
7 | 5671 | 4.8% |
6 | 4849 | 4.1% |
8 | 4452 | 3.8% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 107903 | |
Dash Punctuation | 10000 | 8.5% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 37962 | |
0 | 22991 | |
2 | 9491 | 8.8% |
4 | 6377 | 5.9% |
5 | 6346 | 5.9% |
3 | 5704 | 5.3% |
7 | 5671 | 5.3% |
6 | 4849 | 4.5% |
8 | 4452 | 4.1% |
9 | 4060 | 3.8% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 117903 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 37962 | |
0 | 22991 | |
- | 10000 | 8.5% |
2 | 9491 | 8.0% |
4 | 6377 | 5.4% |
5 | 6346 | 5.4% |
3 | 5704 | 4.8% |
7 | 5671 | 4.8% |
6 | 4849 | 4.1% |
8 | 4452 | 3.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 117903 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 37962 | |
0 | 22991 | |
- | 10000 | 8.5% |
2 | 9491 | 8.0% |
4 | 6377 | 5.4% |
5 | 6346 | 5.4% |
3 | 5704 | 4.8% |
7 | 5671 | 4.8% |
6 | 4849 | 4.1% |
8 | 4452 | 3.8% |
관리_동별_개요_PK
Text
Distinct | 7358 |
---|---|
Distinct (%) | 73.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 28 |
---|---|
Median length | 10 |
Mean length | 11.2709 |
Min length | 7 |
Characters and Unicode
Total characters | 112709 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 5632 ? |
---|---|
Unique (%) | 56.3% |
Sample
1st row | 11230-3170 |
---|---|
2nd row | 11215-3087 |
3rd row | 11260-1511 |
4th row | 11140-2689 |
5th row | 11110-2102 |
Value | Count | Frequency (%) |
11000-100006267 | 22 | 0.2% |
11000-100006268 | 14 | 0.1% |
11000-173 | 13 | 0.1% |
11260-100118116 | 12 | 0.1% |
11680-100196685 | 12 | 0.1% |
11140-100007179 | 11 | 0.1% |
11000-106 | 11 | 0.1% |
11140-3139 | 10 | 0.1% |
11110-776 | 10 | 0.1% |
11170-2373 | 10 | 0.1% |
Other values (7348) | 9875 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 35730 | |
0 | 22849 | |
- | 10000 | 8.9% |
2 | 9336 | 8.3% |
3 | 6245 | 5.5% |
4 | 6104 | 5.4% |
5 | 6052 | 5.4% |
7 | 4933 | 4.4% |
6 | 4434 | 3.9% |
8 | 3581 | 3.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 102709 | |
Dash Punctuation | 10000 | 8.9% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 35730 | |
0 | 22849 | |
2 | 9336 | 9.1% |
3 | 6245 | 6.1% |
4 | 6104 | 5.9% |
5 | 6052 | 5.9% |
7 | 4933 | 4.8% |
6 | 4434 | 4.3% |
8 | 3581 | 3.5% |
9 | 3445 | 3.4% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 112709 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 35730 | |
0 | 22849 | |
- | 10000 | 8.9% |
2 | 9336 | 8.3% |
3 | 6245 | 5.5% |
4 | 6104 | 5.4% |
5 | 6052 | 5.4% |
7 | 4933 | 4.4% |
6 | 4434 | 3.9% |
8 | 3581 | 3.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 112709 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 35730 | |
0 | 22849 | |
- | 10000 | 8.9% |
2 | 9336 | 8.3% |
3 | 6245 | 5.5% |
4 | 6104 | 5.4% |
5 | 6052 | 5.4% |
7 | 4933 | 4.4% |
6 | 4434 | 3.9% |
8 | 3581 | 3.2% |
층_번호
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 50 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.2736 |
Minimum | 0 |
---|---|
Maximum | 58 |
Zeros | 20 |
Zeros (%) | 0.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 2 |
Q3 | 4 |
95-th percentile | 10 |
Maximum | 58 |
Range | 58 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 4.2712255 |
---|---|
Coefficient of variation (CV) | 1.3047488 |
Kurtosis | 32.365941 |
Mean | 3.2736 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 4.7896568 |
Sum | 32736 |
Variance | 18.243367 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 3769 | |
2 | 1976 | |
3 | 1532 | |
4 | 1050 | 10.5% |
5 | 570 | 5.7% |
6 | 230 | 2.3% |
7 | 137 | 1.4% |
8 | 98 | 1.0% |
9 | 88 | 0.9% |
10 | 71 | 0.7% |
Other values (40) | 479 | 4.8% |
Value | Count | Frequency (%) |
0 | 20 | 0.2% |
1 | 3769 | |
2 | 1976 | |
3 | 1532 | |
4 | 1050 | 10.5% |
5 | 570 | 5.7% |
6 | 230 | 2.3% |
7 | 137 | 1.4% |
8 | 98 | 1.0% |
9 | 88 | 0.9% |
Value | Count | Frequency (%) |
58 | 1 | < 0.1% |
56 | 1 | < 0.1% |
55 | 1 | < 0.1% |
52 | 1 | < 0.1% |
51 | 1 | < 0.1% |
50 | 1 | < 0.1% |
46 | 1 | < 0.1% |
45 | 3 | |
44 | 1 | < 0.1% |
42 | 1 | < 0.1% |
건축_구분_코드
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 12 |
---|---|
Distinct (%) | 0.1% |
Missing | 318 |
Missing (%) | 3.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 317.30138 |
Minimum | 100 |
---|---|
Maximum | 3000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 100 |
---|---|
5-th percentile | 100 |
Q1 | 100 |
median | 100 |
Q3 | 200 |
95-th percentile | 900 |
Maximum | 3000 |
Range | 2900 |
Interquartile range (IQR) | 100 |
Descriptive statistics
Standard deviation | 448.72273 |
---|---|
Coefficient of variation (CV) | 1.4141846 |
Kurtosis | 7.2269288 |
Mean | 317.30138 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 2.7062211 |
Sum | 3072112 |
Variance | 201352.09 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
100 | 6139 | |
200 | 1296 | 13.0% |
700 | 1008 | 10.1% |
2000 | 472 | 4.7% |
600 | 432 | 4.3% |
900 | 290 | 2.9% |
300 | 17 | 0.2% |
800 | 11 | 0.1% |
500 | 9 | 0.1% |
400 | 4 | < 0.1% |
Other values (2) | 4 | < 0.1% |
(Missing) | 318 | 3.2% |
Value | Count | Frequency (%) |
100 | 6139 | |
200 | 1296 | 13.0% |
212 | 1 | < 0.1% |
300 | 17 | 0.2% |
400 | 4 | < 0.1% |
500 | 9 | 0.1% |
600 | 432 | 4.3% |
700 | 1008 | 10.1% |
800 | 11 | 0.1% |
900 | 290 | 2.9% |
Value | Count | Frequency (%) |
3000 | 3 | < 0.1% |
2000 | 472 | |
900 | 290 | 2.9% |
800 | 11 | 0.1% |
700 | 1008 | |
600 | 432 | |
500 | 9 | 0.1% |
400 | 4 | < 0.1% |
300 | 17 | 0.2% |
212 | 1 | < 0.1% |
주_용도_코드
Text
Distinct | 220 |
---|---|
Distinct (%) | 2.2% |
Missing | 93 |
Missing (%) | 0.9% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
02003 | 2069 | |
01003 | 1544 | |
01001 | 675 | 6.8% |
02001 | 508 | 5.1% |
03001 | 463 | 4.7% |
04402 | 424 | 4.3% |
14202 | 348 | 3.5% |
04001 | 344 | 3.5% |
14204 | 267 | 2.7% |
04499 | 255 | 2.6% |
Other values (210) | 3010 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 24337 | |
1 | 7253 | 14.6% |
2 | 5245 | 10.6% |
3 | 4794 | 9.7% |
4 | 3716 | 7.5% |
9 | 2861 | 5.8% |
5 | 525 | 1.1% |
7 | 272 | 0.5% |
8 | 217 | 0.4% |
6 | 196 | 0.4% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 49416 | |
Uppercase Letter | 119 | 0.2% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 24337 | |
1 | 7253 | 14.7% |
2 | 5245 | 10.6% |
3 | 4794 | 9.7% |
4 | 3716 | 7.5% |
9 | 2861 | 5.8% |
5 | 525 | 1.1% |
7 | 272 | 0.6% |
8 | 217 | 0.4% |
6 | 196 | 0.4% |
Uppercase Letter
Value | Count | Frequency (%) |
Z | 119 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 49416 | |
Latin | 119 | 0.2% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 24337 | |
1 | 7253 | 14.7% |
2 | 5245 | 10.6% |
3 | 4794 | 9.7% |
4 | 3716 | 7.5% |
9 | 2861 | 5.8% |
5 | 525 | 1.1% |
7 | 272 | 0.6% |
8 | 217 | 0.4% |
6 | 196 | 0.4% |
Latin
Value | Count | Frequency (%) |
Z | 119 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 49535 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 24337 | |
1 | 7253 | 14.6% |
2 | 5245 | 10.6% |
3 | 4794 | 9.7% |
4 | 3716 | 7.5% |
9 | 2861 | 5.8% |
5 | 525 | 1.1% |
7 | 272 | 0.5% |
8 | 217 | 0.4% |
6 | 196 | 0.4% |
기타_용도
Text
MISSING
 
Distinct | 1261 |
---|---|
Distinct (%) | 27.3% |
Missing | 5384 |
Missing (%) | 53.8% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
1가구 | 395 | 8.1% |
2세대 | 342 | 7.0% |
주차장 | 331 | 6.8% |
계단실 | 327 | 6.7% |
1세대 | 186 | 3.8% |
2가구 | 163 | 3.3% |
계단실(연면적제외 | 137 | 2.8% |
사무소 | 82 | 1.7% |
사무실 | 71 | 1.5% |
3세대 | 70 | 1.4% |
Other values (1191) | 2775 |
Most occurring characters
Value | Count | Frequency (%) |
실 | 1476 | 5.3% |
( | 1277 | 4.5% |
) | 1273 | 4.5% |
주 | 1062 | 3.8% |
대 | 1045 | 3.7% |
세 | 1022 | 3.6% |
계 | 992 | 3.5% |
단 | 906 | 3.2% |
1 | 861 | 3.1% |
구 | 823 | 2.9% |
Other values (332) | 17373 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 21382 | |
Decimal Number | 2328 | 8.3% |
Open Punctuation | 1282 | 4.6% |
Close Punctuation | 1278 | 4.5% |
Other Punctuation | 967 | 3.4% |
Uppercase Letter | 449 | 1.6% |
Space Separator | 263 | 0.9% |
Dash Punctuation | 96 | 0.3% |
Lowercase Letter | 26 | 0.1% |
Math Symbol | 20 | 0.1% |
Other values (3) | 19 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
실 | 1476 | 6.9% |
주 | 1062 | 5.0% |
대 | 1045 | 4.9% |
세 | 1022 | 4.8% |
계 | 992 | 4.6% |
단 | 906 | 4.2% |
구 | 823 | 3.8% |
가 | 804 | 3.8% |
장 | 645 | 3.0% |
시 | 623 | 2.9% |
Other values (271) | 11984 |
Uppercase Letter
Value | Count | Frequency (%) |
E | 178 | |
V | 91 | |
L | 87 | |
F | 14 | 3.1% |
M | 14 | 3.1% |
D | 13 | 2.9% |
T | 12 | 2.7% |
P | 9 | 2.0% |
I | 8 | 1.8% |
A | 7 | 1.6% |
Other values (7) | 16 | 3.6% |
Lowercase Letter
Value | Count | Frequency (%) |
m | 8 | |
a | 3 | 11.5% |
f | 2 | 7.7% |
r | 2 | 7.7% |
e | 2 | 7.7% |
p | 1 | 3.8% |
g | 1 | 3.8% |
k | 1 | 3.8% |
c | 1 | 3.8% |
t | 1 | 3.8% |
Other values (4) | 4 |
Decimal Number
Value | Count | Frequency (%) |
1 | 861 | |
2 | 809 | |
3 | 224 | 9.6% |
4 | 152 | 6.5% |
5 | 55 | 2.4% |
0 | 53 | 2.3% |
6 | 52 | 2.2% |
8 | 47 | 2.0% |
7 | 39 | 1.7% |
9 | 36 | 1.5% |
Other Punctuation
Value | Count | Frequency (%) |
, | 679 | |
. | 148 | 15.3% |
/ | 81 | 8.4% |
: | 53 | 5.5% |
? | 2 | 0.2% |
& | 2 | 0.2% |
; | 1 | 0.1% |
' | 1 | 0.1% |
Math Symbol
Value | Count | Frequency (%) |
~ | 10 | |
= | 7 | |
+ | 3 | 15.0% |
Open Punctuation
Value | Count | Frequency (%) |
( | 1277 | |
[ | 5 | 0.4% |
Close Punctuation
Value | Count | Frequency (%) |
) | 1273 | |
] | 5 | 0.4% |
Space Separator
Value | Count | Frequency (%) |
263 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 96 |
Other Symbol
Value | Count | Frequency (%) |
㎡ | 16 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 2 |
Modifier Symbol
Value | Count | Frequency (%) |
` | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 21370 | |
Common | 6253 | 22.2% |
Latin | 475 | 1.7% |
Han | 12 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
실 | 1476 | 6.9% |
주 | 1062 | 5.0% |
대 | 1045 | 4.9% |
세 | 1022 | 4.8% |
계 | 992 | 4.6% |
단 | 906 | 4.2% |
구 | 823 | 3.9% |
가 | 804 | 3.8% |
장 | 645 | 3.0% |
시 | 623 | 2.9% |
Other values (267) | 11972 |
Latin
Value | Count | Frequency (%) |
E | 178 | |
V | 91 | |
L | 87 | |
F | 14 | 2.9% |
M | 14 | 2.9% |
D | 13 | 2.7% |
T | 12 | 2.5% |
P | 9 | 1.9% |
I | 8 | 1.7% |
m | 8 | 1.7% |
Other values (21) | 41 | 8.6% |
Common
Value | Count | Frequency (%) |
( | 1277 | |
) | 1273 | |
1 | 861 | |
2 | 809 | |
, | 679 | |
263 | 4.2% | |
3 | 224 | 3.6% |
4 | 152 | 2.4% |
. | 148 | 2.4% |
- | 96 | 1.5% |
Other values (20) | 471 | 7.5% |
Han
Value | Count | Frequency (%) |
病 | 3 | |
合 | 3 | |
院 | 3 | |
綜 | 3 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 21370 | |
ASCII | 6712 | 23.9% |
CJK Compat | 16 | 0.1% |
CJK | 12 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
실 | 1476 | 6.9% |
주 | 1062 | 5.0% |
대 | 1045 | 4.9% |
세 | 1022 | 4.8% |
계 | 992 | 4.6% |
단 | 906 | 4.2% |
구 | 823 | 3.9% |
가 | 804 | 3.8% |
장 | 645 | 3.0% |
시 | 623 | 2.9% |
Other values (267) | 11972 |
ASCII
Value | Count | Frequency (%) |
( | 1277 | |
) | 1273 | |
1 | 861 | |
2 | 809 | |
, | 679 | |
263 | 3.9% | |
3 | 224 | 3.3% |
E | 178 | 2.7% |
4 | 152 | 2.3% |
. | 148 | 2.2% |
Other values (50) | 848 |
CJK Compat
Value | Count | Frequency (%) |
㎡ | 16 |
CJK
Value | Count | Frequency (%) |
病 | 3 | |
合 | 3 | |
院 | 3 | |
綜 | 3 |
구조_코드
Real number (ℝ)
Distinct | 22 |
---|---|
Distinct (%) | 0.2% |
Missing | 66 |
Missing (%) | 0.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 22.642641 |
Minimum | 10 |
---|---|
Maximum | 99 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 10 |
---|---|
5-th percentile | 11 |
Q1 | 21 |
median | 21 |
Q3 | 21 |
95-th percentile | 42 |
Maximum | 99 |
Range | 89 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 8.8756717 |
---|---|
Coefficient of variation (CV) | 0.39198923 |
Kurtosis | 19.163278 |
Mean | 22.642641 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 3.2510356 |
Sum | 224932 |
Variance | 78.777547 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
21 | 7345 | |
11 | 1017 | 10.2% |
42 | 563 | 5.6% |
31 | 313 | 3.1% |
32 | 286 | 2.9% |
51 | 103 | 1.0% |
19 | 83 | 0.8% |
41 | 66 | 0.7% |
12 | 33 | 0.3% |
99 | 29 | 0.3% |
Other values (12) | 96 | 1.0% |
(Missing) | 66 | 0.7% |
Value | Count | Frequency (%) |
10 | 6 | 0.1% |
11 | 1017 | 10.2% |
12 | 33 | 0.3% |
19 | 83 | 0.8% |
20 | 4 | < 0.1% |
21 | 7345 | |
22 | 15 | 0.1% |
27 | 1 | < 0.1% |
29 | 7 | 0.1% |
30 | 1 | < 0.1% |
Value | Count | Frequency (%) |
99 | 29 | 0.3% |
74 | 28 | 0.3% |
63 | 2 | < 0.1% |
51 | 103 | 1.0% |
50 | 2 | < 0.1% |
43 | 1 | < 0.1% |
42 | 563 | |
41 | 66 | 0.7% |
39 | 28 | 0.3% |
33 | 1 | < 0.1% |
기타_구조
Text
MISSING
 
Distinct | 166 |
---|---|
Distinct (%) | 12.2% |
Missing | 8642 |
Missing (%) | 86.4% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
연와조 | 314 | |
철근콘크리트구조 | 247 | |
철근콘크리트조 | 187 | |
철골철근콘크리트구조 | 94 | 6.8% |
철골콘크리트구조 | 44 | 3.2% |
철근콘크리트 | 39 | 2.8% |
철골철근콘크리트조 | 32 | 2.3% |
컨테이너 | 30 | 2.2% |
목조 | 30 | 2.2% |
벽식구조 | 24 | 1.7% |
Other values (139) | 337 |
Most occurring characters
Value | Count | Frequency (%) |
조 | 1318 | |
철 | 964 | |
트 | 736 | |
리 | 725 | |
크 | 724 | |
콘 | 723 | |
근 | 678 | |
구 | 493 | 5.7% |
연 | 346 | 4.0% |
와 | 346 | 4.0% |
Other values (99) | 1609 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 8382 | |
Other Punctuation | 89 | 1.0% |
Close Punctuation | 65 | 0.8% |
Open Punctuation | 65 | 0.8% |
Decimal Number | 27 | 0.3% |
Space Separator | 20 | 0.2% |
Uppercase Letter | 8 | 0.1% |
Other Symbol | 4 | < 0.1% |
Math Symbol | 2 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
조 | 1318 | |
철 | 964 | |
트 | 736 | |
리 | 725 | |
크 | 724 | |
콘 | 723 | |
근 | 678 | |
구 | 493 | 5.9% |
연 | 346 | 4.1% |
와 | 346 | 4.1% |
Other values (75) | 1329 |
Decimal Number
Value | Count | Frequency (%) |
0 | 6 | |
2 | 4 | |
7 | 4 | |
1 | 4 | |
9 | 3 | |
5 | 2 | 7.4% |
6 | 2 | 7.4% |
3 | 1 | 3.7% |
4 | 1 | 3.7% |
Other Punctuation
Value | Count | Frequency (%) |
, | 60 | |
. | 16 | 18.0% |
/ | 10 | 11.2% |
: | 2 | 2.2% |
? | 1 | 1.1% |
Uppercase Letter
Value | Count | Frequency (%) |
C | 3 | |
S | 2 | |
R | 1 | 12.5% |
A | 1 | 12.5% |
L | 1 | 12.5% |
Close Punctuation
Value | Count | Frequency (%) |
) | 65 |
Open Punctuation
Value | Count | Frequency (%) |
( | 65 |
Space Separator
Value | Count | Frequency (%) |
20 |
Other Symbol
Value | Count | Frequency (%) |
㎡ | 4 |
Math Symbol
Value | Count | Frequency (%) |
+ | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 8382 | |
Common | 272 | 3.1% |
Latin | 8 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
조 | 1318 | |
철 | 964 | |
트 | 736 | |
리 | 725 | |
크 | 724 | |
콘 | 723 | |
근 | 678 | |
구 | 493 | 5.9% |
연 | 346 | 4.1% |
와 | 346 | 4.1% |
Other values (75) | 1329 |
Common
Value | Count | Frequency (%) |
) | 65 | |
( | 65 | |
, | 60 | |
20 | 7.4% | |
. | 16 | 5.9% |
/ | 10 | 3.7% |
0 | 6 | 2.2% |
2 | 4 | 1.5% |
7 | 4 | 1.5% |
1 | 4 | 1.5% |
Other values (9) | 18 | 6.6% |
Latin
Value | Count | Frequency (%) |
C | 3 | |
S | 2 | |
R | 1 | 12.5% |
A | 1 | 12.5% |
L | 1 | 12.5% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 8382 | |
ASCII | 276 | 3.2% |
CJK Compat | 4 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
조 | 1318 | |
철 | 964 | |
트 | 736 | |
리 | 725 | |
크 | 724 | |
콘 | 723 | |
근 | 678 | |
구 | 493 | 5.9% |
연 | 346 | 4.1% |
와 | 346 | 4.1% |
Other values (75) | 1329 |
ASCII
Value | Count | Frequency (%) |
) | 65 | |
( | 65 | |
, | 60 | |
20 | 7.2% | |
. | 16 | 5.8% |
/ | 10 | 3.6% |
0 | 6 | 2.2% |
2 | 4 | 1.4% |
7 | 4 | 1.4% |
1 | 4 | 1.4% |
Other values (13) | 22 | 8.0% |
CJK Compat
Value | Count | Frequency (%) |
㎡ | 4 |
층_면적
Real number (ℝ)
HIGH CORRELATION
  SKEWED
  ZEROS
 
Distinct | 7210 |
---|---|
Distinct (%) | 72.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 394.58854 |
Minimum | -422.99 |
---|---|
Maximum | 1217235 |
Zeros | 377 |
Zeros (%) | 3.8% |
Negative | 36 |
Negative (%) | 0.4% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -422.99 |
---|---|
5-th percentile | 4.8 |
Q1 | 39.97 |
median | 92.495 |
Q3 | 178.825 |
95-th percentile | 1191.5925 |
Maximum | 1217235 |
Range | 1217658 |
Interquartile range (IQR) | 138.855 |
Descriptive statistics
Standard deviation | 12199.989 |
---|---|
Coefficient of variation (CV) | 30.918256 |
Kurtosis | 9900.7816 |
Mean | 394.58854 |
Median Absolute Deviation (MAD) | 63.765 |
Skewness | 99.260124 |
Sum | 3945885.4 |
Variance | 1.4883974 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0 | 377 | 3.8% |
18.0 | 38 | 0.4% |
10.08 | 29 | 0.3% |
10.8 | 21 | 0.2% |
27.0 | 21 | 0.2% |
9.36 | 19 | 0.2% |
10.92 | 17 | 0.2% |
12.48 | 14 | 0.1% |
9.0 | 13 | 0.1% |
11.7 | 10 | 0.1% |
Other values (7200) | 9441 |
Value | Count | Frequency (%) |
-422.99 | 1 | |
-334.23 | 1 | |
-311.01 | 1 | |
-109.38 | 1 | |
-106.15 | 1 | |
-91.81 | 1 | |
-72.5 | 1 | |
-71.4 | 1 | |
-40.77 | 1 | |
-28.8 | 1 |
Value | Count | Frequency (%) |
1217235.0 | 1 | |
26292.115 | 1 | |
24515.08 | 1 | |
22324.05 | 1 | |
22075.03 | 1 | |
20547.58 | 1 | |
18837.23 | 1 | |
16141.21 | 1 | |
12200.78 | 1 | |
12059.35 | 1 |
층_구분_코드
Categorical
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
20 | |
---|---|
10 | |
30 | 646 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 20 |
---|---|
2nd row | 20 |
3rd row | 20 |
4th row | 20 |
5th row | 20 |
Common Values
Value | Count | Frequency (%) |
20 | 7962 | |
10 | 1392 | 13.9% |
30 | 646 | 6.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
20 | 7962 | |
10 | 1392 | 13.9% |
30 | 646 | 6.5% |
층_일련번호
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 17 |
---|---|
Distinct (%) | 77.3% |
Missing | 9978 |
Missing (%) | 99.8% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 21 |
Minimum | 1 |
---|---|
Maximum | 63 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 4 |
Q1 | 6 |
median | 18.5 |
Q3 | 34.5 |
95-th percentile | 58.9 |
Maximum | 63 |
Range | 62 |
Interquartile range (IQR) | 28.5 |
Descriptive statistics
Standard deviation | 18.026436 |
---|---|
Coefficient of variation (CV) | 0.8584017 |
Kurtosis | 0.31455485 |
Mean | 21 |
Median Absolute Deviation (MAD) | 13 |
Skewness | 0.99242193 |
Sum | 462 |
Variance | 324.95238 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4 | 2 | < 0.1% |
9 | 2 | < 0.1% |
5 | 2 | < 0.1% |
6 | 2 | < 0.1% |
36 | 2 | < 0.1% |
20 | 1 | < 0.1% |
30 | 1 | < 0.1% |
1 | 1 | < 0.1% |
18 | 1 | < 0.1% |
38 | 1 | < 0.1% |
Other values (7) | 7 | 0.1% |
(Missing) | 9978 |
Value | Count | Frequency (%) |
1 | 1 | |
4 | 2 | |
5 | 2 | |
6 | 2 | |
7 | 1 | |
9 | 2 | |
18 | 1 | |
19 | 1 | |
20 | 1 | |
24 | 1 |
Value | Count | Frequency (%) |
63 | 1 | |
60 | 1 | |
38 | 1 | |
37 | 1 | |
36 | 2 | |
30 | 1 | |
25 | 1 | |
24 | 1 | |
20 | 1 | |
19 | 1 |
층_번호 | 건축_구분_코드 | 구조_코드 | 층_면적 | 층_구분_코드 | 층_일련번호 | |
---|---|---|---|---|---|---|
층_번호 | 1.000 | 0.000 | 0.241 | 0.263 | 0.175 | 0.801 |
건축_구분_코드 | 0.000 | 1.000 | 0.276 | 0.013 | 0.076 | 0.537 |
구조_코드 | 0.241 | 0.276 | 1.000 | 0.000 | 0.154 | 0.404 |
층_면적 | 0.263 | 0.013 | 0.000 | 1.000 | 0.000 | NaN |
층_구분_코드 | 0.175 | 0.076 | 0.154 | 0.000 | 1.000 | 0.000 |
층_일련번호 | 0.801 | 0.537 | 0.404 | NaN | 0.000 | 1.000 |
층_번호 | 건축_구분_코드 | 구조_코드 | 층_면적 | 층_일련번호 | 층_구분_코드 | |
---|---|---|---|---|---|---|
층_번호 | 1.000 | -0.053 | 0.184 | 0.392 | 0.661 | 0.105 |
건축_구분_코드 | -0.053 | 1.000 | -0.145 | -0.124 | 0.659 | 0.056 |
구조_코드 | 0.184 | -0.145 | 1.000 | 0.243 | 0.061 | 0.098 |
층_면적 | 0.392 | -0.124 | 0.243 | 1.000 | 0.503 | 0.000 |
층_일련번호 | 0.661 | 0.659 | 0.061 | 0.503 | 1.000 | 0.000 |
층_구분_코드 | 0.105 | 0.056 | 0.098 | 0.000 | 0.000 | 1.000 |
관리_층별_개요_PK | 관리_동별_개요_PK | 층_번호 | 건축_구분_코드 | 주_용도_코드 | 기타_용도 | 구조_코드 | 기타_구조 | 층_면적 | 층_구분_코드 | 층_일련번호 | |
---|---|---|---|---|---|---|---|---|---|---|---|
85232 | 11230-9851 | 11230-3170 | 3 | 200 | 01003 | <NA> | 32 | <NA> | 40.0 | 20 | <NA> |
80912 | 11215-14567 | 11215-3087 | 5 | 100 | 02003 | <NA> | 21 | <NA> | 160.36 | 20 | <NA> |
98494 | 11260-5592 | 11260-1511 | 5 | 100 | 02003 | 2세대 | 21 | <NA> | 118.41 | 20 | <NA> |
15900 | 11140-14946 | 11140-2689 | 3 | 100 | 04499 | <NA> | 31 | <NA> | 270.51 | 20 | <NA> |
6538 | 11110-9898 | 11110-2102 | 4 | 100 | 02003 | 4세대 | 21 | <NA> | 115.2 | 20 | <NA> |
22126 | 11140-21022 | 11140-3837 | 5 | 600 | 01001 | <NA> | 21 | <NA> | 0.0 | 20 | <NA> |
19266 | 11140-4695 | 11140-907 | 1 | 100 | 03000 | 소매점(37.98),주차장(12.60) | 21 | <NA> | 50.58 | 20 | <NA> |
249 | 11000-100022002 | 11000-100007368 | 30 | 100 | 14204 | 공동주택 | 21 | 철골철근콘크리트,철골조 | 2661.33 | 20 | <NA> |
87627 | 11215-23775 | 11215-5025 | 3 | 100 | 01003 | 1가구 | 21 | <NA> | 113.43 | 20 | <NA> |
694 | 11110-5263 | 11110-914 | 2 | 2000 | 02003 | 2세대 | 21 | <NA> | 124.62 | 20 | <NA> |
관리_층별_개요_PK | 관리_동별_개요_PK | 층_번호 | 건축_구분_코드 | 주_용도_코드 | 기타_용도 | 구조_코드 | 기타_구조 | 층_면적 | 층_구분_코드 | 층_일련번호 | |
---|---|---|---|---|---|---|---|---|---|---|---|
30150 | 11140-100007227 | 11140-100005265 | 1 | 700 | 17100 | 인쇄소 | 51 | 목조 | 55.54 | 20 | <NA> |
78656 | 11215-15989 | 11215-3346 | 3 | 100 | 02003 | <NA> | 21 | <NA> | 144.92 | 20 | <NA> |
55910 | 11170-6146 | 11170-1626 | 3 | 100 | 01003 | <NA> | 21 | <NA> | 160.96 | 20 | <NA> |
99630 | 11260-14703 | 11260-3460 | 1 | 700 | 03001 | 업무시설 | 21 | <NA> | 0.0 | 20 | <NA> |
57611 | 11200-887 | 11200-251 | 1 | 900 | 01003 | 1가구 | 99 | 연와조 | 34.19 | 20 | <NA> |
15169 | 11140-13919 | 11140-2537 | 5 | 2000 | 14202 | <NA> | 21 | <NA> | 345.61 | 20 | <NA> |
28813 | 11140-16966 | 11140-3069 | 4 | 100 | 20001 | 기계실,전기실 | 21 | <NA> | 910.27 | 10 | <NA> |
14342 | 11110-100011347 | 11110-100006962 | 4 | 100 | 04001 | 한정식집 | 21 | <NA> | 215.94 | 20 | <NA> |
79558 | 11110-100055650 | 11110-100028413 | 1 | 600 | 04001 | 대중음식점 | 21 | 철근콘크리트조 | 123.35 | 10 | <NA> |
55707 | 11200-1988 | 11200-520 | 1 | 100 | 20001 | <NA> | 39 | <NA> | 15.0 | 20 | <NA> |