Dataset statistics
Number of variables | 13 |
---|---|
Number of observations | 10000 |
Missing cells | 395 |
Missing cells (%) | 0.3% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.1 MiB |
Average record size in memory | 115.0 B |
Variable types
Text | 5 |
---|---|
Categorical | 5 |
Numeric | 3 |
Dataset
Description | 전유_공용_면적_PK,폐쇄말소대장_PK,전유_공용_구분_코드,주_부속_구분_코드,층_구분_코드,층_번호,층_번호_명,구조_코드,기타_구조,주_용도_코드,기타_용도,면적,작업_일자 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15398/S/1/datasetView.do |
기타_구조 is highly overall correlated with 구조_코드 | High correlation |
구조_코드 is highly overall correlated with 기타_구조 | High correlation |
전유_공용_구분_코드 is highly overall correlated with 층_구분_코드 | High correlation |
층_구분_코드 is highly overall correlated with 전유_공용_구분_코드 | High correlation |
구조_코드 is highly imbalanced (89.2%) | Imbalance |
기타_구조 is highly imbalanced (81.5%) | Imbalance |
층_번호_명 has 144 (1.4%) missing values | Missing |
주_용도_코드 has 123 (1.2%) missing values | Missing |
기타_용도 has 128 (1.3%) missing values | Missing |
전유_공용_면적_PK has unique values | Unique |
층_번호 has 4627 (46.3%) zeros | Zeros |
Reproduction
Analysis started | 2024-05-18 05:42:24.734457 |
---|---|
Analysis finished | 2024-05-18 05:42:32.827670 |
Duration | 8.09 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
전유_공용_면적_PK
Text
UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 28 |
---|---|
Median length | 15 |
Mean length | 16.6652 |
Min length | 9 |
Characters and Unicode
Total characters | 166652 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 10000 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 11710-100286453 |
---|---|
2nd row | 11350-100783619 |
3rd row | 11260-100954144 |
4th row | 11740-102714129 |
5th row | 11710-100337118 |
Value | Count | Frequency (%) |
11710-100286453 | 1 | < 0.1% |
11710-100279341 | 1 | < 0.1% |
11260-100773526 | 1 | < 0.1% |
11710-100289211 | 1 | < 0.1% |
11545-100970995 | 1 | < 0.1% |
11710-100273581 | 1 | < 0.1% |
11530-105646997 | 1 | < 0.1% |
11710-100274473 | 1 | < 0.1% |
11680-105917068 | 1 | < 0.1% |
11710-100278421 | 1 | < 0.1% |
Other values (9990) | 9990 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 48922 | |
1 | 39506 | |
7 | 11843 | 7.1% |
2 | 11624 | 7.0% |
- | 10000 | 6.0% |
5 | 8123 | 4.9% |
6 | 8044 | 4.8% |
9 | 7992 | 4.8% |
4 | 7686 | 4.6% |
8 | 6457 | 3.9% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 156652 | |
Dash Punctuation | 10000 | 6.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 48922 | |
1 | 39506 | |
7 | 11843 | 7.6% |
2 | 11624 | 7.4% |
5 | 8123 | 5.2% |
6 | 8044 | 5.1% |
9 | 7992 | 5.1% |
4 | 7686 | 4.9% |
8 | 6457 | 4.1% |
3 | 6455 | 4.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 166652 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 48922 | |
1 | 39506 | |
7 | 11843 | 7.1% |
2 | 11624 | 7.0% |
- | 10000 | 6.0% |
5 | 8123 | 4.9% |
6 | 8044 | 4.8% |
9 | 7992 | 4.8% |
4 | 7686 | 4.6% |
8 | 6457 | 3.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 166652 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 48922 | |
1 | 39506 | |
7 | 11843 | 7.1% |
2 | 11624 | 7.0% |
- | 10000 | 6.0% |
5 | 8123 | 4.9% |
6 | 8044 | 4.8% |
9 | 7992 | 4.8% |
4 | 7686 | 4.6% |
8 | 6457 | 3.9% |
폐쇄말소대장_PK
Text
Distinct | 7144 |
---|---|
Distinct (%) | 71.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 28 |
---|---|
Median length | 15 |
Mean length | 16.6272 |
Min length | 10 |
Characters and Unicode
Total characters | 166272 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 4905 ? |
---|---|
Unique (%) | 49.0% |
Sample
1st row | 11710-100160494 |
---|---|
2nd row | 11350-100216753 |
3rd row | 11260-100290878 |
4th row | 11740-100633340 |
5th row | 11710-100190839 |
Value | Count | Frequency (%) |
11260-100242930 | 6 | 0.1% |
11710-100191126 | 5 | < 0.1% |
11260-100290943 | 5 | < 0.1% |
11680-101077182 | 5 | < 0.1% |
11200-100120743 | 5 | < 0.1% |
11500-1000000000000003193950 | 5 | < 0.1% |
11260-100291032 | 5 | < 0.1% |
11440-1000000000000002635762 | 5 | < 0.1% |
11260-100194370 | 5 | < 0.1% |
11260-100290879 | 5 | < 0.1% |
Other values (7134) | 9949 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 50701 | |
1 | 44367 | |
2 | 10307 | 6.2% |
- | 10000 | 6.0% |
7 | 9548 | 5.7% |
6 | 9437 | 5.7% |
5 | 8324 | 5.0% |
9 | 6513 | 3.9% |
4 | 6395 | 3.8% |
3 | 5887 | 3.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 156272 | |
Dash Punctuation | 10000 | 6.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 50701 | |
1 | 44367 | |
2 | 10307 | 6.6% |
7 | 9548 | 6.1% |
6 | 9437 | 6.0% |
5 | 8324 | 5.3% |
9 | 6513 | 4.2% |
4 | 6395 | 4.1% |
3 | 5887 | 3.8% |
8 | 4793 | 3.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 166272 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 50701 | |
1 | 44367 | |
2 | 10307 | 6.2% |
- | 10000 | 6.0% |
7 | 9548 | 5.7% |
6 | 9437 | 5.7% |
5 | 8324 | 5.0% |
9 | 6513 | 3.9% |
4 | 6395 | 3.8% |
3 | 5887 | 3.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 166272 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 50701 | |
1 | 44367 | |
2 | 10307 | 6.2% |
- | 10000 | 6.0% |
7 | 9548 | 5.7% |
6 | 9437 | 5.7% |
5 | 8324 | 5.0% |
9 | 6513 | 3.9% |
4 | 6395 | 3.8% |
3 | 5887 | 3.5% |
전유_공용_구분_코드
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
공용 | |
---|---|
전유 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 공용 |
---|---|
2nd row | 공용 |
3rd row | 공용 |
4th row | 공용 |
5th row | 공용 |
Common Values
Value | Count | Frequency (%) |
공용 | 7780 | |
전유 | 2220 | 22.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
공용 | 7780 | |
전유 | 2220 | 22.2% |
주_부속_구분_코드
Categorical
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
주건축물 | |
---|---|
부속건축물 |
Length
Max length | 5 |
---|---|
Median length | 4 |
Mean length | 4.1478 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 주건축물 |
---|---|
2nd row | 부속건축물 |
3rd row | 부속건축물 |
4th row | 주건축물 |
5th row | 부속건축물 |
Common Values
Value | Count | Frequency (%) |
주건축물 | 8522 | |
부속건축물 | 1478 | 14.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
주건축물 | 8522 | |
부속건축물 | 1478 | 14.8% |
층_구분_코드
Categorical
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
지상 | |
---|---|
각층 | |
지하 | |
<NA> | 215 |
복수층(상층) | 46 |
Length
Max length | 7 |
---|---|
Median length | 2 |
Mean length | 2.066 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 각층 |
---|---|
2nd row | 지하 |
3rd row | 지상 |
4th row | <NA> |
5th row | 각층 |
Common Values
Value | Count | Frequency (%) |
지상 | 4090 | |
각층 | 3668 | |
지하 | 1963 | |
<NA> | 215 | 2.1% |
복수층(상층) | 46 | 0.5% |
옥탑 | 18 | 0.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
지상 | 4090 | |
각층 | 3668 | |
지하 | 1963 | |
na | 215 | 2.1% |
복수층(상층 | 46 | 0.5% |
옥탑 | 18 | 0.2% |
층_번호
Real number (ℝ)
ZEROS
 
Distinct | 29 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.3774 |
Minimum | 0 |
---|---|
Maximum | 32 |
Zeros | 4627 |
Zeros (%) | 46.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 1 |
Q3 | 3 |
95-th percentile | 10 |
Maximum | 32 |
Range | 32 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 3.8155504 |
---|---|
Coefficient of variation (CV) | 1.6049257 |
Kurtosis | 6.5695051 |
Mean | 2.3774 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 2.3451389 |
Sum | 23774 |
Variance | 14.558425 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 4627 | |
1 | 1785 | 17.8% |
2 | 772 | 7.7% |
3 | 436 | 4.4% |
4 | 416 | 4.2% |
6 | 349 | 3.5% |
7 | 319 | 3.2% |
5 | 262 | 2.6% |
8 | 259 | 2.6% |
9 | 243 | 2.4% |
Other values (19) | 532 | 5.3% |
Value | Count | Frequency (%) |
0 | 4627 | |
1 | 1785 | 17.8% |
2 | 772 | 7.7% |
3 | 436 | 4.4% |
4 | 416 | 4.2% |
5 | 262 | 2.6% |
6 | 349 | 3.5% |
7 | 319 | 3.2% |
8 | 259 | 2.6% |
9 | 243 | 2.4% |
Value | Count | Frequency (%) |
32 | 1 | < 0.1% |
28 | 1 | < 0.1% |
27 | 1 | < 0.1% |
25 | 5 | 0.1% |
24 | 8 | 0.1% |
23 | 9 | |
22 | 7 | 0.1% |
21 | 8 | 0.1% |
20 | 20 | |
19 | 21 |
층_번호_명
Text
MISSING
 
Distinct | 195 |
---|---|
Distinct (%) | 2.0% |
Missing | 144 |
Missing (%) | 1.4% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
각층 | 1533 | 15.5% |
지5-1층,3층-7층,10층,11층 | 862 | 8.7% |
지5층-지1층 | 808 | 8.2% |
1층 | 769 | 7.8% |
지1 | 444 | 4.5% |
지하1층 | 302 | 3.1% |
2층 | 235 | 2.4% |
지2 | 229 | 2.3% |
3 | 213 | 2.2% |
7 | 211 | 2.1% |
Other values (180) | 4278 |
Most occurring characters
Value | Count | Frequency (%) |
층 | 11488 | |
1 | 7819 | |
지 | 6160 | |
, | 2900 | 6.7% |
- | 2580 | 5.9% |
5 | 2091 | 4.8% |
3 | 1574 | 3.6% |
각 | 1549 | 3.6% |
2 | 1460 | 3.4% |
7 | 1225 | 2.8% |
Other values (25) | 4599 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 20014 | |
Decimal Number | 16921 | |
Other Punctuation | 2904 | 6.7% |
Dash Punctuation | 2580 | 5.9% |
Math Symbol | 932 | 2.1% |
Space Separator | 35 | 0.1% |
Open Punctuation | 22 | 0.1% |
Close Punctuation | 22 | 0.1% |
Uppercase Letter | 15 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
층 | 11488 | |
지 | 6160 | |
각 | 1549 | 7.7% |
하 | 550 | 2.7% |
상 | 159 | 0.8% |
옥 | 23 | 0.1% |
탑 | 23 | 0.1% |
수 | 20 | 0.1% |
복 | 20 | 0.1% |
동 | 12 | 0.1% |
Other values (6) | 10 | < 0.1% |
Decimal Number
Value | Count | Frequency (%) |
1 | 7819 | |
5 | 2091 | 12.4% |
3 | 1574 | 9.3% |
2 | 1460 | 8.6% |
7 | 1225 | 7.2% |
0 | 990 | 5.9% |
4 | 826 | 4.9% |
6 | 373 | 2.2% |
8 | 294 | 1.7% |
9 | 269 | 1.6% |
Other Punctuation
Value | Count | Frequency (%) |
, | 2900 | |
. | 4 | 0.1% |
Uppercase Letter
Value | Count | Frequency (%) |
B | 8 | |
A | 7 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2580 |
Math Symbol
Value | Count | Frequency (%) |
~ | 932 |
Space Separator
Value | Count | Frequency (%) |
35 |
Open Punctuation
Value | Count | Frequency (%) |
( | 22 |
Close Punctuation
Value | Count | Frequency (%) |
) | 22 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 23416 | |
Hangul | 20014 | |
Latin | 15 | < 0.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 7819 | |
, | 2900 | 12.4% |
- | 2580 | 11.0% |
5 | 2091 | 8.9% |
3 | 1574 | 6.7% |
2 | 1460 | 6.2% |
7 | 1225 | 5.2% |
0 | 990 | 4.2% |
~ | 932 | 4.0% |
4 | 826 | 3.5% |
Other values (7) | 1019 | 4.4% |
Hangul
Value | Count | Frequency (%) |
층 | 11488 | |
지 | 6160 | |
각 | 1549 | 7.7% |
하 | 550 | 2.7% |
상 | 159 | 0.8% |
옥 | 23 | 0.1% |
탑 | 23 | 0.1% |
수 | 20 | 0.1% |
복 | 20 | 0.1% |
동 | 12 | 0.1% |
Other values (6) | 10 | < 0.1% |
Latin
Value | Count | Frequency (%) |
B | 8 | |
A | 7 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 23431 | |
Hangul | 20014 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
층 | 11488 | |
지 | 6160 | |
각 | 1549 | 7.7% |
하 | 550 | 2.7% |
상 | 159 | 0.8% |
옥 | 23 | 0.1% |
탑 | 23 | 0.1% |
수 | 20 | 0.1% |
복 | 20 | 0.1% |
동 | 12 | 0.1% |
Other values (6) | 10 | < 0.1% |
ASCII
Value | Count | Frequency (%) |
1 | 7819 | |
, | 2900 | 12.4% |
- | 2580 | 11.0% |
5 | 2091 | 8.9% |
3 | 1574 | 6.7% |
2 | 1460 | 6.2% |
7 | 1225 | 5.2% |
0 | 990 | 4.2% |
~ | 932 | 4.0% |
4 | 826 | 3.5% |
Other values (9) | 1034 | 4.4% |
구조_코드
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 10 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
철근콘크리트구조 | |
---|---|
철골철근콘크리트구조 | 252 |
벽돌구조 | 124 |
철골콘크리트구조 | 34 |
프리케스트콘크리트구조 | 27 |
Other values (5) | 29 |
Length
Max length | 11 |
---|---|
Median length | 8 |
Mean length | 8.0019 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 철근콘크리트구조 |
---|---|
2nd row | 철근콘크리트구조 |
3rd row | 철근콘크리트구조 |
4th row | 철근콘크리트구조 |
5th row | 철근콘크리트구조 |
Common Values
Value | Count | Frequency (%) |
철근콘크리트구조 | 9534 | |
철골철근콘크리트구조 | 252 | 2.5% |
벽돌구조 | 124 | 1.2% |
철골콘크리트구조 | 34 | 0.3% |
프리케스트콘크리트구조 | 27 | 0.3% |
일반철골구조 | 13 | 0.1% |
기타조적구조 | 6 | 0.1% |
경량철골구조 | 4 | < 0.1% |
<NA> | 3 | < 0.1% |
블록구조 | 3 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
철근콘크리트구조 | 9534 | |
철골철근콘크리트구조 | 252 | 2.5% |
벽돌구조 | 124 | 1.2% |
철골콘크리트구조 | 34 | 0.3% |
프리케스트콘크리트구조 | 27 | 0.3% |
일반철골구조 | 13 | 0.1% |
기타조적구조 | 6 | 0.1% |
경량철골구조 | 4 | < 0.1% |
na | 3 | < 0.1% |
블록구조 | 3 | < 0.1% |
기타_구조
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 40 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
철근콘크리트구조 | |
---|---|
철근콘크리트조 | |
철골철근콘크리트구조 | 250 |
연와조 | 75 |
<NA> | 57 |
Other values (35) | 258 |
Length
Max length | 24 |
---|---|
Median length | 8 |
Mean length | 7.8949 |
Min length | 3 |
Unique
Unique | 8 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | 철근콘크리트구조 |
---|---|
2nd row | 철근콘크리트구조 |
3rd row | 철근콘크리트구조 |
4th row | 철근콘크리트조 |
5th row | 철근콘크리트구조 |
Common Values
Value | Count | Frequency (%) |
철근콘크리트구조 | 8475 | |
철근콘크리트조 | 885 | 8.8% |
철골철근콘크리트구조 | 250 | 2.5% |
연와조 | 75 | 0.8% |
<NA> | 57 | 0.6% |
철근콘크리트 | 51 | 0.5% |
철골콘크리트구조 | 32 | 0.3% |
프리케스트콘크리트구조 | 26 | 0.3% |
시멘트벽돌조 | 26 | 0.3% |
조적조 | 16 | 0.2% |
Other values (30) | 107 | 1.1% |
Length
Value | Count | Frequency (%) |
철근콘크리트구조 | 8475 | |
철근콘크리트조 | 885 | 8.8% |
철골철근콘크리트구조 | 250 | 2.5% |
연와조 | 75 | 0.7% |
na | 57 | 0.6% |
철근콘크리트 | 55 | 0.5% |
철골콘크리트구조 | 32 | 0.3% |
프리케스트콘크리트구조 | 26 | 0.3% |
시멘트벽돌조 | 26 | 0.3% |
조적조 | 16 | 0.2% |
Other values (32) | 112 | 1.1% |
주_용도_코드
Text
MISSING
 
Distinct | 79 |
---|---|
Distinct (%) | 0.8% |
Missing | 123 |
Missing (%) | 1.2% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
도매시장 | 3001 | |
아파트 | 2094 | |
부대시설 | 954 | 9.7% |
오피스텔 | 671 | 6.8% |
기타공장 | 431 | 4.4% |
다세대주택 | 400 | 4.0% |
기타노유자시설 | 324 | 3.3% |
상점(소매점 | 292 | 3.0% |
기타제2종근린생활시설 | 204 | 2.1% |
복리시설 | 155 | 1.6% |
Other values (69) | 1351 |
Most occurring characters
Value | Count | Frequency (%) |
시 | 5207 | 12.4% |
장 | 3792 | 9.0% |
매 | 3565 | 8.5% |
도 | 3010 | 7.2% |
아 | 2094 | 5.0% |
파 | 2094 | 5.0% |
트 | 2094 | 5.0% |
설 | 2015 | 4.8% |
기 | 1373 | 3.3% |
타 | 1371 | 3.3% |
Other values (117) | 15476 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 41214 | |
Open Punctuation | 298 | 0.7% |
Close Punctuation | 298 | 0.7% |
Decimal Number | 278 | 0.7% |
Other Punctuation | 3 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
시 | 5207 | 12.6% |
장 | 3792 | 9.2% |
매 | 3565 | 8.6% |
도 | 3010 | 7.3% |
아 | 2094 | 5.1% |
파 | 2094 | 5.1% |
트 | 2094 | 5.1% |
설 | 2015 | 4.9% |
기 | 1373 | 3.3% |
타 | 1371 | 3.3% |
Other values (112) | 14599 |
Decimal Number
Value | Count | Frequency (%) |
2 | 204 | |
1 | 74 | 26.6% |
Open Punctuation
Value | Count | Frequency (%) |
( | 298 |
Close Punctuation
Value | Count | Frequency (%) |
) | 298 |
Other Punctuation
Value | Count | Frequency (%) |
. | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 41214 | |
Common | 877 | 2.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
시 | 5207 | 12.6% |
장 | 3792 | 9.2% |
매 | 3565 | 8.6% |
도 | 3010 | 7.3% |
아 | 2094 | 5.1% |
파 | 2094 | 5.1% |
트 | 2094 | 5.1% |
설 | 2015 | 4.9% |
기 | 1373 | 3.3% |
타 | 1371 | 3.3% |
Other values (112) | 14599 |
Common
Value | Count | Frequency (%) |
( | 298 | |
) | 298 | |
2 | 204 | |
1 | 74 | 8.4% |
. | 3 | 0.3% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 41214 | |
ASCII | 877 | 2.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
시 | 5207 | 12.6% |
장 | 3792 | 9.2% |
매 | 3565 | 8.6% |
도 | 3010 | 7.3% |
아 | 2094 | 5.1% |
파 | 2094 | 5.1% |
트 | 2094 | 5.1% |
설 | 2015 | 4.9% |
기 | 1373 | 3.3% |
타 | 1371 | 3.3% |
Other values (112) | 14599 |
ASCII
Value | Count | Frequency (%) |
( | 298 | |
) | 298 | |
2 | 204 | |
1 | 74 | 8.4% |
. | 3 | 0.3% |
기타_용도
Text
MISSING
 
Distinct | 477 |
---|---|
Distinct (%) | 4.8% |
Missing | 128 |
Missing (%) | 1.3% |
Memory size | 156.2 KiB |
Length
Max length | 84 |
---|---|
Median length | 63 |
Mean length | 13.663088 |
Min length | 1 |
Characters and Unicode
Total characters | 134882 |
---|---|
Distinct characters | 249 |
Distinct categories | 11 ? |
Distinct scripts | 3 ? |
Distinct blocks | 3 ? |
Unique
Unique | 189 ? |
---|---|
Unique (%) | 1.9% |
Sample
1st row | 기계실,전기실,창고,재활용창고,용역원실,휴게실,오락실,주차관제실,체력단련실,검수실,방재센터,경비실,유아실,사무실 |
---|---|
2nd row | 체력단련장,기전실계단 |
3rd row | 경로당,맘스카페 |
4th row | 계단 |
5th row | 보육시설,주민공동시설,문고,노인정,독서실(지1~2층) |
Value | Count | Frequency (%) |
주차장 | 1333 | 13.3% |
기계실,전기실,창고,재활용창고,용역원실,휴게실,오락실,주차관제실,체력단련실,검수실,방재센터,경비실,유아실,사무실 | 862 | 8.6% |
계단실,복도,로비,화장실,공조실 | 801 | 8.0% |
판매시설(도매시장 | 709 | 7.1% |
계단실 | 402 | 4.0% |
지하주차장 | 326 | 3.3% |
벽체 | 322 | 3.2% |
아파트 | 168 | 1.7% |
경비실 | 163 | 1.6% |
공동주택(아파트 | 149 | 1.5% |
Other values (479) | 4792 |
Most occurring characters
Value | Count | Frequency (%) |
, | 20191 | 15.0% |
실 | 17531 | 13.0% |
기 | 4130 | 3.1% |
주 | 3810 | 2.8% |
장 | 3804 | 2.8% |
계 | 3671 | 2.7% |
단 | 2985 | 2.2% |
차 | 2724 | 2.0% |
시 | 2568 | 1.9% |
도 | 2375 | 1.8% |
Other values (239) | 71093 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 106822 | |
Other Punctuation | 20410 | 15.1% |
Uppercase Letter | 2205 | 1.6% |
Close Punctuation | 1942 | 1.4% |
Open Punctuation | 1941 | 1.4% |
Decimal Number | 1039 | 0.8% |
Math Symbol | 212 | 0.2% |
Space Separator | 164 | 0.1% |
Dash Punctuation | 141 | 0.1% |
Other Symbol | 5 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
실 | 17531 | 16.4% |
기 | 4130 | 3.9% |
주 | 3810 | 3.6% |
장 | 3804 | 3.6% |
계 | 3671 | 3.4% |
단 | 2985 | 2.8% |
차 | 2724 | 2.6% |
시 | 2568 | 2.4% |
도 | 2375 | 2.2% |
전 | 2207 | 2.1% |
Other values (203) | 61017 |
Uppercase Letter
Value | Count | Frequency (%) |
D | 432 | |
F | 429 | |
M | 428 | |
E | 350 | |
V | 183 | |
L | 154 | 7.0% |
S | 72 | 3.3% |
P | 65 | 2.9% |
T | 46 | 2.1% |
A | 18 | 0.8% |
Other values (5) | 28 | 1.3% |
Decimal Number
Value | Count | Frequency (%) |
1 | 481 | |
2 | 352 | |
3 | 92 | 8.9% |
6 | 32 | 3.1% |
4 | 31 | 3.0% |
5 | 26 | 2.5% |
7 | 11 | 1.1% |
0 | 10 | 1.0% |
9 | 4 | 0.4% |
Other Punctuation
Value | Count | Frequency (%) |
, | 20191 | |
/ | 143 | 0.7% |
. | 70 | 0.3% |
# | 5 | < 0.1% |
: | 1 | < 0.1% |
Close Punctuation
Value | Count | Frequency (%) |
) | 1942 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1941 |
Math Symbol
Value | Count | Frequency (%) |
~ | 212 |
Space Separator
Value | Count | Frequency (%) |
164 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 141 |
Other Symbol
Value | Count | Frequency (%) |
㎡ | 5 |
Control
Value | Count | Frequency (%) |
1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 106822 | |
Common | 25855 | 19.2% |
Latin | 2205 | 1.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
실 | 17531 | 16.4% |
기 | 4130 | 3.9% |
주 | 3810 | 3.6% |
장 | 3804 | 3.6% |
계 | 3671 | 3.4% |
단 | 2985 | 2.8% |
차 | 2724 | 2.6% |
시 | 2568 | 2.4% |
도 | 2375 | 2.2% |
전 | 2207 | 2.1% |
Other values (203) | 61017 |
Common
Value | Count | Frequency (%) |
, | 20191 | |
) | 1942 | 7.5% |
( | 1941 | 7.5% |
1 | 481 | 1.9% |
2 | 352 | 1.4% |
~ | 212 | 0.8% |
164 | 0.6% | |
/ | 143 | 0.6% |
- | 141 | 0.5% |
3 | 92 | 0.4% |
Other values (11) | 196 | 0.8% |
Latin
Value | Count | Frequency (%) |
D | 432 | |
F | 429 | |
M | 428 | |
E | 350 | |
V | 183 | |
L | 154 | 7.0% |
S | 72 | 3.3% |
P | 65 | 2.9% |
T | 46 | 2.1% |
A | 18 | 0.8% |
Other values (5) | 28 | 1.3% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 106822 | |
ASCII | 28055 | 20.8% |
CJK Compat | 5 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
, | 20191 | |
) | 1942 | 6.9% |
( | 1941 | 6.9% |
1 | 481 | 1.7% |
D | 432 | 1.5% |
F | 429 | 1.5% |
M | 428 | 1.5% |
2 | 352 | 1.3% |
E | 350 | 1.2% |
~ | 212 | 0.8% |
Other values (25) | 1297 | 4.6% |
Hangul
Value | Count | Frequency (%) |
실 | 17531 | 16.4% |
기 | 4130 | 3.9% |
주 | 3810 | 3.6% |
장 | 3804 | 3.6% |
계 | 3671 | 3.4% |
단 | 2985 | 2.8% |
차 | 2724 | 2.6% |
시 | 2568 | 2.4% |
도 | 2375 | 2.2% |
전 | 2207 | 2.1% |
Other values (203) | 61017 |
CJK Compat
Value | Count | Frequency (%) |
㎡ | 5 |
면적
Real number (ℝ)
Distinct | 2518 |
---|---|
Distinct (%) | 25.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 23.500582 |
Minimum | 0 |
---|---|
Maximum | 1136.84 |
Zeros | 47 |
Zeros (%) | 0.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0.30499 |
Q1 | 2.41 |
median | 13.96285 |
Q3 | 22.86 |
95-th percentile | 84.4675 |
Maximum | 1136.84 |
Range | 1136.84 |
Interquartile range (IQR) | 20.45 |
Descriptive statistics
Standard deviation | 54.224062 |
---|---|
Coefficient of variation (CV) | 2.3073498 |
Kurtosis | 149.42843 |
Mean | 23.500582 |
Median Absolute Deviation (MAD) | 11.51285 |
Skewness | 10.488118 |
Sum | 235005.82 |
Variance | 2940.2489 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2.41 | 505 | 5.1% |
21.08 | 465 | 4.7% |
22.68 | 417 | 4.2% |
20.78 | 119 | 1.2% |
20.66 | 101 | 1.0% |
20.62 | 66 | 0.7% |
20.58 | 51 | 0.5% |
2.43 | 49 | 0.5% |
16.3716 | 48 | 0.5% |
0.0 | 47 | 0.5% |
Other values (2508) | 8132 |
Value | Count | Frequency (%) |
0.0 | 47 | |
0.02 | 5 | 0.1% |
0.03 | 45 | |
0.04 | 20 | |
0.05 | 28 | |
0.053 | 1 | < 0.1% |
0.0558 | 1 | < 0.1% |
0.0559 | 1 | < 0.1% |
0.0595 | 31 | |
0.06 | 36 |
Value | Count | Frequency (%) |
1136.84 | 1 | |
1116.2 | 1 | |
1079.59 | 1 | |
1056.866 | 1 | |
928.52 | 1 | |
918.58 | 1 | |
907.5 | 1 | |
893.95 | 1 | |
874.16 | 1 | |
855.47 | 1 |
작업_일자
Real number (ℝ)
Distinct | 54 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20228855 |
Minimum | 20200108 |
---|---|
Maximum | 20240227 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 20200108 |
---|---|
5-th percentile | 20211029 |
Q1 | 20231104 |
median | 20231104 |
Q3 | 20231110 |
95-th percentile | 20231124 |
Maximum | 20240227 |
Range | 40119 |
Interquartile range (IQR) | 6 |
Descriptive statistics
Standard deviation | 6877.7042 |
---|---|
Coefficient of variation (CV) | 0.00033999474 |
Kurtosis | 6.0941699 |
Mean | 20228855 |
Median Absolute Deviation (MAD) | 6 |
Skewness | -2.6720395 |
Sum | 2.0228855 × 1011 |
Variance | 47302815 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20231104 | 4191 | |
20231124 | 2059 | |
20231110 | 1983 | |
20211029 | 642 | 6.4% |
20230912 | 453 | 4.5% |
20231028 | 104 | 1.0% |
20240227 | 86 | 0.9% |
20211216 | 77 | 0.8% |
20211123 | 43 | 0.4% |
20220211 | 34 | 0.3% |
Other values (44) | 328 | 3.3% |
Value | Count | Frequency (%) |
20200108 | 2 | < 0.1% |
20200110 | 6 | 0.1% |
20200117 | 5 | 0.1% |
20200206 | 3 | < 0.1% |
20200213 | 4 | < 0.1% |
20200304 | 1 | < 0.1% |
20200306 | 9 | 0.1% |
20200324 | 3 | < 0.1% |
20200331 | 23 | |
20200407 | 1 | < 0.1% |
Value | Count | Frequency (%) |
20240227 | 86 | 0.9% |
20231124 | 2059 | |
20231110 | 1983 | |
20231104 | 4191 | |
20231028 | 104 | 1.0% |
20230929 | 2 | < 0.1% |
20230912 | 453 | 4.5% |
20230831 | 6 | 0.1% |
20230808 | 2 | < 0.1% |
20230607 | 3 | < 0.1% |
전유_공용_구분_코드 | 주_부속_구분_코드 | 층_구분_코드 | 층_번호 | 구조_코드 | 기타_구조 | 주_용도_코드 | 면적 | 작업_일자 | |
---|---|---|---|---|---|---|---|---|---|
전유_공용_구분_코드 | 1.000 | 0.342 | 0.433 | 0.527 | 0.100 | 0.203 | 0.339 | 0.188 | 0.156 |
주_부속_구분_코드 | 0.342 | 1.000 | 0.124 | 0.301 | 0.076 | 0.262 | 0.716 | 0.051 | 0.259 |
층_구분_코드 | 0.433 | 0.124 | 1.000 | 0.608 | 0.089 | 0.410 | 0.550 | 0.069 | 0.178 |
층_번호 | 0.527 | 0.301 | 0.608 | 1.000 | 0.059 | 0.000 | 0.411 | 0.395 | 0.147 |
구조_코드 | 0.100 | 0.076 | 0.089 | 0.059 | 1.000 | 0.990 | 0.799 | 0.206 | 0.344 |
기타_구조 | 0.203 | 0.262 | 0.410 | 0.000 | 0.990 | 1.000 | 0.863 | 0.364 | 0.779 |
주_용도_코드 | 0.339 | 0.716 | 0.550 | 0.411 | 0.799 | 0.863 | 1.000 | 0.687 | 0.609 |
면적 | 0.188 | 0.051 | 0.069 | 0.395 | 0.206 | 0.364 | 0.687 | 1.000 | 0.102 |
작업_일자 | 0.156 | 0.259 | 0.178 | 0.147 | 0.344 | 0.779 | 0.609 | 0.102 | 1.000 |
기타_구조 | 주_부속_구분_코드 | 전유_공용_구분_코드 | 구조_코드 | 층_구분_코드 | |
---|---|---|---|---|---|
기타_구조 | 1.000 | 0.220 | 0.170 | 0.917 | 0.190 |
주_부속_구분_코드 | 0.220 | 1.000 | 0.222 | 0.076 | 0.152 |
전유_공용_구분_코드 | 0.170 | 0.222 | 1.000 | 0.100 | 0.527 |
구조_코드 | 0.917 | 0.076 | 0.100 | 1.000 | 0.051 |
층_구분_코드 | 0.190 | 0.152 | 0.527 | 0.051 | 1.000 |
층_번호 | 면적 | 작업_일자 | 전유_공용_구분_코드 | 주_부속_구분_코드 | 층_구분_코드 | 구조_코드 | 기타_구조 | |
---|---|---|---|---|---|---|---|---|
층_번호 | 1.000 | 0.190 | 0.009 | 0.406 | 0.231 | 0.296 | 0.027 | 0.000 |
면적 | 0.190 | 1.000 | -0.061 | 0.144 | 0.039 | 0.029 | 0.095 | 0.134 |
작업_일자 | 0.009 | -0.061 | 1.000 | 0.112 | 0.186 | 0.121 | 0.178 | 0.468 |
전유_공용_구분_코드 | 0.406 | 0.144 | 0.112 | 1.000 | 0.222 | 0.527 | 0.100 | 0.170 |
주_부속_구분_코드 | 0.231 | 0.039 | 0.186 | 0.222 | 1.000 | 0.152 | 0.076 | 0.220 |
층_구분_코드 | 0.296 | 0.029 | 0.121 | 0.527 | 0.152 | 1.000 | 0.051 | 0.190 |
구조_코드 | 0.027 | 0.095 | 0.178 | 0.100 | 0.076 | 0.051 | 1.000 | 0.917 |
기타_구조 | 0.000 | 0.134 | 0.468 | 0.170 | 0.220 | 0.190 | 0.917 | 1.000 |
전유_공용_면적_PK | 폐쇄말소대장_PK | 전유_공용_구분_코드 | 주_부속_구분_코드 | 층_구분_코드 | 층_번호 | 층_번호_명 | 구조_코드 | 기타_구조 | 주_용도_코드 | 기타_용도 | 면적 | 작업_일자 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
22732 | 11710-100286453 | 11710-100160494 | 공용 | 주건축물 | 각층 | 0 | 지5-1층,3층-7층,10층,11층 | 철근콘크리트구조 | 철근콘크리트구조 | 도매시장 | 기계실,전기실,창고,재활용창고,용역원실,휴게실,오락실,주차관제실,체력단련실,검수실,방재센터,경비실,유아실,사무실 | 2.41 | 20231104 |
48215 | 11350-100783619 | 11350-100216753 | 공용 | 부속건축물 | 지하 | 1 | 지1 | 철근콘크리트구조 | 철근콘크리트구조 | 기타노유자시설 | 체력단련장,기전실계단 | 2.4936 | 20231124 |
55499 | 11260-100954144 | 11260-100290878 | 공용 | 부속건축물 | 지상 | 1 | 1층 | 철근콘크리트구조 | 철근콘크리트구조 | 아파트 | 경로당,맘스카페 | 0.6811 | 20231110 |
1877 | 11740-102714129 | 11740-100633340 | 공용 | 주건축물 | <NA> | 0 | <NA> | 철근콘크리트구조 | 철근콘크리트조 | 아파트 | 계단 | 0.79 | 20211127 |
36037 | 11710-100337118 | 11710-100190839 | 공용 | 부속건축물 | 각층 | 0 | 각층 | 철근콘크리트구조 | 철근콘크리트구조 | 복리시설 | 보육시설,주민공동시설,문고,노인정,독서실(지1~2층) | 2.83 | 20231104 |
41986 | 11680-105916827 | 11680-101076530 | 공용 | 부속건축물 | 지하 | 2 | 지하2층 | 철근콘크리트구조 | 철근콘크리트조 | 부대시설 | 보일러실 | 1.14 | 20211029 |
15930 | 11710-100291523 | 11710-100159514 | 공용 | 주건축물 | 지상 | 8 | 8 | 철근콘크리트구조 | 철근콘크리트구조 | 도매시장 | 계단실,복도,로비,화장실,공조실 | 20.78 | 20231104 |
1641 | 11440-1000000000000007907402 | 11440-1000000000000002613654 | 공용 | 주건축물 | 지하 | 0 | 지2~지1층 | 철근콘크리트구조 | 철근콘크리트구조 | 부대시설 | 폐기물보관실,용역원휴게실 | 0.41 | 20231124 |
50487 | 11350-100784225 | 11350-100216829 | 전유 | 주건축물 | 지상 | 9 | 9층 | 철근콘크리트구조 | 철근콘크리트구조 | 기타노유자시설 | 노인복지주택 | 84.7513 | 20231124 |
47632 | 11350-100782527 | 11350-100216616 | 공용 | 부속건축물 | 지상 | 1 | 1층 | 철근콘크리트구조 | 철근콘크리트구조 | 기타노유자시설 | 지하주차장 | 0.0766 | 20231124 |
전유_공용_면적_PK | 폐쇄말소대장_PK | 전유_공용_구분_코드 | 주_부속_구분_코드 | 층_구분_코드 | 층_번호 | 층_번호_명 | 구조_코드 | 기타_구조 | 주_용도_코드 | 기타_용도 | 면적 | 작업_일자 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
6637 | 11710-100279051 | 11710-100157175 | 전유 | 주건축물 | 지상 | 2 | 2 | 철근콘크리트구조 | 철근콘크리트구조 | 도매시장 | 판매시설(도매시장) | 22.68 | 20231104 |
2080 | 11710-100280583 | 11710-100157289 | 전유 | 주건축물 | 지상 | 2 | 2 | 철근콘크리트구조 | 철근콘크리트구조 | 도매시장 | 판매시설(도매시장) | 22.75 | 20231104 |
33328 | 11710-100276378 | 11710-100156423 | 공용 | 주건축물 | 각층 | 0 | 지5층-지1층 | 철근콘크리트구조 | 철근콘크리트구조 | 도매시장 | 주차장 | 21.08 | 20231104 |
52761 | 11140-100263671 | 11140-100090619 | 공용 | 주건축물 | 지하 | 2 | 지2 | 철근콘크리트구조 | 철근콘크리트구조 | 기타판매시설 | 계단실,ELEV.,복도,화장실 | 5.6903 | 20231124 |
33027 | 11260-100951458 | 11260-100290335 | 공용 | 주건축물 | 지상 | 0 | 각층 | 철근콘크리트구조 | 철근콘크리트구조 | 아파트 | 계단실,복도 | 16.3716 | 20231110 |
19496 | 11710-100279201 | 11710-100157782 | 공용 | 주건축물 | 각층 | 0 | 지5-1층,3층-7층,10층,11층 | 철근콘크리트구조 | 철근콘크리트구조 | 도매시장 | 기계실,전기실,창고,재활용창고,용역원실,휴게실,오락실,주차관제실,체력단련실,검수실,방재센터,경비실,유아실,사무실 | 1.2 | 20231104 |
51499 | 11260-1000000000000007522809 | 11260-90500 | 전유 | 주건축물 | 지상 | 2 | 2층 | 철근콘크리트구조 | <NA> | 의원 | 근린생활시설(의원) | 181.21 | 20231028 |
21016 | 11710-100270835 | 11710-100158912 | 전유 | 주건축물 | 지상 | 8 | 8 | 철근콘크리트구조 | 철근콘크리트구조 | 도매시장 | 판매시설(도매시장) | 22.68 | 20231104 |
25429 | 11260-100957792 | 11260-100291393 | 공용 | 주건축물 | 각층 | 0 | 지4~지1 | 철근콘크리트구조 | 철근콘크리트구조 | 오피스텔 | 지하주차장 | 21.4429 | 20231110 |
43804 | 11710-100290615 | 11710-100160199 | 공용 | 주건축물 | 지상 | 3 | 3 | 철근콘크리트구조 | 철근콘크리트구조 | 도매시장 | 계단실,복도,로비,화장실,공조실 | 21.2 | 20231104 |