Dataset statistics
Number of variables | 13 |
---|---|
Number of observations | 10000 |
Missing cells | 10243 |
Missing cells (%) | 7.9% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.1 MiB |
Average record size in memory | 120.0 B |
Variable types
Text | 4 |
---|---|
Categorical | 3 |
Numeric | 6 |
Dataset
Description | 관리_전유_공용_pk,관리_형별_개요_pk,관리_호별_명세_pk,전유_공용_구분_코드,주_부속_구분_코드,층_구분_코드,층_번호,구조_코드,기타_구조,용도_코드,기타_용도,면적,작업_일자 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15673/S/1/datasetView.do |
층_구분_코드 is highly overall correlated with 층_번호 and 1 other fields | High correlation |
층_번호 is highly overall correlated with 층_구분_코드 | High correlation |
구조_코드 is highly overall correlated with 기타_구조 | High correlation |
전유_공용_구분_코드 is highly overall correlated with 층_구분_코드 | High correlation |
기타_구조 is highly overall correlated with 구조_코드 | High correlation |
전유_공용_구분_코드 is highly imbalanced (53.7%) | Imbalance |
기타_구조 is highly imbalanced (74.0%) | Imbalance |
관리_호별_명세_pk has 9968 (99.7%) missing values | Missing |
기타_용도 has 226 (2.3%) missing values | Missing |
면적 is highly skewed (γ1 = 62.43615799) | Skewed |
관리_전유_공용_pk has unique values | Unique |
층_번호 has 7110 (71.1%) zeros | Zeros |
Reproduction
Analysis started | 2024-05-11 03:38:05.311354 |
---|---|
Analysis finished | 2024-05-11 03:38:19.512190 |
Duration | 14.2 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
관리_전유_공용_pk
Text
UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 28 |
---|---|
Median length | 15 |
Mean length | 19.7717 |
Min length | 9 |
Characters and Unicode
Total characters | 197717 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 10000 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 11710-100010604 |
---|---|
2nd row | 11260-1000000000000000142658 |
3rd row | 11380-1000000000000000102908 |
4th row | 11410-100006970 |
5th row | 11440-100006644 |
Value | Count | Frequency (%) |
11710-100010604 | 1 | < 0.1% |
11590-1000000000000000189925 | 1 | < 0.1% |
11000-100002209 | 1 | < 0.1% |
11590-1000000000000000189997 | 1 | < 0.1% |
11410-100007759 | 1 | < 0.1% |
11680-100008061 | 1 | < 0.1% |
11440-100006858 | 1 | < 0.1% |
11740-100006408 | 1 | < 0.1% |
11260-1000000000000000064518 | 1 | < 0.1% |
11545-100002289 | 1 | < 0.1% |
Other values (9990) | 9990 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 98798 | |
1 | 39294 | 19.9% |
- | 10000 | 5.1% |
5 | 7772 | 3.9% |
6 | 7636 | 3.9% |
4 | 6670 | 3.4% |
2 | 5818 | 2.9% |
7 | 5730 | 2.9% |
3 | 5724 | 2.9% |
8 | 5550 | 2.8% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 187717 | |
Dash Punctuation | 10000 | 5.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 98798 | |
1 | 39294 | 20.9% |
5 | 7772 | 4.1% |
6 | 7636 | 4.1% |
4 | 6670 | 3.6% |
2 | 5818 | 3.1% |
7 | 5730 | 3.1% |
3 | 5724 | 3.0% |
8 | 5550 | 3.0% |
9 | 4725 | 2.5% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 197717 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 98798 | |
1 | 39294 | 19.9% |
- | 10000 | 5.1% |
5 | 7772 | 3.9% |
6 | 7636 | 3.9% |
4 | 6670 | 3.4% |
2 | 5818 | 2.9% |
7 | 5730 | 2.9% |
3 | 5724 | 2.9% |
8 | 5550 | 2.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 197717 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 98798 | |
1 | 39294 | 19.9% |
- | 10000 | 5.1% |
5 | 7772 | 3.9% |
6 | 7636 | 3.9% |
4 | 6670 | 3.4% |
2 | 5818 | 2.9% |
7 | 5730 | 2.9% |
3 | 5724 | 2.9% |
8 | 5550 | 2.8% |
관리_형별_개요_pk
Text
Distinct | 6706 |
---|---|
Distinct (%) | 67.3% |
Missing | 32 |
Missing (%) | 0.3% |
Memory size | 156.2 KiB |
Length
Max length | 28 |
---|---|
Median length | 15 |
Mean length | 19.010734 |
Min length | 9 |
Characters and Unicode
Total characters | 189499 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 4283 ? |
---|---|
Unique (%) | 43.0% |
Sample
1st row | 11710-100004977 |
---|---|
2nd row | 11260-1000000000000000082107 |
3rd row | 11380-100003861 |
4th row | 11410-100004347 |
5th row | 11440-100004121 |
Value | Count | Frequency (%) |
11590-100002581 | 10 | 0.1% |
11590-100003867 | 8 | 0.1% |
11590-100002588 | 8 | 0.1% |
11590-100003866 | 7 | 0.1% |
11000-100008534 | 6 | 0.1% |
11500-100005922 | 6 | 0.1% |
11680-100003372 | 6 | 0.1% |
11590-100002591 | 6 | 0.1% |
11000-100008275 | 5 | 0.1% |
11230-1000000000000000034910 | 5 | 0.1% |
Other values (6696) | 9901 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 93246 | |
1 | 35802 | 18.9% |
- | 9968 | 5.3% |
5 | 8390 | 4.4% |
4 | 7575 | 4.0% |
3 | 7355 | 3.9% |
6 | 6880 | 3.6% |
2 | 6815 | 3.6% |
8 | 5347 | 2.8% |
9 | 4074 | 2.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 179531 | |
Dash Punctuation | 9968 | 5.3% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 93246 | |
1 | 35802 | 19.9% |
5 | 8390 | 4.7% |
4 | 7575 | 4.2% |
3 | 7355 | 4.1% |
6 | 6880 | 3.8% |
2 | 6815 | 3.8% |
8 | 5347 | 3.0% |
9 | 4074 | 2.3% |
7 | 4047 | 2.3% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 9968 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 189499 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 93246 | |
1 | 35802 | 18.9% |
- | 9968 | 5.3% |
5 | 8390 | 4.4% |
4 | 7575 | 4.0% |
3 | 7355 | 3.9% |
6 | 6880 | 3.6% |
2 | 6815 | 3.6% |
8 | 5347 | 2.8% |
9 | 4074 | 2.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 189499 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 93246 | |
1 | 35802 | 18.9% |
- | 9968 | 5.3% |
5 | 8390 | 4.4% |
4 | 7575 | 4.0% |
3 | 7355 | 3.9% |
6 | 6880 | 3.6% |
2 | 6815 | 3.6% |
8 | 5347 | 2.8% |
9 | 4074 | 2.1% |
관리_호별_명세_pk
Text
MISSING
 
Distinct | 30 |
---|---|
Distinct (%) | 93.8% |
Missing | 9968 |
Missing (%) | 99.7% |
Memory size | 156.2 KiB |
Length
Max length | 15 |
---|---|
Median length | 15 |
Mean length | 15 |
Min length | 15 |
Characters and Unicode
Total characters | 480 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 28 ? |
---|---|
Unique (%) | 87.5% |
Sample
1st row | 11500-100054551 |
---|---|
2nd row | 11500-100054490 |
3rd row | 11500-100054561 |
4th row | 11500-100054504 |
5th row | 11500-100054514 |
Value | Count | Frequency (%) |
11500-100054496 | 2 | 6.2% |
11500-100054523 | 2 | 6.2% |
11500-100054491 | 1 | 3.1% |
11500-100054541 | 1 | 3.1% |
11500-100054513 | 1 | 3.1% |
11500-100054530 | 1 | 3.1% |
11500-100054554 | 1 | 3.1% |
11500-100054555 | 1 | 3.1% |
11500-100054522 | 1 | 3.1% |
11500-100054512 | 1 | 3.1% |
Other values (20) | 20 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 166 | |
1 | 105 | |
5 | 96 | |
4 | 46 | 9.6% |
- | 32 | 6.7% |
2 | 11 | 2.3% |
3 | 8 | 1.7% |
9 | 7 | 1.5% |
6 | 4 | 0.8% |
8 | 3 | 0.6% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 448 | |
Dash Punctuation | 32 | 6.7% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 166 | |
1 | 105 | |
5 | 96 | |
4 | 46 | 10.3% |
2 | 11 | 2.5% |
3 | 8 | 1.8% |
9 | 7 | 1.6% |
6 | 4 | 0.9% |
8 | 3 | 0.7% |
7 | 2 | 0.4% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 32 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 480 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 166 | |
1 | 105 | |
5 | 96 | |
4 | 46 | 9.6% |
- | 32 | 6.7% |
2 | 11 | 2.3% |
3 | 8 | 1.7% |
9 | 7 | 1.5% |
6 | 4 | 0.8% |
8 | 3 | 0.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 480 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 166 | |
1 | 105 | |
5 | 96 | |
4 | 46 | 9.6% |
- | 32 | 6.7% |
2 | 11 | 2.3% |
3 | 8 | 1.7% |
9 | 7 | 1.5% |
6 | 4 | 0.8% |
8 | 3 | 0.6% |
전유_공용_구분_코드
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
2 | |
---|---|
1 | |
<NA> | 3 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.0009 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2 |
---|---|
2nd row | 2 |
3rd row | 1 |
4th row | 2 |
5th row | 2 |
Common Values
Value | Count | Frequency (%) |
2 | 7960 | |
1 | 2037 | 20.4% |
<NA> | 3 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2 | 7960 | |
1 | 2037 | 20.4% |
na | 3 | < 0.1% |
주_부속_구분_코드
Categorical
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | |
<NA> | 3 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.0009 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 1 |
3rd row | 0 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
0 | 6769 | |
1 | 3228 | |
<NA> | 3 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 6769 | |
1 | 3228 | |
na | 3 | < 0.1% |
층_구분_코드
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 11 |
Missing (%) | 0.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 27.309941 |
Minimum | 10 |
---|---|
Maximum | 40 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 10 |
---|---|
5-th percentile | 10 |
Q1 | 20 |
median | 20 |
Q3 | 40 |
95-th percentile | 40 |
Maximum | 40 |
Range | 30 |
Interquartile range (IQR) | 20 |
Descriptive statistics
Standard deviation | 12.450416 |
---|---|
Coefficient of variation (CV) | 0.45589319 |
Kurtosis | -1.6796124 |
Mean | 27.309941 |
Median Absolute Deviation (MAD) | 10 |
Skewness | -0.12655079 |
Sum | 272799 |
Variance | 155.01286 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
40 | 4686 | |
20 | 3215 | |
10 | 2074 | |
22 | 7 | 0.1% |
21 | 5 | 0.1% |
30 | 2 | < 0.1% |
(Missing) | 11 | 0.1% |
Value | Count | Frequency (%) |
10 | 2074 | |
20 | 3215 | |
21 | 5 | 0.1% |
22 | 7 | 0.1% |
30 | 2 | < 0.1% |
40 | 4686 |
Value | Count | Frequency (%) |
40 | 4686 | |
30 | 2 | < 0.1% |
22 | 7 | 0.1% |
21 | 5 | 0.1% |
20 | 3215 | |
10 | 2074 |
층_번호
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 15 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.5406 |
Minimum | 0 |
---|---|
Maximum | 29 |
Zeros | 7110 |
Zeros (%) | 71.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 1 |
95-th percentile | 2 |
Maximum | 29 |
Range | 29 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 1.425256 |
---|---|
Coefficient of variation (CV) | 2.6364337 |
Kurtosis | 132.75806 |
Mean | 0.5406 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 8.6702864 |
Sum | 5406 |
Variance | 2.0313548 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 7110 | |
1 | 1883 | 18.8% |
2 | 514 | 5.1% |
3 | 176 | 1.8% |
5 | 117 | 1.2% |
4 | 78 | 0.8% |
6 | 64 | 0.6% |
7 | 18 | 0.2% |
9 | 11 | 0.1% |
8 | 9 | 0.1% |
Other values (5) | 20 | 0.2% |
Value | Count | Frequency (%) |
0 | 7110 | |
1 | 1883 | 18.8% |
2 | 514 | 5.1% |
3 | 176 | 1.8% |
4 | 78 | 0.8% |
5 | 117 | 1.2% |
6 | 64 | 0.6% |
7 | 18 | 0.2% |
8 | 9 | 0.1% |
9 | 11 | 0.1% |
Value | Count | Frequency (%) |
29 | 3 | < 0.1% |
26 | 7 | 0.1% |
25 | 1 | < 0.1% |
11 | 5 | 0.1% |
10 | 4 | < 0.1% |
9 | 11 | 0.1% |
8 | 9 | 0.1% |
7 | 18 | 0.2% |
6 | 64 | |
5 | 117 |
구조_코드
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 7 |
---|---|
Distinct (%) | 0.1% |
Missing | 3 |
Missing (%) | < 0.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 22.386216 |
Minimum | 21 |
---|---|
Maximum | 99 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 21 |
---|---|
5-th percentile | 21 |
Q1 | 21 |
median | 21 |
Q3 | 21 |
95-th percentile | 42 |
Maximum | 99 |
Range | 78 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 5.3763691 |
---|---|
Coefficient of variation (CV) | 0.24016426 |
Kurtosis | 24.90416 |
Mean | 22.386216 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 4.2644394 |
Sum | 223795 |
Variance | 28.905344 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
21 | 9345 | |
42 | 621 | 6.2% |
43 | 11 | 0.1% |
40 | 7 | 0.1% |
31 | 5 | 0.1% |
41 | 4 | < 0.1% |
99 | 4 | < 0.1% |
(Missing) | 3 | < 0.1% |
Value | Count | Frequency (%) |
21 | 9345 | |
31 | 5 | 0.1% |
40 | 7 | 0.1% |
41 | 4 | < 0.1% |
42 | 621 | 6.2% |
43 | 11 | 0.1% |
99 | 4 | < 0.1% |
Value | Count | Frequency (%) |
99 | 4 | < 0.1% |
43 | 11 | 0.1% |
42 | 621 | 6.2% |
41 | 4 | < 0.1% |
40 | 7 | 0.1% |
31 | 5 | 0.1% |
21 | 9345 |
기타_구조
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 17 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
철근콘크리트구조 | |
---|---|
<NA> | |
철골철근콘크리트구조 | 627 |
무량판구조 | 49 |
철근콘크리트조 | 27 |
Other values (12) | 52 |
Length
Max length | 21 |
---|---|
Median length | 8 |
Mean length | 7.5372 |
Min length | 3 |
Unique
Unique | 8 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | 철근콘크리트구조 |
---|---|
2nd row | 철근콘크리트구조 |
3rd row | 철근콘크리트구조 |
4th row | 철근콘크리트구조 |
5th row | 철근콘크리트구조 |
Common Values
Value | Count | Frequency (%) |
철근콘크리트구조 | 7739 | |
<NA> | 1506 | 15.1% |
철골철근콘크리트구조 | 627 | 6.3% |
무량판구조 | 49 | 0.5% |
철근콘크리트조 | 27 | 0.3% |
철근콘크리트구조,철골철근콘크리트구조 | 25 | 0.2% |
철골철근콘크리트합성구조 | 10 | 0.1% |
(벽식구조) | 6 | 0.1% |
라멘구조 | 3 | < 0.1% |
철근콘크리트벽식구조 | 1 | < 0.1% |
Other values (7) | 7 | 0.1% |
Length
Value | Count | Frequency (%) |
철근콘크리트구조 | 7739 | |
na | 1506 | 15.1% |
철골철근콘크리트구조 | 627 | 6.3% |
무량판구조 | 49 | 0.5% |
철근콘크리트조 | 27 | 0.3% |
철근콘크리트구조,철골철근콘크리트구조 | 25 | 0.2% |
철골철근콘크리트합성구조 | 10 | 0.1% |
벽식구조 | 6 | 0.1% |
라멘구조 | 3 | < 0.1% |
철근콘크리트벽식구조 | 1 | < 0.1% |
Other values (7) | 7 | 0.1% |
용도_코드
Real number (ℝ)
Distinct | 58 |
---|---|
Distinct (%) | 0.6% |
Missing | 3 |
Missing (%) | < 0.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3900.5868 |
Minimum | 1003 |
---|---|
Maximum | 20001 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1003 |
---|---|
5-th percentile | 2001 |
Q1 | 2001 |
median | 2005 |
Q3 | 4999 |
95-th percentile | 14202 |
Maximum | 20001 |
Range | 18998 |
Interquartile range (IQR) | 2998 |
Descriptive statistics
Standard deviation | 3161.2254 |
---|---|
Coefficient of variation (CV) | 0.81044867 |
Kurtosis | 4.3443515 |
Mean | 3900.5868 |
Median Absolute Deviation (MAD) | 4 |
Skewness | 2.180108 |
Sum | 38994166 |
Variance | 9993345.9 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2001 | 3544 | |
4999 | 1772 | |
2005 | 936 | 9.4% |
2003 | 586 | 5.9% |
7201 | 426 | 4.3% |
14204 | 386 | 3.9% |
3001 | 376 | 3.8% |
2006 | 344 | 3.4% |
3999 | 322 | 3.2% |
7999 | 307 | 3.1% |
Other values (48) | 998 | 10.0% |
Value | Count | Frequency (%) |
1003 | 14 | 0.1% |
2001 | 3544 | |
2002 | 234 | 2.3% |
2003 | 586 | 5.9% |
2004 | 17 | 0.2% |
2005 | 936 | 9.4% |
2006 | 344 | 3.4% |
3001 | 376 | 3.8% |
3002 | 22 | 0.2% |
3005 | 71 | 0.7% |
Value | Count | Frequency (%) |
20001 | 2 | < 0.1% |
15104 | 2 | < 0.1% |
14204 | 386 | |
14202 | 189 | |
14199 | 7 | 0.1% |
14102 | 3 | < 0.1% |
14101 | 1 | < 0.1% |
13999 | 21 | 0.2% |
13104 | 5 | 0.1% |
13011 | 7 | 0.1% |
기타_용도
Text
MISSING
 
Distinct | 983 |
---|---|
Distinct (%) | 10.1% |
Missing | 226 |
Missing (%) | 2.3% |
Memory size | 156.2 KiB |
Length
Max length | 84 |
---|---|
Median length | 72 |
Mean length | 11.049928 |
Min length | 2 |
Characters and Unicode
Total characters | 108002 |
---|---|
Distinct characters | 303 |
Distinct categories | 10 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 257 ? |
---|---|
Unique (%) | 2.6% |
Sample
1st row | 계단실,화장실,홀,경비실,방재실 |
---|---|
2nd row | 주차장 |
3rd row | 공동주택(아파트) |
4th row | 주민공동시설,외부ELEV |
5th row | 지하주차장 |
Value | Count | Frequency (%) |
지하주차장 | 802 | 7.7% |
주차장 | 599 | 5.7% |
벽체 | 546 | 5.2% |
근린생활시설 | 543 | 5.2% |
계단실 | 452 | 4.3% |
등 | 296 | 2.8% |
공동주택(아파트 | 283 | 2.7% |
아파트 | 227 | 2.2% |
기계실,전기실 | 217 | 2.1% |
경비실 | 161 | 1.5% |
Other values (945) | 6293 |
Most occurring characters
Value | Count | Frequency (%) |
, | 12548 | 11.6% |
실 | 9764 | 9.0% |
기 | 4551 | 4.2% |
계 | 3228 | 3.0% |
주 | 3084 | 2.9% |
장 | 2569 | 2.4% |
전 | 2421 | 2.2% |
시 | 2328 | 2.2% |
단 | 2029 | 1.9% |
설 | 1936 | 1.8% |
Other values (293) | 63544 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 87421 | |
Other Punctuation | 12828 | 11.9% |
Uppercase Letter | 3050 | 2.8% |
Close Punctuation | 1498 | 1.4% |
Open Punctuation | 1497 | 1.4% |
Decimal Number | 857 | 0.8% |
Space Separator | 645 | 0.6% |
Dash Punctuation | 128 | 0.1% |
Math Symbol | 76 | 0.1% |
Connector Punctuation | 2 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
실 | 9764 | 11.2% |
기 | 4551 | 5.2% |
계 | 3228 | 3.7% |
주 | 3084 | 3.5% |
장 | 2569 | 2.9% |
전 | 2421 | 2.8% |
시 | 2328 | 2.7% |
단 | 2029 | 2.3% |
설 | 1936 | 2.2% |
관 | 1856 | 2.1% |
Other values (253) | 53655 |
Uppercase Letter
Value | Count | Frequency (%) |
M | 759 | |
F | 758 | |
D | 758 | |
E | 318 | |
V | 165 | 5.4% |
L | 156 | 5.1% |
X | 42 | 1.4% |
G | 41 | 1.3% |
B | 11 | 0.4% |
T | 8 | 0.3% |
Other values (8) | 34 | 1.1% |
Decimal Number
Value | Count | Frequency (%) |
1 | 359 | |
2 | 246 | |
3 | 116 | 13.5% |
5 | 38 | 4.4% |
6 | 37 | 4.3% |
8 | 30 | 3.5% |
4 | 30 | 3.5% |
9 | 1 | 0.1% |
Other Punctuation
Value | Count | Frequency (%) |
, | 12548 | |
/ | 186 | 1.4% |
. | 79 | 0.6% |
& | 13 | 0.1% |
' | 2 | < 0.1% |
Close Punctuation
Value | Count | Frequency (%) |
) | 1483 | |
] | 15 | 1.0% |
Open Punctuation
Value | Count | Frequency (%) |
( | 1482 | |
[ | 15 | 1.0% |
Math Symbol
Value | Count | Frequency (%) |
~ | 73 | |
+ | 3 | 3.9% |
Space Separator
Value | Count | Frequency (%) |
645 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 128 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 87421 | |
Common | 17531 | 16.2% |
Latin | 3050 | 2.8% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
실 | 9764 | 11.2% |
기 | 4551 | 5.2% |
계 | 3228 | 3.7% |
주 | 3084 | 3.5% |
장 | 2569 | 2.9% |
전 | 2421 | 2.8% |
시 | 2328 | 2.7% |
단 | 2029 | 2.3% |
설 | 1936 | 2.2% |
관 | 1856 | 2.1% |
Other values (253) | 53655 |
Common
Value | Count | Frequency (%) |
, | 12548 | |
) | 1483 | 8.5% |
( | 1482 | 8.5% |
645 | 3.7% | |
1 | 359 | 2.0% |
2 | 246 | 1.4% |
/ | 186 | 1.1% |
- | 128 | 0.7% |
3 | 116 | 0.7% |
. | 79 | 0.5% |
Other values (12) | 259 | 1.5% |
Latin
Value | Count | Frequency (%) |
M | 759 | |
F | 758 | |
D | 758 | |
E | 318 | |
V | 165 | 5.4% |
L | 156 | 5.1% |
X | 42 | 1.4% |
G | 41 | 1.3% |
B | 11 | 0.4% |
T | 8 | 0.3% |
Other values (8) | 34 | 1.1% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 87421 | |
ASCII | 20581 | 19.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
, | 12548 | |
) | 1483 | 7.2% |
( | 1482 | 7.2% |
M | 759 | 3.7% |
F | 758 | 3.7% |
D | 758 | 3.7% |
645 | 3.1% | |
1 | 359 | 1.7% |
E | 318 | 1.5% |
2 | 246 | 1.2% |
Other values (30) | 1225 | 6.0% |
Hangul
Value | Count | Frequency (%) |
실 | 9764 | 11.2% |
기 | 4551 | 5.2% |
계 | 3228 | 3.7% |
주 | 3084 | 3.5% |
장 | 2569 | 2.9% |
전 | 2421 | 2.8% |
시 | 2328 | 2.7% |
단 | 2029 | 2.3% |
설 | 1936 | 2.2% |
관 | 1856 | 2.1% |
Other values (253) | 53655 |
면적
Real number (ℝ)
SKEWED
 
Distinct | 7187 |
---|---|
Distinct (%) | 71.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 30.086541 |
Minimum | 0 |
---|---|
Maximum | 18531.37 |
Zeros | 4 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0.28 |
Q1 | 2.36315 |
median | 10.265 |
Q3 | 32.37625 |
95-th percentile | 84.821 |
Maximum | 18531.37 |
Range | 18531.37 |
Interquartile range (IQR) | 30.0131 |
Descriptive statistics
Standard deviation | 223.27341 |
---|---|
Coefficient of variation (CV) | 7.4210396 |
Kurtosis | 4837.0018 |
Mean | 30.086541 |
Median Absolute Deviation (MAD) | 9.3666 |
Skewness | 62.436158 |
Sum | 300865.41 |
Variance | 49851.017 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.03 | 24 | 0.2% |
0.07 | 23 | 0.2% |
0.04 | 20 | 0.2% |
0.05 | 20 | 0.2% |
0.54 | 17 | 0.2% |
0.37 | 16 | 0.2% |
0.77 | 16 | 0.2% |
0.4 | 14 | 0.1% |
0.02 | 13 | 0.1% |
0.78 | 13 | 0.1% |
Other values (7177) | 9824 |
Value | Count | Frequency (%) |
0.0 | 4 | < 0.1% |
0.0168 | 1 | < 0.1% |
0.0185 | 1 | < 0.1% |
0.02 | 13 | |
0.0203 | 1 | < 0.1% |
0.0211 | 2 | < 0.1% |
0.0223 | 2 | < 0.1% |
0.0237 | 2 | < 0.1% |
0.024 | 1 | < 0.1% |
0.0246 | 1 | < 0.1% |
Value | Count | Frequency (%) |
18531.37 | 1 | |
5862.676 | 1 | |
5551.77 | 1 | |
4898.77 | 1 | |
3628.059 | 1 | |
3389.464 | 1 | |
2551.6552 | 1 | |
1876.737 | 1 | |
1815.6 | 1 | |
1671.7518 | 1 |
작업_일자
Real number (ℝ)
Distinct | 146 |
---|---|
Distinct (%) | 1.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20227355 |
Minimum | 20201201 |
---|---|
Maximum | 20240510 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 20201201 |
---|---|
5-th percentile | 20210507 |
Q1 | 20220304 |
median | 20230509 |
Q3 | 20240102 |
95-th percentile | 20240510 |
Maximum | 20240510 |
Range | 39309 |
Interquartile range (IQR) | 19798 |
Descriptive statistics
Standard deviation | 11413.458 |
---|---|
Coefficient of variation (CV) | 0.00056425858 |
Kurtosis | -1.147869 |
Mean | 20227355 |
Median Absolute Deviation (MAD) | 9699 |
Skewness | -0.37866414 |
Sum | 2.0227355 × 1011 |
Variance | 1.3026703 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20240510 | 1394 | 13.9% |
20211029 | 997 | 10.0% |
20240102 | 644 | 6.4% |
20240208 | 551 | 5.5% |
20230704 | 405 | 4.0% |
20230831 | 239 | 2.4% |
20230727 | 220 | 2.2% |
20230310 | 184 | 1.8% |
20210106 | 171 | 1.7% |
20240417 | 170 | 1.7% |
Other values (136) | 5025 |
Value | Count | Frequency (%) |
20201201 | 62 | 0.6% |
20201208 | 43 | 0.4% |
20201216 | 32 | 0.3% |
20210106 | 171 | |
20210115 | 9 | 0.1% |
20210119 | 65 | 0.7% |
20210126 | 13 | 0.1% |
20210203 | 7 | 0.1% |
20210224 | 7 | 0.1% |
20210415 | 8 | 0.1% |
Value | Count | Frequency (%) |
20240510 | 1394 | |
20240507 | 57 | 0.6% |
20240425 | 5 | 0.1% |
20240420 | 44 | 0.4% |
20240417 | 170 | 1.7% |
20240416 | 2 | < 0.1% |
20240402 | 29 | 0.3% |
20240327 | 11 | 0.1% |
20240309 | 44 | 0.4% |
20240302 | 9 | 0.1% |
관리_호별_명세_pk | 전유_공용_구분_코드 | 주_부속_구분_코드 | 층_구분_코드 | 층_번호 | 구조_코드 | 기타_구조 | 용도_코드 | 면적 | 작업_일자 | |
---|---|---|---|---|---|---|---|---|---|---|
관리_호별_명세_pk | 1.000 | 0.000 | NaN | NaN | 1.000 | NaN | NaN | NaN | NaN | NaN |
전유_공용_구분_코드 | 0.000 | 1.000 | 0.520 | 0.443 | 0.097 | 0.057 | 0.032 | 0.191 | 0.030 | 0.056 |
주_부속_구분_코드 | NaN | 0.520 | 1.000 | 0.215 | 0.063 | 0.099 | 0.160 | 0.322 | 0.000 | 0.302 |
층_구분_코드 | NaN | 0.443 | 0.215 | 1.000 | 0.212 | 0.235 | 0.604 | 0.229 | 0.000 | 0.062 |
층_번호 | 1.000 | 0.097 | 0.063 | 0.212 | 1.000 | 0.175 | 0.108 | 0.133 | 0.019 | 0.163 |
구조_코드 | NaN | 0.057 | 0.099 | 0.235 | 0.175 | 1.000 | 1.000 | 0.256 | 0.024 | 0.181 |
기타_구조 | NaN | 0.032 | 0.160 | 0.604 | 0.108 | 1.000 | 1.000 | 0.595 | 0.456 | 0.273 |
용도_코드 | NaN | 0.191 | 0.322 | 0.229 | 0.133 | 0.256 | 0.595 | 1.000 | 0.286 | 0.292 |
면적 | NaN | 0.030 | 0.000 | 0.000 | 0.019 | 0.024 | 0.456 | 0.286 | 1.000 | 0.000 |
작업_일자 | NaN | 0.056 | 0.302 | 0.062 | 0.163 | 0.181 | 0.273 | 0.292 | 0.000 | 1.000 |
기타_구조 | 전유_공용_구분_코드 | 주_부속_구분_코드 | |
---|---|---|---|
기타_구조 | 1.000 | 0.025 | 0.126 |
전유_공용_구분_코드 | 0.025 | 1.000 | 0.348 |
주_부속_구분_코드 | 0.126 | 0.348 | 1.000 |
층_구분_코드 | 층_번호 | 구조_코드 | 용도_코드 | 면적 | 작업_일자 | 전유_공용_구분_코드 | 주_부속_구분_코드 | 기타_구조 | |
---|---|---|---|---|---|---|---|---|---|
층_구분_코드 | 1.000 | -0.699 | 0.089 | -0.004 | -0.009 | 0.039 | 0.538 | 0.263 | 0.414 |
층_번호 | -0.699 | 1.000 | -0.072 | 0.007 | -0.234 | -0.041 | 0.070 | 0.045 | 0.052 |
구조_코드 | 0.089 | -0.072 | 1.000 | 0.135 | 0.053 | -0.003 | 0.038 | 0.065 | 0.999 |
용도_코드 | -0.004 | 0.007 | 0.135 | 1.000 | 0.163 | -0.157 | 0.143 | 0.241 | 0.319 |
면적 | -0.009 | -0.234 | 0.053 | 0.163 | 1.000 | -0.036 | 0.037 | 0.000 | 0.253 |
작업_일자 | 0.039 | -0.041 | -0.003 | -0.157 | -0.036 | 1.000 | 0.040 | 0.213 | 0.135 |
전유_공용_구분_코드 | 0.538 | 0.070 | 0.038 | 0.143 | 0.037 | 0.040 | 1.000 | 0.348 | 0.025 |
주_부속_구분_코드 | 0.263 | 0.045 | 0.065 | 0.241 | 0.000 | 0.213 | 0.348 | 1.000 | 0.126 |
기타_구조 | 0.414 | 0.052 | 0.999 | 0.319 | 0.253 | 0.135 | 0.025 | 0.126 | 1.000 |
관리_전유_공용_pk | 관리_형별_개요_pk | 관리_호별_명세_pk | 전유_공용_구분_코드 | 주_부속_구분_코드 | 층_구분_코드 | 층_번호 | 구조_코드 | 기타_구조 | 용도_코드 | 기타_용도 | 면적 | 작업_일자 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
57617 | 11710-100010604 | 11710-100004977 | <NA> | 2 | 0 | 10 | 1 | 21 | 철근콘크리트구조 | 4999 | 계단실,화장실,홀,경비실,방재실 | 5.64 | 20230104 |
18458 | 11260-1000000000000000142658 | 11260-1000000000000000082107 | <NA> | 2 | 1 | 40 | 0 | 21 | 철근콘크리트구조 | 7201 | 주차장 | 93.0928 | 20240102 |
24096 | 11380-1000000000000000102908 | 11380-100003861 | <NA> | 1 | 0 | 20 | 0 | 21 | 철근콘크리트구조 | 2001 | 공동주택(아파트) | 49.53 | 20230929 |
28065 | 11410-100006970 | 11410-100004347 | <NA> | 2 | 1 | 40 | 0 | 21 | 철근콘크리트구조 | 2005 | 주민공동시설,외부ELEV | 3.51 | 20240510 |
30289 | 11440-100006644 | 11440-100004121 | <NA> | 2 | 1 | 40 | 0 | 21 | 철근콘크리트구조 | 4999 | 지하주차장 | 11.97 | 20211029 |
40666 | 11545-100005853 | 11545-100003053 | <NA> | 2 | 1 | 40 | 0 | 21 | 철근콘크리트구조 | 2006 | 경로당,어린이집,아이키움 | 0.9035 | 20210515 |
46418 | 11590-100007371 | 11590-100004229 | <NA> | 1 | 0 | 20 | 0 | 21 | 철근콘크리트구조 | 7999 | 판매시설 | 63.523 | 20211103 |
3703 | 11000-100004734 | 11000-100002918 | <NA> | 2 | 1 | 40 | 0 | 21 | 철근콘크리트구조 | 2001 | 옥외승강기 | 0.02 | 20240510 |
37359 | 11530-1000000000000000086231 | 11530-1000000000000000053536 | <NA> | 2 | 0 | 40 | 0 | 21 | 철근콘크리트구조 | 2001 | 계단실,승강기,홀 | 16.5303 | 20230602 |
41214 | 11560-1000000000000000035305 | 11560-1000000000000000018214 | <NA> | 2 | 0 | 40 | 0 | 21 | 철근콘크리트구조 | 2001 | 계단실 | 13.17 | 20221115 |
관리_전유_공용_pk | 관리_형별_개요_pk | 관리_호별_명세_pk | 전유_공용_구분_코드 | 주_부속_구분_코드 | 층_구분_코드 | 층_번호 | 구조_코드 | 기타_구조 | 용도_코드 | 기타_용도 | 면적 | 작업_일자 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
45423 | 11590-100006300 | 11590-100003865 | <NA> | 2 | 0 | 40 | 0 | 21 | 철근콘크리트구조 | 2001 | 계단실 | 17.9725 | 20210421 |
37360 | 11530-1000000000000000086232 | 11530-1000000000000000053536 | <NA> | 2 | 1 | 20 | 2 | 21 | 철근콘크리트구조 | 2001 | 어린이집 | 0.5508 | 20230602 |
20755 | 11290-100010360 | 11290-100004716 | <NA> | 2 | 0 | 40 | 0 | 21 | 철근콘크리트구조 | 2001 | 계단실,승강기,홀,복도 | 24.17 | 20220929 |
49550 | 11650-100006252 | 11650-100002830 | <NA> | 2 | 0 | 20 | 0 | 21 | 철근콘크리트구조 | 2001 | 벽체 | 9.1541 | 20211029 |
46103 | 11590-100007024 | 11590-100004086 | <NA> | 2 | 0 | 40 | 0 | 21 | 철근콘크리트구조 | 2001 | 주차장 | 11.8572 | 20211013 |
10615 | 11140-100007629 | 11140-100003291 | <NA> | 2 | 0 | 40 | 0 | 42 | 철골철근콘크리트구조 | 4001 | 주차장 | 21.5312 | 20240102 |
13275 | 11215-100004909 | 11215-100003113 | <NA> | 2 | 1 | 40 | 0 | 21 | 철근콘크리트구조 | 4999 | 기계실,전기실,발전기실,정화조관리실,우수조관리실,방재실,재활용보관창고,쓰레기처리장 | 5.119 | 20220104 |
40339 | 11545-100005518 | 11545-100002971 | <NA> | 2 | 1 | 40 | 0 | 21 | 철근콘크리트구조 | 14204 | 관리사무소,경비실 | 0.9373 | 20210507 |
22058 | 11320-1000000000000000030024 | 11320-1000000000000000013707 | <NA> | 2 | 0 | 20 | 2 | 21 | <NA> | 1003 | <NA> | 21.06 | 20221019 |
36014 | 11500-100011861 | 11500-100005884 | <NA> | 2 | 0 | 40 | 0 | 21 | 철근콘크리트구조 | 2001 | 계단실 | 15.1786 | 20230831 |