Dataset statistics
Number of variables | 10 |
---|---|
Number of observations | 10000 |
Missing cells | 9997 |
Missing cells (%) | 10.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 908.2 KiB |
Average record size in memory | 93.0 B |
Variable types
Text | 5 |
---|---|
Numeric | 3 |
Categorical | 2 |
Dataset
Description | 관리_호별_명세_pk,관리_동별_개요_pk,호_번호,호_명칭,평형_구분_명,층_번호,층_구분_코드,관리_건축물대장_참조_pk,변경_구분_코드,작업_일자 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15674/S/1/datasetView.do |
변경_구분_코드 is highly overall correlated with 호_번호 and 3 other fields | High correlation |
층_구분_코드 is highly overall correlated with 변경_구분_코드 | High correlation |
호_번호 is highly overall correlated with 변경_구분_코드 | High correlation |
층_번호 is highly overall correlated with 변경_구분_코드 | High correlation |
작업_일자 is highly overall correlated with 변경_구분_코드 | High correlation |
층_구분_코드 is highly imbalanced (92.6%) | Imbalance |
변경_구분_코드 is highly imbalanced (92.5%) | Imbalance |
관리_건축물대장_참조_pk has 9911 (99.1%) missing values | Missing |
관리_호별_명세_pk has unique values | Unique |
호_번호 has 8076 (80.8%) zeros | Zeros |
Reproduction
Analysis started | 2024-05-11 02:08:15.074658 |
---|---|
Analysis finished | 2024-05-11 02:08:20.576925 |
Duration | 5.5 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
관리_호별_명세_pk
Text
UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 28 |
---|---|
Median length | 15 |
Mean length | 18.3254 |
Min length | 15 |
Characters and Unicode
Total characters | 183254 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 10000 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 11230-1000000000000000054318 |
---|---|
2nd row | 11350-1000000000000000521372 |
3rd row | 11000-100017041 |
4th row | 11000-100006342 |
5th row | 11380-1000000000000000594147 |
Value | Count | Frequency (%) |
11230-1000000000000000054318 | 1 | < 0.1% |
11000-100036835 | 1 | < 0.1% |
11350-100040673 | 1 | < 0.1% |
11000-100035002 | 1 | < 0.1% |
11000-100043739 | 1 | < 0.1% |
11215-1000000000000000371670 | 1 | < 0.1% |
11350-100041472 | 1 | < 0.1% |
11000-100010812 | 1 | < 0.1% |
11000-100030430 | 1 | < 0.1% |
11380-1000000000000000593763 | 1 | < 0.1% |
Other values (9990) | 9990 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 84338 | |
1 | 37133 | |
- | 10000 | 5.5% |
2 | 9118 | 5.0% |
3 | 9004 | 4.9% |
5 | 6543 | 3.6% |
4 | 5758 | 3.1% |
7 | 5705 | 3.1% |
9 | 5526 | 3.0% |
6 | 5117 | 2.8% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 173254 | |
Dash Punctuation | 10000 | 5.5% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 84338 | |
1 | 37133 | |
2 | 9118 | 5.3% |
3 | 9004 | 5.2% |
5 | 6543 | 3.8% |
4 | 5758 | 3.3% |
7 | 5705 | 3.3% |
9 | 5526 | 3.2% |
6 | 5117 | 3.0% |
8 | 5012 | 2.9% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 183254 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 84338 | |
1 | 37133 | |
- | 10000 | 5.5% |
2 | 9118 | 5.0% |
3 | 9004 | 4.9% |
5 | 6543 | 3.6% |
4 | 5758 | 3.1% |
7 | 5705 | 3.1% |
9 | 5526 | 3.0% |
6 | 5117 | 2.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 183254 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 84338 | |
1 | 37133 | |
- | 10000 | 5.5% |
2 | 9118 | 5.0% |
3 | 9004 | 4.9% |
5 | 6543 | 3.6% |
4 | 5758 | 3.1% |
7 | 5705 | 3.1% |
9 | 5526 | 3.0% |
6 | 5117 | 2.8% |
관리_동별_개요_pk
Text
Distinct | 1399 |
---|---|
Distinct (%) | 14.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 28 |
---|---|
Median length | 15 |
Mean length | 18.3254 |
Min length | 15 |
Characters and Unicode
Total characters | 183254 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 170 ? |
---|---|
Unique (%) | 1.7% |
Sample
1st row | 11230-1000000000000000007437 |
---|---|
2nd row | 11350-1000000000000000097236 |
3rd row | 11000-100004454 |
4th row | 11000-100003664 |
5th row | 11380-1000000000000000106283 |
Value | Count | Frequency (%) |
11230-1000000000000000101255 | 139 | 1.4% |
11305-100006084 | 91 | 0.9% |
11140-1000000000000000066327 | 81 | 0.8% |
11230-1000000000000000101254 | 78 | 0.8% |
11170-100006321 | 75 | 0.8% |
11140-1000000000000000053703 | 61 | 0.6% |
11170-100005821 | 58 | 0.6% |
11170-100005902 | 58 | 0.6% |
11170-100005942 | 57 | 0.6% |
11170-100005941 | 56 | 0.6% |
Other values (1389) | 9246 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 90490 | |
1 | 37478 | |
- | 10000 | 5.5% |
2 | 7872 | 4.3% |
3 | 7588 | 4.1% |
5 | 5711 | 3.1% |
6 | 5662 | 3.1% |
4 | 4943 | 2.7% |
7 | 4788 | 2.6% |
8 | 4571 | 2.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 173254 | |
Dash Punctuation | 10000 | 5.5% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 90490 | |
1 | 37478 | |
2 | 7872 | 4.5% |
3 | 7588 | 4.4% |
5 | 5711 | 3.3% |
6 | 5662 | 3.3% |
4 | 4943 | 2.9% |
7 | 4788 | 2.8% |
8 | 4571 | 2.6% |
9 | 4151 | 2.4% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 183254 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 90490 | |
1 | 37478 | |
- | 10000 | 5.5% |
2 | 7872 | 4.3% |
3 | 7588 | 4.1% |
5 | 5711 | 3.1% |
6 | 5662 | 3.1% |
4 | 4943 | 2.7% |
7 | 4788 | 2.6% |
8 | 4571 | 2.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 183254 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 90490 | |
1 | 37478 | |
- | 10000 | 5.5% |
2 | 7872 | 4.3% |
3 | 7588 | 4.1% |
5 | 5711 | 3.1% |
6 | 5662 | 3.1% |
4 | 4943 | 2.7% |
7 | 4788 | 2.6% |
8 | 4571 | 2.5% |
호_번호
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 452 |
---|---|
Distinct (%) | 4.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 34.9901 |
Minimum | 0 |
---|---|
Maximum | 752 |
Zeros | 8076 |
Zeros (%) | 80.8% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 309.05 |
Maximum | 752 |
Range | 752 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 102.33018 |
---|---|
Coefficient of variation (CV) | 2.9245465 |
Kurtosis | 11.63231 |
Mean | 34.9901 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 3.4276364 |
Sum | 349901 |
Variance | 10471.465 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 8076 | |
404 | 22 | 0.2% |
407 | 20 | 0.2% |
403 | 20 | 0.2% |
4 | 19 | 0.2% |
30 | 19 | 0.2% |
3 | 18 | 0.2% |
7 | 17 | 0.2% |
401 | 17 | 0.2% |
315 | 16 | 0.2% |
Other values (442) | 1756 | 17.6% |
Value | Count | Frequency (%) |
0 | 8076 | |
1 | 12 | 0.1% |
2 | 14 | 0.1% |
3 | 18 | 0.2% |
4 | 19 | 0.2% |
5 | 11 | 0.1% |
6 | 12 | 0.1% |
7 | 17 | 0.2% |
8 | 15 | 0.1% |
9 | 7 | 0.1% |
Value | Count | Frequency (%) |
752 | 1 | |
746 | 1 | |
741 | 1 | |
707 | 1 | |
705 | 1 | |
701 | 1 | |
697 | 1 | |
692 | 1 | |
687 | 1 | |
667 | 1 |
호_명칭
Text
Distinct | 1340 |
---|---|
Distinct (%) | 13.4% |
Missing | 3 |
Missing (%) | < 0.1% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
202 | 141 | 1.4% |
401 | 136 | 1.4% |
602 | 132 | 1.3% |
301 | 132 | 1.3% |
303 | 124 | 1.2% |
501 | 120 | 1.2% |
504 | 119 | 1.2% |
403 | 118 | 1.2% |
203 | 116 | 1.2% |
402 | 116 | 1.2% |
Other values (1320) | 8769 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 9947 | |
1 | 6854 | |
2 | 4467 | |
3 | 3345 | 9.5% |
4 | 2967 | 8.4% |
5 | 1984 | 5.6% |
6 | 1568 | 4.4% |
7 | 1248 | 3.5% |
8 | 1057 | 3.0% |
9 | 985 | 2.8% |
Other values (20) | 899 | 2.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 34422 | |
Other Letter | 586 | 1.7% |
Uppercase Letter | 266 | 0.8% |
Space Separator | 26 | 0.1% |
Dash Punctuation | 21 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
호 | 395 | |
동 | 156 | 26.6% |
층 | 6 | 1.0% |
근 | 5 | 0.9% |
생 | 5 | 0.9% |
판 | 3 | 0.5% |
매 | 3 | 0.5% |
시 | 3 | 0.5% |
설 | 3 | 0.5% |
제 | 3 | 0.5% |
Other values (2) | 4 | 0.7% |
Decimal Number
Value | Count | Frequency (%) |
0 | 9947 | |
1 | 6854 | |
2 | 4467 | |
3 | 3345 | 9.7% |
4 | 2967 | 8.6% |
5 | 1984 | 5.8% |
6 | 1568 | 4.6% |
7 | 1248 | 3.6% |
8 | 1057 | 3.1% |
9 | 985 | 2.9% |
Uppercase Letter
Value | Count | Frequency (%) |
B | 115 | |
C | 59 | |
A | 59 | |
D | 30 | 11.3% |
T | 2 | 0.8% |
H | 1 | 0.4% |
Space Separator
Value | Count | Frequency (%) |
26 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 21 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 34469 | |
Hangul | 586 | 1.7% |
Latin | 266 | 0.8% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 9947 | |
1 | 6854 | |
2 | 4467 | |
3 | 3345 | 9.7% |
4 | 2967 | 8.6% |
5 | 1984 | 5.8% |
6 | 1568 | 4.5% |
7 | 1248 | 3.6% |
8 | 1057 | 3.1% |
9 | 985 | 2.9% |
Other values (2) | 47 | 0.1% |
Hangul
Value | Count | Frequency (%) |
호 | 395 | |
동 | 156 | 26.6% |
층 | 6 | 1.0% |
근 | 5 | 0.9% |
생 | 5 | 0.9% |
판 | 3 | 0.5% |
매 | 3 | 0.5% |
시 | 3 | 0.5% |
설 | 3 | 0.5% |
제 | 3 | 0.5% |
Other values (2) | 4 | 0.7% |
Latin
Value | Count | Frequency (%) |
B | 115 | |
C | 59 | |
A | 59 | |
D | 30 | 11.3% |
T | 2 | 0.8% |
H | 1 | 0.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 34735 | |
Hangul | 586 | 1.7% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 9947 | |
1 | 6854 | |
2 | 4467 | |
3 | 3345 | 9.6% |
4 | 2967 | 8.5% |
5 | 1984 | 5.7% |
6 | 1568 | 4.5% |
7 | 1248 | 3.6% |
8 | 1057 | 3.0% |
9 | 985 | 2.8% |
Other values (8) | 313 | 0.9% |
Hangul
Value | Count | Frequency (%) |
호 | 395 | |
동 | 156 | 26.6% |
층 | 6 | 1.0% |
근 | 5 | 0.9% |
생 | 5 | 0.9% |
판 | 3 | 0.5% |
매 | 3 | 0.5% |
시 | 3 | 0.5% |
설 | 3 | 0.5% |
제 | 3 | 0.5% |
Other values (2) | 4 | 0.7% |
평형_구분_명
Text
Distinct | 920 |
---|---|
Distinct (%) | 9.3% |
Missing | 83 |
Missing (%) | 0.8% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
84a | 1099 | 11.0% |
59a | 946 | 9.5% |
84b | 585 | 5.9% |
49a | 386 | 3.9% |
59b | 284 | 2.9% |
84c | 277 | 2.8% |
49b | 262 | 2.6% |
59 | 237 | 2.4% |
39a | 199 | 2.0% |
74a | 184 | 1.8% |
Other values (899) | 5497 |
Most occurring characters
Value | Count | Frequency (%) |
4 | 5255 | |
A | 4174 | |
9 | 3760 | |
8 | 3197 | |
1 | 2681 | |
5 | 2546 | 7.7% |
B | 2077 | 6.3% |
3 | 1152 | 3.5% |
2 | 1002 | 3.0% |
- | 954 | 2.9% |
Other values (81) | 6370 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 21415 | |
Uppercase Letter | 8803 | |
Other Letter | 1376 | 4.1% |
Dash Punctuation | 954 | 2.9% |
Lowercase Letter | 217 | 0.7% |
Other Punctuation | 120 | 0.4% |
Open Punctuation | 117 | 0.4% |
Close Punctuation | 117 | 0.4% |
Space Separator | 39 | 0.1% |
Other Symbol | 10 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
형 | 158 | 11.5% |
생 | 133 | 9.7% |
근 | 73 | 5.3% |
시 | 72 | 5.2% |
임 | 67 | 4.9% |
대 | 67 | 4.9% |
주 | 65 | 4.7% |
활 | 63 | 4.6% |
택 | 60 | 4.4% |
도 | 60 | 4.4% |
Other values (33) | 558 |
Uppercase Letter
Value | Count | Frequency (%) |
A | 4174 | |
B | 2077 | |
C | 872 | 9.9% |
D | 475 | 5.4% |
E | 239 | 2.7% |
O | 185 | 2.1% |
S | 168 | 1.9% |
F | 152 | 1.7% |
T | 148 | 1.7% |
M | 76 | 0.9% |
Other values (12) | 237 | 2.7% |
Decimal Number
Value | Count | Frequency (%) |
4 | 5255 | |
9 | 3760 | |
8 | 3197 | |
1 | 2681 | |
5 | 2546 | |
3 | 1152 | 5.4% |
2 | 1002 | 4.7% |
0 | 771 | 3.6% |
7 | 647 | 3.0% |
6 | 404 | 1.9% |
Lowercase Letter
Value | Count | Frequency (%) |
a | 94 | |
b | 25 | 11.5% |
e | 21 | 9.7% |
p | 21 | 9.7% |
y | 21 | 9.7% |
t | 21 | 9.7% |
d | 9 | 4.1% |
c | 5 | 2.3% |
Other Punctuation
Value | Count | Frequency (%) |
. | 98 | |
' | 21 | 17.5% |
, | 1 | 0.8% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 954 |
Open Punctuation
Value | Count | Frequency (%) |
( | 117 |
Close Punctuation
Value | Count | Frequency (%) |
) | 117 |
Space Separator
Value | Count | Frequency (%) |
39 |
Other Symbol
Value | Count | Frequency (%) |
㎡ | 10 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 22772 | |
Latin | 9020 | 27.2% |
Hangul | 1376 | 4.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
형 | 158 | 11.5% |
생 | 133 | 9.7% |
근 | 73 | 5.3% |
시 | 72 | 5.2% |
임 | 67 | 4.9% |
대 | 67 | 4.9% |
주 | 65 | 4.7% |
활 | 63 | 4.6% |
택 | 60 | 4.4% |
도 | 60 | 4.4% |
Other values (33) | 558 |
Latin
Value | Count | Frequency (%) |
A | 4174 | |
B | 2077 | |
C | 872 | 9.7% |
D | 475 | 5.3% |
E | 239 | 2.6% |
O | 185 | 2.1% |
S | 168 | 1.9% |
F | 152 | 1.7% |
T | 148 | 1.6% |
a | 94 | 1.0% |
Other values (20) | 436 | 4.8% |
Common
Value | Count | Frequency (%) |
4 | 5255 | |
9 | 3760 | |
8 | 3197 | |
1 | 2681 | |
5 | 2546 | |
3 | 1152 | 5.1% |
2 | 1002 | 4.4% |
- | 954 | 4.2% |
0 | 771 | 3.4% |
7 | 647 | 2.8% |
Other values (8) | 807 | 3.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 31782 | |
Hangul | 1376 | 4.1% |
CJK Compat | 10 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
4 | 5255 | |
A | 4174 | |
9 | 3760 | |
8 | 3197 | |
1 | 2681 | |
5 | 2546 | |
B | 2077 | 6.5% |
3 | 1152 | 3.6% |
2 | 1002 | 3.2% |
- | 954 | 3.0% |
Other values (37) | 4984 |
Hangul
Value | Count | Frequency (%) |
형 | 158 | 11.5% |
생 | 133 | 9.7% |
근 | 73 | 5.3% |
시 | 72 | 5.2% |
임 | 67 | 4.9% |
대 | 67 | 4.9% |
주 | 65 | 4.7% |
활 | 63 | 4.6% |
택 | 60 | 4.4% |
도 | 60 | 4.4% |
Other values (33) | 558 |
CJK Compat
Value | Count | Frequency (%) |
㎡ | 10 |
층_번호
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 63 |
---|---|
Distinct (%) | 0.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 10.2905 |
Minimum | 0 |
---|---|
Maximum | 65 |
Zeros | 2 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 4 |
median | 8 |
Q3 | 14 |
95-th percentile | 26 |
Maximum | 65 |
Range | 65 |
Interquartile range (IQR) | 10 |
Descriptive statistics
Standard deviation | 8.0813638 |
---|---|
Coefficient of variation (CV) | 0.78532275 |
Kurtosis | 3.9588667 |
Mean | 10.2905 |
Median Absolute Deviation (MAD) | 5 |
Skewness | 1.5757461 |
Sum | 102905 |
Variance | 65.308441 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 685 | 6.9% |
4 | 681 | 6.8% |
3 | 664 | 6.6% |
5 | 641 | 6.4% |
6 | 636 | 6.4% |
1 | 620 | 6.2% |
7 | 594 | 5.9% |
9 | 539 | 5.4% |
8 | 538 | 5.4% |
10 | 491 | 4.9% |
Other values (53) | 3911 |
Value | Count | Frequency (%) |
0 | 2 | < 0.1% |
1 | 620 | |
2 | 685 | |
3 | 664 | |
4 | 681 | |
5 | 641 | |
6 | 636 | |
7 | 594 | |
8 | 538 | |
9 | 539 |
Value | Count | Frequency (%) |
65 | 1 | < 0.1% |
63 | 2 | |
62 | 2 | |
61 | 2 | |
60 | 3 | |
58 | 2 | |
57 | 1 | < 0.1% |
56 | 2 | |
55 | 2 | |
54 | 3 |
층_구분_코드
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
20 | |
---|---|
10 | 90 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 20 |
---|---|
2nd row | 20 |
3rd row | 20 |
4th row | 20 |
5th row | 20 |
Common Values
Value | Count | Frequency (%) |
20 | 9910 | |
10 | 90 | 0.9% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
20 | 9910 | |
10 | 90 | 0.9% |
관리_건축물대장_참조_pk
Text
MISSING
 
Distinct | 89 |
---|---|
Distinct (%) | 100.0% |
Missing | 9911 |
Missing (%) | 99.1% |
Memory size | 156.2 KiB |
Length
Max length | 28 |
---|---|
Median length | 15 |
Mean length | 13.685393 |
Min length | 11 |
Characters and Unicode
Total characters | 1218 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 89 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 11305-52214 |
---|---|
2nd row | 11215-56321 |
3rd row | 11380-56951 |
4th row | 11320-107198 |
5th row | 11215-56280 |
Value | Count | Frequency (%) |
11215-56237 | 1 | 1.1% |
11215-56316 | 1 | 1.1% |
11350-100181860 | 1 | 1.1% |
11350-172291 | 1 | 1.1% |
11350-172194 | 1 | 1.1% |
11260-1000000000000002690854 | 1 | 1.1% |
11320-98292 | 1 | 1.1% |
11350-100181850 | 1 | 1.1% |
11320-84169 | 1 | 1.1% |
11215-56331 | 1 | 1.1% |
Other values (79) | 79 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 339 | |
0 | 245 | |
2 | 98 | 8.0% |
5 | 95 | 7.8% |
- | 89 | 7.3% |
3 | 85 | 7.0% |
8 | 76 | 6.2% |
6 | 62 | 5.1% |
9 | 57 | 4.7% |
7 | 37 | 3.0% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 1129 | |
Dash Punctuation | 89 | 7.3% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 339 | |
0 | 245 | |
2 | 98 | 8.7% |
5 | 95 | 8.4% |
3 | 85 | 7.5% |
8 | 76 | 6.7% |
6 | 62 | 5.5% |
9 | 57 | 5.0% |
7 | 37 | 3.3% |
4 | 35 | 3.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 89 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 1218 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 339 | |
0 | 245 | |
2 | 98 | 8.0% |
5 | 95 | 7.8% |
- | 89 | 7.3% |
3 | 85 | 7.0% |
8 | 76 | 6.2% |
6 | 62 | 5.1% |
9 | 57 | 4.7% |
7 | 37 | 3.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1218 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 339 | |
0 | 245 | |
2 | 98 | 8.0% |
5 | 95 | 7.8% |
- | 89 | 7.3% |
3 | 85 | 7.0% |
8 | 76 | 6.2% |
6 | 62 | 5.1% |
9 | 57 | 4.7% |
7 | 37 | 3.0% |
변경_구분_코드
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
1 | 92 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.9724 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 9908 | |
1 | 92 | 0.9% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 9908 | |
1 | 92 | 0.9% |
작업_일자
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 94 |
---|---|
Distinct (%) | 0.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20229949 |
Minimum | 20201201 |
---|---|
Maximum | 20240510 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 20201201 |
---|---|
5-th percentile | 20210806 |
Q1 | 20220416 |
median | 20230831 |
Q3 | 20240510 |
95-th percentile | 20240510 |
Maximum | 20240510 |
Range | 39309 |
Interquartile range (IQR) | 20094 |
Descriptive statistics
Standard deviation | 11750.572 |
---|---|
Coefficient of variation (CV) | 0.00058085033 |
Kurtosis | -0.91715369 |
Mean | 20229949 |
Median Absolute Deviation (MAD) | 9679 |
Skewness | -0.70663813 |
Sum | 2.0229949 × 1011 |
Variance | 1.3807595 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20240510 | 3978 | |
20211029 | 427 | 4.3% |
20230808 | 399 | 4.0% |
20211123 | 319 | 3.2% |
20240102 | 303 | 3.0% |
20230110 | 265 | 2.6% |
20230704 | 265 | 2.6% |
20220929 | 221 | 2.2% |
20231028 | 217 | 2.2% |
20220204 | 208 | 2.1% |
Other values (84) | 3398 |
Value | Count | Frequency (%) |
20201201 | 105 | |
20201208 | 1 | < 0.1% |
20201216 | 1 | < 0.1% |
20201230 | 26 | 0.3% |
20210106 | 26 | 0.3% |
20210216 | 118 | |
20210309 | 110 | |
20210601 | 94 | |
20210628 | 3 | < 0.1% |
20210806 | 150 |
Value | Count | Frequency (%) |
20240510 | 3978 | |
20240507 | 3 | < 0.1% |
20240425 | 9 | 0.1% |
20240402 | 6 | 0.1% |
20240327 | 6 | 0.1% |
20240302 | 95 | 0.9% |
20240227 | 5 | 0.1% |
20240223 | 1 | < 0.1% |
20240222 | 4 | < 0.1% |
20240221 | 1 | < 0.1% |
호_번호 | 층_번호 | 층_구분_코드 | 관리_건축물대장_참조_pk | 작업_일자 | |
---|---|---|---|---|---|
호_번호 | 1.000 | 0.454 | 0.023 | NaN | 0.216 |
층_번호 | 0.454 | 1.000 | 0.150 | 1.000 | 0.264 |
층_구분_코드 | 0.023 | 0.150 | 1.000 | 1.000 | 0.064 |
관리_건축물대장_참조_pk | NaN | 1.000 | 1.000 | 1.000 | 1.000 |
작업_일자 | 0.216 | 0.264 | 0.064 | 1.000 | 1.000 |
변경_구분_코드 | 층_구분_코드 | |
---|---|---|
변경_구분_코드 | 1.000 | 1.000 |
층_구분_코드 | 1.000 | 1.000 |
호_번호 | 층_번호 | 작업_일자 | 층_구분_코드 | 변경_구분_코드 | |
---|---|---|---|---|---|
호_번호 | 1.000 | 0.073 | -0.070 | 0.018 | 1.000 |
층_번호 | 0.073 | 1.000 | -0.214 | 0.115 | 1.000 |
작업_일자 | -0.070 | -0.214 | 1.000 | 0.044 | 1.000 |
층_구분_코드 | 0.018 | 0.115 | 0.044 | 1.000 | 1.000 |
변경_구분_코드 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
관리_호별_명세_pk | 관리_동별_개요_pk | 호_번호 | 호_명칭 | 평형_구분_명 | 층_번호 | 층_구분_코드 | 관리_건축물대장_참조_pk | 변경_구분_코드 | 작업_일자 | |
---|---|---|---|---|---|---|---|---|---|---|
60341 | 11230-1000000000000000054318 | 11230-1000000000000000007437 | 0 | 1003 | 84C | 10 | 20 | <NA> | <NA> | 20230523 |
84252 | 11350-1000000000000000521372 | 11350-1000000000000000097236 | 0 | 202 | 97A | 2 | 20 | <NA> | <NA> | 20230704 |
14501 | 11000-100017041 | 11000-100004454 | 401 | 902 | 49C | 9 | 20 | <NA> | <NA> | 20240510 |
4332 | 11000-100006342 | 11000-100003664 | 0 | 103 | 59B-1 | 1 | 20 | <NA> | <NA> | 20240510 |
96233 | 11380-1000000000000000594147 | 11380-1000000000000000106283 | 0 | 302 | 39 | 3 | 20 | <NA> | <NA> | 20230808 |
93797 | 11380-1000000000000000282565 | 11380-1000000000000000052678 | 0 | 1003 | 84A | 10 | 20 | <NA> | <NA> | 20230831 |
69914 | 11260-100036087 | 11260-100010226 | 0 | 1203 | 59 | 12 | 20 | <NA> | <NA> | 20211029 |
28628 | 11000-100031977 | 11000-100003327 | 0 | 302 | 49A | 3 | 20 | <NA> | <NA> | 20240510 |
89375 | 11350-100044361 | 11350-100017623 | 0 | 602 | 74B | 6 | 20 | <NA> | <NA> | 20220914 |
52804 | 11170-100035175 | 11170-100005943 | 30 | 102동 113호 | 102동 113호 | 1 | 20 | <NA> | <NA> | 20211029 |
관리_호별_명세_pk | 관리_동별_개요_pk | 호_번호 | 호_명칭 | 평형_구분_명 | 층_번호 | 층_구분_코드 | 관리_건축물대장_참조_pk | 변경_구분_코드 | 작업_일자 | |
---|---|---|---|---|---|---|---|---|---|---|
16190 | 11000-100018743 | 11000-100004948 | 322 | 801 | 49B1 | 8 | 20 | <NA> | <NA> | 20240510 |
56383 | 11215-100026158 | 11215-100005204 | 0 | 1202 | 40C | 12 | 20 | <NA> | <NA> | 20210901 |
21893 | 11000-100025097 | 11000-100003364 | 0 | 1201 | 114A | 12 | 20 | <NA> | <NA> | 20240510 |
39376 | 11000-100048790 | 11000-100011918 | 0 | 905 | 70A | 9 | 20 | <NA> | <NA> | 20210806 |
71450 | 11260-100038218 | 11260-100010927 | 63 | 304 | 84A | 3 | 20 | <NA> | <NA> | 20211204 |
86703 | 11350-100041374 | 11350-100016457 | 0 | 804 | <NA> | 8 | 20 | 11350-100181783 | 1 | 20211029 |
63592 | 11230-1000000000000000561591 | 11230-1000000000000000101255 | 0 | A동3106 | 84A | 31 | 20 | <NA> | <NA> | 20231028 |
91387 | 11380-1000000000000000267254 | 11380-1000000000000000047608 | 0 | 702 | 59B-1 | 7 | 20 | <NA> | <NA> | 20230110 |
57277 | 11215-100027052 | 11215-100005243 | 0 | 1102 | C | 11 | 20 | <NA> | <NA> | 20211112 |
76993 | 11290-100076773 | 11290-100007707 | 0 | 2701 | 84A | 27 | 20 | <NA> | <NA> | 20240207 |