Dataset statistics
Number of variables | 24 |
---|---|
Number of observations | 10000 |
Missing cells | 27320 |
Missing cells (%) | 11.4% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 2.0 MiB |
Average record size in memory | 209.0 B |
Variable types
Text | 8 |
---|---|
Categorical | 7 |
Numeric | 9 |
Dataset
Description | 관리_건축물대장_PK,관리_상위_건축물대장_PK,대장_구분_코드,대장_종류_코드,시군구_코드,법정동_코드,대지_구분_코드,번,지,특수지_명,블록,로트,건물_명,위반_건축물_여부,대장_일련번호,총괄표제부_일련번호,표제부_일련번호,전유부_일련번호,새주소_도로_코드,새주소_법정동_코드,새주소_지상지하_코드,새주소_본_번,새주소_부_번,변동_일자 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15387/S/1/datasetView.do |
대장_구분_코드 is highly imbalanced (80.9%) | Imbalance |
대장_종류_코드 is highly imbalanced (83.8%) | Imbalance |
대지_구분_코드 is highly imbalanced (99.5%) | Imbalance |
블록 is highly imbalanced (99.5%) | Imbalance |
위반_건축물_여부 is highly imbalanced (90.4%) | Imbalance |
새주소_지상지하_코드 is highly imbalanced (99.0%) | Imbalance |
관리_상위_건축물대장_PK has 304 (3.0%) missing values | Missing |
특수지_명 has 9995 (> 99.9%) missing values | Missing |
로트 has 9996 (> 99.9%) missing values | Missing |
건물_명 has 699 (7.0%) missing values | Missing |
새주소_부_번 has 6219 (62.2%) missing values | Missing |
지 is highly skewed (γ1 = 22.24380138) | Skewed |
표제부_일련번호 is highly skewed (γ1 = 50.42207266) | Skewed |
관리_건축물대장_PK has unique values | Unique |
지 has 6472 (64.7%) zeros | Zeros |
전유부_일련번호 has 470 (4.7%) zeros | Zeros |
새주소_부_번 has 2719 (27.2%) zeros | Zeros |
Reproduction
Analysis started | 2024-04-20 20:55:54.852199 |
---|---|
Analysis finished | 2024-04-20 20:55:55.979036 |
Duration | 1.13 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
관리_건축물대장_PK
Text
UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 28 |
---|---|
Median length | 15 |
Mean length | 15.3597 |
Min length | 8 |
Characters and Unicode
Total characters | 153597 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 10000 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 11740-100219604 |
---|---|
2nd row | 11590-101344 |
3rd row | 11440-1000000000000002992925 |
4th row | 11680-100227773 |
5th row | 11650-100231979 |
Value | Count | Frequency (%) |
11740-100219604 | 1 | < 0.1% |
11650-100289459 | 1 | < 0.1% |
11500-100348758 | 1 | < 0.1% |
11380-27108 | 1 | < 0.1% |
11530-1000000000000002438062 | 1 | < 0.1% |
11650-127728 | 1 | < 0.1% |
11740-100248628 | 1 | < 0.1% |
11500-100244257 | 1 | < 0.1% |
11710-100512505 | 1 | < 0.1% |
11500-100253978 | 1 | < 0.1% |
Other values (9990) | 9990 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 44431 | |
1 | 35584 | |
2 | 12117 | 7.9% |
5 | 10849 | 7.1% |
- | 10000 | 6.5% |
4 | 8228 | 5.4% |
3 | 7909 | 5.1% |
6 | 7427 | 4.8% |
7 | 6540 | 4.3% |
8 | 5518 | 3.6% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 143597 | |
Dash Punctuation | 10000 | 6.5% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 44431 | |
1 | 35584 | |
2 | 12117 | 8.4% |
5 | 10849 | 7.6% |
4 | 8228 | 5.7% |
3 | 7909 | 5.5% |
6 | 7427 | 5.2% |
7 | 6540 | 4.6% |
8 | 5518 | 3.8% |
9 | 4994 | 3.5% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 153597 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 44431 | |
1 | 35584 | |
2 | 12117 | 7.9% |
5 | 10849 | 7.1% |
- | 10000 | 6.5% |
4 | 8228 | 5.4% |
3 | 7909 | 5.1% |
6 | 7427 | 4.8% |
7 | 6540 | 4.3% |
8 | 5518 | 3.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 153597 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 44431 | |
1 | 35584 | |
2 | 12117 | 7.9% |
5 | 10849 | 7.1% |
- | 10000 | 6.5% |
4 | 8228 | 5.4% |
3 | 7909 | 5.1% |
6 | 7427 | 4.8% |
7 | 6540 | 4.3% |
8 | 5518 | 3.6% |
관리_상위_건축물대장_PK
Text
MISSING
 
Distinct | 1552 |
---|---|
Distinct (%) | 16.0% |
Missing | 304 |
Missing (%) | 3.0% |
Memory size | 156.2 KiB |
Length
Max length | 28 |
---|---|
Median length | 15 |
Mean length | 15.185644 |
Min length | 7 |
Characters and Unicode
Total characters | 147240 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 746 ? |
---|---|
Unique (%) | 7.7% |
Sample
1st row | 11740-100218881 |
---|---|
2nd row | 11590-3995 |
3rd row | 11440-1000000000000002992792 |
4th row | 11680-100227746 |
5th row | 11650-100231970 |
Value | Count | Frequency (%) |
11710-100206053 | 243 | 2.5% |
11710-100512412 | 176 | 1.8% |
11500-100341833 | 115 | 1.2% |
11530-2775 | 100 | 1.0% |
11500-100323653 | 89 | 0.9% |
11500-100282992 | 88 | 0.9% |
11500-100331092 | 83 | 0.9% |
11230-1000000000000002550560 | 75 | 0.8% |
11500-100243739 | 71 | 0.7% |
11590-2414 | 65 | 0.7% |
Other values (1542) | 8591 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 42792 | |
1 | 34203 | |
2 | 12069 | 8.2% |
5 | 10773 | 7.3% |
- | 9696 | 6.6% |
3 | 7695 | 5.2% |
4 | 7339 | 5.0% |
7 | 6685 | 4.5% |
6 | 6410 | 4.4% |
9 | 4926 | 3.3% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 137544 | |
Dash Punctuation | 9696 | 6.6% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 42792 | |
1 | 34203 | |
2 | 12069 | 8.8% |
5 | 10773 | 7.8% |
3 | 7695 | 5.6% |
4 | 7339 | 5.3% |
7 | 6685 | 4.9% |
6 | 6410 | 4.7% |
9 | 4926 | 3.6% |
8 | 4652 | 3.4% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 9696 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 147240 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 42792 | |
1 | 34203 | |
2 | 12069 | 8.2% |
5 | 10773 | 7.3% |
- | 9696 | 6.6% |
3 | 7695 | 5.2% |
4 | 7339 | 5.0% |
7 | 6685 | 4.5% |
6 | 6410 | 4.4% |
9 | 4926 | 3.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 147240 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 42792 | |
1 | 34203 | |
2 | 12069 | 8.2% |
5 | 10773 | 7.3% |
- | 9696 | 6.6% |
3 | 7695 | 5.2% |
4 | 7339 | 5.0% |
7 | 6685 | 4.5% |
6 | 6410 | 4.4% |
9 | 4926 | 3.3% |
대장_구분_코드
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
집합 | |
---|---|
일반 | 294 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 집합 |
---|---|
2nd row | 집합 |
3rd row | 집합 |
4th row | 집합 |
5th row | 집합 |
Common Values
Value | Count | Frequency (%) |
집합 | 9706 | |
일반 | 294 | 2.9% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
집합 | 9706 | |
일반 | 294 | 2.9% |
대장_종류_코드
Categorical
IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
전유부 | |
---|---|
일반건축물 | 292 |
표제부 | 169 |
총괄표제부 | 9 |
Length
Max length | 5 |
---|---|
Median length | 3 |
Mean length | 3.0602 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 전유부 |
---|---|
2nd row | 전유부 |
3rd row | 전유부 |
4th row | 전유부 |
5th row | 전유부 |
Common Values
Value | Count | Frequency (%) |
전유부 | 9530 | |
일반건축물 | 292 | 2.9% |
표제부 | 169 | 1.7% |
총괄표제부 | 9 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
전유부 | 9530 | |
일반건축물 | 292 | 2.9% |
표제부 | 169 | 1.7% |
총괄표제부 | 9 | 0.1% |
시군구_코드
Categorical
Distinct | 25 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
강서구 | |
---|---|
서초구 | |
송파구 | |
강남구 | |
강동구 | |
Other values (20) |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 3.0592 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 강동구 |
---|---|
2nd row | 동작구 |
3rd row | 마포구 |
4th row | 강남구 |
5th row | 서초구 |
Common Values
Value | Count | Frequency (%) |
강서구 | 2879 | |
서초구 | 1166 | |
송파구 | 996 | 10.0% |
강남구 | 990 | 9.9% |
강동구 | 826 | 8.3% |
동작구 | 441 | 4.4% |
영등포구 | 428 | 4.3% |
구로구 | 365 | 3.6% |
성동구 | 275 | 2.8% |
마포구 | 262 | 2.6% |
Other values (15) | 1372 |
Length
Value | Count | Frequency (%) |
강서구 | 2879 | |
서초구 | 1166 | |
송파구 | 996 | 10.0% |
강남구 | 990 | 9.9% |
강동구 | 826 | 8.3% |
동작구 | 441 | 4.4% |
영등포구 | 428 | 4.3% |
구로구 | 365 | 3.6% |
성동구 | 275 | 2.8% |
마포구 | 262 | 2.6% |
Other values (15) | 1372 |
법정동_코드
Text
Distinct | 259 |
---|---|
Distinct (%) | 2.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
마곡동 | 2215 | |
상일동 | 524 | 5.2% |
문정동 | 497 | 5.0% |
서초동 | 375 | 3.8% |
신원동 | 374 | 3.7% |
세곡동 | 315 | 3.1% |
내곡동 | 281 | 2.8% |
염창동 | 227 | 2.3% |
마장동 | 227 | 2.3% |
거여동 | 184 | 1.8% |
Other values (249) | 4781 |
Most occurring characters
Value | Count | Frequency (%) |
동 | 9973 | |
곡 | 3298 | 10.6% |
마 | 2457 | 7.9% |
상 | 826 | 2.7% |
신 | 789 | 2.5% |
일 | 686 | 2.2% |
정 | 578 | 1.9% |
원 | 556 | 1.8% |
문 | 550 | 1.8% |
도 | 443 | 1.4% |
Other values (165) | 10864 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 30684 | |
Decimal Number | 336 | 1.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 9973 | |
곡 | 3298 | 10.7% |
마 | 2457 | 8.0% |
상 | 826 | 2.7% |
신 | 789 | 2.6% |
일 | 686 | 2.2% |
정 | 578 | 1.9% |
원 | 556 | 1.8% |
문 | 550 | 1.8% |
도 | 443 | 1.4% |
Other values (157) | 10528 |
Decimal Number
Value | Count | Frequency (%) |
5 | 78 | |
4 | 68 | |
1 | 58 | |
2 | 42 | |
6 | 40 | |
3 | 25 | 7.4% |
8 | 13 | 3.9% |
7 | 12 | 3.6% |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 30684 | |
Common | 336 | 1.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 9973 | |
곡 | 3298 | 10.7% |
마 | 2457 | 8.0% |
상 | 826 | 2.7% |
신 | 789 | 2.6% |
일 | 686 | 2.2% |
정 | 578 | 1.9% |
원 | 556 | 1.8% |
문 | 550 | 1.8% |
도 | 443 | 1.4% |
Other values (157) | 10528 |
Common
Value | Count | Frequency (%) |
5 | 78 | |
4 | 68 | |
1 | 58 | |
2 | 42 | |
6 | 40 | |
3 | 25 | 7.4% |
8 | 13 | 3.9% |
7 | 12 | 3.6% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 30684 | |
ASCII | 336 | 1.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
동 | 9973 | |
곡 | 3298 | 10.7% |
마 | 2457 | 8.0% |
상 | 826 | 2.7% |
신 | 789 | 2.6% |
일 | 686 | 2.2% |
정 | 578 | 1.9% |
원 | 556 | 1.8% |
문 | 550 | 1.8% |
도 | 443 | 1.4% |
Other values (157) | 10528 |
ASCII
Value | Count | Frequency (%) |
5 | 78 | |
4 | 68 | |
1 | 58 | |
2 | 42 | |
6 | 40 | |
3 | 25 | 7.4% |
8 | 13 | 3.9% |
7 | 12 | 3.6% |
대지_구분_코드
Categorical
IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
대지 | |
---|---|
블록 | 5 |
산 | 1 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 1.9999 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 대지 |
---|---|
2nd row | 대지 |
3rd row | 대지 |
4th row | 대지 |
5th row | 대지 |
Common Values
Value | Count | Frequency (%) |
대지 | 9994 | |
블록 | 5 | 0.1% |
산 | 1 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
대지 | 9994 | |
블록 | 5 | < 0.1% |
산 | 1 | < 0.1% |
번
Real number (ℝ)
Distinct | 757 |
---|---|
Distinct (%) | 7.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 647.1388 |
Minimum | 0 |
---|---|
Maximum | 4972 |
Zeros | 5 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 39 |
Q1 | 413 |
median | 634 |
Q3 | 750 |
95-th percentile | 1454 |
Maximum | 4972 |
Range | 4972 |
Interquartile range (IQR) | 337 |
Descriptive statistics
Standard deviation | 487.08341 |
---|---|
Coefficient of variation (CV) | 0.75267224 |
Kurtosis | 31.448832 |
Mean | 647.1388 |
Median Absolute Deviation (MAD) | 139 |
Skewness | 4.1123575 |
Sum | 6471388 |
Variance | 237250.25 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
743 | 654 | 6.5% |
519 | 507 | 5.1% |
750 | 279 | 2.8% |
751 | 255 | 2.5% |
634 | 243 | 2.4% |
818 | 218 | 2.2% |
639 | 177 | 1.8% |
799 | 176 | 1.8% |
411 | 151 | 1.5% |
747 | 150 | 1.5% |
Other values (747) | 7190 |
Value | Count | Frequency (%) |
0 | 5 | 0.1% |
1 | 11 | |
2 | 10 | |
3 | 2 | < 0.1% |
4 | 1 | < 0.1% |
5 | 24 | |
6 | 10 | |
7 | 3 | < 0.1% |
8 | 10 | |
9 | 24 |
Value | Count | Frequency (%) |
4972 | 1 | < 0.1% |
4969 | 2 | < 0.1% |
4958 | 33 | |
4950 | 1 | < 0.1% |
4942 | 12 | 0.1% |
4780 | 1 | < 0.1% |
4518 | 1 | < 0.1% |
4234 | 3 | < 0.1% |
3483 | 1 | < 0.1% |
3282 | 8 | 0.1% |
지
Real number (ℝ)
SKEWED
  ZEROS
 
Distinct | 147 |
---|---|
Distinct (%) | 1.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 8.5215 |
Minimum | 0 |
---|---|
Maximum | 2003 |
Zeros | 6472 |
Zeros (%) | 64.7% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 3 |
95-th percentile | 35 |
Maximum | 2003 |
Range | 2003 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 50.900081 |
---|---|
Coefficient of variation (CV) | 5.9731363 |
Kurtosis | 688.16909 |
Mean | 8.5215 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 22.243801 |
Sum | 85215 |
Variance | 2590.8182 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 6472 | |
1 | 581 | 5.8% |
4 | 452 | 4.5% |
2 | 359 | 3.6% |
5 | 276 | 2.8% |
3 | 181 | 1.8% |
6 | 170 | 1.7% |
11 | 107 | 1.1% |
41 | 106 | 1.1% |
15 | 92 | 0.9% |
Other values (137) | 1204 | 12.0% |
Value | Count | Frequency (%) |
0 | 6472 | |
1 | 581 | 5.8% |
2 | 359 | 3.6% |
3 | 181 | 1.8% |
4 | 452 | 4.5% |
5 | 276 | 2.8% |
6 | 170 | 1.7% |
7 | 69 | 0.7% |
8 | 29 | 0.3% |
9 | 68 | 0.7% |
Value | Count | Frequency (%) |
2003 | 1 | < 0.1% |
1843 | 1 | < 0.1% |
1661 | 1 | < 0.1% |
1629 | 1 | < 0.1% |
1119 | 1 | < 0.1% |
1007 | 2 | |
704 | 1 | < 0.1% |
599 | 1 | < 0.1% |
570 | 1 | < 0.1% |
561 | 3 |
특수지_명
Text
MISSING
 
Distinct | 3 |
---|---|
Distinct (%) | 60.0% |
Missing | 9995 |
Missing (%) | > 99.9% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
고덕강일공공주택지구 | 2 | |
공공주택지구 | 2 | |
1구역1블럭13롯트 | 1 |
Most occurring characters
Value | Count | Frequency (%) |
공 | 8 | |
구 | 5 | |
주 | 4 | |
택 | 4 | |
지 | 4 | |
1 | 3 | 7.1% |
고 | 2 | 4.8% |
덕 | 2 | 4.8% |
강 | 2 | 4.8% |
일 | 2 | 4.8% |
Other values (6) | 6 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 38 | |
Decimal Number | 4 | 9.5% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
공 | 8 | |
구 | 5 | |
주 | 4 | |
택 | 4 | |
지 | 4 | |
고 | 2 | 5.3% |
덕 | 2 | 5.3% |
강 | 2 | 5.3% |
일 | 2 | 5.3% |
역 | 1 | 2.6% |
Other values (4) | 4 |
Decimal Number
Value | Count | Frequency (%) |
1 | 3 | |
3 | 1 | 25.0% |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 38 | |
Common | 4 | 9.5% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
공 | 8 | |
구 | 5 | |
주 | 4 | |
택 | 4 | |
지 | 4 | |
고 | 2 | 5.3% |
덕 | 2 | 5.3% |
강 | 2 | 5.3% |
일 | 2 | 5.3% |
역 | 1 | 2.6% |
Other values (4) | 4 |
Common
Value | Count | Frequency (%) |
1 | 3 | |
3 | 1 | 25.0% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 38 | |
ASCII | 4 | 9.5% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
공 | 8 | |
구 | 5 | |
주 | 4 | |
택 | 4 | |
지 | 4 | |
고 | 2 | 5.3% |
덕 | 2 | 5.3% |
강 | 2 | 5.3% |
일 | 2 | 5.3% |
역 | 1 | 2.6% |
Other values (4) | 4 |
ASCII
Value | Count | Frequency (%) |
1 | 3 | |
3 | 1 | 25.0% |
블록
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
근린생활용지 | 4 |
Length
Max length | 6 |
---|---|
Median length | 4 |
Mean length | 4.0008 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 9996 | |
근린생활용지 | 4 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 9996 | |
근린생활용지 | 4 | < 0.1% |
로트
Text
MISSING
 
Distinct | 2 |
---|---|
Distinct (%) | 50.0% |
Missing | 9996 |
Missing (%) | > 99.9% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
2-1블럭 | 2 | |
3-3블럭 | 2 |
Most occurring characters
Value | Count | Frequency (%) |
- | 4 | |
블 | 4 | |
럭 | 4 | |
3 | 4 | |
2 | 2 | |
1 | 2 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 8 | |
Decimal Number | 8 | |
Dash Punctuation | 4 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
3 | 4 | |
2 | 2 | |
1 | 2 |
Other Letter
Value | Count | Frequency (%) |
블 | 4 | |
럭 | 4 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 4 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 12 | |
Hangul | 8 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
- | 4 | |
3 | 4 | |
2 | 2 | |
1 | 2 |
Hangul
Value | Count | Frequency (%) |
블 | 4 | |
럭 | 4 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 12 | |
Hangul | 8 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 4 | |
3 | 4 | |
2 | 2 | |
1 | 2 |
Hangul
Value | Count | Frequency (%) |
블 | 4 | |
럭 | 4 |
건물_명
Text
MISSING
 
Distinct | 847 |
---|---|
Distinct (%) | 9.1% |
Missing | 699 |
Missing (%) | 7.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
고덕아르테온 | 506 | 4.1% |
힐스테이트 | 339 | 2.7% |
마곡엠밸리6단지 | 312 | 2.5% |
마곡엠밸리14단지 | 273 | 2.2% |
마곡엠밸리15단지 | 254 | 2.0% |
가든파이브라이프 | 243 | 2.0% |
마곡엠밸리7단지 | 226 | 1.8% |
청계현대아파트 | 217 | 1.7% |
래미안 | 183 | 1.5% |
힐스테이트에코송파 | 176 | 1.4% |
Other values (1006) | 9705 |
Most occurring characters
Value | Count | Frequency (%) |
3133 | 4.2% | |
지 | 2957 | 4.0% |
리 | 2860 | 3.9% |
파 | 2467 | 3.3% |
스 | 2434 | 3.3% |
단 | 2335 | 3.2% |
마 | 2216 | 3.0% |
트 | 2191 | 3.0% |
곡 | 2157 | 2.9% |
이 | 2129 | 2.9% |
Other values (461) | 49090 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 65314 | |
Decimal Number | 4018 | 5.4% |
Space Separator | 3133 | 4.2% |
Uppercase Letter | 824 | 1.1% |
Lowercase Letter | 246 | 0.3% |
Other Punctuation | 209 | 0.3% |
Letter Number | 135 | 0.2% |
Open Punctuation | 42 | 0.1% |
Close Punctuation | 42 | 0.1% |
Dash Punctuation | 6 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
지 | 2957 | 4.5% |
리 | 2860 | 4.4% |
파 | 2467 | 3.8% |
스 | 2434 | 3.7% |
단 | 2335 | 3.6% |
마 | 2216 | 3.4% |
트 | 2191 | 3.4% |
곡 | 2157 | 3.3% |
이 | 2129 | 3.3% |
아 | 2115 | 3.2% |
Other values (402) | 41453 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 136 | |
H | 86 | |
L | 84 | |
E | 69 | |
K | 67 | |
C | 66 | |
I | 66 | |
M | 59 | |
W | 35 | 4.2% |
V | 33 | 4.0% |
Other values (14) | 123 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 40 | |
o | 31 | |
u | 28 | |
s | 27 | |
t | 26 | |
a | 22 | |
y | 21 | |
n | 13 | 5.3% |
d | 13 | 5.3% |
k | 12 | 4.9% |
Other values (5) | 13 | 5.3% |
Decimal Number
Value | Count | Frequency (%) |
1 | 921 | |
2 | 562 | |
4 | 527 | |
3 | 463 | |
5 | 462 | |
6 | 378 | |
0 | 367 | 9.1% |
7 | 290 | 7.2% |
8 | 39 | 1.0% |
9 | 9 | 0.2% |
Other Punctuation
Value | Count | Frequency (%) |
. | 204 | |
& | 2 | 1.0% |
, | 2 | 1.0% |
/ | 1 | 0.5% |
Letter Number
Value | Count | Frequency (%) |
Ⅰ | 68 | |
Ⅱ | 67 |
Space Separator
Value | Count | Frequency (%) |
3133 |
Open Punctuation
Value | Count | Frequency (%) |
( | 42 |
Close Punctuation
Value | Count | Frequency (%) |
) | 42 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 6 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 65311 | |
Common | 7450 | 10.1% |
Latin | 1205 | 1.6% |
Han | 3 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
지 | 2957 | 4.5% |
리 | 2860 | 4.4% |
파 | 2467 | 3.8% |
스 | 2434 | 3.7% |
단 | 2335 | 3.6% |
마 | 2216 | 3.4% |
트 | 2191 | 3.4% |
곡 | 2157 | 3.3% |
이 | 2129 | 3.3% |
아 | 2115 | 3.2% |
Other values (399) | 41450 |
Latin
Value | Count | Frequency (%) |
S | 136 | 11.3% |
H | 86 | 7.1% |
L | 84 | 7.0% |
E | 69 | 5.7% |
Ⅰ | 68 | 5.6% |
K | 67 | 5.6% |
Ⅱ | 67 | 5.6% |
C | 66 | 5.5% |
I | 66 | 5.5% |
M | 59 | 4.9% |
Other values (31) | 437 |
Common
Value | Count | Frequency (%) |
3133 | ||
1 | 921 | 12.4% |
2 | 562 | 7.5% |
4 | 527 | 7.1% |
3 | 463 | 6.2% |
5 | 462 | 6.2% |
6 | 378 | 5.1% |
0 | 367 | 4.9% |
7 | 290 | 3.9% |
. | 204 | 2.7% |
Other values (8) | 143 | 1.9% |
Han
Value | Count | Frequency (%) |
笑 | 1 | |
家 | 1 | |
美 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 65311 | |
ASCII | 8520 | 11.5% |
Number Forms | 135 | 0.2% |
CJK | 3 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
3133 | ||
1 | 921 | 10.8% |
2 | 562 | 6.6% |
4 | 527 | 6.2% |
3 | 463 | 5.4% |
5 | 462 | 5.4% |
6 | 378 | 4.4% |
0 | 367 | 4.3% |
7 | 290 | 3.4% |
. | 204 | 2.4% |
Other values (47) | 1213 | 14.2% |
Hangul
Value | Count | Frequency (%) |
지 | 2957 | 4.5% |
리 | 2860 | 4.4% |
파 | 2467 | 3.8% |
스 | 2434 | 3.7% |
단 | 2335 | 3.6% |
마 | 2216 | 3.4% |
트 | 2191 | 3.4% |
곡 | 2157 | 3.3% |
이 | 2129 | 3.3% |
아 | 2115 | 3.2% |
Other values (399) | 41450 |
Number Forms
Value | Count | Frequency (%) |
Ⅰ | 68 | |
Ⅱ | 67 |
CJK
Value | Count | Frequency (%) |
笑 | 1 | |
家 | 1 | |
美 | 1 |
위반_건축물_여부
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
위반건축물 | 124 |
Length
Max length | 5 |
---|---|
Median length | 4 |
Mean length | 4.0124 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 9876 | |
위반건축물 | 124 | 1.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 9876 | |
위반건축물 | 124 | 1.2% |
대장_일련번호
Real number (ℝ)
Distinct | 1117 |
---|---|
Distinct (%) | 11.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 295.1831 |
Minimum | 1 |
---|---|
Maximum | 28850 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 3 |
Q1 | 18 |
median | 51 |
Q3 | 143.25 |
95-th percentile | 1230 |
Maximum | 28850 |
Range | 28849 |
Interquartile range (IQR) | 125.25 |
Descriptive statistics
Standard deviation | 1017.6025 |
---|---|
Coefficient of variation (CV) | 3.4473602 |
Kurtosis | 156.58114 |
Mean | 295.1831 |
Median Absolute Deviation (MAD) | 41 |
Skewness | 9.4780141 |
Sum | 2951831 |
Variance | 1035514.8 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 301 | 3.0% |
10 | 262 | 2.6% |
2 | 153 | 1.5% |
3 | 148 | 1.5% |
5 | 146 | 1.5% |
7 | 141 | 1.4% |
8 | 136 | 1.4% |
4 | 133 | 1.3% |
20 | 132 | 1.3% |
6 | 128 | 1.3% |
Other values (1107) | 8320 |
Value | Count | Frequency (%) |
1 | 301 | |
2 | 153 | |
3 | 148 | |
4 | 133 | |
5 | 146 | |
6 | 128 | |
7 | 141 | |
8 | 136 | |
9 | 124 | |
10 | 262 |
Value | Count | Frequency (%) |
28850 | 1 | |
28190 | 1 | |
14251 | 1 | |
14000 | 1 | |
11690 | 1 | |
9900 | 1 | |
8860 | 1 | |
8853 | 1 | |
8850 | 1 | |
8847 | 1 |
총괄표제부_일련번호
Real number (ℝ)
Distinct | 34 |
---|---|
Distinct (%) | 0.3% |
Missing | 1 |
Missing (%) | < 0.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 13.454145 |
Minimum | 1 |
---|---|
Maximum | 69 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 1 |
Q3 | 23 |
95-th percentile | 69 |
Maximum | 69 |
Range | 68 |
Interquartile range (IQR) | 22 |
Descriptive statistics
Standard deviation | 18.146998 |
---|---|
Coefficient of variation (CV) | 1.3488035 |
Kurtosis | 1.9693821 |
Mean | 13.454145 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 1.5847241 |
Sum | 134528 |
Variance | 329.31354 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 5473 | |
69 | 506 | 5.1% |
35 | 466 | 4.7% |
40 | 380 | 3.8% |
16 | 331 | 3.3% |
24 | 258 | 2.6% |
39 | 244 | 2.4% |
30 | 227 | 2.3% |
9 | 216 | 2.2% |
11 | 211 | 2.1% |
Other values (24) | 1687 | 16.9% |
Value | Count | Frequency (%) |
1 | 5473 | |
2 | 2 | < 0.1% |
3 | 30 | 0.3% |
4 | 120 | 1.2% |
5 | 15 | 0.1% |
6 | 50 | 0.5% |
7 | 57 | 0.6% |
8 | 1 | < 0.1% |
9 | 216 | 2.2% |
10 | 79 | 0.8% |
Value | Count | Frequency (%) |
69 | 506 | |
40 | 380 | |
39 | 244 | |
35 | 466 | |
33 | 33 | 0.3% |
31 | 34 | 0.3% |
30 | 227 | |
29 | 108 | 1.1% |
28 | 60 | 0.6% |
26 | 45 | 0.4% |
표제부_일련번호
Real number (ℝ)
SKEWED
 
Distinct | 102 |
---|---|
Distinct (%) | 1.0% |
Missing | 1 |
Missing (%) | < 0.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 23.883188 |
Minimum | 0 |
---|---|
Maximum | 8230 |
Zeros | 9 |
Zeros (%) | 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 10 |
Q3 | 21 |
95-th percentile | 68 |
Maximum | 8230 |
Range | 8230 |
Interquartile range (IQR) | 20 |
Descriptive statistics
Standard deviation | 131.59686 |
---|---|
Coefficient of variation (CV) | 5.5100205 |
Kurtosis | 3052.0427 |
Mean | 23.883188 |
Median Absolute Deviation (MAD) | 9 |
Skewness | 50.422073 |
Sum | 238808 |
Variance | 17317.733 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 3468 | |
10 | 765 | 7.6% |
9 | 246 | 2.5% |
8 | 244 | 2.4% |
12 | 237 | 2.4% |
3 | 237 | 2.4% |
20 | 221 | 2.2% |
13 | 196 | 2.0% |
11 | 189 | 1.9% |
18 | 187 | 1.9% |
Other values (92) | 4009 |
Value | Count | Frequency (%) |
0 | 9 | 0.1% |
1 | 3468 | |
2 | 186 | 1.9% |
3 | 237 | 2.4% |
4 | 58 | 0.6% |
5 | 128 | 1.3% |
6 | 171 | 1.7% |
7 | 108 | 1.1% |
8 | 244 | 2.4% |
9 | 246 | 2.5% |
Value | Count | Frequency (%) |
8230 | 1 | < 0.1% |
8220 | 1 | < 0.1% |
3100 | 1 | < 0.1% |
1720 | 1 | < 0.1% |
1160 | 5 | 0.1% |
680 | 1 | < 0.1% |
670 | 1 | < 0.1% |
580 | 2 | < 0.1% |
450 | 4 | < 0.1% |
330 | 27 |
전유부_일련번호
Real number (ℝ)
ZEROS
 
Distinct | 1118 |
---|---|
Distinct (%) | 11.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 294.7276 |
Minimum | 0 |
---|---|
Maximum | 28850 |
Zeros | 470 |
Zeros (%) | 4.7% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 18 |
median | 50 |
Q3 | 143 |
95-th percentile | 1230 |
Maximum | 28850 |
Range | 28850 |
Interquartile range (IQR) | 125 |
Descriptive statistics
Standard deviation | 1017.7203 |
---|---|
Coefficient of variation (CV) | 3.453088 |
Kurtosis | 156.52423 |
Mean | 294.7276 |
Median Absolute Deviation (MAD) | 40 |
Skewness | 9.4759702 |
Sum | 2947276 |
Variance | 1035754.7 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 470 | 4.7% |
5 | 138 | 1.4% |
3 | 137 | 1.4% |
2 | 136 | 1.4% |
7 | 134 | 1.3% |
10 | 133 | 1.3% |
8 | 128 | 1.3% |
4 | 126 | 1.3% |
20 | 124 | 1.2% |
9 | 122 | 1.2% |
Other values (1108) | 8352 |
Value | Count | Frequency (%) |
0 | 470 | |
1 | 112 | 1.1% |
2 | 136 | 1.4% |
3 | 137 | 1.4% |
4 | 126 | 1.3% |
5 | 138 | 1.4% |
6 | 119 | 1.2% |
7 | 134 | 1.3% |
8 | 128 | 1.3% |
9 | 122 | 1.2% |
Value | Count | Frequency (%) |
28850 | 1 | |
28190 | 1 | |
14251 | 1 | |
14000 | 1 | |
11690 | 1 | |
9900 | 1 | |
8860 | 1 | |
8853 | 1 | |
8850 | 1 | |
8847 | 1 |
새주소_도로_코드
Text
Distinct | 954 |
---|---|
Distinct (%) | 9.6% |
Missing | 36 |
Missing (%) | 0.4% |
Memory size | 156.2 KiB |
Length
Max length | 20 |
---|---|
Median length | 18 |
Mean length | 15.006022 |
Min length | 12 |
Characters and Unicode
Total characters | 149520 |
---|---|
Distinct characters | 256 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 553 ? |
---|---|
Unique (%) | 5.5% |
Sample
1st row | 서울특별시 강동구 구천면로 |
---|---|
2nd row | 서울특별시 동작구 여의대방로 |
3rd row | 서울특별시 마포구 모래내로1길 |
4th row | 서울특별시 강남구 밤고개로21길 |
5th row | 서울특별시 서초구 헌릉로8길 |
Value | Count | Frequency (%) |
서울특별시 | 9964 | |
강서구 | 2879 | 9.6% |
서초구 | 1166 | 3.9% |
송파구 | 996 | 3.3% |
강남구 | 990 | 3.3% |
강동구 | 826 | 2.8% |
마곡서1로 | 608 | 2.0% |
마곡중앙로 | 531 | 1.8% |
고덕로 | 525 | 1.8% |
동작구 | 440 | 1.5% |
Other values (946) | 10967 |
Most occurring characters
Value | Count | Frequency (%) |
19928 | ||
서 | 15455 | 10.3% |
구 | 10446 | 7.0% |
로 | 10027 | 6.7% |
시 | 10022 | 6.7% |
울 | 9979 | 6.7% |
특 | 9964 | 6.7% |
별 | 9964 | 6.7% |
강 | 4890 | 3.3% |
길 | 4112 | 2.8% |
Other values (246) | 44733 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 121660 | |
Space Separator | 19928 | 13.3% |
Decimal Number | 7930 | 5.3% |
Other Punctuation | 2 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
서 | 15455 | |
구 | 10446 | 8.6% |
로 | 10027 | 8.2% |
시 | 10022 | 8.2% |
울 | 9979 | 8.2% |
특 | 9964 | 8.2% |
별 | 9964 | 8.2% |
강 | 4890 | 4.0% |
길 | 4112 | 3.4% |
마 | 2380 | 2.0% |
Other values (234) | 34421 |
Decimal Number
Value | Count | Frequency (%) |
1 | 2123 | |
3 | 937 | |
2 | 779 | 9.8% |
5 | 766 | 9.7% |
4 | 663 | 8.4% |
8 | 637 | 8.0% |
7 | 612 | 7.7% |
9 | 513 | 6.5% |
6 | 458 | 5.8% |
0 | 442 | 5.6% |
Space Separator
Value | Count | Frequency (%) |
19928 |
Other Punctuation
Value | Count | Frequency (%) |
. | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 121660 | |
Common | 27860 | 18.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
서 | 15455 | |
구 | 10446 | 8.6% |
로 | 10027 | 8.2% |
시 | 10022 | 8.2% |
울 | 9979 | 8.2% |
특 | 9964 | 8.2% |
별 | 9964 | 8.2% |
강 | 4890 | 4.0% |
길 | 4112 | 3.4% |
마 | 2380 | 2.0% |
Other values (234) | 34421 |
Common
Value | Count | Frequency (%) |
19928 | ||
1 | 2123 | 7.6% |
3 | 937 | 3.4% |
2 | 779 | 2.8% |
5 | 766 | 2.7% |
4 | 663 | 2.4% |
8 | 637 | 2.3% |
7 | 612 | 2.2% |
9 | 513 | 1.8% |
6 | 458 | 1.6% |
Other values (2) | 444 | 1.6% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 121660 | |
ASCII | 27860 | 18.6% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
19928 | ||
1 | 2123 | 7.6% |
3 | 937 | 3.4% |
2 | 779 | 2.8% |
5 | 766 | 2.7% |
4 | 663 | 2.4% |
8 | 637 | 2.3% |
7 | 612 | 2.2% |
9 | 513 | 1.8% |
6 | 458 | 1.6% |
Other values (2) | 444 | 1.6% |
Hangul
Value | Count | Frequency (%) |
서 | 15455 | |
구 | 10446 | 8.6% |
로 | 10027 | 8.2% |
시 | 10022 | 8.2% |
울 | 9979 | 8.2% |
특 | 9964 | 8.2% |
별 | 9964 | 8.2% |
강 | 4890 | 4.0% |
길 | 4112 | 3.4% |
마 | 2380 | 2.0% |
Other values (234) | 34421 |
새주소_법정동_코드
Text
Distinct | 259 |
---|---|
Distinct (%) | 2.6% |
Missing | 36 |
Missing (%) | 0.4% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
마곡동 | 2214 | |
상일동 | 524 | 5.3% |
문정동 | 497 | 5.0% |
서초동 | 375 | 3.8% |
신원동 | 374 | 3.8% |
세곡동 | 315 | 3.2% |
내곡동 | 281 | 2.8% |
마장동 | 227 | 2.3% |
염창동 | 227 | 2.3% |
거여동 | 184 | 1.8% |
Other values (249) | 4746 |
Most occurring characters
Value | Count | Frequency (%) |
동 | 9938 | |
곡 | 3297 | 10.7% |
마 | 2456 | 7.9% |
상 | 826 | 2.7% |
신 | 771 | 2.5% |
일 | 686 | 2.2% |
정 | 571 | 1.8% |
원 | 556 | 1.8% |
문 | 550 | 1.8% |
도 | 443 | 1.4% |
Other values (165) | 10813 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 30574 | |
Decimal Number | 333 | 1.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 9938 | |
곡 | 3297 | 10.8% |
마 | 2456 | 8.0% |
상 | 826 | 2.7% |
신 | 771 | 2.5% |
일 | 686 | 2.2% |
정 | 571 | 1.9% |
원 | 556 | 1.8% |
문 | 550 | 1.8% |
도 | 443 | 1.4% |
Other values (157) | 10480 |
Decimal Number
Value | Count | Frequency (%) |
5 | 76 | |
4 | 68 | |
1 | 58 | |
2 | 42 | |
6 | 39 | |
3 | 25 | 7.5% |
8 | 13 | 3.9% |
7 | 12 | 3.6% |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 30574 | |
Common | 333 | 1.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 9938 | |
곡 | 3297 | 10.8% |
마 | 2456 | 8.0% |
상 | 826 | 2.7% |
신 | 771 | 2.5% |
일 | 686 | 2.2% |
정 | 571 | 1.9% |
원 | 556 | 1.8% |
문 | 550 | 1.8% |
도 | 443 | 1.4% |
Other values (157) | 10480 |
Common
Value | Count | Frequency (%) |
5 | 76 | |
4 | 68 | |
1 | 58 | |
2 | 42 | |
6 | 39 | |
3 | 25 | 7.5% |
8 | 13 | 3.9% |
7 | 12 | 3.6% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 30574 | |
ASCII | 333 | 1.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
동 | 9938 | |
곡 | 3297 | 10.8% |
마 | 2456 | 8.0% |
상 | 826 | 2.7% |
신 | 771 | 2.5% |
일 | 686 | 2.2% |
정 | 571 | 1.9% |
원 | 556 | 1.8% |
문 | 550 | 1.8% |
도 | 443 | 1.4% |
Other values (157) | 10480 |
ASCII
Value | Count | Frequency (%) |
5 | 76 | |
4 | 68 | |
1 | 58 | |
2 | 42 | |
6 | 39 | |
3 | 25 | 7.5% |
8 | 13 | 3.9% |
7 | 12 | 3.6% |
새주소_지상지하_코드
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
지상 | |
---|---|
<NA> | 9 |
Length
Max length | 4 |
---|---|
Median length | 2 |
Mean length | 2.0018 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 지상 |
---|---|
2nd row | 지상 |
3rd row | 지상 |
4th row | 지상 |
5th row | 지상 |
Common Values
Value | Count | Frequency (%) |
지상 | 9991 | |
<NA> | 9 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
지상 | 9991 | |
na | 9 | 0.1% |
새주소_본_번
Real number (ℝ)
Distinct | 351 |
---|---|
Distinct (%) | 3.5% |
Missing | 33 |
Missing (%) | 0.3% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 126.52764 |
Minimum | 0 |
---|---|
Maximum | 3318 |
Zeros | 3 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 8 |
Q1 | 31 |
median | 66 |
Q3 | 157 |
95-th percentile | 427 |
Maximum | 3318 |
Range | 3318 |
Interquartile range (IQR) | 126 |
Descriptive statistics
Standard deviation | 161.73109 |
---|---|
Coefficient of variation (CV) | 1.2782273 |
Kurtosis | 36.692595 |
Mean | 126.52764 |
Median Absolute Deviation (MAD) | 46 |
Skewness | 3.8618745 |
Sum | 1261101 |
Variance | 26156.944 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
360 | 505 | 5.1% |
36 | 394 | 3.9% |
100 | 372 | 3.7% |
22 | 336 | 3.4% |
33 | 290 | 2.9% |
13 | 273 | 2.7% |
50 | 251 | 2.5% |
66 | 246 | 2.5% |
133 | 227 | 2.3% |
62 | 202 | 2.0% |
Other values (341) | 6871 |
Value | Count | Frequency (%) |
0 | 3 | < 0.1% |
1 | 89 | |
2 | 12 | 0.1% |
3 | 63 | |
4 | 15 | 0.1% |
5 | 105 | |
6 | 27 | 0.3% |
7 | 125 | |
8 | 80 | |
9 | 41 | 0.4% |
Value | Count | Frequency (%) |
3318 | 1 | < 0.1% |
2803 | 1 | < 0.1% |
2275 | 1 | < 0.1% |
1808 | 1 | < 0.1% |
1666 | 6 | |
1665 | 2 | < 0.1% |
1496 | 1 | < 0.1% |
1383 | 1 | < 0.1% |
1222 | 1 | < 0.1% |
1197 | 2 | < 0.1% |
새주소_부_번
Real number (ℝ)
MISSING
  ZEROS
 
Distinct | 36 |
---|---|
Distinct (%) | 1.0% |
Missing | 6219 |
Missing (%) | 62.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.8257075 |
Minimum | 0 |
---|---|
Maximum | 83 |
Zeros | 2719 |
Zeros (%) | 27.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 2 |
95-th percentile | 13 |
Maximum | 83 |
Range | 83 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 6.3912585 |
---|---|
Coefficient of variation (CV) | 2.261826 |
Kurtosis | 31.104014 |
Mean | 2.8257075 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 4.1847807 |
Sum | 10684 |
Variance | 40.848186 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 2719 | |
11 | 180 | 1.8% |
12 | 175 | 1.8% |
2 | 111 | 1.1% |
1 | 104 | 1.0% |
8 | 72 | 0.7% |
3 | 55 | 0.5% |
20 | 43 | 0.4% |
5 | 42 | 0.4% |
10 | 34 | 0.3% |
Other values (26) | 246 | 2.5% |
(Missing) | 6219 |
Value | Count | Frequency (%) |
0 | 2719 | |
1 | 104 | 1.0% |
2 | 111 | 1.1% |
3 | 55 | 0.5% |
4 | 14 | 0.1% |
5 | 42 | 0.4% |
6 | 28 | 0.3% |
7 | 15 | 0.1% |
8 | 72 | 0.7% |
9 | 26 | 0.3% |
Value | Count | Frequency (%) |
83 | 3 | < 0.1% |
59 | 4 | |
51 | 1 | < 0.1% |
42 | 7 | |
38 | 2 | < 0.1% |
37 | 3 | < 0.1% |
33 | 8 | |
32 | 1 | < 0.1% |
30 | 2 | < 0.1% |
29 | 1 | < 0.1% |
변동_일자
Real number (ℝ)
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20240413 |
Minimum | 20240411 |
---|---|
Maximum | 20240419 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 20240411 |
---|---|
5-th percentile | 20240411 |
Q1 | 20240411 |
median | 20240413 |
Q3 | 20240417 |
95-th percentile | 20240419 |
Maximum | 20240419 |
Range | 8 |
Interquartile range (IQR) | 6 |
Descriptive statistics
Standard deviation | 2.5938456 |
---|---|
Coefficient of variation (CV) | 1.2815181 × 10-7 |
Kurtosis | -0.72339092 |
Mean | 20240413 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 0.82530912 |
Sum | 2.0240413 × 1011 |
Variance | 6.7280351 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20240411 | 3976 | |
20240413 | 3400 | |
20240417 | 2084 | |
20240419 | 533 | 5.3% |
20240418 | 5 | 0.1% |
20240416 | 2 | < 0.1% |
Value | Count | Frequency (%) |
20240411 | 3976 | |
20240413 | 3400 | |
20240416 | 2 | < 0.1% |
20240417 | 2084 | |
20240418 | 5 | 0.1% |
20240419 | 533 | 5.3% |
Value | Count | Frequency (%) |
20240419 | 533 | 5.3% |
20240418 | 5 | 0.1% |
20240417 | 2084 | |
20240416 | 2 | < 0.1% |
20240413 | 3400 | |
20240411 | 3976 |
관리_건축물대장_PK | 관리_상위_건축물대장_PK | 대장_구분_코드 | 대장_종류_코드 | 시군구_코드 | 법정동_코드 | 대지_구분_코드 | 번 | 지 | 특수지_명 | 블록 | 로트 | 건물_명 | 위반_건축물_여부 | 대장_일련번호 | 총괄표제부_일련번호 | 표제부_일련번호 | 전유부_일련번호 | 새주소_도로_코드 | 새주소_법정동_코드 | 새주소_지상지하_코드 | 새주소_본_번 | 새주소_부_번 | 변동_일자 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
30186 | 11740-100219604 | 11740-100218881 | 집합 | 전유부 | 강동구 | 천호동 | 대지 | 425 | 3 | <NA> | <NA> | <NA> | 천호역 한강 푸르지오시티 | <NA> | 79 | 1 | 1 | 79 | 서울특별시 강동구 구천면로 | 천호동 | 지상 | 192 | <NA> | 20240411 |
26843 | 11590-101344 | 11590-3995 | 집합 | 전유부 | 동작구 | 대방동 | 대지 | 501 | 0 | <NA> | <NA> | <NA> | 대림아파트 | <NA> | 310 | 1 | 330 | 310 | 서울특별시 동작구 여의대방로 | 대방동 | 지상 | 250 | <NA> | 20240413 |
19853 | 11440-1000000000000002992925 | 11440-1000000000000002992792 | 집합 | 전유부 | 마포구 | 성산동 | 대지 | 593 | 6 | <NA> | <NA> | <NA> | 헤리티지 삼영 | <NA> | 48 | 1 | 1 | 48 | 서울특별시 마포구 모래내로1길 | 성산동 | 지상 | 8 | <NA> | 20240413 |
32507 | 11680-100227773 | 11680-100227746 | 집합 | 전유부 | 강남구 | 자곡동 | 대지 | 687 | 0 | <NA> | <NA> | <NA> | 래미안포레 | <NA> | 36 | 29 | 22 | 36 | 서울특별시 강남구 밤고개로21길 | 자곡동 | 지상 | 25 | 0 | 20240411 |
29897 | 11650-100231979 | 11650-100231970 | 집합 | 전유부 | 서초구 | 내곡동 | 대지 | 411 | 0 | <NA> | <NA> | <NA> | 서초더샵포레 | <NA> | 32 | 35 | 30 | 32 | 서울특별시 서초구 헌릉로8길 | 내곡동 | 지상 | 58 | 0 | 20240411 |
2784 | 11500-100240981 | 11500-100240961 | 집합 | 전유부 | 강서구 | 마곡동 | 대지 | 751 | 0 | <NA> | <NA> | <NA> | 마곡엠밸리15단지 | <NA> | 39 | 24 | 13 | 39 | 서울특별시 강서구 마곡중앙로 | 마곡동 | 지상 | 36 | <NA> | 20240417 |
25951 | 11590-38126 | 11590-2414 | 집합 | 전유부 | 동작구 | 신대방동 | 대지 | 719 | 0 | <NA> | <NA> | <NA> | 동작상떼빌 | <NA> | 149 | 1 | 6 | 149 | 서울특별시 동작구 신대방1가길 | 신대방동 | 지상 | 38 | <NA> | 20240413 |
35952 | 11680-100220691 | 11680-100220660 | 집합 | 전유부 | 강남구 | 세곡동 | 대지 | 579 | 0 | <NA> | <NA> | <NA> | 강남엘에이치1단지 | <NA> | 68 | 25 | 20 | 68 | 서울특별시 강남구 헌릉로571길 | 세곡동 | 지상 | 20 | <NA> | 20240411 |
17916 | 11410-1000000000000003164519 | 11410-1000000000000003063607 | 집합 | 전유부 | 서대문구 | 창천동 | 대지 | 20 | 81 | <NA> | <NA> | <NA> | CHIME 20 | <NA> | 119 | 1 | 1 | 119 | 서울특별시 서대문구 연세로2다길 | 창천동 | 지상 | 20 | 0 | 20240413 |
24159 | 11590-38186 | 11590-2414 | 집합 | 전유부 | 동작구 | 신대방동 | 대지 | 719 | 0 | <NA> | <NA> | <NA> | 동작상떼빌 | <NA> | 221 | 1 | 6 | 221 | 서울특별시 동작구 신대방1가길 | 신대방동 | 지상 | 38 | <NA> | 20240413 |
관리_건축물대장_PK | 관리_상위_건축물대장_PK | 대장_구분_코드 | 대장_종류_코드 | 시군구_코드 | 법정동_코드 | 대지_구분_코드 | 번 | 지 | 특수지_명 | 블록 | 로트 | 건물_명 | 위반_건축물_여부 | 대장_일련번호 | 총괄표제부_일련번호 | 표제부_일련번호 | 전유부_일련번호 | 새주소_도로_코드 | 새주소_법정동_코드 | 새주소_지상지하_코드 | 새주소_본_번 | 새주소_부_번 | 변동_일자 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
47784 | 11710-100562022 | 11710-100561983 | 집합 | 전유부 | 송파구 | 거여동 | 대지 | 597 | 0 | <NA> | <NA> | <NA> | 송파 레이크파크 호반써밋Ⅰ | <NA> | 86 | 17 | 12 | 86 | 서울특별시 송파구 위례송파로 | 거여동 | 지상 | 40 | 0 | 20240411 |
16197 | 11500-100246501 | 11500-100246471 | 집합 | 전유부 | 강서구 | 마곡동 | 대지 | 740 | 0 | <NA> | <NA> | <NA> | 마곡엠밸리5단지 | <NA> | 3 | 16 | 12 | 3 | 서울특별시 강서구 마곡서1로 | 마곡동 | 지상 | 111 | 11 | 20240413 |
18075 | 11710-100219640 | 11710-100219527 | 집합 | 전유부 | 송파구 | 신천동 | 대지 | 17 | 0 | <NA> | <NA> | <NA> | 파크리오 | <NA> | 8801 | 1 | 230 | 8801 | 서울특별시 송파구 올림픽로 | 신천동 | 지상 | 435 | <NA> | 20240413 |
12399 | 11500-100241305 | 11500-100241219 | 집합 | 전유부 | 강서구 | 마곡동 | 대지 | 751 | 0 | <NA> | <NA> | <NA> | 마곡엠밸리15단지 | <NA> | 241 | 24 | 11 | 241 | 서울특별시 강서구 마곡중앙로 | 마곡동 | 지상 | 36 | <NA> | 20240417 |
3274 | 11500-100239254 | 11500-100239198 | 집합 | 전유부 | 강서구 | 마곡동 | 대지 | 750 | 0 | <NA> | <NA> | <NA> | 마곡엠밸리14단지 | <NA> | 42 | 40 | 35 | 42 | 서울특별시 강서구 마곡중앙로 | 마곡동 | 지상 | 33 | 0 | 20240417 |
24285 | 11530-1000000000000002438093 | 11530-1000000000000002438063 | 집합 | 전유부 | 구로구 | 구로동 | 대지 | 685 | 201 | <NA> | <NA> | <NA> | 구일 투웨니퍼스트 하이앤드 | <NA> | 31 | 1 | 3 | 31 | 서울특별시 구로구 구일로 | 구로동 | 지상 | 90 | 11 | 20240413 |
19537 | 11680-1000000000000002841769 | 11680-1000000000000002841669 | 집합 | 전유부 | 강남구 | 논현동 | 대지 | 242 | 31 | <NA> | <NA> | <NA> | 논현동 상지카일룸 M | <NA> | 51 | 1 | 1 | 51 | 서울특별시 강남구 선릉로 | 논현동 | 지상 | 663 | <NA> | 20240413 |
45601 | 11680-100305045 | 11680-100304949 | 집합 | 전유부 | 강남구 | 일원동 | 대지 | 743 | 0 | <NA> | <NA> | <NA> | 디에이치 자이 개포 | <NA> | 54 | 1 | 22 | 54 | 서울특별시 강남구 영동대로 | 일원동 | 지상 | 22 | 0 | 20240411 |
43094 | 11650-1000000000000001363206 | 11650-1000000000000001362528 | 집합 | 전유부 | 서초구 | 서초동 | 대지 | 1757 | 1 | <NA> | <NA> | <NA> | 서초그랑자이 그랑몰 | <NA> | 29 | 1 | 1 | 29 | 서울특별시 서초구 효령로 | 서초동 | 지상 | 403 | <NA> | 20240411 |
34318 | 11650-100288920 | 11650-100288845 | 집합 | 전유부 | 서초구 | 서초동 | 대지 | 1755 | 0 | <NA> | <NA> | <NA> | 래미안 리더스원 | <NA> | 28 | 22 | 12 | 28 | 서울특별시 서초구 서운로 | 서초동 | 지상 | 62 | 0 | 20240411 |