Dataset statistics
Number of variables | 10 |
---|---|
Number of observations | 10000 |
Missing cells | 3167 |
Missing cells (%) | 3.2% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 908.2 KiB |
Average record size in memory | 93.0 B |
Variable types
Numeric | 4 |
---|---|
Text | 5 |
Categorical | 1 |
Dataset
Description | 한국부동산원(구.한국감정원)에서 제공하는 공동주택 단지 식별정보 중 기본정보 데이터입니다. - (기본정보) 단지고유번호, 필지고유번호, 주소, 단지명, 단지종류, 동수, 세대수, 사용승인일 |
---|---|
URL | https://www.data.go.kr/data/15106861/fileData.do |
단지종류 has constant value "" | Constant |
단지고유번호 is highly overall correlated with 필지고유번호 | High correlation |
필지고유번호 is highly overall correlated with 단지고유번호 | High correlation |
동수 is highly overall correlated with 세대수 | High correlation |
세대수 is highly overall correlated with 동수 | High correlation |
단지명_건축물대장 has 1579 (15.8%) missing values | Missing |
단지명_도로명주소 has 1588 (15.9%) missing values | Missing |
단지고유번호 has unique values | Unique |
Reproduction
Analysis started | 2023-09-13 05:45:35.548384 |
---|---|
Analysis finished | 2023-09-13 05:45:44.391710 |
Duration | 8.84 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
단지고유번호
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.2623982 × 1013 |
Minimum | 1.11101 × 1013 |
---|---|
Maximum | 5.013012 × 1013 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1.11101 × 1013 |
---|---|
5-th percentile | 1.13201 × 1013 |
Q1 | 2.626012 × 1013 |
median | 3.17101 × 1013 |
Q3 | 4.413312 × 1013 |
95-th percentile | 4.827012 × 1013 |
Maximum | 5.013012 × 1013 |
Range | 3.902002 × 1013 |
Interquartile range (IQR) | 1.7873 × 1013 |
Descriptive statistics
Standard deviation | 1.3347209 × 1013 |
---|---|
Coefficient of variation (CV) | 0.40912261 |
Kurtosis | -1.1885311 |
Mean | 3.2623982 × 1013 |
Median Absolute Deviation (MAD) | 1.140102 × 1013 |
Skewness | -0.45433073 |
Sum | 3.2623982 × 1017 |
Variance | 1.7814799 × 1026 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
41281100007869 | 1 | < 0.1% |
47170100014648 | 1 | < 0.1% |
11380120091516 | 1 | < 0.1% |
26170100448582 | 1 | < 0.1% |
27200100014071 | 1 | < 0.1% |
46870100013245 | 1 | < 0.1% |
48310120354539 | 1 | < 0.1% |
27200100014066 | 1 | < 0.1% |
11470100003224 | 1 | < 0.1% |
11545100003529 | 1 | < 0.1% |
Other values (9990) | 9990 |
Value | Count | Frequency (%) |
11110100000012 | 1 | |
11110100000016 | 1 | |
11110100000027 | 1 | |
11110100000030 | 1 | |
11110100000032 | 1 | |
11110100000033 | 1 | |
11110100000034 | 1 | |
11110100000039 | 1 | |
11110100000045 | 1 | |
11110100000047 | 1 |
Value | Count | Frequency (%) |
50130120433550 | 1 | |
50130120427628 | 1 | |
50130120426526 | 1 | |
50130120392764 | 1 | |
50130120384832 | 1 | |
50130120383793 | 1 | |
50130120366080 | 1 | |
50130120362681 | 1 | |
50130120360868 | 1 | |
50130120360555 | 1 |
필지고유번호
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 9982 |
---|---|
Distinct (%) | 99.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.2624009 × 1018 |
Minimum | 1.1110115 × 1018 |
---|---|
Maximum | 5.013032 × 1018 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1.1110115 × 1018 |
---|---|
5-th percentile | 1.1320107 × 1018 |
Q1 | 2.6260108 × 1018 |
median | 3.1710255 × 1018 |
Q3 | 4.4133144 × 1018 |
95-th percentile | 4.8270253 × 1018 |
Maximum | 5.013032 × 1018 |
Range | 3.9020205 × 1018 |
Interquartile range (IQR) | 1.7873036 × 1018 |
Descriptive statistics
Standard deviation | 1.3347231 × 1018 |
---|---|
Coefficient of variation (CV) | 0.40912294 |
Kurtosis | -1.1885327 |
Mean | 3.2624009 × 1018 |
Median Absolute Deviation (MAD) | 1.140087 × 1018 |
Skewness | -0.45432902 |
Sum | -8.2811784 × 1018 |
Variance | 1.7814857 × 1036 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4127310900110790000 | 2 | < 0.1% |
4711132000300310008 | 2 | < 0.1% |
4825010600103210000 | 2 | < 0.1% |
4159031021100920000 | 2 | < 0.1% |
4611015800107400000 | 2 | < 0.1% |
4128112300108700000 | 2 | < 0.1% |
4128510600111350000 | 2 | < 0.1% |
4127310900110850000 | 2 | < 0.1% |
4128510200115740000 | 2 | < 0.1% |
2826011500300000000 | 2 | < 0.1% |
Other values (9972) | 9980 |
Value | Count | Frequency (%) |
1111011500100090000 | 1 | |
1111011700101450000 | 1 | |
1111013300100300006 | 1 | |
1111013300100550000 | 1 | |
1111013400100220000 | 1 | |
1111013700100220001 | 1 | |
1111016200100650002 | 1 | |
1111016500100090001 | 1 | |
1111016700100600000 | 1 | |
1111016800100040157 | 1 |
Value | Count | Frequency (%) |
5013032021115910005 | 1 | |
5013032021115340001 | 1 | |
5013032021106810000 | 1 | |
5013025924111320021 | 1 | |
5013025321113020003 | 1 | |
5013025321112940003 | 1 | |
5013025022112210001 | 1 | |
5013025022111870002 | 1 | |
5013011600102000000 | 1 | |
5013011600101890000 | 1 |
주소
Text
Distinct | 9982 |
---|---|
Distinct (%) | 99.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 28 |
---|---|
Median length | 26 |
Mean length | 18.752 |
Min length | 12 |
Characters and Unicode
Total characters | 187520 |
---|---|
Distinct characters | 342 |
Distinct categories | 5 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 9964 ? |
---|---|
Unique (%) | 99.6% |
Sample
1st row | 경기도 고양덕양구 행신동 953 |
---|---|
2nd row | 경상남도 거제시 장승포동 367-4 |
3rd row | 서울특별시 강동구 명일동 352-18 |
4th row | 부산광역시 남구 대연동 876-11 |
5th row | 전라북도 임실군 임실읍 이도리 212 |
Value | Count | Frequency (%) |
서울특별시 | 2159 | 5.2% |
경기도 | 1739 | 4.2% |
부산광역시 | 1083 | 2.6% |
경상남도 | 801 | 1.9% |
경상북도 | 628 | 1.5% |
인천광역시 | 475 | 1.1% |
대구광역시 | 419 | 1.0% |
울산광역시 | 379 | 0.9% |
충청남도 | 324 | 0.8% |
전라남도 | 309 | 0.7% |
Other values (9532) | 33085 |
Most occurring characters
Value | Count | Frequency (%) |
31475 | 16.8% | |
동 | 9819 | 5.2% |
시 | 8081 | 4.3% |
1 | 7920 | 4.2% |
구 | 7380 | 3.9% |
- | 6286 | 3.4% |
도 | 5408 | 2.9% |
2 | 4560 | 2.4% |
3 | 4150 | 2.2% |
4 | 3738 | 2.0% |
Other values (332) | 98703 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 110644 | |
Decimal Number | 39079 | 20.8% |
Space Separator | 31475 | 16.8% |
Dash Punctuation | 6286 | 3.4% |
Uppercase Letter | 36 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 9819 | 8.9% |
시 | 8081 | 7.3% |
구 | 7380 | 6.7% |
도 | 5408 | 4.9% |
서 | 3473 | 3.1% |
광 | 3453 | 3.1% |
경 | 3352 | 3.0% |
산 | 3138 | 2.8% |
역 | 2941 | 2.7% |
울 | 2627 | 2.4% |
Other values (318) | 60972 |
Decimal Number
Value | Count | Frequency (%) |
1 | 7920 | |
2 | 4560 | |
3 | 4150 | |
4 | 3738 | |
5 | 3646 | |
6 | 3409 | |
7 | 3223 | |
8 | 2864 | 7.3% |
0 | 2841 | 7.3% |
9 | 2728 | 7.0% |
Uppercase Letter
Value | Count | Frequency (%) |
L | 18 | |
B | 18 |
Space Separator
Value | Count | Frequency (%) |
31475 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 6286 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 110644 | |
Common | 76840 | |
Latin | 36 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 9819 | 8.9% |
시 | 8081 | 7.3% |
구 | 7380 | 6.7% |
도 | 5408 | 4.9% |
서 | 3473 | 3.1% |
광 | 3453 | 3.1% |
경 | 3352 | 3.0% |
산 | 3138 | 2.8% |
역 | 2941 | 2.7% |
울 | 2627 | 2.4% |
Other values (318) | 60972 |
Common
Value | Count | Frequency (%) |
31475 | ||
1 | 7920 | 10.3% |
- | 6286 | 8.2% |
2 | 4560 | 5.9% |
3 | 4150 | 5.4% |
4 | 3738 | 4.9% |
5 | 3646 | 4.7% |
6 | 3409 | 4.4% |
7 | 3223 | 4.2% |
8 | 2864 | 3.7% |
Other values (2) | 5569 | 7.2% |
Latin
Value | Count | Frequency (%) |
L | 18 | |
B | 18 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 110644 | |
ASCII | 76876 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
31475 | ||
1 | 7920 | 10.3% |
- | 6286 | 8.2% |
2 | 4560 | 5.9% |
3 | 4150 | 5.4% |
4 | 3738 | 4.9% |
5 | 3646 | 4.7% |
6 | 3409 | 4.4% |
7 | 3223 | 4.2% |
8 | 2864 | 3.7% |
Other values (4) | 5605 | 7.3% |
Hangul
Value | Count | Frequency (%) |
동 | 9819 | 8.9% |
시 | 8081 | 7.3% |
구 | 7380 | 6.7% |
도 | 5408 | 4.9% |
서 | 3473 | 3.1% |
광 | 3453 | 3.1% |
경 | 3352 | 3.0% |
산 | 3138 | 2.8% |
역 | 2941 | 2.7% |
울 | 2627 | 2.4% |
Other values (318) | 60972 |
단지명_공시가격
Text
Distinct | 8841 |
---|---|
Distinct (%) | 88.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
현대 | 49 | 0.5% |
삼성 | 15 | 0.1% |
주공 | 14 | 0.1% |
삼익 | 13 | 0.1% |
우성 | 12 | 0.1% |
신동아 | 12 | 0.1% |
벽산 | 11 | 0.1% |
한성 | 11 | 0.1% |
현대2 | 10 | 0.1% |
삼호 | 10 | 0.1% |
Other values (8833) | 9846 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 1529 | 2.5% |
동 | 1451 | 2.3% |
스 | 1439 | 2.3% |
빌 | 1403 | 2.3% |
아 | 1358 | 2.2% |
트 | 1230 | 2.0% |
지 | 1152 | 1.9% |
2 | 1145 | 1.9% |
이 | 1107 | 1.8% |
파 | 1074 | 1.7% |
Other values (680) | 49001 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 52350 | |
Decimal Number | 5568 | 9.0% |
Close Punctuation | 1031 | 1.7% |
Open Punctuation | 1031 | 1.7% |
Uppercase Letter | 978 | 1.6% |
Dash Punctuation | 522 | 0.8% |
Lowercase Letter | 279 | 0.5% |
Other Punctuation | 86 | 0.1% |
Letter Number | 21 | < 0.1% |
Math Symbol | 20 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 1451 | 2.8% |
스 | 1439 | 2.7% |
빌 | 1403 | 2.7% |
아 | 1358 | 2.6% |
트 | 1230 | 2.3% |
지 | 1152 | 2.2% |
이 | 1107 | 2.1% |
파 | 1074 | 2.1% |
대 | 959 | 1.8% |
리 | 934 | 1.8% |
Other values (607) | 40243 |
Uppercase Letter
Value | Count | Frequency (%) |
A | 135 | |
B | 92 | 9.4% |
S | 78 | 8.0% |
C | 73 | 7.5% |
I | 69 | 7.1% |
L | 68 | 7.0% |
K | 54 | 5.5% |
H | 50 | 5.1% |
T | 43 | 4.4% |
E | 37 | 3.8% |
Other values (16) | 279 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 103 | |
i | 26 | 9.3% |
l | 24 | 8.6% |
t | 18 | 6.5% |
a | 17 | 6.1% |
r | 13 | 4.7% |
y | 11 | 3.9% |
u | 10 | 3.6% |
s | 9 | 3.2% |
k | 8 | 2.9% |
Other values (12) | 40 | 14.3% |
Decimal Number
Value | Count | Frequency (%) |
1 | 1529 | |
2 | 1145 | |
3 | 581 | 10.4% |
0 | 570 | 10.2% |
4 | 359 | 6.4% |
5 | 350 | 6.3% |
7 | 331 | 5.9% |
6 | 295 | 5.3% |
9 | 216 | 3.9% |
8 | 192 | 3.4% |
Other Punctuation
Value | Count | Frequency (%) |
, | 69 | |
. | 9 | 10.5% |
& | 3 | 3.5% |
' | 3 | 3.5% |
: | 2 | 2.3% |
Math Symbol
Value | Count | Frequency (%) |
~ | 18 | |
> | 1 | 5.0% |
< | 1 | 5.0% |
Letter Number
Value | Count | Frequency (%) |
Ⅱ | 14 | |
Ⅲ | 4 | 19.0% |
Ⅰ | 3 | 14.3% |
Close Punctuation
Value | Count | Frequency (%) |
) | 1031 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1031 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 522 |
Space Separator
Value | Count | Frequency (%) |
3 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 52345 | |
Common | 8261 | 13.3% |
Latin | 1278 | 2.1% |
Han | 5 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 1451 | 2.8% |
스 | 1439 | 2.7% |
빌 | 1403 | 2.7% |
아 | 1358 | 2.6% |
트 | 1230 | 2.3% |
지 | 1152 | 2.2% |
이 | 1107 | 2.1% |
파 | 1074 | 2.1% |
대 | 959 | 1.8% |
리 | 934 | 1.8% |
Other values (604) | 40238 |
Latin
Value | Count | Frequency (%) |
A | 135 | 10.6% |
e | 103 | 8.1% |
B | 92 | 7.2% |
S | 78 | 6.1% |
C | 73 | 5.7% |
I | 69 | 5.4% |
L | 68 | 5.3% |
K | 54 | 4.2% |
H | 50 | 3.9% |
T | 43 | 3.4% |
Other values (41) | 513 |
Common
Value | Count | Frequency (%) |
1 | 1529 | |
2 | 1145 | |
) | 1031 | |
( | 1031 | |
3 | 581 | 7.0% |
0 | 570 | 6.9% |
- | 522 | 6.3% |
4 | 359 | 4.3% |
5 | 350 | 4.2% |
7 | 331 | 4.0% |
Other values (12) | 812 |
Han
Value | Count | Frequency (%) |
家 | 3 | |
林 | 1 | 20.0% |
名 | 1 | 20.0% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 52344 | |
ASCII | 9518 | 15.4% |
Number Forms | 21 | < 0.1% |
CJK | 4 | < 0.1% |
CJK Compat Ideographs | 1 | < 0.1% |
Compat Jamo | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 1529 | |
2 | 1145 | |
) | 1031 | |
( | 1031 | |
3 | 581 | 6.1% |
0 | 570 | 6.0% |
- | 522 | 5.5% |
4 | 359 | 3.8% |
5 | 350 | 3.7% |
7 | 331 | 3.5% |
Other values (60) | 2069 |
Hangul
Value | Count | Frequency (%) |
동 | 1451 | 2.8% |
스 | 1439 | 2.7% |
빌 | 1403 | 2.7% |
아 | 1358 | 2.6% |
트 | 1230 | 2.3% |
지 | 1152 | 2.2% |
이 | 1107 | 2.1% |
파 | 1074 | 2.1% |
대 | 959 | 1.8% |
리 | 934 | 1.8% |
Other values (603) | 40237 |
Number Forms
Value | Count | Frequency (%) |
Ⅱ | 14 | |
Ⅲ | 4 | 19.0% |
Ⅰ | 3 | 14.3% |
CJK
Value | Count | Frequency (%) |
家 | 3 | |
名 | 1 | 25.0% |
CJK Compat Ideographs
Value | Count | Frequency (%) |
林 | 1 |
Compat Jamo
Value | Count | Frequency (%) |
ㆍ | 1 |
단지명_건축물대장
Text
MISSING
 
Distinct | 7376 |
---|---|
Distinct (%) | 87.6% |
Missing | 1579 |
Missing (%) | 15.8% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
아파트 | 375 | 3.3% |
현대아파트 | 76 | 0.7% |
더 | 52 | 0.5% |
2차 | 51 | 0.4% |
푸르지오 | 46 | 0.4% |
2단지 | 45 | 0.4% |
주공아파트 | 44 | 0.4% |
e편한세상 | 40 | 0.3% |
1단지 | 38 | 0.3% |
롯데캐슬 | 31 | 0.3% |
Other values (8000) | 10686 |
Most occurring characters
Value | Count | Frequency (%) |
아 | 4573 | 7.2% |
트 | 4474 | 7.1% |
파 | 4329 | 6.9% |
3067 | 4.9% | |
스 | 1276 | 2.0% |
빌 | 1215 | 1.9% |
동 | 1137 | 1.8% |
이 | 1010 | 1.6% |
지 | 922 | 1.5% |
리 | 837 | 1.3% |
Other values (673) | 40315 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 56753 | |
Space Separator | 3067 | 4.9% |
Decimal Number | 1880 | 3.0% |
Uppercase Letter | 776 | 1.2% |
Lowercase Letter | 285 | 0.5% |
Dash Punctuation | 111 | 0.2% |
Other Punctuation | 98 | 0.2% |
Open Punctuation | 82 | 0.1% |
Close Punctuation | 82 | 0.1% |
Letter Number | 20 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
아 | 4573 | 8.1% |
트 | 4474 | 7.9% |
파 | 4329 | 7.6% |
스 | 1276 | 2.2% |
빌 | 1215 | 2.1% |
동 | 1137 | 2.0% |
이 | 1010 | 1.8% |
지 | 922 | 1.6% |
리 | 837 | 1.5% |
대 | 796 | 1.4% |
Other values (599) | 36184 |
Uppercase Letter
Value | Count | Frequency (%) |
A | 83 | 10.7% |
S | 64 | 8.2% |
C | 62 | 8.0% |
I | 55 | 7.1% |
L | 50 | 6.4% |
K | 49 | 6.3% |
T | 42 | 5.4% |
H | 39 | 5.0% |
P | 38 | 4.9% |
B | 38 | 4.9% |
Other values (16) | 256 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 101 | |
i | 29 | 10.2% |
l | 24 | 8.4% |
t | 20 | 7.0% |
a | 14 | 4.9% |
s | 12 | 4.2% |
n | 11 | 3.9% |
r | 11 | 3.9% |
o | 11 | 3.9% |
y | 11 | 3.9% |
Other values (12) | 41 |
Decimal Number
Value | Count | Frequency (%) |
1 | 576 | |
2 | 556 | |
3 | 224 | 11.9% |
0 | 146 | 7.8% |
5 | 97 | 5.2% |
4 | 94 | 5.0% |
6 | 75 | 4.0% |
7 | 45 | 2.4% |
8 | 44 | 2.3% |
9 | 23 | 1.2% |
Other Punctuation
Value | Count | Frequency (%) |
, | 39 | |
. | 38 | |
& | 6 | 6.1% |
· | 4 | 4.1% |
/ | 4 | 4.1% |
' | 3 | 3.1% |
: | 3 | 3.1% |
# | 1 | 1.0% |
Letter Number
Value | Count | Frequency (%) |
Ⅱ | 13 | |
Ⅰ | 4 | 20.0% |
Ⅲ | 3 | 15.0% |
Space Separator
Value | Count | Frequency (%) |
3067 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 111 |
Open Punctuation
Value | Count | Frequency (%) |
( | 82 |
Close Punctuation
Value | Count | Frequency (%) |
) | 82 |
Math Symbol
Value | Count | Frequency (%) |
~ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 56748 | |
Common | 5321 | 8.4% |
Latin | 1081 | 1.7% |
Han | 5 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
아 | 4573 | 8.1% |
트 | 4474 | 7.9% |
파 | 4329 | 7.6% |
스 | 1276 | 2.2% |
빌 | 1215 | 2.1% |
동 | 1137 | 2.0% |
이 | 1010 | 1.8% |
지 | 922 | 1.6% |
리 | 837 | 1.5% |
대 | 796 | 1.4% |
Other values (597) | 36179 |
Latin
Value | Count | Frequency (%) |
e | 101 | 9.3% |
A | 83 | 7.7% |
S | 64 | 5.9% |
C | 62 | 5.7% |
I | 55 | 5.1% |
L | 50 | 4.6% |
K | 49 | 4.5% |
T | 42 | 3.9% |
H | 39 | 3.6% |
P | 38 | 3.5% |
Other values (41) | 498 |
Common
Value | Count | Frequency (%) |
3067 | ||
1 | 576 | 10.8% |
2 | 556 | 10.4% |
3 | 224 | 4.2% |
0 | 146 | 2.7% |
- | 111 | 2.1% |
5 | 97 | 1.8% |
4 | 94 | 1.8% |
( | 82 | 1.5% |
) | 82 | 1.5% |
Other values (13) | 286 | 5.4% |
Han
Value | Count | Frequency (%) |
家 | 4 | |
名 | 1 | 20.0% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 56748 | |
ASCII | 6378 | 10.1% |
Number Forms | 20 | < 0.1% |
CJK | 5 | < 0.1% |
None | 4 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
아 | 4573 | 8.1% |
트 | 4474 | 7.9% |
파 | 4329 | 7.6% |
스 | 1276 | 2.2% |
빌 | 1215 | 2.1% |
동 | 1137 | 2.0% |
이 | 1010 | 1.8% |
지 | 922 | 1.6% |
리 | 837 | 1.5% |
대 | 796 | 1.4% |
Other values (597) | 36179 |
ASCII
Value | Count | Frequency (%) |
3067 | ||
1 | 576 | 9.0% |
2 | 556 | 8.7% |
3 | 224 | 3.5% |
0 | 146 | 2.3% |
- | 111 | 1.7% |
e | 101 | 1.6% |
5 | 97 | 1.5% |
4 | 94 | 1.5% |
A | 83 | 1.3% |
Other values (60) | 1323 |
Number Forms
Value | Count | Frequency (%) |
Ⅱ | 13 | |
Ⅰ | 4 | 20.0% |
Ⅲ | 3 | 15.0% |
None
Value | Count | Frequency (%) |
· | 4 |
CJK
Value | Count | Frequency (%) |
家 | 4 | |
名 | 1 | 20.0% |
단지명_도로명주소
Text
MISSING
 
Distinct | 7227 |
---|---|
Distinct (%) | 85.9% |
Missing | 1588 |
Missing (%) | 15.9% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
아파트 | 328 | 2.9% |
현대아파트 | 97 | 0.9% |
주공아파트 | 53 | 0.5% |
2차 | 44 | 0.4% |
더 | 43 | 0.4% |
2단지 | 42 | 0.4% |
푸르지오 | 40 | 0.4% |
1단지 | 35 | 0.3% |
e편한세상 | 34 | 0.3% |
101동 | 30 | 0.3% |
Other values (7815) | 10398 |
Most occurring characters
Value | Count | Frequency (%) |
아 | 4721 | 7.6% |
트 | 4578 | 7.4% |
파 | 4469 | 7.2% |
2740 | 4.4% | |
동 | 1208 | 2.0% |
빌 | 1193 | 1.9% |
스 | 1168 | 1.9% |
이 | 936 | 1.5% |
지 | 862 | 1.4% |
대 | 821 | 1.3% |
Other values (664) | 39042 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 55739 | |
Space Separator | 2740 | 4.4% |
Decimal Number | 1931 | 3.1% |
Uppercase Letter | 755 | 1.2% |
Lowercase Letter | 242 | 0.4% |
Dash Punctuation | 96 | 0.2% |
Open Punctuation | 79 | 0.1% |
Close Punctuation | 79 | 0.1% |
Other Punctuation | 57 | 0.1% |
Letter Number | 20 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
아 | 4721 | 8.5% |
트 | 4578 | 8.2% |
파 | 4469 | 8.0% |
동 | 1208 | 2.2% |
빌 | 1193 | 2.1% |
스 | 1168 | 2.1% |
이 | 936 | 1.7% |
지 | 862 | 1.5% |
대 | 821 | 1.5% |
성 | 793 | 1.4% |
Other values (590) | 34990 |
Uppercase Letter
Value | Count | Frequency (%) |
A | 89 | 11.8% |
S | 61 | 8.1% |
C | 55 | 7.3% |
I | 52 | 6.9% |
T | 47 | 6.2% |
P | 47 | 6.2% |
K | 45 | 6.0% |
L | 44 | 5.8% |
B | 40 | 5.3% |
E | 31 | 4.1% |
Other values (16) | 244 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 91 | |
l | 23 | 9.5% |
i | 22 | 9.1% |
t | 15 | 6.2% |
s | 11 | 4.5% |
o | 10 | 4.1% |
n | 9 | 3.7% |
a | 9 | 3.7% |
r | 8 | 3.3% |
u | 8 | 3.3% |
Other values (12) | 36 | 14.9% |
Decimal Number
Value | Count | Frequency (%) |
1 | 592 | |
2 | 538 | |
3 | 233 | 12.1% |
0 | 200 | 10.4% |
5 | 94 | 4.9% |
4 | 88 | 4.6% |
6 | 75 | 3.9% |
7 | 45 | 2.3% |
8 | 41 | 2.1% |
9 | 25 | 1.3% |
Other Punctuation
Value | Count | Frequency (%) |
. | 23 | |
, | 16 | |
& | 7 | 12.3% |
' | 3 | 5.3% |
· | 3 | 5.3% |
: | 2 | 3.5% |
/ | 1 | 1.8% |
@ | 1 | 1.8% |
# | 1 | 1.8% |
Letter Number
Value | Count | Frequency (%) |
Ⅱ | 14 | |
Ⅲ | 3 | 15.0% |
Ⅰ | 3 | 15.0% |
Space Separator
Value | Count | Frequency (%) |
2740 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 96 |
Open Punctuation
Value | Count | Frequency (%) |
( | 79 |
Close Punctuation
Value | Count | Frequency (%) |
) | 79 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 55733 | |
Common | 4982 | 8.1% |
Latin | 1017 | 1.6% |
Han | 6 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
아 | 4721 | 8.5% |
트 | 4578 | 8.2% |
파 | 4469 | 8.0% |
동 | 1208 | 2.2% |
빌 | 1193 | 2.1% |
스 | 1168 | 2.1% |
이 | 936 | 1.7% |
지 | 862 | 1.5% |
대 | 821 | 1.5% |
성 | 793 | 1.4% |
Other values (587) | 34984 |
Latin
Value | Count | Frequency (%) |
e | 91 | 8.9% |
A | 89 | 8.8% |
S | 61 | 6.0% |
C | 55 | 5.4% |
I | 52 | 5.1% |
T | 47 | 4.6% |
P | 47 | 4.6% |
K | 45 | 4.4% |
L | 44 | 4.3% |
B | 40 | 3.9% |
Other values (41) | 446 |
Common
Value | Count | Frequency (%) |
2740 | ||
1 | 592 | 11.9% |
2 | 538 | 10.8% |
3 | 233 | 4.7% |
0 | 200 | 4.0% |
- | 96 | 1.9% |
5 | 94 | 1.9% |
4 | 88 | 1.8% |
( | 79 | 1.6% |
) | 79 | 1.6% |
Other values (13) | 243 | 4.9% |
Han
Value | Count | Frequency (%) |
家 | 4 | |
林 | 1 | 16.7% |
名 | 1 | 16.7% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 55733 | |
ASCII | 5976 | 9.7% |
Number Forms | 20 | < 0.1% |
CJK | 5 | < 0.1% |
None | 3 | < 0.1% |
CJK Compat Ideographs | 1 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
아 | 4721 | 8.5% |
트 | 4578 | 8.2% |
파 | 4469 | 8.0% |
동 | 1208 | 2.2% |
빌 | 1193 | 2.1% |
스 | 1168 | 2.1% |
이 | 936 | 1.7% |
지 | 862 | 1.5% |
대 | 821 | 1.5% |
성 | 793 | 1.4% |
Other values (587) | 34984 |
ASCII
Value | Count | Frequency (%) |
2740 | ||
1 | 592 | 9.9% |
2 | 538 | 9.0% |
3 | 233 | 3.9% |
0 | 200 | 3.3% |
- | 96 | 1.6% |
5 | 94 | 1.6% |
e | 91 | 1.5% |
A | 89 | 1.5% |
4 | 88 | 1.5% |
Other values (60) | 1215 |
Number Forms
Value | Count | Frequency (%) |
Ⅱ | 14 | |
Ⅲ | 3 | 15.0% |
Ⅰ | 3 | 15.0% |
CJK
Value | Count | Frequency (%) |
家 | 4 | |
名 | 1 | 20.0% |
None
Value | Count | Frequency (%) |
· | 3 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
林 | 1 |
단지종류
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
1 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 10000 |
동수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 48 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.6015 |
Minimum | 1 |
---|---|
Maximum | 72 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 1 |
Q3 | 5 |
95-th percentile | 12 |
Maximum | 72 |
Range | 71 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 4.6779575 |
---|---|
Coefficient of variation (CV) | 1.2988914 |
Kurtosis | 23.586366 |
Mean | 3.6015 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 3.5630939 |
Sum | 36015 |
Variance | 21.883286 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 5404 | |
2 | 958 | 9.6% |
3 | 554 | 5.5% |
4 | 485 | 4.9% |
5 | 420 | 4.2% |
6 | 418 | 4.2% |
7 | 308 | 3.1% |
8 | 279 | 2.8% |
9 | 224 | 2.2% |
10 | 187 | 1.9% |
Other values (38) | 763 | 7.6% |
Value | Count | Frequency (%) |
1 | 5404 | |
2 | 958 | 9.6% |
3 | 554 | 5.5% |
4 | 485 | 4.9% |
5 | 420 | 4.2% |
6 | 418 | 4.2% |
7 | 308 | 3.1% |
8 | 279 | 2.8% |
9 | 224 | 2.2% |
10 | 187 | 1.9% |
Value | Count | Frequency (%) |
72 | 1 | |
66 | 1 | |
65 | 1 | |
60 | 1 | |
51 | 1 | |
49 | 1 | |
46 | 1 | |
44 | 1 | |
41 | 1 | |
40 | 1 |
세대수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 1330 |
---|---|
Distinct (%) | 13.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 274.4745 |
Minimum | 4 |
---|---|
Maximum | 6864 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 4 |
---|---|
5-th percentile | 12 |
Q1 | 21 |
median | 95 |
Q3 | 378 |
95-th percentile | 1056.1 |
Maximum | 6864 |
Range | 6860 |
Interquartile range (IQR) | 357 |
Descriptive statistics
Standard deviation | 416.78995 |
---|---|
Coefficient of variation (CV) | 1.5185015 |
Kurtosis | 23.375225 |
Mean | 274.4745 |
Median Absolute Deviation (MAD) | 79 |
Skewness | 3.5227466 |
Sum | 2744745 |
Variance | 173713.86 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
19 | 716 | 7.2% |
18 | 402 | 4.0% |
14 | 209 | 2.1% |
12 | 204 | 2.0% |
16 | 165 | 1.7% |
10 | 163 | 1.6% |
40 | 132 | 1.3% |
15 | 132 | 1.3% |
29 | 111 | 1.1% |
28 | 111 | 1.1% |
Other values (1320) | 7655 |
Value | Count | Frequency (%) |
4 | 4 | < 0.1% |
5 | 71 | 0.7% |
6 | 16 | 0.2% |
7 | 16 | 0.2% |
8 | 29 | 0.3% |
9 | 40 | 0.4% |
10 | 163 | |
11 | 63 | 0.6% |
12 | 204 | |
13 | 68 | 0.7% |
Value | Count | Frequency (%) |
6864 | 1 | |
5678 | 1 | |
5563 | 1 | |
5076 | 1 | |
4089 | 1 | |
3853 | 1 | |
3850 | 1 | |
3806 | 1 | |
3728 | 1 | |
3696 | 1 |
사용승인일
Text
Distinct | 6142 |
---|---|
Distinct (%) | 61.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 100000 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 3669 ? |
---|---|
Unique (%) | 36.7% |
Sample
1st row | 1996-11-22 |
---|---|
2nd row | 2015-07-05 |
3rd row | 2004-10-30 |
4th row | 2004-06-07 |
5th row | 1993-07-16 |
Value | Count | Frequency (%) |
2008-10-31 | 9 | 0.1% |
2002-11-14 | 8 | 0.1% |
2002-11-01 | 8 | 0.1% |
2004-08-27 | 8 | 0.1% |
2004-07-30 | 8 | 0.1% |
2017-10-27 | 8 | 0.1% |
2004-09-24 | 8 | 0.1% |
2002-12-18 | 7 | 0.1% |
2003-01-22 | 7 | 0.1% |
2003-09-08 | 7 | 0.1% |
Other values (6132) | 9922 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 21799 | |
- | 20000 | |
1 | 16043 | |
2 | 14619 | |
9 | 9158 | |
8 | 3587 | 3.6% |
3 | 3553 | 3.6% |
7 | 2900 | 2.9% |
4 | 2887 | 2.9% |
6 | 2757 | 2.8% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 80000 | |
Dash Punctuation | 20000 | 20.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 21799 | |
1 | 16043 | |
2 | 14619 | |
9 | 9158 | |
8 | 3587 | 4.5% |
3 | 3553 | 4.4% |
7 | 2900 | 3.6% |
4 | 2887 | 3.6% |
6 | 2757 | 3.4% |
5 | 2697 | 3.4% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 20000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 100000 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 21799 | |
- | 20000 | |
1 | 16043 | |
2 | 14619 | |
9 | 9158 | |
8 | 3587 | 3.6% |
3 | 3553 | 3.6% |
7 | 2900 | 2.9% |
4 | 2887 | 2.9% |
6 | 2757 | 2.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 100000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 21799 | |
- | 20000 | |
1 | 16043 | |
2 | 14619 | |
9 | 9158 | |
8 | 3587 | 3.6% |
3 | 3553 | 3.6% |
7 | 2900 | 2.9% |
4 | 2887 | 2.9% |
6 | 2757 | 2.8% |
단지고유번호 | 필지고유번호 | 동수 | 세대수 | |
---|---|---|---|---|
단지고유번호 | 1.000 | 1.000 | 0.145 | 0.131 |
필지고유번호 | 1.000 | 1.000 | 0.145 | 0.131 |
동수 | 0.145 | 0.145 | 1.000 | 0.872 |
세대수 | 0.131 | 0.131 | 0.872 | 1.000 |
단지고유번호 | 필지고유번호 | 동수 | 세대수 | |
---|---|---|---|---|
단지고유번호 | 1.000 | 1.000 | 0.114 | 0.124 |
필지고유번호 | 1.000 | 1.000 | 0.115 | 0.125 |
동수 | 0.114 | 0.115 | 1.000 | 0.858 |
세대수 | 0.124 | 0.125 | 0.858 | 1.000 |
단지고유번호 | 필지고유번호 | 주소 | 단지명_공시가격 | 단지명_건축물대장 | 단지명_도로명주소 | 단지종류 | 동수 | 세대수 | 사용승인일 | |
---|---|---|---|---|---|---|---|---|---|---|
9130 | 41281100007869 | 4128112800109530000 | 경기도 고양덕양구 행신동 953 | 햇빛주공23 | <NA> | 햇빛마을 | 1 | 20 | 1813 | 1996-11-22 |
20967 | 48310120322607 | 4831010200103670004 | 경상남도 거제시 장승포동 367-4 | SG펠리체A동 | SG펠리체 | SG펠리체 | 1 | 1 | 16 | 2015-07-05 |
32957 | 11740120022739 | 1174010100103520018 | 서울특별시 강동구 명일동 352-18 | 태천해오름102동B | 태천해오름아파트 | 태천해오름아파트 | 1 | 1 | 6 | 2004-10-30 |
15859 | 26290100250098 | 2629010600108760011 | 부산광역시 남구 대연동 876-11 | 삼정그린타운 | 삼정그린타운 | 삼정그린타운 | 1 | 1 | 14 | 2004-06-07 |
41950 | 45750100012386 | 4575025022102120000 | 전라북도 임실군 임실읍 이도리 212 | 아도훼밀리 | 아도훼밀리아파트 | 아도훼밀리아파트 | 1 | 1 | 58 | 1993-07-16 |
4080 | 41287100008086 | 4128710500103820013 | 경기도 고양일산서구 덕이동 382-13 | 동양라파크 | 동양라파크 | 동양라파크 | 1 | 5 | 200 | 2002-02-28 |
35061 | 26230100003992 | 2623010400112560002 | 부산광역시 부산진구 범천동 1256-2 | 서면항도타워맨션 | 서면항도타워맨션 | 서면항도타워맨션 | 1 | 1 | 188 | 1994-05-16 |
19914 | 41210120131901 | 4121010100107840000 | 경기도 광명시 광명동 784 | 광명제일풍경채 | 제일풍경채아파트 | 제일풍경채아파트 | 1 | 5 | 195 | 2010-09-03 |
29382 | 48240120069051 | 4824025021103040001 | 경상남도 사천시 사천읍 선인리 304-1 | 진성아트빌 | 진성아트빌 | 진성아트빌 | 1 | 1 | 17 | 2007-04-25 |
12089 | 11590100001109 | 1159010500103270000 | 서울특별시 동작구 흑석동 327 | 청호 | 청호아파트 | 청호아파트 | 1 | 5 | 346 | 1997-10-14 |
단지고유번호 | 필지고유번호 | 주소 | 단지명_공시가격 | 단지명_건축물대장 | 단지명_도로명주소 | 단지종류 | 동수 | 세대수 | 사용승인일 | |
---|---|---|---|---|---|---|---|---|---|---|
27473 | 11560100001058 | 1156013200145180000 | 서울특별시 영등포구 신길동 4518 | 우성2 | <NA> | <NA> | 1 | 7 | 725 | 1986-09-25 |
6923 | 11500100001822 | 1150010300109130001 | 서울특별시 강서구 화곡동 913-1 | 삼도 | 삼도아파트 | 삼도아파트 | 1 | 2 | 64 | 1999-11-10 |
29771 | 42770120381657 | 4277025321104400000 | 강원도 정선군 고한읍 고한리 440 | 파인앤유아파트 | 파인앤유 아파트 | <NA> | 1 | 5 | 299 | 2018-11-29 |
25330 | 50110120404711 | 5011013700103060000 | 제주특별자치도 제주시 연동 306 | 제주연동중흥S-클래스 | 제주 연동 중흥S-클래스 | 제주 연동 중흥S-클래스 | 1 | 1 | 151 | 2020-05-15 |
3627 | 11110100248200 | 1111016800100040157 | 서울특별시 종로구 동숭동 4-157 | 동성아파트(2동) | 동성아파트 | 동성아파트 | 1 | 1 | 18 | 1999-08-21 |
24587 | 11500120316409 | 1150010300109170014 | 서울특별시 강서구 화곡동 917-14 | 삼성다빈치(917-14) | 삼성다빈치 | 삼성다빈치 | 1 | 1 | 104 | 2015-04-15 |
32425 | 26470100005311 | 2647010200111220001 | 부산광역시 연제구 연산동 1122-1 | 남일 | <NA> | <NA> | 1 | 3 | 118 | 1976-11-05 |
31706 | 44250120009955 | 4425010100101670001 | 충청남도 계룡시 금암동 167-1 | 우림루미아트 | 우림루미아트 | 우림루미아트 | 1 | 14 | 868 | 2005-06-16 |
1356 | 44770100011341 | 4477025026103030007 | 충청남도 서천군 장항읍 화천리 303-7 | 신흥 | 신흥아파트 | 신흥아파트 | 1 | 1 | 252 | 1995-09-23 |
11455 | 11545100050693 | 1154510200109870011 | 서울특별시 금천구 독산동 987-11 | 한아(987-11) | 한아아파트 | 한아아파트 | 1 | 1 | 12 | 2003-11-20 |