Dataset statistics
Number of variables | 12 |
---|---|
Number of observations | 10000 |
Missing cells | 10815 |
Missing cells (%) | 9.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.0 MiB |
Average record size in memory | 110.0 B |
Variable types
Text | 5 |
---|---|
Categorical | 2 |
Numeric | 5 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15401/S/1/datasetView.do |
6992.960000000 is highly overall correlated with 226180.460000000 | High correlation |
226180.460000000 is highly overall correlated with 6992.960000000 and 1 other fields | High correlation |
46 is highly overall correlated with 226180.460000000 and 1 other fields | High correlation |
3 is highly overall correlated with 46 | High correlation |
현대슈퍼빌 has 4657 (46.6%) missing values | Missing |
업무시설,운동시설,근린생활시설 has 2186 (21.9%) missing values | Missing |
Unnamed: 6 has 3913 (39.1%) missing values | Missing |
6992.960000000 is highly skewed (γ1 = 70.88393568) | Skewed |
226180.460000000 is highly skewed (γ1 = 22.44219467) | Skewed |
11000-1 has unique values | Unique |
6992.960000000 has 3016 (30.2%) zeros | Zeros |
226180.460000000 has 1039 (10.4%) zeros | Zeros |
46 has 232 (2.3%) zeros | Zeros |
3 has 4765 (47.6%) zeros | Zeros |
Reproduction
Analysis started | 2024-05-03 21:58:33.535696 |
---|---|
Analysis finished | 2024-05-03 21:58:44.831794 |
Duration | 11.3 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
11000-1
Text
UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 15 |
---|---|
Median length | 15 |
Mean length | 13.8753 |
Min length | 7 |
Characters and Unicode
Total characters | 138753 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 10000 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 11110-4111 |
---|---|
2nd row | 11110-100019540 |
3rd row | 11110-100013544 |
4th row | 11110-100016571 |
5th row | 11110-100017393 |
Value | Count | Frequency (%) |
11110-4111 | 1 | < 0.1% |
11110-100031435 | 1 | < 0.1% |
11110-4557 | 1 | < 0.1% |
11110-100006936 | 1 | < 0.1% |
11140-100036323 | 1 | < 0.1% |
11110-100022443 | 1 | < 0.1% |
11110-1648 | 1 | < 0.1% |
11110-100016992 | 1 | < 0.1% |
11110-100026502 | 1 | < 0.1% |
11140-100005592 | 1 | < 0.1% |
Other values (9990) | 9990 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 50727 | |
0 | 38824 | |
- | 10000 | 7.2% |
4 | 7757 | 5.6% |
2 | 5945 | 4.3% |
3 | 5244 | 3.8% |
5 | 4556 | 3.3% |
6 | 4541 | 3.3% |
7 | 3844 | 2.8% |
8 | 3825 | 2.8% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 128753 | |
Dash Punctuation | 10000 | 7.2% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 50727 | |
0 | 38824 | |
4 | 7757 | 6.0% |
2 | 5945 | 4.6% |
3 | 5244 | 4.1% |
5 | 4556 | 3.5% |
6 | 4541 | 3.5% |
7 | 3844 | 3.0% |
8 | 3825 | 3.0% |
9 | 3490 | 2.7% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 138753 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 50727 | |
0 | 38824 | |
- | 10000 | 7.2% |
4 | 7757 | 5.6% |
2 | 5945 | 4.3% |
3 | 5244 | 3.8% |
5 | 4556 | 3.3% |
6 | 4541 | 3.3% |
7 | 3844 | 2.8% |
8 | 3825 | 2.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 138753 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 50727 | |
0 | 38824 | |
- | 10000 | 7.2% |
4 | 7757 | 5.6% |
2 | 5945 | 4.3% |
3 | 5244 | 3.8% |
5 | 4556 | 3.3% |
6 | 4541 | 3.3% |
7 | 3844 | 2.8% |
8 | 3825 | 2.8% |
11000-1.1
Text
Distinct | 9102 |
---|---|
Distinct (%) | 91.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 15 |
---|---|
Median length | 15 |
Mean length | 13.7712 |
Min length | 7 |
Characters and Unicode
Total characters | 137712 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 8652 ? |
---|---|
Unique (%) | 86.5% |
Sample
1st row | 11110-3459 |
---|---|
2nd row | 11110-100020333 |
3rd row | 11110-100014710 |
4th row | 11110-100010627 |
5th row | 11110-100018477 |
Value | Count | Frequency (%) |
11140-100030872 | 108 | 1.1% |
11140-100031373 | 35 | 0.4% |
11110-100028220 | 16 | 0.2% |
11110-100025921 | 15 | 0.1% |
11110-100011141 | 12 | 0.1% |
11110-100016415 | 11 | 0.1% |
11110-100028263 | 11 | 0.1% |
11110-100030948 | 11 | 0.1% |
11000-100004285 | 9 | 0.1% |
11110-100034290 | 9 | 0.1% |
Other values (9092) | 9763 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 51336 | |
0 | 37687 | |
- | 10000 | 7.3% |
4 | 7518 | 5.5% |
2 | 6405 | 4.7% |
3 | 5655 | 4.1% |
5 | 4301 | 3.1% |
7 | 3934 | 2.9% |
8 | 3716 | 2.7% |
9 | 3714 | 2.7% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 127712 | |
Dash Punctuation | 10000 | 7.3% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 51336 | |
0 | 37687 | |
4 | 7518 | 5.9% |
2 | 6405 | 5.0% |
3 | 5655 | 4.4% |
5 | 4301 | 3.4% |
7 | 3934 | 3.1% |
8 | 3716 | 2.9% |
9 | 3714 | 2.9% |
6 | 3446 | 2.7% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 137712 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 51336 | |
0 | 37687 | |
- | 10000 | 7.3% |
4 | 7518 | 5.5% |
2 | 6405 | 4.7% |
3 | 5655 | 4.1% |
5 | 4301 | 3.1% |
7 | 3934 | 2.9% |
8 | 3716 | 2.7% |
9 | 3714 | 2.7% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 137712 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 51336 | |
0 | 37687 | |
- | 10000 | 7.3% |
4 | 7518 | 5.5% |
2 | 6405 | 4.7% |
3 | 5655 | 4.1% |
5 | 4301 | 3.1% |
7 | 3934 | 2.9% |
8 | 3716 | 2.7% |
9 | 3714 | 2.7% |
현대슈퍼빌
Text
MISSING
 
Distinct | 1944 |
---|---|
Distinct (%) | 36.4% |
Missing | 4657 |
Missing (%) | 46.6% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
주건축물제1동 | 1206 | 19.5% |
1 | 561 | 9.1% |
1동 | 146 | 2.4% |
132 | 2.1% | |
2 | 104 | 1.7% |
a동 | 72 | 1.2% |
주택 | 52 | 0.8% |
3 | 51 | 0.8% |
단독주택 | 49 | 0.8% |
2동 | 44 | 0.7% |
Other values (2023) | 3775 |
Most occurring characters
Value | Count | Frequency (%) |
동 | 2873 | 10.2% |
1 | 2427 | 8.6% |
주 | 1798 | 6.4% |
건 | 1404 | 5.0% |
축 | 1404 | 5.0% |
물 | 1401 | 5.0% |
제 | 1333 | 4.7% |
851 | 3.0% | |
택 | 425 | 1.5% |
2 | 424 | 1.5% |
Other values (531) | 13939 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 22044 | |
Decimal Number | 4002 | 14.2% |
Space Separator | 851 | 3.0% |
Uppercase Letter | 590 | 2.1% |
Dash Punctuation | 273 | 1.0% |
Other Punctuation | 194 | 0.7% |
Lowercase Letter | 109 | 0.4% |
Close Punctuation | 103 | 0.4% |
Open Punctuation | 103 | 0.4% |
Letter Number | 5 | < 0.1% |
Other values (3) | 5 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 2873 | 13.0% |
주 | 1798 | 8.2% |
건 | 1404 | 6.4% |
축 | 1404 | 6.4% |
물 | 1401 | 6.4% |
제 | 1333 | 6.0% |
택 | 425 | 1.9% |
빌 | 405 | 1.8% |
딩 | 276 | 1.3% |
가 | 268 | 1.2% |
Other values (458) | 10457 |
Uppercase Letter
Value | Count | Frequency (%) |
A | 116 | |
B | 85 | |
C | 43 | 7.3% |
E | 42 | 7.1% |
S | 31 | 5.3% |
D | 31 | 5.3% |
T | 30 | 5.1% |
F | 25 | 4.2% |
I | 23 | 3.9% |
O | 22 | 3.7% |
Other values (14) | 142 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 16 | |
a | 15 | |
r | 7 | 6.4% |
l | 7 | 6.4% |
t | 7 | 6.4% |
i | 7 | 6.4% |
s | 6 | 5.5% |
n | 6 | 5.5% |
k | 5 | 4.6% |
m | 5 | 4.6% |
Other values (11) | 28 |
Decimal Number
Value | Count | Frequency (%) |
1 | 2427 | |
2 | 424 | 10.6% |
0 | 260 | 6.5% |
3 | 237 | 5.9% |
4 | 168 | 4.2% |
5 | 144 | 3.6% |
6 | 112 | 2.8% |
7 | 96 | 2.4% |
8 | 73 | 1.8% |
9 | 61 | 1.5% |
Other Punctuation
Value | Count | Frequency (%) |
. | 139 | |
, | 32 | 16.5% |
: | 8 | 4.1% |
/ | 7 | 3.6% |
' | 4 | 2.1% |
# | 2 | 1.0% |
& | 2 | 1.0% |
Letter Number
Value | Count | Frequency (%) |
Ⅰ | 2 | |
Ⅶ | 1 | |
Ⅲ | 1 | |
Ⅱ | 1 |
Space Separator
Value | Count | Frequency (%) |
851 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 273 |
Close Punctuation
Value | Count | Frequency (%) |
) | 103 |
Open Punctuation
Value | Count | Frequency (%) |
( | 103 |
Math Symbol
Value | Count | Frequency (%) |
+ | 3 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1 |
Modifier Symbol
Value | Count | Frequency (%) |
` | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 22043 | |
Common | 5531 | 19.6% |
Latin | 704 | 2.5% |
Han | 1 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 2873 | 13.0% |
주 | 1798 | 8.2% |
건 | 1404 | 6.4% |
축 | 1404 | 6.4% |
물 | 1401 | 6.4% |
제 | 1333 | 6.0% |
택 | 425 | 1.9% |
빌 | 405 | 1.8% |
딩 | 276 | 1.3% |
가 | 268 | 1.2% |
Other values (457) | 10456 |
Latin
Value | Count | Frequency (%) |
A | 116 | |
B | 85 | 12.1% |
C | 43 | 6.1% |
E | 42 | 6.0% |
S | 31 | 4.4% |
D | 31 | 4.4% |
T | 30 | 4.3% |
F | 25 | 3.6% |
I | 23 | 3.3% |
O | 22 | 3.1% |
Other values (39) | 256 |
Common
Value | Count | Frequency (%) |
1 | 2427 | |
851 | 15.4% | |
2 | 424 | 7.7% |
- | 273 | 4.9% |
0 | 260 | 4.7% |
3 | 237 | 4.3% |
4 | 168 | 3.0% |
5 | 144 | 2.6% |
. | 139 | 2.5% |
6 | 112 | 2.0% |
Other values (14) | 496 | 9.0% |
Han
Value | Count | Frequency (%) |
家 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 22042 | |
ASCII | 6230 | 22.0% |
Number Forms | 5 | < 0.1% |
Compat Jamo | 1 | < 0.1% |
CJK | 1 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
동 | 2873 | 13.0% |
주 | 1798 | 8.2% |
건 | 1404 | 6.4% |
축 | 1404 | 6.4% |
물 | 1401 | 6.4% |
제 | 1333 | 6.0% |
택 | 425 | 1.9% |
빌 | 405 | 1.8% |
딩 | 276 | 1.3% |
가 | 268 | 1.2% |
Other values (456) | 10455 |
ASCII
Value | Count | Frequency (%) |
1 | 2427 | |
851 | 13.7% | |
2 | 424 | 6.8% |
- | 273 | 4.4% |
0 | 260 | 4.2% |
3 | 237 | 3.8% |
4 | 168 | 2.7% |
5 | 144 | 2.3% |
. | 139 | 2.2% |
A | 116 | 1.9% |
Other values (59) | 1191 |
Number Forms
Value | Count | Frequency (%) |
Ⅰ | 2 | |
Ⅶ | 1 | |
Ⅲ | 1 | |
Ⅱ | 1 |
Compat Jamo
Value | Count | Frequency (%) |
ㅁ | 1 |
CJK
Value | Count | Frequency (%) |
家 | 1 |
02000
Categorical
Distinct | 30 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
01000 | |
---|---|
04000 | |
03000 | |
14000 | |
02000 | |
Other values (25) |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 4.9879 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 04000 |
---|---|
2nd row | 01000 |
3rd row | 01000 |
4th row | 04000 |
5th row | 28000 |
Common Values
Value | Count | Frequency (%) |
01000 | 2840 | |
04000 | 2021 | |
03000 | 1012 | 10.1% |
14000 | 979 | 9.8% |
02000 | 911 | 9.1% |
28000 | 770 | 7.7% |
15000 | 224 | 2.2% |
07000 | 189 | 1.9% |
10000 | 160 | 1.6% |
18000 | 131 | 1.3% |
Other values (20) | 763 | 7.6% |
Length
Value | Count | Frequency (%) |
01000 | 2840 | |
04000 | 2021 | |
03000 | 1012 | 10.1% |
14000 | 979 | 9.8% |
02000 | 911 | 9.1% |
28000 | 770 | 7.7% |
15000 | 224 | 2.2% |
07000 | 189 | 1.9% |
10000 | 160 | 1.6% |
18000 | 131 | 1.3% |
Other values (20) | 763 | 7.6% |
업무시설,운동시설,근린생활시설
Text
MISSING
 
Distinct | 1775 |
---|---|
Distinct (%) | 22.7% |
Missing | 2186 |
Missing (%) | 21.9% |
Memory size | 156.2 KiB |
Length
Max length | 46 |
---|---|
Median length | 39 |
Mean length | 7.3527003 |
Min length | 1 |
Characters and Unicode
Total characters | 57454 |
---|---|
Distinct characters | 301 |
Distinct categories | 10 ? |
Distinct scripts | 3 ? |
Distinct blocks | 4 ? |
Unique
Unique | 1239 ? |
---|---|
Unique (%) | 15.9% |
Sample
1st row | 수리점 |
---|---|
2nd row | 주택 |
3rd row | 사무소 |
4th row | 임시창고(농산물 단순 가공 및 농자재 보관) |
5th row | 휴게음식점 |
Value | Count | Frequency (%) |
주택 | 1444 | 15.0% |
근린생활시설 | 995 | 10.3% |
업무시설 | 351 | 3.6% |
다세대주택 | 317 | 3.3% |
사무실 | 294 | 3.1% |
일반음식점 | 288 | 3.0% |
다가구주택 | 283 | 2.9% |
단독주택 | 237 | 2.5% |
사무소 | 232 | 2.4% |
제2종근린생활시설 | 173 | 1.8% |
Other values (1361) | 5024 |
Most occurring characters
Value | Count | Frequency (%) |
시 | 4470 | 7.8% |
설 | 4143 | 7.2% |
주 | 3172 | 5.5% |
택 | 3007 | 5.2% |
, | 2753 | 4.8% |
생 | 2150 | 3.7% |
근 | 2109 | 3.7% |
활 | 2046 | 3.6% |
린 | 2020 | 3.5% |
무 | 1843 | 3.2% |
Other values (291) | 29741 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 49767 | |
Other Punctuation | 2882 | 5.0% |
Space Separator | 1826 | 3.2% |
Decimal Number | 1049 | 1.8% |
Open Punctuation | 939 | 1.6% |
Close Punctuation | 937 | 1.6% |
Dash Punctuation | 20 | < 0.1% |
Math Symbol | 12 | < 0.1% |
Lowercase Letter | 12 | < 0.1% |
Uppercase Letter | 10 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
시 | 4470 | 9.0% |
설 | 4143 | 8.3% |
주 | 3172 | 6.4% |
택 | 3007 | 6.0% |
생 | 2150 | 4.3% |
근 | 2109 | 4.2% |
활 | 2046 | 4.1% |
린 | 2020 | 4.1% |
무 | 1843 | 3.7% |
사 | 1331 | 2.7% |
Other values (251) | 23476 |
Decimal Number
Value | Count | Frequency (%) |
2 | 524 | |
1 | 360 | |
3 | 52 | 5.0% |
4 | 28 | 2.7% |
5 | 28 | 2.7% |
6 | 17 | 1.6% |
7 | 15 | 1.4% |
8 | 10 | 1.0% |
9 | 8 | 0.8% |
0 | 7 | 0.7% |
Lowercase Letter
Value | Count | Frequency (%) |
e | 2 | |
r | 2 | |
x | 1 | |
o | 1 | |
h | 1 | |
n | 1 | |
w | 1 | |
s | 1 | |
k | 1 | |
m | 1 |
Other Punctuation
Value | Count | Frequency (%) |
, | 2753 | |
/ | 83 | 2.9% |
. | 40 | 1.4% |
: | 4 | 0.1% |
# | 1 | < 0.1% |
· | 1 | < 0.1% |
Uppercase Letter
Value | Count | Frequency (%) |
D | 3 | |
F | 3 | |
M | 2 | |
G | 1 | 10.0% |
A | 1 | 10.0% |
Math Symbol
Value | Count | Frequency (%) |
+ | 10 | |
> | 1 | 8.3% |
< | 1 | 8.3% |
Open Punctuation
Value | Count | Frequency (%) |
( | 938 | |
[ | 1 | 0.1% |
Close Punctuation
Value | Count | Frequency (%) |
) | 936 | |
] | 1 | 0.1% |
Space Separator
Value | Count | Frequency (%) |
1826 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 20 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 49767 | |
Common | 7665 | 13.3% |
Latin | 22 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
시 | 4470 | 9.0% |
설 | 4143 | 8.3% |
주 | 3172 | 6.4% |
택 | 3007 | 6.0% |
생 | 2150 | 4.3% |
근 | 2109 | 4.2% |
활 | 2046 | 4.1% |
린 | 2020 | 4.1% |
무 | 1843 | 3.7% |
사 | 1331 | 2.7% |
Other values (251) | 23476 |
Common
Value | Count | Frequency (%) |
, | 2753 | |
1826 | ||
( | 938 | 12.2% |
) | 936 | 12.2% |
2 | 524 | 6.8% |
1 | 360 | 4.7% |
/ | 83 | 1.1% |
3 | 52 | 0.7% |
. | 40 | 0.5% |
4 | 28 | 0.4% |
Other values (15) | 125 | 1.6% |
Latin
Value | Count | Frequency (%) |
D | 3 | |
F | 3 | |
e | 2 | 9.1% |
r | 2 | 9.1% |
M | 2 | 9.1% |
x | 1 | 4.5% |
o | 1 | 4.5% |
h | 1 | 4.5% |
n | 1 | 4.5% |
w | 1 | 4.5% |
Other values (5) | 5 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 49765 | |
ASCII | 7686 | 13.4% |
Compat Jamo | 2 | < 0.1% |
None | 1 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
시 | 4470 | 9.0% |
설 | 4143 | 8.3% |
주 | 3172 | 6.4% |
택 | 3007 | 6.0% |
생 | 2150 | 4.3% |
근 | 2109 | 4.2% |
활 | 2046 | 4.1% |
린 | 2020 | 4.1% |
무 | 1843 | 3.7% |
사 | 1331 | 2.7% |
Other values (249) | 23474 |
ASCII
Value | Count | Frequency (%) |
, | 2753 | |
1826 | ||
( | 938 | 12.2% |
) | 936 | 12.2% |
2 | 524 | 6.8% |
1 | 360 | 4.7% |
/ | 83 | 1.1% |
3 | 52 | 0.7% |
. | 40 | 0.5% |
4 | 28 | 0.4% |
Other values (29) | 146 | 1.9% |
Compat Jamo
Value | Count | Frequency (%) |
ㅈ | 1 | |
ㅁ | 1 |
None
Value | Count | Frequency (%) |
· | 1 |
42
Real number (ℝ)
Distinct | 26 |
---|---|
Distinct (%) | 0.3% |
Missing | 59 |
Missing (%) | 0.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 28.05764 |
Minimum | 10 |
---|---|
Maximum | 99 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 10 |
---|---|
5-th percentile | 11 |
Q1 | 21 |
median | 21 |
Q3 | 39 |
95-th percentile | 51 |
Maximum | 99 |
Range | 89 |
Interquartile range (IQR) | 18 |
Descriptive statistics
Standard deviation | 16.065835 |
---|---|
Coefficient of variation (CV) | 0.57260109 |
Kurtosis | 4.9349202 |
Mean | 28.05764 |
Median Absolute Deviation (MAD) | 10 |
Skewness | 1.8537297 |
Sum | 278921 |
Variance | 258.11106 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
21 | 4685 | |
11 | 1459 | 14.6% |
51 | 1369 | 13.7% |
32 | 680 | 6.8% |
42 | 510 | 5.1% |
31 | 364 | 3.6% |
39 | 350 | 3.5% |
99 | 180 | 1.8% |
12 | 138 | 1.4% |
19 | 87 | 0.9% |
Other values (16) | 119 | 1.2% |
(Missing) | 59 | 0.6% |
Value | Count | Frequency (%) |
10 | 3 | < 0.1% |
11 | 1459 | 14.6% |
12 | 138 | 1.4% |
13 | 11 | 0.1% |
19 | 87 | 0.9% |
20 | 1 | < 0.1% |
21 | 4685 | |
22 | 1 | < 0.1% |
29 | 2 | < 0.1% |
30 | 1 | < 0.1% |
Value | Count | Frequency (%) |
99 | 180 | 1.8% |
74 | 30 | 0.3% |
63 | 3 | < 0.1% |
61 | 1 | < 0.1% |
52 | 2 | < 0.1% |
51 | 1369 | |
50 | 2 | < 0.1% |
49 | 10 | 0.1% |
43 | 3 | < 0.1% |
42 | 510 | 5.1% |
Unnamed: 6
Text
MISSING
 
Distinct | 616 |
---|---|
Distinct (%) | 10.1% |
Missing | 3913 |
Missing (%) | 39.1% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
목조 | 1141 | |
철근콘크리트조 | 1085 | |
연와조 | 950 | |
철근콘크리트구조 | 514 | 7.6% |
컨테이너 | 473 | 7.0% |
철근콘크리트 | 386 | 5.7% |
철골철근콘크리트구조 | 211 | 3.1% |
세멘벽돌조 | 203 | 3.0% |
철골철근콘크리트조 | 133 | 2.0% |
철골조 | 119 | 1.8% |
Other values (430) | 1566 |
Most occurring characters
Value | Count | Frequency (%) |
조 | 5947 | |
철 | 3526 | 9.7% |
트 | 2669 | 7.3% |
콘 | 2668 | 7.3% |
리 | 2605 | 7.2% |
크 | 2602 | 7.1% |
근 | 2595 | 7.1% |
목 | 1247 | 3.4% |
구 | 1127 | 3.1% |
연 | 1075 | 3.0% |
Other values (166) | 10359 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 34596 | |
Other Punctuation | 908 | 2.5% |
Space Separator | 694 | 1.9% |
Open Punctuation | 58 | 0.2% |
Close Punctuation | 58 | 0.2% |
Uppercase Letter | 35 | 0.1% |
Decimal Number | 31 | 0.1% |
Math Symbol | 19 | 0.1% |
Lowercase Letter | 16 | < 0.1% |
Dash Punctuation | 5 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
조 | 5947 | |
철 | 3526 | |
트 | 2669 | 7.7% |
콘 | 2668 | 7.7% |
리 | 2605 | 7.5% |
크 | 2602 | 7.5% |
근 | 2595 | 7.5% |
목 | 1247 | 3.6% |
구 | 1127 | 3.3% |
연 | 1075 | 3.1% |
Other values (128) | 8535 |
Uppercase Letter
Value | Count | Frequency (%) |
R | 9 | |
C | 5 | |
F | 4 | |
P | 4 | |
A | 3 | 8.6% |
S | 3 | 8.6% |
B | 3 | 8.6% |
E | 3 | 8.6% |
D | 1 | 2.9% |
Lowercase Letter
Value | Count | Frequency (%) |
e | 3 | |
b | 2 | |
p | 2 | |
r | 2 | |
f | 2 | |
a | 2 | |
g | 1 | 6.2% |
i | 1 | 6.2% |
t | 1 | 6.2% |
Decimal Number
Value | Count | Frequency (%) |
1 | 9 | |
3 | 6 | |
2 | 5 | |
6 | 4 | |
4 | 3 | 9.7% |
8 | 2 | 6.5% |
0 | 1 | 3.2% |
5 | 1 | 3.2% |
Other Punctuation
Value | Count | Frequency (%) |
, | 831 | |
/ | 31 | 3.4% |
. | 28 | 3.1% |
: | 16 | 1.8% |
? | 1 | 0.1% |
· | 1 | 0.1% |
Math Symbol
Value | Count | Frequency (%) |
+ | 18 | |
~ | 1 | 5.3% |
Space Separator
Value | Count | Frequency (%) |
694 |
Open Punctuation
Value | Count | Frequency (%) |
( | 58 |
Close Punctuation
Value | Count | Frequency (%) |
) | 58 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 5 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 34596 | |
Common | 1773 | 4.9% |
Latin | 51 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
조 | 5947 | |
철 | 3526 | |
트 | 2669 | 7.7% |
콘 | 2668 | 7.7% |
리 | 2605 | 7.5% |
크 | 2602 | 7.5% |
근 | 2595 | 7.5% |
목 | 1247 | 3.6% |
구 | 1127 | 3.3% |
연 | 1075 | 3.1% |
Other values (128) | 8535 |
Common
Value | Count | Frequency (%) |
, | 831 | |
694 | ||
( | 58 | 3.3% |
) | 58 | 3.3% |
/ | 31 | 1.7% |
. | 28 | 1.6% |
+ | 18 | 1.0% |
: | 16 | 0.9% |
1 | 9 | 0.5% |
3 | 6 | 0.3% |
Other values (10) | 24 | 1.4% |
Latin
Value | Count | Frequency (%) |
R | 9 | |
C | 5 | |
F | 4 | 7.8% |
P | 4 | 7.8% |
A | 3 | 5.9% |
S | 3 | 5.9% |
e | 3 | 5.9% |
B | 3 | 5.9% |
E | 3 | 5.9% |
b | 2 | 3.9% |
Other values (8) | 12 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 34596 | |
ASCII | 1822 | 5.0% |
None | 2 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
조 | 5947 | |
철 | 3526 | |
트 | 2669 | 7.7% |
콘 | 2668 | 7.7% |
리 | 2605 | 7.5% |
크 | 2602 | 7.5% |
근 | 2595 | 7.5% |
목 | 1247 | 3.6% |
구 | 1127 | 3.3% |
연 | 1075 | 3.1% |
Other values (128) | 8535 |
ASCII
Value | Count | Frequency (%) |
, | 831 | |
694 | ||
( | 58 | 3.2% |
) | 58 | 3.2% |
/ | 31 | 1.7% |
. | 28 | 1.5% |
+ | 18 | 1.0% |
: | 16 | 0.9% |
1 | 9 | 0.5% |
R | 9 | 0.5% |
Other values (26) | 70 | 3.8% |
None
Value | Count | Frequency (%) |
? | 1 | |
· | 1 |
10
Categorical
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
10 | |
---|---|
20 | |
<NA> | |
90 | |
30 | 82 |
Length
Max length | 4 |
---|---|
Median length | 2 |
Mean length | 2.2354 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 90 |
---|---|
2nd row | 10 |
3rd row | 20 |
4th row | 10 |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
10 | 6077 | |
20 | 1639 | 16.4% |
<NA> | 1177 | 11.8% |
90 | 1024 | 10.2% |
30 | 82 | 0.8% |
39 | 1 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
10 | 6077 | |
20 | 1639 | 16.4% |
na | 1177 | 11.8% |
90 | 1024 | 10.2% |
30 | 82 | 0.8% |
39 | 1 | < 0.1% |
6992.960000000
Real number (ℝ)
HIGH CORRELATION
  SKEWED
  ZEROS
 
Distinct | 4989 |
---|---|
Distinct (%) | 49.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1114.7502 |
Minimum | -3244.74 |
---|---|
Maximum | 4125652 |
Zeros | 3016 |
Zeros (%) | 30.2% |
Negative | 30 |
Negative (%) | 0.3% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -3244.74 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 46.13 |
Q3 | 148.2125 |
95-th percentile | 1315.54 |
Maximum | 4125652 |
Range | 4128896.7 |
Interquartile range (IQR) | 148.2125 |
Descriptive statistics
Standard deviation | 55541.925 |
---|---|
Coefficient of variation (CV) | 49.824546 |
Kurtosis | 5041.9104 |
Mean | 1114.7502 |
Median Absolute Deviation (MAD) | 46.13 |
Skewness | 70.883936 |
Sum | 11147502 |
Variance | 3.0849054 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0 | 3016 | |
18.0 | 328 | 3.3% |
27.0 | 130 | 1.3% |
9.0 | 70 | 0.7% |
5.76 | 32 | 0.3% |
36.0 | 25 | 0.2% |
4671.76 | 24 | 0.2% |
54.0 | 22 | 0.2% |
12.0 | 17 | 0.2% |
33.06 | 17 | 0.2% |
Other values (4979) | 6319 |
Value | Count | Frequency (%) |
-3244.74 | 1 | |
-239.99 | 2 | |
-216.66 | 1 | |
-164.1 | 1 | |
-95.97 | 1 | |
-56.2 | 1 | |
-37.11 | 1 | |
-30.56 | 2 | |
-27.4 | 1 | |
-25.74 | 1 |
Value | Count | Frequency (%) |
4125652.0 | 1 | |
3715601.0 | 1 | |
80409.6 | 1 | |
54944.55 | 2 | |
54936.68 | 1 | |
25500.54 | 1 | |
24913.76 | 1 | |
23742.37 | 1 | |
22793.79 | 1 | |
19556.91 | 1 |
226180.460000000
Real number (ℝ)
HIGH CORRELATION
  SKEWED
  ZEROS
 
Distinct | 6656 |
---|---|
Distinct (%) | 66.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4305.3519 |
Minimum | -13432.91 |
---|---|
Maximum | 1290999.3 |
Zeros | 1039 |
Zeros (%) | 10.4% |
Negative | 35 |
Negative (%) | 0.4% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -13432.91 |
---|---|
5-th percentile | 0 |
Q1 | 29.7225 |
median | 126.765 |
Q3 | 588.77 |
95-th percentile | 18370.068 |
Maximum | 1290999.3 |
Range | 1304432.2 |
Interquartile range (IQR) | 559.0475 |
Descriptive statistics
Standard deviation | 22763.23 |
---|---|
Coefficient of variation (CV) | 5.2871938 |
Kurtosis | 1059.0252 |
Mean | 4305.3519 |
Median Absolute Deviation (MAD) | 126.765 |
Skewness | 22.442195 |
Sum | 43053519 |
Variance | 5.1816464 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0 | 1039 | 10.4% |
18.0 | 276 | 2.8% |
27.0 | 102 | 1.0% |
36.0 | 81 | 0.8% |
9.0 | 66 | 0.7% |
54.0 | 51 | 0.5% |
33.06 | 41 | 0.4% |
46.28 | 41 | 0.4% |
5.76 | 33 | 0.3% |
49.59 | 31 | 0.3% |
Other values (6646) | 8239 |
Value | Count | Frequency (%) |
-13432.91 | 1 | |
-3044.7 | 1 | |
-1333.66 | 1 | |
-1131.98 | 1 | |
-1004.36 | 2 | |
-809.24 | 1 | |
-196.68 | 1 | |
-178.48 | 1 | |
-126.12 | 1 | |
-82.34 | 1 |
Value | Count | Frequency (%) |
1290999.29 | 1 | |
426635.55 | 1 | |
274056.67 | 1 | |
267746.04 | 1 | |
265791.63 | 1 | |
263708.82 | 1 | |
262184.08 | 2 | |
262143.41 | 1 | |
249837.37 | 1 | |
246003.09 | 1 |
46
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 49 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.4488 |
Minimum | 0 |
---|---|
Maximum | 72 |
Zeros | 232 |
Zeros (%) | 2.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 3 |
Q3 | 5 |
95-th percentile | 16 |
Maximum | 72 |
Range | 72 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 5.5607617 |
---|---|
Coefficient of variation (CV) | 1.2499464 |
Kurtosis | 14.444624 |
Mean | 4.4488 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 3.2326779 |
Sum | 44488 |
Variance | 30.922071 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 2571 | |
2 | 1998 | |
3 | 1354 | |
4 | 1049 | |
5 | 883 | 8.8% |
6 | 334 | 3.3% |
0 | 232 | 2.3% |
7 | 190 | 1.9% |
10 | 180 | 1.8% |
8 | 133 | 1.3% |
Other values (39) | 1076 |
Value | Count | Frequency (%) |
0 | 232 | 2.3% |
1 | 2571 | |
2 | 1998 | |
3 | 1354 | |
4 | 1049 | |
5 | 883 | 8.8% |
6 | 334 | 3.3% |
7 | 190 | 1.9% |
8 | 133 | 1.3% |
9 | 114 | 1.1% |
Value | Count | Frequency (%) |
72 | 1 | < 0.1% |
69 | 1 | < 0.1% |
51 | 1 | < 0.1% |
49 | 1 | < 0.1% |
48 | 1 | < 0.1% |
46 | 1 | < 0.1% |
43 | 1 | < 0.1% |
41 | 2 | |
40 | 3 | |
39 | 2 |
3
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 12 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.9656 |
Minimum | 0 |
---|---|
Maximum | 19 |
Zeros | 4765 |
Zeros (%) | 47.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 1 |
Q3 | 1 |
95-th percentile | 4 |
Maximum | 19 |
Range | 19 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 1.5066 |
---|---|
Coefficient of variation (CV) | 1.5602734 |
Kurtosis | 8.5258396 |
Mean | 0.9656 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 2.6285462 |
Sum | 9656 |
Variance | 2.2698436 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 4765 | |
1 | 3636 | |
2 | 611 | 6.1% |
3 | 255 | 2.5% |
4 | 238 | 2.4% |
6 | 154 | 1.5% |
7 | 150 | 1.5% |
5 | 147 | 1.5% |
8 | 35 | 0.4% |
9 | 7 | 0.1% |
Other values (2) | 2 | < 0.1% |
Value | Count | Frequency (%) |
0 | 4765 | |
1 | 3636 | |
2 | 611 | 6.1% |
3 | 255 | 2.5% |
4 | 238 | 2.4% |
5 | 147 | 1.5% |
6 | 154 | 1.5% |
7 | 150 | 1.5% |
8 | 35 | 0.4% |
9 | 7 | 0.1% |
Value | Count | Frequency (%) |
19 | 1 | < 0.1% |
10 | 1 | < 0.1% |
9 | 7 | 0.1% |
8 | 35 | 0.4% |
7 | 150 | 1.5% |
6 | 154 | 1.5% |
5 | 147 | 1.5% |
4 | 238 | 2.4% |
3 | 255 | |
2 | 611 |
02000 | 42 | 10 | 6992.960000000 | 226180.460000000 | 46 | 3 | |
---|---|---|---|---|---|---|---|
02000 | 1.000 | 0.670 | 0.395 | 0.000 | 0.287 | 0.571 | 0.568 |
42 | 0.670 | 1.000 | 0.587 | 0.000 | 0.150 | 0.322 | 0.364 |
10 | 0.395 | 0.587 | 1.000 | 0.000 | 0.056 | 0.216 | 0.199 |
6992.960000000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.057 | 0.043 |
226180.460000000 | 0.287 | 0.150 | 0.056 | 0.000 | 1.000 | 0.431 | 0.245 |
46 | 0.571 | 0.322 | 0.216 | 0.057 | 0.431 | 1.000 | 0.566 |
3 | 0.568 | 0.364 | 0.199 | 0.043 | 0.245 | 0.566 | 1.000 |
02000 | 10 | |
---|---|---|
02000 | 1.000 | 0.202 |
10 | 0.202 | 1.000 |
42 | 6992.960000000 | 226180.460000000 | 46 | 3 | 02000 | 10 | |
---|---|---|---|---|---|---|---|
42 | 1.000 | -0.074 | -0.190 | -0.243 | -0.235 | 0.339 | 0.448 |
6992.960000000 | -0.074 | 1.000 | 0.778 | 0.409 | 0.326 | 0.000 | 0.000 |
226180.460000000 | -0.190 | 0.778 | 1.000 | 0.505 | 0.434 | 0.140 | 0.021 |
46 | -0.243 | 0.409 | 0.505 | 1.000 | 0.647 | 0.254 | 0.126 |
3 | -0.235 | 0.326 | 0.434 | 0.647 | 1.000 | 0.275 | 0.128 |
02000 | 0.339 | 0.000 | 0.140 | 0.254 | 0.275 | 1.000 | 0.202 |
10 | 0.448 | 0.000 | 0.021 | 0.126 | 0.128 | 0.202 | 1.000 |
11000-1 | 11000-1.1 | 현대슈퍼빌 | 02000 | 업무시설,운동시설,근린생활시설 | 42 | Unnamed: 6 | 10 | 6992.960000000 | 226180.460000000 | 46 | 3 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
15064 | 11110-4111 | 11110-3459 | <NA> | 04000 | 수리점 | 51 | <NA> | 90 | 59.5 | 59.5 | 1 | 0 |
6253 | 11110-100019540 | 11110-100020333 | <NA> | 01000 | 주택 | 21 | 철근콘크리트조 | 10 | 0.0 | 0.0 | 3 | 0 |
3590 | 11110-100013544 | 11110-100014710 | 주건축물제1동 | 01000 | <NA> | 51 | <NA> | 20 | 33.06 | 33.06 | 1 | 0 |
4736 | 11110-100016571 | 11110-100010627 | <NA> | 04000 | 사무소 | 21 | <NA> | 10 | 115.81 | 491.76 | 5 | 1 |
5175 | 11110-100017393 | 11110-100018477 | 1 | 28000 | 임시창고(농산물 단순 가공 및 농자재 보관) | 32 | <NA> | <NA> | 36.0 | 36.0 | 1 | 0 |
1192 | 11110-100007070 | 11110-100008507 | <NA> | 03000 | 휴게음식점 | 11 | 연와조 | 90 | 0.0 | 0.0 | 2 | 0 |
2677 | 11110-100010935 | 11110-100012880 | 백상빌딩 | 03000 | 제1종근린생활시설, 업무시설 | 21 | 철근콘크리트구조 | 10 | 615.17 | 2995.68 | 4 | 1 |
17342 | 11140-100007606 | 11140-100004792 | 리안빌리지 | 02000 | 다세대 | 21 | <NA> | 10 | 205.95 | 658.11 | 5 | 0 |
19785 | 11140-100028624 | 11140-100025332 | <NA> | 14000 | 업무시설 | 21 | 철근콘크리트 | 10 | 2496.05 | 39343.15 | 20 | 5 |
23066 | 11140-100060886 | 11140-100051152 | <NA> | 07000 | 판매시설, 업무시설 | 42 | 철골철근콘크리트조 | 10 | 2431.14 | 49938.92 | 18 | 7 |
11000-1 | 11000-1.1 | 현대슈퍼빌 | 02000 | 업무시설,운동시설,근린생활시설 | 42 | Unnamed: 6 | 10 | 6992.960000000 | 226180.460000000 | 46 | 3 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
6351 | 11110-100019783 | 11110-100019643 | <NA> | 04000 | 제1종근린생활시설,영업용, 위락시설(전자유기장) | 21 | 철근콘크리트조 | 10 | 161.16 | 943.35 | 6 | 0 |
18512 | 11140-100015359 | 11140-100013827 | <NA> | 14000 | 사무실, 근린생활시설 | 21 | 철근콘크리트 | 10 | 903.71 | 5533.6 | 6 | 1 |
22563 | 11140-100054068 | 11140-100046032 | A동 | 14000 | 업무시설, 근린생활시설, 판매시설, 노유자시설 | 42 | 철골철근콘크리트조, 철근콘크리트조 | 10 | 8013.45 | 132792.56 | 23 | 2 |
16269 | 11110-5363 | 11110-4628 | 창신동 다세대 | 02000 | 다세대주택 | 21 | 경량철골 | 10 | 50.67 | 22.38 | 5 | 0 |
11750 | 11110-100041385 | 11110-100038891 | 주건축물제1동 | 10000 | 문화및집회시설,교육연구시설,근린생활시설 | 41 | 철골콘크리트구조 | 10 | 602.55 | 6377.92 | 5 | 5 |
21126 | 11140-100040485 | 11140-100033992 | <NA> | 04000 | <NA> | 21 | <NA> | 10 | 261.16 | 1135.21 | 5 | 1 |
378 | 11110-100005144 | 11110-100004265 | 1 | 04000 | <NA> | 51 | 목조 | 20 | 0.0 | 0.0 | 1 | 0 |
516 | 11110-100005491 | 11110-100005054 | 가설-1 | 28000 | 공사용가설 | 39 | 콘테이너 | <NA> | 18.0 | 18.0 | 1 | 0 |
8333 | 11110-100025440 | 11110-100025793 | <NA> | 04000 | 점포 | 11 | 연와조 | 20 | 0.0 | 63.14 | 1 | 0 |
15269 | 11110-4317 | 11110-3632 | 종로6가 창고 | 18000 | <NA> | 31 | <NA> | 90 | 67.16 | 134.32 | 2 | 0 |