Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 488.3 KiB |
Average record size in memory | 50.0 B |
Variable types
Text | 3 |
---|---|
Categorical | 1 |
Numeric | 1 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15820/S/1/datasetView.do |
Reproduction
Analysis started | 2024-05-11 06:00:06.490163 |
---|---|
Analysis finished | 2024-05-11 06:00:07.349612 |
Duration | 0.86 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
아파트명
Text
Distinct | 2197 |
---|---|
Distinct (%) | 22.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 22 |
---|---|
Median length | 20 |
Mean length | 7.2552 |
Min length | 2 |
Characters and Unicode
Total characters | 72552 |
---|---|
Distinct characters | 436 |
Distinct categories | 11 ? |
Distinct scripts | 3 ? |
Distinct blocks | 3 ? |
Unique
Unique | 116 ? |
---|---|
Unique (%) | 1.2% |
Sample
1st row | 중계주공10단지 |
---|---|
2nd row | 상계한일유엔아이 |
3rd row | e편한세상마포리버파크 |
4th row | 목동진도1차 |
5th row | 약수하이츠아파트(임대) |
Value | Count | Frequency (%) |
아파트 | 137 | 1.3% |
래미안 | 33 | 0.3% |
아이파크 | 23 | 0.2% |
잠실레이크팰리스 | 14 | 0.1% |
래미안밤섬리베뉴 | 14 | 0.1% |
은평뉴타운상림마을6단지 | 14 | 0.1% |
e편한세상 | 13 | 0.1% |
힐스테이트 | 13 | 0.1% |
신반포 | 13 | 0.1% |
고덕 | 13 | 0.1% |
Other values (2261) | 10345 |
Most occurring characters
Value | Count | Frequency (%) |
아 | 2384 | 3.3% |
파 | 2342 | 3.2% |
트 | 2095 | 2.9% |
대 | 1876 | 2.6% |
동 | 1776 | 2.4% |
지 | 1769 | 2.4% |
차 | 1567 | 2.2% |
신 | 1529 | 2.1% |
단 | 1385 | 1.9% |
성 | 1333 | 1.8% |
Other values (426) | 54496 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 66423 | |
Decimal Number | 3780 | 5.2% |
Uppercase Letter | 743 | 1.0% |
Space Separator | 699 | 1.0% |
Lowercase Letter | 312 | 0.4% |
Open Punctuation | 170 | 0.2% |
Close Punctuation | 170 | 0.2% |
Dash Punctuation | 131 | 0.2% |
Other Punctuation | 117 | 0.2% |
Letter Number | 4 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
아 | 2384 | 3.6% |
파 | 2342 | 3.5% |
트 | 2095 | 3.2% |
대 | 1876 | 2.8% |
동 | 1776 | 2.7% |
지 | 1769 | 2.7% |
차 | 1567 | 2.4% |
신 | 1529 | 2.3% |
단 | 1385 | 2.1% |
성 | 1333 | 2.0% |
Other values (380) | 48367 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 108 | |
K | 105 | |
S | 104 | |
M | 62 | |
D | 62 | |
L | 61 | |
I | 43 | 5.8% |
H | 41 | 5.5% |
G | 25 | 3.4% |
E | 23 | 3.1% |
Other values (7) | 109 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 173 | |
i | 28 | 9.0% |
l | 26 | 8.3% |
s | 17 | 5.4% |
v | 16 | 5.1% |
k | 16 | 5.1% |
c | 8 | 2.6% |
g | 8 | 2.6% |
a | 8 | 2.6% |
w | 7 | 2.2% |
Decimal Number
Value | Count | Frequency (%) |
1 | 1178 | |
2 | 1120 | |
3 | 470 | 12.4% |
4 | 255 | 6.7% |
5 | 219 | 5.8% |
6 | 162 | 4.3% |
7 | 108 | 2.9% |
9 | 97 | 2.6% |
8 | 86 | 2.3% |
0 | 85 | 2.2% |
Other Punctuation
Value | Count | Frequency (%) |
, | 96 | |
. | 21 | 17.9% |
Space Separator
Value | Count | Frequency (%) |
699 |
Open Punctuation
Value | Count | Frequency (%) |
( | 170 |
Close Punctuation
Value | Count | Frequency (%) |
) | 170 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 131 |
Letter Number
Value | Count | Frequency (%) |
Ⅰ | 4 |
Math Symbol
Value | Count | Frequency (%) |
~ | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 66423 | |
Common | 5070 | 7.0% |
Latin | 1059 | 1.5% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
아 | 2384 | 3.6% |
파 | 2342 | 3.5% |
트 | 2095 | 3.2% |
대 | 1876 | 2.8% |
동 | 1776 | 2.7% |
지 | 1769 | 2.7% |
차 | 1567 | 2.4% |
신 | 1529 | 2.3% |
단 | 1385 | 2.1% |
성 | 1333 | 2.0% |
Other values (380) | 48367 |
Latin
Value | Count | Frequency (%) |
e | 173 | |
C | 108 | 10.2% |
K | 105 | 9.9% |
S | 104 | 9.8% |
M | 62 | 5.9% |
D | 62 | 5.9% |
L | 61 | 5.8% |
I | 43 | 4.1% |
H | 41 | 3.9% |
i | 28 | 2.6% |
Other values (19) | 272 |
Common
Value | Count | Frequency (%) |
1 | 1178 | |
2 | 1120 | |
699 | ||
3 | 470 | 9.3% |
4 | 255 | 5.0% |
5 | 219 | 4.3% |
( | 170 | 3.4% |
) | 170 | 3.4% |
6 | 162 | 3.2% |
- | 131 | 2.6% |
Other values (7) | 496 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 66423 | |
ASCII | 6125 | 8.4% |
Number Forms | 4 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
아 | 2384 | 3.6% |
파 | 2342 | 3.5% |
트 | 2095 | 3.2% |
대 | 1876 | 2.8% |
동 | 1776 | 2.7% |
지 | 1769 | 2.7% |
차 | 1567 | 2.4% |
신 | 1529 | 2.3% |
단 | 1385 | 2.1% |
성 | 1333 | 2.0% |
Other values (380) | 48367 |
ASCII
Value | Count | Frequency (%) |
1 | 1178 | |
2 | 1120 | |
699 | ||
3 | 470 | 7.7% |
4 | 255 | 4.2% |
5 | 219 | 3.6% |
e | 173 | 2.8% |
( | 170 | 2.8% |
) | 170 | 2.8% |
6 | 162 | 2.6% |
Other values (35) | 1509 |
Number Forms
Value | Count | Frequency (%) |
Ⅰ | 4 |
아파트코드
Text
Distinct | 2204 |
---|---|
Distinct (%) | 22.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
a13822001 | 14 | 0.1% |
a12179505 | 12 | 0.1% |
a15288611 | 11 | 0.1% |
a12013202 | 11 | 0.1% |
a15703304 | 11 | 0.1% |
a10045302 | 11 | 0.1% |
a13905105 | 11 | 0.1% |
a13821004 | 11 | 0.1% |
a14380414 | 11 | 0.1% |
a13987306 | 11 | 0.1% |
Other values (2194) | 9886 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 18456 | |
1 | 17604 | |
A | 9989 | |
3 | 8885 | |
2 | 8171 | |
5 | 6260 | 7.0% |
8 | 5663 | 6.3% |
7 | 4760 | 5.3% |
4 | 3916 | 4.4% |
6 | 3344 | 3.7% |
Other values (2) | 2952 | 3.3% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 80000 | |
Uppercase Letter | 10000 | 11.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 18456 | |
1 | 17604 | |
3 | 8885 | |
2 | 8171 | |
5 | 6260 | 7.8% |
8 | 5663 | 7.1% |
7 | 4760 | 5.9% |
4 | 3916 | 4.9% |
6 | 3344 | 4.2% |
9 | 2941 | 3.7% |
Uppercase Letter
Value | Count | Frequency (%) |
A | 9989 | |
B | 11 | 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 80000 | |
Latin | 10000 | 11.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 18456 | |
1 | 17604 | |
3 | 8885 | |
2 | 8171 | |
5 | 6260 | 7.8% |
8 | 5663 | 7.1% |
7 | 4760 | 5.9% |
4 | 3916 | 4.9% |
6 | 3344 | 4.2% |
9 | 2941 | 3.7% |
Latin
Value | Count | Frequency (%) |
A | 9989 | |
B | 11 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 90000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 18456 | |
1 | 17604 | |
A | 9989 | |
3 | 8885 | |
2 | 8171 | |
5 | 6260 | 7.0% |
8 | 5663 | 6.3% |
7 | 4760 | 5.3% |
4 | 3916 | 4.4% |
6 | 3344 | 3.7% |
Other values (2) | 2952 | 3.3% |
비용명
Text
Distinct | 77 |
---|---|
Distinct (%) | 0.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
예금 | 333 | 3.3% |
공동주택적립금 | 319 | 3.2% |
퇴직급여충당부채 | 317 | 3.2% |
미처분이익잉여금 | 311 | 3.1% |
장기수선충당부채 | 309 | 3.1% |
당기순이익 | 309 | 3.1% |
선급비용 | 305 | 3.0% |
예수금 | 300 | 3.0% |
장기수선충당예금 | 299 | 3.0% |
연차수당충당부채 | 297 | 3.0% |
Other values (67) | 6901 |
Most occurring characters
Value | Count | Frequency (%) |
금 | 4715 | 7.9% |
당 | 3770 | 6.3% |
수 | 3181 | 5.3% |
충 | 3092 | 5.2% |
비 | 3007 | 5.0% |
부 | 2972 | 5.0% |
채 | 2685 | 4.5% |
기 | 2359 | 3.9% |
선 | 1915 | 3.2% |
예 | 1793 | 3.0% |
Other values (97) | 30480 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 59969 |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
금 | 4715 | 7.9% |
당 | 3770 | 6.3% |
수 | 3181 | 5.3% |
충 | 3092 | 5.2% |
비 | 3007 | 5.0% |
부 | 2972 | 5.0% |
채 | 2685 | 4.5% |
기 | 2359 | 3.9% |
선 | 1915 | 3.2% |
예 | 1793 | 3.0% |
Other values (97) | 30480 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 59969 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
금 | 4715 | 7.9% |
당 | 3770 | 6.3% |
수 | 3181 | 5.3% |
충 | 3092 | 5.2% |
비 | 3007 | 5.0% |
부 | 2972 | 5.0% |
채 | 2685 | 4.5% |
기 | 2359 | 3.9% |
선 | 1915 | 3.2% |
예 | 1793 | 3.0% |
Other values (97) | 30480 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 59969 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
금 | 4715 | 7.9% |
당 | 3770 | 6.3% |
수 | 3181 | 5.3% |
충 | 3092 | 5.2% |
비 | 3007 | 5.0% |
부 | 2972 | 5.0% |
채 | 2685 | 4.5% |
기 | 2359 | 3.9% |
선 | 1915 | 3.2% |
예 | 1793 | 3.0% |
Other values (97) | 30480 |
년월일
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
202007 |
---|
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 202007 |
---|---|
2nd row | 202007 |
3rd row | 202007 |
4th row | 202007 |
5th row | 202007 |
Common Values
Value | Count | Frequency (%) |
202007 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
202007 | 10000 |
금액
Real number (ℝ)
ZEROS
 
Distinct | 7439 |
---|---|
Distinct (%) | 74.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 74471242 |
Minimum | -4.1436094 × 108 |
---|---|
Maximum | 8.0600576 × 109 |
Zeros | 2218 |
Zeros (%) | 22.2% |
Negative | 349 |
Negative (%) | 3.5% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -4.1436094 × 108 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 3276050 |
Q3 | 33037224 |
95-th percentile | 3.5670219 × 108 |
Maximum | 8.0600576 × 109 |
Range | 8.4744185 × 109 |
Interquartile range (IQR) | 33037224 |
Descriptive statistics
Standard deviation | 3.0636794 × 108 |
---|---|
Coefficient of variation (CV) | 4.1139094 |
Kurtosis | 203.89124 |
Mean | 74471242 |
Median Absolute Deviation (MAD) | 3276050 |
Skewness | 11.881101 |
Sum | 7.4471242 × 1011 |
Variance | 9.3861314 × 1016 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 2218 | 22.2% |
500000 | 32 | 0.3% |
250000 | 24 | 0.2% |
300000 | 18 | 0.2% |
20000000 | 15 | 0.1% |
484000 | 13 | 0.1% |
200000 | 11 | 0.1% |
242000 | 11 | 0.1% |
2000000 | 10 | 0.1% |
1000000 | 9 | 0.1% |
Other values (7429) | 7639 |
Value | Count | Frequency (%) |
-414360936 | 1 | |
-272693490 | 1 | |
-241847440 | 1 | |
-230356964 | 1 | |
-173982688 | 1 | |
-144739812 | 1 | |
-119335150 | 1 | |
-102315520 | 1 | |
-88521217 | 1 | |
-85222720 | 1 |
Value | Count | Frequency (%) |
8060057555 | 1 | |
7252359073 | 1 | |
6577801845 | 2 | |
6262167228 | 1 | |
5264362142 | 1 | |
5204261155 | 1 | |
5184606532 | 1 | |
5143453658 | 1 | |
5097470478 | 1 | |
4461069091 | 1 |
비용명 | 금액 | |
---|---|---|
비용명 | 1.000 | 0.509 |
금액 | 0.509 | 1.000 |
아파트명 | 아파트코드 | 비용명 | 년월일 | 금액 | |
---|---|---|---|---|---|
44261 | 중계주공10단지 | A13986004 | 장기수선충당예금 | 202007 | 14098813 |
40180 | 상계한일유엔아이 | A13920205 | 퇴직급여충당부채 | 202007 | 0 |
5569 | e편한세상마포리버파크 | A10028006 | 장기수선충당예금 | 202007 | 350696170 |
67648 | 목동진도1차 | A15882104 | 관리비예치금 | 202007 | 16926000 |
6018 | 약수하이츠아파트(임대) | A10045402 | 가수금 | 202007 | 6033861 |
47237 | 한일유앤아이 | A14272303 | 주차장충당부채 | 202007 | 0 |
42915 | 상계신동아 | A13982003 | 단기보증금 | 202007 | 1556000 |
38088 | 포스코더샵스타리버 | A13824001 | 선수관리비 | 202007 | 0 |
14283 | 휘경동일스위트리버 | A13009206 | 미수금 | 202007 | 0 |
26498 | 삼성한솔 | A13509004 | 현금 | 202007 | 1496880 |
아파트명 | 아파트코드 | 비용명 | 년월일 | 금액 | |
---|---|---|---|---|---|
9227 | 마포도화우성 | A12104007 | 비품감가상각누계액 | 202007 | -16447890 |
66888 | 목동12단지 | A15807706 | 공동체활성화단체지원적립금 | 202007 | 16830100 |
44633 | 중계성원1차 | A13986606 | 단기차입금 | 202007 | 0 |
62560 | 마곡엠밸리14단지 | A15721010 | 기타유동부채 | 202007 | 32962277 |
58433 | 벽산타운3단지 | A15384501 | 기타의비유동부채 | 202007 | 0 |
16068 | 묵동금호어울림 | A13114103 | 기타유형자산 | 202007 | 0 |
68578 | 은평뉴타운상림마을제3단지 | A41279908 | 선급비용 | 202007 | 11312040 |
14454 | 래미안장안2차 | A13010005 | 장기수선충당부채 | 202007 | 2270379583 |
8856 | 홍은유원 | A12084302 | 예수금 | 202007 | 1531788 |
27106 | 압구정 현대(10,13,14차) | A13511101 | 비품 | 202007 | 242000 |