Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 488.3 KiB |
Average record size in memory | 50.0 B |
Variable types
Text | 3 |
---|---|
Categorical | 1 |
Numeric | 1 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15821/S/1/datasetView.do |
Reproduction
Analysis started | 2024-05-11 06:58:01.782218 |
---|---|
Analysis finished | 2024-05-11 06:58:04.645818 |
Duration | 2.86 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
아파트명
Text
Distinct | 2068 |
---|---|
Distinct (%) | 20.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
아파트 | 111 | 1.1% |
힐스테이트 | 22 | 0.2% |
래미안 | 19 | 0.2% |
장미3차 | 16 | 0.2% |
신도림현대 | 15 | 0.1% |
상도삼호 | 14 | 0.1% |
북한산 | 14 | 0.1% |
이촌강촌 | 14 | 0.1% |
신트리1단지 | 13 | 0.1% |
극동2차아파트입주자대표회의 | 13 | 0.1% |
Other values (2129) | 10291 |
Most occurring characters
Value | Count | Frequency (%) |
아 | 2285 | 3.2% |
파 | 2201 | 3.1% |
트 | 2057 | 2.9% |
대 | 1776 | 2.5% |
지 | 1733 | 2.4% |
동 | 1689 | 2.3% |
차 | 1612 | 2.2% |
신 | 1508 | 2.1% |
단 | 1397 | 1.9% |
성 | 1339 | 1.9% |
Other values (421) | 54355 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 65783 | |
Decimal Number | 3853 | 5.4% |
Uppercase Letter | 769 | 1.1% |
Space Separator | 585 | 0.8% |
Lowercase Letter | 349 | 0.5% |
Open Punctuation | 162 | 0.2% |
Close Punctuation | 162 | 0.2% |
Other Punctuation | 141 | 0.2% |
Dash Punctuation | 139 | 0.2% |
Math Symbol | 5 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
아 | 2285 | 3.5% |
파 | 2201 | 3.3% |
트 | 2057 | 3.1% |
대 | 1776 | 2.7% |
지 | 1733 | 2.6% |
동 | 1689 | 2.6% |
차 | 1612 | 2.5% |
신 | 1508 | 2.3% |
단 | 1397 | 2.1% |
성 | 1339 | 2.0% |
Other values (375) | 48186 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 131 | |
K | 97 | |
C | 90 | |
L | 63 | |
D | 59 | |
M | 59 | |
H | 45 | 5.9% |
E | 44 | 5.7% |
I | 41 | 5.3% |
G | 34 | 4.4% |
Other values (7) | 106 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 177 | |
l | 40 | 11.5% |
i | 36 | 10.3% |
v | 24 | 6.9% |
c | 20 | 5.7% |
k | 17 | 4.9% |
s | 9 | 2.6% |
g | 8 | 2.3% |
a | 8 | 2.3% |
w | 8 | 2.3% |
Decimal Number
Value | Count | Frequency (%) |
1 | 1189 | |
2 | 1104 | |
3 | 520 | |
4 | 249 | 6.5% |
5 | 209 | 5.4% |
6 | 172 | 4.5% |
8 | 113 | 2.9% |
7 | 112 | 2.9% |
9 | 103 | 2.7% |
0 | 82 | 2.1% |
Other Punctuation
Value | Count | Frequency (%) |
, | 113 | |
. | 28 | 19.9% |
Space Separator
Value | Count | Frequency (%) |
585 |
Open Punctuation
Value | Count | Frequency (%) |
( | 162 |
Close Punctuation
Value | Count | Frequency (%) |
) | 162 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 139 |
Math Symbol
Value | Count | Frequency (%) |
~ | 5 |
Letter Number
Value | Count | Frequency (%) |
Ⅰ | 4 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 65783 | |
Common | 5047 | 7.0% |
Latin | 1122 | 1.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
아 | 2285 | 3.5% |
파 | 2201 | 3.3% |
트 | 2057 | 3.1% |
대 | 1776 | 2.7% |
지 | 1733 | 2.6% |
동 | 1689 | 2.6% |
차 | 1612 | 2.5% |
신 | 1508 | 2.3% |
단 | 1397 | 2.1% |
성 | 1339 | 2.0% |
Other values (375) | 48186 |
Latin
Value | Count | Frequency (%) |
e | 177 | |
S | 131 | |
K | 97 | 8.6% |
C | 90 | 8.0% |
L | 63 | 5.6% |
D | 59 | 5.3% |
M | 59 | 5.3% |
H | 45 | 4.0% |
E | 44 | 3.9% |
I | 41 | 3.7% |
Other values (19) | 316 |
Common
Value | Count | Frequency (%) |
1 | 1189 | |
2 | 1104 | |
585 | ||
3 | 520 | |
4 | 249 | 4.9% |
5 | 209 | 4.1% |
6 | 172 | 3.4% |
( | 162 | 3.2% |
) | 162 | 3.2% |
- | 139 | 2.8% |
Other values (7) | 556 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 65783 | |
ASCII | 6165 | 8.6% |
Number Forms | 4 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
아 | 2285 | 3.5% |
파 | 2201 | 3.3% |
트 | 2057 | 3.1% |
대 | 1776 | 2.7% |
지 | 1733 | 2.6% |
동 | 1689 | 2.6% |
차 | 1612 | 2.5% |
신 | 1508 | 2.3% |
단 | 1397 | 2.1% |
성 | 1339 | 2.0% |
Other values (375) | 48186 |
ASCII
Value | Count | Frequency (%) |
1 | 1189 | |
2 | 1104 | |
585 | 9.5% | |
3 | 520 | 8.4% |
4 | 249 | 4.0% |
5 | 209 | 3.4% |
e | 177 | 2.9% |
6 | 172 | 2.8% |
( | 162 | 2.6% |
) | 162 | 2.6% |
Other values (35) | 1636 |
Number Forms
Value | Count | Frequency (%) |
Ⅰ | 4 |
아파트코드
Text
Distinct | 2073 |
---|---|
Distinct (%) | 20.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
a13872504 | 16 | 0.2% |
a15678102 | 14 | 0.1% |
a14003106 | 14 | 0.1% |
a15083701 | 13 | 0.1% |
a13010003 | 13 | 0.1% |
a14380414 | 13 | 0.1% |
a15807002 | 13 | 0.1% |
a13527011 | 12 | 0.1% |
a12201301 | 12 | 0.1% |
a10045001 | 12 | 0.1% |
Other values (2063) | 9868 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 18477 | |
1 | 17543 | |
A | 10000 | |
3 | 8751 | |
2 | 8049 | |
5 | 6434 | 7.1% |
8 | 5822 | 6.5% |
7 | 4870 | 5.4% |
4 | 3687 | 4.1% |
6 | 3499 | 3.9% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 80000 | |
Uppercase Letter | 10000 | 11.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 18477 | |
1 | 17543 | |
3 | 8751 | |
2 | 8049 | |
5 | 6434 | 8.0% |
8 | 5822 | 7.3% |
7 | 4870 | 6.1% |
4 | 3687 | 4.6% |
6 | 3499 | 4.4% |
9 | 2868 | 3.6% |
Uppercase Letter
Value | Count | Frequency (%) |
A | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 80000 | |
Latin | 10000 | 11.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 18477 | |
1 | 17543 | |
3 | 8751 | |
2 | 8049 | |
5 | 6434 | 8.0% |
8 | 5822 | 7.3% |
7 | 4870 | 6.1% |
4 | 3687 | 4.6% |
6 | 3499 | 4.4% |
9 | 2868 | 3.6% |
Latin
Value | Count | Frequency (%) |
A | 10000 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 90000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 18477 | |
1 | 17543 | |
A | 10000 | |
3 | 8751 | |
2 | 8049 | |
5 | 6434 | 7.1% |
8 | 5822 | 6.5% |
7 | 4870 | 5.4% |
4 | 3687 | 4.1% |
6 | 3499 | 3.9% |
비용명
Text
Distinct | 87 |
---|---|
Distinct (%) | 0.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
급여 | 222 | 2.2% |
교육비 | 221 | 2.2% |
장기수선비 | 218 | 2.2% |
수선유지비 | 215 | 2.1% |
보험료 | 213 | 2.1% |
도서인쇄비 | 210 | 2.1% |
승강기유지비 | 210 | 2.1% |
소독비 | 209 | 2.1% |
사무용품비 | 208 | 2.1% |
청소비 | 203 | 2.0% |
Other values (77) | 7871 |
Most occurring characters
Value | Count | Frequency (%) |
비 | 5456 | 11.2% |
수 | 3571 | 7.3% |
료 | 2051 | 4.2% |
익 | 1977 | 4.1% |
용 | 1755 | 3.6% |
기 | 1307 | 2.7% |
대 | 1006 | 2.1% |
보 | 815 | 1.7% |
리 | 808 | 1.7% |
지 | 780 | 1.6% |
Other values (110) | 29280 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 48806 |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
비 | 5456 | 11.2% |
수 | 3571 | 7.3% |
료 | 2051 | 4.2% |
익 | 1977 | 4.1% |
용 | 1755 | 3.6% |
기 | 1307 | 2.7% |
대 | 1006 | 2.1% |
보 | 815 | 1.7% |
리 | 808 | 1.7% |
지 | 780 | 1.6% |
Other values (110) | 29280 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 48806 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
비 | 5456 | 11.2% |
수 | 3571 | 7.3% |
료 | 2051 | 4.2% |
익 | 1977 | 4.1% |
용 | 1755 | 3.6% |
기 | 1307 | 2.7% |
대 | 1006 | 2.1% |
보 | 815 | 1.7% |
리 | 808 | 1.7% |
지 | 780 | 1.6% |
Other values (110) | 29280 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 48806 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
비 | 5456 | 11.2% |
수 | 3571 | 7.3% |
료 | 2051 | 4.2% |
익 | 1977 | 4.1% |
용 | 1755 | 3.6% |
기 | 1307 | 2.7% |
대 | 1006 | 2.1% |
보 | 815 | 1.7% |
리 | 808 | 1.7% |
지 | 780 | 1.6% |
Other values (110) | 29280 |
년월일
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
201912 |
---|
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 201912 |
---|---|
2nd row | 201912 |
3rd row | 201912 |
4th row | 201912 |
5th row | 201912 |
Common Values
Value | Count | Frequency (%) |
201912 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
201912 | 10000 |
금액
Real number (ℝ)
ZEROS
 
Distinct | 6892 |
---|---|
Distinct (%) | 68.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3371301.5 |
Minimum | -8119350 |
---|---|
Maximum | 3.865093 × 108 |
Zeros | 1426 |
Zeros (%) | 14.3% |
Negative | 19 |
Negative (%) | 0.2% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -8119350 |
---|---|
5-th percentile | 0 |
Q1 | 59965 |
median | 300000 |
Q3 | 1388875 |
95-th percentile | 16590569 |
Maximum | 3.865093 × 108 |
Range | 3.9462865 × 108 |
Interquartile range (IQR) | 1328910 |
Descriptive statistics
Standard deviation | 12103694 |
---|---|
Coefficient of variation (CV) | 3.5902142 |
Kurtosis | 204.81442 |
Mean | 3371301.5 |
Median Absolute Deviation (MAD) | 300000 |
Skewness | 10.955221 |
Sum | 3.3713015 × 1010 |
Variance | 1.4649942 × 1014 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 1426 | 14.3% |
200000 | 88 | 0.9% |
300000 | 68 | 0.7% |
150000 | 54 | 0.5% |
100000 | 54 | 0.5% |
400000 | 42 | 0.4% |
500000 | 32 | 0.3% |
350000 | 23 | 0.2% |
220000 | 23 | 0.2% |
450000 | 21 | 0.2% |
Other values (6882) | 8169 |
Value | Count | Frequency (%) |
-8119350 | 1 | |
-2343150 | 1 | |
-1855020 | 1 | |
-1199600 | 1 | |
-854640 | 1 | |
-786000 | 1 | |
-703710 | 1 | |
-700000 | 1 | |
-681528 | 1 | |
-312500 | 1 |
Value | Count | Frequency (%) |
386509300 | 1 | |
273862098 | 1 | |
235573430 | 1 | |
220791300 | 1 | |
219831620 | 1 | |
218731760 | 1 | |
206397300 | 1 | |
173615820 | 1 | |
158527760 | 1 | |
151255323 | 1 |
비용명 | 금액 | |
---|---|---|
비용명 | 1.000 | 0.527 |
금액 | 0.527 | 1.000 |
아파트명 | 아파트코드 | 비용명 | 년월일 | 금액 | |
---|---|---|---|---|---|
22195 | 전농삼성 | A13085301 | 임대료수익 | 201912 | 1200000 |
41272 | 역삼개나리푸르지오 | A13579501 | 잡비용 | 201912 | 2419000 |
25048 | 신내건영2차아파트 | A13185607 | 회계감사비 | 201912 | 187500 |
70064 | 광장힐스테이트 | A14375301 | 수선유지비 | 201912 | 3722460 |
96486 | 목동14단지 | A15807606 | 소모품비 | 201912 | 288540 |
37768 | 삼성현대 | A13509001 | 건강보험료 | 201912 | 264250 |
55799 | 가락현대6차 | A13880201 | 건강보험료 | 201912 | 418080 |
64170 | 중계우성3차 | A13986201 | 선거관리위원회운영비 | 201912 | 0 |
9486 | 서대문천연뜨란채아파트 | A12004001 | 급여 | 201912 | 20962830 |
42356 | 수서동익 | A13588601 | 보험료 | 201912 | 280480 |
아파트명 | 아파트코드 | 비용명 | 년월일 | 금액 | |
---|---|---|---|---|---|
9581 | 독립문삼호 | A12007001 | 기타사용료 | 201912 | 4173000 |
17701 | 신사현대2차 | A12208105 | 건강보험료 | 201912 | 314510 |
2695 | 롯데캐슬노블레스 | A10026180 | 임대료수익 | 201912 | 0 |
47899 | 정릉중앙하이츠빌1단지 | A13685104 | 급여 | 201912 | 11756110 |
14631 | 상암휴먼시아1단지 | A12179502 | 세금과공과 | 201912 | 0 |
57905 | 우림루미아트1.2단지 | A13920104 | 입주자대표회의운영비 | 201912 | 500000 |
88046 | 대방2차현대 | A15681104 | 급여 | 201912 | 8223600 |
6913 | 마포한강푸르지오 | A10027902 | 도서인쇄비 | 201912 | 205000 |
35110 | 강일리버파크2단지 | A13410003 | 검침수익 | 201912 | 190060 |
12537 | 마포도화우성 | A12104007 | 검침수익 | 201912 | 2480660 |