Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 488.3 KiB |
Average record size in memory | 50.0 B |
Variable types
Text | 3 |
---|---|
Categorical | 1 |
Numeric | 1 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15821/S/1/datasetView.do |
Reproduction
Analysis started | 2024-05-11 06:49:30.459701 |
---|---|
Analysis finished | 2024-05-11 06:49:33.714102 |
Duration | 3.25 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
아파트명
Text
Distinct | 2178 |
---|---|
Distinct (%) | 21.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
아파트 | 216 | 2.0% |
래미안 | 51 | 0.5% |
e편한세상 | 31 | 0.3% |
힐스테이트 | 26 | 0.2% |
아이파크 | 19 | 0.2% |
sk뷰 | 18 | 0.2% |
신도림현대 | 17 | 0.2% |
푸르지오 | 16 | 0.1% |
송파 | 16 | 0.1% |
신반포 | 15 | 0.1% |
Other values (2262) | 10526 |
Most occurring characters
Value | Count | Frequency (%) |
아 | 2694 | 3.7% |
파 | 2574 | 3.5% |
트 | 2488 | 3.4% |
지 | 1734 | 2.4% |
대 | 1691 | 2.3% |
동 | 1640 | 2.2% |
이 | 1443 | 2.0% |
신 | 1405 | 1.9% |
차 | 1392 | 1.9% |
단 | 1335 | 1.8% |
Other values (420) | 54953 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 67261 | |
Decimal Number | 3377 | 4.6% |
Space Separator | 1053 | 1.4% |
Uppercase Letter | 817 | 1.1% |
Lowercase Letter | 334 | 0.5% |
Close Punctuation | 144 | 0.2% |
Open Punctuation | 144 | 0.2% |
Other Punctuation | 109 | 0.1% |
Dash Punctuation | 104 | 0.1% |
Letter Number | 6 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
아 | 2694 | 4.0% |
파 | 2574 | 3.8% |
트 | 2488 | 3.7% |
지 | 1734 | 2.6% |
대 | 1691 | 2.5% |
동 | 1640 | 2.4% |
이 | 1443 | 2.1% |
신 | 1405 | 2.1% |
차 | 1392 | 2.1% |
단 | 1335 | 2.0% |
Other values (375) | 48865 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 128 | |
C | 128 | |
K | 99 | |
D | 91 | |
M | 91 | |
L | 43 | 5.3% |
H | 40 | 4.9% |
E | 40 | 4.9% |
I | 36 | 4.4% |
V | 25 | 3.1% |
Other values (7) | 96 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 188 | |
l | 28 | 8.4% |
i | 25 | 7.5% |
k | 20 | 6.0% |
s | 20 | 6.0% |
v | 17 | 5.1% |
c | 14 | 4.2% |
h | 7 | 2.1% |
w | 7 | 2.1% |
g | 4 | 1.2% |
Decimal Number
Value | Count | Frequency (%) |
2 | 1026 | |
1 | 995 | |
3 | 456 | |
4 | 222 | 6.6% |
5 | 186 | 5.5% |
6 | 147 | 4.4% |
7 | 103 | 3.1% |
8 | 103 | 3.1% |
9 | 92 | 2.7% |
0 | 47 | 1.4% |
Other Punctuation
Value | Count | Frequency (%) |
, | 94 | |
. | 15 | 13.8% |
Space Separator
Value | Count | Frequency (%) |
1053 |
Close Punctuation
Value | Count | Frequency (%) |
) | 144 |
Open Punctuation
Value | Count | Frequency (%) |
( | 144 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 104 |
Letter Number
Value | Count | Frequency (%) |
Ⅰ | 6 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 67261 | |
Common | 4931 | 6.7% |
Latin | 1157 | 1.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
아 | 2694 | 4.0% |
파 | 2574 | 3.8% |
트 | 2488 | 3.7% |
지 | 1734 | 2.6% |
대 | 1691 | 2.5% |
동 | 1640 | 2.4% |
이 | 1443 | 2.1% |
신 | 1405 | 2.1% |
차 | 1392 | 2.1% |
단 | 1335 | 2.0% |
Other values (375) | 48865 |
Latin
Value | Count | Frequency (%) |
e | 188 | |
S | 128 | |
C | 128 | |
K | 99 | 8.6% |
D | 91 | 7.9% |
M | 91 | 7.9% |
L | 43 | 3.7% |
H | 40 | 3.5% |
E | 40 | 3.5% |
I | 36 | 3.1% |
Other values (19) | 273 |
Common
Value | Count | Frequency (%) |
1053 | ||
2 | 1026 | |
1 | 995 | |
3 | 456 | |
4 | 222 | 4.5% |
5 | 186 | 3.8% |
6 | 147 | 3.0% |
) | 144 | 2.9% |
( | 144 | 2.9% |
- | 104 | 2.1% |
Other values (6) | 454 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 67261 | |
ASCII | 6082 | 8.3% |
Number Forms | 6 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
아 | 2694 | 4.0% |
파 | 2574 | 3.8% |
트 | 2488 | 3.7% |
지 | 1734 | 2.6% |
대 | 1691 | 2.5% |
동 | 1640 | 2.4% |
이 | 1443 | 2.1% |
신 | 1405 | 2.1% |
차 | 1392 | 2.1% |
단 | 1335 | 2.0% |
Other values (375) | 48865 |
ASCII
Value | Count | Frequency (%) |
1053 | ||
2 | 1026 | |
1 | 995 | |
3 | 456 | 7.5% |
4 | 222 | 3.7% |
e | 188 | 3.1% |
5 | 186 | 3.1% |
6 | 147 | 2.4% |
) | 144 | 2.4% |
( | 144 | 2.4% |
Other values (34) | 1521 |
Number Forms
Value | Count | Frequency (%) |
Ⅰ | 6 |
아파트코드
Text
Distinct | 2181 |
---|---|
Distinct (%) | 21.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
a13872504 | 12 | 0.1% |
a15805115 | 12 | 0.1% |
a15083701 | 12 | 0.1% |
a15875101 | 12 | 0.1% |
a15807311 | 12 | 0.1% |
a12013003 | 12 | 0.1% |
a13987306 | 11 | 0.1% |
a12013202 | 11 | 0.1% |
a14003105 | 11 | 0.1% |
a14381407 | 11 | 0.1% |
Other values (2171) | 9884 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 18859 | |
1 | 17457 | |
A | 10000 | |
3 | 9000 | |
2 | 8202 | |
5 | 6186 | 6.9% |
8 | 5499 | 6.1% |
7 | 4585 | 5.1% |
4 | 3937 | 4.4% |
6 | 3392 | 3.8% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 80000 | |
Uppercase Letter | 10000 | 11.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 18859 | |
1 | 17457 | |
3 | 9000 | |
2 | 8202 | |
5 | 6186 | 7.7% |
8 | 5499 | 6.9% |
7 | 4585 | 5.7% |
4 | 3937 | 4.9% |
6 | 3392 | 4.2% |
9 | 2883 | 3.6% |
Uppercase Letter
Value | Count | Frequency (%) |
A | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 80000 | |
Latin | 10000 | 11.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 18859 | |
1 | 17457 | |
3 | 9000 | |
2 | 8202 | |
5 | 6186 | 7.7% |
8 | 5499 | 6.9% |
7 | 4585 | 5.7% |
4 | 3937 | 4.9% |
6 | 3392 | 4.2% |
9 | 2883 | 3.6% |
Latin
Value | Count | Frequency (%) |
A | 10000 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 90000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 18859 | |
1 | 17457 | |
A | 10000 | |
3 | 9000 | |
2 | 8202 | |
5 | 6186 | 6.9% |
8 | 5499 | 6.1% |
7 | 4585 | 5.1% |
4 | 3937 | 4.4% |
6 | 3392 | 3.8% |
비용명
Text
Distinct | 86 |
---|---|
Distinct (%) | 0.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
소독비 | 238 | 2.4% |
교육비 | 227 | 2.3% |
통신비 | 225 | 2.2% |
경비비 | 216 | 2.2% |
세대전기료 | 215 | 2.1% |
청소비 | 214 | 2.1% |
제수당 | 211 | 2.1% |
연체료수익 | 209 | 2.1% |
승강기유지비 | 209 | 2.1% |
사무용품비 | 209 | 2.1% |
Other values (76) | 7827 |
Most occurring characters
Value | Count | Frequency (%) |
비 | 5438 | 11.3% |
수 | 3499 | 7.3% |
료 | 2140 | 4.4% |
익 | 1943 | 4.0% |
용 | 1389 | 2.9% |
기 | 1309 | 2.7% |
대 | 1029 | 2.1% |
리 | 892 | 1.9% |
보 | 831 | 1.7% |
험 | 797 | 1.7% |
Other values (110) | 28855 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 48122 |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
비 | 5438 | 11.3% |
수 | 3499 | 7.3% |
료 | 2140 | 4.4% |
익 | 1943 | 4.0% |
용 | 1389 | 2.9% |
기 | 1309 | 2.7% |
대 | 1029 | 2.1% |
리 | 892 | 1.9% |
보 | 831 | 1.7% |
험 | 797 | 1.7% |
Other values (110) | 28855 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 48122 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
비 | 5438 | 11.3% |
수 | 3499 | 7.3% |
료 | 2140 | 4.4% |
익 | 1943 | 4.0% |
용 | 1389 | 2.9% |
기 | 1309 | 2.7% |
대 | 1029 | 2.1% |
리 | 892 | 1.9% |
보 | 831 | 1.7% |
험 | 797 | 1.7% |
Other values (110) | 28855 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 48122 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
비 | 5438 | 11.3% |
수 | 3499 | 7.3% |
료 | 2140 | 4.4% |
익 | 1943 | 4.0% |
용 | 1389 | 2.9% |
기 | 1309 | 2.7% |
대 | 1029 | 2.1% |
리 | 892 | 1.9% |
보 | 831 | 1.7% |
험 | 797 | 1.7% |
Other values (110) | 28855 |
년월일
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
202310 |
---|
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 202310 |
---|---|
2nd row | 202310 |
3rd row | 202310 |
4th row | 202310 |
5th row | 202310 |
Common Values
Value | Count | Frequency (%) |
202310 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
202310 | 10000 |
금액
Real number (ℝ)
ZEROS
 
Distinct | 6852 |
---|---|
Distinct (%) | 68.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3551884.4 |
Minimum | -47373150 |
---|---|
Maximum | 3.65629 × 108 |
Zeros | 1580 |
Zeros (%) | 15.8% |
Negative | 15 |
Negative (%) | 0.1% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -47373150 |
---|---|
5-th percentile | 0 |
Q1 | 50000 |
median | 285252.5 |
Q3 | 1416015 |
95-th percentile | 19026952 |
Maximum | 3.65629 × 108 |
Range | 4.1300215 × 108 |
Interquartile range (IQR) | 1366015 |
Descriptive statistics
Standard deviation | 12414757 |
---|---|
Coefficient of variation (CV) | 3.4952594 |
Kurtosis | 193.75661 |
Mean | 3551884.4 |
Median Absolute Deviation (MAD) | 285252.5 |
Skewness | 10.713289 |
Sum | 3.5518844 × 1010 |
Variance | 1.541262 × 1014 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 1580 | 15.8% |
200000 | 74 | 0.7% |
100000 | 58 | 0.6% |
300000 | 57 | 0.6% |
400000 | 42 | 0.4% |
150000 | 39 | 0.4% |
250000 | 38 | 0.4% |
30000 | 34 | 0.3% |
220000 | 32 | 0.3% |
120000 | 31 | 0.3% |
Other values (6842) | 8015 |
Value | Count | Frequency (%) |
-47373150 | 1 | |
-7791178 | 1 | |
-6060606 | 1 | |
-5500000 | 1 | |
-1778310 | 1 | |
-1018000 | 1 | |
-892140 | 1 | |
-401560 | 1 | |
-396666 | 1 | |
-118080 | 1 |
Value | Count | Frequency (%) |
365628998 | 1 | |
299211202 | 1 | |
282779662 | 1 | |
237491905 | 1 | |
225920170 | 1 | |
214890218 | 1 | |
206721922 | 1 | |
182875390 | 1 | |
168082438 | 1 | |
152575066 | 1 |
비용명 | 금액 | |
---|---|---|
비용명 | 1.000 | 0.309 |
금액 | 0.309 | 1.000 |
아파트명 | 아파트코드 | 비용명 | 년월일 | 금액 | |
---|---|---|---|---|---|
35571 | 금호동롯데아파트 | A13309402 | 선거관리위원회운영비 | 202310 | 0 |
16241 | 홍제성원아파트 | A12009201 | 고용보험료 | 202310 | 42360 |
99584 | 목동금호1차 | A15882107 | 주차장수익 | 202310 | 1125200 |
55221 | 한강 | A13790620 | 건강보험료 | 202310 | 744000 |
32599 | 창동상아1차 | A13204507 | 고용보험료 | 202310 | 268980 |
69854 | 한남힐스테이트 | A14077901 | 국민연금 | 202310 | 624010 |
64441 | 공릉대동1차 | A13980801 | 잡수익 | 202310 | 135 |
38343 | 천호태영 | A13402002 | 사무용품비 | 202310 | 204720 |
63346 | 공릉태릉우성 | A13980009 | 재활용품비용 | 202310 | 140000 |
49870 | 장위참누리 | A13614302 | 수선유지비 | 202310 | 1293950 |
아파트명 | 아파트코드 | 비용명 | 년월일 | 금액 | |
---|---|---|---|---|---|
2431 | 자양호반써밋아파트 | A10024132 | 건강보험료 | 202310 | 490500 |
55499 | 강변아파트 | A13790714 | 복리후생비 | 202310 | 324000 |
35358 | 응봉금호현대 | A13308004 | 회계감사비 | 202310 | 110000 |
90594 | 대방현대1차 | A15681106 | 이자수익 | 202310 | 0 |
69337 | 후암미주 | A14019001 | 세대전기료 | 202310 | 5954150 |
32317 | 창동동아청솔 | A13204409 | 수선유지비 | 202310 | 7721930 |
21888 | 신촌금호 | A12188201 | 세대전기료 | 202310 | 12062310 |
7595 | 래미안블레스티지 | A10025675 | 선거관리위원회운영비 | 202310 | 293600 |
82592 | 대상 | A15209303 | 소방안전관리비 | 202310 | 187000 |
83419 | 신도림디큐브시티 | A15277302 | 복리후생비 | 202310 | 143000 |