Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 488.3 KiB |
Average record size in memory | 50.0 B |
Variable types
Text | 3 |
---|---|
Categorical | 1 |
Numeric | 1 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15821/S/1/datasetView.do |
Reproduction
Analysis started | 2024-05-11 06:50:04.130102 |
---|---|
Analysis finished | 2024-05-11 06:50:06.117101 |
Duration | 1.99 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
아파트명
Text
Distinct | 2203 |
---|---|
Distinct (%) | 22.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 28 |
---|---|
Median length | 20 |
Mean length | 7.3867 |
Min length | 2 |
Characters and Unicode
Total characters | 73867 |
---|---|
Distinct characters | 437 |
Distinct categories | 10 ? |
Distinct scripts | 3 ? |
Distinct blocks | 3 ? |
Unique
Unique | 126 ? |
---|---|
Unique (%) | 1.3% |
Sample
1st row | 신도림대림5차e-편한세상 |
---|---|
2nd row | 도봉동아에코빌 |
3rd row | 창동주공2단지 |
4th row | 이촌동부센트레빌 |
5th row | 남서울힐스테이트 |
Value | Count | Frequency (%) |
아파트 | 195 | 1.8% |
래미안 | 34 | 0.3% |
북한산 | 20 | 0.2% |
고덕 | 20 | 0.2% |
아이파크 | 20 | 0.2% |
e편한세상 | 19 | 0.2% |
송파 | 17 | 0.2% |
sk뷰 | 16 | 0.1% |
장미3차 | 15 | 0.1% |
신반포 | 15 | 0.1% |
Other values (2286) | 10495 |
Most occurring characters
Value | Count | Frequency (%) |
아 | 2617 | 3.5% |
파 | 2548 | 3.4% |
트 | 2346 | 3.2% |
지 | 1886 | 2.6% |
대 | 1788 | 2.4% |
동 | 1697 | 2.3% |
단 | 1522 | 2.1% |
차 | 1490 | 2.0% |
이 | 1463 | 2.0% |
신 | 1461 | 2.0% |
Other values (427) | 55049 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 67566 | |
Decimal Number | 3803 | 5.1% |
Space Separator | 942 | 1.3% |
Uppercase Letter | 754 | 1.0% |
Lowercase Letter | 291 | 0.4% |
Close Punctuation | 139 | 0.2% |
Open Punctuation | 139 | 0.2% |
Dash Punctuation | 120 | 0.2% |
Other Punctuation | 108 | 0.1% |
Letter Number | 5 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
아 | 2617 | 3.9% |
파 | 2548 | 3.8% |
트 | 2346 | 3.5% |
지 | 1886 | 2.8% |
대 | 1788 | 2.6% |
동 | 1697 | 2.5% |
단 | 1522 | 2.3% |
차 | 1490 | 2.2% |
이 | 1463 | 2.2% |
신 | 1461 | 2.2% |
Other values (382) | 48748 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 114 | |
S | 113 | |
K | 82 | |
D | 73 | |
M | 73 | |
L | 69 | |
H | 54 | |
I | 40 | 5.3% |
G | 30 | 4.0% |
E | 26 | 3.4% |
Other values (7) | 80 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 163 | |
l | 32 | 11.0% |
i | 24 | 8.2% |
v | 19 | 6.5% |
s | 17 | 5.8% |
k | 16 | 5.5% |
c | 6 | 2.1% |
w | 6 | 2.1% |
h | 4 | 1.4% |
a | 2 | 0.7% |
Decimal Number
Value | Count | Frequency (%) |
1 | 1108 | |
2 | 1099 | |
3 | 507 | |
4 | 303 | 8.0% |
5 | 228 | 6.0% |
6 | 168 | 4.4% |
7 | 139 | 3.7% |
9 | 92 | 2.4% |
0 | 81 | 2.1% |
8 | 78 | 2.1% |
Other Punctuation
Value | Count | Frequency (%) |
, | 92 | |
. | 16 | 14.8% |
Space Separator
Value | Count | Frequency (%) |
942 |
Close Punctuation
Value | Count | Frequency (%) |
) | 139 |
Open Punctuation
Value | Count | Frequency (%) |
( | 139 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 120 |
Letter Number
Value | Count | Frequency (%) |
Ⅰ | 5 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 67566 | |
Common | 5251 | 7.1% |
Latin | 1050 | 1.4% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
아 | 2617 | 3.9% |
파 | 2548 | 3.8% |
트 | 2346 | 3.5% |
지 | 1886 | 2.8% |
대 | 1788 | 2.6% |
동 | 1697 | 2.5% |
단 | 1522 | 2.3% |
차 | 1490 | 2.2% |
이 | 1463 | 2.2% |
신 | 1461 | 2.2% |
Other values (382) | 48748 |
Latin
Value | Count | Frequency (%) |
e | 163 | |
C | 114 | |
S | 113 | |
K | 82 | 7.8% |
D | 73 | 7.0% |
M | 73 | 7.0% |
L | 69 | 6.6% |
H | 54 | 5.1% |
I | 40 | 3.8% |
l | 32 | 3.0% |
Other values (19) | 237 |
Common
Value | Count | Frequency (%) |
1 | 1108 | |
2 | 1099 | |
942 | ||
3 | 507 | |
4 | 303 | 5.8% |
5 | 228 | 4.3% |
6 | 168 | 3.2% |
7 | 139 | 2.6% |
) | 139 | 2.6% |
( | 139 | 2.6% |
Other values (6) | 479 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 67566 | |
ASCII | 6296 | 8.5% |
Number Forms | 5 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
아 | 2617 | 3.9% |
파 | 2548 | 3.8% |
트 | 2346 | 3.5% |
지 | 1886 | 2.8% |
대 | 1788 | 2.6% |
동 | 1697 | 2.5% |
단 | 1522 | 2.3% |
차 | 1490 | 2.2% |
이 | 1463 | 2.2% |
신 | 1461 | 2.2% |
Other values (382) | 48748 |
ASCII
Value | Count | Frequency (%) |
1 | 1108 | |
2 | 1099 | |
942 | ||
3 | 507 | 8.1% |
4 | 303 | 4.8% |
5 | 228 | 3.6% |
6 | 168 | 2.7% |
e | 163 | 2.6% |
7 | 139 | 2.2% |
) | 139 | 2.2% |
Other values (34) | 1500 |
Number Forms
Value | Count | Frequency (%) |
Ⅰ | 5 |
아파트코드
Text
Distinct | 2210 |
---|---|
Distinct (%) | 22.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
a13872504 | 15 | 0.1% |
a15086601 | 14 | 0.1% |
a12127006 | 13 | 0.1% |
a13985605 | 12 | 0.1% |
a13550502 | 12 | 0.1% |
a13204104 | 12 | 0.1% |
a15703204 | 11 | 0.1% |
a15786321 | 11 | 0.1% |
a15101508 | 11 | 0.1% |
a13879102 | 11 | 0.1% |
Other values (2200) | 9878 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 18504 | |
1 | 17511 | |
A | 9994 | |
3 | 8658 | |
2 | 8334 | |
5 | 6310 | 7.0% |
8 | 5573 | 6.2% |
7 | 4687 | 5.2% |
4 | 3977 | 4.4% |
6 | 3449 | 3.8% |
Other values (2) | 3003 | 3.3% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 80000 | |
Uppercase Letter | 10000 | 11.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 18504 | |
1 | 17511 | |
3 | 8658 | |
2 | 8334 | |
5 | 6310 | 7.9% |
8 | 5573 | 7.0% |
7 | 4687 | 5.9% |
4 | 3977 | 5.0% |
6 | 3449 | 4.3% |
9 | 2997 | 3.7% |
Uppercase Letter
Value | Count | Frequency (%) |
A | 9994 | |
B | 6 | 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 80000 | |
Latin | 10000 | 11.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 18504 | |
1 | 17511 | |
3 | 8658 | |
2 | 8334 | |
5 | 6310 | 7.9% |
8 | 5573 | 7.0% |
7 | 4687 | 5.9% |
4 | 3977 | 5.0% |
6 | 3449 | 4.3% |
9 | 2997 | 3.7% |
Latin
Value | Count | Frequency (%) |
A | 9994 | |
B | 6 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 90000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 18504 | |
1 | 17511 | |
A | 9994 | |
3 | 8658 | |
2 | 8334 | |
5 | 6310 | 7.0% |
8 | 5573 | 6.2% |
7 | 4687 | 5.2% |
4 | 3977 | 4.4% |
6 | 3449 | 3.8% |
Other values (2) | 3003 | 3.3% |
비용명
Text
Distinct | 87 |
---|---|
Distinct (%) | 0.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
퇴직급여 | 255 | 2.5% |
청소비 | 254 | 2.5% |
급여 | 247 | 2.5% |
소독비 | 245 | 2.5% |
세대전기료 | 242 | 2.4% |
수선유지비 | 242 | 2.4% |
사무용품비 | 242 | 2.4% |
연체료수익 | 242 | 2.4% |
통신비 | 242 | 2.4% |
산재보험료 | 233 | 2.3% |
Other values (77) | 7556 |
Most occurring characters
Value | Count | Frequency (%) |
비 | 5267 | 11.0% |
수 | 3597 | 7.5% |
료 | 2316 | 4.8% |
익 | 1875 | 3.9% |
용 | 1706 | 3.5% |
기 | 1335 | 2.8% |
대 | 1170 | 2.4% |
보 | 943 | 2.0% |
험 | 894 | 1.9% |
리 | 857 | 1.8% |
Other values (110) | 28125 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 48085 |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
비 | 5267 | 11.0% |
수 | 3597 | 7.5% |
료 | 2316 | 4.8% |
익 | 1875 | 3.9% |
용 | 1706 | 3.5% |
기 | 1335 | 2.8% |
대 | 1170 | 2.4% |
보 | 943 | 2.0% |
험 | 894 | 1.9% |
리 | 857 | 1.8% |
Other values (110) | 28125 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 48085 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
비 | 5267 | 11.0% |
수 | 3597 | 7.5% |
료 | 2316 | 4.8% |
익 | 1875 | 3.9% |
용 | 1706 | 3.5% |
기 | 1335 | 2.8% |
대 | 1170 | 2.4% |
보 | 943 | 2.0% |
험 | 894 | 1.9% |
리 | 857 | 1.8% |
Other values (110) | 28125 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 48085 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
비 | 5267 | 11.0% |
수 | 3597 | 7.5% |
료 | 2316 | 4.8% |
익 | 1875 | 3.9% |
용 | 1706 | 3.5% |
기 | 1335 | 2.8% |
대 | 1170 | 2.4% |
보 | 943 | 2.0% |
험 | 894 | 1.9% |
리 | 857 | 1.8% |
Other values (110) | 28125 |
년월일
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
202201 |
---|
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 202201 |
---|---|
2nd row | 202201 |
3rd row | 202201 |
4th row | 202201 |
5th row | 202201 |
Common Values
Value | Count | Frequency (%) |
202201 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
202201 | 10000 |
금액
Real number (ℝ)
ZEROS
 
Distinct | 7769 |
---|---|
Distinct (%) | 77.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4426582.4 |
Minimum | -2923446 |
---|---|
Maximum | 6.619606 × 108 |
Zeros | 194 |
Zeros (%) | 1.9% |
Negative | 3 |
Negative (%) | < 0.1% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -2923446 |
---|---|
5-th percentile | 5000 |
Q1 | 130410 |
median | 400000 |
Q3 | 1776527.5 |
95-th percentile | 20037743 |
Maximum | 6.619606 × 108 |
Range | 6.6488405 × 108 |
Interquartile range (IQR) | 1646117.5 |
Descriptive statistics
Standard deviation | 18394193 |
---|---|
Coefficient of variation (CV) | 4.1553937 |
Kurtosis | 392.06827 |
Mean | 4426582.4 |
Median Absolute Deviation (MAD) | 346770 |
Skewness | 15.466789 |
Sum | 4.4265824 × 1010 |
Variance | 3.3834632 × 1014 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 194 | 1.9% |
200000 | 105 | 1.1% |
100000 | 72 | 0.7% |
300000 | 70 | 0.7% |
150000 | 58 | 0.6% |
400000 | 49 | 0.5% |
250000 | 41 | 0.4% |
50000 | 39 | 0.4% |
500000 | 31 | 0.3% |
110000 | 28 | 0.3% |
Other values (7759) | 9313 |
Value | Count | Frequency (%) |
-2923446 | 1 | < 0.1% |
-261800 | 1 | < 0.1% |
-63 | 1 | < 0.1% |
0 | 194 | |
2 | 4 | < 0.1% |
3 | 3 | < 0.1% |
4 | 1 | < 0.1% |
6 | 2 | < 0.1% |
7 | 2 | < 0.1% |
8 | 2 | < 0.1% |
Value | Count | Frequency (%) |
661960600 | 1 | |
627197177 | 1 | |
439250910 | 1 | |
361395538 | 1 | |
357260398 | 1 | |
270274270 | 1 | |
255220298 | 1 | |
253187900 | 1 | |
249538280 | 1 | |
246923970 | 1 |
비용명 | 금액 | |
---|---|---|
비용명 | 1.000 | 0.398 |
금액 | 0.398 | 1.000 |
아파트명 | 아파트코드 | 비용명 | 년월일 | 금액 | |
---|---|---|---|---|---|
75628 | 신도림대림5차e-편한세상 | A15288805 | 감가상각비 | 202201 | 235210 |
26216 | 도봉동아에코빌 | A13201206 | 기타부대비 | 202201 | 323050 |
27922 | 창동주공2단지 | A13204508 | 청소비 | 202201 | 6072400 |
60755 | 이촌동부센트레빌 | A14003004 | 퇴직급여 | 202201 | 822960 |
76266 | 남서울힐스테이트 | A15370103 | 산재보험료 | 202201 | 329000 |
60751 | 이촌동부센트레빌 | A14003004 | 연차수당 | 202201 | 318540 |
14024 | DMC센트레빌 | A12072801 | 입주자대표회의운영비 | 202201 | 700000 |
45087 | 돈암현대 | A13681304 | 기타운영수익 | 202201 | 257400 |
65500 | 자양우성3차 | A14386110 | 세대전기료 | 202201 | 22937460 |
28221 | 방학청구 | A13276415 | 정화조관리비 | 202201 | 785480 |
아파트명 | 아파트코드 | 비용명 | 년월일 | 금액 | |
---|---|---|---|---|---|
13016 | 홍은풍림2차 | A12010103 | 승강기유지비 | 202201 | 380000 |
87621 | 목동6단지 | A15875103 | 세대난방비 | 202201 | 177656190 |
69759 | 롯데캐슬아이비 | A15088915 | 소모품비 | 202201 | 1236350 |
65634 | 자양7차현대홈타운 | A14388204 | 소모품비 | 202201 | 53000 |
750 | 서초그랑자이 | A10024240 | 세대난방비 | 202201 | 107597360 |
52374 | 가락프라자 | A13881204 | 연차수당 | 202201 | 0 |
89288 | 은평뉴타운마고정3단지 | A41279912 | 잡비용 | 202201 | 551814 |
4366 | 래미안솔베뉴 | A10025415 | 시설보수비 | 202201 | 85400 |
24534 | 면목삼익 | A13183502 | 피복비 | 202201 | 40010 |
56508 | 화랑해링턴플레이스 | A13980413 | 제수당 | 202201 | 887620 |