Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 488.3 KiB |
Average record size in memory | 50.0 B |
Variable types
Text | 3 |
---|---|
Categorical | 1 |
Numeric | 1 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15821/S/1/datasetView.do |
Reproduction
Analysis started | 2024-05-11 06:56:48.977942 |
---|---|
Analysis finished | 2024-05-11 06:56:51.308226 |
Duration | 2.33 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
아파트명
Text
Distinct | 2182 |
---|---|
Distinct (%) | 21.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
아파트 | 138 | 1.3% |
래미안 | 27 | 0.3% |
아이파크 | 25 | 0.2% |
신반포 | 19 | 0.2% |
e편한세상 | 15 | 0.1% |
은평뉴타운상림마을6단지 | 14 | 0.1% |
코오롱하늘채아파트 | 13 | 0.1% |
힐스테이트 | 13 | 0.1% |
sk뷰 | 13 | 0.1% |
홍은현대 | 13 | 0.1% |
Other values (2245) | 10304 |
Most occurring characters
Value | Count | Frequency (%) |
아 | 2482 | 3.4% |
파 | 2378 | 3.3% |
트 | 2116 | 2.9% |
대 | 1791 | 2.5% |
지 | 1777 | 2.5% |
동 | 1701 | 2.4% |
차 | 1533 | 2.1% |
신 | 1480 | 2.1% |
단 | 1398 | 1.9% |
이 | 1325 | 1.8% |
Other values (422) | 54124 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 66252 | |
Decimal Number | 3711 | 5.1% |
Uppercase Letter | 673 | 0.9% |
Space Separator | 651 | 0.9% |
Lowercase Letter | 313 | 0.4% |
Open Punctuation | 130 | 0.2% |
Close Punctuation | 130 | 0.2% |
Other Punctuation | 124 | 0.2% |
Dash Punctuation | 113 | 0.2% |
Letter Number | 5 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
아 | 2482 | 3.7% |
파 | 2378 | 3.6% |
트 | 2116 | 3.2% |
대 | 1791 | 2.7% |
지 | 1777 | 2.7% |
동 | 1701 | 2.6% |
차 | 1533 | 2.3% |
신 | 1480 | 2.2% |
단 | 1398 | 2.1% |
이 | 1325 | 2.0% |
Other values (376) | 48271 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 119 | |
K | 97 | |
C | 88 | |
D | 53 | |
M | 53 | |
L | 48 | |
I | 36 | 5.3% |
H | 35 | 5.2% |
G | 32 | 4.8% |
E | 29 | 4.3% |
Other values (7) | 83 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 169 | |
i | 28 | 8.9% |
l | 28 | 8.9% |
k | 20 | 6.4% |
v | 17 | 5.4% |
c | 16 | 5.1% |
s | 15 | 4.8% |
w | 11 | 3.5% |
h | 3 | 1.0% |
g | 3 | 1.0% |
Decimal Number
Value | Count | Frequency (%) |
2 | 1111 | |
1 | 1105 | |
3 | 503 | |
4 | 243 | 6.5% |
5 | 209 | 5.6% |
6 | 156 | 4.2% |
9 | 109 | 2.9% |
7 | 97 | 2.6% |
8 | 94 | 2.5% |
0 | 84 | 2.3% |
Other Punctuation
Value | Count | Frequency (%) |
, | 97 | |
. | 27 | 21.8% |
Space Separator
Value | Count | Frequency (%) |
651 |
Open Punctuation
Value | Count | Frequency (%) |
( | 130 |
Close Punctuation
Value | Count | Frequency (%) |
) | 130 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 113 |
Letter Number
Value | Count | Frequency (%) |
Ⅰ | 5 |
Math Symbol
Value | Count | Frequency (%) |
~ | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 66252 | |
Common | 4862 | 6.7% |
Latin | 991 | 1.4% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
아 | 2482 | 3.7% |
파 | 2378 | 3.6% |
트 | 2116 | 3.2% |
대 | 1791 | 2.7% |
지 | 1777 | 2.7% |
동 | 1701 | 2.6% |
차 | 1533 | 2.3% |
신 | 1480 | 2.2% |
단 | 1398 | 2.1% |
이 | 1325 | 2.0% |
Other values (376) | 48271 |
Latin
Value | Count | Frequency (%) |
e | 169 | |
S | 119 | |
K | 97 | 9.8% |
C | 88 | 8.9% |
D | 53 | 5.3% |
M | 53 | 5.3% |
L | 48 | 4.8% |
I | 36 | 3.6% |
H | 35 | 3.5% |
G | 32 | 3.2% |
Other values (19) | 261 |
Common
Value | Count | Frequency (%) |
2 | 1111 | |
1 | 1105 | |
651 | ||
3 | 503 | |
4 | 243 | 5.0% |
5 | 209 | 4.3% |
6 | 156 | 3.2% |
( | 130 | 2.7% |
) | 130 | 2.7% |
- | 113 | 2.3% |
Other values (7) | 511 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 66252 | |
ASCII | 5848 | 8.1% |
Number Forms | 5 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
아 | 2482 | 3.7% |
파 | 2378 | 3.6% |
트 | 2116 | 3.2% |
대 | 1791 | 2.7% |
지 | 1777 | 2.7% |
동 | 1701 | 2.6% |
차 | 1533 | 2.3% |
신 | 1480 | 2.2% |
단 | 1398 | 2.1% |
이 | 1325 | 2.0% |
Other values (376) | 48271 |
ASCII
Value | Count | Frequency (%) |
2 | 1111 | |
1 | 1105 | |
651 | ||
3 | 503 | 8.6% |
4 | 243 | 4.2% |
5 | 209 | 3.6% |
e | 169 | 2.9% |
6 | 156 | 2.7% |
( | 130 | 2.2% |
) | 130 | 2.2% |
Other values (35) | 1441 |
Number Forms
Value | Count | Frequency (%) |
Ⅰ | 5 |
아파트코드
Text
Distinct | 2188 |
---|---|
Distinct (%) | 21.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
a12084504 | 13 | 0.1% |
a13881603 | 12 | 0.1% |
a14206202 | 12 | 0.1% |
a13671208 | 12 | 0.1% |
a12008003 | 12 | 0.1% |
a10027188 | 12 | 0.1% |
a15685702 | 11 | 0.1% |
a13707203 | 11 | 0.1% |
a15792602 | 11 | 0.1% |
a13872502 | 11 | 0.1% |
Other values (2178) | 9883 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 18416 | |
1 | 17603 | |
A | 9989 | |
3 | 8781 | |
2 | 8235 | |
5 | 6348 | 7.1% |
8 | 5713 | 6.3% |
7 | 4843 | 5.4% |
4 | 3710 | 4.1% |
6 | 3443 | 3.8% |
Other values (2) | 2919 | 3.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 80000 | |
Uppercase Letter | 10000 | 11.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 18416 | |
1 | 17603 | |
3 | 8781 | |
2 | 8235 | |
5 | 6348 | 7.9% |
8 | 5713 | 7.1% |
7 | 4843 | 6.1% |
4 | 3710 | 4.6% |
6 | 3443 | 4.3% |
9 | 2908 | 3.6% |
Uppercase Letter
Value | Count | Frequency (%) |
A | 9989 | |
B | 11 | 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 80000 | |
Latin | 10000 | 11.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 18416 | |
1 | 17603 | |
3 | 8781 | |
2 | 8235 | |
5 | 6348 | 7.9% |
8 | 5713 | 7.1% |
7 | 4843 | 6.1% |
4 | 3710 | 4.6% |
6 | 3443 | 4.3% |
9 | 2908 | 3.6% |
Latin
Value | Count | Frequency (%) |
A | 9989 | |
B | 11 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 90000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 18416 | |
1 | 17603 | |
A | 9989 | |
3 | 8781 | |
2 | 8235 | |
5 | 6348 | 7.1% |
8 | 5713 | 6.3% |
7 | 4843 | 5.4% |
4 | 3710 | 4.1% |
6 | 3443 | 3.8% |
Other values (2) | 2919 | 3.2% |
비용명
Text
Distinct | 86 |
---|---|
Distinct (%) | 0.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
퇴직급여 | 240 | 2.4% |
수선유지비 | 235 | 2.4% |
경비비 | 234 | 2.3% |
세대전기료 | 224 | 2.2% |
장기수선비 | 221 | 2.2% |
입주자대표회의운영비 | 218 | 2.2% |
통신비 | 216 | 2.2% |
청소비 | 214 | 2.1% |
이자수익 | 211 | 2.1% |
소독비 | 211 | 2.1% |
Other values (76) | 7776 |
Most occurring characters
Value | Count | Frequency (%) |
비 | 5414 | 11.1% |
수 | 3570 | 7.3% |
료 | 2055 | 4.2% |
익 | 2012 | 4.1% |
용 | 1781 | 3.7% |
기 | 1355 | 2.8% |
대 | 1052 | 2.2% |
보 | 837 | 1.7% |
험 | 794 | 1.6% |
리 | 761 | 1.6% |
Other values (110) | 29076 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 48707 |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
비 | 5414 | 11.1% |
수 | 3570 | 7.3% |
료 | 2055 | 4.2% |
익 | 2012 | 4.1% |
용 | 1781 | 3.7% |
기 | 1355 | 2.8% |
대 | 1052 | 2.2% |
보 | 837 | 1.7% |
험 | 794 | 1.6% |
리 | 761 | 1.6% |
Other values (110) | 29076 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 48707 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
비 | 5414 | 11.1% |
수 | 3570 | 7.3% |
료 | 2055 | 4.2% |
익 | 2012 | 4.1% |
용 | 1781 | 3.7% |
기 | 1355 | 2.8% |
대 | 1052 | 2.2% |
보 | 837 | 1.7% |
험 | 794 | 1.6% |
리 | 761 | 1.6% |
Other values (110) | 29076 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 48707 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
비 | 5414 | 11.1% |
수 | 3570 | 7.3% |
료 | 2055 | 4.2% |
익 | 2012 | 4.1% |
용 | 1781 | 3.7% |
기 | 1355 | 2.8% |
대 | 1052 | 2.2% |
보 | 837 | 1.7% |
험 | 794 | 1.6% |
리 | 761 | 1.6% |
Other values (110) | 29076 |
년월일
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
202005 |
---|
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 202005 |
---|---|
2nd row | 202005 |
3rd row | 202005 |
4th row | 202005 |
5th row | 202005 |
Common Values
Value | Count | Frequency (%) |
202005 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
202005 | 10000 |
금액
Real number (ℝ)
ZEROS
 
Distinct | 6935 |
---|---|
Distinct (%) | 69.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3058755.6 |
Minimum | -6000000 |
---|---|
Maximum | 5.6366563 × 108 |
Zeros | 1285 |
Zeros (%) | 12.8% |
Negative | 6 |
Negative (%) | 0.1% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -6000000 |
---|---|
5-th percentile | 0 |
Q1 | 71815 |
median | 330000 |
Q3 | 1490473 |
95-th percentile | 15499905 |
Maximum | 5.6366563 × 108 |
Range | 5.6966563 × 108 |
Interquartile range (IQR) | 1418658 |
Descriptive statistics
Standard deviation | 11473506 |
---|---|
Coefficient of variation (CV) | 3.7510372 |
Kurtosis | 691.9371 |
Mean | 3058755.6 |
Median Absolute Deviation (MAD) | 330000 |
Skewness | 19.019361 |
Sum | 3.0587556 × 1010 |
Variance | 1.3164134 × 1014 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 1285 | 12.8% |
200000 | 95 | 0.9% |
300000 | 62 | 0.6% |
100000 | 53 | 0.5% |
400000 | 43 | 0.4% |
150000 | 34 | 0.3% |
350000 | 33 | 0.3% |
120000 | 30 | 0.3% |
500000 | 27 | 0.3% |
600000 | 25 | 0.2% |
Other values (6925) | 8313 |
Value | Count | Frequency (%) |
-6000000 | 1 | < 0.1% |
-3410000 | 1 | < 0.1% |
-337050 | 1 | < 0.1% |
-173530 | 1 | < 0.1% |
-156000 | 1 | < 0.1% |
-10000 | 1 | < 0.1% |
0 | 1285 | |
5 | 1 | < 0.1% |
6 | 1 | < 0.1% |
8 | 1 | < 0.1% |
Value | Count | Frequency (%) |
563665630 | 1 | |
310456392 | 1 | |
240481126 | 1 | |
227826890 | 1 | |
218609374 | 1 | |
186211770 | 1 | |
160034000 | 1 | |
151082260 | 1 | |
136015380 | 1 | |
117273446 | 1 |
비용명 | 금액 | |
---|---|---|
비용명 | 1.000 | 0.192 |
금액 | 0.192 | 1.000 |
아파트명 | 아파트코드 | 비용명 | 년월일 | 금액 | |
---|---|---|---|---|---|
77127 | 관악드림타운제2 | A15105503 | 광고선전비 | 202005 | 47500 |
81574 | 구로2차순영웰라이빌 | A15284101 | 시설보수비 | 202005 | 0 |
10933 | 독립문극동 | A12008003 | 자치활동비 | 202005 | 0 |
93090 | 염창극동 | A15786111 | 건강보험료 | 202005 | 316190 |
57033 | 풍납동아한가람 | A13887302 | 광고료수익 | 202005 | 0 |
76643 | 건영3차아파트 | A15101903 | 음식물처리비 | 202005 | 1658790 |
8022 | 목동센트럴푸르지오아파트 | A10027849 | 연체료수익 | 202005 | 23620 |
89665 | 마곡금호어울림 | A15721001 | 기타운영비용 | 202005 | 40000 |
26048 | 신내8단지두산화성 | A13187201 | 잡비용 | 202005 | 774280 |
35768 | 강일리버파크10단지 | A13410005 | 국민연금 | 202005 | 483380 |
아파트명 | 아파트코드 | 비용명 | 년월일 | 금액 | |
---|---|---|---|---|---|
33127 | 금호1차푸르지오 | A13380602 | 제수당 | 202005 | 603220 |
13414 | 공덕한화꿈에그린 | A12102002 | 승강기수익 | 202005 | 0 |
9388 | 신당삼성(분양) | A10045403 | 지급수수료 | 202005 | 8800 |
54785 | 잠실우성4차 | A13822902 | 세대난방비 | 202005 | 5709220 |
9954 | 남산롯데캐슬아이리스 | A10088102 | 기타운영비용 | 202005 | 3320250 |
5184 | 래미안강동팰리스 | A10026852 | 승강기수익 | 202005 | 638000 |
80286 | 천왕이펜하우스1단지 | A15213006 | 입주자대표회의운영비 | 202005 | 300000 |
80054 | 오류푸르지오 | A15210209 | 급여 | 202005 | 11163520 |
18163 | 갈현미미 | A12205001 | 소방안전관리비 | 202005 | 220000 |
91136 | 화곡2차보람 | A15770101 | 수선유지비 | 202005 | 1211040 |