Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 488.3 KiB |
Average record size in memory | 50.0 B |
Variable types
Text | 2 |
---|---|
Categorical | 2 |
Numeric | 1 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15822/S/1/datasetView.do |
Reproduction
Analysis started | 2024-05-11 05:47:31.848465 |
---|---|
Analysis finished | 2024-05-11 05:47:32.905863 |
Duration | 1.06 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
아파트명
Text
Distinct | 2113 |
---|---|
Distinct (%) | 21.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 22 |
---|---|
Median length | 20 |
Mean length | 7.1724 |
Min length | 2 |
Characters and Unicode
Total characters | 71724 |
---|---|
Distinct characters | 432 |
Distinct categories | 11 ? |
Distinct scripts | 3 ? |
Distinct blocks | 3 ? |
Unique
Unique | 107 ? |
---|---|
Unique (%) | 1.1% |
Sample
1st row | 서빙고금호베스트빌 |
---|---|
2nd row | 신내5단지대림두산 |
3rd row | 동부(돌타운)아파트 |
4th row | 광장현대8단지 |
5th row | 창전현대홈타운 |
Value | Count | Frequency (%) |
아파트 | 112 | 1.1% |
래미안 | 20 | 0.2% |
여의도진주 | 17 | 0.2% |
고덕현대 | 16 | 0.2% |
신도림현대 | 16 | 0.2% |
신동아파밀리에 | 14 | 0.1% |
은평뉴타운상림마을6단지 | 13 | 0.1% |
힐스테이트 | 13 | 0.1% |
입주자대표회의 | 13 | 0.1% |
월드컵참누리 | 13 | 0.1% |
Other values (2169) | 10263 |
Most occurring characters
Value | Count | Frequency (%) |
아 | 2265 | 3.2% |
파 | 2168 | 3.0% |
대 | 1949 | 2.7% |
트 | 1931 | 2.7% |
지 | 1852 | 2.6% |
동 | 1701 | 2.4% |
차 | 1592 | 2.2% |
단 | 1522 | 2.1% |
신 | 1516 | 2.1% |
성 | 1355 | 1.9% |
Other values (422) | 53873 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 65695 | |
Decimal Number | 3994 | 5.6% |
Uppercase Letter | 600 | 0.8% |
Space Separator | 553 | 0.8% |
Lowercase Letter | 320 | 0.4% |
Dash Punctuation | 148 | 0.2% |
Close Punctuation | 140 | 0.2% |
Open Punctuation | 140 | 0.2% |
Other Punctuation | 127 | 0.2% |
Letter Number | 4 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
아 | 2265 | 3.4% |
파 | 2168 | 3.3% |
대 | 1949 | 3.0% |
트 | 1931 | 2.9% |
지 | 1852 | 2.8% |
동 | 1701 | 2.6% |
차 | 1592 | 2.4% |
단 | 1522 | 2.3% |
신 | 1516 | 2.3% |
성 | 1355 | 2.1% |
Other values (376) | 47844 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 117 | |
K | 81 | |
C | 62 | |
H | 48 | |
L | 42 | 7.0% |
D | 37 | 6.2% |
M | 37 | 6.2% |
I | 33 | 5.5% |
E | 30 | 5.0% |
G | 30 | 5.0% |
Other values (7) | 83 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 193 | |
l | 36 | 11.2% |
i | 32 | 10.0% |
v | 23 | 7.2% |
w | 11 | 3.4% |
s | 8 | 2.5% |
k | 6 | 1.9% |
a | 3 | 0.9% |
h | 3 | 0.9% |
g | 3 | 0.9% |
Decimal Number
Value | Count | Frequency (%) |
1 | 1231 | |
2 | 1184 | |
3 | 525 | |
4 | 271 | 6.8% |
5 | 205 | 5.1% |
6 | 181 | 4.5% |
7 | 109 | 2.7% |
9 | 105 | 2.6% |
8 | 101 | 2.5% |
0 | 82 | 2.1% |
Other Punctuation
Value | Count | Frequency (%) |
, | 108 | |
. | 19 | 15.0% |
Space Separator
Value | Count | Frequency (%) |
553 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 148 |
Close Punctuation
Value | Count | Frequency (%) |
) | 140 |
Open Punctuation
Value | Count | Frequency (%) |
( | 140 |
Letter Number
Value | Count | Frequency (%) |
Ⅰ | 4 |
Math Symbol
Value | Count | Frequency (%) |
~ | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 65695 | |
Common | 5105 | 7.1% |
Latin | 924 | 1.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
아 | 2265 | 3.4% |
파 | 2168 | 3.3% |
대 | 1949 | 3.0% |
트 | 1931 | 2.9% |
지 | 1852 | 2.8% |
동 | 1701 | 2.6% |
차 | 1592 | 2.4% |
단 | 1522 | 2.3% |
신 | 1516 | 2.3% |
성 | 1355 | 2.1% |
Other values (376) | 47844 |
Latin
Value | Count | Frequency (%) |
e | 193 | |
S | 117 | |
K | 81 | 8.8% |
C | 62 | 6.7% |
H | 48 | 5.2% |
L | 42 | 4.5% |
D | 37 | 4.0% |
M | 37 | 4.0% |
l | 36 | 3.9% |
I | 33 | 3.6% |
Other values (19) | 238 |
Common
Value | Count | Frequency (%) |
1 | 1231 | |
2 | 1184 | |
553 | ||
3 | 525 | |
4 | 271 | 5.3% |
5 | 205 | 4.0% |
6 | 181 | 3.5% |
- | 148 | 2.9% |
) | 140 | 2.7% |
( | 140 | 2.7% |
Other values (7) | 527 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 65695 | |
ASCII | 6025 | 8.4% |
Number Forms | 4 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
아 | 2265 | 3.4% |
파 | 2168 | 3.3% |
대 | 1949 | 3.0% |
트 | 1931 | 2.9% |
지 | 1852 | 2.8% |
동 | 1701 | 2.6% |
차 | 1592 | 2.4% |
단 | 1522 | 2.3% |
신 | 1516 | 2.3% |
성 | 1355 | 2.1% |
Other values (376) | 47844 |
ASCII
Value | Count | Frequency (%) |
1 | 1231 | |
2 | 1184 | |
553 | 9.2% | |
3 | 525 | 8.7% |
4 | 271 | 4.5% |
5 | 205 | 3.4% |
e | 193 | 3.2% |
6 | 181 | 3.0% |
- | 148 | 2.5% |
) | 140 | 2.3% |
Other values (35) | 1394 |
Number Forms
Value | Count | Frequency (%) |
Ⅰ | 4 |
아파트코드
Text
Distinct | 2119 |
---|---|
Distinct (%) | 21.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
a15089513 | 17 | 0.2% |
a12187906 | 13 | 0.1% |
a13203302 | 13 | 0.1% |
a12114001 | 12 | 0.1% |
a15721001 | 12 | 0.1% |
a12009102 | 12 | 0.1% |
a15780703 | 12 | 0.1% |
a15102902 | 12 | 0.1% |
a13290809 | 11 | 0.1% |
a13922903 | 11 | 0.1% |
Other values (2109) | 9875 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 18299 | |
1 | 17713 | |
A | 9992 | |
3 | 8897 | |
2 | 7940 | |
5 | 6211 | 6.9% |
8 | 5862 | 6.5% |
7 | 4776 | 5.3% |
4 | 3897 | 4.3% |
6 | 3352 | 3.7% |
Other values (2) | 3061 | 3.4% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 80000 | |
Uppercase Letter | 10000 | 11.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 18299 | |
1 | 17713 | |
3 | 8897 | |
2 | 7940 | |
5 | 6211 | 7.8% |
8 | 5862 | 7.3% |
7 | 4776 | 6.0% |
4 | 3897 | 4.9% |
6 | 3352 | 4.2% |
9 | 3053 | 3.8% |
Uppercase Letter
Value | Count | Frequency (%) |
A | 9992 | |
B | 8 | 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 80000 | |
Latin | 10000 | 11.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 18299 | |
1 | 17713 | |
3 | 8897 | |
2 | 7940 | |
5 | 6211 | 7.8% |
8 | 5862 | 7.3% |
7 | 4776 | 6.0% |
4 | 3897 | 4.9% |
6 | 3352 | 4.2% |
9 | 3053 | 3.8% |
Latin
Value | Count | Frequency (%) |
A | 9992 | |
B | 8 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 90000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 18299 | |
1 | 17713 | |
A | 9992 | |
3 | 8897 | |
2 | 7940 | |
5 | 6211 | 6.9% |
8 | 5862 | 6.5% |
7 | 4776 | 5.3% |
4 | 3897 | 4.3% |
6 | 3352 | 3.7% |
Other values (2) | 3061 | 3.4% |
비용명
Categorical
Distinct | 44 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
교육비 | 464 |
---|---|
도서인쇄비 | 456 |
급여 | 455 |
사무용품비 | 447 |
통신비 | 438 |
Other values (39) |
Length
Max length | 7 |
---|---|
Median length | 5 |
Mean length | 4.3145 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 수도광열비 |
---|---|
2nd row | 국민연금 |
3rd row | 교육비 |
4th row | 수도광열비 |
5th row | 기타사용료 |
Common Values
Value | Count | Frequency (%) |
교육비 | 464 | 4.6% |
도서인쇄비 | 456 | 4.6% |
급여 | 455 | 4.5% |
사무용품비 | 447 | 4.5% |
통신비 | 438 | 4.4% |
세대전기료 | 435 | 4.3% |
세대수도료 | 429 | 4.3% |
퇴직급여 | 425 | 4.2% |
산재보험료 | 414 | 4.1% |
제수당 | 413 | 4.1% |
Other values (34) | 5624 |
Length
Value | Count | Frequency (%) |
교육비 | 464 | 4.6% |
도서인쇄비 | 456 | 4.6% |
급여 | 455 | 4.5% |
사무용품비 | 447 | 4.5% |
통신비 | 438 | 4.4% |
세대전기료 | 435 | 4.3% |
세대수도료 | 429 | 4.3% |
퇴직급여 | 425 | 4.2% |
산재보험료 | 414 | 4.1% |
제수당 | 413 | 4.1% |
Other values (34) | 5624 |
년월일
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
201904 |
---|
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 201904 |
---|---|
2nd row | 201904 |
3rd row | 201904 |
4th row | 201904 |
5th row | 201904 |
Common Values
Value | Count | Frequency (%) |
201904 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
201904 | 10000 |
금액
Real number (ℝ)
ZEROS
 
Distinct | 7390 |
---|---|
Distinct (%) | 73.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3349629.6 |
Minimum | -2627960 |
---|---|
Maximum | 3.3375535 × 108 |
Zeros | 1028 |
Zeros (%) | 10.3% |
Negative | 9 |
Negative (%) | 0.1% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -2627960 |
---|---|
5-th percentile | 0 |
Q1 | 70157.5 |
median | 250435 |
Q3 | 1113225 |
95-th percentile | 17781610 |
Maximum | 3.3375535 × 108 |
Range | 3.3638331 × 108 |
Interquartile range (IQR) | 1043067.5 |
Descriptive statistics
Standard deviation | 10902978 |
---|---|
Coefficient of variation (CV) | 3.2549803 |
Kurtosis | 157.24517 |
Mean | 3349629.6 |
Median Absolute Deviation (MAD) | 240590 |
Skewness | 9.3131543 |
Sum | 3.3496296 × 1010 |
Variance | 1.1887494 × 1014 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 1028 | 10.3% |
200000 | 121 | 1.2% |
300000 | 57 | 0.6% |
110000 | 45 | 0.4% |
100000 | 43 | 0.4% |
150000 | 38 | 0.4% |
30000 | 34 | 0.3% |
10000 | 27 | 0.3% |
165000 | 25 | 0.2% |
400000 | 24 | 0.2% |
Other values (7380) | 8558 |
Value | Count | Frequency (%) |
-2627960 | 1 | < 0.1% |
-1292560 | 1 | < 0.1% |
-757062 | 1 | < 0.1% |
-700000 | 1 | < 0.1% |
-535200 | 1 | < 0.1% |
-350000 | 1 | < 0.1% |
-201100 | 1 | < 0.1% |
-186247 | 1 | < 0.1% |
-172000 | 1 | < 0.1% |
0 | 1028 |
Value | Count | Frequency (%) |
333755350 | 1 | |
205216900 | 1 | |
199690970 | 1 | |
189988400 | 1 | |
189299140 | 1 | |
171402010 | 1 | |
158456903 | 1 | |
143793110 | 1 | |
136235501 | 1 | |
133920000 | 1 |
비용명 | 금액 | |
---|---|---|
비용명 | 1.000 | 0.407 |
금액 | 0.407 | 1.000 |
금액 | 비용명 | |
---|---|---|
금액 | 1.000 | 0.165 |
비용명 | 0.165 | 1.000 |
아파트명 | 아파트코드 | 비용명 | 년월일 | 금액 | |
---|---|---|---|---|---|
29510 | 서빙고금호베스트빌 | A14024001 | 수도광열비 | 201904 | 168240 |
10284 | 신내5단지대림두산 | A13184610 | 국민연금 | 201904 | 1215880 |
35817 | 동부(돌타운)아파트 | A15210103 | 교육비 | 201904 | 0 |
31478 | 광장현대8단지 | A14381510 | 수도광열비 | 201904 | 190310 |
6639 | 창전현대홈타운 | A12188202 | 기타사용료 | 201904 | 941000 |
4662 | 연희한양아파트 | A12081703 | 제수당 | 201904 | 1829110 |
29837 | 현대한강 | A14085501 | 건강보험료 | 201904 | 585820 |
30248 | 우이대우 | A14209001 | 교육비 | 201904 | 0 |
25186 | 풍납 현대리버빌1차 | A13887405 | 복리후생비 | 201904 | 700000 |
44587 | 은평뉴타운폭포동4단지제2 | A41279924 | 건강보험료 | 201904 | 559340 |
아파트명 | 아파트코드 | 비용명 | 년월일 | 금액 | |
---|---|---|---|---|---|
18010 | 개나리SKVIEW | A13579506 | 제수당 | 201904 | 40020 |
13876 | 마장중앙하이츠 | A13381601 | 교통비 | 201904 | 1750 |
14309 | 무학현대 | A13385802 | 교육비 | 201904 | 0 |
31298 | 광장극동1차 | A14380409 | 공동수도료 | 201904 | 1309110 |
37371 | 관악벽산타운5단지 | A15303205 | 퇴직급여 | 201904 | 3080000 |
19342 | 종암우림카이저팰리스 | A13609001 | 세대전기료 | 201904 | 7088620 |
20717 | 돈암동부센트레빌 | A13681303 | 국민연금 | 201904 | 466760 |
35685 | 개봉삼환 | A15209205 | 회계감사비 | 201904 | 132000 |
13928 | 사근중앙하이츠 | A13381701 | 고용보험료 | 201904 | 64830 |
32186 | 당산2차효성타운 | A15004503 | 세대수도료 | 201904 | 9309810 |