Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 488.3 KiB |
Average record size in memory | 50.0 B |
Variable types
Text | 2 |
---|---|
Categorical | 2 |
Numeric | 1 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15822/S/1/datasetView.do |
Reproduction
Analysis started | 2024-05-11 05:47:25.311478 |
---|---|
Analysis finished | 2024-05-11 05:47:26.286602 |
Duration | 0.98 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
아파트명
Text
Distinct | 2098 |
---|---|
Distinct (%) | 21.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 22 |
---|---|
Median length | 20 |
Mean length | 7.1492 |
Min length | 2 |
Characters and Unicode
Total characters | 71492 |
---|---|
Distinct characters | 429 |
Distinct categories | 11 ? |
Distinct scripts | 3 ? |
Distinct blocks | 3 ? |
Unique
Unique | 81 ? |
---|---|
Unique (%) | 0.8% |
Sample
1st row | 묵동신안3차 |
---|---|
2nd row | 등촌우성102동 |
3rd row | 은평뉴타운박석고개제12단지아파트 |
4th row | 신도림대림3차 |
5th row | 염창롯데캐슬 |
Value | Count | Frequency (%) |
아파트 | 94 | 0.9% |
래미안 | 31 | 0.3% |
송파 | 13 | 0.1% |
잠원신화 | 13 | 0.1% |
광장힐스테이트 | 12 | 0.1% |
신길우성2차 | 12 | 0.1% |
응암경남 | 12 | 0.1% |
dmc자이1단지 | 12 | 0.1% |
암사선사현대 | 12 | 0.1% |
신내 | 12 | 0.1% |
Other values (2152) | 10282 |
Most occurring characters
Value | Count | Frequency (%) |
아 | 2243 | 3.1% |
파 | 2131 | 3.0% |
트 | 1883 | 2.6% |
지 | 1842 | 2.6% |
대 | 1838 | 2.6% |
동 | 1730 | 2.4% |
차 | 1538 | 2.2% |
신 | 1529 | 2.1% |
단 | 1474 | 2.1% |
성 | 1353 | 1.9% |
Other values (419) | 53931 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 65497 | |
Decimal Number | 3876 | 5.4% |
Uppercase Letter | 697 | 1.0% |
Space Separator | 553 | 0.8% |
Lowercase Letter | 308 | 0.4% |
Open Punctuation | 148 | 0.2% |
Close Punctuation | 148 | 0.2% |
Dash Punctuation | 133 | 0.2% |
Other Punctuation | 120 | 0.2% |
Math Symbol | 6 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
아 | 2243 | 3.4% |
파 | 2131 | 3.3% |
트 | 1883 | 2.9% |
지 | 1842 | 2.8% |
대 | 1838 | 2.8% |
동 | 1730 | 2.6% |
차 | 1538 | 2.3% |
신 | 1529 | 2.3% |
단 | 1474 | 2.3% |
성 | 1353 | 2.1% |
Other values (373) | 47936 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 122 | |
K | 88 | |
C | 80 | |
L | 64 | |
H | 53 | |
M | 44 | 6.3% |
D | 44 | 6.3% |
G | 42 | 6.0% |
I | 38 | 5.5% |
E | 33 | 4.7% |
Other values (7) | 89 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 171 | |
l | 40 | 13.0% |
i | 30 | 9.7% |
v | 22 | 7.1% |
s | 10 | 3.2% |
c | 8 | 2.6% |
h | 8 | 2.6% |
w | 7 | 2.3% |
k | 6 | 1.9% |
a | 3 | 1.0% |
Decimal Number
Value | Count | Frequency (%) |
1 | 1221 | |
2 | 1129 | |
3 | 499 | |
4 | 262 | 6.8% |
5 | 195 | 5.0% |
6 | 154 | 4.0% |
7 | 113 | 2.9% |
8 | 107 | 2.8% |
0 | 98 | 2.5% |
9 | 98 | 2.5% |
Other Punctuation
Value | Count | Frequency (%) |
, | 99 | |
. | 21 | 17.5% |
Space Separator
Value | Count | Frequency (%) |
553 |
Open Punctuation
Value | Count | Frequency (%) |
( | 148 |
Close Punctuation
Value | Count | Frequency (%) |
) | 148 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 133 |
Math Symbol
Value | Count | Frequency (%) |
~ | 6 |
Letter Number
Value | Count | Frequency (%) |
Ⅰ | 6 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 65497 | |
Common | 4984 | 7.0% |
Latin | 1011 | 1.4% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
아 | 2243 | 3.4% |
파 | 2131 | 3.3% |
트 | 1883 | 2.9% |
지 | 1842 | 2.8% |
대 | 1838 | 2.8% |
동 | 1730 | 2.6% |
차 | 1538 | 2.3% |
신 | 1529 | 2.3% |
단 | 1474 | 2.3% |
성 | 1353 | 2.1% |
Other values (373) | 47936 |
Latin
Value | Count | Frequency (%) |
e | 171 | |
S | 122 | |
K | 88 | 8.7% |
C | 80 | 7.9% |
L | 64 | 6.3% |
H | 53 | 5.2% |
M | 44 | 4.4% |
D | 44 | 4.4% |
G | 42 | 4.2% |
l | 40 | 4.0% |
Other values (19) | 263 |
Common
Value | Count | Frequency (%) |
1 | 1221 | |
2 | 1129 | |
553 | ||
3 | 499 | |
4 | 262 | 5.3% |
5 | 195 | 3.9% |
6 | 154 | 3.1% |
( | 148 | 3.0% |
) | 148 | 3.0% |
- | 133 | 2.7% |
Other values (7) | 542 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 65497 | |
ASCII | 5989 | 8.4% |
Number Forms | 6 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
아 | 2243 | 3.4% |
파 | 2131 | 3.3% |
트 | 1883 | 2.9% |
지 | 1842 | 2.8% |
대 | 1838 | 2.8% |
동 | 1730 | 2.6% |
차 | 1538 | 2.3% |
신 | 1529 | 2.3% |
단 | 1474 | 2.3% |
성 | 1353 | 2.1% |
Other values (373) | 47936 |
ASCII
Value | Count | Frequency (%) |
1 | 1221 | |
2 | 1129 | |
553 | 9.2% | |
3 | 499 | 8.3% |
4 | 262 | 4.4% |
5 | 195 | 3.3% |
e | 171 | 2.9% |
6 | 154 | 2.6% |
( | 148 | 2.5% |
) | 148 | 2.5% |
Other values (35) | 1509 |
Number Forms
Value | Count | Frequency (%) |
Ⅰ | 6 |
아파트코드
Text
Distinct | 2105 |
---|---|
Distinct (%) | 21.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
a13790703 | 13 | 0.1% |
a15086007 | 12 | 0.1% |
a14005001 | 12 | 0.1% |
a12201301 | 12 | 0.1% |
a15010306 | 12 | 0.1% |
a13920205 | 12 | 0.1% |
a14375301 | 12 | 0.1% |
a13405201 | 12 | 0.1% |
a12275501 | 12 | 0.1% |
a15083701 | 12 | 0.1% |
Other values (2095) | 9879 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 18197 | |
1 | 17560 | |
A | 9988 | |
3 | 8883 | |
2 | 8178 | |
5 | 6270 | 7.0% |
8 | 5784 | 6.4% |
7 | 4839 | 5.4% |
4 | 3835 | 4.3% |
6 | 3422 | 3.8% |
Other values (2) | 3044 | 3.4% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 80000 | |
Uppercase Letter | 10000 | 11.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 18197 | |
1 | 17560 | |
3 | 8883 | |
2 | 8178 | |
5 | 6270 | 7.8% |
8 | 5784 | 7.2% |
7 | 4839 | 6.0% |
4 | 3835 | 4.8% |
6 | 3422 | 4.3% |
9 | 3032 | 3.8% |
Uppercase Letter
Value | Count | Frequency (%) |
A | 9988 | |
B | 12 | 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 80000 | |
Latin | 10000 | 11.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 18197 | |
1 | 17560 | |
3 | 8883 | |
2 | 8178 | |
5 | 6270 | 7.8% |
8 | 5784 | 7.2% |
7 | 4839 | 6.0% |
4 | 3835 | 4.8% |
6 | 3422 | 4.3% |
9 | 3032 | 3.8% |
Latin
Value | Count | Frequency (%) |
A | 9988 | |
B | 12 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 90000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 18197 | |
1 | 17560 | |
A | 9988 | |
3 | 8883 | |
2 | 8178 | |
5 | 6270 | 7.0% |
8 | 5784 | 6.4% |
7 | 4839 | 5.4% |
4 | 3835 | 4.3% |
6 | 3422 | 3.8% |
Other values (2) | 3044 | 3.4% |
비용명
Categorical
Distinct | 44 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
통신비 | 470 |
---|---|
교육비 | 456 |
급여 | 455 |
제수당 | 441 |
퇴직급여 | 440 |
Other values (39) |
Length
Max length | 7 |
---|---|
Median length | 5 |
Mean length | 4.2933 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 업무추진비 |
---|---|
2nd row | 소모품비 |
3rd row | 식대 |
4th row | 세대수도료 |
5th row | 지급수수료 |
Common Values
Value | Count | Frequency (%) |
통신비 | 470 | 4.7% |
교육비 | 456 | 4.6% |
급여 | 455 | 4.5% |
제수당 | 441 | 4.4% |
퇴직급여 | 440 | 4.4% |
사무용품비 | 438 | 4.4% |
세대전기료 | 436 | 4.4% |
산재보험료 | 419 | 4.2% |
복리후생비 | 418 | 4.2% |
소모품비 | 408 | 4.1% |
Other values (34) | 5619 |
Length
Value | Count | Frequency (%) |
통신비 | 470 | 4.7% |
교육비 | 456 | 4.6% |
급여 | 455 | 4.5% |
제수당 | 441 | 4.4% |
퇴직급여 | 440 | 4.4% |
사무용품비 | 438 | 4.4% |
세대전기료 | 436 | 4.4% |
산재보험료 | 419 | 4.2% |
복리후생비 | 418 | 4.2% |
소모품비 | 408 | 4.1% |
Other values (34) | 5619 |
년월일
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
201905 |
---|
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 201905 |
---|---|
2nd row | 201905 |
3rd row | 201905 |
4th row | 201905 |
5th row | 201905 |
Common Values
Value | Count | Frequency (%) |
201905 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
201905 | 10000 |
금액
Real number (ℝ)
ZEROS
 
Distinct | 7378 |
---|---|
Distinct (%) | 73.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2790094.2 |
Minimum | -1538510 |
---|---|
Maximum | 1.7469711 × 108 |
Zeros | 1094 |
Zeros (%) | 10.9% |
Negative | 8 |
Negative (%) | 0.1% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -1538510 |
---|---|
5-th percentile | 0 |
Q1 | 66790 |
median | 227430 |
Q3 | 1053542.5 |
95-th percentile | 15021018 |
Maximum | 1.7469711 × 108 |
Range | 1.7623562 × 108 |
Interquartile range (IQR) | 986752.5 |
Descriptive statistics
Standard deviation | 8620419.1 |
---|---|
Coefficient of variation (CV) | 3.0896516 |
Kurtosis | 100.65088 |
Mean | 2790094.2 |
Median Absolute Deviation (MAD) | 221055 |
Skewness | 7.9549078 |
Sum | 2.7900942 × 1010 |
Variance | 7.4311625 × 1013 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 1094 | 10.9% |
200000 | 126 | 1.3% |
300000 | 59 | 0.6% |
100000 | 59 | 0.6% |
110000 | 44 | 0.4% |
30000 | 39 | 0.4% |
150000 | 28 | 0.3% |
400000 | 25 | 0.2% |
600000 | 25 | 0.2% |
500000 | 23 | 0.2% |
Other values (7368) | 8478 |
Value | Count | Frequency (%) |
-1538510 | 1 | < 0.1% |
-852400 | 1 | < 0.1% |
-229350 | 1 | < 0.1% |
-79270 | 1 | < 0.1% |
-49720 | 1 | < 0.1% |
-43670 | 2 | < 0.1% |
-5000 | 1 | < 0.1% |
0 | 1094 | |
380 | 1 | < 0.1% |
400 | 1 | < 0.1% |
Value | Count | Frequency (%) |
174697110 | 1 | |
168550584 | 1 | |
161978290 | 1 | |
161745530 | 1 | |
139898460 | 1 | |
136750370 | 1 | |
132069176 | 1 | |
131251920 | 1 | |
131159600 | 1 | |
113444460 | 1 |
비용명 | 금액 | |
---|---|---|
비용명 | 1.000 | 0.470 |
금액 | 0.470 | 1.000 |
금액 | 비용명 | |
---|---|---|
금액 | 1.000 | 0.181 |
비용명 | 0.181 | 1.000 |
아파트명 | 아파트코드 | 비용명 | 년월일 | 금액 | |
---|---|---|---|---|---|
9743 | 묵동신안3차 | A13114106 | 업무추진비 | 201905 | 150000 |
41601 | 등촌우성102동 | A15772902 | 소모품비 | 201905 | 0 |
44969 | 은평뉴타운박석고개제12단지아파트 | A41279911 | 식대 | 201905 | 723300 |
37621 | 신도림대림3차 | A15288802 | 세대수도료 | 201905 | 4156800 |
40728 | 염창롯데캐슬 | A15704015 | 지급수수료 | 201905 | 0 |
10755 | 신내우남푸르미아 | A13186502 | 감가상각비 | 201905 | 121800 |
27481 | 불암현대 | A13981208 | 교통비 | 201905 | 6500 |
41111 | 방화한진로즈힐 | A15722005 | 사무용품비 | 201905 | 0 |
4733 | 디엠씨한양 | A12081703 | 통신비 | 201905 | 121340 |
37463 | 구로현대연예인 | A15286807 | 제수당 | 201905 | 3296410 |
아파트명 | 아파트코드 | 비용명 | 년월일 | 금액 | |
---|---|---|---|---|---|
26164 | 진로유통조합대림 | A13922002 | 소모품비 | 201905 | 12500 |
24308 | 잠실아시아선수촌 | A13822701 | 기타부대비 | 201905 | 1020370 |
13640 | 옥수하이츠제2 | A13375904 | 세대수도료 | 201905 | 541410 |
38338 | 현대공무원 | A15384101 | 도서인쇄비 | 201905 | 77000 |
44365 | 양천롯데캐슬 | A15883202 | 사무용품비 | 201905 | 39810 |
38586 | 신대방신동아 | A15601201 | 통신비 | 201905 | 35770 |
10604 | 묵동브라운스톤태릉 | A13185508 | 사무용품비 | 201905 | 44000 |
21324 | 방배아크로리버 | A13706001 | 사무용품비 | 201905 | 64290 |
36515 | 오류금강수목원 | A15210211 | 급여 | 201905 | 16606530 |
21110 | 정릉푸른마을동아 | A13684605 | 기타사용료 | 201905 | 0 |