Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 488.3 KiB |
Average record size in memory | 50.0 B |
Variable types
Text | 2 |
---|---|
Categorical | 2 |
Numeric | 1 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15822/S/1/datasetView.do |
Reproduction
Analysis started | 2024-05-11 05:47:49.624976 |
---|---|
Analysis finished | 2024-05-11 05:47:50.463047 |
Duration | 0.84 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
아파트명
Text
Distinct | 2098 |
---|---|
Distinct (%) | 21.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
아파트 | 100 | 1.0% |
래미안 | 32 | 0.3% |
암사선사현대 | 17 | 0.2% |
신내 | 14 | 0.1% |
래미안밤섬리베뉴 | 14 | 0.1% |
가양대림경동 | 13 | 0.1% |
브라운스톤 | 13 | 0.1% |
제기안암골벽산 | 12 | 0.1% |
반포리체 | 12 | 0.1% |
보문파크뷰자이아파트 | 12 | 0.1% |
Other values (2150) | 10228 |
Most occurring characters
Value | Count | Frequency (%) |
아 | 2190 | 3.1% |
파 | 2075 | 2.9% |
대 | 1899 | 2.7% |
지 | 1895 | 2.7% |
트 | 1828 | 2.6% |
동 | 1707 | 2.4% |
차 | 1568 | 2.2% |
신 | 1561 | 2.2% |
단 | 1510 | 2.1% |
성 | 1371 | 1.9% |
Other values (421) | 53660 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 65238 | |
Decimal Number | 3981 | 5.6% |
Uppercase Letter | 604 | 0.8% |
Space Separator | 514 | 0.7% |
Lowercase Letter | 383 | 0.5% |
Dash Punctuation | 148 | 0.2% |
Other Punctuation | 130 | 0.2% |
Close Punctuation | 127 | 0.2% |
Open Punctuation | 127 | 0.2% |
Math Symbol | 7 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
아 | 2190 | 3.4% |
파 | 2075 | 3.2% |
대 | 1899 | 2.9% |
지 | 1895 | 2.9% |
트 | 1828 | 2.8% |
동 | 1707 | 2.6% |
차 | 1568 | 2.4% |
신 | 1561 | 2.4% |
단 | 1510 | 2.3% |
성 | 1371 | 2.1% |
Other values (375) | 47634 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 116 | |
K | 75 | |
C | 61 | |
L | 50 | |
H | 44 | 7.3% |
M | 42 | 7.0% |
D | 42 | 7.0% |
G | 41 | 6.8% |
I | 29 | 4.8% |
E | 24 | 4.0% |
Other values (7) | 80 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 183 | |
l | 54 | 14.1% |
i | 47 | 12.3% |
v | 36 | 9.4% |
s | 15 | 3.9% |
w | 13 | 3.4% |
k | 11 | 2.9% |
a | 7 | 1.8% |
g | 7 | 1.8% |
h | 6 | 1.6% |
Decimal Number
Value | Count | Frequency (%) |
1 | 1226 | |
2 | 1155 | |
3 | 506 | |
4 | 254 | 6.4% |
5 | 212 | 5.3% |
6 | 170 | 4.3% |
8 | 132 | 3.3% |
7 | 120 | 3.0% |
9 | 114 | 2.9% |
0 | 92 | 2.3% |
Other Punctuation
Value | Count | Frequency (%) |
, | 106 | |
. | 24 | 18.5% |
Space Separator
Value | Count | Frequency (%) |
514 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 148 |
Close Punctuation
Value | Count | Frequency (%) |
) | 127 |
Open Punctuation
Value | Count | Frequency (%) |
( | 127 |
Math Symbol
Value | Count | Frequency (%) |
~ | 7 |
Letter Number
Value | Count | Frequency (%) |
Ⅰ | 5 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 65238 | |
Common | 5034 | 7.1% |
Latin | 992 | 1.4% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
아 | 2190 | 3.4% |
파 | 2075 | 3.2% |
대 | 1899 | 2.9% |
지 | 1895 | 2.9% |
트 | 1828 | 2.8% |
동 | 1707 | 2.6% |
차 | 1568 | 2.4% |
신 | 1561 | 2.4% |
단 | 1510 | 2.3% |
성 | 1371 | 2.1% |
Other values (375) | 47634 |
Latin
Value | Count | Frequency (%) |
e | 183 | |
S | 116 | |
K | 75 | 7.6% |
C | 61 | 6.1% |
l | 54 | 5.4% |
L | 50 | 5.0% |
i | 47 | 4.7% |
H | 44 | 4.4% |
M | 42 | 4.2% |
D | 42 | 4.2% |
Other values (19) | 278 |
Common
Value | Count | Frequency (%) |
1 | 1226 | |
2 | 1155 | |
514 | ||
3 | 506 | |
4 | 254 | 5.0% |
5 | 212 | 4.2% |
6 | 170 | 3.4% |
- | 148 | 2.9% |
8 | 132 | 2.6% |
) | 127 | 2.5% |
Other values (7) | 590 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 65238 | |
ASCII | 6021 | 8.4% |
Number Forms | 5 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
아 | 2190 | 3.4% |
파 | 2075 | 3.2% |
대 | 1899 | 2.9% |
지 | 1895 | 2.9% |
트 | 1828 | 2.8% |
동 | 1707 | 2.6% |
차 | 1568 | 2.4% |
신 | 1561 | 2.4% |
단 | 1510 | 2.3% |
성 | 1371 | 2.1% |
Other values (375) | 47634 |
ASCII
Value | Count | Frequency (%) |
1 | 1226 | |
2 | 1155 | |
514 | 8.5% | |
3 | 506 | 8.4% |
4 | 254 | 4.2% |
5 | 212 | 3.5% |
e | 183 | 3.0% |
6 | 170 | 2.8% |
- | 148 | 2.5% |
8 | 132 | 2.2% |
Other values (35) | 1521 |
Number Forms
Value | Count | Frequency (%) |
Ⅰ | 5 |
아파트코드
Text
Distinct | 2104 |
---|---|
Distinct (%) | 21.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
a13405201 | 17 | 0.2% |
a15780703 | 13 | 0.1% |
a15786104 | 12 | 0.1% |
a12208102 | 12 | 0.1% |
a15703301 | 12 | 0.1% |
a12114001 | 12 | 0.1% |
a12187906 | 12 | 0.1% |
a13086101 | 12 | 0.1% |
a10027189 | 12 | 0.1% |
a13776301 | 12 | 0.1% |
Other values (2094) | 9874 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 18272 | |
1 | 17577 | |
A | 9990 | |
3 | 9136 | |
2 | 7891 | |
5 | 6157 | 6.8% |
8 | 5741 | 6.4% |
7 | 4885 | 5.4% |
4 | 3806 | 4.2% |
6 | 3414 | 3.8% |
Other values (2) | 3131 | 3.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 80000 | |
Uppercase Letter | 10000 | 11.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 18272 | |
1 | 17577 | |
3 | 9136 | |
2 | 7891 | |
5 | 6157 | 7.7% |
8 | 5741 | 7.2% |
7 | 4885 | 6.1% |
4 | 3806 | 4.8% |
6 | 3414 | 4.3% |
9 | 3121 | 3.9% |
Uppercase Letter
Value | Count | Frequency (%) |
A | 9990 | |
B | 10 | 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 80000 | |
Latin | 10000 | 11.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 18272 | |
1 | 17577 | |
3 | 9136 | |
2 | 7891 | |
5 | 6157 | 7.7% |
8 | 5741 | 7.2% |
7 | 4885 | 6.1% |
4 | 3806 | 4.8% |
6 | 3414 | 4.3% |
9 | 3121 | 3.9% |
Latin
Value | Count | Frequency (%) |
A | 9990 | |
B | 10 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 90000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 18272 | |
1 | 17577 | |
A | 9990 | |
3 | 9136 | |
2 | 7891 | |
5 | 6157 | 6.8% |
8 | 5741 | 6.4% |
7 | 4885 | 5.4% |
4 | 3806 | 4.2% |
6 | 3414 | 3.8% |
Other values (2) | 3131 | 3.5% |
비용명
Categorical
Distinct | 44 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
급여 | 514 |
---|---|
통신비 | 487 |
산재보험료 | 467 |
세대전기료 | 455 |
도서인쇄비 | 454 |
Other values (39) |
Length
Max length | 7 |
---|---|
Median length | 5 |
Mean length | 4.3326 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 산재보험료 |
---|---|
2nd row | 국민연금 |
3rd row | 퇴직급여 |
4th row | 차량유지비 |
5th row | 도서인쇄비 |
Common Values
Value | Count | Frequency (%) |
급여 | 514 | 5.1% |
통신비 | 487 | 4.9% |
산재보험료 | 467 | 4.7% |
세대전기료 | 455 | 4.5% |
도서인쇄비 | 454 | 4.5% |
퇴직급여 | 452 | 4.5% |
사무용품비 | 445 | 4.5% |
제수당 | 439 | 4.4% |
국민연금 | 430 | 4.3% |
기타부대비 | 418 | 4.2% |
Other values (34) | 5439 |
Length
Value | Count | Frequency (%) |
급여 | 514 | 5.1% |
통신비 | 487 | 4.9% |
산재보험료 | 467 | 4.7% |
세대전기료 | 455 | 4.5% |
도서인쇄비 | 454 | 4.5% |
퇴직급여 | 452 | 4.5% |
사무용품비 | 445 | 4.5% |
제수당 | 439 | 4.4% |
국민연금 | 430 | 4.3% |
기타부대비 | 418 | 4.2% |
Other values (34) | 5439 |
년월일
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
201901 |
---|
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 201901 |
---|---|
2nd row | 201901 |
3rd row | 201901 |
4th row | 201901 |
5th row | 201901 |
Common Values
Value | Count | Frequency (%) |
201901 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
201901 | 10000 |
금액
Real number (ℝ)
ZEROS
 
Distinct | 7823 |
---|---|
Distinct (%) | 78.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4855792 |
Minimum | -2913440 |
---|---|
Maximum | 7.0520912 × 108 |
Zeros | 310 |
Zeros (%) | 3.1% |
Negative | 7 |
Negative (%) | 0.1% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -2913440 |
---|---|
5-th percentile | 5335.5 |
Q1 | 100000 |
median | 281995 |
Q3 | 1350670 |
95-th percentile | 24951920 |
Maximum | 7.0520912 × 108 |
Range | 7.0812256 × 108 |
Interquartile range (IQR) | 1250670 |
Descriptive statistics
Standard deviation | 19265478 |
---|---|
Coefficient of variation (CV) | 3.9675255 |
Kurtosis | 337.99309 |
Mean | 4855792 |
Median Absolute Deviation (MAD) | 239710 |
Skewness | 13.929104 |
Sum | 4.855792 × 1010 |
Variance | 3.7115866 × 1014 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 310 | 3.1% |
78000 | 122 | 1.2% |
200000 | 110 | 1.1% |
300000 | 66 | 0.7% |
100000 | 59 | 0.6% |
110000 | 44 | 0.4% |
150000 | 42 | 0.4% |
10000 | 23 | 0.2% |
50000 | 23 | 0.2% |
121000 | 23 | 0.2% |
Other values (7813) | 9178 |
Value | Count | Frequency (%) |
-2913440 | 1 | < 0.1% |
-822520 | 1 | < 0.1% |
-279620 | 1 | < 0.1% |
-132000 | 1 | < 0.1% |
-106990 | 1 | < 0.1% |
-60760 | 1 | < 0.1% |
-50000 | 1 | < 0.1% |
0 | 310 | |
7 | 1 | < 0.1% |
400 | 2 | < 0.1% |
Value | Count | Frequency (%) |
705209120 | 1 | |
568227968 | 1 | |
480668350 | 1 | |
390648530 | 1 | |
278879680 | 1 | |
265930050 | 1 | |
263861480 | 1 | |
256988102 | 1 | |
242252330 | 1 | |
230042320 | 1 |
비용명 | 금액 | |
---|---|---|
비용명 | 1.000 | 0.469 |
금액 | 0.469 | 1.000 |
금액 | 비용명 | |
---|---|---|
금액 | 1.000 | 0.195 |
비용명 | 0.195 | 1.000 |
아파트명 | 아파트코드 | 비용명 | 년월일 | 금액 | |
---|---|---|---|---|---|
17339 | 도곡현대 | A13586102 | 산재보험료 | 201901 | 145770 |
4069 | 북가좌삼호제2 | A12076601 | 국민연금 | 201901 | 181980 |
22822 | 잠실미성 | A13824004 | 퇴직급여 | 201901 | 3310320 |
13802 | 신성둔촌미소지움1차 | A13406205 | 차량유지비 | 201901 | 200000 |
3737 | 북가좌휴먼빌 | A12013001 | 도서인쇄비 | 201901 | 406500 |
8886 | 묵동신안3차 | A13114106 | 퇴직급여 | 201901 | 1098110 |
40063 | 목동부영그린타운3차 | A15805301 | 세대수도료 | 201901 | 3471040 |
28086 | 산천리버힐제2 | A14076401 | 고용보험료 | 201901 | 26900 |
3480 | 홍제현대그린 | A12009303 | 고용보험료 | 201901 | 40870 |
30139 | 여의도광장 | A15001019 | 교육비 | 201901 | 19000 |
아파트명 | 아파트코드 | 비용명 | 년월일 | 금액 | |
---|---|---|---|---|---|
32333 | 신길자이 | A15096001 | 급여 | 201901 | 6441430 |
37168 | 대방대림 | A15681110 | 세대난방비 | 201901 | 149005360 |
41375 | 신정대림 | A15885303 | 퇴직급여 | 201901 | 750160 |
27098 | 하계현대우성 | A13987303 | 소모품비 | 201901 | 918200 |
12186 | 금호삼성래미안 | A13309102 | 산재보험료 | 201901 | 532630 |
15326 | 아이파크삼성동 | A13509009 | 기타인건비 | 201901 | 5709407 |
20153 | 반포미도2차 | A13770105 | 국민연금 | 201901 | 401420 |
38851 | 가양강나루현대 | A15780401 | 국민연금 | 201901 | 229960 |
1153 | 금천롯데캐슬골드파크1차아파트 | A10027188 | 교육비 | 201901 | -132000 |
26866 | 중계우성3차 | A13986201 | 건강보험료 | 201901 | 240890 |