Dataset statistics
Number of variables | 13 |
---|---|
Number of observations | 6232 |
Missing cells | 25696 |
Missing cells (%) | 31.7% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 663.5 KiB |
Average record size in memory | 109.0 B |
Variable types
Numeric | 4 |
---|---|
Text | 6 |
Categorical | 2 |
Unsupported | 1 |
Dataset
Description | 년도,제안번호,사업명,예산편성사업명,예산편성사업비,사업위치,예산편성계획서,지출금액,사업추진단계,사업추진집행률,집행기준일,비고,결과보고서 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-15413/S/1/datasetView.do |
년도 is highly overall correlated with 제안번호 | High correlation |
제안번호 is highly overall correlated with 년도 | High correlation |
예산편성사업비 is highly overall correlated with 지출금액 | High correlation |
지출금액 is highly overall correlated with 예산편성사업비 | High correlation |
사업추진단계 is highly imbalanced (53.0%) | Imbalance |
예산편성사업명 has 68 (1.1%) missing values | Missing |
예산편성계획서 has 2867 (46.0%) missing values | Missing |
지출금액 has 2924 (46.9%) missing values | Missing |
사업추진집행률 has 6232 (100.0%) missing values | Missing |
집행기준일 has 2922 (46.9%) missing values | Missing |
비고 has 6005 (96.4%) missing values | Missing |
결과보고서 has 4678 (75.1%) missing values | Missing |
사업추진집행률 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
예산편성사업비 has 130 (2.1%) zeros | Zeros |
지출금액 has 91 (1.5%) zeros | Zeros |
Reproduction
Analysis started | 2024-05-11 06:19:56.243862 |
---|---|
Analysis finished | 2024-05-11 06:20:03.920874 |
Duration | 7.68 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
년도
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 12 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2017.733 |
Minimum | 2012 |
---|---|
Maximum | 2023 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 54.9 KiB |
Quantile statistics
Minimum | 2012 |
---|---|
5-th percentile | 2013 |
Q1 | 2016 |
median | 2018 |
Q3 | 2020 |
95-th percentile | 2021 |
Maximum | 2023 |
Range | 11 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 2.4661306 |
---|---|
Coefficient of variation (CV) | 0.0012222284 |
Kurtosis | -0.75240643 |
Mean | 2017.733 |
Median Absolute Deviation (MAD) | 2 |
Skewness | -0.34547107 |
Sum | 12574512 |
Variance | 6.0818002 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2021 | 934 | |
2020 | 873 | |
2019 | 852 | |
2016 | 804 | |
2017 | 766 | |
2018 | 730 | |
2015 | 524 | |
2014 | 352 | 5.6% |
2013 | 223 | 3.6% |
2012 | 132 | 2.1% |
Other values (2) | 42 | 0.7% |
Value | Count | Frequency (%) |
2012 | 132 | 2.1% |
2013 | 223 | 3.6% |
2014 | 352 | 5.6% |
2015 | 524 | |
2016 | 804 | |
2017 | 766 | |
2018 | 730 | |
2019 | 852 | |
2020 | 873 | |
2021 | 934 |
Value | Count | Frequency (%) |
2023 | 29 | 0.5% |
2022 | 13 | 0.2% |
2021 | 934 | |
2020 | 873 | |
2019 | 852 | |
2018 | 730 | |
2017 | 766 | |
2016 | 804 | |
2015 | 524 | |
2014 | 352 | 5.6% |
제안번호
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 3453 |
---|---|
Distinct (%) | 55.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4833.7357 |
Minimum | 1 |
---|---|
Maximum | 8241 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 54.9 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 306.55 |
Q1 | 1983.5 |
median | 5044.5 |
Q3 | 7361 |
95-th percentile | 8115.45 |
Maximum | 8241 |
Range | 8240 |
Interquartile range (IQR) | 5377.5 |
Descriptive statistics
Standard deviation | 2813.5912 |
---|---|
Coefficient of variation (CV) | 0.58207387 |
Kurtosis | -1.4739889 |
Mean | 4833.7357 |
Median Absolute Deviation (MAD) | 2395.5 |
Skewness | -0.33866376 |
Sum | 30123841 |
Variance | 7916295.7 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
7129 | 4 | 0.1% |
7012 | 4 | 0.1% |
7527 | 4 | 0.1% |
7526 | 4 | 0.1% |
7525 | 4 | 0.1% |
7524 | 4 | 0.1% |
7523 | 4 | 0.1% |
7522 | 4 | 0.1% |
7521 | 4 | 0.1% |
7520 | 4 | 0.1% |
Other values (3443) | 6192 |
Value | Count | Frequency (%) |
1 | 2 | |
4 | 1 | |
7 | 1 | |
9 | 1 | |
13 | 2 | |
14 | 1 | |
19 | 1 | |
21 | 1 | |
29 | 1 | |
30 | 1 |
Value | Count | Frequency (%) |
8241 | 1 | |
8240 | 1 | |
8239 | 1 | |
8238 | 1 | |
8237 | 1 | |
8236 | 1 | |
8235 | 1 | |
8234 | 1 | |
8233 | 1 | |
8232 | 1 |
사업명
Text
Distinct | 5858 |
---|---|
Distinct (%) | 94.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 48.8 KiB |
Length
Max length | 79 |
---|---|
Median length | 58 |
Mean length | 18.179076 |
Min length | 3 |
Characters and Unicode
Total characters | 113292 |
---|---|
Distinct characters | 1139 |
Distinct categories | 15 ? |
Distinct scripts | 4 ? |
Distinct blocks | 9 ? |
Unique
Unique | 5770 ? |
---|---|
Unique (%) | 92.6% |
Sample
1st row | 의류리폼센터 운영(시범) |
---|---|
2nd row | 사회적 약자의 복지향상을 위한 해외연수 운영 |
3rd row | 오동근린공원(월곡산) 철쭉동산 만들기 |
4th row | 공공기관 내 스마트 수돗물 수질관리 시스템 도입 설치 |
5th row | 공원으로 찾아오는 어린이 물놀이터 |
Value | Count | Frequency (%) |
및 | 398 | 1.5% |
설치 | 398 | 1.5% |
만들기 | 339 | 1.3% |
위한 | 330 | 1.2% |
조성 | 257 | 1.0% |
마을 | 227 | 0.9% |
사업 | 211 | 0.8% |
운영 | 204 | 0.8% |
함께 | 198 | 0.7% |
안전한 | 166 | 0.6% |
Other values (10870) | 23831 |
Most occurring characters
Value | Count | Frequency (%) |
20829 | 18.4% | |
이 | 1621 | 1.4% |
을 | 1604 | 1.4% |
기 | 1492 | 1.3% |
로 | 1373 | 1.2% |
마 | 1324 | 1.2% |
동 | 1318 | 1.2% |
한 | 1267 | 1.1% |
사 | 1185 | 1.0% |
리 | 1177 | 1.0% |
Other values (1129) | 80102 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 85764 | |
Space Separator | 20829 | 18.4% |
Other Punctuation | 1861 | 1.6% |
Uppercase Letter | 1067 | 0.9% |
Lowercase Letter | 865 | 0.8% |
Decimal Number | 759 | 0.7% |
Close Punctuation | 609 | 0.5% |
Open Punctuation | 607 | 0.5% |
Initial Punctuation | 281 | 0.2% |
Final Punctuation | 262 | 0.2% |
Other values (5) | 388 | 0.3% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
이 | 1621 | 1.9% |
을 | 1604 | 1.9% |
기 | 1492 | 1.7% |
로 | 1373 | 1.6% |
마 | 1324 | 1.5% |
동 | 1318 | 1.5% |
한 | 1267 | 1.5% |
사 | 1185 | 1.4% |
리 | 1177 | 1.4% |
는 | 1060 | 1.2% |
Other values (1027) | 72343 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 212 | |
T | 125 | |
V | 104 | |
E | 91 | 8.5% |
D | 70 | 6.6% |
O | 63 | 5.9% |
L | 62 | 5.8% |
A | 34 | 3.2% |
S | 33 | 3.1% |
M | 32 | 3.0% |
Other values (15) | 241 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 101 | |
o | 87 | 10.1% |
a | 74 | 8.6% |
t | 61 | 7.1% |
i | 57 | 6.6% |
r | 56 | 6.5% |
n | 56 | 6.5% |
l | 47 | 5.4% |
u | 45 | 5.2% |
c | 40 | 4.6% |
Other values (15) | 241 |
Other Punctuation
Value | Count | Frequency (%) |
! | 696 | |
, | 476 | |
' | 225 | 12.1% |
. | 193 | 10.4% |
? | 179 | 9.6% |
& | 30 | 1.6% |
: | 29 | 1.6% |
/ | 18 | 1.0% |
? | 5 | 0.3% |
… | 3 | 0.2% |
Other values (7) | 7 | 0.4% |
Decimal Number
Value | Count | Frequency (%) |
2 | 182 | |
1 | 168 | |
0 | 113 | |
3 | 92 | |
5 | 51 | 6.7% |
4 | 49 | 6.5% |
9 | 34 | 4.5% |
8 | 26 | 3.4% |
6 | 25 | 3.3% |
7 | 19 | 2.5% |
Close Punctuation
Value | Count | Frequency (%) |
) | 553 | |
」 | 24 | 3.9% |
』 | 18 | 3.0% |
] | 12 | 2.0% |
〕 | 1 | 0.2% |
) | 1 | 0.2% |
Open Punctuation
Value | Count | Frequency (%) |
( | 553 | |
「 | 24 | 4.0% |
『 | 17 | 2.8% |
[ | 12 | 2.0% |
〔 | 1 | 0.2% |
Math Symbol
Value | Count | Frequency (%) |
~ | 155 | |
> | 17 | 8.3% |
< | 17 | 8.3% |
+ | 14 | 6.9% |
↔ | 1 | 0.5% |
Initial Punctuation
Value | Count | Frequency (%) |
‘ | 156 | |
“ | 125 |
Final Punctuation
Value | Count | Frequency (%) |
’ | 149 | |
” | 113 |
Space Separator
Value | Count | Frequency (%) |
20829 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 172 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 5 |
Modifier Symbol
Value | Count | Frequency (%) |
^ | 4 |
Letter Number
Value | Count | Frequency (%) |
Ⅱ | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 85661 | |
Common | 25593 | 22.6% |
Latin | 1935 | 1.7% |
Han | 103 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
이 | 1621 | 1.9% |
을 | 1604 | 1.9% |
기 | 1492 | 1.7% |
로 | 1373 | 1.6% |
마 | 1324 | 1.5% |
동 | 1318 | 1.5% |
한 | 1267 | 1.5% |
사 | 1185 | 1.4% |
리 | 1177 | 1.4% |
는 | 1060 | 1.2% |
Other values (967) | 72240 |
Han
Value | Count | Frequency (%) |
多 | 7 | 6.8% |
動 | 4 | 3.9% |
人 | 4 | 3.9% |
樂 | 4 | 3.9% |
詩 | 4 | 3.9% |
愛 | 4 | 3.9% |
三 | 3 | 2.9% |
通 | 3 | 2.9% |
手 | 3 | 2.9% |
洞 | 3 | 2.9% |
Other values (50) | 64 |
Common
Value | Count | Frequency (%) |
20829 | ||
! | 696 | 2.7% |
) | 553 | 2.2% |
( | 553 | 2.2% |
, | 476 | 1.9% |
' | 225 | 0.9% |
. | 193 | 0.8% |
2 | 182 | 0.7% |
? | 179 | 0.7% |
- | 172 | 0.7% |
Other values (41) | 1535 | 6.0% |
Latin
Value | Count | Frequency (%) |
C | 212 | 11.0% |
T | 125 | 6.5% |
V | 104 | 5.4% |
e | 101 | 5.2% |
E | 91 | 4.7% |
o | 87 | 4.5% |
a | 74 | 3.8% |
D | 70 | 3.6% |
O | 63 | 3.3% |
L | 62 | 3.2% |
Other values (41) | 946 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 85654 | |
ASCII | 26883 | 23.7% |
Punctuation | 547 | 0.5% |
CJK | 97 | 0.1% |
None | 94 | 0.1% |
Compat Jamo | 7 | < 0.1% |
CJK Compat Ideographs | 6 | < 0.1% |
Number Forms | 3 | < 0.1% |
Arrows | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
20829 | ||
! | 696 | 2.6% |
) | 553 | 2.1% |
( | 553 | 2.1% |
, | 476 | 1.8% |
' | 225 | 0.8% |
C | 212 | 0.8% |
. | 193 | 0.7% |
2 | 182 | 0.7% |
? | 179 | 0.7% |
Other values (73) | 2785 | 10.4% |
Hangul
Value | Count | Frequency (%) |
이 | 1621 | 1.9% |
을 | 1604 | 1.9% |
기 | 1492 | 1.7% |
로 | 1373 | 1.6% |
마 | 1324 | 1.5% |
동 | 1318 | 1.5% |
한 | 1267 | 1.5% |
사 | 1185 | 1.4% |
리 | 1177 | 1.4% |
는 | 1060 | 1.2% |
Other values (965) | 72233 |
Punctuation
Value | Count | Frequency (%) |
‘ | 156 | |
’ | 149 | |
“ | 125 | |
” | 113 | |
… | 3 | 0.5% |
※ | 1 | 0.2% |
None
Value | Count | Frequency (%) |
」 | 24 | |
「 | 24 | |
』 | 18 | |
『 | 17 | |
? | 5 | 5.3% |
¡ | 1 | 1.1% |
〕 | 1 | 1.1% |
〔 | 1 | 1.1% |
! | 1 | 1.1% |
" | 1 | 1.1% |
CJK
Value | Count | Frequency (%) |
多 | 7 | 7.2% |
動 | 4 | 4.1% |
人 | 4 | 4.1% |
詩 | 4 | 4.1% |
愛 | 4 | 4.1% |
三 | 3 | 3.1% |
通 | 3 | 3.1% |
手 | 3 | 3.1% |
洞 | 3 | 3.1% |
情 | 3 | 3.1% |
Other values (47) | 59 |
Compat Jamo
Value | Count | Frequency (%) |
ㆍ | 5 | |
ㅜ | 2 | 28.6% |
CJK Compat Ideographs
Value | Count | Frequency (%) |
樂 | 4 | |
樂 | 1 | 16.7% |
女 | 1 | 16.7% |
Number Forms
Value | Count | Frequency (%) |
Ⅱ | 3 |
Arrows
Value | Count | Frequency (%) |
↔ | 1 |
예산편성사업명
Text
MISSING
 
Distinct | 1866 |
---|---|
Distinct (%) | 30.3% |
Missing | 68 |
Missing (%) | 1.1% |
Memory size | 48.8 KiB |
Length
Max length | 64 |
---|---|
Median length | 58 |
Mean length | 19.146982 |
Min length | 6 |
Characters and Unicode
Total characters | 118022 |
---|---|
Distinct characters | 783 |
Distinct categories | 13 ? |
Distinct scripts | 4 ? |
Distinct blocks | 7 ? |
Unique
Unique | 1746 ? |
---|---|
Unique (%) | 28.3% |
Sample
1st row | 저소득 어르신 보청기 지원 시범사업 |
---|---|
2nd row | 북악하늘길 산책로 정비 |
3rd row | 야간 자전거 안전운행 유도디자인 고도화 |
4th row | 공원 환경보호를 위한 행동유도 안내사인 디자인 |
5th row | 응봉근린공원(금호산) 무장애 산책로 조성 |
Value | Count | Frequency (%) |
계획형 | 2691 | 12.1% |
시민참여예산 | 2051 | 9.3% |
동단위 | 1967 | 8.9% |
구단위 | 724 | 3.3% |
시민참여예산(시민참여 | 641 | 2.9% |
지원 | 574 | 2.6% |
지원사업 | 544 | 2.5% |
동단위계획형 | 542 | 2.4% |
시민참여 | 372 | 1.7% |
주민참여 | 330 | 1.5% |
Other values (4077) | 11727 |
Most occurring characters
Value | Count | Frequency (%) |
16057 | 13.6% | |
민 | 6196 | 5.2% |
여 | 6130 | 5.2% |
참 | 6050 | 5.1% |
시 | 5130 | 4.3% |
계 | 3792 | 3.2% |
( | 3675 | 3.1% |
) | 3675 | 3.1% |
획 | 3660 | 3.1% |
위 | 3604 | 3.1% |
Other values (773) | 60053 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 92604 | |
Space Separator | 16057 | 13.6% |
Open Punctuation | 3684 | 3.1% |
Close Punctuation | 3684 | 3.1% |
Decimal Number | 656 | 0.6% |
Other Punctuation | 484 | 0.4% |
Uppercase Letter | 454 | 0.4% |
Lowercase Letter | 216 | 0.2% |
Math Symbol | 77 | 0.1% |
Dash Punctuation | 65 | 0.1% |
Other values (3) | 41 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
민 | 6196 | 6.7% |
여 | 6130 | 6.6% |
참 | 6050 | 6.5% |
시 | 5130 | 5.5% |
계 | 3792 | 4.1% |
획 | 3660 | 4.0% |
위 | 3604 | 3.9% |
단 | 3579 | 3.9% |
형 | 3504 | 3.8% |
동 | 3314 | 3.6% |
Other values (687) | 47645 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 138 | |
T | 77 | |
V | 67 | |
D | 31 | 6.8% |
E | 29 | 6.4% |
L | 24 | 5.3% |
B | 11 | 2.4% |
O | 10 | 2.2% |
I | 9 | 2.0% |
A | 8 | 1.8% |
Other values (14) | 50 | 11.0% |
Lowercase Letter
Value | Count | Frequency (%) |
c | 54 | |
t | 36 | |
v | 27 | |
e | 18 | 8.3% |
o | 13 | 6.0% |
r | 13 | 6.0% |
a | 9 | 4.2% |
i | 7 | 3.2% |
s | 5 | 2.3% |
n | 5 | 2.3% |
Other values (12) | 29 |
Other Punctuation
Value | Count | Frequency (%) |
, | 319 | |
' | 71 | 14.7% |
! | 49 | 10.1% |
? | 24 | 5.0% |
. | 13 | 2.7% |
& | 3 | 0.6% |
* | 2 | 0.4% |
? | 1 | 0.2% |
※ | 1 | 0.2% |
: | 1 | 0.2% |
Decimal Number
Value | Count | Frequency (%) |
1 | 135 | |
2 | 113 | |
0 | 104 | |
5 | 79 | |
3 | 68 | |
7 | 47 | 7.2% |
4 | 38 | 5.8% |
6 | 34 | 5.2% |
8 | 20 | 3.0% |
9 | 18 | 2.7% |
Math Symbol
Value | Count | Frequency (%) |
+ | 40 | |
~ | 32 | |
= | 2 | 2.6% |
> | 2 | 2.6% |
↔ | 1 | 1.3% |
Open Punctuation
Value | Count | Frequency (%) |
( | 3675 | |
[ | 4 | 0.1% |
「 | 3 | 0.1% |
『 | 2 | 0.1% |
Close Punctuation
Value | Count | Frequency (%) |
) | 3675 | |
] | 4 | 0.1% |
」 | 3 | 0.1% |
』 | 2 | 0.1% |
Initial Punctuation
Value | Count | Frequency (%) |
‘ | 11 | |
“ | 10 |
Final Punctuation
Value | Count | Frequency (%) |
’ | 11 | |
” | 7 |
Space Separator
Value | Count | Frequency (%) |
16057 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 65 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 92595 | |
Common | 24748 | 21.0% |
Latin | 670 | 0.6% |
Han | 9 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
민 | 6196 | 6.7% |
여 | 6130 | 6.6% |
참 | 6050 | 6.5% |
시 | 5130 | 5.5% |
계 | 3792 | 4.1% |
획 | 3660 | 4.0% |
위 | 3604 | 3.9% |
단 | 3579 | 3.9% |
형 | 3504 | 3.8% |
동 | 3314 | 3.6% |
Other values (679) | 47636 |
Latin
Value | Count | Frequency (%) |
C | 138 | |
T | 77 | |
V | 67 | 10.0% |
c | 54 | 8.1% |
t | 36 | 5.4% |
D | 31 | 4.6% |
E | 29 | 4.3% |
v | 27 | 4.0% |
L | 24 | 3.6% |
e | 18 | 2.7% |
Other values (36) | 169 |
Common
Value | Count | Frequency (%) |
16057 | ||
( | 3675 | 14.8% |
) | 3675 | 14.8% |
, | 319 | 1.3% |
1 | 135 | 0.5% |
2 | 113 | 0.5% |
0 | 104 | 0.4% |
5 | 79 | 0.3% |
' | 71 | 0.3% |
3 | 68 | 0.3% |
Other values (30) | 452 | 1.8% |
Han
Value | Count | Frequency (%) |
宿 | 2 | |
多 | 1 | |
樂 | 1 | |
人 | 1 | |
通 | 1 | |
溫 | 1 | |
夜 | 1 | |
號 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 92595 | |
ASCII | 25366 | 21.5% |
Punctuation | 40 | < 0.1% |
None | 11 | < 0.1% |
CJK | 8 | < 0.1% |
Arrows | 1 | < 0.1% |
CJK Compat Ideographs | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
16057 | ||
( | 3675 | 14.5% |
) | 3675 | 14.5% |
, | 319 | 1.3% |
C | 138 | 0.5% |
1 | 135 | 0.5% |
2 | 113 | 0.4% |
0 | 104 | 0.4% |
5 | 79 | 0.3% |
T | 77 | 0.3% |
Other values (65) | 994 | 3.9% |
Hangul
Value | Count | Frequency (%) |
민 | 6196 | 6.7% |
여 | 6130 | 6.6% |
참 | 6050 | 6.5% |
시 | 5130 | 5.5% |
계 | 3792 | 4.1% |
획 | 3660 | 4.0% |
위 | 3604 | 3.9% |
단 | 3579 | 3.9% |
형 | 3504 | 3.8% |
동 | 3314 | 3.6% |
Other values (679) | 47636 |
Punctuation
Value | Count | Frequency (%) |
‘ | 11 | |
’ | 11 | |
“ | 10 | |
” | 7 | |
※ | 1 | 2.5% |
None
Value | Count | Frequency (%) |
「 | 3 | |
」 | 3 | |
』 | 2 | |
『 | 2 | |
? | 1 | 9.1% |
CJK
Value | Count | Frequency (%) |
宿 | 2 | |
多 | 1 | |
人 | 1 | |
通 | 1 | |
溫 | 1 | |
夜 | 1 | |
號 | 1 |
Arrows
Value | Count | Frequency (%) |
↔ | 1 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
樂 | 1 |
예산편성사업비
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 1211 |
---|---|
Distinct (%) | 19.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 85437895 |
Minimum | 0 |
---|---|
Maximum | 3.75 × 109 |
Zeros | 130 |
Zeros (%) | 2.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 54.9 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 2000000 |
Q1 | 5000000 |
median | 20000000 |
Q3 | 85000000 |
95-th percentile | 3.4 × 108 |
Maximum | 3.75 × 109 |
Range | 3.75 × 109 |
Interquartile range (IQR) | 80000000 |
Descriptive statistics
Standard deviation | 2.0611189 × 108 |
---|---|
Coefficient of variation (CV) | 2.4124177 |
Kurtosis | 86.31088 |
Mean | 85437895 |
Median Absolute Deviation (MAD) | 17000000 |
Skewness | 7.6614333 |
Sum | 5.3244896 × 1011 |
Variance | 4.2482113 × 1016 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5000000 | 389 | 6.2% |
10000000 | 311 | 5.0% |
50000000 | 274 | 4.4% |
100000000 | 247 | 4.0% |
3000000 | 243 | 3.9% |
20000000 | 239 | 3.8% |
30000000 | 215 | 3.4% |
200000000 | 172 | 2.8% |
4000000 | 165 | 2.6% |
0 | 130 | 2.1% |
Other values (1201) | 3847 |
Value | Count | Frequency (%) |
0 | 130 | |
500000 | 8 | 0.1% |
565000 | 1 | < 0.1% |
600000 | 2 | < 0.1% |
687000 | 1 | < 0.1% |
700000 | 3 | < 0.1% |
710000 | 1 | < 0.1% |
725000 | 1 | < 0.1% |
750000 | 1 | < 0.1% |
800000 | 1 | < 0.1% |
Value | Count | Frequency (%) |
3750000000 | 1 | |
3450000000 | 1 | |
3368000000 | 1 | |
2980000000 | 1 | |
2950000000 | 1 | |
2770000000 | 1 | |
2760000000 | 1 | |
2520000000 | 1 | |
2500000000 | 1 | |
2400000000 | 1 |
사업위치
Categorical
Distinct | 31 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 48.8 KiB |
도봉구 | 419 |
---|---|
동작구 | 412 |
성동구 | 402 |
노원구 | 310 |
금천구 | 301 |
Other values (26) |
Length
Max length | 5 |
---|---|
Median length | 3 |
Mean length | 3.137516 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
도봉구 | 419 | 6.7% |
동작구 | 412 | 6.6% |
성동구 | 402 | 6.5% |
노원구 | 310 | 5.0% |
금천구 | 301 | 4.8% |
성북구 | 298 | 4.8% |
강서구 | 293 | 4.7% |
동대문구 | 277 | 4.4% |
관악구 | 266 | 4.3% |
영등포구 | 254 | 4.1% |
Other values (21) | 3000 |
Length
Value | Count | Frequency (%) |
도봉구 | 419 | 6.7% |
동작구 | 415 | 6.7% |
성동구 | 402 | 6.5% |
노원구 | 310 | 5.0% |
금천구 | 301 | 4.8% |
성북구 | 298 | 4.8% |
강서구 | 293 | 4.7% |
동대문구 | 277 | 4.4% |
관악구 | 266 | 4.3% |
영등포구 | 254 | 4.1% |
Other values (19) | 2997 |
예산편성계획서
Text
MISSING
 
Distinct | 2224 |
---|---|
Distinct (%) | 66.1% |
Missing | 2867 |
Missing (%) | 46.0% |
Memory size | 48.8 KiB |
Length
Max length | 178 |
---|---|
Median length | 172 |
Mean length | 135.76256 |
Min length | 120 |
Characters and Unicode
Total characters | 456841 |
---|---|
Distinct characters | 734 |
Distinct categories | 13 ? |
Distinct scripts | 4 ? |
Distinct blocks | 5 ? |
Unique
Unique | 2219 ? |
---|---|
Unique (%) | 65.9% |
Sample
1st row | http://yesan.seoul.go.kr/d.jsp?filename=/apps/yesancontents/attFile/upload/2023/SL_C00248_202307190128141830&n=시민건강국 보건의료정책과_저소득 어르신 보청기 지원 시범사업(101774).hwp |
---|---|
2nd row | http://yesan.seoul.go.kr/d.jsp?filename=/apps/yesancontents/attFile/upload/2023/SL_C00153_202307210225370020&n=북악하늘길 산책로 정비(시민참여)(101776).hwp |
3rd row | http://yesan.seoul.go.kr/d.jsp?filename=/apps/yesancontents/attFile/upload/2023/SL_C00493_202307280122300560&n=1. 사업계획서- 미래한강본부 시설관리과_야간 자전거 안전운행 유도디자인 고도화(101768).hwp |
4th row | http://yesan.seoul.go.kr/d.jsp?filename=/apps/yesancontents/attFile/upload/2023/SL_C00492_202307210405155540&n=1. 예산편성 사업계획서.hwp |
5th row | http://yesan.seoul.go.kr/d.jsp?filename=/apps/yesancontents/attFile/upload/2023/SL_C00471_202307210225527170&n=응봉근린공원(금호산) 무장애 산책로 조성(시민참여)(101777).hwp |
Value | Count | Frequency (%) |
계획형 | 982 | 7.3% |
시민참여예산(시민참여).pdf | 641 | 4.8% |
http://yesan.seoul.go.kr/d.jsp?filename=/apps/yesancontents/attfile/upload/2017/sl_c_dong_common&n=구단위 | 361 | 2.7% |
지원(시민참여).pdf | 360 | 2.7% |
http://yesan.seoul.go.kr/d.jsp?filename=/apps/yesancontents/attfile/upload/2017/sl_c03750_201710250626558500&n=동단위 | 341 | 2.5% |
시민참여예산 | 341 | 2.5% |
주민참여).pdf | 285 | 2.1% |
및 | 264 | 2.0% |
http://yesan.seoul.go.kr/d.jsp?filename=/apps/yesancontents/attfile/upload/2017/sl_c_gu_common&n=구단위 | 176 | 1.3% |
http://yesan.seoul.go.kr/d.jsp?filename=/apps/yesancontents/attfile/upload/2017/sl_c04509_201710260127090370&n=동 | 164 | 1.2% |
Other values (4935) | 9518 |
Most occurring characters
Value | Count | Frequency (%) |
/ | 30285 | 6.6% |
e | 23570 | 5.2% |
0 | 22113 | 4.8% |
t | 20222 | 4.4% |
a | 20198 | 4.4% |
s | 20195 | 4.4% |
n | 20195 | 4.4% |
p | 20191 | 4.4% |
. | 16833 | 3.7% |
o | 13473 | 2.9% |
Other values (724) | 249566 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 212193 | |
Decimal Number | 79188 | 17.3% |
Other Letter | 57967 | 12.7% |
Other Punctuation | 57634 | 12.6% |
Uppercase Letter | 18827 | 4.1% |
Space Separator | 10071 | 2.2% |
Connector Punctuation | 7278 | 1.6% |
Math Symbol | 6799 | 1.5% |
Open Punctuation | 3404 | 0.7% |
Close Punctuation | 3402 | 0.7% |
Other values (3) | 78 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
민 | 4446 | 7.7% |
여 | 4387 | 7.6% |
참 | 4315 | 7.4% |
시 | 3251 | 5.6% |
주 | 2109 | 3.6% |
구 | 1332 | 2.3% |
지 | 1270 | 2.2% |
산 | 1174 | 2.0% |
계 | 1114 | 1.9% |
위 | 1105 | 1.9% |
Other values (645) | 33464 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 23570 | |
t | 20222 | |
a | 20198 | |
s | 20195 | |
n | 20195 | |
p | 20191 | |
o | 13473 | 6.3% |
l | 13462 | 6.3% |
d | 10066 | 4.7% |
i | 6736 | 3.2% |
Other values (13) | 43885 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 4009 | |
L | 3382 | |
F | 3369 | |
S | 3368 | |
O | 1444 | 7.7% |
M | 1080 | 5.7% |
N | 900 | 4.8% |
G | 544 | 2.9% |
D | 384 | 2.0% |
U | 178 | 0.9% |
Other values (12) | 169 | 0.9% |
Other Punctuation
Value | Count | Frequency (%) |
/ | 30285 | |
. | 16833 | |
? | 3387 | 5.9% |
& | 3366 | 5.8% |
: | 3365 | 5.8% |
, | 303 | 0.5% |
' | 50 | 0.1% |
! | 42 | 0.1% |
? | 2 | < 0.1% |
※ | 1 | < 0.1% |
Decimal Number
Value | Count | Frequency (%) |
0 | 22113 | |
1 | 13344 | |
2 | 12882 | |
7 | 8164 | 10.3% |
5 | 5236 | 6.6% |
3 | 4124 | 5.2% |
9 | 3786 | 4.8% |
4 | 3286 | 4.1% |
6 | 3191 | 4.0% |
8 | 3062 | 3.9% |
Math Symbol
Value | Count | Frequency (%) |
= | 6730 | |
+ | 40 | 0.6% |
~ | 29 | 0.4% |
Open Punctuation
Value | Count | Frequency (%) |
( | 3401 | |
「 | 3 | 0.1% |
Close Punctuation
Value | Count | Frequency (%) |
) | 3399 | |
」 | 3 | 0.1% |
Final Punctuation
Value | Count | Frequency (%) |
” | 2 | |
’ | 1 |
Initial Punctuation
Value | Count | Frequency (%) |
“ | 2 | |
‘ | 1 |
Space Separator
Value | Count | Frequency (%) |
10071 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 7278 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 72 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 231020 | |
Common | 167854 | |
Hangul | 57960 | 12.7% |
Han | 7 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
민 | 4446 | 7.7% |
여 | 4387 | 7.6% |
참 | 4315 | 7.4% |
시 | 3251 | 5.6% |
주 | 2109 | 3.6% |
구 | 1332 | 2.3% |
지 | 1270 | 2.2% |
산 | 1174 | 2.0% |
계 | 1114 | 1.9% |
위 | 1105 | 1.9% |
Other values (639) | 33457 |
Latin
Value | Count | Frequency (%) |
e | 23570 | 10.2% |
t | 20222 | 8.8% |
a | 20198 | 8.7% |
s | 20195 | 8.7% |
n | 20195 | 8.7% |
p | 20191 | 8.7% |
o | 13473 | 5.8% |
l | 13462 | 5.8% |
d | 10066 | 4.4% |
i | 6736 | 2.9% |
Other values (35) | 62712 |
Common
Value | Count | Frequency (%) |
/ | 30285 | |
0 | 22113 | |
. | 16833 | |
1 | 13344 | 7.9% |
2 | 12882 | 7.7% |
10071 | 6.0% | |
7 | 8164 | 4.9% |
_ | 7278 | 4.3% |
= | 6730 | 4.0% |
5 | 5236 | 3.1% |
Other values (24) | 34918 |
Han
Value | Count | Frequency (%) |
宿 | 2 | |
多 | 1 | |
溫 | 1 | |
人 | 1 | |
夜 | 1 | |
號 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 398859 | |
Hangul | 57960 | 12.7% |
None | 8 | < 0.1% |
Punctuation | 7 | < 0.1% |
CJK | 7 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
/ | 30285 | 7.6% |
e | 23570 | 5.9% |
0 | 22113 | 5.5% |
t | 20222 | 5.1% |
a | 20198 | 5.1% |
s | 20195 | 5.1% |
n | 20195 | 5.1% |
p | 20191 | 5.1% |
. | 16833 | 4.2% |
o | 13473 | 3.4% |
Other values (61) | 191584 |
Hangul
Value | Count | Frequency (%) |
민 | 4446 | 7.7% |
여 | 4387 | 7.6% |
참 | 4315 | 7.4% |
시 | 3251 | 5.6% |
주 | 2109 | 3.6% |
구 | 1332 | 2.3% |
지 | 1270 | 2.2% |
산 | 1174 | 2.0% |
계 | 1114 | 1.9% |
위 | 1105 | 1.9% |
Other values (639) | 33457 |
None
Value | Count | Frequency (%) |
「 | 3 | |
」 | 3 | |
? | 2 |
Punctuation
Value | Count | Frequency (%) |
” | 2 | |
“ | 2 | |
‘ | 1 | |
’ | 1 | |
※ | 1 |
CJK
Value | Count | Frequency (%) |
宿 | 2 | |
多 | 1 | |
溫 | 1 | |
人 | 1 | |
夜 | 1 | |
號 | 1 |
지출금액
Real number (ℝ)
HIGH CORRELATION
  MISSING
  ZEROS
 
Distinct | 966 |
---|---|
Distinct (%) | 29.2% |
Missing | 2924 |
Missing (%) | 46.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.1471017 × 108 |
Minimum | 0 |
---|---|
Maximum | 2.537692 × 109 |
Zeros | 91 |
Zeros (%) | 1.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 54.9 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 2500000 |
Q1 | 16000000 |
median | 50000000 |
Q3 | 1.3262225 × 108 |
95-th percentile | 4.300945 × 108 |
Maximum | 2.537692 × 109 |
Range | 2.537692 × 109 |
Interquartile range (IQR) | 1.1662225 × 108 |
Descriptive statistics
Standard deviation | 2.0201637 × 108 |
---|---|
Coefficient of variation (CV) | 1.7611026 |
Kurtosis | 43.409583 |
Mean | 1.1471017 × 108 |
Median Absolute Deviation (MAD) | 42000000 |
Skewness | 5.4075519 |
Sum | 3.7946123 × 1011 |
Variance | 4.0810613 × 1016 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
50000000 | 204 | 3.3% |
100000000 | 165 | 2.6% |
20000000 | 163 | 2.6% |
30000000 | 146 | 2.3% |
10000000 | 143 | 2.3% |
5000000 | 131 | 2.1% |
200000000 | 105 | 1.7% |
0 | 91 | 1.5% |
300000000 | 65 | 1.0% |
40000000 | 65 | 1.0% |
Other values (956) | 2030 | |
(Missing) | 2924 |
Value | Count | Frequency (%) |
0 | 91 | |
200000 | 1 | < 0.1% |
400000 | 1 | < 0.1% |
500000 | 4 | 0.1% |
710000 | 1 | < 0.1% |
725000 | 1 | < 0.1% |
816000 | 1 | < 0.1% |
900000 | 3 | < 0.1% |
960000 | 1 | < 0.1% |
1000000 | 17 | 0.3% |
Value | Count | Frequency (%) |
2537691980 | 1 | |
2520000000 | 1 | |
2400000000 | 1 | |
2398088430 | 1 | |
2117417000 | 1 | |
2100000000 | 1 | |
2000000000 | 1 | |
1977156000 | 1 | |
1900000000 | 1 | |
1800000000 | 1 |
사업추진단계
Categorical
IMBALANCE
 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 48.8 KiB |
<NA> | |
---|---|
완료 | |
추진중 | 137 |
미집행 | 69 |
발주 | 3 |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 3.0198973 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 3073 | |
완료 | 2948 | |
추진중 | 137 | 2.2% |
미집행 | 69 | 1.1% |
발주 | 3 | < 0.1% |
계획수립 | 2 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 3073 | |
완료 | 2948 | |
추진중 | 137 | 2.2% |
미집행 | 69 | 1.1% |
발주 | 3 | < 0.1% |
계획수립 | 2 | < 0.1% |
사업추진집행률
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 6232 |
---|---|
Missing (%) | 100.0% |
Memory size | 54.9 KiB |
집행기준일
Text
MISSING
 
Distinct | 144 |
---|---|
Distinct (%) | 4.4% |
Missing | 2922 |
Missing (%) | 46.9% |
Memory size | 48.8 KiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 9.9978852 |
Min length | 3 |
Characters and Unicode
Total characters | 33093 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 73 ? |
---|---|
Unique (%) | 2.2% |
Sample
1st row | 2023-11-30 |
---|---|
2nd row | 2023-11-29 |
3rd row | 2023-07-28 |
4th row | 2023-07-21 |
5th row | 2023-11-29 |
Value | Count | Frequency (%) |
2017-12-31 | 799 | |
2018-01-15 | 730 | |
2016-12-31 | 521 | |
2015-12-31 | 357 | |
2014-12-31 | 218 | 6.6% |
2019-06-30 | 166 | 5.0% |
2013-12-31 | 132 | 4.0% |
2019-12-19 | 31 | 0.9% |
2019-10-04 | 18 | 0.5% |
2019-12-16 | 12 | 0.4% |
Other values (134) | 326 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 9199 | |
- | 6618 | |
2 | 5740 | |
0 | 4873 | |
3 | 2412 | 7.3% |
5 | 1122 | 3.4% |
7 | 833 | 2.5% |
8 | 783 | 2.4% |
6 | 777 | 2.3% |
9 | 478 | 1.4% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 26475 | |
Dash Punctuation | 6618 | 20.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 9199 | |
2 | 5740 | |
0 | 4873 | |
3 | 2412 | 9.1% |
5 | 1122 | 4.2% |
7 | 833 | 3.1% |
8 | 783 | 3.0% |
6 | 777 | 2.9% |
9 | 478 | 1.8% |
4 | 258 | 1.0% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 6618 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 33093 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 9199 | |
- | 6618 | |
2 | 5740 | |
0 | 4873 | |
3 | 2412 | 7.3% |
5 | 1122 | 3.4% |
7 | 833 | 2.5% |
8 | 783 | 2.4% |
6 | 777 | 2.3% |
9 | 478 | 1.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 33093 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 9199 | |
- | 6618 | |
2 | 5740 | |
0 | 4873 | |
3 | 2412 | 7.3% |
5 | 1122 | 3.4% |
7 | 833 | 2.5% |
8 | 783 | 2.4% |
6 | 777 | 2.3% |
9 | 478 | 1.4% |
비고
Text
MISSING
 
Distinct | 203 |
---|---|
Distinct (%) | 89.4% |
Missing | 6005 |
Missing (%) | 96.4% |
Memory size | 48.8 KiB |
Length
Max length | 371 |
---|---|
Median length | 107 |
Mean length | 52.85022 |
Min length | 2 |
Characters and Unicode
Total characters | 11997 |
---|---|
Distinct characters | 464 |
Distinct categories | 13 ? |
Distinct scripts | 3 ? |
Distinct blocks | 8 ? |
Unique
Unique | 191 ? |
---|---|
Unique (%) | 84.1% |
Sample
1st row | 대상자 선정-의원검진-보청기 구입-적합성 평가-보청기 지원 확정-지원금 지급 과정으로 1~2개월이상 소요됨 자치구 예산집행 완료(2023. 12월중 예정) |
---|---|
2nd row | 현재 공사 진행중으로 연도 내 준공 예정임 |
3rd row | 설계 완료하였으며, 세부 추진계획 수립후 사업 시행중에 있음. -한강공원 저지대 침수, 안정적인 전기 인입(사용), 물품 유지보수 용이성 등 검토로 사업 지연 -2023년 11월 계획수립 완료후 공사 및 관급자재에 대하여 계약부서에 계약 의뢰중에 있음. |
4th row | 특이사항 : '22년 디자인거버넌스 사업으로 개발된 ‘올바른 공원 이용을 유도하는 서비스디자인’ 확산 추진 향후집행계획 : 효과성이 검증된 디자인 결과물을 고도화하여 대상지 맞춤형 디자인 적용 예정 사업부진사유 : 장소 기반 증강현실(AR) 기술구현 및 대상지 부서 협의 등 면밀한 사전검토로 인한 사업추진 지연 |
5th row | 공사 진행 중으로 연내 준공 예정임 |
Value | Count | Frequency (%) |
78 | 3.0% | |
및 | 48 | 1.9% |
사업 | 38 | 1.5% |
사업으로 | 24 | 0.9% |
예정 | 24 | 0.9% |
등 | 22 | 0.9% |
추진 | 19 | 0.7% |
자치구 | 19 | 0.7% |
설치 | 18 | 0.7% |
따른 | 17 | 0.7% |
Other values (1439) | 2262 |
Most occurring characters
Value | Count | Frequency (%) |
2437 | 20.3% | |
0 | 373 | 3.1% |
사 | 320 | 2.7% |
1 | 255 | 2.1% |
업 | 213 | 1.8% |
, | 209 | 1.7% |
2 | 206 | 1.7% |
로 | 179 | 1.5% |
행 | 148 | 1.2% |
( | 134 | 1.1% |
Other values (454) | 7523 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 7460 | |
Space Separator | 2437 | 20.3% |
Decimal Number | 1239 | 10.3% |
Other Punctuation | 418 | 3.5% |
Open Punctuation | 135 | 1.1% |
Close Punctuation | 135 | 1.1% |
Uppercase Letter | 54 | 0.5% |
Dash Punctuation | 49 | 0.4% |
Math Symbol | 45 | 0.4% |
Final Punctuation | 7 | 0.1% |
Other values (3) | 18 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
사 | 320 | 4.3% |
업 | 213 | 2.9% |
로 | 179 | 2.4% |
행 | 148 | 2.0% |
정 | 131 | 1.8% |
구 | 128 | 1.7% |
이 | 117 | 1.6% |
으 | 116 | 1.6% |
시 | 116 | 1.6% |
지 | 110 | 1.5% |
Other values (402) | 5882 |
Uppercase Letter
Value | Count | Frequency (%) |
O | 10 | |
C | 8 | |
L | 6 | |
D | 5 | |
T | 5 | |
E | 5 | |
V | 4 | 7.4% |
I | 3 | 5.6% |
B | 2 | 3.7% |
M | 1 | 1.9% |
Other values (5) | 5 |
Decimal Number
Value | Count | Frequency (%) |
0 | 373 | |
1 | 255 | |
2 | 206 | |
5 | 80 | 6.5% |
9 | 74 | 6.0% |
3 | 59 | 4.8% |
6 | 49 | 4.0% |
4 | 49 | 4.0% |
7 | 47 | 3.8% |
8 | 47 | 3.8% |
Other Punctuation
Value | Count | Frequency (%) |
, | 209 | |
. | 120 | |
: | 48 | 11.5% |
' | 17 | 4.1% |
% | 12 | 2.9% |
※ | 4 | 1.0% |
/ | 4 | 1.0% |
? | 2 | 0.5% |
? | 1 | 0.2% |
* | 1 | 0.2% |
Math Symbol
Value | Count | Frequency (%) |
~ | 27 | |
> | 10 | 22.2% |
→ | 6 | 13.3% |
= | 2 | 4.4% |
Other Symbol
Value | Count | Frequency (%) |
○ | 4 | |
㎡ | 2 | |
□ | 1 | 14.3% |
Open Punctuation
Value | Count | Frequency (%) |
( | 134 | |
[ | 1 | 0.7% |
Close Punctuation
Value | Count | Frequency (%) |
) | 134 | |
] | 1 | 0.7% |
Lowercase Letter
Value | Count | Frequency (%) |
o | 3 | |
m | 3 |
Space Separator
Value | Count | Frequency (%) |
2437 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 49 |
Final Punctuation
Value | Count | Frequency (%) |
’ | 7 |
Initial Punctuation
Value | Count | Frequency (%) |
‘ | 5 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 7459 | |
Common | 4477 | |
Latin | 61 | 0.5% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
사 | 320 | 4.3% |
업 | 213 | 2.9% |
로 | 179 | 2.4% |
행 | 148 | 2.0% |
정 | 131 | 1.8% |
구 | 128 | 1.7% |
이 | 117 | 1.6% |
으 | 116 | 1.6% |
시 | 116 | 1.6% |
지 | 110 | 1.5% |
Other values (401) | 5881 |
Common
Value | Count | Frequency (%) |
2437 | ||
0 | 373 | 8.3% |
1 | 255 | 5.7% |
, | 209 | 4.7% |
2 | 206 | 4.6% |
( | 134 | 3.0% |
) | 134 | 3.0% |
. | 120 | 2.7% |
5 | 80 | 1.8% |
9 | 74 | 1.7% |
Other values (25) | 455 | 10.2% |
Latin
Value | Count | Frequency (%) |
O | 10 | |
C | 8 | |
L | 6 | |
D | 5 | |
T | 5 | |
E | 5 | |
V | 4 | 6.6% |
I | 3 | 4.9% |
o | 3 | 4.9% |
m | 3 | 4.9% |
Other values (8) | 9 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 7458 | |
ASCII | 4507 | |
Punctuation | 16 | 0.1% |
Arrows | 6 | 0.1% |
Geometric Shapes | 5 | < 0.1% |
CJK Compat | 2 | < 0.1% |
None | 2 | < 0.1% |
Compat Jamo | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
2437 | ||
0 | 373 | 8.3% |
1 | 255 | 5.7% |
, | 209 | 4.6% |
2 | 206 | 4.6% |
( | 134 | 3.0% |
) | 134 | 3.0% |
. | 120 | 2.7% |
5 | 80 | 1.8% |
9 | 74 | 1.6% |
Other values (34) | 485 | 10.8% |
Hangul
Value | Count | Frequency (%) |
사 | 320 | 4.3% |
업 | 213 | 2.9% |
로 | 179 | 2.4% |
행 | 148 | 2.0% |
정 | 131 | 1.8% |
구 | 128 | 1.7% |
이 | 117 | 1.6% |
으 | 116 | 1.6% |
시 | 116 | 1.6% |
지 | 110 | 1.5% |
Other values (400) | 5880 |
Punctuation
Value | Count | Frequency (%) |
’ | 7 | |
‘ | 5 | |
※ | 4 |
Arrows
Value | Count | Frequency (%) |
→ | 6 |
Geometric Shapes
Value | Count | Frequency (%) |
○ | 4 | |
□ | 1 | 20.0% |
CJK Compat
Value | Count | Frequency (%) |
㎡ | 2 |
None
Value | Count | Frequency (%) |
? | 1 | |
º | 1 |
Compat Jamo
Value | Count | Frequency (%) |
ㅇ | 1 |
결과보고서
Text
MISSING
 
Distinct | 1554 |
---|---|
Distinct (%) | 100.0% |
Missing | 4678 |
Missing (%) | 75.1% |
Memory size | 48.8 KiB |
Length
Max length | 176 |
---|---|
Median length | 164 |
Mean length | 135.35393 |
Min length | 118 |
Characters and Unicode
Total characters | 210340 |
---|---|
Distinct characters | 700 |
Distinct categories | 14 ? |
Distinct scripts | 4 ? |
Distinct blocks | 7 ? |
Unique
Unique | 1554 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | http://yesan.seoul.go.kr/d.jsp?filename=/apps/yesancontents/attFile/upload/2021/SL_R02096_202212060542588730&n=은행나무 그물망 설치사업 추진실적(도봉구).hwpx |
---|---|
2nd row | http://yesan.seoul.go.kr/d.jsp?filename=/apps/yesancontents/attFile/upload/2023/SL_R00457_202303270709337040&n=공공임대주택 야외운동기구 설치 준공 보고.pdf |
3rd row | http://yesan.seoul.go.kr/d.jsp?filename=/apps/yesancontents/attFile/upload/2021/SL_R00208_202211170133399750&n=2022년 전통시장 홍보 에코백 배포계획.pdf |
4th row | http://yesan.seoul.go.kr/d.jsp?filename=/apps/yesancontents/attFile/upload/2021/SL_R01691_202212080956494550&n=현장 사진(가양나들목).zip |
5th row | http://yesan.seoul.go.kr/d.jsp?filename=/apps/yesancontents/attFile/upload/2021/SL_R02110_202212060543308010&n=남부순환로 자전거도로 단절구간 연결사업 결과보고(남부순환로)_공사완료.hwp |
Value | Count | Frequency (%) |
및 | 181 | 2.8% |
설치.pdf | 91 | 1.4% |
정비사업.pdf | 69 | 1.1% |
조성.pdf | 49 | 0.8% |
설치 | 49 | 0.8% |
주민참여).pdf | 48 | 0.8% |
교통안전시설물 | 46 | 0.7% |
내 | 45 | 0.7% |
주변 | 42 | 0.7% |
정비.pdf | 41 | 0.6% |
Other values (3906) | 5730 |
Most occurring characters
Value | Count | Frequency (%) |
/ | 13986 | 6.6% |
0 | 12108 | 5.8% |
e | 10882 | 5.2% |
t | 9329 | 4.4% |
s | 9329 | 4.4% |
a | 9329 | 4.4% |
n | 9326 | 4.4% |
p | 9323 | 4.4% |
1 | 7986 | 3.8% |
. | 7849 | 3.7% |
Other values (690) | 110893 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 97960 | |
Decimal Number | 43187 | |
Other Punctuation | 26690 | 12.7% |
Other Letter | 23420 | 11.1% |
Uppercase Letter | 6482 | 3.1% |
Space Separator | 4839 | 2.3% |
Connector Punctuation | 3180 | 1.5% |
Math Symbol | 3135 | 1.5% |
Open Punctuation | 656 | 0.3% |
Close Punctuation | 654 | 0.3% |
Other values (4) | 137 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
보 | 595 | 2.5% |
구 | 587 | 2.5% |
사 | 533 | 2.3% |
설 | 513 | 2.2% |
정 | 482 | 2.1% |
시 | 473 | 2.0% |
공 | 419 | 1.8% |
고 | 412 | 1.8% |
로 | 410 | 1.8% |
업 | 393 | 1.7% |
Other values (607) | 18603 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 10882 | |
t | 9329 | |
s | 9329 | |
a | 9329 | |
n | 9326 | |
p | 9323 | |
o | 6223 | 6.4% |
l | 6221 | 6.4% |
d | 4374 | 4.5% |
i | 3117 | 3.2% |
Other values (15) | 20507 |
Uppercase Letter
Value | Count | Frequency (%) |
L | 1568 | |
F | 1558 | |
R | 1556 | |
S | 1556 | |
C | 87 | 1.3% |
T | 48 | 0.7% |
V | 43 | 0.7% |
D | 17 | 0.3% |
E | 15 | 0.2% |
I | 6 | 0.1% |
Other values (10) | 28 | 0.4% |
Decimal Number
Value | Count | Frequency (%) |
0 | 12108 | |
1 | 7986 | |
2 | 7487 | |
7 | 3343 | 7.7% |
8 | 2426 | 5.6% |
3 | 2290 | 5.3% |
5 | 2004 | 4.6% |
4 | 1933 | 4.5% |
6 | 1832 | 4.2% |
9 | 1778 | 4.1% |
Other Punctuation
Value | Count | Frequency (%) |
/ | 13986 | |
. | 7849 | |
? | 1565 | 5.9% |
& | 1556 | 5.8% |
: | 1554 | 5.8% |
, | 126 | 0.5% |
' | 30 | 0.1% |
! | 23 | 0.1% |
% | 1 | < 0.1% |
Open Punctuation
Value | Count | Frequency (%) |
( | 623 | |
「 | 13 | 2.0% |
[ | 13 | 2.0% |
『 | 7 | 1.1% |
Close Punctuation
Value | Count | Frequency (%) |
) | 621 | |
」 | 13 | 2.0% |
] | 13 | 2.0% |
』 | 7 | 1.1% |
Math Symbol
Value | Count | Frequency (%) |
= | 3108 | |
~ | 21 | 0.7% |
+ | 6 | 0.2% |
Final Punctuation
Value | Count | Frequency (%) |
’ | 2 | |
” | 2 |
Initial Punctuation
Value | Count | Frequency (%) |
“ | 2 | |
‘ | 1 |
Space Separator
Value | Count | Frequency (%) |
4839 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 3180 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 129 |
Other Symbol
Value | Count | Frequency (%) |
★ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 104442 | |
Common | 82478 | |
Hangul | 23415 | 11.1% |
Han | 5 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
보 | 595 | 2.5% |
구 | 587 | 2.5% |
사 | 533 | 2.3% |
설 | 513 | 2.2% |
정 | 482 | 2.1% |
시 | 473 | 2.0% |
공 | 419 | 1.8% |
고 | 412 | 1.8% |
로 | 410 | 1.8% |
업 | 393 | 1.7% |
Other values (602) | 18598 |
Latin
Value | Count | Frequency (%) |
e | 10882 | |
t | 9329 | 8.9% |
s | 9329 | 8.9% |
a | 9329 | 8.9% |
n | 9326 | 8.9% |
p | 9323 | 8.9% |
o | 6223 | 6.0% |
l | 6221 | 6.0% |
d | 4374 | 4.2% |
i | 3117 | 3.0% |
Other values (35) | 26989 |
Common
Value | Count | Frequency (%) |
/ | 13986 | |
0 | 12108 | |
1 | 7986 | |
. | 7849 | |
2 | 7487 | |
4839 | 5.9% | |
7 | 3343 | 4.1% |
_ | 3180 | 3.9% |
= | 3108 | 3.8% |
8 | 2426 | 2.9% |
Other values (28) | 16166 |
Han
Value | Count | Frequency (%) |
多 | 1 | |
樂 | 1 | |
場 | 1 | |
署 | 1 | |
宿 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 186872 | |
Hangul | 23415 | 11.1% |
None | 40 | < 0.1% |
Punctuation | 7 | < 0.1% |
CJK | 4 | < 0.1% |
CJK Compat Ideographs | 1 | < 0.1% |
Misc Symbols | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
/ | 13986 | 7.5% |
0 | 12108 | 6.5% |
e | 10882 | 5.8% |
t | 9329 | 5.0% |
s | 9329 | 5.0% |
a | 9329 | 5.0% |
n | 9326 | 5.0% |
p | 9323 | 5.0% |
1 | 7986 | 4.3% |
. | 7849 | 4.2% |
Other values (64) | 87425 |
Hangul
Value | Count | Frequency (%) |
보 | 595 | 2.5% |
구 | 587 | 2.5% |
사 | 533 | 2.3% |
설 | 513 | 2.2% |
정 | 482 | 2.1% |
시 | 473 | 2.0% |
공 | 419 | 1.8% |
고 | 412 | 1.8% |
로 | 410 | 1.8% |
업 | 393 | 1.7% |
Other values (602) | 18598 |
None
Value | Count | Frequency (%) |
」 | 13 | |
「 | 13 | |
『 | 7 | |
』 | 7 |
Punctuation
Value | Count | Frequency (%) |
’ | 2 | |
” | 2 | |
“ | 2 | |
‘ | 1 |
CJK
Value | Count | Frequency (%) |
多 | 1 | |
場 | 1 | |
署 | 1 | |
宿 | 1 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
樂 | 1 |
Misc Symbols
Value | Count | Frequency (%) |
★ | 1 |
년도 | 제안번호 | 예산편성사업비 | 사업위치 | 지출금액 | 사업추진단계 | |
---|---|---|---|---|---|---|
년도 | 1.000 | 0.840 | 0.323 | 0.420 | 0.431 | 0.746 |
제안번호 | 0.840 | 1.000 | 0.297 | 0.574 | 0.286 | 0.345 |
예산편성사업비 | 0.323 | 0.297 | 1.000 | 0.295 | 0.956 | 0.375 |
사업위치 | 0.420 | 0.574 | 0.295 | 1.000 | 0.301 | 0.376 |
지출금액 | 0.431 | 0.286 | 0.956 | 0.301 | 1.000 | 0.343 |
사업추진단계 | 0.746 | 0.345 | 0.375 | 0.376 | 0.343 | 1.000 |
사업위치 | 사업추진단계 | |
---|---|---|
사업위치 | 1.000 | 0.191 |
사업추진단계 | 0.191 | 1.000 |
년도 | 제안번호 | 예산편성사업비 | 지출금액 | 사업위치 | 사업추진단계 | |
---|---|---|---|---|---|---|
년도 | 1.000 | 0.741 | -0.389 | -0.199 | 0.145 | 0.403 |
제안번호 | 0.741 | 1.000 | -0.362 | -0.399 | 0.219 | 0.150 |
예산편성사업비 | -0.389 | -0.362 | 1.000 | 0.964 | 0.113 | 0.164 |
지출금액 | -0.199 | -0.399 | 0.964 | 1.000 | 0.113 | 0.149 |
사업위치 | 0.145 | 0.219 | 0.113 | 0.113 | 1.000 | 0.191 |
사업추진단계 | 0.403 | 0.150 | 0.164 | 0.149 | 0.191 | 1.000 |
년도 | 제안번호 | 사업명 | 예산편성사업명 | 예산편성사업비 | 사업위치 | 예산편성계획서 | 지출금액 | 사업추진단계 | 사업추진집행률 | 집행기준일 | 비고 | 결과보고서 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 2023 | 332 | 의류리폼센터 운영(시범) | <NA> | 219508000 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
1 | 2023 | 624 | 사회적 약자의 복지향상을 위한 해외연수 운영 | <NA> | 250000000 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
2 | 2023 | 264 | 오동근린공원(월곡산) 철쭉동산 만들기 | <NA> | 35000000 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
3 | 2023 | 260 | 공공기관 내 스마트 수돗물 수질관리 시스템 도입 설치 | <NA> | 450000000 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
4 | 2023 | 249 | 공원으로 찾아오는 어린이 물놀이터 | <NA> | 297000000 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
5 | 2023 | 671 | 가정용 소형감량기 설치 지원사업 지원 | <NA> | 1000000000 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
6 | 2023 | 670 | 줍깅 주간 운영 지원 | <NA> | 250000000 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
7 | 2023 | 669 | 경력단절여성 직업교육을 통한 플로리스트 인력 개발 사업 | <NA> | 150000000 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
8 | 2023 | 652 | 스마트폴 설치로 안전한 등하굣길 만들기 | <NA> | 1130000000 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
9 | 2023 | 646 | 교통약자 보행권 확보를 위한 안전 보도 만들기 | <NA> | 2760000000 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
년도 | 제안번호 | 사업명 | 예산편성사업명 | 예산편성사업비 | 사업위치 | 예산편성계획서 | 지출금액 | 사업추진단계 | 사업추진집행률 | 집행기준일 | 비고 | 결과보고서 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
6222 | 2012 | 64 | 어린이 숲속놀이 체험장 조성 | 어린이 숲속놀이 체험장 조성 | 700000000 | 종로구 | <NA> | 700000000 | <NA> | <NA> | 2013-12-31 | <NA> | <NA> |
6223 | 2012 | 65 | 공원우범화방지 CCTV 설치 | 공원우범화방지 CCTV 설치 | 440000000 | 중구 | <NA> | 440000000 | <NA> | <NA> | 2013-12-31 | <NA> | <NA> |
6224 | 2012 | 66 | 공원내 안심어린이 놀이시설 정비 | 공원내 안심어린이 놀이시설 정비 | 524000000 | 중구 | <NA> | 524000000 | <NA> | <NA> | 2013-12-31 | <NA> | <NA> |
6225 | 2012 | 319 | 한부모가정 이해교육 강사양성 및 교육실시 | 한부모가정 이해교육 강사양성 및 교육실시 | 58000000 | 서울시 | <NA> | 58000000 | <NA> | <NA> | 2013-12-31 | <NA> | <NA> |
6226 | 2012 | 320 | 결혼을 앞둔 예비부부 교육 | 결혼을 앞둔 예비부부 교육 | 130000000 | 서울시 | <NA> | 130000000 | <NA> | <NA> | 2013-12-31 | <NA> | <NA> |
6227 | 2012 | 321 | 한부모가정지원센터 설치 | 한부모가정지원센터 설치 | 200000000 | 송파구 | <NA> | 200000000 | <NA> | <NA> | 2013-12-31 | <NA> | <NA> |
6228 | 2012 | 322 | 왕따, 학교폭력 근절을 위한 지역공동체 사업 제안 | 왕따, 학교폭력 근절을 위한 지역공동체 사업 제안 | 185000000 | 금천구 | <NA> | 185000000 | <NA> | <NA> | 2013-12-31 | <NA> | <NA> |
6229 | 2012 | 324 | 결식아동 음식점 한눈에 보여요 | 결식아동 음식점 한눈에 보여요 | 0 | 영등포구 | <NA> | 0 | <NA> | <NA> | 2013-12-31 | <NA> | <NA> |
6230 | 2012 | 325 | 중랑패밀리 행복체험학습 | 중랑패밀리 행복체험학습 | 35000000 | 중랑구 | <NA> | 35000000 | <NA> | <NA> | 2013-12-31 | <NA> | <NA> |
6231 | 2012 | 327 | 아이들이 행복한 놀이마당 조성 | 아이들이 행복한 놀이마당 조성 | 200000000 | 은평구 | <NA> | 200000000 | <NA> | <NA> | 2013-12-31 | <NA> | <NA> |