Overview

Dataset statistics

Number of variables13
Number of observations6232
Missing cells25696
Missing cells (%)31.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory663.5 KiB
Average record size in memory109.0 B

Variable types

Numeric4
Text6
Categorical2
Unsupported1

Dataset

Description년도,제안번호,사업명,예산편성사업명,예산편성사업비,사업위치,예산편성계획서,지출금액,사업추진단계,사업추진집행률,집행기준일,비고,결과보고서
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15413/S/1/datasetView.do

Alerts

년도 is highly overall correlated with 제안번호High correlation
제안번호 is highly overall correlated with 년도High correlation
예산편성사업비 is highly overall correlated with 지출금액High correlation
지출금액 is highly overall correlated with 예산편성사업비High correlation
사업추진단계 is highly imbalanced (53.0%)Imbalance
예산편성사업명 has 68 (1.1%) missing valuesMissing
예산편성계획서 has 2867 (46.0%) missing valuesMissing
지출금액 has 2924 (46.9%) missing valuesMissing
사업추진집행률 has 6232 (100.0%) missing valuesMissing
집행기준일 has 2922 (46.9%) missing valuesMissing
비고 has 6005 (96.4%) missing valuesMissing
결과보고서 has 4678 (75.1%) missing valuesMissing
사업추진집행률 is an unsupported type, check if it needs cleaning or further analysisUnsupported
예산편성사업비 has 130 (2.1%) zerosZeros
지출금액 has 91 (1.5%) zerosZeros

Reproduction

Analysis started2024-05-11 06:19:56.243862
Analysis finished2024-05-11 06:20:03.920874
Duration7.68 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

년도
Real number (ℝ)

HIGH CORRELATION 

Distinct12
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2017.733
Minimum2012
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size54.9 KiB
2024-05-11T15:20:04.040542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2012
5-th percentile2013
Q12016
median2018
Q32020
95-th percentile2021
Maximum2023
Range11
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.4661306
Coefficient of variation (CV)0.0012222284
Kurtosis-0.75240643
Mean2017.733
Median Absolute Deviation (MAD)2
Skewness-0.34547107
Sum12574512
Variance6.0818002
MonotonicityNot monotonic
2024-05-11T15:20:04.274207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
2021 934
15.0%
2020 873
14.0%
2019 852
13.7%
2016 804
12.9%
2017 766
12.3%
2018 730
11.7%
2015 524
8.4%
2014 352
 
5.6%
2013 223
 
3.6%
2012 132
 
2.1%
Other values (2) 42
 
0.7%
ValueCountFrequency (%)
2012 132
 
2.1%
2013 223
 
3.6%
2014 352
 
5.6%
2015 524
8.4%
2016 804
12.9%
2017 766
12.3%
2018 730
11.7%
2019 852
13.7%
2020 873
14.0%
2021 934
15.0%
ValueCountFrequency (%)
2023 29
 
0.5%
2022 13
 
0.2%
2021 934
15.0%
2020 873
14.0%
2019 852
13.7%
2018 730
11.7%
2017 766
12.3%
2016 804
12.9%
2015 524
8.4%
2014 352
 
5.6%

제안번호
Real number (ℝ)

HIGH CORRELATION 

Distinct3453
Distinct (%)55.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4833.7357
Minimum1
Maximum8241
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size54.9 KiB
2024-05-11T15:20:04.523126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile306.55
Q11983.5
median5044.5
Q37361
95-th percentile8115.45
Maximum8241
Range8240
Interquartile range (IQR)5377.5

Descriptive statistics

Standard deviation2813.5912
Coefficient of variation (CV)0.58207387
Kurtosis-1.4739889
Mean4833.7357
Median Absolute Deviation (MAD)2395.5
Skewness-0.33866376
Sum30123841
Variance7916295.7
MonotonicityNot monotonic
2024-05-11T15:20:04.838399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
7129 4
 
0.1%
7012 4
 
0.1%
7527 4
 
0.1%
7526 4
 
0.1%
7525 4
 
0.1%
7524 4
 
0.1%
7523 4
 
0.1%
7522 4
 
0.1%
7521 4
 
0.1%
7520 4
 
0.1%
Other values (3443) 6192
99.4%
ValueCountFrequency (%)
1 2
< 0.1%
4 1
< 0.1%
7 1
< 0.1%
9 1
< 0.1%
13 2
< 0.1%
14 1
< 0.1%
19 1
< 0.1%
21 1
< 0.1%
29 1
< 0.1%
30 1
< 0.1%
ValueCountFrequency (%)
8241 1
< 0.1%
8240 1
< 0.1%
8239 1
< 0.1%
8238 1
< 0.1%
8237 1
< 0.1%
8236 1
< 0.1%
8235 1
< 0.1%
8234 1
< 0.1%
8233 1
< 0.1%
8232 1
< 0.1%
Distinct5858
Distinct (%)94.0%
Missing0
Missing (%)0.0%
Memory size48.8 KiB
2024-05-11T15:20:05.485509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length79
Median length58
Mean length18.179076
Min length3

Characters and Unicode

Total characters113292
Distinct characters1139
Distinct categories15 ?
Distinct scripts4 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5770 ?
Unique (%)92.6%

Sample

1st row의류리폼센터 운영(시범)
2nd row사회적 약자의 복지향상을 위한 해외연수 운영
3rd row오동근린공원(월곡산) 철쭉동산 만들기
4th row공공기관 내 스마트 수돗물 수질관리 시스템 도입 설치
5th row공원으로 찾아오는 어린이 물놀이터
ValueCountFrequency (%)
398
 
1.5%
설치 398
 
1.5%
만들기 339
 
1.3%
위한 330
 
1.2%
조성 257
 
1.0%
마을 227
 
0.9%
사업 211
 
0.8%
운영 204
 
0.8%
함께 198
 
0.7%
안전한 166
 
0.6%
Other values (10870) 23831
89.7%
2024-05-11T15:20:06.508725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
20829
 
18.4%
1621
 
1.4%
1604
 
1.4%
1492
 
1.3%
1373
 
1.2%
1324
 
1.2%
1318
 
1.2%
1267
 
1.1%
1185
 
1.0%
1177
 
1.0%
Other values (1129) 80102
70.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 85764
75.7%
Space Separator 20829
 
18.4%
Other Punctuation 1861
 
1.6%
Uppercase Letter 1067
 
0.9%
Lowercase Letter 865
 
0.8%
Decimal Number 759
 
0.7%
Close Punctuation 609
 
0.5%
Open Punctuation 607
 
0.5%
Initial Punctuation 281
 
0.2%
Final Punctuation 262
 
0.2%
Other values (5) 388
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1621
 
1.9%
1604
 
1.9%
1492
 
1.7%
1373
 
1.6%
1324
 
1.5%
1318
 
1.5%
1267
 
1.5%
1185
 
1.4%
1177
 
1.4%
1060
 
1.2%
Other values (1027) 72343
84.4%
Uppercase Letter
ValueCountFrequency (%)
C 212
19.9%
T 125
11.7%
V 104
9.7%
E 91
 
8.5%
D 70
 
6.6%
O 63
 
5.9%
L 62
 
5.8%
A 34
 
3.2%
S 33
 
3.1%
M 32
 
3.0%
Other values (15) 241
22.6%
Lowercase Letter
ValueCountFrequency (%)
e 101
11.7%
o 87
 
10.1%
a 74
 
8.6%
t 61
 
7.1%
i 57
 
6.6%
r 56
 
6.5%
n 56
 
6.5%
l 47
 
5.4%
u 45
 
5.2%
c 40
 
4.6%
Other values (15) 241
27.9%
Other Punctuation
ValueCountFrequency (%)
! 696
37.4%
, 476
25.6%
' 225
 
12.1%
. 193
 
10.4%
? 179
 
9.6%
& 30
 
1.6%
: 29
 
1.6%
/ 18
 
1.0%
5
 
0.3%
3
 
0.2%
Other values (7) 7
 
0.4%
Decimal Number
ValueCountFrequency (%)
2 182
24.0%
1 168
22.1%
0 113
14.9%
3 92
12.1%
5 51
 
6.7%
4 49
 
6.5%
9 34
 
4.5%
8 26
 
3.4%
6 25
 
3.3%
7 19
 
2.5%
Close Punctuation
ValueCountFrequency (%)
) 553
90.8%
24
 
3.9%
18
 
3.0%
] 12
 
2.0%
1
 
0.2%
1
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 553
91.1%
24
 
4.0%
17
 
2.8%
[ 12
 
2.0%
1
 
0.2%
Math Symbol
ValueCountFrequency (%)
~ 155
76.0%
> 17
 
8.3%
< 17
 
8.3%
+ 14
 
6.9%
1
 
0.5%
Initial Punctuation
ValueCountFrequency (%)
156
55.5%
125
44.5%
Final Punctuation
ValueCountFrequency (%)
149
56.9%
113
43.1%
Space Separator
ValueCountFrequency (%)
20829
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 172
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 5
100.0%
Modifier Symbol
ValueCountFrequency (%)
^ 4
100.0%
Letter Number
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 85661
75.6%
Common 25593
 
22.6%
Latin 1935
 
1.7%
Han 103
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1621
 
1.9%
1604
 
1.9%
1492
 
1.7%
1373
 
1.6%
1324
 
1.5%
1318
 
1.5%
1267
 
1.5%
1185
 
1.4%
1177
 
1.4%
1060
 
1.2%
Other values (967) 72240
84.3%
Han
ValueCountFrequency (%)
7
 
6.8%
4
 
3.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
Other values (50) 64
62.1%
Common
ValueCountFrequency (%)
20829
81.4%
! 696
 
2.7%
) 553
 
2.2%
( 553
 
2.2%
, 476
 
1.9%
' 225
 
0.9%
. 193
 
0.8%
2 182
 
0.7%
? 179
 
0.7%
- 172
 
0.7%
Other values (41) 1535
 
6.0%
Latin
ValueCountFrequency (%)
C 212
 
11.0%
T 125
 
6.5%
V 104
 
5.4%
e 101
 
5.2%
E 91
 
4.7%
o 87
 
4.5%
a 74
 
3.8%
D 70
 
3.6%
O 63
 
3.3%
L 62
 
3.2%
Other values (41) 946
48.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 85654
75.6%
ASCII 26883
 
23.7%
Punctuation 547
 
0.5%
CJK 97
 
0.1%
None 94
 
0.1%
Compat Jamo 7
 
< 0.1%
CJK Compat Ideographs 6
 
< 0.1%
Number Forms 3
 
< 0.1%
Arrows 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
20829
77.5%
! 696
 
2.6%
) 553
 
2.1%
( 553
 
2.1%
, 476
 
1.8%
' 225
 
0.8%
C 212
 
0.8%
. 193
 
0.7%
2 182
 
0.7%
? 179
 
0.7%
Other values (73) 2785
 
10.4%
Hangul
ValueCountFrequency (%)
1621
 
1.9%
1604
 
1.9%
1492
 
1.7%
1373
 
1.6%
1324
 
1.5%
1318
 
1.5%
1267
 
1.5%
1185
 
1.4%
1177
 
1.4%
1060
 
1.2%
Other values (965) 72233
84.3%
Punctuation
ValueCountFrequency (%)
156
28.5%
149
27.2%
125
22.9%
113
20.7%
3
 
0.5%
1
 
0.2%
None
ValueCountFrequency (%)
24
25.5%
24
25.5%
18
19.1%
17
18.1%
5
 
5.3%
¡ 1
 
1.1%
1
 
1.1%
1
 
1.1%
1
 
1.1%
1
 
1.1%
CJK
ValueCountFrequency (%)
7
 
7.2%
4
 
4.1%
4
 
4.1%
4
 
4.1%
4
 
4.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
3
 
3.1%
Other values (47) 59
60.8%
Compat Jamo
ValueCountFrequency (%)
5
71.4%
2
 
28.6%
CJK Compat Ideographs
ValueCountFrequency (%)
4
66.7%
1
 
16.7%
1
 
16.7%
Number Forms
ValueCountFrequency (%)
3
100.0%
Arrows
ValueCountFrequency (%)
1
100.0%

예산편성사업명
Text

MISSING 

Distinct1866
Distinct (%)30.3%
Missing68
Missing (%)1.1%
Memory size48.8 KiB
2024-05-11T15:20:07.081484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length64
Median length58
Mean length19.146982
Min length6

Characters and Unicode

Total characters118022
Distinct characters783
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1746 ?
Unique (%)28.3%

Sample

1st row저소득 어르신 보청기 지원 시범사업
2nd row북악하늘길 산책로 정비
3rd row야간 자전거 안전운행 유도디자인 고도화
4th row공원 환경보호를 위한 행동유도 안내사인 디자인
5th row응봉근린공원(금호산) 무장애 산책로 조성
ValueCountFrequency (%)
계획형 2691
 
12.1%
시민참여예산 2051
 
9.3%
동단위 1967
 
8.9%
구단위 724
 
3.3%
시민참여예산(시민참여 641
 
2.9%
지원 574
 
2.6%
지원사업 544
 
2.5%
동단위계획형 542
 
2.4%
시민참여 372
 
1.7%
주민참여 330
 
1.5%
Other values (4077) 11727
52.9%
2024-05-11T15:20:07.913822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
16057
 
13.6%
6196
 
5.2%
6130
 
5.2%
6050
 
5.1%
5130
 
4.3%
3792
 
3.2%
( 3675
 
3.1%
) 3675
 
3.1%
3660
 
3.1%
3604
 
3.1%
Other values (773) 60053
50.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 92604
78.5%
Space Separator 16057
 
13.6%
Open Punctuation 3684
 
3.1%
Close Punctuation 3684
 
3.1%
Decimal Number 656
 
0.6%
Other Punctuation 484
 
0.4%
Uppercase Letter 454
 
0.4%
Lowercase Letter 216
 
0.2%
Math Symbol 77
 
0.1%
Dash Punctuation 65
 
0.1%
Other values (3) 41
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6196
 
6.7%
6130
 
6.6%
6050
 
6.5%
5130
 
5.5%
3792
 
4.1%
3660
 
4.0%
3604
 
3.9%
3579
 
3.9%
3504
 
3.8%
3314
 
3.6%
Other values (687) 47645
51.5%
Uppercase Letter
ValueCountFrequency (%)
C 138
30.4%
T 77
17.0%
V 67
14.8%
D 31
 
6.8%
E 29
 
6.4%
L 24
 
5.3%
B 11
 
2.4%
O 10
 
2.2%
I 9
 
2.0%
A 8
 
1.8%
Other values (14) 50
 
11.0%
Lowercase Letter
ValueCountFrequency (%)
c 54
25.0%
t 36
16.7%
v 27
12.5%
e 18
 
8.3%
o 13
 
6.0%
r 13
 
6.0%
a 9
 
4.2%
i 7
 
3.2%
s 5
 
2.3%
n 5
 
2.3%
Other values (12) 29
13.4%
Other Punctuation
ValueCountFrequency (%)
, 319
65.9%
' 71
 
14.7%
! 49
 
10.1%
? 24
 
5.0%
. 13
 
2.7%
& 3
 
0.6%
* 2
 
0.4%
1
 
0.2%
1
 
0.2%
: 1
 
0.2%
Decimal Number
ValueCountFrequency (%)
1 135
20.6%
2 113
17.2%
0 104
15.9%
5 79
12.0%
3 68
10.4%
7 47
 
7.2%
4 38
 
5.8%
6 34
 
5.2%
8 20
 
3.0%
9 18
 
2.7%
Math Symbol
ValueCountFrequency (%)
+ 40
51.9%
~ 32
41.6%
= 2
 
2.6%
> 2
 
2.6%
1
 
1.3%
Open Punctuation
ValueCountFrequency (%)
( 3675
99.8%
[ 4
 
0.1%
3
 
0.1%
2
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 3675
99.8%
] 4
 
0.1%
3
 
0.1%
2
 
0.1%
Initial Punctuation
ValueCountFrequency (%)
11
52.4%
10
47.6%
Final Punctuation
ValueCountFrequency (%)
11
61.1%
7
38.9%
Space Separator
ValueCountFrequency (%)
16057
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 65
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 92595
78.5%
Common 24748
 
21.0%
Latin 670
 
0.6%
Han 9
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6196
 
6.7%
6130
 
6.6%
6050
 
6.5%
5130
 
5.5%
3792
 
4.1%
3660
 
4.0%
3604
 
3.9%
3579
 
3.9%
3504
 
3.8%
3314
 
3.6%
Other values (679) 47636
51.4%
Latin
ValueCountFrequency (%)
C 138
20.6%
T 77
11.5%
V 67
 
10.0%
c 54
 
8.1%
t 36
 
5.4%
D 31
 
4.6%
E 29
 
4.3%
v 27
 
4.0%
L 24
 
3.6%
e 18
 
2.7%
Other values (36) 169
25.2%
Common
ValueCountFrequency (%)
16057
64.9%
( 3675
 
14.8%
) 3675
 
14.8%
, 319
 
1.3%
1 135
 
0.5%
2 113
 
0.5%
0 104
 
0.4%
5 79
 
0.3%
' 71
 
0.3%
3 68
 
0.3%
Other values (30) 452
 
1.8%
Han
ValueCountFrequency (%)
宿 2
22.2%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 92595
78.5%
ASCII 25366
 
21.5%
Punctuation 40
 
< 0.1%
None 11
 
< 0.1%
CJK 8
 
< 0.1%
Arrows 1
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
16057
63.3%
( 3675
 
14.5%
) 3675
 
14.5%
, 319
 
1.3%
C 138
 
0.5%
1 135
 
0.5%
2 113
 
0.4%
0 104
 
0.4%
5 79
 
0.3%
T 77
 
0.3%
Other values (65) 994
 
3.9%
Hangul
ValueCountFrequency (%)
6196
 
6.7%
6130
 
6.6%
6050
 
6.5%
5130
 
5.5%
3792
 
4.1%
3660
 
4.0%
3604
 
3.9%
3579
 
3.9%
3504
 
3.8%
3314
 
3.6%
Other values (679) 47636
51.4%
Punctuation
ValueCountFrequency (%)
11
27.5%
11
27.5%
10
25.0%
7
17.5%
1
 
2.5%
None
ValueCountFrequency (%)
3
27.3%
3
27.3%
2
18.2%
2
18.2%
1
 
9.1%
CJK
ValueCountFrequency (%)
宿 2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Arrows
ValueCountFrequency (%)
1
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%

예산편성사업비
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct1211
Distinct (%)19.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean85437895
Minimum0
Maximum3.75 × 109
Zeros130
Zeros (%)2.1%
Negative0
Negative (%)0.0%
Memory size54.9 KiB
2024-05-11T15:20:08.537750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2000000
Q15000000
median20000000
Q385000000
95-th percentile3.4 × 108
Maximum3.75 × 109
Range3.75 × 109
Interquartile range (IQR)80000000

Descriptive statistics

Standard deviation2.0611189 × 108
Coefficient of variation (CV)2.4124177
Kurtosis86.31088
Mean85437895
Median Absolute Deviation (MAD)17000000
Skewness7.6614333
Sum5.3244896 × 1011
Variance4.2482113 × 1016
MonotonicityNot monotonic
2024-05-11T15:20:08.820820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5000000 389
 
6.2%
10000000 311
 
5.0%
50000000 274
 
4.4%
100000000 247
 
4.0%
3000000 243
 
3.9%
20000000 239
 
3.8%
30000000 215
 
3.4%
200000000 172
 
2.8%
4000000 165
 
2.6%
0 130
 
2.1%
Other values (1201) 3847
61.7%
ValueCountFrequency (%)
0 130
2.1%
500000 8
 
0.1%
565000 1
 
< 0.1%
600000 2
 
< 0.1%
687000 1
 
< 0.1%
700000 3
 
< 0.1%
710000 1
 
< 0.1%
725000 1
 
< 0.1%
750000 1
 
< 0.1%
800000 1
 
< 0.1%
ValueCountFrequency (%)
3750000000 1
< 0.1%
3450000000 1
< 0.1%
3368000000 1
< 0.1%
2980000000 1
< 0.1%
2950000000 1
< 0.1%
2770000000 1
< 0.1%
2760000000 1
< 0.1%
2520000000 1
< 0.1%
2500000000 1
< 0.1%
2400000000 1
< 0.1%

사업위치
Categorical

Distinct31
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size48.8 KiB
도봉구
 
419
동작구
 
412
성동구
 
402
노원구
 
310
금천구
 
301
Other values (26)
4388 

Length

Max length5
Median length3
Mean length3.137516
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
도봉구 419
 
6.7%
동작구 412
 
6.6%
성동구 402
 
6.5%
노원구 310
 
5.0%
금천구 301
 
4.8%
성북구 298
 
4.8%
강서구 293
 
4.7%
동대문구 277
 
4.4%
관악구 266
 
4.3%
영등포구 254
 
4.1%
Other values (21) 3000
48.1%

Length

2024-05-11T15:20:09.073311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
도봉구 419
 
6.7%
동작구 415
 
6.7%
성동구 402
 
6.5%
노원구 310
 
5.0%
금천구 301
 
4.8%
성북구 298
 
4.8%
강서구 293
 
4.7%
동대문구 277
 
4.4%
관악구 266
 
4.3%
영등포구 254
 
4.1%
Other values (19) 2997
48.1%

예산편성계획서
Text

MISSING 

Distinct2224
Distinct (%)66.1%
Missing2867
Missing (%)46.0%
Memory size48.8 KiB
2024-05-11T15:20:09.843700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length178
Median length172
Mean length135.76256
Min length120

Characters and Unicode

Total characters456841
Distinct characters734
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2219 ?
Unique (%)65.9%

Sample

1st rowhttp://yesan.seoul.go.kr/d.jsp?filename=/apps/yesancontents/attFile/upload/2023/SL_C00248_202307190128141830&n=시민건강국 보건의료정책과_저소득 어르신 보청기 지원 시범사업(101774).hwp
2nd rowhttp://yesan.seoul.go.kr/d.jsp?filename=/apps/yesancontents/attFile/upload/2023/SL_C00153_202307210225370020&n=북악하늘길 산책로 정비(시민참여)(101776).hwp
3rd rowhttp://yesan.seoul.go.kr/d.jsp?filename=/apps/yesancontents/attFile/upload/2023/SL_C00493_202307280122300560&n=1. 사업계획서- 미래한강본부 시설관리과_야간 자전거 안전운행 유도디자인 고도화(101768).hwp
4th rowhttp://yesan.seoul.go.kr/d.jsp?filename=/apps/yesancontents/attFile/upload/2023/SL_C00492_202307210405155540&n=1. 예산편성 사업계획서.hwp
5th rowhttp://yesan.seoul.go.kr/d.jsp?filename=/apps/yesancontents/attFile/upload/2023/SL_C00471_202307210225527170&n=응봉근린공원(금호산) 무장애 산책로 조성(시민참여)(101777).hwp
ValueCountFrequency (%)
계획형 982
 
7.3%
시민참여예산(시민참여).pdf 641
 
4.8%
http://yesan.seoul.go.kr/d.jsp?filename=/apps/yesancontents/attfile/upload/2017/sl_c_dong_common&n=구단위 361
 
2.7%
지원(시민참여).pdf 360
 
2.7%
http://yesan.seoul.go.kr/d.jsp?filename=/apps/yesancontents/attfile/upload/2017/sl_c03750_201710250626558500&n=동단위 341
 
2.5%
시민참여예산 341
 
2.5%
주민참여).pdf 285
 
2.1%
264
 
2.0%
http://yesan.seoul.go.kr/d.jsp?filename=/apps/yesancontents/attfile/upload/2017/sl_c_gu_common&n=구단위 176
 
1.3%
http://yesan.seoul.go.kr/d.jsp?filename=/apps/yesancontents/attfile/upload/2017/sl_c04509_201710260127090370&n=동 164
 
1.2%
Other values (4935) 9518
70.9%
2024-05-11T15:20:10.883723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 30285
 
6.6%
e 23570
 
5.2%
0 22113
 
4.8%
t 20222
 
4.4%
a 20198
 
4.4%
s 20195
 
4.4%
n 20195
 
4.4%
p 20191
 
4.4%
. 16833
 
3.7%
o 13473
 
2.9%
Other values (724) 249566
54.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 212193
46.4%
Decimal Number 79188
 
17.3%
Other Letter 57967
 
12.7%
Other Punctuation 57634
 
12.6%
Uppercase Letter 18827
 
4.1%
Space Separator 10071
 
2.2%
Connector Punctuation 7278
 
1.6%
Math Symbol 6799
 
1.5%
Open Punctuation 3404
 
0.7%
Close Punctuation 3402
 
0.7%
Other values (3) 78
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4446
 
7.7%
4387
 
7.6%
4315
 
7.4%
3251
 
5.6%
2109
 
3.6%
1332
 
2.3%
1270
 
2.2%
1174
 
2.0%
1114
 
1.9%
1105
 
1.9%
Other values (645) 33464
57.7%
Lowercase Letter
ValueCountFrequency (%)
e 23570
11.1%
t 20222
9.5%
a 20198
9.5%
s 20195
9.5%
n 20195
9.5%
p 20191
9.5%
o 13473
 
6.3%
l 13462
 
6.3%
d 10066
 
4.7%
i 6736
 
3.2%
Other values (13) 43885
20.7%
Uppercase Letter
ValueCountFrequency (%)
C 4009
21.3%
L 3382
18.0%
F 3369
17.9%
S 3368
17.9%
O 1444
 
7.7%
M 1080
 
5.7%
N 900
 
4.8%
G 544
 
2.9%
D 384
 
2.0%
U 178
 
0.9%
Other values (12) 169
 
0.9%
Other Punctuation
ValueCountFrequency (%)
/ 30285
52.5%
. 16833
29.2%
? 3387
 
5.9%
& 3366
 
5.8%
: 3365
 
5.8%
, 303
 
0.5%
' 50
 
0.1%
! 42
 
0.1%
2
 
< 0.1%
1
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
0 22113
27.9%
1 13344
16.9%
2 12882
16.3%
7 8164
 
10.3%
5 5236
 
6.6%
3 4124
 
5.2%
9 3786
 
4.8%
4 3286
 
4.1%
6 3191
 
4.0%
8 3062
 
3.9%
Math Symbol
ValueCountFrequency (%)
= 6730
99.0%
+ 40
 
0.6%
~ 29
 
0.4%
Open Punctuation
ValueCountFrequency (%)
( 3401
99.9%
3
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 3399
99.9%
3
 
0.1%
Final Punctuation
ValueCountFrequency (%)
2
66.7%
1
33.3%
Initial Punctuation
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
10071
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 7278
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 72
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 231020
50.6%
Common 167854
36.7%
Hangul 57960
 
12.7%
Han 7
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4446
 
7.7%
4387
 
7.6%
4315
 
7.4%
3251
 
5.6%
2109
 
3.6%
1332
 
2.3%
1270
 
2.2%
1174
 
2.0%
1114
 
1.9%
1105
 
1.9%
Other values (639) 33457
57.7%
Latin
ValueCountFrequency (%)
e 23570
 
10.2%
t 20222
 
8.8%
a 20198
 
8.7%
s 20195
 
8.7%
n 20195
 
8.7%
p 20191
 
8.7%
o 13473
 
5.8%
l 13462
 
5.8%
d 10066
 
4.4%
i 6736
 
2.9%
Other values (35) 62712
27.1%
Common
ValueCountFrequency (%)
/ 30285
18.0%
0 22113
13.2%
. 16833
10.0%
1 13344
 
7.9%
2 12882
 
7.7%
10071
 
6.0%
7 8164
 
4.9%
_ 7278
 
4.3%
= 6730
 
4.0%
5 5236
 
3.1%
Other values (24) 34918
20.8%
Han
ValueCountFrequency (%)
宿 2
28.6%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 398859
87.3%
Hangul 57960
 
12.7%
None 8
 
< 0.1%
Punctuation 7
 
< 0.1%
CJK 7
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 30285
 
7.6%
e 23570
 
5.9%
0 22113
 
5.5%
t 20222
 
5.1%
a 20198
 
5.1%
s 20195
 
5.1%
n 20195
 
5.1%
p 20191
 
5.1%
. 16833
 
4.2%
o 13473
 
3.4%
Other values (61) 191584
48.0%
Hangul
ValueCountFrequency (%)
4446
 
7.7%
4387
 
7.6%
4315
 
7.4%
3251
 
5.6%
2109
 
3.6%
1332
 
2.3%
1270
 
2.2%
1174
 
2.0%
1114
 
1.9%
1105
 
1.9%
Other values (639) 33457
57.7%
None
ValueCountFrequency (%)
3
37.5%
3
37.5%
2
25.0%
Punctuation
ValueCountFrequency (%)
2
28.6%
2
28.6%
1
14.3%
1
14.3%
1
14.3%
CJK
ValueCountFrequency (%)
宿 2
28.6%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%

지출금액
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct966
Distinct (%)29.2%
Missing2924
Missing (%)46.9%
Infinite0
Infinite (%)0.0%
Mean1.1471017 × 108
Minimum0
Maximum2.537692 × 109
Zeros91
Zeros (%)1.5%
Negative0
Negative (%)0.0%
Memory size54.9 KiB
2024-05-11T15:20:11.136246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2500000
Q116000000
median50000000
Q31.3262225 × 108
95-th percentile4.300945 × 108
Maximum2.537692 × 109
Range2.537692 × 109
Interquartile range (IQR)1.1662225 × 108

Descriptive statistics

Standard deviation2.0201637 × 108
Coefficient of variation (CV)1.7611026
Kurtosis43.409583
Mean1.1471017 × 108
Median Absolute Deviation (MAD)42000000
Skewness5.4075519
Sum3.7946123 × 1011
Variance4.0810613 × 1016
MonotonicityNot monotonic
2024-05-11T15:20:11.378965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
50000000 204
 
3.3%
100000000 165
 
2.6%
20000000 163
 
2.6%
30000000 146
 
2.3%
10000000 143
 
2.3%
5000000 131
 
2.1%
200000000 105
 
1.7%
0 91
 
1.5%
300000000 65
 
1.0%
40000000 65
 
1.0%
Other values (956) 2030
32.6%
(Missing) 2924
46.9%
ValueCountFrequency (%)
0 91
1.5%
200000 1
 
< 0.1%
400000 1
 
< 0.1%
500000 4
 
0.1%
710000 1
 
< 0.1%
725000 1
 
< 0.1%
816000 1
 
< 0.1%
900000 3
 
< 0.1%
960000 1
 
< 0.1%
1000000 17
 
0.3%
ValueCountFrequency (%)
2537691980 1
< 0.1%
2520000000 1
< 0.1%
2400000000 1
< 0.1%
2398088430 1
< 0.1%
2117417000 1
< 0.1%
2100000000 1
< 0.1%
2000000000 1
< 0.1%
1977156000 1
< 0.1%
1900000000 1
< 0.1%
1800000000 1
< 0.1%

사업추진단계
Categorical

IMBALANCE 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size48.8 KiB
<NA>
3073 
완료
2948 
추진중
 
137
미집행
 
69
발주
 
3

Length

Max length4
Median length3
Mean length3.0198973
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 3073
49.3%
완료 2948
47.3%
추진중 137
 
2.2%
미집행 69
 
1.1%
발주 3
 
< 0.1%
계획수립 2
 
< 0.1%

Length

2024-05-11T15:20:11.620411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T15:20:11.837345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 3073
49.3%
완료 2948
47.3%
추진중 137
 
2.2%
미집행 69
 
1.1%
발주 3
 
< 0.1%
계획수립 2
 
< 0.1%

사업추진집행률
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing6232
Missing (%)100.0%
Memory size54.9 KiB

집행기준일
Text

MISSING 

Distinct144
Distinct (%)4.4%
Missing2922
Missing (%)46.9%
Memory size48.8 KiB
2024-05-11T15:20:12.122120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.9978852
Min length3

Characters and Unicode

Total characters33093
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique73 ?
Unique (%)2.2%

Sample

1st row2023-11-30
2nd row2023-11-29
3rd row2023-07-28
4th row2023-07-21
5th row2023-11-29
ValueCountFrequency (%)
2017-12-31 799
24.1%
2018-01-15 730
22.1%
2016-12-31 521
15.7%
2015-12-31 357
10.8%
2014-12-31 218
 
6.6%
2019-06-30 166
 
5.0%
2013-12-31 132
 
4.0%
2019-12-19 31
 
0.9%
2019-10-04 18
 
0.5%
2019-12-16 12
 
0.4%
Other values (134) 326
9.8%
2024-05-11T15:20:12.604694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 9199
27.8%
- 6618
20.0%
2 5740
17.3%
0 4873
14.7%
3 2412
 
7.3%
5 1122
 
3.4%
7 833
 
2.5%
8 783
 
2.4%
6 777
 
2.3%
9 478
 
1.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 26475
80.0%
Dash Punctuation 6618
 
20.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 9199
34.7%
2 5740
21.7%
0 4873
18.4%
3 2412
 
9.1%
5 1122
 
4.2%
7 833
 
3.1%
8 783
 
3.0%
6 777
 
2.9%
9 478
 
1.8%
4 258
 
1.0%
Dash Punctuation
ValueCountFrequency (%)
- 6618
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 33093
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 9199
27.8%
- 6618
20.0%
2 5740
17.3%
0 4873
14.7%
3 2412
 
7.3%
5 1122
 
3.4%
7 833
 
2.5%
8 783
 
2.4%
6 777
 
2.3%
9 478
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 33093
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 9199
27.8%
- 6618
20.0%
2 5740
17.3%
0 4873
14.7%
3 2412
 
7.3%
5 1122
 
3.4%
7 833
 
2.5%
8 783
 
2.4%
6 777
 
2.3%
9 478
 
1.4%

비고
Text

MISSING 

Distinct203
Distinct (%)89.4%
Missing6005
Missing (%)96.4%
Memory size48.8 KiB
2024-05-11T15:20:12.922537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length371
Median length107
Mean length52.85022
Min length2

Characters and Unicode

Total characters11997
Distinct characters464
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique191 ?
Unique (%)84.1%

Sample

1st row대상자 선정-의원검진-보청기 구입-적합성 평가-보청기 지원 확정-지원금 지급 과정으로 1~2개월이상 소요됨 자치구 예산집행 완료(2023. 12월중 예정)
2nd row현재 공사 진행중으로 연도 내 준공 예정임
3rd row설계 완료하였으며, 세부 추진계획 수립후 사업 시행중에 있음. -한강공원 저지대 침수, 안정적인 전기 인입(사용), 물품 유지보수 용이성 등 검토로 사업 지연 -2023년 11월 계획수립 완료후 공사 및 관급자재에 대하여 계약부서에 계약 의뢰중에 있음.
4th row특이사항 : '22년 디자인거버넌스 사업으로 개발된 ‘올바른 공원 이용을 유도하는 서비스디자인’ 확산 추진 향후집행계획 : 효과성이 검증된 디자인 결과물을 고도화하여 대상지 맞춤형 디자인 적용 예정 사업부진사유 : 장소 기반 증강현실(AR) 기술구현 및 대상지 부서 협의 등 면밀한 사전검토로 인한 사업추진 지연
5th row공사 진행 중으로 연내 준공 예정임
ValueCountFrequency (%)
78
 
3.0%
48
 
1.9%
사업 38
 
1.5%
사업으로 24
 
0.9%
예정 24
 
0.9%
22
 
0.9%
추진 19
 
0.7%
자치구 19
 
0.7%
설치 18
 
0.7%
따른 17
 
0.7%
Other values (1439) 2262
88.0%
2024-05-11T15:20:13.541391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2437
 
20.3%
0 373
 
3.1%
320
 
2.7%
1 255
 
2.1%
213
 
1.8%
, 209
 
1.7%
2 206
 
1.7%
179
 
1.5%
148
 
1.2%
( 134
 
1.1%
Other values (454) 7523
62.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7460
62.2%
Space Separator 2437
 
20.3%
Decimal Number 1239
 
10.3%
Other Punctuation 418
 
3.5%
Open Punctuation 135
 
1.1%
Close Punctuation 135
 
1.1%
Uppercase Letter 54
 
0.5%
Dash Punctuation 49
 
0.4%
Math Symbol 45
 
0.4%
Final Punctuation 7
 
0.1%
Other values (3) 18
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
320
 
4.3%
213
 
2.9%
179
 
2.4%
148
 
2.0%
131
 
1.8%
128
 
1.7%
117
 
1.6%
116
 
1.6%
116
 
1.6%
110
 
1.5%
Other values (402) 5882
78.8%
Uppercase Letter
ValueCountFrequency (%)
O 10
18.5%
C 8
14.8%
L 6
11.1%
D 5
9.3%
T 5
9.3%
E 5
9.3%
V 4
 
7.4%
I 3
 
5.6%
B 2
 
3.7%
M 1
 
1.9%
Other values (5) 5
9.3%
Decimal Number
ValueCountFrequency (%)
0 373
30.1%
1 255
20.6%
2 206
16.6%
5 80
 
6.5%
9 74
 
6.0%
3 59
 
4.8%
6 49
 
4.0%
4 49
 
4.0%
7 47
 
3.8%
8 47
 
3.8%
Other Punctuation
ValueCountFrequency (%)
, 209
50.0%
. 120
28.7%
: 48
 
11.5%
' 17
 
4.1%
% 12
 
2.9%
4
 
1.0%
/ 4
 
1.0%
? 2
 
0.5%
1
 
0.2%
* 1
 
0.2%
Math Symbol
ValueCountFrequency (%)
~ 27
60.0%
> 10
 
22.2%
6
 
13.3%
= 2
 
4.4%
Other Symbol
ValueCountFrequency (%)
4
57.1%
2
28.6%
1
 
14.3%
Open Punctuation
ValueCountFrequency (%)
( 134
99.3%
[ 1
 
0.7%
Close Punctuation
ValueCountFrequency (%)
) 134
99.3%
] 1
 
0.7%
Lowercase Letter
ValueCountFrequency (%)
o 3
50.0%
m 3
50.0%
Space Separator
ValueCountFrequency (%)
2437
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 49
100.0%
Final Punctuation
ValueCountFrequency (%)
7
100.0%
Initial Punctuation
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7459
62.2%
Common 4477
37.3%
Latin 61
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
320
 
4.3%
213
 
2.9%
179
 
2.4%
148
 
2.0%
131
 
1.8%
128
 
1.7%
117
 
1.6%
116
 
1.6%
116
 
1.6%
110
 
1.5%
Other values (401) 5881
78.8%
Common
ValueCountFrequency (%)
2437
54.4%
0 373
 
8.3%
1 255
 
5.7%
, 209
 
4.7%
2 206
 
4.6%
( 134
 
3.0%
) 134
 
3.0%
. 120
 
2.7%
5 80
 
1.8%
9 74
 
1.7%
Other values (25) 455
 
10.2%
Latin
ValueCountFrequency (%)
O 10
16.4%
C 8
13.1%
L 6
9.8%
D 5
8.2%
T 5
8.2%
E 5
8.2%
V 4
 
6.6%
I 3
 
4.9%
o 3
 
4.9%
m 3
 
4.9%
Other values (8) 9
14.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7458
62.2%
ASCII 4507
37.6%
Punctuation 16
 
0.1%
Arrows 6
 
0.1%
Geometric Shapes 5
 
< 0.1%
CJK Compat 2
 
< 0.1%
None 2
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2437
54.1%
0 373
 
8.3%
1 255
 
5.7%
, 209
 
4.6%
2 206
 
4.6%
( 134
 
3.0%
) 134
 
3.0%
. 120
 
2.7%
5 80
 
1.8%
9 74
 
1.6%
Other values (34) 485
 
10.8%
Hangul
ValueCountFrequency (%)
320
 
4.3%
213
 
2.9%
179
 
2.4%
148
 
2.0%
131
 
1.8%
128
 
1.7%
117
 
1.6%
116
 
1.6%
116
 
1.6%
110
 
1.5%
Other values (400) 5880
78.8%
Punctuation
ValueCountFrequency (%)
7
43.8%
5
31.2%
4
25.0%
Arrows
ValueCountFrequency (%)
6
100.0%
Geometric Shapes
ValueCountFrequency (%)
4
80.0%
1
 
20.0%
CJK Compat
ValueCountFrequency (%)
2
100.0%
None
ValueCountFrequency (%)
1
50.0%
º 1
50.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

결과보고서
Text

MISSING 

Distinct1554
Distinct (%)100.0%
Missing4678
Missing (%)75.1%
Memory size48.8 KiB
2024-05-11T15:20:13.926309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length176
Median length164
Mean length135.35393
Min length118

Characters and Unicode

Total characters210340
Distinct characters700
Distinct categories14 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1554 ?
Unique (%)100.0%

Sample

1st rowhttp://yesan.seoul.go.kr/d.jsp?filename=/apps/yesancontents/attFile/upload/2021/SL_R02096_202212060542588730&n=은행나무 그물망 설치사업 추진실적(도봉구).hwpx
2nd rowhttp://yesan.seoul.go.kr/d.jsp?filename=/apps/yesancontents/attFile/upload/2023/SL_R00457_202303270709337040&n=공공임대주택 야외운동기구 설치 준공 보고.pdf
3rd rowhttp://yesan.seoul.go.kr/d.jsp?filename=/apps/yesancontents/attFile/upload/2021/SL_R00208_202211170133399750&n=2022년 전통시장 홍보 에코백 배포계획.pdf
4th rowhttp://yesan.seoul.go.kr/d.jsp?filename=/apps/yesancontents/attFile/upload/2021/SL_R01691_202212080956494550&n=현장 사진(가양나들목).zip
5th rowhttp://yesan.seoul.go.kr/d.jsp?filename=/apps/yesancontents/attFile/upload/2021/SL_R02110_202212060543308010&n=남부순환로 자전거도로 단절구간 연결사업 결과보고(남부순환로)_공사완료.hwp
ValueCountFrequency (%)
181
 
2.8%
설치.pdf 91
 
1.4%
정비사업.pdf 69
 
1.1%
조성.pdf 49
 
0.8%
설치 49
 
0.8%
주민참여).pdf 48
 
0.8%
교통안전시설물 46
 
0.7%
45
 
0.7%
주변 42
 
0.7%
정비.pdf 41
 
0.6%
Other values (3906) 5730
89.7%
2024-05-11T15:20:14.563200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 13986
 
6.6%
0 12108
 
5.8%
e 10882
 
5.2%
t 9329
 
4.4%
s 9329
 
4.4%
a 9329
 
4.4%
n 9326
 
4.4%
p 9323
 
4.4%
1 7986
 
3.8%
. 7849
 
3.7%
Other values (690) 110893
52.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 97960
46.6%
Decimal Number 43187
20.5%
Other Punctuation 26690
 
12.7%
Other Letter 23420
 
11.1%
Uppercase Letter 6482
 
3.1%
Space Separator 4839
 
2.3%
Connector Punctuation 3180
 
1.5%
Math Symbol 3135
 
1.5%
Open Punctuation 656
 
0.3%
Close Punctuation 654
 
0.3%
Other values (4) 137
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
595
 
2.5%
587
 
2.5%
533
 
2.3%
513
 
2.2%
482
 
2.1%
473
 
2.0%
419
 
1.8%
412
 
1.8%
410
 
1.8%
393
 
1.7%
Other values (607) 18603
79.4%
Lowercase Letter
ValueCountFrequency (%)
e 10882
11.1%
t 9329
9.5%
s 9329
9.5%
a 9329
9.5%
n 9326
9.5%
p 9323
9.5%
o 6223
 
6.4%
l 6221
 
6.4%
d 4374
 
4.5%
i 3117
 
3.2%
Other values (15) 20507
20.9%
Uppercase Letter
ValueCountFrequency (%)
L 1568
24.2%
F 1558
24.0%
R 1556
24.0%
S 1556
24.0%
C 87
 
1.3%
T 48
 
0.7%
V 43
 
0.7%
D 17
 
0.3%
E 15
 
0.2%
I 6
 
0.1%
Other values (10) 28
 
0.4%
Decimal Number
ValueCountFrequency (%)
0 12108
28.0%
1 7986
18.5%
2 7487
17.3%
7 3343
 
7.7%
8 2426
 
5.6%
3 2290
 
5.3%
5 2004
 
4.6%
4 1933
 
4.5%
6 1832
 
4.2%
9 1778
 
4.1%
Other Punctuation
ValueCountFrequency (%)
/ 13986
52.4%
. 7849
29.4%
? 1565
 
5.9%
& 1556
 
5.8%
: 1554
 
5.8%
, 126
 
0.5%
' 30
 
0.1%
! 23
 
0.1%
% 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 623
95.0%
13
 
2.0%
[ 13
 
2.0%
7
 
1.1%
Close Punctuation
ValueCountFrequency (%)
) 621
95.0%
13
 
2.0%
] 13
 
2.0%
7
 
1.1%
Math Symbol
ValueCountFrequency (%)
= 3108
99.1%
~ 21
 
0.7%
+ 6
 
0.2%
Final Punctuation
ValueCountFrequency (%)
2
50.0%
2
50.0%
Initial Punctuation
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
4839
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3180
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 129
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 104442
49.7%
Common 82478
39.2%
Hangul 23415
 
11.1%
Han 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
595
 
2.5%
587
 
2.5%
533
 
2.3%
513
 
2.2%
482
 
2.1%
473
 
2.0%
419
 
1.8%
412
 
1.8%
410
 
1.8%
393
 
1.7%
Other values (602) 18598
79.4%
Latin
ValueCountFrequency (%)
e 10882
10.4%
t 9329
 
8.9%
s 9329
 
8.9%
a 9329
 
8.9%
n 9326
 
8.9%
p 9323
 
8.9%
o 6223
 
6.0%
l 6221
 
6.0%
d 4374
 
4.2%
i 3117
 
3.0%
Other values (35) 26989
25.8%
Common
ValueCountFrequency (%)
/ 13986
17.0%
0 12108
14.7%
1 7986
9.7%
. 7849
9.5%
2 7487
9.1%
4839
 
5.9%
7 3343
 
4.1%
_ 3180
 
3.9%
= 3108
 
3.8%
8 2426
 
2.9%
Other values (28) 16166
19.6%
Han
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
宿 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 186872
88.8%
Hangul 23415
 
11.1%
None 40
 
< 0.1%
Punctuation 7
 
< 0.1%
CJK 4
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%
Misc Symbols 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 13986
 
7.5%
0 12108
 
6.5%
e 10882
 
5.8%
t 9329
 
5.0%
s 9329
 
5.0%
a 9329
 
5.0%
n 9326
 
5.0%
p 9323
 
5.0%
1 7986
 
4.3%
. 7849
 
4.2%
Other values (64) 87425
46.8%
Hangul
ValueCountFrequency (%)
595
 
2.5%
587
 
2.5%
533
 
2.3%
513
 
2.2%
482
 
2.1%
473
 
2.0%
419
 
1.8%
412
 
1.8%
410
 
1.8%
393
 
1.7%
Other values (602) 18598
79.4%
None
ValueCountFrequency (%)
13
32.5%
13
32.5%
7
17.5%
7
17.5%
Punctuation
ValueCountFrequency (%)
2
28.6%
2
28.6%
2
28.6%
1
14.3%
CJK
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
宿 1
25.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Misc Symbols
ValueCountFrequency (%)
1
100.0%

Interactions

2024-05-11T15:20:02.310781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:20:00.143027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:20:00.828397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:20:01.461192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:20:02.497698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:20:00.320292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:20:00.990167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:20:01.676761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:20:02.702222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:20:00.472832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:20:01.121253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:20:01.911632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:20:02.901640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:20:00.651356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:20:01.277864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:20:02.105431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-11T15:20:14.735203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년도제안번호예산편성사업비사업위치지출금액사업추진단계
년도1.0000.8400.3230.4200.4310.746
제안번호0.8401.0000.2970.5740.2860.345
예산편성사업비0.3230.2971.0000.2950.9560.375
사업위치0.4200.5740.2951.0000.3010.376
지출금액0.4310.2860.9560.3011.0000.343
사업추진단계0.7460.3450.3750.3760.3431.000
2024-05-11T15:20:14.875411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업위치사업추진단계
사업위치1.0000.191
사업추진단계0.1911.000
2024-05-11T15:20:15.012220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년도제안번호예산편성사업비지출금액사업위치사업추진단계
년도1.0000.741-0.389-0.1990.1450.403
제안번호0.7411.000-0.362-0.3990.2190.150
예산편성사업비-0.389-0.3621.0000.9640.1130.164
지출금액-0.199-0.3990.9641.0000.1130.149
사업위치0.1450.2190.1130.1131.0000.191
사업추진단계0.4030.1500.1640.1490.1911.000

Missing values

2024-05-11T15:20:03.106579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-11T15:20:03.415398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-05-11T15:20:03.729694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

년도제안번호사업명예산편성사업명예산편성사업비사업위치예산편성계획서지출금액사업추진단계사업추진집행률집행기준일비고결과보고서
02023332의류리폼센터 운영(시범)<NA>219508000<NA><NA><NA><NA><NA><NA><NA><NA>
12023624사회적 약자의 복지향상을 위한 해외연수 운영<NA>250000000<NA><NA><NA><NA><NA><NA><NA><NA>
22023264오동근린공원(월곡산) 철쭉동산 만들기<NA>35000000<NA><NA><NA><NA><NA><NA><NA><NA>
32023260공공기관 내 스마트 수돗물 수질관리 시스템 도입 설치<NA>450000000<NA><NA><NA><NA><NA><NA><NA><NA>
42023249공원으로 찾아오는 어린이 물놀이터<NA>297000000<NA><NA><NA><NA><NA><NA><NA><NA>
52023671가정용 소형감량기 설치 지원사업 지원<NA>1000000000<NA><NA><NA><NA><NA><NA><NA><NA>
62023670줍깅 주간 운영 지원<NA>250000000<NA><NA><NA><NA><NA><NA><NA><NA>
72023669경력단절여성 직업교육을 통한 플로리스트 인력 개발 사업<NA>150000000<NA><NA><NA><NA><NA><NA><NA><NA>
82023652스마트폴 설치로 안전한 등하굣길 만들기<NA>1130000000<NA><NA><NA><NA><NA><NA><NA><NA>
92023646교통약자 보행권 확보를 위한 안전 보도 만들기<NA>2760000000<NA><NA><NA><NA><NA><NA><NA><NA>
년도제안번호사업명예산편성사업명예산편성사업비사업위치예산편성계획서지출금액사업추진단계사업추진집행률집행기준일비고결과보고서
6222201264어린이 숲속놀이 체험장 조성어린이 숲속놀이 체험장 조성700000000종로구<NA>700000000<NA><NA>2013-12-31<NA><NA>
6223201265공원우범화방지 CCTV 설치공원우범화방지 CCTV 설치440000000중구<NA>440000000<NA><NA>2013-12-31<NA><NA>
6224201266공원내 안심어린이 놀이시설 정비공원내 안심어린이 놀이시설 정비524000000중구<NA>524000000<NA><NA>2013-12-31<NA><NA>
62252012319한부모가정 이해교육 강사양성 및 교육실시한부모가정 이해교육 강사양성 및 교육실시58000000서울시<NA>58000000<NA><NA>2013-12-31<NA><NA>
62262012320결혼을 앞둔 예비부부 교육결혼을 앞둔 예비부부 교육130000000서울시<NA>130000000<NA><NA>2013-12-31<NA><NA>
62272012321한부모가정지원센터 설치한부모가정지원센터 설치200000000송파구<NA>200000000<NA><NA>2013-12-31<NA><NA>
62282012322왕따, 학교폭력 근절을 위한 지역공동체 사업 제안왕따, 학교폭력 근절을 위한 지역공동체 사업 제안185000000금천구<NA>185000000<NA><NA>2013-12-31<NA><NA>
62292012324결식아동 음식점 한눈에 보여요결식아동 음식점 한눈에 보여요0영등포구<NA>0<NA><NA>2013-12-31<NA><NA>
62302012325중랑패밀리 행복체험학습중랑패밀리 행복체험학습35000000중랑구<NA>35000000<NA><NA>2013-12-31<NA><NA>
62312012327아이들이 행복한 놀이마당 조성아이들이 행복한 놀이마당 조성200000000은평구<NA>200000000<NA><NA>2013-12-31<NA><NA>