Overview

Dataset statistics

Number of variables7
Number of observations4475
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory258.0 KiB
Average record size in memory59.0 B

Variable types

Numeric3
Text2
Categorical1
DateTime1

Dataset

Description제안번호,자치단체코드,사업이름,사업위치_구,참여예산위원,참여예산위원코드,투표 참여 시간
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15723/S/1/datasetView.do

Alerts

제안번호 is highly overall correlated with 사업위치_구High correlation
자치단체코드 is highly overall correlated with 사업위치_구High correlation
사업위치_구 is highly overall correlated with 제안번호 and 1 other fieldsHigh correlation

Reproduction

Analysis started2024-05-04 01:08:54.345882
Analysis finished2024-05-04 01:08:59.567606
Duration5.22 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

제안번호
Real number (ℝ)

HIGH CORRELATION 

Distinct373
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3573.6114
Minimum45
Maximum4061
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size39.5 KiB
2024-05-04T01:08:59.845458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum45
5-th percentile3067
Q13397
median3684
Q33820
95-th percentile4043
Maximum4061
Range4016
Interquartile range (IQR)423

Descriptive statistics

Standard deviation489.03924
Coefficient of variation (CV)0.13684735
Kurtosis27.486752
Mean3573.6114
Median Absolute Deviation (MAD)185
Skewness-4.4138411
Sum15991911
Variance239159.37
MonotonicityDecreasing
2024-05-04T01:09:00.334750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3713 78
 
1.7%
3050 64
 
1.4%
3202 63
 
1.4%
3797 63
 
1.4%
3201 62
 
1.4%
3783 59
 
1.3%
4056 55
 
1.2%
3784 55
 
1.2%
4051 52
 
1.2%
3418 49
 
1.1%
Other values (363) 3875
86.6%
ValueCountFrequency (%)
45 33
0.7%
323 3
 
0.1%
575 28
0.6%
2191 5
 
0.1%
3050 64
1.4%
3052 8
 
0.2%
3053 16
 
0.4%
3054 8
 
0.2%
3056 9
 
0.2%
3057 16
 
0.4%
ValueCountFrequency (%)
4061 6
 
0.1%
4060 19
 
0.4%
4059 12
 
0.3%
4058 4
 
0.1%
4057 18
 
0.4%
4056 55
1.2%
4055 21
 
0.5%
4054 12
 
0.3%
4051 52
1.2%
4045 1
 
< 0.1%

자치단체코드
Real number (ℝ)

HIGH CORRELATION 

Distinct25
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3120000
Minimum3000000
Maximum3240000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size39.5 KiB
2024-05-04T01:09:00.763833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3000000
5-th percentile3010000
Q13060000
median3120000
Q33180000
95-th percentile3230000
Maximum3240000
Range240000
Interquartile range (IQR)120000

Descriptive statistics

Standard deviation72119.084
Coefficient of variation (CV)0.023115091
Kurtosis-1.2038503
Mean3120000
Median Absolute Deviation (MAD)60000
Skewness0
Sum1.3962 × 1010
Variance5.2011623 × 109
MonotonicityNot monotonic
2024-05-04T01:09:01.226728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
3200000 179
 
4.0%
3040000 179
 
4.0%
3220000 179
 
4.0%
3240000 179
 
4.0%
3080000 179
 
4.0%
3150000 179
 
4.0%
3050000 179
 
4.0%
3190000 179
 
4.0%
3130000 179
 
4.0%
3070000 179
 
4.0%
Other values (15) 2685
60.0%
ValueCountFrequency (%)
3000000 179
4.0%
3010000 179
4.0%
3020000 179
4.0%
3030000 179
4.0%
3040000 179
4.0%
3050000 179
4.0%
3060000 179
4.0%
3070000 179
4.0%
3080000 179
4.0%
3090000 179
4.0%
ValueCountFrequency (%)
3240000 179
4.0%
3230000 179
4.0%
3220000 179
4.0%
3210000 179
4.0%
3200000 179
4.0%
3190000 179
4.0%
3180000 179
4.0%
3170000 179
4.0%
3160000 179
4.0%
3150000 179
4.0%
Distinct373
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Memory size35.1 KiB
2024-05-04T01:09:01.848374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length32
Mean length20.879777
Min length5

Characters and Unicode

Total characters93437
Distinct characters588
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)0.5%

Sample

1st row청소년 톡톡 놀이 문화기획단
2nd row청소년 톡톡 놀이 문화기획단
3rd row청소년 톡톡 놀이 문화기획단
4th row청소년 톡톡 놀이 문화기획단
5th row청소년 톡톡 놀이 문화기획단
ValueCountFrequency (%)
설치 819
 
3.8%
456
 
2.1%
조성 386
 
1.8%
주세요 350
 
1.6%
위한 288
 
1.3%
위험한 204
 
0.9%
cctv 197
 
0.9%
정비 196
 
0.9%
만들기 150
 
0.7%
산책로 146
 
0.7%
Other values (1162) 18640
85.4%
2024-05-04T01:09:02.841750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17386
 
18.6%
1437
 
1.5%
1368
 
1.5%
1363
 
1.5%
1287
 
1.4%
1212
 
1.3%
1203
 
1.3%
1040
 
1.1%
1028
 
1.1%
1005
 
1.1%
Other values (578) 65108
69.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 70966
76.0%
Space Separator 17386
 
18.6%
Uppercase Letter 1626
 
1.7%
Other Punctuation 1507
 
1.6%
Decimal Number 621
 
0.7%
Close Punctuation 389
 
0.4%
Open Punctuation 380
 
0.4%
Lowercase Letter 209
 
0.2%
Math Symbol 149
 
0.2%
Dash Punctuation 92
 
0.1%
Other values (2) 112
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1437
 
2.0%
1368
 
1.9%
1363
 
1.9%
1287
 
1.8%
1212
 
1.7%
1203
 
1.7%
1040
 
1.5%
1028
 
1.4%
1005
 
1.4%
929
 
1.3%
Other values (516) 59094
83.3%
Uppercase Letter
ValueCountFrequency (%)
C 487
30.0%
T 241
14.8%
V 215
13.2%
E 193
 
11.9%
D 163
 
10.0%
L 134
 
8.2%
P 51
 
3.1%
F 27
 
1.7%
S 27
 
1.7%
B 26
 
1.6%
Other values (7) 62
 
3.8%
Lowercase Letter
ValueCountFrequency (%)
i 34
16.3%
t 33
15.8%
e 19
9.1%
r 16
7.7%
a 16
7.7%
o 15
7.2%
c 12
 
5.7%
p 11
 
5.3%
n 11
 
5.3%
g 8
 
3.8%
Other values (6) 34
16.3%
Decimal Number
ValueCountFrequency (%)
1 254
40.9%
3 109
17.6%
0 101
 
16.3%
2 57
 
9.2%
6 30
 
4.8%
7 27
 
4.3%
5 23
 
3.7%
4 10
 
1.6%
8 10
 
1.6%
Other Punctuation
ValueCountFrequency (%)
! 653
43.3%
, 350
23.2%
. 232
 
15.4%
' 192
 
12.7%
? 69
 
4.6%
/ 7
 
0.5%
4
 
0.3%
Close Punctuation
ValueCountFrequency (%)
) 369
94.9%
10
 
2.6%
10
 
2.6%
Open Punctuation
ValueCountFrequency (%)
( 360
94.7%
10
 
2.6%
10
 
2.6%
Initial Punctuation
ValueCountFrequency (%)
46
82.1%
10
 
17.9%
Final Punctuation
ValueCountFrequency (%)
46
82.1%
10
 
17.9%
Space Separator
ValueCountFrequency (%)
17386
100.0%
Math Symbol
ValueCountFrequency (%)
~ 149
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 92
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 70942
75.9%
Common 20636
 
22.1%
Latin 1835
 
2.0%
Han 24
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1437
 
2.0%
1368
 
1.9%
1363
 
1.9%
1287
 
1.8%
1212
 
1.7%
1203
 
1.7%
1040
 
1.5%
1028
 
1.4%
1005
 
1.4%
929
 
1.3%
Other values (513) 59070
83.3%
Latin
ValueCountFrequency (%)
C 487
26.5%
T 241
13.1%
V 215
11.7%
E 193
 
10.5%
D 163
 
8.9%
L 134
 
7.3%
P 51
 
2.8%
i 34
 
1.9%
t 33
 
1.8%
F 27
 
1.5%
Other values (23) 257
14.0%
Common
ValueCountFrequency (%)
17386
84.3%
! 653
 
3.2%
) 369
 
1.8%
( 360
 
1.7%
, 350
 
1.7%
1 254
 
1.2%
. 232
 
1.1%
' 192
 
0.9%
~ 149
 
0.7%
3 109
 
0.5%
Other values (19) 582
 
2.8%
Han
ValueCountFrequency (%)
12
50.0%
6
25.0%
6
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 70942
75.9%
ASCII 22315
 
23.9%
Punctuation 116
 
0.1%
None 40
 
< 0.1%
CJK 24
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
17386
77.9%
! 653
 
2.9%
C 487
 
2.2%
) 369
 
1.7%
( 360
 
1.6%
, 350
 
1.6%
1 254
 
1.1%
T 241
 
1.1%
. 232
 
1.0%
V 215
 
1.0%
Other values (43) 1768
 
7.9%
Hangul
ValueCountFrequency (%)
1437
 
2.0%
1368
 
1.9%
1363
 
1.9%
1287
 
1.8%
1212
 
1.7%
1203
 
1.7%
1040
 
1.5%
1028
 
1.4%
1005
 
1.4%
929
 
1.3%
Other values (513) 59070
83.3%
Punctuation
ValueCountFrequency (%)
46
39.7%
46
39.7%
10
 
8.6%
10
 
8.6%
4
 
3.4%
CJK
ValueCountFrequency (%)
12
50.0%
6
25.0%
6
25.0%
None
ValueCountFrequency (%)
10
25.0%
10
25.0%
10
25.0%
10
25.0%

사업위치_구
Categorical

HIGH CORRELATION 

Distinct25
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size35.1 KiB
관악구
 
179
광진구
 
179
은평구
 
179
구로구
 
179
서초구
 
179
Other values (20)
3580 

Length

Max length4
Median length3
Mean length3.08
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row관악구
2nd row관악구
3rd row관악구
4th row관악구
5th row관악구

Common Values

ValueCountFrequency (%)
관악구 179
 
4.0%
광진구 179
 
4.0%
은평구 179
 
4.0%
구로구 179
 
4.0%
서초구 179
 
4.0%
성동구 179
 
4.0%
도봉구 179
 
4.0%
서대문구 179
 
4.0%
노원구 179
 
4.0%
금천구 179
 
4.0%
Other values (15) 2685
60.0%

Length

2024-05-04T01:09:03.299778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
관악구 179
 
4.0%
용산구 179
 
4.0%
강동구 179
 
4.0%
강북구 179
 
4.0%
강서구 179
 
4.0%
동대문구 179
 
4.0%
동작구 179
 
4.0%
마포구 179
 
4.0%
성북구 179
 
4.0%
송파구 179
 
4.0%
Other values (15) 2685
60.0%

참여예산위원
Real number (ℝ)

Distinct179
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean115.46927
Minimum1
Maximum240
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size39.5 KiB
2024-05-04T01:09:03.849620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile10
Q152
median113
Q3174
95-th percentile228
Maximum240
Range239
Interquartile range (IQR)122

Descriptive statistics

Standard deviation70.231138
Coefficient of variation (CV)0.6082236
Kurtosis-1.2270783
Mean115.46927
Median Absolute Deviation (MAD)61
Skewness0.080059194
Sum516725
Variance4932.4127
MonotonicityNot monotonic
2024-05-04T01:09:04.324135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
225 25
 
0.6%
61 25
 
0.6%
231 25
 
0.6%
216 25
 
0.6%
206 25
 
0.6%
201 25
 
0.6%
170 25
 
0.6%
140 25
 
0.6%
107 25
 
0.6%
100 25
 
0.6%
Other values (169) 4225
94.4%
ValueCountFrequency (%)
1 25
0.6%
2 25
0.6%
3 25
0.6%
4 25
0.6%
5 25
0.6%
6 25
0.6%
8 25
0.6%
9 25
0.6%
10 25
0.6%
12 25
0.6%
ValueCountFrequency (%)
240 25
0.6%
238 25
0.6%
237 25
0.6%
236 25
0.6%
235 25
0.6%
234 25
0.6%
232 25
0.6%
231 25
0.6%
228 25
0.6%
226 25
0.6%
Distinct179
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size35.1 KiB
2024-05-04T01:09:05.025038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.9888268
Min length2

Characters and Unicode

Total characters13375
Distinct characters130
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row이태영
2nd row최미영
3rd row신우리
4th row김종성
5th row황에녹
ValueCountFrequency (%)
이태영 25
 
0.6%
전은배 25
 
0.6%
김환동 25
 
0.6%
이해응 25
 
0.6%
최미경 25
 
0.6%
백현진 25
 
0.6%
황선주 25
 
0.6%
장혜림 25
 
0.6%
이병열 25
 
0.6%
정미호 25
 
0.6%
Other values (169) 4225
94.4%
2024-05-04T01:09:06.108196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1150
 
8.6%
775
 
5.8%
450
 
3.4%
450
 
3.4%
375
 
2.8%
250
 
1.9%
225
 
1.7%
225
 
1.7%
225
 
1.7%
225
 
1.7%
Other values (120) 9025
67.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13375
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1150
 
8.6%
775
 
5.8%
450
 
3.4%
450
 
3.4%
375
 
2.8%
250
 
1.9%
225
 
1.7%
225
 
1.7%
225
 
1.7%
225
 
1.7%
Other values (120) 9025
67.5%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13375
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1150
 
8.6%
775
 
5.8%
450
 
3.4%
450
 
3.4%
375
 
2.8%
250
 
1.9%
225
 
1.7%
225
 
1.7%
225
 
1.7%
225
 
1.7%
Other values (120) 9025
67.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13375
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1150
 
8.6%
775
 
5.8%
450
 
3.4%
450
 
3.4%
375
 
2.8%
250
 
1.9%
225
 
1.7%
225
 
1.7%
225
 
1.7%
225
 
1.7%
Other values (120) 9025
67.5%
Distinct185
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Memory size35.1 KiB
Minimum2015-07-24 10:01:06
Maximum2015-07-25 15:56:52
2024-05-04T01:09:06.576667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T01:09:07.031631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2024-05-04T01:08:57.927696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T01:08:55.917544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T01:08:56.877339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T01:08:58.354946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T01:08:56.231439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T01:08:57.178824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T01:08:58.652007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T01:08:56.575413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T01:08:57.517511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-04T01:09:07.293277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
제안번호자치단체코드사업위치_구참여예산위원
제안번호1.0000.5970.8770.000
자치단체코드0.5971.0001.0000.000
사업위치_구0.8771.0001.0000.000
참여예산위원0.0000.0000.0001.000
2024-05-04T01:09:07.535555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
제안번호자치단체코드참여예산위원사업위치_구
제안번호1.000-0.3940.0140.634
자치단체코드-0.3941.0000.0000.998
참여예산위원0.0140.0001.0000.000
사업위치_구0.6340.9980.0001.000

Missing values

2024-05-04T01:08:59.002891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-04T01:08:59.401239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

제안번호자치단체코드사업이름사업위치_구참여예산위원참여예산위원코드투표 참여 시간
040613200000청소년 톡톡 놀이 문화기획단관악구225이태영2015-07-25 14:05:25.0
140613200000청소년 톡톡 놀이 문화기획단관악구203최미영2015-07-25 15:00:09.0
240613200000청소년 톡톡 놀이 문화기획단관악구167신우리2015-07-25 13:46:53.0
340613200000청소년 톡톡 놀이 문화기획단관악구160김종성2015-07-24 14:19:17.0
440613200000청소년 톡톡 놀이 문화기획단관악구146황에녹2015-07-24 13:30:45.0
540613200000청소년 톡톡 놀이 문화기획단관악구104윤은정2015-07-25 13:50:03.0
640603200000청소년 화장품 바르게 알고 쓰기 운동관악구212김희정2015-07-24 13:58:09.0
740603200000청소년 화장품 바르게 알고 쓰기 운동관악구184박동주2015-07-25 10:19:41.0
840603200000청소년 화장품 바르게 알고 쓰기 운동관악구180김재호2015-07-24 11:20:29.0
940603200000청소년 화장품 바르게 알고 쓰기 운동관악구171이성현2015-07-24 11:21:58.0
제안번호자치단체코드사업이름사업위치_구참여예산위원참여예산위원코드투표 참여 시간
4465453220000영유아 아동발달 촉진사업 및 양육환경 개선사업강남구100김환동2015-07-25 10:13:43.0
4466453220000영유아 아동발달 촉진사업 및 양육환경 개선사업강남구92김보민2015-07-24 10:56:13.0
4467453220000영유아 아동발달 촉진사업 및 양육환경 개선사업강남구68박점봉2015-07-24 13:32:25.0
4468453220000영유아 아동발달 촉진사업 및 양육환경 개선사업강남구62김보동2015-07-24 13:11:06.0
4469453220000영유아 아동발달 촉진사업 및 양육환경 개선사업강남구53전은배2015-07-25 13:43:24.0
4470453220000영유아 아동발달 촉진사업 및 양육환경 개선사업강남구46유검우2015-07-25 15:51:41.0
4471453220000영유아 아동발달 촉진사업 및 양육환경 개선사업강남구42송혜성2015-07-25 15:16:04.0
4472453220000영유아 아동발달 촉진사업 및 양육환경 개선사업강남구35김순식2015-07-24 11:38:03.0
4473453220000영유아 아동발달 촉진사업 및 양육환경 개선사업강남구31김경현2015-07-24 12:32:51.0
4474453220000영유아 아동발달 촉진사업 및 양육환경 개선사업강남구16배명숙2015-07-24 16:25:50.0