Overview

Dataset statistics

Number of variables4
Number of observations51
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.9 KiB
Average record size in memory37.6 B

Variable types

Numeric2
Categorical1
Text1

Dataset

Description서울특별시 서대문구에서 실시한 사업 중 주민참여예산 승인 사업 정보 현황(편성년도, 사업명, 예산)에 관련한 데이터를 제공합니다.
Author서울특별시 서대문구
URLhttps://www.data.go.kr/data/15048767/fileData.do

Alerts

편성년도 has constant value ""Constant
연번 has unique valuesUnique
사업명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:37:39.875658
Analysis finished2023-12-12 12:37:40.545386
Duration0.67 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct51
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26
Minimum1
Maximum51
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size591.0 B
2023-12-12T21:37:40.614205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.5
Q113.5
median26
Q338.5
95-th percentile48.5
Maximum51
Range50
Interquartile range (IQR)25

Descriptive statistics

Standard deviation14.866069
Coefficient of variation (CV)0.57177187
Kurtosis-1.2
Mean26
Median Absolute Deviation (MAD)13
Skewness0
Sum1326
Variance221
MonotonicityStrictly increasing
2023-12-12T21:37:40.786735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
2.0%
2 1
 
2.0%
29 1
 
2.0%
30 1
 
2.0%
31 1
 
2.0%
32 1
 
2.0%
33 1
 
2.0%
34 1
 
2.0%
35 1
 
2.0%
36 1
 
2.0%
Other values (41) 41
80.4%
ValueCountFrequency (%)
1 1
2.0%
2 1
2.0%
3 1
2.0%
4 1
2.0%
5 1
2.0%
6 1
2.0%
7 1
2.0%
8 1
2.0%
9 1
2.0%
10 1
2.0%
ValueCountFrequency (%)
51 1
2.0%
50 1
2.0%
49 1
2.0%
48 1
2.0%
47 1
2.0%
46 1
2.0%
45 1
2.0%
44 1
2.0%
43 1
2.0%
42 1
2.0%

편성년도
Categorical

CONSTANT 

Distinct1
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size540.0 B
2021
51 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021
2nd row2021
3rd row2021
4th row2021
5th row2021

Common Values

ValueCountFrequency (%)
2021 51
100.0%

Length

2023-12-12T21:37:40.917984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:37:41.019651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021 51
100.0%

사업명
Text

UNIQUE 

Distinct51
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size540.0 B
2023-12-12T21:37:41.279263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length29
Mean length19.294118
Min length11

Characters and Unicode

Total characters984
Distinct characters252
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique51 ?
Unique (%)100.0%

Sample

1st row외지고 위험한 골목 내 CCTV 설치
2nd row쓰레기 무단투기 방지 스마트 경고판 설치
3rd row노후되어 굴곡지고 균열이 심한 이면도로 포장
4th row담배꽁초 쓰레기통 및 안내표지판 설치
5th row폐건전지 수거 활성화 및 인센티브 제공 사업
ValueCountFrequency (%)
설치 18
 
7.4%
6
 
2.5%
스마트 4
 
1.7%
사업 4
 
1.7%
조성 3
 
1.2%
쓰레기 3
 
1.2%
홍제천 3
 
1.2%
횡단보도 3
 
1.2%
무단투기 2
 
0.8%
설치사업 2
 
0.8%
Other values (177) 194
80.2%
2023-12-12T21:37:41.735139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
192
 
19.5%
25
 
2.5%
23
 
2.3%
20
 
2.0%
16
 
1.6%
16
 
1.6%
12
 
1.2%
11
 
1.1%
11
 
1.1%
11
 
1.1%
Other values (242) 647
65.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 740
75.2%
Space Separator 192
 
19.5%
Uppercase Letter 16
 
1.6%
Other Punctuation 12
 
1.2%
Decimal Number 7
 
0.7%
Close Punctuation 6
 
0.6%
Open Punctuation 6
 
0.6%
Final Punctuation 2
 
0.2%
Initial Punctuation 2
 
0.2%
Math Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
25
 
3.4%
23
 
3.1%
20
 
2.7%
16
 
2.2%
16
 
2.2%
12
 
1.6%
11
 
1.5%
11
 
1.5%
11
 
1.5%
10
 
1.4%
Other values (216) 585
79.1%
Uppercase Letter
ValueCountFrequency (%)
C 4
25.0%
V 2
12.5%
T 2
12.5%
E 2
12.5%
S 1
 
6.2%
Y 1
 
6.2%
O 1
 
6.2%
N 1
 
6.2%
D 1
 
6.2%
L 1
 
6.2%
Other Punctuation
ValueCountFrequency (%)
! 6
50.0%
· 2
 
16.7%
. 2
 
16.7%
, 2
 
16.7%
Decimal Number
ValueCountFrequency (%)
3 2
28.6%
2 2
28.6%
1 2
28.6%
0 1
14.3%
Final Punctuation
ValueCountFrequency (%)
1
50.0%
1
50.0%
Initial Punctuation
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
192
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 740
75.2%
Common 228
 
23.2%
Latin 16
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
25
 
3.4%
23
 
3.1%
20
 
2.7%
16
 
2.2%
16
 
2.2%
12
 
1.6%
11
 
1.5%
11
 
1.5%
11
 
1.5%
10
 
1.4%
Other values (216) 585
79.1%
Common
ValueCountFrequency (%)
192
84.2%
) 6
 
2.6%
! 6
 
2.6%
( 6
 
2.6%
· 2
 
0.9%
3 2
 
0.9%
. 2
 
0.9%
2 2
 
0.9%
1 2
 
0.9%
, 2
 
0.9%
Other values (6) 6
 
2.6%
Latin
ValueCountFrequency (%)
C 4
25.0%
V 2
12.5%
T 2
12.5%
E 2
12.5%
S 1
 
6.2%
Y 1
 
6.2%
O 1
 
6.2%
N 1
 
6.2%
D 1
 
6.2%
L 1
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 740
75.2%
ASCII 238
 
24.2%
Punctuation 4
 
0.4%
None 2
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
192
80.7%
) 6
 
2.5%
! 6
 
2.5%
( 6
 
2.5%
C 4
 
1.7%
3 2
 
0.8%
V 2
 
0.8%
. 2
 
0.8%
T 2
 
0.8%
2 2
 
0.8%
Other values (11) 14
 
5.9%
Hangul
ValueCountFrequency (%)
25
 
3.4%
23
 
3.1%
20
 
2.7%
16
 
2.2%
16
 
2.2%
12
 
1.6%
11
 
1.5%
11
 
1.5%
11
 
1.5%
10
 
1.4%
Other values (216) 585
79.1%
None
ValueCountFrequency (%)
· 2
100.0%
Punctuation
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

사업비(단위: 천원)
Real number (ℝ)

Distinct34
Distinct (%)66.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean32603.922
Minimum1200
Maximum158000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size591.0 B
2023-12-12T21:37:41.861993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1200
5-th percentile3720
Q19000
median16000
Q340250
95-th percentile92000
Maximum158000
Range156800
Interquartile range (IQR)31250

Descriptive statistics

Standard deviation36385.275
Coefficient of variation (CV)1.1159785
Kurtosis3.3007378
Mean32603.922
Median Absolute Deviation (MAD)10000
Skewness1.8580254
Sum1662800
Variance1.3238882 × 109
MonotonicityNot monotonic
2023-12-12T21:37:42.339182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%)
15000 4
 
7.8%
20000 3
 
5.9%
30000 3
 
5.9%
8000 3
 
5.9%
25000 2
 
3.9%
24000 2
 
3.9%
80000 2
 
3.9%
10000 2
 
3.9%
92000 2
 
3.9%
5000 2
 
3.9%
Other values (24) 26
51.0%
ValueCountFrequency (%)
1200 1
 
2.0%
3000 2
3.9%
4440 1
 
2.0%
4500 1
 
2.0%
5000 2
3.9%
6000 1
 
2.0%
7000 2
3.9%
8000 3
5.9%
10000 2
3.9%
10900 1
 
2.0%
ValueCountFrequency (%)
158000 1
2.0%
150000 1
2.0%
92000 2
3.9%
90800 1
2.0%
84000 1
2.0%
80000 2
3.9%
75200 1
2.0%
60000 1
2.0%
55200 1
2.0%
50000 1
2.0%

Interactions

2023-12-12T21:37:40.230479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:37:40.046044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:37:40.314701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:37:40.132505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:37:42.453923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사업명사업비(단위: 천원)
연번1.0001.0000.322
사업명1.0001.0001.000
사업비(단위: 천원)0.3221.0001.000
2023-12-12T21:37:42.555081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사업비(단위: 천원)
연번1.0000.457
사업비(단위: 천원)0.4571.000

Missing values

2023-12-12T21:37:40.436315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:37:40.513566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번편성년도사업명사업비(단위: 천원)
012021외지고 위험한 골목 내 CCTV 설치20000
122021쓰레기 무단투기 방지 스마트 경고판 설치8000
232021노후되어 굴곡지고 균열이 심한 이면도로 포장30000
342021담배꽁초 쓰레기통 및 안내표지판 설치1200
452021폐건전지 수거 활성화 및 인센티브 제공 사업4440
562021동그라미 쉼터 음수대 설치7000
672021자전거 거치대 설치 사업3000
782021무단투기 NO! 집중관리 YES! (이동형 무단투기단속기 설치)8000
892021경계없는 연희동(휠체어 경사판 설치)5000
9102021굴다리 보행환경 개선(안전펜스 설치)12800
연번편성년도사업명사업비(단위: 천원)
41422021횡단보도 바닥신호등 설치 사업60000
42432021서대문 청소년 1인 미디어 지원사업25000
43442021청소년 역사캠프 개설13525
44452021비대면 대학 진로 탐색11475
45462021서대문협치체계 고도화24000
46472021버전2.0 주민참여예산 활성화12000
47482021자원 선순환 홍제천 폭포장 운영55200
48492021홍제천 자전거 길 환경 개선 및 생태교육20000
49502021아이들의 꿈 담은 신기한 놀이터(신기한 놀이터 3호)90800
50512021서대문 먹거리 자치의 실천 공동체마을밥상75200