Overview

Dataset statistics

Number of variables5
Number of observations228
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.7 KiB
Average record size in memory43.6 B

Variable types

Categorical2
Numeric1
Boolean2

Dataset

Description회계년도,참여예산위원코드,위원코드구분,구별 참여예산 제안사업 투표여부,자치구별 참여예산 제안사업 투표여부
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15709/S/1/datasetView.do

Alerts

회계년도 has constant value ""Constant
위원코드구분 is highly imbalanced (72.4%)Imbalance
참여예산위원코드 has unique valuesUnique

Reproduction

Analysis started2024-05-04 03:50:37.905737
Analysis finished2024-05-04 03:50:39.332525
Duration1.43 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

회계년도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2015
228 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2015
2nd row2015
3rd row2015
4th row2015
5th row2015

Common Values

ValueCountFrequency (%)
2015 228
100.0%

Length

2024-05-04T03:50:39.618429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T03:50:39.973771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2015 228
100.0%

참여예산위원코드
Real number (ℝ)

UNIQUE 

Distinct228
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean119.45614
Minimum1
Maximum240
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2024-05-04T03:50:40.310418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile12.35
Q157.75
median119.5
Q3179.25
95-th percentile228.65
Maximum240
Range239
Interquartile range (IQR)121.5

Descriptive statistics

Standard deviation69.993472
Coefficient of variation (CV)0.58593449
Kurtosis-1.2222918
Mean119.45614
Median Absolute Deviation (MAD)61
Skewness0.0079898405
Sum27236
Variance4899.0862
MonotonicityStrictly decreasing
2024-05-04T03:50:40.853323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
240 1
 
0.4%
87 1
 
0.4%
84 1
 
0.4%
83 1
 
0.4%
82 1
 
0.4%
81 1
 
0.4%
79 1
 
0.4%
78 1
 
0.4%
77 1
 
0.4%
76 1
 
0.4%
Other values (218) 218
95.6%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
240 1
0.4%
239 1
0.4%
238 1
0.4%
237 1
0.4%
236 1
0.4%
235 1
0.4%
234 1
0.4%
233 1
0.4%
232 1
0.4%
231 1
0.4%

위원코드구분
Categorical

IMBALANCE 

Distinct3
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
1
212 
3
 
8
2
 
8

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 212
93.0%
3 8
 
3.5%
2 8
 
3.5%

Length

2024-05-04T03:50:41.334208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T03:50:41.642441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 212
93.0%
3 8
 
3.5%
2 8
 
3.5%
Distinct2
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size360.0 B
False
161 
True
67 
ValueCountFrequency (%)
False 161
70.6%
True 67
29.4%
2024-05-04T03:50:41.940476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct2
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size360.0 B
True
179 
False
49 
ValueCountFrequency (%)
True 179
78.5%
False 49
 
21.5%
2024-05-04T03:50:42.255320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Interactions

2024-05-04T03:50:38.342848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-04T03:50:42.453597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
참여예산위원코드위원코드구분구별 참여예산 제안사업 투표여부자치구별 참여예산 제안사업 투표여부
참여예산위원코드1.0000.0940.0000.000
위원코드구분0.0941.0000.0500.031
구별 참여예산 제안사업 투표여부0.0000.0501.0000.230
자치구별 참여예산 제안사업 투표여부0.0000.0310.2301.000
2024-05-04T03:50:42.770724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위원코드구분자치구별 참여예산 제안사업 투표여부구별 참여예산 제안사업 투표여부
위원코드구분1.0000.0520.083
자치구별 참여예산 제안사업 투표여부0.0521.0000.148
구별 참여예산 제안사업 투표여부0.0830.1481.000
2024-05-04T03:50:43.035405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
참여예산위원코드위원코드구분구별 참여예산 제안사업 투표여부자치구별 참여예산 제안사업 투표여부
참여예산위원코드1.0000.0530.0000.000
위원코드구분0.0531.0000.0830.052
구별 참여예산 제안사업 투표여부0.0000.0831.0000.148
자치구별 참여예산 제안사업 투표여부0.0000.0520.1481.000

Missing values

2024-05-04T03:50:38.784751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-04T03:50:39.209833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

회계년도참여예산위원코드위원코드구분구별 참여예산 제안사업 투표여부자치구별 참여예산 제안사업 투표여부
020152401NY
120152391NN
220152381NY
320152371NY
420152361NY
520152351NY
620152341NY
720152331NN
820152321NY
920152311NY
회계년도참여예산위원코드위원코드구분구별 참여예산 제안사업 투표여부자치구별 참여예산 제안사업 투표여부
2182015102NY
219201591YY
220201581NY
221201571NN
222201561YY
223201551NY
224201541NY
225201531NY
226201521YY
227201511NY