Overview

Dataset statistics

Number of variables4
Number of observations24
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory948.0 B
Average record size in memory39.5 B

Variable types

Numeric2
Categorical1
Text1

Dataset

Description인천광역시 연수구 주민참여예산 운영현황의 데이터에서 사업유형, 사업명, 소요예산의 목록- 사업유형, 사업명, 소요예산(천원)으로 구분
Author인천광역시 연수구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15087402&srcSe=7661IVAWM27C61E190

Alerts

연번 is highly overall correlated with 구분High correlation
소요예산(천원) is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
연번 has unique valuesUnique
사업명 has unique valuesUnique
소요예산(천원) has 2 (8.3%) zerosZeros

Reproduction

Analysis started2024-01-28 06:05:46.913848
Analysis finished2024-01-28 06:05:47.553785
Duration0.64 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct24
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.5
Minimum1
Maximum24
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size348.0 B
2024-01-28T15:05:47.604143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.15
Q16.75
median12.5
Q318.25
95-th percentile22.85
Maximum24
Range23
Interquartile range (IQR)11.5

Descriptive statistics

Standard deviation7.0710678
Coefficient of variation (CV)0.56568542
Kurtosis-1.2
Mean12.5
Median Absolute Deviation (MAD)6
Skewness0
Sum300
Variance50
MonotonicityStrictly increasing
2024-01-28T15:05:47.700397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
1 1
 
4.2%
14 1
 
4.2%
24 1
 
4.2%
23 1
 
4.2%
22 1
 
4.2%
21 1
 
4.2%
20 1
 
4.2%
19 1
 
4.2%
18 1
 
4.2%
17 1
 
4.2%
Other values (14) 14
58.3%
ValueCountFrequency (%)
1 1
4.2%
2 1
4.2%
3 1
4.2%
4 1
4.2%
5 1
4.2%
6 1
4.2%
7 1
4.2%
8 1
4.2%
9 1
4.2%
10 1
4.2%
ValueCountFrequency (%)
24 1
4.2%
23 1
4.2%
22 1
4.2%
21 1
4.2%
20 1
4.2%
19 1
4.2%
18 1
4.2%
17 1
4.2%
16 1
4.2%
15 1
4.2%

구분
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)12.5%
Missing0
Missing (%)0.0%
Memory size324.0 B
동정참여
19 
구정참여
구정참여
 
1

Length

Max length5
Median length4
Mean length4.1666667
Min length4

Unique

Unique1 ?
Unique (%)4.2%

Sample

1st row구정참여
2nd row구정참여
3rd row구정참여
4th row구정참여
5th row구정참여

Common Values

ValueCountFrequency (%)
동정참여 19
79.2%
구정참여 4
 
16.7%
구정참여 1
 
4.2%

Length

2024-01-28T15:05:47.797688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T15:05:47.870454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
동정참여 19
79.2%
구정참여 5
 
20.8%

사업명
Text

UNIQUE 

Distinct24
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size324.0 B
2024-01-28T15:05:48.078559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length25.5
Mean length19.416667
Min length8

Characters and Unicode

Total characters466
Distinct characters156
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)100.0%

Sample

1st row독거노인 텃밭상자 지원사업
2nd row연수벚꽃길 경관개선 사업
3rd row연수구 내 공원에 꽃과 나무 명찰 설치(원도심)
4th row부수지 공원 환경개선
5th row부수지공원 운동 기구 및 차양설치
ValueCountFrequency (%)
설치 5
 
4.4%
조성 4
 
3.5%
사업 3
 
2.6%
3
 
2.6%
위한 3
 
2.6%
새롭게 3
 
2.6%
안전하게 2
 
1.8%
꽃길 2
 
1.8%
안전한 2
 
1.8%
벚꽃로를 2
 
1.8%
Other values (83) 85
74.6%
2024-01-28T15:05:48.426540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
94
 
20.2%
9
 
1.9%
9
 
1.9%
9
 
1.9%
8
 
1.7%
8
 
1.7%
7
 
1.5%
7
 
1.5%
7
 
1.5%
7
 
1.5%
Other values (146) 301
64.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 350
75.1%
Space Separator 94
 
20.2%
Other Punctuation 8
 
1.7%
Close Punctuation 4
 
0.9%
Open Punctuation 4
 
0.9%
Math Symbol 3
 
0.6%
Decimal Number 3
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9
 
2.6%
9
 
2.6%
9
 
2.6%
8
 
2.3%
8
 
2.3%
7
 
2.0%
7
 
2.0%
7
 
2.0%
7
 
2.0%
6
 
1.7%
Other values (137) 273
78.0%
Decimal Number
ValueCountFrequency (%)
5 1
33.3%
3 1
33.3%
1 1
33.3%
Other Punctuation
ValueCountFrequency (%)
! 6
75.0%
, 2
 
25.0%
Space Separator
ValueCountFrequency (%)
94
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 350
75.1%
Common 116
 
24.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9
 
2.6%
9
 
2.6%
9
 
2.6%
8
 
2.3%
8
 
2.3%
7
 
2.0%
7
 
2.0%
7
 
2.0%
7
 
2.0%
6
 
1.7%
Other values (137) 273
78.0%
Common
ValueCountFrequency (%)
94
81.0%
! 6
 
5.2%
) 4
 
3.4%
( 4
 
3.4%
~ 3
 
2.6%
, 2
 
1.7%
5 1
 
0.9%
3 1
 
0.9%
1 1
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 350
75.1%
ASCII 116
 
24.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
94
81.0%
! 6
 
5.2%
) 4
 
3.4%
( 4
 
3.4%
~ 3
 
2.6%
, 2
 
1.7%
5 1
 
0.9%
3 1
 
0.9%
1 1
 
0.9%
Hangul
ValueCountFrequency (%)
9
 
2.6%
9
 
2.6%
9
 
2.6%
8
 
2.3%
8
 
2.3%
7
 
2.0%
7
 
2.0%
7
 
2.0%
7
 
2.0%
6
 
1.7%
Other values (137) 273
78.0%

소요예산(천원)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct20
Distinct (%)83.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean54692.083
Minimum0
Maximum350000
Zeros2
Zeros (%)8.3%
Negative0
Negative (%)0.0%
Memory size348.0 B
2024-01-28T15:05:48.527112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile375
Q15000
median35000
Q372500
95-th percentile152800
Maximum350000
Range350000
Interquartile range (IQR)67500

Descriptive statistics

Standard deviation75842.014
Coefficient of variation (CV)1.3867092
Kurtosis9.8897496
Mean54692.083
Median Absolute Deviation (MAD)32500
Skewness2.8133743
Sum1312610
Variance5.7520112 × 109
MonotonicityNot monotonic
2024-01-28T15:05:48.620888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=20)
ValueCountFrequency (%)
2500 2
 
8.3%
5000 2
 
8.3%
0 2
 
8.3%
70000 2
 
8.3%
80000 1
 
4.2%
350000 1
 
4.2%
3000 1
 
4.2%
50000 1
 
4.2%
30000 1
 
4.2%
68000 1
 
4.2%
Other values (10) 10
41.7%
ValueCountFrequency (%)
0 2
8.3%
2500 2
8.3%
3000 1
4.2%
5000 2
8.3%
8610 1
4.2%
10000 1
4.2%
14000 1
4.2%
16000 1
4.2%
30000 1
4.2%
40000 1
4.2%
ValueCountFrequency (%)
350000 1
4.2%
160000 1
4.2%
112000 1
4.2%
90000 1
4.2%
84000 1
4.2%
80000 1
4.2%
70000 2
8.3%
68000 1
4.2%
50000 1
4.2%
42000 1
4.2%

Interactions

2024-01-28T15:05:47.262587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T15:05:47.084523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T15:05:47.345032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T15:05:47.172632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-28T15:05:48.691195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분사업명소요예산(천원)
연번1.0000.8111.0000.426
구분0.8111.0001.0000.910
사업명1.0001.0001.0001.000
소요예산(천원)0.4260.9101.0001.000
2024-01-28T15:05:48.769387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번소요예산(천원)구분
연번1.0000.0970.570
소요예산(천원)0.0971.0000.588
구분0.5700.5881.000

Missing values

2024-01-28T15:05:47.453748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T15:05:47.525171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번구분사업명소요예산(천원)
01구정참여독거노인 텃밭상자 지원사업2500
12구정참여연수벚꽃길 경관개선 사업70000
23구정참여연수구 내 공원에 꽃과 나무 명찰 설치(원도심)5000
34구정참여부수지 공원 환경개선112000
45구정참여부수지공원 운동 기구 및 차양설치0
56동정참여쉼이 있는 휴식공간 만들기40000
67동정참여라일락 나무길 조성42000
78동정참여선학어린이공원 로고젝터 및 바닥돌 설치16000
89동정참여선학별빛도서관 꽃길 조성14000
910동정참여연수1동 골목 도로 재포장160000
연번구분사업명소요예산(천원)
1415동정참여녹지대 정비사업70000
1516동정참여동춘터널~송도파크레인동일하이빌 사이 봉재산 등산로 정비80000
1617동정참여인도 미끄럼 방지용 도포90000
1718동정참여행복길 개선 사업10000
1819동정참여꽃길 조성 및 조명등 설치68000
1920동정참여주민 이동이 많은 공원 주위로 공공 식수대 설치30000
2021동정참여어린이들의 안전한 통학을 위한 쉼터 조성 사업50000
2122동정참여자녀를 위한 조기 금융교육3000
2223동정참여보행약자(어린이, 노인)의 안전한 보행확보를 위한 스마트 횡단보도 설치350000
2324동정참여송도5동 관내 초등학교 앞 스마트횡단보도 설치0