Overview

Dataset statistics

Number of variables8
Number of observations34
Missing cells23
Missing cells (%)8.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 KiB
Average record size in memory70.9 B

Variable types

Numeric2
Categorical5
Text1

Dataset

Description인천광역시 남동구 주민참여예산사업 반영결과에 대한 데이터로 연번, 제안연도, 사업명, 사업부서, 검토결과, 요구액, 반영액, 데이터기준일자를 제공합니다.
Author인천광역시 남동구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15087648&srcSe=7661IVAWM27C61E190

Alerts

데이터기준일자 has constant value ""Constant
비고 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
검토결과 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
연번 is highly overall correlated with 반영액(천원) and 2 other fieldsHigh correlation
반영액(천원) is highly overall correlated with 연번 and 2 other fieldsHigh correlation
제안 연도 is highly overall correlated with 사업부서High correlation
사업부서 is highly overall correlated with 반영액(천원) and 2 other fieldsHigh correlation
제안 연도 is highly imbalanced (80.9%)Imbalance
반영액(천원) has 23 (67.6%) missing valuesMissing
연번 has unique valuesUnique
사업명 has unique valuesUnique

Reproduction

Analysis started2024-03-18 05:57:58.780271
Analysis finished2024-03-18 05:57:59.949835
Duration1.17 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct34
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17.5
Minimum1
Maximum34
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size438.0 B
2024-03-18T14:58:00.025598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.65
Q19.25
median17.5
Q325.75
95-th percentile32.35
Maximum34
Range33
Interquartile range (IQR)16.5

Descriptive statistics

Standard deviation9.9582462
Coefficient of variation (CV)0.56904264
Kurtosis-1.2
Mean17.5
Median Absolute Deviation (MAD)8.5
Skewness0
Sum595
Variance99.166667
MonotonicityStrictly increasing
2024-03-18T14:58:00.166649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%)
1 1
 
2.9%
27 1
 
2.9%
21 1
 
2.9%
22 1
 
2.9%
23 1
 
2.9%
24 1
 
2.9%
25 1
 
2.9%
26 1
 
2.9%
28 1
 
2.9%
19 1
 
2.9%
Other values (24) 24
70.6%
ValueCountFrequency (%)
1 1
2.9%
2 1
2.9%
3 1
2.9%
4 1
2.9%
5 1
2.9%
6 1
2.9%
7 1
2.9%
8 1
2.9%
9 1
2.9%
10 1
2.9%
ValueCountFrequency (%)
34 1
2.9%
33 1
2.9%
32 1
2.9%
31 1
2.9%
30 1
2.9%
29 1
2.9%
28 1
2.9%
27 1
2.9%
26 1
2.9%
25 1
2.9%

제안 연도
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)5.9%
Missing0
Missing (%)0.0%
Memory size404.0 B
2022
33 
2019
 
1

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique1 ?
Unique (%)2.9%

Sample

1st row2019
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 33
97.1%
2019 1
 
2.9%

Length

2024-03-18T14:58:00.324845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T14:58:00.441442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 33
97.1%
2019 1
 
2.9%

사업명
Text

UNIQUE 

Distinct34
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size404.0 B
2024-03-18T14:58:00.663288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length26
Mean length20.911765
Min length11

Characters and Unicode

Total characters711
Distinct characters196
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)100.0%

Sample

1st row세대통합 복합시설 건립
2nd row여성 1인 가구 등의 주거안심홈세트 지원
3rd row남촌동 안심마을 조성
4th row만수어린이공원 공원등 조도 개선 및 설치
5th row늘솔길근린공원 내 발지압 자갈길 조성
ValueCountFrequency (%)
설치 14
 
8.2%
조성 6
 
3.5%
5
 
2.9%
논현동 4
 
2.4%
만수6동 4
 
2.4%
3
 
1.8%
어린이공원 2
 
1.2%
2
 
1.2%
교체 2
 
1.2%
가로등 2
 
1.2%
Other values (119) 126
74.1%
2024-03-18T14:58:01.094615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
137
 
19.3%
23
 
3.2%
18
 
2.5%
17
 
2.4%
15
 
2.1%
14
 
2.0%
13
 
1.8%
11
 
1.5%
11
 
1.5%
10
 
1.4%
Other values (186) 442
62.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 542
76.2%
Space Separator 137
 
19.3%
Decimal Number 20
 
2.8%
Close Punctuation 4
 
0.6%
Open Punctuation 4
 
0.6%
Uppercase Letter 4
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
23
 
4.2%
18
 
3.3%
17
 
3.1%
15
 
2.8%
14
 
2.6%
13
 
2.4%
11
 
2.0%
11
 
2.0%
10
 
1.8%
10
 
1.8%
Other values (170) 400
73.8%
Decimal Number
ValueCountFrequency (%)
1 5
25.0%
6 4
20.0%
0 3
15.0%
5 2
 
10.0%
2 1
 
5.0%
7 1
 
5.0%
4 1
 
5.0%
9 1
 
5.0%
3 1
 
5.0%
8 1
 
5.0%
Uppercase Letter
ValueCountFrequency (%)
C 2
50.0%
V 1
25.0%
T 1
25.0%
Space Separator
ValueCountFrequency (%)
137
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 542
76.2%
Common 165
 
23.2%
Latin 4
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
23
 
4.2%
18
 
3.3%
17
 
3.1%
15
 
2.8%
14
 
2.6%
13
 
2.4%
11
 
2.0%
11
 
2.0%
10
 
1.8%
10
 
1.8%
Other values (170) 400
73.8%
Common
ValueCountFrequency (%)
137
83.0%
1 5
 
3.0%
) 4
 
2.4%
( 4
 
2.4%
6 4
 
2.4%
0 3
 
1.8%
5 2
 
1.2%
2 1
 
0.6%
7 1
 
0.6%
4 1
 
0.6%
Other values (3) 3
 
1.8%
Latin
ValueCountFrequency (%)
C 2
50.0%
V 1
25.0%
T 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 542
76.2%
ASCII 169
 
23.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
137
81.1%
1 5
 
3.0%
) 4
 
2.4%
( 4
 
2.4%
6 4
 
2.4%
0 3
 
1.8%
C 2
 
1.2%
5 2
 
1.2%
V 1
 
0.6%
T 1
 
0.6%
Other values (6) 6
 
3.6%
Hangul
ValueCountFrequency (%)
23
 
4.2%
18
 
3.3%
17
 
3.1%
15
 
2.8%
14
 
2.6%
13
 
2.4%
11
 
2.0%
11
 
2.0%
10
 
1.8%
10
 
1.8%
Other values (170) 400
73.8%

사업부서
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)38.2%
Missing0
Missing (%)0.0%
Memory size404.0 B
공원녹지과
13 
교통행정과
건설과+도로과
여성가족과
평생교육과
Other values (8)

Length

Max length12
Median length5
Mean length5.3823529
Min length3

Unique

Unique8 ?
Unique (%)23.5%

Sample

1st row공영개발과
2nd row여성가족과
3rd row여성가족과
4th row공원녹지과
5th row공원녹지과

Common Values

ValueCountFrequency (%)
공원녹지과 13
38.2%
교통행정과 4
 
11.8%
건설과+도로과 4
 
11.8%
여성가족과 3
 
8.8%
평생교육과 2
 
5.9%
공영개발과 1
 
2.9%
안전총괄과 1
 
2.9%
문화관광과 1
 
2.9%
체육진흥과 1
 
2.9%
자동차관리과 1
 
2.9%
Other values (3) 3
 
8.8%

Length

2024-03-18T14:58:01.261409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
공원녹지과 13
38.2%
교통행정과 4
 
11.8%
건설과+도로과 4
 
11.8%
여성가족과 3
 
8.8%
평생교육과 2
 
5.9%
공영개발과 1
 
2.9%
안전총괄과 1
 
2.9%
문화관광과 1
 
2.9%
체육진흥과 1
 
2.9%
자동차관리과 1
 
2.9%
Other values (3) 3
 
8.8%

검토결과
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)11.8%
Missing0
Missing (%)0.0%
Memory size404.0 B
불가
12 
반영
11 
추후반영
장기과제

Length

Max length4
Median length2
Mean length2.6470588
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row반영
2nd row반영
3rd row반영
4th row반영
5th row반영

Common Values

ValueCountFrequency (%)
불가 12
35.3%
반영 11
32.4%
추후반영 7
20.6%
장기과제 4
 
11.8%

Length

2024-03-18T14:58:01.392447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T14:58:01.553160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
불가 12
35.3%
반영 11
32.4%
추후반영 7
20.6%
장기과제 4
 
11.8%

반영액(천원)
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct10
Distinct (%)90.9%
Missing23
Missing (%)67.6%
Infinite0
Infinite (%)0.0%
Mean141423.82
Minimum5000
Maximum1346126
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size438.0 B
2024-03-18T14:58:01.682761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5000
5-th percentile5250
Q17018
median12000
Q340000
95-th percentile708063
Maximum1346126
Range1341126
Interquartile range (IQR)32982

Descriptive statistics

Standard deviation400089.76
Coefficient of variation (CV)2.8290125
Kurtosis10.921314
Mean141423.82
Median Absolute Deviation (MAD)6500
Skewness3.3006008
Sum1555662
Variance1.6007181 × 1011
MonotonicityNot monotonic
2024-03-18T14:58:01.802710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
40000 2
 
5.9%
1346126 1
 
2.9%
10000 1
 
2.9%
70000 1
 
2.9%
12000 1
 
2.9%
13000 1
 
2.9%
5636 1
 
2.9%
8400 1
 
2.9%
5500 1
 
2.9%
5000 1
 
2.9%
(Missing) 23
67.6%
ValueCountFrequency (%)
5000 1
2.9%
5500 1
2.9%
5636 1
2.9%
8400 1
2.9%
10000 1
2.9%
12000 1
2.9%
13000 1
2.9%
40000 2
5.9%
70000 1
2.9%
1346126 1
2.9%
ValueCountFrequency (%)
1346126 1
2.9%
70000 1
2.9%
40000 2
5.9%
13000 1
2.9%
12000 1
2.9%
10000 1
2.9%
8400 1
2.9%
5636 1
2.9%
5500 1
2.9%
5000 1
2.9%

비고
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)8.8%
Missing0
Missing (%)0.0%
Memory size404.0 B
<NA>
23 
2023년 본예산
2022년 예산

Length

Max length9
Median length4
Mean length5.5294118
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023년 본예산
2nd row2023년 본예산
3rd row2023년 본예산
4th row2023년 본예산
5th row2023년 본예산

Common Values

ValueCountFrequency (%)
<NA> 23
67.6%
2023년 본예산 8
 
23.5%
2022년 예산 3
 
8.8%

Length

2024-03-18T14:58:01.961684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T14:58:02.097196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 23
51.1%
2023년 8
 
17.8%
본예산 8
 
17.8%
2022년 3
 
6.7%
예산 3
 
6.7%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size404.0 B
2023-05-15
34 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-05-15
2nd row2023-05-15
3rd row2023-05-15
4th row2023-05-15
5th row2023-05-15

Common Values

ValueCountFrequency (%)
2023-05-15 34
100.0%

Length

2024-03-18T14:58:02.275363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T14:58:02.381766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-05-15 34
100.0%

Interactions

2024-03-18T14:57:59.461847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T14:57:59.181567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T14:57:59.557907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T14:57:59.317226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-18T14:58:02.465217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번제안 연도사업명사업부서검토결과반영액(천원)비고
연번1.0000.0001.0000.3600.9140.0000.930
제안 연도0.0001.0001.0001.0000.0000.4940.000
사업명1.0001.0001.0001.0001.0001.0001.000
사업부서0.3601.0001.0001.0000.0001.0001.000
검토결과0.9140.0001.0000.0001.000NaNNaN
반영액(천원)0.0000.4941.0001.000NaN1.0000.000
비고0.9300.0001.0001.000NaN0.0001.000
2024-03-18T14:58:02.612302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
비고제안 연도검토결과사업부서
비고1.0000.0001.0000.745
제안 연도0.0001.0000.0000.810
검토결과1.0000.0001.0000.000
사업부서0.7450.8100.0001.000
2024-03-18T14:58:02.736598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번반영액(천원)제안 연도사업부서검토결과비고
연번1.000-0.6700.0000.0860.7190.662
반영액(천원)-0.6701.0000.3370.7451.0000.000
제안 연도0.0000.3371.0000.8100.0000.000
사업부서0.0860.7450.8101.0000.0000.745
검토결과0.7191.0000.0000.0001.0001.000
비고0.6620.0000.0000.7451.0001.000

Missing values

2024-03-18T14:57:59.711073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-18T14:57:59.878786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번제안 연도사업명사업부서검토결과반영액(천원)비고데이터기준일자
012019세대통합 복합시설 건립공영개발과반영13461262023년 본예산2023-05-15
122022여성 1인 가구 등의 주거안심홈세트 지원여성가족과반영100002023년 본예산2023-05-15
232022남촌동 안심마을 조성여성가족과반영700002023년 본예산2023-05-15
342022만수어린이공원 공원등 조도 개선 및 설치공원녹지과반영120002023년 본예산2023-05-15
452022늘솔길근린공원 내 발지압 자갈길 조성공원녹지과반영130002023년 본예산2023-05-15
562022새남촌공영주차장 불량조명 교체교통행정과반영56362023년 본예산2023-05-15
672022은봉로 실개천 화단 조성공원녹지과반영400002023년 본예산2023-05-15
782022남동어린이공원 노후시설 정비공원녹지과반영400002023년 본예산2023-05-15
892022신규 스마트 그늘막 설치안전총괄과반영84002022년 예산2023-05-15
9102022청능대로 주요사거리 볼라드 설치 건건설과+도로과반영55002022년 예산2023-05-15
연번제안 연도사업명사업부서검토결과반영액(천원)비고데이터기준일자
24252022청소년 생태활동가 육성 프로그램 개설평생교육과불가<NA><NA>2023-05-15
25262022경력단절 여성 대상 사서 양성교육여성가족과+일자리정책과불가<NA><NA>2023-05-15
26272022평생학습관 앞 시계탑 설치여성가족과불가<NA><NA>2023-05-15
27282022관내 공공놀이터 그늘막 설치공원녹지과불가<NA><NA>2023-05-15
28292022만수산 무장애나눔길 길목 표지판 및 가로등 설치공원녹지과불가<NA><NA>2023-05-15
29302022거머리산 등산로 둘레길 교체 공사 실시공원녹지과불가<NA><NA>2023-05-15
30312022논현동 한화에코 10단지 앞 무단횡단 방지시설 설치교통행정과불가<NA><NA>2023-05-15
31322022만수동 1059번지 일원 무단횡단방지 시설물 설치교통행정과불가<NA><NA>2023-05-15
32332022논현동 이안아파트 공사현장 맞은편 보도블럭 설치건설과+도로과불가<NA><NA>2023-05-15
33342022만수6동 행정복지센터의 낮은 돌담장을 휴식벤치로 개조만수6동불가<NA><NA>2023-05-15