Overview

Dataset statistics

Number of variables6
Number of observations40
Missing cells26
Missing cells (%)10.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.1 KiB
Average record size in memory53.3 B

Variable types

Numeric2
Text3
Categorical1

Dataset

Description경기도 고양시 주민참여예산 반영사업 데이터로 연번, 제안구분, 담당부서, 사업명, 소요예산, 비고(제안명) 등의 항목을 제공합니다.
Author경기도 고양시
URLhttps://www.data.go.kr/data/15127194/fileData.do

Alerts

비고(제안명) has 26 (65.0%) missing valuesMissing
연번 has unique valuesUnique
사업명 has unique valuesUnique

Reproduction

Analysis started2024-03-23 05:34:58.227972
Analysis finished2024-03-23 05:34:59.729863
Duration1.5 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct40
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20.5
Minimum1
Maximum40
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size492.0 B
2024-03-23T14:34:59.850059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.95
Q110.75
median20.5
Q330.25
95-th percentile38.05
Maximum40
Range39
Interquartile range (IQR)19.5

Descriptive statistics

Standard deviation11.690452
Coefficient of variation (CV)0.57026595
Kurtosis-1.2
Mean20.5
Median Absolute Deviation (MAD)10
Skewness0
Sum820
Variance136.66667
MonotonicityStrictly increasing
2024-03-23T14:35:00.192708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=40)
ValueCountFrequency (%)
1 1
 
2.5%
22 1
 
2.5%
24 1
 
2.5%
25 1
 
2.5%
26 1
 
2.5%
27 1
 
2.5%
28 1
 
2.5%
29 1
 
2.5%
30 1
 
2.5%
31 1
 
2.5%
Other values (30) 30
75.0%
ValueCountFrequency (%)
1 1
2.5%
2 1
2.5%
3 1
2.5%
4 1
2.5%
5 1
2.5%
6 1
2.5%
7 1
2.5%
8 1
2.5%
9 1
2.5%
10 1
2.5%
ValueCountFrequency (%)
40 1
2.5%
39 1
2.5%
38 1
2.5%
37 1
2.5%
36 1
2.5%
35 1
2.5%
34 1
2.5%
33 1
2.5%
32 1
2.5%
31 1
2.5%
Distinct21
Distinct (%)52.5%
Missing0
Missing (%)0.0%
Memory size452.0 B
2024-03-23T14:35:00.632204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length9.4
Min length8

Characters and Unicode

Total characters376
Distinct characters47
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique11 ?
Unique (%)27.5%

Sample

1st row지역회의(주교동)
2nd row지역회의(흥도동)
3rd row지역회의(능곡동)
4th row지역회의(행신3동)
5th row지역회의(행신3동)
ValueCountFrequency (%)
시민제안(인터넷 7
17.5%
지역회의(대덕동 3
 
7.5%
지역회의(주엽1동 3
 
7.5%
시민제안(방문 3
 
7.5%
지역회의(행신3동 3
 
7.5%
지역회의(중산1동 2
 
5.0%
지역회의(백석2동 2
 
5.0%
지역회의(성사1동 2
 
5.0%
지역회의(대화동 2
 
5.0%
지역회의(덕이동 2
 
5.0%
Other values (11) 11
27.5%
2024-03-23T14:35:01.200210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 40
 
10.6%
( 40
 
10.6%
30
 
8.0%
30
 
8.0%
30
 
8.0%
30
 
8.0%
30
 
8.0%
1 10
 
2.7%
10
 
2.7%
10
 
2.7%
Other values (37) 116
30.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 277
73.7%
Close Punctuation 40
 
10.6%
Open Punctuation 40
 
10.6%
Decimal Number 19
 
5.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
30
 
10.8%
30
 
10.8%
30
 
10.8%
30
 
10.8%
30
 
10.8%
10
 
3.6%
10
 
3.6%
10
 
3.6%
10
 
3.6%
7
 
2.5%
Other values (31) 80
28.9%
Decimal Number
ValueCountFrequency (%)
1 10
52.6%
2 5
26.3%
3 3
 
15.8%
4 1
 
5.3%
Close Punctuation
ValueCountFrequency (%)
) 40
100.0%
Open Punctuation
ValueCountFrequency (%)
( 40
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 277
73.7%
Common 99
 
26.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
30
 
10.8%
30
 
10.8%
30
 
10.8%
30
 
10.8%
30
 
10.8%
10
 
3.6%
10
 
3.6%
10
 
3.6%
10
 
3.6%
7
 
2.5%
Other values (31) 80
28.9%
Common
ValueCountFrequency (%)
) 40
40.4%
( 40
40.4%
1 10
 
10.1%
2 5
 
5.1%
3 3
 
3.0%
4 1
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 277
73.7%
ASCII 99
 
26.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 40
40.4%
( 40
40.4%
1 10
 
10.1%
2 5
 
5.1%
3 3
 
3.0%
4 1
 
1.0%
Hangul
ValueCountFrequency (%)
30
 
10.8%
30
 
10.8%
30
 
10.8%
30
 
10.8%
30
 
10.8%
10
 
3.6%
10
 
3.6%
10
 
3.6%
10
 
3.6%
7
 
2.5%
Other values (31) 80
28.9%

담당부서
Categorical

Distinct15
Distinct (%)37.5%
Missing0
Missing (%)0.0%
Memory size452.0 B
도로관리과
일산동구 환경녹지과
일산동구 안전건설과
일산서구 환경녹지과
덕양구 안전건설과
Other values (10)
18 

Length

Max length10
Median length9
Mean length7.85
Min length3

Unique

Unique4 ?
Unique (%)10.0%

Sample

1st row버스정책과
2nd row덕양구 안전건설과
3rd row재난대응담당관
4th row도로관리과
5th row도로관리과

Common Values

ValueCountFrequency (%)
도로관리과 6
15.0%
일산동구 환경녹지과 5
12.5%
일산동구 안전건설과 4
10.0%
일산서구 환경녹지과 4
10.0%
덕양구 안전건설과 3
7.5%
교통정책과 3
7.5%
덕양구 환경녹지과 3
7.5%
버스정책과 2
 
5.0%
일산서구 안전건설과 2
 
5.0%
덕양구 청소농정과 2
 
5.0%
Other values (5) 6
15.0%

Length

2024-03-23T14:35:01.470555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
환경녹지과 12
19.0%
일산동구 9
14.3%
안전건설과 9
14.3%
덕양구 8
12.7%
도로관리과 7
11.1%
일산서구 6
9.5%
교통정책과 3
 
4.8%
버스정책과 2
 
3.2%
청소농정과 2
 
3.2%
덕양공원관리과 2
 
3.2%
Other values (3) 3
 
4.8%

사업명
Text

UNIQUE 

Distinct40
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size452.0 B
2024-03-23T14:35:01.896276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length24
Mean length18.375
Min length7

Characters and Unicode

Total characters735
Distinct characters182
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique40 ?
Unique (%)100.0%

Sample

1st row버스정류장 설치
2nd row마을안도로 재포장(성사동 232)
3rd row야외 그늘막 설치
4th row백양고등학교 인근 보행로 환경개선 사업
5th row성신초등학교 주변 환경정비
ValueCountFrequency (%)
설치 14
 
7.6%
교체 6
 
3.2%
정비 5
 
2.7%
무단투기 4
 
2.2%
4
 
2.2%
cctv 4
 
2.2%
쓰레기 3
 
1.6%
3
 
1.6%
조성 3
 
1.6%
환경개선 3
 
1.6%
Other values (121) 136
73.5%
2024-03-23T14:35:02.502072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
145
 
19.7%
24
 
3.3%
19
 
2.6%
18
 
2.4%
15
 
2.0%
14
 
1.9%
13
 
1.8%
12
 
1.6%
12
 
1.6%
11
 
1.5%
Other values (172) 452
61.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 542
73.7%
Space Separator 145
 
19.7%
Uppercase Letter 19
 
2.6%
Decimal Number 16
 
2.2%
Close Punctuation 5
 
0.7%
Open Punctuation 5
 
0.7%
Other Punctuation 2
 
0.3%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
 
4.4%
19
 
3.5%
18
 
3.3%
15
 
2.8%
14
 
2.6%
13
 
2.4%
12
 
2.2%
12
 
2.2%
11
 
2.0%
10
 
1.8%
Other values (152) 394
72.7%
Decimal Number
ValueCountFrequency (%)
3 4
25.0%
2 3
18.8%
8 2
12.5%
6 2
12.5%
1 2
12.5%
4 1
 
6.2%
7 1
 
6.2%
5 1
 
6.2%
Uppercase Letter
ValueCountFrequency (%)
C 8
42.1%
V 4
21.1%
T 4
21.1%
L 1
 
5.3%
E 1
 
5.3%
D 1
 
5.3%
Other Punctuation
ValueCountFrequency (%)
, 1
50.0%
· 1
50.0%
Space Separator
ValueCountFrequency (%)
145
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 542
73.7%
Common 174
 
23.7%
Latin 19
 
2.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
 
4.4%
19
 
3.5%
18
 
3.3%
15
 
2.8%
14
 
2.6%
13
 
2.4%
12
 
2.2%
12
 
2.2%
11
 
2.0%
10
 
1.8%
Other values (152) 394
72.7%
Common
ValueCountFrequency (%)
145
83.3%
) 5
 
2.9%
( 5
 
2.9%
3 4
 
2.3%
2 3
 
1.7%
8 2
 
1.1%
6 2
 
1.1%
1 2
 
1.1%
, 1
 
0.6%
· 1
 
0.6%
Other values (4) 4
 
2.3%
Latin
ValueCountFrequency (%)
C 8
42.1%
V 4
21.1%
T 4
21.1%
L 1
 
5.3%
E 1
 
5.3%
D 1
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 542
73.7%
ASCII 192
 
26.1%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
145
75.5%
C 8
 
4.2%
) 5
 
2.6%
( 5
 
2.6%
V 4
 
2.1%
T 4
 
2.1%
3 4
 
2.1%
2 3
 
1.6%
8 2
 
1.0%
6 2
 
1.0%
Other values (9) 10
 
5.2%
Hangul
ValueCountFrequency (%)
24
 
4.4%
19
 
3.5%
18
 
3.3%
15
 
2.8%
14
 
2.6%
13
 
2.4%
12
 
2.2%
12
 
2.2%
11
 
2.0%
10
 
1.8%
Other values (152) 394
72.7%
None
ValueCountFrequency (%)
· 1
100.0%

소요예산(천원)
Real number (ℝ)

Distinct31
Distinct (%)77.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean33092.5
Minimum1000
Maximum300000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size492.0 B
2024-03-23T14:35:02.712254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1000
5-th percentile1785
Q15750
median12750
Q330000
95-th percentile131000
Maximum300000
Range299000
Interquartile range (IQR)24250

Descriptive statistics

Standard deviation55742.063
Coefficient of variation (CV)1.6844319
Kurtosis13.56493
Mean33092.5
Median Absolute Deviation (MAD)10250
Skewness3.4110099
Sum1323700
Variance3.1071776 × 109
MonotonicityNot monotonic
2024-03-23T14:35:02.957417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
12000 3
 
7.5%
6000 3
 
7.5%
10000 2
 
5.0%
15000 2
 
5.0%
3000 2
 
5.0%
4000 2
 
5.0%
30000 2
 
5.0%
29500 1
 
2.5%
12500 1
 
2.5%
1500 1
 
2.5%
Other values (21) 21
52.5%
ValueCountFrequency (%)
1000 1
 
2.5%
1500 1
 
2.5%
1800 1
 
2.5%
2000 1
 
2.5%
3000 2
5.0%
4000 2
5.0%
4800 1
 
2.5%
5000 1
 
2.5%
6000 3
7.5%
8000 1
 
2.5%
ValueCountFrequency (%)
300000 1
2.5%
150000 1
2.5%
130000 1
2.5%
120000 1
2.5%
84800 1
2.5%
50000 1
2.5%
45000 1
2.5%
35000 1
2.5%
30200 1
2.5%
30000 2
5.0%

비고(제안명)
Text

MISSING 

Distinct12
Distinct (%)85.7%
Missing26
Missing (%)65.0%
Memory size452.0 B
2024-03-23T14:35:03.406241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length27
Mean length23.571429
Min length8

Characters and Unicode

Total characters330
Distinct characters113
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)71.4%

Sample

1st row성신초등학교 주변 횡단보도 바닥신호기 설치 등 환경정비 사업
2nd row쌍굴터널 및 대덕교 하부 청소를 통한 환경개선
3rd row쌍굴터널 및 대덕교 하부 청소를 통한 환경개선
4th row4단지 앞 보행로 노후 보도블록 교체
5th row일산로636번길 환경 개선 사업
ValueCountFrequency (%)
7
 
8.0%
설치 7
 
8.0%
사업 6
 
6.9%
교체 3
 
3.4%
개선 2
 
2.3%
운동기구 2
 
2.3%
정비 2
 
2.3%
2
 
2.3%
횡단보도 2
 
2.3%
쌍굴터널 2
 
2.3%
Other values (45) 52
59.8%
2024-03-23T14:35:04.059689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
73
 
22.1%
9
 
2.7%
7
 
2.1%
7
 
2.1%
7
 
2.1%
7
 
2.1%
7
 
2.1%
7
 
2.1%
6
 
1.8%
6
 
1.8%
Other values (103) 194
58.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 242
73.3%
Space Separator 73
 
22.1%
Decimal Number 10
 
3.0%
Close Punctuation 1
 
0.3%
Dash Punctuation 1
 
0.3%
Open Punctuation 1
 
0.3%
Uppercase Letter 1
 
0.3%
Math Symbol 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9
 
3.7%
7
 
2.9%
7
 
2.9%
7
 
2.9%
7
 
2.9%
7
 
2.9%
7
 
2.9%
6
 
2.5%
6
 
2.5%
5
 
2.1%
Other values (93) 174
71.9%
Decimal Number
ValueCountFrequency (%)
6 4
40.0%
3 4
40.0%
4 1
 
10.0%
1 1
 
10.0%
Space Separator
ValueCountFrequency (%)
73
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 1
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 242
73.3%
Common 87
 
26.4%
Latin 1
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9
 
3.7%
7
 
2.9%
7
 
2.9%
7
 
2.9%
7
 
2.9%
7
 
2.9%
7
 
2.9%
6
 
2.5%
6
 
2.5%
5
 
2.1%
Other values (93) 174
71.9%
Common
ValueCountFrequency (%)
73
83.9%
6 4
 
4.6%
3 4
 
4.6%
) 1
 
1.1%
- 1
 
1.1%
( 1
 
1.1%
~ 1
 
1.1%
4 1
 
1.1%
1 1
 
1.1%
Latin
ValueCountFrequency (%)
A 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 242
73.3%
ASCII 88
 
26.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
73
83.0%
6 4
 
4.5%
3 4
 
4.5%
) 1
 
1.1%
- 1
 
1.1%
( 1
 
1.1%
A 1
 
1.1%
~ 1
 
1.1%
4 1
 
1.1%
1 1
 
1.1%
Hangul
ValueCountFrequency (%)
9
 
3.7%
7
 
2.9%
7
 
2.9%
7
 
2.9%
7
 
2.9%
7
 
2.9%
7
 
2.9%
6
 
2.5%
6
 
2.5%
5
 
2.1%
Other values (93) 174
71.9%

Interactions

2024-03-23T14:34:59.107259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T14:34:58.707322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T14:34:59.271386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T14:34:58.882872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-23T14:35:04.223444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번제안구분담당부서사업명소요예산(천원)비고(제안명)
연번1.0000.7870.7451.0000.4100.974
제안구분0.7871.0000.7941.0000.0001.000
담당부서0.7450.7941.0001.0000.0000.681
사업명1.0001.0001.0001.0001.0001.000
소요예산(천원)0.4100.0000.0001.0001.0000.000
비고(제안명)0.9741.0000.6811.0000.0001.000
2024-03-23T14:35:04.397242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번소요예산(천원)담당부서
연번1.000-0.0980.340
소요예산(천원)-0.0981.0000.000
담당부서0.3400.0001.000

Missing values

2024-03-23T14:34:59.450850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T14:34:59.648887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번제안구분담당부서사업명소요예산(천원)비고(제안명)
01지역회의(주교동)버스정책과버스정류장 설치30000<NA>
12지역회의(흥도동)덕양구 안전건설과마을안도로 재포장(성사동 232)12000<NA>
23지역회의(능곡동)재난대응담당관야외 그늘막 설치1800<NA>
34지역회의(행신3동)도로관리과백양고등학교 인근 보행로 환경개선 사업130000<NA>
45지역회의(행신3동)도로관리과성신초등학교 주변 환경정비45000성신초등학교 주변 횡단보도 바닥신호기 설치 등 환경정비 사업
56지역회의(행신4동)도로관리과서정1·3보도육교 바닥 정비120000<NA>
67지역회의(대덕동)덕양구 안전건설과쌍굴터널 LED터널등 교체4800쌍굴터널 및 대덕교 하부 청소를 통한 환경개선
78지역회의(대덕동)도로관리과대덕교 등 시설물 환경개선50000쌍굴터널 및 대덕교 하부 청소를 통한 환경개선
89지역회의(백석2동)일산동구 안전건설과백석동 오피스텔단지 주변 보도블록 교체2000<NA>
910지역회의(고봉동)일산동구 안전건설과사리현동 보행로 쉼터 조성1000<NA>
연번제안구분담당부서사업명소요예산(천원)비고(제안명)
3031지역회의(중산1동)일산동구 환경녹지과중산마을 8단지 사거리 녹지 화단 조성1500<NA>
3132지역회의(백석2동)일산공원관리과용천공원 휴게용 벤치 및 벤치형 그네 설치13000<NA>
3233지역회의(마두2동)일산동구 환경녹지과백마로 녹지대 환경정비(벤치, 파고라 등 정비)29500<NA>
3334지역회의(일산1동)일산서구 환경녹지과쓰레기 무단투기 감시용 이동식 CCTV 설치8000<NA>
3435지역회의(주엽1동)일산서구 환경녹지과고봉로 소재 학교 인근 쉼터 개보수 및 안전등 설치84800<NA>
3536지역회의(덕이동)일산서구 환경녹지과관내 무단투기 지역 CCTV 설치12000<NA>
3637시민제안(인터넷)덕양공원관리과소만공원 운동기구 정비3000소만공원 운동기구 교체 및 캐노피 설치 사업
3738시민제안(인터넷)일산동구 환경녹지과걷고 싶은길 가로조성(숲 같은 가로 환경 조성)25000<NA>
3839시민제안(인터넷)덕양구 환경녹지과화수로 사거리 심야시간 안전한 보행환경과 도시미관 개선35000<NA>
3940시민제안(방문)일산동구 환경녹지과안곡중 부근 도로변에서 단독주택쪽 진입로 평탄화 작업 및 초화식재 요청30200<NA>