Overview

Dataset statistics

Number of variables8
Number of observations93
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.1 KiB
Average record size in memory67.4 B

Variable types

Numeric2
Categorical5
Text1

Dataset

Description인천광역시 서구 주민참여예산위원회 현황에 대한 데이터로 의원구분, 이름, 연령대, 성별, 소속단체또는 직업, 분과 등을 제공합니다.
Author인천광역시 서구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15090922&srcSe=7661IVAWM27C61E190

Alerts

데이터기준일자 has constant value ""Constant
의원구분 is highly imbalanced (89.2%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2024-03-18 02:01:08.647403
Analysis finished2024-03-18 02:01:09.416714
Duration0.77 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct93
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean47
Minimum1
Maximum93
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size969.0 B
2024-03-18T11:01:09.503165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.6
Q124
median47
Q370
95-th percentile88.4
Maximum93
Range92
Interquartile range (IQR)46

Descriptive statistics

Standard deviation26.990739
Coefficient of variation (CV)0.57427105
Kurtosis-1.2
Mean47
Median Absolute Deviation (MAD)23
Skewness0
Sum4371
Variance728.5
MonotonicityStrictly increasing
2024-03-18T11:01:09.613605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.1%
60 1
 
1.1%
69 1
 
1.1%
68 1
 
1.1%
67 1
 
1.1%
66 1
 
1.1%
65 1
 
1.1%
64 1
 
1.1%
63 1
 
1.1%
62 1
 
1.1%
Other values (83) 83
89.2%
ValueCountFrequency (%)
1 1
1.1%
2 1
1.1%
3 1
1.1%
4 1
1.1%
5 1
1.1%
6 1
1.1%
7 1
1.1%
8 1
1.1%
9 1
1.1%
10 1
1.1%
ValueCountFrequency (%)
93 1
1.1%
92 1
1.1%
91 1
1.1%
90 1
1.1%
89 1
1.1%
88 1
1.1%
87 1
1.1%
86 1
1.1%
85 1
1.1%
84 1
1.1%

의원구분
Categorical

IMBALANCE 

Distinct3
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size876.0 B
위원
91 
부위원장
 
1
위원장
 
1

Length

Max length4
Median length2
Mean length2.0322581
Min length2

Unique

Unique2 ?
Unique (%)2.2%

Sample

1st row위원
2nd row위원
3rd row위원
4th row위원
5th row부위원장

Common Values

ValueCountFrequency (%)
위원 91
97.8%
부위원장 1
 
1.1%
위원장 1
 
1.1%

Length

2024-03-18T11:01:09.718629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T11:01:09.798895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
위원 91
97.8%
부위원장 1
 
1.1%
위원장 1
 
1.1%

이름
Text

Distinct92
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Memory size876.0 B
2024-03-18T11:01:09.999219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.9569892
Min length2

Characters and Unicode

Total characters275
Distinct characters95
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique91 ?
Unique (%)97.8%

Sample

1st row박은우
2nd row백영순
3rd row석지영
4th row임기현
5th row정성미
ValueCountFrequency (%)
이수민 2
 
2.2%
김재경 1
 
1.1%
남춘례 1
 
1.1%
이경주 1
 
1.1%
김민주 1
 
1.1%
김미영 1
 
1.1%
이지희 1
 
1.1%
이공순 1
 
1.1%
오승환 1
 
1.1%
심기보 1
 
1.1%
Other values (82) 82
88.2%
2024-03-18T11:01:10.321948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
16
 
5.8%
13
 
4.7%
12
 
4.4%
10
 
3.6%
9
 
3.3%
8
 
2.9%
7
 
2.5%
7
 
2.5%
7
 
2.5%
7
 
2.5%
Other values (85) 179
65.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 275
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
16
 
5.8%
13
 
4.7%
12
 
4.4%
10
 
3.6%
9
 
3.3%
8
 
2.9%
7
 
2.5%
7
 
2.5%
7
 
2.5%
7
 
2.5%
Other values (85) 179
65.1%

Most occurring scripts

ValueCountFrequency (%)
Hangul 275
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
16
 
5.8%
13
 
4.7%
12
 
4.4%
10
 
3.6%
9
 
3.3%
8
 
2.9%
7
 
2.5%
7
 
2.5%
7
 
2.5%
7
 
2.5%
Other values (85) 179
65.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 275
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
16
 
5.8%
13
 
4.7%
12
 
4.4%
10
 
3.6%
9
 
3.3%
8
 
2.9%
7
 
2.5%
7
 
2.5%
7
 
2.5%
7
 
2.5%
Other values (85) 179
65.1%

연령대
Real number (ℝ)

Distinct6
Distinct (%)6.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean49.139785
Minimum20
Maximum70
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size969.0 B
2024-03-18T11:01:10.429746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20
5-th percentile30
Q140
median50
Q360
95-th percentile60
Maximum70
Range50
Interquartile range (IQR)20

Descriptive statistics

Standard deviation11.292159
Coefficient of variation (CV)0.22979667
Kurtosis-0.70496455
Mean49.139785
Median Absolute Deviation (MAD)10
Skewness-0.38316761
Sum4570
Variance127.51286
MonotonicityNot monotonic
2024-03-18T11:01:10.575206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
60 32
34.4%
50 25
26.9%
40 21
22.6%
30 11
 
11.8%
70 3
 
3.2%
20 1
 
1.1%
ValueCountFrequency (%)
20 1
 
1.1%
30 11
 
11.8%
40 21
22.6%
50 25
26.9%
60 32
34.4%
70 3
 
3.2%
ValueCountFrequency (%)
70 3
 
3.2%
60 32
34.4%
50 25
26.9%
40 21
22.6%
30 11
 
11.8%
20 1
 
1.1%

성별
Categorical

Distinct2
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size876.0 B
여성
47 
남성
46 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row여성
2nd row여성
3rd row여성
4th row남성
5th row여성

Common Values

ValueCountFrequency (%)
여성 47
50.5%
남성 46
49.5%

Length

2024-03-18T11:01:10.716967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T11:01:10.789094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
여성 47
50.5%
남성 46
49.5%
Distinct29
Distinct (%)31.2%
Missing0
Missing (%)0.0%
Memory size876.0 B
주민자치회
30 
<NA>
29 
청년참여단
 
3
아라동 주민자치회
 
3
오류왕길동 주민자치회
 
2
Other values (24)
26 

Length

Max length18
Median length17
Mean length6.2688172
Min length2

Unique

Unique22 ?
Unique (%)23.7%

Sample

1st row주민자치회
2nd row지역화폐 민관운영위원회
3rd row<NA>
4th row<NA>
5th row주민자치회

Common Values

ValueCountFrequency (%)
주민자치회 30
32.3%
<NA> 29
31.2%
청년참여단 3
 
3.2%
아라동 주민자치회 3
 
3.2%
오류왕길동 주민자치회 2
 
2.2%
원당동 통장자율회 2
 
2.2%
지역화폐 민관운영위원회 2
 
2.2%
정책자문소통위원회 1
 
1.1%
서구사회적기업협의회 1
 
1.1%
서구청정책자문위원, 보장협의체위원 1
 
1.1%
Other values (19) 19
20.4%

Length

2024-03-18T11:01:10.886536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
주민자치회 37
33.6%
na 29
26.4%
청년참여단 3
 
2.7%
아라동 3
 
2.7%
원당동 3
 
2.7%
통장자율회 3
 
2.7%
청라2동 2
 
1.8%
오류왕길동 2
 
1.8%
지역화폐 2
 
1.8%
민관운영위원회 2
 
1.8%
Other values (24) 24
21.8%

분과
Categorical

Distinct6
Distinct (%)6.5%
Missing0
Missing (%)0.0%
Memory size876.0 B
자치행정
20 
환경안전
18 
도시주택
16 
복지문화
14 
경제교통
13 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경제교통
2nd row환경안전
3rd row경제교통
4th row미래기획
5th row경제교통

Common Values

ValueCountFrequency (%)
자치행정 20
21.5%
환경안전 18
19.4%
도시주택 16
17.2%
복지문화 14
15.1%
경제교통 13
14.0%
미래기획 12
12.9%

Length

2024-03-18T11:01:10.987576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T11:01:11.068693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
자치행정 20
21.5%
환경안전 18
19.4%
도시주택 16
17.2%
복지문화 14
15.1%
경제교통 13
14.0%
미래기획 12
12.9%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size876.0 B
2023-12-31
93 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-12-31
2nd row2023-12-31
3rd row2023-12-31
4th row2023-12-31
5th row2023-12-31

Common Values

ValueCountFrequency (%)
2023-12-31 93
100.0%

Length

2024-03-18T11:01:11.166215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T11:01:11.236779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-12-31 93
100.0%

Interactions

2024-03-18T11:01:09.098618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T11:01:08.978598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T11:01:09.160486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T11:01:09.039715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-18T11:01:11.291244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번의원구분이름연령대성별소속단체또는 직업분과
연번1.0000.0000.9340.1070.5030.6720.650
의원구분0.0001.0001.0000.0940.0010.0000.244
이름0.9341.0001.0000.9551.0000.9730.817
연령대0.1070.0940.9551.0000.2950.8540.307
성별0.5030.0011.0000.2951.0000.3580.439
소속단체또는 직업0.6720.0000.9730.8540.3581.0000.327
분과0.6500.2440.8170.3070.4390.3271.000
2024-03-18T11:01:11.378845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
의원구분분과성별소속단체또는 직업
의원구분1.0000.0990.0000.000
분과0.0991.0000.3090.085
성별0.0000.3091.0000.202
소속단체또는 직업0.0000.0850.2021.000
2024-03-18T11:01:11.463845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번연령대의원구분성별소속단체또는 직업분과
연번1.000-0.0610.0000.3690.2420.404
연령대-0.0611.0000.0290.2060.4800.113
의원구분0.0000.0291.0000.0000.0000.099
성별0.3690.2060.0001.0000.2020.309
소속단체또는 직업0.2420.4800.0000.2021.0000.085
분과0.4040.1130.0990.3090.0851.000

Missing values

2024-03-18T11:01:09.260957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-18T11:01:09.365657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번의원구분이름연령대성별소속단체또는 직업분과데이터기준일자
01위원박은우40여성주민자치회경제교통2023-12-31
12위원백영순40여성지역화폐 민관운영위원회환경안전2023-12-31
23위원석지영40여성<NA>경제교통2023-12-31
34위원임기현20남성<NA>미래기획2023-12-31
45부위원장정성미50여성주민자치회경제교통2023-12-31
56위원노미숙50여성주민자치회도시주택2023-12-31
67위원마성호70남성<NA>자치행정2023-12-31
78위원박진희50여성주민자치회환경안전2023-12-31
89위원이태호60남성주민자치회복지문화2023-12-31
910위원정은주60여성<NA>경제교통2023-12-31
연번의원구분이름연령대성별소속단체또는 직업분과데이터기준일자
8384위원이준호40남성서구체육회사회이사환경안전2023-12-31
8485위원주현석30남성오류왕길동 주민자치회환경안전2023-12-31
8586위원이태림60남성주민자치회환경안전2023-12-31
8687위원한우식30남성<NA>환경안전2023-12-31
8788위원함혜연40여성마전동주민자치회환경안전2023-12-31
8889위원강현선70남성<NA>환경안전2023-12-31
8990위원김태현40남성아라동 주민자치회환경안전2023-12-31
9091위원박영미50남성<NA>환경안전2023-12-31
9192위원신명종50남성아라동 주민자치회환경안전2023-12-31
9293위원이준혁40남성아라동 주민자치회환경안전2023-12-31