Overview

Dataset statistics

Number of variables5
Number of observations2500
Missing cells2500
Missing cells (%)20.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory107.6 KiB
Average record size in memory44.1 B

Variable types

Categorical1
Numeric2
Text1
Unsupported1

Dataset

Description회계년도,시민참여단 코드,시민참여단 이름,자치단체코드,투표 상태
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15721/S/1/datasetView.do

Alerts

회계년도 has constant value ""Constant
투표 상태 has 2500 (100.0%) missing valuesMissing
시민참여단 코드 has unique valuesUnique
투표 상태 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-05-11 06:51:30.065972
Analysis finished2024-05-11 06:51:31.718071
Duration1.65 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

회계년도
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size19.7 KiB
2014
2500 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2014
2nd row2014
3rd row2014
4th row2014
5th row2014

Common Values

ValueCountFrequency (%)
2014 2500
100.0%

Length

2024-05-11T15:51:32.204071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T15:51:32.619148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2014 2500
100.0%

시민참여단 코드
Real number (ℝ)

UNIQUE 

Distinct2500
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1250.5
Minimum1
Maximum2500
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size22.1 KiB
2024-05-11T15:51:32.920500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile125.95
Q1625.75
median1250.5
Q31875.25
95-th percentile2375.05
Maximum2500
Range2499
Interquartile range (IQR)1249.5

Descriptive statistics

Standard deviation721.83216
Coefficient of variation (CV)0.57723483
Kurtosis-1.2
Mean1250.5
Median Absolute Deviation (MAD)625
Skewness0
Sum3126250
Variance521041.67
MonotonicityNot monotonic
2024-05-11T15:51:33.173031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
999 1
 
< 0.1%
1744 1
 
< 0.1%
1750 1
 
< 0.1%
175 1
 
< 0.1%
1749 1
 
< 0.1%
1748 1
 
< 0.1%
1747 1
 
< 0.1%
1746 1
 
< 0.1%
1745 1
 
< 0.1%
1743 1
 
< 0.1%
Other values (2490) 2490
99.6%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
2500 1
< 0.1%
2499 1
< 0.1%
2498 1
< 0.1%
2497 1
< 0.1%
2496 1
< 0.1%
2495 1
< 0.1%
2494 1
< 0.1%
2493 1
< 0.1%
2492 1
< 0.1%
2491 1
< 0.1%
Distinct2340
Distinct (%)93.6%
Missing0
Missing (%)0.0%
Memory size19.7 KiB
2024-05-11T15:51:33.734879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length2.9944
Min length2

Characters and Unicode

Total characters7486
Distinct characters253
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2209 ?
Unique (%)88.4%

Sample

1st row황범준
2nd row홍나영
3rd row허예지
4th row한정일
5th row한은분
ValueCountFrequency (%)
김정희 5
 
0.2%
이수정 4
 
0.2%
이정희 3
 
0.1%
이상희 3
 
0.1%
박성현 3
 
0.1%
이현주 3
 
0.1%
이지은 3
 
0.1%
김영순 3
 
0.1%
이용희 3
 
0.1%
이선자 3
 
0.1%
Other values (2330) 2467
98.7%
2024-05-11T15:51:34.523915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
546
 
7.3%
382
 
5.1%
304
 
4.1%
219
 
2.9%
193
 
2.6%
172
 
2.3%
152
 
2.0%
144
 
1.9%
134
 
1.8%
122
 
1.6%
Other values (243) 5118
68.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7486
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
546
 
7.3%
382
 
5.1%
304
 
4.1%
219
 
2.9%
193
 
2.6%
172
 
2.3%
152
 
2.0%
144
 
1.9%
134
 
1.8%
122
 
1.6%
Other values (243) 5118
68.4%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7486
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
546
 
7.3%
382
 
5.1%
304
 
4.1%
219
 
2.9%
193
 
2.6%
172
 
2.3%
152
 
2.0%
144
 
1.9%
134
 
1.8%
122
 
1.6%
Other values (243) 5118
68.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7486
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
546
 
7.3%
382
 
5.1%
304
 
4.1%
219
 
2.9%
193
 
2.6%
172
 
2.3%
152
 
2.0%
144
 
1.9%
134
 
1.8%
122
 
1.6%
Other values (243) 5118
68.4%

자치단체코드
Real number (ℝ)

Distinct25
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3120000
Minimum3000000
Maximum3240000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size22.1 KiB
2024-05-11T15:51:34.791654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3000000
5-th percentile3010000
Q13060000
median3120000
Q33180000
95-th percentile3230000
Maximum3240000
Range240000
Interquartile range (IQR)120000

Descriptive statistics

Standard deviation72125.452
Coefficient of variation (CV)0.023117132
Kurtosis-1.2038535
Mean3120000
Median Absolute Deviation (MAD)60000
Skewness0
Sum7.8 × 109
Variance5.2020808 × 109
MonotonicityNot monotonic
2024-05-11T15:51:35.022820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
3090000 100
 
4.0%
3220000 100
 
4.0%
3050000 100
 
4.0%
3190000 100
 
4.0%
3130000 100
 
4.0%
3120000 100
 
4.0%
3210000 100
 
4.0%
3030000 100
 
4.0%
3070000 100
 
4.0%
3230000 100
 
4.0%
Other values (15) 1500
60.0%
ValueCountFrequency (%)
3000000 100
4.0%
3010000 100
4.0%
3020000 100
4.0%
3030000 100
4.0%
3040000 100
4.0%
3050000 100
4.0%
3060000 100
4.0%
3070000 100
4.0%
3080000 100
4.0%
3090000 100
4.0%
ValueCountFrequency (%)
3240000 100
4.0%
3230000 100
4.0%
3220000 100
4.0%
3210000 100
4.0%
3200000 100
4.0%
3190000 100
4.0%
3180000 100
4.0%
3170000 100
4.0%
3160000 100
4.0%
3150000 100
4.0%

투표 상태
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2500
Missing (%)100.0%
Memory size22.1 KiB

Interactions

2024-05-11T15:51:30.805488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:51:30.462611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:51:30.959650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:51:30.624965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-11T15:51:35.194493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시민참여단 코드자치단체코드
시민참여단 코드1.0000.938
자치단체코드0.9381.000
2024-05-11T15:51:35.336044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시민참여단 코드자치단체코드
시민참여단 코드1.000-0.457
자치단체코드-0.4571.000

Missing values

2024-05-11T15:51:31.305649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-11T15:51:31.599810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

회계년도시민참여단 코드시민참여단 이름자치단체코드투표 상태
02014999황범준3090000<NA>
12014998홍나영3090000<NA>
22014997허예지3090000<NA>
32014996한정일3090000<NA>
42014995한은분3090000<NA>
52014994하치원3090000<NA>
62014993최원종3090000<NA>
72014992최성곤3090000<NA>
82014991최낙훈3090000<NA>
92014990진용직3090000<NA>
회계년도시민참여단 코드시민참여단 이름자치단체코드투표 상태
249020141006권혁숙3050000<NA>
249120141005권지혜3050000<NA>
249220141004강현구3050000<NA>
249320141003강은경3050000<NA>
249420141002강병훈3050000<NA>
249520141001강민구3050000<NA>
249620141000황혜민3090000<NA>
24972014100황은영3220000<NA>
2498201410김기화3220000<NA>
249920141강동우3220000<NA>