Overview

Dataset statistics

Number of variables6
Number of observations432
Missing cells432
Missing cells (%)16.7%
Duplicate rows1
Duplicate rows (%)0.2%
Total size in memory21.2 KiB
Average record size in memory50.3 B

Variable types

Categorical4
Numeric1
Unsupported1

Dataset

Description부산광역시_해운대구_인구현황_20180331
Author부산광역시 해운대구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3039794

Alerts

행정기관명 has constant value ""Constant
Dataset has 1 (0.2%) duplicate rowsDuplicates
Unnamed: 5 has 432 (100.0%) missing valuesMissing
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
인구수 has 31 (7.2%) zerosZeros

Reproduction

Analysis started2024-04-21 12:05:16.441549
Analysis finished2024-04-21 12:05:17.175204
Duration0.73 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

행정기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.5 KiB
부산광역시 해운대구
432 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산광역시 해운대구
2nd row부산광역시 해운대구
3rd row부산광역시 해운대구
4th row부산광역시 해운대구
5th row부산광역시 해운대구

Common Values

ValueCountFrequency (%)
부산광역시 해운대구 432
100.0%

Length

2024-04-21T21:05:17.278865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T21:05:17.445707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산광역시 432
50.0%
해운대구 432
50.0%

동명
Categorical

Distinct18
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size3.5 KiB
반여제3동
 
26
우제1동
 
24
재송제2동
 
24
우제3동
 
24
중제1동
 
24
Other values (13)
310 

Length

Max length5
Median length4.5
Mean length4.3888889
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row우제1동
2nd row우제1동
3rd row우제1동
4th row우제1동
5th row우제1동

Common Values

ValueCountFrequency (%)
반여제3동 26
 
6.0%
우제1동 24
 
5.6%
재송제2동 24
 
5.6%
우제3동 24
 
5.6%
중제1동 24
 
5.6%
중제2동 24
 
5.6%
좌제1동 24
 
5.6%
좌제2동 24
 
5.6%
좌제3동 24
 
5.6%
좌제4동 24
 
5.6%
Other values (8) 190
44.0%

Length

2024-04-21T21:05:17.638270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
반여제3동 26
 
6.0%
우제1동 24
 
5.6%
우제2동 24
 
5.6%
재송제1동 24
 
5.6%
반송제2동 24
 
5.6%
반송제1동 24
 
5.6%
반여제4동 24
 
5.6%
반여제1동 24
 
5.6%
송정동 24
 
5.6%
좌제4동 24
 
5.6%
Other values (8) 190
44.0%

연령
Categorical

Distinct12
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size3.5 KiB
0세 - 9세
36 
10세 - 19세
36 
20세 - 29세
36 
30세 - 39세
36 
40세 - 49세
36 
Other values (7)
252 

Length

Max length11
Median length9
Mean length8.8333333
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0세 - 9세
2nd row0세 - 9세
3rd row10세 - 19세
4th row10세 - 19세
5th row20세 - 29세

Common Values

ValueCountFrequency (%)
0세 - 9세 36
8.3%
10세 - 19세 36
8.3%
20세 - 29세 36
8.3%
30세 - 39세 36
8.3%
40세 - 49세 36
8.3%
50세 - 59세 36
8.3%
60세 - 69세 36
8.3%
70세 - 79세 36
8.3%
80세 - 89세 36
8.3%
90세 - 99세 36
8.3%
Other values (2) 72
16.7%

Length

2024-04-21T21:05:17.878004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
396
31.4%
0세 36
 
2.9%
60세 36
 
2.9%
110세 36
 
2.9%
109세 36
 
2.9%
100세 36
 
2.9%
99세 36
 
2.9%
90세 36
 
2.9%
89세 36
 
2.9%
80세 36
 
2.9%
Other values (15) 540
42.9%

성별구분
Categorical

Distinct2
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size3.5 KiB
216 
216 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
216
50.0%
216
50.0%

Length

2024-04-21T21:05:18.092561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T21:05:18.264329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
216
50.0%
216
50.0%

인구수
Real number (ℝ)

ZEROS 

Distinct337
Distinct (%)78.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean953.79398
Minimum0
Maximum3910
Zeros31
Zeros (%)7.2%
Negative0
Negative (%)0.0%
Memory size3.9 KiB
2024-04-21T21:05:18.464705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1103.75
median830
Q31498.75
95-th percentile2569.8
Maximum3910
Range3910
Interquartile range (IQR)1395

Descriptive statistics

Standard deviation861.97431
Coefficient of variation (CV)0.90373217
Kurtosis0.03649947
Mean953.79398
Median Absolute Deviation (MAD)701
Skewness0.78331276
Sum412039
Variance742999.71
MonotonicityNot monotonic
2024-04-21T21:05:18.726184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 31
 
7.2%
1 13
 
3.0%
2 9
 
2.1%
3 7
 
1.6%
4 4
 
0.9%
362 3
 
0.7%
5 3
 
0.7%
1040 3
 
0.7%
1959 2
 
0.5%
1992 2
 
0.5%
Other values (327) 355
82.2%
ValueCountFrequency (%)
0 31
7.2%
1 13
3.0%
2 9
 
2.1%
3 7
 
1.6%
4 4
 
0.9%
5 3
 
0.7%
6 2
 
0.5%
7 2
 
0.5%
9 2
 
0.5%
10 1
 
0.2%
ValueCountFrequency (%)
3910 1
0.2%
3786 1
0.2%
3708 1
0.2%
3414 1
0.2%
3300 1
0.2%
3290 1
0.2%
3140 1
0.2%
3084 1
0.2%
3047 1
0.2%
2995 1
0.2%

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing432
Missing (%)100.0%
Memory size3.9 KiB

Interactions

2024-04-21T21:05:16.699958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T21:05:18.902029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
동명연령성별구분인구수
동명1.0000.0000.0000.432
연령0.0001.0000.0000.667
성별구분0.0000.0001.0000.000
인구수0.4320.6670.0001.000
2024-04-21T21:05:19.056105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연령성별구분동명
연령1.0000.0000.000
성별구분0.0001.0000.000
동명0.0000.0001.000
2024-04-21T21:05:19.201564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인구수동명연령성별구분
인구수1.0000.1790.3550.000
동명0.1791.0000.0000.000
연령0.3550.0001.0000.000
성별구분0.0000.0000.0001.000

Missing values

2024-04-21T21:05:16.918849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T21:05:17.103953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

행정기관명동명연령성별구분인구수Unnamed: 5
0부산광역시 해운대구우제1동0세 - 9세677<NA>
1부산광역시 해운대구우제1동0세 - 9세637<NA>
2부산광역시 해운대구우제1동10세 - 19세845<NA>
3부산광역시 해운대구우제1동10세 - 19세834<NA>
4부산광역시 해운대구우제1동20세 - 29세1530<NA>
5부산광역시 해운대구우제1동20세 - 29세1371<NA>
6부산광역시 해운대구우제1동30세 - 39세1401<NA>
7부산광역시 해운대구우제1동30세 - 39세1463<NA>
8부산광역시 해운대구우제1동40세 - 49세1667<NA>
9부산광역시 해운대구우제1동40세 - 49세1695<NA>
행정기관명동명연령성별구분인구수Unnamed: 5
422부산광역시 해운대구재송제2동70세 - 79세750<NA>
423부산광역시 해운대구재송제2동70세 - 79세929<NA>
424부산광역시 해운대구재송제2동80세 - 89세181<NA>
425부산광역시 해운대구재송제2동80세 - 89세435<NA>
426부산광역시 해운대구재송제2동90세 - 99세12<NA>
427부산광역시 해운대구재송제2동90세 - 99세60<NA>
428부산광역시 해운대구재송제2동100세 - 109세1<NA>
429부산광역시 해운대구재송제2동100세 - 109세2<NA>
430부산광역시 해운대구재송제2동110세 이상0<NA>
431부산광역시 해운대구재송제2동110세 이상3<NA>

Duplicate rows

Most frequently occurring

행정기관명동명연령성별구분인구수# duplicates
0부산광역시 해운대구반여제3동110세 이상32