Overview

Dataset statistics

Number of variables3
Number of observations3639
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory96.1 KiB
Average record size in memory27.0 B

Variable types

Numeric3

Dataset

Description합계출산율(행정구역, 합계출산율) 정보를 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/bigdata/collect/view.chungnam?menuCd=DOM_000000201001001000&apiIdx=96

Alerts

기준연도 is highly overall correlated with 합계출산율High correlation
합계출산율 is highly overall correlated with 기준연도High correlation

Reproduction

Analysis started2024-01-09 21:44:36.501881
Analysis finished2024-01-09 21:44:37.422390
Duration0.92 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준연도
Real number (ℝ)

HIGH CORRELATION 

Distinct13
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2015.9907
Minimum2010
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size32.1 KiB
2024-01-10T06:44:37.463941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2010
5-th percentile2010
Q12013
median2016
Q32019
95-th percentile2022
Maximum2022
Range12
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.7406171
Coefficient of variation (CV)0.0018554734
Kurtosis-1.2135076
Mean2015.9907
Median Absolute Deviation (MAD)3
Skewness0.0046073921
Sum7336190
Variance13.992216
MonotonicityIncreasing
2024-01-10T06:44:37.553898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
2014 282
 
7.7%
2015 282
 
7.7%
2012 281
 
7.7%
2013 281
 
7.7%
2010 280
 
7.7%
2011 280
 
7.7%
2016 279
 
7.7%
2017 279
 
7.7%
2018 279
 
7.7%
2019 279
 
7.7%
Other values (3) 837
23.0%
ValueCountFrequency (%)
2010 280
7.7%
2011 280
7.7%
2012 281
7.7%
2013 281
7.7%
2014 282
7.7%
2015 282
7.7%
2016 279
7.7%
2017 279
7.7%
2018 279
7.7%
2019 279
7.7%
ValueCountFrequency (%)
2022 279
7.7%
2021 279
7.7%
2020 279
7.7%
2019 279
7.7%
2018 279
7.7%
2017 279
7.7%
2016 279
7.7%
2015 282
7.7%
2014 282
7.7%
2013 281
7.7%

행정구역코드
Real number (ℝ)

Distinct290
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean28096.864
Minimum0
Maximum39020
Zeros13
Zeros (%)0.4%
Negative0
Negative (%)0.0%
Memory size32.1 KiB
2024-01-10T06:44:37.655856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile36
Q123040
median31350
Q336025
95-th percentile38113.1
Maximum39020
Range39020
Interquartile range (IQR)12985

Descriptive statistics

Standard deviation10605.236
Coefficient of variation (CV)0.37745265
Kurtosis0.9118053
Mean28096.864
Median Absolute Deviation (MAD)5310
Skewness-1.3609769
Sum1.0224449 × 108
Variance1.1247102 × 108
MonotonicityNot monotonic
2024-01-10T06:44:37.765100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 13
 
0.4%
35 13
 
0.4%
35040 13
 
0.4%
35030 13
 
0.4%
35020 13
 
0.4%
35012 13
 
0.4%
35011 13
 
0.4%
35010 13
 
0.4%
34380 13
 
0.4%
36330 13
 
0.4%
Other values (280) 3509
96.4%
ValueCountFrequency (%)
0 13
0.4%
11 13
0.4%
21 13
0.4%
22 13
0.4%
23 13
0.4%
24 13
0.4%
25 13
0.4%
26 13
0.4%
29 11
0.3%
31 13
0.4%
ValueCountFrequency (%)
39020 13
0.4%
39010 13
0.4%
38400 13
0.4%
38390 13
0.4%
38380 13
0.4%
38370 13
0.4%
38360 13
0.4%
38350 13
0.4%
38340 13
0.4%
38330 13
0.4%

합계출산율
Real number (ℝ)

HIGH CORRELATION 

Distinct178
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.1720747
Minimum0.38
Maximum2.54
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size32.1 KiB
2024-01-10T06:44:37.878885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.38
5-th percentile0.71
Q10.95
median1.16
Q31.36
95-th percentile1.69
Maximum2.54
Range2.16
Interquartile range (IQR)0.41

Descriptive statistics

Standard deviation0.30368501
Coefficient of variation (CV)0.25910038
Kurtosis0.44863147
Mean1.1720747
Median Absolute Deviation (MAD)0.21
Skewness0.48206394
Sum4265.18
Variance0.092224584
MonotonicityNot monotonic
2024-01-10T06:44:37.983769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1.01 65
 
1.8%
1.18 59
 
1.6%
1.0 56
 
1.5%
1.19 54
 
1.5%
1.08 54
 
1.5%
1.05 53
 
1.5%
1.23 53
 
1.5%
1.06 50
 
1.4%
0.91 50
 
1.4%
1.03 49
 
1.3%
Other values (168) 3096
85.1%
ValueCountFrequency (%)
0.38 1
 
< 0.1%
0.42 1
 
< 0.1%
0.44 1
 
< 0.1%
0.45 1
 
< 0.1%
0.46 3
0.1%
0.47 3
0.1%
0.48 1
 
< 0.1%
0.49 1
 
< 0.1%
0.5 3
0.1%
0.52 2
0.1%
ValueCountFrequency (%)
2.54 1
< 0.1%
2.47 1
< 0.1%
2.46 2
0.1%
2.43 1
< 0.1%
2.42 1
< 0.1%
2.41 1
< 0.1%
2.35 1
< 0.1%
2.34 1
< 0.1%
2.28 1
< 0.1%
2.19 1
< 0.1%

Interactions

2024-01-10T06:44:37.107496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:44:36.623962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:44:36.858888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:44:37.179615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:44:36.707765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:44:36.946763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:44:37.259063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:44:36.788675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:44:37.031326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T06:44:38.048129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준연도행정구역코드합계출산율
기준연도1.0000.0000.597
행정구역코드0.0001.0000.401
합계출산율0.5970.4011.000
2024-01-10T06:44:38.124538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준연도행정구역코드합계출산율
기준연도1.0000.000-0.589
행정구역코드0.0001.0000.392
합계출산율-0.5890.3921.000

Missing values

2024-01-10T06:44:37.340486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T06:44:37.397883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준연도행정구역코드합계출산율
0201001.23
12010111.02
22010110100.84
32010110201.06
42010110301.02
52010110401.03
62010110500.94
72010110600.94
82010110701.01
92010110801.05
기준연도행정구역코드합계출산율
36292022383400.66
36302022383500.94
36312022383600.88
36322022383700.69
36332022383800.68
36342022383900.84
36352022384001.0
36362022390.92
36372022390100.93
36382022390200.9