Overview

Dataset statistics

Number of variables3
Number of observations94
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.6 KiB
Average record size in memory28.4 B

Variable types

Numeric3

Dataset

Description연령대,생년,회원수
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15474/S/1/datasetView.do

Alerts

연령대 is highly overall correlated with 생년High correlation
생년 is highly overall correlated with 연령대High correlation
생년 has unique valuesUnique
연령대 has 6 (6.4%) zerosZeros

Reproduction

Analysis started2024-05-11 06:41:43.154659
Analysis finished2024-05-11 06:41:46.619365
Duration3.46 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연령대
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct10
Distinct (%)10.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean45.957447
Minimum0
Maximum90
Zeros6
Zeros (%)6.4%
Negative0
Negative (%)0.0%
Memory size978.0 B
2024-05-11T15:41:46.743049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q120
median50
Q370
95-th percentile90
Maximum90
Range90
Interquartile range (IQR)50

Descriptive statistics

Standard deviation27.486816
Coefficient of variation (CV)0.59809275
Kurtosis-1.1714012
Mean45.957447
Median Absolute Deviation (MAD)20
Skewness-0.010397714
Sum4320
Variance755.52505
MonotonicityDecreasing
2024-05-11T15:41:46.984839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
80 10
10.6%
70 10
10.6%
60 10
10.6%
50 10
10.6%
40 10
10.6%
30 10
10.6%
20 10
10.6%
10 10
10.6%
90 8
8.5%
0 6
6.4%
ValueCountFrequency (%)
0 6
6.4%
10 10
10.6%
20 10
10.6%
30 10
10.6%
40 10
10.6%
50 10
10.6%
60 10
10.6%
70 10
10.6%
80 10
10.6%
90 8
8.5%
ValueCountFrequency (%)
90 8
8.5%
80 10
10.6%
70 10
10.6%
60 10
10.6%
50 10
10.6%
40 10
10.6%
30 10
10.6%
20 10
10.6%
10 10
10.6%
0 6
6.4%

생년
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct94
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1974.5
Minimum1928
Maximum2021
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size978.0 B
2024-05-11T15:41:47.292701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1928
5-th percentile1932.65
Q11951.25
median1974.5
Q31997.75
95-th percentile2016.35
Maximum2021
Range93
Interquartile range (IQR)46.5

Descriptive statistics

Standard deviation27.279418
Coefficient of variation (CV)0.013815861
Kurtosis-1.2
Mean1974.5
Median Absolute Deviation (MAD)23.5
Skewness0
Sum185603
Variance744.16667
MonotonicityStrictly increasing
2024-05-11T15:41:47.665419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1928 1
 
1.1%
1988 1
 
1.1%
1997 1
 
1.1%
1996 1
 
1.1%
1995 1
 
1.1%
1994 1
 
1.1%
1993 1
 
1.1%
1992 1
 
1.1%
1991 1
 
1.1%
1990 1
 
1.1%
Other values (84) 84
89.4%
ValueCountFrequency (%)
1928 1
1.1%
1929 1
1.1%
1930 1
1.1%
1931 1
1.1%
1932 1
1.1%
1933 1
1.1%
1934 1
1.1%
1935 1
1.1%
1936 1
1.1%
1937 1
1.1%
ValueCountFrequency (%)
2021 1
1.1%
2020 1
1.1%
2019 1
1.1%
2018 1
1.1%
2017 1
1.1%
2016 1
1.1%
2015 1
1.1%
2014 1
1.1%
2013 1
1.1%
2012 1
1.1%

회원수
Real number (ℝ)

Distinct88
Distinct (%)93.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1777.4574
Minimum2
Maximum5537
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size978.0 B
2024-05-11T15:41:48.028103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile4.3
Q187
median867
Q33553.75
95-th percentile5133.35
Maximum5537
Range5535
Interquartile range (IQR)3466.75

Descriptive statistics

Standard deviation1933.2837
Coefficient of variation (CV)1.0876681
Kurtosis-1.1833416
Mean1777.4574
Median Absolute Deviation (MAD)851
Skewness0.67052914
Sum167081
Variance3737585.8
MonotonicityNot monotonic
2024-05-11T15:41:48.439497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2 3
 
3.2%
5 2
 
2.1%
3 2
 
2.1%
19 2
 
2.1%
491 2
 
2.1%
4595 1
 
1.1%
4851 1
 
1.1%
5119 1
 
1.1%
5315 1
 
1.1%
5390 1
 
1.1%
Other values (78) 78
83.0%
ValueCountFrequency (%)
2 3
3.2%
3 2
2.1%
5 2
2.1%
9 1
 
1.1%
11 1
 
1.1%
12 1
 
1.1%
15 1
 
1.1%
17 1
 
1.1%
19 2
2.1%
33 1
 
1.1%
ValueCountFrequency (%)
5537 1
1.1%
5474 1
1.1%
5390 1
1.1%
5315 1
1.1%
5160 1
1.1%
5119 1
1.1%
4961 1
1.1%
4900 1
1.1%
4851 1
1.1%
4832 1
1.1%

Interactions

2024-05-11T15:41:45.682302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:41:43.711944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:41:44.541750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:41:45.867263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:41:44.028838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:41:45.313348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:41:46.078921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:41:44.237415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T15:41:45.525394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-11T15:41:48.863941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연령대생년회원수
연령대1.0000.9920.881
생년0.9921.0000.885
회원수0.8810.8851.000
2024-05-11T15:41:49.292819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연령대생년회원수
연령대1.000-0.995-0.420
생년-0.9951.0000.419
회원수-0.4200.4191.000

Missing values

2024-05-11T15:41:46.377292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-11T15:41:46.549455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연령대생년회원수
09019282
19019292
29019302
39019313
49019325
59019333
69019349
790193511
880193612
980193717
연령대생년회원수
84102012163
85102013123
86102014105
8710201596
880201684
890201756
900201840
910201919
920202015
93020215