Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells84
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory566.4 KiB
Average record size in memory58.0 B

Variable types

Numeric2
Categorical3
DateTime1

Dataset

Description광산구 내 종합사회복지관 이용자 현황 정보(이용복지관, 출생년도, 성별, 이용일자, 데이터 기준일자 등)를 제공합니다.
URLhttps://www.data.go.kr/data/15048524/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
연번 is highly overall correlated with 이용복지관High correlation
이용복지관 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 07:28:51.392117
Analysis finished2023-12-12 07:28:52.480764
Duration1.09 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6213.6065
Minimum1
Maximum12410
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T16:28:52.570113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile599.95
Q13131.75
median6200.5
Q39329.5
95-th percentile11785.15
Maximum12410
Range12409
Interquartile range (IQR)6197.75

Descriptive statistics

Standard deviation3587.5088
Coefficient of variation (CV)0.57736337
Kurtosis-1.1976959
Mean6213.6065
Median Absolute Deviation (MAD)3099.5
Skewness-0.0067915292
Sum62136065
Variance12870219
MonotonicityNot monotonic
2023-12-12T16:28:52.730236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
11088 1
 
< 0.1%
811 1
 
< 0.1%
10769 1
 
< 0.1%
10163 1
 
< 0.1%
11637 1
 
< 0.1%
1468 1
 
< 0.1%
10722 1
 
< 0.1%
7600 1
 
< 0.1%
525 1
 
< 0.1%
317 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
11 1
< 0.1%
ValueCountFrequency (%)
12410 1
< 0.1%
12409 1
< 0.1%
12408 1
< 0.1%
12407 1
< 0.1%
12406 1
< 0.1%
12405 1
< 0.1%
12404 1
< 0.1%
12403 1
< 0.1%
12402 1
< 0.1%
12401 1
< 0.1%

이용복지관
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
행복드림종합사회복지관
3997 
첨단종합사회복지관
3404 
송광종합사회복지관
1496 
하남종합사회복지관
1103 

Length

Max length11
Median length9
Mean length9.7994
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row행복드림종합사회복지관
2nd row송광종합사회복지관
3rd row송광종합사회복지관
4th row하남종합사회복지관
5th row행복드림종합사회복지관

Common Values

ValueCountFrequency (%)
행복드림종합사회복지관 3997
40.0%
첨단종합사회복지관 3404
34.0%
송광종합사회복지관 1496
 
15.0%
하남종합사회복지관 1103
 
11.0%

Length

2023-12-12T16:28:52.867302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:28:52.978881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
행복드림종합사회복지관 3997
40.0%
첨단종합사회복지관 3404
34.0%
송광종합사회복지관 1496
 
15.0%
하남종합사회복지관 1103
 
11.0%

성별
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
6729 
3271 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
6729
67.3%
3271
32.7%

Length

2023-12-12T16:28:53.115001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:28:53.214478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
6729
67.3%
3271
32.7%

출생연도
Real number (ℝ)

Distinct99
Distinct (%)1.0%
Missing84
Missing (%)0.8%
Infinite0
Infinite (%)0.0%
Mean1968.2308
Minimum1905
Maximum2021
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T16:28:53.337480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1905
5-th percentile1939
Q11951
median1965
Q31984
95-th percentile2004
Maximum2021
Range116
Interquartile range (IQR)33

Descriptive statistics

Standard deviation21.413416
Coefficient of variation (CV)0.010879525
Kurtosis-0.90359288
Mean1968.2308
Median Absolute Deviation (MAD)16
Skewness0.34682981
Sum19516977
Variance458.5344
MonotonicityNot monotonic
2023-12-12T16:28:53.515542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1960 370
 
3.7%
1955 298
 
3.0%
1962 287
 
2.9%
1951 269
 
2.7%
1945 249
 
2.5%
1950 236
 
2.4%
1948 221
 
2.2%
1969 221
 
2.2%
1942 218
 
2.2%
1941 202
 
2.0%
Other values (89) 7345
73.5%
ValueCountFrequency (%)
1905 2
 
< 0.1%
1915 1
 
< 0.1%
1922 1
 
< 0.1%
1923 1
 
< 0.1%
1925 3
 
< 0.1%
1927 6
 
0.1%
1928 2
 
< 0.1%
1929 8
 
0.1%
1930 22
0.2%
1931 18
0.2%
ValueCountFrequency (%)
2021 6
 
0.1%
2020 1
 
< 0.1%
2018 1
 
< 0.1%
2017 4
 
< 0.1%
2016 3
 
< 0.1%
2015 15
 
0.1%
2014 46
0.5%
2013 64
0.6%
2012 58
0.6%
2011 72
0.7%
Distinct308
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2022-01-01 00:00:00
Maximum2022-12-31 00:00:00
2023-12-12T16:28:53.644024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:28:53.796207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2022-12-31
10000 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-12-31
2nd row2022-12-31
3rd row2022-12-31
4th row2022-12-31
5th row2022-12-31

Common Values

ValueCountFrequency (%)
2022-12-31 10000
100.0%

Length

2023-12-12T16:28:53.935606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:28:54.023072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-12-31 10000
100.0%

Interactions

2023-12-12T16:28:51.985056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:28:51.727892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:28:52.108080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:28:51.857972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:28:54.073053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번이용복지관성별출생연도
연번1.0000.9830.1510.589
이용복지관0.9831.0000.1670.472
성별0.1510.1671.0000.185
출생연도0.5890.4720.1851.000
2023-12-12T16:28:54.156672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성별이용복지관
성별1.0000.111
이용복지관0.1111.000
2023-12-12T16:28:54.232510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번출생연도이용복지관성별
연번1.000-0.0300.9350.115
출생연도-0.0301.0000.3010.141
이용복지관0.9350.3011.0000.111
성별0.1150.1410.1111.000

Missing values

2023-12-12T16:28:52.258647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:28:52.425498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번이용복지관성별출생연도이용일자데이터기준일자
1108711088행복드림종합사회복지관19452022-10-252022-12-31
11741175송광종합사회복지관19442022-01-012022-12-31
17491750송광종합사회복지관19572022-07-062022-12-31
62286229하남종합사회복지관19512022-01-012022-12-31
1216212163행복드림종합사회복지관19812022-12-012022-12-31
86838684행복드림종합사회복지관19692022-04-012022-12-31
82498250행복드림종합사회복지관19712022-02-282022-12-31
25072508첨단종합사회복지관19492022-01-022022-12-31
22522253첨단종합사회복지관20052022-01-082022-12-31
384385송광종합사회복지관19552022-01-052022-12-31
연번이용복지관성별출생연도이용일자데이터기준일자
86498650행복드림종합사회복지관19982022-04-012022-12-31
12381239송광종합사회복지관19432022-01-102022-12-31
43184319첨단종합사회복지관19932022-01-172022-12-31
77577758행복드림종합사회복지관19452022-01-312022-12-31
363364송광종합사회복지관20182022-12-012022-12-31
17651766송광종합사회복지관20012022-01-012022-12-31
1201412015행복드림종합사회복지관19692022-12-012022-12-31
32723273첨단종합사회복지관19482022-01-022022-12-31
23242325첨단종합사회복지관20042022-11-212022-12-31
1060310604행복드림종합사회복지관19772022-09-302022-12-31