Overview

Dataset statistics

Number of variables7
Number of observations364
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory22.2 KiB
Average record size in memory62.4 B

Variable types

Numeric6
Categorical1

Dataset

Description연도,지역코드,지역이름,주민등록인구,추계중증정신질환자수,정신건강복지센터 등록 중증정신질환자수,추계중증정신질환자수 대비 정신건강복지센터 등록 중증정신질환자
Author서울시정신건강복지센터
URLhttps://data.seoul.go.kr/dataList/OA-20335/S/1/datasetView.do

Alerts

연도 is highly overall correlated with 추계중증정신질환자수 대비 정신건강복지센터 등록 중증정신질환자High correlation
지역코드 is highly overall correlated with 지역이름High correlation
주민등록인구 is highly overall correlated with 추계중증정신질환자수 and 1 other fieldsHigh correlation
추계중증정신질환자수 is highly overall correlated with 주민등록인구 and 1 other fieldsHigh correlation
정신건강복지센터 등록 중증정신질환자수 is highly overall correlated with 추계중증정신질환자수 대비 정신건강복지센터 등록 중증정신질환자High correlation
추계중증정신질환자수 대비 정신건강복지센터 등록 중증정신질환자 is highly overall correlated with 연도 and 1 other fieldsHigh correlation
지역이름 is highly overall correlated with 지역코드 and 2 other fieldsHigh correlation
주민등록인구 has unique valuesUnique

Reproduction

Analysis started2024-05-04 03:28:28.096056
Analysis finished2024-05-04 03:28:39.975204
Duration11.88 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Real number (ℝ)

HIGH CORRELATION 

Distinct14
Distinct (%)3.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2015.5
Minimum2009
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.3 KiB
2024-05-04T03:28:40.162591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2009
5-th percentile2009
Q12012
median2015.5
Q32019
95-th percentile2022
Maximum2022
Range13
Interquartile range (IQR)7

Descriptive statistics

Standard deviation4.0366776
Coefficient of variation (CV)0.002002817
Kurtosis-1.2124601
Mean2015.5
Median Absolute Deviation (MAD)3.5
Skewness0
Sum733642
Variance16.294766
MonotonicityDecreasing
2024-05-04T03:28:40.647389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
2022 26
 
7.1%
2021 26
 
7.1%
2020 26
 
7.1%
2019 26
 
7.1%
2018 26
 
7.1%
2017 26
 
7.1%
2016 26
 
7.1%
2015 26
 
7.1%
2014 26
 
7.1%
2013 26
 
7.1%
Other values (4) 104
28.6%
ValueCountFrequency (%)
2009 26
7.1%
2010 26
7.1%
2011 26
7.1%
2012 26
7.1%
2013 26
7.1%
2014 26
7.1%
2015 26
7.1%
2016 26
7.1%
2017 26
7.1%
2018 26
7.1%
ValueCountFrequency (%)
2022 26
7.1%
2021 26
7.1%
2020 26
7.1%
2019 26
7.1%
2018 26
7.1%
2017 26
7.1%
2016 26
7.1%
2015 26
7.1%
2014 26
7.1%
2013 26
7.1%

지역코드
Real number (ℝ)

HIGH CORRELATION 

Distinct26
Distinct (%)7.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean112.5
Minimum100
Maximum125
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.3 KiB
2024-05-04T03:28:41.052015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum100
5-th percentile101
Q1106
median112.5
Q3119
95-th percentile124
Maximum125
Range25
Interquartile range (IQR)13

Descriptive statistics

Standard deviation7.5103235
Coefficient of variation (CV)0.066758431
Kurtosis-1.2035865
Mean112.5
Median Absolute Deviation (MAD)6.5
Skewness0
Sum40950
Variance56.404959
MonotonicityNot monotonic
2024-05-04T03:28:41.436957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
100 14
 
3.8%
114 14
 
3.8%
125 14
 
3.8%
124 14
 
3.8%
123 14
 
3.8%
122 14
 
3.8%
121 14
 
3.8%
120 14
 
3.8%
119 14
 
3.8%
118 14
 
3.8%
Other values (16) 224
61.5%
ValueCountFrequency (%)
100 14
3.8%
101 14
3.8%
102 14
3.8%
103 14
3.8%
104 14
3.8%
105 14
3.8%
106 14
3.8%
107 14
3.8%
108 14
3.8%
109 14
3.8%
ValueCountFrequency (%)
125 14
3.8%
124 14
3.8%
123 14
3.8%
122 14
3.8%
121 14
3.8%
120 14
3.8%
119 14
3.8%
118 14
3.8%
117 14
3.8%
116 14
3.8%

지역이름
Categorical

HIGH CORRELATION 

Distinct26
Distinct (%)7.1%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
서울시
 
14
종로구
 
14
중구
 
14
용산구
 
14
성동구
 
14
Other values (21)
294 

Length

Max length11
Median length10
Mean length9.8076923
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울시
2nd row 종로구
3rd row 중구
4th row 용산구
5th row 성동구

Common Values

ValueCountFrequency (%)
서울시 14
 
3.8%
종로구 14
 
3.8%
중구 14
 
3.8%
용산구 14
 
3.8%
성동구 14
 
3.8%
광진구 14
 
3.8%
동대문구 14
 
3.8%
중랑구 14
 
3.8%
성북구 14
 
3.8%
강북구 14
 
3.8%
Other values (16) 224
61.5%

Length

2024-05-04T03:28:41.879051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서울시 14
 
3.8%
종로구 14
 
3.8%
송파구 14
 
3.8%
강남구 14
 
3.8%
서초구 14
 
3.8%
관악구 14
 
3.8%
동작구 14
 
3.8%
영등포구 14
 
3.8%
금천구 14
 
3.8%
구로구 14
 
3.8%
Other values (16) 224
61.5%

주민등록인구
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct364
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean718926.99
Minimum106485
Maximum10143645
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.3 KiB
2024-05-04T03:28:42.539392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum106485
5-th percentile141874.6
Q1304816.25
median378992
Q3462676.75
95-th percentile664701.7
Maximum10143645
Range10037160
Interquartile range (IQR)157860.5

Descriptive statistics

Standard deviation1738676.6
Coefficient of variation (CV)2.4184327
Kurtosis21.829772
Mean718926.99
Median Absolute Deviation (MAD)79449.5
Skewness4.8374186
Sum2.6168942 × 108
Variance3.0229962 × 1012
MonotonicityNot monotonic
2024-05-04T03:28:43.074730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9428372 1
 
0.3%
424964 1
 
0.3%
492528 1
 
0.3%
381856 1
 
0.3%
314110 1
 
0.3%
503660 1
 
0.3%
590479 1
 
0.3%
358582 1
 
0.3%
338707 1
 
0.3%
476589 1
 
0.3%
Other values (354) 354
97.3%
ValueCountFrequency (%)
106485 1
0.3%
110409 1
0.3%
111312 1
0.3%
112379 1
0.3%
120437 1
0.3%
122499 1
0.3%
125240 1
0.3%
125249 1
0.3%
125709 1
0.3%
125725 1
0.3%
ValueCountFrequency (%)
10143645 1
0.3%
10103233 1
0.3%
10022181 1
0.3%
9930616 1
0.3%
9857426 1
0.3%
9765623 1
0.3%
9729107 1
0.3%
9668465 1
0.3%
9509458 1
0.3%
9428372 1
0.3%

추계중증정신질환자수
Real number (ℝ)

HIGH CORRELATION 

Distinct352
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7189.2816
Minimum1065
Maximum101436
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.3 KiB
2024-05-04T03:28:43.712706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1065
5-th percentile1418.95
Q13048
median3790
Q34627.0375
95-th percentile6646.7
Maximum101436
Range100371
Interquartile range (IQR)1579.0375

Descriptive statistics

Standard deviation17386.752
Coefficient of variation (CV)2.4184268
Kurtosis21.829768
Mean7189.2816
Median Absolute Deviation (MAD)794.61
Skewness4.8374179
Sum2616898.5
Variance3.0229913 × 108
MonotonicityNot monotonic
2024-05-04T03:28:44.232079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1257.0 3
 
0.8%
5037.0 2
 
0.5%
2993.0 2
 
0.5%
3962.0 2
 
0.5%
4868.0 2
 
0.5%
3048.0 2
 
0.5%
3689.0 2
 
0.5%
4667.0 2
 
0.5%
4766.0 2
 
0.5%
1252.0 2
 
0.5%
Other values (342) 343
94.2%
ValueCountFrequency (%)
1065.0 1
 
0.3%
1104.0 1
 
0.3%
1113.0 1
 
0.3%
1123.79 1
 
0.3%
1204.0 1
 
0.3%
1225.0 1
 
0.3%
1252.0 2
0.5%
1257.0 3
0.8%
1261.71 1
 
0.3%
1281.0 1
 
0.3%
ValueCountFrequency (%)
101436.0 1
0.3%
101032.0 1
0.3%
100222.0 1
0.3%
99306.0 1
0.3%
98574.0 1
0.3%
97656.0 1
0.3%
97291.0 1
0.3%
96685.0 1
0.3%
95095.0 1
0.3%
94284.0 1
0.3%
Distinct275
Distinct (%)75.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean859.70879
Minimum0
Maximum15019
Zeros1
Zeros (%)0.3%
Negative0
Negative (%)0.0%
Memory size3.3 KiB
2024-05-04T03:28:44.767722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile192.15
Q1280
median466
Q3574.75
95-th percentile1107.2
Maximum15019
Range15019
Interquartile range (IQR)294.75

Descriptive statistics

Standard deviation2153.9193
Coefficient of variation (CV)2.5054057
Kurtosis26.102201
Mean859.70879
Median Absolute Deviation (MAD)154.5
Skewness5.1830032
Sum312934
Variance4639368.2
MonotonicityNot monotonic
2024-05-04T03:28:45.313716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
494 5
 
1.4%
238 4
 
1.1%
596 4
 
1.1%
566 3
 
0.8%
367 3
 
0.8%
246 3
 
0.8%
260 3
 
0.8%
210 3
 
0.8%
309 3
 
0.8%
230 3
 
0.8%
Other values (265) 330
90.7%
ValueCountFrequency (%)
0 1
0.3%
23 1
0.3%
107 1
0.3%
131 1
0.3%
136 1
0.3%
140 1
0.3%
142 1
0.3%
150 1
0.3%
152 2
0.5%
156 1
0.3%
ValueCountFrequency (%)
15019 1
0.3%
14178 1
0.3%
13710 1
0.3%
12427 1
0.3%
12369 1
0.3%
12224 1
0.3%
12092 1
0.3%
11936 1
0.3%
11871 1
0.3%
11870 1
0.3%
Distinct276
Distinct (%)75.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.890549
Minimum0
Maximum48.85
Zeros1
Zeros (%)0.3%
Negative0
Negative (%)0.0%
Memory size3.3 KiB
2024-05-04T03:28:45.799253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile4.83
Q18.6475
median11.855
Q315.83
95-th percentile24.795
Maximum48.85
Range48.85
Interquartile range (IQR)7.1825

Descriptive statistics

Standard deviation6.5889321
Coefficient of variation (CV)0.51114439
Kurtosis5.6726043
Mean12.890549
Median Absolute Deviation (MAD)3.56
Skewness1.7007632
Sum4692.16
Variance43.414026
MonotonicityNot monotonic
2024-05-04T03:28:46.456809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10.7 6
 
1.6%
15.0 5
 
1.4%
5.5 4
 
1.1%
11.2 4
 
1.1%
16.6 3
 
0.8%
6.7 3
 
0.8%
29.3 3
 
0.8%
12.3 3
 
0.8%
11.5 3
 
0.8%
15.3 3
 
0.8%
Other values (266) 327
89.8%
ValueCountFrequency (%)
0.0 1
 
0.3%
1.1 1
 
0.3%
3.2 1
 
0.3%
3.3 2
0.5%
3.5 1
 
0.3%
3.8 1
 
0.3%
3.9 3
0.8%
4.04 1
 
0.3%
4.1 1
 
0.3%
4.22 1
 
0.3%
ValueCountFrequency (%)
48.85 1
 
0.3%
47.0 1
 
0.3%
45.3 1
 
0.3%
37.7 1
 
0.3%
31.5 1
 
0.3%
30.88 1
 
0.3%
29.3 3
0.8%
29.0 1
 
0.3%
28.8 2
0.5%
28.13 1
 
0.3%

Interactions

2024-05-04T03:28:36.999869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:28.537085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:29.847720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:31.291662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:33.213088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:35.011113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:37.343381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:28.772012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:30.109339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:31.505083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:33.474233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:35.314733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:37.676383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:29.004032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:30.357771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:32.164358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:33.742424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:35.630989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:37.996081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:29.195709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:30.616998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:32.418673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:34.001666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:35.958939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:38.381381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:29.397085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:30.870393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:32.670947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:34.256229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:36.395256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:38.696216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:29.619913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:31.104075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:32.939641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:34.663538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T03:28:36.727438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-04T03:28:46.841027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도지역코드지역이름주민등록인구추계중증정신질환자수정신건강복지센터 등록 중증정신질환자수추계중증정신질환자수 대비 정신건강복지센터 등록 중증정신질환자
연도1.0000.0000.0000.0000.0000.0840.543
지역코드0.0001.0001.0000.4530.4530.3520.401
지역이름0.0001.0001.0000.7770.7770.6680.589
주민등록인구0.0000.4530.7771.0001.0000.9480.051
추계중증정신질환자수0.0000.4530.7771.0001.0000.9480.051
정신건강복지센터 등록 중증정신질환자수0.0840.3520.6680.9480.9481.0000.000
추계중증정신질환자수 대비 정신건강복지센터 등록 중증정신질환자0.5430.4010.5890.0510.0510.0001.000
2024-05-04T03:28:47.224923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도지역코드주민등록인구추계중증정신질환자수정신건강복지센터 등록 중증정신질환자수추계중증정신질환자수 대비 정신건강복지센터 등록 중증정신질환자지역이름
연도1.0000.0000.1490.149-0.442-0.5500.000
지역코드0.0001.0000.4400.4400.012-0.3330.977
주민등록인구0.1490.4401.0001.0000.397-0.3930.516
추계중증정신질환자수0.1490.4401.0001.0000.397-0.3930.516
정신건강복지센터 등록 중증정신질환자수-0.4420.0120.3970.3971.0000.6050.365
추계중증정신질환자수 대비 정신건강복지센터 등록 중증정신질환자-0.550-0.333-0.393-0.3930.6051.0000.259
지역이름0.0000.9770.5160.5160.3650.2591.000

Missing values

2024-05-04T03:28:39.179144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-04T03:28:39.758367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도지역코드지역이름주민등록인구추계중증정신질환자수정신건강복지센터 등록 중증정신질환자수추계중증정신질환자수 대비 정신건강복지센터 등록 중증정신질환자
02022100서울시942837294284.054775.8
12022101종로구1413791414.021014.9
22022102중구1204371204.013110.9
32022103용산구2186502187.01406.4
42022104성동구2810002810.01796.4
52022105광진구3374163374.01594.7
62022106동대문구3366443366.02336.9
72022107중랑구3853183853.02707.0
82022108성북구4303974304.03077.1
92022109강북구2936602937.02368.0
연도지역코드지역이름주민등록인구추계중증정신질환자수정신건강복지센터 등록 중증정신질환자수추계중증정신질환자수 대비 정신건강복지센터 등록 중증정신질환자
3542009116강서구4466484466.094421.1
3552009117구로구3341933342.051215.3
3562009118금천구1942301942.01075.5
3572009119영등포구3246033246.055117.0
3582009120동작구3199173199.039912.5
3592009121관악구4360054360.02275.2
3602009122서초구3315853316.02607.8
3612009123강남구4399064399.070416.0
3622009124송파구5280565281.04378.3
3632009125강동구3779863780.048512.8