Overview

Dataset statistics

Number of variables8
Number of observations825
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory54.9 KiB
Average record size in memory68.2 B

Variable types

Categorical3
Text1
Numeric4

Dataset

Description경기도_양주시_인구 및 세대현황(읍면동, 법정동리, 인구(계), 인구(남), 인구(여), 세대수, 기준일자)
Author경기도 양주시
URLhttps://www.data.go.kr/data/3076101/fileData.do

Alerts

기준일자 has constant value ""Constant
인구(계) is highly overall correlated with 인구(남) and 3 other fieldsHigh correlation
인구(남) is highly overall correlated with 인구(계) and 3 other fieldsHigh correlation
인구(여) is highly overall correlated with 인구(계) and 3 other fieldsHigh correlation
세대수 is highly overall correlated with 인구(계) and 3 other fieldsHigh correlation
읍면동 is highly overall correlated with 인구(계) and 3 other fieldsHigh correlation

Reproduction

Analysis started2024-03-14 17:13:21.402225
Analysis finished2024-03-14 17:13:26.808008
Duration5.41 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연월일
Categorical

Distinct15
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size6.6 KiB
2020-09-30
 
55
2020-10-31
 
55
2020-11-30
 
55
2020-12-31
 
55
2021-01-31
 
55
Other values (10)
550 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020-09-30
2nd row2020-09-30
3rd row2020-09-30
4th row2020-09-30
5th row2020-09-30

Common Values

ValueCountFrequency (%)
2020-09-30 55
 
6.7%
2020-10-31 55
 
6.7%
2020-11-30 55
 
6.7%
2020-12-31 55
 
6.7%
2021-01-31 55
 
6.7%
2021-02-28 55
 
6.7%
2021-03-31 55
 
6.7%
2021-04-30 55
 
6.7%
2021-05-31 55
 
6.7%
2021-06-30 55
 
6.7%
Other values (5) 275
33.3%

Length

2024-03-15T02:13:27.007775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2020-09-30 55
 
6.7%
2020-10-31 55
 
6.7%
2020-11-30 55
 
6.7%
2020-12-31 55
 
6.7%
2021-01-31 55
 
6.7%
2021-02-28 55
 
6.7%
2021-03-31 55
 
6.7%
2021-04-30 55
 
6.7%
2021-05-31 55
 
6.7%
2021-06-30 55
 
6.7%
Other values (5) 275
33.3%

읍면동
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size6.6 KiB
남면
150 
백석읍
105 
광적면
105 
장흥면
105 
은현면
90 
Other values (6)
270 

Length

Max length4
Median length3
Mean length3.1454545
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row백석읍
2nd row백석읍
3rd row백석읍
4th row백석읍
5th row백석읍

Common Values

ValueCountFrequency (%)
남면 150
18.2%
백석읍 105
12.7%
광적면 105
12.7%
장흥면 105
12.7%
은현면 90
10.9%
양주1동 75
9.1%
양주2동 60
 
7.3%
회천4동 45
 
5.5%
회천1동 30
 
3.6%
회천2동 30
 
3.6%

Length

2024-03-15T02:13:27.481398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
남면 150
18.2%
백석읍 105
12.7%
광적면 105
12.7%
장흥면 105
12.7%
은현면 90
10.9%
양주1동 75
9.1%
양주2동 60
 
7.3%
회천4동 45
 
5.5%
회천1동 30
 
3.6%
회천2동 30
 
3.6%
Distinct54
Distinct (%)6.5%
Missing0
Missing (%)0.0%
Memory size6.6 KiB
2024-03-15T02:13:28.457845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters2475
Distinct characters68
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row방성리
2nd row오산리
3rd row복지리
4th row가업리
5th row홍죽리
ValueCountFrequency (%)
덕정동 30
 
3.6%
어둔동 15
 
1.8%
율정동 15
 
1.8%
마전동 15
 
1.8%
효촌리 15
 
1.8%
교현리 15
 
1.8%
울대리 15
 
1.8%
부곡리 15
 
1.8%
석현리 15
 
1.8%
일영리 15
 
1.8%
Other values (44) 660
80.0%
2024-03-15T02:13:29.588680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
555
22.4%
270
 
10.9%
150
 
6.1%
75
 
3.0%
75
 
3.0%
60
 
2.4%
60
 
2.4%
45
 
1.8%
45
 
1.8%
45
 
1.8%
Other values (58) 1095
44.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2475
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
555
22.4%
270
 
10.9%
150
 
6.1%
75
 
3.0%
75
 
3.0%
60
 
2.4%
60
 
2.4%
45
 
1.8%
45
 
1.8%
45
 
1.8%
Other values (58) 1095
44.2%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2475
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
555
22.4%
270
 
10.9%
150
 
6.1%
75
 
3.0%
75
 
3.0%
60
 
2.4%
60
 
2.4%
45
 
1.8%
45
 
1.8%
45
 
1.8%
Other values (58) 1095
44.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2475
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
555
22.4%
270
 
10.9%
150
 
6.1%
75
 
3.0%
75
 
3.0%
60
 
2.4%
60
 
2.4%
45
 
1.8%
45
 
1.8%
45
 
1.8%
Other values (58) 1095
44.2%

인구(계)
Real number (ℝ)

HIGH CORRELATION 

Distinct601
Distinct (%)72.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4246.0206
Minimum143
Maximum53255
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.4 KiB
2024-03-15T02:13:29.853184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum143
5-th percentile325
Q1570
median1017
Q34854
95-th percentile20039.2
Maximum53255
Range53112
Interquartile range (IQR)4284

Descriptive statistics

Standard deviation7998.6122
Coefficient of variation (CV)1.8837902
Kurtosis18.775446
Mean4246.0206
Median Absolute Deviation (MAD)623
Skewness3.9558815
Sum3502967
Variance63977796
MonotonicityNot monotonic
2024-03-15T02:13:30.204721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
147 6
 
0.7%
575 6
 
0.7%
1003 5
 
0.6%
706 5
 
0.6%
330 5
 
0.6%
1108 5
 
0.6%
395 4
 
0.5%
763 4
 
0.5%
298 4
 
0.5%
333 4
 
0.5%
Other values (591) 777
94.2%
ValueCountFrequency (%)
143 1
 
0.1%
147 6
0.7%
148 2
 
0.2%
149 1
 
0.1%
152 2
 
0.2%
155 1
 
0.1%
158 1
 
0.1%
163 1
 
0.1%
291 1
 
0.1%
293 2
 
0.2%
ValueCountFrequency (%)
53255 1
0.1%
53230 1
0.1%
53120 1
0.1%
52974 1
0.1%
52861 1
0.1%
52782 1
0.1%
52717 1
0.1%
52513 1
0.1%
50205 2
0.2%
47640 1
0.1%

인구(남)
Real number (ℝ)

HIGH CORRELATION 

Distinct533
Distinct (%)64.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2153.8752
Minimum78
Maximum26631
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.4 KiB
2024-03-15T02:13:30.454268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum78
5-th percentile178.2
Q1315
median573
Q32403
95-th percentile9820.6
Maximum26631
Range26553
Interquartile range (IQR)2088

Descriptive statistics

Standard deviation3986.1649
Coefficient of variation (CV)1.8506945
Kurtosis19.088656
Mean2153.8752
Median Absolute Deviation (MAD)364
Skewness3.9917041
Sum1776947
Variance15889510
MonotonicityNot monotonic
2024-03-15T02:13:30.751173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
180 9
 
1.1%
185 7
 
0.8%
188 6
 
0.7%
182 6
 
0.7%
187 6
 
0.7%
183 6
 
0.7%
1327 5
 
0.6%
209 5
 
0.6%
573 5
 
0.6%
320 5
 
0.6%
Other values (523) 765
92.7%
ValueCountFrequency (%)
78 1
 
0.1%
79 2
0.2%
80 2
0.2%
81 3
0.4%
82 2
0.2%
83 1
 
0.1%
84 1
 
0.1%
87 1
 
0.1%
89 1
 
0.1%
92 1
 
0.1%
ValueCountFrequency (%)
26631 1
0.1%
26612 1
0.1%
26585 1
0.1%
26505 1
0.1%
26442 1
0.1%
26435 1
0.1%
26423 1
0.1%
26308 1
0.1%
25151 2
0.2%
23903 1
0.1%

인구(여)
Real number (ℝ)

HIGH CORRELATION 

Distinct508
Distinct (%)61.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2092.1455
Minimum65
Maximum26643
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.4 KiB
2024-03-15T02:13:31.005569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum65
5-th percentile149.2
Q1255
median436
Q32452
95-th percentile10216.2
Maximum26643
Range26578
Interquartile range (IQR)2197

Descriptive statistics

Standard deviation4014.0615
Coefficient of variation (CV)1.9186341
Kurtosis18.438176
Mean2092.1455
Median Absolute Deviation (MAD)261
Skewness3.9172694
Sum1726020
Variance16112690
MonotonicityNot monotonic
2024-03-15T02:13:31.487549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
151 8
 
1.0%
315 7
 
0.8%
309 6
 
0.7%
154 6
 
0.7%
307 6
 
0.7%
156 6
 
0.7%
318 5
 
0.6%
317 5
 
0.6%
314 5
 
0.6%
310 5
 
0.6%
Other values (498) 766
92.8%
ValueCountFrequency (%)
65 5
0.6%
66 1
 
0.1%
67 1
 
0.1%
68 4
0.5%
69 1
 
0.1%
71 3
0.4%
103 1
 
0.1%
104 2
 
0.2%
105 1
 
0.1%
106 2
 
0.2%
ValueCountFrequency (%)
26643 1
0.1%
26599 1
0.1%
26535 1
0.1%
26469 1
0.1%
26419 1
0.1%
26347 1
0.1%
26294 1
0.1%
26205 1
0.1%
25054 2
0.2%
23737 1
0.1%

세대수
Real number (ℝ)

HIGH CORRELATION 

Distinct519
Distinct (%)62.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1817.183
Minimum100
Maximum21267
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.4 KiB
2024-03-15T02:13:31.831064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum100
5-th percentile166.4
Q1324
median537
Q32042
95-th percentile7241
Maximum21267
Range21167
Interquartile range (IQR)1718

Descriptive statistics

Standard deviation3176.6642
Coefficient of variation (CV)1.7481256
Kurtosis18.680629
Mean1817.183
Median Absolute Deviation (MAD)314
Skewness3.9399795
Sum1499176
Variance10091196
MonotonicityNot monotonic
2024-03-15T02:13:32.123418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
249 6
 
0.7%
246 6
 
0.7%
251 6
 
0.7%
163 6
 
0.7%
529 6
 
0.7%
252 6
 
0.7%
302 5
 
0.6%
388 5
 
0.6%
184 5
 
0.6%
484 5
 
0.6%
Other values (509) 769
93.2%
ValueCountFrequency (%)
100 4
0.5%
101 4
0.5%
102 2
0.2%
106 2
0.2%
107 1
 
0.1%
109 1
 
0.1%
112 1
 
0.1%
149 1
 
0.1%
152 2
0.2%
153 1
 
0.1%
ValueCountFrequency (%)
21267 1
0.1%
21257 1
0.1%
21212 1
0.1%
21112 1
0.1%
21077 1
0.1%
21059 1
0.1%
21032 1
0.1%
20980 1
0.1%
19877 2
0.2%
18999 1
0.1%

기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size6.6 KiB
2022-12-20
825 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-12-20
2nd row2022-12-20
3rd row2022-12-20
4th row2022-12-20
5th row2022-12-20

Common Values

ValueCountFrequency (%)
2022-12-20 825
100.0%

Length

2024-03-15T02:13:32.457603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T02:13:32.754818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-12-20 825
100.0%

Interactions

2024-03-15T02:13:25.220222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:13:21.880050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:13:23.146163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:13:24.184379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:13:25.475756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:13:22.350668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:13:23.406927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:13:24.446409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:13:25.731963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:13:22.606174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:13:23.664153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:13:24.704133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:13:25.987007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:13:22.885435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:13:23.934283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:13:24.965878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T02:13:32.896840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연월일읍면동법정동리인구(계)인구(남)인구(여)세대수
연월일1.0000.0000.0000.0000.0000.0000.000
읍면동0.0001.0000.9980.7850.7770.7850.778
법정동리0.0000.9981.0000.9880.9880.9880.987
인구(계)0.0000.7850.9881.0001.0001.0001.000
인구(남)0.0000.7770.9881.0001.0001.0001.000
인구(여)0.0000.7850.9881.0001.0001.0001.000
세대수0.0000.7780.9871.0001.0001.0001.000
2024-03-15T02:13:33.202028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연월일읍면동
연월일1.0000.000
읍면동0.0001.000
2024-03-15T02:13:33.446996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인구(계)인구(남)인구(여)세대수연월일읍면동
인구(계)1.0000.9970.9950.9910.0000.541
인구(남)0.9971.0000.9870.9930.0000.531
인구(여)0.9950.9871.0000.9820.0000.541
세대수0.9910.9930.9821.0000.0000.532
연월일0.0000.0000.0000.0001.0000.000
읍면동0.5410.5310.5410.5320.0001.000

Missing values

2024-03-15T02:13:26.335575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T02:13:26.640105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연월일읍면동법정동리인구(계)인구(남)인구(여)세대수기준일자
02020-09-30백석읍방성리63953327306831172022-12-20
12020-09-30백석읍오산리35281788174015732022-12-20
22020-09-30백석읍복지리95824695488735242022-12-20
32020-09-30백석읍가업리48542397245720102022-12-20
42020-09-30백석읍홍죽리8794833964702022-12-20
52020-09-30백석읍연곡리7013883133562022-12-20
62020-09-30백석읍기산리4892632263462022-12-20
72020-09-30은현면용암리13247535716962022-12-20
82020-09-30은현면선암리12277085196502022-12-20
92020-09-30은현면운암리7664393274052022-12-20
연월일읍면동법정동리인구(계)인구(남)인구(여)세대수기준일자
8152021-11-30양주2동고읍동90294406462331612022-12-20
8162021-11-30회천1동덕정동84814341414039132022-12-20
8172021-11-30회천1동봉양동12597924678372022-12-20
8182021-11-30회천2동덕계동21981111021087994592022-12-20
8192021-11-30회천2동회정동53112715259622802022-12-20
8202021-11-30회천3동덕정동102355046518937232022-12-20
8212021-11-30회천3동고암동171198520859973212022-12-20
8222021-11-30회천4동회암동9275373905252022-12-20
8232021-11-30회천4동율정동5563372193542022-12-20
8242021-11-30회천4동옥정동532552661226643212672022-12-20