Overview

Dataset statistics

Number of variables8
Number of observations3690
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory245.2 KiB
Average record size in memory68.0 B

Variable types

DateTime2
Categorical2
Numeric4

Dataset

Description경기도 수원시의 외국인을 제외한 5세 계급별 월간 인구현황에 대한 데이터로 연령별 성별 수, 구성비 등의 항목을 제공합니다. 잠정자료이므로 변동될 수 있습니다.
Author경기도 수원시
URLhttps://www.data.go.kr/data/15051543/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
is highly overall correlated with 구성비(퍼센트) and 2 other fieldsHigh correlation
구성비(퍼센트) is highly overall correlated with and 2 other fieldsHigh correlation
성별_남 is highly overall correlated with and 2 other fieldsHigh correlation
성별_여 is highly overall correlated with and 2 other fieldsHigh correlation

Reproduction

Analysis started2024-03-14 14:25:58.555395
Analysis finished2024-03-14 14:26:04.180513
Duration5.63 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct41
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size29.0 KiB
Minimum2020-07-01 00:00:00
Maximum2023-11-01 00:00:00
2024-03-14T23:26:04.415648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T23:26:05.290880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=41)

구분
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size29.0 KiB
수원시
738 
장안구
738 
권선구
738 
팔달구
738 
영통구
738 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수원시
2nd row수원시
3rd row수원시
4th row수원시
5th row수원시

Common Values

ValueCountFrequency (%)
수원시 738
20.0%
장안구 738
20.0%
권선구 738
20.0%
팔달구 738
20.0%
영통구 738
20.0%

Length

2024-03-14T23:26:05.967620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T23:26:06.437858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수원시 738
20.0%
장안구 738
20.0%
권선구 738
20.0%
팔달구 738
20.0%
영통구 738
20.0%

연령별
Categorical

Distinct36
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size29.0 KiB
0 ∼ 4세
 
200
55 ∼ 59세
 
200
45 ∼ 49세
 
200
85세 이상
 
200
10 ∼ 14세
 
200
Other values (31)
2690 

Length

Max length8
Median length8
Mean length7.6192412
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0∼4세
2nd row5∼9세
3rd row10∼14세
4th row15∼19세
5th row20∼24세

Common Values

ValueCountFrequency (%)
0 ∼ 4세 200
 
5.4%
55 ∼ 59세 200
 
5.4%
45 ∼ 49세 200
 
5.4%
85세 이상 200
 
5.4%
10 ∼ 14세 200
 
5.4%
80 ∼ 84세 200
 
5.4%
75 ∼ 79세 200
 
5.4%
65 ∼ 69세 200
 
5.4%
60 ∼ 64세 200
 
5.4%
70 ∼ 74세 200
 
5.4%
Other values (26) 1690
45.8%

Length

2024-03-14T23:26:06.866212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
3400
31.8%
0 200
 
1.9%
39세 200
 
1.9%
5 200
 
1.9%
9세 200
 
1.9%
50 200
 
1.9%
54세 200
 
1.9%
40 200
 
1.9%
44세 200
 
1.9%
30 200
 
1.9%
Other values (45) 5490
51.4%


Real number (ℝ)

HIGH CORRELATION 

Distinct3525
Distinct (%)95.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26408.641
Minimum2306
Maximum107580
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size32.6 KiB
2024-03-14T23:26:07.286364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2306
5-th percentile4041.85
Q110861.5
median18087.5
Q330396.5
95-th percentile93828.55
Maximum107580
Range105274
Interquartile range (IQR)19535

Descriptive statistics

Standard deviation25379.624
Coefficient of variation (CV)0.96103483
Kurtosis2.5514473
Mean26408.641
Median Absolute Deviation (MAD)10507.5
Skewness1.8394771
Sum97447886
Variance6.4412532 × 108
MonotonicityNot monotonic
2024-03-14T23:26:07.753550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
31932 3
 
0.1%
6601 3
 
0.1%
13158 3
 
0.1%
17643 3
 
0.1%
12890 3
 
0.1%
12474 3
 
0.1%
6624 3
 
0.1%
7188 3
 
0.1%
32213 2
 
0.1%
12579 2
 
0.1%
Other values (3515) 3662
99.2%
ValueCountFrequency (%)
2306 1
< 0.1%
2330 1
< 0.1%
2346 1
< 0.1%
2353 1
< 0.1%
2368 1
< 0.1%
2384 1
< 0.1%
2400 1
< 0.1%
2407 1
< 0.1%
2422 1
< 0.1%
2437 1
< 0.1%
ValueCountFrequency (%)
107580 1
< 0.1%
107345 1
< 0.1%
107253 1
< 0.1%
107236 1
< 0.1%
107214 1
< 0.1%
107210 1
< 0.1%
107201 1
< 0.1%
107136 1
< 0.1%
107098 1
< 0.1%
107090 1
< 0.1%

구성비(퍼센트)
Real number (ℝ)

HIGH CORRELATION 

Distinct2797
Distinct (%)75.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.5769358
Minimum0.69059653
Maximum80.711643
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size32.6 KiB
2024-03-14T23:26:08.172912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.69059653
5-th percentile1.327927
Q13.1785701
median5.7041038
Q37.9975341
95-th percentile8.98
Maximum80.711643
Range80.021047
Interquartile range (IQR)4.8189641

Descriptive statistics

Standard deviation2.8918851
Coefficient of variation (CV)0.51854372
Kurtosis121.86185
Mean5.5769358
Median Absolute Deviation (MAD)2.3641038
Skewness4.5672399
Sum20578.893
Variance8.3629992
MonotonicityNot monotonic
2024-03-14T23:26:08.628358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
8.41 12
 
0.3%
2.09 9
 
0.2%
8.14 9
 
0.2%
8.01 9
 
0.2%
2.11 9
 
0.2%
5.3 8
 
0.2%
7.98 8
 
0.2%
8.8 8
 
0.2%
5.82 7
 
0.2%
2.61 7
 
0.2%
Other values (2787) 3604
97.7%
ValueCountFrequency (%)
0.690596535 1
< 0.1%
0.697341015 1
< 0.1%
0.706316784 1
< 0.1%
0.707821249 1
< 0.1%
0.711100757 1
< 0.1%
0.714940273 1
< 0.1%
0.719957247 1
< 0.1%
0.724755833 1
< 0.1%
0.731248466 1
< 0.1%
0.739199415 1
< 0.1%
ValueCountFrequency (%)
80.71164347 1
< 0.1%
9.73 1
< 0.1%
9.724217336 1
< 0.1%
9.722961151 1
< 0.1%
9.72 1
< 0.1%
9.713581599 1
< 0.1%
9.712270198 1
< 0.1%
9.71 1
< 0.1%
9.70640457 1
< 0.1%
9.7 1
< 0.1%

성별_남
Real number (ℝ)

HIGH CORRELATION 

Distinct3318
Distinct (%)89.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13297.125
Minimum663
Maximum54132
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size32.6 KiB
2024-03-14T23:26:09.059405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum663
5-th percentile1315.25
Q14900.25
median9319.5
Q315574.25
95-th percentile48083
Maximum54132
Range53469
Interquartile range (IQR)10674

Descriptive statistics

Standard deviation13086.884
Coefficient of variation (CV)0.98418896
Kurtosis2.4989205
Mean13297.125
Median Absolute Deviation (MAD)5453
Skewness1.819957
Sum49066392
Variance1.7126653 × 108
MonotonicityNot monotonic
2024-03-14T23:26:09.501528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
12141 5
 
0.1%
2260 5
 
0.1%
7694 4
 
0.1%
7649 4
 
0.1%
10126 4
 
0.1%
1172 3
 
0.1%
7555 3
 
0.1%
6933 3
 
0.1%
14491 3
 
0.1%
10603 3
 
0.1%
Other values (3308) 3653
99.0%
ValueCountFrequency (%)
663 1
< 0.1%
667 1
< 0.1%
669 1
< 0.1%
672 1
< 0.1%
675 1
< 0.1%
681 1
< 0.1%
684 1
< 0.1%
687 2
0.1%
690 1
< 0.1%
691 2
0.1%
ValueCountFrequency (%)
54132 1
< 0.1%
54122 1
< 0.1%
54083 1
< 0.1%
54069 1
< 0.1%
54032 1
< 0.1%
53989 1
< 0.1%
53962 1
< 0.1%
53948 1
< 0.1%
53933 2
0.1%
53905 1
< 0.1%

성별_여
Real number (ℝ)

HIGH CORRELATION 

Distinct3303
Distinct (%)89.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13111.516
Minimum1625
Maximum54028
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size32.6 KiB
2024-03-14T23:26:09.918368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1625
5-th percentile2497.8
Q15453.25
median9213
Q314762
95-th percentile45896.55
Maximum54028
Range52403
Interquartile range (IQR)9308.75

Descriptive statistics

Standard deviation12332.981
Coefficient of variation (CV)0.94062201
Kurtosis2.6398733
Mean13111.516
Median Absolute Deviation (MAD)4930
Skewness1.8570001
Sum48381494
Variance1.5210241 × 108
MonotonicityNot monotonic
2024-03-14T23:26:10.276397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3095 4
 
0.1%
9994 3
 
0.1%
2843 3
 
0.1%
14146 3
 
0.1%
8816 3
 
0.1%
4140 3
 
0.1%
14194 3
 
0.1%
16484 3
 
0.1%
3118 3
 
0.1%
5416 3
 
0.1%
Other values (3293) 3659
99.2%
ValueCountFrequency (%)
1625 1
< 0.1%
1639 1
< 0.1%
1655 1
< 0.1%
1663 1
< 0.1%
1667 1
< 0.1%
1678 1
< 0.1%
1683 1
< 0.1%
1687 1
< 0.1%
1702 1
< 0.1%
1717 1
< 0.1%
ValueCountFrequency (%)
54028 1
< 0.1%
53993 1
< 0.1%
53985 1
< 0.1%
53968 1
< 0.1%
53965 1
< 0.1%
53945 1
< 0.1%
53936 1
< 0.1%
53905 1
< 0.1%
53904 1
< 0.1%
53893 1
< 0.1%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size29.0 KiB
Minimum2023-12-29 00:00:00
Maximum2023-12-29 00:00:00
2024-03-14T23:26:10.466043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T23:26:10.629337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-03-14T23:26:02.306921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T23:25:59.085501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T23:26:00.141655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T23:26:01.251647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T23:26:02.563794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T23:25:59.344274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T23:26:00.415772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T23:26:01.508801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T23:26:02.846757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T23:25:59.624403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T23:26:00.708641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T23:26:01.789019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T23:26:03.118408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T23:25:59.881612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T23:26:00.977868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T23:26:02.051341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T23:26:10.758598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준연월구분연령별구성비(퍼센트)성별_남성별_여
기준연월1.0000.0000.5510.0000.0000.0000.000
구분0.0001.0000.0000.8310.1450.8090.836
연령별0.5510.0001.0000.7640.7290.7780.756
0.0000.8310.7641.0000.4230.9930.987
구성비(퍼센트)0.0000.1450.7290.4231.0000.4280.452
성별_남0.0000.8090.7780.9930.4281.0000.972
성별_여0.0000.8360.7560.9870.4520.9721.000
2024-03-14T23:26:10.936533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분연령별
구분1.0000.000
연령별0.0001.000
2024-03-14T23:26:11.081326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구성비(퍼센트)성별_남성별_여구분연령별
1.0000.6860.9950.9940.4910.388
구성비(퍼센트)0.6861.0000.7130.6520.1090.464
성별_남0.9950.7131.0000.9790.4660.403
성별_여0.9940.6520.9791.0000.4970.379
구분0.4910.1090.4660.4971.0000.000
연령별0.3880.4640.4030.3790.0001.000

Missing values

2024-03-14T23:26:03.489764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T23:26:03.946353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준연월구분연령별구성비(퍼센트)성별_남성별_여데이터기준일자
02020-07수원시0∼4세433403.63821322315210252023-12-29
12020-07수원시5∼9세570994.79322529463276362023-12-29
22020-07수원시10∼14세570494.78902729455275942023-12-29
32020-07수원시15∼19세631125.29799132702304102023-12-29
42020-07수원시20∼24세868127.28750845006418062023-12-29
52020-07수원시25∼29세1002718.41733553525467462023-12-29
62020-07수원시30∼34세854647.17434945133403312023-12-29
72020-07수원시35∼39세962558.08020949129471262023-12-29
82020-07수원시40∼44세975368.18774349375481612023-12-29
92020-07수원시45∼49세1069928.98153553668533242023-12-29
기준연월구분연령별구성비(퍼센트)성별_남성별_여데이터기준일자
36802023-11영통구40 ∼ 44세343689.46758616906174622023-12-29
36812023-11영통구45 ∼ 49세315808.69955715543160372023-12-29
36822023-11영통구50 ∼ 54세320768.83619315921161552023-12-29
36832023-11영통구55 ∼ 59세248836.85468912208126752023-12-29
36842023-11영통구60 ∼ 64세207725.72220410603101692023-12-29
36852023-11영통구65 ∼ 69세127893.523073631664732023-12-29
36862023-11영통구70 ∼ 74세73022.011531340538972023-12-29
36872023-11영통구75 ∼ 79세52361.442396224929872023-12-29
36882023-11영통구80 ∼ 84세38451.059208148023652023-12-29
36892023-11영통구85세 이상30940.85232590121932023-12-29