Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory566.4 KiB
Average record size in memory58.0 B

Variable types

Numeric2
Categorical3
Text1

Dataset

Description경기도 국민기초생활 수급자 현황
Author가평군
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=7J2F4JSEC5TMGAWEHQSG32596011&infSeq=1

Alerts

인원수 has 595 (5.9%) zerosZeros

Reproduction

Analysis started2024-04-11 01:57:43.402175
Analysis finished2024-04-11 01:57:45.709233
Duration2.31 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Real number (ℝ)

Distinct11
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2019.1311
Minimum2014
Maximum2024
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-11T10:57:45.757820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2014
5-th percentile2015
Q12017
median2019
Q32021
95-th percentile2023
Maximum2024
Range10
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.6378037
Coefficient of variation (CV)0.0013064054
Kurtosis-1.175982
Mean2019.1311
Median Absolute Deviation (MAD)2
Skewness-0.0073175149
Sum20191311
Variance6.9580086
MonotonicityNot monotonic
2024-04-11T10:57:45.848604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
2022 1177
11.8%
2020 1131
11.3%
2018 1126
11.3%
2016 1115
11.2%
2021 1102
11.0%
2017 1048
10.5%
2019 1047
10.5%
2023 1024
10.2%
2015 986
9.9%
2024 208
 
2.1%
ValueCountFrequency (%)
2014 36
 
0.4%
2015 986
9.9%
2016 1115
11.2%
2017 1048
10.5%
2018 1126
11.3%
2019 1047
10.5%
2020 1131
11.3%
2021 1102
11.0%
2022 1177
11.8%
2023 1024
10.2%
ValueCountFrequency (%)
2024 208
 
2.1%
2023 1024
10.2%
2022 1177
11.8%
2021 1102
11.0%
2020 1131
11.3%
2019 1047
10.5%
2018 1126
11.3%
2017 1048
10.5%
2016 1115
11.2%
2015 986
9.9%

시군구명
Categorical

Distinct39
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
수원시 영통구
 
689
안양시
 
578
용인시
 
502
평택시
 
494
화성시
 
461
Other values (34)
7276 

Length

Max length8
Median length3
Mean length4.5535
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row남양주시
2nd row성남시 수정구
3rd row안산시 단원구
4th row광주시
5th row연천군

Common Values

ValueCountFrequency (%)
수원시 영통구 689
 
6.9%
안양시 578
 
5.8%
용인시 502
 
5.0%
평택시 494
 
4.9%
화성시 461
 
4.6%
성남시 분당구 447
 
4.5%
부천시 436
 
4.4%
파주시 402
 
4.0%
고양시 덕양구 365
 
3.6%
성남시 수정구 347
 
3.5%
Other values (29) 5279
52.8%

Length

2024-04-11T10:57:45.955928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
성남시 1017
 
7.5%
수원시 742
 
5.5%
고양시 741
 
5.4%
영통구 689
 
5.1%
안산시 615
 
4.5%
안양시 578
 
4.2%
용인시 502
 
3.7%
평택시 494
 
3.6%
경기도 489
 
3.6%
화성시 461
 
3.4%
Other values (34) 7276
53.5%
Distinct619
Distinct (%)6.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-11T10:57:46.193420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length3
Mean length3.3845
Min length2

Characters and Unicode

Total characters33845
Distinct characters210
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)0.1%

Sample

1st row오남읍
2nd row신흥3동
3rd row단원구
4th row신현동
5th row군남면
ValueCountFrequency (%)
중앙동 161
 
1.6%
금곡동 51
 
0.5%
풍산동 46
 
0.5%
반월동 43
 
0.4%
부곡동 42
 
0.4%
신장1동 41
 
0.4%
신촌동 41
 
0.4%
정자2동 39
 
0.4%
고등동 37
 
0.4%
부림동 37
 
0.4%
Other values (609) 9462
94.6%
2024-04-11T10:57:46.574206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7848
23.2%
1712
 
5.1%
1 1356
 
4.0%
2 1334
 
3.9%
694
 
2.1%
3 654
 
1.9%
632
 
1.9%
572
 
1.7%
553
 
1.6%
521
 
1.5%
Other values (200) 17969
53.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 30095
88.9%
Decimal Number 3750
 
11.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7848
26.1%
1712
 
5.7%
694
 
2.3%
632
 
2.1%
572
 
1.9%
553
 
1.8%
521
 
1.7%
488
 
1.6%
387
 
1.3%
385
 
1.3%
Other values (191) 16303
54.2%
Decimal Number
ValueCountFrequency (%)
1 1356
36.2%
2 1334
35.6%
3 654
17.4%
4 194
 
5.2%
5 62
 
1.7%
6 58
 
1.5%
7 48
 
1.3%
8 27
 
0.7%
9 17
 
0.5%

Most occurring scripts

ValueCountFrequency (%)
Hangul 30095
88.9%
Common 3750
 
11.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7848
26.1%
1712
 
5.7%
694
 
2.3%
632
 
2.1%
572
 
1.9%
553
 
1.8%
521
 
1.7%
488
 
1.6%
387
 
1.3%
385
 
1.3%
Other values (191) 16303
54.2%
Common
ValueCountFrequency (%)
1 1356
36.2%
2 1334
35.6%
3 654
17.4%
4 194
 
5.2%
5 62
 
1.7%
6 58
 
1.5%
7 48
 
1.3%
8 27
 
0.7%
9 17
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 30095
88.9%
ASCII 3750
 
11.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
7848
26.1%
1712
 
5.7%
694
 
2.3%
632
 
2.1%
572
 
1.9%
553
 
1.8%
521
 
1.7%
488
 
1.6%
387
 
1.3%
385
 
1.3%
Other values (191) 16303
54.2%
ASCII
ValueCountFrequency (%)
1 1356
36.2%
2 1334
35.6%
3 654
17.4%
4 194
 
5.2%
5 62
 
1.7%
6 58
 
1.5%
7 48
 
1.3%
8 27
 
0.7%
9 17
 
0.5%

자격
Categorical

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
기초생계급여
2762 
기초의료급여
2487 
기초주거급여
2393 
기초교육급여
1891 
<NA>
440 
Other values (2)
 
27

Length

Max length6
Median length6
Mean length5.912
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row기초주거급여
2nd row기초의료급여
3rd row기초주거급여
4th row기초주거급여
5th row기초의료급여

Common Values

ValueCountFrequency (%)
기초생계급여 2762
27.6%
기초의료급여 2487
24.9%
기초주거급여 2393
23.9%
기초교육급여 1891
18.9%
<NA> 440
 
4.4%
기초생활수급 15
 
0.1%
기초생활보장 12
 
0.1%

Length

2024-04-11T10:57:46.696884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-11T10:57:46.796257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기초생계급여 2762
27.6%
기초의료급여 2487
24.9%
기초주거급여 2393
23.9%
기초교육급여 1891
18.9%
na 440
 
4.4%
기초생활수급 15
 
0.1%
기초생활보장 12
 
0.1%

연령구간
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
18~64세
3605 
18세미만
3479 
65세이상
2916 

Length

Max length6
Median length5
Mean length5.3605
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row65세이상
2nd row65세이상
3rd row65세이상
4th row18~64세
5th row65세이상

Common Values

ValueCountFrequency (%)
18~64세 3605
36.0%
18세미만 3479
34.8%
65세이상 2916
29.2%

Length

2024-04-11T10:57:46.903240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-11T10:57:46.985185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
18~64세 3605
36.0%
18세미만 3479
34.8%
65세이상 2916
29.2%

인원수
Real number (ℝ)

ZEROS 

Distinct722
Distinct (%)7.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean100.2297
Minimum0
Maximum6309
Zeros595
Zeros (%)5.9%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-11T10:57:47.078384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q17
median29
Q397
95-th percentile385.05
Maximum6309
Range6309
Interquartile range (IQR)90

Descriptive statistics

Standard deviation269.85574
Coefficient of variation (CV)2.6923731
Kurtosis146.89163
Mean100.2297
Median Absolute Deviation (MAD)27
Skewness10.162491
Sum1002297
Variance72822.123
MonotonicityNot monotonic
2024-04-11T10:57:47.207718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 595
 
5.9%
1 408
 
4.1%
2 340
 
3.4%
3 300
 
3.0%
5 258
 
2.6%
4 252
 
2.5%
6 241
 
2.4%
9 186
 
1.9%
8 182
 
1.8%
7 181
 
1.8%
Other values (712) 7057
70.6%
ValueCountFrequency (%)
0 595
5.9%
1 408
4.1%
2 340
3.4%
3 300
3.0%
4 252
2.5%
5 258
2.6%
6 241
2.4%
7 181
 
1.8%
8 182
 
1.8%
9 186
 
1.9%
ValueCountFrequency (%)
6309 1
< 0.1%
5292 1
< 0.1%
5199 1
< 0.1%
5126 1
< 0.1%
4873 1
< 0.1%
4784 1
< 0.1%
4557 1
< 0.1%
4383 1
< 0.1%
4316 1
< 0.1%
3801 1
< 0.1%

Interactions

2024-04-11T10:57:45.362664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-11T10:57:45.153168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-11T10:57:45.452434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-11T10:57:45.283666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-11T10:57:47.283855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도시군구명자격연령구간인원수
연도1.0000.4730.0580.0000.000
시군구명0.4731.0000.2630.0320.274
자격0.0580.2631.0000.2860.071
연령구간0.0000.0320.2861.0000.084
인원수0.0000.2740.0710.0841.000
2024-04-11T10:57:47.381185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군구명자격연령구간
시군구명1.0000.1150.014
자격0.1151.0000.123
연령구간0.0140.1231.000
2024-04-11T10:57:47.467433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도인원수시군구명자격연령구간
연도1.0000.1070.1820.0720.000
인원수0.1071.0000.0980.0370.050
시군구명0.1820.0981.0000.1150.014
자격0.0720.0370.1151.0000.123
연령구간0.0000.0500.0140.1231.000

Missing values

2024-04-11T10:57:45.578930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-11T10:57:45.665950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도시군구명읍면동명자격연령구간인원수
353692018남양주시오남읍기초주거급여65세이상41
295962020성남시 수정구신흥3동기초의료급여65세이상179
185552018안산시 단원구단원구기초주거급여65세이상2571
86172022광주시신현동기초주거급여18~64세98
286242015연천군군남면기초의료급여65세이상78
112162023안양시호계1동기초의료급여65세이상20
146742015양평군청운면기초교육급여65세이상0
109342023안양시안양5동기초주거급여65세이상267
94882017안양시갈산동기초교육급여18세미만212
54782019용인시신갈동기초교육급여18~64세1
연도시군구명읍면동명자격연령구간인원수
236122015안성시안성3동기초의료급여18~64세29
382882020성남시 수정구시흥동기초교육급여65세이상0
444812022부천시대산동기초생계급여18세미만102
433172021화성시정남면기초주거급여65세이상47
251662017양주시회천1동기초의료급여18세미만10
303882020성남시 중원구상대원1동기초의료급여18세미만60
414232021광명시광명7동기초주거급여18~64세103
365862021가평군청평면기초의료급여18세미만1
119392020안양시석수1동기초의료급여18세미만10
66012020의정부시신곡2동기초의료급여65세이상33