Overview

Dataset statistics

Number of variables8
Number of observations31
Missing cells97
Missing cells (%)39.1%
Duplicate rows1
Duplicate rows (%)3.2%
Total size in memory2.1 KiB
Average record size in memory68.3 B

Variable types

Text3
Unsupported5

Dataset

Description서울특별시 강서구 기초생활수급자 동별 현황 자료
Author서울특별시 강서구
URLhttps://www.data.go.kr/data/15066242/fileData.do

Alerts

Dataset has 1 (3.2%) duplicate rowsDuplicates
기초생활보장 수급자구분별 현황 has 24 (77.4%) missing valuesMissing
Unnamed: 1 has 29 (93.5%) missing valuesMissing
Unnamed: 2 has 9 (29.0%) missing valuesMissing
Unnamed: 3 has 7 (22.6%) missing valuesMissing
Unnamed: 4 has 8 (25.8%) missing valuesMissing
Unnamed: 5 has 7 (22.6%) missing valuesMissing
Unnamed: 6 has 6 (19.4%) missing valuesMissing
Unnamed: 7 has 7 (22.6%) missing valuesMissing
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-13 00:21:00.315080
Analysis finished2023-12-13 00:21:00.761823
Duration0.45 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct7
Distinct (%)100.0%
Missing24
Missing (%)77.4%
Memory size380.0 B
2023-12-13T09:21:00.845763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length73
Median length11
Mean length16.428571
Min length2

Characters and Unicode

Total characters115
Distinct characters46
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)100.0%

Sample

1st row(2020년 05월)
2nd row서울특별시 강서구
3rd row자격 : 전체(중복제외)
4th row시도
5th row합계
ValueCountFrequency (%)
4
17.4%
서울특별시 3
13.0%
강서구 2
 
8.7%
2020년 1
 
4.3%
2020-6-30 1
 
4.3%
생활복지국 1
 
4.3%
출력부서 1
 
4.3%
육심석 1
 
4.3%
출력자 1
 
4.3%
11:37:31 1
 
4.3%
Other values (7) 7
30.4%
2023-12-13T09:21:01.084522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
25
21.7%
6
 
5.2%
0 6
 
5.2%
: 6
 
5.2%
4
 
3.5%
2 4
 
3.5%
3
 
2.6%
3 3
 
2.6%
3
 
2.6%
3
 
2.6%
Other values (36) 52
45.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 59
51.3%
Space Separator 25
21.7%
Decimal Number 19
 
16.5%
Other Punctuation 6
 
5.2%
Open Punctuation 2
 
1.7%
Close Punctuation 2
 
1.7%
Dash Punctuation 2
 
1.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6
 
10.2%
4
 
6.8%
3
 
5.1%
3
 
5.1%
3
 
5.1%
3
 
5.1%
3
 
5.1%
3
 
5.1%
2
 
3.4%
2
 
3.4%
Other values (24) 27
45.8%
Decimal Number
ValueCountFrequency (%)
0 6
31.6%
2 4
21.1%
3 3
15.8%
1 3
15.8%
7 1
 
5.3%
6 1
 
5.3%
5 1
 
5.3%
Space Separator
ValueCountFrequency (%)
25
100.0%
Other Punctuation
ValueCountFrequency (%)
: 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 59
51.3%
Common 56
48.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6
 
10.2%
4
 
6.8%
3
 
5.1%
3
 
5.1%
3
 
5.1%
3
 
5.1%
3
 
5.1%
3
 
5.1%
2
 
3.4%
2
 
3.4%
Other values (24) 27
45.8%
Common
ValueCountFrequency (%)
25
44.6%
0 6
 
10.7%
: 6
 
10.7%
2 4
 
7.1%
3 3
 
5.4%
1 3
 
5.4%
( 2
 
3.6%
) 2
 
3.6%
- 2
 
3.6%
7 1
 
1.8%
Other values (2) 2
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 59
51.3%
ASCII 56
48.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
25
44.6%
0 6
 
10.7%
: 6
 
10.7%
2 4
 
7.1%
3 3
 
5.4%
1 3
 
5.4%
( 2
 
3.6%
) 2
 
3.6%
- 2
 
3.6%
7 1
 
1.8%
Other values (2) 2
 
3.6%
Hangul
ValueCountFrequency (%)
6
 
10.2%
4
 
6.8%
3
 
5.1%
3
 
5.1%
3
 
5.1%
3
 
5.1%
3
 
5.1%
3
 
5.1%
2
 
3.4%
2
 
3.4%
Other values (24) 27
45.8%

Unnamed: 1
Text

MISSING 

Distinct2
Distinct (%)100.0%
Missing29
Missing (%)93.5%
Memory size380.0 B
2023-12-13T09:21:01.189082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters6
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st row시군구
2nd row강서구
ValueCountFrequency (%)
시군구 1
50.0%
강서구 1
50.0%
2023-12-13T09:21:01.379765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2
33.3%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2
33.3%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2
33.3%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2
33.3%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Unnamed: 2
Text

MISSING 

Distinct22
Distinct (%)100.0%
Missing9
Missing (%)29.0%
Memory size380.0 B
2023-12-13T09:21:01.532974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length4
Mean length3.7727273
Min length2

Characters and Unicode

Total characters83
Distinct characters27
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)100.0%

Sample

1st row읍면동
2nd row소계
3rd row염창동
4th row등촌1동
5th row등촌2동
ValueCountFrequency (%)
읍면동 1
 
4.5%
소계 1
 
4.5%
방화2동 1
 
4.5%
방화1동 1
 
4.5%
공항동 1
 
4.5%
우장산동 1
 
4.5%
발산1동 1
 
4.5%
가양3동 1
 
4.5%
가양2동 1
 
4.5%
가양1동 1
 
4.5%
Other values (12) 12
54.5%
2023-12-13T09:21:01.790279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
21
25.3%
10
12.0%
7
 
8.4%
1 5
 
6.0%
3 4
 
4.8%
2 4
 
4.8%
3
 
3.6%
3
 
3.6%
3
 
3.6%
3
 
3.6%
Other values (17) 20
24.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 67
80.7%
Decimal Number 16
 
19.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
21
31.3%
10
14.9%
7
 
10.4%
3
 
4.5%
3
 
4.5%
3
 
4.5%
3
 
4.5%
3
 
4.5%
2
 
3.0%
1
 
1.5%
Other values (11) 11
16.4%
Decimal Number
ValueCountFrequency (%)
1 5
31.2%
3 4
25.0%
2 4
25.0%
8 1
 
6.2%
6 1
 
6.2%
4 1
 
6.2%

Most occurring scripts

ValueCountFrequency (%)
Hangul 67
80.7%
Common 16
 
19.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
21
31.3%
10
14.9%
7
 
10.4%
3
 
4.5%
3
 
4.5%
3
 
4.5%
3
 
4.5%
3
 
4.5%
2
 
3.0%
1
 
1.5%
Other values (11) 11
16.4%
Common
ValueCountFrequency (%)
1 5
31.2%
3 4
25.0%
2 4
25.0%
8 1
 
6.2%
6 1
 
6.2%
4 1
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 67
80.7%
ASCII 16
 
19.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
21
31.3%
10
14.9%
7
 
10.4%
3
 
4.5%
3
 
4.5%
3
 
4.5%
3
 
4.5%
3
 
4.5%
2
 
3.0%
1
 
1.5%
Other values (11) 11
16.4%
ASCII
ValueCountFrequency (%)
1 5
31.2%
3 4
25.0%
2 4
25.0%
8 1
 
6.2%
6 1
 
6.2%
4 1
 
6.2%

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing7
Missing (%)22.6%
Memory size380.0 B

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing8
Missing (%)25.8%
Memory size380.0 B

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing7
Missing (%)22.6%
Memory size380.0 B

Unnamed: 6
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing6
Missing (%)19.4%
Memory size380.0 B

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing7
Missing (%)22.6%
Memory size380.0 B

Correlations

2023-12-13T09:21:01.857334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기초생활보장 수급자구분별 현황Unnamed: 1Unnamed: 2
기초생활보장 수급자구분별 현황1.0000.0000.000
Unnamed: 10.0001.0000.000
Unnamed: 20.0000.0001.000

Missing values

2023-12-13T09:21:00.478864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T09:21:00.590265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T09:21:00.683660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

기초생활보장 수급자구분별 현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7
0(2020년 05월)<NA><NA>NaNNaNNaNNaNNaN
1<NA><NA><NA>NaNNaNNaNNaNNaN
2서울특별시 강서구<NA><NA>NaNNaNNaN페이지 : 1 / 1NaN
3자격 : 전체(중복제외)<NA><NA>NaNNaNNaN(단위: 가구, 명)NaN
4시도시군구읍면동합계NaN일반수급자NaN시설수급자
5<NA><NA><NA>가구수수급권자수가구수수급권자수수급권자수
6합계<NA><NA>18219251871807525043144
7서울특별시강서구소계18219251871807525043144
8<NA><NA>염창동1291791291790
9<NA><NA>등촌1동1632271632270
기초생활보장 수급자구분별 현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7
21<NA><NA>가양3동15712130156821273
22<NA><NA>발산1동7101217698120512
23<NA><NA>우장산동3194303144255
24<NA><NA>공항동6418726398702
25<NA><NA>방화1동65492464391311
26<NA><NA>방화2동105313571041134512
27<NA><NA>방화3동14901949148819472
28<NA><NA><NA>NaNNaNNaNNaNNaN
29<NA><NA><NA>NaNNaNNaNNaNNaN
30출력일자 : 2020-6-30 11:37:31 출력자 : 육심석 출력부서 : 서울특별시 강서구 생활복지국 생활보장과<NA><NA>NaNNaNNaNNaNNaN

Duplicate rows

Most frequently occurring

기초생활보장 수급자구분별 현황Unnamed: 1Unnamed: 2# duplicates
0<NA><NA><NA>4