Overview

Dataset statistics

Number of variables13
Number of observations31
Missing cells49
Missing cells (%)12.2%
Duplicate rows1
Duplicate rows (%)3.2%
Total size in memory3.3 KiB
Average record size in memory108.3 B

Variable types

Unsupported11
Text2

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-22240/F/1/datasetView.do

Alerts

Dataset has 1 (3.2%) duplicate rowsDuplicates
자치구별 보도 총괄현황 has 3 (9.7%) missing valuesMissing
Unnamed: 1 has 4 (12.9%) missing valuesMissing
Unnamed: 2 has 2 (6.5%) missing valuesMissing
Unnamed: 3 has 3 (9.7%) missing valuesMissing
Unnamed: 4 has 2 (6.5%) missing valuesMissing
Unnamed: 5 has 3 (9.7%) missing valuesMissing
Unnamed: 6 has 2 (6.5%) missing valuesMissing
Unnamed: 7 has 3 (9.7%) missing valuesMissing
Unnamed: 8 has 1 (3.2%) missing valuesMissing
Unnamed: 9 has 3 (9.7%) missing valuesMissing
Unnamed: 10 has 2 (6.5%) missing valuesMissing
Unnamed: 11 has 3 (9.7%) missing valuesMissing
Unnamed: 12 has 18 (58.1%) missing valuesMissing
자치구별 보도 총괄현황 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 2 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-01-06 04:56:34.682705
Analysis finished2024-01-06 04:56:37.316802
Duration2.63 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

자치구별 보도 총괄현황
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)9.7%
Memory size380.0 B

Unnamed: 1
Text

MISSING 

Distinct27
Distinct (%)100.0%
Missing4
Missing (%)12.9%
Memory size380.0 B
2024-01-06T04:56:37.656017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length3
Mean length3.2592593
Min length2

Characters and Unicode

Total characters88
Distinct characters44
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)100.0%

Sample

1st row관리 기관
2nd row종로구
3rd row중구
4th row용산구
5th row성동구
ValueCountFrequency (%)
관리 1
 
3.6%
기관 1
 
3.6%
강동구 1
 
3.6%
송파구 1
 
3.6%
강남구 1
 
3.6%
서초구 1
 
3.6%
관악구 1
 
3.6%
동작구 1
 
3.6%
영등포구 1
 
3.6%
금천구 1
 
3.6%
Other values (18) 18
64.3%
2024-01-06T04:56:38.817855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
26
29.5%
4
 
4.5%
4
 
4.5%
4
 
4.5%
3
 
3.4%
2
 
2.3%
2
 
2.3%
2
 
2.3%
2
 
2.3%
2
 
2.3%
Other values (34) 37
42.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 87
98.9%
Space Separator 1
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
26
29.9%
4
 
4.6%
4
 
4.6%
4
 
4.6%
3
 
3.4%
2
 
2.3%
2
 
2.3%
2
 
2.3%
2
 
2.3%
2
 
2.3%
Other values (33) 36
41.4%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 87
98.9%
Common 1
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
26
29.9%
4
 
4.6%
4
 
4.6%
4
 
4.6%
3
 
3.4%
2
 
2.3%
2
 
2.3%
2
 
2.3%
2
 
2.3%
2
 
2.3%
Other values (33) 36
41.4%
Common
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 87
98.9%
ASCII 1
 
1.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
26
29.9%
4
 
4.6%
4
 
4.6%
4
 
4.6%
3
 
3.4%
2
 
2.3%
2
 
2.3%
2
 
2.3%
2
 
2.3%
2
 
2.3%
Other values (33) 36
41.4%
ASCII
ValueCountFrequency (%)
1
100.0%

Unnamed: 2
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)6.5%
Memory size380.0 B

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)9.7%
Memory size380.0 B

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)6.5%
Memory size380.0 B

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)9.7%
Memory size380.0 B

Unnamed: 6
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)6.5%
Memory size380.0 B

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)9.7%
Memory size380.0 B

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1
Missing (%)3.2%
Memory size380.0 B

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)9.7%
Memory size380.0 B

Unnamed: 10
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)6.5%
Memory size380.0 B

Unnamed: 11
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)9.7%
Memory size380.0 B

Unnamed: 12
Text

MISSING 

Distinct10
Distinct (%)76.9%
Missing18
Missing (%)58.1%
Memory size380.0 B
2024-01-06T04:56:39.260497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length13.076923
Min length10

Characters and Unicode

Total characters170
Distinct characters57
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8 ?
Unique (%)61.5%

Sample

1st row(2021. 12. 31.기준)
2nd row비 고 (기존 대비 증감사유)
3rd row일부 연장, 면적 정정
4th row보도 신설 및 확장
5th row노선 연장 오류 정정
ValueCountFrequency (%)
6
 
11.8%
보도 5
 
9.8%
확장 4
 
7.8%
신설 4
 
7.8%
연장 3
 
5.9%
정정 3
 
5.9%
일부 2
 
3.9%
면적 2
 
3.9%
정비 2
 
3.9%
위례중앙로 1
 
2.0%
Other values (19) 19
37.3%
2024-01-06T04:56:40.259895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
38
22.4%
11
 
6.5%
7
 
4.1%
7
 
4.1%
6
 
3.5%
6
 
3.5%
. 6
 
3.5%
6
 
3.5%
1 5
 
2.9%
2 5
 
2.9%
Other values (47) 73
42.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 103
60.6%
Space Separator 38
 
22.4%
Decimal Number 13
 
7.6%
Other Punctuation 9
 
5.3%
Open Punctuation 3
 
1.8%
Close Punctuation 3
 
1.8%
Control 1
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11
 
10.7%
7
 
6.8%
7
 
6.8%
6
 
5.8%
6
 
5.8%
6
 
5.8%
4
 
3.9%
4
 
3.9%
4
 
3.9%
3
 
2.9%
Other values (36) 45
43.7%
Decimal Number
ValueCountFrequency (%)
1 5
38.5%
2 5
38.5%
3 2
 
15.4%
0 1
 
7.7%
Other Punctuation
ValueCountFrequency (%)
. 6
66.7%
, 2
 
22.2%
' 1
 
11.1%
Space Separator
ValueCountFrequency (%)
38
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Control
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 103
60.6%
Common 67
39.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11
 
10.7%
7
 
6.8%
7
 
6.8%
6
 
5.8%
6
 
5.8%
6
 
5.8%
4
 
3.9%
4
 
3.9%
4
 
3.9%
3
 
2.9%
Other values (36) 45
43.7%
Common
ValueCountFrequency (%)
38
56.7%
. 6
 
9.0%
1 5
 
7.5%
2 5
 
7.5%
( 3
 
4.5%
) 3
 
4.5%
3 2
 
3.0%
, 2
 
3.0%
' 1
 
1.5%
1
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 103
60.6%
ASCII 67
39.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
38
56.7%
. 6
 
9.0%
1 5
 
7.5%
2 5
 
7.5%
( 3
 
4.5%
) 3
 
4.5%
3 2
 
3.0%
, 2
 
3.0%
' 1
 
1.5%
1
 
1.5%
Hangul
ValueCountFrequency (%)
11
 
10.7%
7
 
6.8%
7
 
6.8%
6
 
5.8%
6
 
5.8%
6
 
5.8%
4
 
3.9%
4
 
3.9%
4
 
3.9%
3
 
2.9%
Other values (36) 45
43.7%

Correlations

2024-01-06T04:56:40.569885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 1Unnamed: 12
Unnamed: 11.0001.000
Unnamed: 121.0001.000

Missing values

2024-01-06T04:56:35.137188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-06T04:56:35.870474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-01-06T04:56:36.613569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

자치구별 보도 총괄현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12
0NaN<NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaN(2021. 12. 31.기준)
1연번관리 기관NaN특별시도NaN구 도NaN포장재 현황NaNNaNNaN비 고 (기존 대비 증감사유)
2NaN<NA>연  장(m)면  적(㎡)연  장(m)면  적(㎡)연  장(m)면  적(㎡)불투수면적(㎡)NaN투수면적(㎡)NaN<NA>
3NaN<NA>NaNNaNNaNNaNNaNNaN특별시도구도특별시도구도<NA>
4<NA>3094943.910770704.2751698889.66800239.2751396054.33970465.05993067.8033593081.8807171.472385154.1<NA>
51종로구805713037726580728026014764235122802602583200<NA>
62중구8487535051563822285226210536528923714864129480781160<NA>
73용산구69635234381.554665197298.51497037083194797.53343325013650<NA>
84성동구102311.5295053.6573381.5235705.652893059348227607.35555568098.33792<NA>
95광진구73245.5279100.751073.5212504.72217266596197055.766596154490<NA>
자치구별 보도 총괄현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12
2117구로구148334418906619522093398638220956719945720355098825460부광로 시도 인정('21.12.3.)
2218금천구90014206177359691194135404586764104744.58571014668.51054보도 신설 및 확장
2319영등포구162240.4582132.19405037772368190.4204409.136617320348111550503보도 신설 및 확장
2420동작구86007.5289281.244264.5185747.241743103534170218.296645155296889<NA>
2521관악구1215203426376717023916054350103477227790101247113702230보도 신설 및 정비
2622서초구18755078368295054551867924962318155308891982742097833541일부 연장, 면적 정정
2723강남구1472849231021167205919733056433112943822328061715375056945<NA>
2824송파구203629796362.275519445693.7128110350668.540631733179339376.718875.5위례중앙로 관리청 이관 등
2925강동구173135576567.366449280254.3106686296313151457.325954612879736767주택재건축 정비사업 및 보도정비
3026서울시설공단1486247763148624776300366980110650<NA>

Duplicate rows

Most frequently occurring

Unnamed: 1Unnamed: 12# duplicates
0<NA><NA>3