Overview

Dataset statistics

Number of variables10
Number of observations38
Missing cells233
Missing cells (%)61.3%
Duplicate rows1
Duplicate rows (%)2.6%
Total size in memory3.1 KiB
Average record size in memory83.5 B

Variable types

Text1
Unsupported9

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-1324/S/1/datasetView.do

Alerts

Dataset has 1 (2.6%) duplicate rowsDuplicates
속성 ID has 36 (94.7%) missing valuesMissing
ITM_BHT has 29 (76.3%) missing valuesMissing
ITM_ERY has 26 (68.4%) missing valuesMissing
ITM_TYPE has 27 (71.1%) missing valuesMissing
ITM_TRE_SO has 28 (73.7%) missing valuesMissing
ITM_LOCA has 31 (81.6%) missing valuesMissing
MGE_LVL has 1 (2.6%) missing valuesMissing
DME_STTN has 27 (71.1%) missing valuesMissing
VTN_ERY has 28 (73.7%) missing valuesMissing
ITM_BHT is an unsupported type, check if it needs cleaning or further analysisUnsupported
ITM_ERY is an unsupported type, check if it needs cleaning or further analysisUnsupported
ITM_TYPE is an unsupported type, check if it needs cleaning or further analysisUnsupported
ITM_TRE_SO is an unsupported type, check if it needs cleaning or further analysisUnsupported
ITM_LOCA is an unsupported type, check if it needs cleaning or further analysisUnsupported
MGE_LVL is an unsupported type, check if it needs cleaning or further analysisUnsupported
DME_STTN is an unsupported type, check if it needs cleaning or further analysisUnsupported
VTN_ERY is an unsupported type, check if it needs cleaning or further analysisUnsupported
ITM_LVL is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-11 04:38:28.524005
Analysis finished2023-12-11 04:38:29.152599
Duration0.63 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

속성 ID
Text

MISSING 

Distinct2
Distinct (%)100.0%
Missing36
Missing (%)94.7%
Memory size436.0 B
2023-12-11T13:38:29.255858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3.5
Mean length3.5
Min length3

Characters and Unicode

Total characters7
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st row속성명
2nd row속성유형
ValueCountFrequency (%)
속성명 1
50.0%
속성유형 1
50.0%
2023-12-11T13:38:29.593734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2
28.6%
2
28.6%
1
14.3%
1
14.3%
1
14.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2
28.6%
2
28.6%
1
14.3%
1
14.3%
1
14.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2
28.6%
2
28.6%
1
14.3%
1
14.3%
1
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2
28.6%
2
28.6%
1
14.3%
1
14.3%
1
14.3%

ITM_BHT
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing29
Missing (%)76.3%
Memory size436.0 B

ITM_ERY
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing26
Missing (%)68.4%
Memory size436.0 B

ITM_TYPE
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing27
Missing (%)71.1%
Memory size436.0 B

ITM_TRE_SO
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing28
Missing (%)73.7%
Memory size436.0 B

ITM_LOCA
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing31
Missing (%)81.6%
Memory size436.0 B

MGE_LVL
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1
Missing (%)2.6%
Memory size436.0 B

DME_STTN
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing27
Missing (%)71.1%
Memory size436.0 B

VTN_ERY
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing28
Missing (%)73.7%
Memory size436.0 B

ITM_LVL
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size436.0 B

Missing values

2023-12-11T13:38:28.633443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T13:38:28.853146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T13:38:29.019734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

속성 IDITM_BHTITM_ERYITM_TYPEITM_TRE_SOITM_LOCAMGE_LVLDME_STTNVTN_ERYITM_LVL
0속성명품계흉고품계활력품계수형품계수종품계위치관리등급피해현황식생활력품계등급
1속성유형NUMBER(11)NUMBER(11)NUMBER(11)VARCHAR2(50)VARCHAR2(100)NUMBER(22)NUMBER(11)NUMBER(11)NUMBER(22)
2<NA>000000000
3<NA>1111211119
4<NA>2222322220
5<NA>3333633321
6<NA>5564944522
7<NA>10696NaN105623
8<NA>1591021NaN216724
9<NA>NaN101566NaN227925
속성 IDITM_BHTITM_ERYITM_TYPEITM_TRE_SOITM_LOCAMGE_LVLDME_STTNVTN_ERYITM_LVL
28<NA>NaNNaNNaNNaNNaN45NaNNaN44
29<NA>NaNNaNNaNNaNNaN46NaNNaN45
30<NA>NaNNaNNaNNaNNaN47NaNNaN46
31<NA>NaNNaNNaNNaNNaN48NaNNaN47
32<NA>NaNNaNNaNNaNNaN49NaNNaN48
33<NA>NaNNaNNaNNaNNaN50NaNNaN49
34<NA>NaNNaNNaNNaNNaN51NaNNaN50
35<NA>NaNNaNNaNNaNNaN52NaNNaN51
36<NA>NaNNaNNaNNaNNaN54NaNNaN52
37<NA>NaNNaNNaNNaNNaNNaNNaNNaN54

Duplicate rows

Most frequently occurring

속성 ID# duplicates
0<NA>36