Overview

Dataset statistics

Number of variables9
Number of observations548
Missing cells1069
Missing cells (%)21.7%
Duplicate rows24
Duplicate rows (%)4.4%
Total size in memory39.2 KiB
Average record size in memory73.2 B

Variable types

Unsupported6
Text1
Categorical2

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15640/F/1/datasetView.do

Alerts

Dataset has 24 (4.4%) duplicate rowsDuplicates
Unnamed: 2 is highly overall correlated with Unnamed: 3High correlation
Unnamed: 3 is highly overall correlated with Unnamed: 2High correlation
Unnamed: 0 has 548 (100.0%) missing valuesMissing
서울시 자치구별 연료별 자동차등록 현황(기준일 : 2021년 1월말) has 521 (95.1%) missing valuesMissing
Unnamed: 0 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-04-06 11:20:24.542093
Analysis finished2024-04-06 11:20:25.278243
Duration0.74 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Unnamed: 0
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing548
Missing (%)100.0%
Memory size4.9 KiB
Distinct27
Distinct (%)100.0%
Missing521
Missing (%)95.1%
Memory size4.4 KiB
2024-04-06T20:20:25.546138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length3.1111111
Min length2

Characters and Unicode

Total characters84
Distinct characters42
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)100.0%

Sample

1st row시군구별
2nd row합 계
3rd row종로구
4th row중구
5th row용산구
ValueCountFrequency (%)
시군구별 1
 
3.6%
종로구 1
 
3.6%
서대문구 1
 
3.6%
송파구 1
 
3.6%
강남구 1
 
3.6%
서초구 1
 
3.6%
관악구 1
 
3.6%
동작구 1
 
3.6%
영등포구 1
 
3.6%
금천구 1
 
3.6%
Other values (18) 18
64.3%
2024-04-06T20:20:26.135955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
27
32.1%
4
 
4.8%
4
 
4.8%
3
 
3.6%
2
 
2.4%
2
 
2.4%
2
 
2.4%
2
 
2.4%
2
 
2.4%
2
 
2.4%
Other values (32) 34
40.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 83
98.8%
Space Separator 1
 
1.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
27
32.5%
4
 
4.8%
4
 
4.8%
3
 
3.6%
2
 
2.4%
2
 
2.4%
2
 
2.4%
2
 
2.4%
2
 
2.4%
2
 
2.4%
Other values (31) 33
39.8%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 83
98.8%
Common 1
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
27
32.5%
4
 
4.8%
4
 
4.8%
3
 
3.6%
2
 
2.4%
2
 
2.4%
2
 
2.4%
2
 
2.4%
2
 
2.4%
2
 
2.4%
Other values (31) 33
39.8%
Common
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 83
98.8%
ASCII 1
 
1.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
27
32.5%
4
 
4.8%
4
 
4.8%
3
 
3.6%
2
 
2.4%
2
 
2.4%
2
 
2.4%
2
 
2.4%
2
 
2.4%
2
 
2.4%
Other values (31) 33
39.8%
ASCII
ValueCountFrequency (%)
1
100.0%

Unnamed: 2
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
CNG
50 
경유
50 
기타연료
50 
엘피지
50 
전기
50 
Other values (10)
298 

Length

Max length13
Median length12
Mean length5.709854
Min length1

Unique

Unique2 ?
Unique (%)0.4%

Sample

1st row연료별
2nd row
3rd rowCNG
4th rowCNG
5th row경유

Common Values

ValueCountFrequency (%)
CNG 50
9.1%
경유 50
9.1%
기타연료 50
9.1%
엘피지 50
9.1%
전기 50
9.1%
휘발유 50
9.1%
휘발유(무연) 50
9.1%
하이브리드(휘발유+전기) 49
8.9%
수소 38
6.9%
하이브리드(LPG+전기) 37
6.8%
Other values (5) 74
13.5%

Length

2024-04-06T20:20:26.399036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
cng 50
9.1%
경유 50
9.1%
기타연료 50
9.1%
엘피지 50
9.1%
전기 50
9.1%
휘발유 50
9.1%
휘발유(무연 50
9.1%
하이브리드(휘발유+전기 49
8.9%
수소 38
6.9%
하이브리드(lpg+전기 37
6.8%
Other values (5) 74
13.5%

Unnamed: 3
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
비사업용
300 
사업용
246 
용도별
 
1
 
1

Length

Max length4
Median length4
Mean length3.5437956
Min length1

Unique

Unique2 ?
Unique (%)0.4%

Sample

1st row용도별
2nd row
3rd row비사업용
4th row사업용
5th row비사업용

Common Values

ValueCountFrequency (%)
비사업용 300
54.7%
사업용 246
44.9%
용도별 1
 
0.2%
1
 
0.2%

Length

2024-04-06T20:20:26.620981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T20:20:26.854231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
비사업용 300
54.7%
사업용 246
44.9%
용도별 1
 
0.2%
1
 
0.2%

Unnamed: 4
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size4.4 KiB

Unnamed: 5
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size4.4 KiB

Unnamed: 6
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size4.4 KiB

Unnamed: 7
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size4.4 KiB

Unnamed: 8
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size4.4 KiB

Correlations

2024-04-06T20:20:27.008686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
서울시 자치구별 연료별 자동차등록 현황(기준일 : 2021년 1월말)Unnamed: 2Unnamed: 3
서울시 자치구별 연료별 자동차등록 현황(기준일 : 2021년 1월말)1.0001.0001.000
Unnamed: 21.0001.0000.929
Unnamed: 31.0000.9291.000
2024-04-06T20:20:27.178894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 2Unnamed: 3
Unnamed: 21.0000.815
Unnamed: 30.8151.000
2024-04-06T20:20:27.314950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 2Unnamed: 3
Unnamed: 21.0000.815
Unnamed: 30.8151.000

Missing values

2024-04-06T20:20:24.843217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T20:20:25.183116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

Unnamed: 0서울시 자치구별 연료별 자동차등록 현황(기준일 : 2021년 1월말)Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8
0<NA>시군구별연료별용도별승 용승 합화 물특 수
1<NA>합 계271448310606533003889663159552
2<NA>종로구CNG비사업용9422035
3<NA><NA>CNG사업용0890089
4<NA><NA>경유비사업용104723068382512817493
5<NA><NA>경유사업용6021264452968
6<NA><NA>기타연료비사업용131541096
7<NA><NA>기타연료사업용0082082
8<NA><NA>수소비사업용4840052
9<NA><NA>엘피지비사업용155922128202062
Unnamed: 0서울시 자치구별 연료별 자동차등록 현황(기준일 : 2021년 1월말)Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8
538<NA><NA>하이브리드(CNG+전기)사업용02002
539<NA><NA>하이브리드(LPG+전기)비사업용7100071
540<NA><NA>하이브리드(경유+전기)비사업용4200042
541<NA><NA>하이브리드(휘발유+전기)비사업용54400005440
542<NA><NA>하이브리드(휘발유+전기)사업용5700057
543<NA><NA>휘발유비사업용299582277030057
544<NA><NA>휘발유사업용7600076
545<NA><NA>휘발유(무연)비사업용459541828046000
546<NA><NA>휘발유(무연)사업용249000249
547<NA><NA>휘발유(유연)비사업용6100061

Duplicate rows

Most frequently occurring

서울시 자치구별 연료별 자동차등록 현황(기준일 : 2021년 1월말)Unnamed: 2Unnamed: 3# duplicates
0<NA>CNG사업용25
1<NA>경유비사업용25
2<NA>경유사업용25
3<NA>기타연료비사업용25
4<NA>기타연료사업용25
5<NA>수소비사업용25
7<NA>엘피지비사업용25
8<NA>엘피지사업용25
9<NA>전기비사업용25
10<NA>전기사업용25