Overview

Dataset statistics

Number of variables10
Number of observations43
Missing cells45
Missing cells (%)10.5%
Duplicate rows20
Duplicate rows (%)46.5%
Total size in memory3.5 KiB
Average record size in memory83.1 B

Variable types

Text1
Unsupported9

Dataset

Description부산광역시연제구미분양현황_20201231
Author부산광역시 연제구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15030310

Alerts

Dataset has 20 (46.5%) duplicate rowsDuplicates
□ 부산광역시 민간/분양 미분양주택 현황(총괄) has 4 (9.3%) missing valuesMissing
Unnamed: 1 has 5 (11.6%) missing valuesMissing
Unnamed: 2 has 3 (7.0%) missing valuesMissing
Unnamed: 3 has 5 (11.6%) missing valuesMissing
Unnamed: 4 has 5 (11.6%) missing valuesMissing
Unnamed: 5 has 5 (11.6%) missing valuesMissing
Unnamed: 6 has 3 (7.0%) missing valuesMissing
Unnamed: 7 has 5 (11.6%) missing valuesMissing
Unnamed: 8 has 5 (11.6%) missing valuesMissing
Unnamed: 9 has 5 (11.6%) missing valuesMissing
Unnamed: 1 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 2 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-10 16:17:51.893968
Analysis finished2023-12-10 16:17:52.571362
Duration0.68 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct20
Distinct (%)51.3%
Missing4
Missing (%)9.3%
Memory size476.0 B
2023-12-11T01:17:52.712652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length3
Mean length3.8461538
Min length2

Characters and Unicode

Total characters150
Distinct characters51
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)2.6%

Sample

1st row시군별
2nd row부산광역시
3rd row중구
4th row서구
5th row동구
ValueCountFrequency (%)
부산광역시 3
 
6.8%
북구 2
 
4.5%
시군별 2
 
4.5%
해운대구 2
 
4.5%
기장군 2
 
4.5%
사상구 2
 
4.5%
수영구 2
 
4.5%
연제구 2
 
4.5%
강서구(경자청 2
 
4.5%
금정구 2
 
4.5%
Other values (14) 23
52.3%
2023-12-11T01:17:53.145256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
32
21.3%
6
 
4.0%
5
 
3.3%
5
 
3.3%
5
 
3.3%
5
 
3.3%
4
 
2.7%
4
 
2.7%
4
 
2.7%
4
 
2.7%
Other values (41) 76
50.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 137
91.3%
Space Separator 5
 
3.3%
Open Punctuation 3
 
2.0%
Close Punctuation 3
 
2.0%
Other Punctuation 1
 
0.7%
Other Symbol 1
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
32
23.4%
6
 
4.4%
5
 
3.6%
5
 
3.6%
5
 
3.6%
4
 
2.9%
4
 
2.9%
4
 
2.9%
4
 
2.9%
4
 
2.9%
Other values (36) 64
46.7%
Space Separator
ValueCountFrequency (%)
5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 137
91.3%
Common 13
 
8.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
32
23.4%
6
 
4.4%
5
 
3.6%
5
 
3.6%
5
 
3.6%
4
 
2.9%
4
 
2.9%
4
 
2.9%
4
 
2.9%
4
 
2.9%
Other values (36) 64
46.7%
Common
ValueCountFrequency (%)
5
38.5%
( 3
23.1%
) 3
23.1%
/ 1
 
7.7%
1
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 137
91.3%
ASCII 12
 
8.0%
Geometric Shapes 1
 
0.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
32
23.4%
6
 
4.4%
5
 
3.6%
5
 
3.6%
5
 
3.6%
4
 
2.9%
4
 
2.9%
4
 
2.9%
4
 
2.9%
4
 
2.9%
Other values (36) 64
46.7%
ASCII
ValueCountFrequency (%)
5
41.7%
( 3
25.0%
) 3
25.0%
/ 1
 
8.3%
Geometric Shapes
ValueCountFrequency (%)
1
100.0%

Unnamed: 1
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing5
Missing (%)11.6%
Memory size476.0 B

Unnamed: 2
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)7.0%
Memory size476.0 B

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing5
Missing (%)11.6%
Memory size476.0 B

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing5
Missing (%)11.6%
Memory size476.0 B

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing5
Missing (%)11.6%
Memory size476.0 B

Unnamed: 6
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)7.0%
Memory size476.0 B

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing5
Missing (%)11.6%
Memory size476.0 B

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing5
Missing (%)11.6%
Memory size476.0 B

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing5
Missing (%)11.6%
Memory size476.0 B

Missing values

2023-12-11T01:17:52.062873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:17:52.239818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T01:17:52.408896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

□ 부산광역시 민간/분양 미분양주택 현황(총괄)Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9
0시군별전월대비\n미분양\n증감현황민간분양 주택('20. 12월)NaNNaNNaN민간분양 주택('20. 11월)NaNNaNNaN
1<NA>NaN전용 60㎡이하전용 60-85㎡전용 85㎡초과전용 60㎡이하전용 60-85㎡전용 85㎡초과
2부산광역시0478213152593874478213152593874
3중구000000000
4서구02351911210423519112104
5동구03271401771032714017710
6영도구05880588058805880
7부산진구010972776951251097277695125
8동래구0891880891880
9남구063173886317388
□ 부산광역시 민간/분양 미분양주택 현황(총괄)Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9
33북구030111903011190
34해운대구080088008
35사하구-81131130012112100
36금정구6691761502510150
37강서구000000000
38강서구(경자청)014123161021412316102
39연제구010011001
40수영구-4323290363330
41사상구-234201403622140
42기장군0381370381370

Duplicate rows

Most frequently occurring

□ 부산광역시 민간/분양 미분양주택 현황(총괄)# duplicates
19<NA>4
0강서구2
1강서구(경자청)2
2금정구2
3기장군2
4남구2
5동구2
6동래구2
7부산광역시2
8부산진구2