Overview

Dataset statistics

Number of variables8
Number of observations342
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory22.5 KiB
Average record size in memory67.4 B

Variable types

Numeric2
Categorical5
Text1

Dataset

Description부산광역시_금정구_폐공가현황_20230320
Author부산광역시 금정구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15025833

Alerts

시군구 has constant value ""Constant
밀집구역 포함여부 has constant value ""Constant
주택유형 is highly imbalanced (74.1%)Imbalance
용도지역 is highly imbalanced (52.4%)Imbalance
연번 has unique valuesUnique
소재지 has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:13:30.679728
Analysis finished2023-12-10 16:13:31.359094
Duration0.68 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct342
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean171.5
Minimum1
Maximum342
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.1 KiB
2023-12-11T01:13:31.433115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile18.05
Q186.25
median171.5
Q3256.75
95-th percentile324.95
Maximum342
Range341
Interquartile range (IQR)170.5

Descriptive statistics

Standard deviation98.871128
Coefficient of variation (CV)0.57650804
Kurtosis-1.2
Mean171.5
Median Absolute Deviation (MAD)85.5
Skewness0
Sum58653
Variance9775.5
MonotonicityStrictly increasing
2023-12-11T01:13:31.560363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
227 1
 
0.3%
235 1
 
0.3%
234 1
 
0.3%
233 1
 
0.3%
232 1
 
0.3%
231 1
 
0.3%
230 1
 
0.3%
229 1
 
0.3%
228 1
 
0.3%
Other values (332) 332
97.1%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
342 1
0.3%
341 1
0.3%
340 1
0.3%
339 1
0.3%
338 1
0.3%
337 1
0.3%
336 1
0.3%
335 1
0.3%
334 1
0.3%
333 1
0.3%

시군구
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
부산광역시 금정구
342 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산광역시 금정구
2nd row부산광역시 금정구
3rd row부산광역시 금정구
4th row부산광역시 금정구
5th row부산광역시 금정구

Common Values

ValueCountFrequency (%)
부산광역시 금정구 342
100.0%

Length

2023-12-11T01:13:31.675923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:13:31.754687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산광역시 342
50.0%
금정구 342
50.0%

소재지
Text

UNIQUE 

Distinct342
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2023-12-11T01:13:32.003937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length16
Mean length10.111111
Min length6

Characters and Unicode

Total characters3458
Distinct characters38
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique342 ?
Unique (%)100.0%

Sample

1st row구서동 420-45(101호)
2nd row구서동 816-6(지하)
3rd row금사동 26-1(307호)
4th row금사동 26-17
5th row금사동 358(201호)
ValueCountFrequency (%)
서동 215
31.4%
부곡동 70
 
10.2%
장전동 17
 
2.5%
금사동 9
 
1.3%
회동동 8
 
1.2%
청룡동 7
 
1.0%
남산동 5
 
0.7%
두구동 2
 
0.3%
노포동 2
 
0.3%
오륜동 2
 
0.3%
Other values (345) 347
50.7%
2023-12-11T01:13:32.392274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
350
10.1%
342
9.9%
- 324
 
9.4%
2 298
 
8.6%
3 271
 
7.8%
0 265
 
7.7%
1 245
 
7.1%
217
 
6.3%
5 168
 
4.9%
6 158
 
4.6%
Other values (28) 820
23.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1882
54.4%
Other Letter 844
24.4%
Space Separator 342
 
9.9%
Dash Punctuation 324
 
9.4%
Close Punctuation 33
 
1.0%
Open Punctuation 33
 
1.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
350
41.5%
217
25.7%
70
 
8.3%
70
 
8.3%
32
 
3.8%
17
 
2.0%
17
 
2.0%
10
 
1.2%
9
 
1.1%
8
 
0.9%
Other values (14) 44
 
5.2%
Decimal Number
ValueCountFrequency (%)
2 298
15.8%
3 271
14.4%
0 265
14.1%
1 245
13.0%
5 168
8.9%
6 158
8.4%
7 157
8.3%
4 135
7.2%
8 109
 
5.8%
9 76
 
4.0%
Space Separator
ValueCountFrequency (%)
342
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 324
100.0%
Close Punctuation
ValueCountFrequency (%)
) 33
100.0%
Open Punctuation
ValueCountFrequency (%)
( 33
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2614
75.6%
Hangul 844
 
24.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
350
41.5%
217
25.7%
70
 
8.3%
70
 
8.3%
32
 
3.8%
17
 
2.0%
17
 
2.0%
10
 
1.2%
9
 
1.1%
8
 
0.9%
Other values (14) 44
 
5.2%
Common
ValueCountFrequency (%)
342
13.1%
- 324
12.4%
2 298
11.4%
3 271
10.4%
0 265
10.1%
1 245
9.4%
5 168
6.4%
6 158
6.0%
7 157
6.0%
4 135
 
5.2%
Other values (4) 251
9.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2614
75.6%
Hangul 844
 
24.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
350
41.5%
217
25.7%
70
 
8.3%
70
 
8.3%
32
 
3.8%
17
 
2.0%
17
 
2.0%
10
 
1.2%
9
 
1.1%
8
 
0.9%
Other values (14) 44
 
5.2%
ASCII
ValueCountFrequency (%)
342
13.1%
- 324
12.4%
2 298
11.4%
3 271
10.4%
0 265
10.1%
1 245
9.4%
5 168
6.4%
6 158
6.0%
7 157
6.0%
4 135
 
5.2%
Other values (4) 251
9.6%

등급
Categorical

Distinct4
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
1
203 
2
103 
3
26 
4
 
10

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row2
5th row1

Common Values

ValueCountFrequency (%)
1 203
59.4%
2 103
30.1%
3 26
 
7.6%
4 10
 
2.9%

Length

2023-12-11T01:13:32.508088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:13:32.599824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 203
59.4%
2 103
30.1%
3 26
 
7.6%
4 10
 
2.9%

주택유형
Categorical

IMBALANCE 

Distinct5
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
단독
308 
다세대
 
22
연립
 
8
아파트
 
3
다가구
 
1

Length

Max length3
Median length2
Mean length2.0760234
Min length2

Unique

Unique1 ?
Unique (%)0.3%

Sample

1st row다세대
2nd row연립
3rd row연립
4th row단독
5th row다세대

Common Values

ValueCountFrequency (%)
단독 308
90.1%
다세대 22
 
6.4%
연립 8
 
2.3%
아파트 3
 
0.9%
다가구 1
 
0.3%

Length

2023-12-11T01:13:32.708162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:13:32.792510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
단독 308
90.1%
다세대 22
 
6.4%
연립 8
 
2.3%
아파트 3
 
0.9%
다가구 1
 
0.3%

용도지역
Categorical

IMBALANCE 

Distinct8
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
제3종일반주거지역
225 
제2종일반주거지역
79 
준주거지역
23 
제1종일반주거지역
 
8
일반상업지역
 
2
Other values (3)
 
5

Length

Max length9
Median length9
Mean length8.6637427
Min length5

Unique

Unique1 ?
Unique (%)0.3%

Sample

1st row일반상업지역
2nd row제2종일반주거지역
3rd row제2종일반주거지역
4th row제2종일반주거지역
5th row제2종일반주거지역

Common Values

ValueCountFrequency (%)
제3종일반주거지역 225
65.8%
제2종일반주거지역 79
 
23.1%
준주거지역 23
 
6.7%
제1종일반주거지역 8
 
2.3%
일반상업지역 2
 
0.6%
준공업지역 2
 
0.6%
개발제한구역 2
 
0.6%
자연녹지지역 1
 
0.3%

Length

2023-12-11T01:13:32.911041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:13:33.055457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제3종일반주거지역 225
65.8%
제2종일반주거지역 79
 
23.1%
준주거지역 23
 
6.7%
제1종일반주거지역 8
 
2.3%
일반상업지역 2
 
0.6%
준공업지역 2
 
0.6%
개발제한구역 2
 
0.6%
자연녹지지역 1
 
0.3%

면적
Real number (ℝ)

Distinct152
Distinct (%)44.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean84.440351
Minimum17
Maximum503
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.1 KiB
2023-12-11T01:13:33.213776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum17
5-th percentile30.15
Q143.525
median53
Q3103.5
95-th percentile232.9
Maximum503
Range486
Interquartile range (IQR)59.975

Descriptive statistics

Standard deviation74.721966
Coefficient of variation (CV)0.88490828
Kurtosis9.1578694
Mean84.440351
Median Absolute Deviation (MAD)10
Skewness2.7503338
Sum28878.6
Variance5583.3722
MonotonicityNot monotonic
2023-12-11T01:13:33.371577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
53.0 35
 
10.2%
43.0 31
 
9.1%
50.0 26
 
7.6%
46.0 17
 
5.0%
56.0 9
 
2.6%
36.0 9
 
2.6%
49.0 6
 
1.8%
40.0 6
 
1.8%
47.0 5
 
1.5%
60.0 5
 
1.5%
Other values (142) 193
56.4%
ValueCountFrequency (%)
17.0 1
 
0.3%
18.0 1
 
0.3%
22.0 1
 
0.3%
23.0 3
0.9%
25.0 1
 
0.3%
26.0 4
1.2%
26.5 3
0.9%
28.0 1
 
0.3%
29.0 1
 
0.3%
29.3 1
 
0.3%
ValueCountFrequency (%)
503.0 1
0.3%
480.0 1
0.3%
450.0 1
0.3%
436.0 1
0.3%
380.1 1
0.3%
380.0 1
0.3%
327.0 1
0.3%
324.0 1
0.3%
317.0 1
0.3%
304.0 1
0.3%

밀집구역 포함여부
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
미포함
342 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row미포함
2nd row미포함
3rd row미포함
4th row미포함
5th row미포함

Common Values

ValueCountFrequency (%)
미포함 342
100.0%

Length

2023-12-11T01:13:33.516301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:13:33.600034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미포함 342
100.0%

Interactions

2023-12-11T01:13:31.058599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:13:30.922595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:13:31.128516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:13:30.994000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:13:33.694166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번등급주택유형용도지역면적
연번1.0000.3120.4910.5620.547
등급0.3121.0000.1240.4610.465
주택유형0.4910.1241.0000.4620.000
용도지역0.5620.4610.4621.0000.732
면적0.5470.4650.0000.7321.000
2023-12-11T01:13:33.816862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
용도지역주택유형등급
용도지역1.0000.3040.218
주택유형0.3041.0000.101
등급0.2180.1011.000
2023-12-11T01:13:33.930704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번면적등급주택유형용도지역
연번1.000-0.2060.1890.2230.309
면적-0.2061.0000.2930.0000.463
등급0.1890.2931.0000.1010.218
주택유형0.2230.0000.1011.0000.304
용도지역0.3090.4630.2180.3041.000

Missing values

2023-12-11T01:13:31.220061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:13:31.320165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시군구소재지등급주택유형용도지역면적밀집구역 포함여부
01부산광역시 금정구구서동 420-45(101호)1다세대일반상업지역60.5미포함
12부산광역시 금정구구서동 816-6(지하)1연립제2종일반주거지역198.9미포함
23부산광역시 금정구금사동 26-1(307호)1연립제2종일반주거지역43.5미포함
34부산광역시 금정구금사동 26-172단독제2종일반주거지역154.0미포함
45부산광역시 금정구금사동 358(201호)1다세대제2종일반주거지역80.6미포함
56부산광역시 금정구금사동 416-2(107호)1연립제2종일반주거지역53.1미포함
67부산광역시 금정구금사동 58-181단독제2종일반주거지역159.0미포함
78부산광역시 금정구금사동 58-62단독제2종일반주거지역176.8미포함
89부산광역시 금정구금사동 64-8(203호)1다세대제2종일반주거지역49.8미포함
910부산광역시 금정구금사동 68-14(207호)1연립준공업지역48.9미포함
연번시군구소재지등급주택유형용도지역면적밀집구역 포함여부
332333부산광역시 금정구청룡동 3444단독제2종일반주거지역231.0미포함
333334부산광역시 금정구청룡동 345-12단독제2종일반주거지역196.0미포함
334335부산광역시 금정구회동동 167-261단독제2종일반주거지역143.9미포함
335336부산광역시 금정구회동동 174-1(204호)1다세대제2종일반주거지역44.7미포함
336337부산광역시 금정구회동동 197-23(105호)1다세대제2종일반주거지역34.8미포함
337338부산광역시 금정구회동동 201-4(506호)1연립제2종일반주거지역46.0미포함
338339부산광역시 금정구회동동 299-3(201호)2다세대제2종일반주거지역26.5미포함
339340부산광역시 금정구회동동 299-3(203호)1다세대제2종일반주거지역26.5미포함
340341부산광역시 금정구회동동 332-2(502호)1다세대제2종일반주거지역66.8미포함
341342부산광역시 금정구회동동 361(107호)1연립제2종일반주거지역43.6미포함