Overview

Dataset statistics

Number of variables7
Number of observations2928
Missing cells21
Missing cells (%)0.1%
Duplicate rows433
Duplicate rows (%)14.8%
Total size in memory163.1 KiB
Average record size in memory57.0 B

Variable types

Categorical5
Text1
Numeric1

Dataset

Description부산광역시_기장군_임대사업자현황_20230320
Author부산광역시 기장군
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15023429

Alerts

Dataset has 433 (14.8%) duplicate rowsDuplicates
사업자구분 is highly overall correlated with 종류High correlation
종류 is highly overall correlated with 사업자구분High correlation
임대주택구분 is highly imbalanced (64.6%)Imbalance

Reproduction

Analysis started2023-12-10 16:24:00.257985
Analysis finished2023-12-10 16:24:01.347593
Duration1.09 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

영업구분
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size23.0 KiB
등록영업중
1898 
전입영업중
1030 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전입영업중
2nd row전입영업중
3rd row전입영업중
4th row전입영업중
5th row전입영업중

Common Values

ValueCountFrequency (%)
등록영업중 1898
64.8%
전입영업중 1030
35.2%

Length

2023-12-11T01:24:01.411210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:24:01.510415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
등록영업중 1898
64.8%
전입영업중 1030
35.2%

사업자구분
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size23.0 KiB
임대사업자
1745 
일반형임대사업자
722 
매입임대사업자
318 
허가건설임대사업자
 
141
주택건설업자
 
2

Length

Max length9
Median length5
Mean length6.1502732
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row임대사업자
2nd row임대사업자
3rd row임대사업자
4th row임대사업자
5th row임대사업자

Common Values

ValueCountFrequency (%)
임대사업자 1745
59.6%
일반형임대사업자 722
24.7%
매입임대사업자 318
 
10.9%
허가건설임대사업자 141
 
4.8%
주택건설업자 2
 
0.1%

Length

2023-12-11T01:24:01.625505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:24:01.743404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
임대사업자 1745
59.6%
일반형임대사업자 722
24.7%
매입임대사업자 318
 
10.9%
허가건설임대사업자 141
 
4.8%
주택건설업자 2
 
0.1%

임대주택구분
Categorical

IMBALANCE 

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size23.0 KiB
민간매입임대주택
2420 
민간건설임대주택
491 
<NA>
 
9
기타
 
8

Length

Max length8
Median length8
Mean length7.9713115
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row민간매입임대주택
2nd row민간매입임대주택
3rd row민간매입임대주택
4th row민간매입임대주택
5th row민간매입임대주택

Common Values

ValueCountFrequency (%)
민간매입임대주택 2420
82.7%
민간건설임대주택 491
 
16.8%
<NA> 9
 
0.3%
기타 8
 
0.3%

Length

2023-12-11T01:24:01.860394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:24:01.964311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
민간매입임대주택 2420
82.7%
민간건설임대주택 491
 
16.8%
na 9
 
0.3%
기타 8
 
0.3%
Distinct635
Distinct (%)21.8%
Missing9
Missing (%)0.3%
Memory size23.0 KiB
2023-12-11T01:24:02.259457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length45
Mean length25.434395
Min length2

Characters and Unicode

Total characters74243
Distinct characters417
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique399 ?
Unique (%)13.7%

Sample

1st row서울특별시 중랑구 면목동 592-2 (면목동)
2nd row
3rd row부산광역시 금정구 부곡동 297-1 부산대역삼정그린코아
4th row부산광역시 남구 대연동 384-26 벽산e-솔렌스힐
5th row부산광역시 연제구 연산동 1366-1 부산 더샵 시티애비뉴2차
ValueCountFrequency (%)
부산광역시 2027
 
13.9%
기장군 877
 
6.0%
기장읍 475
 
3.3%
정관읍 342
 
2.3%
장안읍 271
 
1.9%
경상남도 245
 
1.7%
예림리 224
 
1.5%
광안동 220
 
1.5%
대연동 205
 
1.4%
해운대구 200
 
1.4%
Other values (1422) 9512
65.2%
2023-12-11T01:24:02.746526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13314
 
17.9%
2932
 
3.9%
2893
 
3.9%
2541
 
3.4%
2441
 
3.3%
2433
 
3.3%
- 2274
 
3.1%
1 2252
 
3.0%
2175
 
2.9%
1792
 
2.4%
Other values (407) 39196
52.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 43645
58.8%
Space Separator 13314
 
17.9%
Decimal Number 11839
 
15.9%
Dash Punctuation 2274
 
3.1%
Open Punctuation 1116
 
1.5%
Close Punctuation 1116
 
1.5%
Other Punctuation 768
 
1.0%
Uppercase Letter 122
 
0.2%
Lowercase Letter 48
 
0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2932
 
6.7%
2893
 
6.6%
2541
 
5.8%
2441
 
5.6%
2433
 
5.6%
2175
 
5.0%
1792
 
4.1%
1449
 
3.3%
1401
 
3.2%
1232
 
2.8%
Other values (366) 22356
51.2%
Uppercase Letter
ValueCountFrequency (%)
E 15
12.3%
W 13
10.7%
I 12
9.8%
K 12
9.8%
S 12
9.8%
H 11
9.0%
V 11
9.0%
B 8
6.6%
A 7
5.7%
L 7
5.7%
Other values (8) 14
11.5%
Decimal Number
ValueCountFrequency (%)
1 2252
19.0%
2 1513
12.8%
4 1425
12.0%
5 1094
9.2%
7 1066
9.0%
6 1015
8.6%
9 947
8.0%
3 919
7.8%
8 876
 
7.4%
0 732
 
6.2%
Other Punctuation
ValueCountFrequency (%)
, 765
99.6%
& 1
 
0.1%
. 1
 
0.1%
@ 1
 
0.1%
Lowercase Letter
ValueCountFrequency (%)
l 18
37.5%
e 12
25.0%
i 9
18.8%
s 9
18.8%
Space Separator
ValueCountFrequency (%)
13314
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2274
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1116
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1116
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 43645
58.8%
Common 30427
41.0%
Latin 171
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2932
 
6.7%
2893
 
6.6%
2541
 
5.8%
2441
 
5.6%
2433
 
5.6%
2175
 
5.0%
1792
 
4.1%
1449
 
3.3%
1401
 
3.2%
1232
 
2.8%
Other values (366) 22356
51.2%
Latin
ValueCountFrequency (%)
l 18
 
10.5%
E 15
 
8.8%
W 13
 
7.6%
I 12
 
7.0%
K 12
 
7.0%
S 12
 
7.0%
e 12
 
7.0%
H 11
 
6.4%
V 11
 
6.4%
i 9
 
5.3%
Other values (13) 46
26.9%
Common
ValueCountFrequency (%)
13314
43.8%
- 2274
 
7.5%
1 2252
 
7.4%
2 1513
 
5.0%
4 1425
 
4.7%
( 1116
 
3.7%
) 1116
 
3.7%
5 1094
 
3.6%
7 1066
 
3.5%
6 1015
 
3.3%
Other values (8) 4242
 
13.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 43645
58.8%
ASCII 30597
41.2%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
13314
43.5%
- 2274
 
7.4%
1 2252
 
7.4%
2 1513
 
4.9%
4 1425
 
4.7%
( 1116
 
3.6%
) 1116
 
3.6%
5 1094
 
3.6%
7 1066
 
3.5%
6 1015
 
3.3%
Other values (30) 4412
 
14.4%
Hangul
ValueCountFrequency (%)
2932
 
6.7%
2893
 
6.6%
2541
 
5.8%
2441
 
5.6%
2433
 
5.6%
2175
 
5.0%
1792
 
4.1%
1449
 
3.3%
1401
 
3.2%
1232
 
2.8%
Other values (366) 22356
51.2%
Number Forms
ValueCountFrequency (%)
1
100.0%

종류
Categorical

HIGH CORRELATION 

Distinct12
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size23.0 KiB
장기일반민간임대주택(8년)
1172 
장기일반민간임대주택(10년)
599 
준공공임대
327 
단기민간임대주택
311 
단기임대
188 
Other values (7)
331 

Length

Max length15
Median length14
Mean length11.592896
Min length4

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row준공공임대
2nd row단기민간임대주택
3rd row장기일반민간임대주택(8년)
4th row장기일반민간임대주택(8년)
5th row장기일반민간임대주택(8년)

Common Values

ValueCountFrequency (%)
장기일반민간임대주택(8년) 1172
40.0%
장기일반민간임대주택(10년) 599
20.5%
준공공임대 327
 
11.2%
단기민간임대주택 311
 
10.6%
단기임대 188
 
6.4%
공공지원민간임대주택(8년) 85
 
2.9%
공공지원민간임대주택(10년) 80
 
2.7%
매입임대주택 79
 
2.7%
5년임대주택(민간) 76
 
2.6%
<NA> 9
 
0.3%
Other values (2) 2
 
0.1%

Length

2023-12-11T01:24:02.883521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
장기일반민간임대주택(8년 1172
40.0%
장기일반민간임대주택(10년 599
20.5%
준공공임대 327
 
11.2%
단기민간임대주택 311
 
10.6%
단기임대 188
 
6.4%
공공지원민간임대주택(8년 85
 
2.9%
공공지원민간임대주택(10년 80
 
2.7%
매입임대주택 79
 
2.7%
5년임대주택(민간 76
 
2.6%
na 9
 
0.3%
Other values (2) 2
 
0.1%

유형
Categorical

Distinct10
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size23.0 KiB
준주택(오피스텔)
921 
다세대주택
627 
아파트
600 
다가구주택
397 
도시형생활주택
211 
Other values (5)
172 

Length

Max length12
Median length9
Mean length5.9515027
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row다세대주택
2nd row준주택(오피스텔)
3rd row아파트
4th row아파트
5th row준주택(오피스텔)

Common Values

ValueCountFrequency (%)
준주택(오피스텔) 921
31.5%
다세대주택 627
21.4%
아파트 600
20.5%
다가구주택 397
13.6%
도시형생활주택 211
 
7.2%
연립주택 80
 
2.7%
단독주택 70
 
2.4%
<NA> 12
 
0.4%
준주택(기숙사) 7
 
0.2%
아파트(도시형생활주택) 3
 
0.1%

Length

2023-12-11T01:24:03.000259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:24:03.115469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
준주택(오피스텔 921
31.5%
다세대주택 627
21.4%
아파트 600
20.5%
다가구주택 397
13.6%
도시형생활주택 211
 
7.2%
연립주택 80
 
2.7%
단독주택 70
 
2.4%
na 12
 
0.4%
준주택(기숙사 7
 
0.2%
아파트(도시형생활주택 3
 
0.1%

면적
Real number (ℝ)

Distinct999
Distinct (%)34.3%
Missing12
Missing (%)0.4%
Infinite0
Infinite (%)0.0%
Mean41.943176
Minimum9.905
Maximum1045.07
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size25.9 KiB
2023-12-11T01:24:03.247644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum9.905
5-th percentile14.91
Q121.4
median27.8369
Q343.5136
95-th percentile84.233675
Maximum1045.07
Range1035.165
Interquartile range (IQR)22.1136

Descriptive statistics

Standard deviation75.257122
Coefficient of variation (CV)1.7942638
Kurtosis113.33123
Mean41.943176
Median Absolute Deviation (MAD)9.0669
Skewness9.936482
Sum122306.3
Variance5663.6343
MonotonicityNot monotonic
2023-12-11T01:24:03.383141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
49.863 56
 
1.9%
18.1037 45
 
1.5%
43.5136 45
 
1.5%
24.0 41
 
1.4%
39.754 32
 
1.1%
13.6 28
 
1.0%
26.665 28
 
1.0%
40.3936 26
 
0.9%
27.0147 24
 
0.8%
28.7692 24
 
0.8%
Other values (989) 2567
87.7%
ValueCountFrequency (%)
9.905 3
 
0.1%
11.596 1
 
< 0.1%
11.597 2
 
0.1%
11.902 8
0.3%
12.49 4
0.1%
12.63 4
0.1%
12.65 7
0.2%
12.725 2
 
0.1%
13.01 1
 
< 0.1%
13.43 2
 
0.1%
ValueCountFrequency (%)
1045.07 9
0.3%
659.58 1
 
< 0.1%
659.12 1
 
< 0.1%
658.32 1
 
< 0.1%
657.88 1
 
< 0.1%
653.44 1
 
< 0.1%
549.37 14
0.5%
511.68 1
 
< 0.1%
433.84 1
 
< 0.1%
341.2 1
 
< 0.1%

Interactions

2023-12-11T01:24:00.873649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:24:03.473462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영업구분사업자구분임대주택구분종류유형면적
영업구분1.0000.2510.0980.3640.1500.094
사업자구분0.2511.0000.3750.8300.3820.108
임대주택구분0.0980.3751.0000.5670.4650.056
종류0.3640.8300.5671.0000.5210.184
유형0.1500.3820.4650.5211.0000.262
면적0.0940.1080.0560.1840.2621.000
2023-12-11T01:24:03.573957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업자구분임대주택구분영업구분유형종류
사업자구분1.0000.3060.3060.2320.647
임대주택구분0.3061.0000.1630.2320.399
영업구분0.3060.1631.0000.1500.349
유형0.2320.2320.1501.0000.267
종류0.6470.3990.3490.2671.000
2023-12-11T01:24:03.663640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
면적영업구분사업자구분임대주택구분종류유형
면적1.0000.0700.0660.0350.0870.131
영업구분0.0701.0000.3060.1630.3490.150
사업자구분0.0660.3061.0000.3060.6470.232
임대주택구분0.0350.1630.3061.0000.3990.232
종류0.0870.3490.6470.3991.0000.267
유형0.1310.1500.2320.2320.2671.000

Missing values

2023-12-11T01:24:00.999066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:24:01.127112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T01:24:01.276140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

영업구분사업자구분임대주택구분지번주소종류유형면적
0전입영업중임대사업자민간매입임대주택서울특별시 중랑구 면목동 592-2 (면목동)준공공임대다세대주택27.19
1전입영업중임대사업자민간매입임대주택단기민간임대주택준주택(오피스텔)29.2538
2전입영업중임대사업자민간매입임대주택부산광역시 금정구 부곡동 297-1 부산대역삼정그린코아장기일반민간임대주택(8년)아파트22.913
3전입영업중임대사업자민간매입임대주택부산광역시 남구 대연동 384-26 벽산e-솔렌스힐장기일반민간임대주택(8년)아파트16.6503
4전입영업중임대사업자민간매입임대주택부산광역시 연제구 연산동 1366-1 부산 더샵 시티애비뉴2차장기일반민간임대주택(8년)준주택(오피스텔)29.98
5전입영업중임대사업자민간매입임대주택부산광역시 남구 대연동 384-26 벽산e-솔렌스힐장기일반민간임대주택(8년)아파트16.6503
6전입영업중일반형임대사업자민간매입임대주택부산광역시 남구 대연동 913-14 (대연동, 푸른마을)준공공임대다세대주택22.47
7전입영업중일반형임대사업자민간매입임대주택부산광역시 남구 대연동 913-14 (대연동, 푸른마을)준공공임대다세대주택22.12
8전입영업중일반형임대사업자민간매입임대주택부산광역시 남구 대연동 913-14 (대연동, 푸른마을)준공공임대다세대주택21.895
9전입영업중일반형임대사업자민간매입임대주택부산광역시 남구 대연동 913-14 (대연동, 푸른마을)준공공임대다세대주택22.47
영업구분사업자구분임대주택구분지번주소종류유형면적
2918등록영업중매입임대사업자민간매입임대주택부산광역시 수영구 광안동 90-60 (광안동, 남강하우스)장기일반민간임대주택(8년)다세대주택19.32
2919등록영업중매입임대사업자민간매입임대주택부산광역시 수영구 광안동 88-33 (광안동, 칸느하우스)장기일반민간임대주택(8년)준주택(오피스텔)20.86
2920등록영업중매입임대사업자민간매입임대주택부산광역시 수영구 광안동 88-33 (광안동, 칸느하우스)장기일반민간임대주택(8년)준주택(오피스텔)21.65
2921등록영업중매입임대사업자민간매입임대주택부산광역시 수영구 광안동 88-33 (광안동, 칸느하우스)장기일반민간임대주택(8년)준주택(오피스텔)21.65
2922등록영업중매입임대사업자민간매입임대주택부산광역시 수영구 광안동 88-33 (광안동, 칸느하우스)장기일반민간임대주택(8년)준주택(오피스텔)20.86
2923등록영업중매입임대사업자민간매입임대주택부산광역시 수영구 광안동 88-33 (광안동, 칸느하우스)장기일반민간임대주택(8년)다세대주택16.33
2924등록영업중일반형임대사업자민간매입임대주택경기도 화성시 장지동 (장지동, 금호어울림레이크)준공공임대아파트59.93
2925전입영업중매입임대사업자민간매입임대주택부산광역시 해운대구 좌동 1478-1 (좌동, 해운대 좌동 SK 허브 올리브)장기일반민간임대주택(8년)준주택(오피스텔)28.4668
2926전입영업중매입임대사업자민간매입임대주택부산광역시 해운대구 반여동 1619 (반여동, 센텀롯데캐슬아파트)매입임대주택아파트59.76
2927전입영업중매입임대사업자민간매입임대주택부산광역시 해운대구 우동 1228 (우동, 센텀센시빌)매입임대주택아파트84.94

Duplicate rows

Most frequently occurring

영업구분사업자구분임대주택구분지번주소종류유형면적# duplicates
82등록영업중임대사업자민간건설임대주택부산광역시 기장군 정관읍 예림리 509 (정관읍 예림리, 에코펠리시아)장기일반민간임대주택(8년)아파트49.86356
26등록영업중일반형임대사업자민간건설임대주택경상남도 통영시 무전동 279-27 다우드림캐슬단기임대도시형생활주택43.513644
90등록영업중임대사업자민간매입임대주택장기일반민간임대주택(8년)아파트18.103739
103등록영업중임대사업자민간매입임대주택경상북도 경주시 석장동 867-1 뉴갤럭시 오피스텔장기일반민간임대주택(10년)준주택(오피스텔)24.035
74등록영업중임대사업자민간건설임대주택장기일반민간임대주택(8년)아파트39.75432
48등록영업중일반형임대사업자민간매입임대주택부산광역시 기장군 기장읍 대라리 91-6장기일반민간임대주택(8년)준주택(오피스텔)26.66528
187등록영업중임대사업자민간매입임대주택부산광역시 부산진구 가야동 644-25 마이크로하우스장기일반민간임대주택(10년)다세대주택13.628
25등록영업중일반형임대사업자민간건설임대주택경상남도 통영시 무전동 279-27 다우드림캐슬단기임대도시형생활주택40.393626
73등록영업중임대사업자민간건설임대주택장기일반민간임대주택(10년)준주택(오피스텔)28.769224
152등록영업중임대사업자민간매입임대주택부산광역시 기장군 장안읍 257-37장기일반민간임대주택(8년)준주택(오피스텔)42.32122