Overview

Dataset statistics

Number of variables6
Number of observations2879
Missing cells14
Missing cells (%)0.1%
Duplicate rows383
Duplicate rows (%)13.3%
Total size in memory137.9 KiB
Average record size in memory49.0 B

Variable types

Text2
Categorical3
Numeric1

Dataset

Description부산광역시 사상구 관내 임대사업자 현황에 대한 자료임(사상구에 주소를 두고 있는 임대사업자가 등록한 전국에 있는 임대물건 자료이며 사업자구분, 등록번호, 임대물건소재지, 유형 현황 등)
URLhttps://www.data.go.kr/data/3078759/fileData.do

Alerts

Dataset has 383 (13.3%) duplicate rowsDuplicates
임대주택구분 is highly imbalanced (50.3%)Imbalance

Reproduction

Analysis started2023-12-12 14:10:56.087781
Analysis finished2023-12-12 14:10:56.871169
Duration0.78 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct428
Distinct (%)14.9%
Missing0
Missing (%)0.0%
Memory size22.6 KiB
2023-12-12T23:10:57.078469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length18
Mean length18.207364
Min length16

Characters and Unicode

Total characters52419
Distinct characters18
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique251 ?
Unique (%)8.7%

Sample

1st row2010-사상구-임대사업자-10157
2nd row2011-사상구-임대사업자-10166
3rd row2011-사상구-임대사업자-10166
4th row2011-사상구-임대사업자-10166
5th row2011-사상구-임대사업자-10166
ValueCountFrequency (%)
2020-사상구-임대사업자-265 252
 
8.8%
2021-사상구-임대사업자-110 144
 
5.0%
2022-사상구-임대사업자-85 80
 
2.8%
2023-사상구-임대사업자-37 79
 
2.7%
2019-사상구-임대사업자-103 78
 
2.7%
2018-사상구-임대사업자-10880 77
 
2.7%
2016-사상구-임대사업자-10556 65
 
2.3%
2021-사상구-임대사업자-140 57
 
2.0%
2023-사상구-임대사업자-27 55
 
1.9%
2018-사상구-임대사업자-11074 50
 
1.7%
Other values (418) 1942
67.5%
2023-12-12T23:10:57.686542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 8637
16.5%
2 6195
11.8%
5758
11.0%
0 5013
9.6%
1 3886
7.4%
2879
 
5.5%
2879
 
5.5%
2879
 
5.5%
2879
 
5.5%
2879
 
5.5%
Other values (8) 8535
16.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 23032
43.9%
Decimal Number 20750
39.6%
Dash Punctuation 8637
 
16.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 6195
29.9%
0 5013
24.2%
1 3886
18.7%
8 1066
 
5.1%
3 983
 
4.7%
6 950
 
4.6%
5 882
 
4.3%
7 654
 
3.2%
4 583
 
2.8%
9 538
 
2.6%
Other Letter
ValueCountFrequency (%)
5758
25.0%
2879
12.5%
2879
12.5%
2879
12.5%
2879
12.5%
2879
12.5%
2879
12.5%
Dash Punctuation
ValueCountFrequency (%)
- 8637
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 29387
56.1%
Hangul 23032
43.9%

Most frequent character per script

Common
ValueCountFrequency (%)
- 8637
29.4%
2 6195
21.1%
0 5013
17.1%
1 3886
13.2%
8 1066
 
3.6%
3 983
 
3.3%
6 950
 
3.2%
5 882
 
3.0%
7 654
 
2.2%
4 583
 
2.0%
Hangul
ValueCountFrequency (%)
5758
25.0%
2879
12.5%
2879
12.5%
2879
12.5%
2879
12.5%
2879
12.5%
2879
12.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 29387
56.1%
Hangul 23032
43.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 8637
29.4%
2 6195
21.1%
0 5013
17.1%
1 3886
13.2%
8 1066
 
3.6%
3 983
 
3.3%
6 950
 
3.2%
5 882
 
3.0%
7 654
 
2.2%
4 583
 
2.0%
Hangul
ValueCountFrequency (%)
5758
25.0%
2879
12.5%
2879
12.5%
2879
12.5%
2879
12.5%
2879
12.5%
2879
12.5%

사업자구분
Categorical

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size22.6 KiB
임대사업자
2158 
일반형임대사업자
508 
매입임대사업자
 
107
허가건설임대사업자
 
106

Length

Max length9
Median length5
Mean length5.7509552
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row매입임대사업자
2nd row허가건설임대사업자
3rd row허가건설임대사업자
4th row허가건설임대사업자
5th row허가건설임대사업자

Common Values

ValueCountFrequency (%)
임대사업자 2158
75.0%
일반형임대사업자 508
 
17.6%
매입임대사업자 107
 
3.7%
허가건설임대사업자 106
 
3.7%

Length

2023-12-12T23:10:57.916165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:10:58.116226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
임대사업자 2158
75.0%
일반형임대사업자 508
 
17.6%
매입임대사업자 107
 
3.7%
허가건설임대사업자 106
 
3.7%

임대주택구분
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size22.6 KiB
민간매입임대주택
2232 
민간건설임대주택
640 
<NA>
 
7

Length

Max length8
Median length8
Mean length7.9902744
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row민간건설임대주택
3rd row민간건설임대주택
4th row민간건설임대주택
5th row민간건설임대주택

Common Values

ValueCountFrequency (%)
민간매입임대주택 2232
77.5%
민간건설임대주택 640
 
22.2%
<NA> 7
 
0.2%

Length

2023-12-12T23:10:58.292290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:10:58.454793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
민간매입임대주택 2232
77.5%
민간건설임대주택 640
 
22.2%
na 7
 
0.2%
Distinct338
Distinct (%)11.8%
Missing7
Missing (%)0.2%
Memory size22.6 KiB
2023-12-12T23:10:58.857068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length45
Mean length25.456476
Min length1

Characters and Unicode

Total characters73111
Distinct characters357
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique169 ?
Unique (%)5.9%

Sample

1st row부산광역시 금정구 수림로81번길 31 (장전동, 풀하우스)
2nd row부산광역시 금정구 수림로81번길 31 (장전동, 풀하우스)
3rd row부산광역시 금정구 수림로81번길 31 (장전동, 풀하우스)
4th row부산광역시 금정구 수림로81번길 31 (장전동, 풀하우스)
5th row부산광역시 금정구 수림로81번길 31 (장전동, 풀하우스)
ValueCountFrequency (%)
부산광역시 2191
 
16.3%
사상구 1081
 
8.0%
괘법동 590
 
4.4%
부산진구 322
 
2.4%
주례동 306
 
2.3%
13 275
 
2.0%
사상로243번길 252
 
1.9%
냉정로 171
 
1.3%
광장로104번길 159
 
1.2%
사하구 148
 
1.1%
Other values (884) 7951
59.1%
2023-12-12T23:10:59.460094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11541
 
15.8%
2810
 
3.8%
2681
 
3.7%
2653
 
3.6%
2552
 
3.5%
2434
 
3.3%
2421
 
3.3%
2404
 
3.3%
( 2364
 
3.2%
) 2364
 
3.2%
Other values (347) 38887
53.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 43924
60.1%
Space Separator 11541
 
15.8%
Decimal Number 11005
 
15.1%
Open Punctuation 2364
 
3.2%
Close Punctuation 2364
 
3.2%
Other Punctuation 1011
 
1.4%
Dash Punctuation 761
 
1.0%
Lowercase Letter 80
 
0.1%
Uppercase Letter 61
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2810
 
6.4%
2681
 
6.1%
2653
 
6.0%
2552
 
5.8%
2434
 
5.5%
2421
 
5.5%
2404
 
5.5%
2246
 
5.1%
1754
 
4.0%
1736
 
4.0%
Other values (310) 20233
46.1%
Uppercase Letter
ValueCountFrequency (%)
A 26
42.6%
M 15
24.6%
W 4
 
6.6%
C 3
 
4.9%
I 3
 
4.9%
S 2
 
3.3%
F 2
 
3.3%
L 1
 
1.6%
K 1
 
1.6%
B 1
 
1.6%
Other values (3) 3
 
4.9%
Decimal Number
ValueCountFrequency (%)
1 2184
19.8%
3 1675
15.2%
2 1618
14.7%
4 1163
10.6%
0 872
 
7.9%
5 739
 
6.7%
8 728
 
6.6%
6 709
 
6.4%
7 661
 
6.0%
9 656
 
6.0%
Lowercase Letter
ValueCountFrequency (%)
l 30
37.5%
i 15
18.8%
v 15
18.8%
y 15
18.8%
e 2
 
2.5%
k 1
 
1.2%
t 1
 
1.2%
x 1
 
1.2%
Other Punctuation
ValueCountFrequency (%)
, 1010
99.9%
@ 1
 
0.1%
Space Separator
ValueCountFrequency (%)
11541
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2364
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2364
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 761
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 43924
60.1%
Common 29046
39.7%
Latin 141
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2810
 
6.4%
2681
 
6.1%
2653
 
6.0%
2552
 
5.8%
2434
 
5.5%
2421
 
5.5%
2404
 
5.5%
2246
 
5.1%
1754
 
4.0%
1736
 
4.0%
Other values (310) 20233
46.1%
Latin
ValueCountFrequency (%)
l 30
21.3%
A 26
18.4%
i 15
10.6%
v 15
10.6%
y 15
10.6%
M 15
10.6%
W 4
 
2.8%
C 3
 
2.1%
I 3
 
2.1%
S 2
 
1.4%
Other values (11) 13
9.2%
Common
ValueCountFrequency (%)
11541
39.7%
( 2364
 
8.1%
) 2364
 
8.1%
1 2184
 
7.5%
3 1675
 
5.8%
2 1618
 
5.6%
4 1163
 
4.0%
, 1010
 
3.5%
0 872
 
3.0%
- 761
 
2.6%
Other values (6) 3494
 
12.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 43924
60.1%
ASCII 29187
39.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11541
39.5%
( 2364
 
8.1%
) 2364
 
8.1%
1 2184
 
7.5%
3 1675
 
5.7%
2 1618
 
5.5%
4 1163
 
4.0%
, 1010
 
3.5%
0 872
 
3.0%
- 761
 
2.6%
Other values (27) 3635
 
12.5%
Hangul
ValueCountFrequency (%)
2810
 
6.4%
2681
 
6.1%
2653
 
6.0%
2552
 
5.8%
2434
 
5.5%
2421
 
5.5%
2404
 
5.5%
2246
 
5.1%
1754
 
4.0%
1736
 
4.0%
Other values (310) 20233
46.1%

유형
Categorical

Distinct9
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size22.6 KiB
준주택(오피스텔)
1518 
다세대주택
814 
아파트
242 
다가구주택
154 
도시형생활주택
 
73
Other values (4)
 
78

Length

Max length12
Median length9
Mean length6.9923585
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row다세대주택
3rd row다세대주택
4th row다세대주택
5th row다세대주택

Common Values

ValueCountFrequency (%)
준주택(오피스텔) 1518
52.7%
다세대주택 814
28.3%
아파트 242
 
8.4%
다가구주택 154
 
5.3%
도시형생활주택 73
 
2.5%
단독주택 57
 
2.0%
아파트(도시형생활주택) 10
 
0.3%
<NA> 7
 
0.2%
연립주택 4
 
0.1%

Length

2023-12-12T23:10:59.629798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:10:59.751727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
준주택(오피스텔 1518
52.7%
다세대주택 814
28.3%
아파트 242
 
8.4%
다가구주택 154
 
5.3%
도시형생활주택 73
 
2.5%
단독주택 57
 
2.0%
아파트(도시형생활주택 10
 
0.3%
na 7
 
0.2%
연립주택 4
 
0.1%

면적
Real number (ℝ)

Distinct741
Distinct (%)25.8%
Missing7
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean28.004766
Minimum3.8296
Maximum295.71
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size25.4 KiB
2023-12-12T23:10:59.912304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3.8296
5-th percentile14.89
Q119.51
median23.3767
Q329.4163
95-th percentile49.929
Maximum295.71
Range291.8804
Interquartile range (IQR)9.9063

Descriptive statistics

Standard deviation18.525696
Coefficient of variation (CV)0.6615194
Kurtosis52.50157
Mean28.004766
Median Absolute Deviation (MAD)4.9367
Skewness5.9180273
Sum80429.689
Variance343.20142
MonotonicityNot monotonic
2023-12-12T23:11:00.047232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
39.8763 80
 
2.8%
28.61 54
 
1.9%
39.7243 50
 
1.7%
22.27 39
 
1.4%
23.2 38
 
1.3%
24.09 36
 
1.3%
27.95 36
 
1.3%
44.1 31
 
1.1%
18.1 29
 
1.0%
27.1094 27
 
0.9%
Other values (731) 2452
85.2%
ValueCountFrequency (%)
3.8296 1
 
< 0.1%
12.0 10
0.3%
12.08 2
 
0.1%
12.17 3
 
0.1%
12.24 3
 
0.1%
12.35 2
 
0.1%
12.44 4
 
0.1%
12.48 1
 
< 0.1%
12.49 4
 
0.1%
12.66 8
0.3%
ValueCountFrequency (%)
295.71 1
 
< 0.1%
254.76 1
 
< 0.1%
186.46 8
0.3%
184.8 8
0.3%
153.44 1
 
< 0.1%
138.69 1
 
< 0.1%
124.2555 1
 
< 0.1%
106.79 1
 
< 0.1%
106.07 1
 
< 0.1%
102.7 2
 
0.1%

Interactions

2023-12-12T23:10:56.486814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:11:00.154161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업자구분임대주택구분유형면적
사업자구분1.0000.5670.6100.115
임대주택구분0.5671.0000.3550.103
유형0.6100.3551.0000.493
면적0.1150.1030.4931.000
2023-12-12T23:11:00.237268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업자구분임대주택구분유형
사업자구분1.0000.3890.309
임대주택구분0.3891.0000.266
유형0.3090.2661.000
2023-12-12T23:11:00.322028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
면적사업자구분임대주택구분유형
면적1.0000.0740.1020.268
사업자구분0.0741.0000.3890.309
임대주택구분0.1020.3891.0000.266
유형0.2680.3090.2661.000

Missing values

2023-12-12T23:10:56.607769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:10:56.711730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T23:10:56.806767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

등록번호사업자구분임대주택구분도로명주소유형면적
02010-사상구-임대사업자-10157매입임대사업자<NA><NA><NA><NA>
12011-사상구-임대사업자-10166허가건설임대사업자민간건설임대주택부산광역시 금정구 수림로81번길 31 (장전동, 풀하우스)다세대주택17.42
22011-사상구-임대사업자-10166허가건설임대사업자민간건설임대주택부산광역시 금정구 수림로81번길 31 (장전동, 풀하우스)다세대주택20.0
32011-사상구-임대사업자-10166허가건설임대사업자민간건설임대주택부산광역시 금정구 수림로81번길 31 (장전동, 풀하우스)다세대주택17.66
42011-사상구-임대사업자-10166허가건설임대사업자민간건설임대주택부산광역시 금정구 수림로81번길 31 (장전동, 풀하우스)다세대주택13.28
52011-사상구-임대사업자-10166허가건설임대사업자민간건설임대주택부산광역시 금정구 수림로81번길 31 (장전동, 풀하우스)다세대주택13.56
62011-사상구-임대사업자-10166허가건설임대사업자민간건설임대주택부산광역시 금정구 수림로81번길 31 (장전동, 풀하우스)다세대주택23.74
72011-사상구-임대사업자-10166허가건설임대사업자민간건설임대주택부산광역시 금정구 수림로81번길 31 (장전동, 풀하우스)다세대주택20.0
82011-사상구-임대사업자-10166허가건설임대사업자민간건설임대주택부산광역시 금정구 수림로81번길 31 (장전동, 풀하우스)다세대주택18.54
92011-사상구-임대사업자-10166허가건설임대사업자민간건설임대주택부산광역시 금정구 수림로81번길 31 (장전동, 풀하우스)다세대주택20.14
등록번호사업자구분임대주택구분도로명주소유형면적
28692023-사상구-임대사업자-43임대사업자민간매입임대주택부산광역시 수영구 무학로21번길 17 (광안동, 광안하늘채)다세대주택22.79
28702023-사상구-임대사업자-43임대사업자민간매입임대주택부산광역시 수영구 무학로21번길 17 (광안동, 광안하늘채)다세대주택22.15
28712023-사상구-임대사업자-43임대사업자민간매입임대주택부산광역시 수영구 무학로21번길 17 (광안동, 광안하늘채)다세대주택22.79
28722023-사상구-임대사업자-43임대사업자민간매입임대주택부산광역시 수영구 무학로21번길 17 (광안동, 광안하늘채)다세대주택22.79
28732023-사상구-임대사업자-43임대사업자민간매입임대주택부산광역시 수영구 무학로21번길 17 (광안동, 광안하늘채)다세대주택23.11
28742023-사상구-임대사업자-43임대사업자민간매입임대주택부산광역시 수영구 무학로21번길 17 (광안동, 광안하늘채)다세대주택18.04
28752023-사상구-임대사업자-43임대사업자민간매입임대주택부산광역시 수영구 무학로21번길 17 (광안동, 광안하늘채)다세대주택22.79
28762023-사상구-임대사업자-43임대사업자민간매입임대주택부산광역시 수영구 무학로21번길 17 (광안동, 광안하늘채)다세대주택18.46
28772023-사상구-임대사업자-43임대사업자민간매입임대주택부산광역시 수영구 무학로21번길 17 (광안동, 광안하늘채)다세대주택23.11
28782023-사상구-임대사업자-43임대사업자민간매입임대주택부산광역시 수영구 무학로21번길 17 (광안동, 광안하늘채)준주택(오피스텔)25.5

Duplicate rows

Most frequently occurring

등록번호사업자구분임대주택구분도로명주소유형면적# duplicates
3422022-사상구-임대사업자-85임대사업자민간건설임대주택아파트39.876380
1892020-사상구-임대사업자-265임대사업자민간매입임대주택부산광역시 사상구 사상로243번길 13 (괘법동)준주택(오피스텔)28.6154
1262018-사상구-임대사업자-11074임대사업자민간건설임대주택아파트39.724350
1022018-사상구-임대사업자-10880일반형임대사업자민간매입임대주택부산광역시 사상구 광장로104번길 25-3 (괘법동, 대동레미안 주현빌)준주택(오피스텔)22.2739
1822020-사상구-임대사업자-265임대사업자민간매입임대주택부산광역시 사상구 사상로243번길 13 (괘법동)준주택(오피스텔)23.236
1852020-사상구-임대사업자-265임대사업자민간매입임대주택부산광역시 사상구 사상로243번길 13 (괘법동)준주택(오피스텔)24.0936
1862020-사상구-임대사업자-265임대사업자민간매입임대주택부산광역시 사상구 사상로243번길 13 (괘법동)준주택(오피스텔)27.9536
3722023-사상구-임대사업자-37임대사업자민간건설임대주택준주택(오피스텔)27.109427
2952022-사상구-임대사업자-39임대사업자민간매입임대주택부산광역시 사하구 낙동대로 505 (하단동)준주택(오피스텔)20.8225
2962022-사상구-임대사업자-39임대사업자민간매입임대주택부산광역시 사하구 낙동대로 505 (하단동)준주택(오피스텔)26.6620