Overview

Dataset statistics

Number of variables17
Number of observations111
Missing cells357
Missing cells (%)18.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.3 KiB
Average record size in memory141.2 B

Variable types

Numeric4
Categorical4
Text4
DateTime5

Dataset

Description부산광역시_지역주택조합현황_20231231
Author부산광역시
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15114403

Alerts

담당기관 전화번호 is highly overall correlated with 연번 and 3 other fieldsHigh correlation
소재지 is highly overall correlated with 연번 and 3 other fieldsHigh correlation
담당기관 is highly overall correlated with 연번 and 3 other fieldsHigh correlation
연번 is highly overall correlated with 소재지 and 3 other fieldsHigh correlation
대지면적(제곱미터) is highly overall correlated with 연면적(제곱미터) and 1 other fieldsHigh correlation
연면적(제곱미터) is highly overall correlated with 대지면적(제곱미터) and 1 other fieldsHigh correlation
총세대수 is highly overall correlated with 대지면적(제곱미터) and 1 other fieldsHigh correlation
비고 is highly overall correlated with 연번 and 3 other fieldsHigh correlation
비고 is highly imbalanced (75.7%)Imbalance
조합사무실 주소 has 9 (8.1%) missing valuesMissing
조합원수 has 3 (2.7%) missing valuesMissing
조합원모집신고일 has 33 (29.7%) missing valuesMissing
조합설립인가일 has 58 (52.3%) missing valuesMissing
사업계획승인일 has 77 (69.4%) missing valuesMissing
착공신고일 has 86 (77.5%) missing valuesMissing
사용검사일(예정일) has 90 (81.1%) missing valuesMissing
연번 has unique valuesUnique
조합명 has unique valuesUnique
사업예정지 has unique valuesUnique
대지면적(제곱미터) has unique valuesUnique

Reproduction

Analysis started2024-03-13 13:18:00.510628
Analysis finished2024-03-13 13:18:03.394294
Duration2.88 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct111
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean56
Minimum1
Maximum111
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2024-03-13T22:18:03.469590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.5
Q128.5
median56
Q383.5
95-th percentile105.5
Maximum111
Range110
Interquartile range (IQR)55

Descriptive statistics

Standard deviation32.186954
Coefficient of variation (CV)0.57476703
Kurtosis-1.2
Mean56
Median Absolute Deviation (MAD)28
Skewness0
Sum6216
Variance1036
MonotonicityStrictly increasing
2024-03-13T22:18:03.599441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.9%
2 1
 
0.9%
83 1
 
0.9%
82 1
 
0.9%
81 1
 
0.9%
80 1
 
0.9%
79 1
 
0.9%
78 1
 
0.9%
77 1
 
0.9%
76 1
 
0.9%
Other values (101) 101
91.0%
ValueCountFrequency (%)
1 1
0.9%
2 1
0.9%
3 1
0.9%
4 1
0.9%
5 1
0.9%
6 1
0.9%
7 1
0.9%
8 1
0.9%
9 1
0.9%
10 1
0.9%
ValueCountFrequency (%)
111 1
0.9%
110 1
0.9%
109 1
0.9%
108 1
0.9%
107 1
0.9%
106 1
0.9%
105 1
0.9%
104 1
0.9%
103 1
0.9%
102 1
0.9%

소재지
Categorical

HIGH CORRELATION 

Distinct12
Distinct (%)10.8%
Missing0
Missing (%)0.0%
Memory size1020.0 B
부산광역시 부산진구
17 
부산광역시 동래구
14 
부산광역시 남구
13 
부산광역시 사하구
12 
부산광역시 연제구
10 
Other values (7)
45 

Length

Max length10
Median length9
Mean length8.9099099
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산광역시 금정구
2nd row부산광역시 금정구
3rd row부산광역시 금정구
4th row부산광역시 금정구
5th row부산광역시 금정구

Common Values

ValueCountFrequency (%)
부산광역시 부산진구 17
15.3%
부산광역시 동래구 14
12.6%
부산광역시 남구 13
11.7%
부산광역시 사하구 12
10.8%
부산광역시 연제구 10
9.0%
부산광역시 서구 9
8.1%
부산광역시 금정구 8
7.2%
부산광역시 북구 8
7.2%
부산광역시 사상구 8
7.2%
부산광역시 수영구 5
 
4.5%
Other values (2) 7
6.3%

Length

2024-03-13T22:18:03.747097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
부산광역시 111
50.0%
부산진구 17
 
7.7%
동래구 14
 
6.3%
남구 13
 
5.9%
사하구 12
 
5.4%
연제구 10
 
4.5%
서구 9
 
4.1%
금정구 8
 
3.6%
북구 8
 
3.6%
사상구 8
 
3.6%
Other values (3) 12
 
5.4%

조합명
Text

UNIQUE 

Distinct111
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1020.0 B
2024-03-13T22:18:04.057246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length19
Mean length12.945946
Min length8

Characters and Unicode

Total characters1437
Distinct characters156
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique111 ?
Unique (%)100.0%

Sample

1st row장전역 서희스타힐스 지역주택조합
2nd row금정더샵지역주택조합
3rd row휴먼파크장전지역주택조합
4th row남산역지역주택조합
5th row리버파크장전지역주택조합
ValueCountFrequency (%)
지역주택조합 28
 
18.4%
가칭 6
 
3.9%
추진위원회 5
 
3.3%
장전역 1
 
0.7%
가칭)송도오션파크지역주택조합 1
 
0.7%
부산송도지역주택조합 1
 
0.7%
암남지역주택조합 1
 
0.7%
괴정오작로지역주택조합 1
 
0.7%
가칭)감천3지역주택조합 1
 
0.7%
가칭)괴정동 1
 
0.7%
Other values (106) 106
69.7%
2024-03-13T22:18:04.490351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
122
 
8.5%
115
 
8.0%
111
 
7.7%
111
 
7.7%
111
 
7.7%
111
 
7.7%
62
 
4.3%
57
 
4.0%
) 57
 
4.0%
( 57
 
4.0%
Other values (146) 523
36.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1262
87.8%
Close Punctuation 57
 
4.0%
Open Punctuation 57
 
4.0%
Space Separator 42
 
2.9%
Decimal Number 17
 
1.2%
Uppercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
122
 
9.7%
115
 
9.1%
111
 
8.8%
111
 
8.8%
111
 
8.8%
111
 
8.8%
62
 
4.9%
57
 
4.5%
30
 
2.4%
19
 
1.5%
Other values (134) 413
32.7%
Decimal Number
ValueCountFrequency (%)
1 5
29.4%
2 4
23.5%
3 3
17.6%
7 2
 
11.8%
8 1
 
5.9%
6 1
 
5.9%
5 1
 
5.9%
Uppercase Letter
ValueCountFrequency (%)
K 1
50.0%
S 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 57
100.0%
Open Punctuation
ValueCountFrequency (%)
( 57
100.0%
Space Separator
ValueCountFrequency (%)
42
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1262
87.8%
Common 173
 
12.0%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
122
 
9.7%
115
 
9.1%
111
 
8.8%
111
 
8.8%
111
 
8.8%
111
 
8.8%
62
 
4.9%
57
 
4.5%
30
 
2.4%
19
 
1.5%
Other values (134) 413
32.7%
Common
ValueCountFrequency (%)
) 57
32.9%
( 57
32.9%
42
24.3%
1 5
 
2.9%
2 4
 
2.3%
3 3
 
1.7%
7 2
 
1.2%
8 1
 
0.6%
6 1
 
0.6%
5 1
 
0.6%
Latin
ValueCountFrequency (%)
K 1
50.0%
S 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1262
87.8%
ASCII 175
 
12.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
122
 
9.7%
115
 
9.1%
111
 
8.8%
111
 
8.8%
111
 
8.8%
111
 
8.8%
62
 
4.9%
57
 
4.5%
30
 
2.4%
19
 
1.5%
Other values (134) 413
32.7%
ASCII
ValueCountFrequency (%)
) 57
32.6%
( 57
32.6%
42
24.0%
1 5
 
2.9%
2 4
 
2.3%
3 3
 
1.7%
7 2
 
1.1%
8 1
 
0.6%
6 1
 
0.6%
5 1
 
0.6%
Other values (2) 2
 
1.1%
Distinct102
Distinct (%)100.0%
Missing9
Missing (%)8.1%
Memory size1020.0 B
2024-03-13T22:18:04.816881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length35
Mean length25.745098
Min length16

Characters and Unicode

Total characters2626
Distinct characters177
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique102 ?
Unique (%)100.0%

Sample

1st row부산광역시 금정구 기찰로 59(부곡동)
2nd row부산광역시 동래구 시실로 14, 2층(명륜동)
3rd row부산광역시 금정구 중앙대로 1989, 3층(남산동)
4th row부산광역시 동래구 충렬대로 137번길 4, 901호(온천동, 상현빌딩)
5th row부산광역시 금정구 중앙대로1719번길 47, 1층(부곡동)
ValueCountFrequency (%)
부산광역시 102
 
19.7%
2층 17
 
3.3%
부산진구 14
 
2.7%
남구 12
 
2.3%
사하구 11
 
2.1%
3층 10
 
1.9%
동래구 10
 
1.9%
연제구 10
 
1.9%
북구 8
 
1.5%
사상구 8
 
1.5%
Other values (246) 316
61.0%
2024-03-13T22:18:05.269548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
417
 
15.9%
126
 
4.8%
122
 
4.6%
107
 
4.1%
107
 
4.1%
107
 
4.1%
102
 
3.9%
100
 
3.8%
1 89
 
3.4%
, 82
 
3.1%
Other values (167) 1267
48.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1550
59.0%
Decimal Number 478
 
18.2%
Space Separator 417
 
15.9%
Other Punctuation 82
 
3.1%
Open Punctuation 43
 
1.6%
Close Punctuation 43
 
1.6%
Dash Punctuation 11
 
0.4%
Lowercase Letter 1
 
< 0.1%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
126
 
8.1%
122
 
7.9%
107
 
6.9%
107
 
6.9%
107
 
6.9%
102
 
6.6%
100
 
6.5%
69
 
4.5%
59
 
3.8%
40
 
2.6%
Other values (150) 611
39.4%
Decimal Number
ValueCountFrequency (%)
1 89
18.6%
2 77
16.1%
3 66
13.8%
0 47
9.8%
4 41
8.6%
5 40
8.4%
7 34
 
7.1%
6 32
 
6.7%
8 26
 
5.4%
9 26
 
5.4%
Space Separator
ValueCountFrequency (%)
417
100.0%
Other Punctuation
ValueCountFrequency (%)
, 82
100.0%
Open Punctuation
ValueCountFrequency (%)
( 43
100.0%
Close Punctuation
ValueCountFrequency (%)
) 43
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Lowercase Letter
ValueCountFrequency (%)
a 1
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1550
59.0%
Common 1074
40.9%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
126
 
8.1%
122
 
7.9%
107
 
6.9%
107
 
6.9%
107
 
6.9%
102
 
6.6%
100
 
6.5%
69
 
4.5%
59
 
3.8%
40
 
2.6%
Other values (150) 611
39.4%
Common
ValueCountFrequency (%)
417
38.8%
1 89
 
8.3%
, 82
 
7.6%
2 77
 
7.2%
3 66
 
6.1%
0 47
 
4.4%
( 43
 
4.0%
) 43
 
4.0%
4 41
 
3.8%
5 40
 
3.7%
Other values (5) 129
 
12.0%
Latin
ValueCountFrequency (%)
a 1
50.0%
A 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1550
59.0%
ASCII 1076
41.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
417
38.8%
1 89
 
8.3%
, 82
 
7.6%
2 77
 
7.2%
3 66
 
6.1%
0 47
 
4.4%
( 43
 
4.0%
) 43
 
4.0%
4 41
 
3.8%
5 40
 
3.7%
Other values (7) 131
 
12.2%
Hangul
ValueCountFrequency (%)
126
 
8.1%
122
 
7.9%
107
 
6.9%
107
 
6.9%
107
 
6.9%
102
 
6.6%
100
 
6.5%
69
 
4.5%
59
 
3.8%
40
 
2.6%
Other values (150) 611
39.4%

조합원수
Text

MISSING 

Distinct51
Distinct (%)47.2%
Missing3
Missing (%)2.7%
Memory size1020.0 B
2024-03-13T22:18:05.465724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length3.0092593
Min length3

Characters and Unicode

Total characters325
Distinct characters13
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique47 ?
Unique (%)43.5%

Sample

1st row252
2nd row686
3rd row484
4th row382
5th row268
ValueCountFrequency (%)
모집중 55
50.9%
401 2
 
1.9%
314 2
 
1.9%
740 2
 
1.9%
232 1
 
0.9%
266 1
 
0.9%
252 1
 
0.9%
811 1
 
0.9%
540 1
 
0.9%
263 1
 
0.9%
Other values (41) 41
38.0%
2024-03-13T22:18:05.771176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
55
16.9%
55
16.9%
55
16.9%
3 26
8.0%
1 22
 
6.8%
2 20
 
6.2%
4 18
 
5.5%
8 16
 
4.9%
0 15
 
4.6%
6 15
 
4.6%
Other values (3) 28
8.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 165
50.8%
Decimal Number 160
49.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 26
16.2%
1 22
13.8%
2 20
12.5%
4 18
11.2%
8 16
10.0%
0 15
9.4%
6 15
9.4%
5 12
7.5%
7 11
6.9%
9 5
 
3.1%
Other Letter
ValueCountFrequency (%)
55
33.3%
55
33.3%
55
33.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 165
50.8%
Common 160
49.2%

Most frequent character per script

Common
ValueCountFrequency (%)
3 26
16.2%
1 22
13.8%
2 20
12.5%
4 18
11.2%
8 16
10.0%
0 15
9.4%
6 15
9.4%
5 12
7.5%
7 11
6.9%
9 5
 
3.1%
Hangul
ValueCountFrequency (%)
55
33.3%
55
33.3%
55
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 165
50.8%
ASCII 160
49.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
55
33.3%
55
33.3%
55
33.3%
ASCII
ValueCountFrequency (%)
3 26
16.2%
1 22
13.8%
2 20
12.5%
4 18
11.2%
8 16
10.0%
0 15
9.4%
6 15
9.4%
5 12
7.5%
7 11
6.9%
9 5
 
3.1%

사업예정지
Text

UNIQUE 

Distinct111
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1020.0 B
2024-03-13T22:18:06.234908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length23
Mean length19.297297
Min length16

Characters and Unicode

Total characters2142
Distinct characters87
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique111 ?
Unique (%)100.0%

Sample

1st row부산광역시 금정구 부곡동 974
2nd row부산광역시 금정구 부곡동 200-1
3rd row부산광역시 금정구 장전동 618-1
4th row부산광역시 금정구 남산동 51-4
5th row부산광역시 금정구 장전동 610-26
ValueCountFrequency (%)
부산광역시 111
24.4%
부산진구 17
 
3.7%
동래구 14
 
3.1%
남구 13
 
2.9%
사하구 12
 
2.6%
연제구 10
 
2.2%
문현동 9
 
2.0%
일원 9
 
2.0%
서구 9
 
2.0%
연산동 8
 
1.8%
Other values (162) 242
53.3%
2024-03-13T22:18:06.782918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
344
16.1%
139
 
6.5%
135
 
6.3%
126
 
5.9%
119
 
5.6%
113
 
5.3%
111
 
5.2%
111
 
5.2%
1 98
 
4.6%
- 89
 
4.2%
Other values (77) 757
35.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1235
57.7%
Decimal Number 473
 
22.1%
Space Separator 344
 
16.1%
Dash Punctuation 89
 
4.2%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
139
11.3%
135
10.9%
126
 
10.2%
119
 
9.6%
113
 
9.1%
111
 
9.0%
111
 
9.0%
23
 
1.9%
20
 
1.6%
18
 
1.5%
Other values (64) 320
25.9%
Decimal Number
ValueCountFrequency (%)
1 98
20.7%
2 63
13.3%
5 50
10.6%
3 49
10.4%
4 43
9.1%
7 42
8.9%
8 38
 
8.0%
0 34
 
7.2%
9 29
 
6.1%
6 27
 
5.7%
Space Separator
ValueCountFrequency (%)
344
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 89
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1235
57.7%
Common 907
42.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
139
11.3%
135
10.9%
126
 
10.2%
119
 
9.6%
113
 
9.1%
111
 
9.0%
111
 
9.0%
23
 
1.9%
20
 
1.6%
18
 
1.5%
Other values (64) 320
25.9%
Common
ValueCountFrequency (%)
344
37.9%
1 98
 
10.8%
- 89
 
9.8%
2 63
 
6.9%
5 50
 
5.5%
3 49
 
5.4%
4 43
 
4.7%
7 42
 
4.6%
8 38
 
4.2%
0 34
 
3.7%
Other values (3) 57
 
6.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1235
57.7%
ASCII 907
42.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
344
37.9%
1 98
 
10.8%
- 89
 
9.8%
2 63
 
6.9%
5 50
 
5.5%
3 49
 
5.4%
4 43
 
4.7%
7 42
 
4.6%
8 38
 
4.2%
0 34
 
3.7%
Other values (3) 57
 
6.3%
Hangul
ValueCountFrequency (%)
139
11.3%
135
10.9%
126
 
10.2%
119
 
9.6%
113
 
9.1%
111
 
9.0%
111
 
9.0%
23
 
1.9%
20
 
1.6%
18
 
1.5%
Other values (64) 320
25.9%

대지면적(제곱미터)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct111
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22783.616
Minimum4320
Maximum53556
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2024-03-13T22:18:07.263114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4320
5-th percentile5923
Q114947
median20832
Q328307
95-th percentile46075.5
Maximum53556
Range49236
Interquartile range (IQR)13360

Descriptive statistics

Standard deviation11721.738
Coefficient of variation (CV)0.51448102
Kurtosis0.012651243
Mean22783.616
Median Absolute Deviation (MAD)6362
Skewness0.73625153
Sum2528981.4
Variance1.3739915 × 108
MonotonicityNot monotonic
2024-03-13T22:18:07.418540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
13063.0 1
 
0.9%
48508.6 1
 
0.9%
22433.0 1
 
0.9%
52798.0 1
 
0.9%
13127.0 1
 
0.9%
17732.0 1
 
0.9%
17889.0 1
 
0.9%
16793.0 1
 
0.9%
14987.0 1
 
0.9%
4320.0 1
 
0.9%
Other values (101) 101
91.0%
ValueCountFrequency (%)
4320.0 1
0.9%
4868.0 1
0.9%
5178.83 1
0.9%
5515.0 1
0.9%
5749.0 1
0.9%
5816.0 1
0.9%
6030.0 1
0.9%
6043.0 1
0.9%
6840.0 1
0.9%
7530.0 1
0.9%
ValueCountFrequency (%)
53556.0 1
0.9%
52798.0 1
0.9%
49994.0 1
0.9%
48508.6 1
0.9%
47802.0 1
0.9%
46376.0 1
0.9%
45775.0 1
0.9%
45123.0 1
0.9%
43269.0 1
0.9%
43090.0 1
0.9%

연면적(제곱미터)
Real number (ℝ)

HIGH CORRELATION 

Distinct108
Distinct (%)98.2%
Missing1
Missing (%)0.9%
Infinite0
Infinite (%)0.0%
Mean85415.827
Minimum17659
Maximum201609
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2024-03-13T22:18:07.567942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum17659
5-th percentile35575.2
Q157269.75
median76555.5
Q3108330.25
95-th percentile153696.9
Maximum201609
Range183950
Interquartile range (IQR)51060.5

Descriptive statistics

Standard deviation38564.385
Coefficient of variation (CV)0.45148992
Kurtosis0.21275998
Mean85415.827
Median Absolute Deviation (MAD)23795
Skewness0.76765686
Sum9395741
Variance1.4872118 × 109
MonotonicityNot monotonic
2024-03-13T22:18:07.711466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
53769 2
 
1.8%
136286 2
 
1.8%
46265 1
 
0.9%
85432 1
 
0.9%
199579 1
 
0.9%
49682 1
 
0.9%
57783 1
 
0.9%
46609 1
 
0.9%
23045 1
 
0.9%
55677 1
 
0.9%
Other values (98) 98
88.3%
ValueCountFrequency (%)
17659 1
0.9%
21890 1
0.9%
23045 1
0.9%
27441 1
0.9%
27623 1
0.9%
33714 1
0.9%
37850 1
0.9%
42883 1
0.9%
45138 1
0.9%
45752 1
0.9%
ValueCountFrequency (%)
201609 1
0.9%
199579 1
0.9%
168977 1
0.9%
164200 1
0.9%
159705 1
0.9%
154011 1
0.9%
153313 1
0.9%
147396 1
0.9%
145638 1
0.9%
145613 1
0.9%

총세대수
Real number (ℝ)

HIGH CORRELATION 

Distinct99
Distinct (%)89.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean553.7027
Minimum134
Maximum1302
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2024-03-13T22:18:07.879451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum134
5-th percentile295.5
Q1368.5
median489
Q3696.5
95-th percentile996
Maximum1302
Range1168
Interquartile range (IQR)328

Descriptive statistics

Standard deviation248.35508
Coefficient of variation (CV)0.44853507
Kurtosis0.21127964
Mean553.7027
Median Absolute Deviation (MAD)142
Skewness0.87263749
Sum61461
Variance61680.247
MonotonicityNot monotonic
2024-03-13T22:18:08.018840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
324 2
 
1.8%
804 2
 
1.8%
316 2
 
1.8%
425 2
 
1.8%
482 2
 
1.8%
322 2
 
1.8%
522 2
 
1.8%
380 2
 
1.8%
690 2
 
1.8%
405 2
 
1.8%
Other values (89) 91
82.0%
ValueCountFrequency (%)
134 1
0.9%
150 1
0.9%
162 1
0.9%
219 1
0.9%
222 1
0.9%
295 1
0.9%
296 1
0.9%
300 1
0.9%
303 1
0.9%
305 1
0.9%
ValueCountFrequency (%)
1302 1
0.9%
1295 1
0.9%
1066 1
0.9%
1050 1
0.9%
999 1
0.9%
998 1
0.9%
994 1
0.9%
986 1
0.9%
975 1
0.9%
970 1
0.9%
Distinct76
Distinct (%)97.4%
Missing33
Missing (%)29.7%
Memory size1020.0 B
Minimum2017-06-01 00:00:00
Maximum2023-12-06 00:00:00
2024-03-13T22:18:08.168353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:18:08.378590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

조합설립인가일
Date

MISSING 

Distinct53
Distinct (%)100.0%
Missing58
Missing (%)52.3%
Memory size1020.0 B
Minimum2012-10-15 00:00:00
Maximum2023-03-14 00:00:00
2024-03-13T22:18:08.529804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:18:08.668767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

사업계획승인일
Date

MISSING 

Distinct34
Distinct (%)100.0%
Missing77
Missing (%)69.4%
Memory size1020.0 B
Minimum2013-07-22 00:00:00
Maximum2023-10-20 00:00:00
2024-03-13T22:18:08.803899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:18:08.940848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)

착공신고일
Date

MISSING 

Distinct25
Distinct (%)100.0%
Missing86
Missing (%)77.5%
Memory size1020.0 B
Minimum2013-11-01 00:00:00
Maximum2023-12-08 00:00:00
2024-03-13T22:18:09.067083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:18:09.175876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
Distinct21
Distinct (%)100.0%
Missing90
Missing (%)81.1%
Memory size1020.0 B
Minimum2017-03-29 00:00:00
Maximum2027-04-30 00:00:00
2024-03-13T22:18:09.303687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:18:09.427024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)

담당기관
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)9.9%
Missing0
Missing (%)0.0%
Memory size1020.0 B
부산진구 건축과
17 
동래구 건축과
14 
연제구 건축과
14 
남구 건축과
13 
사하구 건축과
12 
Other values (6)
41 

Length

Max length12
Median length7
Mean length7.018018
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row금정구 건축과
2nd row금정구 건축과
3rd row금정구 건축과
4th row금정구 건축과
5th row금정구 건축과

Common Values

ValueCountFrequency (%)
부산진구 건축과 17
15.3%
동래구 건축과 14
12.6%
연제구 건축과 14
12.6%
남구 건축과 13
11.7%
사하구 건축과 12
10.8%
서구 건축과 9
8.1%
금정구 건축과 8
7.2%
북구 건축과 8
7.2%
사상구 건축과 8
7.2%
수영구 건축과 5
 
4.5%

Length

2024-03-13T22:18:09.556315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
건축과 108
48.6%
부산진구 17
 
7.7%
동래구 14
 
6.3%
연제구 14
 
6.3%
남구 13
 
5.9%
사하구 12
 
5.4%
서구 9
 
4.1%
금정구 8
 
3.6%
북구 8
 
3.6%
사상구 8
 
3.6%
Other values (3) 11
 
5.0%

담당기관 전화번호
Categorical

HIGH CORRELATION 

Distinct27
Distinct (%)24.3%
Missing0
Missing (%)0.0%
Memory size1020.0 B
051-605-4605
10 
051-240-4582
051-550-4582
051-220-4602
051-607-4602
Other values (22)
68 

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique3 ?
Unique (%)2.7%

Sample

1st row051-519-4602
2nd row051-519-4602
3rd row051-519-4605
4th row051-519-4605
5th row051-519-4605

Common Values

ValueCountFrequency (%)
051-605-4605 10
 
9.0%
051-240-4582 9
 
8.1%
051-550-4582 9
 
8.1%
051-220-4602 8
 
7.2%
051-607-4602 7
 
6.3%
051-605-4601 6
 
5.4%
051-310-4604 5
 
4.5%
051-665-4604 5
 
4.5%
051-665-4602 5
 
4.5%
051-550-4584 5
 
4.5%
Other values (17) 42
37.8%

Length

2024-03-13T22:18:09.671894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
051-605-4605 10
 
9.0%
051-550-4582 9
 
8.1%
051-240-4582 9
 
8.1%
051-220-4602 8
 
7.2%
051-607-4602 7
 
6.3%
051-605-4601 6
 
5.4%
051-310-4604 5
 
4.5%
051-665-4604 5
 
4.5%
051-665-4602 5
 
4.5%
051-550-4584 5
 
4.5%
Other values (17) 42
37.8%

비고
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Memory size1020.0 B
<NA>
102 
조합해산
 
7
조합원 모집 취소
 
1
해산
 
1

Length

Max length9
Median length4
Mean length4.027027
Min length2

Unique

Unique2 ?
Unique (%)1.8%

Sample

1st row조합해산
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 102
91.9%
조합해산 7
 
6.3%
조합원 모집 취소 1
 
0.9%
해산 1
 
0.9%

Length

2024-03-13T22:18:09.790778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T22:18:09.912222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 102
90.3%
조합해산 7
 
6.2%
조합원 1
 
0.9%
모집 1
 
0.9%
취소 1
 
0.9%
해산 1
 
0.9%

Interactions

2024-03-13T22:18:02.422127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:18:01.268789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:18:01.647959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:18:02.029551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:18:02.517367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:18:01.340706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:18:01.737268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:18:02.135103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:18:02.612694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:18:01.423275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:18:01.822464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:18:02.234004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:18:02.719031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:18:01.536490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:18:01.945253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:18:02.328015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-13T22:18:09.996514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번소재지조합원수대지면적(제곱미터)연면적(제곱미터)총세대수조합원모집신고일조합설립인가일사업계획승인일착공신고일사용검사일(예정일)담당기관담당기관 전화번호비고
연번1.0000.9510.5440.3410.2040.2361.0001.0001.0001.0001.0000.9490.9711.000
소재지0.9511.0000.5370.2930.2220.0001.0001.0001.0001.0001.0001.0001.0001.000
조합원수0.5440.5371.0000.6760.8430.8351.0001.0001.0001.0001.0000.0000.000NaN
대지면적(제곱미터)0.3410.2930.6761.0000.8780.7850.0001.0001.0001.0001.0000.3040.3270.491
연면적(제곱미터)0.2040.2220.8430.8781.0000.8890.8551.0001.0001.0001.0000.0950.0001.000
총세대수0.2360.0000.8350.7850.8891.0000.0001.0001.0001.0001.0000.0000.0000.000
조합원모집신고일1.0001.0001.0000.0000.8550.0001.0001.0001.0000.000NaN1.0001.0000.000
조합설립인가일1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000NaN
사업계획승인일1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000NaN
착공신고일1.0001.0001.0001.0001.0001.0000.0001.0001.0001.0001.0001.0001.000NaN
사용검사일(예정일)1.0001.0001.0001.0001.0001.000NaN1.0001.0001.0001.0001.0001.000NaN
담당기관0.9491.0000.0000.3040.0950.0001.0001.0001.0001.0001.0001.0001.0001.000
담당기관 전화번호0.9711.0000.0000.3270.0000.0001.0001.0001.0001.0001.0001.0001.0001.000
비고1.0001.000NaN0.4911.0000.0000.000NaNNaNNaNNaN1.0001.0001.000
2024-03-13T22:18:10.154137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
비고담당기관 전화번호소재지담당기관
비고1.0000.5770.7070.707
담당기관 전화번호0.5771.0000.9210.917
소재지0.7070.9211.0000.995
담당기관0.7070.9170.9951.000
2024-03-13T22:18:10.251849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번대지면적(제곱미터)연면적(제곱미터)총세대수소재지담당기관담당기관 전화번호비고
연번1.0000.026-0.127-0.0750.7900.7820.7560.707
대지면적(제곱미터)0.0261.0000.6800.7800.1210.1290.1070.000
연면적(제곱미터)-0.1270.6801.0000.9450.0950.0430.0000.408
총세대수-0.0750.7800.9451.0000.0000.0000.0000.000
소재지0.7900.1210.0950.0001.0000.9950.9210.707
담당기관0.7820.1290.0430.0000.9951.0000.9170.707
담당기관 전화번호0.7560.1070.0000.0000.9210.9171.0000.577
비고0.7070.0000.4080.0000.7070.7070.5771.000

Missing values

2024-03-13T22:18:02.874358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-13T22:18:03.096709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-13T22:18:03.277561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번소재지조합명조합사무실 주소조합원수사업예정지대지면적(제곱미터)연면적(제곱미터)총세대수조합원모집신고일조합설립인가일사업계획승인일착공신고일사용검사일(예정일)담당기관담당기관 전화번호비고
01부산광역시 금정구장전역 서희스타힐스 지역주택조합<NA>252부산광역시 금정구 부곡동 97413063.046265324<NA>2013-10-082014-09-262015-03-312017-07-31금정구 건축과051-519-4602조합해산
12부산광역시 금정구금정더샵지역주택조합부산광역시 금정구 기찰로 59(부곡동)686부산광역시 금정구 부곡동 200-148508.6164200994<NA>2019-05-162022-04-292023-12-08<NA>금정구 건축과051-519-4602<NA>
23부산광역시 금정구휴먼파크장전지역주택조합부산광역시 동래구 시실로 14, 2층(명륜동)484부산광역시 금정구 장전동 618-110550.01339496692018-05-292019-05-312021-07-28<NA><NA>금정구 건축과051-519-4605<NA>
34부산광역시 금정구남산역지역주택조합부산광역시 금정구 중앙대로 1989, 3층(남산동)382부산광역시 금정구 남산동 51-421806.0740644912020-03-202022-05-30<NA><NA><NA>금정구 건축과051-519-4605<NA>
45부산광역시 금정구리버파크장전지역주택조합부산광역시 동래구 충렬대로 137번길 4, 901호(온천동, 상현빌딩)268부산광역시 금정구 장전동 610-265178.83789953962021-05-212023-03-14<NA><NA><NA>금정구 건축과051-519-4605<NA>
56부산광역시 금정구(가칭)두실지역주택조합부산광역시 금정구 중앙대로1719번길 47, 1층(부곡동)모집중부산광역시 금정구 구서동 15447802.01540119862020-01-22<NA><NA><NA><NA>금정구 건축과051-519-4601<NA>
67부산광역시 금정구(가칭)두실역금샘지역주택조합부산광역시 금정구 중앙대로1959번길 20, 2층(구서동)모집중부산광역시 금정구 구서동 164-727298.01008195402021-12-15<NA><NA><NA><NA>금정구 건축과051-519-4601<NA>
78부산광역시 금정구(가칭)서동지역주택조합부산광역시 금정구 반송로 356, 302호(서동)모집중부산광역시 금정구 서동 222-219522.0666664282022-06-02<NA><NA><NA><NA>금정구 건축과051-519-4602<NA>
89부산광역시 남구대연마루 지역주택조합부산광역시 남구 수영로 69번길 5 2층408부산광역시 남구 문현동 125023560.991823560<NA>2015-07-272017-12-152018-06-112022-12-06남구 건축과051-607-4602조합해산
910부산광역시 남구부산오션힐 지역주택조합부산광역시 남구 자성로 152번길, 1003호401부산광역시 남구 문현동 125526150.993983662<NA>2015-05-182018-06-182019-01-182023-03-30남구 건축과051-607-4602<NA>
연번소재지조합명조합사무실 주소조합원수사업예정지대지면적(제곱미터)연면적(제곱미터)총세대수조합원모집신고일조합설립인가일사업계획승인일착공신고일사용검사일(예정일)담당기관담당기관 전화번호비고
101102부산광역시 연제구(가칭)연산8동지역주택조합부산광역시 연제구 안연로 38, 4층(연산동)모집중부산광역시 연제구 연산동 384-3125335.0769195172022-03-15<NA><NA><NA><NA>연제구 건축과051-665-4604<NA>
102103부산광역시 연제구(가칭)연산6지역주택조합<NA><NA>부산광역시 연제구 연산동 665-445123.0<NA>1066<NA><NA><NA><NA><NA>연제구 건축과051-665-4604해산
103104부산광역시 연제구(가칭)연제지역주택조합부산광역시 연제구 월드컵대로243번길 19, 상가 802호(거제동)모집중부산광역시 연제구 연산동 105-113004.0972325222023-11-06<NA><NA><NA><NA>연제구 건축과051-665-4604<NA>
104105부산광역시 영도구청학1동지역주택조합부산광역시 영도구 청학로 73, 2층210부산광역시 영도구 청학동 279-615810.0474993502018-05-242019-05-28<NA><NA><NA>연제구 건축과051-419-4585<NA>
105106부산광역시 영도구청학지역주택조합부산광역시 영도구 태종로58, 302호233부산광역시 영도구 청학동 270-14020442.767675462<NA>2018-09-192023-10-20<NA><NA>연제구 건축과051-419-4585<NA>
106107부산광역시 영도구동삼지역주택조합부산광역시 영도구 태종로498, 3층 1호230부산광역시 영도구 동삼동 221-8716648.0499504002020-07-172022-08-22<NA><NA><NA>연제구 건축과051-419-4585<NA>
107108부산광역시 영도구(가칭)봉래지역주택조합부산광역시 영도구 태종로234, 2층모집중부산광역시 영도구 봉래동5가 80-342144.01473969702020-08-14<NA><NA><NA><NA>연제구 건축과051-419-4585<NA>
108109부산광역시 해운대구동부산지역주택조합부산광역시 해운대구 반송로 926, 3층191부산광역시 해운대구 반송동 293-115355.0483763232019-05-312022-03-16<NA><NA><NA>해운대구 공동주택관리과051-749-4604<NA>
109110부산광역시 해운대구(가칭)센텀우동지역주택조합부산광역시 해운대구 해운대로295번길 33모집중부산광역시 해운대구 우동 1201-115375.057267347<NA><NA><NA><NA><NA>해운대구 공동주택관리과051-749-4604<NA>
110111부산광역시 해운대구(가칭)해운대우동1지역주택조합부산광역시 해운대구 우동1로 53모집중부산광역시 해운대구 우동 397-5515801.061829380<NA><NA><NA><NA><NA>해운대구 공동주택관리과051-749-4604<NA>