Overview

Dataset statistics

Number of variables17
Number of observations105
Missing cells372
Missing cells (%)20.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory14.4 KiB
Average record size in memory140.3 B

Variable types

Numeric3
Categorical5
Text4
DateTime5

Dataset

Description인천광역시_소규모 주택정비 추진현황 데이터로 사업유형, 구역명, 성명, 직위, 위치, 면적, 추진단계, 조합원수, 주민합의처, 사업시행계획 등을 알 수 있는 자료입니다.
Author인천광역시
URLhttps://www.data.go.kr/data/15072776/fileData.do

Alerts

순번 is highly overall correlated with 구청High correlation
면적 is highly overall correlated with 토지등 소유자수High correlation
토지등 소유자수 is highly overall correlated with 면적High correlation
구청 is highly overall correlated with 순번High correlation
사업유형 is highly overall correlated with 대표자직위 and 1 other fieldsHigh correlation
대표자성명 is highly overall correlated with 대표자직위High correlation
대표자직위 is highly overall correlated with 사업유형 and 2 other fieldsHigh correlation
추진단계 is highly overall correlated with 사업유형 and 1 other fieldsHigh correlation
대표자직위 is highly imbalanced (82.9%)Imbalance
건축심의 has 53 (50.5%) missing valuesMissing
사업시행계획인가 has 84 (80.0%) missing valuesMissing
관리처분계획인가 has 94 (89.5%) missing valuesMissing
착공(시공중) has 98 (93.3%) missing valuesMissing
비고 has 42 (40.0%) missing valuesMissing
순번 has unique valuesUnique
위치 has unique valuesUnique
면적 has unique valuesUnique

Reproduction

Analysis started2024-04-06 08:28:41.467922
Analysis finished2024-04-06 08:28:48.299697
Duration6.83 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct105
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean53
Minimum1
Maximum105
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2024-04-06T17:28:48.470051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.2
Q127
median53
Q379
95-th percentile99.8
Maximum105
Range104
Interquartile range (IQR)52

Descriptive statistics

Standard deviation30.454885
Coefficient of variation (CV)0.57462047
Kurtosis-1.2
Mean53
Median Absolute Deviation (MAD)26
Skewness0
Sum5565
Variance927.5
MonotonicityStrictly increasing
2024-04-06T17:28:48.784374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
80 1
 
1.0%
78 1
 
1.0%
77 1
 
1.0%
76 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
Other values (95) 95
90.5%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
105 1
1.0%
104 1
1.0%
103 1
1.0%
102 1
1.0%
101 1
1.0%
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%

구청
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size972.0 B
미추홀구
30 
부평구
23 
서구
20 
계양구
15 
남동구
13 
Other values (2)

Length

Max length4
Median length3
Mean length3.0571429
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중구
2nd row중구
3rd row동구
4th row동구
5th row미추홀구

Common Values

ValueCountFrequency (%)
미추홀구 30
28.6%
부평구 23
21.9%
서구 20
19.0%
계양구 15
14.3%
남동구 13
12.4%
중구 2
 
1.9%
동구 2
 
1.9%

Length

2024-04-06T17:28:49.114259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:28:49.420812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미추홀구 30
28.6%
부평구 23
21.9%
서구 20
19.0%
계양구 15
14.3%
남동구 13
12.4%
중구 2
 
1.9%
동구 2
 
1.9%

사업유형
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size972.0 B
가로주택
64 
소규모재건축
38 
자율주택
 
3

Length

Max length6
Median length4
Mean length4.7238095
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가로주택
2nd row가로주택
3rd row가로주택
4th row가로주택
5th row가로주택

Common Values

ValueCountFrequency (%)
가로주택 64
61.0%
소규모재건축 38
36.2%
자율주택 3
 
2.9%

Length

2024-04-06T17:28:49.919093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:28:50.207757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
가로주택 64
61.0%
소규모재건축 38
36.2%
자율주택 3
 
2.9%
Distinct104
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size972.0 B
2024-04-06T17:28:50.741142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length16
Mean length8.2
Min length2

Characters and Unicode

Total characters861
Distinct characters138
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique103 ?
Unique (%)98.1%

Sample

1st row신흥삼익아파트 1단지 가로주택정비사업구역
2nd row신흥삼익아파트 2단지 가로주택정비사업구역
3rd row송현2동 72번지 일원
4th row송림동67-10번지 일원
5th row인천 숭의2 LH참여형
ValueCountFrequency (%)
일원 13
 
7.5%
석남동 4
 
2.3%
가로주택정비사업 4
 
2.3%
효성동 3
 
1.7%
인천 3
 
1.7%
lh참여형 3
 
1.7%
성신아파트 2
 
1.2%
가좌동 2
 
1.2%
신흥삼익아파트 2
 
1.2%
삼산동 2
 
1.2%
Other values (128) 135
78.0%
2024-04-06T17:28:51.636011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
73
 
8.5%
51
 
5.9%
50
 
5.8%
50
 
5.8%
48
 
5.6%
1 19
 
2.2%
17
 
2.0%
17
 
2.0%
17
 
2.0%
2 16
 
1.9%
Other values (128) 503
58.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 665
77.2%
Decimal Number 102
 
11.8%
Space Separator 73
 
8.5%
Dash Punctuation 12
 
1.4%
Uppercase Letter 7
 
0.8%
Other Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
51
 
7.7%
50
 
7.5%
50
 
7.5%
48
 
7.2%
17
 
2.6%
17
 
2.6%
17
 
2.6%
15
 
2.3%
14
 
2.1%
13
 
2.0%
Other values (112) 373
56.1%
Decimal Number
ValueCountFrequency (%)
1 19
18.6%
2 16
15.7%
3 13
12.7%
9 12
11.8%
7 11
10.8%
0 8
7.8%
5 7
 
6.9%
8 7
 
6.9%
4 6
 
5.9%
6 3
 
2.9%
Uppercase Letter
ValueCountFrequency (%)
L 3
42.9%
H 3
42.9%
B 1
 
14.3%
Space Separator
ValueCountFrequency (%)
73
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 665
77.2%
Common 189
 
22.0%
Latin 7
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
51
 
7.7%
50
 
7.5%
50
 
7.5%
48
 
7.2%
17
 
2.6%
17
 
2.6%
17
 
2.6%
15
 
2.3%
14
 
2.1%
13
 
2.0%
Other values (112) 373
56.1%
Common
ValueCountFrequency (%)
73
38.6%
1 19
 
10.1%
2 16
 
8.5%
3 13
 
6.9%
- 12
 
6.3%
9 12
 
6.3%
7 11
 
5.8%
0 8
 
4.2%
5 7
 
3.7%
8 7
 
3.7%
Other values (3) 11
 
5.8%
Latin
ValueCountFrequency (%)
L 3
42.9%
H 3
42.9%
B 1
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 665
77.2%
ASCII 196
 
22.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
73
37.2%
1 19
 
9.7%
2 16
 
8.2%
3 13
 
6.6%
- 12
 
6.1%
9 12
 
6.1%
7 11
 
5.6%
0 8
 
4.1%
5 7
 
3.6%
8 7
 
3.6%
Other values (6) 18
 
9.2%
Hangul
ValueCountFrequency (%)
51
 
7.7%
50
 
7.5%
50
 
7.5%
48
 
7.2%
17
 
2.6%
17
 
2.6%
17
 
2.6%
15
 
2.3%
14
 
2.1%
13
 
2.0%
Other values (112) 373
56.1%

대표자성명
Categorical

HIGH CORRELATION 

Distinct36
Distinct (%)34.3%
Missing0
Missing (%)0.0%
Memory size972.0 B
이00
19 
김00
18 
박00
최00
강00
Other values (31)
48 

Length

Max length5
Median length3
Mean length3.0190476
Min length3

Unique

Unique18 ?
Unique (%)17.1%

Sample

1st row김00
2nd row김00
3rd row심00
4th row최00
5th row김00

Common Values

ValueCountFrequency (%)
이00 19
18.1%
김00 18
17.1%
박00 9
 
8.6%
최00 6
 
5.7%
강00 5
 
4.8%
황00 3
 
2.9%
송00 3
 
2.9%
정00 3
 
2.9%
임00 3
 
2.9%
백00 2
 
1.9%
Other values (26) 34
32.4%

Length

2024-04-06T17:28:52.037036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
이00 19
18.1%
김00 18
17.1%
박00 9
 
8.6%
최00 6
 
5.7%
강00 5
 
4.8%
황00 3
 
2.9%
송00 3
 
2.9%
정00 3
 
2.9%
임00 3
 
2.9%
윤00 2
 
1.9%
Other values (26) 34
32.4%

대표자직위
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)3.8%
Missing0
Missing (%)0.0%
Memory size972.0 B
조합장
100 
주민대표
 
3
지정개발자
 
1
대표자
 
1

Length

Max length5
Median length3
Mean length3.047619
Min length3

Unique

Unique2 ?
Unique (%)1.9%

Sample

1st row조합장
2nd row조합장
3rd row조합장
4th row조합장
5th row조합장

Common Values

ValueCountFrequency (%)
조합장 100
95.2%
주민대표 3
 
2.9%
지정개발자 1
 
1.0%
대표자 1
 
1.0%

Length

2024-04-06T17:28:52.451627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:28:52.732564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
조합장 100
95.2%
주민대표 3
 
2.9%
지정개발자 1
 
1.0%
대표자 1
 
1.0%

위치
Text

UNIQUE 

Distinct105
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size972.0 B
2024-04-06T17:28:53.257455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length21
Mean length14.085714
Min length7

Characters and Unicode

Total characters1479
Distinct characters76
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique105 ?
Unique (%)100.0%

Sample

1st row인천광역시 중구 신흥동2가 54-5 1단지 일원
2nd row인천광역시 중구 신흥동2가 54-7 2단지 일원
3rd row송현동 72-185번지 일원
4th row송림동 67-10번지 일원
5th row숭의동 177번지 일원
ValueCountFrequency (%)
일원 38
 
11.7%
20
 
6.2%
서구 19
 
5.9%
석남동 17
 
5.2%
용현동 9
 
2.8%
미추홀구 8
 
2.5%
주안동 8
 
2.5%
만수동 7
 
2.2%
삼산동 6
 
1.9%
효성동 6
 
1.9%
Other values (149) 186
57.4%
2024-04-06T17:28:54.181396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
223
 
15.1%
105
 
7.1%
- 91
 
6.2%
1 88
 
5.9%
86
 
5.8%
2 72
 
4.9%
65
 
4.4%
5 62
 
4.2%
3 46
 
3.1%
9 42
 
2.8%
Other values (66) 599
40.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 680
46.0%
Decimal Number 476
32.2%
Space Separator 223
 
15.1%
Dash Punctuation 91
 
6.2%
Other Punctuation 9
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
105
15.4%
86
 
12.6%
65
 
9.6%
41
 
6.0%
41
 
6.0%
30
 
4.4%
22
 
3.2%
21
 
3.1%
19
 
2.8%
19
 
2.8%
Other values (52) 231
34.0%
Decimal Number
ValueCountFrequency (%)
1 88
18.5%
2 72
15.1%
5 62
13.0%
3 46
9.7%
9 42
8.8%
7 41
8.6%
8 33
 
6.9%
6 32
 
6.7%
4 31
 
6.5%
0 29
 
6.1%
Other Punctuation
ValueCountFrequency (%)
, 8
88.9%
@ 1
 
11.1%
Space Separator
ValueCountFrequency (%)
223
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 91
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 799
54.0%
Hangul 680
46.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
105
15.4%
86
 
12.6%
65
 
9.6%
41
 
6.0%
41
 
6.0%
30
 
4.4%
22
 
3.2%
21
 
3.1%
19
 
2.8%
19
 
2.8%
Other values (52) 231
34.0%
Common
ValueCountFrequency (%)
223
27.9%
- 91
11.4%
1 88
 
11.0%
2 72
 
9.0%
5 62
 
7.8%
3 46
 
5.8%
9 42
 
5.3%
7 41
 
5.1%
8 33
 
4.1%
6 32
 
4.0%
Other values (4) 69
 
8.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 799
54.0%
Hangul 680
46.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
223
27.9%
- 91
11.4%
1 88
 
11.0%
2 72
 
9.0%
5 62
 
7.8%
3 46
 
5.8%
9 42
 
5.3%
7 41
 
5.1%
8 33
 
4.1%
6 32
 
4.0%
Other values (4) 69
 
8.6%
Hangul
ValueCountFrequency (%)
105
15.4%
86
 
12.6%
65
 
9.6%
41
 
6.0%
41
 
6.0%
30
 
4.4%
22
 
3.2%
21
 
3.1%
19
 
2.8%
19
 
2.8%
Other values (52) 231
34.0%

면적
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct105
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4792.6061
Minimum131.8
Maximum9392
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2024-04-06T17:28:54.463465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum131.8
5-th percentile1278.22
Q13158.6
median5148.3
Q36360
95-th percentile7932.04
Maximum9392
Range9260.2
Interquartile range (IQR)3201.4

Descriptive statistics

Standard deviation2209.7279
Coefficient of variation (CV)0.4610702
Kurtosis-0.74665297
Mean4792.6061
Median Absolute Deviation (MAD)1642.7
Skewness-0.13294244
Sum503223.64
Variance4882897.3
MonotonicityNot monotonic
2024-04-06T17:28:54.830357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
7347.9 1
 
1.0%
4640.1 1
 
1.0%
6600.2 1
 
1.0%
5821.0 1
 
1.0%
3150.0 1
 
1.0%
5290.0 1
 
1.0%
2529.1 1
 
1.0%
6357.0 1
 
1.0%
1885.0 1
 
1.0%
2291.0 1
 
1.0%
Other values (95) 95
90.5%
ValueCountFrequency (%)
131.8 1
1.0%
276.5 1
1.0%
423.4 1
1.0%
925.0 1
1.0%
991.8 1
1.0%
1271.1 1
1.0%
1306.7 1
1.0%
1412.2 1
1.0%
1481.0 1
1.0%
1482.4 1
1.0%
ValueCountFrequency (%)
9392.0 1
1.0%
9244.7 1
1.0%
9167.3 1
1.0%
8804.06 1
1.0%
8052.1 1
1.0%
7936.8 1
1.0%
7913.0 1
1.0%
7736.4 1
1.0%
7716.3 1
1.0%
7693.25 1
1.0%

추진단계
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)5.7%
Missing0
Missing (%)0.0%
Memory size972.0 B
조합설립인가
48 
건축심의완료
31 
사업시행계획인가
14 
착공
주민합의체
 
3

Length

Max length8
Median length6
Mean length5.9904762
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row조합설립인가
2nd row조합설립인가
3rd row조합설립인가
4th row조합설립인가
5th row착공

Common Values

ValueCountFrequency (%)
조합설립인가 48
45.7%
건축심의완료 31
29.5%
사업시행계획인가 14
 
13.3%
착공 7
 
6.7%
주민합의체 3
 
2.9%
건축심의 접수 2
 
1.9%

Length

2024-04-06T17:28:55.139677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:28:55.468171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
조합설립인가 48
44.9%
건축심의완료 31
29.0%
사업시행계획인가 14
 
13.1%
착공 7
 
6.5%
주민합의체 3
 
2.8%
건축심의 2
 
1.9%
접수 2
 
1.9%
Distinct82
Distinct (%)78.1%
Missing0
Missing (%)0.0%
Memory size972.0 B
2024-04-06T17:28:55.981404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length2
Mean length2.4095238
Min length1

Characters and Unicode

Total characters253
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique65 ?
Unique (%)61.9%

Sample

1st row246
2nd row196
3rd row40
4th row84
5th row22
ValueCountFrequency (%)
118 4
 
3.8%
45 4
 
3.8%
29 3
 
2.9%
2 3
 
2.9%
84 2
 
1.9%
136 2
 
1.9%
127 2
 
1.9%
65 2
 
1.9%
143 2
 
1.9%
162 2
 
1.9%
Other values (72) 79
75.2%
2024-04-06T17:28:56.748208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 59
23.3%
2 31
12.3%
6 29
11.5%
4 23
 
9.1%
5 23
 
9.1%
8 22
 
8.7%
3 21
 
8.3%
9 20
 
7.9%
0 13
 
5.1%
7 11
 
4.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 252
99.6%
Dash Punctuation 1
 
0.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 59
23.4%
2 31
12.3%
6 29
11.5%
4 23
 
9.1%
5 23
 
9.1%
8 22
 
8.7%
3 21
 
8.3%
9 20
 
7.9%
0 13
 
5.2%
7 11
 
4.4%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 253
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 59
23.3%
2 31
12.3%
6 29
11.5%
4 23
 
9.1%
5 23
 
9.1%
8 22
 
8.7%
3 21
 
8.3%
9 20
 
7.9%
0 13
 
5.1%
7 11
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 253
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 59
23.3%
2 31
12.3%
6 29
11.5%
4 23
 
9.1%
5 23
 
9.1%
8 22
 
8.7%
3 21
 
8.3%
9 20
 
7.9%
0 13
 
5.1%
7 11
 
4.3%

토지등 소유자수
Real number (ℝ)

HIGH CORRELATION 

Distinct82
Distinct (%)78.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean109.4381
Minimum2
Maximum311
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2024-04-06T17:28:57.404688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile27.2
Q155
median103
Q3151
95-th percentile211
Maximum311
Range309
Interquartile range (IQR)96

Descriptive statistics

Standard deviation62.312322
Coefficient of variation (CV)0.5693842
Kurtosis-0.084680343
Mean109.4381
Median Absolute Deviation (MAD)48
Skewness0.49336194
Sum11491
Variance3882.8255
MonotonicityNot monotonic
2024-04-06T17:28:57.682079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
49 3
 
2.9%
2 3
 
2.9%
149 3
 
2.9%
112 3
 
2.9%
29 2
 
1.9%
89 2
 
1.9%
35 2
 
1.9%
208 2
 
1.9%
173 2
 
1.9%
165 2
 
1.9%
Other values (72) 81
77.1%
ValueCountFrequency (%)
2 3
2.9%
17 1
 
1.0%
24 1
 
1.0%
27 1
 
1.0%
28 1
 
1.0%
29 2
1.9%
31 1
 
1.0%
35 2
1.9%
36 1
 
1.0%
39 1
 
1.0%
ValueCountFrequency (%)
311 1
1.0%
246 1
1.0%
236 1
1.0%
226 1
1.0%
220 1
1.0%
211 2
1.9%
208 2
1.9%
197 1
1.0%
196 1
1.0%
192 1
1.0%
Distinct99
Distinct (%)95.2%
Missing1
Missing (%)1.0%
Memory size972.0 B
Minimum2003-06-30 00:00:00
Maximum2024-03-04 00:00:00
2024-04-06T17:28:57.910646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:28:58.152550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

건축심의
Date

MISSING 

Distinct46
Distinct (%)88.5%
Missing53
Missing (%)50.5%
Memory size972.0 B
Minimum2003-09-26 00:00:00
Maximum2024-01-11 00:00:00
2024-04-06T17:28:58.421517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:28:58.705211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=46)
Distinct21
Distinct (%)100.0%
Missing84
Missing (%)80.0%
Memory size972.0 B
Minimum2003-11-29 00:00:00
Maximum2024-01-22 00:00:00
2024-04-06T17:28:59.077936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:28:59.285779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
Distinct11
Distinct (%)100.0%
Missing94
Missing (%)89.5%
Memory size972.0 B
Minimum2016-03-21 00:00:00
Maximum2024-01-22 00:00:00
2024-04-06T17:28:59.484602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:28:59.716262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)

착공(시공중)
Date

MISSING 

Distinct7
Distinct (%)100.0%
Missing98
Missing (%)93.3%
Memory size972.0 B
Minimum2021-06-01 00:00:00
Maximum2022-11-15 00:00:00
2024-04-06T17:29:00.007888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:29:00.294851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)

비고
Text

MISSING 

Distinct42
Distinct (%)66.7%
Missing42
Missing (%)40.0%
Memory size972.0 B
2024-04-06T17:29:00.746974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length85
Median length83
Mean length16.238095
Min length3

Characters and Unicode

Total characters1023
Distinct characters119
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)52.4%

Sample

1st row태영건설
2nd row성호건설
3rd row우탑건설
4th row극동건설
5th row관리처분계획변경인가 2020-03-23SM우방
ValueCountFrequency (%)
도시개발과(김진성)032-509-6933 10
 
11.1%
도시개발과(안혜인)032-509-6934 6
 
6.7%
건축심의 4
 
4.4%
관리지역 4
 
4.4%
검토구역도시개발과(반기훈)032-509-6932 3
 
3.3%
기성건설 2
 
2.2%
남광토건 2
 
2.2%
이수건설 2
 
2.2%
금호건설 2
 
2.2%
태영건설 2
 
2.2%
Other values (48) 53
58.9%
2024-04-06T17:29:01.550823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
159
 
15.5%
3 59
 
5.8%
0 54
 
5.3%
- 50
 
4.9%
9 46
 
4.5%
2 37
 
3.6%
34
 
3.3%
29
 
2.8%
26
 
2.5%
) 25
 
2.4%
Other values (109) 504
49.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 493
48.2%
Decimal Number 252
24.6%
Space Separator 159
 
15.5%
Dash Punctuation 50
 
4.9%
Close Punctuation 25
 
2.4%
Open Punctuation 25
 
2.4%
Uppercase Letter 8
 
0.8%
Other Symbol 6
 
0.6%
Other Punctuation 5
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
34
 
6.9%
29
 
5.9%
26
 
5.3%
25
 
5.1%
25
 
5.1%
25
 
5.1%
24
 
4.9%
16
 
3.2%
11
 
2.2%
11
 
2.2%
Other values (87) 267
54.2%
Decimal Number
ValueCountFrequency (%)
3 59
23.4%
0 54
21.4%
9 46
18.3%
2 37
14.7%
6 24
9.5%
5 23
 
9.1%
4 7
 
2.8%
8 1
 
0.4%
1 1
 
0.4%
Uppercase Letter
ValueCountFrequency (%)
S 2
25.0%
L 2
25.0%
M 1
12.5%
D 1
12.5%
K 1
12.5%
H 1
12.5%
Other Punctuation
ValueCountFrequency (%)
. 3
60.0%
/ 2
40.0%
Space Separator
ValueCountFrequency (%)
159
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 50
100.0%
Close Punctuation
ValueCountFrequency (%)
) 25
100.0%
Open Punctuation
ValueCountFrequency (%)
( 25
100.0%
Other Symbol
ValueCountFrequency (%)
6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 516
50.4%
Hangul 499
48.8%
Latin 8
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
34
 
6.8%
29
 
5.8%
26
 
5.2%
25
 
5.0%
25
 
5.0%
25
 
5.0%
24
 
4.8%
16
 
3.2%
11
 
2.2%
11
 
2.2%
Other values (88) 273
54.7%
Common
ValueCountFrequency (%)
159
30.8%
3 59
 
11.4%
0 54
 
10.5%
- 50
 
9.7%
9 46
 
8.9%
2 37
 
7.2%
) 25
 
4.8%
( 25
 
4.8%
6 24
 
4.7%
5 23
 
4.5%
Other values (5) 14
 
2.7%
Latin
ValueCountFrequency (%)
S 2
25.0%
L 2
25.0%
M 1
12.5%
D 1
12.5%
K 1
12.5%
H 1
12.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 524
51.2%
Hangul 493
48.2%
None 6
 
0.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
159
30.3%
3 59
 
11.3%
0 54
 
10.3%
- 50
 
9.5%
9 46
 
8.8%
2 37
 
7.1%
) 25
 
4.8%
( 25
 
4.8%
6 24
 
4.6%
5 23
 
4.4%
Other values (11) 22
 
4.2%
Hangul
ValueCountFrequency (%)
34
 
6.9%
29
 
5.9%
26
 
5.3%
25
 
5.1%
25
 
5.1%
25
 
5.1%
24
 
4.9%
16
 
3.2%
11
 
2.2%
11
 
2.2%
Other values (87) 267
54.2%
None
ValueCountFrequency (%)
6
100.0%

Interactions

2024-04-06T17:28:46.079604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:28:44.275636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:28:44.956729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:28:46.361450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:28:44.509828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:28:45.217555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:28:46.616716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:28:44.714420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:28:45.440714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-06T17:29:01.774923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번구청사업유형대표자성명대표자직위면적추진단계조합원수토지등 소유자수주민합의체(조합설립)건축심의사업시행계획인가관리처분계획인가착공(시공중)비고
순번1.0000.8900.4840.0000.2390.0000.5710.7350.0000.8720.9591.0001.0001.0000.917
구청0.8901.0000.3040.0000.0000.1800.0750.0000.0000.9590.8981.0001.0001.0000.923
사업유형0.4840.3041.0000.4710.6750.6070.9460.9500.6590.9890.7911.0001.0001.0000.920
대표자성명0.0000.0000.4711.0000.9040.0000.6080.0000.0000.9850.7651.0001.0001.0000.733
대표자직위0.2390.0000.6750.9041.0000.5600.7351.0000.3441.0001.0001.000NaNNaN1.000
면적0.0000.1800.6070.0000.5601.0000.4790.8700.6510.9390.7551.0001.0001.0000.732
추진단계0.5710.0750.9460.6080.7350.4791.0000.9390.3780.9371.0001.0001.000NaN0.961
조합원수0.7350.0000.9500.0001.0000.8700.9391.0000.9910.9830.9161.0001.0001.0000.981
토지등 소유자수0.0000.0000.6590.0000.3440.6510.3780.9911.0000.0000.0001.0001.0001.0000.000
주민합의체(조합설립)0.8720.9590.9890.9851.0000.9390.9370.9830.0001.0000.9851.0001.0001.0001.000
건축심의0.9590.8980.7910.7651.0000.7551.0000.9160.0000.9851.0001.0001.0001.0000.910
사업시행계획인가1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
관리처분계획인가1.0001.0001.0001.000NaN1.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
착공(시공중)1.0001.0001.0001.000NaN1.000NaN1.0001.0001.0001.0001.0001.0001.0001.000
비고0.9170.9230.9200.7331.0000.7320.9610.9810.0001.0000.9101.0001.0001.0001.000
2024-04-06T17:29:02.171276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대표자성명구청사업유형대표자직위추진단계
대표자성명1.0000.0000.1980.5460.239
구청0.0001.0000.2090.0000.037
사업유형0.1980.2091.0000.7010.707
대표자직위0.5460.0000.7011.0000.565
추진단계0.2390.0370.7070.5651.000
2024-04-06T17:29:02.441569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번면적토지등 소유자수구청사업유형대표자성명대표자직위추진단계
순번1.000-0.103-0.0150.7180.3200.0000.1370.337
면적-0.1031.0000.8130.0710.4890.0000.4030.306
토지등 소유자수-0.0150.8131.0000.0000.3600.0000.2190.193
구청0.7180.0710.0001.0000.2090.0000.0000.037
사업유형0.3200.4890.3600.2091.0000.1980.7010.707
대표자성명0.0000.0000.0000.0000.1981.0000.5460.239
대표자직위0.1370.4030.2190.0000.7010.5461.0000.565
추진단계0.3370.3060.1930.0370.7070.2390.5651.000

Missing values

2024-04-06T17:28:47.129050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T17:28:47.701712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-06T17:28:48.108753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

순번구청사업유형구역명대표자성명대표자직위위치면적추진단계조합원수토지등 소유자수주민합의체(조합설립)건축심의사업시행계획인가관리처분계획인가착공(시공중)비고
01중구가로주택신흥삼익아파트 1단지 가로주택정비사업구역김00조합장인천광역시 중구 신흥동2가 54-5 1단지 일원7347.9조합설립인가2462462022-11-11<NA><NA><NA><NA><NA>
12중구가로주택신흥삼익아파트 2단지 가로주택정비사업구역김00조합장인천광역시 중구 신흥동2가 54-7 2단지 일원6735.5조합설립인가1961962022-11-11<NA><NA><NA><NA><NA>
23동구가로주택송현2동 72번지 일원심00조합장송현동 72-185번지 일원4670.81조합설립인가40492021-07-28<NA><NA><NA><NA>태영건설
34동구가로주택송림동67-10번지 일원최00조합장송림동 67-10번지 일원4236.15조합설립인가841022022-07-01<NA><NA><NA><NA><NA>
45미추홀구가로주택인천 숭의2 LH참여형김00조합장숭의동 177번지 일원3028.4착공22552017-11-302019-06-262020-11-162020-11-162021-06-01성호건설
56미추홀구가로주택인천 용현1 LH참여형민00조합장용현동 568-2 진달래@2958.3착공86902017-12-282020-03-272021-05-172021-05-172022-01-21우탑건설
67미추홀구소규모재건축로얄맨션가00조합장주안동882-16192.9착공1341492016-08-302019-03-122020-06-012020-06-012021-11-26극동건설
78미추홀구소규모재건축용현5동새한아파트조00조합장용현동627-805390.0착공531082012-09-192014-10-302015-06-032016-03-212022-11-07관리처분계획변경인가 2020-03-23SM우방
89미추홀구가로주택주안 상일연립이00조합장주안동 85-6925.0사업시행계획인가6172017-03-102018-07-262019-07-152019-07-15<NA>오성종합건설
910미추홀구가로주택숭의동 289-1강00조합장숭의동 289-1번지 일원7936.8사업시행계획인가841052019-05-012020-12-152022-11-01<NA><NA>㈜신일
순번구청사업유형구역명대표자성명대표자직위위치면적추진단계조합원수토지등 소유자수주민합의체(조합설립)건축심의사업시행계획인가관리처분계획인가착공(시공중)비고
9596서구가로주택은성빌라한00조합장서구 석남동 522번지 외 2필지991.8조합설립인가36362020-09-01<NA><NA><NA><NA>조합설립인가 취소 요청 반려
9697서구가로주택석남동 489번지 일원노00조합장서구 석남동 489번지 외 5필지6614.1조합설립인가2262262021-05-18<NA><NA><NA><NA><NA>
9798서구가로주택가좌동 197번지 일원박00조합장서구 가좌동 197번지 일원6580.6조합설립인가68682022-01-27<NA><NA><NA><NA><NA>
9899서구가로주택금강빌라주변민00조합장서구 석남동 515-8번지 외6필지3020.3조합설립인가29292022-03-11<NA><NA><NA><NA><NA>
99100서구가로주택태산아파트이00조합장서구 석남동 499번지3768.4조합설립인가1431432022-07-28<NA><NA><NA><NA><NA>
100101서구가로주택중앙1차아파트권00조합장서구 석남동 503번지1482.4조합설립인가35352022-08-26<NA><NA><NA><NA><NA>
101102서구가로주택인천 석남동 473번지일대박00조합장서구 석남동 473번지 일원5318.6조합설립인가1901902022-08-29<NA><NA><NA><NA><NA>
102103서구가로주택창대빌라 일대박00조합장서구 석남동 201-1번지 일원4451.0조합설립인가1031032022-09-26<NA><NA><NA><NA><NA>
103104서구가로주택석남동 529번지 일원이00조합장석남동 529번지 일원9167.3조합설립인가1731732023-01-02<NA><NA><NA><NA>2022.8.4. 이후 설립된 조합
104105서구소규모재건축가좌동 207번지 동남아파트이00조합장서구 가좌동 207번지6277.6조합설립인가1361592022-07-11<NA><NA><NA><NA><NA>