Overview

Dataset statistics

Number of variables18
Number of observations102
Missing cells469
Missing cells (%)25.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.0 KiB
Average record size in memory150.3 B

Variable types

Numeric4
Categorical5
Text3
DateTime5
Unsupported1

Dataset

Description인천광역시_소규모 주택정비 추진현황 데이터로 사업유형, 구역명, 성명, 직위, 위치, 면적, 추진단계, 조합원수, 주민합의처, 사업시행계획 등을 알 수 있는 자료입니다.
Author인천광역시
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15072776&srcSe=7661IVAWM27C61E190

Alerts

순번 is highly overall correlated with 구청High correlation
면적 is highly overall correlated with 조합원수 and 1 other fieldsHigh correlation
조합원수 is highly overall correlated with 면적 and 1 other fieldsHigh correlation
토지등 소유자수 is highly overall correlated with 면적 and 1 other fieldsHigh correlation
구청 is highly overall correlated with 순번High correlation
사업유형 is highly overall correlated with 대표자직위 and 1 other fieldsHigh correlation
대표자성명 is highly overall correlated with 대표자직위High correlation
대표자직위 is highly overall correlated with 사업유형 and 2 other fieldsHigh correlation
추진단계 is highly overall correlated with 사업유형 and 1 other fieldsHigh correlation
대표자직위 is highly imbalanced (82.9%)Imbalance
건축심의 has 56 (54.9%) missing valuesMissing
사업시행계획인가 has 83 (81.4%) missing valuesMissing
관리처분계획인가 has 92 (90.2%) missing valuesMissing
착공(시공중) has 95 (93.1%) missing valuesMissing
준공 has 102 (100.0%) missing valuesMissing
비고 has 40 (39.2%) missing valuesMissing
순번 has unique valuesUnique
위치 has unique valuesUnique
면적 has unique valuesUnique
준공 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-04-14 03:09:30.870690
Analysis finished2024-04-14 03:09:33.057296
Duration2.19 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct102
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean51.5
Minimum1
Maximum102
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-14T12:09:33.119327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.05
Q126.25
median51.5
Q376.75
95-th percentile96.95
Maximum102
Range101
Interquartile range (IQR)50.5

Descriptive statistics

Standard deviation29.588849
Coefficient of variation (CV)0.57454076
Kurtosis-1.2
Mean51.5
Median Absolute Deviation (MAD)25.5
Skewness0
Sum5253
Variance875.5
MonotonicityStrictly increasing
2024-04-14T12:09:33.231336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
66 1
 
1.0%
76 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
Other values (92) 92
90.2%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
102 1
1.0%
101 1
1.0%
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%

구청
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)6.9%
Missing0
Missing (%)0.0%
Memory size948.0 B
미추홀구
29 
부평구
22 
서구
20 
계양구
15 
남동구
12 
Other values (2)

Length

Max length4
Median length3
Mean length3.0490196
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중구
2nd row중구
3rd row동구
4th row동구
5th row미추홀구

Common Values

ValueCountFrequency (%)
미추홀구 29
28.4%
부평구 22
21.6%
서구 20
19.6%
계양구 15
14.7%
남동구 12
11.8%
중구 2
 
2.0%
동구 2
 
2.0%

Length

2024-04-14T12:09:33.345382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-14T12:09:33.436111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미추홀구 29
28.4%
부평구 22
21.6%
서구 20
19.6%
계양구 15
14.7%
남동구 12
11.8%
중구 2
 
2.0%
동구 2
 
2.0%

사업유형
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size948.0 B
가로주택
62 
소규모재건축
37 
자율주택
 
3

Length

Max length6
Median length4
Mean length4.7254902
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가로주택
2nd row가로주택
3rd row가로주택
4th row가로주택
5th row가로주택

Common Values

ValueCountFrequency (%)
가로주택 62
60.8%
소규모재건축 37
36.3%
자율주택 3
 
2.9%

Length

2024-04-14T12:09:33.769331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-14T12:09:33.854479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
가로주택 62
60.8%
소규모재건축 37
36.3%
자율주택 3
 
2.9%
Distinct101
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size948.0 B
2024-04-14T12:09:34.053864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length16
Mean length8.2156863
Min length2

Characters and Unicode

Total characters838
Distinct characters138
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)98.0%

Sample

1st row신흥삼익아파트 1단지 가로주택정비사업구역
2nd row신흥삼익아파트 2단지 가로주택정비사업구역
3rd row송현2동 72번지 일원
4th row송림동67-10번지 일원
5th row인천 숭의2 LH참여형
ValueCountFrequency (%)
일원 12
 
7.2%
석남동 4
 
2.4%
가로주택정비사업 4
 
2.4%
인천 3
 
1.8%
효성동 3
 
1.8%
lh참여형 3
 
1.8%
성신아파트 2
 
1.2%
삼산동 2
 
1.2%
계산동 2
 
1.2%
동남아파트 2
 
1.2%
Other values (125) 130
77.8%
2024-04-14T12:09:34.390281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
70
 
8.4%
50
 
6.0%
49
 
5.8%
49
 
5.8%
46
 
5.5%
1 19
 
2.3%
17
 
2.0%
16
 
1.9%
16
 
1.9%
2 14
 
1.7%
Other values (128) 492
58.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 648
77.3%
Decimal Number 99
 
11.8%
Space Separator 70
 
8.4%
Dash Punctuation 12
 
1.4%
Uppercase Letter 7
 
0.8%
Other Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
50
 
7.7%
49
 
7.6%
49
 
7.6%
46
 
7.1%
17
 
2.6%
16
 
2.5%
16
 
2.5%
14
 
2.2%
14
 
2.2%
12
 
1.9%
Other values (112) 365
56.3%
Decimal Number
ValueCountFrequency (%)
1 19
19.2%
2 14
14.1%
9 12
12.1%
3 12
12.1%
7 11
11.1%
0 8
8.1%
5 7
 
7.1%
8 7
 
7.1%
4 6
 
6.1%
6 3
 
3.0%
Uppercase Letter
ValueCountFrequency (%)
H 3
42.9%
L 3
42.9%
B 1
 
14.3%
Space Separator
ValueCountFrequency (%)
70
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 648
77.3%
Common 183
 
21.8%
Latin 7
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
50
 
7.7%
49
 
7.6%
49
 
7.6%
46
 
7.1%
17
 
2.6%
16
 
2.5%
16
 
2.5%
14
 
2.2%
14
 
2.2%
12
 
1.9%
Other values (112) 365
56.3%
Common
ValueCountFrequency (%)
70
38.3%
1 19
 
10.4%
2 14
 
7.7%
9 12
 
6.6%
3 12
 
6.6%
- 12
 
6.6%
7 11
 
6.0%
0 8
 
4.4%
5 7
 
3.8%
8 7
 
3.8%
Other values (3) 11
 
6.0%
Latin
ValueCountFrequency (%)
H 3
42.9%
L 3
42.9%
B 1
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 648
77.3%
ASCII 190
 
22.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
70
36.8%
1 19
 
10.0%
2 14
 
7.4%
9 12
 
6.3%
3 12
 
6.3%
- 12
 
6.3%
7 11
 
5.8%
0 8
 
4.2%
5 7
 
3.7%
8 7
 
3.7%
Other values (6) 18
 
9.5%
Hangul
ValueCountFrequency (%)
50
 
7.7%
49
 
7.6%
49
 
7.6%
46
 
7.1%
17
 
2.6%
16
 
2.5%
16
 
2.5%
14
 
2.2%
14
 
2.2%
12
 
1.9%
Other values (112) 365
56.3%

대표자성명
Categorical

HIGH CORRELATION 

Distinct36
Distinct (%)35.3%
Missing0
Missing (%)0.0%
Memory size948.0 B
이00
19 
김00
17 
박00
최00
강00
Other values (31)
47 

Length

Max length5
Median length3
Mean length3.0196078
Min length3

Unique

Unique19 ?
Unique (%)18.6%

Sample

1st row김00
2nd row김00
3rd row심00
4th row최00
5th row김00

Common Values

ValueCountFrequency (%)
이00 19
18.6%
김00 17
16.7%
박00 9
 
8.8%
최00 5
 
4.9%
강00 5
 
4.9%
송00 3
 
2.9%
임00 3
 
2.9%
황00 3
 
2.9%
정00 3
 
2.9%
고00 2
 
2.0%
Other values (26) 33
32.4%

Length

2024-04-14T12:09:34.506336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
이00 19
18.6%
김00 17
16.7%
박00 9
 
8.8%
최00 5
 
4.9%
강00 5
 
4.9%
송00 3
 
2.9%
임00 3
 
2.9%
황00 3
 
2.9%
정00 3
 
2.9%
윤00 2
 
2.0%
Other values (26) 33
32.4%

대표자직위
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size948.0 B
조합장
98 
주민대표
 
3
지정개발자
 
1

Length

Max length5
Median length3
Mean length3.0490196
Min length3

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row조합장
2nd row조합장
3rd row조합장
4th row조합장
5th row조합장

Common Values

ValueCountFrequency (%)
조합장 98
96.1%
주민대표 3
 
2.9%
지정개발자 1
 
1.0%

Length

2024-04-14T12:09:34.613956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-14T12:09:34.700061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
조합장 98
96.1%
주민대표 3
 
2.9%
지정개발자 1
 
1.0%

위치
Text

UNIQUE 

Distinct102
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size948.0 B
2024-04-14T12:09:34.904164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length21
Mean length14.078431
Min length7

Characters and Unicode

Total characters1436
Distinct characters76
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique102 ?
Unique (%)100.0%

Sample

1st row인천광역시 중구 신흥동2가 54-5 1단지 일원
2nd row인천광역시 중구 신흥동2가 54-7 2단지 일원
3rd row송현동 72-185번지 일원
4th row송림동 67-10번지 일원
5th row숭의동 177번지 일원
ValueCountFrequency (%)
일원 36
 
11.5%
서구 19
 
6.1%
19
 
6.1%
석남동 17
 
5.4%
용현동 8
 
2.5%
미추홀구 8
 
2.5%
주안동 8
 
2.5%
효성동 6
 
1.9%
삼산동 6
 
1.9%
만수동 6
 
1.9%
Other values (146) 181
57.6%
2024-04-14T12:09:35.234484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
216
 
15.0%
102
 
7.1%
- 89
 
6.2%
1 85
 
5.9%
83
 
5.8%
2 69
 
4.8%
63
 
4.4%
5 61
 
4.2%
3 44
 
3.1%
9 41
 
2.9%
Other values (66) 583
40.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 660
46.0%
Decimal Number 462
32.2%
Space Separator 216
 
15.0%
Dash Punctuation 89
 
6.2%
Other Punctuation 9
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
102
15.5%
83
 
12.6%
63
 
9.5%
39
 
5.9%
39
 
5.9%
30
 
4.5%
21
 
3.2%
21
 
3.2%
19
 
2.9%
18
 
2.7%
Other values (52) 225
34.1%
Decimal Number
ValueCountFrequency (%)
1 85
18.4%
2 69
14.9%
5 61
13.2%
3 44
9.5%
9 41
8.9%
7 40
8.7%
8 32
 
6.9%
6 31
 
6.7%
4 30
 
6.5%
0 29
 
6.3%
Other Punctuation
ValueCountFrequency (%)
, 8
88.9%
@ 1
 
11.1%
Space Separator
ValueCountFrequency (%)
216
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 89
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 776
54.0%
Hangul 660
46.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
102
15.5%
83
 
12.6%
63
 
9.5%
39
 
5.9%
39
 
5.9%
30
 
4.5%
21
 
3.2%
21
 
3.2%
19
 
2.9%
18
 
2.7%
Other values (52) 225
34.1%
Common
ValueCountFrequency (%)
216
27.8%
- 89
11.5%
1 85
 
11.0%
2 69
 
8.9%
5 61
 
7.9%
3 44
 
5.7%
9 41
 
5.3%
7 40
 
5.2%
8 32
 
4.1%
6 31
 
4.0%
Other values (4) 68
 
8.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 776
54.0%
Hangul 660
46.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
216
27.8%
- 89
11.5%
1 85
 
11.0%
2 69
 
8.9%
5 61
 
7.9%
3 44
 
5.7%
9 41
 
5.3%
7 40
 
5.2%
8 32
 
4.1%
6 31
 
4.0%
Other values (4) 68
 
8.8%
Hangul
ValueCountFrequency (%)
102
15.5%
83
 
12.6%
63
 
9.5%
39
 
5.9%
39
 
5.9%
30
 
4.5%
21
 
3.2%
21
 
3.2%
19
 
2.9%
18
 
2.7%
Other values (52) 225
34.1%

면적
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct102
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4804.5327
Minimum131.8
Maximum9392
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-14T12:09:35.351604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum131.8
5-th percentile1278.155
Q13175.45
median5081.5
Q36525.45
95-th percentile7935.61
Maximum9392
Range9260.2
Interquartile range (IQR)3350

Descriptive statistics

Standard deviation2209.2527
Coefficient of variation (CV)0.45982676
Kurtosis-0.73806843
Mean4804.5327
Median Absolute Deviation (MAD)1687.5
Skewness-0.1170009
Sum490062.34
Variance4880797.5
MonotonicityNot monotonic
2024-04-14T12:09:35.461183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
7347.9 1
 
1.0%
7254.1 1
 
1.0%
7240.0 1
 
1.0%
6620.2 1
 
1.0%
5821.0 1
 
1.0%
3150.0 1
 
1.0%
5290.0 1
 
1.0%
2529.1 1
 
1.0%
6357.0 1
 
1.0%
1885.0 1
 
1.0%
Other values (92) 92
90.2%
ValueCountFrequency (%)
131.8 1
1.0%
276.5 1
1.0%
423.4 1
1.0%
925.0 1
1.0%
991.8 1
1.0%
1271.1 1
1.0%
1412.2 1
1.0%
1481.0 1
1.0%
1482.4 1
1.0%
1775.0 1
1.0%
ValueCountFrequency (%)
9392.0 1
1.0%
9244.7 1
1.0%
9167.3 1
1.0%
8804.06 1
1.0%
8052.1 1
1.0%
7936.8 1
1.0%
7913.0 1
1.0%
7736.4 1
1.0%
7716.3 1
1.0%
7693.25 1
1.0%

추진단계
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)6.9%
Missing0
Missing (%)0.0%
Memory size948.0 B
조합설립인가
51 
건축심의완료
26 
사업시행계획인가
12 
착공
주민합의체
 
3
Other values (2)
 
3

Length

Max length11
Median length6
Mean length6
Min length2

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row조합설립인가
2nd row조합설립인가
3rd row조합설립인가
4th row조합설립인가
5th row착공

Common Values

ValueCountFrequency (%)
조합설립인가 51
50.0%
건축심의완료 26
25.5%
사업시행계획인가 12
 
11.8%
착공 7
 
6.9%
주민합의체 3
 
2.9%
건축심의 접수 2
 
2.0%
사업시행계획인가 접수 1
 
1.0%

Length

2024-04-14T12:09:35.559624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-14T12:09:35.643059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
조합설립인가 51
48.6%
건축심의완료 26
24.8%
사업시행계획인가 13
 
12.4%
착공 7
 
6.7%
주민합의체 3
 
2.9%
접수 3
 
2.9%
건축심의 2
 
1.9%

조합원수
Real number (ℝ)

HIGH CORRELATION 

Distinct81
Distinct (%)79.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean104.09804
Minimum2
Maximum311
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-14T12:09:35.746670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile23.05
Q153
median93.5
Q3142.25
95-th percentile210.95
Maximum311
Range309
Interquartile range (IQR)89.25

Descriptive statistics

Standard deviation62.625954
Coefficient of variation (CV)0.60160551
Kurtosis0.1418977
Mean104.09804
Median Absolute Deviation (MAD)44.5
Skewness0.63462047
Sum10618
Variance3922.0101
MonotonicityNot monotonic
2024-04-14T12:09:35.857722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
45 4
 
3.9%
2 3
 
2.9%
118 3
 
2.9%
29 3
 
2.9%
65 2
 
2.0%
127 2
 
2.0%
143 2
 
2.0%
136 2
 
2.0%
162 2
 
2.0%
119 2
 
2.0%
Other values (71) 77
75.5%
ValueCountFrequency (%)
2 3
2.9%
6 1
 
1.0%
22 1
 
1.0%
23 1
 
1.0%
24 1
 
1.0%
28 1
 
1.0%
29 3
2.9%
34 1
 
1.0%
35 1
 
1.0%
36 1
 
1.0%
ValueCountFrequency (%)
311 1
1.0%
246 1
1.0%
236 1
1.0%
226 1
1.0%
218 1
1.0%
211 1
1.0%
210 1
1.0%
208 1
1.0%
206 1
1.0%
196 1
1.0%

토지등 소유자수
Real number (ℝ)

HIGH CORRELATION 

Distinct82
Distinct (%)80.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean109.83333
Minimum2
Maximum311
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-14T12:09:35.969624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile27.05
Q155.25
median102.5
Q3151.75
95-th percentile211
Maximum311
Range309
Interquartile range (IQR)96.5

Descriptive statistics

Standard deviation62.785677
Coefficient of variation (CV)0.57164501
Kurtosis-0.11528403
Mean109.83333
Median Absolute Deviation (MAD)48
Skewness0.49119696
Sum11203
Variance3942.0413
MonotonicityNot monotonic
2024-04-14T12:09:36.076247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2 3
 
2.9%
49 3
 
2.9%
149 3
 
2.9%
89 2
 
2.0%
140 2
 
2.0%
92 2
 
2.0%
208 2
 
2.0%
29 2
 
2.0%
35 2
 
2.0%
211 2
 
2.0%
Other values (72) 79
77.5%
ValueCountFrequency (%)
2 3
2.9%
17 1
 
1.0%
24 1
 
1.0%
27 1
 
1.0%
28 1
 
1.0%
29 2
2.0%
31 1
 
1.0%
35 2
2.0%
36 1
 
1.0%
40 1
 
1.0%
ValueCountFrequency (%)
311 1
1.0%
246 1
1.0%
236 1
1.0%
226 1
1.0%
220 1
1.0%
211 2
2.0%
208 2
2.0%
197 1
1.0%
196 1
1.0%
192 1
1.0%
Distinct96
Distinct (%)95.0%
Missing1
Missing (%)1.0%
Memory size948.0 B
Minimum2003-06-30 00:00:00
Maximum2023-09-05 00:00:00
2024-04-14T12:09:36.175945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:09:36.278207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

건축심의
Date

MISSING 

Distinct41
Distinct (%)89.1%
Missing56
Missing (%)54.9%
Memory size948.0 B
Minimum2003-09-26 00:00:00
Maximum2023-10-19 00:00:00
2024-04-14T12:09:36.379882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:09:36.474863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=41)
Distinct19
Distinct (%)100.0%
Missing83
Missing (%)81.4%
Memory size948.0 B
Minimum2003-11-29 00:00:00
Maximum2023-07-03 00:00:00
2024-04-14T12:09:36.562760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:09:36.642005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
Distinct10
Distinct (%)100.0%
Missing92
Missing (%)90.2%
Memory size948.0 B
Minimum2016-03-21 00:00:00
Maximum2023-03-29 00:00:00
2024-04-14T12:09:36.728844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:09:36.809115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)

착공(시공중)
Date

MISSING 

Distinct7
Distinct (%)100.0%
Missing95
Missing (%)93.1%
Memory size948.0 B
Minimum2021-06-01 00:00:00
Maximum2022-11-15 00:00:00
2024-04-14T12:09:36.882131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:09:36.960315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)

준공
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing102
Missing (%)100.0%
Memory size1.0 KiB

비고
Text

MISSING 

Distinct46
Distinct (%)74.2%
Missing40
Missing (%)39.2%
Memory size948.0 B
2024-04-14T12:09:37.151152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length83
Median length63.5
Mean length16.854839
Min length3

Characters and Unicode

Total characters1045
Distinct characters127
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)56.5%

Sample

1st row태영건설
2nd row성호건설
3rd row우탑건설
4th row극동건설
5th row관리처분계획변경인가2020-03-23 SM우방
ValueCountFrequency (%)
도시개발과(노주희 5
 
4.6%
032-509-6906 5
 
4.6%
검토구역 4
 
3.7%
관리지역 4
 
3.7%
소규모주택정비 4
 
3.7%
건축심의 4
 
3.7%
도시개발과(최미영)032-509-6922 4
 
3.7%
도시개발과(김영진)032-509-6924 3
 
2.8%
호반건설 3
 
2.8%
도시개발과(안혜인)032-509-6934 3
 
2.8%
Other values (55) 69
63.9%
2024-04-14T12:09:37.487069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
199
 
19.0%
0 60
 
5.7%
- 48
 
4.6%
2 46
 
4.4%
9 44
 
4.2%
33
 
3.2%
3 31
 
3.0%
6 29
 
2.8%
28
 
2.7%
26
 
2.5%
Other values (117) 501
47.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 485
46.4%
Decimal Number 246
23.5%
Space Separator 199
19.0%
Dash Punctuation 48
 
4.6%
Open Punctuation 22
 
2.1%
Close Punctuation 22
 
2.1%
Other Symbol 8
 
0.8%
Uppercase Letter 8
 
0.8%
Other Punctuation 7
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
33
 
6.8%
28
 
5.8%
26
 
5.4%
25
 
5.2%
24
 
4.9%
24
 
4.9%
24
 
4.9%
11
 
2.3%
10
 
2.1%
9
 
1.9%
Other values (95) 271
55.9%
Decimal Number
ValueCountFrequency (%)
0 60
24.4%
2 46
18.7%
9 44
17.9%
3 31
12.6%
6 29
11.8%
5 24
 
9.8%
4 9
 
3.7%
8 2
 
0.8%
1 1
 
0.4%
Uppercase Letter
ValueCountFrequency (%)
S 2
25.0%
L 2
25.0%
D 1
12.5%
H 1
12.5%
M 1
12.5%
K 1
12.5%
Other Punctuation
ValueCountFrequency (%)
/ 4
57.1%
. 3
42.9%
Space Separator
ValueCountFrequency (%)
199
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 48
100.0%
Open Punctuation
ValueCountFrequency (%)
( 22
100.0%
Close Punctuation
ValueCountFrequency (%)
) 22
100.0%
Other Symbol
ValueCountFrequency (%)
8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 544
52.1%
Hangul 493
47.2%
Latin 8
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
33
 
6.7%
28
 
5.7%
26
 
5.3%
25
 
5.1%
24
 
4.9%
24
 
4.9%
24
 
4.9%
11
 
2.2%
10
 
2.0%
9
 
1.8%
Other values (96) 279
56.6%
Common
ValueCountFrequency (%)
199
36.6%
0 60
 
11.0%
- 48
 
8.8%
2 46
 
8.5%
9 44
 
8.1%
3 31
 
5.7%
6 29
 
5.3%
5 24
 
4.4%
( 22
 
4.0%
) 22
 
4.0%
Other values (5) 19
 
3.5%
Latin
ValueCountFrequency (%)
S 2
25.0%
L 2
25.0%
D 1
12.5%
H 1
12.5%
M 1
12.5%
K 1
12.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 552
52.8%
Hangul 485
46.4%
None 8
 
0.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
199
36.1%
0 60
 
10.9%
- 48
 
8.7%
2 46
 
8.3%
9 44
 
8.0%
3 31
 
5.6%
6 29
 
5.3%
5 24
 
4.3%
( 22
 
4.0%
) 22
 
4.0%
Other values (11) 27
 
4.9%
Hangul
ValueCountFrequency (%)
33
 
6.8%
28
 
5.8%
26
 
5.4%
25
 
5.2%
24
 
4.9%
24
 
4.9%
24
 
4.9%
11
 
2.3%
10
 
2.1%
9
 
1.9%
Other values (95) 271
55.9%
None
ValueCountFrequency (%)
8
100.0%

Interactions

2024-04-14T12:09:32.370545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:09:31.594467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:09:31.854626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:09:32.101945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:09:32.435529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:09:31.658442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:09:31.918242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:09:32.167586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:09:32.506789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:09:31.721280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:09:31.974775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:09:32.231954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:09:32.576615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:09:31.788145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:09:32.040197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-14T12:09:32.299254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-14T12:09:37.577658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번구청사업유형대표자성명대표자직위면적추진단계조합원수토지등 소유자수주민합의체(조합설립)건축심의사업시행계획인가관리처분계획인가착공(시공중)비고
순번1.0000.8880.5380.1480.2800.0000.5200.0000.0000.8730.9341.0001.0001.0000.928
구청0.8881.0000.3000.0000.0000.0000.0000.0280.0000.9590.8671.0001.0001.0000.830
사업유형0.5380.3001.0000.4160.9420.5990.7710.6550.6700.9890.9021.0001.0001.0000.965
대표자성명0.1480.0000.4161.0000.9240.0000.7810.0000.0000.9850.4821.0001.0001.0000.000
대표자직위0.2800.0000.9420.9241.0000.6060.7730.5460.5821.0001.0001.000NaNNaN1.000
면적0.0000.0000.5990.0000.6061.0000.4510.6110.6320.9390.8091.0001.0001.0000.000
추진단계0.5200.0000.7710.7810.7730.4511.0000.2880.2890.9570.9851.0001.000NaN0.946
조합원수0.0000.0280.6550.0000.5460.6110.2881.0000.9940.0000.0001.0001.0001.0000.000
토지등 소유자수0.0000.0000.6700.0000.5820.6320.2890.9941.0000.0000.0001.0001.0001.0000.000
주민합의체(조합설립)0.8730.9590.9890.9851.0000.9390.9570.0000.0001.0000.9821.0001.0001.0001.000
건축심의0.9340.8670.9020.4821.0000.8090.9850.0000.0000.9821.0001.0001.0001.0000.901
사업시행계획인가1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
관리처분계획인가1.0001.0001.0001.000NaN1.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
착공(시공중)1.0001.0001.0001.000NaN1.000NaN1.0001.0001.0001.0001.0001.0001.0001.000
비고0.9280.8300.9650.0001.0000.0000.9460.0000.0001.0000.9011.0001.0001.0001.000
2024-04-14T12:09:37.704847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
추진단계대표자성명사업유형대표자직위구청
추진단계1.0000.3780.6960.6980.000
대표자성명0.3781.0000.1650.5930.000
사업유형0.6960.1651.0000.7060.206
대표자직위0.6980.5930.7061.0000.000
구청0.0000.0000.2060.0001.000
2024-04-14T12:09:37.788758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번면적조합원수토지등 소유자수구청사업유형대표자성명대표자직위추진단계
순번1.000-0.0960.022-0.0070.7130.3680.0000.1660.289
면적-0.0961.0000.7920.8100.0000.4810.0000.4880.276
조합원수0.0220.7921.0000.9810.0000.3570.0000.2760.152
토지등 소유자수-0.0070.8100.9811.0000.0000.3690.0000.3010.152
구청0.7130.0000.0000.0001.0000.2060.0000.0000.000
사업유형0.3680.4810.3570.3690.2061.0000.1650.7060.696
대표자성명0.0000.0000.0000.0000.0000.1651.0000.5930.378
대표자직위0.1660.4880.2760.3010.0000.7060.5931.0000.698
추진단계0.2890.2760.1520.1520.0000.6960.3780.6981.000

Missing values

2024-04-14T12:09:32.684140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-14T12:09:32.868416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-14T12:09:32.989222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

순번구청사업유형구역명대표자성명대표자직위위치면적추진단계조합원수토지등 소유자수주민합의체(조합설립)건축심의사업시행계획인가관리처분계획인가착공(시공중)준공비고
01중구가로주택신흥삼익아파트 1단지 가로주택정비사업구역김00조합장인천광역시 중구 신흥동2가 54-5 1단지 일원7347.9조합설립인가2462462022-11-11<NA><NA><NA><NA><NA><NA>
12중구가로주택신흥삼익아파트 2단지 가로주택정비사업구역김00조합장인천광역시 중구 신흥동2가 54-7 2단지 일원6735.5조합설립인가1961962022-11-11<NA><NA><NA><NA><NA><NA>
23동구가로주택송현2동 72번지 일원심00조합장송현동 72-185번지 일원4670.81조합설립인가40492021-07-28<NA><NA><NA><NA><NA>태영건설
34동구가로주택송림동67-10번지 일원최00조합장송림동 67-10번지 일원4236.15조합설립인가841022022-07-01<NA><NA><NA><NA><NA><NA>
45미추홀구가로주택인천 숭의2 LH참여형김00조합장숭의동 177번지 일원3028.4착공22552017-11-302019-06-262020-11-162020-11-162021-06-01<NA>성호건설
56미추홀구가로주택인천 용현1 LH참여형민00조합장용현동 568-2 진달래@2958.3착공86902017-12-282020-03-272021-05-172021-05-172022-01-21<NA>우탑건설
67미추홀구소규모재건축로얄맨션가00조합장주안동882-16192.9착공1341492016-08-302019-03-122020-06-012020-06-012021-11-26<NA>극동건설
78미추홀구소규모재건축용현5동새한아파트조00조합장용현동627-805390.0착공531082012-09-192014-10-302015-06-032016-03-212022-11-07<NA>관리처분계획변경인가2020-03-23 SM우방
89미추홀구가로주택주안 상일연립이00조합장주안동 85-6925.0사업시행계획인가6172017-03-102018-07-262019-07-152019-07-15<NA><NA>오성종합건설
910미추홀구가로주택숭의동 289-1강00조합장숭의동 289-1번지 일원7936.8사업시행계획인가841052019-05-012020-12-152022-11-01<NA><NA><NA>㈜신일
순번구청사업유형구역명대표자성명대표자직위위치면적추진단계조합원수토지등 소유자수주민합의체(조합설립)건축심의사업시행계획인가관리처분계획인가착공(시공중)준공비고
9293서구가로주택은성빌라한00조합장서구 석남동 522번지 외 2필지991.8조합설립인가36362020-09-01<NA><NA><NA><NA><NA>조합설립인가 취소 요청 반려
9394서구가로주택석남동 489번지 일원노00조합장서구 석남동 489번지 외 5필지6614.1조합설립인가2262262021-05-18<NA><NA><NA><NA><NA><NA>
9495서구가로주택가좌동 197번지 일원박00조합장서구 가좌동 197번지 일원6580.6조합설립인가68682022-01-27<NA><NA><NA><NA><NA><NA>
9596서구가로주택금강빌라주변민00조합장서구 석남동 515-8번지 외6필지3020.3조합설립인가29292022-03-11<NA><NA><NA><NA><NA><NA>
9697서구가로주택태산아파트이00조합장서구 석남동 499번지3768.4조합설립인가1431432022-07-28<NA><NA><NA><NA><NA><NA>
9798서구가로주택중앙1차아파트권00조합장서구 석남동 503번지1482.4조합설립인가35352022-08-26<NA><NA><NA><NA><NA><NA>
9899서구가로주택인천 석남동 473번지일대박00조합장서구 석남동 473번지 일원5318.6조합설립인가1901902022-08-29<NA><NA><NA><NA><NA><NA>
99100서구가로주택창대빌라 일대박00조합장서구 석남동 201-1번지 일원4451.0조합설립인가1031032022-09-26<NA><NA><NA><NA><NA><NA>
100101서구가로주택석남동 529번지 일원이00조합장석남동 529번지 일원9167.3조합설립인가1731732023-01-02<NA><NA><NA><NA><NA>2022.8.4. 이후 설립된 조합
101102서구소규모재건축가좌동 207번지 동남아파트이00조합장서구 가좌동 207번지6277.6조합설립인가1361592022-07-11<NA><NA><NA><NA><NA><NA>