Overview

Dataset statistics

Number of variables10
Number of observations29
Missing cells42
Missing cells (%)14.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 KiB
Average record size in memory85.6 B

Variable types

Categorical4
Text2
Numeric1
DateTime3

Dataset

Description인천광역시에서 진행 중인 도시개발사업 현황에 관한 자료로 본 자료에서는 사업별 현재 진행단계, 구역명, 위치, 면적, 시행자, 사업방식 등의 자료를 제공하고 있습니다.
Author인천광역시
URLhttps://www.data.go.kr/data/15048913/fileData.do

Alerts

비고 is highly overall correlated with 단계 and 2 other fieldsHigh correlation
단계 is highly overall correlated with 비고High correlation
시 행 자(제안자) is highly overall correlated with 사업방식 and 1 other fieldsHigh correlation
사업방식 is highly overall correlated with 시 행 자(제안자) and 1 other fieldsHigh correlation
구역지정일 has 11 (37.9%) missing valuesMissing
실시계획인가일 has 15 (51.7%) missing valuesMissing
준공예정일 has 16 (55.2%) missing valuesMissing
구 역 명 has unique valuesUnique
면적(제곱미터) has unique valuesUnique

Reproduction

Analysis started2023-12-12 17:51:31.463250
Analysis finished2023-12-12 17:51:32.689536
Duration1.23 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

단계
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)13.8%
Missing0
Missing (%)0.0%
Memory size364.0 B
사업시행단계
14 
구역지정단계
11 
실시계획
개발계획
 
1

Length

Max length6
Median length6
Mean length5.7241379
Min length4

Unique

Unique1 ?
Unique (%)3.4%

Sample

1st row사업시행단계
2nd row사업시행단계
3rd row사업시행단계
4th row사업시행단계
5th row사업시행단계

Common Values

ValueCountFrequency (%)
사업시행단계 14
48.3%
구역지정단계 11
37.9%
실시계획 3
 
10.3%
개발계획 1
 
3.4%

Length

2023-12-13T02:51:32.794331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:51:32.965832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사업시행단계 14
48.3%
구역지정단계 11
37.9%
실시계획 3
 
10.3%
개발계획 1
 
3.4%

구 역 명
Text

UNIQUE 

Distinct29
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size364.0 B
2023-12-13T02:51:33.232329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length11
Mean length6.6206897
Min length4

Characters and Unicode

Total characters192
Distinct characters69
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)100.0%

Sample

1st row귤현구역
2nd row계산 종합의료단지
3rd row방축구역
4th row효성구역
5th row용현·학익 1블록
ValueCountFrequency (%)
용현·학익 4
 
10.5%
2-2블록 2
 
5.3%
왕길3구역 1
 
2.6%
왕길구역 1
 
2.6%
마전2구역 1
 
2.6%
1-4블록 1
 
2.6%
검단1구역 1
 
2.6%
검단5구역 1
 
2.6%
왕길1구역 1
 
2.6%
두밀)대곡3-1구역 1
 
2.6%
Other values (24) 24
63.2%
2023-12-13T02:51:33.680919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
22
 
11.5%
21
 
10.9%
9
 
4.7%
1 8
 
4.2%
2 8
 
4.2%
3 6
 
3.1%
6
 
3.1%
- 5
 
2.6%
5
 
2.6%
5
 
2.6%
Other values (59) 97
50.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 143
74.5%
Decimal Number 24
 
12.5%
Space Separator 9
 
4.7%
Dash Punctuation 5
 
2.6%
Other Punctuation 5
 
2.6%
Open Punctuation 3
 
1.6%
Close Punctuation 3
 
1.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
22
 
15.4%
21
 
14.7%
6
 
4.2%
5
 
3.5%
5
 
3.5%
4
 
2.8%
4
 
2.8%
4
 
2.8%
4
 
2.8%
4
 
2.8%
Other values (49) 64
44.8%
Decimal Number
ValueCountFrequency (%)
1 8
33.3%
2 8
33.3%
3 6
25.0%
4 1
 
4.2%
5 1
 
4.2%
Space Separator
ValueCountFrequency (%)
9
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Other Punctuation
ValueCountFrequency (%)
· 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 143
74.5%
Common 49
 
25.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
22
 
15.4%
21
 
14.7%
6
 
4.2%
5
 
3.5%
5
 
3.5%
4
 
2.8%
4
 
2.8%
4
 
2.8%
4
 
2.8%
4
 
2.8%
Other values (49) 64
44.8%
Common
ValueCountFrequency (%)
9
18.4%
1 8
16.3%
2 8
16.3%
3 6
12.2%
- 5
10.2%
· 5
10.2%
( 3
 
6.1%
) 3
 
6.1%
4 1
 
2.0%
5 1
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 143
74.5%
ASCII 44
 
22.9%
None 5
 
2.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
22
 
15.4%
21
 
14.7%
6
 
4.2%
5
 
3.5%
5
 
3.5%
4
 
2.8%
4
 
2.8%
4
 
2.8%
4
 
2.8%
4
 
2.8%
Other values (49) 64
44.8%
ASCII
ValueCountFrequency (%)
9
20.5%
1 8
18.2%
2 8
18.2%
3 6
13.6%
- 5
11.4%
( 3
 
6.8%
) 3
 
6.8%
4 1
 
2.3%
5 1
 
2.3%
None
ValueCountFrequency (%)
· 5
100.0%
Distinct28
Distinct (%)96.6%
Missing0
Missing (%)0.0%
Memory size364.0 B
2023-12-13T02:51:33.993428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length19
Mean length16.689655
Min length13

Characters and Unicode

Total characters484
Distinct characters56
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)93.1%

Sample

1st row계양구 귤현동 306-1번지 일원
2nd row계양구 계산동 산 52-11번지 일원
3rd row계양구 방축동 27-1번지 일원
4th row계양구 효성동 100번지 일원
5th row미추홀구 학익동 587-1번지 일원
ValueCountFrequency (%)
일원 29
24.8%
서구 17
 
14.5%
미추홀구 5
 
4.3%
계양구 4
 
3.4%
대곡동 4
 
3.4%
오류동 3
 
2.6%
왕길동 3
 
2.6%
연수구 3
 
2.6%
용현동 2
 
1.7%
백석동 2
 
1.7%
Other values (42) 45
38.5%
2023-12-13T02:51:34.526195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
89
18.4%
31
 
6.4%
29
 
6.0%
29
 
6.0%
29
 
6.0%
1 26
 
5.4%
25
 
5.2%
25
 
5.2%
18
 
3.7%
- 18
 
3.7%
Other values (46) 165
34.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 273
56.4%
Decimal Number 104
 
21.5%
Space Separator 89
 
18.4%
Dash Punctuation 18
 
3.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
31
11.4%
29
10.6%
29
10.6%
29
10.6%
25
 
9.2%
25
 
9.2%
18
 
6.6%
5
 
1.8%
5
 
1.8%
5
 
1.8%
Other values (34) 72
26.4%
Decimal Number
ValueCountFrequency (%)
1 26
25.0%
2 14
13.5%
5 11
10.6%
0 11
10.6%
6 10
 
9.6%
4 10
 
9.6%
3 9
 
8.7%
7 8
 
7.7%
9 3
 
2.9%
8 2
 
1.9%
Space Separator
ValueCountFrequency (%)
89
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 273
56.4%
Common 211
43.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
31
11.4%
29
10.6%
29
10.6%
29
10.6%
25
 
9.2%
25
 
9.2%
18
 
6.6%
5
 
1.8%
5
 
1.8%
5
 
1.8%
Other values (34) 72
26.4%
Common
ValueCountFrequency (%)
89
42.2%
1 26
 
12.3%
- 18
 
8.5%
2 14
 
6.6%
5 11
 
5.2%
0 11
 
5.2%
6 10
 
4.7%
4 10
 
4.7%
3 9
 
4.3%
7 8
 
3.8%
Other values (2) 5
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 273
56.4%
ASCII 211
43.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
89
42.2%
1 26
 
12.3%
- 18
 
8.5%
2 14
 
6.6%
5 11
 
5.2%
0 11
 
5.2%
6 10
 
4.7%
4 10
 
4.7%
3 9
 
4.3%
7 8
 
3.8%
Other values (2) 5
 
2.4%
Hangul
ValueCountFrequency (%)
31
11.4%
29
10.6%
29
10.6%
29
10.6%
25
 
9.2%
25
 
9.2%
18
 
6.6%
5
 
1.8%
5
 
1.8%
5
 
1.8%
Other values (34) 72
26.4%

면적(제곱미터)
Real number (ℝ)

UNIQUE 

Distinct29
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean433447.38
Minimum21926
Maximum1546747
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size393.0 B
2023-12-13T02:51:34.700554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum21926
5-th percentile35310.2
Q1182176
median421148
Q3558315
95-th percentile950803.6
Maximum1546747
Range1524821
Interquartile range (IQR)376139

Descriptive statistics

Standard deviation339466.9
Coefficient of variation (CV)0.78317904
Kurtosis2.8153509
Mean433447.38
Median Absolute Deviation (MAD)216758
Skewness1.3071079
Sum12569974
Variance1.1523778 × 1011
MonotonicityNot monotonic
2023-12-13T02:51:34.889565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%)
182176 1
 
3.4%
21926 1
 
3.4%
980440 1
 
3.4%
458417 1
 
3.4%
708034 1
 
3.4%
579649 1
 
3.4%
816935 1
 
3.4%
558315 1
 
3.4%
502859 1
 
3.4%
506220 1
 
3.4%
Other values (19) 19
65.5%
ValueCountFrequency (%)
21926 1
3.4%
30253 1
3.4%
42896 1
3.4%
81250 1
3.4%
84144 1
3.4%
87612 1
3.4%
97932 1
3.4%
182176 1
3.4%
202930 1
3.4%
204390 1
3.4%
ValueCountFrequency (%)
1546747 1
3.4%
980440 1
3.4%
906349 1
3.4%
816935 1
3.4%
708034 1
3.4%
579649 1
3.4%
567567 1
3.4%
558315 1
3.4%
538600 1
3.4%
524510 1
3.4%

시 행 자(제안자)
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)37.9%
Missing0
Missing (%)0.0%
Memory size364.0 B
조합
19 
토지소유자
 
1
㈜제이케이
 
1
㈜DCRE
 
1
부영주택
 
1
Other values (6)

Length

Max length9
Median length2
Mean length3.2068966
Min length2

Unique

Unique10 ?
Unique (%)34.5%

Sample

1st row조합
2nd row토지소유자
3rd row조합
4th row㈜제이케이
5th row㈜DCRE

Common Values

ValueCountFrequency (%)
조합 19
65.5%
토지소유자 1
 
3.4%
㈜제이케이 1
 
3.4%
㈜DCRE 1
 
3.4%
부영주택 1
 
3.4%
삼성물산 1
 
3.4%
서구청장 1
 
3.4%
인천광역시, LH 1
 
3.4%
신검단개발사업㈜ 1
 
3.4%
아이월드(주) 1
 
3.4%

Length

2023-12-13T02:51:35.069469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
조합 19
63.3%
토지소유자 1
 
3.3%
㈜제이케이 1
 
3.3%
㈜dcre 1
 
3.3%
부영주택 1
 
3.3%
삼성물산 1
 
3.3%
서구청장 1
 
3.3%
인천광역시 1
 
3.3%
lh 1
 
3.3%
신검단개발사업㈜ 1
 
3.3%
Other values (2) 2
 
6.7%

사업방식
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)10.3%
Missing0
Missing (%)0.0%
Memory size364.0 B
환지
21 
수용
혼용
 
1

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)3.4%

Sample

1st row환지
2nd row수용
3rd row환지
4th row수용
5th row수용

Common Values

ValueCountFrequency (%)
환지 21
72.4%
수용 7
 
24.1%
혼용 1
 
3.4%

Length

2023-12-13T02:51:35.262305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:51:35.385931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
환지 21
72.4%
수용 7
 
24.1%
혼용 1
 
3.4%

구역지정일
Date

MISSING 

Distinct17
Distinct (%)94.4%
Missing11
Missing (%)37.9%
Memory size364.0 B
Minimum2006-05-29 00:00:00
Maximum2023-08-21 00:00:00
2023-12-13T02:51:35.534782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:51:35.705882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=17)

실시계획인가일
Date

MISSING 

Distinct13
Distinct (%)92.9%
Missing15
Missing (%)51.7%
Memory size364.0 B
Minimum2008-11-24 00:00:00
Maximum2023-08-28 00:00:00
2023-12-13T02:51:35.883889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:51:36.054896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)

준공예정일
Date

MISSING 

Distinct8
Distinct (%)61.5%
Missing16
Missing (%)55.2%
Memory size364.0 B
Minimum2023-03-31 00:00:00
Maximum2025-12-31 00:00:00
2023-12-13T02:51:36.187535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:51:36.315557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)

비고
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)20.7%
Missing0
Missing (%)0.0%
Memory size364.0 B
공사 중
11 
관계기관 협의 중
11 
실시계획 수립 중
착공 준비 중
개발계획 변경 수립 중
 
1

Length

Max length12
Median length9
Mean length7.0689655
Min length4

Unique

Unique2 ?
Unique (%)6.9%

Sample

1st row공사 중
2nd row공사 중
3rd row공사 중
4th row착공 준비 중
5th row공사 중

Common Values

ValueCountFrequency (%)
공사 중 11
37.9%
관계기관 협의 중 11
37.9%
실시계획 수립 중 3
 
10.3%
착공 준비 중 2
 
6.9%
개발계획 변경 수립 중 1
 
3.4%
개발계획 수립 중 1
 
3.4%

Length

2023-12-13T02:51:36.470197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:51:36.644781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
29
37.7%
공사 11
 
14.3%
관계기관 11
 
14.3%
협의 11
 
14.3%
수립 5
 
6.5%
실시계획 3
 
3.9%
착공 2
 
2.6%
준비 2
 
2.6%
개발계획 2
 
2.6%
변경 1
 
1.3%

Interactions

2023-12-13T02:51:32.002824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:51:36.771490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
단계구 역 명위 치면적(제곱미터)시 행 자(제안자)사업방식구역지정일실시계획인가일준공예정일비고
단계1.0001.0000.3030.5140.0000.2651.000NaNNaN1.000
구 역 명1.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
위 치0.3031.0001.0000.9650.8440.7670.9771.0001.0000.870
면적(제곱미터)0.5141.0000.9651.0000.4740.0001.0000.9710.0000.000
시 행 자(제안자)0.0001.0000.8440.4741.0001.0000.9291.0000.7800.824
사업방식0.2651.0000.7670.0001.0001.0000.8301.0000.0000.869
구역지정일1.0001.0000.9771.0000.9290.8301.0001.0001.0001.000
실시계획인가일NaN1.0001.0000.9711.0001.0001.0001.0000.9041.000
준공예정일NaN1.0001.0000.0000.7800.0001.0000.9041.0001.000
비고1.0001.0000.8700.0000.8240.8691.0001.0001.0001.000
2023-12-13T02:51:36.957273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업방식시 행 자(제안자)비고단계
사업방식1.0000.8320.5340.241
시 행 자(제안자)0.8321.0000.5310.000
비고0.5340.5311.0000.959
단계0.2410.0000.9591.000
2023-12-13T02:51:37.072389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
면적(제곱미터)단계시 행 자(제안자)사업방식비고
면적(제곱미터)1.0000.1640.1510.0000.000
단계0.1641.0000.0000.2410.959
시 행 자(제안자)0.1510.0001.0000.8320.531
사업방식0.0000.2410.8321.0000.534
비고0.0000.9590.5310.5341.000

Missing values

2023-12-13T02:51:32.217245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:51:32.426958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T02:51:32.610191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

단계구 역 명위 치면적(제곱미터)시 행 자(제안자)사업방식구역지정일실시계획인가일준공예정일비고
0사업시행단계귤현구역계양구 귤현동 306-1번지 일원182176조합환지2007-08-272008-11-242023-09-30공사 중
1사업시행단계계산 종합의료단지계양구 계산동 산 52-11번지 일원21926토지소유자수용2016-02-012017-01-162023-12-31공사 중
2사업시행단계방축구역계양구 방축동 27-1번지 일원84144조합환지2013-09-162016-03-212023-12-31공사 중
3사업시행단계효성구역계양구 효성동 100번지 일원434922㈜제이케이수용2014-02-242020-05-252025-12-31착공 준비 중
4사업시행단계용현·학익 1블록미추홀구 학익동 587-1번지 일원1546747㈜DCRE수용2009-06-152013-05-132024-12-31공사 중
5사업시행단계송도 대우자판연수구 동춘동 907번지 일원538600부영주택수용2008-12-152010-02-162023-03-31개발계획 변경 수립 중
6사업시행단계송도역세권연수구 옥련동 104번지 일원291725삼성물산환지2008-12-012014-11-242025-06-30공사 중
7사업시행단계문학구역미추홀구 문학동 141-1번지 일원81250조합환지2006-05-292011-12-262023-10-31공사 중
8사업시행단계동춘1구역연수구 동춘동 752-4번지 일원407913조합환지2006-11-272008-11-242023-12-31공사 중
9사업시행단계경서3구역서구 경서동 124-66번지 일원368095서구청장환지2008-11-172010-08-232024-12-31공사 중
단계구 역 명위 치면적(제곱미터)시 행 자(제안자)사업방식구역지정일실시계획인가일준공예정일비고
19구역지정단계검단1구역서구 금곡동 235번지 일원416020조합환지<NA><NA><NA>관계기관 협의 중
20구역지정단계검단5구역서구 오류동 1번지 일원421148조합환지<NA><NA><NA>관계기관 협의 중
21구역지정단계왕길1구역서구 왕길동 64-46번지 일원506220조합환지<NA><NA><NA>관계기관 협의 중
22구역지정단계왕길3구역서구 오류동 558-4번지 일원502859조합환지<NA><NA><NA>관계기관 협의 중
23구역지정단계(두밀)대곡3-1구역서구 대곡동 214-1 일원558315조합환지<NA><NA><NA>관계기관 협의 중
24구역지정단계(가현)대곡3-2구역서구 대곡동 39-1 일원816935조합환지<NA><NA><NA>관계기관 협의 중
25구역지정단계(태정)대곡2구역서구 대곡동 519-1 일원579649조합환지<NA><NA><NA>관계기관 협의 중
26구역지정단계불로1구역서구 블로동 122-3번지 일원708034조합환지<NA><NA><NA>관계기관 협의 중
27구역지정단계불로3구역서구 대곡동 637번지 일원458417조합환지<NA><NA><NA>관계기관 협의 중
28구역지정단계사월마을 일원서구 백석동 212-2 일원980440조합환지<NA><NA><NA>관계기관 협의 중