Overview

Dataset statistics

Number of variables5
Number of observations26
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 KiB
Average record size in memory46.1 B

Variable types

Categorical3
Text1
Numeric1

Dataset

Description인천광역시 남동구 개발제한구역 현황에 대한 데이터로 구분, 지정(해제)일자, 구역현황, 면적, 데이터기준일자 항목을 제공합니다.
Author인천광역시 남동구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=3080237&srcSe=7661IVAWM27C61E190

Alerts

기준일자 has constant value ""Constant
면적(제곱미터) is highly overall correlated with 지정(해제)일자High correlation
구분 is highly overall correlated with 지정(해제)일자High correlation
지정(해제)일자 is highly overall correlated with 면적(제곱미터) and 1 other fieldsHigh correlation
구분 is highly imbalanced (76.5%)Imbalance
면적(제곱미터) has unique valuesUnique

Reproduction

Analysis started2024-01-28 15:55:44.370981
Analysis finished2024-01-28 15:55:44.754629
Duration0.38 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)7.7%
Missing0
Missing (%)0.0%
Memory size340.0 B
해제
25 
지정
 
1

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)3.8%

Sample

1st row지정
2nd row해제
3rd row해제
4th row해제
5th row해제

Common Values

ValueCountFrequency (%)
해제 25
96.2%
지정 1
 
3.8%

Length

2024-01-29T00:55:44.800582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-29T00:55:44.876528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
해제 25
96.2%
지정 1
 
3.8%

지정(해제)일자
Categorical

HIGH CORRELATION 

Distinct12
Distinct (%)46.2%
Missing0
Missing (%)0.0%
Memory size340.0 B
2006-06-05
15 
1972-08-25
 
1
2006-11-01
 
1
2010-10-18
 
1
2010-11-15
 
1
Other values (7)

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique11 ?
Unique (%)42.3%

Sample

1st row1972-08-25
2nd row2006-06-05
3rd row2006-06-05
4th row2006-06-05
5th row2006-06-05

Common Values

ValueCountFrequency (%)
2006-06-05 15
57.7%
1972-08-25 1
 
3.8%
2006-11-01 1
 
3.8%
2010-10-18 1
 
3.8%
2010-11-15 1
 
3.8%
2013-05-20 1
 
3.8%
2015-02-02 1
 
3.8%
2015-03-02 1
 
3.8%
2017-05-08 1
 
3.8%
2017-12-29 1
 
3.8%
Other values (2) 2
 
7.7%

Length

2024-01-29T00:55:44.955428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2006-06-05 15
57.7%
1972-08-25 1
 
3.8%
2006-11-01 1
 
3.8%
2010-10-18 1
 
3.8%
2010-11-15 1
 
3.8%
2013-05-20 1
 
3.8%
2015-02-02 1
 
3.8%
2015-03-02 1
 
3.8%
2017-05-08 1
 
3.8%
2017-12-29 1
 
3.8%
Other values (2) 2
 
7.7%
Distinct25
Distinct (%)96.2%
Missing0
Missing (%)0.0%
Memory size340.0 B
2024-01-29T00:55:45.126532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length43
Mean length29.846154
Min length11

Characters and Unicode

Total characters776
Distinct characters99
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)92.3%

Sample

1st row건설부고시 제385호 개발제한구역 지정된 남동구 일대
2nd row집단취락지구 해제(인천광역시 남동구 구월동 760번지 일원)
3rd row집단취락지구 해제(인천광역시 남동구 구월동 589번지 일원)
4th row집단취락지구 해제(인천광역시 남동구 구월동 653 수산동 50번지 일원)
5th row집단취락지구 해제(인천광역시 남동구 수산동 150번지 일원)
ValueCountFrequency (%)
남동구 19
 
13.2%
일원 17
 
11.8%
해제(인천광역시 16
 
11.1%
집단취락지구 15
 
10.4%
해제 6
 
4.2%
수산동 5
 
3.5%
개발제한구역 3
 
2.1%
도림동 3
 
2.1%
구월동 3
 
2.1%
일부해제(소규모단절토지 2
 
1.4%
Other values (48) 55
38.2%
2024-01-29T00:55:45.418428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
118
 
15.2%
44
 
5.7%
43
 
5.5%
40
 
5.2%
29
 
3.7%
25
 
3.2%
22
 
2.8%
22
 
2.8%
21
 
2.7%
21
 
2.7%
Other values (89) 391
50.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 557
71.8%
Space Separator 118
 
15.2%
Decimal Number 60
 
7.7%
Close Punctuation 19
 
2.4%
Open Punctuation 19
 
2.4%
Dash Punctuation 2
 
0.3%
Other Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
44
 
7.9%
43
 
7.7%
40
 
7.2%
29
 
5.2%
25
 
4.5%
22
 
3.9%
22
 
3.9%
21
 
3.8%
21
 
3.8%
20
 
3.6%
Other values (74) 270
48.5%
Decimal Number
ValueCountFrequency (%)
3 12
20.0%
5 10
16.7%
6 9
15.0%
2 8
13.3%
0 6
10.0%
8 5
8.3%
1 4
 
6.7%
4 3
 
5.0%
9 2
 
3.3%
7 1
 
1.7%
Space Separator
ValueCountFrequency (%)
118
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 557
71.8%
Common 219
 
28.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
44
 
7.9%
43
 
7.7%
40
 
7.2%
29
 
5.2%
25
 
4.5%
22
 
3.9%
22
 
3.9%
21
 
3.8%
21
 
3.8%
20
 
3.6%
Other values (74) 270
48.5%
Common
ValueCountFrequency (%)
118
53.9%
) 19
 
8.7%
( 19
 
8.7%
3 12
 
5.5%
5 10
 
4.6%
6 9
 
4.1%
2 8
 
3.7%
0 6
 
2.7%
8 5
 
2.3%
1 4
 
1.8%
Other values (5) 9
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 557
71.8%
ASCII 218
 
28.1%
CJK Compat 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
118
54.1%
) 19
 
8.7%
( 19
 
8.7%
3 12
 
5.5%
5 10
 
4.6%
6 9
 
4.1%
2 8
 
3.7%
0 6
 
2.8%
8 5
 
2.3%
1 4
 
1.8%
Other values (4) 8
 
3.7%
Hangul
ValueCountFrequency (%)
44
 
7.9%
43
 
7.7%
40
 
7.2%
29
 
5.2%
25
 
4.5%
22
 
3.9%
22
 
3.9%
21
 
3.8%
21
 
3.8%
20
 
3.6%
Other values (74) 270
48.5%
CJK Compat
ValueCountFrequency (%)
1
100.0%

면적(제곱미터)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct26
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1235790.9
Minimum0.012979
Maximum27950000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size366.0 B
2024-01-29T00:55:45.521214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.012979
5-th percentile3258.75
Q122508
median39942
Q389294.25
95-th percentile1758547.8
Maximum27950000
Range27950000
Interquartile range (IQR)66786.25

Descriptive statistics

Standard deviation5464817.5
Coefficient of variation (CV)4.4221215
Kurtosis25.654417
Mean1235790.9
Median Absolute Deviation (MAD)30339
Skewness5.0520964
Sum32130564
Variance2.986423 × 1013
MonotonicityNot monotonic
2024-01-29T00:55:45.608047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
27950000.0 1
 
3.8%
23522.0 1
 
3.8%
0.012979 1
 
3.8%
10478.0 1
 
3.8%
233141.0 1
 
3.8%
4611.0 1
 
3.8%
2808.0 1
 
3.8%
173188.0 1
 
3.8%
19184.5 1
 
3.8%
736741.0 1
 
3.8%
Other values (16) 16
61.5%
ValueCountFrequency (%)
0.012979 1
3.8%
2808.0 1
3.8%
4611.0 1
3.8%
10478.0 1
3.8%
19184.5 1
3.8%
20724.0 1
3.8%
22170.0 1
3.8%
23522.0 1
3.8%
27903.0 1
3.8%
30671.0 1
3.8%
ValueCountFrequency (%)
27950000.0 1
3.8%
2099150.0 1
3.8%
736741.0 1
3.8%
233141.0 1
3.8%
182184.0 1
3.8%
173188.0 1
3.8%
90357.0 1
3.8%
86106.0 1
3.8%
74222.0 1
3.8%
71156.0 1
3.8%

기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)3.8%
Missing0
Missing (%)0.0%
Memory size340.0 B
2022-12-05
26 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-12-05
2nd row2022-12-05
3rd row2022-12-05
4th row2022-12-05
5th row2022-12-05

Common Values

ValueCountFrequency (%)
2022-12-05 26
100.0%

Length

2024-01-29T00:55:45.701720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-29T00:55:45.767196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-12-05 26
100.0%

Interactions

2024-01-29T00:55:44.541680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-29T00:55:45.809024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분지정(해제)일자구역현황면적(제곱미터)
구분1.0001.0001.0000.646
지정(해제)일자1.0001.0000.0001.000
구역현황1.0000.0001.0001.000
면적(제곱미터)0.6461.0001.0001.000
2024-01-29T00:55:45.881186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지정(해제)일자구분
지정(해제)일자1.0000.764
구분0.7641.000
2024-01-29T00:55:45.941749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
면적(제곱미터)구분지정(해제)일자
면적(제곱미터)1.0000.4450.764
구분0.4451.0000.764
지정(해제)일자0.7640.7641.000

Missing values

2024-01-29T00:55:44.641949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-29T00:55:44.724475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분지정(해제)일자구역현황면적(제곱미터)기준일자
0지정1972-08-25건설부고시 제385호 개발제한구역 지정된 남동구 일대27950000.02022-12-05
1해제2006-06-05집단취락지구 해제(인천광역시 남동구 구월동 760번지 일원)22170.02022-12-05
2해제2006-06-05집단취락지구 해제(인천광역시 남동구 구월동 589번지 일원)30671.02022-12-05
3해제2006-06-05집단취락지구 해제(인천광역시 남동구 구월동 653 수산동 50번지 일원)74222.02022-12-05
4해제2006-06-05집단취락지구 해제(인천광역시 남동구 수산동 150번지 일원)86106.02022-12-05
5해제2006-06-05집단취락지구 해제(인천광역시 남동구 수산동 365번지 일원)33313.02022-12-05
6해제2006-06-05집단취락지구 해제(인천광역시 남동구 수산동 538번지 일원)62628.02022-12-05
7해제2006-06-05집단취락지구 해제(인천광역시 남동구 도림동 220번지 일원)90357.02022-12-05
8해제2006-06-05집단취락지구 해제(인천광역시 남동구 도림동 348번지 일원)34328.02022-12-05
9해제2006-06-05집단취락지구 해제(인천광역시 남동구 논현동 50번지 일원)27903.02022-12-05
구분지정(해제)일자구역현황면적(제곱미터)기준일자
16해제2006-11-01논현2 서창2 택지지구 해제2099150.02022-12-05
17해제2010-10-182014남동경기장 부지 해제182184.02022-12-05
18해제2010-11-15구월보금자리지구 해제736741.02022-12-05
19해제2013-05-20개발제한구역 일부해제(소규모단절토지 경계선관통대지)19184.52022-12-05
20해제2015-02-02인천광역시 남동구 남촌동 농산물도매시장 이전 부지 해제173188.02022-12-05
21해제2015-03-02개발제한구역 일부해제(소규모단절토지 경계선관통대지)2808.02022-12-05
22해제2017-05-08소래어시장 현대화사업을 위한 해제4611.02022-12-05
23해제2017-12-29남동 도시첨단산업단지 해제233141.02022-12-05
24해제2020-12-28단절된 3만㎡ 미만의 토지를 해제(인천광역시 남동구 수산동 13-1번지 일원)10478.02022-12-05
25해제2021-12-13청소년복합문화센터 조성을 위한 일부해제(인천광역시 남동구 도림동 562-3번지 일원)0.0129792022-12-05