Overview

Dataset statistics

Number of variables10
Number of observations241
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory19.9 KiB
Average record size in memory84.5 B

Variable types

Numeric3
Categorical4
DateTime2
Text1

Dataset

Description인천광역시 중구에서 조사한 개발행위허가현황에 대한 데이터 입니다.파일명 인천광역시_중구_개발행위허가현황파일내용 해당년도, 허가일자, 위치, 용도지역 등
Author인천광역시 중구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15036869&srcSe=7661IVAWM27C61E190

Alerts

데이터기준일자 has constant value ""Constant
연번 is highly overall correlated with 해당년도High correlation
허가면적(미터제곱) is highly overall correlated with 협의면적(미터제곱)High correlation
협의면적(미터제곱) is highly overall correlated with 허가면적(미터제곱)High correlation
해당년도 is highly overall correlated with 연번High correlation
용도지역 is highly overall correlated with 지목명High correlation
지목명 is highly overall correlated with 용도지역High correlation
개발행위 목적 is highly imbalanced (54.3%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2024-01-28 10:44:59.327030
Analysis finished2024-01-28 10:45:00.521177
Duration1.19 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct241
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean121
Minimum1
Maximum241
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2024-01-28T19:45:00.582989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile13
Q161
median121
Q3181
95-th percentile229
Maximum241
Range240
Interquartile range (IQR)120

Descriptive statistics

Standard deviation69.714896
Coefficient of variation (CV)0.57615616
Kurtosis-1.2
Mean121
Median Absolute Deviation (MAD)60
Skewness0
Sum29161
Variance4860.1667
MonotonicityStrictly increasing
2024-01-28T19:45:00.901292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
182 1
 
0.4%
154 1
 
0.4%
155 1
 
0.4%
156 1
 
0.4%
157 1
 
0.4%
158 1
 
0.4%
159 1
 
0.4%
160 1
 
0.4%
161 1
 
0.4%
Other values (231) 231
95.9%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
241 1
0.4%
240 1
0.4%
239 1
0.4%
238 1
0.4%
237 1
0.4%
236 1
0.4%
235 1
0.4%
234 1
0.4%
233 1
0.4%
232 1
0.4%

해당년도
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023
135 
2022
68 
2021
38 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021
2nd row2021
3rd row2021
4th row2021
5th row2021

Common Values

ValueCountFrequency (%)
2023 135
56.0%
2022 68
28.2%
2021 38
 
15.8%

Length

2024-01-28T19:45:00.993275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T19:45:01.061526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023 135
56.0%
2022 68
28.2%
2021 38
 
15.8%
Distinct159
Distinct (%)66.0%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
Minimum2021-09-13 00:00:00
Maximum2023-12-06 00:00:00
2024-01-28T19:45:01.144916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T19:45:01.253456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct206
Distinct (%)85.5%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2024-01-28T19:45:01.447700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length12.609959
Min length4

Characters and Unicode

Total characters3039
Distinct characters43
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique187 ?
Unique (%)77.6%

Sample

1st row인천광역시 중구 항동7가112-10
2nd row인천광역시 중구 항동7가1-68
3rd row인천광역시 중구 항동7가112-10
4th row인천광역시 중구 항동7가1-31
5th row인천광역시 중구 북성동1가125-5
ValueCountFrequency (%)
인천광역시 106
22.6%
중구 106
22.6%
중산동 12
 
2.6%
을왕동896-13 8
 
1.7%
산135-25 7
 
1.5%
운북동35-3 4
 
0.9%
운남동 4
 
0.9%
을왕동896-22 3
 
0.6%
항동7가1-8 3
 
0.6%
운북동450-2 3
 
0.6%
Other values (197) 214
45.5%
2024-01-28T19:45:01.754635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
240
 
7.9%
229
 
7.5%
- 212
 
7.0%
1 204
 
6.7%
160
 
5.3%
5 142
 
4.7%
107
 
3.5%
106
 
3.5%
106
 
3.5%
106
 
3.5%
Other values (33) 1427
47.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1501
49.4%
Decimal Number 1061
34.9%
Space Separator 229
 
7.5%
Dash Punctuation 212
 
7.0%
Open Punctuation 12
 
0.4%
Close Punctuation 12
 
0.4%
Uppercase Letter 12
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
240
16.0%
160
10.7%
107
 
7.1%
106
 
7.1%
106
 
7.1%
106
 
7.1%
106
 
7.1%
106
 
7.1%
101
 
6.7%
80
 
5.3%
Other values (17) 283
18.9%
Decimal Number
ValueCountFrequency (%)
1 204
19.2%
5 142
13.4%
8 105
9.9%
3 105
9.9%
2 98
9.2%
7 90
8.5%
4 86
8.1%
6 85
8.0%
9 73
 
6.9%
0 73
 
6.9%
Uppercase Letter
ValueCountFrequency (%)
B 9
75.0%
A 3
 
25.0%
Space Separator
ValueCountFrequency (%)
229
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 212
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1526
50.2%
Hangul 1501
49.4%
Latin 12
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
240
16.0%
160
10.7%
107
 
7.1%
106
 
7.1%
106
 
7.1%
106
 
7.1%
106
 
7.1%
106
 
7.1%
101
 
6.7%
80
 
5.3%
Other values (17) 283
18.9%
Common
ValueCountFrequency (%)
229
15.0%
- 212
13.9%
1 204
13.4%
5 142
9.3%
8 105
6.9%
3 105
6.9%
2 98
6.4%
7 90
 
5.9%
4 86
 
5.6%
6 85
 
5.6%
Other values (4) 170
11.1%
Latin
ValueCountFrequency (%)
B 9
75.0%
A 3
 
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1538
50.6%
Hangul 1501
49.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
240
16.0%
160
10.7%
107
 
7.1%
106
 
7.1%
106
 
7.1%
106
 
7.1%
106
 
7.1%
106
 
7.1%
101
 
6.7%
80
 
5.3%
Other values (17) 283
18.9%
ASCII
ValueCountFrequency (%)
229
14.9%
- 212
13.8%
1 204
13.3%
5 142
9.2%
8 105
6.8%
3 105
6.8%
2 98
6.4%
7 90
 
5.9%
4 86
 
5.6%
6 85
 
5.5%
Other values (6) 182
11.8%

용도지역
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
자연녹지지역
155 
보전녹지
31 
생산녹지지역
30 
준공업지역
 
10
자연/보전녹지지역
 
6
Other values (4)
 
9

Length

Max length9
Median length6
Mean length5.7593361
Min length4

Unique

Unique2 ?
Unique (%)0.8%

Sample

1st row준공업지역
2nd row준공업지역
3rd row준공업지역
4th row준공업지역
5th row일반상업지역

Common Values

ValueCountFrequency (%)
자연녹지지역 155
64.3%
보전녹지 31
 
12.9%
생산녹지지역 30
 
12.4%
준공업지역 10
 
4.1%
자연/보전녹지지역 6
 
2.5%
일반상업지역 5
 
2.1%
일반공업지역 2
 
0.8%
자연녹지 1
 
0.4%
생산지역 1
 
0.4%

Length

2024-01-28T19:45:01.868136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T19:45:01.948394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
자연녹지지역 155
64.3%
보전녹지 31
 
12.9%
생산녹지지역 30
 
12.4%
준공업지역 10
 
4.1%
자연/보전녹지지역 6
 
2.5%
일반상업지역 5
 
2.1%
일반공업지역 2
 
0.8%
자연녹지 1
 
0.4%
생산지역 1
 
0.4%

지목명
Categorical

HIGH CORRELATION 

Distinct12
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
84 
60 
53 
15 
11 
Other values (7)
18 

Length

Max length3
Median length1
Mean length1.0207469
Min length1

Unique

Unique4 ?
Unique (%)1.7%

Sample

1st row
2nd row도로
3rd row
4th row
5th row공원

Common Values

ValueCountFrequency (%)
84
34.9%
60
24.9%
53
22.0%
15
 
6.2%
11
 
4.6%
8
 
3.3%
3
 
1.2%
3
 
1.2%
도로 1
 
0.4%
공원 1
 
0.4%
Other values (2) 2
 
0.8%

Length

2024-01-28T19:45:02.048459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
84
34.9%
60
24.9%
53
22.0%
15
 
6.2%
11
 
4.6%
8
 
3.3%
3
 
1.2%
3
 
1.2%
도로 1
 
0.4%
공원 1
 
0.4%
Other values (2) 2
 
0.8%

허가면적(미터제곱)
Real number (ℝ)

HIGH CORRELATION 

Distinct197
Distinct (%)81.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1317.0967
Minimum16
Maximum14052
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2024-01-28T19:45:02.140993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum16
5-th percentile122
Q1374
median635
Q31322.3
95-th percentile4740
Maximum14052
Range14036
Interquartile range (IQR)948.3

Descriptive statistics

Standard deviation1870.075
Coefficient of variation (CV)1.4198464
Kurtosis13.6092
Mean1317.0967
Median Absolute Deviation (MAD)312
Skewness3.2976352
Sum317420.3
Variance3497180.5
MonotonicityNot monotonic
2024-01-28T19:45:02.239876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
659.0 9
 
3.7%
430.0 8
 
3.3%
400.0 5
 
2.1%
366.0 4
 
1.7%
329.0 3
 
1.2%
355.0 3
 
1.2%
337.0 3
 
1.2%
919.0 2
 
0.8%
381.0 2
 
0.8%
559.0 2
 
0.8%
Other values (187) 200
83.0%
ValueCountFrequency (%)
16.0 1
0.4%
18.0 1
0.4%
20.0 1
0.4%
27.0 2
0.8%
36.0 1
0.4%
37.6 1
0.4%
60.0 1
0.4%
63.0 1
0.4%
73.0 1
0.4%
77.1 1
0.4%
ValueCountFrequency (%)
14052.0 1
0.4%
10041.0 1
0.4%
8733.0 1
0.4%
8727.0 1
0.4%
8454.0 1
0.4%
8319.0 1
0.4%
7136.0 1
0.4%
6882.0 1
0.4%
6753.0 1
0.4%
4975.7 1
0.4%

협의면적(미터제곱)
Real number (ℝ)

HIGH CORRELATION 

Distinct196
Distinct (%)81.7%
Missing1
Missing (%)0.4%
Infinite0
Infinite (%)0.0%
Mean1306.305
Minimum16
Maximum14052
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2024-01-28T19:45:02.339727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum16
5-th percentile120.6
Q1372
median634
Q31309.25
95-th percentile4746.2
Maximum14052
Range14036
Interquartile range (IQR)937.25

Descriptive statistics

Standard deviation1866.4481
Coefficient of variation (CV)1.4287997
Kurtosis13.873137
Mean1306.305
Median Absolute Deviation (MAD)311.5
Skewness3.3371082
Sum313513.2
Variance3483628.6
MonotonicityNot monotonic
2024-01-28T19:45:02.439307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
659.0 9
 
3.7%
430.0 8
 
3.3%
400.0 5
 
2.1%
366.0 4
 
1.7%
329.0 3
 
1.2%
355.0 3
 
1.2%
337.0 3
 
1.2%
27.0 2
 
0.8%
919.0 2
 
0.8%
559.0 2
 
0.8%
Other values (186) 199
82.6%
ValueCountFrequency (%)
16.0 1
0.4%
18.0 1
0.4%
20.0 1
0.4%
27.0 2
0.8%
36.0 1
0.4%
37.6 1
0.4%
60.0 1
0.4%
63.0 1
0.4%
73.0 1
0.4%
77.1 1
0.4%
ValueCountFrequency (%)
14052.0 1
0.4%
10041.0 1
0.4%
8733.0 1
0.4%
8727.0 1
0.4%
8454.0 1
0.4%
8319.0 1
0.4%
7136.0 1
0.4%
6882.0 1
0.4%
6753.0 1
0.4%
4975.7 1
0.4%

개발행위 목적
Categorical

IMBALANCE 

Distinct9
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
토지형질변경(신축)
178 
토지형질변경
20 
토지분할
19 
공작물설치
 
11
건축물의 건축
 
6
Other values (4)
 
7

Length

Max length20
Median length10
Mean length9.0041494
Min length4

Unique

Unique2 ?
Unique (%)0.8%

Sample

1st row건축물의 건축
2nd row건축물의 건축
3rd row건축물의 건축
4th row건축물의 건축
5th row건축물의 건축

Common Values

ValueCountFrequency (%)
토지형질변경(신축) 178
73.9%
토지형질변경 20
 
8.3%
토지분할 19
 
7.9%
공작물설치 11
 
4.6%
건축물의 건축 6
 
2.5%
토지분할(기반시설 공사완료 후 분할) 3
 
1.2%
공작물 설치 2
 
0.8%
토지분할(공유물분할) 1
 
0.4%
토지형질변경 및 공작물설치 1
 
0.4%

Length

2024-01-28T19:45:02.554779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T19:45:02.656731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
토지형질변경(신축 178
68.5%
토지형질변경 21
 
8.1%
토지분할 19
 
7.3%
공작물설치 12
 
4.6%
건축물의 6
 
2.3%
건축 6
 
2.3%
토지분할(기반시설 3
 
1.2%
공사완료 3
 
1.2%
3
 
1.2%
분할 3
 
1.2%
Other values (4) 6
 
2.3%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
Minimum2023-12-06 00:00:00
Maximum2023-12-06 00:00:00
2024-01-28T19:45:02.751274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T19:45:02.829395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-01-28T19:45:00.097513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T19:44:59.672551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T19:44:59.863786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T19:45:00.169407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T19:44:59.730625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T19:44:59.929527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T19:45:00.248470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T19:44:59.798791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T19:44:59.997473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-28T19:45:02.886781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번해당년도용도지역지목명허가면적(미터제곱)협의면적(미터제곱)개발행위 목적
연번1.0000.9130.6450.5120.0000.0000.475
해당년도0.9131.0000.5710.4680.0340.0390.587
용도지역0.6450.5711.0000.8180.0000.0000.707
지목명0.5120.4680.8181.0000.1280.1350.694
허가면적(미터제곱)0.0000.0340.0000.1281.0001.0000.273
협의면적(미터제곱)0.0000.0390.0000.1351.0001.0000.279
개발행위 목적0.4750.5870.7070.6940.2730.2791.000
2024-01-28T19:45:02.978213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지목명해당년도용도지역개발행위 목적
지목명1.0000.2350.5230.380
해당년도0.2351.0000.3000.311
용도지역0.5230.3001.0000.296
개발행위 목적0.3800.3110.2961.000
2024-01-28T19:45:03.054428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번허가면적(미터제곱)협의면적(미터제곱)해당년도용도지역지목명개발행위 목적
연번1.000-0.026-0.0180.8550.3620.2460.231
허가면적(미터제곱)-0.0261.0001.0000.0090.0000.0520.090
협의면적(미터제곱)-0.0181.0001.0000.0110.0000.0550.092
해당년도0.8550.0090.0111.0000.3000.2350.311
용도지역0.3620.0000.0000.3001.0000.5230.296
지목명0.2460.0520.0550.2350.5231.0000.380
개발행위 목적0.2310.0900.0920.3110.2960.3801.000

Missing values

2024-01-28T19:45:00.351404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T19:45:00.467234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번해당년도허가일자허가위치용도지역지목명허가면적(미터제곱)협의면적(미터제곱)개발행위 목적데이터기준일자
0120212021-09-27인천광역시 중구 항동7가112-10준공업지역27.027.0건축물의 건축2023-12-06
1220212021-10-07인천광역시 중구 항동7가1-68준공업지역도로36.036.0건축물의 건축2023-12-06
2320212021-10-08인천광역시 중구 항동7가112-10준공업지역27.027.0건축물의 건축2023-12-06
3420212021-10-25인천광역시 중구 항동7가1-31준공업지역20.020.0건축물의 건축2023-12-06
4520212021-10-27인천광역시 중구 북성동1가125-5일반상업지역공원318.24318.24건축물의 건축2023-12-06
5620212021-10-29인천광역시 중구 인현동1-336일반상업지역63.063.0공작물설치2023-12-06
6720212021-11-16인천광역시 중구 항동7가1-8준공업지역18.018.0건축물의 건축2023-12-06
7820212021-11-18인천광역시 중구 관동2가9일반상업지역122.0122.0공작물설치2023-12-06
8920212021-12-02인천광역시 중구 북성동1가6-85일반공업지역공장273.46273.46공작물설치2023-12-06
91020212021-12-10인천광역시 중구 신흥동3가47-1일반상업지역주유소1304.91304.9공작물설치2023-12-06
연번해당년도허가일자허가위치용도지역지목명허가면적(미터제곱)협의면적(미터제곱)개발행위 목적데이터기준일자
23123220232023-10-26무의동180-70자연녹지지역919.0919.0토지형질변경(신축)2023-12-06
23223320232023-10-30무의동1자연녹지지역2520.02520.0토지형질변경(신축)2023-12-06
23323420232023-10-30을왕동641-4자연녹지지역863.0863.0토지형질변경(신축)2023-12-06
23423520232023-10-30을왕동산54-14자연/보전녹지지역2046.02046.0토지형질변경(신축)2023-12-06
23523620232023-10-30무의동880자연녹지지역268.1268.1공작물설치2023-12-06
23623720232023-11-14남북동44-15자연녹지지역366.0366.0토지형질변경(신축)2023-12-06
23723820232023-11-15남북동44-15자연녹지지역602.0602.0토지형질변경(신축)2023-12-06
23823920232023-11-15남북동901-19자연녹지지역427.0427.0토지형질변경(신축)2023-12-06
23924020232023-12-05을왕동112자연녹지지역659.0659.0토지형질변경(신축)2023-12-06
24024120232023-12-06남북동695-5자연/보전녹지지역1012.01012.0토지형질변경(신축)2023-12-06