Overview

Dataset statistics

Number of variables6
Number of observations234
Missing cells1
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.6 KiB
Average record size in memory50.6 B

Variable types

Numeric2
Categorical2
Text1
DateTime1

Dataset

Description해당 데이터는 인천광역시 남동구의 개발행위 허가현황에 관련된 자료로서, 인천광역시 남동구 개발행위 허가현황의 연번, 소재지, 지번, 면적(제곱미터), 용도, 허가일의 정보를 확인할 수 있다.
Author인천광역시 남동구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15104544&srcSe=7661IVAWM27C61E190

Alerts

면적(제곱미터) is highly overall correlated with 용도High correlation
용도 is highly overall correlated with 면적(제곱미터)High correlation
용도 is highly imbalanced (58.6%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2024-04-17 18:42:20.934408
Analysis finished2024-04-17 18:42:21.490185
Duration0.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct234
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean117.5
Minimum1
Maximum234
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2024-04-18T03:42:21.551340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile12.65
Q159.25
median117.5
Q3175.75
95-th percentile222.35
Maximum234
Range233
Interquartile range (IQR)116.5

Descriptive statistics

Standard deviation67.694165
Coefficient of variation (CV)0.57612055
Kurtosis-1.2
Mean117.5
Median Absolute Deviation (MAD)58.5
Skewness0
Sum27495
Variance4582.5
MonotonicityStrictly increasing
2024-04-18T03:42:21.654261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
162 1
 
0.4%
150 1
 
0.4%
151 1
 
0.4%
152 1
 
0.4%
153 1
 
0.4%
154 1
 
0.4%
155 1
 
0.4%
156 1
 
0.4%
157 1
 
0.4%
Other values (224) 224
95.7%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
234 1
0.4%
233 1
0.4%
232 1
0.4%
231 1
0.4%
230 1
0.4%
229 1
0.4%
228 1
0.4%
227 1
0.4%
226 1
0.4%
225 1
0.4%

소재지
Categorical

Distinct17
Distinct (%)7.3%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
고잔동
68 
수산동
36 
운연동
29 
간석동
24 
논현동
14 
Other values (12)
63 

Length

Max length7
Median length3
Mean length3.0512821
Min length3

Unique

Unique5 ?
Unique (%)2.1%

Sample

1st row간석동
2nd row고잔동
3rd row논현동
4th row간석동
5th row고잔동

Common Values

ValueCountFrequency (%)
고잔동 68
29.1%
수산동 36
15.4%
운연동 29
12.4%
간석동 24
 
10.3%
논현동 14
 
6.0%
장수동 12
 
5.1%
도림동 10
 
4.3%
서창동 10
 
4.3%
만수동 9
 
3.8%
구월동 8
 
3.4%
Other values (7) 14
 
6.0%

Length

2024-04-18T03:42:21.768414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
고잔동 71
30.2%
수산동 37
15.7%
운연동 29
12.3%
간석동 24
 
10.2%
논현동 15
 
6.4%
장수동 12
 
5.1%
서창동 12
 
5.1%
도림동 10
 
4.3%
만수동 9
 
3.8%
구월동 8
 
3.4%
Other values (3) 8
 
3.4%

지번
Text

Distinct220
Distinct (%)94.0%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2024-04-18T03:42:22.015398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length24
Mean length8.6452991
Min length2

Characters and Unicode

Total characters2023
Distinct characters28
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique206 ?
Unique (%)88.0%

Sample

1st row1-161
2nd row460-21 외3
3rd row738-6
4th row1-863
5th row382-16
ValueCountFrequency (%)
19
 
5.2%
2필지 9
 
2.5%
294-9 4
 
1.1%
343-2 4
 
1.1%
1필지 4
 
1.1%
269-2 3
 
0.8%
1 3
 
0.8%
294-10 3
 
0.8%
외2 3
 
0.8%
17533 2
 
0.6%
Other values (292) 309
85.1%
2024-04-18T03:42:22.368577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 275
13.6%
1 250
12.4%
2 193
9.5%
3 162
 
8.0%
5 152
 
7.5%
4 142
 
7.0%
129
 
6.4%
6 127
 
6.3%
, 100
 
4.9%
0 87
 
4.3%
Other values (18) 406
20.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1359
67.2%
Dash Punctuation 275
 
13.6%
Other Letter 138
 
6.8%
Space Separator 129
 
6.4%
Other Punctuation 100
 
4.9%
Uppercase Letter 8
 
0.4%
Open Punctuation 7
 
0.3%
Close Punctuation 7
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
46
33.3%
33
23.9%
23
16.7%
23
16.7%
4
 
2.9%
4
 
2.9%
1
 
0.7%
1
 
0.7%
1
 
0.7%
1
 
0.7%
Decimal Number
ValueCountFrequency (%)
1 250
18.4%
2 193
14.2%
3 162
11.9%
5 152
11.2%
4 142
10.4%
6 127
9.3%
0 87
 
6.4%
9 85
 
6.3%
8 81
 
6.0%
7 80
 
5.9%
Uppercase Letter
ValueCountFrequency (%)
A 5
62.5%
B 3
37.5%
Dash Punctuation
ValueCountFrequency (%)
- 275
100.0%
Space Separator
ValueCountFrequency (%)
129
100.0%
Other Punctuation
ValueCountFrequency (%)
, 100
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1877
92.8%
Hangul 138
 
6.8%
Latin 8
 
0.4%

Most frequent character per script

Common
ValueCountFrequency (%)
- 275
14.7%
1 250
13.3%
2 193
10.3%
3 162
8.6%
5 152
8.1%
4 142
7.6%
129
6.9%
6 127
6.8%
, 100
 
5.3%
0 87
 
4.6%
Other values (5) 260
13.9%
Hangul
ValueCountFrequency (%)
46
33.3%
33
23.9%
23
16.7%
23
16.7%
4
 
2.9%
4
 
2.9%
1
 
0.7%
1
 
0.7%
1
 
0.7%
1
 
0.7%
Latin
ValueCountFrequency (%)
A 5
62.5%
B 3
37.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1885
93.2%
Hangul 138
 
6.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 275
14.6%
1 250
13.3%
2 193
10.2%
3 162
8.6%
5 152
8.1%
4 142
7.5%
129
6.8%
6 127
6.7%
, 100
 
5.3%
0 87
 
4.6%
Other values (7) 268
14.2%
Hangul
ValueCountFrequency (%)
46
33.3%
33
23.9%
23
16.7%
23
16.7%
4
 
2.9%
4
 
2.9%
1
 
0.7%
1
 
0.7%
1
 
0.7%
1
 
0.7%

면적(제곱미터)
Real number (ℝ)

HIGH CORRELATION 

Distinct212
Distinct (%)91.0%
Missing1
Missing (%)0.4%
Infinite0
Infinite (%)0.0%
Mean2190.5997
Minimum11
Maximum272338
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2024-04-18T03:42:22.479398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum11
5-th percentile44.4
Q1303
median568.16
Q3930.4
95-th percentile2974
Maximum272338
Range272327
Interquartile range (IQR)627.4

Descriptive statistics

Standard deviation18091.305
Coefficient of variation (CV)8.2586082
Kurtosis217.10278
Mean2190.5997
Median Absolute Deviation (MAD)294.86
Skewness14.556952
Sum510409.74
Variance3.2729532 × 108
MonotonicityNot monotonic
2024-04-18T03:42:22.581954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
527.0 3
 
1.3%
711.0 3
 
1.3%
576.0 2
 
0.9%
602.9 2
 
0.9%
400.0 2
 
0.9%
536.0 2
 
0.9%
300.0 2
 
0.9%
1009.0 2
 
0.9%
330.0 2
 
0.9%
726.0 2
 
0.9%
Other values (202) 211
90.2%
ValueCountFrequency (%)
11.0 1
0.4%
20.2 1
0.4%
23.61 1
0.4%
28.0 1
0.4%
30.1 1
0.4%
30.3 1
0.4%
30.32 1
0.4%
34.0 1
0.4%
37.0 1
0.4%
40.0 1
0.4%
ValueCountFrequency (%)
272338.0 1
0.4%
50256.0 1
0.4%
8218.0 1
0.4%
4514.0 1
0.4%
4151.56 1
0.4%
4006.0 1
0.4%
3732.0 1
0.4%
3372.0 1
0.4%
3214.0 1
0.4%
3195.0 1
0.4%

용도
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct11
Distinct (%)4.7%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
토지형질변경
167 
공작물설치
39 
건축부지조성
 
12
태양광발전설비
 
7
토지형질변경, 건축물의건축
 
3
Other values (6)
 
6

Length

Max length14
Median length6
Mean length6.0299145
Min length4

Unique

Unique6 ?
Unique (%)2.6%

Sample

1st row토지형질변경
2nd row토지형질변경
3rd row공작물설치
4th row토지형질변경
5th row토지형질변경

Common Values

ValueCountFrequency (%)
토지형질변경 167
71.4%
공작물설치 39
 
16.7%
건축부지조성 12
 
5.1%
태양광발전설비 7
 
3.0%
토지형질변경, 건축물의건축 3
 
1.3%
건축물의 건축 1
 
0.4%
토지분할 1
 
0.4%
노외 주차장조성 1
 
0.4%
건축부지조성, 옹벽설치 1
 
0.4%
증축에 따른 건축부지조성 1
 
0.4%

Length

2024-04-18T03:42:22.678730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
토지형질변경 170
70.2%
공작물설치 39
 
16.1%
건축부지조성 14
 
5.8%
태양광발전설비 7
 
2.9%
건축물의건축 3
 
1.2%
건축물의 1
 
0.4%
건축 1
 
0.4%
토지분할 1
 
0.4%
노외 1
 
0.4%
주차장조성 1
 
0.4%
Other values (4) 4
 
1.7%
Distinct189
Distinct (%)80.8%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
Minimum2018-01-09 00:00:00
Maximum2023-07-10 00:00:00
2024-04-18T03:42:22.771214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T03:42:23.088710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2024-04-18T03:42:21.204763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T03:42:21.087924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T03:42:21.267371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T03:42:21.143658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-18T03:42:23.155962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번소재지면적(제곱미터)용도
연번1.0000.2410.0000.431
소재지0.2411.0000.0000.667
면적(제곱미터)0.0000.0001.0000.811
용도0.4310.6670.8111.000
2024-04-18T03:42:23.223419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소재지용도
소재지1.0000.316
용도0.3161.000
2024-04-18T03:42:23.283895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번면적(제곱미터)소재지용도
연번1.000-0.1450.0930.198
면적(제곱미터)-0.1451.0000.0000.680
소재지0.0930.0001.0000.316
용도0.1980.6800.3161.000

Missing values

2024-04-18T03:42:21.367755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-18T03:42:21.458633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번소재지지번면적(제곱미터)용도허가일
01간석동1-161388.0토지형질변경2018-01-09
12고잔동460-21 외3859.0토지형질변경2018-01-25
23논현동738-6146.7공작물설치2018-01-29
34간석동1-863303.6토지형질변경2018-02-23
45고잔동382-16166.0토지형질변경2018-03-06
56고잔동460-22번지 외2603.0토지형질변경2018-03-06
67간석동1-862 외1403.1토지형질변경2018-03-06
78도림동102-2번지 외 31238.0토지형질변경2018-03-12
89고잔동735,735-7,735-,8,735-94151.56공작물설치2018-03-14
910서창동655-6281.8토지형질변경2018-03-15
연번소재지지번면적(제곱미터)용도허가일
224225고잔동665-15210.9공작물설치2022-06-30
225226간석동561221.0토지형질변경2022-07-01
226227간석동564154.0토지형질변경2022-07-01
227228서창동62-2, 62-1, 63674.6토지형질변경2022-07-05
228229장수동21551985.0토지형질변경2022-07-06
229230고잔동457-1217.0토지형질변경2022-07-12
230231서창동22828628.4토지형질변경2022-07-13
231232서창동63, 62-2, 62-1821.9토지형질변경2022-07-13
232233고잔동734527.0공작물설치2023-07-07
233234도림동220-3527.0토지형질변경2023-07-10