Overview

Dataset statistics

Number of variables6
Number of observations65
Missing cells164
Missing cells (%)42.1%
Duplicate rows1
Duplicate rows (%)1.5%
Total size in memory3.3 KiB
Average record size in memory52.0 B

Variable types

Numeric2
Categorical2
Text2

Dataset

Description인천광역시 연수구 구도심 녹지현황 데이터로서 연번, 지역구분, 녹지명, 소재지, 면적(제곱미터) 등의 항목으로 이루어져 있습니다.
Author인천광역시 연수구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15116387&srcSe=7661IVAWM27C61E190

Alerts

Dataset has 1 (1.5%) duplicate rowsDuplicates
지역구분 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
데이터 기준일자 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
연번 is highly overall correlated with 지역구분 and 1 other fieldsHigh correlation
면적(제곱미터) is highly overall correlated with 지역구분 and 1 other fieldsHigh correlation
연번 has 41 (63.1%) missing valuesMissing
녹지명 has 41 (63.1%) missing valuesMissing
소재지 has 41 (63.1%) missing valuesMissing
면적(제곱미터) has 41 (63.1%) missing valuesMissing

Reproduction

Analysis started2024-01-28 13:59:47.311606
Analysis finished2024-01-28 13:59:48.097707
Duration0.79 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct24
Distinct (%)100.0%
Missing41
Missing (%)63.1%
Infinite0
Infinite (%)0.0%
Mean12.5
Minimum1
Maximum24
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size717.0 B
2024-01-28T22:59:48.148112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.15
Q16.75
median12.5
Q318.25
95-th percentile22.85
Maximum24
Range23
Interquartile range (IQR)11.5

Descriptive statistics

Standard deviation7.0710678
Coefficient of variation (CV)0.56568542
Kurtosis-1.2
Mean12.5
Median Absolute Deviation (MAD)6
Skewness0
Sum300
Variance50
MonotonicityStrictly increasing
2024-01-28T22:59:48.247087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
14 1
 
1.5%
24 1
 
1.5%
23 1
 
1.5%
22 1
 
1.5%
21 1
 
1.5%
20 1
 
1.5%
19 1
 
1.5%
18 1
 
1.5%
17 1
 
1.5%
16 1
 
1.5%
Other values (14) 14
 
21.5%
(Missing) 41
63.1%
ValueCountFrequency (%)
1 1
1.5%
2 1
1.5%
3 1
1.5%
4 1
1.5%
5 1
1.5%
6 1
1.5%
7 1
1.5%
8 1
1.5%
9 1
1.5%
10 1
1.5%
ValueCountFrequency (%)
24 1
1.5%
23 1
1.5%
22 1
1.5%
21 1
1.5%
20 1
1.5%
19 1
1.5%
18 1
1.5%
17 1
1.5%
16 1
1.5%
15 1
1.5%

지역구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Memory size652.0 B
<NA>
41 
원도심
24 

Length

Max length4
Median length4
Mean length3.6307692
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row원도심
2nd row원도심
3rd row원도심
4th row원도심
5th row원도심

Common Values

ValueCountFrequency (%)
<NA> 41
63.1%
원도심 24
36.9%

Length

2024-01-28T22:59:48.347880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T22:59:48.440137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 41
63.1%
원도심 24
36.9%

녹지명
Text

MISSING 

Distinct24
Distinct (%)100.0%
Missing41
Missing (%)63.1%
Memory size652.0 B
2024-01-28T22:59:48.599634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length5
Mean length7.875
Min length5

Characters and Unicode

Total characters189
Distinct characters31
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)100.0%

Sample

1st row선학1녹지
2nd row선학2녹지
3rd row선학3녹지
4th row선학4녹지
5th row선학5녹지
ValueCountFrequency (%)
선학2녹지 1
 
3.8%
선학3녹지 1
 
3.8%
청량경관녹지 1
 
3.8%
동춘2구역(5)(동춘4녹지 1
 
3.8%
동춘2구역(4)(동춘4녹지 1
 
3.8%
동춘2구역(3)(동춘3녹지 1
 
3.8%
송도유원지녹지 1
 
3.8%
완충녹지 1
 
3.8%
청학1호 1
 
3.8%
녹지 1
 
3.8%
Other values (16) 16
61.5%
2024-01-28T22:59:48.880553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
25
13.2%
24
12.7%
12
 
6.3%
) 12
 
6.3%
( 12
 
6.3%
12
 
6.3%
2 11
 
5.8%
9
 
4.8%
7
 
3.7%
7
 
3.7%
Other values (21) 58
30.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 129
68.3%
Decimal Number 34
 
18.0%
Close Punctuation 12
 
6.3%
Open Punctuation 12
 
6.3%
Space Separator 2
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
25
19.4%
24
18.6%
12
9.3%
12
9.3%
9
 
7.0%
7
 
5.4%
7
 
5.4%
6
 
4.7%
6
 
4.7%
5
 
3.9%
Other values (11) 16
12.4%
Decimal Number
ValueCountFrequency (%)
2 11
32.4%
3 6
17.6%
4 6
17.6%
1 6
17.6%
5 3
 
8.8%
7 1
 
2.9%
6 1
 
2.9%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 129
68.3%
Common 60
31.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
25
19.4%
24
18.6%
12
9.3%
12
9.3%
9
 
7.0%
7
 
5.4%
7
 
5.4%
6
 
4.7%
6
 
4.7%
5
 
3.9%
Other values (11) 16
12.4%
Common
ValueCountFrequency (%)
) 12
20.0%
( 12
20.0%
2 11
18.3%
3 6
10.0%
4 6
10.0%
1 6
10.0%
5 3
 
5.0%
2
 
3.3%
7 1
 
1.7%
6 1
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 129
68.3%
ASCII 60
31.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
25
19.4%
24
18.6%
12
9.3%
12
9.3%
9
 
7.0%
7
 
5.4%
7
 
5.4%
6
 
4.7%
6
 
4.7%
5
 
3.9%
Other values (11) 16
12.4%
ASCII
ValueCountFrequency (%)
) 12
20.0%
( 12
20.0%
2 11
18.3%
3 6
10.0%
4 6
10.0%
1 6
10.0%
5 3
 
5.0%
2
 
3.3%
7 1
 
1.7%
6 1
 
1.7%

소재지
Text

MISSING 

Distinct24
Distinct (%)100.0%
Missing41
Missing (%)63.1%
Memory size652.0 B
2024-01-28T22:59:49.038278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length9.625
Min length7

Characters and Unicode

Total characters231
Distinct characters27
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)100.0%

Sample

1st row선학동 401-4
2nd row선학동 412-9
3rd row선학동 350-1
4th row선학동 350-3,-6
5th row선학동 340-4
ValueCountFrequency (%)
동춘동 10
19.2%
선학동 5
 
9.6%
연수동 4
 
7.7%
일원 4
 
7.7%
청학동 4
 
7.7%
340-4 1
 
1.9%
287 1
 
1.9%
950번지 1
 
1.9%
산53-58일원 1
 
1.9%
산45-2일원 1
 
1.9%
Other values (20) 20
38.5%
2024-01-28T22:59:49.294687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
34
14.7%
29
12.6%
5 21
 
9.1%
- 19
 
8.2%
4 11
 
4.8%
0 10
 
4.3%
10
 
4.3%
3 10
 
4.3%
1 9
 
3.9%
9
 
3.9%
Other values (17) 69
29.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 91
39.4%
Decimal Number 91
39.4%
Space Separator 29
 
12.6%
Dash Punctuation 19
 
8.2%
Other Punctuation 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
34
37.4%
10
 
11.0%
9
 
9.9%
7
 
7.7%
7
 
7.7%
5
 
5.5%
4
 
4.4%
4
 
4.4%
4
 
4.4%
3
 
3.3%
Other values (4) 4
 
4.4%
Decimal Number
ValueCountFrequency (%)
5 21
23.1%
4 11
12.1%
0 10
11.0%
3 10
11.0%
1 9
9.9%
7 8
 
8.8%
2 7
 
7.7%
8 6
 
6.6%
9 6
 
6.6%
6 3
 
3.3%
Space Separator
ValueCountFrequency (%)
29
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 19
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 140
60.6%
Hangul 91
39.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
34
37.4%
10
 
11.0%
9
 
9.9%
7
 
7.7%
7
 
7.7%
5
 
5.5%
4
 
4.4%
4
 
4.4%
4
 
4.4%
3
 
3.3%
Other values (4) 4
 
4.4%
Common
ValueCountFrequency (%)
29
20.7%
5 21
15.0%
- 19
13.6%
4 11
 
7.9%
0 10
 
7.1%
3 10
 
7.1%
1 9
 
6.4%
7 8
 
5.7%
2 7
 
5.0%
8 6
 
4.3%
Other values (3) 10
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 140
60.6%
Hangul 91
39.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
34
37.4%
10
 
11.0%
9
 
9.9%
7
 
7.7%
7
 
7.7%
5
 
5.5%
4
 
4.4%
4
 
4.4%
4
 
4.4%
3
 
3.3%
Other values (4) 4
 
4.4%
ASCII
ValueCountFrequency (%)
29
20.7%
5 21
15.0%
- 19
13.6%
4 11
 
7.9%
0 10
 
7.1%
3 10
 
7.1%
1 9
 
6.4%
7 8
 
5.7%
2 7
 
5.0%
8 6
 
4.3%
Other values (3) 10
 
7.1%

면적(제곱미터)
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct24
Distinct (%)100.0%
Missing41
Missing (%)63.1%
Infinite0
Infinite (%)0.0%
Mean13352.492
Minimum76
Maximum176086
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size717.0 B
2024-01-28T22:59:49.408069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum76
5-th percentile877.85
Q11199.8
median3153.85
Q36562
95-th percentile32453.335
Maximum176086
Range176010
Interquartile range (IQR)5362.2

Descriptive statistics

Standard deviation35830.092
Coefficient of variation (CV)2.6834012
Kurtosis20.61051
Mean13352.492
Median Absolute Deviation (MAD)1974.75
Skewness4.4302621
Sum320459.8
Variance1.2837955 × 109
MonotonicityNot monotonic
2024-01-28T22:59:49.506884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
4043.1 1
 
1.5%
1977.6 1
 
1.5%
1105.0 1
 
1.5%
1346.0 1
 
1.5%
1159.0 1
 
1.5%
854.0 1
 
1.5%
6259.0 1
 
1.5%
4130.0 1
 
1.5%
1226.0 1
 
1.5%
1013.0 1
 
1.5%
Other values (14) 14
 
21.5%
(Missing) 41
63.1%
ValueCountFrequency (%)
76.0 1
1.5%
854.0 1
1.5%
1013.0 1
1.5%
1105.0 1
1.5%
1159.0 1
1.5%
1199.2 1
1.5%
1200.0 1
1.5%
1200.1 1
1.5%
1226.0 1
1.5%
1346.0 1
1.5%
ValueCountFrequency (%)
176086.0 1
1.5%
32720.2 1
1.5%
30941.1 1
1.5%
21159.8 1
1.5%
11475.6 1
1.5%
7471.0 1
1.5%
6259.0 1
1.5%
4130.0 1
1.5%
4043.1 1
1.5%
3800.0 1
1.5%

데이터 기준일자
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Memory size652.0 B
<NA>
41 
2023-07-06
24 

Length

Max length10
Median length4
Mean length6.2153846
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-07-06
2nd row2023-07-06
3rd row2023-07-06
4th row2023-07-06
5th row2023-07-06

Common Values

ValueCountFrequency (%)
<NA> 41
63.1%
2023-07-06 24
36.9%

Length

2024-01-28T22:59:49.616365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T22:59:49.692663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 41
63.1%
2023-07-06 24
36.9%

Interactions

2024-01-28T22:59:47.642658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T22:59:47.497377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T22:59:47.719098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T22:59:47.569626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-28T22:59:49.751529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번녹지명소재지면적(제곱미터)
연번1.0001.0001.0000.481
녹지명1.0001.0001.0001.000
소재지1.0001.0001.0001.000
면적(제곱미터)0.4811.0001.0001.000
2024-01-28T22:59:49.831732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역구분데이터 기준일자
지역구분1.0001.000
데이터 기준일자1.0001.000
2024-01-28T22:59:49.902841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번면적(제곱미터)지역구분데이터 기준일자
연번1.000-0.3231.0001.000
면적(제곱미터)-0.3231.0001.0001.000
지역구분1.0001.0001.0001.000
데이터 기준일자1.0001.0001.0001.000

Missing values

2024-01-28T22:59:47.820631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T22:59:47.931500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-01-28T22:59:48.030066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번지역구분녹지명소재지면적(제곱미터)데이터 기준일자
01원도심선학1녹지선학동 401-43710.42023-07-06
12원도심선학2녹지선학동 412-976.02023-07-06
23원도심선학3녹지선학동 350-13469.82023-07-06
34원도심선학4녹지선학동 350-3,-62837.92023-07-06
45원도심선학5녹지선학동 340-47471.02023-07-06
56원도심연수1녹지연수동 58530941.12023-07-06
67원도심연수2녹지연수동 583176086.02023-07-06
78원도심연수3녹지동춘동 943-41199.22023-07-06
89원도심연수4녹지연수동 479-1021159.82023-07-06
910원도심연수5녹지연수동 631-711475.62023-07-06
연번지역구분녹지명소재지면적(제곱미터)데이터 기준일자
55<NA><NA><NA><NA><NA><NA>
56<NA><NA><NA><NA><NA><NA>
57<NA><NA><NA><NA><NA><NA>
58<NA><NA><NA><NA><NA><NA>
59<NA><NA><NA><NA><NA><NA>
60<NA><NA><NA><NA><NA><NA>
61<NA><NA><NA><NA><NA><NA>
62<NA><NA><NA><NA><NA><NA>
63<NA><NA><NA><NA><NA><NA>
64<NA><NA><NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

연번지역구분녹지명소재지면적(제곱미터)데이터 기준일자# duplicates
0<NA><NA><NA><NA><NA><NA>41