Overview

Dataset statistics

Number of variables6
Number of observations33
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.8 KiB
Average record size in memory56.0 B

Variable types

Numeric4
Categorical1
Text1

Dataset

Description경상남도 하동군에 있는 도시숲 조성 현황 (연번, 연도, 종류, 소재지, 사업량(키로미터), 예산액(천원) 등)의 정보를 제공하고 있습니다
Author경상남도 하동군
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15086123

Alerts

연번 is highly overall correlated with 연도 and 1 other fieldsHigh correlation
연도 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
사업량(키로미터) is highly overall correlated with 연번 and 2 other fieldsHigh correlation
예산액(천원) is highly overall correlated with 사업량(키로미터)High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-10 22:51:01.273849
Analysis finished2023-12-10 22:51:03.060166
Duration1.79 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct33
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17
Minimum1
Maximum33
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size429.0 B
2023-12-11T07:51:03.126715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.6
Q19
median17
Q325
95-th percentile31.4
Maximum33
Range32
Interquartile range (IQR)16

Descriptive statistics

Standard deviation9.6695398
Coefficient of variation (CV)0.56879646
Kurtosis-1.2
Mean17
Median Absolute Deviation (MAD)8
Skewness0
Sum561
Variance93.5
MonotonicityStrictly increasing
2023-12-11T07:51:03.252649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)
ValueCountFrequency (%)
1 1
 
3.0%
26 1
 
3.0%
20 1
 
3.0%
21 1
 
3.0%
22 1
 
3.0%
23 1
 
3.0%
24 1
 
3.0%
25 1
 
3.0%
27 1
 
3.0%
2 1
 
3.0%
Other values (23) 23
69.7%
ValueCountFrequency (%)
1 1
3.0%
2 1
3.0%
3 1
3.0%
4 1
3.0%
5 1
3.0%
6 1
3.0%
7 1
3.0%
8 1
3.0%
9 1
3.0%
10 1
3.0%
ValueCountFrequency (%)
33 1
3.0%
32 1
3.0%
31 1
3.0%
30 1
3.0%
29 1
3.0%
28 1
3.0%
27 1
3.0%
26 1
3.0%
25 1
3.0%
24 1
3.0%

연도
Real number (ℝ)

HIGH CORRELATION 

Distinct13
Distinct (%)39.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2017.7576
Minimum2011
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size429.0 B
2023-12-11T07:51:03.371032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2011
5-th percentile2012
Q12016
median2019
Q32020
95-th percentile2022.4
Maximum2023
Range12
Interquartile range (IQR)4

Descriptive statistics

Standard deviation3.3263184
Coefficient of variation (CV)0.0016485223
Kurtosis-0.70522017
Mean2017.7576
Median Absolute Deviation (MAD)2
Skewness-0.50620107
Sum66586
Variance11.064394
MonotonicityIncreasing
2023-12-11T07:51:03.484705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
2020 6
18.2%
2019 5
15.2%
2018 4
12.1%
2013 3
9.1%
2016 3
9.1%
2021 3
9.1%
2012 2
 
6.1%
2023 2
 
6.1%
2011 1
 
3.0%
2014 1
 
3.0%
Other values (3) 3
9.1%
ValueCountFrequency (%)
2011 1
 
3.0%
2012 2
 
6.1%
2013 3
9.1%
2014 1
 
3.0%
2015 1
 
3.0%
2016 3
9.1%
2017 1
 
3.0%
2018 4
12.1%
2019 5
15.2%
2020 6
18.2%
ValueCountFrequency (%)
2023 2
 
6.1%
2022 1
 
3.0%
2021 3
9.1%
2020 6
18.2%
2019 5
15.2%
2018 4
12.1%
2017 1
 
3.0%
2016 3
9.1%
2015 1
 
3.0%
2014 1
 
3.0%

종류
Categorical

Distinct5
Distinct (%)15.2%
Missing0
Missing (%)0.0%
Memory size396.0 B
가로수
23 
도시숲
도시녹지
학교숲
자녀안심숲
 
1

Length

Max length5
Median length3
Mean length3.1515152
Min length3

Unique

Unique1 ?
Unique (%)3.0%

Sample

1st row도시숲
2nd row도시녹지
3rd row학교숲
4th row도시녹지
5th row도시녹지

Common Values

ValueCountFrequency (%)
가로수 23
69.7%
도시숲 3
 
9.1%
도시녹지 3
 
9.1%
학교숲 3
 
9.1%
자녀안심숲 1
 
3.0%

Length

2023-12-11T07:51:03.615206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:51:03.755766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
가로수 23
69.7%
도시숲 3
 
9.1%
도시녹지 3
 
9.1%
학교숲 3
 
9.1%
자녀안심숲 1
 
3.0%
Distinct28
Distinct (%)84.8%
Missing0
Missing (%)0.0%
Memory size396.0 B
2023-12-11T07:51:04.013417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length18
Mean length14.121212
Min length6

Characters and Unicode

Total characters466
Distinct characters89
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)75.8%

Sample

1st row금성면 가린길 32 궁항초등학교
2nd row금성해안도로등
3rd row하동고등학교
4th row지방도 1005호선변
5th row국도 59호선변
ValueCountFrequency (%)
지방도 8
 
12.7%
1002호선변(왕벚나무 4
 
6.3%
1002호선(왕벚나무 3
 
4.8%
국도 3
 
4.8%
국도2호선변(배롱나무 2
 
3.2%
국도19호선 2
 
3.2%
국도19호선변(왕벚나무 2
 
3.2%
하동읍 2
 
3.2%
국도19호선(배롱나무 1
 
1.6%
국도19호선(왕벚나무 1
 
1.6%
Other values (35) 35
55.6%
2023-12-11T07:51:04.435420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
30
 
6.4%
25
 
5.4%
( 23
 
4.9%
) 23
 
4.9%
22
 
4.7%
22
 
4.7%
22
 
4.7%
20
 
4.3%
1 19
 
4.1%
0 17
 
3.6%
Other values (79) 243
52.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 319
68.5%
Decimal Number 64
 
13.7%
Space Separator 30
 
6.4%
Open Punctuation 23
 
4.9%
Close Punctuation 23
 
4.9%
Other Punctuation 6
 
1.3%
Math Symbol 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
25
 
7.8%
22
 
6.9%
22
 
6.9%
22
 
6.9%
20
 
6.3%
15
 
4.7%
15
 
4.7%
11
 
3.4%
11
 
3.4%
10
 
3.1%
Other values (67) 146
45.8%
Decimal Number
ValueCountFrequency (%)
1 19
29.7%
0 17
26.6%
2 11
17.2%
9 11
17.2%
3 2
 
3.1%
7 2
 
3.1%
5 2
 
3.1%
Space Separator
ValueCountFrequency (%)
30
100.0%
Open Punctuation
ValueCountFrequency (%)
( 23
100.0%
Close Punctuation
ValueCountFrequency (%)
) 23
100.0%
Other Punctuation
ValueCountFrequency (%)
, 6
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 319
68.5%
Common 147
31.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
25
 
7.8%
22
 
6.9%
22
 
6.9%
22
 
6.9%
20
 
6.3%
15
 
4.7%
15
 
4.7%
11
 
3.4%
11
 
3.4%
10
 
3.1%
Other values (67) 146
45.8%
Common
ValueCountFrequency (%)
30
20.4%
( 23
15.6%
) 23
15.6%
1 19
12.9%
0 17
11.6%
2 11
 
7.5%
9 11
 
7.5%
, 6
 
4.1%
3 2
 
1.4%
7 2
 
1.4%
Other values (2) 3
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 319
68.5%
ASCII 147
31.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
30
20.4%
( 23
15.6%
) 23
15.6%
1 19
12.9%
0 17
11.6%
2 11
 
7.5%
9 11
 
7.5%
, 6
 
4.1%
3 2
 
1.4%
7 2
 
1.4%
Other values (2) 3
 
2.0%
Hangul
ValueCountFrequency (%)
25
 
7.8%
22
 
6.9%
22
 
6.9%
22
 
6.9%
20
 
6.3%
15
 
4.7%
15
 
4.7%
11
 
3.4%
11
 
3.4%
10
 
3.1%
Other values (67) 146
45.8%

사업량(키로미터)
Real number (ℝ)

HIGH CORRELATION 

Distinct23
Distinct (%)69.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.300015
Minimum0.013
Maximum293
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size429.0 B
2023-12-11T07:51:04.555320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.013
5-th percentile0.0255
Q10.4
median0.8
Q31.4
95-th percentile8.2
Maximum293
Range292.987
Interquartile range (IQR)1

Descriptive statistics

Standard deviation50.794488
Coefficient of variation (CV)4.9314964
Kurtosis32.872151
Mean10.300015
Median Absolute Deviation (MAD)0.5
Skewness5.728583
Sum339.9005
Variance2580.08
MonotonicityNot monotonic
2023-12-11T07:51:04.665178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
0.5 3
 
9.1%
1.0 3
 
9.1%
0.4 3
 
9.1%
0.3 2
 
6.1%
0.6 2
 
6.1%
2.8 2
 
6.1%
1.4 2
 
6.1%
293.0 1
 
3.0%
0.05 1
 
3.0%
0.0325 1
 
3.0%
Other values (13) 13
39.4%
ValueCountFrequency (%)
0.013 1
 
3.0%
0.015 1
 
3.0%
0.0325 1
 
3.0%
0.05 1
 
3.0%
0.2 1
 
3.0%
0.3 2
6.1%
0.4 3
9.1%
0.5 3
9.1%
0.6 2
6.1%
0.7 1
 
3.0%
ValueCountFrequency (%)
293.0 1
 
3.0%
10.0 1
 
3.0%
7.0 1
 
3.0%
5.44 1
 
3.0%
2.8 2
6.1%
2.0 1
 
3.0%
1.7 1
 
3.0%
1.4 2
6.1%
1.2 1
 
3.0%
1.0 3
9.1%

예산액(천원)
Real number (ℝ)

HIGH CORRELATION 

Distinct32
Distinct (%)97.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean118472.91
Minimum10220
Maximum1500000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size429.0 B
2023-12-11T07:51:04.782339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10220
5-th percentile16332.2
Q127420
median54759
Q370442
95-th percentile425376.4
Maximum1500000
Range1489780
Interquartile range (IQR)43022

Descriptive statistics

Standard deviation270603.31
Coefficient of variation (CV)2.2840944
Kurtosis22.683236
Mean118472.91
Median Absolute Deviation (MAD)27339
Skewness4.5932376
Sum3909606
Variance7.3226153 × 1010
MonotonicityNot monotonic
2023-12-11T07:51:04.900414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=32)
ValueCountFrequency (%)
60000 2
 
6.1%
58205 1
 
3.0%
1500000 1
 
3.0%
63170 1
 
3.0%
88358 1
 
3.0%
33220 1
 
3.0%
19700 1
 
3.0%
19500 1
 
3.0%
18980 1
 
3.0%
44755 1
 
3.0%
Other values (22) 22
66.7%
ValueCountFrequency (%)
10220 1
3.0%
14249 1
3.0%
17721 1
3.0%
18980 1
3.0%
19098 1
3.0%
19500 1
3.0%
19700 1
3.0%
20460 1
3.0%
27420 1
3.0%
28043 1
3.0%
ValueCountFrequency (%)
1500000 1
3.0%
592000 1
3.0%
314294 1
3.0%
169590 1
3.0%
90184 1
3.0%
88358 1
3.0%
87315 1
3.0%
86139 1
3.0%
70442 1
3.0%
66059 1
3.0%

Interactions

2023-12-11T07:51:02.596047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:51:01.497756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:51:01.887662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:51:02.234681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:51:02.677464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:51:01.595126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:51:01.963486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:51:02.326673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:51:02.749577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:51:01.693204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:51:02.042655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:51:02.418687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:51:02.827068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:51:01.797457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:51:02.135148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:51:02.493694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T07:51:04.993702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번연도종류소재지사업량(키로미터)예산액(천원)
연번1.0000.9590.3220.8420.0000.228
연도0.9591.0000.6700.938NaN0.367
종류0.3220.6701.0001.0000.3830.841
소재지0.8420.9381.0001.0001.0001.000
사업량(키로미터)0.000NaN0.3831.0001.0000.000
예산액(천원)0.2280.3670.8411.0000.0001.000
2023-12-11T07:51:05.117144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번연도사업량(키로미터)예산액(천원)종류
연번1.0000.993-0.649-0.2230.080
연도0.9931.000-0.638-0.2090.267
사업량(키로미터)-0.649-0.6381.0000.5880.440
예산액(천원)-0.223-0.2090.5881.0000.474
종류0.0800.2670.4400.4741.000

Missing values

2023-12-11T07:51:02.934422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T07:51:03.024200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번연도종류소재지사업량(키로미터)예산액(천원)
012011도시숲금성면 가린길 32 궁항초등학교293.058205
122012도시녹지금성해안도로등5.44592000
232012학교숲하동고등학교0.560000
342013도시녹지지방도 1005호선변10.087315
452013도시녹지국도 59호선변7.0314294
562013학교숲하동읍 연화길 77 중앙중학교0.859917
672014가로수하동 두곡~화심 가로수 식재0.620460
782015가로수흥룡마을 가로변1.034932
892016학교숲하동여자고등학교0.560000
9102016가로수국도19호선변(왕벚나무)2.890184
연번연도종류소재지사업량(키로미터)예산액(천원)
23242020가로수지방도 1002호선변(왕벚나무)0.319700
24252020가로수국도19호선, 지방도 1002호선(왕벚나무)0.219500
25262020가로수국도19호선, 지방도 1002호선(왕벚나무)0.418980
26272020도시숲하동읍 미세먼지차단숲2.01500000
27282021가로수국도19호선변(왕벚나무, 배롱나무)1.044755
28292021가로수지방도 1002호선(왕벚나무)0.01314249
29302021가로수국도19호선(배롱나무)0.01510220
30312022가로수지방도1023호선(목수국)0.032566059
31322023도시숲진교메타세쿼이아길(애기동백, 목수국)0.4169590
32332023자녀안심숲진교초등학교(에메랄드그린, 산철쭉)0.0586139