Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows529
Duplicate rows (%)5.3%
Total size in memory722.7 KiB
Average record size in memory74.0 B

Variable types

Categorical7
Numeric1

Dataset

Description경상남도 사천시 공간정보시스템 가로수(DB) 자료입니다.(행정읍면동, 식재일자, 가로수 직경, 수목보호판 유무 등) 자료입니다.
Author경상남도 사천시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15091551

Alerts

대장초기화여부 has constant value ""Constant
Dataset has 529 (5.3%) duplicate rowsDuplicates
지주목유무 is highly overall correlated with 가로수직경 and 5 other fieldsHigh correlation
지주목재질 is highly overall correlated with 수목보호판유무 and 1 other fieldsHigh correlation
수목보호판재질 is highly overall correlated with 수목보호판유무 and 1 other fieldsHigh correlation
수목보호판유무 is highly overall correlated with 행정읍면동 and 4 other fieldsHigh correlation
가로수직경 is highly overall correlated with 지주목유무High correlation
행정읍면동 is highly overall correlated with 수목보호판유무 and 1 other fieldsHigh correlation
식재일자 is highly overall correlated with 수목보호판유무 and 1 other fieldsHigh correlation
식재일자 is highly imbalanced (64.6%)Imbalance
가로수직경 has 4819 (48.2%) zerosZeros

Reproduction

Analysis started2023-12-10 23:38:39.738004
Analysis finished2023-12-10 23:38:40.813399
Duration1.08 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

행정읍면동
Categorical

HIGH CORRELATION 

Distinct16
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
서포면
2321 
사남면
1393 
곤명면
1191 
용현면
1161 
곤양면
935 
Other values (11)
2999 

Length

Max length4
Median length3
Mean length3.0192
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row용현면
2nd row곤양면
3rd row용현면
4th row서포면
5th row용현면

Common Values

ValueCountFrequency (%)
서포면 2321
23.2%
사남면 1393
13.9%
곤명면 1191
11.9%
용현면 1161
11.6%
곤양면 935
9.3%
사천읍 675
 
6.8%
축동면 566
 
5.7%
남양동 350
 
3.5%
동서동 350
 
3.5%
향촌동 311
 
3.1%
Other values (6) 747
 
7.5%

Length

2023-12-11T08:38:40.876178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서포면 2321
23.2%
사남면 1393
13.9%
곤명면 1191
11.9%
용현면 1161
11.6%
곤양면 935
9.3%
사천읍 675
 
6.8%
축동면 566
 
5.7%
남양동 350
 
3.5%
동서동 350
 
3.5%
향촌동 311
 
3.1%
Other values (6) 747
 
7.5%

식재일자
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct23
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1960-01-01
7658 
2000-12-31
 
453
2011-01-01
 
435
<NA>
 
335
2005-12-31
 
182
Other values (18)
937 

Length

Max length10
Median length10
Mean length9.799
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2011-01-01
2nd row1960-01-01
3rd row2007-01-01
4th row1960-01-01
5th row2011-01-01

Common Values

ValueCountFrequency (%)
1960-01-01 7658
76.6%
2000-12-31 453
 
4.5%
2011-01-01 435
 
4.3%
<NA> 335
 
3.4%
2005-12-31 182
 
1.8%
1900-01-01 170
 
1.7%
2010-01-01 139
 
1.4%
2002-02-07 115
 
1.1%
1992-04-30 113
 
1.1%
2007-01-01 110
 
1.1%
Other values (13) 290
 
2.9%

Length

2023-12-11T08:38:41.013578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1960-01-01 7658
76.6%
2000-12-31 453
 
4.5%
2011-01-01 435
 
4.3%
na 335
 
3.4%
2005-12-31 182
 
1.8%
1900-01-01 170
 
1.7%
2010-01-01 139
 
1.4%
2002-02-07 115
 
1.1%
1992-04-30 113
 
1.1%
2007-01-01 110
 
1.1%
Other values (13) 290
 
2.9%

가로수직경
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct67
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.489309
Minimum0
Maximum90
Zeros4819
Zeros (%)48.2%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T08:38:41.154710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0.1
Q311
95-th percentile28
Maximum90
Range90
Interquartile range (IQR)11

Descriptive statistics

Standard deviation13.074322
Coefficient of variation (CV)1.7457314
Kurtosis13.068059
Mean7.489309
Median Absolute Deviation (MAD)0.1
Skewness3.1684687
Sum74893.09
Variance170.93789
MonotonicityNot monotonic
2023-12-11T08:38:41.287259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 4819
48.2%
0.1 519
 
5.2%
10.0 376
 
3.8%
6.0 368
 
3.7%
7.0 355
 
3.5%
8.0 227
 
2.3%
20.0 223
 
2.2%
15.0 218
 
2.2%
5.0 195
 
1.9%
12.0 193
 
1.9%
Other values (57) 2507
25.1%
ValueCountFrequency (%)
0.0 4819
48.2%
0.07 46
 
0.5%
0.08 26
 
0.3%
0.1 519
 
5.2%
0.12 25
 
0.2%
0.13 48
 
0.5%
0.15 55
 
0.5%
0.3 14
 
0.1%
1.0 37
 
0.4%
1.4 1
 
< 0.1%
ValueCountFrequency (%)
90.0 31
0.3%
85.0 1
 
< 0.1%
80.0 56
0.6%
70.0 66
0.7%
60.0 33
0.3%
56.0 1
 
< 0.1%
55.0 1
 
< 0.1%
54.0 1
 
< 0.1%
52.0 2
 
< 0.1%
51.0 2
 
< 0.1%

수목보호판유무
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
없음
4847 
미분류
4626 
있음
527 

Length

Max length3
Median length2
Mean length2.4626
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row없음
2nd row없음
3rd row없음
4th row미분류
5th row없음

Common Values

ValueCountFrequency (%)
없음 4847
48.5%
미분류 4626
46.3%
있음 527
 
5.3%

Length

2023-12-11T08:38:41.414420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:38:41.507595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
없음 4847
48.5%
미분류 4626
46.3%
있음 527
 
5.3%

수목보호판재질
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
미분류
5382 
미설치
3761 
속성나중입력
 
285
주철
 
261
합성수지
 
205
Other values (2)
 
106

Length

Max length6
Median length3
Mean length3.0747
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row미분류
2nd row미설치
3rd row미분류
4th row미분류
5th row미분류

Common Values

ValueCountFrequency (%)
미분류 5382
53.8%
미설치 3761
37.6%
속성나중입력 285
 
2.9%
주철 261
 
2.6%
합성수지 205
 
2.1%
인조석 54
 
0.5%
기타 52
 
0.5%

Length

2023-12-11T08:38:41.614432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:38:41.720636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미분류 5382
53.8%
미설치 3761
37.6%
속성나중입력 285
 
2.9%
주철 261
 
2.6%
합성수지 205
 
2.1%
인조석 54
 
0.5%
기타 52
 
0.5%

지주목유무
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
미분류
4626 
있음
2695 
없음
2679 

Length

Max length3
Median length2
Mean length2.4626
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row있음
2nd row있음
3rd row있음
4th row미분류
5th row있음

Common Values

ValueCountFrequency (%)
미분류 4626
46.3%
있음 2695
27.0%
없음 2679
26.8%

Length

2023-12-11T08:38:41.843356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:38:41.959342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미분류 4626
46.3%
있음 2695
27.0%
없음 2679
26.8%

지주목재질
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
미분류
5271 
미설치
2432 
목재
2211 
철재
 
65
속성나중입력
 
14

Length

Max length6
Median length3
Mean length2.7759
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row미분류
2nd row목재
3rd row목재
4th row미분류
5th row미분류

Common Values

ValueCountFrequency (%)
미분류 5271
52.7%
미설치 2432
24.3%
목재 2211
22.1%
철재 65
 
0.7%
속성나중입력 14
 
0.1%
기타 7
 
0.1%

Length

2023-12-11T08:38:42.099120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:38:42.239350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미분류 5271
52.7%
미설치 2432
24.3%
목재 2211
22.1%
철재 65
 
0.7%
속성나중입력 14
 
0.1%
기타 7
 
0.1%

대장초기화여부
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 10000
100.0%

Length

2023-12-11T08:38:42.355402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:38:42.440768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 10000
100.0%

Interactions

2023-12-11T08:38:40.422807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T08:38:42.509127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
행정읍면동식재일자가로수직경수목보호판유무수목보호판재질지주목유무지주목재질
행정읍면동1.0000.8030.6390.8200.7250.8460.708
식재일자0.8031.0000.6300.7200.7110.7250.624
가로수직경0.6390.6301.0000.6150.5670.6910.602
수목보호판유무0.8200.7200.6151.0000.9150.9430.904
수목보호판재질0.7250.7110.5670.9151.0000.7260.655
지주목유무0.8460.7250.6910.9430.7261.0000.994
지주목재질0.7080.6240.6020.9040.6550.9941.000
2023-12-11T08:38:42.619909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지주목유무행정읍면동지주목재질수목보호판재질식재일자수목보호판유무
지주목유무1.0000.7030.9080.6490.5180.707
행정읍면동0.7031.0000.4370.4400.4060.664
지주목재질0.9080.4371.0000.4640.3440.630
수목보호판재질0.6490.4400.4641.0000.4230.924
식재일자0.5180.4060.3440.4231.0000.512
수목보호판유무0.7070.6640.6300.9240.5121.000
2023-12-11T08:38:42.748703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
가로수직경행정읍면동식재일자수목보호판유무수목보호판재질지주목유무지주목재질
가로수직경1.0000.3070.2830.4430.3210.5350.361
행정읍면동0.3071.0000.4060.6640.4400.7030.437
식재일자0.2830.4061.0000.5120.4230.5180.344
수목보호판유무0.4430.6640.5121.0000.9240.7070.630
수목보호판재질0.3210.4400.4230.9241.0000.6490.464
지주목유무0.5350.7030.5180.7070.6491.0000.908
지주목재질0.3610.4370.3440.6300.4640.9081.000

Missing values

2023-12-11T08:38:40.589966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:38:40.740304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

행정읍면동식재일자가로수직경수목보호판유무수목보호판재질지주목유무지주목재질대장초기화여부
34752용현면2011-01-010.1없음미분류있음미분류1
14082곤양면1960-01-016.0없음미설치있음목재1
511용현면2007-01-010.12없음미분류있음목재1
22486서포면1960-01-010.0미분류미분류미분류미분류1
35194용현면2011-01-010.1없음미분류있음미분류1
13467곤양면1960-01-017.0없음미설치있음목재1
23115서포면1960-01-010.0미분류미분류미분류미분류1
34267남양동2011-01-010.1없음미분류있음미분류1
30332축동면1960-01-010.0미분류미분류미분류미분류1
9955동서동1960-01-0121.0없음미설치없음미설치1
행정읍면동식재일자가로수직경수목보호판유무수목보호판재질지주목유무지주목재질대장초기화여부
36633곤양면<NA>17.0없음속성나중입력있음목재1
8913벌용동1960-01-0115.0있음인조석없음미설치1
31124축동면1960-01-010.0미분류미분류미분류미분류1
5482사천읍1960-01-016.0없음미설치있음목재1
10315선구동2003-12-0270.0없음미설치없음미설치1
17993곤명면1960-01-010.0미분류미분류미분류미분류1
24230서포면1960-01-010.0미분류미분류미분류미분류1
27594서포면1960-01-010.0미분류미분류미분류미분류1
18086곤명면1960-01-010.0미분류미분류미분류미분류1
8081용현면1960-01-0122.0없음미설치없음미설치1

Duplicate rows

Most frequently occurring

행정읍면동식재일자가로수직경수목보호판유무수목보호판재질지주목유무지주목재질대장초기화여부# duplicates
355서포면1960-01-010.0미분류미분류미분류미분류12264
0곤명면1960-01-010.0미분류미분류미분류미분류11172
478축동면1960-01-010.0미분류미분류미분류미분류1541
7곤양면1960-01-010.0미분류미분류미분류미분류1439
457용현면2011-01-010.1없음미분류있음미분류1282
305사천읍1960-01-010.0미분류미분류미분류미분류1163
6곤양면1900-01-010.0없음미분류없음미분류1162
228사남면1960-01-015.0없음미설치있음목재1154
230사남면1960-01-016.0없음미설치있음목재1132
455용현면2010-01-010.1없음미분류있음목재1104