Overview

Dataset statistics

Number of variables5
Number of observations220
Missing cells1
Missing cells (%)0.1%
Duplicate rows1
Duplicate rows (%)0.5%
Total size in memory9.2 KiB
Average record size in memory42.6 B

Variable types

Categorical4
Numeric1

Dataset

Description전라남도 곡성군 도시계획정보시스템(UPIS) DB 내 지역지구 현황 데이터를 제공합니다,(농림지역현황, 경관지구현황 등 포함)
Author전라남도 곡성군
URLhttps://www.data.go.kr/data/15123819/fileData.do

Alerts

Dataset has 1 (0.5%) duplicate rowsDuplicates
현황도형 생성일시 is highly overall correlated with 테이블명 and 2 other fieldsHigh correlation
라벨명 is highly overall correlated with 테이블명 and 1 other fieldsHigh correlation
길이 도형 is highly overall correlated with 현황도형 생성일시High correlation
테이블명 is highly overall correlated with 라벨명 and 1 other fieldsHigh correlation
테이블명 is highly imbalanced (82.3%)Imbalance
라벨명 is highly imbalanced (83.0%)Imbalance
길이 도형 is highly imbalanced (94.7%)Imbalance
현황도형 생성일시 is highly imbalanced (84.0%)Imbalance

Reproduction

Analysis started2023-12-12 11:55:02.111156
Analysis finished2023-12-12 11:55:03.024543
Duration0.91 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

테이블명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
농림지역현황
206 
개발진흥지구현황
 
11
경관지구현황
 
1
보존지구현황
 
1
기타용도지역현황
 
1

Length

Max length8
Median length6
Mean length6.1090909
Min length6

Unique

Unique3 ?
Unique (%)1.4%

Sample

1st row농림지역현황
2nd row농림지역현황
3rd row농림지역현황
4th row농림지역현황
5th row농림지역현황

Common Values

ValueCountFrequency (%)
농림지역현황 206
93.6%
개발진흥지구현황 11
 
5.0%
경관지구현황 1
 
0.5%
보존지구현황 1
 
0.5%
기타용도지역현황 1
 
0.5%

Length

2023-12-12T20:55:03.115803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:55:03.258716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
농림지역현황 206
93.6%
개발진흥지구현황 11
 
5.0%
경관지구현황 1
 
0.5%
보존지구현황 1
 
0.5%
기타용도지역현황 1
 
0.5%

라벨명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct7
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
농림지역
206 
관광.휴양개발진흥지구
 
6
산업개발진흥지구
 
4
자연경관지구
 
1
문화자원보존지구
 
1
Other values (2)
 
2

Length

Max length11
Median length4
Mean length4.3136364
Min length4

Unique

Unique4 ?
Unique (%)1.8%

Sample

1st row농림지역
2nd row농림지역
3rd row농림지역
4th row농림지역
5th row농림지역

Common Values

ValueCountFrequency (%)
농림지역 206
93.6%
관광.휴양개발진흥지구 6
 
2.7%
산업개발진흥지구 4
 
1.8%
자연경관지구 1
 
0.5%
문화자원보존지구 1
 
0.5%
특정형개발진흥지구 1
 
0.5%
<NA> 1
 
0.5%

Length

2023-12-12T20:55:03.408351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:55:03.540124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
농림지역 206
93.6%
관광.휴양개발진흥지구 6
 
2.7%
산업개발진흥지구 4
 
1.8%
자연경관지구 1
 
0.5%
문화자원보존지구 1
 
0.5%
특정형개발진흥지구 1
 
0.5%
na 1
 
0.5%

면적 도형
Real number (ℝ)

Distinct218
Distinct (%)99.5%
Missing1
Missing (%)0.5%
Infinite0
Infinite (%)0.0%
Mean382687.97
Minimum0.55
Maximum31147365
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2023-12-12T20:55:03.696295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.55
5-th percentile76.035
Q1782.985
median3273.88
Q373522.375
95-th percentile972583.3
Maximum31147365
Range31147364
Interquartile range (IQR)72739.39

Descriptive statistics

Standard deviation2444199.1
Coefficient of variation (CV)6.3869243
Kurtosis123.23522
Mean382687.97
Median Absolute Deviation (MAD)3124.37
Skewness10.575544
Sum83808665
Variance5.9741092 × 1012
MonotonicityNot monotonic
2023-12-12T20:55:03.863451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.55 2
 
0.9%
264351.22 1
 
0.5%
3273.88 1
 
0.5%
1458.05 1
 
0.5%
181458.87 1
 
0.5%
1433.67 1
 
0.5%
619.0 1
 
0.5%
1623.11 1
 
0.5%
13475.71 1
 
0.5%
10051.25 1
 
0.5%
Other values (208) 208
94.5%
ValueCountFrequency (%)
0.55 2
0.9%
0.96 1
0.5%
1.31 1
0.5%
1.85 1
0.5%
2.59 1
0.5%
2.79 1
0.5%
4.65 1
0.5%
22.59 1
0.5%
38.72 1
0.5%
73.2 1
0.5%
ValueCountFrequency (%)
31147364.63 1
0.5%
15741343.62 1
0.5%
7984448.25 1
0.5%
6045154.07 1
0.5%
1563643.93 1
0.5%
1205215.54 1
0.5%
1109529.02 1
0.5%
1107135.57 1
0.5%
1058079.1 1
0.5%
992288.62 1
0.5%

길이 도형
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
0.0
218 
1592.41
 
1
<NA>
 
1

Length

Max length7
Median length3
Mean length3.0227273
Min length3

Unique

Unique2 ?
Unique (%)0.9%

Sample

1st row0.0
2nd row0.0
3rd row0.0
4th row0.0
5th row0.0

Common Values

ValueCountFrequency (%)
0.0 218
99.1%
1592.41 1
 
0.5%
<NA> 1
 
0.5%

Length

2023-12-12T20:55:04.034014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:55:04.208542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0.0 218
99.1%
1592.41 1
 
0.5%
na 1
 
0.5%

현황도형 생성일시
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct6
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2016-12-19
207 
42723
 
9
2017-06-05
 
1
43942
 
1
43585
 
1

Length

Max length10
Median length10
Mean length9.7227273
Min length4

Unique

Unique4 ?
Unique (%)1.8%

Sample

1st row2016-12-19
2nd row2016-12-19
3rd row2016-12-19
4th row2016-12-19
5th row2016-12-19

Common Values

ValueCountFrequency (%)
2016-12-19 207
94.1%
42723 9
 
4.1%
2017-06-05 1
 
0.5%
43942 1
 
0.5%
43585 1
 
0.5%
<NA> 1
 
0.5%

Length

2023-12-12T20:55:04.340344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:55:04.460528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2016-12-19 207
94.1%
42723 9
 
4.1%
2017-06-05 1
 
0.5%
43942 1
 
0.5%
43585 1
 
0.5%
na 1
 
0.5%

Interactions

2023-12-12T20:55:02.739715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T20:55:04.558437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
테이블명라벨명면적 도형길이 도형현황도형 생성일시
테이블명1.0001.0000.0000.4060.637
라벨명1.0001.0000.0000.6550.680
면적 도형0.0000.0001.0000.0000.000
길이 도형0.4060.6550.0001.0001.000
현황도형 생성일시0.6370.6800.0001.0001.000
2023-12-12T20:55:04.672525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
현황도형 생성일시라벨명길이 도형테이블명
현황도형 생성일시1.0000.5410.9930.565
라벨명0.5411.0000.4740.995
길이 도형0.9930.4741.0000.271
테이블명0.5650.9950.2711.000
2023-12-12T20:55:04.777603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
면적 도형테이블명라벨명길이 도형현황도형 생성일시
면적 도형1.0000.0000.0000.0000.000
테이블명0.0001.0000.9950.2710.565
라벨명0.0000.9951.0000.4740.541
길이 도형0.0000.2710.4741.0000.993
현황도형 생성일시0.0000.5650.5410.9931.000

Missing values

2023-12-12T20:55:02.871315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:55:02.980392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

테이블명라벨명면적 도형길이 도형현황도형 생성일시
0농림지역현황농림지역264351.220.02016-12-19
1농림지역현황농림지역7854.980.02016-12-19
2농림지역현황농림지역5829.550.02016-12-19
3농림지역현황농림지역184.250.02016-12-19
4농림지역현황농림지역171.420.02016-12-19
5농림지역현황농림지역515.560.02016-12-19
6농림지역현황농림지역421.620.02016-12-19
7농림지역현황농림지역6398.250.02016-12-19
8농림지역현황농림지역22.590.02016-12-19
9농림지역현황농림지역445841.20.02016-12-19
테이블명라벨명면적 도형길이 도형현황도형 생성일시
210개발진흥지구현황특정형개발진흥지구398133.110.042723
211개발진흥지구현황산업개발진흥지구301210.530.042723
212개발진흥지구현황산업개발진흥지구502903.710.042723
213개발진흥지구현황산업개발진흥지구65455.950.042723
214개발진흥지구현황관광.휴양개발진흥지구1205215.540.042723
215개발진흥지구현황관광.휴양개발진흥지구58191.070.042723
216개발진흥지구현황관광.휴양개발진흥지구247256.160.042723
217개발진흥지구현황관광.휴양개발진흥지구77438.050.042723
218개발진흥지구현황관광.휴양개발진흥지구232942.210.042723
219기타용도지역현황<NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

테이블명라벨명면적 도형길이 도형현황도형 생성일시# duplicates
0농림지역현황농림지역0.550.02016-12-192