Overview

Dataset statistics

Number of variables5
Number of observations53
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.3 KiB
Average record size in memory44.5 B

Variable types

Text1
Categorical2
Numeric2

Dataset

Description경기도 양주시 도시계획정보시스템(UPIS)의 경관지구, 미관지구, 방재지구, 고도지구, 취락지구 현황으로 현황도형 관리번호, 라벨명, 면적(도형), 길이(도형) 등의 항목을 제공합니다.
Author공공데이터포털
URLhttps://www.data.go.kr/data/15115986/fileData.do

Alerts

면적(도형) is highly overall correlated with 길이(도형) and 2 other fieldsHigh correlation
길이(도형) is highly overall correlated with 면적(도형) and 2 other fieldsHigh correlation
라벨명 is highly overall correlated with 면적(도형) and 2 other fieldsHigh correlation
현황도형 생성일 is highly overall correlated with 면적(도형) and 2 other fieldsHigh correlation
라벨명 is highly imbalanced (63.2%)Imbalance
현황도형 관리번호 has unique valuesUnique
면적(도형) has unique valuesUnique
길이(도형) has unique valuesUnique

Reproduction

Analysis started2024-04-17 15:09:39.744922
Analysis finished2024-04-17 15:09:40.314870
Duration0.57 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct53
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size556.0 B
2024-04-18T00:09:40.444121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length24
Mean length24
Min length24

Characters and Unicode

Total characters1272
Distinct characters14
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique53 ?
Unique (%)100.0%

Sample

1st row41630UQ128PS201609260041
2nd row41630UQ128PS201609260043
3rd row41630UQ128PS201609260044
4th row41630UQ128PS201609260009
5th row41630UQ128PS201609260010
ValueCountFrequency (%)
41630uq128ps201609260041 1
 
1.9%
41630uq128ps201609260051 1
 
1.9%
41630uq128ps201609260053 1
 
1.9%
41630uq128ps201609260054 1
 
1.9%
41630uq128ps201609260029 1
 
1.9%
41630uq128ps201609260030 1
 
1.9%
41630uq128ps201609260031 1
 
1.9%
41630uq128ps201609260032 1
 
1.9%
41630uq128ps201609260033 1
 
1.9%
41630uq128ps201609260034 1
 
1.9%
Other values (43) 43
81.1%
2024-04-18T00:09:40.711197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 277
21.8%
1 178
14.0%
2 176
13.8%
6 162
12.7%
4 69
 
5.4%
3 69
 
5.4%
8 57
 
4.5%
9 57
 
4.5%
U 53
 
4.2%
Q 53
 
4.2%
Other values (4) 121
9.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1060
83.3%
Uppercase Letter 212
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 277
26.1%
1 178
16.8%
2 176
16.6%
6 162
15.3%
4 69
 
6.5%
3 69
 
6.5%
8 57
 
5.4%
9 57
 
5.4%
5 9
 
0.8%
7 6
 
0.6%
Uppercase Letter
ValueCountFrequency (%)
U 53
25.0%
Q 53
25.0%
P 53
25.0%
S 53
25.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1060
83.3%
Latin 212
 
16.7%

Most frequent character per script

Common
ValueCountFrequency (%)
0 277
26.1%
1 178
16.8%
2 176
16.6%
6 162
15.3%
4 69
 
6.5%
3 69
 
6.5%
8 57
 
5.4%
9 57
 
5.4%
5 9
 
0.8%
7 6
 
0.6%
Latin
ValueCountFrequency (%)
U 53
25.0%
Q 53
25.0%
P 53
25.0%
S 53
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1272
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 277
21.8%
1 178
14.0%
2 176
13.8%
6 162
12.7%
4 69
 
5.4%
3 69
 
5.4%
8 57
 
4.5%
9 57
 
4.5%
U 53
 
4.2%
Q 53
 
4.2%
Other values (4) 121
9.5%

라벨명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)5.7%
Missing0
Missing (%)0.0%
Memory size556.0 B
자연취락지구
47 
집단취락지구
도락산불곡산지구
 
1

Length

Max length8
Median length6
Mean length6.0377358
Min length6

Unique

Unique1 ?
Unique (%)1.9%

Sample

1st row자연취락지구
2nd row자연취락지구
3rd row자연취락지구
4th row자연취락지구
5th row자연취락지구

Common Values

ValueCountFrequency (%)
자연취락지구 47
88.7%
집단취락지구 5
 
9.4%
도락산불곡산지구 1
 
1.9%

Length

2024-04-18T00:09:40.822227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T00:09:40.911265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
자연취락지구 47
88.7%
집단취락지구 5
 
9.4%
도락산불곡산지구 1
 
1.9%

면적(도형)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct53
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean127769.04
Minimum3702.81
Maximum3604533.2
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size609.0 B
2024-04-18T00:09:41.001274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3702.81
5-th percentile14444.728
Q124587.02
median39566.92
Q365763.79
95-th percentile207182.29
Maximum3604533.2
Range3600830.4
Interquartile range (IQR)41176.77

Descriptive statistics

Standard deviation491093.85
Coefficient of variation (CV)3.8436061
Kurtosis51.042922
Mean127769.04
Median Absolute Deviation (MAD)17408.31
Skewness7.0882996
Sum6771758.9
Variance2.4117317 × 1011
MonotonicityNot monotonic
2024-04-18T00:09:41.120269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
228557.6 1
 
1.9%
34284.67 1
 
1.9%
53391.9 1
 
1.9%
17024.66 1
 
1.9%
32612.6 1
 
1.9%
62960.58 1
 
1.9%
24587.02 1
 
1.9%
51670.5 1
 
1.9%
18575.02 1
 
1.9%
10830.25 1
 
1.9%
Other values (43) 43
81.1%
ValueCountFrequency (%)
3702.81 1
1.9%
8855.38 1
1.9%
10830.25 1
1.9%
16854.38 1
1.9%
17024.66 1
1.9%
18575.02 1
1.9%
18817.5 1
1.9%
19541.6 1
1.9%
22073.43 1
1.9%
22158.61 1
1.9%
ValueCountFrequency (%)
3604533.21 1
1.9%
380523.36 1
1.9%
228557.6 1
1.9%
192932.08 1
1.9%
170385.14 1
1.9%
164577.93 1
1.9%
129179.8 1
1.9%
110592.57 1
1.9%
107086.78 1
1.9%
96572.24 1
1.9%

길이(도형)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct53
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1722.5972
Minimum282.15
Maximum9676.31
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size609.0 B
2024-04-18T00:09:41.230517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum282.15
5-th percentile654.494
Q1890.67
median1364.9
Q31997.9
95-th percentile3509.498
Maximum9676.31
Range9394.16
Interquartile range (IQR)1107.23

Descriptive statistics

Standard deviation1454.152
Coefficient of variation (CV)0.84416257
Kurtosis17.214176
Mean1722.5972
Median Absolute Deviation (MAD)480.6
Skewness3.6029791
Sum91297.65
Variance2114558.2
MonotonicityNot monotonic
2024-04-18T00:09:41.338659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3461.56 1
 
1.9%
1034.52 1
 
1.9%
1260.98 1
 
1.9%
1219.05 1
 
1.9%
984.01 1
 
1.9%
1668.81 1
 
1.9%
832.34 1
 
1.9%
1135.97 1
 
1.9%
635.81 1
 
1.9%
742.72 1
 
1.9%
Other values (43) 43
81.1%
ValueCountFrequency (%)
282.15 1
1.9%
422.35 1
1.9%
635.81 1
1.9%
666.95 1
1.9%
668.22 1
1.9%
727.72 1
1.9%
742.72 1
1.9%
785.67 1
1.9%
811.23 1
1.9%
832.34 1
1.9%
ValueCountFrequency (%)
9676.31 1
1.9%
5324.62 1
1.9%
3575.51 1
1.9%
3465.49 1
1.9%
3461.56 1
1.9%
2967.01 1
1.9%
2959.86 1
1.9%
2751.65 1
1.9%
2355.8 1
1.9%
2178.36 1
1.9%

현황도형 생성일
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)7.5%
Missing0
Missing (%)0.0%
Memory size556.0 B
2016-09-12
37 
2010-01-01
14 
2015-01-11
 
1
2017-12-25
 
1

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique2 ?
Unique (%)3.8%

Sample

1st row2016-09-12
2nd row2016-09-12
3rd row2016-09-12
4th row2010-01-01
5th row2010-01-01

Common Values

ValueCountFrequency (%)
2016-09-12 37
69.8%
2010-01-01 14
 
26.4%
2015-01-11 1
 
1.9%
2017-12-25 1
 
1.9%

Length

2024-04-18T00:09:41.440421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T00:09:41.518447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2016-09-12 37
69.8%
2010-01-01 14
 
26.4%
2015-01-11 1
 
1.9%
2017-12-25 1
 
1.9%

Interactions

2024-04-18T00:09:40.023723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T00:09:39.888030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T00:09:40.107066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T00:09:39.960554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-18T00:09:41.587424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
현황도형 관리번호라벨명면적(도형)길이(도형)현황도형 생성일
현황도형 관리번호1.0001.0001.0001.0001.000
라벨명1.0001.0000.9370.9310.742
면적(도형)1.0000.9371.0001.0000.661
길이(도형)1.0000.9311.0001.0000.720
현황도형 생성일1.0000.7420.6610.7201.000
2024-04-18T00:09:41.674716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
현황도형 생성일라벨명
현황도형 생성일1.0000.781
라벨명0.7811.000
2024-04-18T00:09:41.743441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
면적(도형)길이(도형)라벨명현황도형 생성일
면적(도형)1.0000.8980.6940.681
길이(도형)0.8981.0000.6610.542
라벨명0.6940.6611.0000.781
현황도형 생성일0.6810.5420.7811.000

Missing values

2024-04-18T00:09:40.189355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-18T00:09:40.277775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

현황도형 관리번호라벨명면적(도형)길이(도형)현황도형 생성일
041630UQ128PS201609260041자연취락지구228557.63461.562016-09-12
141630UQ128PS201609260043자연취락지구51895.632064.672016-09-12
241630UQ128PS201609260044자연취락지구107086.782178.362016-09-12
341630UQ128PS201609260009자연취락지구41999.521626.922010-01-01
441630UQ128PS201609260010자연취락지구24375.97668.222010-01-01
541630UQ128PS201609260045자연취락지구170385.142355.82016-09-12
641630UQ128PS201609260001집단취락지구22073.43666.952010-01-01
741630UQ128PS201609260002집단취락지구26662.551440.72010-01-01
841630UQ128PS201609260003자연취락지구3702.81282.152010-01-01
941630UQ128PS201609260004집단취락지구8855.38422.352010-01-01
현황도형 관리번호라벨명면적(도형)길이(도형)현황도형 생성일
4341630UQ128PS201609260040자연취락지구78575.251997.92016-09-12
4441630UQ128PS201609260046자연취락지구31817.31176.352016-09-12
4541630UQ128PS201609260011집단취락지구28105.941472.892010-01-01
4641630UQ128PS201609260012자연취락지구18817.51108.562010-01-01
4741630UQ128PS201609260013집단취락지구34213.171508.032010-01-01
4841630UQ128PS201609260014자연취락지구37112.821544.212010-01-01
4941630UQ128PS201609260015자연취락지구19541.6811.232010-01-01
5041630UQ128PS201609260016자연취락지구45194.951477.332010-01-01
5141630UQ128PS201609260017자연취락지구39631.14884.32015-01-11
5241630UQ121PS201712260001도락산불곡산지구3604533.219676.312017-12-25