Overview

Dataset statistics

Number of variables6
Number of observations52
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.7 KiB
Average record size in memory53.5 B

Variable types

Text1
Categorical1
Numeric3
DateTime1

Dataset

Description경기도 양주시 도시계획정보시스템(UPIS) 취락지구 현황으로 현황도형 관리번호, 라벨명, 면적(도형), 면적(길이), 도면번호, 현황도현 생성일 등의 항목을 제공합니다.
URLhttps://www.data.go.kr/data/15115910/fileData.do

Alerts

면적(도형) is highly overall correlated with 면적(길이)High correlation
면적(길이) is highly overall correlated with 면적(도형)High correlation
라벨명 is highly imbalanced (54.3%)Imbalance
현황도형 관리번호 has unique valuesUnique
면적(도형) has unique valuesUnique
면적(길이) has unique valuesUnique

Reproduction

Analysis started2023-12-12 15:02:52.052940
Analysis finished2023-12-12 15:02:53.205572
Duration1.15 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct52
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size548.0 B
2023-12-13T00:02:53.349200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length24
Mean length24
Min length24

Characters and Unicode

Total characters1248
Distinct characters14
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52 ?
Unique (%)100.0%

Sample

1st row41630UQ128PS201609260041
2nd row41630UQ128PS201609260043
3rd row41630UQ128PS201609260044
4th row41630UQ128PS201609260009
5th row41630UQ128PS201609260010
ValueCountFrequency (%)
41630uq128ps201609260041 1
 
1.9%
41630uq128ps201609260043 1
 
1.9%
41630uq128ps201609260036 1
 
1.9%
41630uq128ps201609260052 1
 
1.9%
41630uq128ps201609260053 1
 
1.9%
41630uq128ps201609260054 1
 
1.9%
41630uq128ps201609260029 1
 
1.9%
41630uq128ps201609260030 1
 
1.9%
41630uq128ps201609260031 1
 
1.9%
41630uq128ps201609260032 1
 
1.9%
Other values (42) 42
80.8%
2023-12-13T00:02:53.675325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 272
21.8%
1 172
13.8%
2 172
13.8%
6 160
12.8%
4 68
 
5.4%
3 68
 
5.4%
8 57
 
4.6%
9 57
 
4.6%
U 52
 
4.2%
Q 52
 
4.2%
Other values (4) 118
9.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1040
83.3%
Uppercase Letter 208
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 272
26.2%
1 172
16.5%
2 172
16.5%
6 160
15.4%
4 68
 
6.5%
3 68
 
6.5%
8 57
 
5.5%
9 57
 
5.5%
5 9
 
0.9%
7 5
 
0.5%
Uppercase Letter
ValueCountFrequency (%)
U 52
25.0%
Q 52
25.0%
P 52
25.0%
S 52
25.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1040
83.3%
Latin 208
 
16.7%

Most frequent character per script

Common
ValueCountFrequency (%)
0 272
26.2%
1 172
16.5%
2 172
16.5%
6 160
15.4%
4 68
 
6.5%
3 68
 
6.5%
8 57
 
5.5%
9 57
 
5.5%
5 9
 
0.9%
7 5
 
0.5%
Latin
ValueCountFrequency (%)
U 52
25.0%
Q 52
25.0%
P 52
25.0%
S 52
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1248
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 272
21.8%
1 172
13.8%
2 172
13.8%
6 160
12.8%
4 68
 
5.4%
3 68
 
5.4%
8 57
 
4.6%
9 57
 
4.6%
U 52
 
4.2%
Q 52
 
4.2%
Other values (4) 118
9.5%

라벨명
Categorical

IMBALANCE 

Distinct2
Distinct (%)3.8%
Missing0
Missing (%)0.0%
Memory size548.0 B
자연취락지구
47 
집단취락지구

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row자연취락지구
2nd row자연취락지구
3rd row자연취락지구
4th row자연취락지구
5th row자연취락지구

Common Values

ValueCountFrequency (%)
자연취락지구 47
90.4%
집단취락지구 5
 
9.6%

Length

2023-12-13T00:02:53.812686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:02:53.915465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
자연취락지구 47
90.4%
집단취락지구 5
 
9.6%

면적(도형)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct52
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean60908.187
Minimum3702.81
Maximum380523.36
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size600.0 B
2023-12-13T00:02:54.091206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3702.81
5-th percentile14143.522
Q124579.59
median38656.515
Q365760.355
95-th percentile180531.26
Maximum380523.36
Range376820.55
Interquartile range (IQR)41180.765

Descriptive statistics

Standard deviation65776.838
Coefficient of variation (CV)1.0799343
Kurtosis10.81996
Mean60908.187
Median Absolute Deviation (MAD)15789.325
Skewness2.9505952
Sum3167225.7
Variance4.3265925 × 109
MonotonicityNot monotonic
2023-12-13T00:02:54.514577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
228557.6 1
 
1.9%
164577.93 1
 
1.9%
53391.9 1
 
1.9%
17024.66 1
 
1.9%
32612.6 1
 
1.9%
62960.58 1
 
1.9%
24587.02 1
 
1.9%
51670.5 1
 
1.9%
18575.02 1
 
1.9%
10830.25 1
 
1.9%
Other values (42) 42
80.8%
ValueCountFrequency (%)
3702.81 1
1.9%
8855.38 1
1.9%
10830.25 1
1.9%
16854.38 1
1.9%
17024.66 1
1.9%
18575.02 1
1.9%
18817.5 1
1.9%
19541.6 1
1.9%
22073.43 1
1.9%
22158.61 1
1.9%
ValueCountFrequency (%)
380523.36 1
1.9%
228557.6 1
1.9%
192932.08 1
1.9%
170385.14 1
1.9%
164577.93 1
1.9%
129179.8 1
1.9%
110592.57 1
1.9%
107086.78 1
1.9%
96572.24 1
1.9%
82333.88 1
1.9%

면적(길이)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct52
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1569.6412
Minimum282.15
Maximum5324.62
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size600.0 B
2023-12-13T00:02:54.682220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum282.15
5-th percentile652.937
Q1890.395
median1342.275
Q31933.8275
95-th percentile3463.3285
Maximum5324.62
Range5042.47
Interquartile range (IQR)1043.4325

Descriptive statistics

Standard deviation944.32162
Coefficient of variation (CV)0.60161624
Kurtosis4.0528123
Mean1569.6412
Median Absolute Deviation (MAD)455.34
Skewness1.7495332
Sum81621.34
Variance891743.31
MonotonicityNot monotonic
2023-12-13T00:02:54.843388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3461.56 1
 
1.9%
2751.65 1
 
1.9%
1260.98 1
 
1.9%
1219.05 1
 
1.9%
984.01 1
 
1.9%
1668.81 1
 
1.9%
832.34 1
 
1.9%
1135.97 1
 
1.9%
635.81 1
 
1.9%
742.72 1
 
1.9%
Other values (42) 42
80.8%
ValueCountFrequency (%)
282.15 1
1.9%
422.35 1
1.9%
635.81 1
1.9%
666.95 1
1.9%
668.22 1
1.9%
727.72 1
1.9%
742.72 1
1.9%
785.67 1
1.9%
811.23 1
1.9%
832.34 1
1.9%
ValueCountFrequency (%)
5324.62 1
1.9%
3575.51 1
1.9%
3465.49 1
1.9%
3461.56 1
1.9%
2967.01 1
1.9%
2959.86 1
1.9%
2751.65 1
1.9%
2355.8 1
1.9%
2178.36 1
1.9%
2171.61 1
1.9%

도면번호
Real number (ℝ)

Distinct46
Distinct (%)88.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean24.442308
Minimum1
Maximum55
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size600.0 B
2023-12-13T00:02:55.007260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1.55
Q17.75
median21.5
Q342.25
95-th percentile52.45
Maximum55
Range54
Interquartile range (IQR)34.5

Descriptive statistics

Standard deviation18.015694
Coefficient of variation (CV)0.7370701
Kurtosis-1.3790699
Mean24.442308
Median Absolute Deviation (MAD)17
Skewness0.27847488
Sum1271
Variance324.56523
MonotonicityNot monotonic
2023-12-13T00:02:55.153259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=46)
ValueCountFrequency (%)
1 3
 
5.8%
2 2
 
3.8%
5 2
 
3.8%
3 2
 
3.8%
4 2
 
3.8%
14 1
 
1.9%
34 1
 
1.9%
46 1
 
1.9%
45 1
 
1.9%
44 1
 
1.9%
Other values (36) 36
69.2%
ValueCountFrequency (%)
1 3
5.8%
2 2
3.8%
3 2
3.8%
4 2
3.8%
5 2
3.8%
6 1
 
1.9%
7 1
 
1.9%
8 1
 
1.9%
9 1
 
1.9%
10 1
 
1.9%
ValueCountFrequency (%)
55 1
1.9%
54 1
1.9%
53 1
1.9%
52 1
1.9%
51 1
1.9%
50 1
1.9%
49 1
1.9%
48 1
1.9%
47 1
1.9%
46 1
1.9%
Distinct3
Distinct (%)5.8%
Missing0
Missing (%)0.0%
Memory size548.0 B
Minimum2010-01-01 00:00:00
Maximum2016-09-12 00:00:00
2023-12-13T00:02:55.297517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:02:55.412738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=3)

Interactions

2023-12-13T00:02:52.733040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:02:52.267399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:02:52.512230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:02:52.828184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:02:52.354556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:02:52.596614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:02:52.909325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:02:52.430729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:02:52.663672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:02:55.516400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
현황도형 관리번호라벨명면적(도형)면적(길이)도면번호현황도형 생성일
현황도형 관리번호1.0001.0001.0001.0001.0001.000
라벨명1.0001.0000.0000.2190.6050.317
면적(도형)1.0000.0001.0000.9060.5180.000
면적(길이)1.0000.2190.9061.0000.6770.000
도면번호1.0000.6050.5180.6771.0000.564
현황도형 생성일1.0000.3170.0000.0000.5641.000
2023-12-13T00:02:55.615789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
면적(도형)면적(길이)도면번호라벨명
면적(도형)1.0000.891-0.1130.000
면적(길이)0.8911.000-0.1730.097
도면번호-0.113-0.1731.0000.426
라벨명0.0000.0970.4261.000

Missing values

2023-12-13T00:02:53.023097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:02:53.156896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

현황도형 관리번호라벨명면적(도형)면적(길이)도면번호현황도형 생성일
041630UQ128PS201609260041자연취락지구228557.63461.56142016-09-12
141630UQ128PS201609260043자연취락지구51895.632064.67202016-09-12
241630UQ128PS201609260044자연취락지구107086.782178.36212016-09-12
341630UQ128PS201609260009자연취락지구41999.521626.92272010-01-01
441630UQ128PS201609260010자연취락지구24375.97668.22232010-01-01
541630UQ128PS201609260045자연취락지구170385.142355.852016-09-12
641630UQ128PS201609260001집단취락지구22073.43666.9532010-01-01
741630UQ128PS201609260002집단취락지구26662.551440.712010-01-01
841630UQ128PS201609260003자연취락지구3702.81282.1512010-01-01
941630UQ128PS201609260004집단취락지구8855.38422.3542010-01-01
현황도형 관리번호라벨명면적(도형)면적(길이)도면번호현황도형 생성일
4241630UQ128PS201609260039자연취락지구69557.062171.61122016-09-12
4341630UQ128PS201609260040자연취락지구78575.251997.9192016-09-12
4441630UQ128PS201609260046자연취락지구31817.31176.3562016-09-12
4541630UQ128PS201609260011집단취락지구28105.941472.8952010-01-01
4641630UQ128PS201609260012자연취락지구18817.51108.56262010-01-01
4741630UQ128PS201609260013집단취락지구34213.171508.0322010-01-01
4841630UQ128PS201609260014자연취락지구37112.821544.2112010-01-01
4941630UQ128PS201609260015자연취락지구19541.6811.23292010-01-01
5041630UQ128PS201609260016자연취락지구45194.951477.33222010-01-01
5141630UQ128PS201609260017자연취락지구39631.14884.3392015-01-11