Overview

Dataset statistics

Number of variables4
Number of observations86
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.0 KiB
Average record size in memory35.5 B

Variable types

Text1
Categorical1
Numeric2

Dataset

Description경기도 평택시 도시계획정보시스템(UPIS) 농림지역 현황으로 현황도형 관리번호, 라벨명, 면적(도형), 길이(도형) 의 항목을 제공합니다. ※문의 : 평택시 도시계획과(031-8024-3923)
URLhttps://www.data.go.kr/data/15116817/fileData.do

Alerts

라벨명 has constant value ""Constant
면적_도형 is highly overall correlated with 길이_도형High correlation
길이_도형 is highly overall correlated with 면적_도형High correlation
현황도형 관리번호 has unique valuesUnique
면적_도형 has unique valuesUnique
길이_도형 has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:09:06.695384
Analysis finished2023-12-12 12:09:07.431806
Duration0.74 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct86
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size820.0 B
2023-12-12T21:09:07.612073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length24
Mean length24
Min length24

Characters and Unicode

Total characters2064
Distinct characters14
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique86 ?
Unique (%)100.0%

Sample

1st row41220UQ113PS202104020001
2nd row41220UQ113PS202104020002
3rd row41220UQ113PS202104020003
4th row41220UQ113PS202104020004
5th row41220UQ113PS202104020005
ValueCountFrequency (%)
41220uq113ps202104020001 1
 
1.2%
41220uq113ps202104020054 1
 
1.2%
41220uq113ps202104020062 1
 
1.2%
41220uq113ps202104020061 1
 
1.2%
41220uq113ps202104020060 1
 
1.2%
41220uq113ps202104020059 1
 
1.2%
41220uq113ps202104020058 1
 
1.2%
41220uq113ps202104020057 1
 
1.2%
41220uq113ps202104020056 1
 
1.2%
41220uq113ps202104020068 1
 
1.2%
Other values (76) 76
88.4%
2023-12-12T21:09:08.098158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 536
26.0%
2 447
21.7%
1 363
17.6%
4 188
 
9.1%
3 108
 
5.2%
U 86
 
4.2%
Q 86
 
4.2%
P 86
 
4.2%
S 86
 
4.2%
6 21
 
1.0%
Other values (4) 57
 
2.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1720
83.3%
Uppercase Letter 344
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 536
31.2%
2 447
26.0%
1 363
21.1%
4 188
 
10.9%
3 108
 
6.3%
6 21
 
1.2%
5 19
 
1.1%
7 18
 
1.0%
8 13
 
0.8%
9 7
 
0.4%
Uppercase Letter
ValueCountFrequency (%)
U 86
25.0%
Q 86
25.0%
P 86
25.0%
S 86
25.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1720
83.3%
Latin 344
 
16.7%

Most frequent character per script

Common
ValueCountFrequency (%)
0 536
31.2%
2 447
26.0%
1 363
21.1%
4 188
 
10.9%
3 108
 
6.3%
6 21
 
1.2%
5 19
 
1.1%
7 18
 
1.0%
8 13
 
0.8%
9 7
 
0.4%
Latin
ValueCountFrequency (%)
U 86
25.0%
Q 86
25.0%
P 86
25.0%
S 86
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2064
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 536
26.0%
2 447
21.7%
1 363
17.6%
4 188
 
9.1%
3 108
 
5.2%
U 86
 
4.2%
Q 86
 
4.2%
P 86
 
4.2%
S 86
 
4.2%
6 21
 
1.0%
Other values (4) 57
 
2.8%

라벨명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size820.0 B
농림지역
86 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row농림지역
2nd row농림지역
3rd row농림지역
4th row농림지역
5th row농림지역

Common Values

ValueCountFrequency (%)
농림지역 86
100.0%

Length

2023-12-12T21:09:08.252598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:09:08.661487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
농림지역 86
100.0%

면적_도형
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct86
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1639315.9
Minimum2.18
Maximum22386742
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size906.0 B
2023-12-12T21:09:08.817055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.18
5-th percentile241.905
Q119923.717
median148513.33
Q3803996.7
95-th percentile8138702.6
Maximum22386742
Range22386740
Interquartile range (IQR)784072.98

Descriptive statistics

Standard deviation4059285.6
Coefficient of variation (CV)2.4762071
Kurtosis12.926244
Mean1639315.9
Median Absolute Deviation (MAD)144237.65
Skewness3.5505998
Sum1.4098116 × 108
Variance1.6477799 × 1013
MonotonicityNot monotonic
2023-12-12T21:09:08.979008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
18679.32 1
 
1.2%
10236.78 1
 
1.2%
26005.57 1
 
1.2%
4655.2 1
 
1.2%
15522.54 1
 
1.2%
253588.52 1
 
1.2%
66734.88 1
 
1.2%
189834.69 1
 
1.2%
11842.86 1
 
1.2%
30529.34 1
 
1.2%
Other values (76) 76
88.4%
ValueCountFrequency (%)
2.18 1
1.2%
11.92 1
1.2%
141.77 1
1.2%
142.04 1
1.2%
189.52 1
1.2%
399.06 1
1.2%
740.82 1
1.2%
835.24 1
1.2%
1091.2 1
1.2%
3896.15 1
1.2%
ValueCountFrequency (%)
22386742.44 1
1.2%
17553713.14 1
1.2%
16797791.86 1
1.2%
15569240.08 1
1.2%
8345659.88 1
1.2%
7517830.63 1
1.2%
7252758.7 1
1.2%
5879139.22 1
1.2%
4161559.65 1
1.2%
3810951.56 1
1.2%

길이_도형
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct86
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11207.524
Minimum23.15
Maximum132745.59
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size906.0 B
2023-12-12T21:09:09.153668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum23.15
5-th percentile107.61
Q1767.44
median2867.09
Q37467.5325
95-th percentile40641.1
Maximum132745.59
Range132722.44
Interquartile range (IQR)6700.0925

Descriptive statistics

Standard deviation24871.428
Coefficient of variation (CV)2.2191725
Kurtosis13.666665
Mean11207.524
Median Absolute Deviation (MAD)2517.295
Skewness3.6761614
Sum963847.05
Variance6.1858794 × 108
MonotonicityNot monotonic
2023-12-12T21:09:09.357889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
648.04 1
 
1.2%
459.33 1
 
1.2%
771.01 1
 
1.2%
337.96 1
 
1.2%
676.13 1
 
1.2%
3012.26 1
 
1.2%
1174.02 1
 
1.2%
3426.03 1
 
1.2%
562.76 1
 
1.2%
947.96 1
 
1.2%
Other values (76) 76
88.4%
ValueCountFrequency (%)
23.15 1
1.2%
52.66 1
1.2%
58.52 1
1.2%
85.25 1
1.2%
97.74 1
1.2%
137.22 1
1.2%
147.73 1
1.2%
172.61 1
1.2%
261.06 1
1.2%
337.05 1
1.2%
ValueCountFrequency (%)
132745.59 1
1.2%
116737.85 1
1.2%
114212.41 1
1.2%
95737.99 1
1.2%
41895.65 1
1.2%
36877.45 1
1.2%
35750.92 1
1.2%
30903.66 1
1.2%
30690.76 1
1.2%
26566.66 1
1.2%

Interactions

2023-12-12T21:09:07.010037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:09:06.809940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:09:07.127469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:09:06.908339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:09:09.472706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
현황도형 관리번호면적_도형길이_도형
현황도형 관리번호1.0001.0001.000
면적_도형1.0001.0000.965
길이_도형1.0000.9651.000
2023-12-12T21:09:09.588572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
면적_도형길이_도형
면적_도형1.0000.982
길이_도형0.9821.000

Missing values

2023-12-12T21:09:07.281782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:09:07.390218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

현황도형 관리번호라벨명면적_도형길이_도형
041220UQ113PS202104020001농림지역18679.32648.04
141220UQ113PS202104020002농림지역141.7752.66
241220UQ113PS202104020003농림지역1937147.99646.68
341220UQ113PS202104020004농림지역1450785.387472.61
441220UQ113PS202104020005농림지역740.82172.61
541220UQ113PS202104020006농림지역1484252.3610761.74
641220UQ113PS202104020007농림지역399.0685.25
741220UQ113PS202104020008농림지역5879139.2230690.76
841220UQ113PS202104020009농림지역97967.632109.58
941220UQ113PS202104020010농림지역77051.761229.76
현황도형 관리번호라벨명면적_도형길이_도형
7641220UQ113PS202104020077농림지역1740504.1321117.68
7741220UQ113PS202104020078농림지역741032.027142.74
7841220UQ113PS202104020079농림지역17553713.14132745.59
7941220UQ113PS202104020080농림지역12782.67766.25
8041220UQ113PS202104020081농림지역3896.15342.93
8141220UQ113PS202104020082농림지역2.1823.15
8241220UQ113PS202106300001농림지역2453166.6711071.67
8341220UQ113PS202104020084농림지역2665748.0941895.65
8441220UQ113PS202104020085농림지역105233.12144.51
8541220UQ113PS202106300003농림지역1875941.9920790.94