Overview

Dataset statistics

Number of variables4
Number of observations80
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.8 KiB
Average record size in memory35.6 B

Variable types

Text1
Categorical1
Numeric2

Dataset

Description충청남도 천안시 도시계획정보시스템(UPIS) 유통공급시설 현황으로 현황도형 관리번호, 라벨명 등의 항목을 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=15&beforeMenuCd=DOM_000000201001001000&publicdatapk=15123199

Alerts

면적_도형 is highly overall correlated with 길이_도형 and 1 other fieldsHigh correlation
길이_도형 is highly overall correlated with 면적_도형High correlation
라벨명 is highly overall correlated with 면적_도형High correlation
현황도형 관리번호 has unique valuesUnique
면적_도형 has unique valuesUnique
길이_도형 has unique valuesUnique

Reproduction

Analysis started2024-01-09 22:18:38.040571
Analysis finished2024-01-09 22:18:38.555219
Duration0.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct80
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size772.0 B
2024-01-10T07:18:38.897979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length24
Mean length24
Min length24

Characters and Unicode

Total characters1920
Distinct characters14
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique80 ?
Unique (%)100.0%

Sample

1st row44130UQ154PS200707020015
2nd row44130UQ154PS200101180189
3rd row44130UQ154PS199710220003
4th row44130UQ154PS200812010679
5th row44130UQ154PS200406240013
ValueCountFrequency (%)
44130uq154ps200707020015 1
 
1.2%
44130uq154ps200101180189 1
 
1.2%
44130uq154ps201912110002 1
 
1.2%
44130uq154ps201908210025 1
 
1.2%
44130uq154ps201812210013 1
 
1.2%
44130uq154ps201812210012 1
 
1.2%
44130uq154ps201812210011 1
 
1.2%
44130uq154ps201812030002 1
 
1.2%
44130uq154ps201812030001 1
 
1.2%
44130uq154ps201912110005 1
 
1.2%
Other values (70) 70
87.5%
2024-01-10T07:18:39.152362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 447
23.3%
1 369
19.2%
4 249
13.0%
2 159
 
8.3%
5 105
 
5.5%
3 100
 
5.2%
U 80
 
4.2%
Q 80
 
4.2%
P 80
 
4.2%
S 80
 
4.2%
Other values (4) 171
 
8.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1600
83.3%
Uppercase Letter 320
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 447
27.9%
1 369
23.1%
4 249
15.6%
2 159
 
9.9%
5 105
 
6.6%
3 100
 
6.2%
8 59
 
3.7%
9 49
 
3.1%
6 37
 
2.3%
7 26
 
1.6%
Uppercase Letter
ValueCountFrequency (%)
U 80
25.0%
Q 80
25.0%
P 80
25.0%
S 80
25.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1600
83.3%
Latin 320
 
16.7%

Most frequent character per script

Common
ValueCountFrequency (%)
0 447
27.9%
1 369
23.1%
4 249
15.6%
2 159
 
9.9%
5 105
 
6.6%
3 100
 
6.2%
8 59
 
3.7%
9 49
 
3.1%
6 37
 
2.3%
7 26
 
1.6%
Latin
ValueCountFrequency (%)
U 80
25.0%
Q 80
25.0%
P 80
25.0%
S 80
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1920
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 447
23.3%
1 369
19.2%
4 249
13.0%
2 159
 
8.3%
5 105
 
5.5%
3 100
 
5.2%
U 80
 
4.2%
Q 80
 
4.2%
P 80
 
4.2%
S 80
 
4.2%
Other values (4) 171
 
8.9%

라벨명
Categorical

HIGH CORRELATION 

Distinct16
Distinct (%)20.0%
Missing0
Missing (%)0.0%
Memory size772.0 B
배수시설
21 
가스공급시설
16 
기타수도시설
11 
변전시설
기타전기공급설비
Other values (11)
22 

Length

Max length10
Median length8
Mean length5.425
Min length3

Unique

Unique4 ?
Unique (%)5.0%

Sample

1st row배수시설
2nd row배수시설
3rd row기타수도시설
4th row정수시설
5th row배수시설

Common Values

ValueCountFrequency (%)
배수시설 21
26.2%
가스공급시설 16
20.0%
기타수도시설 11
13.8%
변전시설 6
 
7.5%
기타전기공급설비 4
 
5.0%
기타시장시설 3
 
3.8%
취수시설 3
 
3.8%
송유관 3
 
3.8%
기타가스공급설비 3
 
3.8%
정수시설 2
 
2.5%
Other values (6) 8
 
10.0%

Length

2024-01-10T07:18:39.267155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
배수시설 21
26.2%
가스공급시설 16
20.0%
기타수도시설 11
13.8%
변전시설 6
 
7.5%
기타전기공급설비 4
 
5.0%
기타시장시설 3
 
3.8%
취수시설 3
 
3.8%
송유관 3
 
3.8%
기타가스공급설비 3
 
3.8%
정수시설 2
 
2.5%
Other values (6) 8
 
10.0%

면적_도형
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct80
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21043.239
Minimum12
Maximum451189.78
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size852.0 B
2024-01-10T07:18:39.386823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum12
5-th percentile93.150014
Q1933.2921
median3385.9025
Q312435.379
95-th percentile97910.346
Maximum451189.78
Range451177.78
Interquartile range (IQR)11502.087

Descriptive statistics

Standard deviation58891.715
Coefficient of variation (CV)2.798605
Kurtosis37.758555
Mean21043.239
Median Absolute Deviation (MAD)3135.1516
Skewness5.6722941
Sum1683459.2
Variance3.4682341 × 109
MonotonicityNot monotonic
2024-01-10T07:18:39.523089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2125.625592 1
 
1.2%
10570.45435 1
 
1.2%
536.13012 1
 
1.2%
39.000283 1
 
1.2%
96.0 1
 
1.2%
1572.95022 1
 
1.2%
219.2162925 1
 
1.2%
131.449614 1
 
1.2%
221.5086545 1
 
1.2%
396.458689 1
 
1.2%
Other values (70) 70
87.5%
ValueCountFrequency (%)
12.00000002 1
1.2%
18.21360933 1
1.2%
27.9642 1
1.2%
39.000283 1
1.2%
96.0 1
1.2%
131.449614 1
1.2%
161.2007761 1
1.2%
171.587455 1
1.2%
189.3967864 1
1.2%
219.2162925 1
1.2%
ValueCountFrequency (%)
451189.7844 1
1.2%
221190.0358 1
1.2%
111564.4038 1
1.2%
99464.24765 1
1.2%
97828.56213 1
1.2%
72038.28468 1
1.2%
65473.20958 1
1.2%
63580.96365 1
1.2%
50065.88303 1
1.2%
45930.06316 1
1.2%

길이_도형
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct80
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1859.6996
Minimum16
Maximum76033.057
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size852.0 B
2024-01-10T07:18:39.658339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum16
5-th percentile43.050005
Q1153.59768
median300.48105
Q3763.55936
95-th percentile3441.9736
Maximum76033.057
Range76017.057
Interquartile range (IQR)609.96168

Descriptive statistics

Standard deviation8698.6403
Coefficient of variation (CV)4.6774439
Kurtosis69.198514
Mean1859.6996
Median Absolute Deviation (MAD)221.32953
Skewness8.1196811
Sum148775.96
Variance75666344
MonotonicityNot monotonic
2024-01-10T07:18:39.804261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
202.9773322 1
 
1.2%
3180.566564 1
 
1.2%
98.80001073 1
 
1.2%
25.00010859 1
 
1.2%
44.0 1
 
1.2%
158.6793954 1
 
1.2%
63.35923346 1
 
1.2%
47.22365162 1
 
1.2%
85.66358869 1
 
1.2%
99.65320512 1
 
1.2%
Other values (70) 70
87.5%
ValueCountFrequency (%)
16.00000001 1
1.2%
22.01805128 1
1.2%
23.22033969 1
1.2%
25.00010859 1
1.2%
44.0 1
1.2%
47.22365162 1
1.2%
52.69369612 1
1.2%
58.60394212 1
1.2%
63.35923346 1
1.2%
76.63737084 1
1.2%
ValueCountFrequency (%)
76033.05675 1
1.2%
14303.87941 1
1.2%
14221.40635 1
1.2%
3554.787973 1
1.2%
3436.036003 1
1.2%
3180.566564 1
1.2%
2222.815135 1
1.2%
1694.538931 1
1.2%
1637.055045 1
1.2%
1551.478925 1
1.2%

Interactions

2024-01-10T07:18:38.314815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:18:38.164717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:18:38.377633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:18:38.233205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T07:18:39.900800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
현황도형 관리번호라벨명면적_도형길이_도형
현황도형 관리번호1.0001.0001.0001.000
라벨명1.0001.0000.7830.485
면적_도형1.0000.7831.0000.220
길이_도형1.0000.4850.2201.000
2024-01-10T07:18:40.003135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
면적_도형길이_도형라벨명
면적_도형1.0000.8170.502
길이_도형0.8171.0000.273
라벨명0.5020.2731.000

Missing values

2024-01-10T07:18:38.466282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:18:38.528709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

현황도형 관리번호라벨명면적_도형길이_도형
044130UQ154PS200707020015배수시설2125.625592202.977332
144130UQ154PS200101180189배수시설1763.368818183.034935
244130UQ154PS199710220003기타수도시설34431.153311270.135297
344130UQ154PS200812010679정수시설12690.79816605.949244
444130UQ154PS200406240013배수시설5247.043521477.545098
544130UQ154PS200701250022변전시설10975.68236420.234752
644130UQ154PS200101180178변전시설10892.14211441.605742
744130UQ154PS200101180185기타수도시설13038.41388706.051998
844130UQ154PS199812280005가스공급시설869.153245125.212835
944130UQ154PS199811020001가스공급시설29874.67843736.392264
현황도형 관리번호라벨명면적_도형길이_도형
7044130UQ154PS202111260052기타전기공급설비399.75353479.856506
7144130UQ154PS202111260053기타전기공급설비408.10437378.446531
7244130UQ154PS199302230188송유관919.26424714221.40635
7344130UQ154PS199411250001송유관3802.01900576033.05675
7444130UQ154PS202011300045배수시설1224.406861166.647303
7544130UQ154PS201508120003가스공급시설974.6421126.239401
7644130UQ154PS201206070026가스공급시설27.964222.018051
7744130UQ154PS202207210092배수시설1788.377722179.225326
7844130UQ154PS201808210006가스공급시설1020.0212181.08
7944130UQ154PS201506260011배수시설3499.986651254.999786