Overview

Dataset statistics

Number of variables5
Number of observations32
Missing cells32
Missing cells (%)20.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.5 KiB
Average record size in memory47.1 B

Variable types

Numeric2
Categorical2
Unsupported1

Dataset

Description공간id,자치구명,면적,기타,일련번호
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-21178/S/1/datasetView.do

Alerts

기타 has constant value ""Constant
공간id is highly overall correlated with 일련번호High correlation
일련번호 is highly overall correlated with 공간idHigh correlation
자치구명 is highly imbalanced (53.4%)Imbalance
면적 has 32 (100.0%) missing valuesMissing
공간id has unique valuesUnique
일련번호 has unique valuesUnique
면적 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-05-11 09:55:03.634375
Analysis finished2024-05-11 09:55:05.725071
Duration2.09 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

공간id
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct32
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20176.5
Minimum20161
Maximum20192
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size420.0 B
2024-05-11T09:55:05.840536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20161
5-th percentile20162.55
Q120168.75
median20176.5
Q320184.25
95-th percentile20190.45
Maximum20192
Range31
Interquartile range (IQR)15.5

Descriptive statistics

Standard deviation9.3808315
Coefficient of variation (CV)0.00046493849
Kurtosis-1.2
Mean20176.5
Median Absolute Deviation (MAD)8
Skewness0
Sum645648
Variance88
MonotonicityStrictly increasing
2024-05-11T09:55:06.111238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=32)
ValueCountFrequency (%)
20161 1
 
3.1%
20178 1
 
3.1%
20192 1
 
3.1%
20191 1
 
3.1%
20190 1
 
3.1%
20189 1
 
3.1%
20188 1
 
3.1%
20187 1
 
3.1%
20186 1
 
3.1%
20185 1
 
3.1%
Other values (22) 22
68.8%
ValueCountFrequency (%)
20161 1
3.1%
20162 1
3.1%
20163 1
3.1%
20164 1
3.1%
20165 1
3.1%
20166 1
3.1%
20167 1
3.1%
20168 1
3.1%
20169 1
3.1%
20170 1
3.1%
ValueCountFrequency (%)
20192 1
3.1%
20191 1
3.1%
20190 1
3.1%
20189 1
3.1%
20188 1
3.1%
20187 1
3.1%
20186 1
3.1%
20185 1
3.1%
20184 1
3.1%
20183 1
3.1%

자치구명
Categorical

IMBALANCE 

Distinct3
Distinct (%)9.4%
Missing0
Missing (%)0.0%
Memory size388.0 B
성동구
27 
강남구
용산구
 
1

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique1 ?
Unique (%)3.1%

Sample

1st row용산구
2nd row성동구
3rd row성동구
4th row성동구
5th row성동구

Common Values

ValueCountFrequency (%)
성동구 27
84.4%
강남구 4
 
12.5%
용산구 1
 
3.1%

Length

2024-05-11T09:55:06.474525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T09:55:06.784306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
성동구 27
84.4%
강남구 4
 
12.5%
용산구 1
 
3.1%

면적
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing32
Missing (%)100.0%
Memory size420.0 B

기타
Categorical

CONSTANT 

Distinct1
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Memory size388.0 B
SD913
32 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSD913
2nd rowSD913
3rd rowSD913
4th rowSD913
5th rowSD913

Common Values

ValueCountFrequency (%)
SD913 32
100.0%

Length

2024-05-11T09:55:07.134072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T09:55:07.434588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
sd913 32
100.0%

일련번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct32
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17.90625
Minimum1
Maximum35
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size420.0 B
2024-05-11T09:55:07.729549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.55
Q18.75
median18.5
Q327.25
95-th percentile33.45
Maximum35
Range34
Interquartile range (IQR)18.5

Descriptive statistics

Standard deviation10.614535
Coefficient of variation (CV)0.59278379
Kurtosis-1.3071967
Mean17.90625
Median Absolute Deviation (MAD)9.5
Skewness0.014539834
Sum573
Variance112.66835
MonotonicityStrictly increasing
2024-05-11T09:55:08.185340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=32)
ValueCountFrequency (%)
1 1
 
3.1%
20 1
 
3.1%
35 1
 
3.1%
34 1
 
3.1%
33 1
 
3.1%
32 1
 
3.1%
31 1
 
3.1%
30 1
 
3.1%
29 1
 
3.1%
28 1
 
3.1%
Other values (22) 22
68.8%
ValueCountFrequency (%)
1 1
3.1%
2 1
3.1%
3 1
3.1%
4 1
3.1%
5 1
3.1%
6 1
3.1%
7 1
3.1%
8 1
3.1%
9 1
3.1%
10 1
3.1%
ValueCountFrequency (%)
35 1
3.1%
34 1
3.1%
33 1
3.1%
32 1
3.1%
31 1
3.1%
30 1
3.1%
29 1
3.1%
28 1
3.1%
27 1
3.1%
25 1
3.1%

Interactions

2024-05-11T09:55:04.766314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T09:55:04.412940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T09:55:05.020797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T09:55:04.565622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-11T09:55:08.471957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공간id자치구명일련번호
공간id1.0000.8200.996
자치구명0.8201.0000.576
일련번호0.9960.5761.000
2024-05-11T09:55:08.624834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공간id일련번호자치구명
공간id1.0001.0000.354
일련번호1.0001.0000.354
자치구명0.3540.3541.000

Missing values

2024-05-11T09:55:05.351330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-11T09:55:05.647674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

공간id자치구명면적기타일련번호
020161용산구<NA>SD9131
120162성동구<NA>SD9132
220163성동구<NA>SD9133
320164성동구<NA>SD9134
420165성동구<NA>SD9135
520166성동구<NA>SD9136
620167성동구<NA>SD9137
720168성동구<NA>SD9138
820169성동구<NA>SD9139
920170성동구<NA>SD91310
공간id자치구명면적기타일련번호
2220183성동구<NA>SD91325
2320184성동구<NA>SD91327
2420185성동구<NA>SD91328
2520186성동구<NA>SD91329
2620187성동구<NA>SD91330
2720188강남구<NA>SD91331
2820189강남구<NA>SD91332
2920190강남구<NA>SD91333
3020191강남구<NA>SD91334
3120192성동구<NA>SD91335