Overview

Dataset statistics

Number of variables2
Number of observations163
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.8 KiB
Average record size in memory17.8 B

Variable types

Numeric1
Boolean1

Dataset

Description등록번호,삭제여부
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-21198/S/1/datasetView.do

Alerts

삭제여부 is highly imbalanced (66.7%)Imbalance
등록번호 has unique valuesUnique

Reproduction

Analysis started2024-05-11 09:53:10.499647
Analysis finished2024-05-11 09:53:14.089561
Duration3.59 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

등록번호
Real number (ℝ)

UNIQUE 

Distinct163
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean135.07362
Minimum18
Maximum220
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.6 KiB
2024-05-11T09:53:14.343083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum18
5-th percentile26.1
Q198.5
median139
Q3179.5
95-th percentile211.9
Maximum220
Range202
Interquartile range (IQR)81

Descriptive statistics

Standard deviation54.322375
Coefficient of variation (CV)0.40216865
Kurtosis-0.54221319
Mean135.07362
Median Absolute Deviation (MAD)41
Skewness-0.44604242
Sum22017
Variance2950.9205
MonotonicityNot monotonic
2024-05-11T09:53:14.810538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
179 1
 
0.6%
138 1
 
0.6%
126 1
 
0.6%
128 1
 
0.6%
131 1
 
0.6%
132 1
 
0.6%
133 1
 
0.6%
134 1
 
0.6%
139 1
 
0.6%
140 1
 
0.6%
Other values (153) 153
93.9%
ValueCountFrequency (%)
18 1
0.6%
19 1
0.6%
20 1
0.6%
21 1
0.6%
22 1
0.6%
23 1
0.6%
24 1
0.6%
25 1
0.6%
26 1
0.6%
27 1
0.6%
ValueCountFrequency (%)
220 1
0.6%
219 1
0.6%
218 1
0.6%
217 1
0.6%
216 1
0.6%
215 1
0.6%
214 1
0.6%
213 1
0.6%
212 1
0.6%
211 1
0.6%

삭제여부
Boolean

IMBALANCE 

Distinct2
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size295.0 B
False
153 
True
 
10
ValueCountFrequency (%)
False 153
93.9%
True 10
 
6.1%
2024-05-11T09:53:15.219369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Interactions

2024-05-11T09:53:13.455010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-11T09:53:15.421609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
등록번호삭제여부
등록번호1.0000.281
삭제여부0.2811.000
2024-05-11T09:53:15.676394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
등록번호삭제여부
등록번호1.0000.317
삭제여부0.3171.000

Missing values

2024-05-11T09:53:13.822724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-11T09:53:14.011024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

등록번호삭제여부
0179N
1154N
2155N
3156N
4157Y
5158N
6159N
7160N
8161N
9162N
등록번호삭제여부
153214N
154215N
155186N
156187N
157213N
158216N
159217N
160218N
161219N
162220Y