Overview

Dataset statistics

Number of variables2
Number of observations163
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.0 KiB
Average record size in memory18.8 B

Variable types

Numeric2

Dataset

Description등록번호,관리번호
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-21196/S/1/datasetView.do

Alerts

등록번호 is highly overall correlated with 관리번호High correlation
관리번호 is highly overall correlated with 등록번호High correlation
관리번호 has unique valuesUnique

Reproduction

Analysis started2024-05-11 09:39:43.223081
Analysis finished2024-05-11 09:39:45.771069
Duration2.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

등록번호
Real number (ℝ)

HIGH CORRELATION 

Distinct157
Distinct (%)96.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1433.1656
Minimum45
Maximum1781
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.6 KiB
2024-05-11T09:39:46.091674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum45
5-th percentile1290.1
Q11372
median1452
Q31554.5
95-th percentile1698.5
Maximum1781
Range1736
Interquartile range (IQR)182.5

Descriptive statistics

Standard deviation275.95308
Coefficient of variation (CV)0.19254793
Kurtosis17.30817
Mean1433.1656
Median Absolute Deviation (MAD)84
Skewness-3.7892273
Sum233606
Variance76150.102
MonotonicityNot monotonic
2024-05-11T09:39:46.748926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1296 2
 
1.2%
1390 2
 
1.2%
1437 2
 
1.2%
1664 2
 
1.2%
45 2
 
1.2%
1681 2
 
1.2%
1288 1
 
0.6%
1557 1
 
0.6%
1536 1
 
0.6%
1486 1
 
0.6%
Other values (147) 147
90.2%
ValueCountFrequency (%)
45 2
1.2%
51 1
0.6%
53 1
0.6%
57 1
0.6%
1287 1
0.6%
1288 1
0.6%
1289 1
0.6%
1290 1
0.6%
1291 1
0.6%
1292 1
0.6%
ValueCountFrequency (%)
1781 1
0.6%
1755 1
0.6%
1737 1
0.6%
1714 1
0.6%
1713 1
0.6%
1710 1
0.6%
1709 1
0.6%
1708 1
0.6%
1699 1
0.6%
1694 1
0.6%

관리번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct163
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1654.3436
Minimum36
Maximum2217
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.6 KiB
2024-05-11T09:39:47.426292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36
5-th percentile1297.3
Q11475.5
median1671
Q31885.5
95-th percentile2146.7
Maximum2217
Range2181
Interquartile range (IQR)410

Descriptive statistics

Standard deviation384.78425
Coefficient of variation (CV)0.23259029
Kurtosis7.2873064
Mean1654.3436
Median Absolute Deviation (MAD)202
Skewness-2.02651
Sum269658
Variance148058.92
MonotonicityNot monotonic
2024-05-11T09:39:47.977941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1291 1
 
0.6%
1779 1
 
0.6%
1787 1
 
0.6%
1799 1
 
0.6%
1856 1
 
0.6%
1745 1
 
0.6%
1817 1
 
0.6%
1888 1
 
0.6%
1889 1
 
0.6%
1893 1
 
0.6%
Other values (153) 153
93.9%
ValueCountFrequency (%)
36 1
0.6%
39 1
0.6%
40 1
0.6%
46 1
0.6%
59 1
0.6%
1288 1
0.6%
1291 1
0.6%
1295 1
0.6%
1297 1
0.6%
1300 1
0.6%
ValueCountFrequency (%)
2217 1
0.6%
2185 1
0.6%
2179 1
0.6%
2167 1
0.6%
2166 1
0.6%
2159 1
0.6%
2155 1
0.6%
2154 1
0.6%
2147 1
0.6%
2144 1
0.6%

Interactions

2024-05-11T09:39:44.305850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T09:39:43.380699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T09:39:44.727279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T09:39:43.871220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-11T09:39:48.320503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
등록번호관리번호
등록번호1.0000.992
관리번호0.9921.000
2024-05-11T09:39:48.631451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
등록번호관리번호
등록번호1.0001.000
관리번호1.0001.000

Missing values

2024-05-11T09:39:45.326748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-11T09:39:45.678930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

등록번호관리번호
012881291
112891295
212901297
312911300
412921303
512931306
612991323
712971324
813001328
913051340
등록번호관리번호
15316812122
15416812123
15517082154
15617092155
15717102159
15817132166
15917142167
16017372179
16117552185
16217812217