Overview

Dataset statistics

Number of variables2
Number of observations67
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 KiB
Average record size in memory20.0 B

Variable types

Numeric2

Dataset

Description등록번호,관리번호
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-21194/S/1/datasetView.do

Alerts

등록번호 is highly overall correlated with 관리번호High correlation
관리번호 is highly overall correlated with 등록번호High correlation
관리번호 has unique valuesUnique

Reproduction

Analysis started2024-05-11 09:32:23.633131
Analysis finished2024-05-11 09:32:25.331022
Duration1.7 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

등록번호
Real number (ℝ)

HIGH CORRELATION 

Distinct63
Distinct (%)94.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1410.7015
Minimum1
Maximum1764
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size735.0 B
2024-05-11T09:32:25.630172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile48
Q11398.5
median1505
Q31618
95-th percentile1683.9
Maximum1764
Range1763
Interquartile range (IQR)219.5

Descriptive statistics

Standard deviation415.86409
Coefficient of variation (CV)0.29479241
Kurtosis7.1534031
Mean1410.7015
Median Absolute Deviation (MAD)113
Skewness-2.7974886
Sum94517
Variance172942.94
MonotonicityNot monotonic
2024-05-11T09:32:26.129569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1618 2
 
3.0%
1617 2
 
3.0%
1616 2
 
3.0%
1404 2
 
3.0%
1514 1
 
1.5%
1570 1
 
1.5%
1563 1
 
1.5%
1561 1
 
1.5%
1515 1
 
1.5%
1459 1
 
1.5%
Other values (53) 53
79.1%
ValueCountFrequency (%)
1 1
1.5%
4 1
1.5%
22 1
1.5%
45 1
1.5%
55 1
1.5%
1285 1
1.5%
1299 1
1.5%
1306 1
1.5%
1312 1
1.5%
1316 1
1.5%
ValueCountFrequency (%)
1764 1
1.5%
1763 1
1.5%
1759 1
1.5%
1686 1
1.5%
1679 1
1.5%
1678 1
1.5%
1677 1
1.5%
1667 1
1.5%
1658 1
1.5%
1657 1
1.5%

관리번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct67
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1674.3433
Minimum2
Maximum2201
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size735.0 B
2024-05-11T09:32:26.558815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile49.3
Q11537.5
median1802
Q32011.5
95-th percentile2129.1
Maximum2201
Range2199
Interquartile range (IQR)474

Descriptive statistics

Standard deviation533.92774
Coefficient of variation (CV)0.31888786
Kurtosis4.4811687
Mean1674.3433
Median Absolute Deviation (MAD)220
Skewness-2.1391501
Sum112181
Variance285078.83
MonotonicityNot monotonic
2024-05-11T09:32:26.972041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2013 1
 
1.5%
1814 1
 
1.5%
1989 1
 
1.5%
1987 1
 
1.5%
1975 1
 
1.5%
1922 1
 
1.5%
1910 1
 
1.5%
1901 1
 
1.5%
1811 1
 
1.5%
2015 1
 
1.5%
Other values (57) 57
85.1%
ValueCountFrequency (%)
2 1
1.5%
6 1
1.5%
38 1
1.5%
43 1
1.5%
64 1
1.5%
1286 1
1.5%
1322 1
1.5%
1343 1
1.5%
1355 1
1.5%
1362 1
1.5%
ValueCountFrequency (%)
2201 1
1.5%
2197 1
1.5%
2191 1
1.5%
2133 1
1.5%
2120 1
1.5%
2118 1
1.5%
2115 1
1.5%
2093 1
1.5%
2069 1
1.5%
2065 1
1.5%

Interactions

2024-05-11T09:32:24.442144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T09:32:23.791424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T09:32:24.691216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T09:32:24.147389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-11T09:32:27.243371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
등록번호관리번호
등록번호1.0000.998
관리번호0.9981.000
2024-05-11T09:32:27.461799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
등록번호관리번호
등록번호1.0000.981
관리번호0.9811.000

Missing values

2024-05-11T09:32:25.039312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-11T09:32:25.248461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

등록번호관리번호
016182013
116232015
216672093
315051788
415111802
515211829
616462045
717592191
817632197
917642201
등록번호관리번호
5716582069
5816782118
5915781941
6016772115
6116792120
6216862133
6315901954
6416171998
6516162011
6616172012