Overview

Dataset statistics

Number of variables3
Number of observations5287
Missing cells0
Missing cells (%)0.0%
Duplicate rows28
Duplicate rows (%)0.5%
Total size in memory139.5 KiB
Average record size in memory27.0 B

Variable types

Numeric3

Dataset

Description대학도서관 시설 및 설비에 관한 데이터(도서관 건물 연면적, 설비 - 이용자용 컴퓨터 수) 등의 도서관 시설 관련 정보를 제공합니다.
Author한국교육학술정보원
URLhttps://www.data.go.kr/data/15071924/fileData.do

Alerts

Dataset has 28 (0.5%) duplicate rowsDuplicates
도서관 건물 연면적 is highly overall correlated with 설비-이용자용 컴퓨터(PC) 수High correlation
설비-이용자용 컴퓨터(PC) 수 is highly overall correlated with 도서관 건물 연면적High correlation
도서관 건물 연면적 is highly skewed (γ1 = 43.08435307)Skewed
도서관 건물 연면적 has 121 (2.3%) zerosZeros
설비-이용자용 컴퓨터(PC) 수 has 218 (4.1%) zerosZeros

Reproduction

Analysis started2023-12-12 23:35:21.771709
Analysis finished2023-12-12 23:35:22.809981
Duration1.04 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

조사년도 키
Real number (ℝ)

Distinct12
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2013.6656
Minimum2008
Maximum2019
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size46.6 KiB
2023-12-13T08:35:22.858167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2008
5-th percentile2008
Q12011
median2014
Q32017
95-th percentile2019
Maximum2019
Range11
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.4082419
Coefficient of variation (CV)0.001692556
Kurtosis-1.1880171
Mean2013.6656
Median Absolute Deviation (MAD)3
Skewness-0.053347407
Sum10646250
Variance11.616113
MonotonicityNot monotonic
2023-12-13T08:35:22.956285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
2016 462
8.7%
2017 461
8.7%
2019 460
8.7%
2013 458
8.7%
2015 458
8.7%
2014 457
8.6%
2018 453
8.6%
2011 434
8.2%
2012 430
8.1%
2010 426
8.1%
Other values (2) 788
14.9%
ValueCountFrequency (%)
2008 381
7.2%
2009 407
7.7%
2010 426
8.1%
2011 434
8.2%
2012 430
8.1%
2013 458
8.7%
2014 457
8.6%
2015 458
8.7%
2016 462
8.7%
2017 461
8.7%
ValueCountFrequency (%)
2019 460
8.7%
2018 453
8.6%
2017 461
8.7%
2016 462
8.7%
2015 458
8.7%
2014 457
8.6%
2013 458
8.7%
2012 430
8.1%
2011 434
8.2%
2010 426
8.1%

도서관 건물 연면적
Real number (ℝ)

HIGH CORRELATION  SKEWED  ZEROS 

Distinct1619
Distinct (%)30.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6476.3197
Minimum0
Maximum1050207
Zeros121
Zeros (%)2.3%
Negative0
Negative (%)0.0%
Memory size46.6 KiB
2023-12-13T08:35:23.086360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile135
Q11033
median3217
Q38355
95-th percentile21344
Maximum1050207
Range1050207
Interquartile range (IQR)7322

Descriptive statistics

Standard deviation17379.349
Coefficient of variation (CV)2.6835225
Kurtosis2497.6287
Mean6476.3197
Median Absolute Deviation (MAD)2611
Skewness43.084353
Sum34240302
Variance3.0204179 × 108
MonotonicityNot monotonic
2023-12-13T08:35:23.488721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 121
 
2.3%
379.0 22
 
0.4%
583.0 22
 
0.4%
210.0 21
 
0.4%
906.0 21
 
0.4%
1657.0 17
 
0.3%
20454.0 16
 
0.3%
750.0 15
 
0.3%
2883.0 14
 
0.3%
1195.0 14
 
0.3%
Other values (1609) 5004
94.6%
ValueCountFrequency (%)
0.0 121
2.3%
10.0 7
 
0.1%
13.0 4
 
0.1%
15.0 2
 
< 0.1%
15.5 1
 
< 0.1%
18.0 3
 
0.1%
26.0 1
 
< 0.1%
31.0 2
 
< 0.1%
33.0 12
 
0.2%
37.0 3
 
0.1%
ValueCountFrequency (%)
1050207.0 1
< 0.1%
368560.0 1
< 0.1%
106992.0 2
< 0.1%
78604.0 1
< 0.1%
78585.0 1
< 0.1%
78226.0 1
< 0.1%
76051.0 1
< 0.1%
75222.0 1
< 0.1%
68395.0 1
< 0.1%
66131.0 2
< 0.1%

설비-이용자용 컴퓨터(PC) 수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct353
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean67.40609
Minimum0
Maximum989
Zeros218
Zeros (%)4.1%
Negative0
Negative (%)0.0%
Memory size46.6 KiB
2023-12-13T08:35:23.634294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q110
median39
Q389
95-th percentile237
Maximum989
Range989
Interquartile range (IQR)79

Descriptive statistics

Standard deviation92.887844
Coefficient of variation (CV)1.3780334
Kurtosis23.853233
Mean67.40609
Median Absolute Deviation (MAD)33
Skewness3.8985924
Sum356376
Variance8628.1516
MonotonicityNot monotonic
2023-12-13T08:35:23.780001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 218
 
4.1%
2 217
 
4.1%
1 134
 
2.5%
4 134
 
2.5%
3 133
 
2.5%
6 108
 
2.0%
8 106
 
2.0%
5 98
 
1.9%
12 83
 
1.6%
10 81
 
1.5%
Other values (343) 3975
75.2%
ValueCountFrequency (%)
0 218
4.1%
1 134
2.5%
2 217
4.1%
3 133
2.5%
4 134
2.5%
5 98
1.9%
6 108
2.0%
7 49
 
0.9%
8 106
2.0%
9 63
 
1.2%
ValueCountFrequency (%)
989 1
< 0.1%
949 1
< 0.1%
944 1
< 0.1%
932 1
< 0.1%
903 1
< 0.1%
826 1
< 0.1%
820 1
< 0.1%
818 2
< 0.1%
810 1
< 0.1%
792 2
< 0.1%

Interactions

2023-12-13T08:35:22.399010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:35:21.916541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:35:22.159551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:35:22.483423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:35:21.995410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:35:22.240386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:35:22.573178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:35:22.073636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:35:22.318791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:35:23.859205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
조사년도 키도서관 건물 연면적설비-이용자용 컴퓨터(PC) 수
조사년도 키1.0000.0220.000
도서관 건물 연면적0.0221.0000.000
설비-이용자용 컴퓨터(PC) 수0.0000.0001.000
2023-12-13T08:35:23.938791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
조사년도 키도서관 건물 연면적설비-이용자용 컴퓨터(PC) 수
조사년도 키1.000-0.015-0.013
도서관 건물 연면적-0.0151.0000.862
설비-이용자용 컴퓨터(PC) 수-0.0130.8621.000

Missing values

2023-12-13T08:35:22.695843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:35:22.778776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

조사년도 키도서관 건물 연면적설비-이용자용 컴퓨터(PC) 수
020083823.045
120084457.09
22008796.02
320082292.062
4200814059.0142
520080.01
620083536.049
720083604.057
82008984.03
920081867.021
조사년도 키도서관 건물 연면적설비-이용자용 컴퓨터(PC) 수
5277200914857.0184
527820093319.0118
527920092743.05
528020091135.031
528120091504.08
5282200910134.017
52832009924.035
5284200917762.048
5285200912541.0216
528620093340.048

Duplicate rows

Most frequently occurring

조사년도 키도서관 건물 연면적설비-이용자용 컴퓨터(PC) 수# duplicates
820130.008
1020140.008
1220150.008
520100.007
620110.007
1620160.007
2120180.007
2520190.007
720120.006
1820170.006