Overview

Dataset statistics

Number of variables5
Number of observations5287
Missing cells0
Missing cells (%)0.0%
Duplicate rows17
Duplicate rows (%)0.3%
Total size in memory232.5 KiB
Average record size in memory45.0 B

Variable types

Numeric5

Dataset

Description대학도서관 전자 서비스에 관한 데이터 항목(MARC 데이터 구축 건수, 디지털 콘텐츠 구축 건수, 전자서비스 이용 현황 등) 정보를 제공합니다.
Author한국교육학술정보원
URLhttps://www.data.go.kr/data/15071922/fileData.do

Alerts

Dataset has 17 (0.3%) duplicate rowsDuplicates
MARC 데이터 구축 건수-합계 is highly overall correlated with 디지털 콘텐츠 구축 건수-합계 and 2 other fieldsHigh correlation
디지털 콘텐츠 구축 건수-합계 is highly overall correlated with MARC 데이터 구축 건수-합계 and 2 other fieldsHigh correlation
전자서비스 이용현황-홈페이지 접속건수 is highly overall correlated with MARC 데이터 구축 건수-합계 and 2 other fieldsHigh correlation
전자서비스 이용현황-OPAC 검색건수 is highly overall correlated with MARC 데이터 구축 건수-합계 and 2 other fieldsHigh correlation
디지털 콘텐츠 구축 건수-합계 is highly skewed (γ1 = 70.08883163)Skewed
전자서비스 이용현황-홈페이지 접속건수 is highly skewed (γ1 = 43.14243812)Skewed
전자서비스 이용현황-OPAC 검색건수 is highly skewed (γ1 = 33.0649954)Skewed
MARC 데이터 구축 건수-합계 has 569 (10.8%) zerosZeros
디지털 콘텐츠 구축 건수-합계 has 2870 (54.3%) zerosZeros
전자서비스 이용현황-홈페이지 접속건수 has 804 (15.2%) zerosZeros
전자서비스 이용현황-OPAC 검색건수 has 1517 (28.7%) zerosZeros

Reproduction

Analysis started2023-12-12 06:40:28.665020
Analysis finished2023-12-12 06:40:32.284843
Duration3.62 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

MARC 데이터 구축 건수-합계
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct4605
Distinct (%)87.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean267842.24
Minimum0
Maximum9814868
Zeros569
Zeros (%)10.8%
Negative0
Negative (%)0.0%
Memory size46.6 KiB
2023-12-12T15:40:32.411535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q130574.5
median103279
Q3340014
95-th percentile986986.8
Maximum9814868
Range9814868
Interquartile range (IQR)309439.5

Descriptive statistics

Standard deviation464016.51
Coefficient of variation (CV)1.7324247
Kurtosis80.04625
Mean267842.24
Median Absolute Deviation (MAD)98204
Skewness6.1343658
Sum1.4160819 × 109
Variance2.1531132 × 1011
MonotonicityNot monotonic
2023-12-12T15:40:32.621495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 569
 
10.8%
11000 8
 
0.2%
2203 7
 
0.1%
166500 6
 
0.1%
10 5
 
0.1%
5400 5
 
0.1%
10869 5
 
0.1%
75191 4
 
0.1%
250 3
 
0.1%
42822 3
 
0.1%
Other values (4595) 4672
88.4%
ValueCountFrequency (%)
0 569
10.8%
6 1
 
< 0.1%
8 1
 
< 0.1%
9 2
 
< 0.1%
10 5
 
0.1%
12 1
 
< 0.1%
13 1
 
< 0.1%
15 1
 
< 0.1%
60 1
 
< 0.1%
114 1
 
< 0.1%
ValueCountFrequency (%)
9814868 1
< 0.1%
9759519 1
< 0.1%
4578641 1
< 0.1%
4244292 1
< 0.1%
3908464 1
< 0.1%
3734600 1
< 0.1%
3694125 1
< 0.1%
3538894 1
< 0.1%
3482374 1
< 0.1%
3409107 1
< 0.1%

디지털 콘텐츠 구축 건수-합계
Real number (ℝ)

HIGH CORRELATION  SKEWED  ZEROS 

Distinct1812
Distinct (%)34.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16462.118
Minimum0
Maximum32412363
Zeros2870
Zeros (%)54.3%
Negative0
Negative (%)0.0%
Memory size46.6 KiB
2023-12-12T15:40:32.820330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31928
95-th percentile43746.9
Maximum32412363
Range32412363
Interquartile range (IQR)1928

Descriptive statistics

Standard deviation451258.53
Coefficient of variation (CV)27.411936
Kurtosis5027.8805
Mean16462.118
Median Absolute Deviation (MAD)0
Skewness70.088832
Sum87035218
Variance2.0363426 × 1011
MonotonicityNot monotonic
2023-12-12T15:40:32.993057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 2870
54.3%
1 21
 
0.4%
22 17
 
0.3%
678 15
 
0.3%
6 13
 
0.2%
434 12
 
0.2%
146 12
 
0.2%
26 12
 
0.2%
34 11
 
0.2%
324 10
 
0.2%
Other values (1802) 2294
43.4%
ValueCountFrequency (%)
0 2870
54.3%
1 21
 
0.4%
2 8
 
0.2%
3 6
 
0.1%
4 3
 
0.1%
5 7
 
0.1%
6 13
 
0.2%
7 4
 
0.1%
8 4
 
0.1%
9 7
 
0.1%
ValueCountFrequency (%)
32412363 1
< 0.1%
2014989 1
< 0.1%
1795080 1
< 0.1%
1688593 1
< 0.1%
1573719 1
< 0.1%
1468459 1
< 0.1%
1213114 1
< 0.1%
915154 1
< 0.1%
912854 1
< 0.1%
794454 1
< 0.1%

전자서비스 이용현황-홈페이지 접속건수
Real number (ℝ)

HIGH CORRELATION  SKEWED  ZEROS 

Distinct4360
Distinct (%)82.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean659028.69
Minimum0
Maximum2.553604 × 108
Zeros804
Zeros (%)15.2%
Negative0
Negative (%)0.0%
Memory size46.6 KiB
2023-12-12T15:40:33.176976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q15428.5
median55037
Q3417026.5
95-th percentile2615418.1
Maximum2.553604 × 108
Range2.553604 × 108
Interquartile range (IQR)411598

Descriptive statistics

Standard deviation4407051.7
Coefficient of variation (CV)6.6871924
Kurtosis2301.2967
Mean659028.69
Median Absolute Deviation (MAD)55037
Skewness43.142438
Sum3.4842847 × 109
Variance1.9422104 × 1013
MonotonicityNot monotonic
2023-12-12T15:40:33.354736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 804
 
15.2%
18250 4
 
0.1%
300 4
 
0.1%
867 4
 
0.1%
100 4
 
0.1%
1000 4
 
0.1%
700 4
 
0.1%
54750 3
 
0.1%
1800 3
 
0.1%
22221 3
 
0.1%
Other values (4350) 4450
84.2%
ValueCountFrequency (%)
0 804
15.2%
10 2
 
< 0.1%
21 1
 
< 0.1%
28 1
 
< 0.1%
30 1
 
< 0.1%
31 1
 
< 0.1%
35 2
 
< 0.1%
36 2
 
< 0.1%
42 1
 
< 0.1%
47 1
 
< 0.1%
ValueCountFrequency (%)
255360400 1
< 0.1%
139597712 1
< 0.1%
43355114 1
< 0.1%
39971971 1
< 0.1%
25365132 1
< 0.1%
22189404 1
< 0.1%
21727185 1
< 0.1%
20602212 1
< 0.1%
20161346 1
< 0.1%
19576055 1
< 0.1%

전자서비스 이용현황-OPAC 검색건수
Real number (ℝ)

HIGH CORRELATION  SKEWED  ZEROS 

Distinct3685
Distinct (%)69.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean503377.6
Minimum0
Maximum1.7020361 × 108
Zeros1517
Zeros (%)28.7%
Negative0
Negative (%)0.0%
Memory size46.6 KiB
2023-12-12T15:40:33.512659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median48000
Q3343647.5
95-th percentile1793042.9
Maximum1.7020361 × 108
Range1.7020361 × 108
Interquartile range (IQR)343647.5

Descriptive statistics

Standard deviation3549407.2
Coefficient of variation (CV)7.0511822
Kurtosis1333.9341
Mean503377.6
Median Absolute Deviation (MAD)48000
Skewness33.064995
Sum2.6613574 × 109
Variance1.2598291 × 1013
MonotonicityNot monotonic
2023-12-12T15:40:33.691879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1517
28.7%
300 5
 
0.1%
100000 4
 
0.1%
1152 3
 
0.1%
500 3
 
0.1%
3493 3
 
0.1%
1558 3
 
0.1%
12 3
 
0.1%
24 3
 
0.1%
50 3
 
0.1%
Other values (3675) 3740
70.7%
ValueCountFrequency (%)
0 1517
28.7%
10 1
 
< 0.1%
12 3
 
0.1%
20 1
 
< 0.1%
24 3
 
0.1%
31 1
 
< 0.1%
35 1
 
< 0.1%
50 3
 
0.1%
73 1
 
< 0.1%
74 1
 
< 0.1%
ValueCountFrequency (%)
170203612 1
< 0.1%
118377783 1
< 0.1%
93862037 1
< 0.1%
60326574 1
< 0.1%
48185059 1
< 0.1%
37137422 1
< 0.1%
29843049 1
< 0.1%
29261206 1
< 0.1%
26245478 1
< 0.1%
19387845 1
< 0.1%

조사년도 키
Real number (ℝ)

Distinct12
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2013.6656
Minimum2008
Maximum2019
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size46.6 KiB
2023-12-12T15:40:33.875253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2008
5-th percentile2008
Q12011
median2014
Q32017
95-th percentile2019
Maximum2019
Range11
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.4082419
Coefficient of variation (CV)0.001692556
Kurtosis-1.1880171
Mean2013.6656
Median Absolute Deviation (MAD)3
Skewness-0.053347407
Sum10646250
Variance11.616113
MonotonicityNot monotonic
2023-12-12T15:40:34.008688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
2016 462
8.7%
2017 461
8.7%
2019 460
8.7%
2013 458
8.7%
2015 458
8.7%
2014 457
8.6%
2018 453
8.6%
2011 434
8.2%
2012 430
8.1%
2010 426
8.1%
Other values (2) 788
14.9%
ValueCountFrequency (%)
2008 381
7.2%
2009 407
7.7%
2010 426
8.1%
2011 434
8.2%
2012 430
8.1%
2013 458
8.7%
2014 457
8.6%
2015 458
8.7%
2016 462
8.7%
2017 461
8.7%
ValueCountFrequency (%)
2019 460
8.7%
2018 453
8.6%
2017 461
8.7%
2016 462
8.7%
2015 458
8.7%
2014 457
8.6%
2013 458
8.7%
2012 430
8.1%
2011 434
8.2%
2010 426
8.1%

Interactions

2023-12-12T15:40:31.471453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:40:29.108073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:40:29.690340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:40:30.329098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:40:30.898629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:40:31.565757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:40:29.218931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:40:29.805588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:40:30.474098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:40:31.009046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:40:31.699900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:40:29.324873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:40:29.969495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:40:30.589583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:40:31.156884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:40:31.808956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:40:29.471428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:40:30.110906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:40:30.684211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:40:31.265207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:40:31.945883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:40:29.579771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:40:30.226595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:40:30.781161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:40:31.374782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:40:34.106184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
MARC 데이터 구축 건수-합계디지털 콘텐츠 구축 건수-합계전자서비스 이용현황-홈페이지 접속건수전자서비스 이용현황-OPAC 검색건수조사년도 키
MARC 데이터 구축 건수-합계1.0000.0850.0390.0220.000
디지털 콘텐츠 구축 건수-합계0.0851.0000.0000.0000.015
전자서비스 이용현황-홈페이지 접속건수0.0390.0001.0000.0000.005
전자서비스 이용현황-OPAC 검색건수0.0220.0000.0001.0000.000
조사년도 키0.0000.0150.0050.0001.000
2023-12-12T15:40:34.239111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
MARC 데이터 구축 건수-합계디지털 콘텐츠 구축 건수-합계전자서비스 이용현황-홈페이지 접속건수전자서비스 이용현황-OPAC 검색건수조사년도 키
MARC 데이터 구축 건수-합계1.0000.6520.7280.7210.070
디지털 콘텐츠 구축 건수-합계0.6521.0000.6050.6050.213
전자서비스 이용현황-홈페이지 접속건수0.7280.6051.0000.7900.096
전자서비스 이용현황-OPAC 검색건수0.7210.6050.7901.0000.143
조사년도 키0.0700.2130.0960.1431.000

Missing values

2023-12-12T15:40:32.091724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:40:32.222170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

MARC 데이터 구축 건수-합계디지털 콘텐츠 구축 건수-합계전자서비스 이용현황-홈페이지 접속건수전자서비스 이용현황-OPAC 검색건수조사년도 키
01074675214595681810983882013
112201052212862013
259171025778824745262454782013
3474244507649099868206722013
4456016177102013
5312860002013
6127113440052153097622761962013
7565518137567041758364952013
820570210616700874926292013
9878952154515300766392572013
MARC 데이터 구축 건수-합계디지털 콘텐츠 구축 건수-합계전자서비스 이용현황-홈페이지 접속건수전자서비스 이용현황-OPAC 검색건수조사년도 키
527700622302012
527800293402012
5279745070002012
52804494601825002012
52816654836022797496322012
5282110000002012
52834628751497993132202012
52841160080154376794622012
528500002012
5286872083150185213647410382012

Duplicate rows

Most frequently occurring

MARC 데이터 구축 건수-합계디지털 콘텐츠 구축 건수-합계전자서비스 이용현황-홈페이지 접속건수전자서비스 이용현황-OPAC 검색건수조사년도 키# duplicates
00000200841
10000200941
50000201338
30000201132
20000201030
70000201528
60000201427
80000201627
40000201224
90000201724