Overview

Dataset statistics

Number of variables4
Number of observations164
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.7 KiB
Average record size in memory35.8 B

Variable types

Numeric3
Categorical1

Dataset

Description사립학교교직원연금공단 대여 현황과 관련된 데이터로 연도, 대여종류별(학자금대여, 의료대여, 주택대여, 가계자금대여) 건수, 금액정보 등의 항목을 제공합니다.
URLhttps://www.data.go.kr/data/15045823/fileData.do

Alerts

건수(건) is highly overall correlated with 금액(천원) and 1 other fieldsHigh correlation
금액(천원) is highly overall correlated with 건수(건)High correlation
대여종류 is highly overall correlated with 건수(건)High correlation
건수(건) has 55 (33.5%) zerosZeros
금액(천원) has 55 (33.5%) zerosZeros

Reproduction

Analysis started2023-12-12 17:19:33.585939
Analysis finished2023-12-12 17:19:35.197140
Duration1.61 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준연도
Real number (ℝ)

Distinct41
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2002
Minimum1982
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.6 KiB
2023-12-13T02:19:35.277778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1982
5-th percentile1984
Q11992
median2002
Q32012
95-th percentile2020
Maximum2022
Range40
Interquartile range (IQR)20

Descriptive statistics

Standard deviation11.868399
Coefficient of variation (CV)0.0059282712
Kurtosis-1.2013811
Mean2002
Median Absolute Deviation (MAD)10
Skewness0
Sum328328
Variance140.8589
MonotonicityIncreasing
2023-12-13T02:19:35.445693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=41)
ValueCountFrequency (%)
1982 4
 
2.4%
2013 4
 
2.4%
2005 4
 
2.4%
2006 4
 
2.4%
2007 4
 
2.4%
2008 4
 
2.4%
2009 4
 
2.4%
2010 4
 
2.4%
2011 4
 
2.4%
2012 4
 
2.4%
Other values (31) 124
75.6%
ValueCountFrequency (%)
1982 4
2.4%
1983 4
2.4%
1984 4
2.4%
1985 4
2.4%
1986 4
2.4%
1987 4
2.4%
1988 4
2.4%
1989 4
2.4%
1990 4
2.4%
1991 4
2.4%
ValueCountFrequency (%)
2022 4
2.4%
2021 4
2.4%
2020 4
2.4%
2019 4
2.4%
2018 4
2.4%
2017 4
2.4%
2016 4
2.4%
2015 4
2.4%
2014 4
2.4%
2013 4
2.4%

대여종류
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
학자금대여
41 
의료대여
41 
주택대여
41 
가계자금대여
41 

Length

Max length6
Median length5.5
Mean length4.75
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row학자금대여
2nd row의료대여
3rd row주택대여
4th row가계자금대여
5th row학자금대여

Common Values

ValueCountFrequency (%)
학자금대여 41
25.0%
의료대여 41
25.0%
주택대여 41
25.0%
가계자금대여 41
25.0%

Length

2023-12-13T02:19:35.613894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:19:35.795903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
학자금대여 41
25.0%
의료대여 41
25.0%
주택대여 41
25.0%
가계자금대여 41
25.0%

건수(건)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct109
Distinct (%)66.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16284.86
Minimum0
Maximum73357
Zeros55
Zeros (%)33.5%
Negative0
Negative (%)0.0%
Memory size1.6 KiB
2023-12-13T02:19:35.944513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median3371.5
Q324905
95-th percentile68443.9
Maximum73357
Range73357
Interquartile range (IQR)24905

Descriptive statistics

Standard deviation21019.186
Coefficient of variation (CV)1.2907195
Kurtosis0.96518749
Mean16284.86
Median Absolute Deviation (MAD)3371.5
Skewness1.3489288
Sum2670717
Variance4.4180617 × 108
MonotonicityNot monotonic
2023-12-13T02:19:36.120555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 55
33.5%
1265 2
 
1.2%
14738 1
 
0.6%
38793 1
 
0.6%
36432 1
 
0.6%
58111 1
 
0.6%
35333 1
 
0.6%
49897 1
 
0.6%
32685 1
 
0.6%
51959 1
 
0.6%
Other values (99) 99
60.4%
ValueCountFrequency (%)
0 55
33.5%
72 1
 
0.6%
140 1
 
0.6%
283 1
 
0.6%
305 1
 
0.6%
472 1
 
0.6%
474 1
 
0.6%
559 1
 
0.6%
572 1
 
0.6%
709 1
 
0.6%
ValueCountFrequency (%)
73357 1
0.6%
72933 1
0.6%
72321 1
0.6%
72048 1
0.6%
71877 1
0.6%
69891 1
0.6%
69789 1
0.6%
68536 1
0.6%
68482 1
0.6%
68228 1
0.6%

금액(천원)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct110
Distinct (%)67.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.7654565 × 108
Minimum0
Maximum1.4683967 × 109
Zeros55
Zeros (%)33.5%
Negative0
Negative (%)0.0%
Memory size1.6 KiB
2023-12-13T02:19:36.295264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median8172360
Q378585920
95-th percentile1.1872402 × 109
Maximum1.4683967 × 109
Range1.4683967 × 109
Interquartile range (IQR)78585920

Descriptive statistics

Standard deviation3.841511 × 108
Coefficient of variation (CV)2.1759308
Kurtosis4.075588
Mean1.7654565 × 108
Median Absolute Deviation (MAD)8172360
Skewness2.3542074
Sum2.8953486 × 1010
Variance1.4757207 × 1017
MonotonicityNot monotonic
2023-12-13T02:19:36.505692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 55
33.5%
4131150 1
 
0.6%
71012020 1
 
0.6%
1343205024 1
 
0.6%
123375740 1
 
0.6%
1182003600 1
 
0.6%
105041020 1
 
0.6%
1188146590 1
 
0.6%
90140215 1
 
0.6%
1018383700 1
 
0.6%
Other values (100) 100
61.0%
ValueCountFrequency (%)
0 55
33.5%
245300 1
 
0.6%
279900 1
 
0.6%
539200 1
 
0.6%
695900 1
 
0.6%
895400 1
 
0.6%
900800 1
 
0.6%
1017000 1
 
0.6%
1095800 1
 
0.6%
1186200 1
 
0.6%
ValueCountFrequency (%)
1468396700 1
0.6%
1447784500 1
0.6%
1402876926 1
0.6%
1399959600 1
0.6%
1343205024 1
0.6%
1342947000 1
0.6%
1340732446 1
0.6%
1328056200 1
0.6%
1188146590 1
0.6%
1182103700 1
0.6%

Interactions

2023-12-13T02:19:34.563815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:19:33.713317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:19:34.137985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:19:34.752012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:19:33.842535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:19:34.283857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:19:34.900054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:19:33.966914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:19:34.390520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:19:36.613781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준연도대여종류건수(건)금액(천원)
기준연도1.0000.0000.5570.488
대여종류0.0001.0000.7930.581
건수(건)0.5570.7931.0000.873
금액(천원)0.4880.5810.8731.000
2023-12-13T02:19:37.049063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준연도건수(건)금액(천원)대여종류
기준연도1.0000.0450.0560.000
건수(건)0.0451.0000.9680.603
금액(천원)0.0560.9681.0000.380
대여종류0.0000.6030.3801.000

Missing values

2023-12-13T02:19:35.058331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:19:35.162094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준연도대여종류건수(건)금액(천원)
01982학자금대여147384131150
11982의료대여283245300
21982주택대여5591649700
31982가계자금대여00
41983학자금대여147034073390
51983의료대여474539200
61983주택대여10353244800
71983가계자금대여00
81984학자금대여136284074910
91984의료대여572695900
기준연도대여종류건수(건)금액(천원)
1542020주택대여00
1552020가계자금대여733571109648200
1562021학자금대여1786853100840
1572021의료대여00
1582021주택대여00
1592021가계자금대여729331103416400
1602022학자금대여1397051224600
1612022의료대여00
1622022주택대여00
1632022가계자금대여69789834495000