Overview

Dataset statistics

Number of variables4
Number of observations7792
Missing cells0
Missing cells (%)0.0%
Duplicate rows620
Duplicate rows (%)8.0%
Total size in memory258.8 KiB
Average record size in memory34.0 B

Variable types

Numeric2
DateTime1
Categorical1

Dataset

Description60세 이상 국민연금 수급자의 노후긴급자금 대부심사 결재의뢰 내역(연령별)(연령, 접수일, 대부용도, 대부금액)
URLhttps://www.data.go.kr/data/15044882/fileData.do

Alerts

Dataset has 620 (8.0%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 23:11:29.547679
Analysis finished2023-12-12 23:11:30.627236
Duration1.08 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

신청연령
Real number (ℝ)

Distinct30
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean65.677233
Minimum60
Maximum89
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size68.6 KiB
2023-12-13T08:11:30.697794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum60
5-th percentile60
Q162
median64
Q368
95-th percentile75
Maximum89
Range29
Interquartile range (IQR)6

Descriptive statistics

Standard deviation4.6522952
Coefficient of variation (CV)0.070835737
Kurtosis1.3029931
Mean65.677233
Median Absolute Deviation (MAD)2
Skewness1.2102105
Sum511757
Variance21.643851
MonotonicityIncreasing
2023-12-13T08:11:30.831351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
62 1235
15.8%
64 953
12.2%
63 903
11.6%
65 551
 
7.1%
60 549
 
7.0%
66 517
 
6.6%
61 491
 
6.3%
67 418
 
5.4%
69 341
 
4.4%
68 328
 
4.2%
Other values (20) 1506
19.3%
ValueCountFrequency (%)
60 549
7.0%
61 491
 
6.3%
62 1235
15.8%
63 903
11.6%
64 953
12.2%
65 551
7.1%
66 517
6.6%
67 418
 
5.4%
68 328
 
4.2%
69 341
 
4.4%
ValueCountFrequency (%)
89 1
 
< 0.1%
88 2
 
< 0.1%
87 4
 
0.1%
86 1
 
< 0.1%
85 3
 
< 0.1%
84 4
 
0.1%
83 9
 
0.1%
82 18
0.2%
81 25
0.3%
80 33
0.4%
Distinct247
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size61.0 KiB
Minimum2021-07-01 00:00:00
Maximum2022-06-30 00:00:00
2023-12-13T08:11:30.959230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:11:31.098216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

대부용도
Categorical

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size61.0 KiB
전월세보증금
4330 
의료비
3285 
배우자장제비
 
158
재해복구비
 
19

Length

Max length6
Median length6
Mean length4.7328029
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전월세보증금
2nd row의료비
3rd row전월세보증금
4th row전월세보증금
5th row의료비

Common Values

ValueCountFrequency (%)
전월세보증금 4330
55.6%
의료비 3285
42.2%
배우자장제비 158
 
2.0%
재해복구비 19
 
0.2%

Length

2023-12-13T08:11:31.248998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:11:31.366973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전월세보증금 4330
55.6%
의료비 3285
42.2%
배우자장제비 158
 
2.0%
재해복구비 19
 
0.2%

대부금액
Real number (ℝ)

Distinct100
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6454170.9
Minimum100000
Maximum10000000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size68.6 KiB
2023-12-13T08:11:31.503996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum100000
5-th percentile1100000
Q13800000
median6600000
Q310000000
95-th percentile10000000
Maximum10000000
Range9900000
Interquartile range (IQR)6200000

Descriptive statistics

Standard deviation3187236.4
Coefficient of variation (CV)0.49382584
Kurtosis-1.3119021
Mean6454170.9
Median Absolute Deviation (MAD)3400000
Skewness-0.28273107
Sum5.02909 × 1010
Variance1.0158476 × 1013
MonotonicityNot monotonic
2023-12-13T08:11:31.653200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10000000 2428
31.2%
5000000 613
 
7.9%
3000000 242
 
3.1%
2000000 164
 
2.1%
7000000 143
 
1.8%
6000000 136
 
1.7%
8000000 130
 
1.7%
4000000 118
 
1.5%
9000000 97
 
1.2%
1000000 87
 
1.1%
Other values (90) 3634
46.6%
ValueCountFrequency (%)
100000 9
 
0.1%
200000 23
 
0.3%
300000 16
 
0.2%
400000 33
 
0.4%
500000 41
0.5%
600000 37
0.5%
700000 42
0.5%
800000 45
0.6%
900000 42
0.5%
1000000 87
1.1%
ValueCountFrequency (%)
10000000 2428
31.2%
9900000 27
 
0.3%
9800000 32
 
0.4%
9700000 38
 
0.5%
9600000 40
 
0.5%
9500000 37
 
0.5%
9400000 33
 
0.4%
9300000 40
 
0.5%
9200000 22
 
0.3%
9100000 22
 
0.3%

Interactions

2023-12-13T08:11:30.245862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:11:29.761131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:11:30.357822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:11:29.868543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:11:31.734509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
신청연령대부용도대부금액
신청연령1.0000.1460.232
대부용도0.1461.0000.495
대부금액0.2320.4951.000
2023-12-13T08:11:31.821913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
신청연령대부금액대부용도
신청연령1.000-0.1670.087
대부금액-0.1671.0000.319
대부용도0.0870.3191.000

Missing values

2023-12-13T08:11:30.492414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:11:30.586363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

신청연령접수일대부용도대부금액
0602021-07-01전월세보증금10000000
1602021-07-01의료비5400000
2602021-07-01전월세보증금9500000
3602021-07-01전월세보증금10000000
4602021-07-01의료비2200000
5602021-07-02전월세보증금10000000
6602021-07-02전월세보증금5000000
7602021-07-05전월세보증금4000000
8602021-07-05의료비1700000
9602021-07-05의료비4100000
신청연령접수일대부용도대부금액
7782852022-03-04의료비800000
7783852022-04-19의료비200000
7784862022-02-04의료비2500000
7785872021-09-09의료비2700000
7786872021-09-15의료비3300000
7787872021-12-07의료비1800000
7788872022-02-15전월세보증금10000000
7789882021-08-27배우자장제비7000000
7790882021-09-09전월세보증금3600000
7791892021-12-15전월세보증금7300000

Duplicate rows

Most frequently occurring

신청연령접수일대부용도대부금액# duplicates
137622021-10-27전월세보증금1000000011
135622021-10-25전월세보증금100000006
87622021-07-02전월세보증금100000005
97622021-07-21전월세보증금100000005
140622021-11-03전월세보증금100000005
172622022-01-03전월세보증금100000005
199622022-03-02전월세보증금100000005
216622022-04-04전월세보증금100000005
217622022-04-05전월세보증금100000005
226622022-04-28전월세보증금100000005