Overview

Dataset statistics

Number of variables4
Number of observations7792
Missing cells0
Missing cells (%)0.0%
Duplicate rows229
Duplicate rows (%)2.9%
Total size in memory251.2 KiB
Average record size in memory33.0 B

Variable types

Text1
DateTime1
Categorical1
Numeric1

Dataset

Description60세 이상 국민연금 수급자의 노후긴급자금 대부심사 결재의뢰 내역(지사별)(지사명, 접수일, 대부용도, 대부금액)
URLhttps://www.data.go.kr/data/15044883/fileData.do

Alerts

Dataset has 229 (2.9%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 15:42:34.287150
Analysis finished2023-12-12 15:42:34.902062
Duration0.61 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct112
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size61.0 KiB
2023-12-13T00:42:35.228186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length6
Mean length3.2772074
Min length2

Characters and Unicode

Total characters25536
Distinct characters106
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row종로중구
2nd row종로중구
3rd row종로중구
4th row종로중구
5th row종로중구
ValueCountFrequency (%)
강동하남 163
 
2.1%
도봉노원 152
 
2.0%
의정부 152
 
2.0%
남인천 150
 
1.9%
화성오산 144
 
1.8%
동대문중랑 142
 
1.8%
부천 140
 
1.8%
창원 134
 
1.7%
강서 130
 
1.7%
남양주 128
 
1.6%
Other values (102) 6357
81.6%
2023-12-13T00:42:36.197259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1572
 
6.2%
1506
 
5.9%
1132
 
4.4%
1092
 
4.3%
1043
 
4.1%
976
 
3.8%
808
 
3.2%
735
 
2.9%
719
 
2.8%
631
 
2.5%
Other values (96) 15322
60.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 25536
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1572
 
6.2%
1506
 
5.9%
1132
 
4.4%
1092
 
4.3%
1043
 
4.1%
976
 
3.8%
808
 
3.2%
735
 
2.9%
719
 
2.8%
631
 
2.5%
Other values (96) 15322
60.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 25536
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1572
 
6.2%
1506
 
5.9%
1132
 
4.4%
1092
 
4.3%
1043
 
4.1%
976
 
3.8%
808
 
3.2%
735
 
2.9%
719
 
2.8%
631
 
2.5%
Other values (96) 15322
60.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 25536
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1572
 
6.2%
1506
 
5.9%
1132
 
4.4%
1092
 
4.3%
1043
 
4.1%
976
 
3.8%
808
 
3.2%
735
 
2.9%
719
 
2.8%
631
 
2.5%
Other values (96) 15322
60.0%
Distinct247
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size61.0 KiB
Minimum2021-07-01 00:00:00
Maximum2022-06-30 00:00:00
2023-12-13T00:42:36.397731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:42:36.597393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

대부용도
Categorical

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size61.0 KiB
전월세보증금
4330 
의료비
3285 
배우자장제비
 
158
재해복구비
 
19

Length

Max length6
Median length6
Mean length4.7328029
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전월세보증금
2nd row의료비
3rd row의료비
4th row전월세보증금
5th row의료비

Common Values

ValueCountFrequency (%)
전월세보증금 4330
55.6%
의료비 3285
42.2%
배우자장제비 158
 
2.0%
재해복구비 19
 
0.2%

Length

2023-12-13T00:42:36.793818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:42:36.974391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전월세보증금 4330
55.6%
의료비 3285
42.2%
배우자장제비 158
 
2.0%
재해복구비 19
 
0.2%

대부금액
Real number (ℝ)

Distinct100
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6454170.9
Minimum100000
Maximum10000000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size68.6 KiB
2023-12-13T00:42:37.141824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum100000
5-th percentile1100000
Q13800000
median6600000
Q310000000
95-th percentile10000000
Maximum10000000
Range9900000
Interquartile range (IQR)6200000

Descriptive statistics

Standard deviation3187236.4
Coefficient of variation (CV)0.49382584
Kurtosis-1.3119021
Mean6454170.9
Median Absolute Deviation (MAD)3400000
Skewness-0.28273107
Sum5.02909 × 1010
Variance1.0158476 × 1013
MonotonicityNot monotonic
2023-12-13T00:42:37.360791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10000000 2428
31.2%
5000000 613
 
7.9%
3000000 242
 
3.1%
2000000 164
 
2.1%
7000000 143
 
1.8%
6000000 136
 
1.7%
8000000 130
 
1.7%
4000000 118
 
1.5%
9000000 97
 
1.2%
1000000 87
 
1.1%
Other values (90) 3634
46.6%
ValueCountFrequency (%)
100000 9
 
0.1%
200000 23
 
0.3%
300000 16
 
0.2%
400000 33
 
0.4%
500000 41
0.5%
600000 37
0.5%
700000 42
0.5%
800000 45
0.6%
900000 42
0.5%
1000000 87
1.1%
ValueCountFrequency (%)
10000000 2428
31.2%
9900000 27
 
0.3%
9800000 32
 
0.4%
9700000 38
 
0.5%
9600000 40
 
0.5%
9500000 37
 
0.5%
9400000 33
 
0.4%
9300000 40
 
0.5%
9200000 22
 
0.3%
9100000 22
 
0.3%

Interactions

2023-12-13T00:42:34.534746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:42:37.468259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대부용도대부금액
대부용도1.0000.495
대부금액0.4951.000
2023-12-13T00:42:37.564546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대부금액대부용도
대부금액1.0000.319
대부용도0.3191.000

Missing values

2023-12-13T00:42:34.705003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:42:34.838069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

지사명접수일대부용도대부금액
0종로중구2021-07-05전월세보증금10000000
1종로중구2021-07-07의료비2900000
2종로중구2021-07-20의료비2700000
3종로중구2021-07-30전월세보증금10000000
4종로중구2021-08-09의료비3100000
5종로중구2021-08-19의료비6100000
6종로중구2021-09-03전월세보증금10000000
7종로중구2021-09-10의료비1700000
8종로중구2021-09-13의료비8000000
9종로중구2021-09-24전월세보증금10000000
지사명접수일대부용도대부금액
7782서귀포2022-02-14전월세보증금10000000
7783서귀포2022-03-04전월세보증금6000000
7784서귀포2022-03-08전월세보증금10000000
7785서귀포2022-03-14전월세보증금4500000
7786서귀포2022-03-21전월세보증금5800000
7787서귀포2022-03-30전월세보증금6800000
7788서귀포2022-05-18의료비1700000
7789서귀포2022-05-18전월세보증금8300000
7790서귀포2022-05-30전월세보증금4000000
7791서귀포2022-06-02전월세보증금10000000

Duplicate rows

Most frequently occurring

지사명접수일대부용도대부금액# duplicates
41고양일산2021-12-17전월세보증금100000004
20경인지역본부2021-08-30전월세보증금100000003
52구로금천2021-10-25전월세보증금100000003
75남부산2022-01-24전월세보증금100000003
114동청주2022-01-03전월세보증금100000003
139서대구2021-11-26전월세보증금100000003
206처인기흥2021-11-29전월세보증금100000003
216파주2022-02-07전월세보증금100000003
224화성오산2021-09-27전월세보증금100000003
225화성오산2021-10-14전월세보증금100000003