Overview

Dataset statistics

Number of variables4
Number of observations3873
Missing cells0
Missing cells (%)0.0%
Duplicate rows358
Duplicate rows (%)9.2%
Total size in memory128.7 KiB
Average record size in memory34.0 B

Variable types

DateTime1
Categorical1
Numeric2

Dataset

Description한국자산관리공사의 국민행복기금 지역별 햇살론 인수정보를 제공합니다. 데이터는 인수일자, 지역, 인수채권금액. 채권잔액으로 이루어져 제공됩니다.
Author한국자산관리공사
URLhttps://www.data.go.kr/data/15087871/fileData.do

Alerts

Dataset has 358 (9.2%) duplicate rowsDuplicates
인수채권금액 is highly overall correlated with 채권잔액High correlation
채권잔액 is highly overall correlated with 인수채권금액High correlation
인수채권금액 has 1523 (39.3%) zerosZeros
채권잔액 has 1534 (39.6%) zerosZeros

Reproduction

Analysis started2023-12-12 04:57:07.288499
Analysis finished2023-12-12 04:57:08.187299
Duration0.9 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct145
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Memory size30.4 KiB
Minimum2020-02-12 00:00:00
Maximum2020-12-31 00:00:00
2023-12-12T13:57:08.279278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:57:08.458727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

지역
Categorical

Distinct17
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size30.4 KiB
경기
1030 
서울
606 
인천
297 
경남
279 
부산
246 
Other values (12)
1415 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row제주
2nd row제주
3rd row경남
4th row경남
5th row부산

Common Values

ValueCountFrequency (%)
경기 1030
26.6%
서울 606
15.6%
인천 297
 
7.7%
경남 279
 
7.2%
부산 246
 
6.4%
전북 192
 
5.0%
대구 180
 
4.6%
경북 169
 
4.4%
강원 151
 
3.9%
충북 126
 
3.3%
Other values (7) 597
15.4%

Length

2023-12-12T13:57:08.587348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기 1030
26.6%
서울 606
15.6%
인천 297
 
7.7%
경남 279
 
7.2%
부산 246
 
6.4%
전북 192
 
5.0%
대구 180
 
4.6%
경북 169
 
4.4%
강원 151
 
3.9%
충북 126
 
3.3%
Other values (7) 597
15.4%

인수채권금액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct1083
Distinct (%)28.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4242703.6
Minimum0
Maximum14183606
Zeros1523
Zeros (%)39.3%
Negative0
Negative (%)0.0%
Memory size34.2 KiB
2023-12-12T13:57:08.709443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median6328054
Q36877860
95-th percentile10041363
Maximum14183606
Range14183606
Interquartile range (IQR)6877860

Descriptive statistics

Standard deviation3810196.5
Coefficient of variation (CV)0.89805862
Kurtosis-1.1066676
Mean4242703.6
Median Absolute Deviation (MAD)3380397
Skewness0.18128004
Sum1.6431991 × 1010
Variance1.4517597 × 1013
MonotonicityNot monotonic
2023-12-12T13:57:08.871664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1523
39.3%
6866737 100
 
2.6%
6877849 93
 
2.4%
6942873 72
 
1.9%
6800452 65
 
1.7%
6954107 62
 
1.6%
6789466 57
 
1.5%
7017889 55
 
1.4%
6721902 48
 
1.2%
6642180 41
 
1.1%
Other values (1073) 1757
45.4%
ValueCountFrequency (%)
0 1523
39.3%
250610 1
 
< 0.1%
992854 1
 
< 0.1%
1734317 1
 
< 0.1%
1758870 1
 
< 0.1%
1871616 1
 
< 0.1%
1889070 1
 
< 0.1%
1898585 1
 
< 0.1%
1965098 1
 
< 0.1%
1983302 1
 
< 0.1%
ValueCountFrequency (%)
14183606 1
 
< 0.1%
14129234 1
 
< 0.1%
14058491 2
0.1%
14038429 1
 
< 0.1%
14037615 1
 
< 0.1%
14016083 1
 
< 0.1%
13908216 3
0.1%
13908107 1
 
< 0.1%
13908099 1
 
< 0.1%
13905730 1
 
< 0.1%

채권잔액
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct1676
Distinct (%)43.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4071889.9
Minimum0
Maximum14183606
Zeros1534
Zeros (%)39.6%
Negative0
Negative (%)0.0%
Memory size34.2 KiB
2023-12-12T13:57:09.018124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median5497553
Q36800452
95-th percentile9934553.2
Maximum14183606
Range14183606
Interquartile range (IQR)6800452

Descriptive statistics

Standard deviation3701929.6
Coefficient of variation (CV)0.90914286
Kurtosis-1.0404362
Mean4071889.9
Median Absolute Deviation (MAD)2953722
Skewness0.2288337
Sum1.577043 × 1010
Variance1.3704283 × 1013
MonotonicityNot monotonic
2023-12-12T13:57:09.161674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1534
39.6%
6877849 58
 
1.5%
6866737 46
 
1.2%
6800452 39
 
1.0%
7017889 38
 
1.0%
6942873 36
 
0.9%
6954107 34
 
0.9%
6721902 31
 
0.8%
6789466 26
 
0.7%
6642180 23
 
0.6%
Other values (1666) 2008
51.8%
ValueCountFrequency (%)
0 1534
39.6%
187221 1
 
< 0.1%
977584 1
 
< 0.1%
1445768 1
 
< 0.1%
1734317 1
 
< 0.1%
1752856 1
 
< 0.1%
1758870 1
 
< 0.1%
1889070 1
 
< 0.1%
1898585 1
 
< 0.1%
1909146 1
 
< 0.1%
ValueCountFrequency (%)
14183606 1
< 0.1%
14129234 1
< 0.1%
14058491 2
0.1%
14038429 1
< 0.1%
14016083 1
< 0.1%
13908216 2
0.1%
13908107 1
< 0.1%
13905730 1
< 0.1%
13858543 1
< 0.1%
13763222 1
< 0.1%

Interactions

2023-12-12T13:57:07.781894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:57:07.485928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:57:07.914227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:57:07.631702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T13:57:09.545582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역인수채권금액채권잔액
지역1.0000.0600.052
인수채권금액0.0601.0000.991
채권잔액0.0520.9911.000
2023-12-12T13:57:09.633881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인수채권금액채권잔액지역
인수채권금액1.0000.9640.023
채권잔액0.9641.0000.020
지역0.0230.0201.000

Missing values

2023-12-12T13:57:08.042116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:57:08.142975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

인수일자지역인수채권금액채권잔액
02020-02-12제주00
12020-02-12제주70294496703541
22020-02-18경남99697978519297
32020-02-18경남00
42020-03-10부산70164417016441
52020-03-10제주68361836836183
62020-03-10서울71034937103493
72020-03-10서울00
82020-03-10부산00
92020-03-10제주00
인수일자지역인수채권금액채권잔액
38632020-12-29서울27410592680169
38642020-12-29울산00
38652020-12-29경기68004526800452
38662020-12-29부산65390156539015
38672020-12-29서울1080805010808050
38682020-12-30인천94888309488830
38692020-12-30강원88085828808582
38702020-12-30경남1390821613621448
38712020-12-31제주68778496877849
38722020-12-31제주49127494912749

Duplicate rows

Most frequently occurring

인수일자지역인수채권금액채권잔액# duplicates
2172020-10-06경기0016
2352020-10-15경기0015
1312020-09-09경기0012
1822020-09-23경기0012
2262020-10-08경기0012
3282020-12-10경기0012
2602020-10-28경기0011
332020-06-17경기0010
372020-06-29경기0010
1592020-09-17경기0010