Overview

Dataset statistics

Number of variables3
Number of observations1000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory25.5 KiB
Average record size in memory26.1 B

Variable types

Numeric2
Text1

Dataset

Description한국주택금융공사 주택연금부 보증료수납대사미결 업무 관련 공개 공공데이터 (해당 부서의 업무와 관련된 데이터베이스에서 공개 가능한 원천 데이터)
Author한국주택금융공사
URLhttps://www.data.go.kr/data/15072862/fileData.do

Alerts

TRD_DY is highly overall correlated with BANK_CDHigh correlation
BANK_CD is highly overall correlated with TRD_DYHigh correlation

Reproduction

Analysis started2023-12-12 15:25:08.473614
Analysis finished2023-12-12 15:25:09.411274
Duration0.94 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

TRD_DY
Real number (ℝ)

HIGH CORRELATION 

Distinct164
Distinct (%)16.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20193308
Minimum20160318
Maximum20201023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.9 KiB
2023-12-13T00:25:09.485630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20160318
5-th percentile20170924
Q120180430
median20201023
Q320201023
95-th percentile20201023
Maximum20201023
Range40705
Interquartile range (IQR)20593

Descriptive statistics

Standard deviation11193.575
Coefficient of variation (CV)0.000554321
Kurtosis0.30577299
Mean20193308
Median Absolute Deviation (MAD)0
Skewness-1.1798751
Sum2.0193308 × 1010
Variance1.2529612 × 108
MonotonicityNot monotonic
2023-12-13T00:25:09.617733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20201023 584
58.4%
20180430 178
 
17.8%
20161031 5
 
0.5%
20190531 4
 
0.4%
20171030 3
 
0.3%
20200228 2
 
0.2%
20200729 2
 
0.2%
20190215 2
 
0.2%
20181218 2
 
0.2%
20190103 2
 
0.2%
Other values (154) 216
 
21.6%
ValueCountFrequency (%)
20160318 2
0.2%
20160321 2
0.2%
20160411 2
0.2%
20160427 2
0.2%
20160429 2
0.2%
20160513 1
0.1%
20160527 1
0.1%
20160712 1
0.1%
20160714 1
0.1%
20160808 1
0.1%
ValueCountFrequency (%)
20201023 584
58.4%
20201014 2
 
0.2%
20201008 1
 
0.1%
20200914 1
 
0.1%
20200911 1
 
0.1%
20200909 2
 
0.2%
20200831 2
 
0.2%
20200828 1
 
0.1%
20200824 1
 
0.1%
20200729 2
 
0.2%

BANK_CD
Real number (ℝ)

HIGH CORRELATION 

Distinct11
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14.275
Minimum3
Maximum88
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.9 KiB
2023-12-13T00:25:09.743718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3
5-th percentile4
Q14
median4
Q320
95-th percentile39
Maximum88
Range85
Interquartile range (IQR)16

Descriptive statistics

Standard deviation14.655926
Coefficient of variation (CV)1.0266848
Kurtosis3.9839216
Mean14.275
Median Absolute Deviation (MAD)0
Skewness1.6439165
Sum14275
Variance214.79617
MonotonicityNot monotonic
2023-12-13T00:25:09.835646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
4 591
59.1%
20 178
 
17.8%
31 108
 
10.8%
39 102
 
10.2%
88 7
 
0.7%
3 6
 
0.6%
81 3
 
0.3%
34 2
 
0.2%
11 1
 
0.1%
37 1
 
0.1%
ValueCountFrequency (%)
3 6
 
0.6%
4 591
59.1%
11 1
 
0.1%
20 178
 
17.8%
31 108
 
10.8%
32 1
 
0.1%
34 2
 
0.2%
37 1
 
0.1%
39 102
 
10.2%
81 3
 
0.3%
ValueCountFrequency (%)
88 7
 
0.7%
81 3
 
0.3%
39 102
 
10.2%
37 1
 
0.1%
34 2
 
0.2%
32 1
 
0.1%
31 108
 
10.8%
20 178
 
17.8%
11 1
 
0.1%
4 591
59.1%
Distinct780
Distinct (%)78.0%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
2023-12-13T00:25:10.061905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length11.574
Min length2

Characters and Unicode

Total characters11574
Distinct characters73
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique773 ?
Unique (%)77.3%

Sample

1st rowRTHO2017000277
2nd rowRTHO2017000276
3rd rowRTHO2017000274
4th row하나
5th rowRTHO2017000208
ValueCountFrequency (%)
하나 104
 
10.4%
실시간이체 101
 
10.1%
ibk개인여신부 6
 
0.6%
kb개인여신상품 6
 
0.6%
신한은행 5
 
0.5%
주)하나은행 3
 
0.3%
여신정책부 2
 
0.2%
rtha2020000402 1
 
0.1%
rtha2020000330 1
 
0.1%
rtha2020000333 1
 
0.1%
Other values (772) 772
77.0%
2023-12-13T00:25:10.419774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 3419
29.5%
2 1152
 
10.0%
1 969
 
8.4%
R 760
 
6.6%
T 567
 
4.9%
H 501
 
4.3%
A 377
 
3.3%
6 357
 
3.1%
7 328
 
2.8%
4 298
 
2.6%
Other values (63) 2846
24.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 7596
65.6%
Uppercase Letter 3072
26.5%
Other Letter 890
 
7.7%
Close Punctuation 7
 
0.1%
Open Punctuation 6
 
0.1%
Space Separator 2
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
107
12.0%
107
12.0%
102
11.5%
101
11.3%
101
11.3%
101
11.3%
101
11.3%
21
 
2.4%
16
 
1.8%
15
 
1.7%
Other values (32) 118
13.3%
Uppercase Letter
ValueCountFrequency (%)
R 760
24.7%
T 567
18.5%
H 501
16.3%
A 377
12.3%
B 266
 
8.7%
D 227
 
7.4%
Q 194
 
6.3%
O 144
 
4.7%
K 13
 
0.4%
I 6
 
0.2%
Other values (7) 17
 
0.6%
Decimal Number
ValueCountFrequency (%)
0 3419
45.0%
2 1152
 
15.2%
1 969
 
12.8%
6 357
 
4.7%
7 328
 
4.3%
4 298
 
3.9%
9 290
 
3.8%
5 278
 
3.7%
3 277
 
3.6%
8 228
 
3.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 7612
65.8%
Latin 3072
26.5%
Hangul 890
 
7.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
107
12.0%
107
12.0%
102
11.5%
101
11.3%
101
11.3%
101
11.3%
101
11.3%
21
 
2.4%
16
 
1.8%
15
 
1.7%
Other values (32) 118
13.3%
Latin
ValueCountFrequency (%)
R 760
24.7%
T 567
18.5%
H 501
16.3%
A 377
12.3%
B 266
 
8.7%
D 227
 
7.4%
Q 194
 
6.3%
O 144
 
4.7%
K 13
 
0.4%
I 6
 
0.2%
Other values (7) 17
 
0.6%
Common
ValueCountFrequency (%)
0 3419
44.9%
2 1152
 
15.1%
1 969
 
12.7%
6 357
 
4.7%
7 328
 
4.3%
4 298
 
3.9%
9 290
 
3.8%
5 278
 
3.7%
3 277
 
3.6%
8 228
 
3.0%
Other values (4) 16
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 10684
92.3%
Hangul 890
 
7.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 3419
32.0%
2 1152
 
10.8%
1 969
 
9.1%
R 760
 
7.1%
T 567
 
5.3%
H 501
 
4.7%
A 377
 
3.5%
6 357
 
3.3%
7 328
 
3.1%
4 298
 
2.8%
Other values (21) 1956
18.3%
Hangul
ValueCountFrequency (%)
107
12.0%
107
12.0%
102
11.5%
101
11.3%
101
11.3%
101
11.3%
101
11.3%
21
 
2.4%
16
 
1.8%
15
 
1.7%
Other values (32) 118
13.3%

Interactions

2023-12-13T00:25:08.791877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:25:08.584445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:25:08.894372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:25:08.698347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:25:10.509960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
TRD_DYBANK_CD
TRD_DY1.0000.740
BANK_CD0.7401.000
2023-12-13T00:25:10.581740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
TRD_DYBANK_CD
TRD_DY1.000-0.864
BANK_CD-0.8641.000

Missing values

2023-12-13T00:25:08.982568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:25:09.385766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

TRD_DYBANK_CDGUARNT_NO
0202010234RTHO2017000277
1202010234RTHO2017000276
2202010234RTHO2017000274
32018041731하나
4202010234RTHO2017000208
5202010234RTHO2017000157
6202010234RTHO2017000122
7202010234RTHO2017000100
8202010234RTHO2017000097
9202010234RTHO2017000094
TRD_DYBANK_CDGUARNT_NO
9902019111139실시간이체
9912019110739실시간이체
9922019110539실시간이체
9932019110139실시간이체
9942017060239실시간이체
9952017060231하나
9962016110131하나
997202010234RTAB2018000020
998202010234RTAB2018000006
999202010234RTAB2017000994