Overview

Dataset statistics

Number of variables5
Number of observations372
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.8 KiB
Average record size in memory43.4 B

Variable types

Numeric3
Categorical1
Text1

Dataset

Description한국주택금융공사 주택연금부 업무 관련 공개 공공데이터 (해당 부서의 업무와 관련된 데이터베이스에서 공개 가능한 원천 데이터)
Author한국주택금융공사
URLhttps://www.data.go.kr/data/15073059/fileData.do

Alerts

STAR_DY is highly overall correlated with END_DY and 1 other fieldsHigh correlation
END_DY is highly overall correlated with STAR_DY and 1 other fieldsHigh correlation
LAWCO_DONG_CD is highly overall correlated with CTRL_BRCDHigh correlation
CTRL_BRCD is highly overall correlated with STAR_DY and 2 other fieldsHigh correlation

Reproduction

Analysis started2023-12-12 13:12:18.137527
Analysis finished2023-12-12 13:12:19.786376
Duration1.65 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

STAR_DY
Real number (ℝ)

HIGH CORRELATION 

Distinct12
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20134840
Minimum20110101
Maximum20200410
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.4 KiB
2023-12-12T22:12:19.850423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20110101
5-th percentile20110101
Q120120101
median20120305
Q320130304
95-th percentile20190101
Maximum20200410
Range90309
Interquartile range (IQR)10203

Descriptive statistics

Standard deviation28242.166
Coefficient of variation (CV)0.0014026516
Kurtosis-0.061362424
Mean20134840
Median Absolute Deviation (MAD)9920
Skewness1.2310988
Sum7.4901604 × 109
Variance7.9761993 × 108
MonotonicityNot monotonic
2023-12-12T22:12:19.967676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
20110101 73
19.6%
20120101 73
19.6%
20120305 73
19.6%
20180716 34
9.1%
20190101 28
 
7.5%
20130225 24
 
6.5%
20130304 22
 
5.9%
20130218 17
 
4.6%
20150316 13
 
3.5%
20200410 12
 
3.2%
Other values (2) 3
 
0.8%
ValueCountFrequency (%)
20110101 73
19.6%
20120101 73
19.6%
20120305 73
19.6%
20130218 17
 
4.6%
20130225 24
 
6.5%
20130304 22
 
5.9%
20150316 13
 
3.5%
20160106 1
 
0.3%
20180716 34
9.1%
20190101 28
 
7.5%
ValueCountFrequency (%)
20200410 12
 
3.2%
20190627 2
 
0.5%
20190101 28
 
7.5%
20180716 34
9.1%
20160106 1
 
0.3%
20150316 13
 
3.5%
20130304 22
 
5.9%
20130225 24
 
6.5%
20130218 17
 
4.6%
20120305 73
19.6%

END_DY
Real number (ℝ)

HIGH CORRELATION 

Distinct11
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean59201404
Minimum20111231
Maximum99991231
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.4 KiB
2023-12-12T22:12:20.097909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20111231
5-th percentile20111231
Q120120304
median20200409
Q399991231
95-th percentile99991231
Maximum99991231
Range79880000
Interquartile range (IQR)79870927

Descriptive statistics

Standard deviation39975631
Coefficient of variation (CV)0.67524801
Kurtosis-2.0089633
Mean59201404
Median Absolute Deviation (MAD)89178
Skewness0.043194367
Sum2.2022922 × 1010
Variance1.598051 × 1015
MonotonicityNot monotonic
2023-12-12T22:12:20.240534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
99991231 182
48.9%
20111231 73
19.6%
20120304 73
19.6%
20150316 13
 
3.5%
20200409 12
 
3.2%
20181231 11
 
3.0%
20190626 2
 
0.5%
20180715 2
 
0.5%
20130224 2
 
0.5%
20130217 1
 
0.3%
ValueCountFrequency (%)
20111231 73
19.6%
20120304 73
19.6%
20130217 1
 
0.3%
20130224 2
 
0.5%
20130303 1
 
0.3%
20150316 13
 
3.5%
20180715 2
 
0.5%
20181231 11
 
3.0%
20190626 2
 
0.5%
20200409 12
 
3.2%
ValueCountFrequency (%)
99991231 182
48.9%
20200409 12
 
3.2%
20190626 2
 
0.5%
20181231 11
 
3.0%
20180715 2
 
0.5%
20150316 13
 
3.5%
20130303 1
 
0.3%
20130224 2
 
0.5%
20130217 1
 
0.3%
20120304 73
19.6%

LAWCO_DONG_CD
Real number (ℝ)

HIGH CORRELATION 

Distinct188
Distinct (%)50.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13859.226
Minimum26
Maximum44825
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.4 KiB
2023-12-12T22:12:20.405593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum26
5-th percentile33.75
Q14272.75
median4832
Q311680
95-th percentile41719
Maximum44825
Range44799
Interquartile range (IQR)7407.25

Descriptive statistics

Standard deviation15492.818
Coefficient of variation (CV)1.1178704
Kurtosis-0.49922316
Mean13859.226
Median Absolute Deviation (MAD)4798.5
Skewness1.1237468
Sum5155632
Variance2.4002741 × 108
MonotonicityNot monotonic
2023-12-12T22:12:20.564304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
41830 4
 
1.1%
41500 4
 
1.1%
11215 4
 
1.1%
11740 4
 
1.1%
11290 4
 
1.1%
11230 4
 
1.1%
41730 4
 
1.1%
11200 4
 
1.1%
41820 4
 
1.1%
41310 4
 
1.1%
Other values (178) 332
89.2%
ValueCountFrequency (%)
26 3
0.8%
27 3
0.8%
28 3
0.8%
29 3
0.8%
30 3
0.8%
31 4
1.1%
36 2
0.5%
42 3
0.8%
43 3
0.8%
44 3
0.8%
ValueCountFrequency (%)
44825 1
 
0.3%
41830 4
1.1%
41820 4
1.1%
41810 3
0.8%
41800 3
0.8%
41730 4
1.1%
41710 3
0.8%
41650 3
0.8%
41630 3
0.8%
41610 3
0.8%

CTRL_BRCD
Categorical

HIGH CORRELATION 

Distinct25
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
TAB
44 
TAA
44 
QAD
35 
TPA
27 
THA
25 
Other values (20)
197 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowTQC
2nd rowTQC
3rd rowTQC
4th rowTQC
5th rowTQC

Common Values

ValueCountFrequency (%)
TAB 44
11.8%
TAA 44
11.8%
QAD 35
 
9.4%
TPA 27
 
7.3%
THA 25
 
6.7%
THB 21
 
5.6%
TOA 20
 
5.4%
TQA 20
 
5.4%
TBA 17
 
4.6%
TMA 15
 
4.0%
Other values (15) 104
28.0%

Length

2023-12-12T22:12:20.691440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
tab 44
11.8%
taa 44
11.8%
qad 35
 
9.4%
tpa 27
 
7.3%
tha 25
 
6.7%
thb 21
 
5.6%
toa 20
 
5.4%
tqa 20
 
5.4%
tba 17
 
4.6%
tma 15
 
4.0%
Other values (15) 104
28.0%

ADDR
Text

Distinct186
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
2023-12-12T22:12:21.123186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length11
Mean length6.8360215
Min length3

Characters and Unicode

Total characters2543
Distinct characters127
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique91 ?
Unique (%)24.5%

Sample

1st row경상북도 울릉군
2nd row경상북도 울진군
3rd row경상북도 봉화군
4th row경상북도 예천군
5th row경상북도 영덕군
ValueCountFrequency (%)
경기 105
 
15.1%
서울 82
 
11.8%
경상북도 38
 
5.5%
경상남도 31
 
4.5%
전라남도 25
 
3.6%
충청남도 21
 
3.0%
강원도 21
 
3.0%
부산광역시 19
 
2.7%
양주군(양주시 6
 
0.9%
제주도 6
 
0.9%
Other values (174) 342
49.1%
2023-12-12T22:12:21.742009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
369
 
14.5%
178
 
7.0%
175
 
6.9%
155
 
6.1%
110
 
4.3%
108
 
4.2%
107
 
4.2%
95
 
3.7%
94
 
3.7%
90
 
3.5%
Other values (117) 1062
41.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2150
84.5%
Space Separator 369
 
14.5%
Close Punctuation 12
 
0.5%
Open Punctuation 12
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
178
 
8.3%
175
 
8.1%
155
 
7.2%
110
 
5.1%
108
 
5.0%
107
 
5.0%
95
 
4.4%
94
 
4.4%
90
 
4.2%
72
 
3.3%
Other values (114) 966
44.9%
Space Separator
ValueCountFrequency (%)
369
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2150
84.5%
Common 393
 
15.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
178
 
8.3%
175
 
8.1%
155
 
7.2%
110
 
5.1%
108
 
5.0%
107
 
5.0%
95
 
4.4%
94
 
4.4%
90
 
4.2%
72
 
3.3%
Other values (114) 966
44.9%
Common
ValueCountFrequency (%)
369
93.9%
) 12
 
3.1%
( 12
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2150
84.5%
ASCII 393
 
15.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
369
93.9%
) 12
 
3.1%
( 12
 
3.1%
Hangul
ValueCountFrequency (%)
178
 
8.3%
175
 
8.1%
155
 
7.2%
110
 
5.1%
108
 
5.0%
107
 
5.0%
95
 
4.4%
94
 
4.4%
90
 
4.2%
72
 
3.3%
Other values (114) 966
44.9%

Interactions

2023-12-12T22:12:19.174190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:12:18.373936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:12:18.817204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:12:19.315559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:12:18.517147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:12:18.920791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:12:19.455506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:12:18.661201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:12:19.064836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T22:12:21.871200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
STAR_DYEND_DYLAWCO_DONG_CDCTRL_BRCD
STAR_DY1.0000.4820.6420.961
END_DY0.4821.0000.3700.568
LAWCO_DONG_CD0.6420.3701.0000.920
CTRL_BRCD0.9610.5680.9201.000
2023-12-12T22:12:21.977073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
STAR_DYEND_DYLAWCO_DONG_CDCTRL_BRCD
STAR_DY1.0000.863-0.2700.741
END_DY0.8631.000-0.2030.544
LAWCO_DONG_CD-0.270-0.2031.0000.741
CTRL_BRCD0.7410.5440.7411.000

Missing values

2023-12-12T22:12:19.591928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:12:19.743845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

STAR_DYEND_DYLAWCO_DONG_CDCTRL_BRCDADDR
020200410999912314794TQC경상북도 울릉군
120200410999912314793TQC경상북도 울진군
220200410999912314792TQC경상북도 봉화군
320200410999912314790TQC경상북도 예천군
420200410999912314777TQC경상북도 영덕군
520200410999912314776TQC경상북도 영양군
620200410999912314775TQC경상북도 청송군
720200410999912314773TQC경상북도 의성군
820200410999912314728TQC경상북도 문경시
920200410999912314725TQC경상북도 상주시
STAR_DYEND_DYLAWCO_DONG_CDCTRL_BRCDADDR
362201203059999123141480QAD경기 파주시
36320120305999912314128QAD경기 고양시
364201203052015031611290QAD서울 성북구
365201203059999123111170QAD서울 용산구
366201203059999123111440QAD서울 마포구
367201203059999123111140QAD서울 중구
368201203059999123111380QAD서울 은평구
369201203052015031611230QAD서울 동대문구
370201203059999123111110QAD서울 종로구
371201203059999123111410QAD서울 서대문구