Overview

Dataset statistics

Number of variables4
Number of observations44
Missing cells2
Missing cells (%)1.1%
Duplicate rows6
Duplicate rows (%)13.6%
Total size in memory1.5 KiB
Average record size in memory36.0 B

Variable types

Categorical2
Boolean1
Numeric1

Dataset

DescriptionTOP전문수신현황관리(자료작성기준일자,반복제공회차,수신일자 등).......................................................................................................
Author한국주택금융공사
URLhttps://www.data.go.kr/data/15073112/fileData.do

Alerts

Dataset has 6 (13.6%) duplicate rowsDuplicates
등록일시 is highly overall correlated with 실행여부High correlation
실행여부 is highly overall correlated with 표준목록코드 and 1 other fieldsHigh correlation
뷰순번 is highly overall correlated with 표준목록코드High correlation
표준목록코드 is highly overall correlated with 뷰순번 and 1 other fieldsHigh correlation
실행여부 is highly imbalanced (73.3%)Imbalance
뷰순번 has 2 (4.5%) missing valuesMissing

Reproduction

Analysis started2023-12-12 10:35:46.462041
Analysis finished2023-12-12 10:35:47.059562
Duration0.6 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

표준목록코드
Categorical

HIGH CORRELATION 

Distinct18
Distinct (%)40.9%
Missing0
Missing (%)0.0%
Memory size484.0 B
DMPD0101
SMRM0101
SRDM0101
SMRM0701
DMCF0201
Other values (13)
24 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique5 ?
Unique (%)11.4%

Sample

1st rowDMPD0501
2nd rowDMCF0201
3rd rowDMSD0201
4th rowDMCF0101
5th rowDMPD0301

Common Values

ValueCountFrequency (%)
DMPD0101 6
13.6%
SMRM0101 4
 
9.1%
SRDM0101 4
 
9.1%
SMRM0701 3
 
6.8%
DMCF0201 3
 
6.8%
SLDM0201 3
 
6.8%
SRDM3162 3
 
6.8%
SMRD0101 3
 
6.8%
DMPD0301 2
 
4.5%
DMCF0101 2
 
4.5%
Other values (8) 11
25.0%

Length

2023-12-12T19:35:47.144598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
dmpd0101 6
13.6%
srdm0101 4
 
9.1%
smrm0101 4
 
9.1%
smrm0701 3
 
6.8%
dmcf0201 3
 
6.8%
sldm0201 3
 
6.8%
srdm3162 3
 
6.8%
smrd0101 3
 
6.8%
dmpd0201 2
 
4.5%
smrm1001 2
 
4.5%
Other values (8) 11
25.0%

실행여부
Boolean

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)4.5%
Missing0
Missing (%)0.0%
Memory size176.0 B
True
42 
False
 
2
ValueCountFrequency (%)
True 42
95.5%
False 2
 
4.5%
2023-12-12T19:35:47.267754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

뷰순번
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct9
Distinct (%)21.4%
Missing2
Missing (%)4.5%
Infinite0
Infinite (%)0.0%
Mean4.452381
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size528.0 B
2023-12-12T19:35:47.403629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median4
Q36
95-th percentile9
Maximum9
Range8
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.5394678
Coefficient of variation (CV)0.57036175
Kurtosis-0.98440468
Mean4.452381
Median Absolute Deviation (MAD)2
Skewness0.30693467
Sum187
Variance6.4488966
MonotonicityNot monotonic
2023-12-12T19:35:47.582822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
5 6
13.6%
1 6
13.6%
2 6
13.6%
3 5
11.4%
4 5
11.4%
6 4
9.1%
7 4
9.1%
9 4
9.1%
8 2
 
4.5%
(Missing) 2
 
4.5%
ValueCountFrequency (%)
1 6
13.6%
2 6
13.6%
3 5
11.4%
4 5
11.4%
5 6
13.6%
6 4
9.1%
7 4
9.1%
8 2
 
4.5%
9 4
9.1%
ValueCountFrequency (%)
9 4
9.1%
8 2
 
4.5%
7 4
9.1%
6 4
9.1%
5 6
13.6%
4 5
11.4%
3 5
11.4%
2 6
13.6%
1 6
13.6%

등록일시
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)13.6%
Missing0
Missing (%)0.0%
Memory size484.0 B
2010-06-10 15:28
21 
2010-09-29 13:09
2010-09-29 13:06
2011-11-22 11:39
2010-09-01 7:34
 
2

Length

Max length16
Median length16
Mean length15.954545
Min length15

Unique

Unique1 ?
Unique (%)2.3%

Sample

1st row2010-06-10 15:28
2nd row2010-06-10 15:28
3rd row2010-06-10 15:28
4th row2010-06-10 15:28
5th row2010-06-10 15:28

Common Values

ValueCountFrequency (%)
2010-06-10 15:28 21
47.7%
2010-09-29 13:09 8
 
18.2%
2010-09-29 13:06 6
 
13.6%
2011-11-22 11:39 6
 
13.6%
2010-09-01 7:34 2
 
4.5%
2010-09-29 13:05 1
 
2.3%

Length

2023-12-12T19:35:47.735880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:35:47.890187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2010-06-10 21
23.9%
15:28 21
23.9%
2010-09-29 15
17.0%
13:09 8
 
9.1%
13:06 6
 
6.8%
2011-11-22 6
 
6.8%
11:39 6
 
6.8%
2010-09-01 2
 
2.3%
7:34 2
 
2.3%
13:05 1
 
1.1%

Interactions

2023-12-12T19:35:46.685521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T19:35:47.995549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
표준목록코드실행여부뷰순번등록일시
표준목록코드1.0000.8020.9530.000
실행여부0.8021.0000.3961.000
뷰순번0.9530.3961.0000.147
등록일시0.0001.0000.1471.000
2023-12-12T19:35:48.107757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
등록일시표준목록코드실행여부
등록일시1.0000.0000.951
표준목록코드0.0001.0000.511
실행여부0.9510.5111.000
2023-12-12T19:35:48.206171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
뷰순번표준목록코드실행여부등록일시
뷰순번1.0000.7210.3540.000
표준목록코드0.7211.0000.5110.000
실행여부0.3540.5111.0000.951
등록일시0.0000.0000.9511.000

Missing values

2023-12-12T19:35:46.857788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:35:47.008274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

표준목록코드실행여부뷰순번등록일시
0DMPD0501Y62010-06-10 15:28
1DMCF0201Y62010-06-10 15:28
2DMSD0201Y<NA>2010-06-10 15:28
3DMCF0101Y52010-06-10 15:28
4DMPD0301Y32010-06-10 15:28
5DMPD0101Y12010-06-10 15:28
6DMPD0201Y22010-06-10 15:28
7DMPD0401Y42010-06-10 15:28
8DMPD0301Y32010-06-10 15:28
9TMRM0801Y72010-06-10 15:28
표준목록코드실행여부뷰순번등록일시
34DMPD0101Y72010-09-29 13:09
35DMCF0201Y62010-09-29 13:09
36SMRM0101Y12010-09-29 13:09
37SMRM1001Y22010-09-29 13:09
38SRDM0101Y52011-11-22 11:39
39SMRM0101Y12011-11-22 11:39
40SMRM1001Y22011-11-22 11:39
41SLDM0201Y92011-11-22 11:39
42DMPD0101Y72011-11-22 11:39
43SRDM3162Y92011-11-22 11:39

Duplicate rows

Most frequently occurring

표준목록코드실행여부뷰순번등록일시# duplicates
1DMPD0101Y12010-06-10 15:283
0DMCF0101Y52010-06-10 15:282
2DMPD0201Y22010-06-10 15:282
3DMPD0301Y32010-06-10 15:282
4DMPD0401Y42010-06-10 15:282
5SMRM0701Y32010-06-10 15:282