Overview

Dataset statistics

Number of variables5
Number of observations500
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory19.7 KiB
Average record size in memory40.3 B

Variable types

Text1
Boolean4

Dataset

Description해당 파일은 신용보증기금이 운용하는 CRC 컨설팅에 대한 동의 여부 데이터로, 컨설팅 번호별 동의 여부 등의 항목을 제공합니다.
URLhttps://www.data.go.kr/data/15121484/fileData.do

Alerts

선택식별정보동의여부 is highly overall correlated with 선택수집조회동의여부 and 2 other fieldsHigh correlation
선택수집조회동의여부 is highly overall correlated with 선택식별정보동의여부 and 2 other fieldsHigh correlation
선택식별동의여부 is highly overall correlated with 선택수집조회동의여부 and 2 other fieldsHigh correlation
선택제공동의여부 is highly overall correlated with 선택수집조회동의여부 and 2 other fieldsHigh correlation
선택수집조회동의여부 is highly imbalanced (83.7%)Imbalance
선택식별정보동의여부 is highly imbalanced (83.7%)Imbalance
선택제공동의여부 is highly imbalanced (83.7%)Imbalance
선택식별동의여부 is highly imbalanced (83.7%)Imbalance
CRC컨설팅번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 10:01:01.746395
Analysis finished2023-12-12 10:01:02.176246
Duration0.43 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

CRC컨설팅번호
Text

UNIQUE 

Distinct500
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
2023-12-12T19:01:02.358783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters6000
Distinct characters35
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique500 ?
Unique (%)100.0%

Sample

1st rowTHS2019R0007
2nd rowTAI2019R0007
3rd rowTAM2019R0020
4th rowTHJ2019R0012
5th rowTIA2019R0012
ValueCountFrequency (%)
ths2019r0007 1
 
0.2%
qac2019r0012 1
 
0.2%
tap2019r0015 1
 
0.2%
tbc2019r0002 1
 
0.2%
toa2019r0005 1
 
0.2%
tab2020r0007 1
 
0.2%
thy2019r0007 1
 
0.2%
tna2019r0003 1
 
0.2%
tht2019r0014 1
 
0.2%
tbi2019r0001 1
 
0.2%
Other values (490) 490
98.0%
2023-12-12T19:01:02.807044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1856
30.9%
2 701
 
11.7%
1 662
 
11.0%
T 510
 
8.5%
R 507
 
8.5%
9 439
 
7.3%
A 227
 
3.8%
B 127
 
2.1%
H 89
 
1.5%
3 75
 
1.2%
Other values (25) 807
13.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4000
66.7%
Uppercase Letter 2000
33.3%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
T 510
25.5%
R 507
25.4%
A 227
11.3%
B 127
 
6.3%
H 89
 
4.5%
I 56
 
2.8%
P 53
 
2.6%
D 46
 
2.3%
N 44
 
2.2%
C 44
 
2.2%
Other values (15) 297
14.8%
Decimal Number
ValueCountFrequency (%)
0 1856
46.4%
2 701
 
17.5%
1 662
 
16.6%
9 439
 
11.0%
3 75
 
1.9%
5 62
 
1.6%
6 56
 
1.4%
4 55
 
1.4%
7 53
 
1.3%
8 41
 
1.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4000
66.7%
Latin 2000
33.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
T 510
25.5%
R 507
25.4%
A 227
11.3%
B 127
 
6.3%
H 89
 
4.5%
I 56
 
2.8%
P 53
 
2.6%
D 46
 
2.3%
N 44
 
2.2%
C 44
 
2.2%
Other values (15) 297
14.8%
Common
ValueCountFrequency (%)
0 1856
46.4%
2 701
 
17.5%
1 662
 
16.6%
9 439
 
11.0%
3 75
 
1.9%
5 62
 
1.6%
6 56
 
1.4%
4 55
 
1.4%
7 53
 
1.3%
8 41
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1856
30.9%
2 701
 
11.7%
1 662
 
11.0%
T 510
 
8.5%
R 507
 
8.5%
9 439
 
7.3%
A 227
 
3.8%
B 127
 
2.1%
H 89
 
1.5%
3 75
 
1.2%
Other values (25) 807
13.5%

선택수집조회동의여부
Boolean

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size632.0 B
True
488 
False
 
12
ValueCountFrequency (%)
True 488
97.6%
False 12
 
2.4%
2023-12-12T19:01:02.960342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

선택식별정보동의여부
Boolean

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size632.0 B
True
488 
False
 
12
ValueCountFrequency (%)
True 488
97.6%
False 12
 
2.4%
2023-12-12T19:01:03.065440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

선택제공동의여부
Boolean

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size632.0 B
True
488 
False
 
12
ValueCountFrequency (%)
True 488
97.6%
False 12
 
2.4%
2023-12-12T19:01:03.176466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

선택식별동의여부
Boolean

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size632.0 B
True
488 
False
 
12
ValueCountFrequency (%)
True 488
97.6%
False 12
 
2.4%
2023-12-12T19:01:03.286371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T19:01:03.371327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
선택수집조회동의여부선택식별정보동의여부선택제공동의여부선택식별동의여부
선택수집조회동의여부1.0000.9980.9980.998
선택식별정보동의여부0.9981.0000.9980.998
선택제공동의여부0.9980.9981.0000.998
선택식별동의여부0.9980.9980.9981.000
2023-12-12T19:01:03.506994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
선택식별정보동의여부선택수집조회동의여부선택식별동의여부선택제공동의여부
선택식별정보동의여부1.0000.9570.9570.957
선택수집조회동의여부0.9571.0000.9570.957
선택식별동의여부0.9570.9571.0000.957
선택제공동의여부0.9570.9570.9571.000
2023-12-12T19:01:03.628819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
선택수집조회동의여부선택식별정보동의여부선택제공동의여부선택식별동의여부
선택수집조회동의여부1.0000.9570.9570.957
선택식별정보동의여부0.9571.0000.9570.957
선택제공동의여부0.9570.9571.0000.957
선택식별동의여부0.9570.9570.9571.000

Missing values

2023-12-12T19:01:02.008611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:01:02.127674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

CRC컨설팅번호선택수집조회동의여부선택식별정보동의여부선택제공동의여부선택식별동의여부
0THS2019R0007YYYY
1TAI2019R0007YYYY
2TAM2019R0020YYYY
3THJ2019R0012YYYY
4TIA2019R0012YYYY
5TAB2020R0009YYYY
6TMK2019R0003YYYY
7TLA2019R0006YYYY
8TPB2019R0005YYYY
9TJD2019R0006YYYY
CRC컨설팅번호선택수집조회동의여부선택식별정보동의여부선택제공동의여부선택식별동의여부
490TIE2020R0005YYYY
491TAB2021R0056YYYY
492TAW2020R0001YYYY
493TQJ2020R0007YYYY
494TPB2021R0008YYYY
495TAA2023R0006YYYY
496TBC2021R0005YYYY
497TQJ2020R0008YYYY
498TAA2022R0017YYYY
499TLE2023R0001YYYY