Overview

Dataset statistics

Number of variables5
Number of observations59
Missing cells78
Missing cells (%)26.4%
Duplicate rows16
Duplicate rows (%)27.1%
Total size in memory2.4 KiB
Average record size in memory42.2 B

Variable types

Categorical1
Unsupported4

Dataset

Description보증사고원인별 보증공급 현황을 제공
Author울산신용보증재단
URLhttps://www.data.go.kr/data/3076314/fileData.do

Alerts

Dataset has 16 (27.1%) duplicate rowsDuplicates
Unnamed: 1 has 18 (30.5%) missing valuesMissing
Unnamed: 2 has 21 (35.6%) missing valuesMissing
Unnamed: 3 has 21 (35.6%) missing valuesMissing
Unnamed: 4 has 18 (30.5%) missing valuesMissing
Unnamed: 1 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 2 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 08:37:16.067250
Analysis finished2023-12-12 08:37:16.445293
Duration0.38 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct20
Distinct (%)33.9%
Missing0
Missing (%)0.0%
Memory size604.0 B
<NA>
13 
이자연체
 
3
합계
 
3
재단명 : 울산신용보증재단
 
3
휴폐업
 
3
Other values (15)
34 

Length

Max length30
Median length14
Mean length6.6101695
Min length2

Unique

Unique4 ?
Unique (%)6.8%

Sample

1st row(기간 : 2011-01-01 ~ 2011-12-31)
2nd row<NA>
3rd row재단명 : 울산신용보증재단
4th row<NA>
5th row사고원인

Common Values

ValueCountFrequency (%)
<NA> 13
22.0%
이자연체 3
 
5.1%
합계 3
 
5.1%
재단명 : 울산신용보증재단 3
 
5.1%
휴폐업 3
 
5.1%
원리금연체 3
 
5.1%
원금연체 3
 
5.1%
사고원인 3
 
5.1%
신용관리정보등록 3
 
5.1%
기한이익상실 3
 
5.1%
Other values (10) 19
32.2%

Length

2023-12-12T17:37:16.530200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 13
 
16.5%
9
 
11.4%
사고원인 3
 
3.8%
기간 3
 
3.8%
회생및개인회생절차 3
 
3.8%
이자연체 3
 
3.8%
신용회복지원신청 3
 
3.8%
기한이익상실 3
 
3.8%
신용관리정보등록 3
 
3.8%
기타 3
 
3.8%
Other values (17) 33
41.8%

Unnamed: 1
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing18
Missing (%)30.5%
Memory size604.0 B

Unnamed: 2
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)35.6%
Memory size604.0 B

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing21
Missing (%)35.6%
Memory size604.0 B

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing18
Missing (%)30.5%
Memory size604.0 B

Missing values

2023-12-12T17:37:16.143577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:37:16.250662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T17:37:16.358045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

사고원인별 현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4
0(기간 : 2011-01-01 ~ 2011-12-31)NaNNaNNaNNaN
1<NA>NaNNaNNaNNaN
2재단명 : 울산신용보증재단NaNNaNNaN(단위:건,백만원)
3<NA>NaNNaNNaNNaN
4사고원인사고현황NaNNaNNaN
5<NA>건수비율사고금액비율
6원리금연체5810.41543700.3521
7원금연체1820.1324560.1979
8이자연체4000.285734340.2766
9당좌부도30.0021670.0054
사고원인별 현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4
49이자연체1710.153422500.211
50당좌부도10.0009180.0017
51신용관리정보등록220.01974050.038
52회생및개인회생절차940.08437380.0692
53휴폐업360.03231970.0185
54기한이익상실650.05837870.0738
55파산면책신청10.0009430.004
56신용회복지원신청160.01431850.0173
57기타60.00541140.0107
58합계11151.0106641

Duplicate rows

Most frequently occurring

사고원인별 현황# duplicates
15<NA>13
0기타3
1기한이익상실3
3사고원인3
5신용관리정보등록3
6신용회복지원신청3
7원금연체3
8원리금연체3
9이자연체3
10재단명 : 울산신용보증재단3