Overview

Dataset statistics

Number of variables4
Number of observations465
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.6 KiB
Average record size in memory34.3 B

Variable types

Categorical3
Numeric1

Dataset

Description경기신용보증재단 대위변제발생 내역
Author경기신용보증재단
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=H6QXMPISMEYLHKIP6V8430737958&infSeq=1

Alerts

금액(원) has 6 (1.3%) zerosZeros

Reproduction

Analysis started2024-04-20 18:23:24.433443
Analysis finished2024-04-20 18:23:25.804802
Duration1.37 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

Distinct31
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
가평군
 
15
고양시
 
15
과천시
 
15
광명시
 
15
광주시
 
15
Other values (26)
390 

Length

Max length4
Median length3
Mean length3.0967742
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가평군
2nd row가평군
3rd row가평군
4th row가평군
5th row가평군

Common Values

ValueCountFrequency (%)
가평군 15
 
3.2%
고양시 15
 
3.2%
과천시 15
 
3.2%
광명시 15
 
3.2%
광주시 15
 
3.2%
구리시 15
 
3.2%
군포시 15
 
3.2%
김포시 15
 
3.2%
용인시 15
 
3.2%
의왕시 15
 
3.2%
Other values (21) 315
67.7%

Length

2024-04-21T03:23:25.860407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
가평군 15
 
3.2%
화성시 15
 
3.2%
연천군 15
 
3.2%
여주시 15
 
3.2%
양평군 15
 
3.2%
양주시 15
 
3.2%
안양시 15
 
3.2%
안성시 15
 
3.2%
안산시 15
 
3.2%
시흥시 15
 
3.2%
Other values (21) 315
67.7%

연도
Categorical

Distinct5
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2019
93 
2020
93 
2021
93 
2022
93 
2023
93 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019
2nd row2019
3rd row2019
4th row2020
5th row2020

Common Values

ValueCountFrequency (%)
2019 93
20.0%
2020 93
20.0%
2021 93
20.0%
2022 93
20.0%
2023 93
20.0%

Length

2024-04-21T03:23:25.947349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T03:23:26.026586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019 93
20.0%
2020 93
20.0%
2021 93
20.0%
2022 93
20.0%
2023 93
20.0%

구분
Categorical

Distinct3
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
소상공
155 
일반
155 
합계
155 

Length

Max length3
Median length2
Mean length2.3333333
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row소상공
2nd row일반
3rd row합계
4th row소상공
5th row일반

Common Values

ValueCountFrequency (%)
소상공 155
33.3%
일반 155
33.3%
합계 155
33.3%

Length

2024-04-21T03:23:26.117835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T03:23:26.197386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
소상공 155
33.3%
일반 155
33.3%
합계 155
33.3%

금액(원)
Real number (ℝ)

ZEROS 

Distinct454
Distinct (%)97.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.2371589 × 109
Minimum0
Maximum3.1856025 × 1010
Zeros6
Zeros (%)1.3%
Negative0
Negative (%)0.0%
Memory size4.2 KiB
2024-04-21T03:23:26.289742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1.9562741 × 108
Q19.7822378 × 108
median2.4977912 × 109
Q35.9434191 × 109
95-th percentile1.3089145 × 1010
Maximum3.1856025 × 1010
Range3.1856025 × 1010
Interquartile range (IQR)4.9651954 × 109

Descriptive statistics

Standard deviation4.8297363 × 109
Coefficient of variation (CV)1.1398526
Kurtosis6.9959623
Mean4.2371589 × 109
Median Absolute Deviation (MAD)1.9451277 × 109
Skewness2.3351277
Sum1.9702789 × 1012
Variance2.3326353 × 1019
MonotonicityNot monotonic
2024-04-21T03:23:26.404266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 6
 
1.3%
299156221 2
 
0.4%
429105329 2
 
0.4%
140948065 2
 
0.4%
370377977 2
 
0.4%
2550386359 2
 
0.4%
293541375 2
 
0.4%
2059706273 1
 
0.2%
9148224037 1
 
0.2%
1382477997 1
 
0.2%
Other values (444) 444
95.5%
ValueCountFrequency (%)
0 6
1.3%
20136301 1
 
0.2%
23303053 1
 
0.2%
26130070 1
 
0.2%
27307438 1
 
0.2%
40617893 1
 
0.2%
68663606 1
 
0.2%
71239887 1
 
0.2%
78022978 1
 
0.2%
81032909 1
 
0.2%
ValueCountFrequency (%)
31856025089 1
0.2%
27510180594 1
0.2%
26765851445 1
0.2%
26363386933 1
0.2%
24666682558 1
0.2%
23877357370 1
0.2%
22145843766 1
0.2%
22039153221 1
0.2%
21660938070 1
0.2%
20508280266 1
0.2%

Interactions

2024-04-21T03:23:25.442064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T03:23:26.479067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명연도구분금액(원)
시군명1.0000.0000.0000.498
연도0.0001.0000.0000.454
구분0.0000.0001.0000.499
금액(원)0.4980.4540.4991.000
2024-04-21T03:23:26.556257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분시군명연도
구분1.0000.0000.000
시군명0.0001.0000.000
연도0.0000.0001.000
2024-04-21T03:23:26.626747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
금액(원)시군명연도구분
금액(원)1.0000.1940.2040.344
시군명0.1941.0000.0000.000
연도0.2040.0001.0000.000
구분0.3440.0000.0001.000

Missing values

2024-04-21T03:23:25.708579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T03:23:25.773171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군명연도구분금액(원)
0가평군2019소상공299156221
1가평군2019일반0
2가평군2019합계299156221
3가평군2020소상공388592767
4가평군2020일반100183670
5가평군2020합계488776437
6가평군2021소상공293541375
7가평군2021일반0
8가평군2021합계293541375
9가평군2022소상공329875540
시군명연도구분금액(원)
455오산시2023소상공5411174634
456오산시2023일반1304975256
457오산시2023합계6716149890
458용인시2019소상공5391960084
459용인시2019일반3088128321
460용인시2019합계8480088405
461용인시2020소상공5561503213
462용인시2020일반2392059250
463용인시2020합계7953562463
464용인시2021소상공5233799886