Overview

Dataset statistics

Number of variables8
Number of observations21
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.5 KiB
Average record size in memory75.1 B

Variable types

DateTime1
Categorical6
Numeric1

Dataset

DescriptionGH 판매 관련 제이자율 데이터로써 적용일, 구분, 선납할인율, 할부이자율, 지연손해금율 등의 정보를 포함하고 있습니다.
Author경기주택도시공사
URLhttps://www.data.go.kr/data/15112591/fileData.do

Alerts

기타 has constant value ""Constant
일반 지연 손해금율(퍼센트) is highly overall correlated with 할부이자율(퍼센트) and 1 other fieldsHigh correlation
선납할인율(퍼센트) is highly overall correlated with 할부이자율(퍼센트) and 3 other fieldsHigh correlation
30일 이상 지연손해금율(퍼센트) is highly overall correlated with 할부이자율(퍼센트) and 2 other fieldsHigh correlation
할부이자율(퍼센트) is highly overall correlated with 선납할인율(퍼센트) and 3 other fieldsHigh correlation
30일 미만 지연손해금율(퍼센트) is highly overall correlated with 할부이자율(퍼센트) and 2 other fieldsHigh correlation

Reproduction

Analysis started2024-03-14 15:39:57.161772
Analysis finished2024-03-14 15:39:58.616840
Duration1.46 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct16
Distinct (%)76.2%
Missing0
Missing (%)0.0%
Memory size296.0 B
Minimum2007-03-01 00:00:00
Maximum2024-01-31 00:00:00
2024-03-15T00:39:58.781059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T00:39:59.171562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)

구분
Categorical

Distinct5
Distinct (%)23.8%
Missing0
Missing (%)0.0%
Memory size296.0 B
LH 공동사업
자체사업
전체
산단
택지

Length

Max length7
Median length4
Mean length4.9047619
Min length2

Unique

Unique2 ?
Unique (%)9.5%

Sample

1st row산단
2nd row택지
3rd row전체
4th rowLH 공동사업
5th row자체사업

Common Values

ValueCountFrequency (%)
LH 공동사업 9
42.9%
자체사업 8
38.1%
전체 2
 
9.5%
산단 1
 
4.8%
택지 1
 
4.8%

Length

2024-03-15T00:39:59.570432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T00:39:59.898479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
lh 9
30.0%
공동사업 9
30.0%
자체사업 8
26.7%
전체 2
 
6.7%
산단 1
 
3.3%
택지 1
 
3.3%

선납할인율(퍼센트)
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)23.8%
Missing0
Missing (%)0.0%
Memory size296.0 B
2.5
10 
5.0
6.5
3.5
3.0

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row6.5
2nd row6.5
3rd row5.0
4th row3.5
5th row3.5

Common Values

ValueCountFrequency (%)
2.5 10
47.6%
5.0 5
23.8%
6.5 2
 
9.5%
3.5 2
 
9.5%
3.0 2
 
9.5%

Length

2024-03-15T00:40:00.137866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T00:40:00.408256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2.5 10
47.6%
5.0 5
23.8%
6.5 2
 
9.5%
3.5 2
 
9.5%
3.0 2
 
9.5%

할부이자율(퍼센트)
Real number (ℝ)

HIGH CORRELATION 

Distinct8
Distinct (%)38.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.5142857
Minimum2.3
Maximum5
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size317.0 B
2024-03-15T00:40:00.692117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.3
5-th percentile2.3
Q12.5
median3.5
Q34
95-th percentile5
Maximum5
Range2.7
Interquartile range (IQR)1.5

Descriptive statistics

Standard deviation0.93342687
Coefficient of variation (CV)0.26560927
Kurtosis-1.1436478
Mean3.5142857
Median Absolute Deviation (MAD)1
Skewness0.21935362
Sum73.8
Variance0.87128571
MonotonicityNot monotonic
2024-03-15T00:40:01.067733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
3.5 5
23.8%
5.0 3
14.3%
4.0 3
14.3%
2.3 3
14.3%
2.5 3
14.3%
4.5 2
 
9.5%
2.9 1
 
4.8%
3.0 1
 
4.8%
ValueCountFrequency (%)
2.3 3
14.3%
2.5 3
14.3%
2.9 1
 
4.8%
3.0 1
 
4.8%
3.5 5
23.8%
4.0 3
14.3%
4.5 2
 
9.5%
5.0 3
14.3%
ValueCountFrequency (%)
5.0 3
14.3%
4.5 2
 
9.5%
4.0 3
14.3%
3.5 5
23.8%
3.0 1
 
4.8%
2.9 1
 
4.8%
2.5 3
14.3%
2.3 3
14.3%

일반 지연 손해금율(퍼센트)
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)23.8%
Missing0
Missing (%)0.0%
Memory size296.0 B
<NA>
6.5
8.5
5.0
8.0

Length

Max length4
Median length3
Mean length3.3809524
Min length3

Unique

Unique1 ?
Unique (%)4.8%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 8
38.1%
6.5 7
33.3%
8.5 3
 
14.3%
5.0 2
 
9.5%
8.0 1
 
4.8%

Length

2024-03-15T00:40:01.466523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T00:40:01.734194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 8
38.1%
6.5 7
33.3%
8.5 3
 
14.3%
5.0 2
 
9.5%
8.0 1
 
4.8%

30일 미만 지연손해금율(퍼센트)
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)28.6%
Missing0
Missing (%)0.0%
Memory size296.0 B
<NA>
13 
8.5
7.5
8.7
 
1
11.2
 
1

Length

Max length4
Median length4
Mean length3.6666667
Min length3

Unique

Unique3 ?
Unique (%)14.3%

Sample

1st row8.7
2nd row11.2
3rd row8.5
4th row8.5
5th row8.5

Common Values

ValueCountFrequency (%)
<NA> 13
61.9%
8.5 3
 
14.3%
7.5 2
 
9.5%
8.7 1
 
4.8%
11.2 1
 
4.8%
7.0 1
 
4.8%

Length

2024-03-15T00:40:02.041746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T00:40:02.401035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 13
61.9%
8.5 3
 
14.3%
7.5 2
 
9.5%
8.7 1
 
4.8%
11.2 1
 
4.8%
7.0 1
 
4.8%

30일 이상 지연손해금율(퍼센트)
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)28.6%
Missing0
Missing (%)0.0%
Memory size296.0 B
<NA>
13 
11.2
11.5
11.0
12.0
 
1

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique2 ?
Unique (%)9.5%

Sample

1st row11.2
2nd row11.2
3rd row12.0
4th row11.5
5th row11.5

Common Values

ValueCountFrequency (%)
<NA> 13
61.9%
11.2 2
 
9.5%
11.5 2
 
9.5%
11.0 2
 
9.5%
12.0 1
 
4.8%
10.5 1
 
4.8%

Length

2024-03-15T00:40:02.712878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T00:40:03.003750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 13
61.9%
11.2 2
 
9.5%
11.5 2
 
9.5%
11.0 2
 
9.5%
12.0 1
 
4.8%
10.5 1
 
4.8%

기타
Categorical

CONSTANT 

Distinct1
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Memory size296.0 B
공란은 해당값 없음
21 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공란은 해당값 없음
2nd row공란은 해당값 없음
3rd row공란은 해당값 없음
4th row공란은 해당값 없음
5th row공란은 해당값 없음

Common Values

ValueCountFrequency (%)
공란은 해당값 없음 21
100.0%

Length

2024-03-15T00:40:03.390119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T00:40:03.615412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공란은 21
33.3%
해당값 21
33.3%
없음 21
33.3%

Interactions

2024-03-15T00:39:57.550993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T00:40:03.732271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
적용일구분선납할인율(퍼센트)할부이자율(퍼센트)일반 지연 손해금율(퍼센트)30일 미만 지연손해금율(퍼센트)30일 이상 지연손해금율(퍼센트)
적용일1.0000.0001.0000.9710.7920.0001.000
구분0.0001.0000.6590.0000.0000.8010.643
선납할인율(퍼센트)1.0000.6591.0000.8491.0000.9571.000
할부이자율(퍼센트)0.9710.0000.8491.0000.9460.8251.000
일반 지연 손해금율(퍼센트)0.7920.0001.0000.9461.000NaNNaN
30일 미만 지연손해금율(퍼센트)0.0000.8010.9570.825NaN1.0000.957
30일 이상 지연손해금율(퍼센트)1.0000.6431.0001.000NaN0.9571.000
2024-03-15T00:40:04.052845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일반 지연 손해금율(퍼센트)선납할인율(퍼센트)30일 이상 지연손해금율(퍼센트)30일 미만 지연손해금율(퍼센트)구분
일반 지연 손해금율(퍼센트)1.0000.905NaNNaN0.000
선납할인율(퍼센트)0.9051.0001.0000.6450.281
30일 이상 지연손해금율(퍼센트)NaN1.0001.0000.6450.000
30일 미만 지연손해금율(퍼센트)NaN0.6450.6451.0000.210
구분0.0000.2810.0000.2101.000
2024-03-15T00:40:04.338151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
할부이자율(퍼센트)구분선납할인율(퍼센트)일반 지연 손해금율(퍼센트)30일 미만 지연손해금율(퍼센트)30일 이상 지연손해금율(퍼센트)
할부이자율(퍼센트)1.0000.0000.7300.6780.6530.866
구분0.0001.0000.2810.0000.2100.000
선납할인율(퍼센트)0.7300.2811.0000.9050.6451.000
일반 지연 손해금율(퍼센트)0.6780.0000.9051.0000.0000.000
30일 미만 지연손해금율(퍼센트)0.6530.2100.6450.0001.0000.645
30일 이상 지연손해금율(퍼센트)0.8660.0001.0000.0000.6451.000

Missing values

2024-03-15T00:39:57.902250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T00:39:58.443491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

적용일구분선납할인율(퍼센트)할부이자율(퍼센트)일반 지연 손해금율(퍼센트)30일 미만 지연손해금율(퍼센트)30일 이상 지연손해금율(퍼센트)기타
02007-03-01산단6.55.0<NA>8.711.2공란은 해당값 없음
12007-03-01택지6.55.0<NA>11.211.2공란은 해당값 없음
22013-12-02전체5.05.0<NA>8.512.0공란은 해당값 없음
32015-04-01LH 공동사업3.54.5<NA>8.511.5공란은 해당값 없음
42015-07-01자체사업3.54.5<NA>8.511.5공란은 해당값 없음
52015-11-01LH 공동사업3.04.0<NA>7.511.0공란은 해당값 없음
62016-01-01자체사업3.04.0<NA>7.511.0공란은 해당값 없음
72016-09-01LH 공동사업2.53.5<NA>7.010.5공란은 해당값 없음
82018-07-01전체2.53.56.5<NA><NA>공란은 해당값 없음
92019-11-18LH 공동사업2.52.96.5<NA><NA>공란은 해당값 없음
적용일구분선납할인율(퍼센트)할부이자율(퍼센트)일반 지연 손해금율(퍼센트)30일 미만 지연손해금율(퍼센트)30일 이상 지연손해금율(퍼센트)기타
112020-07-01LH 공동사업2.52.35.0<NA><NA>공란은 해당값 없음
122020-07-06자체사업2.52.55.0<NA><NA>공란은 해당값 없음
132021-01-01LH 공동사업2.52.36.5<NA><NA>공란은 해당값 없음
142021-01-01자체사업2.52.56.5<NA><NA>공란은 해당값 없음
152022-01-01LH 공동사업2.52.36.5<NA><NA>공란은 해당값 없음
162022-01-01자체사업2.52.56.5<NA><NA>공란은 해당값 없음
172023-01-01LH 공동사업5.03.58.5<NA><NA>공란은 해당값 없음
182023-01-01자체사업5.03.58.5<NA><NA>공란은 해당값 없음
192024-01-31LH 공동사업5.03.58.5<NA><NA>공란은 해당값 없음
202024-01-31자체사업5.04.08.0<NA><NA>공란은 해당값 없음