Overview

Dataset statistics

Number of variables4
Number of observations350
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.1 KiB
Average record size in memory32.4 B

Variable types

Text1
DateTime1
Categorical2

Dataset

Description제주특별자치도개발공사가 운영하는 매입임대주택 자동이체 정보로 신청가구, 신청일자, 신청결과 등 정보를 포함하고 있습니다.
Author제주특별자치도개발공사
URLhttps://www.data.go.kr/data/15112150/fileData.do

Alerts

등록결과 is highly imbalanced (95.0%)Imbalance

Reproduction

Analysis started2024-03-14 08:43:20.647371
Analysis finished2024-03-14 08:43:21.364907
Duration0.72 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Text

Distinct313
Distinct (%)89.4%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2024-03-14T17:43:22.217408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length15
Mean length11.122857
Min length8

Characters and Unicode

Total characters3893
Distinct characters162
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique283 ?
Unique (%)80.9%

Sample

1st row정암주택 301호
2nd row정실뜨래별 102동 402호
3rd row효돈빌 305호
4th row이도네집 302호
5th row아뜨네오피스텔 304호
ValueCountFrequency (%)
302호 34
 
4.1%
201호 31
 
3.7%
202호 30
 
3.6%
b동 30
 
3.6%
301호 24
 
2.9%
203호 22
 
2.6%
402호 20
 
2.4%
303호 20
 
2.4%
정실뜨래별 20
 
2.4%
아뜨네오피스텔 20
 
2.4%
Other values (155) 585
70.0%
2024-03-14T17:43:23.664247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
486
 
12.5%
0 394
 
10.1%
351
 
9.0%
2 237
 
6.1%
3 191
 
4.9%
1 186
 
4.8%
126
 
3.2%
120
 
3.1%
4 100
 
2.6%
70
 
1.8%
Other values (152) 1632
41.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2099
53.9%
Decimal Number 1227
31.5%
Space Separator 486
 
12.5%
Uppercase Letter 69
 
1.8%
Lowercase Letter 12
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
351
 
16.7%
126
 
6.0%
120
 
5.7%
70
 
3.3%
53
 
2.5%
52
 
2.5%
49
 
2.3%
49
 
2.3%
45
 
2.1%
45
 
2.1%
Other values (133) 1139
54.3%
Decimal Number
ValueCountFrequency (%)
0 394
32.1%
2 237
19.3%
3 191
15.6%
1 186
15.2%
4 100
 
8.1%
5 68
 
5.5%
6 21
 
1.7%
8 14
 
1.1%
7 11
 
0.9%
9 5
 
0.4%
Uppercase Letter
ValueCountFrequency (%)
B 32
46.4%
A 17
24.6%
K 7
 
10.1%
S 7
 
10.1%
C 6
 
8.7%
Lowercase Letter
ValueCountFrequency (%)
l 6
50.0%
i 3
25.0%
v 3
25.0%
Space Separator
ValueCountFrequency (%)
486
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2099
53.9%
Common 1713
44.0%
Latin 81
 
2.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
351
 
16.7%
126
 
6.0%
120
 
5.7%
70
 
3.3%
53
 
2.5%
52
 
2.5%
49
 
2.3%
49
 
2.3%
45
 
2.1%
45
 
2.1%
Other values (133) 1139
54.3%
Common
ValueCountFrequency (%)
486
28.4%
0 394
23.0%
2 237
13.8%
3 191
 
11.2%
1 186
 
10.9%
4 100
 
5.8%
5 68
 
4.0%
6 21
 
1.2%
8 14
 
0.8%
7 11
 
0.6%
Latin
ValueCountFrequency (%)
B 32
39.5%
A 17
21.0%
K 7
 
8.6%
S 7
 
8.6%
C 6
 
7.4%
l 6
 
7.4%
i 3
 
3.7%
v 3
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2099
53.9%
ASCII 1794
46.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
486
27.1%
0 394
22.0%
2 237
13.2%
3 191
 
10.6%
1 186
 
10.4%
4 100
 
5.6%
5 68
 
3.8%
B 32
 
1.8%
6 21
 
1.2%
A 17
 
0.9%
Other values (9) 62
 
3.5%
Hangul
ValueCountFrequency (%)
351
 
16.7%
126
 
6.0%
120
 
5.7%
70
 
3.3%
53
 
2.5%
52
 
2.5%
49
 
2.3%
49
 
2.3%
45
 
2.1%
45
 
2.1%
Other values (133) 1139
54.3%
Distinct224
Distinct (%)64.0%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
Minimum2021-04-26 00:00:00
Maximum2023-12-26 00:00:00
2024-03-14T17:43:24.070426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T17:43:24.494755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

신청구분
Categorical

Distinct4
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
신규
213 
신청
76 
해지
60 
계좌변경
 
1

Length

Max length4
Median length2
Mean length2.0057143
Min length2

Unique

Unique1 ?
Unique (%)0.3%

Sample

1st row신규
2nd row신규
3rd row신규
4th row신규
5th row신규

Common Values

ValueCountFrequency (%)
신규 213
60.9%
신청 76
 
21.7%
해지 60
 
17.1%
계좌변경 1
 
0.3%

Length

2024-03-14T17:43:24.939114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T17:43:25.301906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
신규 213
60.9%
신청 76
 
21.7%
해지 60
 
17.1%
계좌변경 1
 
0.3%

등록결과
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
정상
347 
납부자번호 오류
 
2
동일계좌이중신청 오류
 
1

Length

Max length11
Median length2
Mean length2.06
Min length2

Unique

Unique1 ?
Unique (%)0.3%

Sample

1st row정상
2nd row정상
3rd row정상
4th row정상
5th row정상

Common Values

ValueCountFrequency (%)
정상 347
99.1%
납부자번호 오류 2
 
0.6%
동일계좌이중신청 오류 1
 
0.3%

Length

2024-03-14T17:43:25.677282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T17:43:26.003170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상 347
98.3%
오류 3
 
0.8%
납부자번호 2
 
0.6%
동일계좌이중신청 1
 
0.3%

Correlations

2024-03-14T17:43:26.202740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
신청구분등록결과
신청구분1.0000.089
등록결과0.0891.000
2024-03-14T17:43:26.429770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
신청구분등록결과
신청구분1.0000.084
등록결과0.0841.000
2024-03-14T17:43:26.655599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
신청구분등록결과
신청구분1.0000.084
등록결과0.0841.000

Missing values

2024-03-14T17:43:20.968595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T17:43:21.253078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분신청일자신청구분등록결과
0정암주택 301호2021-04-26신규정상
1정실뜨래별 102동 402호2021-04-26신규정상
2효돈빌 305호2021-04-26신규정상
3이도네집 302호2021-04-27신규정상
4아뜨네오피스텔 304호2021-04-27신규정상
5아델리아빌 201호2021-04-27신규정상
6중문주택 A동 304호2021-04-27신규정상
7SK원 빌라 가동 501호2021-04-27신규정상
8이도스타빌리지 105동 202호2021-04-27신규정상
9진흥아트빌 305호2021-04-27신규정상
구분신청일자신청구분등록결과
340씨티빌C동 303호2023-11-23신청정상
341마음에온함덕 102동 404호2023-11-24해지정상
342마음에온함덕 102동 404호2023-11-24신청정상
343중문주택 A동 203호2023-11-30신청정상
344서운당주택 301호2023-12-04신청정상
345마음에온의귀리 303호2023-12-05신청정상
346희건주택 206호2023-12-07신청정상
347이디살젠 203호2023-12-26신청정상
348마음에온아라 411호2023-12-26신청정상
349이호다가구 303호2023-12-26신청정상