Overview

Dataset statistics

Number of variables10
Number of observations155
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.5 KiB
Average record size in memory82.9 B

Variable types

Categorical4
Text2
Boolean2
Numeric2

Dataset

Description현재 우체국에서 취급하는 보험상품에 관한 정보입니다. 해당데이터가 보유한 컬럼은 다음과 같습니다. 컬럼명 : 상품명, 개인단체구분, 금리연동구분, 선납할인여부, 보험기간, 온라인보험여부 사용자가 응답받게 되는 데이터는 다음과 같습니다. 상품명, 개인단체구분, 금리연동구분, 선납할인여부, 보험기간, 온라인보험여부
Author과학기술정보통신부 우정사업본부
URLhttps://www.data.go.kr/data/15083183/fileData.do

Alerts

보험기간최소 is highly overall correlated with 보험기간최대 and 1 other fieldsHigh correlation
보험기간최대 is highly overall correlated with 보험기간최소 and 1 other fieldsHigh correlation
상품분류 is highly overall correlated with 금리연동구분 and 1 other fieldsHigh correlation
금리연동구분 is highly overall correlated with 상품분류 and 1 other fieldsHigh correlation
선납할인여부 is highly overall correlated with 금리연동구분High correlation
단위코드 is highly overall correlated with 보험기간최소 and 2 other fieldsHigh correlation
상품분류 is highly imbalanced (69.5%)Imbalance
상품코드 has unique valuesUnique

Reproduction

Analysis started2023-12-12 20:34:02.938841
Analysis finished2023-12-12 20:34:04.218924
Duration1.28 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

상품분류
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
보장성보험
139 
저축성보험
 
8
연금보험
 
7
교육보험
 
1

Length

Max length5
Median length5
Mean length4.9483871
Min length4

Unique

Unique1 ?
Unique (%)0.6%

Sample

1st row교육보험
2nd row보장성보험
3rd row보장성보험
4th row보장성보험
5th row보장성보험

Common Values

ValueCountFrequency (%)
보장성보험 139
89.7%
저축성보험 8
 
5.2%
연금보험 7
 
4.5%
교육보험 1
 
0.6%

Length

2023-12-13T05:34:04.294530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:34:04.409967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
보장성보험 139
89.7%
저축성보험 8
 
5.2%
연금보험 7
 
4.5%
교육보험 1
 
0.6%

상품코드
Text

UNIQUE 

Distinct155
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-13T05:34:04.794050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length5
Mean length5
Min length5

Characters and Unicode

Total characters775
Distinct characters17
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique155 ?
Unique (%)100.0%

Sample

1st row1003C
2nd row40074
3rd row40084
4th row40094
5th row40104
ValueCountFrequency (%)
1003c 1
 
0.6%
61220 1
 
0.6%
61342 1
 
0.6%
61170 1
 
0.6%
61174 1
 
0.6%
61182 1
 
0.6%
61192 1
 
0.6%
61201 1
 
0.6%
61211 1
 
0.6%
61264 1
 
0.6%
Other values (145) 145
93.5%
2023-12-13T05:34:05.394557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 167
21.5%
0 161
20.8%
6 100
12.9%
2 94
12.1%
4 69
8.9%
3 53
 
6.8%
5 44
 
5.7%
9 31
 
4.0%
7 23
 
3.0%
8 19
 
2.5%
Other values (7) 14
 
1.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 761
98.2%
Uppercase Letter 14
 
1.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 167
21.9%
0 161
21.2%
6 100
13.1%
2 94
12.4%
4 69
9.1%
3 53
 
7.0%
5 44
 
5.8%
9 31
 
4.1%
7 23
 
3.0%
8 19
 
2.5%
Uppercase Letter
ValueCountFrequency (%)
C 4
28.6%
A 3
21.4%
G 3
21.4%
D 1
 
7.1%
E 1
 
7.1%
H 1
 
7.1%
B 1
 
7.1%

Most occurring scripts

ValueCountFrequency (%)
Common 761
98.2%
Latin 14
 
1.8%

Most frequent character per script

Common
ValueCountFrequency (%)
1 167
21.9%
0 161
21.2%
6 100
13.1%
2 94
12.4%
4 69
9.1%
3 53
 
7.0%
5 44
 
5.8%
9 31
 
4.1%
7 23
 
3.0%
8 19
 
2.5%
Latin
ValueCountFrequency (%)
C 4
28.6%
A 3
21.4%
G 3
21.4%
D 1
 
7.1%
E 1
 
7.1%
H 1
 
7.1%
B 1
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 775
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 167
21.5%
0 161
20.8%
6 100
12.9%
2 94
12.1%
4 69
8.9%
3 53
 
6.8%
5 44
 
5.7%
9 31
 
4.0%
7 23
 
3.0%
8 19
 
2.5%
Other values (7) 14
 
1.8%
Distinct141
Distinct (%)91.0%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-13T05:34:05.672246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length9.2774194
Min length4

Characters and Unicode

Total characters1438
Distinct characters138
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique128 ?
Unique (%)82.6%

Sample

1st row(무)청소년꿈
2nd row(무)든든한종신1종
3rd row(무)든든한종신2종
4th row(무)실속정기1종순수
5th row(무)실속정기2종순수
ValueCountFrequency (%)
무)온라인희망 3
 
1.9%
무)노후실손종합 2
 
1.3%
무)간편실손상해 2
 
1.3%
무)와이드건강2종 2
 
1.3%
무)간편실손질병 2
 
1.3%
무)건강클리닉 2
 
1.3%
무)하나로ok 2
 
1.3%
무)노후실손상해 2
 
1.3%
무)치아보험 2
 
1.3%
무)간편실손종합 2
 
1.3%
Other values (131) 134
86.5%
2023-12-13T05:34:06.089666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 156
 
10.8%
) 156
 
10.8%
151
 
10.5%
75
 
5.2%
49
 
3.4%
42
 
2.9%
2 28
 
1.9%
25
 
1.7%
24
 
1.7%
24
 
1.7%
Other values (128) 708
49.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1033
71.8%
Open Punctuation 156
 
10.8%
Close Punctuation 156
 
10.8%
Decimal Number 62
 
4.3%
Uppercase Letter 22
 
1.5%
Lowercase Letter 6
 
0.4%
Other Punctuation 2
 
0.1%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
151
 
14.6%
75
 
7.3%
49
 
4.7%
42
 
4.1%
25
 
2.4%
24
 
2.3%
24
 
2.3%
22
 
2.1%
21
 
2.0%
19
 
1.8%
Other values (112) 581
56.2%
Decimal Number
ValueCountFrequency (%)
2 28
45.2%
1 24
38.7%
3 6
 
9.7%
0 2
 
3.2%
5 2
 
3.2%
Uppercase Letter
ValueCountFrequency (%)
I 9
40.9%
N 9
40.9%
K 2
 
9.1%
O 2
 
9.1%
Lowercase Letter
ValueCountFrequency (%)
w 2
33.3%
i 2
33.3%
n 2
33.3%
Open Punctuation
ValueCountFrequency (%)
( 156
100.0%
Close Punctuation
ValueCountFrequency (%)
) 156
100.0%
Other Punctuation
ValueCountFrequency (%)
% 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1033
71.8%
Common 377
 
26.2%
Latin 28
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
151
 
14.6%
75
 
7.3%
49
 
4.7%
42
 
4.1%
25
 
2.4%
24
 
2.3%
24
 
2.3%
22
 
2.1%
21
 
2.0%
19
 
1.8%
Other values (112) 581
56.2%
Common
ValueCountFrequency (%)
( 156
41.4%
) 156
41.4%
2 28
 
7.4%
1 24
 
6.4%
3 6
 
1.6%
% 2
 
0.5%
0 2
 
0.5%
5 2
 
0.5%
- 1
 
0.3%
Latin
ValueCountFrequency (%)
I 9
32.1%
N 9
32.1%
K 2
 
7.1%
O 2
 
7.1%
w 2
 
7.1%
i 2
 
7.1%
n 2
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1033
71.8%
ASCII 405
 
28.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 156
38.5%
) 156
38.5%
2 28
 
6.9%
1 24
 
5.9%
I 9
 
2.2%
N 9
 
2.2%
3 6
 
1.5%
K 2
 
0.5%
O 2
 
0.5%
w 2
 
0.5%
Other values (6) 11
 
2.7%
Hangul
ValueCountFrequency (%)
151
 
14.6%
75
 
7.3%
49
 
4.7%
42
 
4.1%
25
 
2.4%
24
 
2.3%
24
 
2.3%
22
 
2.1%
21
 
2.0%
19
 
1.8%
Other values (112) 581
56.2%
Distinct3
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
개인/단체
91 
개인
62 
단체
 
2

Length

Max length5
Median length5
Mean length3.7612903
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개인
2nd row개인/단체
3rd row개인/단체
4th row개인/단체
5th row개인/단체

Common Values

ValueCountFrequency (%)
개인/단체 91
58.7%
개인 62
40.0%
단체 2
 
1.3%

Length

2023-12-13T05:34:06.226253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:34:06.337629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개인/단체 91
58.7%
개인 62
40.0%
단체 2
 
1.3%

금리연동구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
고정
115 
연동
40 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row고정
2nd row연동
3rd row연동
4th row연동
5th row연동

Common Values

ValueCountFrequency (%)
고정 115
74.2%
연동 40
 
25.8%

Length

2023-12-13T05:34:06.455407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:34:06.568218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
고정 115
74.2%
연동 40
 
25.8%

선납할인여부
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size287.0 B
True
104 
False
51 
ValueCountFrequency (%)
True 104
67.1%
False 51
32.9%
2023-12-13T05:34:06.691294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

보험기간최소
Real number (ℝ)

HIGH CORRELATION 

Distinct13
Distinct (%)8.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26.283871
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2023-12-13T05:34:06.810576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median10
Q330
95-th percentile99
Maximum100
Range99
Interquartile range (IQR)29

Descriptive statistics

Standard deviation35.640477
Coefficient of variation (CV)1.3559828
Kurtosis-0.3377046
Mean26.283871
Median Absolute Deviation (MAD)9
Skewness1.202336
Sum4074
Variance1270.2436
MonotonicityNot monotonic
2023-12-13T05:34:06.991880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
1 49
31.6%
10 34
21.9%
5 11
 
7.1%
99 11
 
7.1%
80 11
 
7.1%
90 9
 
5.8%
20 7
 
4.5%
15 7
 
4.5%
3 7
 
4.5%
60 4
 
2.6%
Other values (3) 5
 
3.2%
ValueCountFrequency (%)
1 49
31.6%
3 7
 
4.5%
5 11
 
7.1%
10 34
21.9%
15 7
 
4.5%
20 7
 
4.5%
30 2
 
1.3%
60 4
 
2.6%
80 11
 
7.1%
85 1
 
0.6%
ValueCountFrequency (%)
100 2
 
1.3%
99 11
 
7.1%
90 9
 
5.8%
85 1
 
0.6%
80 11
 
7.1%
60 4
 
2.6%
30 2
 
1.3%
20 7
 
4.5%
15 7
 
4.5%
10 34
21.9%

보험기간최대
Real number (ℝ)

HIGH CORRELATION 

Distinct12
Distinct (%)7.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean31.445161
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2023-12-13T05:34:07.141185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median10
Q330
95-th percentile100
Maximum100
Range99
Interquartile range (IQR)29

Descriptive statistics

Standard deviation38.836845
Coefficient of variation (CV)1.2350659
Kurtosis-0.72119264
Mean31.445161
Median Absolute Deviation (MAD)9
Skewness1.0307365
Sum4874
Variance1508.3005
MonotonicityNot monotonic
2023-12-13T05:34:07.304481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
1 47
30.3%
10 26
16.8%
30 20
12.9%
100 18
 
11.6%
5 11
 
7.1%
99 11
 
7.1%
90 6
 
3.9%
20 6
 
3.9%
15 6
 
3.9%
95 2
 
1.3%
Other values (2) 2
 
1.3%
ValueCountFrequency (%)
1 47
30.3%
3 1
 
0.6%
5 11
 
7.1%
10 26
16.8%
15 6
 
3.9%
20 6
 
3.9%
30 20
12.9%
80 1
 
0.6%
90 6
 
3.9%
95 2
 
1.3%
ValueCountFrequency (%)
100 18
11.6%
99 11
7.1%
95 2
 
1.3%
90 6
 
3.9%
80 1
 
0.6%
30 20
12.9%
20 6
 
3.9%
15 6
 
3.9%
10 26
16.8%
5 11
7.1%

단위코드
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
115 
29 
종신
 
11

Length

Max length2
Median length1
Mean length1.0709677
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row종신
3rd row종신
4th row
5th row

Common Values

ValueCountFrequency (%)
115
74.2%
29
 
18.7%
종신 11
 
7.1%

Length

2023-12-13T05:34:07.507442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:34:07.669295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
115
74.2%
29
 
18.7%
종신 11
 
7.1%
Distinct2
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size287.0 B
False
122 
True
33 
ValueCountFrequency (%)
False 122
78.7%
True 33
 
21.3%
2023-12-13T05:34:07.822373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Interactions

2023-12-13T05:34:03.721804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:34:03.509489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:34:03.831255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:34:03.606746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T05:34:07.935098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상품분류개인단체구분금리연동구분선납할인여부보험기간최소보험기간최대단위코드온라인보험여부
상품분류1.0000.1130.7510.6670.5210.3300.5470.160
개인단체구분0.1131.0000.0780.0710.3630.5230.5540.000
금리연동구분0.7510.0781.0000.9630.4540.6480.2970.000
선납할인여부0.6670.0710.9631.0000.3650.5700.2230.000
보험기간최소0.5210.3630.4540.3651.0000.8330.9280.166
보험기간최대0.3300.5230.6480.5700.8331.0000.9410.523
단위코드0.5470.5540.2970.2230.9280.9411.0000.000
온라인보험여부0.1600.0000.0000.0000.1660.5230.0001.000
2023-12-13T05:34:08.131774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
개인단체구분금리연동구분선납할인여부상품분류온라인보험여부단위코드
개인단체구분1.0000.1280.1170.1060.0000.238
금리연동구분0.1281.0000.8250.5400.0000.478
선납할인여부0.1170.8251.0000.4650.0000.364
상품분류0.1060.5400.4651.0000.1050.550
온라인보험여부0.0000.0000.0000.1051.0000.000
단위코드0.2380.4780.3640.5500.0001.000
2023-12-13T05:34:08.282896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
보험기간최소보험기간최대상품분류개인단체구분금리연동구분선납할인여부단위코드온라인보험여부
보험기간최소1.0000.9530.3820.2580.4780.3840.9310.174
보험기간최대0.9531.0000.2160.2500.4670.4070.7000.373
상품분류0.3820.2161.0000.1060.5400.4650.5500.105
개인단체구분0.2580.2500.1061.0000.1280.1170.2380.000
금리연동구분0.4780.4670.5400.1281.0000.8250.4780.000
선납할인여부0.3840.4070.4650.1170.8251.0000.3640.000
단위코드0.9310.7000.5500.2380.4780.3641.0000.000
온라인보험여부0.1740.3730.1050.0000.0000.0000.0001.000

Missing values

2023-12-13T05:34:03.983697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:34:04.150473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상품분류상품코드상품명개인단체구분금리연동구분선납할인여부보험기간최소보험기간최대단위코드온라인보험여부
0교육보험1003C(무)청소년꿈개인고정N55N
1보장성보험40074(무)든든한종신1종개인/단체연동N9999종신N
2보장성보험40084(무)든든한종신2종개인/단체연동N9999종신N
3보장성보험40094(무)실속정기1종순수개인/단체연동N6090N
4보장성보험40104(무)실속정기2종순수개인/단체연동N6090N
5보장성보험40112(무)통합건강개인/단체연동N90100N
6보장성보험40121(무)요양보험개인/단체고정Y85100N
7보장성보험40131(무)온라인정기1종개인/단체고정Y2020Y
8보장성보험40141(무)온라인정기2종개인/단체고정Y2020Y
9보장성보험40151(무)당뇨안심개인/단체연동N80100N
상품분류상품코드상품명개인단체구분금리연동구분선납할인여부보험기간최소보험기간최대단위코드온라인보험여부
145연금보험21098(무)연금저축이전형개인연동N9999종신N
146연금보험21112(무)온라인연금저축개인연동N9999종신Y
147저축성보험3014A(무)그린(일반형)개인/단체연동N310N
148저축성보험3016B(무)그린(비과세)개인연동N310N
149저축성보험30177(무)파워적립1종개인연동N310N
150저축성보험30187(무)파워적립2종개인연동N1010N
151저축성보험30206(무)그린(일반형)IN개인/단체연동N310Y
152저축성보험30215(무)파워적립1종IN개인연동N310Y
153저축성보험30225(무)파워적립2종IN개인연동N1010Y
154저축성보험30232(무)온라인저축개인연동N110Y