Overview

Dataset statistics

Number of variables5
Number of observations2946
Missing cells2943
Missing cells (%)20.0%
Duplicate rows339
Duplicate rows (%)11.5%
Total size in memory115.2 KiB
Average record size in memory40.0 B

Variable types

DateTime3
Categorical2

Dataset

Descriptiono 합리적 의료이용지원 상담 현황 - 대상: 2022년도 합리적 의료이용지원 1차 상담 완료자 - 내용: 안내문 발송일자, 1차 상담일자, 1차 상담방법, 2차 상담일자, 2차 상담방법 o 1차 면담방법 - 유선(InBound), 유선(Outbound), 내방, 방문 o 2차 면담방법 - 안내문발송군, 상담강화군: 유선(InBound), 유선(Outbound), 내방, 방문 - 통합건강관리: 가정방문, 내방, 기타
URLhttps://www.data.go.kr/data/15120843/fileData.do

Alerts

Dataset has 339 (11.5%) duplicate rowsDuplicates
2차면담방법코드 is highly overall correlated with 1차면담방법코드High correlation
1차면담방법코드 is highly overall correlated with 2차면담방법코드High correlation
2차면담방법코드 is highly imbalanced (98.8%)Imbalance
2차면담일자 has 2943 (99.9%) missing valuesMissing

Reproduction

Analysis started2023-12-12 02:46:20.123012
Analysis finished2023-12-12 02:46:20.404120
Duration0.28 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct9
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size23.1 KiB
Minimum2022-03-04 00:00:00
Maximum2022-11-24 00:00:00
2023-12-12T11:46:20.444711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:46:20.538142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
Distinct134
Distinct (%)4.5%
Missing0
Missing (%)0.0%
Memory size23.1 KiB
Minimum2022-03-10 00:00:00
Maximum2022-12-19 00:00:00
2023-12-12T11:46:20.643743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:46:20.771832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

1차면담방법코드
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size23.1 KiB
유선(InBound)
1881 
유선(OutBound)
1037 
내방
 
28

Length

Max length12
Median length11
Mean length11.266463
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유선(InBound)
2nd row유선(OutBound)
3rd row유선(InBound)
4th row유선(OutBound)
5th row유선(InBound)

Common Values

ValueCountFrequency (%)
유선(InBound) 1881
63.8%
유선(OutBound) 1037
35.2%
내방 28
 
1.0%

Length

2023-12-12T11:46:20.871052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:46:20.952795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유선(inbound 1881
63.8%
유선(outbound 1037
35.2%
내방 28
 
1.0%

2차면담일자
Date

MISSING 

Distinct2
Distinct (%)66.7%
Missing2943
Missing (%)99.9%
Memory size23.1 KiB
Minimum2022-07-19 00:00:00
Maximum2022-09-05 00:00:00
2023-12-12T11:46:21.023577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:46:21.103798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=2)

2차면담방법코드
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size23.1 KiB
<NA>
2943 
유선(OutBound)
 
3

Length

Max length12
Median length4
Mean length4.0081466
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 2943
99.9%
유선(OutBound) 3
 
0.1%

Length

2023-12-12T11:46:21.274270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:46:21.388655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 2943
99.9%
유선(outbound 3
 
0.1%

Correlations

2023-12-12T11:46:21.442747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
의료이용안내문발송일자1차면담방법코드2차면담일자
의료이용안내문발송일자1.0000.903NaN
1차면담방법코드0.9031.0000.000
2차면담일자NaN0.0001.000
2023-12-12T11:46:21.532451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2차면담방법코드1차면담방법코드
2차면담방법코드1.0001.000
1차면담방법코드1.0001.000
2023-12-12T11:46:21.613074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
1차면담방법코드2차면담방법코드
1차면담방법코드1.0001.000
2차면담방법코드1.0001.000

Missing values

2023-12-12T11:46:20.278500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T11:46:20.367167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

의료이용안내문발송일자1차면담일자1차면담방법코드2차면담일자2차면담방법코드
02022-04-192022-05-24유선(InBound)<NA><NA>
12022-04-252022-06-09유선(OutBound)<NA><NA>
22022-04-192022-05-02유선(InBound)<NA><NA>
32022-04-252022-05-19유선(OutBound)<NA><NA>
42022-04-192022-05-24유선(InBound)<NA><NA>
52022-04-192022-06-21유선(InBound)<NA><NA>
62022-04-252022-05-16유선(OutBound)<NA><NA>
72022-04-192022-05-04유선(InBound)<NA><NA>
82022-04-192022-05-02유선(InBound)<NA><NA>
92022-04-192022-05-27유선(InBound)<NA><NA>
의료이용안내문발송일자1차면담일자1차면담방법코드2차면담일자2차면담방법코드
29362022-06-202022-09-27유선(InBound)<NA><NA>
29372022-11-212022-12-01유선(InBound)<NA><NA>
29382022-11-212022-11-25유선(InBound)<NA><NA>
29392022-11-212022-12-02유선(InBound)<NA><NA>
29402022-08-182022-09-02유선(InBound)<NA><NA>
29412022-06-202022-08-02유선(InBound)<NA><NA>
29422022-08-242022-09-05유선(InBound)<NA><NA>
29432022-06-282022-07-05유선(InBound)2022-07-19유선(OutBound)
29442022-06-282022-08-09유선(OutBound)2022-09-05유선(OutBound)
29452022-06-282022-08-08유선(OutBound)2022-09-05유선(OutBound)

Duplicate rows

Most frequently occurring

의료이용안내문발송일자1차면담일자1차면담방법코드2차면담일자2차면담방법코드# duplicates
2642022-08-182022-09-01유선(InBound)<NA><NA>55
112022-04-192022-05-02유선(InBound)<NA><NA>53
92022-04-192022-04-29유선(InBound)<NA><NA>49
3202022-11-212022-12-01유선(InBound)<NA><NA>46
122022-04-192022-05-03유선(InBound)<NA><NA>44
2632022-08-182022-08-31유선(InBound)<NA><NA>39
3192022-11-212022-11-30유선(InBound)<NA><NA>36
1592022-06-202022-07-05유선(InBound)<NA><NA>32
1602022-06-202022-07-06유선(InBound)<NA><NA>31
1622022-06-202022-07-08유선(InBound)<NA><NA>31