Overview

Dataset statistics

Number of variables9
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.6 KiB
Average record size in memory77.3 B

Variable types

Numeric2
Categorical4
DateTime1
Text2

Dataset

Description국립암센터에서 19년도 9월까지 암환자의료비지원정보시스템의 운영 관리 관련 공지사항 정보. 날짜 제목 담당자 처리 날짜를 확인할 수 있습니다.
Author국립암센터
URLhttps://www.data.go.kr/data/15049642/fileData.do

Alerts

사용자 번호 has constant value ""Constant
처리번호 is highly overall correlated with 기기 번호High correlation
조회건수 is highly overall correlated with 담당자High correlation
담당자 is highly overall correlated with 조회건수High correlation
기기 번호 is highly overall correlated with 처리번호High correlation
담당자 is highly imbalanced (91.9%)Imbalance
처리번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 15:51:50.699362
Analysis finished2023-12-12 15:51:52.754956
Duration2.06 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

처리번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean699.5
Minimum650
Maximum749
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-13T00:51:52.844875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum650
5-th percentile654.95
Q1674.75
median699.5
Q3724.25
95-th percentile744.05
Maximum749
Range99
Interquartile range (IQR)49.5

Descriptive statistics

Standard deviation29.011492
Coefficient of variation (CV)0.041474613
Kurtosis-1.2
Mean699.5
Median Absolute Deviation (MAD)25
Skewness0
Sum69950
Variance841.66667
MonotonicityNot monotonic
2023-12-13T00:51:53.010247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
695 1
 
1.0%
734 1
 
1.0%
665 1
 
1.0%
747 1
 
1.0%
733 1
 
1.0%
708 1
 
1.0%
687 1
 
1.0%
691 1
 
1.0%
676 1
 
1.0%
674 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
650 1
1.0%
651 1
1.0%
652 1
1.0%
653 1
1.0%
654 1
1.0%
655 1
1.0%
656 1
1.0%
657 1
1.0%
658 1
1.0%
659 1
1.0%
ValueCountFrequency (%)
749 1
1.0%
748 1
1.0%
747 1
1.0%
746 1
1.0%
745 1
1.0%
744 1
1.0%
743 1
1.0%
742 1
1.0%
741 1
1.0%
740 1
1.0%

담당자
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
박서진
99 
양형국
 
1

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row박서진
2nd row박서진
3rd row박서진
4th row박서진
5th row박서진

Common Values

ValueCountFrequency (%)
박서진 99
99.0%
양형국 1
 
1.0%

Length

2023-12-13T00:51:53.171618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:51:53.275045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
박서진 99
99.0%
양형국 1
 
1.0%

날짜
Date

Distinct83
Distinct (%)83.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
Minimum2019-01-03 00:00:00
Maximum2019-09-25 00:00:00
2023-12-13T00:51:53.398936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:51:53.564939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

제목
Text

Distinct76
Distinct (%)76.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-13T00:51:53.970931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length65
Median length41
Mean length27.4
Min length11

Characters and Unicode

Total characters2740
Distinct characters182
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique70 ?
Unique (%)70.0%

Sample

1st row[시스템] 4월 18일 오후, 19일 시스템 관련 전화상담 불가 안내
2nd row지급등록 건 수정 요청
3rd row지급등록 건 수정 요청
4th row[시스템] 복사된 2019년 지원신청서에서 [등록한 영수증 선택하여 입력]으로 입력한 2017년분 영수증이 있는 경우
5th row영수증 날짜 확인 및 수정 요청
ValueCountFrequency (%)
수정 52
 
7.6%
시스템 49
 
7.1%
안내 36
 
5.2%
지급등록 35
 
5.1%
35
 
5.1%
요청 31
 
4.5%
불가 28
 
4.1%
관련 20
 
2.9%
14
 
2.0%
전화상담 13
 
1.9%
Other values (224) 373
54.4%
2023-12-13T00:51:54.569474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
586
 
21.4%
94
 
3.4%
72
 
2.6%
66
 
2.4%
64
 
2.3%
64
 
2.3%
63
 
2.3%
1 62
 
2.3%
56
 
2.0%
56
 
2.0%
Other values (172) 1557
56.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1707
62.3%
Space Separator 586
 
21.4%
Decimal Number 219
 
8.0%
Close Punctuation 91
 
3.3%
Open Punctuation 91
 
3.3%
Other Punctuation 26
 
0.9%
Dash Punctuation 12
 
0.4%
Other Symbol 4
 
0.1%
Math Symbol 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
94
 
5.5%
72
 
4.2%
66
 
3.9%
64
 
3.7%
64
 
3.7%
63
 
3.7%
56
 
3.3%
56
 
3.3%
52
 
3.0%
50
 
2.9%
Other values (151) 1070
62.7%
Decimal Number
ValueCountFrequency (%)
1 62
28.3%
2 39
17.8%
5 21
 
9.6%
0 18
 
8.2%
6 14
 
6.4%
3 14
 
6.4%
4 13
 
5.9%
9 13
 
5.9%
7 13
 
5.9%
8 12
 
5.5%
Other Punctuation
ValueCountFrequency (%)
, 11
42.3%
. 8
30.8%
/ 7
26.9%
Close Punctuation
ValueCountFrequency (%)
) 48
52.7%
] 43
47.3%
Open Punctuation
ValueCountFrequency (%)
( 48
52.7%
[ 43
47.3%
Space Separator
ValueCountFrequency (%)
586
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%
Math Symbol
ValueCountFrequency (%)
~ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1707
62.3%
Common 1033
37.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
94
 
5.5%
72
 
4.2%
66
 
3.9%
64
 
3.7%
64
 
3.7%
63
 
3.7%
56
 
3.3%
56
 
3.3%
52
 
3.0%
50
 
2.9%
Other values (151) 1070
62.7%
Common
ValueCountFrequency (%)
586
56.7%
1 62
 
6.0%
) 48
 
4.6%
( 48
 
4.6%
[ 43
 
4.2%
] 43
 
4.2%
2 39
 
3.8%
5 21
 
2.0%
0 18
 
1.7%
6 14
 
1.4%
Other values (11) 111
 
10.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1707
62.3%
ASCII 1029
37.6%
Misc Symbols 4
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
586
56.9%
1 62
 
6.0%
) 48
 
4.7%
( 48
 
4.7%
[ 43
 
4.2%
] 43
 
4.2%
2 39
 
3.8%
5 21
 
2.0%
0 18
 
1.7%
6 14
 
1.4%
Other values (10) 107
 
10.4%
Hangul
ValueCountFrequency (%)
94
 
5.5%
72
 
4.2%
66
 
3.9%
64
 
3.7%
64
 
3.7%
63
 
3.7%
56
 
3.3%
56
 
3.3%
52
 
3.0%
50
 
2.9%
Other values (151) 1070
62.7%
Misc Symbols
ValueCountFrequency (%)
4
100.0%

조회건수
Real number (ℝ)

HIGH CORRELATION 

Distinct79
Distinct (%)79.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean199.76
Minimum2
Maximum1929
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-13T00:51:54.766726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile13
Q151
median81.5
Q3156.25
95-th percentile868.15
Maximum1929
Range1927
Interquartile range (IQR)105.25

Descriptive statistics

Standard deviation352.09733
Coefficient of variation (CV)1.7626018
Kurtosis11.914087
Mean199.76
Median Absolute Deviation (MAD)47
Skewness3.4061389
Sum19976
Variance123972.53
MonotonicityNot monotonic
2023-12-13T00:51:54.974730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
27 3
 
3.0%
73 3
 
3.0%
2 3
 
3.0%
63 3
 
3.0%
55 3
 
3.0%
74 2
 
2.0%
13 2
 
2.0%
72 2
 
2.0%
40 2
 
2.0%
160 2
 
2.0%
Other values (69) 75
75.0%
ValueCountFrequency (%)
2 3
3.0%
4 1
 
1.0%
13 2
2.0%
14 1
 
1.0%
23 1
 
1.0%
25 1
 
1.0%
26 2
2.0%
27 3
3.0%
30 1
 
1.0%
32 2
2.0%
ValueCountFrequency (%)
1929 1
1.0%
1719 1
1.0%
1582 1
1.0%
1492 1
1.0%
966 1
1.0%
863 1
1.0%
818 1
1.0%
604 1
1.0%
597 1
1.0%
486 1
1.0%

기기 번호
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2
59 
1
41 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row1
4th row2
5th row1

Common Values

ValueCountFrequency (%)
2 59
59.0%
1 41
41.0%

Length

2023-12-13T00:51:55.158764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:51:55.278427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 59
59.0%
1 41
41.0%

사용자 번호
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
3908
100 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3908
2nd row3908
3rd row3908
4th row3908
5th row3908

Common Values

ValueCountFrequency (%)
3908 100
100.0%

Length

2023-12-13T00:51:55.409827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:51:55.517065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3908 100
100.0%
Distinct58
Distinct (%)58.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-13T00:51:55.756703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length8.3
Min length5

Characters and Unicode

Total characters830
Distinct characters16
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique48 ?
Unique (%)48.0%

Sample

1st row2019-04-16
2nd row해당 없음
3rd row2019-03-04
4th row해당 없음
5th row2019-03-06
ValueCountFrequency (%)
해당 34
25.4%
없음 34
25.4%
2019-09-16 2
 
1.5%
2019-03-19 2
 
1.5%
2019-02-01 2
 
1.5%
2019-02-12 2
 
1.5%
2019-04-15 2
 
1.5%
2019-04-25 2
 
1.5%
2019-03-06 2
 
1.5%
2019-03-13 2
 
1.5%
Other values (49) 50
37.3%
2023-12-13T00:51:56.180084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 159
19.2%
- 132
15.9%
1 100
12.0%
2 99
11.9%
9 78
9.4%
34
 
4.1%
34
 
4.1%
34
 
4.1%
34
 
4.1%
34
 
4.1%
Other values (6) 92
11.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 528
63.6%
Other Letter 136
 
16.4%
Dash Punctuation 132
 
15.9%
Space Separator 34
 
4.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 159
30.1%
1 100
18.9%
2 99
18.8%
9 78
14.8%
4 26
 
4.9%
3 23
 
4.4%
5 14
 
2.7%
8 11
 
2.1%
6 9
 
1.7%
7 9
 
1.7%
Other Letter
ValueCountFrequency (%)
34
25.0%
34
25.0%
34
25.0%
34
25.0%
Dash Punctuation
ValueCountFrequency (%)
- 132
100.0%
Space Separator
ValueCountFrequency (%)
34
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 694
83.6%
Hangul 136
 
16.4%

Most frequent character per script

Common
ValueCountFrequency (%)
0 159
22.9%
- 132
19.0%
1 100
14.4%
2 99
14.3%
9 78
11.2%
34
 
4.9%
4 26
 
3.7%
3 23
 
3.3%
5 14
 
2.0%
8 11
 
1.6%
Other values (2) 18
 
2.6%
Hangul
ValueCountFrequency (%)
34
25.0%
34
25.0%
34
25.0%
34
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 694
83.6%
Hangul 136
 
16.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 159
22.9%
- 132
19.0%
1 100
14.4%
2 99
14.3%
9 78
11.2%
34
 
4.9%
4 26
 
3.7%
3 23
 
3.3%
5 14
 
2.0%
8 11
 
1.6%
Other values (2) 18
 
2.6%
Hangul
ValueCountFrequency (%)
34
25.0%
34
25.0%
34
25.0%
34
25.0%

처리날짜
Categorical

Distinct46
Distinct (%)46.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
해당 없음
34 
2019-03-15
2019-09-20
 
4
2019-04-30
 
4
2019-01-31
 
2
Other values (41)
50 

Length

Max length10
Median length10
Mean length8.3
Min length5

Unique

Unique32 ?
Unique (%)32.0%

Sample

1st row2019-04-19
2nd row해당 없음
3rd row2019-03-08
4th row해당 없음
5th row2019-03-14

Common Values

ValueCountFrequency (%)
해당 없음 34
34.0%
2019-03-15 6
 
6.0%
2019-09-20 4
 
4.0%
2019-04-30 4
 
4.0%
2019-01-31 2
 
2.0%
2019-03-08 2
 
2.0%
2019-03-14 2
 
2.0%
2019-04-05 2
 
2.0%
2019-02-15 2
 
2.0%
2019-03-22 2
 
2.0%
Other values (36) 40
40.0%

Length

2023-12-13T00:51:56.346141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
해당 34
25.4%
없음 34
25.4%
2019-03-15 6
 
4.5%
2019-04-30 4
 
3.0%
2019-09-20 4
 
3.0%
2019-02-15 2
 
1.5%
2019-04-10 2
 
1.5%
2019-04-12 2
 
1.5%
2019-04-01 2
 
1.5%
2019-02-20 2
 
1.5%
Other values (37) 42
31.3%

Interactions

2023-12-13T00:51:52.220727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:51:52.021303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:51:52.332675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:51:52.110041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:51:56.461228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
처리번호담당자날짜제목조회건수기기 번호접수날짜처리날짜
처리번호1.0000.1440.9950.6490.3910.8280.8410.885
담당자0.1441.0001.0001.0000.6630.0000.0000.000
날짜0.9951.0001.0000.9700.7060.9560.9910.968
제목0.6491.0000.9701.0000.9950.5710.0000.000
조회건수0.3910.6630.7060.9951.0000.1340.0000.000
기기 번호0.8280.0000.9560.5710.1341.0000.5290.697
접수날짜0.8410.0000.9910.0000.0000.5291.0000.999
처리날짜0.8850.0000.9680.0000.0000.6970.9991.000
2023-12-13T00:51:56.609005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
처리날짜기기 번호담당자
처리날짜1.0000.4170.000
기기 번호0.4171.0000.000
담당자0.0000.0001.000
2023-12-13T00:51:56.712570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
처리번호조회건수담당자기기 번호처리날짜
처리번호1.000-0.2510.0000.6230.433
조회건수-0.2511.0000.6470.1260.000
담당자0.0000.6471.0000.0000.000
기기 번호0.6230.1260.0001.0000.417
처리날짜0.4330.0000.0000.4171.000

Missing values

2023-12-13T00:51:52.505584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:51:52.697305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

처리번호담당자날짜제목조회건수기기 번호사용자 번호접수날짜처리날짜
0695박서진2019-04-16[시스템] 4월 18일 오후, 19일 시스템 관련 전화상담 불가 안내96239082019-04-162019-04-19
1709박서진2019-05-17지급등록 건 수정 요청7323908해당 없음해당 없음
2666박서진2019-03-04지급등록 건 수정 요청208139082019-03-042019-03-08
3652박서진2019-01-08[시스템] 복사된 2019년 지원신청서에서 [등록한 영수증 선택하여 입력]으로 입력한 2017년분 영수증이 있는 경우86323908해당 없음해당 없음
4667박서진2019-03-06영수증 날짜 확인 및 수정 요청346139082019-03-062019-03-14
5719박서진2019-06-18[시스템] 6월 18일(화) 15시-17시 전화상담 불가 안내4823908해당 없음해당 없음
6738박서진2019-08-08지급등록 건 수정 요청(경북 문경시)6523908해당 없음해당 없음
7660박서진2019-02-18[시스템]금일(2/18) 10시 30분~13시 시스템 전화상담 불가 안내1313908해당 없음해당 없음
8720박서진2019-06-20[시스템] 6월 21일(금) 시스템 관련 전화 상담 불가 안내7223908해당 없음해당 없음
9728박서진2019-07-15[시스템] 인증서 로그인 기능 복구19213908해당 없음해당 없음
처리번호담당자날짜제목조회건수기기 번호사용자 번호접수날짜처리날짜
90726박서진2019-07-08영수증 날짜 및 지급일자 수정 요청155239082019-07-082019-07-17
91662박서진2019-02-212019년 예산으로 지급했지만 2018년 지원신청서에 지급등록한 건에 대한 수정 요청(목포시)305139082019-02-212019-02-28
92683박서진2019-04-01지급등록 건 수정 요청(인천 서구) 4월 5일까지104239082019-04-012019-04-05
93684박서진2019-04-02[시스템] 4월 2일(화) 16시~17시 시스템 관련 전화 상담 불가 안내3213908해당 없음해당 없음
94651양형국2019-01-072018년 미등록자 및 등록 신청 서류 접수 후 전산 미입력자 관련 안내158213908해당 없음해당 없음
95673박서진2019-03-13지급등록 건 수정 요청63139082019-03-132019-03-15
96692박서진2019-04-10지급등록건 수정 요청(서울 송파구) 4월 12일까지2323908해당 없음해당 없음
97681박서진2019-03-27지급등록 건 수정 요청84239082019-03-272019-04-01
98700박서진2019-04-25[안내] 사업안내서 문구 추가 및 수정 안내 (재안내)818239082019-04-252019-04-30
99711박서진2019-05-29[시스템] 5월 30일, 31일 오후 시스템 관련 전화 상담 불가 안내63239082019-05-292019-05-31