Overview

Dataset statistics

Number of variables11
Number of observations45
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.1 KiB
Average record size in memory92.9 B

Variable types

Numeric2
Categorical6
Text3

Dataset

Description췌장암 라이브러리 췌장암_환자_건강정보 메타정보( 제공 되어질 데이터 항목, 타입, 사이즈, 항목설명, 항목별건수, 표시형식 등)를 제공
Author국립암센터
URLhttps://www.data.go.kr/data/15091767/fileData.do

Alerts

분류아이디 has constant value ""Constant
분류명 has constant value ""Constant
테이블아이디 has constant value ""Constant
테이블명 has constant value ""Constant
데이터타입 is highly overall correlated with 표시형식High correlation
표시형식 is highly overall correlated with 데이터타입High correlation
순번 has unique valuesUnique
컬럼아이디 has unique valuesUnique
컬럼명 has unique valuesUnique
컬럼설명 has unique valuesUnique
컬럼데이터수 has 1 (2.2%) zerosZeros

Reproduction

Analysis started2023-12-12 00:49:11.122417
Analysis finished2023-12-12 00:49:12.307187
Duration1.18 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct45
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean23
Minimum1
Maximum45
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size537.0 B
2023-12-12T09:49:12.380967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.2
Q112
median23
Q334
95-th percentile42.8
Maximum45
Range44
Interquartile range (IQR)22

Descriptive statistics

Standard deviation13.133926
Coefficient of variation (CV)0.57104024
Kurtosis-1.2
Mean23
Median Absolute Deviation (MAD)11
Skewness0
Sum1035
Variance172.5
MonotonicityStrictly increasing
2023-12-12T09:49:12.551747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)
ValueCountFrequency (%)
1 1
 
2.2%
35 1
 
2.2%
26 1
 
2.2%
27 1
 
2.2%
28 1
 
2.2%
29 1
 
2.2%
30 1
 
2.2%
31 1
 
2.2%
32 1
 
2.2%
33 1
 
2.2%
Other values (35) 35
77.8%
ValueCountFrequency (%)
1 1
2.2%
2 1
2.2%
3 1
2.2%
4 1
2.2%
5 1
2.2%
6 1
2.2%
7 1
2.2%
8 1
2.2%
9 1
2.2%
10 1
2.2%
ValueCountFrequency (%)
45 1
2.2%
44 1
2.2%
43 1
2.2%
42 1
2.2%
41 1
2.2%
40 1
2.2%
39 1
2.2%
38 1
2.2%
37 1
2.2%
36 1
2.2%

분류아이디
Categorical

CONSTANT 

Distinct1
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size492.0 B
PT
45 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPT
2nd rowPT
3rd rowPT
4th rowPT
5th rowPT

Common Values

ValueCountFrequency (%)
PT 45
100.0%

Length

2023-12-12T09:49:12.715702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:49:12.842177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
pt 45
100.0%

분류명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size492.0 B
환자
45 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row환자
2nd row환자
3rd row환자
4th row환자
5th row환자

Common Values

ValueCountFrequency (%)
환자 45
100.0%

Length

2023-12-12T09:49:12.964004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:49:13.098480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
환자 45
100.0%

테이블아이디
Categorical

CONSTANT 

Distinct1
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size492.0 B
PNCT_PT_HLNF
45 

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPNCT_PT_HLNF
2nd rowPNCT_PT_HLNF
3rd rowPNCT_PT_HLNF
4th rowPNCT_PT_HLNF
5th rowPNCT_PT_HLNF

Common Values

ValueCountFrequency (%)
PNCT_PT_HLNF 45
100.0%

Length

2023-12-12T09:49:13.220186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:49:13.367578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
pnct_pt_hlnf 45
100.0%

테이블명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size492.0 B
췌장암_환자_건강정보
45 

Length

Max length11
Median length11
Mean length11
Min length11

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row췌장암_환자_건강정보
2nd row췌장암_환자_건강정보
3rd row췌장암_환자_건강정보
4th row췌장암_환자_건강정보
5th row췌장암_환자_건강정보

Common Values

ValueCountFrequency (%)
췌장암_환자_건강정보 45
100.0%

Length

2023-12-12T09:49:13.505065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:49:13.695986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
췌장암_환자_건강정보 45
100.0%

컬럼아이디
Text

UNIQUE 

Distinct45
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size492.0 B
2023-12-12T09:49:13.950528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length23
Mean length18.6
Min length7

Characters and Unicode

Total characters837
Distinct characters24
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)100.0%

Sample

1st rowCENTER_CD
2nd rowIRB_APRV_NO
3rd rowPT_SBST_NO
4th rowHLPT_RCRD_YMD
5th rowHLPT_HLNF_SEQ
ValueCountFrequency (%)
center_cd 1
 
2.2%
hlpt_smok_strt_age 1
 
2.2%
hlpt_smok_dtrn_ycnt 1
 
2.2%
hlpt_nsmk_perd_ycnt 1
 
2.2%
hlpt_mhis_yn_clsf_cd 1
 
2.2%
hlpt_mhis_htn_yn_clsf_cd 1
 
2.2%
hlpt_mhis_dbt_yn_clsf_cd 1
 
2.2%
hlpt_mhis_tb_yn_clsf_cd 1
 
2.2%
hlpt_mhis_lvds_yn_clsf_cd 1
 
2.2%
hlpt_mhis_cncr_yn_clsf_cd 1
 
2.2%
Other values (35) 35
77.8%
2023-12-12T09:49:14.438630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
_ 155
18.5%
T 77
9.2%
C 61
 
7.3%
H 61
 
7.3%
N 58
 
6.9%
D 58
 
6.9%
L 58
 
6.9%
S 48
 
5.7%
P 45
 
5.4%
M 31
 
3.7%
Other values (14) 185
22.1%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 682
81.5%
Connector Punctuation 155
 
18.5%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
T 77
11.3%
C 61
 
8.9%
H 61
 
8.9%
N 58
 
8.5%
D 58
 
8.5%
L 58
 
8.5%
S 48
 
7.0%
P 45
 
6.6%
M 31
 
4.5%
R 26
 
3.8%
Other values (13) 159
23.3%
Connector Punctuation
ValueCountFrequency (%)
_ 155
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 682
81.5%
Common 155
 
18.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
T 77
11.3%
C 61
 
8.9%
H 61
 
8.9%
N 58
 
8.5%
D 58
 
8.5%
L 58
 
8.5%
S 48
 
7.0%
P 45
 
6.6%
M 31
 
4.5%
R 26
 
3.8%
Other values (13) 159
23.3%
Common
ValueCountFrequency (%)
_ 155
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 837
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
_ 155
18.5%
T 77
9.2%
C 61
 
7.3%
H 61
 
7.3%
N 58
 
6.9%
D 58
 
6.9%
L 58
 
6.9%
S 48
 
5.7%
P 45
 
5.4%
M 31
 
3.7%
Other values (14) 185
22.1%

컬럼명
Text

UNIQUE 

Distinct45
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size492.0 B
2023-12-12T09:49:14.737269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length14
Mean length10.977778
Min length4

Characters and Unicode

Total characters494
Distinct characters80
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)100.0%

Sample

1st row센터코드
2nd rowIRB승인번호
3rd row환자대체번호
4th row건강환자기록일자
5th row건강환자건강정보순번
ValueCountFrequency (%)
센터코드 1
 
2.2%
건강환자흡연시작연령 1
 
2.2%
건강환자흡연기간년수 1
 
2.2%
건강환자금연시작시기년수 1
 
2.2%
건강환자병력여부구분코드 1
 
2.2%
건강환자병력고혈압여부구분코드 1
 
2.2%
건강환자병력당뇨여부구분코드 1
 
2.2%
건강환자병력결핵여부구분코드 1
 
2.2%
건강환자병력간질환여부구분코드 1
 
2.2%
건강환자병력암여부구분코드 1
 
2.2%
Other values (35) 35
77.8%
2023-12-12T09:49:15.175624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
45
 
9.1%
44
 
8.9%
42
 
8.5%
42
 
8.5%
22
 
4.5%
22
 
4.5%
15
 
3.0%
15
 
3.0%
15
 
3.0%
15
 
3.0%
Other values (70) 217
43.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 491
99.4%
Uppercase Letter 3
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
45
 
9.2%
44
 
9.0%
42
 
8.6%
42
 
8.6%
22
 
4.5%
22
 
4.5%
15
 
3.1%
15
 
3.1%
15
 
3.1%
15
 
3.1%
Other values (67) 214
43.6%
Uppercase Letter
ValueCountFrequency (%)
I 1
33.3%
R 1
33.3%
B 1
33.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 491
99.4%
Latin 3
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
45
 
9.2%
44
 
9.0%
42
 
8.6%
42
 
8.6%
22
 
4.5%
22
 
4.5%
15
 
3.1%
15
 
3.1%
15
 
3.1%
15
 
3.1%
Other values (67) 214
43.6%
Latin
ValueCountFrequency (%)
I 1
33.3%
R 1
33.3%
B 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 491
99.4%
ASCII 3
 
0.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
45
 
9.2%
44
 
9.0%
42
 
8.6%
42
 
8.6%
22
 
4.5%
22
 
4.5%
15
 
3.1%
15
 
3.1%
15
 
3.1%
15
 
3.1%
Other values (67) 214
43.6%
ASCII
ValueCountFrequency (%)
I 1
33.3%
R 1
33.3%
B 1
33.3%

데이터타입
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)22.2%
Missing0
Missing (%)0.0%
Memory size492.0 B
VARCHAR(20)
19 
VARCHAR(100)
VARCHAR(8000)
VARCHAR(8)
VARCHAR(200)
Other values (5)

Length

Max length13
Median length12
Mean length11.4
Min length8

Unique

Unique5 ?
Unique (%)11.1%

Sample

1st rowVARCHAR(20)
2nd rowVARCHAR(50)
3rd rowVARCHAR(10)
4th rowVARCHAR(8)
5th rowNUMBER(3)

Common Values

ValueCountFrequency (%)
VARCHAR(20) 19
42.2%
VARCHAR(100) 9
20.0%
VARCHAR(8000) 8
17.8%
VARCHAR(8) 2
 
4.4%
VARCHAR(200) 2
 
4.4%
VARCHAR(50) 1
 
2.2%
VARCHAR(10) 1
 
2.2%
NUMBER(3) 1
 
2.2%
NUMBER(4) 1
 
2.2%
DATETIME 1
 
2.2%

Length

2023-12-12T09:49:15.321790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:49:15.458384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
varchar(20 19
42.2%
varchar(100 9
20.0%
varchar(8000 8
17.8%
varchar(8 2
 
4.4%
varchar(200 2
 
4.4%
varchar(50 1
 
2.2%
varchar(10 1
 
2.2%
number(3 1
 
2.2%
number(4 1
 
2.2%
datetime 1
 
2.2%

컬럼설명
Text

UNIQUE 

Distinct45
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size492.0 B
2023-12-12T09:49:15.782166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length76
Median length37
Mean length31.288889
Min length12

Characters and Unicode

Total characters1408
Distinct characters163
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)100.0%

Sample

1st row센터코드 (5자리 : XXXXX) / 00030 : 국립암센터 예) 00030
2nd row센터별 기준에 따라 생성
3rd row개인고유번호(10자리) / 센터별 별도부여 예) RN12345678
4th row기타건강정보 기록일자 / YYYYMMDD
5th row건강환자기록일자별 순번
ValueCountFrequency (%)
86
20.3%
환자의 35
 
8.3%
무응답 18
 
4.2%
y 15
 
3.5%
n 15
 
3.5%
m 15
 
3.5%
14
 
3.3%
12
 
2.8%
12
 
2.8%
기타 9
 
2.1%
Other values (130) 193
45.5%
2023-12-12T09:49:16.286252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
380
27.0%
: 56
 
4.0%
44
 
3.1%
/ 42
 
3.0%
41
 
2.9%
35
 
2.5%
, 30
 
2.1%
29
 
2.1%
28
 
2.0%
e 26
 
1.8%
Other values (153) 697
49.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 640
45.5%
Space Separator 380
27.0%
Other Punctuation 129
 
9.2%
Decimal Number 83
 
5.9%
Lowercase Letter 80
 
5.7%
Uppercase Letter 75
 
5.3%
Close Punctuation 17
 
1.2%
Open Punctuation 3
 
0.2%
Connector Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
44
 
6.9%
41
 
6.4%
35
 
5.5%
29
 
4.5%
28
 
4.4%
22
 
3.4%
18
 
2.8%
18
 
2.8%
18
 
2.8%
16
 
2.5%
Other values (110) 371
58.0%
Lowercase Letter
ValueCountFrequency (%)
e 26
32.5%
t 19
23.8%
r 10
 
12.5%
f 8
 
10.0%
x 8
 
10.0%
m 2
 
2.5%
c 1
 
1.2%
u 1
 
1.2%
n 1
 
1.2%
i 1
 
1.2%
Other values (3) 3
 
3.8%
Uppercase Letter
ValueCountFrequency (%)
Y 23
30.7%
M 19
25.3%
N 16
21.3%
D 5
 
6.7%
X 5
 
6.7%
E 1
 
1.3%
F 1
 
1.3%
A 1
 
1.3%
U 1
 
1.3%
L 1
 
1.3%
Other values (2) 2
 
2.7%
Decimal Number
ValueCountFrequency (%)
0 25
30.1%
9 13
15.7%
5 9
 
10.8%
1 8
 
9.6%
2 7
 
8.4%
3 6
 
7.2%
8 5
 
6.0%
4 4
 
4.8%
7 3
 
3.6%
6 3
 
3.6%
Other Punctuation
ValueCountFrequency (%)
: 56
43.4%
/ 42
32.6%
, 30
23.3%
. 1
 
0.8%
Space Separator
ValueCountFrequency (%)
380
100.0%
Close Punctuation
ValueCountFrequency (%)
) 17
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 640
45.5%
Common 613
43.5%
Latin 155
 
11.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
44
 
6.9%
41
 
6.4%
35
 
5.5%
29
 
4.5%
28
 
4.4%
22
 
3.4%
18
 
2.8%
18
 
2.8%
18
 
2.8%
16
 
2.5%
Other values (110) 371
58.0%
Latin
ValueCountFrequency (%)
e 26
16.8%
Y 23
14.8%
t 19
12.3%
M 19
12.3%
N 16
10.3%
r 10
 
6.5%
f 8
 
5.2%
x 8
 
5.2%
D 5
 
3.2%
X 5
 
3.2%
Other values (15) 16
10.3%
Common
ValueCountFrequency (%)
380
62.0%
: 56
 
9.1%
/ 42
 
6.9%
, 30
 
4.9%
0 25
 
4.1%
) 17
 
2.8%
9 13
 
2.1%
5 9
 
1.5%
1 8
 
1.3%
2 7
 
1.1%
Other values (8) 26
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 768
54.5%
Hangul 640
45.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
380
49.5%
: 56
 
7.3%
/ 42
 
5.5%
, 30
 
3.9%
e 26
 
3.4%
0 25
 
3.3%
Y 23
 
3.0%
t 19
 
2.5%
M 19
 
2.5%
) 17
 
2.2%
Other values (33) 131
 
17.1%
Hangul
ValueCountFrequency (%)
44
 
6.9%
41
 
6.4%
35
 
5.5%
29
 
4.5%
28
 
4.4%
22
 
3.4%
18
 
2.8%
18
 
2.8%
18
 
2.8%
16
 
2.5%
Other values (110) 371
58.0%

컬럼데이터수
Real number (ℝ)

ZEROS 

Distinct20
Distinct (%)44.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1624.0889
Minimum0
Maximum2177
Zeros1
Zeros (%)2.2%
Negative0
Negative (%)0.0%
Memory size537.0 B
2023-12-12T09:49:16.419111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile243
Q1923
median2177
Q32177
95-th percentile2177
Maximum2177
Range2177
Interquartile range (IQR)1254

Descriptive statistics

Standard deviation715.6248
Coefficient of variation (CV)0.44063155
Kurtosis-0.76687147
Mean1624.0889
Median Absolute Deviation (MAD)0
Skewness-0.80175789
Sum73084
Variance512118.86
MonotonicityNot monotonic
2023-12-12T09:49:16.559815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=20)
ValueCountFrequency (%)
2177 23
51.1%
2103 2
 
4.4%
923 2
 
4.4%
2167 2
 
4.4%
1100 1
 
2.2%
1322 1
 
2.2%
1020 1
 
2.2%
142 1
 
2.2%
842 1
 
2.2%
918 1
 
2.2%
Other values (10) 10
22.2%
ValueCountFrequency (%)
0 1
2.2%
20 1
2.2%
142 1
2.2%
647 1
2.2%
842 1
2.2%
858 1
2.2%
905 1
2.2%
907 1
2.2%
915 1
2.2%
918 1
2.2%
ValueCountFrequency (%)
2177 23
51.1%
2167 2
 
4.4%
2103 2
 
4.4%
1322 1
 
2.2%
1106 1
 
2.2%
1100 1
 
2.2%
1020 1
 
2.2%
999 1
 
2.2%
926 1
 
2.2%
923 2
 
4.4%

표시형식
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)24.4%
Missing0
Missing (%)0.0%
Memory size492.0 B
Y 여 | N 부 | M 무응답
15 
텍스트
12 
Free 텍스트
YYYYMMDD
숫자
Other values (6)

Length

Max length81
Median length71
Mean length13.688889
Min length2

Unique

Unique6 ?
Unique (%)13.3%

Sample

1st row문자(5) : XXXXX
2nd row텍스트
3rd row문자(10) : XXXXXXXXXX
4th rowYYYYMMDD
5th row숫자

Common Values

ValueCountFrequency (%)
Y 여 | N 부 | M 무응답 15
33.3%
텍스트 12
26.7%
Free 텍스트 8
17.8%
YYYYMMDD 2
 
4.4%
숫자 2
 
4.4%
문자(5) : XXXXX 1
 
2.2%
문자(10) : XXXXXXXXXX 1
 
2.2%
01 한글해독불가 | 02 초졸이하 | 03 중졸 | 04 고졸 | 05 대졸 | 06 대학원이상 | 98 무응답 | 99 기타 1
 
2.2%
01 회사원 | 02 전문직 | 03 주부 | 04 학생 | 05 군인 | 06 무직 | 07 자유업 | 08 교사 | 98 무응답 | 99 기타 1
 
2.2%
01 맥주 | 02 소주 | 03 양주 | 98 무응답 | 99 기타 1
 
2.2%

Length

2023-12-12T09:49:16.742583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
52
23.0%
텍스트 20
 
8.8%
무응답 18
 
8.0%
y 15
 
6.6%
m 15
 
6.6%
15
 
6.6%
15
 
6.6%
n 15
 
6.6%
free 8
 
3.5%
99 3
 
1.3%
Other values (35) 50
22.1%

Interactions

2023-12-12T09:49:11.926790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:49:11.495736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:49:11.993067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:49:11.573303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T09:49:16.840019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번컬럼아이디컬럼명데이터타입컬럼설명컬럼데이터수표시형식
순번1.0001.0001.0000.6101.0000.5150.570
컬럼아이디1.0001.0001.0001.0001.0001.0001.000
컬럼명1.0001.0001.0001.0001.0001.0001.000
데이터타입0.6101.0001.0001.0001.0000.4490.933
컬럼설명1.0001.0001.0001.0001.0001.0001.000
컬럼데이터수0.5151.0001.0000.4491.0001.0000.000
표시형식0.5701.0001.0000.9331.0000.0001.000
2023-12-12T09:49:16.958945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
데이터타입표시형식
데이터타입1.0000.743
표시형식0.7431.000
2023-12-12T09:49:17.041283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번컬럼데이터수데이터타입표시형식
순번1.0000.0180.2290.283
컬럼데이터수0.0181.0000.2260.000
데이터타입0.2290.2261.0000.743
표시형식0.2830.0000.7431.000

Missing values

2023-12-12T09:49:12.095567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:49:12.251825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번분류아이디분류명테이블아이디테이블명컬럼아이디컬럼명데이터타입컬럼설명컬럼데이터수표시형식
01PT환자PNCT_PT_HLNF췌장암_환자_건강정보CENTER_CD센터코드VARCHAR(20)센터코드 (5자리 : XXXXX) / 00030 : 국립암센터 예) 000302177문자(5) : XXXXX
12PT환자PNCT_PT_HLNF췌장암_환자_건강정보IRB_APRV_NOIRB승인번호VARCHAR(50)센터별 기준에 따라 생성2177텍스트
23PT환자PNCT_PT_HLNF췌장암_환자_건강정보PT_SBST_NO환자대체번호VARCHAR(10)개인고유번호(10자리) / 센터별 별도부여 예) RN123456782177문자(10) : XXXXXXXXXX
34PT환자PNCT_PT_HLNF췌장암_환자_건강정보HLPT_RCRD_YMD건강환자기록일자VARCHAR(8)기타건강정보 기록일자 / YYYYMMDD2177YYYYMMDD
45PT환자PNCT_PT_HLNF췌장암_환자_건강정보HLPT_HLNF_SEQ건강환자건강정보순번NUMBER(3)건강환자기록일자별 순번2177숫자
56PT환자PNCT_PT_HLNF췌장암_환자_건강정보HLPT_ADM_YMD건강환자입원일자VARCHAR(8)첫번째 간호 정보 작성시 입원한 일자 / YYYYMMDD2177YYYYMMDD
67PT환자PNCT_PT_HLNF췌장암_환자_건강정보HLPT_IADM_AGE건강환자입원시연령NUMBER(4)첫번째 수술 당시 환자 나이 / 정수 예) 452177숫자
78PT환자PNCT_PT_HLNF췌장암_환자_건강정보HLPT_EDU_DGRE_CD건강환자교육정도코드VARCHAR(20)환자의 교육정도코드 / 01 한글해독불가 02 초졸이하 03 중졸 04 고졸 05 대졸 06 대학원이상 98 무응답 99 기타216701 한글해독불가 | 02 초졸이하 | 03 중졸 | 04 고졸 | 05 대졸 | 06 대학원이상 | 98 무응답 | 99 기타
89PT환자PNCT_PT_HLNF췌장암_환자_건강정보HLPT_EDU_DGRE_NM건강환자교육정도명VARCHAR(100)환자의 교육정도코드명 / 예) 대졸2167텍스트
910PT환자PNCT_PT_HLNF췌장암_환자_건강정보HLPT_EDU_DGRE_CD_ETC_CONT건강환자교육정도코드기타내용VARCHAR(8000)환자교육정도코드가 기타 : 99 일 경우 환자의 기타 교육정도 상세내용 / free text20Free 텍스트
순번분류아이디분류명테이블아이디테이블명컬럼아이디컬럼명데이터타입컬럼설명컬럼데이터수표시형식
3536PT환자PNCT_PT_HLNF췌장암_환자_건강정보HLPT_MHIS_CADS_YN_CLSF_CD건강환자병력심장질환여부구분코드VARCHAR(20)환자의 심장질환여부 / Y : 여, N : 부, M : 무응답2177Y 여 | N 부 | M 무응답
3637PT환자PNCT_PT_HLNF췌장암_환자_건강정보HLPT_MHIS_ETC_YN_CLSF_CD건강환자병력기타여부구분코드VARCHAR(20)기타병력 여부 Y: 유, N: 무, M: 무응답2177Y 여 | N 부 | M 무응답
3738PT환자PNCT_PT_HLNF췌장암_환자_건강정보HLPT_MHIS_HTN_CONT건강환자병력고혈압내용VARCHAR(8000)환자의 고혈압 상세내용 / free text918Free 텍스트
3839PT환자PNCT_PT_HLNF췌장암_환자_건강정보HLPT_MHIS_DBT_CONT건강환자병력당뇨내용VARCHAR(8000)환자의 당뇨 상세내용 / free text842Free 텍스트
3940PT환자PNCT_PT_HLNF췌장암_환자_건강정보HLPT_MHIS_CADS_CONT건강환자병력심장질환내용VARCHAR(8000)환자의 심장질환 상세내용 / free text142Free 텍스트
4041PT환자PNCT_PT_HLNF췌장암_환자_건강정보HLPT_MHIS_ETC_CONT건강환자병력기타내용VARCHAR(8000)환자의 기타병력 상세내용 / free text1020Free 텍스트
4142PT환자PNCT_PT_HLNF췌장암_환자_건강정보HLPT_MAIN_SYM_YN_CLSF_CD건강환자주증상여부구분코드VARCHAR(20)주증상여부 Y: 유, N: 무, M: 무응답2177Y 여 | N 부 | M 무응답
4243PT환자PNCT_PT_HLNF췌장암_환자_건강정보HLPT_MAIN_SYM_CONT건강환자주증상내용VARCHAR(8000)환자의 입원 시 주증상 상세내용 / free text1322Free 텍스트
4344PT환자PNCT_PT_HLNF췌장암_환자_건강정보HLPT_OHAD_HSTR_YN_CLSF_CD건강환자타병원진단후전원여부구분코드VARCHAR(20)타병원진단후전원여부 Y: 유, N: 무, M: 무응답2177Y 여 | N 부 | M 무응답
4445PT환자PNCT_PT_HLNF췌장암_환자_건강정보CRTN_DT생성일시DATETIME생성일시 DEFAULT current_timestamp()2177YYYY-MM-DD HH:MI:SS