Overview

Dataset statistics

Number of variables11
Number of observations45
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.1 KiB
Average record size in memory92.9 B

Variable types

Numeric2
Categorical6
Text3

Dataset

DescriptionPRE위암_라이브러리_PRE_위암_병리_외과_메타정보( 제공 되어질 데이터 항목, 타입, 사이즈, 항목별건수등)를 제공
Author국립암센터
URLhttps://www.data.go.kr/data/15074152/fileData.do

Alerts

분류ID has constant value ""Constant
분류명 has constant value ""Constant
테이블ID has constant value ""Constant
테이블명 has constant value ""Constant
데이터타입 is highly overall correlated with 표시형식High correlation
표시형식 is highly overall correlated with 데이터타입High correlation
순번 has unique valuesUnique
컬럼ID has unique valuesUnique
컬럼명 has unique valuesUnique
컬럼설명 has unique valuesUnique
컬럼데이터수 has 2 (4.4%) zerosZeros

Reproduction

Analysis started2023-12-12 20:02:57.415783
Analysis finished2023-12-12 20:02:58.850373
Duration1.43 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct45
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean23
Minimum1
Maximum45
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size537.0 B
2023-12-13T05:02:58.941583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.2
Q112
median23
Q334
95-th percentile42.8
Maximum45
Range44
Interquartile range (IQR)22

Descriptive statistics

Standard deviation13.133926
Coefficient of variation (CV)0.57104024
Kurtosis-1.2
Mean23
Median Absolute Deviation (MAD)11
Skewness0
Sum1035
Variance172.5
MonotonicityStrictly increasing
2023-12-13T05:02:59.087877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)
ValueCountFrequency (%)
1 1
 
2.2%
35 1
 
2.2%
26 1
 
2.2%
27 1
 
2.2%
28 1
 
2.2%
29 1
 
2.2%
30 1
 
2.2%
31 1
 
2.2%
32 1
 
2.2%
33 1
 
2.2%
Other values (35) 35
77.8%
ValueCountFrequency (%)
1 1
2.2%
2 1
2.2%
3 1
2.2%
4 1
2.2%
5 1
2.2%
6 1
2.2%
7 1
2.2%
8 1
2.2%
9 1
2.2%
10 1
2.2%
ValueCountFrequency (%)
45 1
2.2%
44 1
2.2%
43 1
2.2%
42 1
2.2%
41 1
2.2%
40 1
2.2%
39 1
2.2%
38 1
2.2%
37 1
2.2%
36 1
2.2%

분류ID
Categorical

CONSTANT 

Distinct1
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size492.0 B
PTH
45 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPTH
2nd rowPTH
3rd rowPTH
4th rowPTH
5th rowPTH

Common Values

ValueCountFrequency (%)
PTH 45
100.0%

Length

2023-12-13T05:02:59.251913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:02:59.364391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
pth 45
100.0%

분류명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size492.0 B
병리
45 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row병리
2nd row병리
3rd row병리
4th row병리
5th row병리

Common Values

ValueCountFrequency (%)
병리 45
100.0%

Length

2023-12-13T05:02:59.494492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:02:59.600068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
병리 45
100.0%

테이블ID
Categorical

CONSTANT 

Distinct1
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size492.0 B
PRE_GSTR_PTH_SRGC
45 

Length

Max length17
Median length17
Mean length17
Min length17

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPRE_GSTR_PTH_SRGC
2nd rowPRE_GSTR_PTH_SRGC
3rd rowPRE_GSTR_PTH_SRGC
4th rowPRE_GSTR_PTH_SRGC
5th rowPRE_GSTR_PTH_SRGC

Common Values

ValueCountFrequency (%)
PRE_GSTR_PTH_SRGC 45
100.0%

Length

2023-12-13T05:02:59.712401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:02:59.814487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
pre_gstr_pth_srgc 45
100.0%

테이블명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size492.0 B
PRE_위암_병리_외과
45 

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPRE_위암_병리_외과
2nd rowPRE_위암_병리_외과
3rd rowPRE_위암_병리_외과
4th rowPRE_위암_병리_외과
5th rowPRE_위암_병리_외과

Common Values

ValueCountFrequency (%)
PRE_위암_병리_외과 45
100.0%

Length

2023-12-13T05:02:59.913118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:02:59.994660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
pre_위암_병리_외과 45
100.0%

컬럼ID
Text

UNIQUE 

Distinct45
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size492.0 B
2023-12-13T05:03:00.209629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length26
Mean length16.511111
Min length7

Characters and Unicode

Total characters743
Distinct characters24
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)100.0%

Sample

1st rowCENTER_CD
2nd rowIRB_APRV_NO
3rd rowPT_SBST_NO
4th rowSGPT_ACPT_YMD
5th rowSGPT_EXAM_CLSF_CD
ValueCountFrequency (%)
center_cd 1
 
2.2%
sgpt_oprt_rmrg_nm 1
 
2.2%
sgpt_srmg_dstl_cncr_txsz_vl 1
 
2.2%
sgpt_lymp_inva_cd 1
 
2.2%
sgpt_lymp_inva_nm 1
 
2.2%
sgpt_vnin_cd 1
 
2.2%
sgpt_vnin_nm 1
 
2.2%
sgpt_sumb_fibr_cd 1
 
2.2%
sgpt_sumb_fibr_nm 1
 
2.2%
sgpt_anin_cd 1
 
2.2%
Other values (35) 35
77.8%
2023-12-13T05:03:00.629497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
_ 129
17.4%
T 85
11.4%
P 64
 
8.6%
S 64
 
8.6%
N 51
 
6.9%
C 49
 
6.6%
G 48
 
6.5%
R 30
 
4.0%
D 29
 
3.9%
M 28
 
3.8%
Other values (14) 166
22.3%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 614
82.6%
Connector Punctuation 129
 
17.4%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
T 85
13.8%
P 64
10.4%
S 64
10.4%
N 51
 
8.3%
C 49
 
8.0%
G 48
 
7.8%
R 30
 
4.9%
D 29
 
4.7%
M 28
 
4.6%
E 26
 
4.2%
Other values (13) 140
22.8%
Connector Punctuation
ValueCountFrequency (%)
_ 129
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 614
82.6%
Common 129
 
17.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
T 85
13.8%
P 64
10.4%
S 64
10.4%
N 51
 
8.3%
C 49
 
8.0%
G 48
 
7.8%
R 30
 
4.9%
D 29
 
4.7%
M 28
 
4.6%
E 26
 
4.2%
Other values (13) 140
22.8%
Common
ValueCountFrequency (%)
_ 129
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 743
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
_ 129
17.4%
T 85
11.4%
P 64
 
8.6%
S 64
 
8.6%
N 51
 
6.9%
C 49
 
6.6%
G 48
 
6.5%
R 30
 
4.0%
D 29
 
3.9%
M 28
 
3.8%
Other values (14) 166
22.3%

컬럼명
Text

UNIQUE 

Distinct45
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size492.0 B
2023-12-13T05:03:00.924760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length13
Mean length10
Min length4

Characters and Unicode

Total characters450
Distinct characters82
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)100.0%

Sample

1st row센터코드
2nd rowIRB승인번호
3rd row환자대체번호
4th row외과병리접수일자
5th row외과병리검사구분코드
ValueCountFrequency (%)
센터코드 1
 
2.2%
외과병리수술절제면명 1
 
2.2%
외과병리수술절제면원위암조직크기값 1
 
2.2%
외과병리림프성침윤코드 1
 
2.2%
외과병리림프성침윤명 1
 
2.2%
외과병리정맥침윤코드 1
 
2.2%
외과병리정맥침윤명 1
 
2.2%
외과병리점막하섬유증코드 1
 
2.2%
외과병리점막하섬유증명 1
 
2.2%
외과병리혈관조영침윤코드 1
 
2.2%
Other values (35) 35
77.8%
2023-12-13T05:03:01.349416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
42
 
9.3%
41
 
9.1%
41
 
9.1%
41
 
9.1%
17
 
3.8%
16
 
3.6%
14
 
3.1%
14
 
3.1%
12
 
2.7%
9
 
2.0%
Other values (72) 203
45.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 447
99.3%
Uppercase Letter 3
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
42
 
9.4%
41
 
9.2%
41
 
9.2%
41
 
9.2%
17
 
3.8%
16
 
3.6%
14
 
3.1%
14
 
3.1%
12
 
2.7%
9
 
2.0%
Other values (69) 200
44.7%
Uppercase Letter
ValueCountFrequency (%)
B 1
33.3%
R 1
33.3%
I 1
33.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 447
99.3%
Latin 3
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
42
 
9.4%
41
 
9.2%
41
 
9.2%
41
 
9.2%
17
 
3.8%
16
 
3.6%
14
 
3.1%
14
 
3.1%
12
 
2.7%
9
 
2.0%
Other values (69) 200
44.7%
Latin
ValueCountFrequency (%)
B 1
33.3%
R 1
33.3%
I 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 447
99.3%
ASCII 3
 
0.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
42
 
9.4%
41
 
9.2%
41
 
9.2%
41
 
9.2%
17
 
3.8%
16
 
3.6%
14
 
3.1%
14
 
3.1%
12
 
2.7%
9
 
2.0%
Other values (69) 200
44.7%
ASCII
ValueCountFrequency (%)
B 1
33.3%
R 1
33.3%
I 1
33.3%

데이터타입
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)22.2%
Missing0
Missing (%)0.0%
Memory size492.0 B
VARCHAR(20)
15 
VARCHAR(100)
12 
CLOB
VARCHAR(8)
VARCHAR(200)
Other values (5)

Length

Max length13
Median length12
Mean length9.8
Min length4

Unique

Unique5 ?
Unique (%)11.1%

Sample

1st rowVARCHAR(20)
2nd rowVARCHAR(50)
3rd rowVARCHAR(10)
4th rowVARCHAR(8)
5th rowVARCHAR(20)

Common Values

ValueCountFrequency (%)
VARCHAR(20) 15
33.3%
VARCHAR(100) 12
26.7%
CLOB 9
20.0%
VARCHAR(8) 2
 
4.4%
VARCHAR(200) 2
 
4.4%
VARCHAR(50) 1
 
2.2%
VARCHAR(10) 1
 
2.2%
NUMBER(3) 1
 
2.2%
VARCHAR(1000) 1
 
2.2%
DATETIME 1
 
2.2%

Length

2023-12-13T05:03:01.486824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:03:01.597207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
varchar(20 15
33.3%
varchar(100 12
26.7%
clob 9
20.0%
varchar(8 2
 
4.4%
varchar(200 2
 
4.4%
varchar(50 1
 
2.2%
varchar(10 1
 
2.2%
number(3 1
 
2.2%
varchar(1000 1
 
2.2%
datetime 1
 
2.2%

컬럼설명
Text

UNIQUE 

Distinct45
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size492.0 B
2023-12-13T05:03:01.872097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length187
Median length53
Mean length44.888889
Min length13

Characters and Unicode

Total characters2020
Distinct characters165
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)100.0%

Sample

1st row센터코드 (5자리 : XXXXX) / 00030 : 국립암센터 예) 00030
2nd row센터별 기준에 따라 생성
3rd row개인고유번호(10자리) / 센터별 별도부여 예) RN12345678
4th row수술 후 외과병리검사의 접수일자 / YYYYMMDD 예)20200101
5th row외과병리검사 구분코드 / 1 EGC 2 AGC
ValueCountFrequency (%)
44
 
10.8%
25
 
6.1%
외과병리 20
 
4.9%
2 11
 
2.7%
1 11
 
2.7%
present 10
 
2.5%
free 9
 
2.2%
9 9
 
2.2%
text 9
 
2.2%
ln 8
 
2.0%
Other values (183) 251
61.7%
2023-12-13T05:03:02.299423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
362
 
17.9%
e 103
 
5.1%
t 66
 
3.3%
r 62
 
3.1%
a 53
 
2.6%
o 53
 
2.6%
n 52
 
2.6%
/ 51
 
2.5%
0 49
 
2.4%
s 45
 
2.2%
Other values (155) 1124
55.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 687
34.0%
Other Letter 566
28.0%
Space Separator 362
17.9%
Decimal Number 147
 
7.3%
Uppercase Letter 99
 
4.9%
Other Punctuation 97
 
4.8%
Close Punctuation 43
 
2.1%
Open Punctuation 15
 
0.7%
Dash Punctuation 3
 
0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
32
 
5.7%
30
 
5.3%
30
 
5.3%
28
 
4.9%
28
 
4.9%
27
 
4.8%
26
 
4.6%
14
 
2.5%
14
 
2.5%
13
 
2.3%
Other values (90) 324
57.2%
Lowercase Letter
ValueCountFrequency (%)
e 103
15.0%
t 66
9.6%
r 62
 
9.0%
a 53
 
7.7%
o 53
 
7.7%
n 52
 
7.6%
s 45
 
6.6%
i 36
 
5.2%
m 30
 
4.4%
c 29
 
4.2%
Other values (13) 158
23.0%
Uppercase Letter
ValueCountFrequency (%)
N 14
14.1%
L 10
 
10.1%
Y 8
 
8.1%
I 8
 
8.1%
D 7
 
7.1%
C 6
 
6.1%
A 6
 
6.1%
G 5
 
5.1%
E 5
 
5.1%
M 5
 
5.1%
Other values (11) 25
25.3%
Decimal Number
ValueCountFrequency (%)
0 49
33.3%
1 27
18.4%
2 22
15.0%
3 15
 
10.2%
9 11
 
7.5%
4 8
 
5.4%
5 6
 
4.1%
7 4
 
2.7%
6 3
 
2.0%
8 2
 
1.4%
Other Punctuation
ValueCountFrequency (%)
/ 51
52.6%
, 17
 
17.5%
: 13
 
13.4%
# 9
 
9.3%
. 6
 
6.2%
; 1
 
1.0%
Space Separator
ValueCountFrequency (%)
362
100.0%
Close Punctuation
ValueCountFrequency (%)
) 43
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 786
38.9%
Common 668
33.1%
Hangul 566
28.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
32
 
5.7%
30
 
5.3%
30
 
5.3%
28
 
4.9%
28
 
4.9%
27
 
4.8%
26
 
4.6%
14
 
2.5%
14
 
2.5%
13
 
2.3%
Other values (90) 324
57.2%
Latin
ValueCountFrequency (%)
e 103
13.1%
t 66
 
8.4%
r 62
 
7.9%
a 53
 
6.7%
o 53
 
6.7%
n 52
 
6.6%
s 45
 
5.7%
i 36
 
4.6%
m 30
 
3.8%
c 29
 
3.7%
Other values (34) 257
32.7%
Common
ValueCountFrequency (%)
362
54.2%
/ 51
 
7.6%
0 49
 
7.3%
) 43
 
6.4%
1 27
 
4.0%
2 22
 
3.3%
, 17
 
2.5%
( 15
 
2.2%
3 15
 
2.2%
: 13
 
1.9%
Other values (11) 54
 
8.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1454
72.0%
Hangul 566
 
28.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
362
24.9%
e 103
 
7.1%
t 66
 
4.5%
r 62
 
4.3%
a 53
 
3.6%
o 53
 
3.6%
n 52
 
3.6%
/ 51
 
3.5%
0 49
 
3.4%
s 45
 
3.1%
Other values (55) 558
38.4%
Hangul
ValueCountFrequency (%)
32
 
5.7%
30
 
5.3%
30
 
5.3%
28
 
4.9%
28
 
4.9%
27
 
4.8%
26
 
4.6%
14
 
2.5%
14
 
2.5%
13
 
2.3%
Other values (90) 324
57.2%

컬럼데이터수
Real number (ℝ)

ZEROS 

Distinct15
Distinct (%)33.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3535.0444
Minimum0
Maximum4189
Zeros2
Zeros (%)4.4%
Negative0
Negative (%)0.0%
Memory size537.0 B
2023-12-13T05:03:02.416309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile62.6
Q14139
median4189
Q34189
95-th percentile4189
Maximum4189
Range4189
Interquartile range (IQR)50

Descriptive statistics

Standard deviation1428.4598
Coefficient of variation (CV)0.40408539
Kurtosis1.9345703
Mean3535.0444
Median Absolute Deviation (MAD)0
Skewness-1.9141028
Sum159077
Variance2040497.5
MonotonicityNot monotonic
2023-12-13T05:03:02.513801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
4189 29
64.4%
0 2
 
4.4%
4139 2
 
4.4%
4187 1
 
2.2%
53 1
 
2.2%
4151 1
 
2.2%
112 1
 
2.2%
4170 1
 
2.2%
4168 1
 
2.2%
4122 1
 
2.2%
Other values (5) 5
 
11.1%
ValueCountFrequency (%)
0 2
4.4%
53 1
2.2%
101 1
2.2%
112 1
2.2%
949 1
2.2%
1112 1
2.2%
2346 1
2.2%
3847 1
2.2%
4122 1
2.2%
4139 2
4.4%
ValueCountFrequency (%)
4189 29
64.4%
4187 1
 
2.2%
4170 1
 
2.2%
4168 1
 
2.2%
4151 1
 
2.2%
4139 2
 
4.4%
4122 1
 
2.2%
3847 1
 
2.2%
2346 1
 
2.2%
1112 1
 
2.2%

표시형식
Categorical

HIGH CORRELATION 

Distinct16
Distinct (%)35.6%
Missing0
Missing (%)0.0%
Memory size492.0 B
텍스트
18 
Free 텍스트
1 present | 2 absent | 3 no record | 9 other
YYYYMMDD
문자(5) : XXXXX
 
1
Other values (11)
11 

Length

Max length143
Median length57
Mean length16.711111
Min length2

Unique

Unique12 ?
Unique (%)26.7%

Sample

1st row문자(5) : XXXXX
2nd row텍스트
3rd row문자(10) : XXXXXXXXXX
4th rowYYYYMMDD
5th row1 EGC | 2 AGC

Common Values

ValueCountFrequency (%)
텍스트 18
40.0%
Free 텍스트 9
20.0%
1 present | 2 absent | 3 no record | 9 other 4
 
8.9%
YYYYMMDD 2
 
4.4%
문자(5) : XXXXX 1
 
2.2%
문자(10) : XXXXXXXXXX 1
 
2.2%
1 EGC | 2 AGC 1
 
2.2%
숫자 1
 
2.2%
센터내 수술코드 1
 
2.2%
1 Adenocarcinoma | 2 Signet ring cell carcinoma | 9 Other 1
 
2.2%
Other values (6) 6
 
13.3%

Length

2023-12-13T05:03:02.646872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
33
18.0%
텍스트 27
14.8%
1 10
 
5.5%
2 10
 
5.5%
free 9
 
4.9%
9 8
 
4.4%
other 8
 
4.4%
3 7
 
3.8%
present 5
 
2.7%
absent 5
 
2.7%
Other values (52) 61
33.3%

Interactions

2023-12-13T05:02:58.382988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:02:57.877371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:02:58.474028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:02:58.305758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T05:03:02.731854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번컬럼ID컬럼명데이터타입컬럼설명컬럼데이터수표시형식
순번1.0001.0001.0000.3811.0000.1550.383
컬럼ID1.0001.0001.0001.0001.0001.0001.000
컬럼명1.0001.0001.0001.0001.0001.0001.000
데이터타입0.3811.0001.0001.0001.0000.3130.914
컬럼설명1.0001.0001.0001.0001.0001.0001.000
컬럼데이터수0.1551.0001.0000.3131.0001.0000.000
표시형식0.3831.0001.0000.9141.0000.0001.000
2023-12-13T05:03:02.836825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
표시형식데이터타입
표시형식1.0000.614
데이터타입0.6141.000
2023-12-13T05:03:02.914858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번컬럼데이터수데이터타입표시형식
순번1.000-0.3250.1470.089
컬럼데이터수-0.3251.0000.1630.000
데이터타입0.1470.1631.0000.614
표시형식0.0890.0000.6141.000

Missing values

2023-12-13T05:02:58.596055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:02:58.782832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번분류ID분류명테이블ID테이블명컬럼ID컬럼명데이터타입컬럼설명컬럼데이터수표시형식
01PTH병리PRE_GSTR_PTH_SRGCPRE_위암_병리_외과CENTER_CD센터코드VARCHAR(20)센터코드 (5자리 : XXXXX) / 00030 : 국립암센터 예) 000304189문자(5) : XXXXX
12PTH병리PRE_GSTR_PTH_SRGCPRE_위암_병리_외과IRB_APRV_NOIRB승인번호VARCHAR(50)센터별 기준에 따라 생성4189텍스트
23PTH병리PRE_GSTR_PTH_SRGCPRE_위암_병리_외과PT_SBST_NO환자대체번호VARCHAR(10)개인고유번호(10자리) / 센터별 별도부여 예) RN123456784189문자(10) : XXXXXXXXXX
34PTH병리PRE_GSTR_PTH_SRGCPRE_위암_병리_외과SGPT_ACPT_YMD외과병리접수일자VARCHAR(8)수술 후 외과병리검사의 접수일자 / YYYYMMDD 예)202001014189YYYYMMDD
45PTH병리PRE_GSTR_PTH_SRGCPRE_위암_병리_외과SGPT_EXAM_CLSF_CD외과병리검사구분코드VARCHAR(20)외과병리검사 구분코드 / 1 EGC 2 AGC41891 EGC | 2 AGC
56PTH병리PRE_GSTR_PTH_SRGCPRE_위암_병리_외과SGPT_SEQ외과병리순번NUMBER(3)외과병리검사구분코드별 순번4189숫자
67PTH병리PRE_GSTR_PTH_SRGCPRE_위암_병리_외과SGPT_EXAM_CLSF_NM외과병리검사구분명VARCHAR(100)외과병리검사구분코드명 / 예) EGC4189텍스트
78PTH병리PRE_GSTR_PTH_SRGCPRE_위암_병리_외과SGPT_READ_YMD외과병리판독일자VARCHAR(8)수술 후 외과병리검사의 판독일자 / YYYYMMDD 예)202001014189YYYYMMDD
89PTH병리PRE_GSTR_PTH_SRGCPRE_위암_병리_외과SGPT_OPRT_CD외과병리수술코드VARCHAR(20)외과병리보고서의 수술코드 / 예) H120004189센터내 수술코드
910PTH병리PRE_GSTR_PTH_SRGCPRE_위암_병리_외과SGPT_OPRT_NM외과병리수술명VARCHAR(1000)외과병리보고서의 수술명 / 예) 상부 내시경적 점막하 박리 절제술(ESD)4189텍스트
순번분류ID분류명테이블ID테이블명컬럼ID컬럼명데이터타입컬럼설명컬럼데이터수표시형식
3536PTH병리PRE_GSTR_PTH_SRGCPRE_위암_병리_외과SGPT_NERV_PREX_NM외과병리신경주위침윤명VARCHAR(100)외과병리 신경주위침윤코드명 / 예) present4189텍스트
3637PTH병리PRE_GSTR_PTH_SRGCPRE_위암_병리_외과SGPT_INVA_SHAP_CD외과병리침윤모양코드VARCHAR(20)종양의 침윤 모양분류코드 / 1 infiltrative 2 pushing 9 Other41891 infiltrative | 2 pushing | 9 Other
3738PTH병리PRE_GSTR_PTH_SRGCPRE_위암_병리_외과SGPT_INVA_SHAP_NM외과병리침윤모양명VARCHAR(100)종양의 침윤 모양분류명 / 예) infiltrative4189텍스트
3839PTH병리PRE_GSTR_PTH_SRGCPRE_위암_병리_외과SGPT_LYMP_INVA_DETL_CONT외과병리림프성침윤상세내용CLOB림프성침윤 상세내용 / free text 예) multifocal, mural, extramural and peritumoral1112Free 텍스트
3940PTH병리PRE_GSTR_PTH_SRGCPRE_위암_병리_외과SGPT_LN_MTST_CONT외과병리림프절전이내용CLOBlymph node 전이 절제내용 / free text 예) no metastasis in 21 lymph nodes (pN0)(LN #1: 0/5, LN #2: 0/3, LN #3: 0/12, LN #4sa: 0/0, LN #4sb: 0/0, LN #7: 0/0, LN #8a: 0/0, LN #9: 0/0, LN #11d: 0/1)3847Free 텍스트
4041PTH병리PRE_GSTR_PTH_SRGCPRE_위암_병리_외과SGPT_MTST_SITE_CONT외과병리전이부위내용CLOB전이부위내용 / free text 예) mesocolon (pP1)101Free 텍스트
4142PTH병리PRE_GSTR_PTH_SRGCPRE_위암_병리_외과SGPT_STAG_VL외과병리병기값VARCHAR(20)위암 수술 후 종양 병기정보 / 예) pT1bN0949텍스트
4243PTH병리PRE_GSTR_PTH_SRGCPRE_위암_병리_외과SGPT_RELT_SLID_CONT외과병리관련슬라이드내용CLOB관련 슬라이드 내용 / free text 예) S05-4376; Adenocarcinoma0Free 텍스트
4344PTH병리PRE_GSTR_PTH_SRGCPRE_위암_병리_외과SGPT_ETC_CONT외과병리기타내용CLOB외과병리 기타내용 / free text 예) - Histologic mapping was done.2346Free 텍스트
4445PTH병리PRE_GSTR_PTH_SRGCPRE_위암_병리_외과CRTN_DT생성일시DATETIME생성일시 DEFAULT current_timestamp()4189YYYY-MM-DD HH:MI:SS