Overview

Dataset statistics

Number of variables11
Number of observations41
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.7 KiB
Average record size in memory93.2 B

Variable types

Numeric2
Categorical6
Text3

Dataset

DescriptionPRE위암_라이브러리_PRE_위암_검사_ESD_메타정보( 제공 되어질 데이터 항목, 타입, 사이즈, 항목별건수등)를 제공
Author국립암센터
URLhttps://www.data.go.kr/data/15074150/fileData.do

Alerts

분류ID has constant value ""Constant
분류명 has constant value ""Constant
테이블ID has constant value ""Constant
테이블명 has constant value ""Constant
데이터타입 is highly overall correlated with 표시형식High correlation
표시형식 is highly overall correlated with 데이터타입High correlation
순번 has unique valuesUnique
컬럼ID has unique valuesUnique
컬럼명 has unique valuesUnique
컬럼설명 has unique valuesUnique
컬럼데이터수 has 2 (4.9%) zerosZeros

Reproduction

Analysis started2023-12-12 12:54:30.688921
Analysis finished2023-12-12 12:54:31.776851
Duration1.09 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct41
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21
Minimum1
Maximum41
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size501.0 B
2023-12-12T21:54:31.860086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3
Q111
median21
Q331
95-th percentile39
Maximum41
Range40
Interquartile range (IQR)20

Descriptive statistics

Standard deviation11.979149
Coefficient of variation (CV)0.57043565
Kurtosis-1.2
Mean21
Median Absolute Deviation (MAD)10
Skewness0
Sum861
Variance143.5
MonotonicityStrictly increasing
2023-12-12T21:54:31.988748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=41)
ValueCountFrequency (%)
1 1
 
2.4%
32 1
 
2.4%
24 1
 
2.4%
25 1
 
2.4%
26 1
 
2.4%
27 1
 
2.4%
28 1
 
2.4%
29 1
 
2.4%
30 1
 
2.4%
31 1
 
2.4%
Other values (31) 31
75.6%
ValueCountFrequency (%)
1 1
2.4%
2 1
2.4%
3 1
2.4%
4 1
2.4%
5 1
2.4%
6 1
2.4%
7 1
2.4%
8 1
2.4%
9 1
2.4%
10 1
2.4%
ValueCountFrequency (%)
41 1
2.4%
40 1
2.4%
39 1
2.4%
38 1
2.4%
37 1
2.4%
36 1
2.4%
35 1
2.4%
34 1
2.4%
33 1
2.4%
32 1
2.4%

분류ID
Categorical

CONSTANT 

Distinct1
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size460.0 B
EX
41 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowEX
2nd rowEX
3rd rowEX
4th rowEX
5th rowEX

Common Values

ValueCountFrequency (%)
EX 41
100.0%

Length

2023-12-12T21:54:32.105400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:54:32.191128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
ex 41
100.0%

분류명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size460.0 B
검사
41 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row검사
2nd row검사
3rd row검사
4th row검사
5th row검사

Common Values

ValueCountFrequency (%)
검사 41
100.0%

Length

2023-12-12T21:54:32.276463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:54:32.374024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
검사 41
100.0%

테이블ID
Categorical

CONSTANT 

Distinct1
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size460.0 B
PRE_GSTR_EX_ESD
41 

Length

Max length15
Median length15
Mean length15
Min length15

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPRE_GSTR_EX_ESD
2nd rowPRE_GSTR_EX_ESD
3rd rowPRE_GSTR_EX_ESD
4th rowPRE_GSTR_EX_ESD
5th rowPRE_GSTR_EX_ESD

Common Values

ValueCountFrequency (%)
PRE_GSTR_EX_ESD 41
100.0%

Length

2023-12-12T21:54:32.463674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:54:32.556937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
pre_gstr_ex_esd 41
100.0%

테이블명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size460.0 B
PRE_위암_검사_ ESD
41 

Length

Max length14
Median length14
Mean length14
Min length14

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPRE_위암_검사_ ESD
2nd rowPRE_위암_검사_ ESD
3rd rowPRE_위암_검사_ ESD
4th rowPRE_위암_검사_ ESD
5th rowPRE_위암_검사_ ESD

Common Values

ValueCountFrequency (%)
PRE_위암_검사_ ESD 41
100.0%

Length

2023-12-12T21:54:32.643363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:54:33.015663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
pre_위암_검사 41
50.0%
esd 41
50.0%

컬럼ID
Text

UNIQUE 

Distinct41
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size460.0 B
2023-12-12T21:54:33.216283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length26
Mean length16.878049
Min length7

Characters and Unicode

Total characters692
Distinct characters25
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique41 ?
Unique (%)100.0%

Sample

1st rowCENTER_CD
2nd rowIRB_APRV_NO
3rd rowPT_SBST_NO
4th rowESDE_ACPT_YMD
5th rowESDE_SEQ
ValueCountFrequency (%)
center_cd 1
 
2.4%
esde_oprt_rmrg_nm 1
 
2.4%
esde_srmg_dstl_cncr_txsz_vl 1
 
2.4%
esde_antw_rmrg_size_vl 1
 
2.4%
esde_posw_rmrg_size_vl 1
 
2.4%
esde_grtr_rmrg_size_vl 1
 
2.4%
esde_dpmg_rmrg_size_vl 1
 
2.4%
esde_lymp_inva_cd 1
 
2.4%
esde_lymp_inva_nm 1
 
2.4%
esde_vnin_cd 1
 
2.4%
Other values (31) 31
75.6%
2023-12-12T21:54:33.574590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
_ 121
17.5%
E 96
13.9%
D 62
9.0%
S 59
8.5%
C 46
 
6.6%
N 43
 
6.2%
T 39
 
5.6%
R 37
 
5.3%
M 25
 
3.6%
I 22
 
3.2%
Other values (15) 142
20.5%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 571
82.5%
Connector Punctuation 121
 
17.5%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
E 96
16.8%
D 62
10.9%
S 59
10.3%
C 46
 
8.1%
N 43
 
7.5%
T 39
 
6.8%
R 37
 
6.5%
M 25
 
4.4%
I 22
 
3.9%
L 20
 
3.5%
Other values (14) 122
21.4%
Connector Punctuation
ValueCountFrequency (%)
_ 121
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 571
82.5%
Common 121
 
17.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
E 96
16.8%
D 62
10.9%
S 59
10.3%
C 46
 
8.1%
N 43
 
7.5%
T 39
 
6.8%
R 37
 
6.5%
M 25
 
4.4%
I 22
 
3.9%
L 20
 
3.5%
Other values (14) 122
21.4%
Common
ValueCountFrequency (%)
_ 121
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 692
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
_ 121
17.5%
E 96
13.9%
D 62
9.0%
S 59
8.5%
C 46
 
6.6%
N 43
 
6.2%
T 39
 
5.6%
R 37
 
5.3%
M 25
 
3.6%
I 22
 
3.2%
Other values (15) 142
20.5%

컬럼명
Text

UNIQUE 

Distinct41
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size460.0 B
2023-12-12T21:54:33.810235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length13
Mean length11.02439
Min length4

Characters and Unicode

Total characters452
Distinct characters86
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique41 ?
Unique (%)100.0%

Sample

1st row센터코드
2nd rowIRB승인번호
3rd row환자대체번호
4th rowESD검사접수일자
5th rowESD검사순번
ValueCountFrequency (%)
센터코드 1
 
2.4%
esd검사수술절제면명 1
 
2.4%
esd검사수술절제면원위암조직크기값 1
 
2.4%
esd검사전위절제면크기값 1
 
2.4%
esd검사후위절제면크기값 1
 
2.4%
esd검사큰만곡절제면크기값 1
 
2.4%
esd검사딥마진절제면크기값 1
 
2.4%
esd검사림프성침윤코드 1
 
2.4%
esd검사림프성침윤명 1
 
2.4%
esd검사정맥침윤코드 1
 
2.4%
Other values (31) 31
75.6%
2023-12-12T21:54:34.266382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
37
 
8.2%
37
 
8.2%
D 37
 
8.2%
S 37
 
8.2%
E 37
 
8.2%
13
 
2.9%
13
 
2.9%
12
 
2.7%
9
 
2.0%
9
 
2.0%
Other values (76) 211
46.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 338
74.8%
Uppercase Letter 114
 
25.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
37
 
10.9%
37
 
10.9%
13
 
3.8%
13
 
3.8%
12
 
3.6%
9
 
2.7%
9
 
2.7%
9
 
2.7%
9
 
2.7%
8
 
2.4%
Other values (70) 182
53.8%
Uppercase Letter
ValueCountFrequency (%)
D 37
32.5%
S 37
32.5%
E 37
32.5%
I 1
 
0.9%
R 1
 
0.9%
B 1
 
0.9%

Most occurring scripts

ValueCountFrequency (%)
Hangul 338
74.8%
Latin 114
 
25.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
37
 
10.9%
37
 
10.9%
13
 
3.8%
13
 
3.8%
12
 
3.6%
9
 
2.7%
9
 
2.7%
9
 
2.7%
9
 
2.7%
8
 
2.4%
Other values (70) 182
53.8%
Latin
ValueCountFrequency (%)
D 37
32.5%
S 37
32.5%
E 37
32.5%
I 1
 
0.9%
R 1
 
0.9%
B 1
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 338
74.8%
ASCII 114
 
25.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
37
 
10.9%
37
 
10.9%
13
 
3.8%
13
 
3.8%
12
 
3.6%
9
 
2.7%
9
 
2.7%
9
 
2.7%
9
 
2.7%
8
 
2.4%
Other values (70) 182
53.8%
ASCII
ValueCountFrequency (%)
D 37
32.5%
S 37
32.5%
E 37
32.5%
I 1
 
0.9%
R 1
 
0.9%
B 1
 
0.9%

데이터타입
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)24.4%
Missing0
Missing (%)0.0%
Memory size460.0 B
VARCHAR(20)
12 
VARCHAR(100)
CLOB
VARCHAR(200)
VARCHAR(8)
Other values (5)

Length

Max length13
Median length12
Mean length10.04878
Min length4

Unique

Unique5 ?
Unique (%)12.2%

Sample

1st rowVARCHAR(20)
2nd rowVARCHAR(50)
3rd rowVARCHAR(10)
4th rowVARCHAR(8)
5th rowNUMBER(3)

Common Values

ValueCountFrequency (%)
VARCHAR(20) 12
29.3%
VARCHAR(100) 9
22.0%
CLOB 7
17.1%
VARCHAR(200) 6
14.6%
VARCHAR(8) 2
 
4.9%
VARCHAR(50) 1
 
2.4%
VARCHAR(10) 1
 
2.4%
NUMBER(3) 1
 
2.4%
VARCHAR(1000) 1
 
2.4%
DATETIME 1
 
2.4%

Length

2023-12-12T21:54:34.423343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:54:34.580436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
varchar(20 12
29.3%
varchar(100 9
22.0%
clob 7
17.1%
varchar(200 6
14.6%
varchar(8 2
 
4.9%
varchar(50 1
 
2.4%
varchar(10 1
 
2.4%
number(3 1
 
2.4%
varchar(1000 1
 
2.4%
datetime 1
 
2.4%

컬럼설명
Text

UNIQUE 

Distinct41
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size460.0 B
2023-12-12T21:54:34.883696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length144
Median length51
Mean length40.829268
Min length13

Characters and Unicode

Total characters1674
Distinct characters158
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique41 ?
Unique (%)100.0%

Sample

1st row센터코드 (5자리 : XXXXX) / 00030 : 국립암센터 예) 00030
2nd row센터별 기준에 따라 생성
3rd row개인고유번호(10자리) / 센터별 별도부여 예) RN12345678
4th rowESD검사의 접수일자 / YYYYMMDD 예)20200101
5th rowESD검사접수일자별 순번
ValueCountFrequency (%)
41
 
12.6%
23
 
7.1%
esd검사 19
 
5.8%
present 8
 
2.5%
3 7
 
2.1%
free 7
 
2.1%
1 7
 
2.1%
2 7
 
2.1%
text 7
 
2.1%
other 6
 
1.8%
Other values (151) 194
59.5%
2023-12-12T21:54:35.330845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
285
 
17.0%
e 92
 
5.5%
r 56
 
3.3%
t 53
 
3.2%
a 51
 
3.0%
o 47
 
2.8%
n 47
 
2.8%
s 40
 
2.4%
) 38
 
2.3%
/ 38
 
2.3%
Other values (148) 927
55.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 608
36.3%
Other Letter 411
24.6%
Space Separator 285
17.0%
Uppercase Letter 146
 
8.7%
Decimal Number 109
 
6.5%
Other Punctuation 60
 
3.6%
Close Punctuation 38
 
2.3%
Open Punctuation 12
 
0.7%
Dash Punctuation 4
 
0.2%
Connector Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
26
 
6.3%
25
 
6.1%
25
 
6.1%
20
 
4.9%
20
 
4.9%
12
 
2.9%
10
 
2.4%
10
 
2.4%
9
 
2.2%
9
 
2.2%
Other values (84) 245
59.6%
Lowercase Letter
ValueCountFrequency (%)
e 92
15.1%
r 56
9.2%
t 53
8.7%
a 51
 
8.4%
o 47
 
7.7%
n 47
 
7.7%
s 40
 
6.6%
i 35
 
5.8%
c 31
 
5.1%
m 26
 
4.3%
Other values (13) 130
21.4%
Uppercase Letter
ValueCountFrequency (%)
D 32
21.9%
E 27
18.5%
S 27
18.5%
I 8
 
5.5%
Y 8
 
5.5%
X 5
 
3.4%
A 5
 
3.4%
N 5
 
3.4%
M 5
 
3.4%
P 4
 
2.7%
Other values (12) 20
13.7%
Decimal Number
ValueCountFrequency (%)
0 38
34.9%
1 20
18.3%
2 17
15.6%
3 12
 
11.0%
9 8
 
7.3%
4 5
 
4.6%
5 4
 
3.7%
6 2
 
1.8%
7 2
 
1.8%
8 1
 
0.9%
Other Punctuation
ValueCountFrequency (%)
/ 38
63.3%
. 11
 
18.3%
, 7
 
11.7%
: 4
 
6.7%
Space Separator
ValueCountFrequency (%)
285
100.0%
Close Punctuation
ValueCountFrequency (%)
) 38
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 754
45.0%
Common 509
30.4%
Hangul 411
24.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
26
 
6.3%
25
 
6.1%
25
 
6.1%
20
 
4.9%
20
 
4.9%
12
 
2.9%
10
 
2.4%
10
 
2.4%
9
 
2.2%
9
 
2.2%
Other values (84) 245
59.6%
Latin
ValueCountFrequency (%)
e 92
 
12.2%
r 56
 
7.4%
t 53
 
7.0%
a 51
 
6.8%
o 47
 
6.2%
n 47
 
6.2%
s 40
 
5.3%
i 35
 
4.6%
D 32
 
4.2%
c 31
 
4.1%
Other values (35) 270
35.8%
Common
ValueCountFrequency (%)
285
56.0%
) 38
 
7.5%
/ 38
 
7.5%
0 38
 
7.5%
1 20
 
3.9%
2 17
 
3.3%
( 12
 
2.4%
3 12
 
2.4%
. 11
 
2.2%
9 8
 
1.6%
Other values (9) 30
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1263
75.4%
Hangul 411
 
24.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
285
22.6%
e 92
 
7.3%
r 56
 
4.4%
t 53
 
4.2%
a 51
 
4.0%
o 47
 
3.7%
n 47
 
3.7%
s 40
 
3.2%
) 38
 
3.0%
/ 38
 
3.0%
Other values (54) 516
40.9%
Hangul
ValueCountFrequency (%)
26
 
6.3%
25
 
6.1%
25
 
6.1%
20
 
4.9%
20
 
4.9%
12
 
2.9%
10
 
2.4%
10
 
2.4%
9
 
2.2%
9
 
2.2%
Other values (84) 245
59.6%

컬럼데이터수
Real number (ℝ)

ZEROS 

Distinct16
Distinct (%)39.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean900.12195
Minimum0
Maximum1089
Zeros2
Zeros (%)4.9%
Negative0
Negative (%)0.0%
Memory size501.0 B
2023-12-12T21:54:35.491430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile94
Q1721
median1089
Q31089
95-th percentile1089
Maximum1089
Range1089
Interquartile range (IQR)368

Descriptive statistics

Standard deviation329.78798
Coefficient of variation (CV)0.36638144
Kurtosis1.746661
Mean900.12195
Median Absolute Deviation (MAD)0
Skewness-1.6992512
Sum36905
Variance108760.11
MonotonicityNot monotonic
2023-12-12T21:54:35.624492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
1089 24
58.5%
0 2
 
4.9%
647 2
 
4.9%
721 1
 
2.4%
1077 1
 
2.4%
1051 1
 
2.4%
94 1
 
2.4%
1062 1
 
2.4%
325 1
 
2.4%
920 1
 
2.4%
Other values (6) 6
 
14.6%
ValueCountFrequency (%)
0 2
4.9%
94 1
2.4%
325 1
2.4%
385 1
2.4%
389 1
2.4%
647 2
4.9%
670 1
2.4%
714 1
2.4%
721 1
2.4%
920 1
2.4%
ValueCountFrequency (%)
1089 24
58.5%
1077 1
 
2.4%
1069 1
 
2.4%
1062 1
 
2.4%
1051 1
 
2.4%
998 1
 
2.4%
920 1
 
2.4%
721 1
 
2.4%
714 1
 
2.4%
670 1
 
2.4%

표시형식
Categorical

HIGH CORRELATION 

Distinct14
Distinct (%)34.1%
Missing0
Missing (%)0.0%
Memory size460.0 B
텍스트
19 
Free 텍스트
1 present | 2 absent | 3 no record | 9 other
YYYYMMDD
문자(5) : XXXXX
 
1
Other values (9)

Length

Max length143
Median length57
Mean length15.756098
Min length2

Unique

Unique10 ?
Unique (%)24.4%

Sample

1st row문자(5) : XXXXX
2nd row텍스트
3rd row문자(10) : XXXXXXXXXX
4th rowYYYYMMDD
5th row숫자

Common Values

ValueCountFrequency (%)
텍스트 19
46.3%
Free 텍스트 7
 
17.1%
1 present | 2 absent | 3 no record | 9 other 3
 
7.3%
YYYYMMDD 2
 
4.9%
문자(5) : XXXXX 1
 
2.4%
문자(10) : XXXXXXXXXX 1
 
2.4%
숫자 1
 
2.4%
센터내 수술코드 1
 
2.4%
1 Adenocarcinoma | 2 Signet ring cell carcinoma | 9 Other 1
 
2.4%
1 Well | 2 Moderately | 3 Poorly | 9 other 1
 
2.4%
Other values (4) 4
 
9.8%

Length

2023-12-12T21:54:35.765521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
27
17.4%
텍스트 26
16.8%
1 7
 
4.5%
free 7
 
4.5%
2 7
 
4.5%
3 6
 
3.9%
9 6
 
3.9%
other 6
 
3.9%
absent 4
 
2.6%
present 4
 
2.6%
Other values (48) 55
35.5%

Interactions

2023-12-12T21:54:31.331050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:54:31.147504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:54:31.440013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:54:31.246201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:54:35.858757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번컬럼ID컬럼명데이터타입컬럼설명컬럼데이터수표시형식
순번1.0001.0001.0000.7471.0000.5200.528
컬럼ID1.0001.0001.0001.0001.0001.0001.000
컬럼명1.0001.0001.0001.0001.0001.0001.000
데이터타입0.7471.0001.0001.0001.0000.0000.911
컬럼설명1.0001.0001.0001.0001.0001.0001.000
컬럼데이터수0.5201.0001.0000.0001.0001.0000.000
표시형식0.5281.0001.0000.9111.0000.0001.000
2023-12-12T21:54:35.971777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
표시형식데이터타입
표시형식1.0000.636
데이터타입0.6361.000
2023-12-12T21:54:36.054135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번컬럼데이터수데이터타입표시형식
순번1.000-0.2690.2220.030
컬럼데이터수-0.2691.0000.0000.000
데이터타입0.2220.0001.0000.636
표시형식0.0300.0000.6361.000

Missing values

2023-12-12T21:54:31.557826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:54:31.712034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번분류ID분류명테이블ID테이블명컬럼ID컬럼명데이터타입컬럼설명컬럼데이터수표시형식
01EX검사PRE_GSTR_EX_ESDPRE_위암_검사_ ESDCENTER_CD센터코드VARCHAR(20)센터코드 (5자리 : XXXXX) / 00030 : 국립암센터 예) 000301089문자(5) : XXXXX
12EX검사PRE_GSTR_EX_ESDPRE_위암_검사_ ESDIRB_APRV_NOIRB승인번호VARCHAR(50)센터별 기준에 따라 생성1089텍스트
23EX검사PRE_GSTR_EX_ESDPRE_위암_검사_ ESDPT_SBST_NO환자대체번호VARCHAR(10)개인고유번호(10자리) / 센터별 별도부여 예) RN123456781089문자(10) : XXXXXXXXXX
34EX검사PRE_GSTR_EX_ESDPRE_위암_검사_ ESDESDE_ACPT_YMDESD검사접수일자VARCHAR(8)ESD검사의 접수일자 / YYYYMMDD 예)202001011089YYYYMMDD
45EX검사PRE_GSTR_EX_ESDPRE_위암_검사_ ESDESDE_SEQESD검사순번NUMBER(3)ESD검사접수일자별 순번1089숫자
56EX검사PRE_GSTR_EX_ESDPRE_위암_검사_ ESDESDE_READ_YMDESD검사판독일자VARCHAR(8)ESD검사의 판독일자 / YYYYMMDD 예)202001011089YYYYMMDD
67EX검사PRE_GSTR_EX_ESDPRE_위암_검사_ ESDESDE_OPRT_CDESD검사수술코드VARCHAR(20)외과병리보고서의 수술코드 / 예) H120001089센터내 수술코드
78EX검사PRE_GSTR_EX_ESDPRE_위암_검사_ ESDESDE_OPRT_NMESD검사수술명VARCHAR(1000)ESD검사의 수술명 / 예) 상부 내시경적 점막하 박리 절제술(ESD)1089텍스트
89EX검사PRE_GSTR_EX_ESDPRE_위암_검사_ ESDESDE_GSCN_OPRT_SITE_CONTESD검사위암수술부위내용CLOB외부 및 원내에서 시행한 ESD검사 부위명 / free text 예) Stomach and esophagus1089Free 텍스트
910EX검사PRE_GSTR_EX_ESDPRE_위암_검사_ ESDESDE_CRCN_TYPE_CDESD검사암종유형코드VARCHAR(20)ESD검사 암종유형코드 / 1 Adenocarcinoma 2 Signet ring cell carcinoma 9 Other10891 Adenocarcinoma | 2 Signet ring cell carcinoma | 9 Other
순번분류ID분류명테이블ID테이블명컬럼ID컬럼명데이터타입컬럼설명컬럼데이터수표시형식
3132EX검사PRE_GSTR_EX_ESDPRE_위암_검사_ ESDESDE_VNIN_NMESD검사정맥침윤명VARCHAR(100)ESD검사 정맥침윤코드명 / 예) present1089텍스트
3233EX검사PRE_GSTR_EX_ESDPRE_위암_검사_ ESDESDE_SUMB_FIBR_CDESD검사점막하섬유증코드VARCHAR(20)ESD검사 점막하섬유증코드 / 1 present 2 absent 3 not identified 9 other10891 present | 2 absent | 3 not identified | 9 other
3334EX검사PRE_GSTR_EX_ESDPRE_위암_검사_ ESDESDE_SUMB_FIBR_NMESD검사점막하섬유증명VARCHAR(100)ESD검사 점막하섬유증코드명 / 예) present1089텍스트
3435EX검사PRE_GSTR_EX_ESDPRE_위암_검사_ ESDESDE_ANIN_CDESD검사혈관조영침윤코드VARCHAR(20)ESD검사 혈관조영침윤코드 / 1 present 2 absent 3 no record 9 other10891 present | 2 absent | 3 no record | 9 other
3536EX검사PRE_GSTR_EX_ESDPRE_위암_검사_ ESDESDE_ANIN_NMESD검사혈관조영침윤명VARCHAR(100)ESD검사 혈관조영침윤코드명 / 예) present1089텍스트
3637EX검사PRE_GSTR_EX_ESDPRE_위암_검사_ ESDESDE_STAG_VLESD검사병기값VARCHAR(20)ESD검사 종양 병기정보 / 예) pT1bN094텍스트
3738EX검사PRE_GSTR_EX_ESDPRE_위암_검사_ ESDESDE_VIEN_CLS_CONTESD검사비엔나분류내용CLOBVienna classification 내용 / free text 예) Vienna 3. Non-invasive low-grade neoplasia1051Free 텍스트
3839EX검사PRE_GSTR_EX_ESDPRE_위암_검사_ ESDESDE_CNCR_TXUS_RLCT_CONTESD검사암조직잔존내용CLOB암조직 잔존내용 / free text0Free 텍스트
3940EX검사PRE_GSTR_EX_ESDPRE_위암_검사_ ESDESDE_ETC_CONTESD검사기타내용CLOB외과병리 기타내용 / free text 예) - Histologic mapping was done.1077Free 텍스트
4041EX검사PRE_GSTR_EX_ESDPRE_위암_검사_ ESDCRTN_DT생성일시DATETIME생성일시 DEFAULT current_timestamp()1089YYYY-MM-DD HH:MI:SS