Overview

Dataset statistics

Number of variables11
Number of observations21
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.0 KiB
Average record size in memory96.3 B

Variable types

Numeric1
Categorical6
Text4

Dataset

DescriptionPRE위암_라이브러리_PRE_위암_치료_방사선_메타정보( 제공 되어질 데이터 항목, 타입, 사이즈, 항목별건수등)를 제공
Author국립암센터
URLhttps://www.data.go.kr/data/15074155/fileData.do

Alerts

분류ID has constant value ""Constant
분류명 has constant value ""Constant
테이블ID has constant value ""Constant
테이블명 has constant value ""Constant
순번 has unique valuesUnique
컬럼ID has unique valuesUnique
컬럼명 has unique valuesUnique
컬럼설명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 09:54:08.934738
Analysis finished2023-12-12 09:54:10.125818
Duration1.19 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11
Minimum1
Maximum21
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size321.0 B
2023-12-12T18:54:10.250064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q16
median11
Q316
95-th percentile20
Maximum21
Range20
Interquartile range (IQR)10

Descriptive statistics

Standard deviation6.2048368
Coefficient of variation (CV)0.56407607
Kurtosis-1.2
Mean11
Median Absolute Deviation (MAD)5
Skewness0
Sum231
Variance38.5
MonotonicityStrictly increasing
2023-12-12T18:54:10.448202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
1 1
 
4.8%
2 1
 
4.8%
21 1
 
4.8%
20 1
 
4.8%
19 1
 
4.8%
18 1
 
4.8%
17 1
 
4.8%
16 1
 
4.8%
15 1
 
4.8%
14 1
 
4.8%
Other values (11) 11
52.4%
ValueCountFrequency (%)
1 1
4.8%
2 1
4.8%
3 1
4.8%
4 1
4.8%
5 1
4.8%
6 1
4.8%
7 1
4.8%
8 1
4.8%
9 1
4.8%
10 1
4.8%
ValueCountFrequency (%)
21 1
4.8%
20 1
4.8%
19 1
4.8%
18 1
4.8%
17 1
4.8%
16 1
4.8%
15 1
4.8%
14 1
4.8%
13 1
4.8%
12 1
4.8%

분류ID
Categorical

CONSTANT 

Distinct1
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Memory size300.0 B
TRTM
21 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowTRTM
2nd rowTRTM
3rd rowTRTM
4th rowTRTM
5th rowTRTM

Common Values

ValueCountFrequency (%)
TRTM 21
100.0%

Length

2023-12-12T18:54:10.651559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:54:10.800848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
trtm 21
100.0%

분류명
Categorical

CONSTANT 

Distinct1
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Memory size300.0 B
치료
21 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row치료
2nd row치료
3rd row치료
4th row치료
5th row치료

Common Values

ValueCountFrequency (%)
치료 21
100.0%

Length

2023-12-12T18:54:10.947724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:54:11.069928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
치료 21
100.0%

테이블ID
Categorical

CONSTANT 

Distinct1
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Memory size300.0 B
PRE_GSTR_TRTM_RD
21 

Length

Max length16
Median length16
Mean length16
Min length16

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPRE_GSTR_TRTM_RD
2nd rowPRE_GSTR_TRTM_RD
3rd rowPRE_GSTR_TRTM_RD
4th rowPRE_GSTR_TRTM_RD
5th rowPRE_GSTR_TRTM_RD

Common Values

ValueCountFrequency (%)
PRE_GSTR_TRTM_RD 21
100.0%

Length

2023-12-12T18:54:11.218755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:54:11.345915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
pre_gstr_trtm_rd 21
100.0%

테이블명
Categorical

CONSTANT 

Distinct1
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Memory size300.0 B
PRE_위암_치료_방사선
21 

Length

Max length13
Median length13
Mean length13
Min length13

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPRE_위암_치료_방사선
2nd rowPRE_위암_치료_방사선
3rd rowPRE_위암_치료_방사선
4th rowPRE_위암_치료_방사선
5th rowPRE_위암_치료_방사선

Common Values

ValueCountFrequency (%)
PRE_위암_치료_방사선 21
100.0%

Length

2023-12-12T18:54:11.478318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:54:11.634280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
pre_위암_치료_방사선 21
100.0%

컬럼ID
Text

UNIQUE 

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size300.0 B
2023-12-12T18:54:11.845480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length20
Mean length12.47619
Min length7

Characters and Unicode

Total characters262
Distinct characters21
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)100.0%

Sample

1st rowCENTER_CD
2nd rowIRB_APRV_NO
3rd rowPT_SBST_NO
4th rowRDT_STRT_YMD
5th rowRDT_SEQ
ValueCountFrequency (%)
center_cd 1
 
4.8%
rdt_prsc_nm 1
 
4.8%
rdt_smnt_cont 1
 
4.8%
rdt_totl_trtm_nt 1
 
4.8%
rdt_totl_cgy 1
 
4.8%
rdt_tm1_cgy 1
 
4.8%
rdt_end_ymd 1
 
4.8%
rdt_prps_cd_etc_cont 1
 
4.8%
rdt_prps_nm 1
 
4.8%
rdt_prps_cd 1
 
4.8%
Other values (11) 11
52.4%
2023-12-12T18:54:12.264977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
_ 47
17.9%
T 38
14.5%
R 31
11.8%
D 31
11.8%
C 21
8.0%
S 17
 
6.5%
N 17
 
6.5%
P 13
 
5.0%
M 9
 
3.4%
E 8
 
3.1%
Other values (11) 30
11.5%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 214
81.7%
Connector Punctuation 47
 
17.9%
Decimal Number 1
 
0.4%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
T 38
17.8%
R 31
14.5%
D 31
14.5%
C 21
9.8%
S 17
7.9%
N 17
7.9%
P 13
 
6.1%
M 9
 
4.2%
E 8
 
3.7%
O 6
 
2.8%
Other values (9) 23
10.7%
Connector Punctuation
ValueCountFrequency (%)
_ 47
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 214
81.7%
Common 48
 
18.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
T 38
17.8%
R 31
14.5%
D 31
14.5%
C 21
9.8%
S 17
7.9%
N 17
7.9%
P 13
 
6.1%
M 9
 
4.2%
E 8
 
3.7%
O 6
 
2.8%
Other values (9) 23
10.7%
Common
ValueCountFrequency (%)
_ 47
97.9%
1 1
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 262
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
_ 47
17.9%
T 38
14.5%
R 31
11.8%
D 31
11.8%
C 21
8.0%
S 17
 
6.5%
N 17
 
6.5%
P 13
 
5.0%
M 9
 
3.4%
E 8
 
3.1%
Other values (11) 30
11.5%

컬럼명
Text

UNIQUE 

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size300.0 B
2023-12-12T18:54:12.533951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length8.7142857
Min length4

Characters and Unicode

Total characters183
Distinct characters49
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)100.0%

Sample

1st row센터코드
2nd rowIRB승인번호
3rd row환자대체번호
4th row방사선치료시작일자
5th row방사선치료순번
ValueCountFrequency (%)
센터코드 1
 
4.8%
방사선치료처방명 1
 
4.8%
방사선치료평가내용 1
 
4.8%
방사선치료총치료횟수 1
 
4.8%
방사선치료총선량 1
 
4.8%
방사선치료1회선량 1
 
4.8%
방사선치료종료일자 1
 
4.8%
방사선치료목적코드기타내용 1
 
4.8%
방사선치료목적명 1
 
4.8%
방사선치료목적코드 1
 
4.8%
Other values (11) 11
52.4%
2023-12-12T18:54:12.981353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
22
 
12.0%
19
 
10.4%
19
 
10.4%
18
 
9.8%
17
 
9.3%
7
 
3.8%
7
 
3.8%
5
 
2.7%
4
 
2.2%
4
 
2.2%
Other values (39) 61
33.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 176
96.2%
Uppercase Letter 6
 
3.3%
Decimal Number 1
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
22
12.5%
19
 
10.8%
19
 
10.8%
18
 
10.2%
17
 
9.7%
7
 
4.0%
7
 
4.0%
5
 
2.8%
4
 
2.3%
4
 
2.3%
Other values (33) 54
30.7%
Uppercase Letter
ValueCountFrequency (%)
I 2
33.3%
D 1
16.7%
E 1
16.7%
B 1
16.7%
R 1
16.7%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 176
96.2%
Latin 6
 
3.3%
Common 1
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
22
12.5%
19
 
10.8%
19
 
10.8%
18
 
10.2%
17
 
9.7%
7
 
4.0%
7
 
4.0%
5
 
2.8%
4
 
2.3%
4
 
2.3%
Other values (33) 54
30.7%
Latin
ValueCountFrequency (%)
I 2
33.3%
D 1
16.7%
E 1
16.7%
B 1
16.7%
R 1
16.7%
Common
ValueCountFrequency (%)
1 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 176
96.2%
ASCII 7
 
3.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
22
12.5%
19
 
10.8%
19
 
10.8%
18
 
10.2%
17
 
9.7%
7
 
4.0%
7
 
4.0%
5
 
2.8%
4
 
2.3%
4
 
2.3%
Other values (33) 54
30.7%
ASCII
ValueCountFrequency (%)
I 2
28.6%
1 1
14.3%
D 1
14.3%
E 1
14.3%
B 1
14.3%
R 1
14.3%

데이터타입
Categorical

Distinct10
Distinct (%)47.6%
Missing0
Missing (%)0.0%
Memory size300.0 B
VARCHAR(20)
VARCHAR(200)
NUMBER(5)
VARCHAR(8)
CLOB
Other values (5)

Length

Max length12
Median length11
Mean length9.9047619
Min length4

Unique

Unique5 ?
Unique (%)23.8%

Sample

1st rowVARCHAR(20)
2nd rowVARCHAR(50)
3rd rowVARCHAR(10)
4th rowVARCHAR(8)
5th rowNUMBER(3)

Common Values

ValueCountFrequency (%)
VARCHAR(20) 6
28.6%
VARCHAR(200) 3
14.3%
NUMBER(5) 3
14.3%
VARCHAR(8) 2
 
9.5%
CLOB 2
 
9.5%
VARCHAR(50) 1
 
4.8%
VARCHAR(10) 1
 
4.8%
NUMBER(3) 1
 
4.8%
VARCHAR(100) 1
 
4.8%
DATETIME 1
 
4.8%

Length

2023-12-12T18:54:13.189837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:54:13.395434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
varchar(20 6
28.6%
varchar(200 3
14.3%
number(5 3
14.3%
varchar(8 2
 
9.5%
clob 2
 
9.5%
varchar(50 1
 
4.8%
varchar(10 1
 
4.8%
number(3 1
 
4.8%
varchar(100 1
 
4.8%
datetime 1
 
4.8%

컬럼설명
Text

UNIQUE 

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size300.0 B
2023-12-12T18:54:13.690814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length183
Median length37
Mean length41.571429
Min length12

Characters and Unicode

Total characters873
Distinct characters138
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)100.0%

Sample

1st row센터코드 (5자리 : XXXXX) / 00030 : 국립암센터 예) 00030
2nd row센터별 기준에 따라 생성
3rd row개인고유번호(10자리) / 센터별 별도부여 예) RN12345678
4th row방사선치료 시작일자 / YYYYMMDD 예)20200101
5th row방사선치료 시작일자별 순번 / 예) 1
ValueCountFrequency (%)
30
 
15.4%
방사선치료 12
 
6.2%
11
 
5.6%
1회당 8
 
4.1%
방사선 4
 
2.1%
치료 3
 
1.5%
정위적 3
 
1.5%
정수값 3
 
1.5%
체외조사 2
 
1.0%
08 2
 
1.0%
Other values (94) 117
60.0%
2023-12-12T18:54:14.144398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
174
 
19.9%
0 40
 
4.6%
26
 
3.0%
1 25
 
2.9%
24
 
2.7%
24
 
2.7%
23
 
2.6%
22
 
2.5%
e 20
 
2.3%
/ 18
 
2.1%
Other values (128) 477
54.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 321
36.8%
Space Separator 174
19.9%
Lowercase Letter 133
15.2%
Decimal Number 108
 
12.4%
Uppercase Letter 60
 
6.9%
Other Punctuation 30
 
3.4%
Close Punctuation 24
 
2.7%
Open Punctuation 11
 
1.3%
Math Symbol 10
 
1.1%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
26
 
8.1%
24
 
7.5%
24
 
7.5%
23
 
7.2%
22
 
6.9%
13
 
4.0%
9
 
2.8%
8
 
2.5%
7
 
2.2%
7
 
2.2%
Other values (68) 158
49.2%
Uppercase Letter
ValueCountFrequency (%)
Y 9
15.0%
X 6
10.0%
D 6
10.0%
A 6
10.0%
M 5
8.3%
B 4
 
6.7%
C 4
 
6.7%
R 3
 
5.0%
P 3
 
5.0%
E 2
 
3.3%
Other values (11) 12
20.0%
Lowercase Letter
ValueCountFrequency (%)
e 20
15.0%
r 16
12.0%
t 16
12.0%
a 14
10.5%
i 11
8.3%
l 9
 
6.8%
n 8
 
6.0%
d 5
 
3.8%
o 5
 
3.8%
v 4
 
3.0%
Other values (9) 25
18.8%
Decimal Number
ValueCountFrequency (%)
0 40
37.0%
1 25
23.1%
2 10
 
9.3%
9 7
 
6.5%
5 6
 
5.6%
3 5
 
4.6%
6 5
 
4.6%
8 4
 
3.7%
4 3
 
2.8%
7 3
 
2.8%
Other Punctuation
ValueCountFrequency (%)
/ 18
60.0%
: 12
40.0%
Close Punctuation
ValueCountFrequency (%)
) 16
66.7%
] 8
33.3%
Open Punctuation
ValueCountFrequency (%)
[ 8
72.7%
( 3
 
27.3%
Space Separator
ValueCountFrequency (%)
174
100.0%
Math Symbol
ValueCountFrequency (%)
| 10
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 359
41.1%
Hangul 321
36.8%
Latin 193
22.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
26
 
8.1%
24
 
7.5%
24
 
7.5%
23
 
7.2%
22
 
6.9%
13
 
4.0%
9
 
2.8%
8
 
2.5%
7
 
2.2%
7
 
2.2%
Other values (68) 158
49.2%
Latin
ValueCountFrequency (%)
e 20
 
10.4%
r 16
 
8.3%
t 16
 
8.3%
a 14
 
7.3%
i 11
 
5.7%
Y 9
 
4.7%
l 9
 
4.7%
n 8
 
4.1%
X 6
 
3.1%
D 6
 
3.1%
Other values (30) 78
40.4%
Common
ValueCountFrequency (%)
174
48.5%
0 40
 
11.1%
1 25
 
7.0%
/ 18
 
5.0%
) 16
 
4.5%
: 12
 
3.3%
2 10
 
2.8%
| 10
 
2.8%
[ 8
 
2.2%
] 8
 
2.2%
Other values (10) 38
 
10.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 552
63.2%
Hangul 321
36.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
174
31.5%
0 40
 
7.2%
1 25
 
4.5%
e 20
 
3.6%
/ 18
 
3.3%
r 16
 
2.9%
) 16
 
2.9%
t 16
 
2.9%
a 14
 
2.5%
: 12
 
2.2%
Other values (50) 201
36.4%
Hangul
ValueCountFrequency (%)
26
 
8.1%
24
 
7.5%
24
 
7.5%
23
 
7.2%
22
 
6.9%
13
 
4.0%
9
 
2.8%
8
 
2.5%
7
 
2.2%
7
 
2.2%
Other values (68) 158
49.2%
Distinct5
Distinct (%)23.8%
Missing0
Missing (%)0.0%
Memory size300.0 B
758
13 
0
754
757
 
1
561
 
1

Length

Max length3
Median length3
Mean length2.6190476
Min length1

Unique

Unique2 ?
Unique (%)9.5%

Sample

1st row758
2nd row758
3rd row758
4th row758
5th row758

Common Values

ValueCountFrequency (%)
758 13
61.9%
0 4
 
19.0%
754 2
 
9.5%
757 1
 
4.8%
561 1
 
4.8%

Length

2023-12-12T18:54:14.346652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:54:14.514904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
758 13
61.9%
0 4
 
19.0%
754 2
 
9.5%
757 1
 
4.8%
561 1
 
4.8%
Distinct12
Distinct (%)57.1%
Missing0
Missing (%)0.0%
Memory size300.0 B
2023-12-12T18:54:14.719975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length173
Median length35
Mean length23.095238
Min length2

Characters and Unicode

Total characters485
Distinct characters95
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8 ?
Unique (%)38.1%

Sample

1st row문자(5) : XXXXX
2nd row텍스트
3rd row문자(10) : XXXXXXXXXX
4th rowYYYYMMDD
5th row숫자
ValueCountFrequency (%)
23
 
20.2%
텍스트 7
 
6.1%
1회당 7
 
6.1%
숫자 4
 
3.5%
정위적 3
 
2.6%
03 2
 
1.8%
other 2
 
1.8%
99 2
 
1.8%
09 2
 
1.8%
08 2
 
1.8%
Other values (50) 60
52.6%
2023-12-12T18:54:15.157501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
93
 
19.2%
| 21
 
4.3%
0 20
 
4.1%
X 15
 
3.1%
e 14
 
2.9%
1 12
 
2.5%
Y 12
 
2.5%
r 11
 
2.3%
a 11
 
2.3%
8
 
1.6%
Other values (85) 268
55.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 137
28.2%
Space Separator 93
19.2%
Lowercase Letter 89
18.4%
Uppercase Letter 66
13.6%
Decimal Number 55
11.3%
Math Symbol 21
 
4.3%
Close Punctuation 9
 
1.9%
Open Punctuation 9
 
1.9%
Other Punctuation 4
 
0.8%
Dash Punctuation 2
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8
 
5.8%
8
 
5.8%
7
 
5.1%
7
 
5.1%
7
 
5.1%
7
 
5.1%
7
 
5.1%
7
 
5.1%
7
 
5.1%
7
 
5.1%
Other values (35) 65
47.4%
Lowercase Letter
ValueCountFrequency (%)
e 14
15.7%
r 11
12.4%
a 11
12.4%
i 8
9.0%
l 8
9.0%
t 7
7.9%
n 6
6.7%
d 5
 
5.6%
v 3
 
3.4%
o 3
 
3.4%
Other values (7) 13
14.6%
Uppercase Letter
ValueCountFrequency (%)
X 15
22.7%
Y 12
18.2%
D 7
10.6%
M 7
10.6%
A 4
 
6.1%
S 3
 
4.5%
C 3
 
4.5%
B 3
 
4.5%
F 2
 
3.0%
I 2
 
3.0%
Other values (5) 8
12.1%
Decimal Number
ValueCountFrequency (%)
0 20
36.4%
1 12
21.8%
9 7
 
12.7%
2 3
 
5.5%
5 3
 
5.5%
7 2
 
3.6%
8 2
 
3.6%
3 2
 
3.6%
6 2
 
3.6%
4 2
 
3.6%
Close Punctuation
ValueCountFrequency (%)
] 7
77.8%
) 2
 
22.2%
Open Punctuation
ValueCountFrequency (%)
[ 7
77.8%
( 2
 
22.2%
Space Separator
ValueCountFrequency (%)
93
100.0%
Math Symbol
ValueCountFrequency (%)
| 21
100.0%
Other Punctuation
ValueCountFrequency (%)
: 4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 193
39.8%
Latin 155
32.0%
Hangul 137
28.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8
 
5.8%
8
 
5.8%
7
 
5.1%
7
 
5.1%
7
 
5.1%
7
 
5.1%
7
 
5.1%
7
 
5.1%
7
 
5.1%
7
 
5.1%
Other values (35) 65
47.4%
Latin
ValueCountFrequency (%)
X 15
 
9.7%
e 14
 
9.0%
Y 12
 
7.7%
r 11
 
7.1%
a 11
 
7.1%
i 8
 
5.2%
l 8
 
5.2%
t 7
 
4.5%
D 7
 
4.5%
M 7
 
4.5%
Other values (22) 55
35.5%
Common
ValueCountFrequency (%)
93
48.2%
| 21
 
10.9%
0 20
 
10.4%
1 12
 
6.2%
] 7
 
3.6%
[ 7
 
3.6%
9 7
 
3.6%
: 4
 
2.1%
2 3
 
1.6%
5 3
 
1.6%
Other values (8) 16
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 348
71.8%
Hangul 137
 
28.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
93
26.7%
| 21
 
6.0%
0 20
 
5.7%
X 15
 
4.3%
e 14
 
4.0%
1 12
 
3.4%
Y 12
 
3.4%
r 11
 
3.2%
a 11
 
3.2%
i 8
 
2.3%
Other values (40) 131
37.6%
Hangul
ValueCountFrequency (%)
8
 
5.8%
8
 
5.8%
7
 
5.1%
7
 
5.1%
7
 
5.1%
7
 
5.1%
7
 
5.1%
7
 
5.1%
7
 
5.1%
7
 
5.1%
Other values (35) 65
47.4%

Interactions

2023-12-12T18:54:09.589257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:54:15.285998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번컬럼ID컬럼명데이터타입컬럼설명컬럼데이터수표시형식
순번1.0001.0001.0000.5191.0000.6570.504
컬럼ID1.0001.0001.0001.0001.0001.0001.000
컬럼명1.0001.0001.0001.0001.0001.0001.000
데이터타입0.5191.0001.0001.0001.0000.0000.833
컬럼설명1.0001.0001.0001.0001.0001.0001.000
컬럼데이터수0.6571.0001.0000.0001.0001.0000.909
표시형식0.5041.0001.0000.8331.0000.9091.000
2023-12-12T18:54:15.409253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
컬럼데이터수데이터타입
컬럼데이터수1.0000.000
데이터타입0.0001.000
2023-12-12T18:54:15.510420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번데이터타입컬럼데이터수
순번1.0000.1340.499
데이터타입0.1341.0000.000
컬럼데이터수0.4990.0001.000

Missing values

2023-12-12T18:54:09.763768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:54:10.018428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번분류ID분류명테이블ID테이블명컬럼ID컬럼명데이터타입컬럼설명컬럼데이터수표시형식
01TRTM치료PRE_GSTR_TRTM_RDPRE_위암_치료_방사선CENTER_CD센터코드VARCHAR(20)센터코드 (5자리 : XXXXX) / 00030 : 국립암센터 예) 00030758문자(5) : XXXXX
12TRTM치료PRE_GSTR_TRTM_RDPRE_위암_치료_방사선IRB_APRV_NOIRB승인번호VARCHAR(50)센터별 기준에 따라 생성758텍스트
23TRTM치료PRE_GSTR_TRTM_RDPRE_위암_치료_방사선PT_SBST_NO환자대체번호VARCHAR(10)개인고유번호(10자리) / 센터별 별도부여 예) RN12345678758문자(10) : XXXXXXXXXX
34TRTM치료PRE_GSTR_TRTM_RDPRE_위암_치료_방사선RDT_STRT_YMD방사선치료시작일자VARCHAR(8)방사선치료 시작일자 / YYYYMMDD 예)20200101758YYYYMMDD
45TRTM치료PRE_GSTR_TRTM_RDPRE_위암_치료_방사선RDT_SEQ방사선치료순번NUMBER(3)방사선치료 시작일자별 순번 / 예) 1758숫자
56TRTM치료PRE_GSTR_TRTM_RDPRE_위암_치료_방사선RDT_SITE_CD방사선치료부위코드VARCHAR(20)방사선치료 부위코드 / 01 Abdomen Partial | 02 Adrenal Gland | 03 Anus | 04 Axilla | 05 Bladder | 06 boost | 07 Brain | 08 Breast | 09 C Spine | 10 Cervix | 99 Other75701 Abdomen Partial | 02 Adrenal Gland | 03 Anus | 04 Axilla | 05 Bladder | 06 boost | 07 Brain | 08 Breast | 09 C Spine | 10 Cervix | 99 Other
67TRTM치료PRE_GSTR_TRTM_RDPRE_위암_치료_방사선RDT_SITE_NM방사선치료부위명VARCHAR(200)방사선치료 부위명 / 예) Brain Whole758텍스트
78TRTM치료PRE_GSTR_TRTM_RDPRE_위암_치료_방사선RDT_GSCN_PRSC_KIND_CD방사선치료위암처방종류코드VARCHAR(20)위암 방사선치료처방 종류코드 / 01: 체외조사 [1회당] 02: 체부 정위적 방사선수술 [1회당] 03: 정위적 방사선 분할치료 [1회당] 04: 전신조사 [1회당] 05: 입체조형치료 [1회당] 06: 양성자 치료 [1회당] 07: 세기변조 방사선치료 [1회당] 08: 밀봉소선원치료 09: 뇌 정위적 방사선수술 99: 기타75401 체외조사 [1회당] | 02 체부 정위적 방사선수술 [1회당] | 03 정위적 방사선 분할치료 [1회당] | 04 전신조사 [1회당] | 05 입체조형치료 [1회당] | 06 양성자 치료 [1회당] | 07 세기변조 방사선치료 [1회당] | 08 밀봉소선원치료 | 09 뇌 정위적 방사선수술 | 99 기타
89TRTM치료PRE_GSTR_TRTM_RDPRE_위암_치료_방사선RDT_GSCN_PRSC_KIND_NM방사선치료위암처방종류명VARCHAR(100)위암 방사선치료처방 종류명 / 예) 체외조사 [1회당]754텍스트
910TRTM치료PRE_GSTR_TRTM_RDPRE_위암_치료_방사선RDT_PRSC_CD방사선치료처방코드VARCHAR(20)방사선치료 처방코드 / 예) R51101758센터내 방사선치료처방코드
순번분류ID분류명테이블ID테이블명컬럼ID컬럼명데이터타입컬럼설명컬럼데이터수표시형식
1112TRTM치료PRE_GSTR_TRTM_RDPRE_위암_치료_방사선RDT_PRSC_NM방사선치료처방명VARCHAR(200)방사선치료 처방명 / 예) 6MV X-RAY 1Port758텍스트
1213TRTM치료PRE_GSTR_TRTM_RDPRE_위암_치료_방사선RDT_PRPS_CD방사선치료목적코드VARCHAR(20)방사선치료 목적코드 / 1 Curative 2 Palliative 9 Other01 Curative | 2 Palliative | 9 Other
1314TRTM치료PRE_GSTR_TRTM_RDPRE_위암_치료_방사선RDT_PRPS_NM방사선치료목적명VARCHAR(200)방사선치료 목적명 / 예) Curative0텍스트
1415TRTM치료PRE_GSTR_TRTM_RDPRE_위암_치료_방사선RDT_PRPS_CD_ETC_CONT방사선치료목적코드기타내용CLOB방사선치료 목적코드가 기타일 경우 기타내용 / free text0Free 텍스트
1516TRTM치료PRE_GSTR_TRTM_RDPRE_위암_치료_방사선RDT_END_YMD방사선치료종료일자VARCHAR(8)방사선치료 종료일자 / YYYYMMDD 예)20200101758YYYYMMDD
1617TRTM치료PRE_GSTR_TRTM_RDPRE_위암_치료_방사선RDT_TM1_CGY방사선치료1회선량NUMBER(5)방사선 치료시 1회 선량 / 정수값 예) 220758숫자
1718TRTM치료PRE_GSTR_TRTM_RDPRE_위암_치료_방사선RDT_TOTL_CGY방사선치료총선량NUMBER(5)방사선 치료 시 총 누적선량 / 정수값 예) 6500758숫자
1819TRTM치료PRE_GSTR_TRTM_RDPRE_위암_치료_방사선RDT_TOTL_TRTM_NT방사선치료총치료횟수NUMBER(5)방사선 치료 총 실시 횟수 / 정수값 예) 18758숫자
1920TRTM치료PRE_GSTR_TRTM_RDPRE_위암_치료_방사선RDT_SMNT_CONT방사선치료평가내용CLOB방사선치료 평가내용 / free text0Free 텍스트
2021TRTM치료PRE_GSTR_TRTM_RDPRE_위암_치료_방사선CRTN_DT생성일시DATETIME생성일시 DEFAULT current_timestamp()758YYYY-MM-DD HH:MI:SS