Dataset statistics
Number of variables | 11 |
---|---|
Number of observations | 269 |
Missing cells | 269 |
Missing cells (%) | 9.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 23.8 KiB |
Average record size in memory | 90.5 B |
Variable types
Numeric | 1 |
---|---|
Categorical | 6 |
Text | 3 |
Unsupported | 1 |
Dataset
Description | 구강암 레지스트리 메타정보( 제공 되어질 데이터 항목, 타입, 사이즈, 항목별건수, 샘플데이터 등)를 제공 |
---|---|
Author | 국립암센터 |
URL | https://www.data.go.kr/data/15048706/fileData.do |
gpNm is highly overall correlated with NUM and 3 other fields | High correlation |
tblNm is highly overall correlated with NUM and 3 other fields | High correlation |
gpId is highly overall correlated with NUM and 3 other fields | High correlation |
tblId is highly overall correlated with NUM and 3 other fields | High correlation |
NUM is highly overall correlated with gpId and 3 other fields | High correlation |
dataType is highly overall correlated with dispFormat | High correlation |
dispFormat is highly overall correlated with dataType | High correlation |
colCnt has 269 (100.0%) missing values | Missing |
NUM has unique values | Unique |
colCnt is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2023-12-12 12:53:10.596180 |
---|---|
Analysis finished | 2023-12-12 12:53:11.576081 |
Duration | 0.98 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
NUM
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 269 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 135 |
Minimum | 1 |
---|---|
Maximum | 269 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 14.4 |
Q1 | 68 |
median | 135 |
Q3 | 202 |
95-th percentile | 255.6 |
Maximum | 269 |
Range | 268 |
Interquartile range (IQR) | 134 |
Descriptive statistics
Standard deviation | 77.797815 |
---|---|
Coefficient of variation (CV) | 0.57628011 |
Kurtosis | -1.2 |
Mean | 135 |
Median Absolute Deviation (MAD) | 67 |
Skewness | 0 |
Sum | 36315 |
Variance | 6052.5 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 0.4% |
186 | 1 | 0.4% |
172 | 1 | 0.4% |
173 | 1 | 0.4% |
174 | 1 | 0.4% |
175 | 1 | 0.4% |
176 | 1 | 0.4% |
177 | 1 | 0.4% |
178 | 1 | 0.4% |
179 | 1 | 0.4% |
Other values (259) | 259 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
269 | 1 | |
268 | 1 | |
267 | 1 | |
266 | 1 | |
265 | 1 | |
264 | 1 | |
263 | 1 | |
262 | 1 | |
261 | 1 | |
260 | 1 |
gpId
Categorical
HIGH CORRELATION
 
Distinct | 17 |
---|---|
Distinct (%) | 6.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.2 KiB |
ORAL_HLTH | |
---|---|
ORAL_SPR | |
ORAL_CHMO_FLST | |
ORAL_OPRT | |
ORAL_TRGT | |
Other values (12) |
Length
Max length | 14 |
---|---|
Median length | 9 |
Mean length | 10.092937 |
Min length | 7 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | ORAL_TRGT |
---|---|
2nd row | ORAL_TRGT |
3rd row | ORAL_TRGT |
4th row | ORAL_TRGT |
5th row | ORAL_TRGT |
Common Values
Value | Count | Frequency (%) |
ORAL_HLTH | 74 | |
ORAL_SPR | 37 | |
ORAL_CHMO_FLST | 26 | 9.7% |
ORAL_OPRT | 19 | 7.1% |
ORAL_TRGT | 18 | 6.7% |
ORAL_CNDX_BDMS | 12 | 4.5% |
ORAL_CHMO | 11 | 4.1% |
ORAL_RTX | 11 | 4.1% |
ORAL_EVAL_DEAD | 10 | 3.7% |
ORAL_BX | 9 | 3.3% |
Other values (7) | 42 |
Length
Value | Count | Frequency (%) |
oral_hlth | 74 | |
oral_spr | 37 | |
oral_chmo_flst | 26 | 9.7% |
oral_oprt | 19 | 7.1% |
oral_trgt | 18 | 6.7% |
oral_cndx_bdms | 12 | 4.5% |
oral_chmo | 11 | 4.1% |
oral_rtx | 11 | 4.1% |
oral_eval_dead | 10 | 3.7% |
oral_bx | 9 | 3.3% |
Other values (7) | 42 |
gpNm
Categorical
HIGH CORRELATION
 
Distinct | 17 |
---|---|
Distinct (%) | 6.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.2 KiB |
환자건강정보 | |
---|---|
외과병리 | |
항암 FlowSheet | |
수술 | |
Summary | |
Other values (12) |
Length
Max length | 12 |
---|---|
Median length | 11 |
Mean length | 6.4869888 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Summary |
---|---|
2nd row | Summary |
3rd row | Summary |
4th row | Summary |
5th row | Summary |
Common Values
Value | Count | Frequency (%) |
환자건강정보 | 74 | |
외과병리 | 37 | |
항암 FlowSheet | 26 | 9.7% |
수술 | 19 | 7.1% |
Summary | 18 | 6.7% |
진단 및 신체 | 12 | 4.5% |
항암치료 | 11 | 4.1% |
방사선치료 | 11 | 4.1% |
치료평가 및 사망정보 | 10 | 3.7% |
병리검사 | 9 | 3.3% |
Other values (7) | 42 |
Length
Value | Count | Frequency (%) |
환자건강정보 | 74 | |
외과병리 | 37 | 10.2% |
항암 | 26 | 7.2% |
flowsheet | 26 | 7.2% |
및 | 22 | 6.1% |
수술 | 19 | 5.2% |
summary | 18 | 5.0% |
영상검사 | 14 | 3.9% |
initial | 12 | 3.3% |
진단 | 12 | 3.3% |
Other values (11) | 103 |
tblId
Categorical
HIGH CORRELATION
 
Distinct | 30 |
---|---|
Distinct (%) | 11.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.2 KiB |
ORAL_PE_SPR | |
---|---|
ORAL_PE_CHMO_FLST | |
ORAL_PT_TRGT | |
ORAL_MR_HLTH_10 | 14 |
ORAL_PE_OPRT | 14 |
Other values (25) |
Length
Max length | 17 |
---|---|
Median length | 15 |
Mean length | 13.386617 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | ORAL_PT_TRGT |
---|---|
2nd row | ORAL_PT_TRGT |
3rd row | ORAL_PT_TRGT |
4th row | ORAL_PT_TRGT |
5th row | ORAL_PT_TRGT |
Common Values
Value | Count | Frequency (%) |
ORAL_PE_SPR | 37 | 13.8% |
ORAL_PE_CHMO_FLST | 26 | 9.7% |
ORAL_PT_TRGT | 18 | 6.7% |
ORAL_MR_HLTH_10 | 14 | 5.2% |
ORAL_PE_OPRT | 14 | 5.2% |
ORAL_PE_RTX | 11 | 4.1% |
ORAL_PE_CHMO | 11 | 4.1% |
ORAL_PE_BX | 9 | 3.3% |
ORAL_MR_HLTH_5 | 9 | 3.3% |
ORAL_MR_HLTH_8 | 9 | 3.3% |
Other values (20) | 111 |
Length
Value | Count | Frequency (%) |
oral_pe_spr | 37 | 13.8% |
oral_pe_chmo_flst | 26 | 9.7% |
oral_pt_trgt | 18 | 6.7% |
oral_mr_hlth_10 | 14 | 5.2% |
oral_pe_oprt | 14 | 5.2% |
oral_pe_rtx | 11 | 4.1% |
oral_pe_chmo | 11 | 4.1% |
oral_pe_bx | 9 | 3.3% |
oral_mr_hlth_5 | 9 | 3.3% |
oral_mr_hlth_8 | 9 | 3.3% |
Other values (20) | 111 |
tblNm
Categorical
HIGH CORRELATION
 
Distinct | 30 |
---|---|
Distinct (%) | 11.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.2 KiB |
수술 후 결과 | |
---|---|
Flow Sheet | |
기본정보 | |
과거력 | 14 |
수술정보 | 14 |
Other values (25) |
Length
Max length | 11 |
---|---|
Median length | 10 |
Mean length | 6.2342007 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 기본정보 |
---|---|
2nd row | 기본정보 |
3rd row | 기본정보 |
4th row | 기본정보 |
5th row | 기본정보 |
Common Values
Value | Count | Frequency (%) |
수술 후 결과 | 37 | 13.8% |
Flow Sheet | 26 | 9.7% |
기본정보 | 18 | 6.7% |
과거력 | 14 | 5.2% |
수술정보 | 14 | 5.2% |
방사선치료정보 | 11 | 4.1% |
항암치료정보 | 11 | 4.1% |
병리결과 | 9 | 3.3% |
가족력 (부) | 9 | 3.3% |
가족력(형제/자매) | 9 | 3.3% |
Other values (20) | 111 |
Length
Value | Count | Frequency (%) |
결과 | 43 | 10.5% |
수술 | 37 | 9.0% |
후 | 37 | 9.0% |
flow | 26 | 6.3% |
sheet | 26 | 6.3% |
기본정보 | 18 | 4.4% |
과거력 | 14 | 3.4% |
수술정보 | 14 | 3.4% |
영상 | 14 | 3.4% |
initial | 12 | 2.9% |
Other values (25) | 169 |
colId
Text
Distinct | 226 |
---|---|
Distinct (%) | 84.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.2 KiB |
Length
Max length | 24 |
---|---|
Median length | 17 |
Mean length | 11.933086 |
Min length | 5 |
Characters and Unicode
Total characters | 3210 |
---|---|
Distinct characters | 33 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 204 ? |
---|---|
Unique (%) | 75.8% |
Sample
1st row | PT_SBST_NO |
---|---|
2nd row | SEX_CD |
3rd row | BRTH_YMD |
4th row | FRST_DIAG_YMD |
5th row | FRST_DIAG_CD |
Value | Count | Frequency (%) |
pt_sbst_no | 20 | 7.4% |
oprt_ymd | 3 | 1.1% |
chmo_strt_ymd | 3 | 1.1% |
oprt_nm | 3 | 1.1% |
cexm_rslt_cmnt | 2 | 0.7% |
miex_ymd | 2 | 0.7% |
miex_rslt_cmnt | 2 | 0.7% |
chmo_prps_nm | 2 | 0.7% |
main_smpl_site_cmnt | 2 | 0.7% |
miex_opnn_cmnt | 2 | 0.7% |
Other values (216) | 228 |
Most occurring characters
Value | Count | Frequency (%) |
_ | 528 | |
T | 325 | 10.1% |
M | 285 | 8.9% |
N | 260 | 8.1% |
C | 202 | 6.3% |
S | 201 | 6.3% |
D | 154 | 4.8% |
R | 146 | 4.5% |
H | 124 | 3.9% |
Y | 119 | 3.7% |
Other values (23) | 866 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 2669 | |
Connector Punctuation | 528 | 16.4% |
Decimal Number | 13 | 0.4% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
T | 325 | |
M | 285 | 10.7% |
N | 260 | 9.7% |
C | 202 | 7.6% |
S | 201 | 7.5% |
D | 154 | 5.8% |
R | 146 | 5.5% |
H | 124 | 4.6% |
Y | 119 | 4.5% |
P | 106 | 4.0% |
Other values (16) | 747 |
Decimal Number
Value | Count | Frequency (%) |
3 | 4 | |
2 | 3 | |
1 | 2 | |
6 | 2 | |
5 | 1 | 7.7% |
7 | 1 | 7.7% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 528 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 2669 | |
Common | 541 | 16.9% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
T | 325 | |
M | 285 | 10.7% |
N | 260 | 9.7% |
C | 202 | 7.6% |
S | 201 | 7.5% |
D | 154 | 5.8% |
R | 146 | 5.5% |
H | 124 | 4.6% |
Y | 119 | 4.5% |
P | 106 | 4.0% |
Other values (16) | 747 |
Common
Value | Count | Frequency (%) |
_ | 528 | |
3 | 4 | 0.7% |
2 | 3 | 0.6% |
1 | 2 | 0.4% |
6 | 2 | 0.4% |
5 | 1 | 0.2% |
7 | 1 | 0.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 3210 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
_ | 528 | |
T | 325 | 10.1% |
M | 285 | 8.9% |
N | 260 | 8.1% |
C | 202 | 6.3% |
S | 201 | 6.3% |
D | 154 | 4.8% |
R | 146 | 4.5% |
H | 124 | 3.9% |
Y | 119 | 3.7% |
Other values (23) | 866 |
colNm
Text
Distinct | 207 |
---|---|
Distinct (%) | 77.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.2 KiB |
Value | Count | Frequency (%) |
내용 | 57 | 12.1% |
환자대체번호 | 20 | 4.2% |
검사 | 19 | 4.0% |
결과 | 8 | 1.7% |
여부 | 8 | 1.7% |
최초 | 7 | 1.5% |
기타 | 7 | 1.5% |
n:무 | 6 | 1.3% |
y:유 | 6 | 1.3% |
tumor | 6 | 1.3% |
Other values (225) | 329 |
Most occurring characters
Value | Count | Frequency (%) |
204 | 8.9% | |
내 | 71 | 3.1% |
용 | 67 | 2.9% |
부 | 65 | 2.8% |
( | 55 | 2.4% |
) | 55 | 2.4% |
력 | 53 | 2.3% |
병 | 53 | 2.3% |
일 | 42 | 1.8% |
여 | 41 | 1.8% |
Other values (181) | 1584 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1357 | |
Lowercase Letter | 401 | 17.5% |
Space Separator | 204 | 8.9% |
Uppercase Letter | 176 | 7.7% |
Open Punctuation | 55 | 2.4% |
Close Punctuation | 55 | 2.4% |
Other Punctuation | 27 | 1.2% |
Decimal Number | 14 | 0.6% |
Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
내 | 71 | 5.2% |
용 | 67 | 4.9% |
부 | 65 | 4.8% |
력 | 53 | 3.9% |
병 | 53 | 3.9% |
일 | 42 | 3.1% |
여 | 41 | 3.0% |
자 | 39 | 2.9% |
가 | 39 | 2.9% |
족 | 38 | 2.8% |
Other values (123) | 849 |
Lowercase Letter
Value | Count | Frequency (%) |
o | 41 | |
i | 38 | 9.5% |
e | 35 | 8.7% |
a | 34 | 8.5% |
n | 29 | 7.2% |
t | 28 | 7.0% |
r | 27 | 6.7% |
s | 25 | 6.2% |
l | 22 | 5.5% |
m | 20 | 5.0% |
Other values (13) | 102 |
Uppercase Letter
Value | Count | Frequency (%) |
T | 21 | |
N | 19 | |
I | 16 | 9.1% |
C | 14 | 8.0% |
A | 13 | 7.4% |
S | 13 | 7.4% |
P | 10 | 5.7% |
R | 8 | 4.5% |
E | 8 | 4.5% |
H | 7 | 4.0% |
Other values (13) | 47 |
Decimal Number
Value | Count | Frequency (%) |
3 | 4 | |
2 | 3 | |
1 | 3 | |
6 | 2 | |
7 | 1 | 7.1% |
5 | 1 | 7.1% |
Other Punctuation
Value | Count | Frequency (%) |
/ | 15 | |
: | 12 |
Space Separator
Value | Count | Frequency (%) |
204 |
Open Punctuation
Value | Count | Frequency (%) |
( | 55 |
Close Punctuation
Value | Count | Frequency (%) |
) | 55 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1357 | |
Latin | 577 | |
Common | 356 | 15.5% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
내 | 71 | 5.2% |
용 | 67 | 4.9% |
부 | 65 | 4.8% |
력 | 53 | 3.9% |
병 | 53 | 3.9% |
일 | 42 | 3.1% |
여 | 41 | 3.0% |
자 | 39 | 2.9% |
가 | 39 | 2.9% |
족 | 38 | 2.8% |
Other values (123) | 849 |
Latin
Value | Count | Frequency (%) |
o | 41 | 7.1% |
i | 38 | 6.6% |
e | 35 | 6.1% |
a | 34 | 5.9% |
n | 29 | 5.0% |
t | 28 | 4.9% |
r | 27 | 4.7% |
s | 25 | 4.3% |
l | 22 | 3.8% |
T | 21 | 3.6% |
Other values (36) | 277 |
Common
Value | Count | Frequency (%) |
204 | ||
( | 55 | 15.4% |
) | 55 | 15.4% |
/ | 15 | 4.2% |
: | 12 | 3.4% |
3 | 4 | 1.1% |
2 | 3 | 0.8% |
1 | 3 | 0.8% |
6 | 2 | 0.6% |
- | 1 | 0.3% |
Other values (2) | 2 | 0.6% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1357 | |
ASCII | 933 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
204 | ||
( | 55 | 5.9% |
) | 55 | 5.9% |
o | 41 | 4.4% |
i | 38 | 4.1% |
e | 35 | 3.8% |
a | 34 | 3.6% |
n | 29 | 3.1% |
t | 28 | 3.0% |
r | 27 | 2.9% |
Other values (48) | 387 |
Hangul
Value | Count | Frequency (%) |
내 | 71 | 5.2% |
용 | 67 | 4.9% |
부 | 65 | 4.8% |
력 | 53 | 3.9% |
병 | 53 | 3.9% |
일 | 42 | 3.1% |
여 | 41 | 3.0% |
자 | 39 | 2.9% |
가 | 39 | 2.9% |
족 | 38 | 2.8% |
Other values (123) | 849 |
dataType
Categorical
HIGH CORRELATION
 
Distinct | 27 |
---|---|
Distinct (%) | 10.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.2 KiB |
String(1) | |
---|---|
DATE | |
String(100) | |
String(10) | |
String(200) | |
Other values (22) |
Length
Max length | 13 |
---|---|
Median length | 12 |
Mean length | 9.3085502 |
Min length | 4 |
Unique
Unique | 8 ? |
---|---|
Unique (%) | 3.0% |
Sample
1st row | String(10) |
---|---|
2nd row | String(code) |
3rd row | DATE |
4th row | DATE |
5th row | String(code) |
Common Values
Value | Count | Frequency (%) |
String(1) | 54 | |
DATE | 42 | |
String(100) | 33 | |
String(10) | 27 | |
String(200) | 20 | 7.4% |
String(50) | 17 | 6.3% |
String(256) | 10 | 3.7% |
Integer(code) | 9 | 3.3% |
String(4000) | 8 | 3.0% |
String(20) | 8 | 3.0% |
Other values (17) | 41 |
Length
Value | Count | Frequency (%) |
string(1 | 54 | |
date | 42 | |
string(100 | 33 | |
string(10 | 27 | |
string(200 | 20 | 7.4% |
string(50 | 17 | 6.3% |
string(256 | 10 | 3.7% |
integer(code | 9 | 3.3% |
integer(3 | 9 | 3.3% |
string(4000 | 8 | 3.0% |
Other values (16) | 40 |
colDesc
Text
Distinct | 225 |
---|---|
Distinct (%) | 83.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.2 KiB |
Value | Count | Frequency (%) |
환자대체번호 | 20 | 4.5% |
내용 | 15 | 3.4% |
일 | 7 | 1.6% |
명칭 | 6 | 1.4% |
항암화학요법치료 | 6 | 1.4% |
tumor | 6 | 1.4% |
기타 | 6 | 1.4% |
수술 | 5 | 1.1% |
방사선치료 | 5 | 1.1% |
y:유 | 4 | 0.9% |
Other values (264) | 361 |
Most occurring characters
Value | Count | Frequency (%) |
172 | 7.2% | |
부 | 65 | 2.7% |
병 | 64 | 2.7% |
자 | 56 | 2.3% |
력 | 53 | 2.2% |
( | 52 | 2.2% |
) | 52 | 2.2% |
i | 48 | 2.0% |
사 | 42 | 1.7% |
o | 42 | 1.7% |
Other values (190) | 1755 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1492 | |
Lowercase Letter | 414 | 17.2% |
Uppercase Letter | 179 | 7.5% |
Space Separator | 172 | 7.2% |
Open Punctuation | 52 | 2.2% |
Close Punctuation | 52 | 2.2% |
Other Punctuation | 24 | 1.0% |
Decimal Number | 14 | 0.6% |
Connector Punctuation | 1 | < 0.1% |
Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
부 | 65 | 4.4% |
병 | 64 | 4.3% |
자 | 56 | 3.8% |
력 | 53 | 3.6% |
사 | 42 | 2.8% |
일 | 42 | 2.8% |
가 | 41 | 2.7% |
여 | 40 | 2.7% |
내 | 39 | 2.6% |
족 | 38 | 2.5% |
Other values (132) | 1012 |
Uppercase Letter
Value | Count | Frequency (%) |
N | 20 | 11.2% |
T | 18 | 10.1% |
C | 15 | 8.4% |
I | 13 | 7.3% |
A | 12 | 6.7% |
S | 12 | 6.7% |
L | 11 | 6.1% |
B | 9 | 5.0% |
P | 9 | 5.0% |
Y | 8 | 4.5% |
Other values (13) | 52 |
Lowercase Letter
Value | Count | Frequency (%) |
i | 48 | |
o | 42 | |
e | 36 | 8.7% |
a | 35 | 8.5% |
n | 31 | 7.5% |
t | 29 | 7.0% |
r | 28 | 6.8% |
s | 27 | 6.5% |
l | 21 | 5.1% |
c | 19 | 4.6% |
Other values (12) | 98 |
Decimal Number
Value | Count | Frequency (%) |
3 | 4 | |
2 | 3 | |
1 | 3 | |
6 | 2 | |
5 | 1 | 7.1% |
7 | 1 | 7.1% |
Other Punctuation
Value | Count | Frequency (%) |
/ | 14 | |
: | 10 |
Space Separator
Value | Count | Frequency (%) |
172 |
Open Punctuation
Value | Count | Frequency (%) |
( | 52 |
Close Punctuation
Value | Count | Frequency (%) |
) | 52 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1492 | |
Latin | 593 | 24.7% |
Common | 316 | 13.2% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
부 | 65 | 4.4% |
병 | 64 | 4.3% |
자 | 56 | 3.8% |
력 | 53 | 3.6% |
사 | 42 | 2.8% |
일 | 42 | 2.8% |
가 | 41 | 2.7% |
여 | 40 | 2.7% |
내 | 39 | 2.6% |
족 | 38 | 2.5% |
Other values (132) | 1012 |
Latin
Value | Count | Frequency (%) |
i | 48 | 8.1% |
o | 42 | 7.1% |
e | 36 | 6.1% |
a | 35 | 5.9% |
n | 31 | 5.2% |
t | 29 | 4.9% |
r | 28 | 4.7% |
s | 27 | 4.6% |
l | 21 | 3.5% |
N | 20 | 3.4% |
Other values (35) | 276 |
Common
Value | Count | Frequency (%) |
172 | ||
( | 52 | 16.5% |
) | 52 | 16.5% |
/ | 14 | 4.4% |
: | 10 | 3.2% |
3 | 4 | 1.3% |
2 | 3 | 0.9% |
1 | 3 | 0.9% |
6 | 2 | 0.6% |
5 | 1 | 0.3% |
Other values (3) | 3 | 0.9% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1492 | |
ASCII | 909 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
172 | ||
( | 52 | 5.7% |
) | 52 | 5.7% |
i | 48 | 5.3% |
o | 42 | 4.6% |
e | 36 | 4.0% |
a | 35 | 3.9% |
n | 31 | 3.4% |
t | 29 | 3.2% |
r | 28 | 3.1% |
Other values (48) | 384 |
Hangul
Value | Count | Frequency (%) |
부 | 65 | 4.4% |
병 | 64 | 4.3% |
자 | 56 | 3.8% |
력 | 53 | 3.6% |
사 | 42 | 2.8% |
일 | 42 | 2.8% |
가 | 41 | 2.7% |
여 | 40 | 2.7% |
내 | 39 | 2.6% |
족 | 38 | 2.5% |
Other values (132) | 1012 |
colCnt
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 269 |
---|---|
Missing (%) | 100.0% |
Memory size | 2.5 KiB |
dispFormat
Categorical
HIGH CORRELATION
 
Distinct | 9 |
---|---|
Distinct (%) | 3.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.2 KiB |
텍스트 | |
---|---|
Y : 유 / N : 무 | |
YYYY-MM-DD | |
숫자 | |
RN+비식별숫자(8) | |
Other values (4) |
Length
Max length | 15 |
---|---|
Median length | 13 |
Mean length | 6.802974 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.4% |
Sample
1st row | RN+비식별숫자(8) |
---|---|
2nd row | M 남 | F 여 |
3rd row | YYYY-MM-DD |
4th row | YYYY-MM-DD |
5th row | 원내검사 코드 |
Common Values
Value | Count | Frequency (%) |
텍스트 | 107 | |
Y : 유 / N : 무 | 51 | |
YYYY-MM-DD | 41 | 15.2% |
숫자 | 34 | 12.6% |
RN+비식별숫자(8) | 20 | 7.4% |
Free 텍스트 | 10 | 3.7% |
Y : 내부 / N : 외부 | 3 | 1.1% |
원내검사 코드 | 2 | 0.7% |
M 남 | F 여 | 1 | 0.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
163 | ||
텍스트 | 117 | |
y | 54 | 8.9% |
n | 54 | 8.9% |
유 | 51 | 8.4% |
무 | 51 | 8.4% |
yyyy-mm-dd | 41 | 6.7% |
숫자 | 34 | 5.6% |
rn+비식별숫자(8 | 20 | 3.3% |
free | 10 | 1.6% |
Other values (8) | 14 | 2.3% |
NUM | gpId | gpNm | tblId | tblNm | dataType | dispFormat | |
---|---|---|---|---|---|---|---|
NUM | 1.000 | 0.948 | 0.948 | 0.997 | 0.997 | 0.749 | 0.564 |
gpId | 0.948 | 1.000 | 1.000 | 1.000 | 1.000 | 0.752 | 0.610 |
gpNm | 0.948 | 1.000 | 1.000 | 1.000 | 1.000 | 0.752 | 0.610 |
tblId | 0.997 | 1.000 | 1.000 | 1.000 | 1.000 | 0.710 | 0.639 |
tblNm | 0.997 | 1.000 | 1.000 | 1.000 | 1.000 | 0.710 | 0.639 |
dataType | 0.749 | 0.752 | 0.752 | 0.710 | 0.710 | 1.000 | 0.985 |
dispFormat | 0.564 | 0.610 | 0.610 | 0.639 | 0.639 | 0.985 | 1.000 |
dispFormat | gpNm | tblNm | dataType | gpId | tblId | |
---|---|---|---|---|---|---|
dispFormat | 1.000 | 0.291 | 0.284 | 0.785 | 0.291 | 0.284 |
gpNm | 0.291 | 1.000 | 0.974 | 0.311 | 1.000 | 0.974 |
tblNm | 0.284 | 0.974 | 1.000 | 0.234 | 0.974 | 1.000 |
dataType | 0.785 | 0.311 | 0.234 | 1.000 | 0.311 | 0.234 |
gpId | 0.291 | 1.000 | 0.974 | 0.311 | 1.000 | 0.974 |
tblId | 0.284 | 0.974 | 1.000 | 0.234 | 0.974 | 1.000 |
NUM | gpId | gpNm | tblId | tblNm | dataType | dispFormat | |
---|---|---|---|---|---|---|---|
NUM | 1.000 | 0.767 | 0.767 | 0.875 | 0.875 | 0.371 | 0.297 |
gpId | 0.767 | 1.000 | 1.000 | 0.974 | 0.974 | 0.311 | 0.291 |
gpNm | 0.767 | 1.000 | 1.000 | 0.974 | 0.974 | 0.311 | 0.291 |
tblId | 0.875 | 0.974 | 0.974 | 1.000 | 1.000 | 0.234 | 0.284 |
tblNm | 0.875 | 0.974 | 0.974 | 1.000 | 1.000 | 0.234 | 0.284 |
dataType | 0.371 | 0.311 | 0.311 | 0.234 | 0.234 | 1.000 | 0.785 |
dispFormat | 0.297 | 0.291 | 0.291 | 0.284 | 0.284 | 0.785 | 1.000 |
NUM | gpId | gpNm | tblId | tblNm | colId | colNm | dataType | colDesc | colCnt | dispFormat | |
---|---|---|---|---|---|---|---|---|---|---|---|
0 | 1 | ORAL_TRGT | Summary | ORAL_PT_TRGT | 기본정보 | PT_SBST_NO | 환자대체번호 | String(10) | 환자대체번호 | <NA> | RN+비식별숫자(8) |
1 | 2 | ORAL_TRGT | Summary | ORAL_PT_TRGT | 기본정보 | SEX_CD | 성별 코드 | String(code) | 성별코드 | <NA> | M 남 | F 여 |
2 | 3 | ORAL_TRGT | Summary | ORAL_PT_TRGT | 기본정보 | BRTH_YMD | 생년월일 | DATE | 생년월일 | <NA> | YYYY-MM-DD |
3 | 4 | ORAL_TRGT | Summary | ORAL_PT_TRGT | 기본정보 | FRST_DIAG_YMD | 최초 진단일 | DATE | 최초진단일자 | <NA> | YYYY-MM-DD |
4 | 5 | ORAL_TRGT | Summary | ORAL_PT_TRGT | 기본정보 | FRST_DIAG_CD | 최초 진단 코드 | String(code) | 최초진단코드 | <NA> | 원내검사 코드 |
5 | 6 | ORAL_TRGT | Summary | ORAL_PT_TRGT | 기본정보 | FRST_DIAG_NM | 최초 진단명 | String(256) | 최초진단명 | <NA> | 텍스트 |
6 | 7 | ORAL_TRGT | Summary | ORAL_PT_TRGT | 기본정보 | DIAG_ATT_AGE | 진단 시 나이 | Integer(3) | 진단 시 나이 | <NA> | 숫자 |
7 | 8 | ORAL_TRGT | Summary | ORAL_PT_TRGT | 기본정보 | FRMD_YMD | 초진일 | DATE | 초진일자 | <NA> | YYYY-MM-DD |
8 | 9 | ORAL_TRGT | Summary | ORAL_PT_TRGT | 기본정보 | FRMD_DEPT_NM | 초진 부서명 | String(20) | 초진 부서 | <NA> | 텍스트 |
9 | 10 | ORAL_TRGT | Summary | ORAL_PT_TRGT | 기본정보 | SPRA_FRMD_YMD | 희귀암클리닉 초진일 | DATE | 희귀암클리닉 초진일 | <NA> | YYYY-MM-DD |
NUM | gpId | gpNm | tblId | tblNm | colId | colNm | dataType | colDesc | colCnt | dispFormat | |
---|---|---|---|---|---|---|---|---|---|---|---|
259 | 260 | ORAL_CHMO_FLST | 항암 FlowSheet | ORAL_PE_CHMO_FLST | Flow Sheet | FATG_CMNT | FATIGUE 내용 | String(50) | FATIGUE | <NA> | 텍스트 |
260 | 261 | ORAL_CHMO_FLST | 항암 FlowSheet | ORAL_PE_CHMO_FLST | Flow Sheet | NV_CMNT | NV 내용 | String(50) | NV | <NA> | 텍스트 |
261 | 262 | ORAL_CHMO_FLST | 항암 FlowSheet | ORAL_PE_CHMO_FLST | Flow Sheet | CSTP_CMNT | CONSTIPATION 내용 | String(50) | CONSTIPATION | <NA> | 텍스트 |
262 | 263 | ORAL_CHMO_FLST | 항암 FlowSheet | ORAL_PE_CHMO_FLST | Flow Sheet | DIAR_CMNT | DIARRHEA 내용 | String(50) | DIARRHEA | <NA> | 텍스트 |
263 | 264 | ORAL_CHMO_FLST | 항암 FlowSheet | ORAL_PE_CHMO_FLST | Flow Sheet | SKIN_RASH_CMNT | SKINRASH 내용 | String(50) | SKINRASH | <NA> | 텍스트 |
264 | 265 | ORAL_CHMO_FLST | 항암 FlowSheet | ORAL_PE_CHMO_FLST | Flow Sheet | MCST_CMNT | MUCOSITIS 내용 | String(50) | MUCOSITIS | <NA> | 텍스트 |
265 | 266 | ORAL_CHMO_FLST | 항암 FlowSheet | ORAL_PE_CHMO_FLST | Flow Sheet | NURO_PTHY_CMNT | NEUROPATHY 내용 | String(50) | NEUROPATHY | <NA> | 텍스트 |
266 | 267 | ORAL_CHMO_FLST | 항암 FlowSheet | ORAL_PE_CHMO_FLST | Flow Sheet | ECOG_CD | ECOG 코드 | Integer(code) | ECOG 전신상태평가 | <NA> | 숫자 |
267 | 268 | ORAL_CHMO_FLST | 항암 FlowSheet | ORAL_PE_CHMO_FLST | Flow Sheet | WT_VL | 체중 (kg) | Float(5,2) | 체중 | <NA> | 숫자 |
268 | 269 | ORAL_CHMO_FLST | 항암 FlowSheet | ORAL_PE_CHMO_FLST | Flow Sheet | BSA_VL | BSA | Float(10,2) | 체표면적 | <NA> | 숫자 |