Dataset statistics
Number of variables | 11 |
---|---|
Number of observations | 249 |
Missing cells | 249 |
Missing cells (%) | 9.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 22.0 KiB |
Average record size in memory | 90.5 B |
Variable types
Numeric | 1 |
---|---|
Categorical | 6 |
Text | 3 |
Unsupported | 1 |
Dataset
Description | 소아청소년암 레지스트리 메타정보( 제공 되어질 데이터 항목, 타입, 사이즈, 항목별건수, 샘플데이터 등)를 제공 |
---|---|
Author | 국립암센터 |
URL | https://www.data.go.kr/data/15048704/fileData.do |
tblId is highly overall correlated with NUM and 3 other fields | High correlation |
gpNm is highly overall correlated with NUM and 3 other fields | High correlation |
gpId is highly overall correlated with NUM and 3 other fields | High correlation |
tblNm is highly overall correlated with NUM and 3 other fields | High correlation |
NUM is highly overall correlated with gpId and 3 other fields | High correlation |
dataType is highly overall correlated with dispFormat | High correlation |
dispFormat is highly overall correlated with dataType | High correlation |
colCnt has 249 (100.0%) missing values | Missing |
NUM has unique values | Unique |
colCnt is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2023-12-12 06:02:37.720349 |
---|---|
Analysis finished | 2023-12-12 06:02:38.605613 |
Duration | 0.89 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
NUM
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 249 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 125 |
Minimum | 1 |
---|---|
Maximum | 249 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.3 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 13.4 |
Q1 | 63 |
median | 125 |
Q3 | 187 |
95-th percentile | 236.6 |
Maximum | 249 |
Range | 248 |
Interquartile range (IQR) | 124 |
Descriptive statistics
Standard deviation | 72.024301 |
---|---|
Coefficient of variation (CV) | 0.57619441 |
Kurtosis | -1.2 |
Mean | 125 |
Median Absolute Deviation (MAD) | 62 |
Skewness | 0 |
Sum | 31125 |
Variance | 5187.5 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 0.4% |
172 | 1 | 0.4% |
159 | 1 | 0.4% |
160 | 1 | 0.4% |
161 | 1 | 0.4% |
162 | 1 | 0.4% |
163 | 1 | 0.4% |
164 | 1 | 0.4% |
165 | 1 | 0.4% |
166 | 1 | 0.4% |
Other values (239) | 239 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
249 | 1 | |
248 | 1 | |
247 | 1 | |
246 | 1 | |
245 | 1 | |
244 | 1 | |
243 | 1 | |
242 | 1 | |
241 | 1 | |
240 | 1 |
gpId
Categorical
HIGH CORRELATION
 
Distinct | 22 |
---|---|
Distinct (%) | 8.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.1 KiB |
PDTR_HLTH | |
---|---|
PDTR_CHMO_FLST | |
PDTR_CNDX_BDMS | |
PDTR_TRGT | |
PDTR_EVAL_DEAD | 10 |
Other values (17) |
Length
Max length | 14 |
---|---|
Median length | 9 |
Mean length | 10.64257 |
Min length | 7 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | PDTR_TRGT |
---|---|
2nd row | PDTR_TRGT |
3rd row | PDTR_TRGT |
4th row | PDTR_TRGT |
5th row | PDTR_TRGT |
Common Values
Value | Count | Frequency (%) |
PDTR_HLTH | 76 | |
PDTR_CHMO_FLST | 26 | 10.4% |
PDTR_CNDX_BDMS | 16 | 6.4% |
PDTR_TRGT | 13 | 5.2% |
PDTR_EVAL_DEAD | 10 | 4.0% |
PDTR_MTST_RLPS | 10 | 4.0% |
PDTR_RTX | 10 | 4.0% |
PDTR_CHMO | 9 | 3.6% |
PDTR_BX | 8 | 3.2% |
PDTR_OPRT | 8 | 3.2% |
Other values (12) | 63 |
Length
Value | Count | Frequency (%) |
pdtr_hlth | 76 | |
pdtr_chmo_flst | 26 | 10.4% |
pdtr_cndx_bdms | 16 | 6.4% |
pdtr_trgt | 13 | 5.2% |
pdtr_eval_dead | 10 | 4.0% |
pdtr_mtst_rlps | 10 | 4.0% |
pdtr_rtx | 10 | 4.0% |
pdtr_chmo | 9 | 3.6% |
pdtr_bx | 8 | 3.2% |
pdtr_oprt | 8 | 3.2% |
Other values (12) | 63 |
gpNm
Categorical
HIGH CORRELATION
 
Distinct | 22 |
---|---|
Distinct (%) | 8.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.1 KiB |
환자건강정보 | |
---|---|
항암 FlowSheet | |
진단 및 신체 | |
Summary | |
치료평가 및 사망정보 | 10 |
Other values (17) |
Length
Max length | 12 |
---|---|
Median length | 11 |
Mean length | 6.8915663 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Summary |
---|---|
2nd row | Summary |
3rd row | Summary |
4th row | Summary |
5th row | Summary |
Common Values
Value | Count | Frequency (%) |
환자건강정보 | 76 | |
항암 FlowSheet | 26 | 10.4% |
진단 및 신체 | 16 | 6.4% |
Summary | 13 | 5.2% |
치료평가 및 사망정보 | 10 | 4.0% |
전이 및 재발 | 10 | 4.0% |
방사선 치료 | 10 | 4.0% |
항암치료 | 9 | 3.6% |
병리검사 | 8 | 3.2% |
수술 | 8 | 3.2% |
Other values (12) | 63 |
Length
Value | Count | Frequency (%) |
환자건강정보 | 76 | |
및 | 36 | 9.8% |
flowsheet | 26 | 7.1% |
항암 | 26 | 7.1% |
진단 | 16 | 4.3% |
신체 | 16 | 4.3% |
summary | 13 | 3.5% |
initial | 11 | 3.0% |
치료평가 | 10 | 2.7% |
사망정보 | 10 | 2.7% |
Other values (19) | 128 |
tblId
Categorical
HIGH CORRELATION
 
Distinct | 35 |
---|---|
Distinct (%) | 14.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.1 KiB |
PDTR_PE_CHMO_FLST | |
---|---|
PDTR_MR_HLTH_10 | 14 |
PDTR_PT_TRGT | 13 |
PDTR_PT_BDMS | 11 |
PDTR_PE_RTX | 10 |
Other values (30) |
Length
Max length | 17 |
---|---|
Median length | 15 |
Mean length | 13.594378 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | PDTR_PT_TRGT |
---|---|
2nd row | PDTR_PT_TRGT |
3rd row | PDTR_PT_TRGT |
4th row | PDTR_PT_TRGT |
5th row | PDTR_PT_TRGT |
Common Values
Value | Count | Frequency (%) |
PDTR_PE_CHMO_FLST | 26 | 10.4% |
PDTR_MR_HLTH_10 | 14 | 5.6% |
PDTR_PT_TRGT | 13 | 5.2% |
PDTR_PT_BDMS | 11 | 4.4% |
PDTR_PE_RTX | 10 | 4.0% |
PDTR_MR_HLTH_8 | 9 | 3.6% |
PDTR_MR_HLTH_7 | 9 | 3.6% |
PDTR_MR_HLTH_6 | 9 | 3.6% |
PDTR_MR_HLTH_5 | 9 | 3.6% |
PDTR_PE_CHMO | 9 | 3.6% |
Other values (25) | 130 |
Length
Value | Count | Frequency (%) |
pdtr_pe_chmo_flst | 26 | 10.4% |
pdtr_mr_hlth_10 | 14 | 5.6% |
pdtr_pt_trgt | 13 | 5.2% |
pdtr_pt_bdms | 11 | 4.4% |
pdtr_pe_rtx | 10 | 4.0% |
pdtr_mr_hlth_8 | 9 | 3.6% |
pdtr_mr_hlth_7 | 9 | 3.6% |
pdtr_mr_hlth_6 | 9 | 3.6% |
pdtr_mr_hlth_5 | 9 | 3.6% |
pdtr_pe_chmo | 9 | 3.6% |
Other values (25) | 130 |
tblNm
Categorical
HIGH CORRELATION
 
Distinct | 35 |
---|---|
Distinct (%) | 14.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.1 KiB |
Flow Sheet | |
---|---|
과거력 | 14 |
기본정보 | 13 |
신체계측정보 | 11 |
방사선 치료 정보 | 10 |
Other values (30) |
Length
Max length | 11 |
---|---|
Median length | 9 |
Mean length | 6.3815261 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 기본정보 |
---|---|
2nd row | 기본정보 |
3rd row | 기본정보 |
4th row | 기본정보 |
5th row | 기본정보 |
Common Values
Value | Count | Frequency (%) |
Flow Sheet | 26 | 10.4% |
과거력 | 14 | 5.6% |
기본정보 | 13 | 5.2% |
신체계측정보 | 11 | 4.4% |
방사선 치료 정보 | 10 | 4.0% |
가족력(형제/자매) | 9 | 3.6% |
가족력(자녀) | 9 | 3.6% |
가족력(모) | 9 | 3.6% |
가족력(부) | 9 | 3.6% |
항암치료 정보 | 9 | 3.6% |
Other values (25) | 130 |
Length
Value | Count | Frequency (%) |
flow | 26 | 8.2% |
sheet | 26 | 8.2% |
정보 | 19 | 6.0% |
과거력 | 14 | 4.4% |
기본정보 | 13 | 4.1% |
initial | 11 | 3.5% |
신체계측정보 | 11 | 3.5% |
치료 | 10 | 3.2% |
방사선 | 10 | 3.2% |
가족력(형제/자매 | 9 | 2.8% |
Other values (30) | 168 |
colId
Text
Distinct | 217 |
---|---|
Distinct (%) | 87.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.1 KiB |
Length
Max length | 24 |
---|---|
Median length | 18 |
Mean length | 12.040161 |
Min length | 5 |
Characters and Unicode
Total characters | 2998 |
---|---|
Distinct characters | 30 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 209 ? |
---|---|
Unique (%) | 83.9% |
Sample
1st row | PT_SBST_NO |
---|---|
2nd row | SEX_CD |
3rd row | BRTH_YMD |
4th row | FRST_DIAG_CD |
5th row | FRST_DIAG_YMD |
Value | Count | Frequency (%) |
pt_sbst_no | 25 | 10.0% |
chmo_strt_ymd | 3 | 1.2% |
wt_vl | 2 | 0.8% |
rtx_strt_ymd | 2 | 0.8% |
intr_pret_cmnt | 2 | 0.8% |
chmo_end_ymd | 2 | 0.8% |
ecog_cd | 2 | 0.8% |
dead_ymd | 2 | 0.8% |
fmhs_fath_yn | 1 | 0.4% |
fmhs_fath_cncr_yn | 1 | 0.4% |
Other values (207) | 207 |
Most occurring characters
Value | Count | Frequency (%) |
_ | 500 | |
T | 293 | 9.8% |
M | 268 | 8.9% |
N | 232 | 7.7% |
S | 197 | 6.6% |
C | 193 | 6.4% |
D | 155 | 5.2% |
R | 118 | 3.9% |
H | 116 | 3.9% |
Y | 115 | 3.8% |
Other values (20) | 811 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 2495 | |
Connector Punctuation | 500 | 16.7% |
Decimal Number | 3 | 0.1% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
T | 293 | |
M | 268 | 10.7% |
N | 232 | 9.3% |
S | 197 | 7.9% |
C | 193 | 7.7% |
D | 155 | 6.2% |
R | 118 | 4.7% |
H | 116 | 4.6% |
Y | 115 | 4.6% |
L | 101 | 4.0% |
Other values (16) | 707 |
Decimal Number
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 500 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 2495 | |
Common | 503 | 16.8% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
T | 293 | |
M | 268 | 10.7% |
N | 232 | 9.3% |
S | 197 | 7.9% |
C | 193 | 7.7% |
D | 155 | 6.2% |
R | 118 | 4.7% |
H | 116 | 4.6% |
Y | 115 | 4.6% |
L | 101 | 4.0% |
Other values (16) | 707 |
Common
Value | Count | Frequency (%) |
_ | 500 | |
1 | 1 | 0.2% |
2 | 1 | 0.2% |
3 | 1 | 0.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 2998 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
_ | 500 | |
T | 293 | 9.8% |
M | 268 | 8.9% |
N | 232 | 7.7% |
S | 197 | 6.6% |
C | 193 | 6.4% |
D | 155 | 5.2% |
R | 118 | 3.9% |
H | 116 | 3.9% |
Y | 115 | 3.8% |
Other values (20) | 811 |
colNm
Text
Distinct | 182 |
---|---|
Distinct (%) | 73.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.1 KiB |
Value | Count | Frequency (%) |
내용 | 42 | 10.3% |
환자대체번호 | 25 | 6.1% |
검사 | 22 | 5.4% |
결과 | 11 | 2.7% |
검사명 | 10 | 2.4% |
접수일 | 10 | 2.4% |
여부 | 7 | 1.7% |
n:무 | 6 | 1.5% |
y:유 | 6 | 1.5% |
코드 | 5 | 1.2% |
Other values (186) | 265 |
Most occurring characters
Value | Count | Frequency (%) |
160 | 8.3% | |
내 | 58 | 3.0% |
부 | 56 | 2.9% |
용 | 54 | 2.8% |
력 | 53 | 2.7% |
병 | 53 | 2.7% |
) | 53 | 2.7% |
( | 53 | 2.7% |
일 | 47 | 2.4% |
사 | 46 | 2.4% |
Other values (176) | 1306 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1374 | |
Space Separator | 160 | 8.3% |
Uppercase Letter | 145 | 7.5% |
Lowercase Letter | 123 | 6.3% |
Close Punctuation | 53 | 2.7% |
Open Punctuation | 53 | 2.7% |
Other Punctuation | 27 | 1.4% |
Decimal Number | 4 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
내 | 58 | 4.2% |
부 | 56 | 4.1% |
용 | 54 | 3.9% |
력 | 53 | 3.9% |
병 | 53 | 3.9% |
일 | 47 | 3.4% |
사 | 46 | 3.3% |
자 | 44 | 3.2% |
여 | 40 | 2.9% |
가 | 39 | 2.8% |
Other values (125) | 884 |
Uppercase Letter
Value | Count | Frequency (%) |
N | 17 | |
T | 14 | 9.7% |
I | 13 | 9.0% |
A | 11 | 7.6% |
C | 10 | 6.9% |
S | 9 | 6.2% |
O | 9 | 6.2% |
R | 8 | 5.5% |
E | 8 | 5.5% |
B | 8 | 5.5% |
Other values (12) | 38 |
Lowercase Letter
Value | Count | Frequency (%) |
n | 14 | 11.4% |
i | 12 | 9.8% |
e | 11 | 8.9% |
o | 10 | 8.1% |
s | 9 | 7.3% |
a | 8 | 6.5% |
r | 6 | 4.9% |
g | 6 | 4.9% |
m | 6 | 4.9% |
c | 6 | 4.9% |
Other values (11) | 35 |
Decimal Number
Value | Count | Frequency (%) |
1 | 2 | |
3 | 1 | |
2 | 1 |
Other Punctuation
Value | Count | Frequency (%) |
/ | 15 | |
: | 12 |
Space Separator
Value | Count | Frequency (%) |
160 |
Close Punctuation
Value | Count | Frequency (%) |
) | 53 |
Open Punctuation
Value | Count | Frequency (%) |
( | 53 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1374 | |
Common | 297 | 15.3% |
Latin | 268 | 13.8% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
내 | 58 | 4.2% |
부 | 56 | 4.1% |
용 | 54 | 3.9% |
력 | 53 | 3.9% |
병 | 53 | 3.9% |
일 | 47 | 3.4% |
사 | 46 | 3.3% |
자 | 44 | 3.2% |
여 | 40 | 2.9% |
가 | 39 | 2.8% |
Other values (125) | 884 |
Latin
Value | Count | Frequency (%) |
N | 17 | 6.3% |
n | 14 | 5.2% |
T | 14 | 5.2% |
I | 13 | 4.9% |
i | 12 | 4.5% |
A | 11 | 4.1% |
e | 11 | 4.1% |
o | 10 | 3.7% |
C | 10 | 3.7% |
s | 9 | 3.4% |
Other values (33) | 147 |
Common
Value | Count | Frequency (%) |
160 | ||
) | 53 | 17.8% |
( | 53 | 17.8% |
/ | 15 | 5.1% |
: | 12 | 4.0% |
1 | 2 | 0.7% |
3 | 1 | 0.3% |
2 | 1 | 0.3% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1374 | |
ASCII | 565 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
160 | ||
) | 53 | 9.4% |
( | 53 | 9.4% |
N | 17 | 3.0% |
/ | 15 | 2.7% |
n | 14 | 2.5% |
T | 14 | 2.5% |
I | 13 | 2.3% |
i | 12 | 2.1% |
: | 12 | 2.1% |
Other values (41) | 202 |
Hangul
Value | Count | Frequency (%) |
내 | 58 | 4.2% |
부 | 56 | 4.1% |
용 | 54 | 3.9% |
력 | 53 | 3.9% |
병 | 53 | 3.9% |
일 | 47 | 3.4% |
사 | 46 | 3.3% |
자 | 44 | 3.2% |
여 | 40 | 2.9% |
가 | 39 | 2.8% |
Other values (125) | 884 |
dataType
Categorical
HIGH CORRELATION
 
Distinct | 25 |
---|---|
Distinct (%) | 10.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.1 KiB |
String(1) | |
---|---|
DATE | |
String(10) | |
String(100) | |
String(50) | |
Other values (20) |
Length
Max length | 13 |
---|---|
Median length | 12 |
Mean length | 9.1485944 |
Min length | 4 |
Unique
Unique | 5 ? |
---|---|
Unique (%) | 2.0% |
Sample
1st row | String(10) |
---|---|
2nd row | String(code) |
3rd row | DATE |
4th row | String(code) |
5th row | DATE |
Common Values
Value | Count | Frequency (%) |
String(1) | 52 | |
DATE | 46 | |
String(10) | 31 | |
String(100) | 23 | |
String(50) | 14 | 5.6% |
String(200) | 14 | 5.6% |
String(4000) | 12 | 4.8% |
Integer(code) | 7 | 2.8% |
String(code) | 6 | 2.4% |
Integer(3) | 6 | 2.4% |
Other values (15) | 38 |
Length
Value | Count | Frequency (%) |
string(1 | 52 | |
date | 46 | |
string(10 | 31 | |
string(100 | 23 | |
string(50 | 14 | 5.6% |
string(200 | 14 | 5.6% |
string(4000 | 12 | 4.8% |
integer(code | 7 | 2.8% |
string(code | 6 | 2.4% |
integer(3 | 6 | 2.4% |
Other values (15) | 38 |
colDesc
Text
Distinct | 216 |
---|---|
Distinct (%) | 86.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.1 KiB |
Value | Count | Frequency (%) |
환자대체번호 | 25 | 6.4% |
접수일 | 8 | 2.1% |
일 | 7 | 1.8% |
검사명 | 7 | 1.8% |
내용 | 7 | 1.8% |
검사결과 | 5 | 1.3% |
biopsy | 5 | 1.3% |
항암화학요법치료 | 5 | 1.3% |
세포병리검사 | 5 | 1.3% |
원인 | 4 | 1.0% |
Other values (236) | 312 |
Most occurring characters
Value | Count | Frequency (%) |
141 | 6.6% | |
병 | 62 | 2.9% |
자 | 60 | 2.8% |
사 | 58 | 2.7% |
부 | 56 | 2.6% |
력 | 53 | 2.5% |
( | 52 | 2.4% |
) | 52 | 2.4% |
일 | 50 | 2.3% |
검 | 44 | 2.1% |
Other values (194) | 1506 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1556 | |
Uppercase Letter | 168 | 7.9% |
Space Separator | 141 | 6.6% |
Lowercase Letter | 135 | 6.3% |
Open Punctuation | 52 | 2.4% |
Close Punctuation | 52 | 2.4% |
Other Punctuation | 24 | 1.1% |
Decimal Number | 4 | 0.2% |
Connector Punctuation | 2 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
병 | 62 | 4.0% |
자 | 60 | 3.9% |
사 | 58 | 3.7% |
부 | 56 | 3.6% |
력 | 53 | 3.4% |
일 | 50 | 3.2% |
검 | 44 | 2.8% |
가 | 41 | 2.6% |
여 | 39 | 2.5% |
족 | 38 | 2.4% |
Other values (142) | 1055 |
Uppercase Letter
Value | Count | Frequency (%) |
N | 19 | |
T | 16 | 9.5% |
I | 14 | 8.3% |
L | 13 | 7.7% |
C | 13 | 7.7% |
B | 12 | 7.1% |
A | 10 | 6.0% |
O | 10 | 6.0% |
Y | 9 | 5.4% |
E | 9 | 5.4% |
Other values (12) | 43 |
Lowercase Letter
Value | Count | Frequency (%) |
i | 18 | |
n | 15 | |
s | 13 | 9.6% |
o | 13 | 9.6% |
e | 10 | 7.4% |
y | 8 | 5.9% |
a | 8 | 5.9% |
p | 6 | 4.4% |
r | 5 | 3.7% |
d | 5 | 3.7% |
Other values (11) | 34 |
Decimal Number
Value | Count | Frequency (%) |
1 | 2 | |
3 | 1 | |
2 | 1 |
Other Punctuation
Value | Count | Frequency (%) |
/ | 14 | |
: | 10 |
Space Separator
Value | Count | Frequency (%) |
141 |
Open Punctuation
Value | Count | Frequency (%) |
( | 52 |
Close Punctuation
Value | Count | Frequency (%) |
) | 52 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1556 | |
Latin | 303 | 14.2% |
Common | 275 | 12.9% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
병 | 62 | 4.0% |
자 | 60 | 3.9% |
사 | 58 | 3.7% |
부 | 56 | 3.6% |
력 | 53 | 3.4% |
일 | 50 | 3.2% |
검 | 44 | 2.8% |
가 | 41 | 2.6% |
여 | 39 | 2.5% |
족 | 38 | 2.4% |
Other values (142) | 1055 |
Latin
Value | Count | Frequency (%) |
N | 19 | 6.3% |
i | 18 | 5.9% |
T | 16 | 5.3% |
n | 15 | 5.0% |
I | 14 | 4.6% |
s | 13 | 4.3% |
L | 13 | 4.3% |
o | 13 | 4.3% |
C | 13 | 4.3% |
B | 12 | 4.0% |
Other values (33) | 157 |
Common
Value | Count | Frequency (%) |
141 | ||
( | 52 | 18.9% |
) | 52 | 18.9% |
/ | 14 | 5.1% |
: | 10 | 3.6% |
1 | 2 | 0.7% |
_ | 2 | 0.7% |
3 | 1 | 0.4% |
2 | 1 | 0.4% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1556 | |
ASCII | 578 | 27.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
141 | ||
( | 52 | 9.0% |
) | 52 | 9.0% |
N | 19 | 3.3% |
i | 18 | 3.1% |
T | 16 | 2.8% |
n | 15 | 2.6% |
I | 14 | 2.4% |
/ | 14 | 2.4% |
s | 13 | 2.2% |
Other values (42) | 224 |
Hangul
Value | Count | Frequency (%) |
병 | 62 | 4.0% |
자 | 60 | 3.9% |
사 | 58 | 3.7% |
부 | 56 | 3.6% |
력 | 53 | 3.4% |
일 | 50 | 3.2% |
검 | 44 | 2.8% |
가 | 41 | 2.6% |
여 | 39 | 2.5% |
족 | 38 | 2.4% |
Other values (142) | 1055 |
colCnt
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 249 |
---|---|
Missing (%) | 100.0% |
Memory size | 2.3 KiB |
dispFormat
Categorical
HIGH CORRELATION
 
Distinct | 9 |
---|---|
Distinct (%) | 3.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.1 KiB |
텍스트 | |
---|---|
Y : 유 / N : 무 | |
YYYY-MM-DD | |
숫자 | |
RN+비식별숫자(8) | |
Other values (4) |
Length
Max length | 15 |
---|---|
Median length | 11 |
Mean length | 7.4417671 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.4% |
Sample
1st row | RN+비식별숫자(8) |
---|---|
2nd row | M 남 | F 여 |
3rd row | YYYY-MM-DD |
4th row | 원내검사 코드 |
5th row | YYYY-MM-DD |
Common Values
Value | Count | Frequency (%) |
텍스트 | 73 | |
Y : 유 / N : 무 | 50 | |
YYYY-MM-DD | 46 | |
숫자 | 34 | |
RN+비식별숫자(8) | 25 | 10.0% |
Free 텍스트 | 16 | 6.4% |
원내검사 코드 | 2 | 0.8% |
Y : 내부 / N : 외부 | 2 | 0.8% |
M 남 | F 여 | 1 | 0.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
157 | ||
텍스트 | 89 | |
y | 52 | 8.9% |
n | 52 | 8.9% |
유 | 50 | 8.6% |
무 | 50 | 8.6% |
yyyy-mm-dd | 46 | 7.9% |
숫자 | 34 | 5.8% |
rn+비식별숫자(8 | 25 | 4.3% |
free | 16 | 2.7% |
Other values (8) | 12 | 2.1% |
NUM | gpId | gpNm | tblId | tblNm | dataType | dispFormat | |
---|---|---|---|---|---|---|---|
NUM | 1.000 | 0.964 | 0.964 | 0.994 | 0.994 | 0.769 | 0.579 |
gpId | 0.964 | 1.000 | 1.000 | 1.000 | 1.000 | 0.760 | 0.579 |
gpNm | 0.964 | 1.000 | 1.000 | 1.000 | 1.000 | 0.760 | 0.579 |
tblId | 0.994 | 1.000 | 1.000 | 1.000 | 1.000 | 0.757 | 0.586 |
tblNm | 0.994 | 1.000 | 1.000 | 1.000 | 1.000 | 0.757 | 0.586 |
dataType | 0.769 | 0.760 | 0.760 | 0.757 | 0.757 | 1.000 | 0.949 |
dispFormat | 0.579 | 0.579 | 0.579 | 0.586 | 0.586 | 0.949 | 1.000 |
dataType | tblId | dispFormat | gpNm | gpId | tblNm | |
---|---|---|---|---|---|---|
dataType | 1.000 | 0.260 | 0.739 | 0.299 | 0.299 | 0.260 |
tblId | 0.260 | 1.000 | 0.241 | 0.971 | 0.971 | 1.000 |
dispFormat | 0.739 | 0.241 | 1.000 | 0.260 | 0.260 | 0.241 |
gpNm | 0.299 | 0.971 | 0.260 | 1.000 | 1.000 | 0.971 |
gpId | 0.299 | 0.971 | 0.260 | 1.000 | 1.000 | 0.971 |
tblNm | 0.260 | 1.000 | 0.241 | 0.971 | 0.971 | 1.000 |
NUM | gpId | gpNm | tblId | tblNm | dataType | dispFormat | |
---|---|---|---|---|---|---|---|
NUM | 1.000 | 0.795 | 0.795 | 0.881 | 0.881 | 0.375 | 0.307 |
gpId | 0.795 | 1.000 | 1.000 | 0.971 | 0.971 | 0.299 | 0.260 |
gpNm | 0.795 | 1.000 | 1.000 | 0.971 | 0.971 | 0.299 | 0.260 |
tblId | 0.881 | 0.971 | 0.971 | 1.000 | 1.000 | 0.260 | 0.241 |
tblNm | 0.881 | 0.971 | 0.971 | 1.000 | 1.000 | 0.260 | 0.241 |
dataType | 0.375 | 0.299 | 0.299 | 0.260 | 0.260 | 1.000 | 0.739 |
dispFormat | 0.307 | 0.260 | 0.260 | 0.241 | 0.241 | 0.739 | 1.000 |
NUM | gpId | gpNm | tblId | tblNm | colId | colNm | dataType | colDesc | colCnt | dispFormat | |
---|---|---|---|---|---|---|---|---|---|---|---|
0 | 1 | PDTR_TRGT | Summary | PDTR_PT_TRGT | 기본정보 | PT_SBST_NO | 환자대체번호 | String(10) | 환자대체번호 | <NA> | RN+비식별숫자(8) |
1 | 2 | PDTR_TRGT | Summary | PDTR_PT_TRGT | 기본정보 | SEX_CD | 성별 코드 | String(code) | 성별코드 | <NA> | M 남 | F 여 |
2 | 3 | PDTR_TRGT | Summary | PDTR_PT_TRGT | 기본정보 | BRTH_YMD | 생년월일 | DATE | 생년월일 | <NA> | YYYY-MM-DD |
3 | 4 | PDTR_TRGT | Summary | PDTR_PT_TRGT | 기본정보 | FRST_DIAG_CD | 최초 진단 코드 | String(code) | 최초진단코드 | <NA> | 원내검사 코드 |
4 | 5 | PDTR_TRGT | Summary | PDTR_PT_TRGT | 기본정보 | FRST_DIAG_YMD | 최초 진단일 | DATE | 최초진단일자 | <NA> | YYYY-MM-DD |
5 | 6 | PDTR_TRGT | Summary | PDTR_PT_TRGT | 기본정보 | FRST_DIAG_NM | 최초 진단명 | String(256) | 최초진단명 | <NA> | 텍스트 |
6 | 7 | PDTR_TRGT | Summary | PDTR_PT_TRGT | 기본정보 | DIAG_ATT_AGE | 진단 시 나이 | Integer(3) | 진단 시 나이 | <NA> | 숫자 |
7 | 8 | PDTR_TRGT | Summary | PDTR_PT_TRGT | 기본정보 | FRMD_YMD | 초진일 | DATE | 초진일자 | <NA> | YYYY-MM-DD |
8 | 9 | PDTR_TRGT | Summary | PDTR_PT_TRGT | 기본정보 | FRST_OPRT_YMD | 최초 수술일 | DATE | 최초 수술일자 | <NA> | YYYY-MM-DD |
9 | 10 | PDTR_TRGT | Summary | PDTR_PT_TRGT | 기본정보 | FRST_OPRT_NM | 최초 수술명 | String(256) | 최초 수술명 | <NA> | 텍스트 |
NUM | gpId | gpNm | tblId | tblNm | colId | colNm | dataType | colDesc | colCnt | dispFormat | |
---|---|---|---|---|---|---|---|---|---|---|---|
239 | 240 | PDTR_CHMO_FLST | 항암 FlowSheet | PDTR_PE_CHMO_FLST | Flow Sheet | FATG_CMNT | FATIGUE 내용 | String(50) | FATIGUE | <NA> | 텍스트 |
240 | 241 | PDTR_CHMO_FLST | 항암 FlowSheet | PDTR_PE_CHMO_FLST | Flow Sheet | NV_CMNT | NV 내용 | String(50) | NV | <NA> | 텍스트 |
241 | 242 | PDTR_CHMO_FLST | 항암 FlowSheet | PDTR_PE_CHMO_FLST | Flow Sheet | CSTP_CMNT | CONSTIPATION 내용 | String(50) | CONSTIPATION | <NA> | 텍스트 |
242 | 243 | PDTR_CHMO_FLST | 항암 FlowSheet | PDTR_PE_CHMO_FLST | Flow Sheet | DIAR_CMNT | DIARRHEA 내용 | String(50) | DIARRHEA | <NA> | 텍스트 |
243 | 244 | PDTR_CHMO_FLST | 항암 FlowSheet | PDTR_PE_CHMO_FLST | Flow Sheet | SKIN_RASH_CMNT | SKINRASH 내용 | String(50) | SKINRASH | <NA> | 텍스트 |
244 | 245 | PDTR_CHMO_FLST | 항암 FlowSheet | PDTR_PE_CHMO_FLST | Flow Sheet | MCST_CMNT | MUCOSITIS 내용 | String(50) | MUCOSITIS | <NA> | 텍스트 |
245 | 246 | PDTR_CHMO_FLST | 항암 FlowSheet | PDTR_PE_CHMO_FLST | Flow Sheet | NURO_PTHY_CMNT | NEUROPATHY 내용 | String(50) | NEUROPATHY | <NA> | 텍스트 |
246 | 247 | PDTR_CHMO_FLST | 항암 FlowSheet | PDTR_PE_CHMO_FLST | Flow Sheet | ECOG_CD | ECOG 코드 | Integer(code) | ECOG 전신상태평가 | <NA> | 숫자 |
247 | 248 | PDTR_CHMO_FLST | 항암 FlowSheet | PDTR_PE_CHMO_FLST | Flow Sheet | WT_VL | 체중 (kg) | Float(5,2) | 체중 | <NA> | 숫자 |
248 | 249 | PDTR_CHMO_FLST | 항암 FlowSheet | PDTR_PE_CHMO_FLST | Flow Sheet | BSA_VL | BSA | Float(10,2) | 체표면적 | <NA> | 숫자 |