Dataset statistics
Number of variables | 11 |
---|---|
Number of observations | 308 |
Missing cells | 308 |
Missing cells (%) | 9.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 27.2 KiB |
Average record size in memory | 90.4 B |
Variable types
Numeric | 1 |
---|---|
Categorical | 6 |
Text | 3 |
Unsupported | 1 |
Dataset
Description | 방광암 레지스트리 메타정보( 제공 되어질 데이터 항목, 타입, 사이즈, 항목별건수, 샘플데이터 등)를 제공 |
---|---|
Author | 국립암센터 |
URL | https://www.data.go.kr/data/15048705/fileData.do |
tblId is highly overall correlated with NUM and 3 other fields | High correlation |
gpNm is highly overall correlated with NUM and 3 other fields | High correlation |
gpId is highly overall correlated with NUM and 3 other fields | High correlation |
tblNm is highly overall correlated with NUM and 3 other fields | High correlation |
NUM is highly overall correlated with gpId and 3 other fields | High correlation |
dataType is highly overall correlated with dispFormat | High correlation |
dispFormat is highly overall correlated with dataType | High correlation |
colCnt has 308 (100.0%) missing values | Missing |
NUM has unique values | Unique |
colCnt is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2023-12-12 09:20:26.327044 |
---|---|
Analysis finished | 2023-12-12 09:20:27.682713 |
Duration | 1.36 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
NUM
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 308 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 154.5 |
Minimum | 1 |
---|---|
Maximum | 308 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.8 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 16.35 |
Q1 | 77.75 |
median | 154.5 |
Q3 | 231.25 |
95-th percentile | 292.65 |
Maximum | 308 |
Range | 307 |
Interquartile range (IQR) | 153.5 |
Descriptive statistics
Standard deviation | 89.056162 |
---|---|
Coefficient of variation (CV) | 0.57641529 |
Kurtosis | -1.2 |
Mean | 154.5 |
Median Absolute Deviation (MAD) | 77 |
Skewness | 0 |
Sum | 47586 |
Variance | 7931 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 0.3% |
205 | 1 | 0.3% |
212 | 1 | 0.3% |
211 | 1 | 0.3% |
210 | 1 | 0.3% |
209 | 1 | 0.3% |
208 | 1 | 0.3% |
207 | 1 | 0.3% |
206 | 1 | 0.3% |
204 | 1 | 0.3% |
Other values (298) | 298 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
308 | 1 | |
307 | 1 | |
306 | 1 | |
305 | 1 | |
304 | 1 | |
303 | 1 | |
302 | 1 | |
301 | 1 | |
300 | 1 | |
299 | 1 |
gpId
Categorical
HIGH CORRELATION
 
Distinct | 19 |
---|---|
Distinct (%) | 6.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.5 KiB |
BLAD_HLTH | |
---|---|
BLAD_SPR | |
BLAD_CHMO_FLST | |
BLAD_OPRT | |
BLAD_IMNL | |
Other values (14) |
Length
Max length | 14 |
---|---|
Median length | 12 |
Mean length | 10.126623 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | BLAD_TRGT |
---|---|
2nd row | BLAD_TRGT |
3rd row | BLAD_TRGT |
4th row | BLAD_TRGT |
5th row | BLAD_TRGT |
Common Values
Value | Count | Frequency (%) |
BLAD_HLTH | 74 | |
BLAD_SPR | 60 | |
BLAD_CHMO_FLST | 26 | 8.4% |
BLAD_OPRT | 19 | 6.2% |
BLAD_IMNL | 13 | 4.2% |
BLAD_TRGT | 13 | 4.2% |
BLAD_CNDX_BDMS | 12 | 3.9% |
BLAD_CHMO | 12 | 3.9% |
BLAD_EVAL_DEAD | 10 | 3.2% |
BLAD_RTX | 9 | 2.9% |
Other values (9) | 60 |
Length
Value | Count | Frequency (%) |
blad_hlth | 74 | |
blad_spr | 60 | |
blad_chmo_flst | 26 | 8.4% |
blad_oprt | 19 | 6.2% |
blad_imnl | 13 | 4.2% |
blad_trgt | 13 | 4.2% |
blad_cndx_bdms | 12 | 3.9% |
blad_chmo | 12 | 3.9% |
blad_eval_dead | 10 | 3.2% |
blad_itrv_tx | 9 | 2.9% |
Other values (9) | 60 |
gpNm
Categorical
HIGH CORRELATION
 
Distinct | 19 |
---|---|
Distinct (%) | 6.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.5 KiB |
환자건강정보 | |
---|---|
외과병리 | |
항암 FlowSheet | |
수술 | |
면역병리검사 | |
Other values (14) |
Length
Max length | 15 |
---|---|
Median length | 12 |
Mean length | 6.775974 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Summary |
---|---|
2nd row | Summary |
3rd row | Summary |
4th row | Summary |
5th row | Summary |
Common Values
Value | Count | Frequency (%) |
환자건강정보 | 74 | |
외과병리 | 60 | |
항암 FlowSheet | 26 | 8.4% |
수술 | 19 | 6.2% |
면역병리검사 | 13 | 4.2% |
Summary | 13 | 4.2% |
진단 및 신체 | 12 | 3.9% |
항암치료 | 12 | 3.9% |
치료평가 및 사망정보 | 10 | 3.2% |
방사선 치료 | 9 | 2.9% |
Other values (9) | 60 |
Length
Value | Count | Frequency (%) |
환자건강정보 | 74 | |
외과병리 | 60 | |
항암 | 26 | 6.1% |
flowsheet | 26 | 6.1% |
및 | 22 | 5.2% |
initial | 20 | 4.7% |
수술 | 19 | 4.4% |
면역병리검사 | 13 | 3.0% |
summary | 13 | 3.0% |
진단 | 12 | 2.8% |
Other values (15) | 142 |
tblId
Categorical
HIGH CORRELATION
 
Distinct | 34 |
---|---|
Distinct (%) | 11.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.5 KiB |
BLAD_PE_SPR | |
---|---|
BLAD_PE_CHMO_FLST | |
BLAD_PE_SPR_TURB | |
BLAD_MR_HLTH_5 | 14 |
BLAD_PE_IMNL | 13 |
Other values (29) |
Length
Max length | 17 |
---|---|
Median length | 16 |
Mean length | 13.75 |
Min length | 11 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | BLAD_PT_TRGT |
---|---|
2nd row | BLAD_PT_TRGT |
3rd row | BLAD_PT_TRGT |
4th row | BLAD_PT_TRGT |
5th row | BLAD_PT_TRGT |
Common Values
Value | Count | Frequency (%) |
BLAD_PE_SPR | 36 | 11.7% |
BLAD_PE_CHMO_FLST | 26 | 8.4% |
BLAD_PE_SPR_TURB | 24 | 7.8% |
BLAD_MR_HLTH_5 | 14 | 4.5% |
BLAD_PE_IMNL | 13 | 4.2% |
BLAD_PT_TRGT | 13 | 4.2% |
BLAD_PE_CHMO | 12 | 3.9% |
BLAD_MR_HLTH_6 | 9 | 2.9% |
BLAD_PE_RTX | 9 | 2.9% |
BLAD_PE_BX_INIT | 9 | 2.9% |
Other values (24) | 143 |
Length
Value | Count | Frequency (%) |
blad_pe_spr | 36 | 11.7% |
blad_pe_chmo_flst | 26 | 8.4% |
blad_pe_spr_turb | 24 | 7.8% |
blad_mr_hlth_5 | 14 | 4.5% |
blad_pe_imnl | 13 | 4.2% |
blad_pt_trgt | 13 | 4.2% |
blad_pe_chmo | 12 | 3.9% |
blad_mr_hlth_7 | 9 | 2.9% |
blad_pe_itrv_tx | 9 | 2.9% |
blad_mr_hlth_8 | 9 | 2.9% |
Other values (24) | 143 |
tblNm
Categorical
HIGH CORRELATION
 
Distinct | 34 |
---|---|
Distinct (%) | 11.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.5 KiB |
수술 후 결과 | |
---|---|
Flow Sheet | |
수술 후 결과(TURB) | |
과거력 | 14 |
면역병리결과 | 13 |
Other values (29) |
Length
Max length | 13 |
---|---|
Median length | 11 |
Mean length | 7.2564935 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 기본정보 |
---|---|
2nd row | 기본정보 |
3rd row | 기본정보 |
4th row | 기본정보 |
5th row | 기본정보 |
Common Values
Value | Count | Frequency (%) |
수술 후 결과 | 36 | 11.7% |
Flow Sheet | 26 | 8.4% |
수술 후 결과(TURB) | 24 | 7.8% |
과거력 | 14 | 4.5% |
면역병리결과 | 13 | 4.2% |
기본정보 | 13 | 4.2% |
항암치료정보 | 12 | 3.9% |
가족력(부) | 9 | 2.9% |
방사선치료정보 | 9 | 2.9% |
Initial 병리결과 | 9 | 2.9% |
Other values (24) | 143 |
Length
Value | Count | Frequency (%) |
수술 | 66 | 12.4% |
후 | 66 | 12.4% |
결과 | 43 | 8.1% |
flow | 26 | 4.9% |
sheet | 26 | 4.9% |
결과(turb | 24 | 4.5% |
initial | 20 | 3.8% |
과거력 | 14 | 2.6% |
기본정보 | 13 | 2.4% |
면역병리결과 | 13 | 2.4% |
Other values (30) | 220 |
colId
Text
Distinct | 240 |
---|---|
Distinct (%) | 77.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.5 KiB |
Length
Max length | 20 |
---|---|
Median length | 16 |
Mean length | 11.896104 |
Min length | 5 |
Characters and Unicode
Total characters | 3664 |
---|---|
Distinct characters | 35 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 200 ? |
---|---|
Unique (%) | 64.9% |
Sample
1st row | PT_SBST_NO |
---|---|
2nd row | SEX_CD |
3rd row | BRTH_YMD |
4th row | FRST_DIAG_CD |
5th row | FRST_DIAG_YMD |
Value | Count | Frequency (%) |
pt_sbst_no | 24 | 7.8% |
chmo_strt_ymd | 4 | 1.3% |
spr_read_ymd | 3 | 1.0% |
spr_acpt_ymd | 3 | 1.0% |
oprt_ymd | 3 | 1.0% |
chmo_end_ymd | 3 | 1.0% |
mrph_type_cmnt | 2 | 0.6% |
ord_ymd | 2 | 0.6% |
cis_yn | 2 | 0.6% |
miex_clsf_nm | 2 | 0.6% |
Other values (230) | 260 |
Most occurring characters
Value | Count | Frequency (%) |
_ | 597 | |
T | 384 | 10.5% |
M | 319 | 8.7% |
N | 284 | 7.8% |
S | 244 | 6.7% |
C | 241 | 6.6% |
D | 177 | 4.8% |
R | 144 | 3.9% |
Y | 143 | 3.9% |
H | 137 | 3.7% |
Other values (25) | 994 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 3020 | |
Connector Punctuation | 597 | 16.3% |
Decimal Number | 47 | 1.3% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
T | 384 | |
M | 319 | 10.6% |
N | 284 | 9.4% |
S | 244 | 8.1% |
C | 241 | 8.0% |
D | 177 | 5.9% |
R | 144 | 4.8% |
Y | 143 | 4.7% |
H | 137 | 4.5% |
A | 112 | 3.7% |
Other values (16) | 835 |
Decimal Number
Value | Count | Frequency (%) |
0 | 13 | |
2 | 11 | |
1 | 10 | |
7 | 5 | 10.6% |
5 | 2 | 4.3% |
3 | 2 | 4.3% |
6 | 2 | 4.3% |
4 | 2 | 4.3% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 597 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 3020 | |
Common | 644 | 17.6% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
T | 384 | |
M | 319 | 10.6% |
N | 284 | 9.4% |
S | 244 | 8.1% |
C | 241 | 8.0% |
D | 177 | 5.9% |
R | 144 | 4.8% |
Y | 143 | 4.7% |
H | 137 | 4.5% |
A | 112 | 3.7% |
Other values (16) | 835 |
Common
Value | Count | Frequency (%) |
_ | 597 | |
0 | 13 | 2.0% |
2 | 11 | 1.7% |
1 | 10 | 1.6% |
7 | 5 | 0.8% |
5 | 2 | 0.3% |
3 | 2 | 0.3% |
6 | 2 | 0.3% |
4 | 2 | 0.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 3664 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
_ | 597 | |
T | 384 | 10.5% |
M | 319 | 8.7% |
N | 284 | 7.8% |
S | 244 | 6.7% |
C | 241 | 6.6% |
D | 177 | 4.8% |
R | 144 | 3.9% |
Y | 143 | 3.9% |
H | 137 | 3.7% |
Other values (25) | 994 |
colNm
Text
Distinct | 225 |
---|---|
Distinct (%) | 73.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.5 KiB |
Value | Count | Frequency (%) |
내용 | 72 | 12.6% |
환자대체번호 | 24 | 4.2% |
검사 | 17 | 3.0% |
stage | 14 | 2.5% |
여부 | 9 | 1.6% |
invasion | 9 | 1.6% |
부위 | 7 | 1.2% |
기타 | 7 | 1.2% |
y:유 | 6 | 1.1% |
n:무 | 6 | 1.1% |
Other values (238) | 399 |
Most occurring characters
Value | Count | Frequency (%) |
262 | 9.3% | |
내 | 92 | 3.3% |
용 | 85 | 3.0% |
( | 65 | 2.3% |
) | 65 | 2.3% |
부 | 65 | 2.3% |
a | 64 | 2.3% |
o | 62 | 2.2% |
e | 61 | 2.2% |
i | 56 | 2.0% |
Other values (184) | 1948 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1438 | |
Lowercase Letter | 629 | |
Uppercase Letter | 288 | 10.2% |
Space Separator | 262 | 9.3% |
Open Punctuation | 65 | 2.3% |
Close Punctuation | 65 | 2.3% |
Decimal Number | 48 | 1.7% |
Other Punctuation | 27 | 1.0% |
Dash Punctuation | 3 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
내 | 92 | 6.4% |
용 | 85 | 5.9% |
부 | 65 | 4.5% |
병 | 53 | 3.7% |
력 | 53 | 3.7% |
일 | 52 | 3.6% |
자 | 42 | 2.9% |
여 | 42 | 2.9% |
족 | 38 | 2.6% |
가 | 38 | 2.6% |
Other values (124) | 878 |
Lowercase Letter
Value | Count | Frequency (%) |
a | 64 | 10.2% |
o | 62 | 9.9% |
e | 61 | 9.7% |
i | 56 | 8.9% |
t | 49 | 7.8% |
s | 47 | 7.5% |
n | 47 | 7.5% |
g | 31 | 4.9% |
r | 27 | 4.3% |
l | 25 | 4.0% |
Other values (13) | 160 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 39 | |
T | 28 | 9.7% |
A | 24 | 8.3% |
N | 23 | 8.0% |
I | 21 | 7.3% |
S | 17 | 5.9% |
P | 15 | 5.2% |
L | 14 | 4.9% |
M | 14 | 4.9% |
R | 12 | 4.2% |
Other values (13) | 81 |
Decimal Number
Value | Count | Frequency (%) |
0 | 13 | |
2 | 11 | |
1 | 11 | |
7 | 5 | 10.4% |
6 | 2 | 4.2% |
5 | 2 | 4.2% |
3 | 2 | 4.2% |
4 | 2 | 4.2% |
Other Punctuation
Value | Count | Frequency (%) |
/ | 15 | |
: | 12 |
Space Separator
Value | Count | Frequency (%) |
262 |
Open Punctuation
Value | Count | Frequency (%) |
( | 65 |
Close Punctuation
Value | Count | Frequency (%) |
) | 65 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1438 | |
Latin | 917 | |
Common | 470 | 16.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
내 | 92 | 6.4% |
용 | 85 | 5.9% |
부 | 65 | 4.5% |
병 | 53 | 3.7% |
력 | 53 | 3.7% |
일 | 52 | 3.6% |
자 | 42 | 2.9% |
여 | 42 | 2.9% |
족 | 38 | 2.6% |
가 | 38 | 2.6% |
Other values (124) | 878 |
Latin
Value | Count | Frequency (%) |
a | 64 | 7.0% |
o | 62 | 6.8% |
e | 61 | 6.7% |
i | 56 | 6.1% |
t | 49 | 5.3% |
s | 47 | 5.1% |
n | 47 | 5.1% |
C | 39 | 4.3% |
g | 31 | 3.4% |
T | 28 | 3.1% |
Other values (36) | 433 |
Common
Value | Count | Frequency (%) |
262 | ||
( | 65 | 13.8% |
) | 65 | 13.8% |
/ | 15 | 3.2% |
0 | 13 | 2.8% |
: | 12 | 2.6% |
2 | 11 | 2.3% |
1 | 11 | 2.3% |
7 | 5 | 1.1% |
- | 3 | 0.6% |
Other values (4) | 8 | 1.7% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1438 | |
ASCII | 1387 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
262 | ||
( | 65 | 4.7% |
) | 65 | 4.7% |
a | 64 | 4.6% |
o | 62 | 4.5% |
e | 61 | 4.4% |
i | 56 | 4.0% |
t | 49 | 3.5% |
s | 47 | 3.4% |
n | 47 | 3.4% |
Other values (50) | 609 |
Hangul
Value | Count | Frequency (%) |
내 | 92 | 6.4% |
용 | 85 | 5.9% |
부 | 65 | 4.5% |
병 | 53 | 3.7% |
력 | 53 | 3.7% |
일 | 52 | 3.6% |
자 | 42 | 2.9% |
여 | 42 | 2.9% |
족 | 38 | 2.6% |
가 | 38 | 2.6% |
Other values (124) | 878 |
dataType
Categorical
HIGH CORRELATION
 
Distinct | 31 |
---|---|
Distinct (%) | 10.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.5 KiB |
String(1) | |
---|---|
DATE | |
String(10) | |
String(100) | |
String(200) | |
Other values (26) |
Length
Max length | 13 |
---|---|
Median length | 12 |
Mean length | 9.2305195 |
Min length | 4 |
Unique
Unique | 9 ? |
---|---|
Unique (%) | 2.9% |
Sample
1st row | String(10) |
---|---|
2nd row | String(code) |
3rd row | DATE |
4th row | String(code) |
5th row | DATE |
Common Values
Value | Count | Frequency (%) |
String(1) | 56 | |
DATE | 50 | |
String(10) | 37 | |
String(100) | 31 | |
String(200) | 19 | 6.2% |
String(50) | 17 | 5.5% |
String(20) | 12 | 3.9% |
String(400) | 10 | 3.2% |
Integer(code) | 9 | 2.9% |
Integer(4) | 8 | 2.6% |
Other values (21) | 59 |
Length
Value | Count | Frequency (%) |
string(1 | 56 | |
date | 51 | |
string(10 | 37 | |
string(100 | 31 | |
string(200 | 19 | 6.2% |
string(50 | 17 | 5.5% |
string(20 | 12 | 3.9% |
string(400 | 10 | 3.2% |
integer(code | 9 | 2.9% |
integer(4 | 8 | 2.6% |
Other values (20) | 58 |
colDesc
Text
Distinct | 239 |
---|---|
Distinct (%) | 77.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.5 KiB |
Value | Count | Frequency (%) |
환자대체번호 | 24 | 4.5% |
내용 | 18 | 3.4% |
stage | 14 | 2.6% |
invasion | 9 | 1.7% |
검체결과 | 8 | 1.5% |
항암치료 | 6 | 1.1% |
기타 | 6 | 1.1% |
명칭 | 6 | 1.1% |
일 | 5 | 0.9% |
세포병리검사 | 5 | 0.9% |
Other values (273) | 429 |
Most occurring characters
Value | Count | Frequency (%) |
222 | 7.6% | |
i | 68 | 2.3% |
o | 66 | 2.3% |
병 | 66 | 2.3% |
) | 65 | 2.2% |
( | 65 | 2.2% |
a | 64 | 2.2% |
부 | 62 | 2.1% |
e | 61 | 2.1% |
자 | 59 | 2.0% |
Other values (194) | 2107 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1554 | |
Lowercase Letter | 660 | |
Uppercase Letter | 261 | 9.0% |
Space Separator | 222 | 7.6% |
Close Punctuation | 65 | 2.2% |
Open Punctuation | 65 | 2.2% |
Decimal Number | 48 | 1.7% |
Other Punctuation | 24 | 0.8% |
Connector Punctuation | 3 | 0.1% |
Dash Punctuation | 3 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
병 | 66 | 4.2% |
부 | 62 | 4.0% |
자 | 59 | 3.8% |
력 | 53 | 3.4% |
일 | 52 | 3.3% |
내 | 44 | 2.8% |
사 | 41 | 2.6% |
가 | 40 | 2.6% |
체 | 39 | 2.5% |
족 | 38 | 2.4% |
Other values (133) | 1060 |
Lowercase Letter
Value | Count | Frequency (%) |
i | 68 | |
o | 66 | 10.0% |
a | 64 | 9.7% |
e | 61 | 9.2% |
t | 52 | 7.9% |
s | 50 | 7.6% |
n | 48 | 7.3% |
r | 28 | 4.2% |
g | 28 | 4.2% |
y | 27 | 4.1% |
Other values (13) | 168 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 28 | 10.7% |
T | 24 | 9.2% |
N | 24 | 9.2% |
A | 18 | 6.9% |
S | 17 | 6.5% |
I | 15 | 5.7% |
L | 15 | 5.7% |
P | 14 | 5.4% |
B | 13 | 5.0% |
R | 12 | 4.6% |
Other values (13) | 81 |
Decimal Number
Value | Count | Frequency (%) |
0 | 13 | |
2 | 11 | |
1 | 11 | |
7 | 5 | 10.4% |
4 | 2 | 4.2% |
3 | 2 | 4.2% |
5 | 2 | 4.2% |
6 | 2 | 4.2% |
Other Punctuation
Value | Count | Frequency (%) |
/ | 14 | |
: | 10 |
Space Separator
Value | Count | Frequency (%) |
222 |
Close Punctuation
Value | Count | Frequency (%) |
) | 65 |
Open Punctuation
Value | Count | Frequency (%) |
( | 65 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 3 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1554 | |
Latin | 921 | |
Common | 430 | 14.8% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
병 | 66 | 4.2% |
부 | 62 | 4.0% |
자 | 59 | 3.8% |
력 | 53 | 3.4% |
일 | 52 | 3.3% |
내 | 44 | 2.8% |
사 | 41 | 2.6% |
가 | 40 | 2.6% |
체 | 39 | 2.5% |
족 | 38 | 2.4% |
Other values (133) | 1060 |
Latin
Value | Count | Frequency (%) |
i | 68 | 7.4% |
o | 66 | 7.2% |
a | 64 | 6.9% |
e | 61 | 6.6% |
t | 52 | 5.6% |
s | 50 | 5.4% |
n | 48 | 5.2% |
r | 28 | 3.0% |
C | 28 | 3.0% |
g | 28 | 3.0% |
Other values (36) | 428 |
Common
Value | Count | Frequency (%) |
222 | ||
) | 65 | 15.1% |
( | 65 | 15.1% |
/ | 14 | 3.3% |
0 | 13 | 3.0% |
2 | 11 | 2.6% |
1 | 11 | 2.6% |
: | 10 | 2.3% |
7 | 5 | 1.2% |
_ | 3 | 0.7% |
Other values (5) | 11 | 2.6% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1554 | |
ASCII | 1351 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
222 | 16.4% | |
i | 68 | 5.0% |
o | 66 | 4.9% |
) | 65 | 4.8% |
( | 65 | 4.8% |
a | 64 | 4.7% |
e | 61 | 4.5% |
t | 52 | 3.8% |
s | 50 | 3.7% |
n | 48 | 3.6% |
Other values (51) | 590 |
Hangul
Value | Count | Frequency (%) |
병 | 66 | 4.2% |
부 | 62 | 4.0% |
자 | 59 | 3.8% |
력 | 53 | 3.4% |
일 | 52 | 3.3% |
내 | 44 | 2.8% |
사 | 41 | 2.6% |
가 | 40 | 2.6% |
체 | 39 | 2.5% |
족 | 38 | 2.4% |
Other values (133) | 1060 |
colCnt
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 308 |
---|---|
Missing (%) | 100.0% |
Memory size | 2.8 KiB |
dispFormat
Categorical
HIGH CORRELATION
 
Distinct | 10 |
---|---|
Distinct (%) | 3.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.5 KiB |
텍스트 | |
---|---|
YYYY-MM-DD | |
Y : 유 / N : 무 | |
숫자 | |
RN+비식별숫자(8) | |
Other values (5) |
Length
Max length | 18 |
---|---|
Median length | 15 |
Mean length | 6.6850649 |
Min length | 2 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 0.6% |
Sample
1st row | RN+비식별숫자(8) |
---|---|
2nd row | M 남 | F 여 |
3rd row | YYYY-MM-DD |
4th row | 원내검사 코드 |
5th row | YYYY-MM-DD |
Common Values
Value | Count | Frequency (%) |
텍스트 | 126 | |
YYYY-MM-DD | 51 | |
Y : 유 / N : 무 | 51 | |
숫자 | 39 | 12.7% |
RN+비식별숫자(8) | 24 | 7.8% |
Free 텍스트 | 10 | 3.2% |
Y : 내부 / N : 외부 | 3 | 1.0% |
원내검사 코드 | 2 | 0.6% |
M 남 | F 여 | 1 | 0.3% |
Y : 유 / N : 무/알수없음 | 1 | 0.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
166 | ||
텍스트 | 136 | |
y | 55 | 8.4% |
n | 55 | 8.4% |
유 | 52 | 8.0% |
yyyy-mm-dd | 51 | 7.8% |
무 | 51 | 7.8% |
숫자 | 39 | 6.0% |
rn+비식별숫자(8 | 24 | 3.7% |
free | 10 | 1.5% |
Other values (9) | 15 | 2.3% |
NUM | gpId | gpNm | tblId | tblNm | dataType | dispFormat | |
---|---|---|---|---|---|---|---|
NUM | 1.000 | 0.960 | 0.960 | 0.992 | 0.992 | 0.748 | 0.704 |
gpId | 0.960 | 1.000 | 1.000 | 1.000 | 1.000 | 0.767 | 0.605 |
gpNm | 0.960 | 1.000 | 1.000 | 1.000 | 1.000 | 0.767 | 0.605 |
tblId | 0.992 | 1.000 | 1.000 | 1.000 | 1.000 | 0.746 | 0.609 |
tblNm | 0.992 | 1.000 | 1.000 | 1.000 | 1.000 | 0.746 | 0.609 |
dataType | 0.748 | 0.767 | 0.767 | 0.746 | 0.746 | 1.000 | 0.953 |
dispFormat | 0.704 | 0.605 | 0.605 | 0.609 | 0.609 | 0.953 | 1.000 |
dataType | tblId | dispFormat | gpNm | gpId | tblNm | |
---|---|---|---|---|---|---|
dataType | 1.000 | 0.242 | 0.718 | 0.304 | 0.304 | 0.242 |
tblId | 0.242 | 1.000 | 0.251 | 0.974 | 0.974 | 1.000 |
dispFormat | 0.718 | 0.251 | 1.000 | 0.272 | 0.272 | 0.251 |
gpNm | 0.304 | 0.974 | 0.272 | 1.000 | 1.000 | 0.974 |
gpId | 0.304 | 0.974 | 0.272 | 1.000 | 1.000 | 0.974 |
tblNm | 0.242 | 1.000 | 0.251 | 0.974 | 0.974 | 1.000 |
NUM | gpId | gpNm | tblId | tblNm | dataType | dispFormat | |
---|---|---|---|---|---|---|---|
NUM | 1.000 | 0.788 | 0.788 | 0.896 | 0.896 | 0.360 | 0.285 |
gpId | 0.788 | 1.000 | 1.000 | 0.974 | 0.974 | 0.304 | 0.272 |
gpNm | 0.788 | 1.000 | 1.000 | 0.974 | 0.974 | 0.304 | 0.272 |
tblId | 0.896 | 0.974 | 0.974 | 1.000 | 1.000 | 0.242 | 0.251 |
tblNm | 0.896 | 0.974 | 0.974 | 1.000 | 1.000 | 0.242 | 0.251 |
dataType | 0.360 | 0.304 | 0.304 | 0.242 | 0.242 | 1.000 | 0.718 |
dispFormat | 0.285 | 0.272 | 0.272 | 0.251 | 0.251 | 0.718 | 1.000 |
NUM | gpId | gpNm | tblId | tblNm | colId | colNm | dataType | colDesc | colCnt | dispFormat | |
---|---|---|---|---|---|---|---|---|---|---|---|
0 | 1 | BLAD_TRGT | Summary | BLAD_PT_TRGT | 기본정보 | PT_SBST_NO | 환자대체번호 | String(10) | 환자대체번호 | <NA> | RN+비식별숫자(8) |
1 | 2 | BLAD_TRGT | Summary | BLAD_PT_TRGT | 기본정보 | SEX_CD | 성별 코드 | String(code) | 성별코드 | <NA> | M 남 | F 여 |
2 | 3 | BLAD_TRGT | Summary | BLAD_PT_TRGT | 기본정보 | BRTH_YMD | 생년월일 | DATE | 생년월일 | <NA> | YYYY-MM-DD |
3 | 4 | BLAD_TRGT | Summary | BLAD_PT_TRGT | 기본정보 | FRST_DIAG_CD | 최초 진단 코드 | String(code) | 최초진단코드 | <NA> | 원내검사 코드 |
4 | 5 | BLAD_TRGT | Summary | BLAD_PT_TRGT | 기본정보 | FRST_DIAG_YMD | 최초 진단일 | DATE | 최초진단일자 | <NA> | YYYY-MM-DD |
5 | 6 | BLAD_TRGT | Summary | BLAD_PT_TRGT | 기본정보 | FRST_DIAG_NM | 최초 진단명 | String(256) | 최초진단명 | <NA> | 텍스트 |
6 | 7 | BLAD_TRGT | Summary | BLAD_PT_TRGT | 기본정보 | DIAG_ATT_AGE | 진단 시 나이 | Integer(3) | 진단 시 나이 | <NA> | 숫자 |
7 | 8 | BLAD_TRGT | Summary | BLAD_PT_TRGT | 기본정보 | FRMD_YMD | 초진일 | DATE | 초진일자 | <NA> | YYYY-MM-DD |
8 | 9 | BLAD_TRGT | Summary | BLAD_PT_TRGT | 기본정보 | FRST_OPRT_YMD | 최초 수술일 | DATE | 최초 수술일자 | <NA> | YYYY-MM-DD |
9 | 10 | BLAD_TRGT | Summary | BLAD_PT_TRGT | 기본정보 | FRST_OPRT_NM | 최초 수술명 | String(256) | 최초 수술명 | <NA> | 텍스트 |
NUM | gpId | gpNm | tblId | tblNm | colId | colNm | dataType | colDesc | colCnt | dispFormat | |
---|---|---|---|---|---|---|---|---|---|---|---|
298 | 299 | BLAD_CHMO_FLST | 항암 FlowSheet | BLAD_PE_CHMO_FLST | Flow Sheet | FATG_CMNT | FATIGUE 내용 | String(50) | FATIGUE | <NA> | 텍스트 |
299 | 300 | BLAD_CHMO_FLST | 항암 FlowSheet | BLAD_PE_CHMO_FLST | Flow Sheet | NV_CMNT | NV 내용 | String(50) | NV | <NA> | 텍스트 |
300 | 301 | BLAD_CHMO_FLST | 항암 FlowSheet | BLAD_PE_CHMO_FLST | Flow Sheet | CSTP_CMNT | CONSTIPATION 내용 | String(50) | CONSTIPATION | <NA> | 텍스트 |
301 | 302 | BLAD_CHMO_FLST | 항암 FlowSheet | BLAD_PE_CHMO_FLST | Flow Sheet | DIAR_CMNT | DIARRHEA 내용 | String(50) | DIARRHEA | <NA> | 텍스트 |
302 | 303 | BLAD_CHMO_FLST | 항암 FlowSheet | BLAD_PE_CHMO_FLST | Flow Sheet | SKIN_RASH_CMNT | SKINRASH 내용 | String(50) | SKINRASH | <NA> | 텍스트 |
303 | 304 | BLAD_CHMO_FLST | 항암 FlowSheet | BLAD_PE_CHMO_FLST | Flow Sheet | MCST_CMNT | MUCOSITIS 내용 | String(50) | MUCOSITIS | <NA> | 텍스트 |
304 | 305 | BLAD_CHMO_FLST | 항암 FlowSheet | BLAD_PE_CHMO_FLST | Flow Sheet | NURO_PTHY_CMNT | NEUROPATHY 내용 | String(50) | NEUROPATHY | <NA> | 텍스트 |
305 | 306 | BLAD_CHMO_FLST | 항암 FlowSheet | BLAD_PE_CHMO_FLST | Flow Sheet | ECOG_CD | ECOG 코드 | Integer(code) | ECOG 전신상태평가 | <NA> | 숫자 |
306 | 307 | BLAD_CHMO_FLST | 항암 FlowSheet | BLAD_PE_CHMO_FLST | Flow Sheet | WT_VL | 체중 (kg) | Float(5,2) | 체중 | <NA> | 숫자 |
307 | 308 | BLAD_CHMO_FLST | 항암 FlowSheet | BLAD_PE_CHMO_FLST | Flow Sheet | BSA_VL | BSA | Float(10,2) | 체표면적 | <NA> | 숫자 |