Dataset statistics
Number of variables | 11 |
---|---|
Number of observations | 613 |
Missing cells | 613 |
Missing cells (%) | 9.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 54.0 KiB |
Average record size in memory | 90.2 B |
Variable types
Numeric | 1 |
---|---|
Categorical | 4 |
Text | 5 |
Unsupported | 1 |
Dataset
Description | 자궁암 레지스트리 메타정보( 제공 되어질 데이터 항목, 타입, 사이즈, 항목별건수, 샘플데이터 등)를 제공 |
---|---|
Author | 국립암센터 |
URL | https://www.data.go.kr/data/15048702/fileData.do |
gpNm is highly overall correlated with NUM and 1 other fields | High correlation |
gpId is highly overall correlated with NUM and 1 other fields | High correlation |
NUM is highly overall correlated with gpId and 1 other fields | High correlation |
dataType is highly overall correlated with dispFormat | High correlation |
dispFormat is highly overall correlated with dataType | High correlation |
colCnt has 613 (100.0%) missing values | Missing |
NUM has unique values | Unique |
colCnt is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2024-04-19 06:23:20.886571 |
---|---|
Analysis finished | 2024-04-19 06:23:21.958002 |
Duration | 1.07 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
NUM
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 613 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 307 |
Minimum | 1 |
---|---|
Maximum | 613 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 5.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 31.6 |
Q1 | 154 |
median | 307 |
Q3 | 460 |
95-th percentile | 582.4 |
Maximum | 613 |
Range | 612 |
Interquartile range (IQR) | 306 |
Descriptive statistics
Standard deviation | 177.10214 |
---|---|
Coefficient of variation (CV) | 0.57687992 |
Kurtosis | -1.2 |
Mean | 307 |
Median Absolute Deviation (MAD) | 153 |
Skewness | 0 |
Sum | 188191 |
Variance | 31365.167 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 0.2% |
413 | 1 | 0.2% |
406 | 1 | 0.2% |
407 | 1 | 0.2% |
408 | 1 | 0.2% |
409 | 1 | 0.2% |
410 | 1 | 0.2% |
411 | 1 | 0.2% |
412 | 1 | 0.2% |
414 | 1 | 0.2% |
Other values (603) | 603 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
613 | 1 | |
612 | 1 | |
611 | 1 | |
610 | 1 | |
609 | 1 | |
608 | 1 | |
607 | 1 | |
606 | 1 | |
605 | 1 | |
604 | 1 |
gpId
Categorical
HIGH CORRELATION
 
Distinct | 24 |
---|---|
Distinct (%) | 3.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.9 KiB |
UTRN_OPRT_STOC | |
---|---|
UTRN_HLTH | |
UTRN_OPRT | |
UTRN_SPR | |
UTRN_CHMO_FLST | |
Other values (19) |
Length
Max length | 14 |
---|---|
Median length | 10 |
Mean length | 11.231648 |
Min length | 7 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | UTRN_TRGT |
---|---|
2nd row | UTRN_TRGT |
3rd row | UTRN_TRGT |
4th row | UTRN_TRGT |
5th row | UTRN_TRGT |
Common Values
Value | Count | Frequency (%) |
UTRN_OPRT_STOC | 210 | |
UTRN_HLTH | 93 | |
UTRN_OPRT | 85 | |
UTRN_SPR | 45 | 7.3% |
UTRN_CHMO_FLST | 26 | 4.2% |
UTRN_TRGT | 16 | 2.6% |
UTRN_IMNL | 13 | 2.1% |
UTRN_CNDX_BDMS | 12 | 2.0% |
UTRN_CHMO | 12 | 2.0% |
UTRN_RTX | 11 | 1.8% |
Other values (14) | 90 |
Length
Value | Count | Frequency (%) |
utrn_oprt_stoc | 210 | |
utrn_hlth | 93 | |
utrn_oprt | 85 | |
utrn_spr | 45 | 7.3% |
utrn_chmo_flst | 26 | 4.2% |
utrn_trgt | 16 | 2.6% |
utrn_imnl | 13 | 2.1% |
utrn_cndx_bdms | 12 | 2.0% |
utrn_chmo | 12 | 2.0% |
utrn_rtx | 11 | 1.8% |
Other values (14) | 90 |
gpNm
Categorical
HIGH CORRELATION
 
Distinct | 24 |
---|---|
Distinct (%) | 3.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.9 KiB |
수술정보(SOTOC) | |
---|---|
환자건강정보 | |
수술 | |
외과병리 | |
항암 FlowSheet | |
Other values (19) |
Length
Max length | 12 |
---|---|
Median length | 11 |
Mean length | 7.4241436 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Summary |
---|---|
2nd row | Summary |
3rd row | Summary |
4th row | Summary |
5th row | Summary |
Common Values
Value | Count | Frequency (%) |
수술정보(SOTOC) | 210 | |
환자건강정보 | 93 | |
수술 | 85 | |
외과병리 | 45 | 7.3% |
항암 FlowSheet | 26 | 4.2% |
Summary | 16 | 2.6% |
면역병리검사 | 13 | 2.1% |
진단 및 신체 | 12 | 2.0% |
항암치료 | 12 | 2.0% |
방사선치료 | 11 | 1.8% |
Other values (14) | 90 |
Length
Value | Count | Frequency (%) |
수술정보(sotoc | 210 | |
환자건강정보 | 93 | |
수술 | 85 | |
외과병리 | 45 | 6.4% |
항암 | 26 | 3.7% |
flowsheet | 26 | 3.7% |
및 | 22 | 3.1% |
summary | 16 | 2.3% |
영상검사 | 14 | 2.0% |
면역병리검사 | 13 | 1.8% |
Other values (18) | 157 |
tblId
Text
Distinct | 62 |
---|---|
Distinct (%) | 10.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.9 KiB |
Length
Max length | 20 |
---|---|
Median length | 18 |
Mean length | 15.513866 |
Min length | 10 |
Characters and Unicode
Total characters | 9510 |
---|---|
Distinct characters | 32 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | UTRN_PT_TRGT |
---|---|
2nd row | UTRN_PT_TRGT |
3rd row | UTRN_PT_TRGT |
4th row | UTRN_PT_TRGT |
5th row | UTRN_PT_TRGT |
Value | Count | Frequency (%) |
utrn_pe_oprt | 74 | 12.1% |
utrn_pe_spr | 45 | 7.3% |
utrn_pe_chmo_flst | 26 | 4.2% |
utrn_pe_oprt_stoc_5 | 24 | 3.9% |
utrn_pe_oprt_stoc_9 | 23 | 3.8% |
utrn_pe_oprt_stoc_6 | 23 | 3.8% |
utrn_mr_hlth_2 | 19 | 3.1% |
utrn_pe_oprt_stoc_3 | 17 | 2.8% |
utrn_pt_trgt | 16 | 2.6% |
utrn_pe_oprt_stoc_4 | 15 | 2.4% |
Other values (52) | 331 |
Most occurring characters
Value | Count | Frequency (%) |
_ | 1828 | |
T | 1334 | |
R | 1090 | |
P | 861 | |
N | 695 | 7.3% |
U | 625 | 6.6% |
E | 587 | 6.2% |
O | 537 | 5.6% |
S | 310 | 3.3% |
C | 287 | 3.0% |
Other values (22) | 1356 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 7238 | |
Connector Punctuation | 1828 | 19.2% |
Decimal Number | 444 | 4.7% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
T | 1334 | |
R | 1090 | |
P | 861 | |
N | 695 | |
U | 625 | |
E | 587 | |
O | 537 | |
S | 310 | 4.3% |
C | 287 | 4.0% |
H | 244 | 3.4% |
Other values (11) | 668 |
Decimal Number
Value | Count | Frequency (%) |
1 | 107 | |
2 | 91 | |
6 | 44 | |
9 | 36 | 8.1% |
5 | 36 | 8.1% |
3 | 34 | 7.7% |
4 | 34 | 7.7% |
8 | 28 | 6.3% |
7 | 23 | 5.2% |
0 | 11 | 2.5% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1828 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 7238 | |
Common | 2272 | 23.9% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
T | 1334 | |
R | 1090 | |
P | 861 | |
N | 695 | |
U | 625 | |
E | 587 | |
O | 537 | |
S | 310 | 4.3% |
C | 287 | 4.0% |
H | 244 | 3.4% |
Other values (11) | 668 |
Common
Value | Count | Frequency (%) |
_ | 1828 | |
1 | 107 | 4.7% |
2 | 91 | 4.0% |
6 | 44 | 1.9% |
9 | 36 | 1.6% |
5 | 36 | 1.6% |
3 | 34 | 1.5% |
4 | 34 | 1.5% |
8 | 28 | 1.2% |
7 | 23 | 1.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 9510 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
_ | 1828 | |
T | 1334 | |
R | 1090 | |
P | 861 | |
N | 695 | 7.3% |
U | 625 | 6.6% |
E | 587 | 6.2% |
O | 537 | 5.6% |
S | 310 | 3.3% |
C | 287 | 3.0% |
Other values (22) | 1356 |
tblNm
Text
Distinct | 62 |
---|---|
Distinct (%) | 10.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.9 KiB |
Value | Count | Frequency (%) |
결과 | 79 | 7.7% |
수술정보 | 74 | 7.2% |
ln | 71 | 6.9% |
66 | 6.4% | |
수술 | 45 | 4.4% |
후 | 45 | 4.4% |
sheet | 26 | 2.5% |
flow | 26 | 2.5% |
ruq | 24 | 2.3% |
pelvis | 23 | 2.2% |
Other values (71) | 552 |
Most occurring characters
Value | Count | Frequency (%) |
466 | 9.4% | |
a | 213 | 4.3% |
L | 188 | 3.8% |
l | 180 | 3.6% |
i | 174 | 3.5% |
e | 173 | 3.5% |
정 | 158 | 3.2% |
r | 157 | 3.2% |
보 | 153 | 3.1% |
t | 148 | 3.0% |
Other values (105) | 2960 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 1761 | |
Other Letter | 1569 | |
Uppercase Letter | 804 | |
Space Separator | 466 | 9.4% |
Dash Punctuation | 102 | 2.1% |
Decimal Number | 88 | 1.8% |
Other Punctuation | 68 | 1.4% |
Open Punctuation | 56 | 1.1% |
Close Punctuation | 56 | 1.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
정 | 158 | 10.1% |
보 | 153 | 9.8% |
과 | 147 | 9.4% |
수 | 130 | 8.3% |
술 | 124 | 7.9% |
결 | 114 | 7.3% |
력 | 65 | 4.1% |
후 | 45 | 2.9% |
가 | 41 | 2.6% |
족 | 38 | 2.4% |
Other values (52) | 554 |
Lowercase Letter
Value | Count | Frequency (%) |
a | 213 | |
l | 180 | |
i | 174 | |
e | 173 | |
r | 157 | |
t | 148 | |
o | 131 | 7.4% |
n | 96 | 5.5% |
c | 73 | 4.1% |
s | 72 | 4.1% |
Other values (12) | 344 |
Uppercase Letter
Value | Count | Frequency (%) |
L | 188 | |
N | 125 | |
P | 66 | 8.2% |
S | 60 | 7.5% |
U | 53 | 6.6% |
C | 48 | 6.0% |
Q | 41 | 5.1% |
F | 38 | 4.7% |
R | 36 | 4.5% |
O | 33 | 4.1% |
Other values (8) | 116 |
Decimal Number
Value | Count | Frequency (%) |
2 | 29 | |
1 | 15 | |
6 | 12 | |
8 | 10 | 11.4% |
5 | 6 | 6.8% |
4 | 6 | 6.8% |
3 | 6 | 6.8% |
7 | 4 | 4.5% |
Space Separator
Value | Count | Frequency (%) |
466 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 102 |
Other Punctuation
Value | Count | Frequency (%) |
/ | 68 |
Open Punctuation
Value | Count | Frequency (%) |
( | 56 |
Close Punctuation
Value | Count | Frequency (%) |
) | 56 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 2565 | |
Hangul | 1569 | |
Common | 836 | 16.8% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
정 | 158 | 10.1% |
보 | 153 | 9.8% |
과 | 147 | 9.4% |
수 | 130 | 8.3% |
술 | 124 | 7.9% |
결 | 114 | 7.3% |
력 | 65 | 4.1% |
후 | 45 | 2.9% |
가 | 41 | 2.6% |
족 | 38 | 2.4% |
Other values (52) | 554 |
Latin
Value | Count | Frequency (%) |
a | 213 | 8.3% |
L | 188 | 7.3% |
l | 180 | 7.0% |
i | 174 | 6.8% |
e | 173 | 6.7% |
r | 157 | 6.1% |
t | 148 | 5.8% |
o | 131 | 5.1% |
N | 125 | 4.9% |
n | 96 | 3.7% |
Other values (30) | 980 |
Common
Value | Count | Frequency (%) |
466 | ||
- | 102 | 12.2% |
/ | 68 | 8.1% |
( | 56 | 6.7% |
) | 56 | 6.7% |
2 | 29 | 3.5% |
1 | 15 | 1.8% |
6 | 12 | 1.4% |
8 | 10 | 1.2% |
5 | 6 | 0.7% |
Other values (3) | 16 | 1.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 3401 | |
Hangul | 1569 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
466 | 13.7% | |
a | 213 | 6.3% |
L | 188 | 5.5% |
l | 180 | 5.3% |
i | 174 | 5.1% |
e | 173 | 5.1% |
r | 157 | 4.6% |
t | 148 | 4.4% |
o | 131 | 3.9% |
N | 125 | 3.7% |
Other values (43) | 1446 |
Hangul
Value | Count | Frequency (%) |
정 | 158 | 10.1% |
보 | 153 | 9.8% |
과 | 147 | 9.4% |
수 | 130 | 8.3% |
술 | 124 | 7.9% |
결 | 114 | 7.3% |
력 | 65 | 4.1% |
후 | 45 | 2.9% |
가 | 41 | 2.6% |
족 | 38 | 2.4% |
Other values (52) | 554 |
colId
Text
Distinct | 539 |
---|---|
Distinct (%) | 87.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.9 KiB |
Length
Max length | 31 |
---|---|
Median length | 22 |
Mean length | 14.176183 |
Min length | 5 |
Characters and Unicode
Total characters | 8690 |
---|---|
Distinct characters | 35 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 508 ? |
---|---|
Unique (%) | 82.9% |
Sample
1st row | PT_SBST_NO |
---|---|
2nd row | SEX_CD |
3rd row | BRTH_YMD |
4th row | FRST_DIAG_YMD |
5th row | FRST_DIAG_CD |
Value | Count | Frequency (%) |
pt_sbst_no | 28 | 4.6% |
exam_ymd | 6 | 1.0% |
exam_nm | 6 | 1.0% |
exam_yn | 6 | 1.0% |
oprt_ymd | 5 | 0.8% |
chmo_strt_ymd | 3 | 0.5% |
oprt_nm | 3 | 0.5% |
gene_muta_cd | 2 | 0.3% |
chmo_prps_nm | 2 | 0.3% |
cexm_nm | 2 | 0.3% |
Other values (529) | 550 |
Most occurring characters
Value | Count | Frequency (%) |
_ | 1562 | |
N | 761 | 8.8% |
T | 737 | 8.5% |
M | 558 | 6.4% |
L | 556 | 6.4% |
S | 481 | 5.5% |
C | 453 | 5.2% |
R | 431 | 5.0% |
P | 380 | 4.4% |
E | 370 | 4.3% |
Other values (25) | 2401 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 7067 | |
Connector Punctuation | 1562 | 18.0% |
Decimal Number | 61 | 0.7% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
N | 761 | 10.8% |
T | 737 | 10.4% |
M | 558 | 7.9% |
L | 556 | 7.9% |
S | 481 | 6.8% |
C | 453 | 6.4% |
R | 431 | 6.1% |
P | 380 | 5.4% |
E | 370 | 5.2% |
Y | 345 | 4.9% |
Other values (16) | 1995 |
Decimal Number
Value | Count | Frequency (%) |
2 | 14 | |
1 | 10 | |
3 | 9 | |
6 | 8 | |
4 | 6 | |
5 | 6 | |
7 | 4 | 6.6% |
8 | 4 | 6.6% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1562 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 7067 | |
Common | 1623 | 18.7% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
N | 761 | 10.8% |
T | 737 | 10.4% |
M | 558 | 7.9% |
L | 556 | 7.9% |
S | 481 | 6.8% |
C | 453 | 6.4% |
R | 431 | 6.1% |
P | 380 | 5.4% |
E | 370 | 5.2% |
Y | 345 | 4.9% |
Other values (16) | 1995 |
Common
Value | Count | Frequency (%) |
_ | 1562 | |
2 | 14 | 0.9% |
1 | 10 | 0.6% |
3 | 9 | 0.6% |
6 | 8 | 0.5% |
4 | 6 | 0.4% |
5 | 6 | 0.4% |
7 | 4 | 0.2% |
8 | 4 | 0.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 8690 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
_ | 1562 | |
N | 761 | 8.8% |
T | 737 | 8.5% |
M | 558 | 6.4% |
L | 556 | 6.4% |
S | 481 | 5.5% |
C | 453 | 5.2% |
R | 431 | 5.0% |
P | 380 | 4.4% |
E | 370 | 4.3% |
Other values (25) | 2401 |
colNm
Text
Distinct | 519 |
---|---|
Distinct (%) | 84.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.9 KiB |
Value | Count | Frequency (%) |
여부 | 190 | 10.1% |
내용 | 135 | 7.2% |
57 | 3.0% | |
size | 43 | 2.3% |
right | 36 | 1.9% |
left | 34 | 1.8% |
lnd | 33 | 1.8% |
other | 32 | 1.7% |
환자대체번호 | 28 | 1.5% |
lns | 27 | 1.4% |
Other values (444) | 1267 |
Most occurring characters
Value | Count | Frequency (%) |
1270 | 12.1% | |
e | 616 | 5.9% |
i | 510 | 4.8% |
t | 503 | 4.8% |
a | 453 | 4.3% |
o | 451 | 4.3% |
r | 448 | 4.3% |
c | 270 | 2.6% |
l | 265 | 2.5% |
s | 260 | 2.5% |
Other values (203) | 5470 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 5307 | |
Other Letter | 2111 | 20.1% |
Uppercase Letter | 1348 | 12.8% |
Space Separator | 1270 | 12.1% |
Close Punctuation | 114 | 1.1% |
Open Punctuation | 114 | 1.1% |
Dash Punctuation | 105 | 1.0% |
Other Punctuation | 83 | 0.8% |
Decimal Number | 61 | 0.6% |
Math Symbol | 3 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
부 | 248 | 11.7% |
여 | 226 | 10.7% |
내 | 150 | 7.1% |
용 | 146 | 6.9% |
사 | 58 | 2.7% |
일 | 54 | 2.6% |
병 | 53 | 2.5% |
력 | 53 | 2.5% |
자 | 50 | 2.4% |
검 | 49 | 2.3% |
Other values (139) | 1024 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 616 | |
i | 510 | 9.6% |
t | 503 | 9.5% |
a | 453 | 8.5% |
o | 451 | 8.5% |
r | 448 | 8.4% |
c | 270 | 5.1% |
l | 265 | 5.0% |
s | 260 | 4.9% |
n | 256 | 4.8% |
Other values (14) | 1275 |
Uppercase Letter
Value | Count | Frequency (%) |
L | 253 | |
P | 161 | |
N | 146 | |
S | 130 | |
R | 91 | 6.8% |
O | 89 | 6.6% |
C | 67 | 5.0% |
A | 54 | 4.0% |
D | 53 | 3.9% |
U | 48 | 3.6% |
Other values (14) | 256 |
Decimal Number
Value | Count | Frequency (%) |
2 | 13 | |
1 | 10 | |
3 | 9 | |
6 | 8 | |
5 | 7 | |
4 | 6 | |
7 | 4 | 6.6% |
8 | 4 | 6.6% |
Other Punctuation
Value | Count | Frequency (%) |
/ | 58 | |
% | 13 | 15.7% |
: | 12 | 14.5% |
Space Separator
Value | Count | Frequency (%) |
1270 |
Close Punctuation
Value | Count | Frequency (%) |
) | 114 |
Open Punctuation
Value | Count | Frequency (%) |
( | 114 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 105 |
Math Symbol
Value | Count | Frequency (%) |
+ | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 6655 | |
Hangul | 2111 | 20.1% |
Common | 1750 | 16.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
부 | 248 | 11.7% |
여 | 226 | 10.7% |
내 | 150 | 7.1% |
용 | 146 | 6.9% |
사 | 58 | 2.7% |
일 | 54 | 2.6% |
병 | 53 | 2.5% |
력 | 53 | 2.5% |
자 | 50 | 2.4% |
검 | 49 | 2.3% |
Other values (139) | 1024 |
Latin
Value | Count | Frequency (%) |
e | 616 | 9.3% |
i | 510 | 7.7% |
t | 503 | 7.6% |
a | 453 | 6.8% |
o | 451 | 6.8% |
r | 448 | 6.7% |
c | 270 | 4.1% |
l | 265 | 4.0% |
s | 260 | 3.9% |
n | 256 | 3.8% |
Other values (38) | 2623 |
Common
Value | Count | Frequency (%) |
1270 | ||
) | 114 | 6.5% |
( | 114 | 6.5% |
- | 105 | 6.0% |
/ | 58 | 3.3% |
2 | 13 | 0.7% |
% | 13 | 0.7% |
: | 12 | 0.7% |
1 | 10 | 0.6% |
3 | 9 | 0.5% |
Other values (6) | 32 | 1.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 8405 | |
Hangul | 2111 | 20.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1270 | 15.1% | |
e | 616 | 7.3% |
i | 510 | 6.1% |
t | 503 | 6.0% |
a | 453 | 5.4% |
o | 451 | 5.4% |
r | 448 | 5.3% |
c | 270 | 3.2% |
l | 265 | 3.2% |
s | 260 | 3.1% |
Other values (54) | 3359 |
Hangul
Value | Count | Frequency (%) |
부 | 248 | 11.7% |
여 | 226 | 10.7% |
내 | 150 | 7.1% |
용 | 146 | 6.9% |
사 | 58 | 2.7% |
일 | 54 | 2.6% |
병 | 53 | 2.5% |
력 | 53 | 2.5% |
자 | 50 | 2.4% |
검 | 49 | 2.3% |
Other values (139) | 1024 |
dataType
Categorical
HIGH CORRELATION
 
Distinct | 29 |
---|---|
Distinct (%) | 4.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.9 KiB |
String(1) | |
---|---|
String(50) | |
String(100) | |
DATE | |
String(10) | |
Other values (24) |
Length
Max length | 13 |
---|---|
Median length | 12 |
Mean length | 9.4159869 |
Min length | 4 |
Unique
Unique | 6 ? |
---|---|
Unique (%) | 1.0% |
Sample
1st row | String(10) |
---|---|
2nd row | String(code) |
3rd row | DATE |
4th row | DATE |
5th row | String(code) |
Common Values
Value | Count | Frequency (%) |
String(1) | 238 | |
String(50) | 76 | 12.4% |
String(100) | 62 | 10.1% |
DATE | 49 | 8.0% |
String(10) | 40 | 6.5% |
String(200) | 28 | 4.6% |
String(20) | 24 | 3.9% |
Integer(code) | 15 | 2.4% |
String(4000) | 11 | 1.8% |
String(256) | 10 | 1.6% |
Other values (19) | 60 | 9.8% |
Length
Value | Count | Frequency (%) |
string(1 | 238 | |
string(50 | 76 | 12.4% |
string(100 | 62 | 10.1% |
date | 49 | 8.0% |
string(10 | 40 | 6.5% |
string(200 | 28 | 4.6% |
string(20 | 24 | 3.9% |
integer(code | 15 | 2.4% |
string(4000 | 11 | 1.8% |
string(256 | 10 | 1.6% |
Other values (19) | 60 | 9.8% |
colDesc
Text
Distinct | 537 |
---|---|
Distinct (%) | 87.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.9 KiB |
Value | Count | Frequency (%) |
여부 | 102 | 5.8% |
내용 | 90 | 5.1% |
59 | 3.3% | |
size | 43 | 2.4% |
right | 36 | 2.0% |
left | 34 | 1.9% |
lnd | 33 | 1.9% |
other | 32 | 1.8% |
환자대체번호 | 28 | 1.6% |
lns | 27 | 1.5% |
Other values (490) | 1278 |
Most occurring characters
Value | Count | Frequency (%) |
1151 | 11.1% | |
e | 611 | 5.9% |
i | 515 | 5.0% |
t | 499 | 4.8% |
o | 451 | 4.4% |
a | 449 | 4.3% |
r | 443 | 4.3% |
c | 276 | 2.7% |
L | 262 | 2.5% |
l | 260 | 2.5% |
Other values (212) | 5429 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 5283 | |
Other Letter | 2083 | 20.1% |
Uppercase Letter | 1375 | 13.3% |
Space Separator | 1151 | 11.1% |
Open Punctuation | 107 | 1.0% |
Close Punctuation | 107 | 1.0% |
Dash Punctuation | 104 | 1.0% |
Other Punctuation | 67 | 0.6% |
Decimal Number | 62 | 0.6% |
Connector Punctuation | 4 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
부 | 167 | 8.0% |
여 | 144 | 6.9% |
내 | 114 | 5.5% |
용 | 110 | 5.3% |
자 | 76 | 3.6% |
병 | 65 | 3.1% |
사 | 64 | 3.1% |
력 | 53 | 2.5% |
일 | 53 | 2.5% |
검 | 51 | 2.4% |
Other values (148) | 1186 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 611 | |
i | 515 | |
t | 499 | 9.4% |
o | 451 | 8.5% |
a | 449 | 8.5% |
r | 443 | 8.4% |
c | 276 | 5.2% |
l | 260 | 4.9% |
n | 259 | 4.9% |
s | 258 | 4.9% |
Other values (14) | 1262 |
Uppercase Letter
Value | Count | Frequency (%) |
L | 262 | |
P | 161 | |
N | 150 | |
S | 131 | |
O | 91 | 6.6% |
R | 89 | 6.5% |
C | 69 | 5.0% |
D | 54 | 3.9% |
A | 53 | 3.9% |
U | 48 | 3.5% |
Other values (14) | 267 |
Decimal Number
Value | Count | Frequency (%) |
2 | 13 | |
1 | 11 | |
3 | 9 | |
6 | 8 | |
5 | 7 | |
4 | 6 | |
8 | 4 | 6.5% |
7 | 4 | 6.5% |
Other Punctuation
Value | Count | Frequency (%) |
/ | 57 | |
: | 10 | 14.9% |
Space Separator
Value | Count | Frequency (%) |
1151 |
Open Punctuation
Value | Count | Frequency (%) |
( | 107 |
Close Punctuation
Value | Count | Frequency (%) |
) | 107 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 104 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 4 |
Math Symbol
Value | Count | Frequency (%) |
+ | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 6658 | |
Hangul | 2083 | 20.1% |
Common | 1605 | 15.5% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
부 | 167 | 8.0% |
여 | 144 | 6.9% |
내 | 114 | 5.5% |
용 | 110 | 5.3% |
자 | 76 | 3.6% |
병 | 65 | 3.1% |
사 | 64 | 3.1% |
력 | 53 | 2.5% |
일 | 53 | 2.5% |
검 | 51 | 2.4% |
Other values (148) | 1186 |
Latin
Value | Count | Frequency (%) |
e | 611 | 9.2% |
i | 515 | 7.7% |
t | 499 | 7.5% |
o | 451 | 6.8% |
a | 449 | 6.7% |
r | 443 | 6.7% |
c | 276 | 4.1% |
L | 262 | 3.9% |
l | 260 | 3.9% |
n | 259 | 3.9% |
Other values (38) | 2633 |
Common
Value | Count | Frequency (%) |
1151 | ||
( | 107 | 6.7% |
) | 107 | 6.7% |
- | 104 | 6.5% |
/ | 57 | 3.6% |
2 | 13 | 0.8% |
1 | 11 | 0.7% |
: | 10 | 0.6% |
3 | 9 | 0.6% |
6 | 8 | 0.5% |
Other values (6) | 28 | 1.7% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 8263 | |
Hangul | 2083 | 20.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1151 | 13.9% | |
e | 611 | 7.4% |
i | 515 | 6.2% |
t | 499 | 6.0% |
o | 451 | 5.5% |
a | 449 | 5.4% |
r | 443 | 5.4% |
c | 276 | 3.3% |
L | 262 | 3.2% |
l | 260 | 3.1% |
Other values (54) | 3346 |
Hangul
Value | Count | Frequency (%) |
부 | 167 | 8.0% |
여 | 144 | 6.9% |
내 | 114 | 5.5% |
용 | 110 | 5.3% |
자 | 76 | 3.6% |
병 | 65 | 3.1% |
사 | 64 | 3.1% |
력 | 53 | 2.5% |
일 | 53 | 2.5% |
검 | 51 | 2.4% |
Other values (148) | 1186 |
colCnt
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 613 |
---|---|
Missing (%) | 100.0% |
Memory size | 5.5 KiB |
dispFormat
Categorical
HIGH CORRELATION
 
Distinct | 11 |
---|---|
Distinct (%) | 1.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.9 KiB |
텍스트 | |
---|---|
Y : 유 / N : 무 | |
YYYY-MM-DD | |
숫자 | |
RN+비식별숫자(8) | |
Other values (6) | 19 |
Length
Max length | 15 |
---|---|
Median length | 13 |
Mean length | 7.8531811 |
Min length | 2 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | 0.5% |
Sample
1st row | RN+비식별숫자(8) |
---|---|
2nd row | M 남 | F 여 |
3rd row | YYYY-MM-DD |
4th row | YYYY-MM-DD |
5th row | 원내검사 코드 |
Common Values
Value | Count | Frequency (%) |
텍스트 | 242 | |
Y : 유 / N : 무 | 234 | |
YYYY-MM-DD | 49 | 8.0% |
숫자 | 41 | 6.7% |
RN+비식별숫자(8) | 28 | 4.6% |
Free 텍스트 | 11 | 1.8% |
Y : 내부 / N : 외부 | 3 | 0.5% |
원내검사 코드 | 2 | 0.3% |
M 남 | F 여 | 1 | 0.2% |
FREE 텍스트 | 1 | 0.2% |
Length
Value | Count | Frequency (%) |
712 | ||
텍스트 | 254 | 12.4% |
y | 237 | 11.5% |
n | 237 | 11.5% |
유 | 234 | 11.4% |
무 | 234 | 11.4% |
yyyy-mm-dd | 49 | 2.4% |
숫자 | 41 | 2.0% |
rn+비식별숫자(8 | 28 | 1.4% |
free | 12 | 0.6% |
Other values (9) | 15 | 0.7% |
NUM | gpId | gpNm | tblId | tblNm | dataType | dispFormat | |
---|---|---|---|---|---|---|---|
NUM | 1.000 | 0.933 | 0.933 | 0.993 | 0.993 | 0.683 | 0.491 |
gpId | 0.933 | 1.000 | 1.000 | 1.000 | 1.000 | 0.773 | 0.664 |
gpNm | 0.933 | 1.000 | 1.000 | 1.000 | 1.000 | 0.773 | 0.664 |
tblId | 0.993 | 1.000 | 1.000 | 1.000 | 1.000 | 0.754 | 0.711 |
tblNm | 0.993 | 1.000 | 1.000 | 1.000 | 1.000 | 0.754 | 0.711 |
dataType | 0.683 | 0.773 | 0.773 | 0.754 | 0.754 | 1.000 | 0.956 |
dispFormat | 0.491 | 0.664 | 0.664 | 0.711 | 0.711 | 0.956 | 1.000 |
dataType | gpNm | gpId | dispFormat | |
---|---|---|---|---|
dataType | 1.000 | 0.295 | 0.295 | 0.743 |
gpNm | 0.295 | 1.000 | 1.000 | 0.299 |
gpId | 0.295 | 1.000 | 1.000 | 0.299 |
dispFormat | 0.743 | 0.299 | 0.299 | 1.000 |
NUM | gpId | gpNm | dataType | dispFormat | |
---|---|---|---|---|---|
NUM | 1.000 | 0.692 | 0.692 | 0.313 | 0.233 |
gpId | 0.692 | 1.000 | 1.000 | 0.295 | 0.299 |
gpNm | 0.692 | 1.000 | 1.000 | 0.295 | 0.299 |
dataType | 0.313 | 0.295 | 0.295 | 1.000 | 0.743 |
dispFormat | 0.233 | 0.299 | 0.299 | 0.743 | 1.000 |
NUM | gpId | gpNm | tblId | tblNm | colId | colNm | dataType | colDesc | colCnt | dispFormat | |
---|---|---|---|---|---|---|---|---|---|---|---|
0 | 1 | UTRN_TRGT | Summary | UTRN_PT_TRGT | 기본정보 | PT_SBST_NO | 환자대체번호 | String(10) | 환자대체번호 | <NA> | RN+비식별숫자(8) |
1 | 2 | UTRN_TRGT | Summary | UTRN_PT_TRGT | 기본정보 | SEX_CD | 성별 코드 | String(code) | 성별코드 | <NA> | M 남 | F 여 |
2 | 3 | UTRN_TRGT | Summary | UTRN_PT_TRGT | 기본정보 | BRTH_YMD | 생년월일 | DATE | 생년월일 | <NA> | YYYY-MM-DD |
3 | 4 | UTRN_TRGT | Summary | UTRN_PT_TRGT | 기본정보 | FRST_DIAG_YMD | 최초 진단일 | DATE | 최초진단일자 | <NA> | YYYY-MM-DD |
4 | 5 | UTRN_TRGT | Summary | UTRN_PT_TRGT | 기본정보 | FRST_DIAG_CD | 최초 진단 코드 | String(code) | 최초진단코드 | <NA> | 원내검사 코드 |
5 | 6 | UTRN_TRGT | Summary | UTRN_PT_TRGT | 기본정보 | FRST_DIAG_NM | 최초 진단명 | String(256) | 최초진단명 | <NA> | 텍스트 |
6 | 7 | UTRN_TRGT | Summary | UTRN_PT_TRGT | 기본정보 | DIAG_ATT_AGE | 진단 시 나이 | Integer(3) | 진단 시 나이 | <NA> | 숫자 |
7 | 8 | UTRN_TRGT | Summary | UTRN_PT_TRGT | 기본정보 | FRMD_YMD | 초진일 | DATE | 초진일자 | <NA> | YYYY-MM-DD |
8 | 9 | UTRN_TRGT | Summary | UTRN_PT_TRGT | 기본정보 | FRMD_DEPT_NM | 초진 부서명 | String(20) | 초진 부서 | <NA> | 텍스트 |
9 | 10 | UTRN_TRGT | Summary | UTRN_PT_TRGT | 기본정보 | FRST_OPRT_YMD | 최초 수술일 | DATE | 최초 수술일자 | <NA> | YYYY-MM-DD |
NUM | gpId | gpNm | tblId | tblNm | colId | colNm | dataType | colDesc | colCnt | dispFormat | |
---|---|---|---|---|---|---|---|---|---|---|---|
603 | 604 | UTRN_CHMO_FLST | 항암 FlowSheet | UTRN_PE_CHMO_FLST | Flow Sheet | FATG_CMNT | FATIGUE 내용 | String(50) | FATIGUE | <NA> | 텍스트 |
604 | 605 | UTRN_CHMO_FLST | 항암 FlowSheet | UTRN_PE_CHMO_FLST | Flow Sheet | NV_CMNT | NV 내용 | String(50) | NV | <NA> | 텍스트 |
605 | 606 | UTRN_CHMO_FLST | 항암 FlowSheet | UTRN_PE_CHMO_FLST | Flow Sheet | CSTP_CMNT | CONSTIPATION 내용 | String(50) | CONSTIPATION | <NA> | 텍스트 |
606 | 607 | UTRN_CHMO_FLST | 항암 FlowSheet | UTRN_PE_CHMO_FLST | Flow Sheet | DIAR_CMNT | DIARRHEA 내용 | String(50) | DIARRHEA | <NA> | 텍스트 |
607 | 608 | UTRN_CHMO_FLST | 항암 FlowSheet | UTRN_PE_CHMO_FLST | Flow Sheet | SKIN_RASH_CMNT | SKINRASH 내용 | String(50) | SKINRASH | <NA> | 텍스트 |
608 | 609 | UTRN_CHMO_FLST | 항암 FlowSheet | UTRN_PE_CHMO_FLST | Flow Sheet | MCST_CMNT | MUCOSITIS 내용 | String(50) | MUCOSITIS | <NA> | 텍스트 |
609 | 610 | UTRN_CHMO_FLST | 항암 FlowSheet | UTRN_PE_CHMO_FLST | Flow Sheet | NURO_PTHY_CMNT | NEUROPATHY 내용 | String(50) | NEUROPATHY | <NA> | 텍스트 |
610 | 611 | UTRN_CHMO_FLST | 항암 FlowSheet | UTRN_PE_CHMO_FLST | Flow Sheet | ECOG_CD | ECOG 코드 | Integer(code) | ECOG 전신상태평가 | <NA> | 숫자 |
611 | 612 | UTRN_CHMO_FLST | 항암 FlowSheet | UTRN_PE_CHMO_FLST | Flow Sheet | WT_VL | 체중 (kg) | Float(5,2) | 체중 | <NA> | 숫자 |
612 | 613 | UTRN_CHMO_FLST | 항암 FlowSheet | UTRN_PE_CHMO_FLST | Flow Sheet | BSA_VL | BSA | Float(10,2) | 체표면적 | <NA> | 숫자 |