Dataset statistics
Number of variables | 10 |
---|---|
Number of observations | 10000 |
Missing cells | 27282 |
Missing cells (%) | 27.3% |
Duplicate rows | 1 |
Duplicate rows (%) | < 0.1% |
Total size in memory | 878.9 KiB |
Average record size in memory | 90.0 B |
Variable types
Text | 3 |
---|---|
Boolean | 2 |
Categorical | 4 |
Numeric | 1 |
Dataset
Description | 한국표준질병·사인분류(KCD)를 기본으로 요양급여비용 청구에 필요한 상병기호 및 상병과 관련한 각종 부가정보를 반영한 상병마스터 파일 / 주상병사용 구분, 완전코드 구분, 성별구분, 법정감염병 구분, 상·하한연령 등 |
---|---|
Author | 건강보험심사평가원 |
URL | https://www.data.go.kr/data/15067467/fileData.do |
완전코드구분 has constant value "" | Constant |
주상병사용구분 has constant value "" | Constant |
Dataset has 1 (< 0.1%) duplicate rows | Duplicates |
성별구분 is highly overall correlated with 하한연령 and 3 other fields | High correlation |
상한연령 is highly overall correlated with 하한연령 and 2 other fields | High correlation |
양한방구분 is highly overall correlated with 하한연령 and 3 other fields | High correlation |
법정감염병구분 is highly overall correlated with 성별구분 and 1 other fields | High correlation |
하한연령 is highly overall correlated with 성별구분 and 2 other fields | High correlation |
법정감염병구분 is highly imbalanced (95.1%) | Imbalance |
성별구분 is highly imbalanced (79.1%) | Imbalance |
상한연령 is highly imbalanced (92.1%) | Imbalance |
양한방구분 is highly imbalanced (94.6%) | Imbalance |
완전코드구분 has 8902 (89.0%) missing values | Missing |
주상병사용구분 has 8762 (87.6%) missing values | Missing |
하한연령 has 9618 (96.2%) missing values | Missing |
Reproduction
Analysis started | 2023-12-12 23:33:58.266687 |
---|---|
Analysis finished | 2023-12-12 23:33:59.710821 |
Duration | 1.44 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
상병기호
Text
Distinct | 7333 |
---|---|
Distinct (%) | 73.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
e1140 | 12 | 0.1% |
m9025 | 11 | 0.1% |
m7797 | 11 | 0.1% |
m0534 | 11 | 0.1% |
m0537 | 11 | 0.1% |
e1142 | 11 | 0.1% |
e1132 | 10 | 0.1% |
e1133 | 10 | 0.1% |
m9021 | 9 | 0.1% |
e1121 | 9 | 0.1% |
Other values (7323) | 9895 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 4807 | |
0 | 4718 | |
8 | 4016 | |
2 | 3783 | |
M | 3592 | 8.1% |
4 | 3003 | 6.8% |
6 | 2981 | 6.7% |
3 | 2897 | 6.5% |
5 | 2771 | 6.2% |
9 | 2713 | 6.1% |
Other values (26) | 9086 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 34367 | |
Uppercase Letter | 10000 | 22.5% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
M | 3592 | |
E | 549 | 5.5% |
S | 433 | 4.3% |
X | 428 | 4.3% |
K | 390 | 3.9% |
T | 331 | 3.3% |
Q | 311 | 3.1% |
Y | 305 | 3.0% |
H | 285 | 2.9% |
F | 255 | 2.5% |
Other values (16) | 3121 |
Decimal Number
Value | Count | Frequency (%) |
1 | 4807 | |
0 | 4718 | |
8 | 4016 | |
2 | 3783 | |
4 | 3003 | |
6 | 2981 | |
3 | 2897 | |
5 | 2771 | |
9 | 2713 | |
7 | 2678 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 34367 | |
Latin | 10000 | 22.5% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
M | 3592 | |
E | 549 | 5.5% |
S | 433 | 4.3% |
X | 428 | 4.3% |
K | 390 | 3.9% |
T | 331 | 3.3% |
Q | 311 | 3.1% |
Y | 305 | 3.0% |
H | 285 | 2.9% |
F | 255 | 2.5% |
Other values (16) | 3121 |
Common
Value | Count | Frequency (%) |
1 | 4807 | |
0 | 4718 | |
8 | 4016 | |
2 | 3783 | |
4 | 3003 | |
6 | 2981 | |
3 | 2897 | |
5 | 2771 | |
9 | 2713 | |
7 | 2678 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 44367 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 4807 | |
0 | 4718 | |
8 | 4016 | |
2 | 3783 | |
M | 3592 | 8.1% |
4 | 3003 | 6.8% |
6 | 2981 | 6.7% |
3 | 2897 | 6.5% |
5 | 2771 | 6.2% |
9 | 2713 | 6.1% |
Other values (26) | 9086 |
한글명
Text
Distinct | 9976 |
---|---|
Distinct (%) | 99.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 156 |
---|---|
Median length | 70 |
Mean length | 18.8539 |
Min length | 2 |
Characters and Unicode
Total characters | 188539 |
---|---|
Distinct characters | 920 |
Distinct categories | 12 ? |
Distinct scripts | 4 ? |
Distinct blocks | 7 ? |
Unique
Unique | 9952 ? |
---|---|
Unique (%) | 99.5% |
Sample
1st row | 눈 및 눈부속기의 상세불명 부분의 부식 |
---|---|
2nd row | (착색) 융모결절성 윤활막염 상세불명 부분 |
3rd row | 사다리에서의 낙상 주택 |
4th row | 어깨-손증후군 손 |
5th row | 횡격막의 마비 |
Value | Count | Frequency (%) |
및 | 2219 | 5.0% |
기타 | 2046 | 4.6% |
상세불명의 | 966 | 2.2% |
의한 | 749 | 1.7% |
동반한 | 738 | 1.7% |
nos | 552 | 1.2% |
또는 | 502 | 1.1% |
상세불명 | 447 | 1.0% |
명시된 | 415 | 0.9% |
달리 | 410 | 0.9% |
Other values (6883) | 35387 |
Most occurring characters
Value | Count | Frequency (%) |
34449 | 18.3% | |
의 | 6663 | 3.5% |
성 | 4064 | 2.2% |
기 | 3112 | 1.7% |
골 | 2945 | 1.6% |
관 | 2830 | 1.5% |
상 | 2691 | 1.4% |
절 | 2573 | 1.4% |
증 | 2551 | 1.4% |
및 | 2221 | 1.2% |
Other values (910) | 124440 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 140915 | |
Space Separator | 34449 | 18.3% |
Decimal Number | 3609 | 1.9% |
Uppercase Letter | 3075 | 1.6% |
Close Punctuation | 2008 | 1.1% |
Open Punctuation | 2008 | 1.1% |
Other Punctuation | 1763 | 0.9% |
Dash Punctuation | 577 | 0.3% |
Math Symbol | 105 | 0.1% |
Lowercase Letter | 16 | < 0.1% |
Other values (2) | 14 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
의 | 6663 | 4.7% |
성 | 4064 | 2.9% |
기 | 3112 | 2.2% |
골 | 2945 | 2.1% |
관 | 2830 | 2.0% |
상 | 2691 | 1.9% |
절 | 2573 | 1.8% |
증 | 2551 | 1.8% |
및 | 2221 | 1.6% |
타 | 2218 | 1.6% |
Other values (850) | 109047 |
Uppercase Letter
Value | Count | Frequency (%) |
N | 688 | |
S | 661 | |
O | 624 | |
A | 251 | 8.2% |
B | 126 | 4.1% |
G | 112 | 3.6% |
I | 102 | 3.3% |
H | 90 | 2.9% |
C | 81 | 2.6% |
T | 67 | 2.2% |
Other values (13) | 273 | 8.9% |
Decimal Number
Value | Count | Frequency (%) |
0 | 676 | |
3 | 489 | |
2 | 407 | |
1 | 361 | |
4 | 346 | |
5 | 339 | |
9 | 286 | |
8 | 273 | |
6 | 242 | 6.7% |
7 | 190 | 5.3% |
Other Punctuation
Value | Count | Frequency (%) |
. | 911 | |
† | 464 | |
* | 370 | |
/ | 10 | 0.6% |
: | 3 | 0.2% |
% | 3 | 0.2% |
? | 2 | 0.1% |
Lowercase Letter
Value | Count | Frequency (%) |
m | 6 | |
g | 4 | |
l | 3 | |
h | 1 | 6.2% |
b | 1 | 6.2% |
q | 1 | 6.2% |
Letter Number
Value | Count | Frequency (%) |
Ⅲ | 5 | |
Ⅱ | 3 | |
Ⅰ | 3 | |
Ⅵ | 1 | 7.7% |
Ⅳ | 1 | 7.7% |
Close Punctuation
Value | Count | Frequency (%) |
) | 1859 | |
] | 149 | 7.4% |
Open Punctuation
Value | Count | Frequency (%) |
( | 1859 | |
[ | 149 | 7.4% |
Math Symbol
Value | Count | Frequency (%) |
~ | 59 | |
+ | 46 |
Space Separator
Value | Count | Frequency (%) |
34449 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 577 |
Other Number
Value | Count | Frequency (%) |
₂ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 140540 | |
Common | 44520 | 23.6% |
Latin | 3104 | 1.6% |
Han | 375 | 0.2% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
의 | 6663 | 4.7% |
성 | 4064 | 2.9% |
기 | 3112 | 2.2% |
골 | 2945 | 2.1% |
관 | 2830 | 2.0% |
상 | 2691 | 1.9% |
절 | 2573 | 1.8% |
증 | 2551 | 1.8% |
및 | 2221 | 1.6% |
타 | 2218 | 1.6% |
Other values (713) | 108672 |
Han
Value | Count | Frequency (%) |
證 | 60 | 16.0% |
陽 | 16 | 4.3% |
虛 | 13 | 3.5% |
氣 | 11 | 2.9% |
病 | 11 | 2.9% |
太 | 9 | 2.4% |
陰 | 9 | 2.4% |
熱 | 8 | 2.1% |
症 | 8 | 2.1% |
人 | 7 | 1.9% |
Other values (127) | 223 |
Latin
Value | Count | Frequency (%) |
N | 688 | |
S | 661 | |
O | 624 | |
A | 251 | 8.1% |
B | 126 | 4.1% |
G | 112 | 3.6% |
I | 102 | 3.3% |
H | 90 | 2.9% |
C | 81 | 2.6% |
T | 67 | 2.2% |
Other values (24) | 302 |
Common
Value | Count | Frequency (%) |
34449 | ||
) | 1859 | 4.2% |
( | 1859 | 4.2% |
. | 911 | 2.0% |
0 | 676 | 1.5% |
- | 577 | 1.3% |
3 | 489 | 1.1% |
† | 464 | 1.0% |
2 | 407 | 0.9% |
* | 370 | 0.8% |
Other values (16) | 2459 | 5.5% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 140540 | |
ASCII | 47143 | 25.0% |
Punctuation | 464 | 0.2% |
CJK | 370 | 0.2% |
Number Forms | 13 | < 0.1% |
CJK Compat Ideographs | 5 | < 0.1% |
None | 4 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
34449 | ||
) | 1859 | 3.9% |
( | 1859 | 3.9% |
. | 911 | 1.9% |
N | 688 | 1.5% |
0 | 676 | 1.4% |
S | 661 | 1.4% |
O | 624 | 1.3% |
- | 577 | 1.2% |
3 | 489 | 1.0% |
Other values (42) | 4350 | 9.2% |
Hangul
Value | Count | Frequency (%) |
의 | 6663 | 4.7% |
성 | 4064 | 2.9% |
기 | 3112 | 2.2% |
골 | 2945 | 2.1% |
관 | 2830 | 2.0% |
상 | 2691 | 1.9% |
절 | 2573 | 1.8% |
증 | 2551 | 1.8% |
및 | 2221 | 1.6% |
타 | 2218 | 1.6% |
Other values (713) | 108672 |
Punctuation
Value | Count | Frequency (%) |
† | 464 |
CJK
Value | Count | Frequency (%) |
證 | 60 | 16.2% |
陽 | 16 | 4.3% |
虛 | 13 | 3.5% |
氣 | 11 | 3.0% |
病 | 11 | 3.0% |
太 | 9 | 2.4% |
陰 | 9 | 2.4% |
熱 | 8 | 2.2% |
症 | 8 | 2.2% |
人 | 7 | 1.9% |
Other values (122) | 218 |
Number Forms
Value | Count | Frequency (%) |
Ⅲ | 5 | |
Ⅱ | 3 | |
Ⅰ | 3 | |
Ⅵ | 1 | 7.7% |
Ⅳ | 1 | 7.7% |
None
Value | Count | Frequency (%) |
% | 3 | |
₂ | 1 | 25.0% |
CJK Compat Ideographs
Value | Count | Frequency (%) |
歷 | 1 | |
淋 | 1 | |
裏 | 1 | |
凉 | 1 | |
凌 | 1 |
영문명
Text
Distinct | 9993 |
---|---|
Distinct (%) | 99.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 218 |
---|---|
Median length | 146 |
Mean length | 47.2809 |
Min length | 3 |
Characters and Unicode
Total characters | 472809 |
---|---|
Distinct characters | 90 |
Distinct categories | 13 ? |
Distinct scripts | 3 ? |
Distinct blocks | 4 ? |
Unique
Unique | 9986 ? |
---|---|
Unique (%) | 99.9% |
Sample
1st row | Corrosion of eye and adnexa part unspecified |
---|---|
2nd row | Villonodular synovitis (pigmented) site unspecified |
3rd row | Fall on and from ladder home |
4th row | Shoulder-hand syndrome hand |
5th row | Paralysis of diaphragm |
Value | Count | Frequency (%) |
of | 3385 | 5.6% |
and | 2252 | 3.7% |
other | 2063 | 3.4% |
unspecified | 1386 | 2.3% |
in | 1243 | 2.1% |
with | 1225 | 2.0% |
to | 816 | 1.3% |
or | 714 | 1.2% |
nos | 607 | 1.0% |
by | 595 | 1.0% |
Other values (5365) | 46275 |
Most occurring characters
Value | Count | Frequency (%) |
50591 | 10.7% | |
e | 41964 | 8.9% |
i | 39356 | 8.3% |
o | 32904 | 7.0% |
t | 31590 | 6.7% |
a | 30253 | 6.4% |
s | 29336 | 6.2% |
n | 28714 | 6.1% |
r | 27994 | 5.9% |
l | 18067 | 3.8% |
Other values (80) | 142040 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 396842 | |
Space Separator | 50591 | 10.7% |
Uppercase Letter | 14363 | 3.0% |
Decimal Number | 3511 | 0.7% |
Close Punctuation | 2061 | 0.4% |
Open Punctuation | 2061 | 0.4% |
Other Punctuation | 1747 | 0.4% |
Dash Punctuation | 1304 | 0.3% |
Final Punctuation | 226 | < 0.1% |
Math Symbol | 85 | < 0.1% |
Other values (3) | 18 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 41964 | |
i | 39356 | 9.9% |
o | 32904 | 8.3% |
t | 31590 | 8.0% |
a | 30253 | 7.6% |
s | 29336 | 7.4% |
n | 28714 | 7.2% |
r | 27994 | 7.1% |
l | 18067 | 4.6% |
c | 17755 | 4.5% |
Other values (18) | 98909 |
Uppercase Letter
Value | Count | Frequency (%) |
O | 2051 | |
S | 1468 | |
A | 1322 | 9.2% |
C | 1183 | 8.2% |
N | 1173 | 8.2% |
P | 1029 | 7.2% |
M | 799 | 5.6% |
I | 678 | 4.7% |
E | 554 | 3.9% |
F | 528 | 3.7% |
Other values (16) | 3578 |
Decimal Number
Value | Count | Frequency (%) |
0 | 675 | |
3 | 463 | |
2 | 394 | |
5 | 336 | |
4 | 333 | |
1 | 331 | |
9 | 286 | |
8 | 261 | 7.4% |
6 | 242 | 6.9% |
7 | 190 | 5.4% |
Other Punctuation
Value | Count | Frequency (%) |
. | 904 | |
† | 463 | |
* | 361 | 20.7% |
/ | 10 | 0.6% |
' | 4 | 0.2% |
% | 3 | 0.2% |
: | 2 | 0.1% |
Letter Number
Value | Count | Frequency (%) |
Ⅲ | 6 | |
Ⅰ | 3 | |
Ⅱ | 3 | |
Ⅳ | 1 | 6.2% |
Ⅷ | 1 | 6.2% |
Ⅵ | 1 | 6.2% |
Ⅹ | 1 | 6.2% |
Math Symbol
Value | Count | Frequency (%) |
+ | 46 | |
~ | 38 | |
> | 1 | 1.2% |
Close Punctuation
Value | Count | Frequency (%) |
) | 1848 | |
] | 213 | 10.3% |
Open Punctuation
Value | Count | Frequency (%) |
( | 1848 | |
[ | 213 | 10.3% |
Space Separator
Value | Count | Frequency (%) |
50591 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1304 |
Final Punctuation
Value | Count | Frequency (%) |
’ | 226 |
Modifier Symbol
Value | Count | Frequency (%) |
´ | 1 |
Other Number
Value | Count | Frequency (%) |
₂ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 411213 | |
Common | 61588 | 13.0% |
Greek | 8 | < 0.1% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 41964 | 10.2% |
i | 39356 | 9.6% |
o | 32904 | 8.0% |
t | 31590 | 7.7% |
a | 30253 | 7.4% |
s | 29336 | 7.1% |
n | 28714 | 7.0% |
r | 27994 | 6.8% |
l | 18067 | 4.4% |
c | 17755 | 4.3% |
Other values (49) | 113280 |
Common
Value | Count | Frequency (%) |
50591 | ||
) | 1848 | 3.0% |
( | 1848 | 3.0% |
- | 1304 | 2.1% |
. | 904 | 1.5% |
0 | 675 | 1.1% |
† | 463 | 0.8% |
3 | 463 | 0.8% |
2 | 394 | 0.6% |
* | 361 | 0.6% |
Other values (19) | 2737 | 4.4% |
Greek
Value | Count | Frequency (%) |
β | 6 | |
α | 2 | 25.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 472091 | |
Punctuation | 689 | 0.1% |
Number Forms | 16 | < 0.1% |
None | 13 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
50591 | 10.7% | |
e | 41964 | 8.9% |
i | 39356 | 8.3% |
o | 32904 | 7.0% |
t | 31590 | 6.7% |
a | 30253 | 6.4% |
s | 29336 | 6.2% |
n | 28714 | 6.1% |
r | 27994 | 5.9% |
l | 18067 | 3.8% |
Other values (66) | 141322 |
Punctuation
Value | Count | Frequency (%) |
† | 463 | |
’ | 226 |
Number Forms
Value | Count | Frequency (%) |
Ⅲ | 6 | |
Ⅰ | 3 | |
Ⅱ | 3 | |
Ⅳ | 1 | 6.2% |
Ⅷ | 1 | 6.2% |
Ⅵ | 1 | 6.2% |
Ⅹ | 1 | 6.2% |
None
Value | Count | Frequency (%) |
β | 6 | |
% | 3 | |
α | 2 | 15.4% |
´ | 1 | 7.7% |
₂ | 1 | 7.7% |
완전코드구분
Boolean
CONSTANT
  MISSING
 
Distinct | 1 |
---|---|
Distinct (%) | 0.1% |
Missing | 8902 |
Missing (%) | 89.0% |
Memory size | 97.7 KiB |
False | |
---|---|
(Missing) |
Value | Count | Frequency (%) |
False | 1098 | 11.0% |
(Missing) | 8902 |
주상병사용구분
Boolean
CONSTANT
  MISSING
 
Distinct | 1 |
---|---|
Distinct (%) | 0.1% |
Missing | 8762 |
Missing (%) | 87.6% |
Memory size | 97.7 KiB |
False | |
---|---|
(Missing) |
Value | Count | Frequency (%) |
False | 1238 | 12.4% |
(Missing) | 8762 |
법정감염병구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
제2급 | 45 |
제4급 | 35 |
제3급 | 31 |
제1급 | 7 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.9882 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 9882 | |
제2급 | 45 | 0.4% |
제4급 | 35 | 0.4% |
제3급 | 31 | 0.3% |
제1급 | 7 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 9882 | |
제2급 | 45 | 0.4% |
제4급 | 35 | 0.4% |
제3급 | 31 | 0.3% |
제1급 | 7 | 0.1% |
성별구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
X | 474 |
Y | 65 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.8383 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 9461 | |
X | 474 | 4.7% |
Y | 65 | 0.7% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 9461 | |
x | 474 | 4.7% |
y | 65 | 0.7% |
상한연령
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
60 | 257 |
24 | 5 |
20 | 2 |
15 | 1 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.947 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 9735 | |
60 | 257 | 2.6% |
24 | 5 | 0.1% |
20 | 2 | < 0.1% |
15 | 1 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 9735 | |
60 | 257 | 2.6% |
24 | 5 | < 0.1% |
20 | 2 | < 0.1% |
15 | 1 | < 0.1% |
하한연령
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 6 |
---|---|
Distinct (%) | 1.6% |
Missing | 9618 |
Missing (%) | 96.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 12.188482 |
Minimum | 8 |
---|---|
Maximum | 65 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 8 |
---|---|
5-th percentile | 8 |
Q1 | 8 |
median | 8 |
Q3 | 15 |
95-th percentile | 40 |
Maximum | 65 |
Range | 57 |
Interquartile range (IQR) | 7 |
Descriptive statistics
Standard deviation | 8.5601821 |
---|---|
Coefficient of variation (CV) | 0.70231734 |
Kurtosis | 8.053814 |
Mean | 12.188482 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 2.7629564 |
Sum | 4656 |
Variance | 73.276717 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
8 | 262 | 2.6% |
15 | 71 | 0.7% |
40 | 24 | 0.2% |
20 | 23 | 0.2% |
10 | 1 | < 0.1% |
65 | 1 | < 0.1% |
(Missing) | 9618 |
Value | Count | Frequency (%) |
8 | 262 | |
10 | 1 | < 0.1% |
15 | 71 | 0.7% |
20 | 23 | 0.2% |
40 | 24 | 0.2% |
65 | 1 | < 0.1% |
Value | Count | Frequency (%) |
65 | 1 | < 0.1% |
40 | 24 | 0.2% |
20 | 23 | 0.2% |
15 | 71 | 0.7% |
10 | 1 | < 0.1% |
8 | 262 |
양한방구분
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
양한방 공통 | |
---|---|
한방 | 61 |
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 5.9756 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 양한방 공통 |
---|---|
2nd row | 양한방 공통 |
3rd row | 양한방 공통 |
4th row | 양한방 공통 |
5th row | 양한방 공통 |
Common Values
Value | Count | Frequency (%) |
양한방 공통 | 9939 | |
한방 | 61 | 0.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
양한방 | 9939 | |
공통 | 9939 | |
한방 | 61 | 0.3% |
법정감염병구분 | 성별구분 | 상한연령 | 하한연령 | 양한방구분 | |
---|---|---|---|---|---|
법정감염병구분 | 1.000 | 0.000 | NaN | NaN | NaN |
성별구분 | 0.000 | 1.000 | NaN | NaN | NaN |
상한연령 | NaN | NaN | 1.000 | NaN | NaN |
하한연령 | NaN | NaN | NaN | 1.000 | NaN |
양한방구분 | NaN | NaN | NaN | NaN | 1.000 |
성별구분 | 상한연령 | 양한방구분 | 법정감염병구분 | |
---|---|---|---|---|
성별구분 | 1.000 | 1.000 | 1.000 | 1.000 |
상한연령 | 1.000 | 1.000 | 1.000 | NaN |
양한방구분 | 1.000 | 1.000 | 1.000 | 1.000 |
법정감염병구분 | 1.000 | NaN | 1.000 | 1.000 |
하한연령 | 법정감염병구분 | 성별구분 | 상한연령 | 양한방구분 | |
---|---|---|---|---|---|
하한연령 | 1.000 | 0.000 | 1.000 | 1.000 | 1.000 |
법정감염병구분 | 0.000 | 1.000 | 1.000 | 0.000 | 1.000 |
성별구분 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
상한연령 | 1.000 | 0.000 | 1.000 | 1.000 | 1.000 |
양한방구분 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
상병기호 | 한글명 | 영문명 | 완전코드구분 | 주상병사용구분 | 법정감염병구분 | 성별구분 | 상한연령 | 하한연령 | 양한방구분 | |
---|---|---|---|---|---|---|---|---|---|---|
39507 | T269 | 눈 및 눈부속기의 상세불명 부분의 부식 | Corrosion of eye and adnexa part unspecified | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 양한방 공통 |
19062 | M1229 | (착색) 융모결절성 윤활막염 상세불명 부분 | Villonodular synovitis (pigmented) site unspecified | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 양한방 공통 |
42012 | W110 | 사다리에서의 낙상 주택 | Fall on and from ladder home | <NA> | N | <NA> | <NA> | <NA> | <NA> | 양한방 공통 |
29744 | M8904 | 어깨-손증후군 손 | Shoulder-hand syndrome hand | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 양한방 공통 |
11958 | J986 | 횡격막의 마비 | Paralysis of diaphragm | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 양한방 공통 |
31390 | M9912 | (척추)부분탈구복합 흉요추 | Subluxation complex (vertebral) thoracolumbar | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 양한방 공통 |
23406 | M6247 | 근육의 구축 발가락 | Contracture of muscle toes | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 양한방 공통 |
12139 | K042 | 치수변성 | Pulp degeneration | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 양한방 공통 |
42234 | W24 | 철사와의 접촉 | Contact with wire | N | N | <NA> | <NA> | <NA> | <NA> | 양한방 공통 |
32431 | N854 | 자궁의 후굴 | Retroflexion of uterus | <NA> | <NA> | <NA> | X | <NA> | <NA> | 양한방 공통 |
상병기호 | 한글명 | 영문명 | 완전코드구분 | 주상병사용구분 | 법정감염병구분 | 성별구분 | 상한연령 | 하한연령 | 양한방구분 | |
---|---|---|---|---|---|---|---|---|---|---|
5417 | E1161 | 수포(당뇨병성 수포증)를 동반한 성숙기발병당뇨병(진성 비비만성 비만성)(L14*) | Maturity-onset diabetes (mellitus nonobese obese) with bullae(bullosis diabeticorum)(L14*) | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 양한방 공통 |
9707 | H447 | 전방(~에서의) (비자기성)(오래된) 안구내이물 | Retained (nonmagnetic)(old) foreign body (in) anterior chamber | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 양한방 공통 |
21175 | M2591 | 상세불명의 관절장애 견쇄관절 | Joint disorder unspecified acromioclavicular joints | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 양한방 공통 |
30731 | M9035 | 잠함병에서의 골괴사(T70.3†) 고관절 | Osteonecrosis in caisson disease(T70.3†) hip (joint) | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 양한방 공통 |
17264 | M06879 | 상세불명 기타 명시된 류마티스관절염 족근골 | Unspecified other specified rheumatoid arthritis tarsus | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 양한방 공통 |
12334 | K103 | 치조골염 | Alveolar osteitis | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 양한방 공통 |
38233 | S62170 | 갈고리뼈의 골절 폐쇄성 | Fracture of hamate bone closed | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 양한방 공통 |
34044 | P11 | 중추신경계통에 대한 기타 출산손상 | Other birth injuries to central nervous system | N | <NA> | <NA> | <NA> | <NA> | <NA> | 양한방 공통 |
30835 | M9058 | 달리 분류된 기타 질환에서의 골괴사 두개골 | Osteonecrosis in other diseases classified elsewhere skull | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 양한방 공통 |
43236 | X092 | 상세불명의 연기 불 및 불꽃에의 노출 학교 기타 시설 및 공공행정 구역 | Exposure to unspecified smoke fire and flames school other institution and public administrative area | <NA> | N | <NA> | <NA> | <NA> | <NA> | 양한방 공통 |
Most frequently occurring
상병기호 | 한글명 | 영문명 | 완전코드구분 | 주상병사용구분 | 법정감염병구분 | 성별구분 | 상한연령 | 하한연령 | 양한방구분 | # duplicates | |
---|---|---|---|---|---|---|---|---|---|---|---|
0 | G03 | 기타 및 상세불명의 원인에 의한 수막염 | Meningitis due to other and unspecified causes | N | <NA> | <NA> | <NA> | <NA> | <NA> | 양한방 공통 | 2 |