Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 400.4 KiB |
Average record size in memory | 41.0 B |
Variable types
Numeric | 1 |
---|---|
Text | 3 |
Dataset
Description | 국가유공자자격확인서비스의 질병정보를 관리하는 테이블로 환자들의 질병정보코드에 대한 데이터를 관리하는 테이블 입니다. |
---|---|
URL | https://www.data.go.kr/data/15116493/fileData.do |
순번 has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 11:46:02.899494 |
---|---|
Analysis finished | 2023-12-12 11:46:04.208752 |
Duration | 1.31 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
순번
Real number (ℝ)
UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 14691.62 |
Minimum | 1 |
---|---|
Maximum | 29279 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1448.95 |
Q1 | 7355 |
median | 14747 |
Q3 | 21982.75 |
95-th percentile | 27835.05 |
Maximum | 29279 |
Range | 29278 |
Interquartile range (IQR) | 14627.75 |
Descriptive statistics
Standard deviation | 8453.2642 |
---|---|
Coefficient of variation (CV) | 0.57538001 |
Kurtosis | -1.1949849 |
Mean | 14691.62 |
Median Absolute Deviation (MAD) | 7325.5 |
Skewness | -0.014831689 |
Sum | 1.469162 × 108 |
Variance | 71457676 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
24463 | 1 | < 0.1% |
19394 | 1 | < 0.1% |
15103 | 1 | < 0.1% |
3904 | 1 | < 0.1% |
28951 | 1 | < 0.1% |
14913 | 1 | < 0.1% |
3712 | 1 | < 0.1% |
23235 | 1 | < 0.1% |
5734 | 1 | < 0.1% |
13749 | 1 | < 0.1% |
Other values (9990) | 9990 |
Value | Count | Frequency (%) |
1 | 1 | |
11 | 1 | |
13 | 1 | |
15 | 1 | |
19 | 1 | |
20 | 1 | |
22 | 1 | |
24 | 1 | |
27 | 1 | |
31 | 1 |
Value | Count | Frequency (%) |
29279 | 1 | |
29278 | 1 | |
29266 | 1 | |
29264 | 1 | |
29261 | 1 | |
29256 | 1 | |
29253 | 1 | |
29250 | 1 | |
29249 | 1 | |
29248 | 1 |
질병코드
Text
Distinct | 7897 |
---|---|
Distinct (%) | 79.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
e11.4 | 18 | 0.2% |
e11.0 | 12 | 0.1% |
e10.2 | 11 | 0.1% |
a09.0 | 11 | 0.1% |
e11.1 | 10 | 0.1% |
e11.2 | 10 | 0.1% |
e11.6 | 9 | 0.1% |
e10.0 | 9 | 0.1% |
e12.0 | 8 | 0.1% |
t21 | 8 | 0.1% |
Other values (7887) | 9894 |
Most occurring characters
Value | Count | Frequency (%) |
. | 8654 | |
0 | 4293 | 8.6% |
1 | 3974 | 7.9% |
2 | 3335 | 6.6% |
8 | 3199 | 6.4% |
4 | 3037 | 6.1% |
3 | 2933 | 5.8% |
9 | 2693 | 5.4% |
5 | 2630 | 5.2% |
6 | 2399 | 4.8% |
Other values (31) | 13043 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 30657 | |
Uppercase Letter | 10127 | 20.2% |
Other Punctuation | 9122 | 18.2% |
Math Symbol | 156 | 0.3% |
Dash Punctuation | 127 | 0.3% |
Space Separator | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
M | 1856 | |
S | 533 | 5.3% |
Q | 473 | 4.7% |
X | 466 | 4.6% |
T | 439 | 4.3% |
E | 436 | 4.3% |
W | 410 | 4.0% |
K | 392 | 3.9% |
F | 380 | 3.8% |
V | 367 | 3.6% |
Other values (16) | 4375 |
Decimal Number
Value | Count | Frequency (%) |
0 | 4293 | |
1 | 3974 | |
2 | 3335 | |
8 | 3199 | |
4 | 3037 | |
3 | 2933 | |
9 | 2693 | |
5 | 2630 | |
6 | 2399 | |
7 | 2164 |
Other Punctuation
Value | Count | Frequency (%) |
. | 8654 | |
* | 468 | 5.1% |
Math Symbol
Value | Count | Frequency (%) |
+ | 156 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 127 |
Space Separator
Value | Count | Frequency (%) |
1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 40063 | |
Latin | 10127 | 20.2% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
M | 1856 | |
S | 533 | 5.3% |
Q | 473 | 4.7% |
X | 466 | 4.6% |
T | 439 | 4.3% |
E | 436 | 4.3% |
W | 410 | 4.0% |
K | 392 | 3.9% |
F | 380 | 3.8% |
V | 367 | 3.6% |
Other values (16) | 4375 |
Common
Value | Count | Frequency (%) |
. | 8654 | |
0 | 4293 | |
1 | 3974 | |
2 | 3335 | 8.3% |
8 | 3199 | 8.0% |
4 | 3037 | 7.6% |
3 | 2933 | 7.3% |
9 | 2693 | 6.7% |
5 | 2630 | 6.6% |
6 | 2399 | 6.0% |
Other values (5) | 2916 | 7.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 50190 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
. | 8654 | |
0 | 4293 | 8.6% |
1 | 3974 | 7.9% |
2 | 3335 | 6.6% |
8 | 3199 | 6.4% |
4 | 3037 | 6.1% |
3 | 2933 | 5.8% |
9 | 2693 | 5.4% |
5 | 2630 | 5.2% |
6 | 2399 | 4.8% |
Other values (31) | 13043 |
한글명
Text
Distinct | 9958 |
---|---|
Distinct (%) | 99.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 99 |
---|---|
Median length | 52 |
Mean length | 18.0425 |
Min length | 2 |
Characters and Unicode
Total characters | 180425 |
---|---|
Distinct characters | 842 |
Distinct categories | 11 ? |
Distinct scripts | 3 ? |
Distinct blocks | 5 ? |
Unique
Unique | 9918 ? |
---|---|
Unique (%) | 99.2% |
Sample
1st row | 가스통 폭발 및 파열, 주거지 |
---|---|
2nd row | 임균성 윤활낭염,다발 부위 |
3rd row | 감옥에서 석방과 관련된 문제 |
4th row | 엉덩관절 및 넓적다리 부위에서의 넓적다리신경의 손상 |
5th row | 코카인 유발 기억상실성 장애 |
Value | Count | Frequency (%) |
및 | 2316 | 5.1% |
기타 | 1763 | 3.9% |
상세불명의 | 1505 | 3.3% |
부위 | 872 | 1.9% |
또는 | 727 | 1.6% |
의한 | 719 | 1.6% |
장애 | 430 | 0.9% |
동반한 | 406 | 0.9% |
명시된 | 351 | 0.8% |
노출 | 319 | 0.7% |
Other values (7912) | 35994 |
Most occurring characters
Value | Count | Frequency (%) |
35893 | 19.9% | |
의 | 6926 | 3.8% |
성 | 4180 | 2.3% |
상 | 3287 | 1.8% |
, | 3211 | 1.8% |
기 | 3183 | 1.8% |
증 | 2409 | 1.3% |
및 | 2322 | 1.3% |
에 | 2253 | 1.2% |
명 | 2250 | 1.2% |
Other values (832) | 114511 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 137290 | |
Space Separator | 35893 | 19.9% |
Other Punctuation | 3364 | 1.9% |
Open Punctuation | 1081 | 0.6% |
Close Punctuation | 1079 | 0.6% |
Decimal Number | 834 | 0.5% |
Uppercase Letter | 454 | 0.3% |
Dash Punctuation | 383 | 0.2% |
Lowercase Letter | 27 | < 0.1% |
Letter Number | 16 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
의 | 6926 | 5.0% |
성 | 4180 | 3.0% |
상 | 3287 | 2.4% |
기 | 3183 | 2.3% |
증 | 2409 | 1.8% |
및 | 2322 | 1.7% |
에 | 2253 | 1.6% |
명 | 2250 | 1.6% |
불 | 2156 | 1.6% |
세 | 1987 | 1.4% |
Other values (771) | 106337 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 67 | |
T | 63 | |
I | 59 | |
O | 48 | |
C | 38 | |
B | 24 | 5.3% |
H | 23 | 5.1% |
N | 21 | 4.6% |
A | 19 | 4.2% |
X | 16 | 3.5% |
Other values (13) | 76 |
Decimal Number
Value | Count | Frequency (%) |
0 | 172 | |
1 | 108 | |
2 | 105 | |
3 | 90 | |
9 | 80 | |
8 | 72 | |
4 | 68 | 8.2% |
5 | 54 | 6.5% |
6 | 43 | 5.2% |
7 | 42 | 5.0% |
Lowercase Letter
Value | Count | Frequency (%) |
m | 10 | |
g | 5 | |
l | 5 | |
o | 2 | 7.4% |
a | 2 | 7.4% |
i | 1 | 3.7% |
b | 1 | 3.7% |
s | 1 | 3.7% |
Other Punctuation
Value | Count | Frequency (%) |
, | 3211 | |
. | 135 | 4.0% |
/ | 8 | 0.2% |
% | 7 | 0.2% |
? | 2 | 0.1% |
? | 1 | < 0.1% |
Letter Number
Value | Count | Frequency (%) |
Ⅰ | 12 | |
Ⅷ | 1 | 6.2% |
Ⅱ | 1 | 6.2% |
Ⅹ | 1 | 6.2% |
Ⅲ | 1 | 6.2% |
Open Punctuation
Value | Count | Frequency (%) |
( | 953 | |
[ | 127 | 11.7% |
[ | 1 | 0.1% |
Close Punctuation
Value | Count | Frequency (%) |
) | 951 | |
] | 127 | 11.8% |
] | 1 | 0.1% |
Space Separator
Value | Count | Frequency (%) |
35893 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 383 |
Math Symbol
Value | Count | Frequency (%) |
+ | 4 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 137290 | |
Common | 42638 | 23.6% |
Latin | 497 | 0.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
의 | 6926 | 5.0% |
성 | 4180 | 3.0% |
상 | 3287 | 2.4% |
기 | 3183 | 2.3% |
증 | 2409 | 1.8% |
및 | 2322 | 1.7% |
에 | 2253 | 1.6% |
명 | 2250 | 1.6% |
불 | 2156 | 1.6% |
세 | 1987 | 1.4% |
Other values (771) | 106337 |
Latin
Value | Count | Frequency (%) |
S | 67 | |
T | 63 | |
I | 59 | |
O | 48 | |
C | 38 | 7.6% |
B | 24 | 4.8% |
H | 23 | 4.6% |
N | 21 | 4.2% |
A | 19 | 3.8% |
X | 16 | 3.2% |
Other values (26) | 119 |
Common
Value | Count | Frequency (%) |
35893 | ||
, | 3211 | 7.5% |
( | 953 | 2.2% |
) | 951 | 2.2% |
- | 383 | 0.9% |
0 | 172 | 0.4% |
. | 135 | 0.3% |
] | 127 | 0.3% |
[ | 127 | 0.3% |
1 | 108 | 0.3% |
Other values (15) | 578 | 1.4% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 137288 | |
ASCII | 43116 | 23.9% |
Number Forms | 16 | < 0.1% |
None | 3 | < 0.1% |
Compat Jamo | 2 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
35893 | ||
, | 3211 | 7.4% |
( | 953 | 2.2% |
) | 951 | 2.2% |
- | 383 | 0.9% |
0 | 172 | 0.4% |
. | 135 | 0.3% |
] | 127 | 0.3% |
[ | 127 | 0.3% |
1 | 108 | 0.3% |
Other values (43) | 1056 | 2.4% |
Hangul
Value | Count | Frequency (%) |
의 | 6926 | 5.0% |
성 | 4180 | 3.0% |
상 | 3287 | 2.4% |
기 | 3183 | 2.3% |
증 | 2409 | 1.8% |
및 | 2322 | 1.7% |
에 | 2253 | 1.6% |
명 | 2250 | 1.6% |
불 | 2156 | 1.6% |
세 | 1987 | 1.4% |
Other values (770) | 106335 |
Number Forms
Value | Count | Frequency (%) |
Ⅰ | 12 | |
Ⅷ | 1 | 6.2% |
Ⅱ | 1 | 6.2% |
Ⅹ | 1 | 6.2% |
Ⅲ | 1 | 6.2% |
Compat Jamo
Value | Count | Frequency (%) |
ㆍ | 2 |
None
Value | Count | Frequency (%) |
? | 1 | |
] | 1 | |
[ | 1 |
영문명
Text
Distinct | 9992 |
---|---|
Distinct (%) | 99.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 187 |
---|---|
Median length | 138 |
Mean length | 49.5964 |
Min length | 4 |
Characters and Unicode
Total characters | 495964 |
---|---|
Distinct characters | 91 |
Distinct categories | 14 ? |
Distinct scripts | 4 ? |
Distinct blocks | 5 ? |
Unique
Unique | 9984 ? |
---|---|
Unique (%) | 99.8% |
Sample
1st row | Explosion and rupture of gas cylinder, home |
---|---|
2nd row | Gonococcal bursitis, multiple sites (A54.4+) |
3rd row | Problems related to release from prison |
4th row | Injury of femoral nerves at hip and thigh level |
5th row | Amnestic disorder, cocaine-induced |
Value | Count | Frequency (%) |
of | 3778 | 6.0% |
and | 2576 | 4.1% |
other | 1893 | 3.0% |
in | 1375 | 2.2% |
unspecified | 1220 | 1.9% |
with | 912 | 1.4% |
to | 889 | 1.4% |
or | 833 | 1.3% |
joints | 802 | 1.3% |
by | 575 | 0.9% |
Other values (6287) | 48393 |
Most occurring characters
Value | Count | Frequency (%) |
53326 | 10.8% | |
e | 42354 | 8.5% |
i | 40078 | 8.1% |
o | 33775 | 6.8% |
n | 32171 | 6.5% |
a | 32060 | 6.5% |
t | 30920 | 6.2% |
r | 29065 | 5.9% |
s | 28744 | 5.8% |
l | 19704 | 4.0% |
Other values (81) | 153767 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 410978 | |
Space Separator | 53326 | 10.8% |
Uppercase Letter | 14160 | 2.9% |
Other Punctuation | 8134 | 1.6% |
Decimal Number | 2740 | 0.6% |
Open Punctuation | 2339 | 0.5% |
Close Punctuation | 2338 | 0.5% |
Dash Punctuation | 1548 | 0.3% |
Math Symbol | 368 | 0.1% |
Letter Number | 21 | < 0.1% |
Other values (4) | 12 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 42354 | |
i | 40078 | 9.8% |
o | 33775 | 8.2% |
n | 32171 | 7.8% |
a | 32060 | 7.8% |
t | 30920 | 7.5% |
r | 29065 | 7.1% |
s | 28744 | 7.0% |
l | 19704 | 4.8% |
c | 19411 | 4.7% |
Other values (18) | 102696 |
Uppercase Letter
Value | Count | Frequency (%) |
O | 1930 | |
S | 1398 | |
C | 1377 | |
A | 1157 | 8.2% |
N | 1110 | 7.8% |
P | 1067 | 7.5% |
I | 800 | 5.6% |
M | 722 | 5.1% |
E | 647 | 4.6% |
D | 579 | 4.1% |
Other values (16) | 3373 |
Decimal Number
Value | Count | Frequency (%) |
0 | 584 | |
9 | 279 | |
3 | 273 | |
5 | 270 | |
2 | 267 | |
1 | 261 | |
8 | 248 | |
4 | 236 | |
6 | 174 | 6.4% |
7 | 148 | 5.4% |
Other Punctuation
Value | Count | Frequency (%) |
, | 7134 | |
. | 582 | 7.2% |
' | 206 | 2.5% |
* | 188 | 2.3% |
% | 7 | 0.1% |
/ | 7 | 0.1% |
; | 5 | 0.1% |
& | 5 | 0.1% |
Letter Number
Value | Count | Frequency (%) |
Ⅱ | 6 | |
Ⅹ | 6 | |
Ⅰ | 3 | |
Ⅴ | 2 | 9.5% |
Ⅲ | 2 | 9.5% |
Ⅵ | 1 | 4.8% |
Ⅶ | 1 | 4.8% |
Open Punctuation
Value | Count | Frequency (%) |
( | 2140 | |
[ | 199 | 8.5% |
Close Punctuation
Value | Count | Frequency (%) |
) | 2139 | |
] | 199 | 8.5% |
Other Letter
Value | Count | Frequency (%) |
에 | 2 | |
서 | 2 |
Space Separator
Value | Count | Frequency (%) |
53326 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1548 |
Math Symbol
Value | Count | Frequency (%) |
+ | 368 |
Control
Value | Count | Frequency (%) |
4 |
Modifier Symbol
Value | Count | Frequency (%) |
` | 3 |
Final Punctuation
Value | Count | Frequency (%) |
’ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 425152 | |
Common | 70801 | 14.3% |
Greek | 7 | < 0.1% |
Hangul | 4 | < 0.1% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 42354 | 10.0% |
i | 40078 | 9.4% |
o | 33775 | 7.9% |
n | 32171 | 7.6% |
a | 32060 | 7.5% |
t | 30920 | 7.3% |
r | 29065 | 6.8% |
s | 28744 | 6.8% |
l | 19704 | 4.6% |
c | 19411 | 4.6% |
Other values (49) | 116870 |
Common
Value | Count | Frequency (%) |
53326 | ||
, | 7134 | 10.1% |
( | 2140 | 3.0% |
) | 2139 | 3.0% |
- | 1548 | 2.2% |
0 | 584 | 0.8% |
. | 582 | 0.8% |
+ | 368 | 0.5% |
9 | 279 | 0.4% |
3 | 273 | 0.4% |
Other values (18) | 2428 | 3.4% |
Greek
Value | Count | Frequency (%) |
β | 4 | |
α | 3 |
Hangul
Value | Count | Frequency (%) |
에 | 2 | |
서 | 2 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 495931 | |
Number Forms | 21 | < 0.1% |
None | 7 | < 0.1% |
Hangul | 4 | < 0.1% |
Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
53326 | 10.8% | |
e | 42354 | 8.5% |
i | 40078 | 8.1% |
o | 33775 | 6.8% |
n | 32171 | 6.5% |
a | 32060 | 6.5% |
t | 30920 | 6.2% |
r | 29065 | 5.9% |
s | 28744 | 5.8% |
l | 19704 | 4.0% |
Other values (69) | 153734 |
Number Forms
Value | Count | Frequency (%) |
Ⅱ | 6 | |
Ⅹ | 6 | |
Ⅰ | 3 | |
Ⅴ | 2 | 9.5% |
Ⅲ | 2 | 9.5% |
Ⅵ | 1 | 4.8% |
Ⅶ | 1 | 4.8% |
None
Value | Count | Frequency (%) |
β | 4 | |
α | 3 |
Hangul
Value | Count | Frequency (%) |
에 | 2 | |
서 | 2 |
Punctuation
Value | Count | Frequency (%) |
’ | 1 |
순번 | 질병코드 | 한글명 | 영문명 | |
---|---|---|---|---|
24462 | 24463 | W36.0 | 가스통 폭발 및 파열, 주거지 | Explosion and rupture of gas cylinder, home |
13635 | 13636 | M73.00* | 임균성 윤활낭염,다발 부위 | Gonococcal bursitis, multiple sites (A54.4+) |
26531 | 26532 | Z65.2 | 감옥에서 석방과 관련된 문제 | Problems related to release from prison |
21026 | 21027 | S74.1 | 엉덩관절 및 넓적다리 부위에서의 넓적다리신경의 손상 | Injury of femoral nerves at hip and thigh level |
5100 | 5101 | F14.6 | 코카인 유발 기억상실성 장애 | Amnestic disorder, cocaine-induced |
13963 | 13964 | N21.9 | 상세불명의 하부 요로 결석 | Calculus of lower urinary tract, unspecified |
9394 | 9395 | K04.6 | 굴이 있는 치아치조 고름집(농양) | Dentoalveolar abscess with sinus |
8261 | 8262 | I51.8 | 전층심장염(급성, 만성) | Pancarditis (acute, chronic) |
9060 | 9061 | J38.4 | 성문의 부종 | Edema of glottis |
23016 | 23017 | T52.0 | 석유 휘발유의 중독효과 | Toxic effect of Petroleum spirits |
순번 | 질병코드 | 한글명 | 영문명 | |
---|---|---|---|---|
3816 | 3817 | E11 | 청년의 인슐린-비의존 당뇨병 | Non-insulin-dependent diabetes of the young |
10872 | 10873 | L27.1 | 약물 및 약제에 의한 국소 피부발진 | Localized skin eruption due to drugs and medicaments |
18374 | 18375 | S62.00 | 손의 발배뼈의 폐쇄성 골절 | Closed fracture of navicular[scaphoid] bone of hand |
1736 | 1737 | B25.8 | 기타 거대세포바이러스 질환 | Other cytomegaloviral diseases |
8129 | 8130 | I42.9 | 상세불명의 심장근육병증 | Cardiomyopathy, unspecified |
11598 | 11599 | M47.89 | 척수병증 또는 신경뿌리병증이 없는 흉추강직, 상세불명의 부위 | Thoracic spondylosis without myelopathy or radiculopathy, site unspecified |
9135 | 9136 | J45.0 | 천식을 동반한 고초 열 | Hay fever with asthma |
19514 | 19515 | O62.2 | 수축 불량 | Poor contractions |
5361 | 5362 | F31 | 조울성 반응 | Manic-depressive reaction |
23114 | 23115 | V04 | 대형화물차 또는 버스와 충돌로 다친 보행자 | Pedestrian injured in collision with heavy transport vehicle or bus |