Dataset statistics
Number of variables | 3 |
---|---|
Number of observations | 10000 |
Missing cells | 28 |
Missing cells (%) | 0.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 332.0 KiB |
Average record size in memory | 34.0 B |
Variable types
Text | 1 |
---|---|
Numeric | 2 |
Dataset
Description | 공무원연금공단 종합재해보상 표준질환분류코드(질환분류코드, 질환분류순번, 질환분류기준코드 등 포함)에 관한 데이터입니다. |
---|---|
Author | 공무원연금공단 |
URL | https://www.data.go.kr/data/15123837/fileData.do |
Reproduction
Analysis started | 2023-12-12 22:07:35.002233 |
---|---|
Analysis finished | 2023-12-12 22:07:35.573228 |
Duration | 0.57 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
질환분류코드
Text
Distinct | 7247 |
---|---|
Distinct (%) | 72.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
e10-e14 | 25 | 0.2% |
e1040 | 10 | 0.1% |
e1042 | 10 | 0.1% |
e1142 | 10 | 0.1% |
m1001 | 9 | 0.1% |
e1140 | 9 | 0.1% |
r52 | 9 | 0.1% |
m7794 | 9 | 0.1% |
m7798 | 9 | 0.1% |
k529 | 9 | 0.1% |
Other values (7236) | 9891 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 4752 | |
1 | 4596 | |
8 | 3831 | 8.7% |
2 | 3608 | 8.2% |
M | 3245 | 7.4% |
4 | 2954 | 6.7% |
3 | 2854 | 6.5% |
6 | 2853 | 6.5% |
9 | 2742 | 6.2% |
5 | 2738 | 6.2% |
Other values (28) | 9750 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 33546 | |
Uppercase Letter | 10186 | 23.2% |
Dash Punctuation | 186 | 0.4% |
Space Separator | 5 | < 0.1% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
M | 3245 | |
E | 568 | 5.6% |
S | 508 | 5.0% |
K | 436 | 4.3% |
X | 401 | 3.9% |
T | 400 | 3.9% |
F | 318 | 3.1% |
Q | 312 | 3.1% |
Y | 280 | 2.7% |
O | 275 | 2.7% |
Other values (16) | 3443 |
Decimal Number
Value | Count | Frequency (%) |
0 | 4752 | |
1 | 4596 | |
8 | 3831 | |
2 | 3608 | |
4 | 2954 | |
3 | 2854 | |
6 | 2853 | |
9 | 2742 | |
5 | 2738 | |
7 | 2618 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 186 |
Space Separator
Value | Count | Frequency (%) |
5 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 33737 | |
Latin | 10186 | 23.2% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
M | 3245 | |
E | 568 | 5.6% |
S | 508 | 5.0% |
K | 436 | 4.3% |
X | 401 | 3.9% |
T | 400 | 3.9% |
F | 318 | 3.1% |
Q | 312 | 3.1% |
Y | 280 | 2.7% |
O | 275 | 2.7% |
Other values (16) | 3443 |
Common
Value | Count | Frequency (%) |
0 | 4752 | |
1 | 4596 | |
8 | 3831 | |
2 | 3608 | |
4 | 2954 | |
3 | 2854 | |
6 | 2853 | |
9 | 2742 | |
5 | 2738 | |
7 | 2618 | |
Other values (2) | 191 | 0.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 43923 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 4752 | |
1 | 4596 | |
8 | 3831 | 8.7% |
2 | 3608 | 8.2% |
M | 3245 | 7.4% |
4 | 2954 | 6.7% |
3 | 2854 | 6.5% |
6 | 2853 | 6.5% |
9 | 2742 | 6.2% |
5 | 2738 | 6.2% |
Other values (28) | 9750 |
질환분류순번
Real number (ℝ)
Distinct | 60 |
---|---|
Distinct (%) | 0.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.5805 |
Minimum | 1 |
---|---|
Maximum | 110 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 2 |
Q3 | 4 |
95-th percentile | 11 |
Maximum | 110 |
Range | 109 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 5.1834551 |
---|---|
Coefficient of variation (CV) | 1.4476903 |
Kurtosis | 97.45821 |
Mean | 3.5805 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 7.4526681 |
Sum | 35805 |
Variance | 26.868207 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 3983 | |
2 | 1739 | |
3 | 1156 | 11.6% |
4 | 882 | 8.8% |
5 | 595 | 5.9% |
6 | 463 | 4.6% |
7 | 247 | 2.5% |
8 | 161 | 1.6% |
9 | 122 | 1.2% |
10 | 114 | 1.1% |
Other values (50) | 538 | 5.4% |
Value | Count | Frequency (%) |
1 | 3983 | |
2 | 1739 | |
3 | 1156 | 11.6% |
4 | 882 | 8.8% |
5 | 595 | 5.9% |
6 | 463 | 4.6% |
7 | 247 | 2.5% |
8 | 161 | 1.6% |
9 | 122 | 1.2% |
10 | 114 | 1.1% |
Value | Count | Frequency (%) |
110 | 1 | |
109 | 1 | |
100 | 1 | |
89 | 1 | |
87 | 1 | |
83 | 1 | |
80 | 1 | |
74 | 1 | |
73 | 1 | |
70 | 1 |
질환분류기준코드
Real number (ℝ)
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 28 |
Missing (%) | 0.3% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.3036502 |
Minimum | 1 |
---|---|
Maximum | 6 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 3 |
Q1 | 4 |
median | 4 |
Q3 | 5 |
95-th percentile | 5 |
Maximum | 6 |
Range | 5 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 0.78232321 |
---|---|
Coefficient of variation (CV) | 0.18178132 |
Kurtosis | 0.22617858 |
Mean | 4.3036502 |
Median Absolute Deviation (MAD) | 1 |
Skewness | -0.48443563 |
Sum | 42916 |
Variance | 0.6120296 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4 | 4289 | |
5 | 4036 | |
3 | 1214 | 12.1% |
6 | 270 | 2.7% |
2 | 155 | 1.6% |
1 | 8 | 0.1% |
(Missing) | 28 | 0.3% |
Value | Count | Frequency (%) |
1 | 8 | 0.1% |
2 | 155 | 1.6% |
3 | 1214 | 12.1% |
4 | 4289 | |
5 | 4036 | |
6 | 270 | 2.7% |
Value | Count | Frequency (%) |
6 | 270 | 2.7% |
5 | 4036 | |
4 | 4289 | |
3 | 1214 | 12.1% |
2 | 155 | 1.6% |
1 | 8 | 0.1% |
질환분류순번 | 질환분류기준코드 | |
---|---|---|
질환분류순번 | 1.000 | 0.266 |
질환분류기준코드 | 0.266 | 1.000 |
질환분류순번 | 질환분류기준코드 | |
---|---|---|
질환분류순번 | 1.000 | 0.134 |
질환분류기준코드 | 0.134 | 1.000 |
질환분류코드 | 질환분류순번 | 질환분류기준코드 | |
---|---|---|---|
31730 | E274 | 4 | 4 |
2624 | S8556 | 1 | 5 |
27642 | K861 | 2 | 4 |
45712 | N814 | 2 | 4 |
3117 | O3431 | 1 | 5 |
36600 | H185 | 7 | 4 |
43668 | M0748 | 1 | 5 |
7968 | O3462 | 3 | 5 |
12410 | Y365 | 2 | 4 |
41412 | M198 | 1 | 4 |
질환분류코드 | 질환분류순번 | 질환분류기준코드 | |
---|---|---|---|
22687 | E1003 | 4 | 5 |
42510 | M0685 | 3 | 5 |
54242 | M8657 | 1 | 5 |
12587 | Y1734 | 2 | 5 |
8531 | T677 | 1 | 4 |
33944 | M0327 | 10 | 5 |
5895 | R10-R19 | 3 | 2 |
17630 | B085 | 2 | 4 |
6350 | P94 | 1 | 3 |
6545 | O912 | 3 | 4 |