Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 839.8 KiB |
Average record size in memory | 86.0 B |
Variable types
Categorical | 3 |
---|---|
Text | 1 |
Numeric | 5 |
Dataset
Description | 5단상병별 건강보험 진료비 통계 / 진료일자 기준(심사분은 각 진료년+4개월) (예) 진료년월: 2022.1월~12월, 심사년월: 2022.1월~2023.4월 / 보험자: 건강보험 / 요양기관 종별: 약국 제외 / 한방상병 제외 |
---|---|
URL | https://www.data.go.kr/data/15072876/fileData.do |
진료년도 has constant value "" | Constant |
환자수 is highly overall correlated with 명세서건수 and 3 other fields | High correlation |
명세서건수 is highly overall correlated with 환자수 and 3 other fields | High correlation |
입내원일수 is highly overall correlated with 환자수 and 3 other fields | High correlation |
요양급여비용총액 is highly overall correlated with 환자수 and 3 other fields | High correlation |
보험자부담금 is highly overall correlated with 환자수 and 3 other fields | High correlation |
입내원일수 is highly skewed (γ1 = 69.9307136) | Skewed |
요양급여비용총액 is highly skewed (γ1 = 51.77926953) | Skewed |
보험자부담금 is highly skewed (γ1 = 46.49536151) | Skewed |
Reproduction
Analysis started | 2023-12-12 13:41:21.671647 |
---|---|
Analysis finished | 2023-12-12 13:41:25.541498 |
Duration | 3.87 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
진료년도
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
2022 |
---|
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2022 |
---|---|
2nd row | 2022 |
3rd row | 2022 |
4th row | 2022 |
5th row | 2022 |
Common Values
Value | Count | Frequency (%) |
2022 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2022 | 10000 |
주상병코드
Text
Distinct | 3303 |
---|---|
Distinct (%) | 33.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
h430 | 10 | 0.1% |
f431 | 10 | 0.1% |
h600 | 10 | 0.1% |
h0241 | 10 | 0.1% |
e876 | 10 | 0.1% |
c444 | 10 | 0.1% |
c716 | 10 | 0.1% |
d127 | 9 | 0.1% |
d383 | 9 | 0.1% |
h184 | 9 | 0.1% |
Other values (3293) | 9903 |
Most occurring characters
Value | Count | Frequency (%) |
8328 | ||
1 | 5070 | |
0 | 5051 | |
2 | 3220 | 6.4% |
3 | 3087 | 6.2% |
4 | 3040 | 6.1% |
8 | 2912 | 5.8% |
9 | 2649 | 5.3% |
5 | 2525 | 5.1% |
6 | 2361 | 4.7% |
Other values (9) | 11757 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 31672 | |
Uppercase Letter | 10000 | 20.0% |
Space Separator | 8328 | 16.7% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 5070 | |
0 | 5051 | |
2 | 3220 | |
3 | 3087 | |
4 | 3040 | |
8 | 2912 | |
9 | 2649 | |
5 | 2525 | |
6 | 2361 | |
7 | 1757 | 5.5% |
Uppercase Letter
Value | Count | Frequency (%) |
H | 1897 | |
E | 1435 | |
D | 1429 | |
C | 1384 | |
G | 1216 | |
F | 1094 | |
A | 817 | |
B | 728 | 7.3% |
Space Separator
Value | Count | Frequency (%) |
8328 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 40000 | |
Latin | 10000 | 20.0% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
8328 | ||
1 | 5070 | |
0 | 5051 | |
2 | 3220 | 8.1% |
3 | 3087 | 7.7% |
4 | 3040 | 7.6% |
8 | 2912 | 7.3% |
9 | 2649 | 6.6% |
5 | 2525 | 6.3% |
6 | 2361 | 5.9% |
Latin
Value | Count | Frequency (%) |
H | 1897 | |
E | 1435 | |
D | 1429 | |
C | 1384 | |
G | 1216 | |
F | 1094 | |
A | 817 | |
B | 728 | 7.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 50000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
8328 | ||
1 | 5070 | |
0 | 5051 | |
2 | 3220 | 6.4% |
3 | 3087 | 6.2% |
4 | 3040 | 6.1% |
8 | 2912 | 5.8% |
9 | 2649 | 5.3% |
5 | 2525 | 5.1% |
6 | 2361 | 4.7% |
Other values (9) | 11757 |
성별
Categorical
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
여 | |
---|---|
남 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 남 |
---|---|
2nd row | 여 |
3rd row | 여 |
4th row | 남 |
5th row | 남 |
Common Values
Value | Count | Frequency (%) |
여 | 5022 | |
남 | 4978 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
여 | 5022 | |
남 | 4978 |
연령군
Categorical
Distinct | 18 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
13_60~64세 | 662 |
---|---|
14_65~69세 | 640 |
12_55~59세 | 634 |
10_45~49세 | 615 |
16_75~79세 | 611 |
Other values (13) |
Length
Max length | 9 |
---|---|
Median length | 9 |
Mean length | 8.86 |
Min length | 7 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 18_85세 이상 |
---|---|
2nd row | 15_70~74세 |
3rd row | 15_70~74세 |
4th row | 05_20~24세 |
5th row | 13_60~64세 |
Common Values
Value | Count | Frequency (%) |
13_60~64세 | 662 | 6.6% |
14_65~69세 | 640 | 6.4% |
12_55~59세 | 634 | 6.3% |
10_45~49세 | 615 | 6.2% |
16_75~79세 | 611 | 6.1% |
11_50~54세 | 606 | 6.1% |
06_25~29세 | 600 | 6.0% |
09_40~44세 | 597 | 6.0% |
15_70~74세 | 597 | 6.0% |
17_80~84세 | 593 | 5.9% |
Other values (8) | 3845 |
Length
Value | Count | Frequency (%) |
13_60~64세 | 662 | 6.3% |
14_65~69세 | 640 | 6.1% |
12_55~59세 | 634 | 6.0% |
10_45~49세 | 615 | 5.8% |
16_75~79세 | 611 | 5.8% |
11_50~54세 | 606 | 5.7% |
06_25~29세 | 600 | 5.7% |
09_40~44세 | 597 | 5.7% |
15_70~74세 | 597 | 5.7% |
17_80~84세 | 593 | 5.6% |
Other values (9) | 4389 |
환자수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 1507 |
---|---|
Distinct (%) | 15.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 684.6554 |
Minimum | 1 |
---|---|
Maximum | 152812 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 17 |
Q3 | 119.25 |
95-th percentile | 2344.05 |
Maximum | 152812 |
Range | 152811 |
Interquartile range (IQR) | 116.25 |
Descriptive statistics
Standard deviation | 4185.2533 |
---|---|
Coefficient of variation (CV) | 6.112934 |
Kurtosis | 372.75056 |
Mean | 684.6554 |
Median Absolute Deviation (MAD) | 16 |
Skewness | 16.094784 |
Sum | 6846554 |
Variance | 17516345 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 1354 | 13.5% |
2 | 746 | 7.5% |
3 | 549 | 5.5% |
4 | 419 | 4.2% |
5 | 265 | 2.6% |
6 | 254 | 2.5% |
7 | 227 | 2.3% |
8 | 201 | 2.0% |
9 | 168 | 1.7% |
10 | 153 | 1.5% |
Other values (1497) | 5664 |
Value | Count | Frequency (%) |
1 | 1354 | |
2 | 746 | |
3 | 549 | |
4 | 419 | 4.2% |
5 | 265 | 2.6% |
6 | 254 | 2.5% |
7 | 227 | 2.3% |
8 | 201 | 2.0% |
9 | 168 | 1.7% |
10 | 153 | 1.5% |
Value | Count | Frequency (%) |
152812 | 1 | |
112047 | 1 | |
95878 | 1 | |
92539 | 1 | |
88405 | 1 | |
81751 | 1 | |
81402 | 1 | |
68157 | 1 | |
66475 | 1 | |
66302 | 1 |
명세서건수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 2209 |
---|---|
Distinct (%) | 22.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1884.9588 |
Minimum | 1 |
---|---|
Maximum | 454086 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 8 |
median | 53 |
Q3 | 355 |
95-th percentile | 6623 |
Maximum | 454086 |
Range | 454085 |
Interquartile range (IQR) | 347 |
Descriptive statistics
Standard deviation | 11665.888 |
---|---|
Coefficient of variation (CV) | 6.1889351 |
Kurtosis | 392.11683 |
Mean | 1884.9588 |
Median Absolute Deviation (MAD) | 51 |
Skewness | 16.326581 |
Sum | 18849588 |
Variance | 1.3609293 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 738 | 7.4% |
2 | 464 | 4.6% |
3 | 306 | 3.1% |
4 | 297 | 3.0% |
5 | 227 | 2.3% |
8 | 183 | 1.8% |
6 | 181 | 1.8% |
7 | 174 | 1.7% |
9 | 138 | 1.4% |
10 | 125 | 1.2% |
Other values (2199) | 7167 |
Value | Count | Frequency (%) |
1 | 738 | |
2 | 464 | |
3 | 306 | |
4 | 297 | |
5 | 227 | 2.3% |
6 | 181 | 1.8% |
7 | 174 | 1.7% |
8 | 183 | 1.8% |
9 | 138 | 1.4% |
10 | 125 | 1.2% |
Value | Count | Frequency (%) |
454086 | 1 | |
282156 | 1 | |
249011 | 1 | |
238449 | 1 | |
230875 | 1 | |
214721 | 1 | |
201520 | 1 | |
194181 | 1 | |
182910 | 1 | |
179128 | 1 |
입내원일수
Real number (ℝ)
HIGH CORRELATION
  SKEWED
 
Distinct | 2401 |
---|---|
Distinct (%) | 24.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2593.951 |
Minimum | 0 |
---|---|
Maximum | 2983759 |
Zeros | 5 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1 |
Q1 | 10 |
median | 70 |
Q3 | 491 |
95-th percentile | 7792.8 |
Maximum | 2983759 |
Range | 2983759 |
Interquartile range (IQR) | 481 |
Descriptive statistics
Standard deviation | 34466.033 |
---|---|
Coefficient of variation (CV) | 13.287079 |
Kurtosis | 5769.4124 |
Mean | 2593.951 |
Median Absolute Deviation (MAD) | 68 |
Skewness | 69.930714 |
Sum | 25939510 |
Variance | 1.1879074 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 686 | 6.9% |
2 | 402 | 4.0% |
3 | 261 | 2.6% |
4 | 260 | 2.6% |
5 | 204 | 2.0% |
8 | 164 | 1.6% |
6 | 163 | 1.6% |
7 | 159 | 1.6% |
9 | 136 | 1.4% |
10 | 116 | 1.2% |
Other values (2391) | 7449 |
Value | Count | Frequency (%) |
0 | 5 | 0.1% |
1 | 686 | |
2 | 402 | |
3 | 261 | 2.6% |
4 | 260 | 2.6% |
5 | 204 | 2.0% |
6 | 163 | 1.6% |
7 | 159 | 1.6% |
8 | 164 | 1.6% |
9 | 136 | 1.4% |
Value | Count | Frequency (%) |
2983759 | 1 | |
1243579 | 1 | |
279965 | 1 | |
275265 | 1 | |
248867 | 1 | |
238426 | 1 | |
230599 | 1 | |
226153 | 1 | |
222176 | 1 | |
201934 | 1 |
요양급여비용총액
Real number (ℝ)
HIGH CORRELATION
  SKEWED
 
Distinct | 9668 |
---|---|
Distinct (%) | 96.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.8088868 × 108 |
Minimum | 0 |
---|---|
Maximum | 2.3582 × 1011 |
Zeros | 2 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 33940 |
Q1 | 783325 |
median | 8417420 |
Q3 | 62386415 |
95-th percentile | 8.0167348 × 108 |
Maximum | 2.3582 × 1011 |
Range | 2.3582 × 1011 |
Interquartile range (IQR) | 61603090 |
Descriptive statistics
Standard deviation | 3.0899179 × 109 |
---|---|
Coefficient of variation (CV) | 11.000507 |
Kurtosis | 3561.4546 |
Mean | 2.8088868 × 108 |
Median Absolute Deviation (MAD) | 8342370 |
Skewness | 51.77927 |
Sum | 2.8088868 × 1012 |
Variance | 9.5475925 × 1018 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
16970 | 103 | 1.0% |
12130 | 39 | 0.4% |
21180 | 13 | 0.1% |
29100 | 12 | 0.1% |
14780 | 10 | 0.1% |
33940 | 7 | 0.1% |
24260 | 6 | 0.1% |
20110 | 6 | 0.1% |
11870 | 6 | 0.1% |
17950 | 6 | 0.1% |
Other values (9658) | 9792 |
Value | Count | Frequency (%) |
0 | 2 | |
2240 | 1 | < 0.1% |
3850 | 1 | < 0.1% |
4930 | 1 | < 0.1% |
5100 | 2 | |
5610 | 4 | |
5930 | 1 | < 0.1% |
6070 | 2 | |
6630 | 1 | < 0.1% |
8160 | 1 | < 0.1% |
Value | Count | Frequency (%) |
235820000000 | 1 | |
100629000000 | 1 | |
71987700400 | 1 | |
69607967620 | 1 | |
56942114570 | 1 | |
37271687190 | 1 | |
34664264260 | 1 | |
31136766910 | 1 | |
29182070240 | 1 | |
26560564630 | 1 |
보험자부담금
Real number (ℝ)
HIGH CORRELATION
  SKEWED
 
Distinct | 9660 |
---|---|
Distinct (%) | 96.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.248684 × 108 |
Minimum | 0 |
---|---|
Maximum | 1.76072 × 1011 |
Zeros | 2 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 22366 |
Q1 | 513152.5 |
median | 6058835 |
Q3 | 46265408 |
95-th percentile | 6.0557597 × 108 |
Maximum | 1.76072 × 1011 |
Range | 1.76072 × 1011 |
Interquartile range (IQR) | 45752255 |
Descriptive statistics
Standard deviation | 2.4391541 × 109 |
---|---|
Coefficient of variation (CV) | 10.847029 |
Kurtosis | 2925.5917 |
Mean | 2.248684 × 108 |
Median Absolute Deviation (MAD) | 6013165 |
Skewness | 46.495362 |
Sum | 2.248684 × 1012 |
Variance | 5.9494728 × 1018 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
11970 | 75 | 0.8% |
8530 | 21 | 0.2% |
15370 | 21 | 0.2% |
10630 | 15 | 0.1% |
20500 | 10 | 0.1% |
14880 | 8 | 0.1% |
10380 | 7 | 0.1% |
17060 | 6 | 0.1% |
16170 | 6 | 0.1% |
14250 | 6 | 0.1% |
Other values (9650) | 9825 |
Value | Count | Frequency (%) |
0 | 2 | < 0.1% |
50 | 1 | < 0.1% |
1340 | 1 | < 0.1% |
2710 | 1 | < 0.1% |
2910 | 2 | < 0.1% |
2920 | 1 | < 0.1% |
3060 | 1 | < 0.1% |
3070 | 3 | |
3210 | 5 | |
3610 | 1 | < 0.1% |
Value | Count | Frequency (%) |
176072000000 | 1 | |
75534895280 | 1 | |
66960543610 | 1 | |
63875885710 | 1 | |
52070254350 | 1 | |
34308133310 | 1 | |
28837767100 | 1 | |
24504634880 | 1 | |
23052710960 | 1 | |
21779005100 | 1 |
성별 | 연령군 | 환자수 | 명세서건수 | 입내원일수 | 요양급여비용총액 | 보험자부담금 | |
---|---|---|---|---|---|---|---|
성별 | 1.000 | 0.044 | 0.022 | 0.028 | 0.000 | 0.015 | 0.000 |
연령군 | 0.044 | 1.000 | 0.054 | 0.029 | 0.000 | 0.000 | 0.000 |
환자수 | 0.022 | 0.054 | 1.000 | 0.777 | 0.782 | 0.619 | 0.601 |
명세서건수 | 0.028 | 0.029 | 0.777 | 1.000 | 1.000 | 0.872 | 0.855 |
입내원일수 | 0.000 | 0.000 | 0.782 | 1.000 | 1.000 | 1.000 | 1.000 |
요양급여비용총액 | 0.015 | 0.000 | 0.619 | 0.872 | 1.000 | 1.000 | 0.994 |
보험자부담금 | 0.000 | 0.000 | 0.601 | 0.855 | 1.000 | 0.994 | 1.000 |
성별 | 연령군 | |
---|---|---|
성별 | 1.000 | 0.035 |
연령군 | 0.035 | 1.000 |
환자수 | 명세서건수 | 입내원일수 | 요양급여비용총액 | 보험자부담금 | 성별 | 연령군 | |
---|---|---|---|---|---|---|---|
환자수 | 1.000 | 0.958 | 0.929 | 0.827 | 0.810 | 0.022 | 0.018 |
명세서건수 | 0.958 | 1.000 | 0.979 | 0.895 | 0.883 | 0.021 | 0.012 |
입내원일수 | 0.929 | 0.979 | 1.000 | 0.943 | 0.935 | 0.000 | 0.000 |
요양급여비용총액 | 0.827 | 0.895 | 0.943 | 1.000 | 0.998 | 0.011 | 0.000 |
보험자부담금 | 0.810 | 0.883 | 0.935 | 0.998 | 1.000 | 0.000 | 0.000 |
성별 | 0.022 | 0.021 | 0.000 | 0.011 | 0.000 | 1.000 | 0.035 |
연령군 | 0.018 | 0.012 | 0.000 | 0.000 | 0.000 | 0.035 | 1.000 |
진료년도 | 주상병코드 | 성별 | 연령군 | 환자수 | 명세서건수 | 입내원일수 | 요양급여비용총액 | 보험자부담금 | |
---|---|---|---|---|---|---|---|---|---|
15662 | 2022 | C108 | 남 | 18_85세 이상 | 4 | 11 | 22 | 7874570 | 6963100 |
55328 | 2022 | F061 | 여 | 15_70~74세 | 20 | 45 | 45 | 754710 | 647810 |
21720 | 2022 | C548 | 여 | 15_70~74세 | 40 | 130 | 143 | 24536330 | 21931350 |
63817 | 2022 | F810 | 남 | 05_20~24세 | 6 | 18 | 18 | 1254840 | 995440 |
71353 | 2022 | G547 | 남 | 13_60~64세 | 22 | 168 | 187 | 9501140 | 5749690 |
75161 | 2022 | G932 | 남 | 01_0~4세 | 6 | 11 | 13 | 1951790 | 1656290 |
88379 | 2022 | H472 | 여 | 11_50~54세 | 250 | 414 | 414 | 46964660 | 28331960 |
41884 | 2022 | E040 | 남 | 08_35~39세 | 59 | 121 | 121 | 10897170 | 7243470 |
81633 | 2022 | H170 | 여 | 16_75~79세 | 18 | 49 | 49 | 1835090 | 1265190 |
2577 | 2022 | A1651 | 남 | 12_55~59세 | 162 | 1034 | 1885 | 529237080 | 512615280 |
진료년도 | 주상병코드 | 성별 | 연령군 | 환자수 | 명세서건수 | 입내원일수 | 요양급여비용총액 | 보험자부담금 | |
---|---|---|---|---|---|---|---|---|---|
73823 | 2022 | G803 | 남 | 05_20~24세 | 29 | 480 | 480 | 19198270 | 11762070 |
37312 | 2022 | D484 | 여 | 16_75~79세 | 14 | 65 | 85 | 14585720 | 12942930 |
3461 | 2022 | A198 | 남 | 14_65~69세 | 1 | 14 | 36 | 10863660 | 10666490 |
49884 | 2022 | E344 | 남 | 10_45~49세 | 1 | 2 | 2 | 33350 | 23450 |
45246 | 2022 | E118 | 남 | 02_5~9세 | 15 | 44 | 52 | 4866980 | 3336770 |
8181 | 2022 | B021 | 남 | 12_55~59세 | 15 | 37 | 116 | 35191670 | 24458350 |
37205 | 2022 | D481 | 여 | 03_10~14세 | 25 | 122 | 122 | 44812410 | 40636210 |
32821 | 2022 | D235 | 남 | 06_25~29세 | 1619 | 4085 | 4112 | 292189780 | 201041520 |
21379 | 2022 | C5069 | 여 | 10_45~49세 | 10 | 113 | 112 | 8273000 | 7406800 |
25552 | 2022 | C829 | 여 | 04_15~19세 | 1 | 1 | 1 | 152100 | 144500 |