Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 839.8 KiB |
Average record size in memory | 86.0 B |
Variable types
Categorical | 3 |
---|---|
Text | 1 |
Numeric | 5 |
Dataset
Description | 5단상병별 건강보험 진료비 통계 / 진료일자 기준(심사분은 각 진료년+4개월) (예) 진료년월: 2022.1월~12월, 심사년월: 2022.1월~2023.4월 / 보험자: 건강보험 / 요양기관 종별: 약국 제외 / 한방상병 제외 |
---|---|
URL | https://www.data.go.kr/data/15072880/fileData.do |
진료년도 has constant value "" | Constant |
환자수 is highly overall correlated with 명세서건수 and 3 other fields | High correlation |
명세서건수 is highly overall correlated with 환자수 and 3 other fields | High correlation |
입내원일수 is highly overall correlated with 환자수 and 3 other fields | High correlation |
보험자부담금 is highly overall correlated with 환자수 and 3 other fields | High correlation |
요양급여비용총액 is highly overall correlated with 환자수 and 3 other fields | High correlation |
환자수 is highly skewed (γ1 = 71.74764149) | Skewed |
명세서건수 is highly skewed (γ1 = 70.87948436) | Skewed |
입내원일수 is highly skewed (γ1 = 67.90367968) | Skewed |
보험자부담금 is highly skewed (γ1 = 41.94012728) | Skewed |
요양급여비용총액 is highly skewed (γ1 = 45.15516829) | Skewed |
Reproduction
Analysis started | 2023-12-12 13:00:36.627872 |
---|---|
Analysis finished | 2023-12-12 13:00:40.949520 |
Duration | 4.32 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
진료년도
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
2022 |
---|
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2022 |
---|---|
2nd row | 2022 |
3rd row | 2022 |
4th row | 2022 |
5th row | 2022 |
Common Values
Value | Count | Frequency (%) |
2022 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2022 | 10000 |
주상병코드
Text
Distinct | 7709 |
---|---|
Distinct (%) | 77.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
a560 | 4 | < 0.1% |
e1464 | 4 | < 0.1% |
f072 | 4 | < 0.1% |
i879 | 4 | < 0.1% |
h681 | 4 | < 0.1% |
s3631 | 4 | < 0.1% |
c1641 | 4 | < 0.1% |
e834 | 4 | < 0.1% |
k5721 | 4 | < 0.1% |
e078 | 4 | < 0.1% |
Other values (7699) | 9960 |
Most occurring characters
Value | Count | Frequency (%) |
6643 | ||
0 | 4635 | 9.3% |
1 | 4270 | 8.5% |
2 | 3918 | 7.8% |
8 | 3475 | 7.0% |
3 | 3233 | 6.5% |
9 | 3137 | 6.3% |
4 | 3130 | 6.3% |
6 | 2647 | 5.3% |
5 | 2640 | 5.3% |
Other values (23) | 12272 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 33357 | |
Uppercase Letter | 10000 | 20.0% |
Space Separator | 6643 | 13.3% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
M | 2337 | |
S | 930 | 9.3% |
K | 574 | 5.7% |
T | 552 | 5.5% |
H | 478 | 4.8% |
Q | 434 | 4.3% |
C | 426 | 4.3% |
I | 417 | 4.2% |
D | 412 | 4.1% |
E | 381 | 3.8% |
Other values (12) | 3059 |
Decimal Number
Value | Count | Frequency (%) |
0 | 4635 | |
1 | 4270 | |
2 | 3918 | |
8 | 3475 | |
3 | 3233 | |
9 | 3137 | |
4 | 3130 | |
6 | 2647 | |
5 | 2640 | |
7 | 2272 |
Space Separator
Value | Count | Frequency (%) |
6643 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 40000 | |
Latin | 10000 | 20.0% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
M | 2337 | |
S | 930 | 9.3% |
K | 574 | 5.7% |
T | 552 | 5.5% |
H | 478 | 4.8% |
Q | 434 | 4.3% |
C | 426 | 4.3% |
I | 417 | 4.2% |
D | 412 | 4.1% |
E | 381 | 3.8% |
Other values (12) | 3059 |
Common
Value | Count | Frequency (%) |
6643 | ||
0 | 4635 | |
1 | 4270 | |
2 | 3918 | |
8 | 3475 | |
3 | 3233 | |
9 | 3137 | |
4 | 3130 | |
6 | 2647 | 6.6% |
5 | 2640 | 6.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 50000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
6643 | ||
0 | 4635 | 9.3% |
1 | 4270 | 8.5% |
2 | 3918 | 7.8% |
8 | 3475 | 7.0% |
3 | 3233 | 6.5% |
9 | 3137 | 6.3% |
4 | 3130 | 6.3% |
6 | 2647 | 5.3% |
5 | 2640 | 5.3% |
Other values (23) | 12272 |
성별
Categorical
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
여 | |
---|---|
남 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 여 |
---|---|
2nd row | 여 |
3rd row | 남 |
4th row | 여 |
5th row | 여 |
Common Values
Value | Count | Frequency (%) |
여 | 5146 | |
남 | 4854 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
여 | 5146 | |
남 | 4854 |
입원외래구분
Categorical
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
외래 | |
---|---|
입원 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 외래 |
---|---|
2nd row | 입원 |
3rd row | 입원 |
4th row | 외래 |
5th row | 입원 |
Common Values
Value | Count | Frequency (%) |
외래 | 5721 | |
입원 | 4279 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
외래 | 5721 | |
입원 | 4279 |
환자수
Real number (ℝ)
HIGH CORRELATION
  SKEWED
 
Distinct | 2493 |
---|---|
Distinct (%) | 24.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6704.9307 |
Minimum | 1 |
---|---|
Maximum | 10867248 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 7 |
median | 50.5 |
Q3 | 440 |
95-th percentile | 13179.4 |
Maximum | 10867248 |
Range | 10867247 |
Interquartile range (IQR) | 433 |
Descriptive statistics
Standard deviation | 123340.74 |
---|---|
Coefficient of variation (CV) | 18.395528 |
Kurtosis | 6081.5416 |
Mean | 6704.9307 |
Median Absolute Deviation (MAD) | 49.5 |
Skewness | 71.747641 |
Sum | 67049307 |
Variance | 1.5212938 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 887 | 8.9% |
2 | 527 | 5.3% |
3 | 333 | 3.3% |
4 | 292 | 2.9% |
5 | 227 | 2.3% |
6 | 200 | 2.0% |
7 | 171 | 1.7% |
8 | 167 | 1.7% |
9 | 145 | 1.5% |
10 | 126 | 1.3% |
Other values (2483) | 6925 |
Value | Count | Frequency (%) |
1 | 887 | |
2 | 527 | |
3 | 333 | 3.3% |
4 | 292 | 2.9% |
5 | 227 | 2.3% |
6 | 200 | 2.0% |
7 | 171 | 1.7% |
8 | 167 | 1.7% |
9 | 145 | 1.5% |
10 | 126 | 1.3% |
Value | Count | Frequency (%) |
10867248 | 1 | |
2929112 | 1 | |
2768451 | 1 | |
2078845 | 1 | |
1305983 | 1 | |
1293145 | 1 | |
1047665 | 1 | |
931839 | 1 | |
920388 | 1 | |
765700 | 1 |
명세서건수
Real number (ℝ)
HIGH CORRELATION
  SKEWED
 
Distinct | 3154 |
---|---|
Distinct (%) | 31.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 16007.252 |
Minimum | 1 |
---|---|
Maximum | 24432321 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 11 |
median | 101 |
Q3 | 1017.5 |
95-th percentile | 36147.25 |
Maximum | 24432321 |
Range | 24432320 |
Interquartile range (IQR) | 1006.5 |
Descriptive statistics
Standard deviation | 278019.5 |
---|---|
Coefficient of variation (CV) | 17.368347 |
Kurtosis | 6006.2069 |
Mean | 16007.252 |
Median Absolute Deviation (MAD) | 99 |
Skewness | 70.879484 |
Sum | 1.6007252 × 108 |
Variance | 7.7294845 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 662 | 6.6% |
2 | 419 | 4.2% |
3 | 284 | 2.8% |
4 | 219 | 2.2% |
5 | 189 | 1.9% |
6 | 166 | 1.7% |
7 | 157 | 1.6% |
8 | 142 | 1.4% |
10 | 115 | 1.1% |
11 | 104 | 1.0% |
Other values (3144) | 7543 |
Value | Count | Frequency (%) |
1 | 662 | |
2 | 419 | |
3 | 284 | |
4 | 219 | 2.2% |
5 | 189 | 1.9% |
6 | 166 | 1.7% |
7 | 157 | 1.6% |
8 | 142 | 1.4% |
9 | 102 | 1.0% |
10 | 115 | 1.1% |
Value | Count | Frequency (%) |
24432321 | 1 | |
6720782 | 1 | |
4578223 | 1 | |
4429749 | 1 | |
3916760 | 1 | |
3314644 | 1 | |
2635955 | 1 | |
2574913 | 1 | |
2229325 | 1 | |
2158557 | 1 |
입내원일수
Real number (ℝ)
HIGH CORRELATION
  SKEWED
 
Distinct | 3795 |
---|---|
Distinct (%) | 38.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 18474.109 |
Minimum | 0 |
---|---|
Maximum | 24418203 |
Zeros | 65 |
Zeros (%) | 0.7% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 2 |
Q1 | 29 |
median | 244 |
Q3 | 1997.25 |
95-th percentile | 47770.85 |
Maximum | 24418203 |
Range | 24418203 |
Interquartile range (IQR) | 1968.25 |
Descriptive statistics
Standard deviation | 282203.13 |
---|---|
Coefficient of variation (CV) | 15.275602 |
Kurtosis | 5644.6911 |
Mean | 18474.109 |
Median Absolute Deviation (MAD) | 240 |
Skewness | 67.90368 |
Sum | 1.8474109 × 108 |
Variance | 7.9638608 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 296 | 3.0% |
2 | 210 | 2.1% |
3 | 193 | 1.9% |
5 | 141 | 1.4% |
4 | 139 | 1.4% |
6 | 126 | 1.3% |
7 | 121 | 1.2% |
8 | 96 | 1.0% |
11 | 89 | 0.9% |
10 | 76 | 0.8% |
Other values (3785) | 8513 |
Value | Count | Frequency (%) |
0 | 65 | 0.7% |
1 | 296 | |
2 | 210 | |
3 | 193 | |
4 | 139 | |
5 | 141 | |
6 | 126 | |
7 | 121 | |
8 | 96 | 1.0% |
9 | 73 | 0.7% |
Value | Count | Frequency (%) |
24418203 | 1 | |
6673702 | 1 | |
4570342 | 1 | |
4428627 | 1 | |
3914253 | 1 | |
3312966 | 1 | |
3227507 | 1 | |
3169294 | 1 | |
2634336 | 1 | |
2574605 | 1 |
보험자부담금
Real number (ℝ)
HIGH CORRELATION
  SKEWED
 
Distinct | 9898 |
---|---|
Distinct (%) | 99.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.2740972 × 109 |
Minimum | 0 |
---|---|
Maximum | 8.22893 × 1011 |
Zeros | 5 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 83222 |
Q1 | 2274900 |
median | 23041045 |
Q3 | 2.1051079 × 108 |
95-th percentile | 3.6654327 × 109 |
Maximum | 8.22893 × 1011 |
Range | 8.22893 × 1011 |
Interquartile range (IQR) | 2.0823589 × 108 |
Descriptive statistics
Standard deviation | 1.3124072 × 1010 |
---|---|
Coefficient of variation (CV) | 10.300684 |
Kurtosis | 2309.5357 |
Mean | 1.2740972 × 109 |
Median Absolute Deviation (MAD) | 22846010 |
Skewness | 41.940127 |
Sum | 1.2740972 × 1013 |
Variance | 1.7224126 × 1020 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
11970 | 21 | 0.2% |
8530 | 9 | 0.1% |
15370 | 6 | 0.1% |
7170 | 6 | 0.1% |
10630 | 5 | 0.1% |
0 | 5 | 0.1% |
14880 | 5 | 0.1% |
10380 | 5 | 0.1% |
3210 | 4 | < 0.1% |
10330 | 3 | < 0.1% |
Other values (9888) | 9931 |
Value | Count | Frequency (%) |
0 | 5 | |
160 | 1 | < 0.1% |
800 | 1 | < 0.1% |
2350 | 2 | < 0.1% |
2520 | 1 | < 0.1% |
2770 | 1 | < 0.1% |
2860 | 1 | < 0.1% |
3070 | 2 | < 0.1% |
3170 | 1 | < 0.1% |
3210 | 4 |
Value | Count | Frequency (%) |
822893000000 | 1 | |
683971000000 | 1 | |
235128000000 | 1 | |
215531000000 | 1 | |
206861000000 | 1 | |
182563000000 | 1 | |
180810000000 | 1 | |
177767000000 | 1 | |
157670000000 | 1 | |
139273000000 | 1 |
요양급여비용총액
Real number (ℝ)
HIGH CORRELATION
  SKEWED
 
Distinct | 9891 |
---|---|
Distinct (%) | 98.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.6464795 × 109 |
Minimum | 0 |
---|---|
Maximum | 1.16798 × 1012 |
Zeros | 5 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 125672.5 |
Q1 | 3352115 |
median | 32153150 |
Q3 | 2.8721504 × 108 |
95-th percentile | 4.977873 × 109 |
Maximum | 1.16798 × 1012 |
Range | 1.16798 × 1012 |
Interquartile range (IQR) | 2.8386292 × 108 |
Descriptive statistics
Standard deviation | 1.7025943 × 1010 |
---|---|
Coefficient of variation (CV) | 10.340817 |
Kurtosis | 2699.8623 |
Mean | 1.6464795 × 109 |
Median Absolute Deviation (MAD) | 31844690 |
Skewness | 45.155168 |
Sum | 1.6464795 × 1013 |
Variance | 2.8988274 × 1020 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
16970 | 30 | 0.3% |
12130 | 16 | 0.2% |
11870 | 7 | 0.1% |
14780 | 6 | 0.1% |
0 | 5 | 0.1% |
21180 | 5 | 0.1% |
23770 | 3 | < 0.1% |
33940 | 3 | < 0.1% |
29100 | 3 | < 0.1% |
102030 | 3 | < 0.1% |
Other values (9881) | 9919 |
Value | Count | Frequency (%) |
0 | 5 | |
190 | 1 | < 0.1% |
1000 | 1 | < 0.1% |
2790 | 1 | < 0.1% |
3850 | 2 | < 0.1% |
3960 | 1 | < 0.1% |
5530 | 1 | < 0.1% |
5610 | 1 | < 0.1% |
6120 | 1 | < 0.1% |
7460 | 1 | < 0.1% |
Value | Count | Frequency (%) |
1167980000000 | 1 | |
789164000000 | 1 | |
336612000000 | 1 | |
273920000000 | 1 | |
241003000000 | 1 | |
230619000000 | 1 | |
225292000000 | 1 | |
207276000000 | 1 | |
188306000000 | 1 | |
159763000000 | 1 |
성별 | 입원외래구분 | 환자수 | 명세서건수 | 입내원일수 | 보험자부담금 | 요양급여비용총액 | |
---|---|---|---|---|---|---|---|
성별 | 1.000 | 0.000 | 0.009 | 0.000 | 0.000 | 0.000 | 0.011 |
입원외래구분 | 0.000 | 1.000 | 0.018 | 0.026 | 0.000 | 0.000 | 0.000 |
환자수 | 0.009 | 0.018 | 1.000 | 0.973 | 0.968 | 0.676 | 0.789 |
명세서건수 | 0.000 | 0.026 | 0.973 | 1.000 | 0.999 | 0.724 | 0.756 |
입내원일수 | 0.000 | 0.000 | 0.968 | 0.999 | 1.000 | 0.746 | 0.794 |
보험자부담금 | 0.000 | 0.000 | 0.676 | 0.724 | 0.746 | 1.000 | 0.990 |
요양급여비용총액 | 0.011 | 0.000 | 0.789 | 0.756 | 0.794 | 0.990 | 1.000 |
성별 | 입원외래구분 | |
---|---|---|
성별 | 1.000 | 0.000 |
입원외래구분 | 0.000 | 1.000 |
환자수 | 명세서건수 | 입내원일수 | 보험자부담금 | 요양급여비용총액 | 성별 | 입원외래구분 | |
---|---|---|---|---|---|---|---|
환자수 | 1.000 | 0.981 | 0.937 | 0.799 | 0.822 | 0.006 | 0.012 |
명세서건수 | 0.981 | 1.000 | 0.941 | 0.781 | 0.804 | 0.000 | 0.017 |
입내원일수 | 0.937 | 0.941 | 1.000 | 0.920 | 0.934 | 0.000 | 0.000 |
보험자부담금 | 0.799 | 0.781 | 0.920 | 1.000 | 0.998 | 0.000 | 0.000 |
요양급여비용총액 | 0.822 | 0.804 | 0.934 | 0.998 | 1.000 | 0.014 | 0.000 |
성별 | 0.006 | 0.000 | 0.000 | 0.000 | 0.014 | 1.000 | 0.000 |
입원외래구분 | 0.012 | 0.017 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 |
진료년도 | 주상병코드 | 성별 | 입원외래구분 | 환자수 | 명세서건수 | 입내원일수 | 보험자부담금 | 요양급여비용총액 | |
---|---|---|---|---|---|---|---|---|---|
30143 | 2022 | M8168 | 여 | 외래 | 196 | 423 | 423 | 16612150 | 27810250 |
18709 | 2022 | K5782 | 여 | 입원 | 132 | 145 | 1034 | 296867700 | 380516910 |
22567 | 2022 | M1090 | 남 | 입원 | 140 | 173 | 1177 | 215228420 | 277003900 |
29765 | 2022 | M7990 | 여 | 외래 | 283 | 640 | 639 | 11419840 | 16131840 |
2816 | 2022 | C181 | 여 | 입원 | 281 | 1221 | 9062 | 2993472090 | 3238765130 |
4670 | 2022 | D0511 | 남 | 입원 | 10 | 11 | 68 | 43867730 | 47508170 |
39907 | 2022 | S1314 | 여 | 외래 | 34 | 80 | 80 | 2659010 | 5226010 |
31200 | 2022 | M8688 | 남 | 입원 | 24 | 37 | 526 | 168711350 | 209019310 |
22602 | 2022 | M1099 | 남 | 외래 | 64346 | 179181 | 179089 | 4226007090 | 6831594010 |
5700 | 2022 | D464 | 남 | 외래 | 71 | 342 | 341 | 58625910 | 62965510 |
진료년도 | 주상병코드 | 성별 | 입원외래구분 | 환자수 | 명세서건수 | 입내원일수 | 보험자부담금 | 요양급여비용총액 | |
---|---|---|---|---|---|---|---|---|---|
1153 | 2022 | A830 | 남 | 입원 | 11 | 28 | 490 | 249082160 | 303213840 |
22606 | 2022 | M1100 | 남 | 외래 | 8 | 16 | 16 | 482660 | 699260 |
3702 | 2022 | C675 | 남 | 입원 | 111 | 206 | 1884 | 434946800 | 478232380 |
41650 | 2022 | S568 | 남 | 외래 | 3 | 10 | 10 | 128620 | 205420 |
19313 | 2022 | K761 | 남 | 외래 | 263 | 685 | 685 | 40343000 | 73259900 |
961 | 2022 | A563 | 남 | 외래 | 4 | 5 | 5 | 174930 | 253030 |
38493 | 2022 | R470 | 여 | 입원 | 172 | 199 | 853 | 187541540 | 268275750 |
26907 | 2022 | M5414 | 남 | 외래 | 1785 | 4567 | 4566 | 291236910 | 428507510 |
28721 | 2022 | M7103 | 남 | 입원 | 11 | 12 | 111 | 19400430 | 25365120 |
15761 | 2022 | J0381 | 여 | 입원 | 9 | 9 | 28 | 3834740 | 4442210 |