Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 626 |
Duplicate rows (%) | 6.3% |
Total size in memory | 859.4 KiB |
Average record size in memory | 88.0 B |
Variable types
Categorical | 3 |
---|---|
Numeric | 6 |
Dataset
Description | o (내용) 시도별·연령별 일반(암) 검진 대상자 및 수검자 인원 수 o (대상) 당해연도 일반(암)검진 종별 중 하나라도 대상자인 건강보험가입자 o (변수 레이아웃) 1 사업년도 2 센터구분코드(01: 서울 02: 부산, 03: 대구, 04: 광주, 05: 대전, 06: 경인 3 관리지사코드(수검 당시 주소지 관할 지사) 4 소속지사구분코드(0: 지사, 1: 출장소1, 2: 출장소2, 3: 출장소3) 5 대상자연령 6 건강검진대상유형코드(A0: 일반, A5: 생애검진, D1: 위암, D2: 대장암, D3: 유방암, D4: 간암상반기, D5: 자궁경부암, D6: 간암하반기, D7: 폐암) 7 대상자인원수 8 수검자인원수 9 수검율(수검자인원수/대상자인원수 x 100(%)) o (자료제공범위) 자료가 존재하는 범위 내 가장 최근 ‘1개월’ (2020년5월1일~2020년5월31일), 6행 이상 제공 불가 |
---|---|
URL | https://www.data.go.kr/data/15121859/fileData.do |
Reproduction
Analysis started | 2023-12-12 20:17:15.651246 |
---|---|
Analysis finished | 2023-12-12 20:17:21.224563 |
Duration | 5.57 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
사업년도
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
2020 |
---|
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2020 |
---|---|
2nd row | 2020 |
3rd row | 2020 |
4th row | 2020 |
5th row | 2020 |
Common Values
Value | Count | Frequency (%) |
2020 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2020 | 10000 |
센터구분코드
Real number (ℝ)
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.4766 |
Minimum | 1 |
---|---|
Maximum | 6 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 2 |
median | 4 |
Q3 | 5 |
95-th percentile | 6 |
Maximum | 6 |
Range | 5 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 1.785153 |
---|---|
Coefficient of variation (CV) | 0.51347666 |
Kurtosis | -1.3426381 |
Mean | 3.4766 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 0.0086074088 |
Sum | 34766 |
Variance | 3.1867711 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 2002 | |
6 | 1927 | |
4 | 1822 | |
2 | 1543 | |
5 | 1355 | |
3 | 1351 |
Value | Count | Frequency (%) |
1 | 2002 | |
2 | 1543 | |
3 | 1351 | |
4 | 1822 | |
5 | 1355 | |
6 | 1927 |
Value | Count | Frequency (%) |
6 | 1927 | |
5 | 1355 | |
4 | 1822 | |
3 | 1351 | |
2 | 1543 | |
1 | 2002 |
관리지사코드
Real number (ℝ)
Distinct | 178 |
---|---|
Distinct (%) | 1.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 444.5311 |
Minimum | 101 |
---|---|
Maximum | 802 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 101 |
---|---|
5-th percentile | 113 |
Q1 | 242 |
median | 405 |
Q3 | 664 |
95-th percentile | 762 |
Maximum | 802 |
Range | 701 |
Interquartile range (IQR) | 422 |
Descriptive statistics
Standard deviation | 218.97221 |
---|---|
Coefficient of variation (CV) | 0.49259144 |
Kurtosis | -1.4349458 |
Mean | 444.5311 |
Median Absolute Deviation (MAD) | 198 |
Skewness | 0.023513377 |
Sum | 4445311 |
Variance | 47948.83 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
716 | 141 | 1.4% |
765 | 141 | 1.4% |
606 | 137 | 1.4% |
704 | 136 | 1.4% |
401 | 128 | 1.3% |
405 | 128 | 1.3% |
604 | 122 | 1.2% |
505 | 115 | 1.1% |
416 | 110 | 1.1% |
771 | 104 | 1.0% |
Other values (168) | 8738 |
Value | Count | Frequency (%) |
101 | 41 | |
103 | 35 | |
104 | 38 | |
105 | 44 | |
106 | 44 | |
107 | 36 | |
108 | 46 | |
109 | 51 | |
110 | 43 | |
111 | 46 |
Value | Count | Frequency (%) |
802 | 53 | 0.5% |
801 | 38 | 0.4% |
771 | 104 | |
769 | 53 | 0.5% |
767 | 43 | 0.4% |
765 | 141 | |
762 | 72 | |
759 | 72 | |
757 | 36 | 0.4% |
756 | 38 | 0.4% |
소속지사구분코드
Categorical
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | |
2 | 465 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 7691 | |
1 | 1844 | 18.4% |
2 | 465 | 4.7% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 7691 | |
1 | 1844 | 18.4% |
2 | 465 | 4.7% |
대상자연령
Real number (ℝ)
Distinct | 8 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 57.047 |
Minimum | 10 |
---|---|
Maximum | 80 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 10 |
---|---|
5-th percentile | 20 |
Q1 | 50 |
median | 60 |
Q3 | 70 |
95-th percentile | 80 |
Maximum | 80 |
Range | 70 |
Interquartile range (IQR) | 20 |
Descriptive statistics
Standard deviation | 17.787676 |
---|---|
Coefficient of variation (CV) | 0.31180739 |
Kurtosis | -0.20970883 |
Mean | 57.047 |
Median Absolute Deviation (MAD) | 10 |
Skewness | -0.61574988 |
Sum | 570470 |
Variance | 316.40143 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
70 | 2048 | |
60 | 1973 | |
50 | 1803 | |
80 | 1776 | |
40 | 1336 | |
20 | 454 | 4.5% |
30 | 394 | 3.9% |
10 | 216 | 2.2% |
Value | Count | Frequency (%) |
10 | 216 | 2.2% |
20 | 454 | 4.5% |
30 | 394 | 3.9% |
40 | 1336 | |
50 | 1803 | |
60 | 1973 | |
70 | 2048 | |
80 | 1776 |
Value | Count | Frequency (%) |
80 | 1776 | |
70 | 2048 | |
60 | 1973 | |
50 | 1803 | |
40 | 1336 | |
30 | 394 | 3.9% |
20 | 454 | 4.5% |
10 | 216 | 2.2% |
건강검진대상유형코드
Categorical
Distinct | 9 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
A0 | |
---|---|
D5 | |
D6 | |
D1 | |
D3 | |
Other values (4) |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | D2 |
---|---|
2nd row | D3 |
3rd row | D7 |
4th row | D3 |
5th row | D6 |
Common Values
Value | Count | Frequency (%) |
A0 | 1688 | |
D5 | 1532 | |
D6 | 1161 | |
D1 | 1151 | |
D3 | 1127 | |
D4 | 1105 | |
D2 | 900 | |
A5 | 685 | |
D7 | 651 | 6.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
a0 | 1688 | |
d5 | 1532 | |
d6 | 1161 | |
d1 | 1151 | |
d3 | 1127 | |
d4 | 1105 | |
d2 | 900 | |
a5 | 685 | |
d7 | 651 | 6.5% |
대상자인원수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 5907 |
---|---|
Distinct (%) | 59.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 7584.2882 |
Minimum | 1 |
---|---|
Maximum | 142013 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 52 |
Q1 | 524 |
median | 2176.5 |
Q3 | 9103 |
95-th percentile | 33385.6 |
Maximum | 142013 |
Range | 142012 |
Interquartile range (IQR) | 8579 |
Descriptive statistics
Standard deviation | 12877.772 |
---|---|
Coefficient of variation (CV) | 1.6979539 |
Kurtosis | 17.701979 |
Mean | 7584.2882 |
Median Absolute Deviation (MAD) | 2029.5 |
Skewness | 3.4544721 |
Sum | 75842882 |
Variance | 1.6583701 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 53 | 0.5% |
96 | 18 | 0.2% |
9 | 17 | 0.2% |
66 | 17 | 0.2% |
82 | 17 | 0.2% |
105 | 16 | 0.2% |
13 | 15 | 0.1% |
103 | 14 | 0.1% |
39 | 14 | 0.1% |
4 | 14 | 0.1% |
Other values (5897) | 9805 |
Value | Count | Frequency (%) |
1 | 53 | |
2 | 8 | 0.1% |
3 | 12 | 0.1% |
4 | 14 | 0.1% |
5 | 5 | 0.1% |
6 | 9 | 0.1% |
7 | 13 | 0.1% |
8 | 8 | 0.1% |
9 | 17 | 0.2% |
10 | 5 | 0.1% |
Value | Count | Frequency (%) |
142013 | 1 | |
141904 | 1 | |
141816 | 1 | |
141628 | 1 | |
128136 | 1 | |
128084 | 1 | |
127900 | 1 | |
118513 | 1 | |
118059 | 1 | |
115711 | 1 |
수검자인원수
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 1996 |
---|---|
Distinct (%) | 20.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 472.4716 |
Minimum | 0 |
---|---|
Maximum | 9774 |
Zeros | 264 |
Zeros (%) | 2.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 2 |
Q1 | 35 |
median | 170 |
Q3 | 614 |
95-th percentile | 1922 |
Maximum | 9774 |
Range | 9774 |
Interquartile range (IQR) | 579 |
Descriptive statistics
Standard deviation | 703.46407 |
---|---|
Coefficient of variation (CV) | 1.4889023 |
Kurtosis | 10.585367 |
Mean | 472.4716 |
Median Absolute Deviation (MAD) | 158 |
Skewness | 2.6485137 |
Sum | 4724716 |
Variance | 494861.7 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 264 | 2.6% |
1 | 180 | 1.8% |
2 | 134 | 1.3% |
4 | 102 | 1.0% |
3 | 99 | 1.0% |
5 | 96 | 1.0% |
11 | 93 | 0.9% |
7 | 91 | 0.9% |
6 | 91 | 0.9% |
8 | 89 | 0.9% |
Other values (1986) | 8761 |
Value | Count | Frequency (%) |
0 | 264 | |
1 | 180 | |
2 | 134 | |
3 | 99 | 1.0% |
4 | 102 | 1.0% |
5 | 96 | 1.0% |
6 | 91 | 0.9% |
7 | 91 | 0.9% |
8 | 89 | 0.9% |
9 | 68 | 0.7% |
Value | Count | Frequency (%) |
9774 | 1 | |
6333 | 1 | |
5760 | 1 | |
5371 | 1 | |
5343 | 1 | |
5306 | 1 | |
5178 | 1 | |
5102 | 1 | |
4948 | 1 | |
4884 | 1 |
수검율
Real number (ℝ)
ZEROS
 
Distinct | 1766 |
---|---|
Distinct (%) | 17.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 7.978185 |
Minimum | 0 |
---|---|
Maximum | 100 |
Zeros | 264 |
Zeros (%) | 2.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1.6 |
Q1 | 4.57 |
median | 7.385 |
Q3 | 10.4325 |
95-th percentile | 15.92 |
Maximum | 100 |
Range | 100 |
Interquartile range (IQR) | 5.8625 |
Descriptive statistics
Standard deviation | 5.9923127 |
---|---|
Coefficient of variation (CV) | 0.75108721 |
Kurtosis | 96.621141 |
Mean | 7.978185 |
Median Absolute Deviation (MAD) | 2.935 |
Skewness | 6.8195187 |
Sum | 79781.85 |
Variance | 35.907812 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0 | 264 | 2.6% |
11.11 | 25 | 0.2% |
9.09 | 22 | 0.2% |
8.61 | 22 | 0.2% |
5.66 | 22 | 0.2% |
6.25 | 20 | 0.2% |
9.5 | 20 | 0.2% |
5.88 | 20 | 0.2% |
7.89 | 20 | 0.2% |
6.67 | 19 | 0.2% |
Other values (1756) | 9546 |
Value | Count | Frequency (%) |
0.0 | 264 | |
0.11 | 1 | < 0.1% |
0.13 | 1 | < 0.1% |
0.16 | 1 | < 0.1% |
0.43 | 2 | < 0.1% |
0.45 | 1 | < 0.1% |
0.54 | 2 | < 0.1% |
0.56 | 3 | < 0.1% |
0.61 | 2 | < 0.1% |
0.64 | 1 | < 0.1% |
Value | Count | Frequency (%) |
100.0 | 17 | |
75.0 | 1 | < 0.1% |
66.67 | 2 | < 0.1% |
42.86 | 2 | < 0.1% |
33.33 | 7 | |
32.0 | 2 | < 0.1% |
31.76 | 1 | < 0.1% |
31.42 | 1 | < 0.1% |
31.27 | 1 | < 0.1% |
28.57 | 1 | < 0.1% |
센터구분코드 | 관리지사코드 | 소속지사구분코드 | 대상자연령 | 건강검진대상유형코드 | 대상자인원수 | 수검자인원수 | 수검율 | |
---|---|---|---|---|---|---|---|---|
센터구분코드 | 1.000 | 0.935 | 0.388 | 0.000 | 0.000 | 0.191 | 0.159 | 0.153 |
관리지사코드 | 0.935 | 1.000 | 0.469 | 0.000 | 0.000 | 0.255 | 0.248 | 0.141 |
소속지사구분코드 | 0.388 | 0.469 | 1.000 | 0.000 | 0.046 | 0.348 | 0.249 | 0.166 |
대상자연령 | 0.000 | 0.000 | 0.000 | 1.000 | 0.455 | 0.204 | 0.293 | 0.375 |
건강검진대상유형코드 | 0.000 | 0.000 | 0.046 | 0.455 | 1.000 | 0.486 | 0.304 | 0.394 |
대상자인원수 | 0.191 | 0.255 | 0.348 | 0.204 | 0.486 | 1.000 | 0.634 | 0.153 |
수검자인원수 | 0.159 | 0.248 | 0.249 | 0.293 | 0.304 | 0.634 | 1.000 | 0.107 |
수검율 | 0.153 | 0.141 | 0.166 | 0.375 | 0.394 | 0.153 | 0.107 | 1.000 |
건강검진대상유형코드 | 소속지사구분코드 | |
---|---|---|
건강검진대상유형코드 | 1.000 | 0.020 |
소속지사구분코드 | 0.020 | 1.000 |
센터구분코드 | 관리지사코드 | 대상자연령 | 대상자인원수 | 수검자인원수 | 수검율 | 소속지사구분코드 | 건강검진대상유형코드 | |
---|---|---|---|---|---|---|---|---|
센터구분코드 | 1.000 | 0.099 | -0.008 | 0.004 | 0.009 | 0.022 | 0.174 | 0.000 |
관리지사코드 | 0.099 | 1.000 | -0.001 | -0.282 | -0.275 | 0.058 | 0.319 | 0.000 |
대상자연령 | -0.008 | -0.001 | 1.000 | -0.192 | -0.198 | 0.042 | 0.000 | 0.243 |
대상자인원수 | 0.004 | -0.282 | -0.192 | 1.000 | 0.954 | -0.127 | 0.164 | 0.174 |
수검자인원수 | 0.009 | -0.275 | -0.198 | 0.954 | 1.000 | 0.141 | 0.162 | 0.154 |
수검율 | 0.022 | 0.058 | 0.042 | -0.127 | 0.141 | 1.000 | 0.104 | 0.204 |
소속지사구분코드 | 0.174 | 0.319 | 0.000 | 0.164 | 0.162 | 0.104 | 1.000 | 0.020 |
건강검진대상유형코드 | 0.000 | 0.000 | 0.243 | 0.174 | 0.154 | 0.204 | 0.020 | 1.000 |
사업년도 | 센터구분코드 | 관리지사코드 | 소속지사구분코드 | 대상자연령 | 건강검진대상유형코드 | 대상자인원수 | 수검자인원수 | 수검율 | |
---|---|---|---|---|---|---|---|---|---|
51122 | 2020 | 6 | 311 | 0 | 70 | D2 | 8920 | 392 | 4.39 |
27299 | 2020 | 5 | 560 | 0 | 40 | D3 | 2447 | 148 | 6.05 |
89200 | 2020 | 1 | 134 | 0 | 60 | D7 | 1256 | 33 | 2.63 |
21052 | 2020 | 2 | 205 | 0 | 60 | D3 | 9141 | 843 | 9.22 |
65729 | 2020 | 6 | 309 | 0 | 70 | D6 | 1010 | 122 | 12.08 |
74496 | 2020 | 2 | 262 | 0 | 70 | D4 | 401 | 58 | 14.46 |
19911 | 2020 | 2 | 203 | 0 | 60 | D7 | 1232 | 22 | 1.79 |
31664 | 2020 | 5 | 252 | 0 | 70 | D4 | 542 | 84 | 15.5 |
88860 | 2020 | 6 | 328 | 0 | 60 | D6 | 1714 | 210 | 12.25 |
28565 | 2020 | 6 | 306 | 0 | 50 | D2 | 39501 | 913 | 2.31 |
사업년도 | 센터구분코드 | 관리지사코드 | 소속지사구분코드 | 대상자연령 | 건강검진대상유형코드 | 대상자인원수 | 수검자인원수 | 수검율 | |
---|---|---|---|---|---|---|---|---|---|
20920 | 2020 | 3 | 704 | 2 | 30 | D5 | 239 | 18 | 7.53 |
53937 | 2020 | 1 | 418 | 0 | 70 | D2 | 5564 | 266 | 4.78 |
61474 | 2020 | 2 | 753 | 0 | 80 | D3 | 1710 | 41 | 2.4 |
36477 | 2020 | 6 | 318 | 0 | 80 | D1 | 8155 | 568 | 6.97 |
66813 | 2020 | 6 | 232 | 0 | 60 | D1 | 22741 | 2435 | 10.71 |
55408 | 2020 | 1 | 141 | 0 | 80 | A0 | 10592 | 647 | 6.11 |
55681 | 2020 | 4 | 611 | 0 | 70 | D7 | 278 | 11 | 3.96 |
65344 | 2020 | 6 | 326 | 0 | 70 | D6 | 176 | 16 | 9.09 |
87061 | 2020 | 6 | 342 | 0 | 40 | D5 | 22843 | 1751 | 7.67 |
6460 | 2020 | 3 | 705 | 1 | 20 | A0 | 1125 | 8 | 0.71 |
Most frequently occurring
사업년도 | 센터구분코드 | 관리지사코드 | 소속지사구분코드 | 대상자연령 | 건강검진대상유형코드 | 대상자인원수 | 수검자인원수 | 수검율 | # duplicates | |
---|---|---|---|---|---|---|---|---|---|---|
72 | 2020 | 1 | 404 | 1 | 70 | D7 | 43 | 2 | 4.65 | 4 |
97 | 2020 | 1 | 408 | 0 | 60 | A5 | 88 | 13 | 14.77 | 4 |
134 | 2020 | 2 | 209 | 0 | 60 | A5 | 306 | 34 | 11.11 | 4 |
270 | 2020 | 3 | 716 | 2 | 70 | D6 | 82 | 8 | 9.76 | 4 |
329 | 2020 | 4 | 606 | 0 | 10 | A0 | 1 | 0 | 0.0 | 4 |
412 | 2020 | 4 | 666 | 1 | 70 | D7 | 21 | 2 | 9.52 | 4 |
514 | 2020 | 5 | 555 | 1 | 40 | D1 | 1 | 0 | 0.0 | 4 |
574 | 2020 | 6 | 312 | 1 | 10 | A0 | 13 | 0 | 0.0 | 4 |
612 | 2020 | 6 | 333 | 0 | 80 | A5 | 466 | 16 | 3.43 | 4 |
9 | 2020 | 1 | 107 | 0 | 50 | D7 | 434 | 11 | 2.53 | 3 |