Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 10000 |
Missing cells | 55 |
Missing cells (%) | 0.1% |
Duplicate rows | 357 |
Duplicate rows (%) | 3.6% |
Total size in memory | 566.4 KiB |
Average record size in memory | 58.0 B |
Variable types
Text | 1 |
---|---|
Categorical | 3 |
Numeric | 1 |
Boolean | 1 |
Dataset
Description | 검진사후관리 대상자별 상담자 검진내역 등 열람일자 정보 1 건강사후관리번호 2 조회일자 (열람 화면별 조회일자 표기) 3 지사코드 4 삭제여부 (Y: 삭제 N: 삭제아님) □ 자료 제공 범위 o 조회일자 기준 최근 ‘1개월’ (2023년7월28일~2023년8월28일) |
---|---|
URL | https://www.data.go.kr/data/15120942/fileData.do |
발췌년도 has constant value "" | Constant |
발췌년월 has constant value "" | Constant |
삭제여부 has constant value "" | Constant |
Dataset has 357 (3.6%) duplicate rows | Duplicates |
Reproduction
Analysis started | 2023-12-12 15:08:35.247650 |
---|---|
Analysis finished | 2023-12-12 15:08:35.836820 |
Duration | 0.59 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
건강사후관리번호
Text
Distinct | 3195 |
---|---|
Distinct (%) | 31.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 15 |
---|---|
Median length | 11 |
Mean length | 12.2772 |
Min length | 11 |
Characters and Unicode
Total characters | 122772 |
---|---|
Distinct characters | 14 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 3193 ? |
---|---|
Unique (%) | 31.9% |
Sample
1st row | 5.02202E+14 |
---|---|
2nd row | 5.02202E+14 |
3rd row | A01202301306493 |
4th row | 5.02202E+14 |
5th row | A01202301316368 |
Value | Count | Frequency (%) |
5.02202e+14 | 3422 | |
5.01202e+14 | 3385 | |
a01202301312497 | 1 | < 0.1% |
a01202301332645 | 1 | < 0.1% |
a01202301304199 | 1 | < 0.1% |
a01202301304114 | 1 | < 0.1% |
a01202301325792 | 1 | < 0.1% |
a01202301308907 | 1 | < 0.1% |
a01202301319596 | 1 | < 0.1% |
a01202301309107 | 1 | < 0.1% |
Other values (3185) | 3185 |
Most occurring characters
Value | Count | Frequency (%) |
2 | 25761 | |
0 | 25230 | |
1 | 18905 | |
5 | 8074 | 6.6% |
4 | 8053 | 6.6% |
3 | 7951 | 6.5% |
. | 6807 | 5.5% |
E | 6807 | 5.5% |
+ | 6807 | 5.5% |
A | 3193 | 2.6% |
Other values (4) | 5184 | 4.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 99158 | |
Uppercase Letter | 10000 | 8.1% |
Other Punctuation | 6807 | 5.5% |
Math Symbol | 6807 | 5.5% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
2 | 25761 | |
0 | 25230 | |
1 | 18905 | |
5 | 8074 | 8.1% |
4 | 8053 | 8.1% |
3 | 7951 | 8.0% |
6 | 1307 | 1.3% |
8 | 1294 | 1.3% |
7 | 1292 | 1.3% |
9 | 1291 | 1.3% |
Uppercase Letter
Value | Count | Frequency (%) |
E | 6807 | |
A | 3193 |
Other Punctuation
Value | Count | Frequency (%) |
. | 6807 |
Math Symbol
Value | Count | Frequency (%) |
+ | 6807 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 112772 | |
Latin | 10000 | 8.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
2 | 25761 | |
0 | 25230 | |
1 | 18905 | |
5 | 8074 | 7.2% |
4 | 8053 | 7.1% |
3 | 7951 | 7.1% |
. | 6807 | 6.0% |
+ | 6807 | 6.0% |
6 | 1307 | 1.2% |
8 | 1294 | 1.1% |
Other values (2) | 2583 | 2.3% |
Latin
Value | Count | Frequency (%) |
E | 6807 | |
A | 3193 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 122772 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
2 | 25761 | |
0 | 25230 | |
1 | 18905 | |
5 | 8074 | 6.6% |
4 | 8053 | 6.6% |
3 | 7951 | 6.5% |
. | 6807 | 5.5% |
E | 6807 | 5.5% |
+ | 6807 | 5.5% |
A | 3193 | 2.6% |
Other values (4) | 5184 | 4.2% |
건강사후업무구분코드
Categorical
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
502 | |
---|---|
501 | |
A01 |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 502 |
---|---|
2nd row | 502 |
3rd row | A01 |
4th row | 502 |
5th row | A01 |
Common Values
Value | Count | Frequency (%) |
502 | 3422 | |
501 | 3385 | |
A01 | 3193 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
502 | 3422 | |
501 | 3385 | |
a01 | 3193 |
수행지사코드
Real number (ℝ)
Distinct | 179 |
---|---|
Distinct (%) | 1.8% |
Missing | 55 |
Missing (%) | 0.5% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 387.14781 |
Minimum | 101 |
---|---|
Maximum | 9998 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 101 |
---|---|
5-th percentile | 109 |
Q1 | 211 |
median | 313 |
Q3 | 562 |
95-th percentile | 755 |
Maximum | 9998 |
Range | 9897 |
Interquartile range (IQR) | 351 |
Descriptive statistics
Standard deviation | 431.40286 |
---|---|
Coefficient of variation (CV) | 1.1143105 |
Kurtosis | 368.91229 |
Mean | 387.14781 |
Median Absolute Deviation (MAD) | 173 |
Skewness | 16.729079 |
Sum | 3850185 |
Variance | 186108.42 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
112 | 168 | 1.7% |
321 | 142 | 1.4% |
232 | 141 | 1.4% |
312 | 132 | 1.3% |
235 | 119 | 1.2% |
551 | 113 | 1.1% |
251 | 113 | 1.1% |
767 | 112 | 1.1% |
342 | 111 | 1.1% |
131 | 108 | 1.1% |
Other values (169) | 8686 |
Value | Count | Frequency (%) |
101 | 70 | |
103 | 48 | |
104 | 56 | |
105 | 77 | |
106 | 83 | |
107 | 54 | |
108 | 56 | |
109 | 65 | |
110 | 64 | |
111 | 103 |
Value | Count | Frequency (%) |
9998 | 15 | 0.1% |
802 | 28 | 0.3% |
801 | 60 | |
771 | 51 | |
769 | 27 | 0.3% |
767 | 112 | |
765 | 35 | 0.4% |
762 | 22 | 0.2% |
759 | 25 | 0.2% |
757 | 57 |
발췌년도
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
2023 |
---|
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2023 |
---|---|
2nd row | 2023 |
3rd row | 2023 |
4th row | 2023 |
5th row | 2023 |
Common Values
Value | Count | Frequency (%) |
2023 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2023 | 10000 |
발췌년월
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
2023-08 |
---|
Length
Max length | 7 |
---|---|
Median length | 7 |
Mean length | 7 |
Min length | 7 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2023-08 |
---|---|
2nd row | 2023-08 |
3rd row | 2023-08 |
4th row | 2023-08 |
5th row | 2023-08 |
Common Values
Value | Count | Frequency (%) |
2023-08 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2023-08 | 10000 |
삭제여부
Boolean
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 87.9 KiB |
False |
---|
Value | Count | Frequency (%) |
False | 10000 |
건강사후업무구분코드 | 수행지사코드 | |
---|---|---|
건강사후업무구분코드 | 1.000 | 0.016 |
수행지사코드 | 0.016 | 1.000 |
수행지사코드 | 건강사후업무구분코드 | |
---|---|---|
수행지사코드 | 1.000 | 0.027 |
건강사후업무구분코드 | 0.027 | 1.000 |
건강사후관리번호 | 건강사후업무구분코드 | 수행지사코드 | 발췌년도 | 발췌년월 | 삭제여부 | |
---|---|---|---|---|---|---|
83493 | 5.02202E+14 | 502 | 129 | 2023 | 2023-08 | N |
61400 | 5.02202E+14 | 502 | 756 | 2023 | 2023-08 | N |
205 | A01202301306493 | A01 | 251 | 2023 | 2023-08 | N |
80691 | 5.02202E+14 | 502 | 251 | 2023 | 2023-08 | N |
9275 | A01202301316368 | A01 | 721 | 2023 | 2023-08 | N |
82527 | 5.02202E+14 | 502 | 652 | 2023 | 2023-08 | N |
84351 | 5.02202E+14 | 502 | 704 | 2023 | 2023-08 | N |
19291 | A01202301318210 | A01 | 716 | 2023 | 2023-08 | N |
49151 | 5.01202E+14 | 501 | 314 | 2023 | 2023-08 | N |
36004 | 5.02202E+14 | 502 | 108 | 2023 | 2023-08 | N |
건강사후관리번호 | 건강사후업무구분코드 | 수행지사코드 | 발췌년도 | 발췌년월 | 삭제여부 | |
---|---|---|---|---|---|---|
9995 | A01202301325681 | A01 | 305 | 2023 | 2023-08 | N |
72408 | 5.02202E+14 | 502 | 751 | 2023 | 2023-08 | N |
26270 | A01202301326909 | A01 | 237 | 2023 | 2023-08 | N |
24096 | A01202301312497 | A01 | 264 | 2023 | 2023-08 | N |
32492 | 5.02202E+14 | 502 | 321 | 2023 | 2023-08 | N |
28801 | 5.01202E+14 | 501 | 130 | 2023 | 2023-08 | N |
20156 | A01202301328538 | A01 | 254 | 2023 | 2023-08 | N |
10150 | A01202301304833 | A01 | 140 | 2023 | 2023-08 | N |
43069 | A01202301330182 | A01 | 318 | 2023 | 2023-08 | N |
78164 | 5.01202E+14 | 501 | 221 | 2023 | 2023-08 | N |
Most frequently occurring
건강사후관리번호 | 건강사후업무구분코드 | 수행지사코드 | 발췌년도 | 발췌년월 | 삭제여부 | # duplicates | |
---|---|---|---|---|---|---|---|
9 | 5.01202E+14 | 501 | 111 | 2023 | 2023-08 | N | 66 |
188 | 5.02202E+14 | 502 | 112 | 2023 | 2023-08 | N | 65 |
81 | 5.01202E+14 | 501 | 321 | 2023 | 2023-08 | N | 61 |
10 | 5.01202E+14 | 501 | 112 | 2023 | 2023-08 | N | 60 |
172 | 5.01202E+14 | 501 | 767 | 2023 | 2023-08 | N | 59 |
227 | 5.02202E+14 | 502 | 232 | 2023 | 2023-08 | N | 52 |
229 | 5.02202E+14 | 502 | 235 | 2023 | 2023-08 | N | 52 |
56 | 5.01202E+14 | 501 | 251 | 2023 | 2023-08 | N | 51 |
48 | 5.01202E+14 | 501 | 232 | 2023 | 2023-08 | N | 50 |
65 | 5.01202E+14 | 501 | 302 | 2023 | 2023-08 | N | 50 |