Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 6783 |
Missing cells | 594 |
Missing cells (%) | 1.3% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 397.6 KiB |
Average record size in memory | 60.0 B |
Variable types
Categorical | 2 |
---|---|
Numeric | 3 |
DateTime | 1 |
Text | 1 |
Dataset
Description | o (내용) 사업장 건강검진 대상자 명단 발급 내역 o (대상) 당해연도 건강검진 대상자가 존재하는 건강보험 가입 사업장 o (변수 레이아웃) 1 작성차수 2 사업장기호 3 단위사업장기호 4 검진년도 5 EDI진행상태(1:미처리, 2: 처리완료, 3: 반송, 4: 삭제) 6 신청일시(EDI 신청한 일시(예: 20200612110446 = 2020년06월12일11시04분46초)) 7 송수신파일명(지사, 사업장기호, 신청발급일시가 포함된 파일명) o (자료제공범위) 조회일자 기준 최근 ‘1개월’ (2023년7월28일~2023년8월28일) |
---|---|
URL | https://www.data.go.kr/data/15121843/fileData.do |
Reproduction
Analysis started | 2023-12-12 23:21:21.588396 |
---|---|
Analysis finished | 2023-12-12 23:21:23.017391 |
Duration | 1.43 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
작성차수
Categorical
Distinct | 31 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 53.1 KiB |
20230822ZZ | 439 |
---|---|
20230817ZZ | 426 |
20230810ZZ | 411 |
20230821ZZ | 398 |
20230816ZZ | 398 |
Other values (26) |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 20230728ZZ |
---|---|
2nd row | 20230728ZZ |
3rd row | 20230728ZZ |
4th row | 20230728ZZ |
5th row | 20230728ZZ |
Common Values
Value | Count | Frequency (%) |
20230822ZZ | 439 | 6.5% |
20230817ZZ | 426 | 6.3% |
20230810ZZ | 411 | 6.1% |
20230821ZZ | 398 | 5.9% |
20230816ZZ | 398 | 5.9% |
20230824ZZ | 374 | 5.5% |
20230828ZZ | 372 | 5.5% |
20230823ZZ | 371 | 5.5% |
20230818ZZ | 344 | 5.1% |
20230825ZZ | 329 | 4.9% |
Other values (21) | 2921 |
Length
Value | Count | Frequency (%) |
20230822zz | 439 | 6.5% |
20230817zz | 426 | 6.3% |
20230810zz | 411 | 6.1% |
20230821zz | 398 | 5.9% |
20230816zz | 398 | 5.9% |
20230824zz | 374 | 5.5% |
20230828zz | 372 | 5.5% |
20230823zz | 371 | 5.5% |
20230818zz | 344 | 5.1% |
20230825zz | 329 | 4.9% |
Other values (21) | 2921 |
사업장기호
Real number (ℝ)
Distinct | 5411 |
---|---|
Distinct (%) | 79.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 72453903 |
Minimum | 10000021 |
---|---|
Maximum | 79720343 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 59.7 KiB |
Quantile statistics
Minimum | 10000021 |
---|---|
5-th percentile | 70065399 |
Q1 | 71043966 |
median | 74670090 |
Q3 | 77330700 |
95-th percentile | 79325956 |
Maximum | 79720343 |
Range | 69720322 |
Interquartile range (IQR) | 6286733.5 |
Descriptive statistics
Standard deviation | 11022652 |
---|---|
Coefficient of variation (CV) | 0.1521333 |
Kurtosis | 23.098609 |
Mean | 72453903 |
Median Absolute Deviation (MAD) | 3372853 |
Skewness | -4.7191784 |
Sum | 4.9145482 × 1011 |
Variance | 1.2149885 × 1014 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
71108887 | 13 | 0.2% |
71106811 | 12 | 0.2% |
71114420 | 12 | 0.2% |
72296310 | 11 | 0.2% |
76384358 | 10 | 0.1% |
70123482 | 10 | 0.1% |
71098879 | 10 | 0.1% |
76755096 | 9 | 0.1% |
71103263 | 9 | 0.1% |
75605310 | 8 | 0.1% |
Other values (5401) | 6679 |
Value | Count | Frequency (%) |
10000021 | 1 | |
10000070 | 2 | |
10000158 | 1 | |
10000164 | 1 | |
10000186 | 1 | |
10000224 | 1 | |
10000229 | 2 | |
10000354 | 1 | |
10000583 | 1 | |
10000600 | 1 |
Value | Count | Frequency (%) |
79720343 | 1 | < 0.1% |
79719144 | 1 | < 0.1% |
79718714 | 1 | < 0.1% |
79718423 | 1 | < 0.1% |
79717774 | 1 | < 0.1% |
79717673 | 1 | < 0.1% |
79717043 | 1 | < 0.1% |
79715829 | 3 | |
79715777 | 1 | < 0.1% |
79715458 | 1 | < 0.1% |
단위사업장기호
Real number (ℝ)
SKEWED
  ZEROS
 
Distinct | 14 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.058823529 |
Minimum | 0 |
---|---|
Maximum | 117 |
Zeros | 6737 |
Zeros (%) | 99.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 59.7 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 0 |
Maximum | 117 |
Range | 117 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 2.0808464 |
---|---|
Coefficient of variation (CV) | 35.374389 |
Kurtosis | 2666.5059 |
Mean | 0.058823529 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 50.138892 |
Sum | 399 |
Variance | 4.3299218 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 6737 | |
1 | 33 | 0.5% |
5 | 2 | < 0.1% |
13 | 1 | < 0.1% |
18 | 1 | < 0.1% |
3 | 1 | < 0.1% |
117 | 1 | < 0.1% |
16 | 1 | < 0.1% |
17 | 1 | < 0.1% |
110 | 1 | < 0.1% |
Other values (4) | 4 | 0.1% |
Value | Count | Frequency (%) |
0 | 6737 | |
1 | 33 | 0.5% |
2 | 1 | < 0.1% |
3 | 1 | < 0.1% |
4 | 1 | < 0.1% |
5 | 2 | < 0.1% |
7 | 1 | < 0.1% |
13 | 1 | < 0.1% |
16 | 1 | < 0.1% |
17 | 1 | < 0.1% |
Value | Count | Frequency (%) |
117 | 1 | |
110 | 1 | |
49 | 1 | |
18 | 1 | |
17 | 1 | |
16 | 1 | |
13 | 1 | |
7 | 1 | |
5 | 2 | |
4 | 1 |
검진년도
Real number (ℝ)
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2022.8664 |
Minimum | 2018 |
---|---|
Maximum | 2023 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 59.7 KiB |
Quantile statistics
Minimum | 2018 |
---|---|
5-th percentile | 2022 |
Q1 | 2023 |
median | 2023 |
Q3 | 2023 |
95-th percentile | 2023 |
Maximum | 2023 |
Range | 5 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 0.55339843 |
---|---|
Coefficient of variation (CV) | 0.00027357141 |
Kurtosis | 27.009929 |
Mean | 2022.8664 |
Median Absolute Deviation (MAD) | 0 |
Skewness | -5.0054495 |
Sum | 13721103 |
Variance | 0.30624982 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2023 | 6281 | |
2022 | 281 | 4.1% |
2021 | 95 | 1.4% |
2020 | 71 | 1.0% |
2019 | 53 | 0.8% |
2018 | 2 | < 0.1% |
Value | Count | Frequency (%) |
2018 | 2 | < 0.1% |
2019 | 53 | 0.8% |
2020 | 71 | 1.0% |
2021 | 95 | 1.4% |
2022 | 281 | 4.1% |
2023 | 6281 |
Value | Count | Frequency (%) |
2023 | 6281 | |
2022 | 281 | 4.1% |
2021 | 95 | 1.4% |
2020 | 71 | 1.0% |
2019 | 53 | 0.8% |
2018 | 2 | < 0.1% |
EDI진행상태
Categorical
IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 53.1 KiB |
2 | |
---|---|
3 | 477 |
4 | 77 |
1 | 44 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2 |
---|---|
2nd row | 2 |
3rd row | 2 |
4th row | 2 |
5th row | 2 |
Common Values
Value | Count | Frequency (%) |
2 | 6185 | |
3 | 477 | 7.0% |
4 | 77 | 1.1% |
1 | 44 | 0.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2 | 6185 | |
3 | 477 | 7.0% |
4 | 77 | 1.1% |
1 | 44 | 0.6% |
신청일시
Date
Distinct | 6758 |
---|---|
Distinct (%) | 99.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 53.1 KiB |
Minimum | 2023-07-28 07:52:34 |
---|---|
Maximum | 2023-08-28 20:55:20 |
송수신파일명
Text
MISSING
 
Distinct | 5883 |
---|---|
Distinct (%) | 95.1% |
Missing | 594 |
Missing (%) | 8.8% |
Memory size | 53.1 KiB |
Length
Max length | 41 |
---|---|
Median length | 41 |
Mean length | 41 |
Min length | 41 |
Characters and Unicode
Total characters | 253749 |
---|---|
Distinct characters | 16 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 5620 ? |
---|---|
Unique (%) | 90.8% |
Sample
1st row | 000s2_00_0208_10001482_20230731095439.dat |
---|---|
2nd row | 000s2_00_0402_10003720_20230728174529.dat |
3rd row | 000s2_00_0226_10006082_20230728152602.dat |
4th row | 000s2_00_0701_10006277_20230728162344.dat |
5th row | 000s2_01_0762_10007554_20230801103140.dat |
Value | Count | Frequency (%) |
000s2_00_0132_70123482_20230828163750.dat | 10 | 0.2% |
000s2_00_0327_71449412_20230807144641.dat | 5 | 0.1% |
000s2_00_0301_72770264_20230811174530.dat | 5 | 0.1% |
000s2_00_0312_75690029_20230810153707.dat | 4 | 0.1% |
000s2_00_0332_70535795_20230822162249.dat | 4 | 0.1% |
000s2_00_0105_77031219_20230814085032.dat | 4 | 0.1% |
000s2_00_0316_78743271_20230811140342.dat | 4 | 0.1% |
000s2_00_0416_78594442_20230828105249.dat | 4 | 0.1% |
000s2_00_0320_76947695_20230821130726.dat | 3 | < 0.1% |
000s2_00_0302_78989756_20230822111137.dat | 3 | < 0.1% |
Other values (5873) | 6143 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 64129 | |
2 | 32333 | |
_ | 24756 | 9.8% |
1 | 21626 | 8.5% |
3 | 17707 | 7.0% |
7 | 14291 | 5.6% |
8 | 13071 | 5.2% |
5 | 10529 | 4.1% |
4 | 10283 | 4.1% |
6 | 7562 | 3.0% |
Other values (6) | 37462 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 198048 | |
Connector Punctuation | 24756 | 9.8% |
Lowercase Letter | 24756 | 9.8% |
Other Punctuation | 6189 | 2.4% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 64129 | |
2 | 32333 | |
1 | 21626 | 10.9% |
3 | 17707 | 8.9% |
7 | 14291 | 7.2% |
8 | 13071 | 6.6% |
5 | 10529 | 5.3% |
4 | 10283 | 5.2% |
6 | 7562 | 3.8% |
9 | 6517 | 3.3% |
Lowercase Letter
Value | Count | Frequency (%) |
s | 6189 | |
d | 6189 | |
a | 6189 | |
t | 6189 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 24756 |
Other Punctuation
Value | Count | Frequency (%) |
. | 6189 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 228993 | |
Latin | 24756 | 9.8% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 64129 | |
2 | 32333 | |
_ | 24756 | 10.8% |
1 | 21626 | 9.4% |
3 | 17707 | 7.7% |
7 | 14291 | 6.2% |
8 | 13071 | 5.7% |
5 | 10529 | 4.6% |
4 | 10283 | 4.5% |
6 | 7562 | 3.3% |
Other values (2) | 12706 | 5.5% |
Latin
Value | Count | Frequency (%) |
s | 6189 | |
d | 6189 | |
a | 6189 | |
t | 6189 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 253749 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 64129 | |
2 | 32333 | |
_ | 24756 | 9.8% |
1 | 21626 | 8.5% |
3 | 17707 | 7.0% |
7 | 14291 | 5.6% |
8 | 13071 | 5.2% |
5 | 10529 | 4.1% |
4 | 10283 | 4.1% |
6 | 7562 | 3.0% |
Other values (6) | 37462 |
작성차수 | 사업장기호 | 단위사업장기호 | 검진년도 | EDI진행상태 | |
---|---|---|---|---|---|
작성차수 | 1.000 | 0.144 | 0.000 | 0.412 | 0.191 |
사업장기호 | 0.144 | 1.000 | 0.000 | 0.063 | 0.071 |
단위사업장기호 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 |
검진년도 | 0.412 | 0.063 | 0.000 | 1.000 | 0.121 |
EDI진행상태 | 0.191 | 0.071 | 0.000 | 0.121 | 1.000 |
EDI진행상태 | 작성차수 | |
---|---|---|
EDI진행상태 | 1.000 | 0.100 |
작성차수 | 0.100 | 1.000 |
사업장기호 | 단위사업장기호 | 검진년도 | 작성차수 | EDI진행상태 | |
---|---|---|---|---|---|
사업장기호 | 1.000 | -0.091 | 0.063 | 0.064 | 0.046 |
단위사업장기호 | -0.091 | 1.000 | -0.061 | 0.000 | 0.000 |
검진년도 | 0.063 | -0.061 | 1.000 | 0.185 | 0.109 |
작성차수 | 0.064 | 0.000 | 0.185 | 1.000 | 0.100 |
EDI진행상태 | 0.046 | 0.000 | 0.109 | 0.100 | 1.000 |
작성차수 | 사업장기호 | 단위사업장기호 | 검진년도 | EDI진행상태 | 신청일시 | 송수신파일명 | |
---|---|---|---|---|---|---|---|
0 | 20230728ZZ | 10001482 | 0 | 2023 | 2 | 2023-07-28 11:54:02 | 000s2_00_0208_10001482_20230731095439.dat |
1 | 20230728ZZ | 10003720 | 0 | 2023 | 2 | 2023-07-28 15:35:07 | 000s2_00_0402_10003720_20230728174529.dat |
2 | 20230728ZZ | 10006082 | 0 | 2023 | 2 | 2023-07-28 15:18:10 | 000s2_00_0226_10006082_20230728152602.dat |
3 | 20230728ZZ | 10006277 | 0 | 2023 | 2 | 2023-07-28 16:20:00 | 000s2_00_0701_10006277_20230728162344.dat |
4 | 20230728ZZ | 10007554 | 0 | 2023 | 2 | 2023-07-28 09:40:33 | 000s2_01_0762_10007554_20230801103140.dat |
5 | 20230728ZZ | 10008193 | 0 | 2023 | 2 | 2023-07-28 13:51:22 | 000s2_00_0210_10008193_20230803080649.dat |
6 | 20230728ZZ | 70015432 | 0 | 2023 | 2 | 2023-07-28 14:39:37 | 000s2_00_0113_70015432_20230728154129.dat |
7 | 20230728ZZ | 70015692 | 0 | 2023 | 2 | 2023-07-28 13:02:24 | 000s2_00_0134_70015692_20230728130631.dat |
8 | 20230728ZZ | 70038740 | 0 | 2023 | 2 | 2023-07-28 11:29:07 | 000s2_00_0112_70038740_20230728130100.dat |
9 | 20230728ZZ | 70053235 | 0 | 2023 | 2 | 2023-07-28 15:17:03 | 000s2_00_0328_70053235_20230728155639.dat |
작성차수 | 사업장기호 | 단위사업장기호 | 검진년도 | EDI진행상태 | 신청일시 | 송수신파일명 | |
---|---|---|---|---|---|---|---|
6773 | 20230828ZZ | 71175550 | 0 | 2022 | 2 | 2023-08-28 14:21:35 | 000s2_00_0551_71175550_20230828144412.dat |
6774 | 20230828ZZ | 79318289 | 0 | 2023 | 2 | 2023-08-28 14:53:02 | 000s2_00_0769_79318289_20230828150135.dat |
6775 | 20230828ZZ | 70123482 | 0 | 2023 | 2 | 2023-08-28 14:34:45 | 000s2_00_0132_70123482_20230828163750.dat |
6776 | 20230828ZZ | 70563862 | 0 | 2023 | 2 | 2023-08-28 11:20:43 | 000s2_00_0111_70563862_20230828112746.dat |
6777 | 20230828ZZ | 71175550 | 0 | 2023 | 2 | 2023-08-28 14:21:39 | 000s2_00_0551_71175550_20230828144419.dat |
6778 | 20230828ZZ | 70123482 | 0 | 2023 | 2 | 2023-08-28 14:35:34 | 000s2_00_0132_70123482_20230828163750.dat |
6779 | 20230828ZZ | 70123482 | 0 | 2023 | 2 | 2023-08-28 14:36:14 | 000s2_00_0132_70123482_20230828163750.dat |
6780 | 20230828ZZ | 70123482 | 0 | 2023 | 2 | 2023-08-28 14:38:28 | 000s2_00_0132_70123482_20230828163750.dat |
6781 | 20230828ZZ | 70123482 | 0 | 2023 | 2 | 2023-08-28 15:39:47 | 000s2_00_0132_70123482_20230828163750.dat |
6782 | 20230828ZZ | 70123482 | 0 | 2023 | 2 | 2023-08-28 15:41:57 | 000s2_00_0132_70123482_20230828163750.dat |