Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 62 |
Missing cells | 78 |
Missing cells (%) | 21.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 3.3 KiB |
Average record size in memory | 54.1 B |
Variable types
Categorical | 3 |
---|---|
Text | 1 |
Numeric | 2 |
Dataset
Description | 강원특별자치도에서 발생한 1~3급 법정감염병 주차별 발생현황 (주요 감염병 : 중동호흡기증후군, 수두, 홍역, 콜레라 , 장티푸스 등) |
---|---|
URL | https://www.data.go.kr/data/15064351/fileData.do |
Reproduction
Analysis started | 2023-12-12 04:25:35.706455 |
---|---|
Analysis finished | 2023-12-12 04:25:36.686698 |
Duration | 0.98 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
질병군
Categorical
Distinct | 3 |
---|---|
Distinct (%) | 4.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 628.0 B |
3급 | |
---|---|
2급 | |
1급 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1급 |
---|---|
2nd row | 1급 |
3rd row | 1급 |
4th row | 1급 |
5th row | 1급 |
Common Values
Value | Count | Frequency (%) |
3급 | 25 | |
2급 | 22 | |
1급 | 15 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
3급 | 25 | |
2급 | 22 | |
1급 | 15 |
감염병명
Text
UNIQUE
 
Distinct | 62 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 628.0 B |
Value | Count | Frequency (%) |
에볼라바이러스 | 1 | 1.6% |
디프테리아 | 1 | 1.6% |
한센병 | 1 | 1.6% |
중증열성혈소판감소증후군 | 1 | 1.6% |
성홍열 | 1 | 1.6% |
vrsa | 1 | 1.6% |
cre | 1 | 1.6% |
e형간염 | 1 | 1.6% |
파상풍 | 1 | 1.6% |
b형간염 | 1 | 1.6% |
Other values (54) | 54 |
Most occurring characters
Value | Count | Frequency (%) |
증 | 16 | 4.8% |
열 | 13 | 3.9% |
염 | 12 | 3.6% |
라 | 9 | 2.7% |
스 | 9 | 2.7% |
성 | 8 | 2.4% |
리 | 8 | 2.4% |
진 | 6 | 1.8% |
감 | 6 | 1.8% |
형 | 5 | 1.5% |
Other values (139) | 238 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 298 | |
Uppercase Letter | 17 | 5.2% |
Decimal Number | 4 | 1.2% |
Open Punctuation | 3 | 0.9% |
Close Punctuation | 3 | 0.9% |
Space Separator | 2 | 0.6% |
Lowercase Letter | 2 | 0.6% |
Other Punctuation | 1 | 0.3% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
증 | 16 | 5.4% |
열 | 13 | 4.4% |
염 | 12 | 4.0% |
라 | 9 | 3.0% |
스 | 9 | 3.0% |
성 | 8 | 2.7% |
리 | 8 | 2.7% |
진 | 6 | 2.0% |
감 | 6 | 2.0% |
형 | 5 | 1.7% |
Other values (120) | 206 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 4 | |
A | 2 | |
D | 2 | |
E | 2 | |
J | 2 | |
R | 2 | |
V | 1 | 5.9% |
B | 1 | 5.9% |
S | 1 | 5.9% |
Decimal Number
Value | Count | Frequency (%) |
8 | 1 | |
1 | 1 | |
0 | 1 | |
2 | 1 |
Lowercase Letter
Value | Count | Frequency (%) |
v | 1 | |
b | 1 |
Open Punctuation
Value | Count | Frequency (%) |
( | 3 |
Close Punctuation
Value | Count | Frequency (%) |
) | 3 |
Space Separator
Value | Count | Frequency (%) |
2 |
Other Punctuation
Value | Count | Frequency (%) |
/ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 298 | |
Latin | 19 | 5.8% |
Common | 13 | 3.9% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
증 | 16 | 5.4% |
열 | 13 | 4.4% |
염 | 12 | 4.0% |
라 | 9 | 3.0% |
스 | 9 | 3.0% |
성 | 8 | 2.7% |
리 | 8 | 2.7% |
진 | 6 | 2.0% |
감 | 6 | 2.0% |
형 | 5 | 1.7% |
Other values (120) | 206 |
Latin
Value | Count | Frequency (%) |
C | 4 | |
A | 2 | |
D | 2 | |
E | 2 | |
J | 2 | |
R | 2 | |
V | 1 | 5.3% |
B | 1 | 5.3% |
S | 1 | 5.3% |
v | 1 | 5.3% |
Common
Value | Count | Frequency (%) |
( | 3 | |
) | 3 | |
2 | ||
/ | 1 | 7.7% |
8 | 1 | 7.7% |
1 | 1 | 7.7% |
0 | 1 | 7.7% |
2 | 1 | 7.7% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 298 | |
ASCII | 32 | 9.7% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
증 | 16 | 5.4% |
열 | 13 | 4.4% |
염 | 12 | 4.0% |
라 | 9 | 3.0% |
스 | 9 | 3.0% |
성 | 8 | 2.7% |
리 | 8 | 2.7% |
진 | 6 | 2.0% |
감 | 6 | 2.0% |
형 | 5 | 1.7% |
Other values (120) | 206 |
ASCII
Value | Count | Frequency (%) |
C | 4 | |
( | 3 | 9.4% |
) | 3 | 9.4% |
A | 2 | 6.2% |
D | 2 | 6.2% |
E | 2 | 6.2% |
J | 2 | 6.2% |
R | 2 | 6.2% |
2 | 6.2% | |
V | 1 | 3.1% |
Other values (9) | 9 |
해당연도
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 628.0 B |
2022 |
---|
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2022 |
---|---|
2nd row | 2022 |
3rd row | 2022 |
4th row | 2022 |
5th row | 2022 |
Common Values
Value | Count | Frequency (%) |
2022 | 62 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2022 | 62 |
건수
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 16 |
---|---|
Distinct (%) | 69.6% |
Missing | 39 |
Missing (%) | 62.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 76.521739 |
Minimum | 1 |
---|---|
Maximum | 706 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 690.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 2 |
median | 11 |
Q3 | 33 |
95-th percentile | 459 |
Maximum | 706 |
Range | 705 |
Interquartile range (IQR) | 31 |
Descriptive statistics
Standard deviation | 174.32046 |
---|---|
Coefficient of variation (CV) | 2.2780515 |
Kurtosis | 8.5703034 |
Mean | 76.521739 |
Median Absolute Deviation (MAD) | 10 |
Skewness | 2.962307 |
Sum | 1760 |
Variance | 30387.625 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 5 | 8.1% |
6 | 2 | 3.2% |
13 | 2 | 3.2% |
2 | 2 | 3.2% |
15 | 1 | 1.6% |
28 | 1 | 1.6% |
4 | 1 | 1.6% |
25 | 1 | 1.6% |
10 | 1 | 1.6% |
486 | 1 | 1.6% |
Other values (6) | 6 | 9.7% |
(Missing) | 39 |
Value | Count | Frequency (%) |
1 | 5 | |
2 | 2 | 3.2% |
4 | 1 | 1.6% |
6 | 2 | 3.2% |
10 | 1 | 1.6% |
11 | 1 | 1.6% |
13 | 2 | 3.2% |
15 | 1 | 1.6% |
25 | 1 | 1.6% |
28 | 1 | 1.6% |
Value | Count | Frequency (%) |
706 | 1 | |
486 | 1 | |
216 | 1 | |
126 | 1 | |
48 | 1 | |
38 | 1 | |
28 | 1 | |
25 | 1 | |
15 | 1 | |
13 | 2 |
해당연도.1
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 628.0 B |
2021 |
---|
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2021 |
---|---|
2nd row | 2021 |
3rd row | 2021 |
4th row | 2021 |
5th row | 2021 |
Common Values
Value | Count | Frequency (%) |
2021 | 62 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2021 | 62 |
건수.1
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 17 |
---|---|
Distinct (%) | 73.9% |
Missing | 39 |
Missing (%) | 62.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 89.347826 |
Minimum | 1 |
---|---|
Maximum | 608 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 690.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 2 |
median | 8 |
Q3 | 37 |
95-th percentile | 487.6 |
Maximum | 608 |
Range | 607 |
Interquartile range (IQR) | 35 |
Descriptive statistics
Standard deviation | 172.50444 |
---|---|
Coefficient of variation (CV) | 1.9307067 |
Kurtosis | 3.8705125 |
Mean | 89.347826 |
Median Absolute Deviation (MAD) | 7 |
Skewness | 2.1867273 |
Sum | 2055 |
Variance | 29757.783 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 5 | 8.1% |
2 | 2 | 3.2% |
6 | 2 | 3.2% |
4 | 1 | 1.6% |
158 | 1 | 1.6% |
376 | 1 | 1.6% |
18 | 1 | 1.6% |
500 | 1 | 1.6% |
608 | 1 | 1.6% |
15 | 1 | 1.6% |
Other values (7) | 7 | 11.3% |
(Missing) | 39 |
Value | Count | Frequency (%) |
1 | 5 | |
2 | 2 | 3.2% |
4 | 1 | 1.6% |
6 | 2 | 3.2% |
7 | 1 | 1.6% |
8 | 1 | 1.6% |
15 | 1 | 1.6% |
18 | 1 | 1.6% |
19 | 1 | 1.6% |
29 | 1 | 1.6% |
Value | Count | Frequency (%) |
608 | 1 | |
500 | 1 | |
376 | 1 | |
218 | 1 | |
158 | 1 | |
42 | 1 | |
32 | 1 | |
29 | 1 | |
19 | 1 | |
18 | 1 |
질병군 | 감염병명 | 건수 | 건수.1 | |
---|---|---|---|---|
질병군 | 1.000 | 1.000 | 0.189 | 0.000 |
감염병명 | 1.000 | 1.000 | 1.000 | 1.000 |
건수 | 0.189 | 1.000 | 1.000 | 1.000 |
건수.1 | 0.000 | 1.000 | 1.000 | 1.000 |
건수 | 건수.1 | 질병군 | |
---|---|---|---|
건수 | 1.000 | 0.943 | 0.194 |
건수.1 | 0.943 | 1.000 | 0.000 |
질병군 | 0.194 | 0.000 | 1.000 |
질병군 | 감염병명 | 해당연도 | 건수 | 해당연도.1 | 건수.1 | |
---|---|---|---|---|---|---|
0 | 1급 | 에볼라바이러스 | 2022 | <NA> | 2021 | <NA> |
1 | 1급 | 마버그열 | 2022 | <NA> | 2021 | <NA> |
2 | 1급 | 라싸열 | 2022 | <NA> | 2021 | <NA> |
3 | 1급 | 크리미안콩고출혈열 | 2022 | <NA> | 2021 | <NA> |
4 | 1급 | 리프트밸리열 | 2022 | <NA> | 2021 | <NA> |
5 | 1급 | 두창 | 2022 | <NA> | 2021 | <NA> |
6 | 1급 | 페스트 | 2022 | <NA> | 2021 | <NA> |
7 | 1급 | 탄저 | 2022 | <NA> | 2021 | <NA> |
8 | 1급 | 보툴리눔독소증 | 2022 | <NA> | 2021 | <NA> |
9 | 1급 | 야토병 | 2022 | <NA> | 2021 | <NA> |
질병군 | 감염병명 | 해당연도 | 건수 | 해당연도.1 | 건수.1 | |
---|---|---|---|---|---|---|
52 | 3급 | 황열 | 2022 | <NA> | 2021 | <NA> |
53 | 3급 | 뎅기열 | 2022 | 4 | 2021 | 1 |
54 | 3급 | 큐열 | 2022 | 1 | 2021 | <NA> |
55 | 3급 | 웨스트나일열 | 2022 | <NA> | 2021 | <NA> |
56 | 3급 | 라임병 | 2022 | <NA> | 2021 | <NA> |
57 | 3급 | 진드기매개뇌염 | 2022 | <NA> | 2021 | <NA> |
58 | 3급 | 유비저 | 2022 | <NA> | 2021 | <NA> |
59 | 3급 | 치쿤구니아열 | 2022 | <NA> | 2021 | <NA> |
60 | 3급 | 중증열성혈소판감소증후군 | 2022 | 28 | 2021 | 19 |
61 | 3급 | 지카바이러스감염증 | 2022 | <NA> | 2021 | <NA> |