Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 48 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 2.4 KiB |
Average record size in memory | 51.8 B |
Variable types
Numeric | 1 |
---|---|
Categorical | 4 |
Text | 1 |
Dataset
Description | 인천광역시 서구 감염병 현황에 대한 데이터로 연번, 구분, 질병명, 발생기간, 신고건수 등의 정보가 포함되어 있습니다. |
---|---|
Author | 인천광역시 서구 |
URL | https://data.incheon.go.kr/findData/publicDataDetail?dataId=15090927&srcSe=7661IVAWM27C61E190 |
발생기간 has constant value "" | Constant |
데이터기준일자 has constant value "" | Constant |
연번 is highly overall correlated with 구분 | High correlation |
구분 is highly overall correlated with 연번 and 1 other fields | High correlation |
신고건수 is highly overall correlated with 구분 | High correlation |
연번 has unique values | Unique |
질병명 has unique values | Unique |
Reproduction
Analysis started | 2024-01-28 14:39:14.416514 |
---|---|
Analysis finished | 2024-01-28 14:39:14.828664 |
Duration | 0.41 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
연번
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 48 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 24.5 |
Minimum | 1 |
---|---|
Maximum | 48 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 564.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 3.35 |
Q1 | 12.75 |
median | 24.5 |
Q3 | 36.25 |
95-th percentile | 45.65 |
Maximum | 48 |
Range | 47 |
Interquartile range (IQR) | 23.5 |
Descriptive statistics
Standard deviation | 14 |
---|---|
Coefficient of variation (CV) | 0.57142857 |
Kurtosis | -1.2 |
Mean | 24.5 |
Median Absolute Deviation (MAD) | 12 |
Skewness | 0 |
Sum | 1176 |
Variance | 196 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 2.1% |
26 | 1 | 2.1% |
28 | 1 | 2.1% |
29 | 1 | 2.1% |
30 | 1 | 2.1% |
31 | 1 | 2.1% |
32 | 1 | 2.1% |
33 | 1 | 2.1% |
34 | 1 | 2.1% |
35 | 1 | 2.1% |
Other values (38) | 38 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
48 | 1 | |
47 | 1 | |
46 | 1 | |
45 | 1 | |
44 | 1 | |
43 | 1 | |
42 | 1 | |
41 | 1 | |
40 | 1 | |
39 | 1 |
구분
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 6.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 516.0 B |
3급 | |
---|---|
2급 | |
4급 | 1 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 2.1% |
Sample
1st row | 2급 |
---|---|
2nd row | 2급 |
3rd row | 2급 |
4th row | 2급 |
5th row | 2급 |
Common Values
Value | Count | Frequency (%) |
3급 | 25 | |
2급 | 22 | |
4급 | 1 | 2.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
3급 | 25 | |
2급 | 22 | |
4급 | 1 | 2.1% |
질병명
Text
UNIQUE
 
Distinct | 48 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 516.0 B |
Value | Count | Frequency (%) |
감염증 | 4 | 7.4% |
수두 | 1 | 1.9% |
c형간염 | 1 | 1.9% |
말라리아 | 1 | 1.9% |
레지오넬라증 | 1 | 1.9% |
비브리오패혈증 | 1 | 1.9% |
발진티푸스 | 1 | 1.9% |
발진열 | 1 | 1.9% |
쯔쯔가무시증 | 1 | 1.9% |
렙토스피라증 | 1 | 1.9% |
Other values (41) | 41 |
Most occurring characters
Value | Count | Frequency (%) |
증 | 14 | 4.3% |
염 | 13 | 4.0% |
성 | 9 | 2.8% |
열 | 9 | 2.8% |
( | 8 | 2.5% |
) | 8 | 2.5% |
이 | 7 | 2.1% |
균 | 7 | 2.1% |
스 | 7 | 2.1% |
라 | 7 | 2.1% |
Other values (134) | 237 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 267 | |
Uppercase Letter | 26 | 8.0% |
Open Punctuation | 8 | 2.5% |
Close Punctuation | 8 | 2.5% |
Space Separator | 6 | 1.8% |
Decimal Number | 6 | 1.8% |
Dash Punctuation | 3 | 0.9% |
Lowercase Letter | 2 | 0.6% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
증 | 14 | 5.2% |
염 | 13 | 4.9% |
성 | 9 | 3.4% |
열 | 9 | 3.4% |
이 | 7 | 2.6% |
균 | 7 | 2.6% |
스 | 7 | 2.6% |
라 | 7 | 2.6% |
감 | 7 | 2.6% |
진 | 6 | 2.2% |
Other values (110) | 181 |
Uppercase Letter
Value | Count | Frequency (%) |
C | 5 | |
S | 3 | |
D | 3 | |
V | 2 | 7.7% |
J | 2 | 7.7% |
R | 2 | 7.7% |
E | 2 | 7.7% |
A | 2 | 7.7% |
I | 1 | 3.8% |
O | 1 | 3.8% |
Other values (3) | 3 |
Decimal Number
Value | Count | Frequency (%) |
1 | 2 | |
2 | 1 | |
0 | 1 | |
8 | 1 | |
9 | 1 |
Lowercase Letter
Value | Count | Frequency (%) |
v | 1 | |
b | 1 |
Open Punctuation
Value | Count | Frequency (%) |
( | 8 |
Close Punctuation
Value | Count | Frequency (%) |
) | 8 |
Space Separator
Value | Count | Frequency (%) |
6 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 267 | |
Common | 31 | 9.5% |
Latin | 28 | 8.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
증 | 14 | 5.2% |
염 | 13 | 4.9% |
성 | 9 | 3.4% |
열 | 9 | 3.4% |
이 | 7 | 2.6% |
균 | 7 | 2.6% |
스 | 7 | 2.6% |
라 | 7 | 2.6% |
감 | 7 | 2.6% |
진 | 6 | 2.2% |
Other values (110) | 181 |
Latin
Value | Count | Frequency (%) |
C | 5 | |
S | 3 | |
D | 3 | |
V | 2 | 7.1% |
J | 2 | 7.1% |
R | 2 | 7.1% |
E | 2 | 7.1% |
A | 2 | 7.1% |
v | 1 | 3.6% |
I | 1 | 3.6% |
Other values (5) | 5 |
Common
Value | Count | Frequency (%) |
( | 8 | |
) | 8 | |
6 | ||
- | 3 | 9.7% |
1 | 2 | 6.5% |
2 | 1 | 3.2% |
0 | 1 | 3.2% |
8 | 1 | 3.2% |
9 | 1 | 3.2% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 267 | |
ASCII | 59 | 18.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
증 | 14 | 5.2% |
염 | 13 | 4.9% |
성 | 9 | 3.4% |
열 | 9 | 3.4% |
이 | 7 | 2.6% |
균 | 7 | 2.6% |
스 | 7 | 2.6% |
라 | 7 | 2.6% |
감 | 7 | 2.6% |
진 | 6 | 2.2% |
Other values (110) | 181 |
ASCII
Value | Count | Frequency (%) |
( | 8 | |
) | 8 | |
6 | 10.2% | |
C | 5 | 8.5% |
S | 3 | 5.1% |
D | 3 | 5.1% |
- | 3 | 5.1% |
V | 2 | 3.4% |
J | 2 | 3.4% |
R | 2 | 3.4% |
Other values (14) | 17 |
발생기간
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 2.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 516.0 B |
2023-01-01~2023-09-30 |
---|
Length
Max length | 21 |
---|---|
Median length | 21 |
Mean length | 21 |
Min length | 21 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2023-01-01~2023-09-30 |
---|---|
2nd row | 2023-01-01~2023-09-30 |
3rd row | 2023-01-01~2023-09-30 |
4th row | 2023-01-01~2023-09-30 |
5th row | 2023-01-01~2023-09-30 |
Common Values
Value | Count | Frequency (%) |
2023-01-01~2023-09-30 | 48 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2023-01-01~2023-09-30 | 48 |
신고건수
Categorical
HIGH CORRELATION
 
Distinct | 13 |
---|---|
Distinct (%) | 27.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 516.0 B |
0 | |
---|---|
5 | 3 |
1 | 2 |
4 | 2 |
207 | 1 |
Other values (8) |
Length
Max length | 7 |
---|---|
Median length | 1 |
Mean length | 1.2916667 |
Min length | 1 |
Unique
Unique | 9 ? |
---|---|
Unique (%) | 18.8% |
Sample
1st row | 207 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
0 | 32 | |
5 | 3 | 6.2% |
1 | 2 | 4.2% |
4 | 2 | 4.2% |
207 | 1 | 2.1% |
3 | 1 | 2.1% |
6 | 1 | 2.1% |
116 | 1 | 2.1% |
339 | 1 | 2.1% |
73 | 1 | 2.1% |
Other values (3) | 3 | 6.2% |
Length
Value | Count | Frequency (%) |
0 | 32 | |
5 | 3 | 6.2% |
1 | 2 | 4.2% |
4 | 2 | 4.2% |
207 | 1 | 2.1% |
3 | 1 | 2.1% |
6 | 1 | 2.1% |
116 | 1 | 2.1% |
339 | 1 | 2.1% |
73 | 1 | 2.1% |
Other values (3) | 3 | 6.2% |
데이터기준일자
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 2.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 516.0 B |
2023-09-30 |
---|
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2023-09-30 |
---|---|
2nd row | 2023-09-30 |
3rd row | 2023-09-30 |
4th row | 2023-09-30 |
5th row | 2023-09-30 |
Common Values
Value | Count | Frequency (%) |
2023-09-30 | 48 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2023-09-30 | 48 |
연번 | 구분 | 질병명 | 신고건수 | |
---|---|---|---|---|
연번 | 1.000 | 0.773 | 1.000 | 0.351 |
구분 | 0.773 | 1.000 | 1.000 | 0.818 |
질병명 | 1.000 | 1.000 | 1.000 | 1.000 |
신고건수 | 0.351 | 0.818 | 1.000 | 1.000 |
신고건수 | 구분 | |
---|---|---|
신고건수 | 1.000 | 0.599 |
구분 | 0.599 | 1.000 |
연번 | 구분 | 신고건수 | |
---|---|---|---|
연번 | 1.000 | 0.597 | 0.123 |
구분 | 0.597 | 1.000 | 0.599 |
신고건수 | 0.123 | 0.599 | 1.000 |
연번 | 구분 | 질병명 | 발생기간 | 신고건수 | 데이터기준일자 | |
---|---|---|---|---|---|---|
0 | 1 | 2급 | 수두 | 2023-01-01~2023-09-30 | 207 | 2023-09-30 |
1 | 2 | 2급 | 홍역 | 2023-01-01~2023-09-30 | 0 | 2023-09-30 |
2 | 3 | 2급 | 콜레라 | 2023-01-01~2023-09-30 | 0 | 2023-09-30 |
3 | 4 | 2급 | 장티푸스 | 2023-01-01~2023-09-30 | 0 | 2023-09-30 |
4 | 5 | 2급 | 파라티푸스 | 2023-01-01~2023-09-30 | 1 | 2023-09-30 |
5 | 6 | 2급 | 세균성이질 | 2023-01-01~2023-09-30 | 0 | 2023-09-30 |
6 | 7 | 2급 | 장출혈성대장균감염증 | 2023-01-01~2023-09-30 | 3 | 2023-09-30 |
7 | 8 | 2급 | A형간염 | 2023-01-01~2023-09-30 | 6 | 2023-09-30 |
8 | 9 | 2급 | 백일해 | 2023-01-01~2023-09-30 | 0 | 2023-09-30 |
9 | 10 | 2급 | 유행성이하선염 | 2023-01-01~2023-09-30 | 116 | 2023-09-30 |
연번 | 구분 | 질병명 | 발생기간 | 신고건수 | 데이터기준일자 | |
---|---|---|---|---|---|---|
38 | 39 | 3급 | 뎅기열 | 2023-01-01~2023-09-30 | 4 | 2023-09-30 |
39 | 40 | 3급 | 큐열 | 2023-01-01~2023-09-30 | 0 | 2023-09-30 |
40 | 41 | 3급 | 웨스트나일열 | 2023-01-01~2023-09-30 | 0 | 2023-09-30 |
41 | 42 | 3급 | 라임병 | 2023-01-01~2023-09-30 | 0 | 2023-09-30 |
42 | 43 | 3급 | 진드기매개뇌염 | 2023-01-01~2023-09-30 | 0 | 2023-09-30 |
43 | 44 | 3급 | 유비저 | 2023-01-01~2023-09-30 | 0 | 2023-09-30 |
44 | 45 | 3급 | 치쿤구니야열 | 2023-01-01~2023-09-30 | 0 | 2023-09-30 |
45 | 46 | 3급 | 중증열성혈소판감소증후군(SFTS) | 2023-01-01~2023-09-30 | 0 | 2023-09-30 |
46 | 47 | 3급 | 지카바이러스감염증 | 2023-01-01~2023-09-30 | 0 | 2023-09-30 |
47 | 48 | 4급 | COVID-19 | 2023-01-01~2023-09-30 | 389,931 | 2023-09-30 |