Overview

Dataset statistics

Number of variables8
Number of observations32
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.1 KiB
Average record size in memory68.1 B

Variable types

Text2
Categorical6

Dataset

Description인천광역시 연수구의 예방접종실시 현황 데이터로서 접조명, 1차 접종대상, 2차 접종대상, 3차 접종대상, 수수료, 접종기간 등의 항목으로 이루어져 있습니다.
Author인천광역시 연수구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15116717&srcSe=7661IVAWM27C61E190

Alerts

2차 접종대상 is highly overall correlated with 3차 접종대상 and 4 other fieldsHigh correlation
접종장소 is highly overall correlated with 2차 접종대상 and 2 other fieldsHigh correlation
3차 접종대상 is highly overall correlated with 2차 접종대상 and 3 other fieldsHigh correlation
수수료(원) is highly overall correlated with 2차 접종대상 and 2 other fieldsHigh correlation
접종기간 is highly overall correlated with 2차 접종대상 and 2 other fieldsHigh correlation
4차 접종대상 is highly overall correlated with 2차 접종대상 and 3 other fieldsHigh correlation
수수료(원) is highly imbalanced (79.9%)Imbalance
접종기간 is highly imbalanced (66.2%)Imbalance

Reproduction

Analysis started2024-01-28 12:47:20.344816
Analysis finished2024-01-28 12:47:20.877611
Duration0.53 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct31
Distinct (%)96.9%
Missing0
Missing (%)0.0%
Memory size388.0 B
2024-01-28T21:47:20.994488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length11.5
Mean length8.46875
Min length2

Characters and Unicode

Total characters271
Distinct characters103
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)93.8%

Sample

1st row결핵BCG(피내용)
2nd rowB형간염
3rd rowB형간염
4th rowDTaP
5th rowTd/Tdap
ValueCountFrequency (%)
b형간염 2
 
5.9%
ba.4/5 2
 
5.9%
a형간염 1
 
2.9%
hpv(사람유두종바이러스 1
 
2.9%
화이자 1
 
2.9%
화이자(영유아용 1
 
2.9%
화이자(소아용 1
 
2.9%
스카이코비원 1
 
2.9%
노바백스(코로나19 1
 
2.9%
얀센(코로나19 1
 
2.9%
Other values (22) 22
64.7%
2024-01-28T21:47:21.263483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 18
 
6.6%
( 18
 
6.6%
P 10
 
3.7%
8
 
3.0%
7
 
2.6%
7
 
2.6%
6
 
2.2%
6
 
2.2%
V 6
 
2.2%
1 5
 
1.8%
Other values (93) 180
66.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 149
55.0%
Uppercase Letter 45
 
16.6%
Close Punctuation 18
 
6.6%
Open Punctuation 18
 
6.6%
Decimal Number 16
 
5.9%
Lowercase Letter 14
 
5.2%
Other Punctuation 6
 
2.2%
Space Separator 3
 
1.1%
Dash Punctuation 2
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8
 
5.4%
7
 
4.7%
7
 
4.7%
6
 
4.0%
6
 
4.0%
4
 
2.7%
4
 
2.7%
4
 
2.7%
4
 
2.7%
4
 
2.7%
Other values (58) 95
63.8%
Uppercase Letter
ValueCountFrequency (%)
P 10
22.2%
V 6
13.3%
B 5
11.1%
T 5
11.1%
C 3
 
6.7%
D 3
 
6.7%
A 3
 
6.7%
M 2
 
4.4%
I 2
 
4.4%
H 2
 
4.4%
Other values (4) 4
 
8.9%
Lowercase Letter
ValueCountFrequency (%)
a 4
28.6%
d 2
14.3%
i 2
14.3%
b 2
14.3%
l 1
 
7.1%
p 1
 
7.1%
h 1
 
7.1%
u 1
 
7.1%
Decimal Number
ValueCountFrequency (%)
1 5
31.2%
9 3
18.8%
5 2
 
12.5%
4 2
 
12.5%
3 2
 
12.5%
0 1
 
6.2%
2 1
 
6.2%
Other Punctuation
ValueCountFrequency (%)
/ 4
66.7%
. 2
33.3%
Close Punctuation
ValueCountFrequency (%)
) 18
100.0%
Open Punctuation
ValueCountFrequency (%)
( 18
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 149
55.0%
Common 63
23.2%
Latin 59
 
21.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8
 
5.4%
7
 
4.7%
7
 
4.7%
6
 
4.0%
6
 
4.0%
4
 
2.7%
4
 
2.7%
4
 
2.7%
4
 
2.7%
4
 
2.7%
Other values (58) 95
63.8%
Latin
ValueCountFrequency (%)
P 10
16.9%
V 6
 
10.2%
B 5
 
8.5%
T 5
 
8.5%
a 4
 
6.8%
C 3
 
5.1%
D 3
 
5.1%
A 3
 
5.1%
M 2
 
3.4%
d 2
 
3.4%
Other values (12) 16
27.1%
Common
ValueCountFrequency (%)
) 18
28.6%
( 18
28.6%
1 5
 
7.9%
/ 4
 
6.3%
3
 
4.8%
9 3
 
4.8%
. 2
 
3.2%
5 2
 
3.2%
- 2
 
3.2%
4 2
 
3.2%
Other values (3) 4
 
6.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 149
55.0%
ASCII 122
45.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 18
14.8%
( 18
14.8%
P 10
 
8.2%
V 6
 
4.9%
1 5
 
4.1%
B 5
 
4.1%
T 5
 
4.1%
/ 4
 
3.3%
a 4
 
3.3%
3
 
2.5%
Other values (25) 44
36.1%
Hangul
ValueCountFrequency (%)
8
 
5.4%
7
 
4.7%
7
 
4.7%
6
 
4.0%
6
 
4.0%
4
 
2.7%
4
 
2.7%
4
 
2.7%
4
 
2.7%
4
 
2.7%
Other values (58) 95
63.8%
Distinct20
Distinct (%)62.5%
Missing0
Missing (%)0.0%
Memory size388.0 B
2024-01-28T21:47:21.434596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length42
Mean length10.15625
Min length2

Characters and Unicode

Total characters325
Distinct characters78
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique15 ?
Unique (%)46.9%

Sample

1st row0개월
2nd row0개월
3rd row성인
4th row2개월
5th row만11~12세
ValueCountFrequency (%)
2개월 10
 
13.7%
9
 
12.3%
18세이상 3
 
4.1%
12세이상 3
 
4.1%
장티푸스 2
 
2.7%
12~15개월 2
 
2.7%
이후 2
 
2.7%
2
 
2.7%
0개월 2
 
2.7%
유행 1
 
1.4%
Other values (37) 37
50.7%
2024-01-28T21:47:21.698166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
44
 
13.5%
1 27
 
8.3%
2 26
 
8.0%
20
 
6.2%
20
 
6.2%
16
 
4.9%
13
 
4.0%
~ 12
 
3.7%
11
 
3.4%
0 9
 
2.8%
Other values (68) 127
39.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 174
53.5%
Decimal Number 82
25.2%
Space Separator 44
 
13.5%
Math Symbol 12
 
3.7%
Other Punctuation 9
 
2.8%
Close Punctuation 2
 
0.6%
Open Punctuation 2
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
20
 
11.5%
20
 
11.5%
16
 
9.2%
13
 
7.5%
11
 
6.3%
8
 
4.6%
6
 
3.4%
3
 
1.7%
3
 
1.7%
2
 
1.1%
Other values (52) 72
41.4%
Decimal Number
ValueCountFrequency (%)
1 27
32.9%
2 26
31.7%
0 9
 
11.0%
5 5
 
6.1%
6 5
 
6.1%
8 4
 
4.9%
3 3
 
3.7%
4 1
 
1.2%
7 1
 
1.2%
9 1
 
1.2%
Other Punctuation
ValueCountFrequency (%)
, 6
66.7%
. 3
33.3%
Space Separator
ValueCountFrequency (%)
44
100.0%
Math Symbol
ValueCountFrequency (%)
~ 12
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 174
53.5%
Common 151
46.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
20
 
11.5%
20
 
11.5%
16
 
9.2%
13
 
7.5%
11
 
6.3%
8
 
4.6%
6
 
3.4%
3
 
1.7%
3
 
1.7%
2
 
1.1%
Other values (52) 72
41.4%
Common
ValueCountFrequency (%)
44
29.1%
1 27
17.9%
2 26
17.2%
~ 12
 
7.9%
0 9
 
6.0%
, 6
 
4.0%
5 5
 
3.3%
6 5
 
3.3%
8 4
 
2.6%
3 3
 
2.0%
Other values (6) 10
 
6.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 174
53.5%
ASCII 151
46.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
44
29.1%
1 27
17.9%
2 26
17.2%
~ 12
 
7.9%
0 9
 
6.0%
, 6
 
4.0%
5 5
 
3.3%
6 5
 
3.3%
8 4
 
2.6%
3 3
 
2.0%
Other values (6) 10
 
6.6%
Hangul
ValueCountFrequency (%)
20
 
11.5%
20
 
11.5%
16
 
9.2%
13
 
7.5%
11
 
6.3%
8
 
4.6%
6
 
3.4%
3
 
1.7%
3
 
1.7%
2
 
1.1%
Other values (52) 72
41.4%

2차 접종대상
Categorical

HIGH CORRELATION 

Distinct12
Distinct (%)37.5%
Missing0
Missing (%)0.0%
Memory size388.0 B
4개월
10 
<NA>
8주(3주)
4주
6개월 권고(3개월가능)
Other values (7)

Length

Max length14
Median length13
Mean length5.5
Min length2

Unique

Unique7 ?
Unique (%)21.9%

Sample

1st row생후4주이내
2nd row1개월
3rd row<NA>
4th row4개월
5th row<NA>

Common Values

ValueCountFrequency (%)
4개월 10
31.2%
<NA> 8
25.0%
8주(3주) 3
 
9.4%
4주 2
 
6.2%
6개월 권고(3개월가능) 2
 
6.2%
생후4주이내 1
 
3.1%
1개월 1
 
3.1%
만4~6세 1
 
3.1%
1차 접종 후 7일~30일 1
 
3.1%
1차 접종 후 1년 이후 1
 
3.1%
Other values (2) 2
 
6.2%

Length

2024-01-28T21:47:21.807589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
4개월 10
21.3%
na 8
17.0%
6개월 4
 
8.5%
8주(3주 3
 
6.4%
1차 3
 
6.4%
접종 3
 
6.4%
3
 
6.4%
4주 2
 
4.3%
권고(3개월가능 2
 
4.3%
이후 2
 
4.3%
Other values (7) 7
14.9%

3차 접종대상
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)15.6%
Missing0
Missing (%)0.0%
Memory size388.0 B
<NA>
18 
6개월
10 
무료
만4~6세
 
1
8주
 
1

Length

Max length5
Median length4
Mean length3.53125
Min length2

Unique

Unique2 ?
Unique (%)6.2%

Sample

1st row<NA>
2nd row6개월
3rd row<NA>
4th row6개월
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 18
56.2%
6개월 10
31.2%
무료 2
 
6.2%
만4~6세 1
 
3.1%
8주 1
 
3.1%

Length

2024-01-28T21:47:21.903709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T21:47:21.990825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 18
56.2%
6개월 10
31.2%
무료 2
 
6.2%
만4~6세 1
 
3.1%
8주 1
 
3.1%

4차 접종대상
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)15.6%
Missing0
Missing (%)0.0%
Memory size388.0 B
<NA>
23 
12~15개월
만4~6세
4차(15~18개월), 5차(만4~6세)
 
1
2차 접종 후 12개월
 
1

Length

Max length22
Median length4
Mean length5.28125
Min length4

Unique

Unique2 ?
Unique (%)6.2%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row4차(15~18개월), 5차(만4~6세)
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 23
71.9%
12~15개월 4
 
12.5%
만4~6세 3
 
9.4%
4차(15~18개월), 5차(만4~6세) 1
 
3.1%
2차 접종 후 12개월 1
 
3.1%

Length

2024-01-28T21:47:22.088584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T21:47:22.198574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 23
63.9%
12~15개월 4
 
11.1%
만4~6세 3
 
8.3%
4차(15~18개월 1
 
2.8%
5차(만4~6세 1
 
2.8%
2차 1
 
2.8%
접종 1
 
2.8%
1
 
2.8%
12개월 1
 
2.8%

수수료(원)
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)6.2%
Missing0
Missing (%)0.0%
Memory size388.0 B
무료
31 
5,000원
 
1

Length

Max length6
Median length2
Mean length2.125
Min length2

Unique

Unique1 ?
Unique (%)3.1%

Sample

1st row무료
2nd row무료
3rd row5,000원
4th row무료
5th row무료

Common Values

ValueCountFrequency (%)
무료 31
96.9%
5,000원 1
 
3.1%

Length

2024-01-28T21:47:22.295844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T21:47:22.378697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
무료 31
96.9%
5,000원 1
 
3.1%

접종기간
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)9.4%
Missing0
Missing (%)0.0%
Memory size388.0 B
연중
29 
10월 예정
 
2
9월~10월
 
1

Length

Max length6
Median length2
Mean length2.375
Min length2

Unique

Unique1 ?
Unique (%)3.1%

Sample

1st row연중
2nd row연중
3rd row연중
4th row연중
5th row연중

Common Values

ValueCountFrequency (%)
연중 29
90.6%
10월 예정 2
 
6.2%
9월~10월 1
 
3.1%

Length

2024-01-28T21:47:22.466668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T21:47:22.555766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
연중 29
85.3%
10월 2
 
5.9%
예정 2
 
5.9%
9월~10월 1
 
2.9%

접종장소
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)6.2%
Missing0
Missing (%)0.0%
Memory size388.0 B
지정의료기관
22 
지정의료기관 및 보건소
10 

Length

Max length12
Median length6
Mean length7.875
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지정의료기관
2nd row지정의료기관
3rd row지정의료기관 및 보건소
4th row지정의료기관
5th row지정의료기관

Common Values

ValueCountFrequency (%)
지정의료기관 22
68.8%
지정의료기관 및 보건소 10
31.2%

Length

2024-01-28T21:47:22.639594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T21:47:22.717747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지정의료기관 32
61.5%
10
 
19.2%
보건소 10
 
19.2%

Correlations

2024-01-28T21:47:22.778795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
접종명1차 접종대상2차 접종대상3차 접종대상4차 접종대상수수료(원)접종기간접종장소
접종명1.0000.9461.0001.0001.0000.0001.0000.000
1차 접종대상0.9461.0000.9611.0000.7791.0001.0001.000
2차 접종대상1.0000.9611.0001.0000.634NaNNaN1.000
3차 접종대상1.0001.0001.0001.0000.000NaNNaN1.000
4차 접종대상1.0000.7790.6340.0001.000NaNNaNNaN
수수료(원)0.0001.000NaNNaNNaN1.0000.0000.000
접종기간1.0001.000NaNNaNNaN0.0001.0000.000
접종장소0.0001.0001.0001.000NaN0.0000.0001.000
2024-01-28T21:47:22.877158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2차 접종대상접종장소3차 접종대상수수료(원)접종기간4차 접종대상
2차 접종대상1.0000.7690.9491.0001.0000.570
접종장소0.7691.0000.9130.0000.0001.000
3차 접종대상0.9490.9131.0001.0001.0000.000
수수료(원)1.0000.0001.0001.0000.0001.000
접종기간1.0000.0001.0000.0001.0001.000
4차 접종대상0.5701.0000.0001.0001.0001.000
2024-01-28T21:47:22.958924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2차 접종대상3차 접종대상4차 접종대상수수료(원)접종기간접종장소
2차 접종대상1.0000.9490.5701.0001.0000.769
3차 접종대상0.9491.0000.0001.0001.0000.913
4차 접종대상0.5700.0001.0001.0001.0001.000
수수료(원)1.0001.0001.0001.0000.0000.000
접종기간1.0001.0001.0000.0001.0000.000
접종장소0.7690.9131.0000.0000.0001.000

Missing values

2024-01-28T21:47:20.732521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T21:47:20.837055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

접종명1차 접종대상2차 접종대상3차 접종대상4차 접종대상수수료(원)접종기간접종장소
0결핵BCG(피내용)0개월생후4주이내<NA><NA>무료연중지정의료기관
1B형간염0개월1개월6개월<NA>무료연중지정의료기관
2B형간염성인<NA><NA><NA>5,000원연중지정의료기관 및 보건소
3DTaP2개월4개월6개월4차(15~18개월), 5차(만4~6세)무료연중지정의료기관
4Td/Tdap만11~12세<NA><NA><NA>무료연중지정의료기관
5폴리오2개월4개월6개월만4~6세무료연중지정의료기관
6DTaP-IPV2개월4개월6개월만4~6세무료연중지정의료기관
7DTaP-IPV/hib2개월4개월6개월<NA>무료연중지정의료기관
8Hib(뇌수막염)2개월4개월6개월12~15개월무료연중지정의료기관
9폐렴구균(PCV10)2개월4개월6개월12~15개월무료연중지정의료기관
접종명1차 접종대상2차 접종대상3차 접종대상4차 접종대상수수료(원)접종기간접종장소
22신증후군출혈열(유행성출혈열)농업 종사자 및 고위험군<NA><NA><NA>무료9월~10월지정의료기관
23인플루엔자(독감)연수구민 중 만 65세이상 , 임신부, 1~3급 장애인, 국가유공자, 수급자<NA><NA><NA>무료10월 예정지정의료기관 및 보건소
24화이자(코로나19)만 12세이상8주(3주)<NA><NA>무료연중지정의료기관 및 보건소
25얀센(코로나19)만 18세이상<NA><NA><NA>무료연중지정의료기관 및 보건소
26노바백스(코로나19)만 12세이상4주<NA><NA>무료연중지정의료기관 및 보건소
27스카이코비원만 18세이상4주<NA><NA>무료연중지정의료기관 및 보건소
28화이자(소아용)만 5~11세8주(3주)<NA><NA>무료연중지정의료기관 및 보건소
29화이자(영유아용)6개월~4세8주(3주)8주<NA>무료연중지정의료기관 및 보건소
30화이자 BA.4/5만 12세이상6개월 권고(3개월가능)무료<NA>무료연중지정의료기관 및 보건소
31모더나 BA.4/5만 18세이상6개월 권고(3개월가능)무료<NA>무료연중지정의료기관 및 보건소