Overview

Dataset statistics

Number of variables3
Number of observations6492
Missing cells0
Missing cells (%)0.0%
Duplicate rows236
Duplicate rows (%)3.6%
Total size in memory152.3 KiB
Average record size in memory24.0 B

Variable types

Categorical1
Text2

Dataset

Description2023년 충청남도 소독의무대상시설에 대한 명단을 제공하는 것으로 소재지, 시설명, 주소 등의 데이터를 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=374&beforeMenuCd=DOM_000000201001001000&publicdatapk=15069638

Alerts

Dataset has 236 (3.6%) duplicate rowsDuplicates

Reproduction

Analysis started2024-01-09 20:46:52.749597
Analysis finished2024-01-09 20:46:53.460475
Duration0.71 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구 분
Categorical

Distinct15
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size50.8 KiB
천안시
1663 
아산시
1016 
당진시
563 
서산시
558 
논산시
404 
Other values (10)
2288 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row천안시
2nd row천안시
3rd row천안시
4th row천안시
5th row천안시

Common Values

ValueCountFrequency (%)
천안시 1663
25.6%
아산시 1016
15.7%
당진시 563
 
8.7%
서산시 558
 
8.6%
논산시 404
 
6.2%
보령시 402
 
6.2%
공주시 341
 
5.3%
예산군 305
 
4.7%
태안군 249
 
3.8%
홍성군 246
 
3.8%
Other values (5) 745
11.5%

Length

2024-01-10T05:46:53.510108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
천안시 1663
25.6%
아산시 1016
15.7%
당진시 563
 
8.7%
서산시 558
 
8.6%
논산시 404
 
6.2%
보령시 402
 
6.2%
공주시 341
 
5.3%
예산군 305
 
4.7%
태안군 249
 
3.8%
홍성군 246
 
3.8%
Other values (5) 745
11.5%
Distinct6012
Distinct (%)92.6%
Missing0
Missing (%)0.0%
Memory size50.8 KiB
2024-01-10T05:46:53.736479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length30
Mean length7.8817006
Min length1

Characters and Unicode

Total characters51168
Distinct characters825
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5597 ?
Unique (%)86.2%

Sample

1st row신원모텔
2nd row능수모텔
3rd row유성장여관
4th row오룡여관
5th row호수모텔
ValueCountFrequency (%)
주식회사 44
 
0.6%
기숙사 37
 
0.5%
모텔 36
 
0.5%
어린이집 29
 
0.4%
충청남도 22
 
0.3%
의료법인 20
 
0.3%
호텔 20
 
0.3%
주)아워홈 13
 
0.2%
건물 10
 
0.1%
주)현대그린푸드 9
 
0.1%
Other values (6428) 7359
96.8%
2024-01-10T05:46:54.110904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1311
 
2.6%
1182
 
2.3%
1166
 
2.3%
1150
 
2.2%
) 999
 
2.0%
( 998
 
2.0%
903
 
1.8%
820
 
1.6%
814
 
1.6%
812
 
1.6%
Other values (815) 41013
80.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 46356
90.6%
Space Separator 1166
 
2.3%
Close Punctuation 999
 
2.0%
Open Punctuation 998
 
2.0%
Decimal Number 710
 
1.4%
Uppercase Letter 551
 
1.1%
Lowercase Letter 215
 
0.4%
Other Punctuation 72
 
0.1%
Other Symbol 69
 
0.1%
Dash Punctuation 26
 
0.1%
Other values (2) 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1311
 
2.8%
1182
 
2.5%
1150
 
2.5%
903
 
1.9%
820
 
1.8%
814
 
1.8%
812
 
1.8%
799
 
1.7%
758
 
1.6%
744
 
1.6%
Other values (735) 37063
80.0%
Uppercase Letter
ValueCountFrequency (%)
T 52
 
9.4%
K 45
 
8.2%
S 43
 
7.8%
C 42
 
7.6%
A 38
 
6.9%
L 33
 
6.0%
D 33
 
6.0%
H 31
 
5.6%
E 28
 
5.1%
O 26
 
4.7%
Other values (16) 180
32.7%
Lowercase Letter
ValueCountFrequency (%)
e 46
21.4%
a 19
 
8.8%
l 17
 
7.9%
s 16
 
7.4%
o 14
 
6.5%
t 12
 
5.6%
r 11
 
5.1%
n 10
 
4.7%
u 9
 
4.2%
i 9
 
4.2%
Other values (15) 52
24.2%
Decimal Number
ValueCountFrequency (%)
2 174
24.5%
1 174
24.5%
3 75
10.6%
0 60
 
8.5%
7 48
 
6.8%
8 43
 
6.1%
5 40
 
5.6%
4 39
 
5.5%
6 31
 
4.4%
9 26
 
3.7%
Other Punctuation
ValueCountFrequency (%)
. 19
26.4%
& 13
18.1%
/ 13
18.1%
, 11
15.3%
· 10
13.9%
: 2
 
2.8%
" 2
 
2.8%
! 1
 
1.4%
' 1
 
1.4%
Letter Number
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Space Separator
ValueCountFrequency (%)
1166
100.0%
Close Punctuation
ValueCountFrequency (%)
) 999
100.0%
Open Punctuation
ValueCountFrequency (%)
( 998
100.0%
Other Symbol
ValueCountFrequency (%)
69
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 26
100.0%
Math Symbol
ValueCountFrequency (%)
< 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 46424
90.7%
Common 3973
 
7.8%
Latin 770
 
1.5%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1311
 
2.8%
1182
 
2.5%
1150
 
2.5%
903
 
1.9%
820
 
1.8%
814
 
1.8%
812
 
1.7%
799
 
1.7%
758
 
1.6%
744
 
1.6%
Other values (735) 37131
80.0%
Latin
ValueCountFrequency (%)
T 52
 
6.8%
e 46
 
6.0%
K 45
 
5.8%
S 43
 
5.6%
C 42
 
5.5%
A 38
 
4.9%
L 33
 
4.3%
D 33
 
4.3%
H 31
 
4.0%
E 28
 
3.6%
Other values (45) 379
49.2%
Common
ValueCountFrequency (%)
1166
29.3%
) 999
25.1%
( 998
25.1%
2 174
 
4.4%
1 174
 
4.4%
3 75
 
1.9%
0 60
 
1.5%
7 48
 
1.2%
8 43
 
1.1%
5 40
 
1.0%
Other values (14) 196
 
4.9%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 46355
90.6%
ASCII 4729
 
9.2%
None 79
 
0.2%
Number Forms 4
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1311
 
2.8%
1182
 
2.5%
1150
 
2.5%
903
 
1.9%
820
 
1.8%
814
 
1.8%
812
 
1.8%
799
 
1.7%
758
 
1.6%
744
 
1.6%
Other values (734) 37062
80.0%
ASCII
ValueCountFrequency (%)
1166
24.7%
) 999
21.1%
( 998
21.1%
2 174
 
3.7%
1 174
 
3.7%
3 75
 
1.6%
0 60
 
1.3%
T 52
 
1.1%
7 48
 
1.0%
e 46
 
1.0%
Other values (64) 937
19.8%
None
ValueCountFrequency (%)
69
87.3%
· 10
 
12.7%
Number Forms
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
CJK
ValueCountFrequency (%)
1
100.0%

주소
Text

Distinct5832
Distinct (%)89.8%
Missing0
Missing (%)0.0%
Memory size50.8 KiB
2024-01-10T05:46:54.427207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length79
Median length59
Mean length24.430838
Min length13

Characters and Unicode

Total characters158605
Distinct characters510
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5265 ?
Unique (%)81.1%

Sample

1st row충청남도 천안시 동남구 원성1길 29 (원성동)
2nd row충청남도 천안시 동남구 충절로 189 (원성동)
3rd row충청남도 천안시 동남구 공설시장3길 14 (대흥동)
4th row충청남도 천안시 동남구 중앙로 75-7 (오룡동)
5th row충청남도 천안시 동남구 망향로 111 (안서동)
ValueCountFrequency (%)
충청남도 6467
 
18.5%
천안시 1662
 
4.7%
아산시 1015
 
2.9%
서북구 1001
 
2.9%
동남구 653
 
1.9%
당진시 563
 
1.6%
서산시 557
 
1.6%
논산시 404
 
1.2%
보령시 402
 
1.1%
1층 343
 
1.0%
Other values (5781) 21937
62.7%
2024-01-10T05:46:54.905966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
28991
 
18.3%
7474
 
4.7%
6879
 
4.3%
6726
 
4.2%
6629
 
4.2%
1 5929
 
3.7%
5291
 
3.3%
4262
 
2.7%
2 3723
 
2.3%
3706
 
2.3%
Other values (500) 78995
49.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 97064
61.2%
Space Separator 28991
 
18.3%
Decimal Number 24430
 
15.4%
Open Punctuation 2215
 
1.4%
Close Punctuation 2213
 
1.4%
Dash Punctuation 1801
 
1.1%
Other Punctuation 1569
 
1.0%
Uppercase Letter 173
 
0.1%
Math Symbol 139
 
0.1%
Lowercase Letter 9
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7474
 
7.7%
6879
 
7.1%
6726
 
6.9%
6629
 
6.8%
5291
 
5.5%
4262
 
4.4%
3706
 
3.8%
3508
 
3.6%
2740
 
2.8%
2540
 
2.6%
Other values (455) 47309
48.7%
Uppercase Letter
ValueCountFrequency (%)
B 43
24.9%
A 27
15.6%
L 16
 
9.2%
C 13
 
7.5%
S 13
 
7.5%
H 9
 
5.2%
D 6
 
3.5%
E 5
 
2.9%
I 5
 
2.9%
F 5
 
2.9%
Other values (11) 31
17.9%
Decimal Number
ValueCountFrequency (%)
1 5929
24.3%
2 3723
15.2%
3 2766
11.3%
4 2045
 
8.4%
5 2019
 
8.3%
7 1721
 
7.0%
0 1713
 
7.0%
6 1684
 
6.9%
8 1502
 
6.1%
9 1328
 
5.4%
Lowercase Letter
ValueCountFrequency (%)
e 4
44.4%
s 2
22.2%
l 2
22.2%
a 1
 
11.1%
Other Punctuation
ValueCountFrequency (%)
, 1554
99.0%
. 10
 
0.6%
/ 5
 
0.3%
Math Symbol
ValueCountFrequency (%)
~ 138
99.3%
1
 
0.7%
Space Separator
ValueCountFrequency (%)
28991
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2215
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2213
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1801
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 97065
61.2%
Common 61358
38.7%
Latin 182
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7474
 
7.7%
6879
 
7.1%
6726
 
6.9%
6629
 
6.8%
5291
 
5.5%
4262
 
4.4%
3706
 
3.8%
3508
 
3.6%
2740
 
2.8%
2540
 
2.6%
Other values (456) 47310
48.7%
Latin
ValueCountFrequency (%)
B 43
23.6%
A 27
14.8%
L 16
 
8.8%
C 13
 
7.1%
S 13
 
7.1%
H 9
 
4.9%
D 6
 
3.3%
E 5
 
2.7%
I 5
 
2.7%
F 5
 
2.7%
Other values (15) 40
22.0%
Common
ValueCountFrequency (%)
28991
47.2%
1 5929
 
9.7%
2 3723
 
6.1%
3 2766
 
4.5%
( 2215
 
3.6%
) 2213
 
3.6%
4 2045
 
3.3%
5 2019
 
3.3%
- 1801
 
2.9%
7 1721
 
2.8%
Other values (9) 7935
 
12.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 97064
61.2%
ASCII 61539
38.8%
Math Operators 1
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
28991
47.1%
1 5929
 
9.6%
2 3723
 
6.0%
3 2766
 
4.5%
( 2215
 
3.6%
) 2213
 
3.6%
4 2045
 
3.3%
5 2019
 
3.3%
- 1801
 
2.9%
7 1721
 
2.8%
Other values (33) 8116
 
13.2%
Hangul
ValueCountFrequency (%)
7474
 
7.7%
6879
 
7.1%
6726
 
6.9%
6629
 
6.8%
5291
 
5.5%
4262
 
4.4%
3706
 
3.8%
3508
 
3.6%
2740
 
2.8%
2540
 
2.6%
Other values (455) 47309
48.7%
Math Operators
ValueCountFrequency (%)
1
100.0%
None
ValueCountFrequency (%)
1
100.0%

Missing values

2024-01-10T05:46:53.369151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:46:53.430275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구 분소독의무대상시설명주소
0천안시신원모텔충청남도 천안시 동남구 원성1길 29 (원성동)
1천안시능수모텔충청남도 천안시 동남구 충절로 189 (원성동)
2천안시유성장여관충청남도 천안시 동남구 공설시장3길 14 (대흥동)
3천안시오룡여관충청남도 천안시 동남구 중앙로 75-7 (오룡동)
4천안시호수모텔충청남도 천안시 동남구 망향로 111 (안서동)
5천안시화이트텔충청남도 천안시 동남구 서부역1길 14-1 (봉명동)
6천안시토마토모텔충청남도 천안시 동남구 동면 충절로 2303
7천안시폴라리스충청남도 천안시 동남구 목천읍 종합휴양지2길 12, 2~3층
8천안시시크릿모텔충청남도 천안시 동남구 신부3길 29 (신부동)
9천안시광명장여관충청남도 천안시 동남구 중앙로 151 (성황동)
구 분소독의무대상시설명주소
6482태안군이화마을(1차)충청남도 태안군 원북면 원이로 820-32
6483태안군해송마을(1차)충청남도 태안군 태안읍 원이로 302
6484태안군동문주공(1차)충청남도 태안군 태안읍 동문7길 20
6485태안군동문주공(2차)충청남도 태안군 태안읍 동문7길 20
6486태안군진흥더블파크충청남도 태안군 태안읍 군청10길 14
6487태안군평천 휴먼시아충청남도 태안군 태안읍 동평로 42
6488태안군남문코아루충청남도 태안군 태안읍 후곡로 16
6489태안군새빛마을충청남도 태안군 태안읍 동평로 16
6490태안군동문코아루충청남도 태안군 태안읍 동평로 45
6491태안군남문미소지움충청남도 태안군 태안읍 환동길 43-12

Duplicate rows

Most frequently occurring

구 분소독의무대상시설명주소# duplicates
2논산시강경고등학교충청남도 논산시 강경읍 계백로 1883
31논산시충남인터넷고등학교충청남도 논산시 연산면 계백로 1958-163
56예산군대흥고등학교충청남도 예산군 대흥면 예당로 8453
67예산군예산고등학교충청남도 예산군 예산읍 예산로 1013
143천안시오페라웨딩홀뷔페충청남도 천안시 동남구 원거리14길 4 (원성동)3
0금산군금산고등학교충청남도 금산군 금산읍 탑선길 52
1금산군금산효사랑요양병원충청남도 금산군 남일면 무금로 21452
3논산시강경산양초등학교충청남도 논산시 강경읍 산양길 452
4논산시강경상업고등학교충청남도 논산시 계백로 220 (남교리 1번지)2
5논산시강경여자중학교충청남도 논산시 강경읍 계백로 2002