Overview

Dataset statistics

Number of variables3
Number of observations6497
Missing cells0
Missing cells (%)0.0%
Duplicate rows122
Duplicate rows (%)1.9%
Total size in memory152.4 KiB
Average record size in memory24.0 B

Variable types

Categorical1
Text2

Dataset

Description2024년 충청남도 소독의무대상시설에 대한 명단을 제공하는 것으로 소재지, 시설명, 주소 등의 데이터를 제공합니다.
Author충청남도
URLhttps://www.data.go.kr/data/15069638/fileData.do

Alerts

Dataset has 122 (1.9%) duplicate rowsDuplicates

Reproduction

Analysis started2024-03-14 23:47:35.698441
Analysis finished2024-03-14 23:47:36.997537
Duration1.3 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구 분
Categorical

Distinct15
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size50.9 KiB
천안시
1508 
아산시
1014 
당진시
617 
서산시
610 
논산시
410 
Other values (10)
2338 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row천안시
2nd row천안시
3rd row천안시
4th row천안시
5th row천안시

Common Values

ValueCountFrequency (%)
천안시 1508
23.2%
아산시 1014
15.6%
당진시 617
9.5%
서산시 610
9.4%
논산시 410
 
6.3%
보령시 378
 
5.8%
공주시 345
 
5.3%
홍성군 324
 
5.0%
태안군 251
 
3.9%
예산군 248
 
3.8%
Other values (5) 792
12.2%

Length

2024-03-15T08:47:37.112416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
천안시 1508
23.2%
아산시 1014
15.6%
당진시 617
9.5%
서산시 610
9.4%
논산시 410
 
6.3%
보령시 378
 
5.8%
공주시 345
 
5.3%
홍성군 324
 
5.0%
태안군 251
 
3.9%
예산군 248
 
3.8%
Other values (5) 792
12.2%
Distinct6055
Distinct (%)93.2%
Missing0
Missing (%)0.0%
Memory size50.9 KiB
2024-03-15T08:47:38.116957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length35
Mean length7.8119132
Min length1

Characters and Unicode

Total characters50754
Distinct characters829
Distinct categories14 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5668 ?
Unique (%)87.2%

Sample

1st row천안메트로관광호텔
2nd row신라스테이 천안
3rd row슈바이처도르프
4th row베니키아 천안관광호텔
5th row천안상록리조트
ValueCountFrequency (%)
기숙사 78
 
1.0%
건물 40
 
0.5%
주식회사 40
 
0.5%
모텔 34
 
0.4%
급식소 31
 
0.4%
충청남도 30
 
0.4%
어린이집 26
 
0.3%
호텔 24
 
0.3%
의료법인 22
 
0.3%
서산 13
 
0.2%
Other values (6462) 7470
95.7%
2024-03-15T08:47:39.916643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1331
 
2.6%
1253
 
2.5%
1224
 
2.4%
1160
 
2.3%
840
 
1.7%
829
 
1.6%
826
 
1.6%
) 800
 
1.6%
( 797
 
1.6%
793
 
1.6%
Other values (819) 40901
80.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 46019
90.7%
Space Separator 1331
 
2.6%
Close Punctuation 800
 
1.6%
Open Punctuation 797
 
1.6%
Decimal Number 719
 
1.4%
Uppercase Letter 593
 
1.2%
Lowercase Letter 221
 
0.4%
Other Punctuation 118
 
0.2%
Other Symbol 93
 
0.2%
Dash Punctuation 45
 
0.1%
Other values (4) 18
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1253
 
2.7%
1224
 
2.7%
1160
 
2.5%
840
 
1.8%
829
 
1.8%
826
 
1.8%
793
 
1.7%
790
 
1.7%
760
 
1.7%
749
 
1.6%
Other values (738) 36795
80.0%
Uppercase Letter
ValueCountFrequency (%)
T 59
 
9.9%
S 48
 
8.1%
K 47
 
7.9%
C 46
 
7.8%
A 38
 
6.4%
L 38
 
6.4%
D 37
 
6.2%
E 33
 
5.6%
H 33
 
5.6%
O 29
 
4.9%
Other values (16) 185
31.2%
Lowercase Letter
ValueCountFrequency (%)
e 46
20.8%
a 20
 
9.0%
s 15
 
6.8%
l 15
 
6.8%
r 13
 
5.9%
o 13
 
5.9%
t 12
 
5.4%
u 10
 
4.5%
i 9
 
4.1%
n 9
 
4.1%
Other values (15) 59
26.7%
Decimal Number
ValueCountFrequency (%)
2 175
24.3%
1 171
23.8%
0 69
 
9.6%
3 67
 
9.3%
7 56
 
7.8%
4 46
 
6.4%
8 42
 
5.8%
5 41
 
5.7%
6 27
 
3.8%
9 25
 
3.5%
Other Punctuation
ValueCountFrequency (%)
? 46
39.0%
. 19
16.1%
, 18
 
15.3%
& 16
 
13.6%
/ 8
 
6.8%
· 7
 
5.9%
: 2
 
1.7%
! 1
 
0.8%
' 1
 
0.8%
Math Symbol
ValueCountFrequency (%)
~ 2
66.7%
+ 1
33.3%
Letter Number
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
1331
100.0%
Close Punctuation
ValueCountFrequency (%)
) 800
100.0%
Open Punctuation
ValueCountFrequency (%)
( 797
100.0%
Other Symbol
ValueCountFrequency (%)
93
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 45
100.0%
Control
ValueCountFrequency (%)
11
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 46111
90.9%
Common 3826
 
7.5%
Latin 816
 
1.6%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1253
 
2.7%
1224
 
2.7%
1160
 
2.5%
840
 
1.8%
829
 
1.8%
826
 
1.8%
793
 
1.7%
790
 
1.7%
760
 
1.6%
749
 
1.6%
Other values (738) 36887
80.0%
Latin
ValueCountFrequency (%)
T 59
 
7.2%
S 48
 
5.9%
K 47
 
5.8%
e 46
 
5.6%
C 46
 
5.6%
A 38
 
4.7%
L 38
 
4.7%
D 37
 
4.5%
E 33
 
4.0%
H 33
 
4.0%
Other values (43) 391
47.9%
Common
ValueCountFrequency (%)
1331
34.8%
) 800
20.9%
( 797
20.8%
2 175
 
4.6%
1 171
 
4.5%
0 69
 
1.8%
3 67
 
1.8%
7 56
 
1.5%
? 46
 
1.2%
4 46
 
1.2%
Other values (17) 268
 
7.0%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 46018
90.7%
ASCII 4633
 
9.1%
None 100
 
0.2%
Number Forms 2
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1331
28.7%
) 800
17.3%
( 797
17.2%
2 175
 
3.8%
1 171
 
3.7%
0 69
 
1.5%
3 67
 
1.4%
T 59
 
1.3%
7 56
 
1.2%
S 48
 
1.0%
Other values (67) 1060
22.9%
Hangul
ValueCountFrequency (%)
1253
 
2.7%
1224
 
2.7%
1160
 
2.5%
840
 
1.8%
829
 
1.8%
826
 
1.8%
793
 
1.7%
790
 
1.7%
760
 
1.7%
749
 
1.6%
Other values (737) 36794
80.0%
None
ValueCountFrequency (%)
93
93.0%
· 7
 
7.0%
CJK
ValueCountFrequency (%)
1
100.0%
Number Forms
ValueCountFrequency (%)
1
50.0%
1
50.0%

주소
Text

Distinct5979
Distinct (%)92.0%
Missing0
Missing (%)0.0%
Memory size50.9 KiB
2024-03-15T08:47:41.127355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length324
Median length57
Mean length24.262275
Min length10

Characters and Unicode

Total characters157632
Distinct characters514
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5555 ?
Unique (%)85.5%

Sample

1st row충청남도 천안시 동남구 대흥로 241-11(대흥동)
2nd row충청남도 천안시 서북구 동서대로 177 (성정동)
3rd row충청남도 천안시 서북구 성정공원5로 42(성정동)
4th row충청남도 천안시 서북구 양지21길 42(성정동)
5th row충청남도 천안시 동남구 수신면 수신로 576
ValueCountFrequency (%)
충청남도 6385
 
18.3%
천안시 1502
 
4.3%
아산시 1015
 
2.9%
서북구 908
 
2.6%
당진시 617
 
1.8%
서산시 606
 
1.7%
동남구 597
 
1.7%
논산시 410
 
1.2%
1층 365
 
1.0%
보령시 350
 
1.0%
Other values (5931) 22052
63.4%
2024-03-15T08:47:42.875496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
28854
 
18.3%
7421
 
4.7%
6795
 
4.3%
6719
 
4.3%
6558
 
4.2%
1 6044
 
3.8%
5224
 
3.3%
4213
 
2.7%
3728
 
2.4%
2 3706
 
2.4%
Other values (504) 78370
49.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 95928
60.9%
Space Separator 28854
 
18.3%
Decimal Number 24742
 
15.7%
Open Punctuation 2146
 
1.4%
Close Punctuation 2144
 
1.4%
Dash Punctuation 1879
 
1.2%
Other Punctuation 1625
 
1.0%
Uppercase Letter 154
 
0.1%
Math Symbol 149
 
0.1%
Lowercase Letter 7
 
< 0.1%
Other values (2) 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7421
 
7.7%
6795
 
7.1%
6719
 
7.0%
6558
 
6.8%
5224
 
5.4%
4213
 
4.4%
3728
 
3.9%
3365
 
3.5%
2604
 
2.7%
2455
 
2.6%
Other values (458) 46846
48.8%
Uppercase Letter
ValueCountFrequency (%)
B 41
26.6%
A 25
16.2%
L 15
 
9.7%
C 12
 
7.8%
S 11
 
7.1%
H 7
 
4.5%
D 6
 
3.9%
O 4
 
2.6%
M 4
 
2.6%
K 4
 
2.6%
Other values (10) 25
16.2%
Decimal Number
ValueCountFrequency (%)
1 6044
24.4%
2 3706
15.0%
3 2796
11.3%
4 2053
 
8.3%
5 2047
 
8.3%
0 1782
 
7.2%
7 1743
 
7.0%
6 1706
 
6.9%
8 1518
 
6.1%
9 1347
 
5.4%
Other Punctuation
ValueCountFrequency (%)
, 1599
98.4%
. 15
 
0.9%
/ 9
 
0.6%
? 2
 
0.1%
Lowercase Letter
ValueCountFrequency (%)
e 3
42.9%
l 2
28.6%
s 1
 
14.3%
a 1
 
14.3%
Math Symbol
ValueCountFrequency (%)
~ 148
99.3%
1
 
0.7%
Space Separator
ValueCountFrequency (%)
28854
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2146
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2144
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1879
100.0%
Control
ValueCountFrequency (%)
3
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 95929
60.9%
Common 61542
39.0%
Latin 161
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7421
 
7.7%
6795
 
7.1%
6719
 
7.0%
6558
 
6.8%
5224
 
5.4%
4213
 
4.4%
3728
 
3.9%
3365
 
3.5%
2604
 
2.7%
2455
 
2.6%
Other values (459) 46847
48.8%
Latin
ValueCountFrequency (%)
B 41
25.5%
A 25
15.5%
L 15
 
9.3%
C 12
 
7.5%
S 11
 
6.8%
H 7
 
4.3%
D 6
 
3.7%
O 4
 
2.5%
M 4
 
2.5%
K 4
 
2.5%
Other values (14) 32
19.9%
Common
ValueCountFrequency (%)
28854
46.9%
1 6044
 
9.8%
2 3706
 
6.0%
3 2796
 
4.5%
( 2146
 
3.5%
) 2144
 
3.5%
4 2053
 
3.3%
5 2047
 
3.3%
- 1879
 
3.1%
0 1782
 
2.9%
Other values (11) 8091
 
13.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 95927
60.9%
ASCII 61702
39.1%
None 1
 
< 0.1%
Math Operators 1
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
28854
46.8%
1 6044
 
9.8%
2 3706
 
6.0%
3 2796
 
4.5%
( 2146
 
3.5%
) 2144
 
3.5%
4 2053
 
3.3%
5 2047
 
3.3%
- 1879
 
3.0%
0 1782
 
2.9%
Other values (34) 8251
 
13.4%
Hangul
ValueCountFrequency (%)
7421
 
7.7%
6795
 
7.1%
6719
 
7.0%
6558
 
6.8%
5224
 
5.4%
4213
 
4.4%
3728
 
3.9%
3365
 
3.5%
2604
 
2.7%
2455
 
2.6%
Other values (457) 46845
48.8%
None
ValueCountFrequency (%)
1
100.0%
Math Operators
ValueCountFrequency (%)
1
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

Missing values

2024-03-15T08:47:36.801252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T08:47:36.935616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구 분소독의무대상시설명주소
0천안시천안메트로관광호텔충청남도 천안시 동남구 대흥로 241-11(대흥동)
1천안시신라스테이 천안충청남도 천안시 서북구 동서대로 177 (성정동)
2천안시슈바이처도르프충청남도 천안시 서북구 성정공원5로 42(성정동)
3천안시베니키아 천안관광호텔충청남도 천안시 서북구 양지21길 42(성정동)
4천안시천안상록리조트충청남도 천안시 동남구 수신면 수신로 576
5천안시소노벨 천안충청남도 천안시 동남구 성남면 종합휴양지로 200
6천안시단우 호스텔충청남도 천안시 서북구 검은들1길 19, 7층
7천안시신원모텔충청남도 천안시 동남구 원성1길 29 (원성동)
8천안시화성장여관충청남도 천안시 동남구 자유시장2길 7 (성황동)
9천안시능수모텔충청남도 천안시 동남구 충절로 189 (원성동)
구 분소독의무대상시설명주소
6487태안군해송마을(1차)충청남도 태안군 태안읍 원이로 302
6488태안군동문주공(1차)충청남도 태안군 태안읍 동문7길 20
6489태안군동문주공(2차)충청남도 태안군 태안읍 동문7길 20
6490태안군진흥더블파크충청남도 태안군 태안읍 군청10길 14
6491태안군평천 휴먼시아충청남도 태안군 태안읍 동평로 42
6492태안군남문코아루충청남도 태안군 태안읍 후곡로 16
6493태안군새빛마을충청남도 태안군 태안읍 동평로 16
6494태안군동문코아루충청남도 태안군 태안읍 동평로 45
6495태안군남문미소지움충청남도 태안군 태안읍 환동길 43-12
6496태안군태안평천3단지아파트충청남도 태안군 태안읍 동평로 32

Duplicate rows

Most frequently occurring

구 분소독의무대상시설명주소# duplicates
2논산시강경고등학교충청남도 논산시 강경읍 계백로 1883
31논산시충남인터넷고등학교충청남도 논산시 연산면 계백로 1958-163
82천안시오페라웨딩홀뷔페충청남도 천안시 동남구 원거리14길 4 (원성동)3
0공주시한국영상대학교부설유치원충청남도 공주시 정안면 화봉평정길 482
1금산군농업기술센터충청남도 금산군 금성면 의총길 252
3논산시강경산양초등학교충청남도 논산시 강경읍 산양길 452
4논산시강경상업고등학교충청남도 논산시 계백로 220 (남교리 1번지)2
5논산시강경여자중학교충청남도 논산시 강경읍 계백로 2002
6논산시강경중앙초등학교충청남도 논산시 강경읍 옥녀봉로 82
7논산시강경중학교충청남도 논산시 강경읍 계백로 282