Overview

Dataset statistics

Number of variables5
Number of observations1888
Missing cells0
Missing cells (%)0.0%
Duplicate rows2
Duplicate rows (%)0.1%
Total size in memory75.7 KiB
Average record size in memory41.1 B

Variable types

Categorical3
Text2

Dataset

Description경기도 안산시 소재 이용업, 미용업 등을 대상으로하는 위생서비스 수준을 점검하는 공중위생서비스 평가결과입니다. 평가항목표의 득점점수를 100점 만점으로 환산하여 90점이상은 녹색, 80점이상 90점미만은 황색, 80점 미만은 백색 등급으로 구분합니다. 평가연도,업종,업소명,소재지,등급 을 제공합니다.
URLhttps://www.data.go.kr/data/15068676/fileData.do

Alerts

Dataset has 2 (0.1%) duplicate rowsDuplicates
평가연도 is highly overall correlated with 업종High correlation
업종 is highly overall correlated with 평가연도High correlation

Reproduction

Analysis started2023-12-13 00:43:05.256336
Analysis finished2023-12-13 00:43:05.821491
Duration0.57 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

평가연도
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size14.9 KiB
2021
1445 
2022
443 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2021 1445
76.5%
2022 443
 
23.5%

Length

2023-12-13T09:43:05.869564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T09:43:05.943088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021 1445
76.5%
2022 443
 
23.5%

업종
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size14.9 KiB
미용업(종합)
870 
세탁업
259 
미용업(피부)
240 
미용업(네일)
191 
숙박업
154 
Other values (3)
174 

Length

Max length9
Median length7
Mean length6.095339
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row숙박업
2nd row숙박업
3rd row숙박업
4th row숙박업
5th row숙박업

Common Values

ValueCountFrequency (%)
미용업(종합) 870
46.1%
세탁업 259
 
13.7%
미용업(피부) 240
 
12.7%
미용업(네일) 191
 
10.1%
숙박업 154
 
8.2%
미용업(일반) 127
 
6.7%
목욕장업 30
 
1.6%
미용업(화장분장) 17
 
0.9%

Length

2023-12-13T09:43:06.024464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T09:43:06.121270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미용업(종합 870
46.1%
세탁업 259
 
13.7%
미용업(피부 240
 
12.7%
미용업(네일 191
 
10.1%
숙박업 154
 
8.2%
미용업(일반 127
 
6.7%
목욕장업 30
 
1.6%
미용업(화장분장 17
 
0.9%
Distinct1783
Distinct (%)94.4%
Missing0
Missing (%)0.0%
Memory size14.9 KiB
2023-12-13T09:43:06.364715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length28
Mean length5.9698093
Min length1

Characters and Unicode

Total characters11271
Distinct characters674
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1703 ?
Unique (%)90.2%

Sample

1st row마리호텔
2nd row힙(HIP)호텔
3rd row퀸즈호텔
4th row호텔바인
5th row에비뉴나인호텔
ValueCountFrequency (%)
nail 10
 
0.5%
헤어 9
 
0.4%
hair 9
 
0.4%
머리하는날 5
 
0.2%
토리헤어 5
 
0.2%
더헤어 4
 
0.2%
by 4
 
0.2%
salon 4
 
0.2%
세탁소 4
 
0.2%
명품세탁소 4
 
0.2%
Other values (1896) 2010
97.2%
2023-12-13T09:43:06.705753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
641
 
5.7%
617
 
5.5%
259
 
2.3%
230
 
2.0%
226
 
2.0%
223
 
2.0%
219
 
1.9%
212
 
1.9%
205
 
1.8%
193
 
1.7%
Other values (664) 8246
73.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9649
85.6%
Lowercase Letter 484
 
4.3%
Uppercase Letter 451
 
4.0%
Space Separator 181
 
1.6%
Open Punctuation 171
 
1.5%
Close Punctuation 171
 
1.5%
Decimal Number 82
 
0.7%
Other Punctuation 73
 
0.6%
Dash Punctuation 6
 
0.1%
Math Symbol 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
641
 
6.6%
617
 
6.4%
259
 
2.7%
230
 
2.4%
226
 
2.3%
223
 
2.3%
219
 
2.3%
212
 
2.2%
205
 
2.1%
193
 
2.0%
Other values (590) 6624
68.6%
Uppercase Letter
ValueCountFrequency (%)
N 47
 
10.4%
A 42
 
9.3%
I 35
 
7.8%
S 34
 
7.5%
H 27
 
6.0%
E 26
 
5.8%
T 24
 
5.3%
O 24
 
5.3%
M 23
 
5.1%
J 21
 
4.7%
Other values (16) 148
32.8%
Lowercase Letter
ValueCountFrequency (%)
a 71
14.7%
i 62
12.8%
e 48
9.9%
l 36
 
7.4%
r 33
 
6.8%
n 33
 
6.8%
o 28
 
5.8%
h 26
 
5.4%
y 23
 
4.8%
s 21
 
4.3%
Other values (14) 103
21.3%
Decimal Number
ValueCountFrequency (%)
2 22
26.8%
1 13
15.9%
5 11
13.4%
0 10
12.2%
9 7
 
8.5%
3 6
 
7.3%
7 6
 
7.3%
4 3
 
3.7%
6 3
 
3.7%
8 1
 
1.2%
Other Punctuation
ValueCountFrequency (%)
& 23
31.5%
. 20
27.4%
# 14
19.2%
, 7
 
9.6%
' 6
 
8.2%
* 1
 
1.4%
1
 
1.4%
/ 1
 
1.4%
Math Symbol
ValueCountFrequency (%)
~ 2
66.7%
+ 1
33.3%
Space Separator
ValueCountFrequency (%)
181
100.0%
Open Punctuation
ValueCountFrequency (%)
( 171
100.0%
Close Punctuation
ValueCountFrequency (%)
) 171
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9645
85.6%
Latin 935
 
8.3%
Common 687
 
6.1%
Han 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
641
 
6.6%
617
 
6.4%
259
 
2.7%
230
 
2.4%
226
 
2.3%
223
 
2.3%
219
 
2.3%
212
 
2.2%
205
 
2.1%
193
 
2.0%
Other values (588) 6620
68.6%
Latin
ValueCountFrequency (%)
a 71
 
7.6%
i 62
 
6.6%
e 48
 
5.1%
N 47
 
5.0%
A 42
 
4.5%
l 36
 
3.9%
I 35
 
3.7%
S 34
 
3.6%
r 33
 
3.5%
n 33
 
3.5%
Other values (40) 494
52.8%
Common
ValueCountFrequency (%)
181
26.3%
( 171
24.9%
) 171
24.9%
& 23
 
3.3%
2 22
 
3.2%
. 20
 
2.9%
# 14
 
2.0%
1 13
 
1.9%
5 11
 
1.6%
0 10
 
1.5%
Other values (14) 51
 
7.4%
Han
ValueCountFrequency (%)
3
75.0%
1
 
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9645
85.6%
ASCII 1621
 
14.4%
CJK 4
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
641
 
6.6%
617
 
6.4%
259
 
2.7%
230
 
2.4%
226
 
2.3%
223
 
2.3%
219
 
2.3%
212
 
2.2%
205
 
2.1%
193
 
2.0%
Other values (588) 6620
68.6%
ASCII
ValueCountFrequency (%)
181
 
11.2%
( 171
 
10.5%
) 171
 
10.5%
a 71
 
4.4%
i 62
 
3.8%
e 48
 
3.0%
N 47
 
2.9%
A 42
 
2.6%
l 36
 
2.2%
I 35
 
2.2%
Other values (63) 757
46.7%
CJK
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
None
ValueCountFrequency (%)
1
100.0%
Distinct1871
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size14.9 KiB
2023-12-13T09:43:06.976991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length59
Median length47
Mean length30.004767
Min length15

Characters and Unicode

Total characters56649
Distinct characters337
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1855 ?
Unique (%)98.3%

Sample

1st row경기도 안산시 상록구 선진4길 29 4~6층 (사동)
2nd row경기도 안산시 상록구 용신로 360 1~6층 (본오동)
3rd row경기도 안산시 상록구 용신로 356 1~6층 (본오동)
4th row경기도 안산시 상록구 상록수로 22 1~6층 (본오동)
5th row경기도 안산시 단원구 광덕2로 186-11 (고잔동)
ValueCountFrequency (%)
단원구 1006
 
8.3%
상록구 882
 
7.3%
1층 752
 
6.2%
안산시 443
 
3.6%
경기도 443
 
3.6%
고잔동 440
 
3.6%
본오동 242
 
2.0%
사동 188
 
1.5%
선부동 180
 
1.5%
일부 177
 
1.5%
Other values (2011) 7404
60.9%
2023-12-13T09:43:07.369419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10827
 
19.1%
1 3604
 
6.4%
2321
 
4.1%
) 1935
 
3.4%
( 1934
 
3.4%
1897
 
3.3%
, 1892
 
3.3%
1610
 
2.8%
2 1511
 
2.7%
1328
 
2.3%
Other values (327) 27790
49.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 28139
49.7%
Decimal Number 11281
19.9%
Space Separator 10827
 
19.1%
Close Punctuation 1935
 
3.4%
Open Punctuation 1934
 
3.4%
Other Punctuation 1902
 
3.4%
Dash Punctuation 504
 
0.9%
Uppercase Letter 90
 
0.2%
Math Symbol 35
 
0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2321
 
8.2%
1897
 
6.7%
1610
 
5.7%
1328
 
4.7%
1164
 
4.1%
1135
 
4.0%
1120
 
4.0%
1046
 
3.7%
908
 
3.2%
777
 
2.8%
Other values (290) 14833
52.7%
Uppercase Letter
ValueCountFrequency (%)
A 28
31.1%
B 20
22.2%
E 7
 
7.8%
T 5
 
5.6%
R 4
 
4.4%
P 4
 
4.4%
C 3
 
3.3%
O 3
 
3.3%
V 3
 
3.3%
N 3
 
3.3%
Other values (7) 10
 
11.1%
Decimal Number
ValueCountFrequency (%)
1 3604
31.9%
2 1511
13.4%
0 1269
 
11.2%
3 1055
 
9.4%
4 820
 
7.3%
5 740
 
6.6%
6 665
 
5.9%
7 637
 
5.6%
8 499
 
4.4%
9 481
 
4.3%
Other Punctuation
ValueCountFrequency (%)
, 1892
99.5%
. 5
 
0.3%
@ 3
 
0.2%
/ 2
 
0.1%
Space Separator
ValueCountFrequency (%)
10827
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1935
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1934
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 504
100.0%
Math Symbol
ValueCountFrequency (%)
~ 35
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 28418
50.2%
Hangul 28139
49.7%
Latin 92
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2321
 
8.2%
1897
 
6.7%
1610
 
5.7%
1328
 
4.7%
1164
 
4.1%
1135
 
4.0%
1120
 
4.0%
1046
 
3.7%
908
 
3.2%
777
 
2.8%
Other values (290) 14833
52.7%
Common
ValueCountFrequency (%)
10827
38.1%
1 3604
 
12.7%
) 1935
 
6.8%
( 1934
 
6.8%
, 1892
 
6.7%
2 1511
 
5.3%
0 1269
 
4.5%
3 1055
 
3.7%
4 820
 
2.9%
5 740
 
2.6%
Other values (9) 2831
 
10.0%
Latin
ValueCountFrequency (%)
A 28
30.4%
B 20
21.7%
E 7
 
7.6%
T 5
 
5.4%
R 4
 
4.3%
P 4
 
4.3%
C 3
 
3.3%
O 3
 
3.3%
V 3
 
3.3%
N 3
 
3.3%
Other values (8) 12
13.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 28510
50.3%
Hangul 28139
49.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10827
38.0%
1 3604
 
12.6%
) 1935
 
6.8%
( 1934
 
6.8%
, 1892
 
6.6%
2 1511
 
5.3%
0 1269
 
4.5%
3 1055
 
3.7%
4 820
 
2.9%
5 740
 
2.6%
Other values (27) 2923
 
10.3%
Hangul
ValueCountFrequency (%)
2321
 
8.2%
1897
 
6.7%
1610
 
5.7%
1328
 
4.7%
1164
 
4.1%
1135
 
4.0%
1120
 
4.0%
1046
 
3.7%
908
 
3.2%
777
 
2.8%
Other values (290) 14833
52.7%

등급
Categorical

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size14.9 KiB
녹색
742 
황색
701 
백색
445 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row녹색
2nd row녹색
3rd row녹색
4th row녹색
5th row녹색

Common Values

ValueCountFrequency (%)
녹색 742
39.3%
황색 701
37.1%
백색 445
23.6%

Length

2023-12-13T09:43:07.470340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T09:43:07.546447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
녹색 742
39.3%
황색 701
37.1%
백색 445
23.6%

Correlations

2023-12-13T09:43:07.597499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
평가연도업종등급
평가연도1.0001.0000.034
업종1.0001.0000.367
등급0.0340.3671.000
2023-12-13T09:43:07.880312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
평가연도등급업종
평가연도1.0000.0570.998
등급0.0571.0000.251
업종0.9980.2511.000
2023-12-13T09:43:07.943073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
평가연도업종등급
평가연도1.0000.9980.057
업종0.9981.0000.251
등급0.0570.2511.000

Missing values

2023-12-13T09:43:05.722137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T09:43:05.793451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

평가연도업종업소명소재지등급
02022숙박업마리호텔경기도 안산시 상록구 선진4길 29 4~6층 (사동)녹색
12022숙박업힙(HIP)호텔경기도 안산시 상록구 용신로 360 1~6층 (본오동)녹색
22022숙박업퀸즈호텔경기도 안산시 상록구 용신로 356 1~6층 (본오동)녹색
32022숙박업호텔바인경기도 안산시 상록구 상록수로 22 1~6층 (본오동)녹색
42022숙박업에비뉴나인호텔경기도 안산시 단원구 광덕2로 186-11 (고잔동)녹색
52022숙박업아이비모텔경기도 안산시 단원구 선부광장로 91 (선부동 1071-14 )녹색
62022숙박업모텔피아노경기도 안산시 단원구 선부광장로 93 1 2 3 4 5층 (선부동 1071-6)녹색
72022숙박업호텔스튜디오9경기도 안산시 단원구 선부광장로 27 501호 (선부동 1076-17)녹색
82022숙박업로자벨호텔경기도 안산시 단원구 원곡공원로 1 3~10층 (원곡동)녹색
92022숙박업호텔727경기도 안산시 단원구 광덕2로 58-14 1~6층 (초지동)녹색
평가연도업종업소명소재지등급
18782021미용업(종합)준미용실단원구 다문화1길 60 (원곡동, 1층일부)백색
18792021미용업(종합)외출준비단원구 원본로1길 20-12, 1층 (원곡동)백색
18802021미용업(종합)그린미용실단원구 지곡로 31 (선부동, 1층 일부)백색
18812021미용업(종합)지민헤어애드단원구 라성로 48 (원곡동, 846-9 보성상가)백색
18822021미용업(종합)미아뜨펌상록구 호동로 63, 1층 (일동)백색
18832021미용업(종합)빛나는헤어단원구 선부광장서로 69, 1층일부 (선부동)백색
18842021미용업(종합)우리들의단원구 산단로 326, 유통상가 지하상가동 사-5호 (원곡동, 994-5)백색
18852021미용업(종합)미스터남성컷트상록구 본오로 100, 1층 (본오동)백색
18862021미용업(종합)보미미용실단원구 라성안길 7, 군자상가2동 151호 (원곡동)백색
18872021미용업(종합)블랙베레미용실상록구 안산대학로 123, 1층 (일동)백색

Duplicate rows

Most frequently occurring

평가연도업종업소명소재지등급# duplicates
02021미용업(종합)그녀는예뻤다상록구 건건로 122, 2층 206호 (건건동)황색2
12021미용업(종합)더(the) 끌림단원구 초지로 114, 주공프라자 102호 (초지동, 726-5)황색2