Overview

Dataset statistics

Number of variables4
Number of observations2235
Missing cells57
Missing cells (%)0.6%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory70.0 KiB
Average record size in memory32.1 B

Variable types

Text3
Categorical1

Dataset

Description경기도 안산시 관내 소독의무대상시설 현황으로 시설구분,시설명,소재지도로명주소,데이터기준일자 등의 목록 정보를 제공합니다.
Author경기도 안산시
URLhttps://www.data.go.kr/data/15036785/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 1 (< 0.1%) duplicate rowsDuplicates
시설명 has 56 (2.5%) missing valuesMissing

Reproduction

Analysis started2023-12-11 23:24:47.395910
Analysis finished2023-12-11 23:24:48.097026
Duration0.7 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct55
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size17.6 KiB
2023-12-12T08:24:48.277185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length13
Mean length11.09396
Min length2

Characters and Unicode

Total characters24795
Distinct characters117
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4 ?
Unique (%)0.2%

Sample

1st row식품접객업소
2nd row연면적 2,000m2 이상의 사무실용 건축물 및 복합용도 건축물
3rd row학교
4th row집단급식소
5th row연면적 2,000m2 이상의 사무실용 건축물 및 복합용도 건축물
ValueCountFrequency (%)
건축물 860
16.3%
연면적 430
 
8.2%
2,000m2 430
 
8.2%
이상의 430
 
8.2%
사무실용 430
 
8.2%
430
 
8.2%
복합용도 430
 
8.2%
집단급식소 412
 
7.8%
어린이집 133
 
2.5%
공동주택 129
 
2.4%
Other values (53) 1161
22.0%
2023-12-12T08:24:48.725765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3038
 
12.3%
0 1290
 
5.2%
2 975
 
3.9%
860
 
3.5%
860
 
3.5%
860
 
3.5%
860
 
3.5%
664
 
2.7%
563
 
2.3%
552
 
2.2%
Other values (107) 14273
57.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 18280
73.7%
Space Separator 3038
 
12.3%
Decimal Number 2365
 
9.5%
Other Punctuation 506
 
2.0%
Lowercase Letter 430
 
1.7%
Close Punctuation 87
 
0.4%
Open Punctuation 87
 
0.4%
Control 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
860
 
4.7%
860
 
4.7%
860
 
4.7%
860
 
4.7%
664
 
3.6%
563
 
3.1%
552
 
3.0%
540
 
3.0%
510
 
2.8%
508
 
2.8%
Other values (97) 11503
62.9%
Decimal Number
ValueCountFrequency (%)
0 1290
54.5%
2 975
41.2%
1 100
 
4.2%
Other Punctuation
ValueCountFrequency (%)
, 434
85.8%
· 72
 
14.2%
Space Separator
ValueCountFrequency (%)
3038
100.0%
Lowercase Letter
ValueCountFrequency (%)
m 430
100.0%
Close Punctuation
ValueCountFrequency (%)
) 87
100.0%
Open Punctuation
ValueCountFrequency (%)
( 87
100.0%
Control
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 18280
73.7%
Common 6085
 
24.5%
Latin 430
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
860
 
4.7%
860
 
4.7%
860
 
4.7%
860
 
4.7%
664
 
3.6%
563
 
3.1%
552
 
3.0%
540
 
3.0%
510
 
2.8%
508
 
2.8%
Other values (97) 11503
62.9%
Common
ValueCountFrequency (%)
3038
49.9%
0 1290
21.2%
2 975
 
16.0%
, 434
 
7.1%
1 100
 
1.6%
) 87
 
1.4%
( 87
 
1.4%
· 72
 
1.2%
2
 
< 0.1%
Latin
ValueCountFrequency (%)
m 430
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 18280
73.7%
ASCII 6443
 
26.0%
None 72
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3038
47.2%
0 1290
20.0%
2 975
 
15.1%
, 434
 
6.7%
m 430
 
6.7%
1 100
 
1.6%
) 87
 
1.4%
( 87
 
1.4%
2
 
< 0.1%
Hangul
ValueCountFrequency (%)
860
 
4.7%
860
 
4.7%
860
 
4.7%
860
 
4.7%
664
 
3.6%
563
 
3.1%
552
 
3.0%
540
 
3.0%
510
 
2.8%
508
 
2.8%
Other values (97) 11503
62.9%
None
ValueCountFrequency (%)
· 72
100.0%

시설명
Text

MISSING 

Distinct1908
Distinct (%)87.6%
Missing56
Missing (%)2.5%
Memory size17.6 KiB
2023-12-12T08:24:49.030885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length27
Mean length7.8609454
Min length1

Characters and Unicode

Total characters17129
Distinct characters611
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1680 ?
Unique (%)77.1%

Sample

1st row해미청
2nd row연지프라자
3rd row안산디자인문화고등학교
4th row안산디자인문화고등학교
5th row안산여자정보산업고등학교 A동(안산디자인문화고등학교)
ValueCountFrequency (%)
한양대학교안산캠퍼스 22
 
0.8%
그랑시티자이 15
 
0.6%
교사동 14
 
0.5%
복합용도건축물 14
 
0.5%
사무실용건축물 12
 
0.5%
안산점 12
 
0.5%
오피스텔 10
 
0.4%
안산대학교 10
 
0.4%
주식회사 9
 
0.3%
안산고등학교 8
 
0.3%
Other values (2027) 2510
95.2%
2023-12-12T08:24:49.551844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
525
 
3.1%
422
 
2.5%
417
 
2.4%
397
 
2.3%
396
 
2.3%
377
 
2.2%
( 374
 
2.2%
) 373
 
2.2%
343
 
2.0%
293
 
1.7%
Other values (601) 13212
77.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 15086
88.1%
Space Separator 525
 
3.1%
Open Punctuation 377
 
2.2%
Close Punctuation 376
 
2.2%
Decimal Number 322
 
1.9%
Uppercase Letter 300
 
1.8%
Lowercase Letter 62
 
0.4%
Control 29
 
0.2%
Other Symbol 21
 
0.1%
Other Punctuation 19
 
0.1%
Other values (2) 12
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
422
 
2.8%
417
 
2.8%
397
 
2.6%
396
 
2.6%
377
 
2.5%
343
 
2.3%
293
 
1.9%
270
 
1.8%
251
 
1.7%
247
 
1.6%
Other values (532) 11673
77.4%
Uppercase Letter
ValueCountFrequency (%)
A 42
14.0%
E 22
 
7.3%
I 21
 
7.0%
S 21
 
7.0%
R 18
 
6.0%
T 17
 
5.7%
B 16
 
5.3%
C 16
 
5.3%
G 14
 
4.7%
L 12
 
4.0%
Other values (15) 101
33.7%
Lowercase Letter
ValueCountFrequency (%)
s 10
16.1%
a 9
14.5%
e 9
14.5%
l 6
9.7%
i 5
8.1%
n 4
 
6.5%
y 3
 
4.8%
t 3
 
4.8%
k 2
 
3.2%
m 2
 
3.2%
Other values (8) 9
14.5%
Decimal Number
ValueCountFrequency (%)
2 105
32.6%
1 91
28.3%
3 24
 
7.5%
5 19
 
5.9%
4 17
 
5.3%
7 17
 
5.3%
9 16
 
5.0%
6 16
 
5.0%
8 10
 
3.1%
0 7
 
2.2%
Other Punctuation
ValueCountFrequency (%)
. 8
42.1%
& 5
26.3%
/ 4
21.1%
, 1
 
5.3%
; 1
 
5.3%
Open Punctuation
ValueCountFrequency (%)
( 374
99.2%
[ 3
 
0.8%
Close Punctuation
ValueCountFrequency (%)
) 373
99.2%
] 3
 
0.8%
Other Symbol
ValueCountFrequency (%)
20
95.2%
1
 
4.8%
Letter Number
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
525
100.0%
Control
ValueCountFrequency (%)
29
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 15106
88.2%
Common 1658
 
9.7%
Latin 365
 
2.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
422
 
2.8%
417
 
2.8%
397
 
2.6%
396
 
2.6%
377
 
2.5%
343
 
2.3%
293
 
1.9%
270
 
1.8%
251
 
1.7%
247
 
1.6%
Other values (533) 11693
77.4%
Latin
ValueCountFrequency (%)
A 42
 
11.5%
E 22
 
6.0%
I 21
 
5.8%
S 21
 
5.8%
R 18
 
4.9%
T 17
 
4.7%
B 16
 
4.4%
C 16
 
4.4%
G 14
 
3.8%
L 12
 
3.3%
Other values (35) 166
45.5%
Common
ValueCountFrequency (%)
525
31.7%
( 374
22.6%
) 373
22.5%
2 105
 
6.3%
1 91
 
5.5%
29
 
1.7%
3 24
 
1.4%
5 19
 
1.1%
4 17
 
1.0%
7 17
 
1.0%
Other values (13) 84
 
5.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 15086
88.1%
ASCII 2019
 
11.8%
None 20
 
0.1%
Number Forms 3
 
< 0.1%
Misc Symbols 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
525
26.0%
( 374
18.5%
) 373
18.5%
2 105
 
5.2%
1 91
 
4.5%
A 42
 
2.1%
29
 
1.4%
3 24
 
1.2%
E 22
 
1.1%
I 21
 
1.0%
Other values (55) 413
20.5%
Hangul
ValueCountFrequency (%)
422
 
2.8%
417
 
2.8%
397
 
2.6%
396
 
2.6%
377
 
2.5%
343
 
2.3%
293
 
1.9%
270
 
1.8%
251
 
1.7%
247
 
1.6%
Other values (532) 11673
77.4%
None
ValueCountFrequency (%)
20
100.0%
Number Forms
ValueCountFrequency (%)
2
66.7%
1
33.3%
Misc Symbols
ValueCountFrequency (%)
1
100.0%
Distinct2005
Distinct (%)89.7%
Missing1
Missing (%)< 0.1%
Memory size17.6 KiB
2023-12-12T08:24:49.986748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length74
Median length58
Mean length25.017457
Min length2

Characters and Unicode

Total characters55889
Distinct characters377
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1857 ?
Unique (%)83.1%

Sample

1st row경기도 안산시 상록구 각골로 47, 1층 (본오동)
2nd row경기도 안산시 상록구 각골로 55 (본오동)
3rd row경기도 안산시 상록구 각골로 87
4th row경기도 안산시 상록구 각골로 87 (본오동)
5th row경기도 안산시 상록구 각골로 87 (본오동)
ValueCountFrequency (%)
안산시 1568
 
12.3%
경기도 1542
 
12.1%
단원구 1286
 
10.1%
상록구 949
 
7.5%
본오동 213
 
1.7%
고잔동 197
 
1.5%
사동 196
 
1.5%
1층 92
 
0.7%
이동 90
 
0.7%
성곡동 83
 
0.7%
Other values (1770) 6495
51.1%
2023-12-12T08:24:50.648890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10691
 
19.1%
2265
 
4.1%
1 2129
 
3.8%
1976
 
3.5%
1813
 
3.2%
1786
 
3.2%
1738
 
3.1%
1656
 
3.0%
1644
 
2.9%
1586
 
2.8%
Other values (367) 28605
51.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 31642
56.6%
Space Separator 10691
 
19.1%
Decimal Number 9025
 
16.1%
Close Punctuation 1507
 
2.7%
Open Punctuation 1506
 
2.7%
Other Punctuation 914
 
1.6%
Dash Punctuation 376
 
0.7%
Uppercase Letter 114
 
0.2%
Math Symbol 72
 
0.1%
Lowercase Letter 38
 
0.1%
Other values (2) 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2265
 
7.2%
1976
 
6.2%
1813
 
5.7%
1786
 
5.6%
1738
 
5.5%
1656
 
5.2%
1644
 
5.2%
1586
 
5.0%
1563
 
4.9%
1556
 
4.9%
Other values (312) 14059
44.4%
Uppercase Letter
ValueCountFrequency (%)
B 26
22.8%
A 23
20.2%
C 9
 
7.9%
M 7
 
6.1%
E 7
 
6.1%
O 6
 
5.3%
G 5
 
4.4%
T 4
 
3.5%
D 4
 
3.5%
L 4
 
3.5%
Other values (9) 19
16.7%
Lowercase Letter
ValueCountFrequency (%)
e 12
31.6%
t 4
 
10.5%
s 4
 
10.5%
y 2
 
5.3%
a 2
 
5.3%
o 2
 
5.3%
j 2
 
5.3%
r 2
 
5.3%
v 2
 
5.3%
i 2
 
5.3%
Other values (2) 4
 
10.5%
Decimal Number
ValueCountFrequency (%)
1 2129
23.6%
2 1191
13.2%
5 914
10.1%
3 905
10.0%
4 851
 
9.4%
0 742
 
8.2%
6 631
 
7.0%
7 621
 
6.9%
8 545
 
6.0%
9 496
 
5.5%
Other Punctuation
ValueCountFrequency (%)
, 901
98.6%
. 7
 
0.8%
/ 3
 
0.3%
& 2
 
0.2%
* 1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 1500
99.5%
] 7
 
0.5%
Open Punctuation
ValueCountFrequency (%)
( 1499
99.5%
[ 7
 
0.5%
Space Separator
ValueCountFrequency (%)
10691
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 376
100.0%
Math Symbol
ValueCountFrequency (%)
~ 72
100.0%
Control
ValueCountFrequency (%)
3
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 31642
56.6%
Common 24094
43.1%
Latin 153
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2265
 
7.2%
1976
 
6.2%
1813
 
5.7%
1786
 
5.6%
1738
 
5.5%
1656
 
5.2%
1644
 
5.2%
1586
 
5.0%
1563
 
4.9%
1556
 
4.9%
Other values (312) 14059
44.4%
Latin
ValueCountFrequency (%)
B 26
17.0%
A 23
15.0%
e 12
 
7.8%
C 9
 
5.9%
M 7
 
4.6%
E 7
 
4.6%
O 6
 
3.9%
G 5
 
3.3%
T 4
 
2.6%
t 4
 
2.6%
Other values (22) 50
32.7%
Common
ValueCountFrequency (%)
10691
44.4%
1 2129
 
8.8%
) 1500
 
6.2%
( 1499
 
6.2%
2 1191
 
4.9%
5 914
 
3.8%
3 905
 
3.8%
, 901
 
3.7%
4 851
 
3.5%
0 742
 
3.1%
Other values (13) 2771
 
11.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 31642
56.6%
ASCII 24246
43.4%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10691
44.1%
1 2129
 
8.8%
) 1500
 
6.2%
( 1499
 
6.2%
2 1191
 
4.9%
5 914
 
3.8%
3 905
 
3.7%
, 901
 
3.7%
4 851
 
3.5%
0 742
 
3.1%
Other values (44) 2923
 
12.1%
Hangul
ValueCountFrequency (%)
2265
 
7.2%
1976
 
6.2%
1813
 
5.7%
1786
 
5.6%
1738
 
5.5%
1656
 
5.2%
1644
 
5.2%
1586
 
5.0%
1563
 
4.9%
1556
 
4.9%
Other values (312) 14059
44.4%
Number Forms
ValueCountFrequency (%)
1
100.0%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size17.6 KiB
2023-10-05
2235 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-10-05
2nd row2023-10-05
3rd row2023-10-05
4th row2023-10-05
5th row2023-10-05

Common Values

ValueCountFrequency (%)
2023-10-05 2235
100.0%

Length

2023-12-12T08:24:50.827209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:24:50.938716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-10-05 2235
100.0%

Missing values

2023-12-12T08:24:47.888263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:24:47.964471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T08:24:48.042531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

시설구분시설명소재지도로명주소데이터기준일자
0식품접객업소해미청경기도 안산시 상록구 각골로 47, 1층 (본오동)2023-10-05
1연면적 2,000m2 이상의 사무실용 건축물 및 복합용도 건축물연지프라자경기도 안산시 상록구 각골로 55 (본오동)2023-10-05
2학교안산디자인문화고등학교경기도 안산시 상록구 각골로 872023-10-05
3집단급식소안산디자인문화고등학교경기도 안산시 상록구 각골로 87 (본오동)2023-10-05
4연면적 2,000m2 이상의 사무실용 건축물 및 복합용도 건축물안산여자정보산업고등학교 A동(안산디자인문화고등학교)경기도 안산시 상록구 각골로 87 (본오동)2023-10-05
5연면적 2,000m2 이상의 사무실용 건축물 및 복합용도 건축물안산여자정보산업고등학교 D동(안산디자인문화고등학교)경기도 안산시 상록구 각골로 87 (본오동)2023-10-05
6어린이집신에덴어린이집경기도 안산시 상록구 각골로1안길 40 (본오동)2023-10-05
7어린이집용신어린이집경기도 안산시 상록구 각골로2안길 38 (본오동)2023-10-05
8학교상록중학교경기도 안산시 상록구 각골로4길 292023-10-05
9연면적 2,000m2 이상의 사무실용 건축물 및 복합용도 건축물상록중학교 상록중학교경기도 안산시 상록구 각골로4길 29 (본오동)2023-10-05
시설구분시설명소재지도로명주소데이터기준일자
2225판매시설서울프라자단원구 삼일로 3102023-10-05
2226판매시설보성상가단원구 라성로 482023-10-05
2227판매시설고잔프라자단원구 광덕4로 1082023-10-05
2228판매시설원곡동 808-3 판매시설 (삼흥제일산업(주)단원구 백성길382023-10-05
2229판매시설스타맥스타워2단원구 광덕대로 1942023-10-05
2230판매시설밀레니엄프라자단원구 광덕4로 2202023-10-05
2231판매시설부일프라자단원구 광덕4로 962023-10-05
2232판매시설대덕프라자단원구 광덕4로 1162023-10-05
2233판매시설씨티프라자단원구 광덕4로 2502023-10-05
2234판매시설유통상가단원구 산단로 3262023-10-05

Duplicate rows

Most frequently occurring

시설구분시설명소재지도로명주소데이터기준일자# duplicates
0식품접객업소게스트하우스컨벤션경기도 안산시 상록구 한양대학로 55, 지하1층 (사동)2023-10-052