Overview

Dataset statistics

Number of variables5
Number of observations3452
Missing cells0
Missing cells (%)0.0%
Duplicate rows2
Duplicate rows (%)0.1%
Total size in memory135.0 KiB
Average record size in memory40.0 B

Variable types

Categorical2
Text2
DateTime1

Dataset

Description경기도 용인시 소독의무대상시설 현황입니다.(시군명, 시설구분, 업소명, 소재지도로명주소, 데이터기준일자)
Author경기도 용인시
URLhttps://www.data.go.kr/data/15112507/fileData.do

Alerts

시군명 has constant value ""Constant
데이터기준일자 has constant value ""Constant
Dataset has 2 (0.1%) duplicate rowsDuplicates

Reproduction

Analysis started2024-03-14 20:16:38.146720
Analysis finished2024-03-14 20:16:39.751439
Duration1.6 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size27.1 KiB
용인시
3452 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row용인시
2nd row용인시
3rd row용인시
4th row용인시
5th row용인시

Common Values

ValueCountFrequency (%)
용인시 3452
100.0%

Length

2024-03-15T05:16:39.972132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T05:16:40.287951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
용인시 3452
100.0%

시설구분
Categorical

Distinct46
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size27.1 KiB
11호(연면적2000㎡이상 사무실및복합건축물)
557 
2호(식품접객업소)
407 
6호(집단급식소)
375 
11호(건축물)
330 
11호
324 
Other values (41)
1459 

Length

Max length25
Median length9
Mean length9.7540556
Min length2

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row1호(숙박업소)
2nd row1호(숙박업소)
3rd row1호(숙박업소)
4th row1호(숙박업소)
5th row1호(숙박업소)

Common Values

ValueCountFrequency (%)
11호(연면적2000㎡이상 사무실및복합건축물) 557
16.1%
2호(식품접객업소) 407
11.8%
6호(집단급식소) 375
10.9%
11호(건축물) 330
9.6%
11호 324
9.4%
13호(공동주택) 221
 
6.4%
13호 142
 
4.1%
9호(학교) 133
 
3.9%
12호(어린이집) 130
 
3.8%
2호 119
 
3.4%
Other values (36) 714
20.7%

Length

2024-03-15T05:16:40.640155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
11호(연면적2000㎡이상 557
13.9%
사무실및복합건축물 557
13.9%
2호(식품접객업소 407
10.1%
6호(집단급식소 375
9.3%
11호(건축물 330
 
8.2%
11호 324
 
8.1%
13호(공동주택 221
 
5.5%
13호 142
 
3.5%
9호(학교 133
 
3.3%
12호(어린이집 130
 
3.2%
Other values (38) 844
21.0%
Distinct3026
Distinct (%)87.7%
Missing0
Missing (%)0.0%
Memory size27.1 KiB
2024-03-15T05:16:41.642540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length36
Mean length8.6975666
Min length1

Characters and Unicode

Total characters30024
Distinct characters738
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2652 ?
Unique (%)76.8%

Sample

1st row(주)더숨디앤씨
2nd row(주)더트리니호텔
3rd rowG7
4th rowNO25 용인터미널점
5th rowQ(큐)호텔
ValueCountFrequency (%)
주식회사 36
 
0.8%
용인 26
 
0.6%
스타벅스 15
 
0.3%
호텔 11
 
0.2%
오피스텔 11
 
0.2%
어린이집 10
 
0.2%
동백 10
 
0.2%
용인공장 9
 
0.2%
용인점 9
 
0.2%
9
 
0.2%
Other values (3586) 4413
96.8%
2024-03-15T05:16:43.252925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1162
 
3.9%
687
 
2.3%
653
 
2.2%
551
 
1.8%
( 543
 
1.8%
) 542
 
1.8%
522
 
1.7%
514
 
1.7%
491
 
1.6%
488
 
1.6%
Other values (728) 23871
79.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 25703
85.6%
Space Separator 1162
 
3.9%
Uppercase Letter 890
 
3.0%
Open Punctuation 543
 
1.8%
Close Punctuation 542
 
1.8%
Lowercase Letter 484
 
1.6%
Decimal Number 402
 
1.3%
Other Punctuation 150
 
0.5%
Other Symbol 100
 
0.3%
Dash Punctuation 22
 
0.1%
Other values (3) 26
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
687
 
2.7%
653
 
2.5%
551
 
2.1%
522
 
2.0%
514
 
2.0%
491
 
1.9%
488
 
1.9%
398
 
1.5%
380
 
1.5%
360
 
1.4%
Other values (649) 20659
80.4%
Uppercase Letter
ValueCountFrequency (%)
C 110
 
12.4%
A 77
 
8.7%
T 72
 
8.1%
S 60
 
6.7%
E 56
 
6.3%
D 53
 
6.0%
R 43
 
4.8%
I 39
 
4.4%
L 38
 
4.3%
G 36
 
4.0%
Other values (16) 306
34.4%
Lowercase Letter
ValueCountFrequency (%)
e 77
15.9%
a 61
12.6%
t 42
8.7%
r 38
7.9%
o 37
7.6%
i 35
7.2%
c 34
 
7.0%
n 28
 
5.8%
l 25
 
5.2%
s 21
 
4.3%
Other values (14) 86
17.8%
Decimal Number
ValueCountFrequency (%)
1 121
30.1%
2 96
23.9%
3 46
 
11.4%
5 38
 
9.5%
4 29
 
7.2%
0 22
 
5.5%
7 21
 
5.2%
6 11
 
2.7%
9 10
 
2.5%
8 8
 
2.0%
Other Punctuation
ValueCountFrequency (%)
, 70
46.7%
/ 36
24.0%
& 17
 
11.3%
. 16
 
10.7%
· 7
 
4.7%
' 3
 
2.0%
# 1
 
0.7%
Letter Number
ValueCountFrequency (%)
5
55.6%
3
33.3%
1
 
11.1%
Math Symbol
ValueCountFrequency (%)
~ 4
50.0%
> 3
37.5%
= 1
 
12.5%
Space Separator
ValueCountFrequency (%)
1162
100.0%
Open Punctuation
ValueCountFrequency (%)
( 543
100.0%
Close Punctuation
ValueCountFrequency (%)
) 542
100.0%
Other Symbol
ValueCountFrequency (%)
100
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 22
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 25797
85.9%
Common 2838
 
9.5%
Latin 1383
 
4.6%
Han 6
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
687
 
2.7%
653
 
2.5%
551
 
2.1%
522
 
2.0%
514
 
2.0%
491
 
1.9%
488
 
1.9%
398
 
1.5%
380
 
1.5%
360
 
1.4%
Other values (644) 20753
80.4%
Latin
ValueCountFrequency (%)
C 110
 
8.0%
e 77
 
5.6%
A 77
 
5.6%
T 72
 
5.2%
a 61
 
4.4%
S 60
 
4.3%
E 56
 
4.0%
D 53
 
3.8%
R 43
 
3.1%
t 42
 
3.0%
Other values (43) 732
52.9%
Common
ValueCountFrequency (%)
1162
40.9%
( 543
19.1%
) 542
19.1%
1 121
 
4.3%
2 96
 
3.4%
, 70
 
2.5%
3 46
 
1.6%
5 38
 
1.3%
/ 36
 
1.3%
4 29
 
1.0%
Other values (15) 155
 
5.5%
Han
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 25697
85.6%
ASCII 4205
 
14.0%
None 107
 
0.4%
Number Forms 9
 
< 0.1%
CJK 6
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1162
27.6%
( 543
12.9%
) 542
12.9%
1 121
 
2.9%
C 110
 
2.6%
2 96
 
2.3%
e 77
 
1.8%
A 77
 
1.8%
T 72
 
1.7%
, 70
 
1.7%
Other values (64) 1335
31.7%
Hangul
ValueCountFrequency (%)
687
 
2.7%
653
 
2.5%
551
 
2.1%
522
 
2.0%
514
 
2.0%
491
 
1.9%
488
 
1.9%
398
 
1.5%
380
 
1.5%
360
 
1.4%
Other values (643) 20653
80.4%
None
ValueCountFrequency (%)
100
93.5%
· 7
 
6.5%
Number Forms
ValueCountFrequency (%)
5
55.6%
3
33.3%
1
 
11.1%
CJK
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Distinct3267
Distinct (%)94.6%
Missing0
Missing (%)0.0%
Memory size27.1 KiB
2024-03-15T05:16:44.737869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length74
Median length59
Mean length28.826477
Min length16

Characters and Unicode

Total characters99509
Distinct characters436
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3136 ?
Unique (%)90.8%

Sample

1st row경기도 용인시 처인구 포곡읍 성산로 633, A,B,C,D,E동
2nd row경기도 용인시 처인구 중부대로 1218, THE TRINY HOTEL (역북동)
3rd row경기도 용인시 처인구 포곡읍 전대로78번길 14-6
4th row경기도 용인시 처인구 금령로117번길 17 (김량장동)
5th row경기도 용인시 처인구 포곡읍 전대로110번길 10-3
ValueCountFrequency (%)
경기도 3452
 
15.9%
용인시 3451
 
15.9%
처인구 1369
 
6.3%
기흥구 1216
 
5.6%
수지구 864
 
4.0%
포곡읍 220
 
1.0%
1층 207
 
1.0%
이동읍 192
 
0.9%
양지면 177
 
0.8%
죽전동 143
 
0.7%
Other values (3526) 10412
48.0%
2024-03-15T05:16:46.481413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
18366
 
18.5%
4908
 
4.9%
4883
 
4.9%
1 4080
 
4.1%
3732
 
3.8%
3626
 
3.6%
3507
 
3.5%
3476
 
3.5%
3464
 
3.5%
3316
 
3.3%
Other values (426) 46151
46.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 58596
58.9%
Space Separator 18366
 
18.5%
Decimal Number 16434
 
16.5%
Open Punctuation 1703
 
1.7%
Close Punctuation 1700
 
1.7%
Dash Punctuation 1170
 
1.2%
Other Punctuation 1145
 
1.2%
Uppercase Letter 249
 
0.3%
Math Symbol 106
 
0.1%
Lowercase Letter 35
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4908
 
8.4%
4883
 
8.3%
3732
 
6.4%
3626
 
6.2%
3507
 
6.0%
3476
 
5.9%
3464
 
5.9%
3316
 
5.7%
2428
 
4.1%
1604
 
2.7%
Other values (365) 23652
40.4%
Uppercase Letter
ValueCountFrequency (%)
B 41
16.5%
A 39
15.7%
C 24
9.6%
E 17
 
6.8%
I 17
 
6.8%
T 17
 
6.8%
K 12
 
4.8%
R 10
 
4.0%
D 10
 
4.0%
H 9
 
3.6%
Other values (15) 53
21.3%
Lowercase Letter
ValueCountFrequency (%)
e 6
17.1%
a 5
14.3%
r 5
14.3%
o 4
11.4%
c 3
8.6%
t 2
 
5.7%
k 2
 
5.7%
n 1
 
2.9%
x 1
 
2.9%
l 1
 
2.9%
Other values (5) 5
14.3%
Decimal Number
ValueCountFrequency (%)
1 4080
24.8%
2 2458
15.0%
3 1596
 
9.7%
0 1307
 
8.0%
5 1304
 
7.9%
4 1267
 
7.7%
7 1238
 
7.5%
6 1159
 
7.1%
8 1038
 
6.3%
9 987
 
6.0%
Other Punctuation
ValueCountFrequency (%)
, 1126
98.3%
. 9
 
0.8%
/ 7
 
0.6%
& 3
 
0.3%
Other Symbol
ValueCountFrequency (%)
4
80.0%
1
 
20.0%
Space Separator
ValueCountFrequency (%)
18366
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1703
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1700
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1170
100.0%
Math Symbol
ValueCountFrequency (%)
~ 106
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 58596
58.9%
Common 40628
40.8%
Latin 284
 
0.3%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4908
 
8.4%
4883
 
8.3%
3732
 
6.4%
3626
 
6.2%
3507
 
6.0%
3476
 
5.9%
3464
 
5.9%
3316
 
5.7%
2428
 
4.1%
1604
 
2.7%
Other values (365) 23652
40.4%
Latin
ValueCountFrequency (%)
B 41
14.4%
A 39
13.7%
C 24
 
8.5%
E 17
 
6.0%
I 17
 
6.0%
T 17
 
6.0%
K 12
 
4.2%
R 10
 
3.5%
D 10
 
3.5%
H 9
 
3.2%
Other values (30) 88
31.0%
Common
ValueCountFrequency (%)
18366
45.2%
1 4080
 
10.0%
2 2458
 
6.1%
( 1703
 
4.2%
) 1700
 
4.2%
3 1596
 
3.9%
0 1307
 
3.2%
5 1304
 
3.2%
4 1267
 
3.1%
7 1238
 
3.0%
Other values (10) 5609
 
13.8%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 58595
58.9%
ASCII 40908
41.1%
CJK Compat 4
 
< 0.1%
CJK 1
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
18366
44.9%
1 4080
 
10.0%
2 2458
 
6.0%
( 1703
 
4.2%
) 1700
 
4.2%
3 1596
 
3.9%
0 1307
 
3.2%
5 1304
 
3.2%
4 1267
 
3.1%
7 1238
 
3.0%
Other values (49) 5889
 
14.4%
Hangul
ValueCountFrequency (%)
4908
 
8.4%
4883
 
8.3%
3732
 
6.4%
3626
 
6.2%
3507
 
6.0%
3476
 
5.9%
3464
 
5.9%
3316
 
5.7%
2428
 
4.1%
1604
 
2.7%
Other values (364) 23651
40.4%
CJK Compat
ValueCountFrequency (%)
4
100.0%
CJK
ValueCountFrequency (%)
1
100.0%
None
ValueCountFrequency (%)
1
100.0%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size27.1 KiB
Minimum2024-02-22 00:00:00
Maximum2024-02-22 00:00:00
2024-03-15T05:16:46.693410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T05:16:46.863415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Missing values

2024-03-15T05:16:39.259342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T05:16:39.612814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군명시설구분업소명소재지도로명주소데이터기준일자
0용인시1호(숙박업소)(주)더숨디앤씨경기도 용인시 처인구 포곡읍 성산로 633, A,B,C,D,E동2024-02-22
1용인시1호(숙박업소)(주)더트리니호텔경기도 용인시 처인구 중부대로 1218, THE TRINY HOTEL (역북동)2024-02-22
2용인시1호(숙박업소)G7경기도 용인시 처인구 포곡읍 전대로78번길 14-62024-02-22
3용인시1호(숙박업소)NO25 용인터미널점경기도 용인시 처인구 금령로117번길 17 (김량장동)2024-02-22
4용인시1호(숙박업소)Q(큐)호텔경기도 용인시 처인구 포곡읍 전대로110번길 10-32024-02-22
5용인시1호(숙박업소)SR 디자인 호텔경기도 용인시 처인구 금령로90번길 5 (김량장동)2024-02-22
6용인시1호(숙박업소)STAY HOTEL경기도 용인시 처인구 양지면 남평로 3672024-02-22
7용인시1호(숙박업소)T7 HOTEL경기도 용인시 처인구 백옥대로 1130 (김량장동)2024-02-22
8용인시1호(숙박업소)W(더블유)모텔경기도 용인시 처인구 백암면 백암로226번길 372024-02-22
9용인시1호(숙박업소)골든튤립에버용인호텔경기도 용인시 처인구 포곡읍 전대로78번길 19-2, 골든튤립에버용인호텔2024-02-22
시군명시설구분업소명소재지도로명주소데이터기준일자
3442용인시13호수지파크푸르지오경기도 용인시 수지구 풍덕천로 171번길 92024-02-22
3443용인시13호동천센트럴자이(동천자이2차)경기도 용인시 수지구 고기로45번길 40-182024-02-22
3444용인시13호성복역롯데캐슬골드타운경기도 용인시 수지구 성복2로 102024-02-22
3445용인시13호동천파크자이경기도 용인시 수지구 동천로153번길 792024-02-22
3446용인시13호더샵수지포레(상현더샵파크사이드)경기도 용인시 수지구 만현로67번길 202024-02-22
3447용인시13호더샵동천이스트포레경기도 용인시 수지구 수풍로 892024-02-22
3448용인시13호성복역롯데캐슬파크나인경기도 용인시 수지구 성복1로 132024-02-22
3449용인시13호성복역롯데캐슬클라시엘경기도 용인시 수지구 성복1로 352024-02-22
3450용인시13호힐스테이트광교산경기도 용인시 수지구 신봉2로 1542024-02-22
3451용인시13호수지스카이뷰푸르지오경기도 용인시 수지구 신봉3로7번길 312024-02-22

Duplicate rows

Most frequently occurring

시군명시설구분업소명소재지도로명주소데이터기준일자# duplicates
0용인시11호(연면적2000㎡이상 사무실및복합건축물)SK아카데미경기도 용인시 처인구 원삼면 모래실로 172024-02-222
1용인시11호(연면적2000㎡이상 사무실및복합건축물)에버랜드경기도 용인시 처인구 포곡읍 에버랜드로 1992024-02-222