Overview

Dataset statistics

Number of variables4
Number of observations535
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory17.4 KiB
Average record size in memory33.2 B

Variable types

Numeric1
Categorical1
Text2

Dataset

Description경기도 안양시 만안구 관내 소독의무대상시설 현황(관내 소독의무대상시설명, 관내 소독의무대상시설소재지)데이터 정보입니다.
Author경기도 안양시
URLhttps://www.data.go.kr/data/15055320/fileData.do

Alerts

연번 is highly overall correlated with 시설종류High correlation
시설종류 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 09:49:19.721482
Analysis finished2023-12-12 09:49:20.345090
Duration0.62 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct535
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean268
Minimum1
Maximum535
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.8 KiB
2023-12-12T18:49:20.429511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile27.7
Q1134.5
median268
Q3401.5
95-th percentile508.3
Maximum535
Range534
Interquartile range (IQR)267

Descriptive statistics

Standard deviation154.58547
Coefficient of variation (CV)0.57681144
Kurtosis-1.2
Mean268
Median Absolute Deviation (MAD)134
Skewness0
Sum143380
Variance23896.667
MonotonicityStrictly increasing
2023-12-12T18:49:20.662884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
353 1
 
0.2%
367 1
 
0.2%
366 1
 
0.2%
365 1
 
0.2%
364 1
 
0.2%
363 1
 
0.2%
362 1
 
0.2%
361 1
 
0.2%
360 1
 
0.2%
Other values (525) 525
98.1%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
535 1
0.2%
534 1
0.2%
533 1
0.2%
532 1
0.2%
531 1
0.2%
530 1
0.2%
529 1
0.2%
528 1
0.2%
527 1
0.2%
526 1
0.2%

시설종류
Categorical

HIGH CORRELATION 

Distinct12
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size4.3 KiB
건축물
156 
집단급식
95 
보육시설
65 
숙박업
47 
식품접객
45 
Other values (7)
127 

Length

Max length4
Median length3
Mean length3.3775701
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row숙박업
2nd row숙박업
3rd row숙박업
4th row숙박업
5th row숙박업

Common Values

ValueCountFrequency (%)
건축물 156
29.2%
집단급식 95
17.8%
보육시설 65
12.1%
숙박업 47
 
8.8%
식품접객 45
 
8.4%
학교 41
 
7.7%
공동주택 41
 
7.7%
교통시설 18
 
3.4%
병원 14
 
2.6%
유통 7
 
1.3%
Other values (2) 6
 
1.1%

Length

2023-12-12T18:49:20.861999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
건축물 156
29.2%
집단급식 95
17.8%
보육시설 65
12.1%
숙박업 47
 
8.8%
식품접객 45
 
8.4%
학교 41
 
7.7%
공동주택 41
 
7.7%
교통시설 18
 
3.4%
병원 14
 
2.6%
유통 7
 
1.3%
Other values (2) 6
 
1.1%
Distinct476
Distinct (%)89.0%
Missing0
Missing (%)0.0%
Memory size4.3 KiB
2023-12-12T18:49:21.103707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length21
Mean length7.8056075
Min length2

Characters and Unicode

Total characters4176
Distinct characters433
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique418 ?
Unique (%)78.1%

Sample

1st row(주)삼원프라자호텔
2nd rowMOON모텔(호텔문)
3rd rowSS(상상)모텔
4th row거화파크
5th row그린로즈모텔/꿈꾸다모텔
ValueCountFrequency (%)
본병원 3
 
0.5%
안양 3
 
0.5%
성결대학교 3
 
0.5%
안양샘병원 3
 
0.5%
건물 3
 
0.5%
안양삼성어린이집 2
 
0.3%
양명여자고등학교 2
 
0.3%
안양초등학교 2
 
0.3%
경기캠퍼스 2
 
0.3%
경인교육대학교 2
 
0.3%
Other values (519) 579
95.9%
2023-12-12T18:49:21.512320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
137
 
3.3%
127
 
3.0%
99
 
2.4%
99
 
2.4%
96
 
2.3%
( 94
 
2.3%
) 93
 
2.2%
88
 
2.1%
69
 
1.7%
68
 
1.6%
Other values (423) 3206
76.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3774
90.4%
Open Punctuation 94
 
2.3%
Close Punctuation 93
 
2.2%
Space Separator 69
 
1.7%
Decimal Number 54
 
1.3%
Uppercase Letter 54
 
1.3%
Other Punctuation 18
 
0.4%
Other Symbol 11
 
0.3%
Lowercase Letter 6
 
0.1%
Dash Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
137
 
3.6%
127
 
3.4%
99
 
2.6%
99
 
2.6%
96
 
2.5%
88
 
2.3%
68
 
1.8%
67
 
1.8%
63
 
1.7%
61
 
1.6%
Other values (381) 2869
76.0%
Uppercase Letter
ValueCountFrequency (%)
T 6
11.1%
C 6
11.1%
S 5
 
9.3%
O 5
 
9.3%
K 5
 
9.3%
B 3
 
5.6%
G 3
 
5.6%
M 3
 
5.6%
Y 3
 
5.6%
L 2
 
3.7%
Other values (9) 13
24.1%
Decimal Number
ValueCountFrequency (%)
1 20
37.0%
2 10
18.5%
0 7
 
13.0%
5 5
 
9.3%
3 4
 
7.4%
4 2
 
3.7%
9 2
 
3.7%
7 2
 
3.7%
8 1
 
1.9%
6 1
 
1.9%
Lowercase Letter
ValueCountFrequency (%)
e 2
33.3%
w 1
16.7%
o 1
16.7%
t 1
16.7%
r 1
16.7%
Other Punctuation
ValueCountFrequency (%)
, 12
66.7%
/ 3
 
16.7%
& 3
 
16.7%
Open Punctuation
ValueCountFrequency (%)
( 94
100.0%
Close Punctuation
ValueCountFrequency (%)
) 93
100.0%
Space Separator
ValueCountFrequency (%)
69
100.0%
Other Symbol
ValueCountFrequency (%)
11
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3785
90.6%
Common 331
 
7.9%
Latin 60
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
137
 
3.6%
127
 
3.4%
99
 
2.6%
99
 
2.6%
96
 
2.5%
88
 
2.3%
68
 
1.8%
67
 
1.8%
63
 
1.7%
61
 
1.6%
Other values (382) 2880
76.1%
Latin
ValueCountFrequency (%)
T 6
 
10.0%
C 6
 
10.0%
S 5
 
8.3%
O 5
 
8.3%
K 5
 
8.3%
B 3
 
5.0%
G 3
 
5.0%
M 3
 
5.0%
Y 3
 
5.0%
L 2
 
3.3%
Other values (14) 19
31.7%
Common
ValueCountFrequency (%)
( 94
28.4%
) 93
28.1%
69
20.8%
1 20
 
6.0%
, 12
 
3.6%
2 10
 
3.0%
0 7
 
2.1%
5 5
 
1.5%
3 4
 
1.2%
/ 3
 
0.9%
Other values (7) 14
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3774
90.4%
ASCII 391
 
9.4%
None 11
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
137
 
3.6%
127
 
3.4%
99
 
2.6%
99
 
2.6%
96
 
2.5%
88
 
2.3%
68
 
1.8%
67
 
1.8%
63
 
1.7%
61
 
1.6%
Other values (381) 2869
76.0%
ASCII
ValueCountFrequency (%)
( 94
24.0%
) 93
23.8%
69
17.6%
1 20
 
5.1%
, 12
 
3.1%
2 10
 
2.6%
0 7
 
1.8%
T 6
 
1.5%
C 6
 
1.5%
S 5
 
1.3%
Other values (31) 69
17.6%
None
ValueCountFrequency (%)
11
100.0%
Distinct510
Distinct (%)95.3%
Missing0
Missing (%)0.0%
Memory size4.3 KiB
2023-12-12T18:49:21.892635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length46
Mean length26.607477
Min length17

Characters and Unicode

Total characters14235
Distinct characters183
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique489 ?
Unique (%)91.4%

Sample

1st row경기도 안양시 만안구 장내로139번길 7 (안양동)
2nd row경기도 안양시 만안구 예술공원로208번길 4 (안양동)
3rd row경기도 안양시 만안구 병목안로 12 (안양동)
4th row경기도 안양시 만안구 수리산로47번길 6-5 (안양동)
5th row경기도 안양시 만안구 안양로319번길 26 (안양동,.146)
ValueCountFrequency (%)
경기도 535
17.2%
만안구 535
17.2%
안양시 530
17.0%
안양동 158
 
5.1%
안양로 82
 
2.6%
석수동 31
 
1.0%
만안로 27
 
0.9%
박달동 24
 
0.8%
박달로 21
 
0.7%
병목안로 13
 
0.4%
Other values (567) 1159
37.2%
2023-12-12T18:49:22.481455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2601
18.3%
1488
 
10.5%
933
 
6.6%
569
 
4.0%
564
 
4.0%
541
 
3.8%
540
 
3.8%
537
 
3.8%
536
 
3.8%
535
 
3.8%
Other values (173) 5391
37.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8590
60.3%
Space Separator 2601
 
18.3%
Decimal Number 2172
 
15.3%
Open Punctuation 306
 
2.1%
Close Punctuation 306
 
2.1%
Other Punctuation 181
 
1.3%
Dash Punctuation 59
 
0.4%
Math Symbol 9
 
0.1%
Uppercase Letter 9
 
0.1%
Other Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1488
17.3%
933
10.9%
569
 
6.6%
564
 
6.6%
541
 
6.3%
540
 
6.3%
537
 
6.3%
536
 
6.2%
535
 
6.2%
310
 
3.6%
Other values (151) 2037
23.7%
Decimal Number
ValueCountFrequency (%)
1 496
22.8%
3 302
13.9%
2 287
13.2%
4 212
9.8%
5 186
 
8.6%
0 149
 
6.9%
8 146
 
6.7%
7 140
 
6.4%
6 127
 
5.8%
9 127
 
5.8%
Other Punctuation
ValueCountFrequency (%)
, 173
95.6%
. 4
 
2.2%
/ 4
 
2.2%
Uppercase Letter
ValueCountFrequency (%)
A 6
66.7%
C 2
 
22.2%
K 1
 
11.1%
Space Separator
ValueCountFrequency (%)
2601
100.0%
Open Punctuation
ValueCountFrequency (%)
( 306
100.0%
Close Punctuation
ValueCountFrequency (%)
) 306
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 59
100.0%
Math Symbol
ValueCountFrequency (%)
~ 9
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8590
60.3%
Common 5636
39.6%
Latin 9
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1488
17.3%
933
10.9%
569
 
6.6%
564
 
6.6%
541
 
6.3%
540
 
6.3%
537
 
6.3%
536
 
6.2%
535
 
6.2%
310
 
3.6%
Other values (151) 2037
23.7%
Common
ValueCountFrequency (%)
2601
46.1%
1 496
 
8.8%
( 306
 
5.4%
) 306
 
5.4%
3 302
 
5.4%
2 287
 
5.1%
4 212
 
3.8%
5 186
 
3.3%
, 173
 
3.1%
0 149
 
2.6%
Other values (9) 618
 
11.0%
Latin
ValueCountFrequency (%)
A 6
66.7%
C 2
 
22.2%
K 1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8590
60.3%
ASCII 5643
39.6%
CJK Compat 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2601
46.1%
1 496
 
8.8%
( 306
 
5.4%
) 306
 
5.4%
3 302
 
5.4%
2 287
 
5.1%
4 212
 
3.8%
5 186
 
3.3%
, 173
 
3.1%
0 149
 
2.6%
Other values (11) 625
 
11.1%
Hangul
ValueCountFrequency (%)
1488
17.3%
933
10.9%
569
 
6.6%
564
 
6.6%
541
 
6.3%
540
 
6.3%
537
 
6.3%
536
 
6.2%
535
 
6.2%
310
 
3.6%
Other values (151) 2037
23.7%
CJK Compat
ValueCountFrequency (%)
2
100.0%

Interactions

2023-12-12T18:49:20.086998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:49:22.578398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시설종류
연번1.0000.930
시설종류0.9301.000
2023-12-12T18:49:22.687655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시설종류
연번1.0000.750
시설종류0.7501.000

Missing values

2023-12-12T18:49:20.185053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:49:20.303259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시설종류시설명소재지
01숙박업(주)삼원프라자호텔경기도 안양시 만안구 장내로139번길 7 (안양동)
12숙박업MOON모텔(호텔문)경기도 안양시 만안구 예술공원로208번길 4 (안양동)
23숙박업SS(상상)모텔경기도 안양시 만안구 병목안로 12 (안양동)
34숙박업거화파크경기도 안양시 만안구 수리산로47번길 6-5 (안양동)
45숙박업그린로즈모텔/꿈꾸다모텔경기도 안양시 만안구 안양로319번길 26 (안양동,.146)
56숙박업그린파크장여관(그린모텔)경기도 안양시 만안구 장내로140번길 31-5 (안양동)
67숙박업돈키호텔(모텔)경기도 안양시 만안구 만안로 185 (안양동,30)
78숙박업라노비아호텔경기도 안양시 만안구 안양로304번길 21 (안양동)
89숙박업라인모텔경기도 안양시 만안구 만안로 107 (안양동422-12)
910숙박업이브모텔경기도 안양시 만안구 수리산로40번길 11 (안양동)
연번시설종류시설명소재지
525526공동주택안양1동주공뜨란채경기도 안양시 만안구 안양천서로 289
526527공동주택석수아이파크아파트경기도 안양시 만안구 충훈로 52
527528공동주택한라비발디아파트경기도 안양시 만안구 박달로 453
528529공동주택LG빌리지아파트경기도 안양시 만안구 연현로79번길 105
529530공동주택석수e편한세상경기도 안양시 만안구 경수대로 1193
530531공동주택삼성래미안아파트경기도 안양시 만안구 안양천서로 311
531532공동주택래미안안양메가트리아경기도 안양시 만안구 안양천서로 177
532533공동주택한양수자인에듀파크경기도 안양시 만안구 충훈로 14
533534공동주택안양역한양수자인리버파크경기도 안양시 만안구 안양천서로 357
534535공동주택씨엘포레자이경기도 안양시 만안구 소곡로 72