Overview

Dataset statistics

Number of variables4
Number of observations385
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.5 KiB
Average record size in memory33.3 B

Variable types

Numeric1
Text2
Categorical1

Dataset

Description대구광역시 서구_의무소독대상시설_20240202
Author대구광역시 서구
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15103326&dataSetDetailId=151033261a8c2ca54c1e9&provdMethod=FILE

Alerts

번호 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 번호High correlation
번호 has unique valuesUnique

Reproduction

Analysis started2024-03-13 14:09:49.794789
Analysis finished2024-03-13 14:09:50.311189
Duration0.52 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct385
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean193
Minimum1
Maximum385
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.5 KiB
2024-03-13T23:09:50.381033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile20.2
Q197
median193
Q3289
95-th percentile365.8
Maximum385
Range384
Interquartile range (IQR)192

Descriptive statistics

Standard deviation111.28417
Coefficient of variation (CV)0.57660192
Kurtosis-1.2
Mean193
Median Absolute Deviation (MAD)96
Skewness0
Sum74305
Variance12384.167
MonotonicityStrictly increasing
2024-03-13T23:09:50.520206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
290 1
 
0.3%
264 1
 
0.3%
263 1
 
0.3%
262 1
 
0.3%
261 1
 
0.3%
260 1
 
0.3%
259 1
 
0.3%
258 1
 
0.3%
257 1
 
0.3%
Other values (375) 375
97.4%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
385 1
0.3%
384 1
0.3%
383 1
0.3%
382 1
0.3%
381 1
0.3%
380 1
0.3%
379 1
0.3%
378 1
0.3%
377 1
0.3%
376 1
0.3%
Distinct357
Distinct (%)92.7%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
2024-03-13T23:09:50.748171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length8.2779221
Min length2

Characters and Unicode

Total characters3187
Distinct characters370
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique329 ?
Unique (%)85.5%

Sample

1st row제니스호텔
2nd row호텔센텀
3rd row동원모텔
4th row하와이모텔
5th row기키모텔
ValueCountFrequency (%)
건물관리자 33
 
6.0%
서구 33
 
6.0%
구립 10
 
1.8%
서대구로 7
 
1.3%
국채보상로 6
 
1.1%
주식회사 3
 
0.5%
내당점 3
 
0.5%
3
 
0.5%
달서로 3
 
0.5%
모텔 3
 
0.5%
Other values (404) 445
81.1%
2024-03-13T23:09:51.265932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
164
 
5.1%
149
 
4.7%
121
 
3.8%
103
 
3.2%
77
 
2.4%
76
 
2.4%
73
 
2.3%
64
 
2.0%
59
 
1.9%
49
 
1.5%
Other values (360) 2252
70.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2814
88.3%
Space Separator 164
 
5.1%
Decimal Number 115
 
3.6%
Uppercase Letter 40
 
1.3%
Close Punctuation 20
 
0.6%
Open Punctuation 20
 
0.6%
Lowercase Letter 6
 
0.2%
Other Symbol 4
 
0.1%
Dash Punctuation 2
 
0.1%
Other Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
149
 
5.3%
121
 
4.3%
103
 
3.7%
77
 
2.7%
76
 
2.7%
73
 
2.6%
64
 
2.3%
59
 
2.1%
49
 
1.7%
46
 
1.6%
Other values (321) 1997
71.0%
Uppercase Letter
ValueCountFrequency (%)
T 8
20.0%
D 6
15.0%
M 4
10.0%
L 3
 
7.5%
E 3
 
7.5%
O 3
 
7.5%
H 3
 
7.5%
S 2
 
5.0%
N 1
 
2.5%
B 1
 
2.5%
Other values (6) 6
15.0%
Decimal Number
ValueCountFrequency (%)
1 28
24.3%
2 20
17.4%
0 11
 
9.6%
7 10
 
8.7%
3 9
 
7.8%
6 8
 
7.0%
8 8
 
7.0%
4 8
 
7.0%
5 7
 
6.1%
9 6
 
5.2%
Lowercase Letter
ValueCountFrequency (%)
e 1
16.7%
i 1
16.7%
t 1
16.7%
y 1
16.7%
k 1
16.7%
s 1
16.7%
Other Punctuation
ValueCountFrequency (%)
, 1
50.0%
& 1
50.0%
Space Separator
ValueCountFrequency (%)
164
100.0%
Close Punctuation
ValueCountFrequency (%)
) 20
100.0%
Open Punctuation
ValueCountFrequency (%)
( 20
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2818
88.4%
Common 323
 
10.1%
Latin 46
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
149
 
5.3%
121
 
4.3%
103
 
3.7%
77
 
2.7%
76
 
2.7%
73
 
2.6%
64
 
2.3%
59
 
2.1%
49
 
1.7%
46
 
1.6%
Other values (322) 2001
71.0%
Latin
ValueCountFrequency (%)
T 8
17.4%
D 6
13.0%
M 4
 
8.7%
L 3
 
6.5%
E 3
 
6.5%
O 3
 
6.5%
H 3
 
6.5%
S 2
 
4.3%
e 1
 
2.2%
N 1
 
2.2%
Other values (12) 12
26.1%
Common
ValueCountFrequency (%)
164
50.8%
1 28
 
8.7%
2 20
 
6.2%
) 20
 
6.2%
( 20
 
6.2%
0 11
 
3.4%
7 10
 
3.1%
3 9
 
2.8%
6 8
 
2.5%
8 8
 
2.5%
Other values (6) 25
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2814
88.3%
ASCII 369
 
11.6%
None 4
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
164
44.4%
1 28
 
7.6%
2 20
 
5.4%
) 20
 
5.4%
( 20
 
5.4%
0 11
 
3.0%
7 10
 
2.7%
3 9
 
2.4%
6 8
 
2.2%
8 8
 
2.2%
Other values (28) 71
19.2%
Hangul
ValueCountFrequency (%)
149
 
5.3%
121
 
4.3%
103
 
3.7%
77
 
2.7%
76
 
2.7%
73
 
2.6%
64
 
2.3%
59
 
2.1%
49
 
1.7%
46
 
1.6%
Other values (321) 1997
71.0%
None
ValueCountFrequency (%)
4
100.0%
Distinct360
Distinct (%)93.5%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
2024-03-13T23:09:51.513998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length38
Mean length25.522078
Min length14

Characters and Unicode

Total characters9826
Distinct characters159
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique335 ?
Unique (%)87.0%

Sample

1st row대구광역시 서구 국채보상로42길 17 (평리동)
2nd row대구광역시 서구 국채보상로42길 20 (평리동)
3rd row대구광역시 서구 서대구로 322 (비산동)
4th row대구광역시 서구 서대구로 356 (비산동)
5th row대구광역시 서구 국채보상로42길 41 (평리동)
ValueCountFrequency (%)
대구광역시 385
19.3%
서구 384
19.2%
평리동 116
 
5.8%
내당동 83
 
4.2%
비산동 74
 
3.7%
서대구로 64
 
3.2%
국채보상로 47
 
2.4%
중리동 36
 
1.8%
달구벌대로 28
 
1.4%
평리로 15
 
0.8%
Other values (419) 764
38.3%
2024-03-13T23:09:51.883391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1611
16.4%
918
 
9.3%
552
 
5.6%
533
 
5.4%
389
 
4.0%
387
 
3.9%
387
 
3.9%
383
 
3.9%
365
 
3.7%
) 359
 
3.7%
Other values (149) 3942
40.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5909
60.1%
Space Separator 1611
 
16.4%
Decimal Number 1428
 
14.5%
Close Punctuation 359
 
3.7%
Open Punctuation 359
 
3.7%
Other Punctuation 111
 
1.1%
Dash Punctuation 40
 
0.4%
Math Symbol 7
 
0.1%
Uppercase Letter 1
 
< 0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
918
15.5%
552
 
9.3%
533
 
9.0%
389
 
6.6%
387
 
6.5%
387
 
6.5%
383
 
6.5%
365
 
6.2%
201
 
3.4%
155
 
2.6%
Other values (130) 1639
27.7%
Decimal Number
ValueCountFrequency (%)
1 260
18.2%
2 230
16.1%
3 227
15.9%
4 126
8.8%
5 120
8.4%
7 111
7.8%
6 110
7.7%
0 95
 
6.7%
8 90
 
6.3%
9 59
 
4.1%
Other Punctuation
ValueCountFrequency (%)
, 110
99.1%
? 1
 
0.9%
Space Separator
ValueCountFrequency (%)
1611
100.0%
Close Punctuation
ValueCountFrequency (%)
) 359
100.0%
Open Punctuation
ValueCountFrequency (%)
( 359
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 40
100.0%
Math Symbol
ValueCountFrequency (%)
~ 7
100.0%
Uppercase Letter
ValueCountFrequency (%)
M 1
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5909
60.1%
Common 3915
39.8%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
918
15.5%
552
 
9.3%
533
 
9.0%
389
 
6.6%
387
 
6.5%
387
 
6.5%
383
 
6.5%
365
 
6.2%
201
 
3.4%
155
 
2.6%
Other values (130) 1639
27.7%
Common
ValueCountFrequency (%)
1611
41.1%
) 359
 
9.2%
( 359
 
9.2%
1 260
 
6.6%
2 230
 
5.9%
3 227
 
5.8%
4 126
 
3.2%
5 120
 
3.1%
7 111
 
2.8%
6 110
 
2.8%
Other values (7) 402
 
10.3%
Latin
ValueCountFrequency (%)
M 1
50.0%
e 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5909
60.1%
ASCII 3917
39.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1611
41.1%
) 359
 
9.2%
( 359
 
9.2%
1 260
 
6.6%
2 230
 
5.9%
3 227
 
5.8%
4 126
 
3.2%
5 120
 
3.1%
7 111
 
2.8%
6 110
 
2.8%
Other values (9) 404
 
10.3%
Hangul
ValueCountFrequency (%)
918
15.5%
552
 
9.3%
533
 
9.0%
389
 
6.6%
387
 
6.5%
387
 
6.5%
383
 
6.5%
365
 
6.2%
201
 
3.4%
155
 
2.6%
Other values (130) 1639
27.7%

구분
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
복합건축물
90 
집단급식소
62 
식품접객업
50 
숙박
47 
보육시설
39 
Other values (8)
97 

Length

Max length5
Median length5
Mean length3.9636364
Min length2

Unique

Unique1 ?
Unique (%)0.3%

Sample

1st row숙박
2nd row숙박
3rd row숙박
4th row숙박
5th row숙박

Common Values

ValueCountFrequency (%)
복합건축물 90
23.4%
집단급식소 62
16.1%
식품접객업 50
13.0%
숙박 47
12.2%
보육시설 39
10.1%
학교 32
 
8.3%
병원 21
 
5.5%
공동주택 20
 
5.2%
전통시장 11
 
2.9%
여객 5
 
1.3%
Other values (3) 8
 
2.1%

Length

2024-03-13T23:09:52.010167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
복합건축물 90
23.4%
집단급식소 62
16.1%
식품접객업 50
13.0%
숙박 47
12.2%
보육시설 39
10.1%
학교 32
 
8.3%
병원 21
 
5.5%
공동주택 20
 
5.2%
전통시장 11
 
2.9%
여객 5
 
1.3%
Other values (3) 8
 
2.1%

Interactions

2024-03-13T23:09:50.083042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-13T23:09:52.082305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호구분
번호1.0000.937
구분0.9371.000
2024-03-13T23:09:52.159737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호구분
번호1.0000.764
구분0.7641.000

Missing values

2024-03-13T23:09:50.198851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-13T23:09:50.276141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호시설명도로명주소구분
01제니스호텔대구광역시 서구 국채보상로42길 17 (평리동)숙박
12호텔센텀대구광역시 서구 국채보상로42길 20 (평리동)숙박
23동원모텔대구광역시 서구 서대구로 322 (비산동)숙박
34하와이모텔대구광역시 서구 서대구로 356 (비산동)숙박
45기키모텔대구광역시 서구 국채보상로42길 41 (평리동)숙박
56장수장여관대구광역시 서구 북비산로 303-1 (평리동)숙박
67비바스호텔대구광역시 서구 서대구로21길 6 (평리동)숙박
78유성모텔대구광역시 서구 국채보상로 236 (평리동)숙박
89유진장모텔대구광역시 서구 서대구로 200 (평리동)숙박
910테마모텔대구광역시 서구 평리로 316 (내당동)숙박
번호시설명도로명주소구분
375376서연빌딩대구광역시 서구 통학로 57 (내당동)복합건축물
376377서구 가르뱅이로16길 11 건물관리자대구광역시 서구 가르뱅이로16길 11 (상리동)복합건축물
377378서구 국채보상로 168 건물관리자대구광역시 서구 국채보상로 168 (중리동)복합건축물
378379주식회사대안대구광역시 서구 달서천로 68 (이현동)복합건축물
379380한화생명보험(주)대구내당동사옥대구광역시 서구 달구벌대로 1833 (내당동)복합건축물
380381비산웰빙하와이대구광역시 서구 문화로 322 (비산동)복합건축물
381382M-월드대구광역시 서구 문화로 37 (이현동)복합건축물
382383서구 달서로 201 건물관리자대구광역시 서구 달서로 201 (비산동)복합건축물
383384대구N타워대구광역시 서구 달구벌대로361길 1 (내당동)복합건축물
384385M플라자대구광역시 서구 달구벌대로 1691 (내당동)복합건축물