Overview

Dataset statistics

Number of variables5
Number of observations437
Missing cells20
Missing cells (%)0.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory17.2 KiB
Average record size in memory40.3 B

Variable types

Text3
Categorical2

Dataset

Description대구광역시 달서구_음식물폐기물다량배출사업장_20220929
Author대구광역시 달서구
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15106969&dataSetDetailId=151069691efdac531a474&provdMethod=FILE

Alerts

데이터기준일자 is highly overall correlated with 사업장구분High correlation
사업장구분 is highly overall correlated with 데이터기준일자High correlation
데이터기준일자 is highly imbalanced (97.7%)Imbalance
전화번호 has 18 (4.1%) missing valuesMissing

Reproduction

Analysis started2023-12-10 18:12:34.518241
Analysis finished2023-12-10 18:12:35.870322
Duration1.35 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct432
Distinct (%)99.1%
Missing1
Missing (%)0.2%
Memory size3.5 KiB
2023-12-11T03:12:36.118276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length20
Mean length8.0802752
Min length2

Characters and Unicode

Total characters3523
Distinct characters421
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique428 ?
Unique (%)98.2%

Sample

1st row진월초등학교
2nd row장산초등학교
3rd row대구한샘초등학교
4th row열린아동병원
5th row(주)후레쉬케터링[효성대구공장]
ValueCountFrequency (%)
맥도날드 7
 
1.3%
주식회사 6
 
1.1%
주)아워홈 4
 
0.8%
주)한국피제스 3
 
0.6%
동원홈푸드 3
 
0.6%
월성점 3
 
0.6%
버거킹 3
 
0.6%
성서점 3
 
0.6%
구내식당 3
 
0.6%
본사 2
 
0.4%
Other values (473) 492
93.0%
2023-12-11T03:12:36.710285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
117
 
3.3%
102
 
2.9%
96
 
2.7%
93
 
2.6%
90
 
2.6%
86
 
2.4%
) 77
 
2.2%
( 77
 
2.2%
69
 
2.0%
63
 
1.8%
Other values (411) 2653
75.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3198
90.8%
Space Separator 93
 
2.6%
Close Punctuation 81
 
2.3%
Open Punctuation 81
 
2.3%
Uppercase Letter 37
 
1.1%
Decimal Number 22
 
0.6%
Other Punctuation 6
 
0.2%
Lowercase Letter 5
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
117
 
3.7%
102
 
3.2%
96
 
3.0%
90
 
2.8%
86
 
2.7%
69
 
2.2%
63
 
2.0%
61
 
1.9%
60
 
1.9%
59
 
1.8%
Other values (375) 2395
74.9%
Uppercase Letter
ValueCountFrequency (%)
D 8
21.6%
T 7
18.9%
S 4
10.8%
O 3
 
8.1%
K 2
 
5.4%
C 2
 
5.4%
A 2
 
5.4%
I 1
 
2.7%
U 1
 
2.7%
G 1
 
2.7%
Other values (6) 6
16.2%
Decimal Number
ValueCountFrequency (%)
2 7
31.8%
1 5
22.7%
0 3
13.6%
3 2
 
9.1%
9 2
 
9.1%
5 1
 
4.5%
4 1
 
4.5%
7 1
 
4.5%
Lowercase Letter
ValueCountFrequency (%)
y 1
20.0%
r 1
20.0%
o 1
20.0%
t 1
20.0%
s 1
20.0%
Close Punctuation
ValueCountFrequency (%)
) 77
95.1%
] 4
 
4.9%
Open Punctuation
ValueCountFrequency (%)
( 77
95.1%
[ 4
 
4.9%
Other Punctuation
ValueCountFrequency (%)
& 4
66.7%
. 2
33.3%
Space Separator
ValueCountFrequency (%)
93
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3198
90.8%
Common 283
 
8.0%
Latin 42
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
117
 
3.7%
102
 
3.2%
96
 
3.0%
90
 
2.8%
86
 
2.7%
69
 
2.2%
63
 
2.0%
61
 
1.9%
60
 
1.9%
59
 
1.8%
Other values (375) 2395
74.9%
Latin
ValueCountFrequency (%)
D 8
19.0%
T 7
16.7%
S 4
 
9.5%
O 3
 
7.1%
K 2
 
4.8%
C 2
 
4.8%
A 2
 
4.8%
I 1
 
2.4%
U 1
 
2.4%
y 1
 
2.4%
Other values (11) 11
26.2%
Common
ValueCountFrequency (%)
93
32.9%
) 77
27.2%
( 77
27.2%
2 7
 
2.5%
1 5
 
1.8%
& 4
 
1.4%
[ 4
 
1.4%
] 4
 
1.4%
0 3
 
1.1%
3 2
 
0.7%
Other values (5) 7
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3198
90.8%
ASCII 325
 
9.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
117
 
3.7%
102
 
3.2%
96
 
3.0%
90
 
2.8%
86
 
2.7%
69
 
2.2%
63
 
2.0%
61
 
1.9%
60
 
1.9%
59
 
1.8%
Other values (375) 2395
74.9%
ASCII
ValueCountFrequency (%)
93
28.6%
) 77
23.7%
( 77
23.7%
D 8
 
2.5%
T 7
 
2.2%
2 7
 
2.2%
1 5
 
1.5%
& 4
 
1.2%
S 4
 
1.2%
[ 4
 
1.2%
Other values (26) 39
12.0%
Distinct422
Distinct (%)96.8%
Missing1
Missing (%)0.2%
Memory size3.5 KiB
2023-12-11T03:12:37.214497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length39
Mean length25.56422
Min length1

Characters and Unicode

Total characters11146
Distinct characters191
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique408 ?
Unique (%)93.6%

Sample

1st row대구광역시 달서구 진천로4길 32 (진천동)
2nd row대구광역시 달서구 장산로 30 (용산동)
3rd row대구광역시 달서구 월배로11길 42 (대천동)
4th row대구광역시 달서구 달구벌대로 1542 (감삼동)
5th row대구광역시 달서구 성서공단로55길 45 (장동)
ValueCountFrequency (%)
대구광역시 435
 
19.2%
달서구 435
 
19.2%
이곡동 52
 
2.3%
월성동 37
 
1.6%
상인동 30
 
1.3%
진천동 26
 
1.1%
호산동 26
 
1.1%
월암동 22
 
1.0%
신당동 21
 
0.9%
달구벌대로 20
 
0.9%
Other values (456) 1166
51.4%
2023-12-11T03:12:37.880581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1843
16.5%
933
 
8.4%
578
 
5.2%
573
 
5.1%
510
 
4.6%
463
 
4.2%
436
 
3.9%
436
 
3.9%
435
 
3.9%
427
 
3.8%
Other values (181) 4512
40.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6912
62.0%
Space Separator 1843
 
16.5%
Decimal Number 1417
 
12.7%
Close Punctuation 426
 
3.8%
Open Punctuation 426
 
3.8%
Connector Punctuation 87
 
0.8%
Dash Punctuation 23
 
0.2%
Uppercase Letter 10
 
0.1%
Other Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
933
13.5%
578
 
8.4%
573
 
8.3%
510
 
7.4%
463
 
6.7%
436
 
6.3%
436
 
6.3%
435
 
6.3%
427
 
6.2%
141
 
2.0%
Other values (157) 1980
28.6%
Decimal Number
ValueCountFrequency (%)
1 320
22.6%
2 183
12.9%
3 159
11.2%
5 142
10.0%
4 118
 
8.3%
0 109
 
7.7%
6 104
 
7.3%
9 102
 
7.2%
7 96
 
6.8%
8 84
 
5.9%
Uppercase Letter
ValueCountFrequency (%)
A 2
20.0%
C 2
20.0%
K 1
10.0%
M 1
10.0%
J 1
10.0%
B 1
10.0%
L 1
10.0%
F 1
10.0%
Space Separator
ValueCountFrequency (%)
1843
100.0%
Close Punctuation
ValueCountFrequency (%)
) 426
100.0%
Open Punctuation
ValueCountFrequency (%)
( 426
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 87
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 23
100.0%
Other Punctuation
ValueCountFrequency (%)
& 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6912
62.0%
Common 4224
37.9%
Latin 10
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
933
13.5%
578
 
8.4%
573
 
8.3%
510
 
7.4%
463
 
6.7%
436
 
6.3%
436
 
6.3%
435
 
6.3%
427
 
6.2%
141
 
2.0%
Other values (157) 1980
28.6%
Common
ValueCountFrequency (%)
1843
43.6%
) 426
 
10.1%
( 426
 
10.1%
1 320
 
7.6%
2 183
 
4.3%
3 159
 
3.8%
5 142
 
3.4%
4 118
 
2.8%
0 109
 
2.6%
6 104
 
2.5%
Other values (6) 394
 
9.3%
Latin
ValueCountFrequency (%)
A 2
20.0%
C 2
20.0%
K 1
10.0%
M 1
10.0%
J 1
10.0%
B 1
10.0%
L 1
10.0%
F 1
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6912
62.0%
ASCII 4234
38.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1843
43.5%
) 426
 
10.1%
( 426
 
10.1%
1 320
 
7.6%
2 183
 
4.3%
3 159
 
3.8%
5 142
 
3.4%
4 118
 
2.8%
0 109
 
2.6%
6 104
 
2.5%
Other values (14) 404
 
9.5%
Hangul
ValueCountFrequency (%)
933
13.5%
578
 
8.4%
573
 
8.3%
510
 
7.4%
463
 
6.7%
436
 
6.3%
436
 
6.3%
435
 
6.3%
427
 
6.2%
141
 
2.0%
Other values (157) 1980
28.6%

사업장구분
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size3.5 KiB
일반음식점
256 
집단급식소
172 
휴게음식점
 
8
<NA>
 
1

Length

Max length5
Median length5
Mean length4.9977117
Min length4

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row집단급식소
2nd row집단급식소
3rd row집단급식소
4th row집단급식소
5th row집단급식소

Common Values

ValueCountFrequency (%)
일반음식점 256
58.6%
집단급식소 172
39.4%
휴게음식점 8
 
1.8%
<NA> 1
 
0.2%

Length

2023-12-11T03:12:38.107595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T03:12:38.295834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반음식점 256
58.6%
집단급식소 172
39.4%
휴게음식점 8
 
1.8%
na 1
 
0.2%

전화번호
Text

MISSING 

Distinct409
Distinct (%)97.6%
Missing18
Missing (%)4.1%
Memory size3.5 KiB
2023-12-11T03:12:38.664246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length12
Mean length12.011933
Min length1

Characters and Unicode

Total characters5033
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique401 ?
Unique (%)95.7%

Sample

1st row053-234-2384
2nd row053-234-3902
3rd row053-234-5251
4th row053-269-7000
5th row053-382-3311
ValueCountFrequency (%)
053-585-6210 2
 
0.5%
070-4571-7130 2
 
0.5%
053-583-0002 2
 
0.5%
053-656-8001 2
 
0.5%
053-561-0740 2
 
0.5%
053-561-0508 2
 
0.5%
053-522-7373 2
 
0.5%
053-634-9599 1
 
0.2%
053-634-2223 1
 
0.2%
053-234-2384 1
 
0.2%
Other values (398) 398
95.9%
2023-12-11T03:12:39.313677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 829
16.5%
5 817
16.2%
0 783
15.6%
3 737
14.6%
2 356
7.1%
6 312
 
6.2%
8 286
 
5.7%
7 235
 
4.7%
4 234
 
4.6%
1 220
 
4.4%
Other values (2) 224
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4198
83.4%
Dash Punctuation 829
 
16.5%
Space Separator 6
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 817
19.5%
0 783
18.7%
3 737
17.6%
2 356
8.5%
6 312
 
7.4%
8 286
 
6.8%
7 235
 
5.6%
4 234
 
5.6%
1 220
 
5.2%
9 218
 
5.2%
Dash Punctuation
ValueCountFrequency (%)
- 829
100.0%
Space Separator
ValueCountFrequency (%)
6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5033
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 829
16.5%
5 817
16.2%
0 783
15.6%
3 737
14.6%
2 356
7.1%
6 312
 
6.2%
8 286
 
5.7%
7 235
 
4.7%
4 234
 
4.6%
1 220
 
4.4%
Other values (2) 224
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5033
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 829
16.5%
5 817
16.2%
0 783
15.6%
3 737
14.6%
2 356
7.1%
6 312
 
6.2%
8 286
 
5.7%
7 235
 
4.7%
4 234
 
4.6%
1 220
 
4.4%
Other values (2) 224
 
4.5%

데이터기준일자
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size3.5 KiB
2022-09-29
436 
<NA>
 
1

Length

Max length10
Median length10
Mean length9.98627
Min length4

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row2022-09-29
2nd row2022-09-29
3rd row2022-09-29
4th row2022-09-29
5th row2022-09-29

Common Values

ValueCountFrequency (%)
2022-09-29 436
99.8%
<NA> 1
 
0.2%

Length

2023-12-11T03:12:39.541397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T03:12:39.716748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-09-29 436
99.8%
na 1
 
0.2%

Correlations

2023-12-11T03:12:39.829981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업장구분
사업장구분1.000
2023-12-11T03:12:39.964424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
데이터기준일자사업장구분
데이터기준일자1.0001.000
사업장구분1.0001.000
2023-12-11T03:12:40.108913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업장구분데이터기준일자
사업장구분1.0001.000
데이터기준일자1.0001.000

Missing values

2023-12-11T03:12:35.225324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T03:12:35.439737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T03:12:35.750622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

사업장명도로명주소사업장구분전화번호데이터기준일자
0진월초등학교대구광역시 달서구 진천로4길 32 (진천동)집단급식소053-234-23842022-09-29
1장산초등학교대구광역시 달서구 장산로 30 (용산동)집단급식소053-234-39022022-09-29
2대구한샘초등학교대구광역시 달서구 월배로11길 42 (대천동)집단급식소053-234-52512022-09-29
3열린아동병원대구광역시 달서구 달구벌대로 1542 (감삼동)집단급식소053-269-70002022-09-29
4(주)후레쉬케터링[효성대구공장]대구광역시 달서구 성서공단로55길 45 (장동)집단급식소053-382-33112022-09-29
5황장군본리점대구광역시 달서구 와룡로 110 (본리동)일반음식점053-523-03352022-09-29
6대밭골생오리대구광역시 달서구 와룡로33길 7-2 (감삼동)일반음식점053-522-92522022-09-29
7장동초등학교대구광역시 달서구 달구벌대로304길 112 (장기동)집단급식소053-550-55762022-09-29
8대구전자공업고등학교대구광역시 달서구 용산로 113 (장기동)집단급식소053-551-14802022-09-29
9국빈식당대구광역시 달서구 평리로 84 (죽전동)일반음식점053-559-98002022-09-29
사업장명도로명주소사업장구분전화번호데이터기준일자
427(주)아워홈 덴티스대구점(성서공장)대구광역시 달서구 성서서로 99 (월암동)집단급식소053-583-07882022-09-29
428푸디스트(주) 삼보모터스점대구광역시 달서구 성서동로 142 (월암동)집단급식소053-582-92302022-09-29
429주식회사 페스티발푸드대구광역시 달서구 성서로71길 43 (갈산동)집단급식소<NA>2022-09-29
430명륜진사갈비 대곡점대구광역시 달서구 갈밭남로 39_ 중앙메디컬빌딩 (대곡동)일반음식점2022-09-29
431나눔대구광역시 달서구 조암로6길 60 (월성동)일반음식점2022-09-29
432우즈베이커리 대곡점대구광역시 달서구 갈밭로 76_ 1층 (대곡동)일반음식점053-634-97002022-09-29
433뵤뵤만두(본동)대구광역시 달서구 학산로7길 63 (본동)일반음식점0507-1308-33952022-09-29
434푸디스트주식회사삼익티에이치케이본사공장점대구광역시 달서구 성서동로 163 (월암동)집단급식소2022-09-29
435주식회사 규빈대구광역시 달서구 성서공단로22안길 18 (월암동)집단급식소2022-09-29
436<NA><NA><NA><NA><NA>