Overview

Dataset statistics

Number of variables7
Number of observations109
Missing cells44
Missing cells (%)5.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.1 KiB
Average record size in memory57.2 B

Variable types

Categorical3
Text4

Dataset

Description음식물류를 다량 배출하는 사업장의 정보가 포함되어 있습니다. 업소명, 주소, 번호와 음식물류 배출량 등의 데이터가 있습니다.
Author대구광역시 서구
URLhttps://www.data.go.kr/data/15088625/fileData.do

Alerts

수거형태 has constant value ""Constant
데이터기준일 has constant value ""Constant
전화번호 has 37 (33.9%) missing valuesMissing
월배출량(킬로그램) has 7 (6.4%) missing valuesMissing

Reproduction

Analysis started2024-03-14 14:25:16.008717
Analysis finished2024-03-14 14:25:17.727710
Duration1.72 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

Distinct3
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size1000.0 B
일반음식점
60 
집단급식소
44 
휴게음식점
 
5

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row집단급식소
2nd row집단급식소
3rd row일반음식점
4th row일반음식점
5th row일반음식점

Common Values

ValueCountFrequency (%)
일반음식점 60
55.0%
집단급식소 44
40.4%
휴게음식점 5
 
4.6%

Length

2024-03-14T23:25:17.855331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T23:25:18.086023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반음식점 60
55.0%
집단급식소 44
40.4%
휴게음식점 5
 
4.6%
Distinct108
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size1000.0 B
2024-03-14T23:25:18.925371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length14
Mean length7.0550459
Min length2

Characters and Unicode

Total characters769
Distinct characters233
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique107 ?
Unique (%)98.2%

Sample

1st row곽호순병원
2nd row두류초등학교
3rd row손복자할매낙지
4th row거너실 흑태찜 전문점
5th row남강장어
ValueCountFrequency (%)
평리점 3
 
2.2%
경대요양병원 2
 
1.5%
무한리필 2
 
1.5%
감자탕 2
 
1.5%
교동면옥(내당점 1
 
0.7%
산록식당 1
 
0.7%
남다른 1
 
0.7%
북비산 1
 
0.7%
고기굽는 1
 
0.7%
롯데리아 1
 
0.7%
Other values (121) 121
89.0%
2024-03-14T23:25:20.033327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
27
 
3.5%
23
 
3.0%
22
 
2.9%
22
 
2.9%
20
 
2.6%
19
 
2.5%
19
 
2.5%
17
 
2.2%
17
 
2.2%
16
 
2.1%
Other values (223) 567
73.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 711
92.5%
Space Separator 27
 
3.5%
Other Symbol 9
 
1.2%
Close Punctuation 7
 
0.9%
Open Punctuation 7
 
0.9%
Uppercase Letter 5
 
0.7%
Decimal Number 3
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
23
 
3.2%
22
 
3.1%
22
 
3.1%
20
 
2.8%
19
 
2.7%
19
 
2.7%
17
 
2.4%
17
 
2.4%
16
 
2.3%
14
 
2.0%
Other values (212) 522
73.4%
Uppercase Letter
ValueCountFrequency (%)
K 2
40.0%
C 1
20.0%
F 1
20.0%
S 1
20.0%
Decimal Number
ValueCountFrequency (%)
5 1
33.3%
6 1
33.3%
3 1
33.3%
Space Separator
ValueCountFrequency (%)
27
100.0%
Other Symbol
ValueCountFrequency (%)
9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 720
93.6%
Common 44
 
5.7%
Latin 5
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
23
 
3.2%
22
 
3.1%
22
 
3.1%
20
 
2.8%
19
 
2.6%
19
 
2.6%
17
 
2.4%
17
 
2.4%
16
 
2.2%
14
 
1.9%
Other values (213) 531
73.8%
Common
ValueCountFrequency (%)
27
61.4%
) 7
 
15.9%
( 7
 
15.9%
5 1
 
2.3%
6 1
 
2.3%
3 1
 
2.3%
Latin
ValueCountFrequency (%)
K 2
40.0%
C 1
20.0%
F 1
20.0%
S 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 711
92.5%
ASCII 49
 
6.4%
None 9
 
1.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
27
55.1%
) 7
 
14.3%
( 7
 
14.3%
K 2
 
4.1%
5 1
 
2.0%
6 1
 
2.0%
3 1
 
2.0%
C 1
 
2.0%
F 1
 
2.0%
S 1
 
2.0%
Hangul
ValueCountFrequency (%)
23
 
3.2%
22
 
3.1%
22
 
3.1%
20
 
2.8%
19
 
2.7%
19
 
2.7%
17
 
2.4%
17
 
2.4%
16
 
2.3%
14
 
2.0%
Other values (212) 522
73.4%
None
ValueCountFrequency (%)
9
100.0%
Distinct100
Distinct (%)91.7%
Missing0
Missing (%)0.0%
Memory size1000.0 B
2024-03-14T23:25:21.093831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length20
Mean length17.522936
Min length15

Characters and Unicode

Total characters1910
Distinct characters53
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique91 ?
Unique (%)83.5%

Sample

1st row대구광역시 서구 통학로 31
2nd row대구광역시 서구 달구벌대로357길 22
3rd row대구광역시 서구 달구벌대로 1789
4th row대구광역시 서구 서대구로 36
5th row대구광역시 서구 서대구로 54
ValueCountFrequency (%)
대구광역시 109
27.5%
서구 109
27.5%
서대구로 24
 
6.1%
달구벌대로 9
 
2.3%
와룡로 4
 
1.0%
북비산로 4
 
1.0%
35 3
 
0.8%
통학로 3
 
0.8%
달서천로 3
 
0.8%
달서로 3
 
0.8%
Other values (110) 125
31.6%
2024-03-14T23:25:22.397861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
288
15.1%
270
14.1%
160
 
8.4%
156
 
8.2%
109
 
5.7%
109
 
5.7%
109
 
5.7%
109
 
5.7%
1 66
 
3.5%
3 53
 
2.8%
Other values (43) 481
25.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1245
65.2%
Decimal Number 357
 
18.7%
Space Separator 288
 
15.1%
Dash Punctuation 12
 
0.6%
Other Punctuation 4
 
0.2%
Open Punctuation 2
 
0.1%
Close Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
270
21.7%
160
12.9%
156
12.5%
109
8.8%
109
8.8%
109
8.8%
109
8.8%
28
 
2.2%
27
 
2.2%
18
 
1.4%
Other values (27) 150
12.0%
Decimal Number
ValueCountFrequency (%)
1 66
18.5%
3 53
14.8%
2 45
12.6%
7 38
10.6%
5 34
9.5%
6 32
9.0%
0 24
 
6.7%
9 24
 
6.7%
4 21
 
5.9%
8 20
 
5.6%
Other Punctuation
ValueCountFrequency (%)
, 3
75.0%
. 1
 
25.0%
Space Separator
ValueCountFrequency (%)
288
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1245
65.2%
Common 665
34.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
270
21.7%
160
12.9%
156
12.5%
109
8.8%
109
8.8%
109
8.8%
109
8.8%
28
 
2.2%
27
 
2.2%
18
 
1.4%
Other values (27) 150
12.0%
Common
ValueCountFrequency (%)
288
43.3%
1 66
 
9.9%
3 53
 
8.0%
2 45
 
6.8%
7 38
 
5.7%
5 34
 
5.1%
6 32
 
4.8%
0 24
 
3.6%
9 24
 
3.6%
4 21
 
3.2%
Other values (6) 40
 
6.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1245
65.2%
ASCII 665
34.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
288
43.3%
1 66
 
9.9%
3 53
 
8.0%
2 45
 
6.8%
7 38
 
5.7%
5 34
 
5.1%
6 32
 
4.8%
0 24
 
3.6%
9 24
 
3.6%
4 21
 
3.2%
Other values (6) 40
 
6.0%
Hangul
ValueCountFrequency (%)
270
21.7%
160
12.9%
156
12.5%
109
8.8%
109
8.8%
109
8.8%
109
8.8%
28
 
2.2%
27
 
2.2%
18
 
1.4%
Other values (27) 150
12.0%

전화번호
Text

MISSING 

Distinct71
Distinct (%)98.6%
Missing37
Missing (%)33.9%
Memory size1000.0 B
2024-03-14T23:25:23.220240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.013889
Min length12

Characters and Unicode

Total characters865
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique70 ?
Unique (%)97.2%

Sample

1st row053-572-7770
2nd row053-233-2714
3rd row053-554-9475
4th row053-571-9595
5th row053-522-0770
ValueCountFrequency (%)
053-233-2832 2
 
2.8%
053-571-1003 1
 
1.4%
053-572-7770 1
 
1.4%
053-522-3232 1
 
1.4%
053-656-2119 1
 
1.4%
053-233-2380 1
 
1.4%
053-567-9777 1
 
1.4%
053-233-1606 1
 
1.4%
070-7209-1666 1
 
1.4%
053-555-5588 1
 
1.4%
Other values (61) 61
84.7%
2024-03-14T23:25:24.242202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 167
19.3%
3 153
17.7%
- 144
16.6%
0 128
14.8%
2 71
8.2%
1 42
 
4.9%
6 42
 
4.9%
7 40
 
4.6%
8 32
 
3.7%
9 26
 
3.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 721
83.4%
Dash Punctuation 144
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 167
23.2%
3 153
21.2%
0 128
17.8%
2 71
9.8%
1 42
 
5.8%
6 42
 
5.8%
7 40
 
5.5%
8 32
 
4.4%
9 26
 
3.6%
4 20
 
2.8%
Dash Punctuation
ValueCountFrequency (%)
- 144
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 865
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 167
19.3%
3 153
17.7%
- 144
16.6%
0 128
14.8%
2 71
8.2%
1 42
 
4.9%
6 42
 
4.9%
7 40
 
4.6%
8 32
 
3.7%
9 26
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 865
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 167
19.3%
3 153
17.7%
- 144
16.6%
0 128
14.8%
2 71
8.2%
1 42
 
4.9%
6 42
 
4.9%
7 40
 
4.6%
8 32
 
3.7%
9 26
 
3.0%

수거형태
Categorical

CONSTANT 

Distinct1
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1000.0 B
환경업체 위탁
109 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row환경업체 위탁
2nd row환경업체 위탁
3rd row환경업체 위탁
4th row환경업체 위탁
5th row환경업체 위탁

Common Values

ValueCountFrequency (%)
환경업체 위탁 109
100.0%

Length

2024-03-14T23:25:24.470058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T23:25:24.628142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
환경업체 109
50.0%
위탁 109
50.0%
Distinct72
Distinct (%)70.6%
Missing7
Missing (%)6.4%
Memory size1000.0 B
2024-03-14T23:25:25.371939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length3
Mean length3.5392157
Min length2

Characters and Unicode

Total characters361
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique57 ?
Unique (%)55.9%

Sample

1st row2000
2nd row257
3rd row300
4th row3000
5th row1500
ValueCountFrequency (%)
감량기 9
 
8.8%
1000 5
 
4.9%
3000 4
 
3.9%
600 3
 
2.9%
800 3
 
2.9%
300 3
 
2.9%
340 2
 
2.0%
550 2
 
2.0%
1350 2
 
2.0%
230 2
 
2.0%
Other values (62) 67
65.7%
2024-03-14T23:25:26.427303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 136
37.7%
1 38
 
10.5%
5 32
 
8.9%
3 24
 
6.6%
2 23
 
6.4%
8 19
 
5.3%
7 17
 
4.7%
4 15
 
4.2%
6 14
 
3.9%
9 11
 
3.0%
Other values (4) 32
 
8.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 329
91.1%
Other Letter 27
 
7.5%
Space Separator 5
 
1.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 136
41.3%
1 38
 
11.6%
5 32
 
9.7%
3 24
 
7.3%
2 23
 
7.0%
8 19
 
5.8%
7 17
 
5.2%
4 15
 
4.6%
6 14
 
4.3%
9 11
 
3.3%
Other Letter
ValueCountFrequency (%)
9
33.3%
9
33.3%
9
33.3%
Space Separator
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 334
92.5%
Hangul 27
 
7.5%

Most frequent character per script

Common
ValueCountFrequency (%)
0 136
40.7%
1 38
 
11.4%
5 32
 
9.6%
3 24
 
7.2%
2 23
 
6.9%
8 19
 
5.7%
7 17
 
5.1%
4 15
 
4.5%
6 14
 
4.2%
9 11
 
3.3%
Hangul
ValueCountFrequency (%)
9
33.3%
9
33.3%
9
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 334
92.5%
Hangul 27
 
7.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 136
40.7%
1 38
 
11.4%
5 32
 
9.6%
3 24
 
7.2%
2 23
 
6.9%
8 19
 
5.7%
7 17
 
5.1%
4 15
 
4.5%
6 14
 
4.2%
9 11
 
3.3%
Hangul
ValueCountFrequency (%)
9
33.3%
9
33.3%
9
33.3%

데이터기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1000.0 B
2024-02-14
109 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024-02-14
2nd row2024-02-14
3rd row2024-02-14
4th row2024-02-14
5th row2024-02-14

Common Values

ValueCountFrequency (%)
2024-02-14 109
100.0%

Length

2024-03-14T23:25:26.651228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T23:25:26.813439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024-02-14 109
100.0%

Correlations

2024-03-14T23:25:26.912144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종소재지도로명주소전화번호월배출량(킬로그램)
업종1.0000.8570.8830.842
소재지도로명주소0.8571.0001.0000.984
전화번호0.8831.0001.0000.995
월배출량(킬로그램)0.8420.9840.9951.000

Missing values

2024-03-14T23:25:16.889354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T23:25:17.280123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-14T23:25:17.625844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업종업소명소재지도로명주소전화번호수거형태월배출량(킬로그램)데이터기준일
0집단급식소곽호순병원대구광역시 서구 통학로 31053-572-7770환경업체 위탁20002024-02-14
1집단급식소두류초등학교대구광역시 서구 달구벌대로357길 22053-233-2714환경업체 위탁2572024-02-14
2일반음식점손복자할매낙지대구광역시 서구 달구벌대로 1789053-554-9475환경업체 위탁3002024-02-14
3일반음식점거너실 흑태찜 전문점대구광역시 서구 서대구로 36053-571-9595환경업체 위탁30002024-02-14
4일반음식점남강장어대구광역시 서구 서대구로 54053-522-0770환경업체 위탁15002024-02-14
5일반음식점뼈큰 감자탕 내당점대구광역시 서구 달구벌대로371길35053-233-2832환경업체 위탁10002024-02-14
6집단급식소내서초등학교대구광역시 서구 달구벌대로 1877053-552-1119환경업체 위탁13502024-02-14
7집단급식소열린큰병원대구광역시 서구 달서로 35053-555-0660환경업체 위탁10002024-02-14
8집단급식소경운초등학교대구광역시 서구 평리로54길16053-233-1707환경업체 위탁17002024-02-14
9일반음식점어등대구광역시 서구 서대구로 39053-525-0043환경업체 위탁20802024-02-14
업종업소명소재지도로명주소전화번호수거형태월배출량(킬로그램)데이터기준일
99집단급식소연세요양병원대구광역시 서구 북비산로 156<NA>환경업체 위탁감량기2024-02-14
100집단급식소팔달요양병원대구광역시 서구 팔달로 152<NA>환경업체 위탁감량기2024-02-14
101집단급식소경대요양병원대구광역시 서구 국채보상로223<NA>환경업체 위탁감량기2024-02-14
102일반음식점㈜다함푸드대구광역시 서구 국채보상로6길12-12<NA>환경업체 위탁<NA>2024-02-14
103일반음식점쿠우쿠우 대구서구점대구광역시 서구와룡로307<NA>환경업체 위탁<NA>2024-02-14
104일반음식점다담뜰한식뷔페서구내당점대구광역시 서구 달구벌대로1783,1층<NA>환경업체 위탁<NA>2024-02-14
105일반음식점㈜이엠에스대구광역시 서구 문화로23길16<NA>환경업체 위탁<NA>2024-02-14
106일반음식점이삭푸드서비스대구광역시 서구 달서천로92.7층<NA>환경업체 위탁<NA>2024-02-14
107일반음식점㈜은성푸드 서대구역점대구광역시 서구 와룡로 527, 4층<NA>환경업체 위탁<NA>2024-02-14
108일반음식점디프트 카페테리아대구광역시 서구 와룡로 307, 2층<NA>환경업체 위탁<NA>2024-02-14