Overview

Dataset statistics

Number of variables5
Number of observations258
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.4%
Total size in memory10.2 KiB
Average record size in memory40.5 B

Variable types

Categorical2
Text3

Dataset

Description서산시의 음식물류 폐기물 다량배출사업장에 대한 데이터입니다. 항목명은 음식점 구분, 상호, 주소, 연락처로 구성되어 있습니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=307&beforeMenuCd=DOM_000000201001001000&publicdatapk=15094309

Alerts

데이터기준일 has constant value ""Constant
Dataset has 1 (0.4%) duplicate rowsDuplicates

Reproduction

Analysis started2024-01-09 22:08:29.316232
Analysis finished2024-01-09 22:08:29.720942
Duration0.4 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

Distinct3
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
일반음식점
184 
집단급식소
73 
휴게음식점
 
1

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique1 ?
Unique (%)0.4%

Sample

1st row휴게음식점
2nd row일반음식점
3rd row일반음식점
4th row일반음식점
5th row일반음식점

Common Values

ValueCountFrequency (%)
일반음식점 184
71.3%
집단급식소 73
 
28.3%
휴게음식점 1
 
0.4%

Length

2024-01-10T07:08:29.769540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:08:29.840206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반음식점 184
71.3%
집단급식소 73
 
28.3%
휴게음식점 1
 
0.4%

상호
Text

Distinct257
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2024-01-10T07:08:29.986109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length20
Mean length7.3837209
Min length2

Characters and Unicode

Total characters1905
Distinct characters347
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique256 ?
Unique (%)99.2%

Sample

1st row맥도날드서산점
2nd row중앙병원장례식장
3rd row큰마당
4th row성심가든
5th row천수만회타운
ValueCountFrequency (%)
서산점 12
 
3.4%
주)현대그린푸드 4
 
1.1%
주)아워홈 4
 
1.1%
서산 4
 
1.1%
현대파워텍 3
 
0.9%
현대트랜시스 3
 
0.9%
주)동원홈푸드 3
 
0.9%
서산호수공원점 3
 
0.9%
한식 2
 
0.6%
푸디스트 2
 
0.6%
Other values (305) 310
88.6%
2024-01-10T07:08:30.269917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
103
 
5.4%
79
 
4.1%
77
 
4.0%
47
 
2.5%
45
 
2.4%
45
 
2.4%
34
 
1.8%
33
 
1.7%
( 32
 
1.7%
) 32
 
1.7%
Other values (337) 1378
72.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1701
89.3%
Space Separator 103
 
5.4%
Open Punctuation 32
 
1.7%
Close Punctuation 32
 
1.7%
Decimal Number 24
 
1.3%
Uppercase Letter 9
 
0.5%
Other Punctuation 2
 
0.1%
Connector Punctuation 1
 
0.1%
Other Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
79
 
4.6%
77
 
4.5%
47
 
2.8%
45
 
2.6%
45
 
2.6%
34
 
2.0%
33
 
1.9%
30
 
1.8%
28
 
1.6%
28
 
1.6%
Other values (316) 1255
73.8%
Decimal Number
ValueCountFrequency (%)
1 5
20.8%
0 4
16.7%
2 4
16.7%
4 3
12.5%
8 2
 
8.3%
3 2
 
8.3%
9 1
 
4.2%
7 1
 
4.2%
5 1
 
4.2%
6 1
 
4.2%
Uppercase Letter
ValueCountFrequency (%)
C 5
55.6%
B 1
 
11.1%
H 1
 
11.1%
K 1
 
11.1%
D 1
 
11.1%
Space Separator
ValueCountFrequency (%)
103
100.0%
Open Punctuation
ValueCountFrequency (%)
( 32
100.0%
Close Punctuation
ValueCountFrequency (%)
) 32
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 2
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1702
89.3%
Common 194
 
10.2%
Latin 9
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
79
 
4.6%
77
 
4.5%
47
 
2.8%
45
 
2.6%
45
 
2.6%
34
 
2.0%
33
 
1.9%
30
 
1.8%
28
 
1.6%
28
 
1.6%
Other values (317) 1256
73.8%
Common
ValueCountFrequency (%)
103
53.1%
( 32
 
16.5%
) 32
 
16.5%
1 5
 
2.6%
0 4
 
2.1%
2 4
 
2.1%
4 3
 
1.5%
8 2
 
1.0%
3 2
 
1.0%
/ 2
 
1.0%
Other values (5) 5
 
2.6%
Latin
ValueCountFrequency (%)
C 5
55.6%
B 1
 
11.1%
H 1
 
11.1%
K 1
 
11.1%
D 1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1701
89.3%
ASCII 203
 
10.7%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
103
50.7%
( 32
 
15.8%
) 32
 
15.8%
1 5
 
2.5%
C 5
 
2.5%
0 4
 
2.0%
2 4
 
2.0%
4 3
 
1.5%
8 2
 
1.0%
3 2
 
1.0%
Other values (10) 11
 
5.4%
Hangul
ValueCountFrequency (%)
79
 
4.6%
77
 
4.5%
47
 
2.8%
45
 
2.6%
45
 
2.6%
34
 
2.0%
33
 
1.9%
30
 
1.8%
28
 
1.6%
28
 
1.6%
Other values (316) 1255
73.8%
None
ValueCountFrequency (%)
1
100.0%

주소
Text

Distinct256
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2024-01-10T07:08:30.512704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length35.5
Mean length23.833333
Min length15

Characters and Unicode

Total characters6149
Distinct characters205
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique254 ?
Unique (%)98.4%

Sample

1st row충청남도 서산시 충의로 3 (예천동)
2nd row충청남도 서산시 수석산업로 5 (수석동), 지하2층
3rd row충청남도 서산시 시장2로 20 (동문동), 지하1층
4th row충청남도 서산시 운산면 운암로 1071-42
5th row충청남도 서산시 부석면 천수만로 602
ValueCountFrequency (%)
서산시 258
 
18.7%
충청남도 257
 
18.7%
대산읍 38
 
2.8%
동문동 37
 
2.7%
1층 31
 
2.2%
읍내동 24
 
1.7%
예천동 22
 
1.6%
2층 20
 
1.5%
성연면 18
 
1.3%
지곡면 11
 
0.8%
Other values (387) 662
48.0%
2024-01-10T07:08:30.877479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1122
18.2%
328
 
5.3%
281
 
4.6%
276
 
4.5%
265
 
4.3%
260
 
4.2%
260
 
4.2%
1 259
 
4.2%
257
 
4.2%
214
 
3.5%
Other values (195) 2627
42.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3674
59.7%
Space Separator 1122
 
18.2%
Decimal Number 956
 
15.5%
Open Punctuation 121
 
2.0%
Close Punctuation 121
 
2.0%
Other Punctuation 74
 
1.2%
Dash Punctuation 66
 
1.1%
Connector Punctuation 8
 
0.1%
Uppercase Letter 4
 
0.1%
Math Symbol 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
328
 
8.9%
281
 
7.6%
276
 
7.5%
265
 
7.2%
260
 
7.1%
260
 
7.1%
257
 
7.0%
214
 
5.8%
165
 
4.5%
85
 
2.3%
Other values (174) 1283
34.9%
Decimal Number
ValueCountFrequency (%)
1 259
27.1%
2 161
16.8%
3 110
11.5%
4 82
 
8.6%
5 72
 
7.5%
6 67
 
7.0%
7 64
 
6.7%
9 56
 
5.9%
8 43
 
4.5%
0 42
 
4.4%
Uppercase Letter
ValueCountFrequency (%)
K 1
25.0%
C 1
25.0%
V 1
25.0%
T 1
25.0%
Space Separator
ValueCountFrequency (%)
1122
100.0%
Open Punctuation
ValueCountFrequency (%)
( 121
100.0%
Close Punctuation
ValueCountFrequency (%)
) 121
100.0%
Other Punctuation
ValueCountFrequency (%)
, 74
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 66
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 8
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3674
59.7%
Common 2471
40.2%
Latin 4
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
328
 
8.9%
281
 
7.6%
276
 
7.5%
265
 
7.2%
260
 
7.1%
260
 
7.1%
257
 
7.0%
214
 
5.8%
165
 
4.5%
85
 
2.3%
Other values (174) 1283
34.9%
Common
ValueCountFrequency (%)
1122
45.4%
1 259
 
10.5%
2 161
 
6.5%
( 121
 
4.9%
) 121
 
4.9%
3 110
 
4.5%
4 82
 
3.3%
, 74
 
3.0%
5 72
 
2.9%
6 67
 
2.7%
Other values (7) 282
 
11.4%
Latin
ValueCountFrequency (%)
K 1
25.0%
C 1
25.0%
V 1
25.0%
T 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3674
59.7%
ASCII 2475
40.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1122
45.3%
1 259
 
10.5%
2 161
 
6.5%
( 121
 
4.9%
) 121
 
4.9%
3 110
 
4.4%
4 82
 
3.3%
, 74
 
3.0%
5 72
 
2.9%
6 67
 
2.7%
Other values (11) 286
 
11.6%
Hangul
ValueCountFrequency (%)
328
 
8.9%
281
 
7.6%
276
 
7.5%
265
 
7.2%
260
 
7.1%
260
 
7.1%
257
 
7.0%
214
 
5.8%
165
 
4.5%
85
 
2.3%
Other values (174) 1283
34.9%
Distinct244
Distinct (%)94.6%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2024-01-10T07:08:31.106333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length11.94186
Min length9

Characters and Unicode

Total characters3081
Distinct characters20
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique242 ?
Unique (%)93.8%

Sample

1st row070-7209-0548
2nd row041-669-1414
3rd row041-669-9755
4th row041-662-0063
5th row041-664-4800
ValueCountFrequency (%)
개인 14
 
4.8%
휴대전화번호 14
 
4.8%
041 6
 
2.1%
041-663 5
 
1.7%
041-669 2
 
0.7%
041-664-3141 2
 
0.7%
041-666 2
 
0.7%
041-665 2
 
0.7%
041-667 2
 
0.7%
9292 2
 
0.7%
Other values (241) 241
82.5%
2024-01-10T07:08:31.461699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6 554
18.0%
- 487
15.8%
0 414
13.4%
1 362
11.7%
4 354
11.5%
8 148
 
4.8%
2 134
 
4.3%
9 132
 
4.3%
5 123
 
4.0%
7 111
 
3.6%
Other values (10) 262
8.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2440
79.2%
Dash Punctuation 487
 
15.8%
Other Letter 112
 
3.6%
Space Separator 42
 
1.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
6 554
22.7%
0 414
17.0%
1 362
14.8%
4 354
14.5%
8 148
 
6.1%
2 134
 
5.5%
9 132
 
5.4%
5 123
 
5.0%
7 111
 
4.5%
3 108
 
4.4%
Other Letter
ValueCountFrequency (%)
14
12.5%
14
12.5%
14
12.5%
14
12.5%
14
12.5%
14
12.5%
14
12.5%
14
12.5%
Dash Punctuation
ValueCountFrequency (%)
- 487
100.0%
Space Separator
ValueCountFrequency (%)
42
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2969
96.4%
Hangul 112
 
3.6%

Most frequent character per script

Common
ValueCountFrequency (%)
6 554
18.7%
- 487
16.4%
0 414
13.9%
1 362
12.2%
4 354
11.9%
8 148
 
5.0%
2 134
 
4.5%
9 132
 
4.4%
5 123
 
4.1%
7 111
 
3.7%
Other values (2) 150
 
5.1%
Hangul
ValueCountFrequency (%)
14
12.5%
14
12.5%
14
12.5%
14
12.5%
14
12.5%
14
12.5%
14
12.5%
14
12.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2969
96.4%
Hangul 112
 
3.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6 554
18.7%
- 487
16.4%
0 414
13.9%
1 362
12.2%
4 354
11.9%
8 148
 
5.0%
2 134
 
4.5%
9 132
 
4.4%
5 123
 
4.1%
7 111
 
3.7%
Other values (2) 150
 
5.1%
Hangul
ValueCountFrequency (%)
14
12.5%
14
12.5%
14
12.5%
14
12.5%
14
12.5%
14
12.5%
14
12.5%
14
12.5%

데이터기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2021-11-10
258 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-11-10
2nd row2021-11-10
3rd row2021-11-10
4th row2021-11-10
5th row2021-11-10

Common Values

ValueCountFrequency (%)
2021-11-10 258
100.0%

Length

2024-01-10T07:08:31.567146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:08:31.634016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-11-10 258
100.0%

Missing values

2024-01-10T07:08:29.623170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:08:29.692016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분상호주소연락처데이터기준일
0휴게음식점맥도날드서산점충청남도 서산시 충의로 3 (예천동)070-7209-05482021-11-10
1일반음식점중앙병원장례식장충청남도 서산시 수석산업로 5 (수석동), 지하2층041-669-14142021-11-10
2일반음식점큰마당충청남도 서산시 시장2로 20 (동문동), 지하1층041-669-97552021-11-10
3일반음식점성심가든충청남도 서산시 운산면 운암로 1071-42041-662-00632021-11-10
4일반음식점천수만회타운충청남도 서산시 부석면 천수만로 602041-664-48002021-11-10
5일반음식점가야관충청남도 서산시 명륜3길 24-35, 1층041-667-66812021-11-10
6일반음식점미담충청남도 서산시 대산읍 삼길포7로 37041-666-67002021-11-10
7일반음식점서산잠실감자탕충청남도 서산시 호수공원4로 3 (읍내동)041-666-00072021-11-10
8일반음식점아구촌충청남도 서산시 대산읍 충의로 1962041-681-66552021-11-10
9일반음식점서산 한우프라자충청남도 서산시 석지2길 14, 2~3층041-665-90062021-11-10
구분상호주소연락처데이터기준일
248집단급식소(재)서해안청소년수련원충청남도 서산시 운산면 봉운로 951-125041-669-91002021-11-10
249집단급식소동암초등학교충청남도 서산시 음암면 동암마을길 14-3041-663-22532021-11-10
250집단급식소부석초등학교충청남도 서산시 부석면 취평2길 37-6041-664-86012021-11-10
251집단급식소서산서남초등학교충청남도 서산시 예천2로 19, 서산서남초등학교 (예천동)041-666-70172021-11-10
252집단급식소다미식품(주식회사 우진)충청남도 서산시 음암면 노루골길 77-36, 2층041-689-95002021-11-10
253집단급식소(주)대우건설 서산푸르지오더센트럴충청남도 서산시 나무장2길 31 (예천동)개인 휴대전화번호2021-11-10
254집단급식소(주)우미건설 서산테크노밸리 우미린 신축현장 식당충청남도 서산시 성연면 성연3로 57-30 (고운라피네 서산테크노밸리)개인 휴대전화번호2021-11-10
255집단급식소육군 제2162부대세종특별자치시 조치원읍 새내12길 3, 조치원우체국개인 휴대전화번호2021-11-10
256집단급식소본우리집밥 신우에프에스점충청남도 서산시 고북면 고북1로 343-22, 2층개인 휴대전화번호2021-11-10
257집단급식소(주)아워홈 광성강관공업 서산점충청남도 서산시 성연면 해성산업로 148-36041-666-36042021-11-10

Duplicate rows

Most frequently occurring

구분상호주소연락처데이터기준일# duplicates
0일반음식점서산청와대충청남도 서산시 음암면 진동길 192-23041-664-31412021-11-102