Overview

Dataset statistics

Number of variables5
Number of observations1703
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.1%
Total size in memory66.7 KiB
Average record size in memory40.1 B

Variable types

Categorical2
Text3

Dataset

Description서천군 관내 식품위생업소 현황으로 문화관광홈페이지를 통해서 맛집 등의 정보를 제공하고 있습니다(업종별, 업소명, 소재지 안내)
URLhttps://www.data.go.kr/data/3065992/fileData.do

Alerts

데이터기준일 has constant value ""Constant
Dataset has 1 (0.1%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 20:12:12.179451
Analysis finished2023-12-12 20:12:13.077352
Duration0.9 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

Distinct18
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size13.4 KiB
일반음식점
781 
즉석판매제조가공업
233 
휴게음식점
187 
식품제조가공업
105 
건강기능식품일반판매업
 
71
Other values (13)
326 

Length

Max length11
Median length5
Mean length6.2190252
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반음식점
2nd row일반음식점
3rd row일반음식점
4th row일반음식점
5th row일반음식점

Common Values

ValueCountFrequency (%)
일반음식점 781
45.9%
즉석판매제조가공업 233
 
13.7%
휴게음식점 187
 
11.0%
식품제조가공업 105
 
6.2%
건강기능식품일반판매업 71
 
4.2%
집단급식소 70
 
4.1%
식품자동판매기영업 55
 
3.2%
식품소분업 44
 
2.6%
유통전문판매업 37
 
2.2%
유흥주점영업 29
 
1.7%
Other values (8) 91
 
5.3%

Length

2023-12-13T05:12:13.175857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
일반음식점 781
45.3%
즉석판매제조가공업 233
 
13.5%
휴게음식점 187
 
10.8%
식품제조가공업 105
 
6.1%
집단급식소 92
 
5.3%
건강기능식품일반판매업 71
 
4.1%
식품자동판매기영업 55
 
3.2%
식품소분업 44
 
2.6%
유통전문판매업 37
 
2.1%
유흥주점영업 29
 
1.7%
Other values (8) 91
 
5.3%
Distinct1607
Distinct (%)94.4%
Missing0
Missing (%)0.0%
Memory size13.4 KiB
2023-12-13T05:12:13.550789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length22
Mean length6.392249
Min length2

Characters and Unicode

Total characters10886
Distinct characters653
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1529 ?
Unique (%)89.8%

Sample

1st row신흥회관
2nd row삼거리식당
3rd row한미옥
4th row우이식당
5th row어울림
ValueCountFrequency (%)
주식회사 33
 
1.5%
서천점 21
 
1.0%
세븐일레븐 16
 
0.7%
장항점 15
 
0.7%
카페 13
 
0.6%
농업회사법인 12
 
0.6%
씨유 11
 
0.5%
gs25 11
 
0.5%
주)우양 9
 
0.4%
서천 7
 
0.3%
Other values (1778) 2026
93.2%
2023-12-13T05:12:14.186983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
473
 
4.3%
260
 
2.4%
238
 
2.2%
234
 
2.1%
227
 
2.1%
220
 
2.0%
177
 
1.6%
160
 
1.5%
144
 
1.3%
137
 
1.3%
Other values (643) 8616
79.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9792
90.0%
Space Separator 473
 
4.3%
Uppercase Letter 162
 
1.5%
Open Punctuation 113
 
1.0%
Close Punctuation 113
 
1.0%
Decimal Number 111
 
1.0%
Lowercase Letter 89
 
0.8%
Other Punctuation 27
 
0.2%
Dash Punctuation 4
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
260
 
2.7%
238
 
2.4%
234
 
2.4%
227
 
2.3%
220
 
2.2%
177
 
1.8%
160
 
1.6%
144
 
1.5%
137
 
1.4%
131
 
1.3%
Other values (581) 7864
80.3%
Uppercase Letter
ValueCountFrequency (%)
S 23
14.2%
G 20
12.3%
C 17
10.5%
O 13
 
8.0%
E 12
 
7.4%
U 11
 
6.8%
N 9
 
5.6%
A 9
 
5.6%
M 7
 
4.3%
F 6
 
3.7%
Other values (12) 35
21.6%
Lowercase Letter
ValueCountFrequency (%)
o 14
15.7%
e 13
14.6%
f 11
12.4%
c 8
9.0%
a 6
 
6.7%
l 5
 
5.6%
w 5
 
5.6%
s 4
 
4.5%
h 4
 
4.5%
i 3
 
3.4%
Other values (9) 16
18.0%
Decimal Number
ValueCountFrequency (%)
2 37
33.3%
5 16
14.4%
1 12
 
10.8%
4 8
 
7.2%
0 8
 
7.2%
8 8
 
7.2%
3 7
 
6.3%
9 7
 
6.3%
6 5
 
4.5%
7 3
 
2.7%
Other Punctuation
ValueCountFrequency (%)
. 11
40.7%
, 7
25.9%
/ 3
 
11.1%
· 3
 
11.1%
& 2
 
7.4%
' 1
 
3.7%
Space Separator
ValueCountFrequency (%)
473
100.0%
Open Punctuation
ValueCountFrequency (%)
( 113
100.0%
Close Punctuation
ValueCountFrequency (%)
) 113
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Math Symbol
ValueCountFrequency (%)
+ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9792
90.0%
Common 843
 
7.7%
Latin 251
 
2.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
260
 
2.7%
238
 
2.4%
234
 
2.4%
227
 
2.3%
220
 
2.2%
177
 
1.8%
160
 
1.6%
144
 
1.5%
137
 
1.4%
131
 
1.3%
Other values (581) 7864
80.3%
Latin
ValueCountFrequency (%)
S 23
 
9.2%
G 20
 
8.0%
C 17
 
6.8%
o 14
 
5.6%
e 13
 
5.2%
O 13
 
5.2%
E 12
 
4.8%
U 11
 
4.4%
f 11
 
4.4%
N 9
 
3.6%
Other values (31) 108
43.0%
Common
ValueCountFrequency (%)
473
56.1%
( 113
 
13.4%
) 113
 
13.4%
2 37
 
4.4%
5 16
 
1.9%
1 12
 
1.4%
. 11
 
1.3%
4 8
 
0.9%
0 8
 
0.9%
8 8
 
0.9%
Other values (11) 44
 
5.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9791
89.9%
ASCII 1091
 
10.0%
None 3
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
473
43.4%
( 113
 
10.4%
) 113
 
10.4%
2 37
 
3.4%
S 23
 
2.1%
G 20
 
1.8%
C 17
 
1.6%
5 16
 
1.5%
o 14
 
1.3%
e 13
 
1.2%
Other values (51) 252
23.1%
Hangul
ValueCountFrequency (%)
260
 
2.7%
238
 
2.4%
234
 
2.4%
227
 
2.3%
220
 
2.2%
177
 
1.8%
160
 
1.6%
144
 
1.5%
137
 
1.4%
131
 
1.3%
Other values (580) 7863
80.3%
None
ValueCountFrequency (%)
· 3
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct1325
Distinct (%)77.8%
Missing0
Missing (%)0.0%
Memory size13.4 KiB
2023-12-13T05:12:14.486848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length50
Mean length22.908397
Min length11

Characters and Unicode

Total characters39013
Distinct characters296
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1111 ?
Unique (%)65.2%

Sample

1st row충청남도 서천군 마산면
2nd row충청남도 서천군 한산면 한산모시길38번길 8
3rd row충청남도 서천군 한산면 한산모시길 42-7
4th row충청남도 서천군 판교면 종판로887번길 18
5th row충청남도 서천군 비인면 비인로 202
ValueCountFrequency (%)
충청남도 1703
19.0%
서천군 1703
19.0%
서천읍 576
 
6.4%
장항읍 468
 
5.2%
서면 193
 
2.2%
서천로 129
 
1.4%
마서면 129
 
1.4%
충절로 113
 
1.3%
1층 98
 
1.1%
한산면 84
 
0.9%
Other values (1087) 3779
42.1%
2023-12-13T05:12:15.096179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7272
18.6%
3102
 
8.0%
2612
 
6.7%
1939
 
5.0%
1779
 
4.6%
1747
 
4.5%
1732
 
4.4%
1729
 
4.4%
1 1513
 
3.9%
1368
 
3.5%
Other values (286) 14220
36.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 24942
63.9%
Space Separator 7272
 
18.6%
Decimal Number 5960
 
15.3%
Dash Punctuation 416
 
1.1%
Other Punctuation 327
 
0.8%
Close Punctuation 44
 
0.1%
Open Punctuation 44
 
0.1%
Uppercase Letter 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3102
12.4%
2612
 
10.5%
1939
 
7.8%
1779
 
7.1%
1747
 
7.0%
1732
 
6.9%
1729
 
6.9%
1368
 
5.5%
1044
 
4.2%
1044
 
4.2%
Other values (263) 6846
27.4%
Decimal Number
ValueCountFrequency (%)
1 1513
25.4%
2 819
13.7%
4 608
10.2%
3 557
 
9.3%
5 497
 
8.3%
7 423
 
7.1%
8 419
 
7.0%
0 380
 
6.4%
6 372
 
6.2%
9 372
 
6.2%
Uppercase Letter
ValueCountFrequency (%)
A 2
25.0%
X 2
25.0%
B 1
12.5%
T 1
12.5%
P 1
12.5%
E 1
12.5%
Other Punctuation
ValueCountFrequency (%)
, 324
99.1%
. 2
 
0.6%
/ 1
 
0.3%
Space Separator
ValueCountFrequency (%)
7272
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 416
100.0%
Close Punctuation
ValueCountFrequency (%)
) 44
100.0%
Open Punctuation
ValueCountFrequency (%)
( 44
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 24942
63.9%
Common 14063
36.0%
Latin 8
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3102
12.4%
2612
 
10.5%
1939
 
7.8%
1779
 
7.1%
1747
 
7.0%
1732
 
6.9%
1729
 
6.9%
1368
 
5.5%
1044
 
4.2%
1044
 
4.2%
Other values (263) 6846
27.4%
Common
ValueCountFrequency (%)
7272
51.7%
1 1513
 
10.8%
2 819
 
5.8%
4 608
 
4.3%
3 557
 
4.0%
5 497
 
3.5%
7 423
 
3.0%
8 419
 
3.0%
- 416
 
3.0%
0 380
 
2.7%
Other values (7) 1159
 
8.2%
Latin
ValueCountFrequency (%)
A 2
25.0%
X 2
25.0%
B 1
12.5%
T 1
12.5%
P 1
12.5%
E 1
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 24942
63.9%
ASCII 14071
36.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
7272
51.7%
1 1513
 
10.8%
2 819
 
5.8%
4 608
 
4.3%
3 557
 
4.0%
5 497
 
3.5%
7 423
 
3.0%
8 419
 
3.0%
- 416
 
3.0%
0 380
 
2.7%
Other values (13) 1167
 
8.3%
Hangul
ValueCountFrequency (%)
3102
12.4%
2612
 
10.5%
1939
 
7.8%
1779
 
7.1%
1747
 
7.0%
1732
 
6.9%
1729
 
6.9%
1368
 
5.5%
1044
 
4.2%
1044
 
4.2%
Other values (263) 6846
27.4%
Distinct1272
Distinct (%)74.7%
Missing0
Missing (%)0.0%
Memory size13.4 KiB
2023-12-13T05:12:15.597974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length43
Mean length23.565473
Min length11

Characters and Unicode

Total characters40132
Distinct characters297
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1057 ?
Unique (%)62.1%

Sample

1st row충청남도 서천군 마산면 신장리 250
2nd row충청남도 서천군 한산면 지현리 102
3rd row충청남도 서천군 한산면
4th row충청남도 서천군 판교면 현암리 147-12
5th row충청남도 서천군 비인면 성내리 500-1
ValueCountFrequency (%)
충청남도 1703
19.5%
서천군 1703
19.5%
서천읍 576
 
6.6%
장항읍 468
 
5.4%
군사리 376
 
4.3%
서면 193
 
2.2%
사곡리 145
 
1.7%
신창리 130
 
1.5%
마서면 129
 
1.5%
도둔리 122
 
1.4%
Other values (1446) 3183
36.5%
2023-12-13T05:12:16.269604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8649
21.6%
2695
 
6.7%
2452
 
6.1%
2084
 
5.2%
1891
 
4.7%
1726
 
4.3%
1711
 
4.3%
1704
 
4.2%
1646
 
4.1%
1 1432
 
3.6%
Other values (287) 14142
35.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 23256
57.9%
Space Separator 8649
 
21.6%
Decimal Number 6880
 
17.1%
Dash Punctuation 1316
 
3.3%
Open Punctuation 11
 
< 0.1%
Close Punctuation 11
 
< 0.1%
Uppercase Letter 7
 
< 0.1%
Other Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2695
11.6%
2452
10.5%
2084
 
9.0%
1891
 
8.1%
1726
 
7.4%
1711
 
7.4%
1704
 
7.3%
1646
 
7.1%
1044
 
4.5%
661
 
2.8%
Other values (266) 5642
24.3%
Decimal Number
ValueCountFrequency (%)
1 1432
20.8%
2 1022
14.9%
3 742
10.8%
6 677
9.8%
4 607
8.8%
5 598
8.7%
8 514
 
7.5%
7 510
 
7.4%
0 399
 
5.8%
9 379
 
5.5%
Uppercase Letter
ValueCountFrequency (%)
X 2
28.6%
B 1
14.3%
P 1
14.3%
A 1
14.3%
E 1
14.3%
T 1
14.3%
Space Separator
ValueCountFrequency (%)
8649
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1316
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 23256
57.9%
Common 16869
42.0%
Latin 7
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2695
11.6%
2452
10.5%
2084
 
9.0%
1891
 
8.1%
1726
 
7.4%
1711
 
7.4%
1704
 
7.3%
1646
 
7.1%
1044
 
4.5%
661
 
2.8%
Other values (266) 5642
24.3%
Common
ValueCountFrequency (%)
8649
51.3%
1 1432
 
8.5%
- 1316
 
7.8%
2 1022
 
6.1%
3 742
 
4.4%
6 677
 
4.0%
4 607
 
3.6%
5 598
 
3.5%
8 514
 
3.0%
7 510
 
3.0%
Other values (5) 802
 
4.8%
Latin
ValueCountFrequency (%)
X 2
28.6%
B 1
14.3%
P 1
14.3%
A 1
14.3%
E 1
14.3%
T 1
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 23256
57.9%
ASCII 16876
42.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8649
51.3%
1 1432
 
8.5%
- 1316
 
7.8%
2 1022
 
6.1%
3 742
 
4.4%
6 677
 
4.0%
4 607
 
3.6%
5 598
 
3.5%
8 514
 
3.0%
7 510
 
3.0%
Other values (11) 809
 
4.8%
Hangul
ValueCountFrequency (%)
2695
11.6%
2452
10.5%
2084
 
9.0%
1891
 
8.1%
1726
 
7.4%
1711
 
7.4%
1704
 
7.3%
1646
 
7.1%
1044
 
4.5%
661
 
2.8%
Other values (266) 5642
24.3%

데이터기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.4 KiB
2023-07-12
1703 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-07-12
2nd row2023-07-12
3rd row2023-07-12
4th row2023-07-12
5th row2023-07-12

Common Values

ValueCountFrequency (%)
2023-07-12 1703
100.0%

Length

2023-12-13T05:12:16.448358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:12:16.583393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-07-12 1703
100.0%

Missing values

2023-12-13T05:12:12.928625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:12:13.034932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명소재지(도로명)소재지(지번)데이터기준일
0일반음식점신흥회관충청남도 서천군 마산면충청남도 서천군 마산면 신장리 2502023-07-12
1일반음식점삼거리식당충청남도 서천군 한산면 한산모시길38번길 8충청남도 서천군 한산면 지현리 1022023-07-12
2일반음식점한미옥충청남도 서천군 한산면 한산모시길 42-7충청남도 서천군 한산면2023-07-12
3일반음식점우이식당충청남도 서천군 판교면 종판로887번길 18충청남도 서천군 판교면 현암리 147-122023-07-12
4일반음식점어울림충청남도 서천군 비인면 비인로 202충청남도 서천군 비인면 성내리 500-12023-07-12
5일반음식점은해곱창충청남도 서천군 서천읍 충절로59번길 52충청남도 서천군 서천읍 군사리 513-52023-07-12
6일반음식점비인반점충청남도 서천군 비인면 비인로 201충청남도 서천군 비인면 성내리 503-32023-07-12
7일반음식점경주식당충청남도 서천군 서천읍 서문로 33충청남도 서천군 서천읍 군사리 159-62023-07-12
8일반음식점오라리집충청남도 서천군 한산면충청남도 서천군 한산면 지현리 1682023-07-12
9일반음식점두리식당충청남도 서천군 문산면충청남도 서천군 문산면 신농리 602023-07-12
업종명업소명소재지(도로명)소재지(지번)데이터기준일
1693건강기능식품일반판매업유한회사 티오디충청남도 서천군 장항읍 장서로 30, 장항종로약국충청남도 서천군 장항읍 창선1리 158-4 장항종로약국2023-07-12
1694건강기능식품일반판매업인셀덤우리대리점충청남도 서천군 장항읍 장서로 46충청남도 서천군 장항읍 창선2리 4902023-07-12
1695건강기능식품일반판매업J&J 뷰티샵충청남도 서천군 서천읍 사곡로 71충청남도 서천군 서천읍 사곡리 117-22023-07-12
1696건강기능식품일반판매업채쓰마켓충청남도 서천군 서천읍 서문로 39, 2층충청남도 서천군 서천읍 군사리 216-92023-07-12
1697건강기능식품일반판매업샤인몰충청남도 서천군 서천읍 충절로59번길 45충청남도 서천군 서천읍 군사리 5112023-07-12
1698건강기능식품일반판매업빛과소금충청남도 서천군 서천읍 서문로 31충청남도 서천군 서천읍 군사리 159-72023-07-12
1699건강기능식품일반판매업백조바다충청남도 서천군 장항읍 장서로 63, 502호 (골든 팰리스)충청남도 서천군 장항읍 창선2리 256 골든 팰리스 502호2023-07-12
1700건강기능식품일반판매업마켓보라 장항점충청남도 서천군 장항읍 성화로 104, 일반동 12호충청남도 서천군 장항읍 신창리 1102023-07-12
1701건강기능식품일반판매업방실이네알뜰상회충청남도 서천군 서천읍 서천로 134충청남도 서천군 서천읍 군사리 752-22023-07-12
1702건강기능식품일반판매업엔케이(NK)몰충청남도 서천군 마서면 송신로 427-7충청남도 서천군 마서면 한성리 428-12023-07-12

Duplicate rows

Most frequently occurring

업종명업소명소재지(도로명)소재지(지번)데이터기준일# duplicates
0식품자동판매기영업하구둑광장충청남도 서천군 마서면충청남도 서천군 마서면 도삼리 751-112023-07-122