Overview

Dataset statistics

Number of variables5
Number of observations1725
Missing cells275
Missing cells (%)3.2%
Duplicate rows1
Duplicate rows (%)0.1%
Total size in memory67.5 KiB
Average record size in memory40.1 B

Variable types

Categorical1
Text4

Dataset

Description충청남도 공주시 일반 음식점 현황에 대한 데이터로 (업종명, 업소명, 영업자, 소재지(도로명) ) 등의 항목을 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=419&beforeMenuCd=DOM_000000201001001000&publicdatapk=15051147

Alerts

업종명 has constant value ""Constant
Dataset has 1 (0.1%) duplicate rowsDuplicates
소재지전화 has 275 (15.9%) missing valuesMissing

Reproduction

Analysis started2024-01-09 20:28:49.606718
Analysis finished2024-01-09 20:28:50.274803
Duration0.67 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size13.6 KiB
일반음식점
1725 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반음식점
2nd row일반음식점
3rd row일반음식점
4th row일반음식점
5th row일반음식점

Common Values

ValueCountFrequency (%)
일반음식점 1725
100.0%

Length

2024-01-10T05:28:50.334290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:28:50.420609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반음식점 1725
100.0%
Distinct1666
Distinct (%)96.6%
Missing0
Missing (%)0.0%
Memory size13.6 KiB
2024-01-10T05:28:50.590463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length16
Mean length5.124058
Min length1

Characters and Unicode

Total characters8839
Distinct characters679
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1616 ?
Unique (%)93.7%

Sample

1st row#모아모아
2nd row(주)동양식품이인(상)휴게소
3rd row(주)동양식품탄천(하)휴게소
4th row(주)웅진
5th row12월의왈츠
ValueCountFrequency (%)
공주식당 6
 
0.3%
우리식당 4
 
0.2%
메아리식당 3
 
0.2%
전주식당 3
 
0.2%
현대식당 3
 
0.2%
원두막 2
 
0.1%
징기스칸치킨 2
 
0.1%
더쉼 2
 
0.1%
초원가든 2
 
0.1%
자매식당 2
 
0.1%
Other values (1666) 1709
98.3%
2024-01-10T05:28:50.929812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
340
 
3.8%
294
 
3.3%
200
 
2.3%
138
 
1.6%
123
 
1.4%
116
 
1.3%
115
 
1.3%
114
 
1.3%
111
 
1.3%
107
 
1.2%
Other values (669) 7181
81.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8571
97.0%
Uppercase Letter 63
 
0.7%
Open Punctuation 49
 
0.6%
Close Punctuation 49
 
0.6%
Decimal Number 43
 
0.5%
Lowercase Letter 39
 
0.4%
Space Separator 13
 
0.1%
Other Punctuation 11
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
340
 
4.0%
294
 
3.4%
200
 
2.3%
138
 
1.6%
123
 
1.4%
116
 
1.4%
115
 
1.3%
114
 
1.3%
111
 
1.3%
107
 
1.2%
Other values (613) 6913
80.7%
Uppercase Letter
ValueCountFrequency (%)
C 8
12.7%
B 7
11.1%
O 6
 
9.5%
N 5
 
7.9%
D 4
 
6.3%
H 4
 
6.3%
E 4
 
6.3%
I 3
 
4.8%
T 3
 
4.8%
P 2
 
3.2%
Other values (12) 17
27.0%
Lowercase Letter
ValueCountFrequency (%)
e 10
25.6%
o 5
12.8%
c 4
 
10.3%
a 3
 
7.7%
h 2
 
5.1%
r 2
 
5.1%
s 2
 
5.1%
f 2
 
5.1%
n 2
 
5.1%
y 1
 
2.6%
Other values (6) 6
15.4%
Decimal Number
ValueCountFrequency (%)
2 7
16.3%
0 6
14.0%
5 6
14.0%
7 6
14.0%
1 5
11.6%
8 4
9.3%
4 3
7.0%
6 3
7.0%
9 2
 
4.7%
3 1
 
2.3%
Other Punctuation
ValueCountFrequency (%)
. 6
54.5%
& 3
27.3%
# 1
 
9.1%
· 1
 
9.1%
Open Punctuation
ValueCountFrequency (%)
( 49
100.0%
Close Punctuation
ValueCountFrequency (%)
) 49
100.0%
Space Separator
ValueCountFrequency (%)
13
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8571
97.0%
Common 166
 
1.9%
Latin 102
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
340
 
4.0%
294
 
3.4%
200
 
2.3%
138
 
1.6%
123
 
1.4%
116
 
1.4%
115
 
1.3%
114
 
1.3%
111
 
1.3%
107
 
1.2%
Other values (613) 6913
80.7%
Latin
ValueCountFrequency (%)
e 10
 
9.8%
C 8
 
7.8%
B 7
 
6.9%
O 6
 
5.9%
N 5
 
4.9%
o 5
 
4.9%
c 4
 
3.9%
D 4
 
3.9%
H 4
 
3.9%
E 4
 
3.9%
Other values (28) 45
44.1%
Common
ValueCountFrequency (%)
( 49
29.5%
) 49
29.5%
13
 
7.8%
2 7
 
4.2%
0 6
 
3.6%
5 6
 
3.6%
7 6
 
3.6%
. 6
 
3.6%
1 5
 
3.0%
8 4
 
2.4%
Other values (8) 15
 
9.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8571
97.0%
ASCII 267
 
3.0%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
340
 
4.0%
294
 
3.4%
200
 
2.3%
138
 
1.6%
123
 
1.4%
116
 
1.4%
115
 
1.3%
114
 
1.3%
111
 
1.3%
107
 
1.2%
Other values (613) 6913
80.7%
ASCII
ValueCountFrequency (%)
( 49
18.4%
) 49
18.4%
13
 
4.9%
e 10
 
3.7%
C 8
 
3.0%
2 7
 
2.6%
B 7
 
2.6%
0 6
 
2.2%
5 6
 
2.2%
7 6
 
2.2%
Other values (45) 106
39.7%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct1557
Distinct (%)90.3%
Missing0
Missing (%)0.0%
Memory size13.6 KiB
2024-01-10T05:28:51.241009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length3
Mean length3.0278261
Min length2

Characters and Unicode

Total characters5223
Distinct characters224
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1438 ?
Unique (%)83.4%

Sample

1st row권동우
2nd row임태기
3rd row임태기
4th row권영기
5th row명정숙
ValueCountFrequency (%)
11
 
0.6%
1명 11
 
0.6%
이정숙 6
 
0.3%
김영희 5
 
0.3%
이명수 5
 
0.3%
김미숙 5
 
0.3%
김명자 5
 
0.3%
이현숙 4
 
0.2%
김정숙 4
 
0.2%
김영숙 4
 
0.2%
Other values (1548) 1687
96.6%
2024-01-10T05:28:51.916059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
355
 
6.8%
309
 
5.9%
215
 
4.1%
196
 
3.8%
190
 
3.6%
146
 
2.8%
138
 
2.6%
125
 
2.4%
123
 
2.4%
99
 
1.9%
Other values (214) 3327
63.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5190
99.4%
Space Separator 22
 
0.4%
Decimal Number 11
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
355
 
6.8%
309
 
6.0%
215
 
4.1%
196
 
3.8%
190
 
3.7%
146
 
2.8%
138
 
2.7%
125
 
2.4%
123
 
2.4%
99
 
1.9%
Other values (212) 3294
63.5%
Space Separator
ValueCountFrequency (%)
22
100.0%
Decimal Number
ValueCountFrequency (%)
1 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5190
99.4%
Common 33
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
355
 
6.8%
309
 
6.0%
215
 
4.1%
196
 
3.8%
190
 
3.7%
146
 
2.8%
138
 
2.7%
125
 
2.4%
123
 
2.4%
99
 
1.9%
Other values (212) 3294
63.5%
Common
ValueCountFrequency (%)
22
66.7%
1 11
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5190
99.4%
ASCII 33
 
0.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
355
 
6.8%
309
 
6.0%
215
 
4.1%
196
 
3.8%
190
 
3.7%
146
 
2.8%
138
 
2.7%
125
 
2.4%
123
 
2.4%
99
 
1.9%
Other values (212) 3294
63.5%
ASCII
ValueCountFrequency (%)
22
66.7%
1 11
33.3%
Distinct1578
Distinct (%)91.5%
Missing0
Missing (%)0.0%
Memory size13.6 KiB
2024-01-10T05:28:52.225622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length41
Mean length24.071884
Min length18

Characters and Unicode

Total characters41524
Distinct characters235
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1455 ?
Unique (%)84.3%

Sample

1st row충청남도 공주시 국고개길 21 (중동)
2nd row충청남도 공주시 이인면 논산천안고속도로 32
3rd row충청남도 공주시 탄천면 논산천안고속도로 27, 1층
4th row충청남도 공주시 의당면 신성말길 67
5th row충청남도 공주시 반포면 동학사2로 62 (672-1)
ValueCountFrequency (%)
충청남도 1725
18.9%
공주시 1725
18.9%
신관동 424
 
4.7%
1층 305
 
3.4%
반포면 202
 
2.2%
유구읍 117
 
1.3%
산성동 98
 
1.1%
번영2로 95
 
1.0%
중동 91
 
1.0%
계룡면 90
 
1.0%
Other values (1336) 4232
46.5%
2024-01-10T05:28:52.666924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7396
 
17.8%
1832
 
4.4%
1806
 
4.3%
1789
 
4.3%
1 1787
 
4.3%
1731
 
4.2%
1731
 
4.2%
1728
 
4.2%
1726
 
4.2%
1192
 
2.9%
Other values (225) 18806
45.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 24292
58.5%
Space Separator 7396
 
17.8%
Decimal Number 6365
 
15.3%
Close Punctuation 1119
 
2.7%
Open Punctuation 1119
 
2.7%
Dash Punctuation 698
 
1.7%
Other Punctuation 521
 
1.3%
Uppercase Letter 13
 
< 0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1832
 
7.5%
1806
 
7.4%
1789
 
7.4%
1731
 
7.1%
1731
 
7.1%
1728
 
7.1%
1726
 
7.1%
1192
 
4.9%
869
 
3.6%
848
 
3.5%
Other values (204) 9040
37.2%
Decimal Number
ValueCountFrequency (%)
1 1787
28.1%
2 1008
15.8%
3 630
 
9.9%
4 543
 
8.5%
5 481
 
7.6%
7 456
 
7.2%
6 413
 
6.5%
8 411
 
6.5%
0 346
 
5.4%
9 290
 
4.6%
Uppercase Letter
ValueCountFrequency (%)
A 7
53.8%
B 4
30.8%
C 1
 
7.7%
E 1
 
7.7%
Other Punctuation
ValueCountFrequency (%)
, 519
99.6%
. 2
 
0.4%
Space Separator
ValueCountFrequency (%)
7396
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1119
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1119
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 698
100.0%
Lowercase Letter
ValueCountFrequency (%)
a 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 24292
58.5%
Common 17218
41.5%
Latin 14
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1832
 
7.5%
1806
 
7.4%
1789
 
7.4%
1731
 
7.1%
1731
 
7.1%
1728
 
7.1%
1726
 
7.1%
1192
 
4.9%
869
 
3.6%
848
 
3.5%
Other values (204) 9040
37.2%
Common
ValueCountFrequency (%)
7396
43.0%
1 1787
 
10.4%
) 1119
 
6.5%
( 1119
 
6.5%
2 1008
 
5.9%
- 698
 
4.1%
3 630
 
3.7%
4 543
 
3.2%
, 519
 
3.0%
5 481
 
2.8%
Other values (6) 1918
 
11.1%
Latin
ValueCountFrequency (%)
A 7
50.0%
B 4
28.6%
a 1
 
7.1%
C 1
 
7.1%
E 1
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 24292
58.5%
ASCII 17232
41.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
7396
42.9%
1 1787
 
10.4%
) 1119
 
6.5%
( 1119
 
6.5%
2 1008
 
5.8%
- 698
 
4.1%
3 630
 
3.7%
4 543
 
3.2%
, 519
 
3.0%
5 481
 
2.8%
Other values (11) 1932
 
11.2%
Hangul
ValueCountFrequency (%)
1832
 
7.5%
1806
 
7.4%
1789
 
7.4%
1731
 
7.1%
1731
 
7.1%
1728
 
7.1%
1726
 
7.1%
1192
 
4.9%
869
 
3.6%
848
 
3.5%
Other values (204) 9040
37.2%

소재지전화
Text

MISSING 

Distinct1421
Distinct (%)98.0%
Missing275
Missing (%)15.9%
Memory size13.6 KiB
2024-01-10T05:28:52.893274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.002069
Min length12

Characters and Unicode

Total characters17403
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1393 ?
Unique (%)96.1%

Sample

1st row041-858-4455
2nd row041-856-8351
3rd row041-854-3521
4th row041-856-7108
5th row042-823-0050
ValueCountFrequency (%)
041-858-0522 3
 
0.2%
041-858-0082 2
 
0.1%
041-856-2345 2
 
0.1%
041-841-5181 2
 
0.1%
041-855-4706 2
 
0.1%
041-858-0561 2
 
0.1%
041-852-6565 2
 
0.1%
041-855-0974 2
 
0.1%
042-825-0450 2
 
0.1%
041-881-3161 2
 
0.1%
Other values (1411) 1429
98.6%
2024-01-10T05:28:53.236331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 2900
16.7%
8 2309
13.3%
4 2209
12.7%
1 2165
12.4%
0 2158
12.4%
5 1941
11.2%
2 1038
 
6.0%
6 713
 
4.1%
7 704
 
4.0%
3 699
 
4.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 14503
83.3%
Dash Punctuation 2900
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
8 2309
15.9%
4 2209
15.2%
1 2165
14.9%
0 2158
14.9%
5 1941
13.4%
2 1038
7.2%
6 713
 
4.9%
7 704
 
4.9%
3 699
 
4.8%
9 567
 
3.9%
Dash Punctuation
ValueCountFrequency (%)
- 2900
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 17403
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 2900
16.7%
8 2309
13.3%
4 2209
12.7%
1 2165
12.4%
0 2158
12.4%
5 1941
11.2%
2 1038
 
6.0%
6 713
 
4.1%
7 704
 
4.0%
3 699
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 17403
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 2900
16.7%
8 2309
13.3%
4 2209
12.7%
1 2165
12.4%
0 2158
12.4%
5 1941
11.2%
2 1038
 
6.0%
6 713
 
4.1%
7 704
 
4.0%
3 699
 
4.0%

Missing values

2024-01-10T05:28:50.150013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:28:50.237540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명영업자소재지(도로명)소재지전화
0일반음식점#모아모아권동우충청남도 공주시 국고개길 21 (중동)041-858-4455
1일반음식점(주)동양식품이인(상)휴게소임태기충청남도 공주시 이인면 논산천안고속도로 32041-856-8351
2일반음식점(주)동양식품탄천(하)휴게소임태기충청남도 공주시 탄천면 논산천안고속도로 27, 1층041-854-3521
3일반음식점(주)웅진권영기충청남도 공주시 의당면 신성말길 67041-856-7108
4일반음식점12월의왈츠명정숙충청남도 공주시 반포면 동학사2로 62 (672-1)042-823-0050
5일반음식점21세기호프광장조영자충청남도 공주시 번영2로 78-6 (신관동)041-856-3949
6일반음식점24시연자김밥이영란충청남도 공주시 공주대학로 90-2 (신관동)041-858-8863
7일반음식점24시전주명가콩나물국밥김혜란충청남도 공주시 신관로 63, 1층 (신관동)041-852-8529
8일반음식점24시청해루최용규 외 1명충청남도 공주시 공주대학로 94-9, 1,2층 (신관동)041-855-2345
9일반음식점25시뼈다귀탕이상형충청남도 공주시 흑수골길 38-6 (신관동)041-856-3882
업종명업소명영업자소재지(도로명)소재지전화
1715일반음식점훼미리한식뷔페오홍규 외 1명충청남도 공주시 월미농공단지길 20, 1층 (월미동)041-855-7600
1716일반음식점휴영최진숙충청남도 공주시 당간지주길 26-1 (중동)041-858-9006
1717일반음식점흑룡성임헌문충청남도 공주시 월미안터길 0 (월미동)041-881-1718
1718일반음식점흥덕골대화원한경자충청남도 공주시 이인면 괴재길 58-9041-856-4549
1719일반음식점흥부네진영분충청남도 공주시 매산동길 20 (신관동)041-854-1411
1720일반음식점흥부네칼국수박효성충청남도 공주시 우성면 동대리길 68041-852-0092
1721일반음식점흥부전놀부쩐우미자충청남도 공주시 번영3로 54-3, 1층 (신관동)<NA>
1722일반음식점희동이네국수이순주충청남도 공주시 무령로 302 (옥룡동)041-881-6671
1723일반음식점희망식당임오빈충청남도 공주시 전막2길 9 (신관동)041-858-2300
1724일반음식점힐하우스박진섭충청남도 공주시 반포면 동학사1로 215041-825-4046

Duplicate rows

Most frequently occurring

업종명업소명영업자소재지(도로명)소재지전화# duplicates
0일반음식점키다리식품(주)정안(상)휴게소이명수충청남도 공주시 정안면 장원장자울길 73-47041-858-05612