Overview

Dataset statistics

Number of variables6
Number of observations3841
Missing cells875
Missing cells (%)3.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory180.2 KiB
Average record size in memory48.0 B

Variable types

Categorical3
Text3

Dataset

Description충청북도 충주시 음식점 정보에 대한 데이터 제공(업종명, 업소명, 도로명 주소, 소재지 전화, 업태명, 문의전화 등)
URLhttps://www.data.go.kr/data/3037407/fileData.do

Alerts

업종명 has constant value ""Constant
문의전화 has constant value ""Constant
소재지전화 has 875 (22.8%) missing valuesMissing

Reproduction

Analysis started2023-12-12 18:08:06.132356
Analysis finished2023-12-12 18:08:07.312999
Duration1.18 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size30.1 KiB
일반음식점
3841 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반음식점
2nd row일반음식점
3rd row일반음식점
4th row일반음식점
5th row일반음식점

Common Values

ValueCountFrequency (%)
일반음식점 3841
100.0%

Length

2023-12-13T03:08:07.388094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:08:07.501133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반음식점 3841
100.0%
Distinct3834
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Memory size30.1 KiB
2023-12-13T03:08:07.757280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length27
Mean length6.4043218
Min length1

Characters and Unicode

Total characters24599
Distinct characters901
Distinct categories10 ?
Distinct scripts5 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3827 ?
Unique (%)99.6%

Sample

1st row아서원
2nd row중원생고기마을
3rd row대미식당
4th row대동식당
5th row유선분식
ValueCountFrequency (%)
salad 3
 
0.1%
충주호암점 3
 
0.1%
서충주점 3
 
0.1%
충주연수점 3
 
0.1%
페리카나양념통닭 2
 
0.1%
슬기로운대학생활 2
 
0.1%
burger 2
 
0.1%
monster 2
 
0.1%
충주(상)휴게소 2
 
0.1%
우하하 2
 
0.1%
Other values (3908) 3918
99.4%
2023-12-13T03:08:08.236244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
833
 
3.4%
608
 
2.5%
476
 
1.9%
473
 
1.9%
444
 
1.8%
389
 
1.6%
388
 
1.6%
325
 
1.3%
280
 
1.1%
271
 
1.1%
Other values (891) 20112
81.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 22418
91.1%
Uppercase Letter 662
 
2.7%
Lowercase Letter 459
 
1.9%
Decimal Number 336
 
1.4%
Open Punctuation 258
 
1.0%
Close Punctuation 258
 
1.0%
Other Punctuation 102
 
0.4%
Space Separator 101
 
0.4%
Dash Punctuation 4
 
< 0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
833
 
3.7%
608
 
2.7%
476
 
2.1%
473
 
2.1%
444
 
2.0%
389
 
1.7%
388
 
1.7%
325
 
1.4%
280
 
1.2%
271
 
1.2%
Other values (815) 17931
80.0%
Uppercase Letter
ValueCountFrequency (%)
C 79
 
11.9%
A 67
 
10.1%
B 57
 
8.6%
O 40
 
6.0%
E 38
 
5.7%
I 35
 
5.3%
N 35
 
5.3%
G 29
 
4.4%
R 28
 
4.2%
S 27
 
4.1%
Other values (16) 227
34.3%
Lowercase Letter
ValueCountFrequency (%)
e 69
15.0%
a 52
11.3%
o 42
 
9.2%
n 31
 
6.8%
r 29
 
6.3%
s 28
 
6.1%
i 24
 
5.2%
t 24
 
5.2%
l 20
 
4.4%
u 18
 
3.9%
Other values (14) 122
26.6%
Decimal Number
ValueCountFrequency (%)
1 84
25.0%
2 56
16.7%
9 38
11.3%
0 35
10.4%
3 30
 
8.9%
5 21
 
6.2%
7 20
 
6.0%
6 20
 
6.0%
4 19
 
5.7%
8 13
 
3.9%
Other Punctuation
ValueCountFrequency (%)
& 55
53.9%
. 20
 
19.6%
, 12
 
11.8%
' 3
 
2.9%
# 3
 
2.9%
/ 3
 
2.9%
· 3
 
2.9%
! 2
 
2.0%
1
 
1.0%
Open Punctuation
ValueCountFrequency (%)
( 257
99.6%
[ 1
 
0.4%
Close Punctuation
ValueCountFrequency (%)
) 257
99.6%
] 1
 
0.4%
Space Separator
ValueCountFrequency (%)
101
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 22387
91.0%
Latin 1121
 
4.6%
Common 1060
 
4.3%
Han 23
 
0.1%
Hiragana 8
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
833
 
3.7%
608
 
2.7%
476
 
2.1%
473
 
2.1%
444
 
2.0%
389
 
1.7%
388
 
1.7%
325
 
1.5%
280
 
1.3%
271
 
1.2%
Other values (789) 17900
80.0%
Latin
ValueCountFrequency (%)
C 79
 
7.0%
e 69
 
6.2%
A 67
 
6.0%
B 57
 
5.1%
a 52
 
4.6%
o 42
 
3.7%
O 40
 
3.6%
E 38
 
3.4%
I 35
 
3.1%
N 35
 
3.1%
Other values (40) 607
54.1%
Common
ValueCountFrequency (%)
( 257
24.2%
) 257
24.2%
101
 
9.5%
1 84
 
7.9%
2 56
 
5.3%
& 55
 
5.2%
9 38
 
3.6%
0 35
 
3.3%
3 30
 
2.8%
5 21
 
2.0%
Other values (16) 126
11.9%
Han
ValueCountFrequency (%)
3
 
13.0%
2
 
8.7%
2
 
8.7%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
Other values (9) 9
39.1%
Hiragana
ValueCountFrequency (%)
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 22387
91.0%
ASCII 2176
 
8.8%
CJK 22
 
0.1%
Hiragana 8
 
< 0.1%
None 4
 
< 0.1%
Letterlike Symbols 1
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
833
 
3.7%
608
 
2.7%
476
 
2.1%
473
 
2.1%
444
 
2.0%
389
 
1.7%
388
 
1.7%
325
 
1.5%
280
 
1.3%
271
 
1.2%
Other values (789) 17900
80.0%
ASCII
ValueCountFrequency (%)
( 257
 
11.8%
) 257
 
11.8%
101
 
4.6%
1 84
 
3.9%
C 79
 
3.6%
e 69
 
3.2%
A 67
 
3.1%
B 57
 
2.6%
2 56
 
2.6%
& 55
 
2.5%
Other values (63) 1094
50.3%
CJK
ValueCountFrequency (%)
3
 
13.6%
2
 
9.1%
2
 
9.1%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
Other values (8) 8
36.4%
None
ValueCountFrequency (%)
· 3
75.0%
1
 
25.0%
Hiragana
ValueCountFrequency (%)
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Distinct3463
Distinct (%)90.2%
Missing0
Missing (%)0.0%
Memory size30.1 KiB
2023-12-13T03:08:08.583224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length58
Median length55
Mean length24.959646
Min length9

Characters and Unicode

Total characters95870
Distinct characters372
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3177 ?
Unique (%)82.7%

Sample

1st row충청북도 충주시 관아1길 8-1 (성내동)
2nd row충청북도 충주시 관아3길 11 (성서동)
3rd row충청북도 충주시 금가면 대미길 59
4th row충청북도 충주시 수안보면 온천중앙길 23-1
5th row충청북도 충주시 예성로 165-1, 1층 (성서동)
ValueCountFrequency (%)
충청북도 3818
 
17.9%
충주시 3818
 
17.9%
1층 1180
 
5.5%
연수동 598
 
2.8%
교현동 371
 
1.7%
칠금동 250
 
1.2%
대소원면 243
 
1.1%
호암동 210
 
1.0%
용산동 181
 
0.8%
봉방동 169
 
0.8%
Other values (1927) 10551
49.3%
2023-12-13T03:08:09.110553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17549
18.3%
7964
 
8.3%
1 4422
 
4.6%
4074
 
4.2%
3902
 
4.1%
3898
 
4.1%
3860
 
4.0%
3822
 
4.0%
3238
 
3.4%
) 2776
 
2.9%
Other values (362) 40365
42.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 56426
58.9%
Space Separator 17549
 
18.3%
Decimal Number 13632
 
14.2%
Close Punctuation 2776
 
2.9%
Open Punctuation 2774
 
2.9%
Other Punctuation 1753
 
1.8%
Dash Punctuation 664
 
0.7%
Uppercase Letter 214
 
0.2%
Math Symbol 60
 
0.1%
Lowercase Letter 22
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7964
 
14.1%
4074
 
7.2%
3902
 
6.9%
3898
 
6.9%
3860
 
6.8%
3822
 
6.8%
3238
 
5.7%
2152
 
3.8%
1773
 
3.1%
1414
 
2.5%
Other values (322) 20329
36.0%
Uppercase Letter
ValueCountFrequency (%)
E 46
21.5%
A 45
21.0%
C 30
14.0%
L 26
12.1%
R 24
11.2%
P 23
10.7%
B 14
 
6.5%
H 3
 
1.4%
T 1
 
0.5%
K 1
 
0.5%
Decimal Number
ValueCountFrequency (%)
1 4422
32.4%
2 2019
14.8%
3 1312
 
9.6%
4 1051
 
7.7%
0 992
 
7.3%
5 976
 
7.2%
6 774
 
5.7%
7 767
 
5.6%
9 665
 
4.9%
8 654
 
4.8%
Lowercase Letter
ValueCountFrequency (%)
e 12
54.5%
c 3
 
13.6%
l 2
 
9.1%
s 1
 
4.5%
i 1
 
4.5%
v 1
 
4.5%
u 1
 
4.5%
h 1
 
4.5%
Other Punctuation
ValueCountFrequency (%)
, 1749
99.8%
@ 2
 
0.1%
. 1
 
0.1%
· 1
 
0.1%
Math Symbol
ValueCountFrequency (%)
> 23
38.3%
< 23
38.3%
~ 14
23.3%
Space Separator
ValueCountFrequency (%)
17549
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2776
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2774
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 664
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 56426
58.9%
Common 39208
40.9%
Latin 236
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7964
 
14.1%
4074
 
7.2%
3902
 
6.9%
3898
 
6.9%
3860
 
6.8%
3822
 
6.8%
3238
 
5.7%
2152
 
3.8%
1773
 
3.1%
1414
 
2.5%
Other values (322) 20329
36.0%
Common
ValueCountFrequency (%)
17549
44.8%
1 4422
 
11.3%
) 2776
 
7.1%
( 2774
 
7.1%
2 2019
 
5.1%
, 1749
 
4.5%
3 1312
 
3.3%
4 1051
 
2.7%
0 992
 
2.5%
5 976
 
2.5%
Other values (11) 3588
 
9.2%
Latin
ValueCountFrequency (%)
E 46
19.5%
A 45
19.1%
C 30
12.7%
L 26
11.0%
R 24
10.2%
P 23
9.7%
B 14
 
5.9%
e 12
 
5.1%
H 3
 
1.3%
c 3
 
1.3%
Other values (9) 10
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 56426
58.9%
ASCII 39443
41.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
17549
44.5%
1 4422
 
11.2%
) 2776
 
7.0%
( 2774
 
7.0%
2 2019
 
5.1%
, 1749
 
4.4%
3 1312
 
3.3%
4 1051
 
2.7%
0 992
 
2.5%
5 976
 
2.5%
Other values (29) 3823
 
9.7%
Hangul
ValueCountFrequency (%)
7964
 
14.1%
4074
 
7.2%
3902
 
6.9%
3898
 
6.9%
3860
 
6.8%
3822
 
6.8%
3238
 
5.7%
2152
 
3.8%
1773
 
3.1%
1414
 
2.5%
Other values (322) 20329
36.0%
None
ValueCountFrequency (%)
· 1
100.0%

소재지전화
Text

MISSING 

Distinct2873
Distinct (%)96.9%
Missing875
Missing (%)22.8%
Memory size30.1 KiB
2023-12-13T03:08:09.388164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.017869
Min length9

Characters and Unicode

Total characters35645
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2808 ?
Unique (%)94.7%

Sample

1st row043-847-3639
2nd row043-847-1525
3rd row043-853-7066
4th row043-846-3406
5th row043-847-2003
ValueCountFrequency (%)
043-849-7071 5
 
0.2%
043-851-5773 5
 
0.2%
043-857-9339 5
 
0.2%
043-857-5002 4
 
0.1%
043-850-8614 4
 
0.1%
043-853-5555 4
 
0.1%
043-850-8615 4
 
0.1%
043-849-7926 4
 
0.1%
031-5171-1646 4
 
0.1%
043-857-5001 3
 
0.1%
Other values (2863) 2924
98.6%
2023-12-13T03:08:09.794372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 5931
16.6%
4 5554
15.6%
0 4524
12.7%
3 4483
12.6%
8 4427
12.4%
5 3366
9.4%
2 1760
 
4.9%
7 1631
 
4.6%
9 1389
 
3.9%
6 1325
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 29714
83.4%
Dash Punctuation 5931
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 5554
18.7%
0 4524
15.2%
3 4483
15.1%
8 4427
14.9%
5 3366
11.3%
2 1760
 
5.9%
7 1631
 
5.5%
9 1389
 
4.7%
6 1325
 
4.5%
1 1255
 
4.2%
Dash Punctuation
ValueCountFrequency (%)
- 5931
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 35645
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 5931
16.6%
4 5554
15.6%
0 4524
12.7%
3 4483
12.6%
8 4427
12.4%
5 3366
9.4%
2 1760
 
4.9%
7 1631
 
4.6%
9 1389
 
3.9%
6 1325
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 35645
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 5931
16.6%
4 5554
15.6%
0 4524
12.7%
3 4483
12.6%
8 4427
12.4%
5 3366
9.4%
2 1760
 
4.9%
7 1631
 
4.6%
9 1389
 
3.9%
6 1325
 
3.7%

업태명
Categorical

Distinct23
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size30.1 KiB
한식
1888 
호프/통닭
480 
기타
302 
분식
225 
식육(숯불구이)
217 
Other values (18)
729 

Length

Max length15
Median length2
Mean length3.1830253
Min length2

Unique

Unique3 ?
Unique (%)0.1%

Sample

1st row중국식
2nd row식육(숯불구이)
3rd row한식
4th row한식
5th row분식

Common Values

ValueCountFrequency (%)
한식 1888
49.2%
호프/통닭 480
 
12.5%
기타 302
 
7.9%
분식 225
 
5.9%
식육(숯불구이) 217
 
5.6%
중국식 184
 
4.8%
경양식 139
 
3.6%
정종/대포집/소주방 77
 
2.0%
일식 74
 
1.9%
횟집 62
 
1.6%
Other values (13) 193
 
5.0%

Length

2023-12-13T03:08:09.988696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한식 1888
49.2%
호프/통닭 480
 
12.5%
기타 302
 
7.9%
분식 225
 
5.9%
식육(숯불구이 217
 
5.6%
중국식 184
 
4.8%
경양식 139
 
3.6%
정종/대포집/소주방 77
 
2.0%
일식 74
 
1.9%
횟집 62
 
1.6%
Other values (13) 193
 
5.0%

문의전화
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size30.1 KiB
043-850-3473
3841 

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row043-850-3473
2nd row043-850-3473
3rd row043-850-3473
4th row043-850-3473
5th row043-850-3473

Common Values

ValueCountFrequency (%)
043-850-3473 3841
100.0%

Length

2023-12-13T03:08:10.159743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:08:10.247517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
043-850-3473 3841
100.0%

Missing values

2023-12-13T03:08:07.099038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:08:07.246938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명소재지(도로명)소재지전화업태명문의전화
0일반음식점아서원충청북도 충주시 관아1길 8-1 (성내동)043-847-3639중국식043-850-3473
1일반음식점중원생고기마을충청북도 충주시 관아3길 11 (성서동)043-847-1525식육(숯불구이)043-850-3473
2일반음식점대미식당충청북도 충주시 금가면 대미길 59043-853-7066한식043-850-3473
3일반음식점대동식당충청북도 충주시 수안보면 온천중앙길 23-1043-846-3406한식043-850-3473
4일반음식점유선분식충청북도 충주시 예성로 165-1, 1층 (성서동)043-847-2003분식043-850-3473
5일반음식점만리식당충청북도 충주시 수안보면 물탕2길 5043-846-3206한식043-850-3473
6일반음식점목벌식당충청북도 충주시 충인6길 31-1 (충인동)043-843-4710한식043-850-3473
7일반음식점충북식당충청북도 충주시 봉계12길 32 (봉방동)043-847-6520한식043-850-3473
8일반음식점금미집충청북도 충주시 충인6길 37 (충인동)043-845-2771한식043-850-3473
9일반음식점들림횟집충청북도 충주시 살미면 팔봉향산길 374043-851-0083횟집043-850-3473
업종명업소명소재지(도로명)소재지전화업태명문의전화
3831일반음식점갓튀긴후라이드충주점충청북도 충주시 금곡로 5, 109호 (연수동)<NA>호프/통닭043-850-3473
3832일반음식점한촌설렁탕호암점충청북도 충주시 호암토성5길 42, 1층 (호암동)<NA>한식043-850-3473
3833일반음식점쿠니소바충주호암점충청북도 충주시 호암수청1로 60, 상가동 2층 203호 (호암동, 제일 풍경채)<NA>일식043-850-3473
3834일반음식점카페깬닙충청북도 충주시 월촌5길 16, 충주안림 LH천년나무 2단지 상가 상가동 1층 102호 (안림동)<NA>기타043-850-3473
3835일반음식점점프충주교현점(JUMP)충청북도 충주시 갱고개로 57, 1층 (교현동)<NA>기타043-850-3473
3836일반음식점명륜진사갈비서충주기업도시점충청북도 충주시 중앙탑면 기업도시로 214, 2층 201호043-7802-5254식육(숯불구이)043-850-3473
3837일반음식점웨스트.본1093(West.Bon1093)충청북도 충주시 대소원면 첨단쉼터3길 4, 1층<NA>기타043-850-3473
3838일반음식점천도국밥충청북도 충주시 봉현로 290, 1층 (교현동)<NA>한식043-850-3473
3839일반음식점인생대패충청북도 충주시 봉계1길 29, 1층 (봉방동)043-845-2394식육(숯불구이)043-850-3473
3840일반음식점정타코·붕어충청북도 충주시 사직산8길 50, 1층 (문화동)<NA>기타043-850-3473