Overview

Dataset statistics

Number of variables5
Number of observations344
Missing cells222
Missing cells (%)12.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.6 KiB
Average record size in memory40.4 B

Variable types

Text4
DateTime1

Dataset

Description전라북도 정읍시 에 소재한 휴게음식점 현황 중(업소명, 소재지도로명주소, 소재지지번주소, 소재지연락처) 등의 정보를 제공합니다.
Author전라북도 정읍시
URLhttps://www.data.go.kr/data/15047930/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
소재지전화 has 222 (64.5%) missing valuesMissing
업소명 has unique valuesUnique

Reproduction

Analysis started2023-12-16 15:30:57.900088
Analysis finished2023-12-16 15:31:00.068525
Duration2.17 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업소명
Text

UNIQUE 

Distinct344
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2023-12-16T15:31:00.567948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length17
Mean length7.7063953
Min length2

Characters and Unicode

Total characters2651
Distinct characters413
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique344 ?
Unique (%)100.0%

Sample

1st row삼화다방
2nd row정읍녹두장군(하행)휴게소GS25편의점
3rd row약속다방
4th row신세계다방
5th row모두랑쌍화탕
ValueCountFrequency (%)
씨유 17
 
3.3%
세븐일레븐 11
 
2.1%
카페 10
 
1.9%
정읍점 9
 
1.7%
정읍상동점 9
 
1.7%
커피 7
 
1.3%
정읍수성점 7
 
1.3%
gs25 5
 
1.0%
지에스25 5
 
1.0%
상동점 4
 
0.8%
Other values (414) 439
83.9%
2023-12-16T15:31:02.221494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
182
 
6.9%
131
 
4.9%
119
 
4.5%
113
 
4.3%
51
 
1.9%
49
 
1.8%
40
 
1.5%
38
 
1.4%
) 37
 
1.4%
( 36
 
1.4%
Other values (403) 1855
70.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2122
80.0%
Space Separator 182
 
6.9%
Uppercase Letter 145
 
5.5%
Decimal Number 64
 
2.4%
Lowercase Letter 57
 
2.2%
Close Punctuation 37
 
1.4%
Open Punctuation 36
 
1.4%
Other Punctuation 7
 
0.3%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
131
 
6.2%
119
 
5.6%
113
 
5.3%
51
 
2.4%
49
 
2.3%
40
 
1.9%
38
 
1.8%
35
 
1.6%
33
 
1.6%
32
 
1.5%
Other values (347) 1481
69.8%
Uppercase Letter
ValueCountFrequency (%)
C 26
17.9%
G 14
 
9.7%
E 12
 
8.3%
S 12
 
8.3%
U 9
 
6.2%
P 9
 
6.2%
A 8
 
5.5%
O 7
 
4.8%
I 7
 
4.8%
T 6
 
4.1%
Other values (11) 35
24.1%
Lowercase Letter
ValueCountFrequency (%)
e 8
14.0%
a 8
14.0%
c 6
10.5%
f 5
8.8%
r 5
8.8%
n 4
7.0%
o 3
 
5.3%
p 3
 
5.3%
i 3
 
5.3%
l 3
 
5.3%
Other values (6) 9
15.8%
Decimal Number
ValueCountFrequency (%)
5 19
29.7%
2 18
28.1%
1 8
12.5%
9 5
 
7.8%
3 4
 
6.2%
0 3
 
4.7%
4 3
 
4.7%
6 2
 
3.1%
8 1
 
1.6%
7 1
 
1.6%
Other Punctuation
ValueCountFrequency (%)
& 2
28.6%
, 2
28.6%
! 1
14.3%
: 1
14.3%
. 1
14.3%
Space Separator
ValueCountFrequency (%)
182
100.0%
Close Punctuation
ValueCountFrequency (%)
) 37
100.0%
Open Punctuation
ValueCountFrequency (%)
( 36
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2120
80.0%
Common 327
 
12.3%
Latin 202
 
7.6%
Han 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
131
 
6.2%
119
 
5.6%
113
 
5.3%
51
 
2.4%
49
 
2.3%
40
 
1.9%
38
 
1.8%
35
 
1.7%
33
 
1.6%
32
 
1.5%
Other values (345) 1479
69.8%
Latin
ValueCountFrequency (%)
C 26
 
12.9%
G 14
 
6.9%
E 12
 
5.9%
S 12
 
5.9%
U 9
 
4.5%
P 9
 
4.5%
e 8
 
4.0%
a 8
 
4.0%
A 8
 
4.0%
O 7
 
3.5%
Other values (27) 89
44.1%
Common
ValueCountFrequency (%)
182
55.7%
) 37
 
11.3%
( 36
 
11.0%
5 19
 
5.8%
2 18
 
5.5%
1 8
 
2.4%
9 5
 
1.5%
3 4
 
1.2%
0 3
 
0.9%
4 3
 
0.9%
Other values (9) 12
 
3.7%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2120
80.0%
ASCII 529
 
20.0%
CJK 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
182
34.4%
) 37
 
7.0%
( 36
 
6.8%
C 26
 
4.9%
5 19
 
3.6%
2 18
 
3.4%
G 14
 
2.6%
E 12
 
2.3%
S 12
 
2.3%
U 9
 
1.7%
Other values (46) 164
31.0%
Hangul
ValueCountFrequency (%)
131
 
6.2%
119
 
5.6%
113
 
5.3%
51
 
2.4%
49
 
2.3%
40
 
1.9%
38
 
1.8%
35
 
1.7%
33
 
1.6%
32
 
1.5%
Other values (345) 1479
69.8%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct331
Distinct (%)96.2%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2023-12-16T15:31:03.057391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length46
Mean length24.927326
Min length9

Characters and Unicode

Total characters8575
Distinct characters190
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique319 ?
Unique (%)92.7%

Sample

1st row전라북도 정읍시 칠보면 칠보중앙로 77-4
2nd row전라북도 정읍시 북면 정신로 135-61
3rd row전라북도 정읍시 중앙로 32 (연지동)
4th row전라북도 정읍시 중앙1길 96 (수성동)
5th row전라북도 정읍시 중앙1길 146 (수성동)
ValueCountFrequency (%)
전라북도 343
18.1%
정읍시 343
18.1%
수성동 99
 
5.2%
1층 78
 
4.1%
상동 73
 
3.9%
시기동 47
 
2.5%
중앙로 39
 
2.1%
연지동 36
 
1.9%
충정로 27
 
1.4%
학산로 23
 
1.2%
Other values (412) 788
41.6%
2023-12-16T15:31:05.181208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1552
18.1%
419
 
4.9%
401
 
4.7%
390
 
4.5%
374
 
4.4%
1 369
 
4.3%
353
 
4.1%
344
 
4.0%
344
 
4.0%
326
 
3.8%
Other values (180) 3703
43.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4916
57.3%
Space Separator 1552
 
18.1%
Decimal Number 1244
 
14.5%
Close Punctuation 298
 
3.5%
Open Punctuation 298
 
3.5%
Other Punctuation 146
 
1.7%
Dash Punctuation 103
 
1.2%
Uppercase Letter 13
 
0.2%
Math Symbol 3
 
< 0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
419
 
8.5%
401
 
8.2%
390
 
7.9%
374
 
7.6%
353
 
7.2%
344
 
7.0%
344
 
7.0%
326
 
6.6%
248
 
5.0%
157
 
3.2%
Other values (151) 1560
31.7%
Decimal Number
ValueCountFrequency (%)
1 369
29.7%
2 185
14.9%
5 111
 
8.9%
3 108
 
8.7%
4 93
 
7.5%
6 84
 
6.8%
0 83
 
6.7%
7 78
 
6.3%
8 69
 
5.5%
9 64
 
5.1%
Uppercase Letter
ValueCountFrequency (%)
L 3
23.1%
B 2
15.4%
E 2
15.4%
A 2
15.4%
R 1
 
7.7%
P 1
 
7.7%
C 1
 
7.7%
N 1
 
7.7%
Math Symbol
ValueCountFrequency (%)
< 1
33.3%
> 1
33.3%
~ 1
33.3%
Other Punctuation
ValueCountFrequency (%)
, 145
99.3%
· 1
 
0.7%
Lowercase Letter
ValueCountFrequency (%)
e 1
50.0%
t 1
50.0%
Space Separator
ValueCountFrequency (%)
1552
100.0%
Close Punctuation
ValueCountFrequency (%)
) 298
100.0%
Open Punctuation
ValueCountFrequency (%)
( 298
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 103
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4916
57.3%
Common 3644
42.5%
Latin 15
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
419
 
8.5%
401
 
8.2%
390
 
7.9%
374
 
7.6%
353
 
7.2%
344
 
7.0%
344
 
7.0%
326
 
6.6%
248
 
5.0%
157
 
3.2%
Other values (151) 1560
31.7%
Common
ValueCountFrequency (%)
1552
42.6%
1 369
 
10.1%
) 298
 
8.2%
( 298
 
8.2%
2 185
 
5.1%
, 145
 
4.0%
5 111
 
3.0%
3 108
 
3.0%
- 103
 
2.8%
4 93
 
2.6%
Other values (9) 382
 
10.5%
Latin
ValueCountFrequency (%)
L 3
20.0%
B 2
13.3%
E 2
13.3%
A 2
13.3%
R 1
 
6.7%
P 1
 
6.7%
C 1
 
6.7%
N 1
 
6.7%
e 1
 
6.7%
t 1
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4916
57.3%
ASCII 3658
42.7%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1552
42.4%
1 369
 
10.1%
) 298
 
8.1%
( 298
 
8.1%
2 185
 
5.1%
, 145
 
4.0%
5 111
 
3.0%
3 108
 
3.0%
- 103
 
2.8%
4 93
 
2.5%
Other values (18) 396
 
10.8%
Hangul
ValueCountFrequency (%)
419
 
8.5%
401
 
8.2%
390
 
7.9%
374
 
7.6%
353
 
7.2%
344
 
7.0%
344
 
7.0%
326
 
6.6%
248
 
5.0%
157
 
3.2%
Other values (151) 1560
31.7%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct321
Distinct (%)93.3%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2023-12-16T15:31:07.155057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length37
Mean length20.491279
Min length14

Characters and Unicode

Total characters7049
Distinct characters174
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique300 ?
Unique (%)87.2%

Sample

1st row전라북도 정읍시 칠보면 시산리 597
2nd row전라북도 정읍시 북면 남산리 19-1 외8필지
3rd row전라북도 정읍시 연지동 49-11
4th row전라북도 정읍시 수성동 596-2
5th row전라북도 정읍시 수성동 638-2
ValueCountFrequency (%)
전라북도 344
22.9%
정읍시 344
22.9%
수성동 102
 
6.8%
상동 74
 
4.9%
시기동 48
 
3.2%
연지동 36
 
2.4%
신태인읍 12
 
0.8%
북면 11
 
0.7%
신태인리 9
 
0.6%
1층일부 9
 
0.6%
Other values (399) 511
34.1%
2023-12-16T15:31:10.240545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1485
21.1%
399
 
5.7%
364
 
5.2%
360
 
5.1%
357
 
5.1%
346
 
4.9%
345
 
4.9%
344
 
4.9%
1 315
 
4.5%
301
 
4.3%
Other values (164) 2433
34.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3806
54.0%
Space Separator 1485
 
21.1%
Decimal Number 1442
 
20.5%
Dash Punctuation 295
 
4.2%
Uppercase Letter 7
 
0.1%
Close Punctuation 5
 
0.1%
Open Punctuation 5
 
0.1%
Lowercase Letter 2
 
< 0.1%
Math Symbol 1
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
399
10.5%
364
9.6%
360
9.5%
357
9.4%
346
9.1%
345
9.1%
344
9.0%
301
 
7.9%
107
 
2.8%
102
 
2.7%
Other values (140) 781
20.5%
Decimal Number
ValueCountFrequency (%)
1 315
21.8%
2 171
11.9%
3 170
11.8%
4 144
10.0%
5 137
9.5%
9 113
 
7.8%
6 112
 
7.8%
0 105
 
7.3%
8 94
 
6.5%
7 81
 
5.6%
Uppercase Letter
ValueCountFrequency (%)
L 2
28.6%
N 1
14.3%
A 1
14.3%
Y 1
14.3%
R 1
14.3%
T 1
14.3%
Lowercase Letter
ValueCountFrequency (%)
t 1
50.0%
e 1
50.0%
Space Separator
ValueCountFrequency (%)
1485
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 295
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%
Other Punctuation
ValueCountFrequency (%)
· 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3806
54.0%
Common 3234
45.9%
Latin 9
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
399
10.5%
364
9.6%
360
9.5%
357
9.4%
346
9.1%
345
9.1%
344
9.0%
301
 
7.9%
107
 
2.8%
102
 
2.7%
Other values (140) 781
20.5%
Common
ValueCountFrequency (%)
1485
45.9%
1 315
 
9.7%
- 295
 
9.1%
2 171
 
5.3%
3 170
 
5.3%
4 144
 
4.5%
5 137
 
4.2%
9 113
 
3.5%
6 112
 
3.5%
0 105
 
3.2%
Other values (6) 187
 
5.8%
Latin
ValueCountFrequency (%)
L 2
22.2%
t 1
11.1%
e 1
11.1%
N 1
11.1%
A 1
11.1%
Y 1
11.1%
R 1
11.1%
T 1
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3806
54.0%
ASCII 3242
46.0%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1485
45.8%
1 315
 
9.7%
- 295
 
9.1%
2 171
 
5.3%
3 170
 
5.2%
4 144
 
4.4%
5 137
 
4.2%
9 113
 
3.5%
6 112
 
3.5%
0 105
 
3.2%
Other values (13) 195
 
6.0%
Hangul
ValueCountFrequency (%)
399
10.5%
364
9.6%
360
9.5%
357
9.4%
346
9.1%
345
9.1%
344
9.0%
301
 
7.9%
107
 
2.8%
102
 
2.7%
Other values (140) 781
20.5%
None
ValueCountFrequency (%)
· 1
100.0%

소재지전화
Text

MISSING 

Distinct120
Distinct (%)98.4%
Missing222
Missing (%)64.5%
Memory size2.8 KiB
2023-12-16T15:31:11.239506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.040984
Min length12

Characters and Unicode

Total characters1469
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique118 ?
Unique (%)96.7%

Sample

1st row063-534-3077
2nd row063-532-0510
3rd row063-535-2414
4th row063-535-6050
5th row063-531-9635
ValueCountFrequency (%)
063-532-0510 2
 
1.6%
063-532-2373 2
 
1.6%
063-535-7206 1
 
0.8%
063-536-1444 1
 
0.8%
063-535-9004 1
 
0.8%
063-570-7575 1
 
0.8%
063-536-3346 1
 
0.8%
070-7018-4500 1
 
0.8%
063-538-0579 1
 
0.8%
063-535-3246 1
 
0.8%
Other values (110) 110
90.2%
2023-12-16T15:31:13.096256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 290
19.7%
- 244
16.6%
0 204
13.9%
5 190
12.9%
6 164
11.2%
7 83
 
5.7%
2 71
 
4.8%
8 67
 
4.6%
1 64
 
4.4%
4 47
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1225
83.4%
Dash Punctuation 244
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 290
23.7%
0 204
16.7%
5 190
15.5%
6 164
13.4%
7 83
 
6.8%
2 71
 
5.8%
8 67
 
5.5%
1 64
 
5.2%
4 47
 
3.8%
9 45
 
3.7%
Dash Punctuation
ValueCountFrequency (%)
- 244
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1469
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 290
19.7%
- 244
16.6%
0 204
13.9%
5 190
12.9%
6 164
11.2%
7 83
 
5.7%
2 71
 
4.8%
8 67
 
4.6%
1 64
 
4.4%
4 47
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1469
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 290
19.7%
- 244
16.6%
0 204
13.9%
5 190
12.9%
6 164
11.2%
7 83
 
5.7%
2 71
 
4.8%
8 67
 
4.6%
1 64
 
4.4%
4 47
 
3.2%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
Minimum2023-12-13 00:00:00
Maximum2023-12-13 00:00:00
2023-12-16T15:31:13.892541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:31:14.566633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Missing values

2023-12-16T15:30:59.299375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-16T15:30:59.870195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명소재지(도로명)소재지(지번)소재지전화데이터기준일자
0삼화다방전라북도 정읍시 칠보면 칠보중앙로 77-4전라북도 정읍시 칠보면 시산리 597063-534-30772023-12-13
1정읍녹두장군(하행)휴게소GS25편의점전라북도 정읍시 북면 정신로 135-61전라북도 정읍시 북면 남산리 19-1 외8필지063-532-05102023-12-13
2약속다방전라북도 정읍시 중앙로 32 (연지동)전라북도 정읍시 연지동 49-11063-535-24142023-12-13
3신세계다방전라북도 정읍시 중앙1길 96 (수성동)전라북도 정읍시 수성동 596-2063-535-60502023-12-13
4모두랑쌍화탕전라북도 정읍시 중앙1길 146 (수성동)전라북도 정읍시 수성동 638-2063-531-96352023-12-13
5애마다방전라북도 정읍시 중앙로 23-1 (연지동)전라북도 정읍시 연지동 315-11063-531-93842023-12-13
6청수찻집전라북도 정읍시 정우면 정신로 598 (초강리 55-17)전라북도 정읍시 정우면 초강리 55-17063-537-58032023-12-13
7쵸이스커피숍전라북도 정읍시 중앙2길 19 (수성동)전라북도 정읍시 수성동 591-4063-532-72812023-12-13
8롯데리아정읍점전라북도 정읍시 조곡천1길 55 (수성동)전라북도 정읍시 수성동 558-4063-537-20002023-12-13
9보리다방전라북도 정읍시 신태인읍 서태길 38전라북도 정읍시 신태인읍 신태인리 228-63063-571-31112023-12-13
업소명소재지(도로명)소재지(지번)소재지전화데이터기준일자
334씨유 정읍제일점전라북도 정읍시 수성택지7길 30, 1층 (수성동)전라북도 정읍시 수성동 1034-10<NA>2023-12-13
335텐퍼센트커피 정읍상동점전라북도 정읍시 학산로 117-16, 부경타운 1동 101호 (상동)전라북도 정읍시 상동 241-1<NA>2023-12-13
336물멍커피전라북도 정읍시 상신경2길 1, 1층 (상동)전라북도 정읍시 상동 233-40507-1399-04322023-12-13
337세븐일레븐 정읍미소중앙점전라북도 정읍시 중앙로 286, 1층 (상동)전라북도 정읍시 상동 310-8<NA>2023-12-13
338지에스25 정읍서부로점전라북도 정읍시 서부산업도로 182-1 (상평동)전라북도 정읍시 상평동 95-7<NA>2023-12-13
339씨유 정읍스타점전라북도 정읍시 충정로 622, 우림주유소 (용계동)전라북도 정읍시 용계동 390-7<NA>2023-12-13
340씨유 태인IC점전라북도 정읍시 태인면 석지로 1427전라북도 정읍시 태인면 태성리 628-32<NA>2023-12-13
341컴포즈커피 수성점전라북도 정읍시 수성로 45 (수성동)전라북도 정읍시 수성동 919-5063-532-75502023-12-13
342벌크커피 신태인점전라북도 정읍시 신태인읍 신태인1길 107, 1층전라북도 정읍시 신태인읍 신태인리 141-2063-927-99202023-12-13
343지에스25 정읍상동힐스점전라북도 정읍시 상동중앙로 80 (상동)전라북도 정읍시 상동 133-8<NA>2023-12-13