Overview

Dataset statistics

Number of variables5
Number of observations2742
Missing cells308
Missing cells (%)2.2%
Duplicate rows3
Duplicate rows (%)0.1%
Total size in memory107.2 KiB
Average record size in memory40.0 B

Variable types

Text4
Categorical1

Dataset

Description칠곡사랑상품권 취급 가맹점 정보
Author경상북도 칠곡군
URLhttps://www.data.go.kr/data/3047587/fileData.do

Alerts

영업 has constant value ""Constant
Dataset has 3 (0.1%) duplicate rowsDuplicates
전화번호 has 308 (11.2%) missing valuesMissing

Reproduction

Analysis started2023-12-12 22:44:32.096186
Analysis finished2023-12-12 22:44:32.963696
Duration0.87 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct2651
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Memory size21.6 KiB
2023-12-13T07:44:33.141829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length19
Mean length5.9657185
Min length1

Characters and Unicode

Total characters16358
Distinct characters770
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2604 ?
Unique (%)95.0%

Sample

1st row(사)경북장애인부모회칠곡군지부
2nd row(유)우현리베라웨딩뷔페
3rd row(존)삼일타이어
4th row(주)거양건설
5th row(주)뉴그린렌트카(왜관지점)
ValueCountFrequency (%)
개인택시 44
 
1.6%
왜관점 6
 
0.2%
큰집막창 3
 
0.1%
파워마트 3
 
0.1%
세븐일레븐 3
 
0.1%
신한우촌 3
 
0.1%
세븐일레븐칠곡왜관중앙점 2
 
0.1%
인평주유소 2
 
0.1%
현대철물 2
 
0.1%
기쁨주는머리방 2
 
0.1%
Other values (2680) 2726
97.5%
2023-12-13T07:44:33.512412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
273
 
1.7%
267
 
1.6%
259
 
1.6%
250
 
1.5%
222
 
1.4%
217
 
1.3%
210
 
1.3%
209
 
1.3%
198
 
1.2%
197
 
1.2%
Other values (760) 14056
85.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 15590
95.3%
Uppercase Letter 224
 
1.4%
Close Punctuation 130
 
0.8%
Open Punctuation 129
 
0.8%
Decimal Number 116
 
0.7%
Lowercase Letter 75
 
0.5%
Space Separator 64
 
0.4%
Other Punctuation 29
 
0.2%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
273
 
1.8%
267
 
1.7%
259
 
1.7%
250
 
1.6%
222
 
1.4%
217
 
1.4%
210
 
1.3%
209
 
1.3%
198
 
1.3%
197
 
1.3%
Other values (691) 13288
85.2%
Uppercase Letter
ValueCountFrequency (%)
G 30
13.4%
S 26
11.6%
C 19
 
8.5%
P 18
 
8.0%
B 16
 
7.1%
L 14
 
6.2%
A 11
 
4.9%
K 11
 
4.9%
E 9
 
4.0%
T 9
 
4.0%
Other values (16) 61
27.2%
Lowercase Letter
ValueCountFrequency (%)
e 10
13.3%
s 7
 
9.3%
i 7
 
9.3%
c 6
 
8.0%
a 5
 
6.7%
n 5
 
6.7%
o 4
 
5.3%
k 3
 
4.0%
w 3
 
4.0%
b 3
 
4.0%
Other values (12) 22
29.3%
Decimal Number
ValueCountFrequency (%)
2 32
27.6%
5 18
15.5%
1 15
12.9%
0 14
12.1%
3 13
11.2%
6 7
 
6.0%
4 6
 
5.2%
8 5
 
4.3%
7 4
 
3.4%
9 2
 
1.7%
Other Punctuation
ValueCountFrequency (%)
. 10
34.5%
& 8
27.6%
, 4
 
13.8%
? 3
 
10.3%
/ 2
 
6.9%
# 1
 
3.4%
' 1
 
3.4%
Close Punctuation
ValueCountFrequency (%)
) 130
100.0%
Open Punctuation
ValueCountFrequency (%)
( 129
100.0%
Space Separator
ValueCountFrequency (%)
64
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 15589
95.3%
Common 469
 
2.9%
Latin 299
 
1.8%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
273
 
1.8%
267
 
1.7%
259
 
1.7%
250
 
1.6%
222
 
1.4%
217
 
1.4%
210
 
1.3%
209
 
1.3%
198
 
1.3%
197
 
1.3%
Other values (690) 13287
85.2%
Latin
ValueCountFrequency (%)
G 30
 
10.0%
S 26
 
8.7%
C 19
 
6.4%
P 18
 
6.0%
B 16
 
5.4%
L 14
 
4.7%
A 11
 
3.7%
K 11
 
3.7%
e 10
 
3.3%
E 9
 
3.0%
Other values (38) 135
45.2%
Common
ValueCountFrequency (%)
) 130
27.7%
( 129
27.5%
64
13.6%
2 32
 
6.8%
5 18
 
3.8%
1 15
 
3.2%
0 14
 
3.0%
3 13
 
2.8%
. 10
 
2.1%
& 8
 
1.7%
Other values (11) 36
 
7.7%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 15589
95.3%
ASCII 768
 
4.7%
CJK 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
273
 
1.8%
267
 
1.7%
259
 
1.7%
250
 
1.6%
222
 
1.4%
217
 
1.4%
210
 
1.3%
209
 
1.3%
198
 
1.3%
197
 
1.3%
Other values (690) 13287
85.2%
ASCII
ValueCountFrequency (%)
) 130
16.9%
( 129
16.8%
64
 
8.3%
2 32
 
4.2%
G 30
 
3.9%
S 26
 
3.4%
C 19
 
2.5%
P 18
 
2.3%
5 18
 
2.3%
B 16
 
2.1%
Other values (59) 286
37.2%
CJK
ValueCountFrequency (%)
1
100.0%

전화번호
Text

MISSING 

Distinct2340
Distinct (%)96.1%
Missing308
Missing (%)11.2%
Memory size21.6 KiB
2023-12-13T07:44:33.778322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length11.991372
Min length5

Characters and Unicode

Total characters29187
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2283 ?
Unique (%)93.8%

Sample

1st row054-974-0159
2nd row054-975-8300
3rd row054-971-0985
4th row054-974-0025
5th row054-975-7007
ValueCountFrequency (%)
054-973-3875 33
 
1.4%
054-977-7777 6
 
0.2%
054-973-0904 3
 
0.1%
054 3
 
0.1%
054-973-8855 2
 
0.1%
054-973-2060 2
 
0.1%
054-974-9990 2
 
0.1%
054-975-8866 2
 
0.1%
054-977-0045 2
 
0.1%
054-975-9555 2
 
0.1%
Other values (2330) 2377
97.7%
2023-12-13T07:44:34.187754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 4868
16.7%
5 3810
13.1%
7 3791
13.0%
0 3729
12.8%
4 3534
12.1%
9 3347
11.5%
3 1465
 
5.0%
2 1293
 
4.4%
1 1249
 
4.3%
8 1064
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 24319
83.3%
Dash Punctuation 4868
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 3810
15.7%
7 3791
15.6%
0 3729
15.3%
4 3534
14.5%
9 3347
13.8%
3 1465
 
6.0%
2 1293
 
5.3%
1 1249
 
5.1%
8 1064
 
4.4%
6 1037
 
4.3%
Dash Punctuation
ValueCountFrequency (%)
- 4868
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 29187
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 4868
16.7%
5 3810
13.1%
7 3791
13.0%
0 3729
12.8%
4 3534
12.1%
9 3347
11.5%
3 1465
 
5.0%
2 1293
 
4.4%
1 1249
 
4.3%
8 1064
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 29187
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 4868
16.7%
5 3810
13.1%
7 3791
13.0%
0 3729
12.8%
4 3534
12.1%
9 3347
11.5%
3 1465
 
5.0%
2 1293
 
4.4%
1 1249
 
4.3%
8 1064
 
3.6%

업태
Text

Distinct849
Distinct (%)31.0%
Missing0
Missing (%)0.0%
Memory size21.6 KiB
2023-12-13T07:44:34.460574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length18
Mean length4.0999271
Min length1

Characters and Unicode

Total characters11242
Distinct characters300
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique657 ?
Unique (%)24.0%

Sample

1st row비영리 장애인 단체
2nd row서비스/예식장
3rd row타이어
4th row제조/건설
5th row서비스
ValueCountFrequency (%)
음숙 297
 
10.4%
소매 172
 
6.0%
음숙/한식 166
 
5.8%
음식 148
 
5.2%
서비스 118
 
4.1%
소매업 86
 
3.0%
한식 80
 
2.8%
도소매 62
 
2.2%
학원 56
 
2.0%
택시 43
 
1.5%
Other values (830) 1639
57.2%
2023-12-13T07:44:34.891730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 988
 
8.8%
874
 
7.8%
854
 
7.6%
838
 
7.5%
757
 
6.7%
615
 
5.5%
347
 
3.1%
334
 
3.0%
331
 
2.9%
329
 
2.9%
Other values (290) 4975
44.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9924
88.3%
Other Punctuation 1159
 
10.3%
Space Separator 129
 
1.1%
Close Punctuation 9
 
0.1%
Open Punctuation 9
 
0.1%
Uppercase Letter 6
 
0.1%
Dash Punctuation 4
 
< 0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
874
 
8.8%
854
 
8.6%
838
 
8.4%
757
 
7.6%
615
 
6.2%
347
 
3.5%
334
 
3.4%
331
 
3.3%
329
 
3.3%
313
 
3.2%
Other values (277) 4332
43.7%
Other Punctuation
ValueCountFrequency (%)
/ 988
85.2%
. 112
 
9.7%
, 58
 
5.0%
? 1
 
0.1%
Uppercase Letter
ValueCountFrequency (%)
G 2
33.3%
P 2
33.3%
L 2
33.3%
Lowercase Letter
ValueCountFrequency (%)
c 1
50.0%
p 1
50.0%
Space Separator
ValueCountFrequency (%)
129
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9924
88.3%
Common 1310
 
11.7%
Latin 8
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
874
 
8.8%
854
 
8.6%
838
 
8.4%
757
 
7.6%
615
 
6.2%
347
 
3.5%
334
 
3.4%
331
 
3.3%
329
 
3.3%
313
 
3.2%
Other values (277) 4332
43.7%
Common
ValueCountFrequency (%)
/ 988
75.4%
129
 
9.8%
. 112
 
8.5%
, 58
 
4.4%
) 9
 
0.7%
( 9
 
0.7%
- 4
 
0.3%
? 1
 
0.1%
Latin
ValueCountFrequency (%)
G 2
25.0%
P 2
25.0%
L 2
25.0%
c 1
12.5%
p 1
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9924
88.3%
ASCII 1318
 
11.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 988
75.0%
129
 
9.8%
. 112
 
8.5%
, 58
 
4.4%
) 9
 
0.7%
( 9
 
0.7%
- 4
 
0.3%
G 2
 
0.2%
P 2
 
0.2%
L 2
 
0.2%
Other values (3) 3
 
0.2%
Hangul
ValueCountFrequency (%)
874
 
8.8%
854
 
8.6%
838
 
8.4%
757
 
7.6%
615
 
6.2%
347
 
3.5%
334
 
3.4%
331
 
3.3%
329
 
3.3%
313
 
3.2%
Other values (277) 4332
43.7%

주소
Text

Distinct2445
Distinct (%)89.2%
Missing0
Missing (%)0.0%
Memory size21.6 KiB
2023-12-13T07:44:35.316549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length39
Mean length20.118162
Min length12

Characters and Unicode

Total characters55164
Distinct characters198
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2240 ?
Unique (%)81.7%

Sample

1st row경북칠곡군약목면 복성리 562-5번지
2nd row경북칠곡군왜관읍 왜관리 746-1번지
3rd row경북칠곡군왜관읍 금산리 854-1
4th row경북칠곡군왜관읍 왜관리 788-12번지
5th row경북칠곡군왜관읍 삼청리 483-12번지
ValueCountFrequency (%)
경북칠곡군왜관읍 1171
 
13.5%
왜관리 652
 
7.5%
경북칠곡군북삼읍 494
 
5.7%
경북칠곡군석적읍 426
 
4.9%
인평리 325
 
3.7%
중리 262
 
3.0%
경북칠곡군약목면 204
 
2.3%
칠곡군 144
 
1.7%
경북 143
 
1.6%
경북칠곡군동명면 106
 
1.2%
Other values (2511) 4778
54.9%
2023-12-13T07:44:35.846876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6794
 
12.3%
3277
 
5.9%
2763
 
5.0%
2759
 
5.0%
2750
 
5.0%
2748
 
5.0%
1 2602
 
4.7%
2431
 
4.4%
2219
 
4.0%
2181
 
4.0%
Other values (188) 24640
44.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 34204
62.0%
Decimal Number 11870
 
21.5%
Space Separator 6794
 
12.3%
Dash Punctuation 2134
 
3.9%
Other Punctuation 68
 
0.1%
Close Punctuation 32
 
0.1%
Open Punctuation 32
 
0.1%
Uppercase Letter 24
 
< 0.1%
Lowercase Letter 4
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3277
 
9.6%
2763
 
8.1%
2759
 
8.1%
2750
 
8.0%
2748
 
8.0%
2431
 
7.1%
2219
 
6.5%
2181
 
6.4%
2121
 
6.2%
1686
 
4.9%
Other values (162) 9269
27.1%
Decimal Number
ValueCountFrequency (%)
1 2602
21.9%
2 1884
15.9%
7 1123
9.5%
3 1088
9.2%
0 973
 
8.2%
8 866
 
7.3%
4 854
 
7.2%
6 849
 
7.2%
5 837
 
7.1%
9 794
 
6.7%
Other Punctuation
ValueCountFrequency (%)
@ 25
36.8%
/ 20
29.4%
, 13
19.1%
. 10
 
14.7%
Uppercase Letter
ValueCountFrequency (%)
B 10
41.7%
L 10
41.7%
A 3
 
12.5%
C 1
 
4.2%
Lowercase Letter
ValueCountFrequency (%)
s 2
50.0%
g 2
50.0%
Math Symbol
ValueCountFrequency (%)
> 1
50.0%
~ 1
50.0%
Space Separator
ValueCountFrequency (%)
6794
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2134
100.0%
Close Punctuation
ValueCountFrequency (%)
) 32
100.0%
Open Punctuation
ValueCountFrequency (%)
( 32
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 34204
62.0%
Common 20932
37.9%
Latin 28
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3277
 
9.6%
2763
 
8.1%
2759
 
8.1%
2750
 
8.0%
2748
 
8.0%
2431
 
7.1%
2219
 
6.5%
2181
 
6.4%
2121
 
6.2%
1686
 
4.9%
Other values (162) 9269
27.1%
Common
ValueCountFrequency (%)
6794
32.5%
1 2602
 
12.4%
- 2134
 
10.2%
2 1884
 
9.0%
7 1123
 
5.4%
3 1088
 
5.2%
0 973
 
4.6%
8 866
 
4.1%
4 854
 
4.1%
6 849
 
4.1%
Other values (10) 1765
 
8.4%
Latin
ValueCountFrequency (%)
B 10
35.7%
L 10
35.7%
A 3
 
10.7%
s 2
 
7.1%
g 2
 
7.1%
C 1
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 34204
62.0%
ASCII 20960
38.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6794
32.4%
1 2602
 
12.4%
- 2134
 
10.2%
2 1884
 
9.0%
7 1123
 
5.4%
3 1088
 
5.2%
0 973
 
4.6%
8 866
 
4.1%
4 854
 
4.1%
6 849
 
4.1%
Other values (16) 1793
 
8.6%
Hangul
ValueCountFrequency (%)
3277
 
9.6%
2763
 
8.1%
2759
 
8.1%
2750
 
8.0%
2748
 
8.0%
2431
 
7.1%
2219
 
6.5%
2181
 
6.4%
2121
 
6.2%
1686
 
4.9%
Other values (162) 9269
27.1%

영업
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size21.6 KiB
정상영업
2742 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정상영업
2nd row정상영업
3rd row정상영업
4th row정상영업
5th row정상영업

Common Values

ValueCountFrequency (%)
정상영업 2742
100.0%

Length

2023-12-13T07:44:35.969191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:44:36.047926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상영업 2742
100.0%

Missing values

2023-12-13T07:44:32.837243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:44:32.924578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

가맹점명전화번호업태주소영업
0(사)경북장애인부모회칠곡군지부054-974-0159비영리 장애인 단체경북칠곡군약목면 복성리 562-5번지정상영업
1(유)우현리베라웨딩뷔페054-975-8300서비스/예식장경북칠곡군왜관읍 왜관리 746-1번지정상영업
2(존)삼일타이어054-971-0985타이어경북칠곡군왜관읍 금산리 854-1정상영업
3(주)거양건설054-974-0025제조/건설경북칠곡군왜관읍 왜관리 788-12번지정상영업
4(주)뉴그린렌트카(왜관지점)054-975-7007서비스경북칠곡군왜관읍 삼청리 483-12번지정상영업
5(주)대교삼창운수주유소054-972-3331소매,운수업경북칠곡군약목면 칠곡대로 1153정상영업
6(주)대화산기054-979-3001금속/철강경북칠곡군왜관읍 낙산리 676-1정상영업
7(주)더건강한나눔밥상된장과김치찌개<NA>음숙경북칠곡군석적읍 석적로 950정상영업
8(주)동서건설054-975-2539건설업경북칠곡군왜관읍 석전리 721-5정상영업
9(주)동서상사054-973-2945제조/도.소매경북칠곡군가산면 다부리 457정상영업
가맹점명전화번호업태주소영업
2732PAT왜관점054-973-4593의류업경북칠곡군왜관읍 왜관1리 211-172번지정상영업
2733S노래연습장<NA>서비스/노래방경북칠곡군북삼읍 인평리 1038-1번지정상영업
2734SK네트웍스(주)가산IC주유소054-976-6051도.소매/주유소경북칠곡군가산면 천평리 4번지정상영업
2735SK동행주유소054-971-5101주유소경북칠곡군왜관읍 금산리 860-22정상영업
2736SK제일주유소054-971-1085서비스/주유소경북칠곡군가산면 금화리 152-1번지정상영업
2737SK종합프라자054-971-8844도소매/통신장비.광고경북칠곡군왜관읍 왜관1리 211-218번지정상영업
2738SS식자재마트054-979-1214소매업,부동산업경북칠곡군석적읍 북중리7길 5정상영업
2739The # (더샵)<NA>소매경북 칠곡군 왜관읍 중앙로 235 (1층)정상영업
2740UGlZ054-977-1551도소매경북칠곡군왜관읍 왜관리 230-30번지정상영업
2741VIP종합카센타054-972-5517서비스경북칠곡군왜관읍 석전리 778정상영업

Duplicate rows

Most frequently occurring

가맹점명전화번호업태주소영업# duplicates
0개인택시054-973-3875택시경북칠곡군왜관읍 왜관리 197-2번지정상영업32
1개인택시054-977-7777택시경북칠곡군왜관읍 왜관리 197-2번지정상영업6
2개인택시<NA>택시경북칠곡군왜관읍 왜관리 197-2번지정상영업3