Overview

Dataset statistics

Number of variables4
Number of observations405
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.8 KiB
Average record size in memory32.3 B

Variable types

Text4

Dataset

Description삼척시 관리 담배소매인 사업 지정현황 정보를 통하여 누구나 데이터를 이용하여 담배소매인 사업 지정현황
Author강원도 삼척시
URLhttps://www.data.go.kr/data/15021279/fileData.do

Reproduction

Analysis started2023-12-11 23:41:49.909389
Analysis finished2023-12-11 23:41:50.461408
Duration0.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct373
Distinct (%)92.1%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2023-12-12T08:41:50.726102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length3
Mean length3.6296296
Min length2

Characters and Unicode

Total characters1470
Distinct characters201
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique351 ?
Unique (%)86.7%

Sample

1st row황규호
2nd row양춘자
3rd row양춘자
4th row권옥랑
5th row이정임
ValueCountFrequency (%)
인제지원본부 9
 
2.1%
국군복지단 8
 
1.9%
김미경 4
 
0.9%
삼척농협장 3
 
0.7%
홍순용 3
 
0.7%
김유미 2
 
0.5%
주식회사 2
 
0.5%
김태훈 2
 
0.5%
김옥녀 2
 
0.5%
강원대학교삼척캠퍼스 2
 
0.5%
Other values (375) 391
91.4%
2023-12-12T08:41:51.225752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
111
 
7.6%
60
 
4.1%
41
 
2.8%
36
 
2.4%
35
 
2.4%
34
 
2.3%
32
 
2.2%
29
 
2.0%
26
 
1.8%
26
 
1.8%
Other values (191) 1040
70.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1428
97.1%
Space Separator 23
 
1.6%
Close Punctuation 8
 
0.5%
Open Punctuation 8
 
0.5%
Other Punctuation 2
 
0.1%
Decimal Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
111
 
7.8%
60
 
4.2%
41
 
2.9%
36
 
2.5%
35
 
2.5%
34
 
2.4%
32
 
2.2%
29
 
2.0%
26
 
1.8%
26
 
1.8%
Other values (186) 998
69.9%
Space Separator
ValueCountFrequency (%)
23
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Other Punctuation
ValueCountFrequency (%)
2
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1428
97.1%
Common 42
 
2.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
111
 
7.8%
60
 
4.2%
41
 
2.9%
36
 
2.5%
35
 
2.5%
34
 
2.4%
32
 
2.2%
29
 
2.0%
26
 
1.8%
26
 
1.8%
Other values (186) 998
69.9%
Common
ValueCountFrequency (%)
23
54.8%
) 8
 
19.0%
( 8
 
19.0%
2
 
4.8%
1 1
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1428
97.1%
ASCII 40
 
2.7%
None 2
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
111
 
7.8%
60
 
4.2%
41
 
2.9%
36
 
2.5%
35
 
2.5%
34
 
2.4%
32
 
2.2%
29
 
2.0%
26
 
1.8%
26
 
1.8%
Other values (186) 998
69.9%
ASCII
ValueCountFrequency (%)
23
57.5%
) 8
 
20.0%
( 8
 
20.0%
1 1
 
2.5%
None
ValueCountFrequency (%)
2
100.0%
Distinct380
Distinct (%)93.8%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2023-12-12T08:41:51.493116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length18
Mean length7.017284
Min length2

Characters and Unicode

Total characters2842
Distinct characters342
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique374 ?
Unique (%)92.3%

Sample

1st row유워시 셀프세차장
2nd row교동새마을부녀회
3rd row교동새마을부녀회
4th row나원식당
5th row대성디자인건설
ValueCountFrequency (%)
상호없음 21
 
4.0%
세븐일레븐 15
 
2.8%
gs25 9
 
1.7%
씨유 7
 
1.3%
이마트24 6
 
1.1%
미니스톱 6
 
1.1%
슈퍼 5
 
0.9%
주식회사 3
 
0.6%
수퍼 3
 
0.6%
편의점 3
 
0.6%
Other values (422) 451
85.3%
2023-12-12T08:41:51.889137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
134
 
4.7%
124
 
4.4%
116
 
4.1%
111
 
3.9%
64
 
2.3%
55
 
1.9%
52
 
1.8%
2 51
 
1.8%
46
 
1.6%
44
 
1.5%
Other values (332) 2045
72.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2456
86.4%
Space Separator 124
 
4.4%
Decimal Number 114
 
4.0%
Uppercase Letter 88
 
3.1%
Open Punctuation 26
 
0.9%
Close Punctuation 26
 
0.9%
Lowercase Letter 4
 
0.1%
Other Punctuation 3
 
0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
134
 
5.5%
116
 
4.7%
111
 
4.5%
64
 
2.6%
55
 
2.2%
52
 
2.1%
46
 
1.9%
44
 
1.8%
41
 
1.7%
38
 
1.5%
Other values (303) 1755
71.5%
Uppercase Letter
ValueCountFrequency (%)
S 25
28.4%
G 24
27.3%
C 15
17.0%
U 12
13.6%
E 3
 
3.4%
K 2
 
2.3%
D 2
 
2.3%
I 1
 
1.1%
Y 1
 
1.1%
V 1
 
1.1%
Other values (2) 2
 
2.3%
Decimal Number
ValueCountFrequency (%)
2 51
44.7%
5 32
28.1%
4 9
 
7.9%
1 7
 
6.1%
9 5
 
4.4%
0 4
 
3.5%
3 3
 
2.6%
7 2
 
1.8%
8 1
 
0.9%
Other Punctuation
ValueCountFrequency (%)
. 2
66.7%
& 1
33.3%
Lowercase Letter
ValueCountFrequency (%)
s 2
50.0%
g 2
50.0%
Space Separator
ValueCountFrequency (%)
124
100.0%
Open Punctuation
ValueCountFrequency (%)
( 26
100.0%
Close Punctuation
ValueCountFrequency (%)
) 26
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2456
86.4%
Common 294
 
10.3%
Latin 92
 
3.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
134
 
5.5%
116
 
4.7%
111
 
4.5%
64
 
2.6%
55
 
2.2%
52
 
2.1%
46
 
1.9%
44
 
1.8%
41
 
1.7%
38
 
1.5%
Other values (303) 1755
71.5%
Common
ValueCountFrequency (%)
124
42.2%
2 51
17.3%
5 32
 
10.9%
( 26
 
8.8%
) 26
 
8.8%
4 9
 
3.1%
1 7
 
2.4%
9 5
 
1.7%
0 4
 
1.4%
3 3
 
1.0%
Other values (5) 7
 
2.4%
Latin
ValueCountFrequency (%)
S 25
27.2%
G 24
26.1%
C 15
16.3%
U 12
13.0%
E 3
 
3.3%
K 2
 
2.2%
s 2
 
2.2%
g 2
 
2.2%
D 2
 
2.2%
I 1
 
1.1%
Other values (4) 4
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2456
86.4%
ASCII 386
 
13.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
134
 
5.5%
116
 
4.7%
111
 
4.5%
64
 
2.6%
55
 
2.2%
52
 
2.1%
46
 
1.9%
44
 
1.8%
41
 
1.7%
38
 
1.5%
Other values (303) 1755
71.5%
ASCII
ValueCountFrequency (%)
124
32.1%
2 51
13.2%
5 32
 
8.3%
( 26
 
6.7%
) 26
 
6.7%
S 25
 
6.5%
G 24
 
6.2%
C 15
 
3.9%
U 12
 
3.1%
4 9
 
2.3%
Other values (19) 42
 
10.9%
Distinct330
Distinct (%)81.5%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2023-12-12T08:41:52.184890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length31
Mean length19.441975
Min length1

Characters and Unicode

Total characters7874
Distinct characters193
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique328 ?
Unique (%)81.0%

Sample

1st row강원도 삼척시 교동 222-2 .222-3
2nd row강원도 삼척시 교동 산 80-1
3rd row강원도 삼척시 교동 86-3
4th row강원도 삼척시 하장면 역둔리 3-4
5th row강원도 삼척시 당저동 140-2
ValueCountFrequency (%)
강원도 330
 
17.2%
삼척시 330
 
17.2%
도계읍 60
 
3.1%
원덕읍 44
 
2.3%
근덕면 42
 
2.2%
42
 
2.2%
남양동 39
 
2.0%
교동 34
 
1.8%
1호 32
 
1.7%
3호 28
 
1.5%
Other values (467) 934
48.8%
2023-12-12T08:41:52.637245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1953
24.8%
417
 
5.3%
412
 
5.2%
333
 
4.2%
333
 
4.2%
332
 
4.2%
331
 
4.2%
281
 
3.6%
1 271
 
3.4%
230
 
2.9%
Other values (183) 2981
37.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4615
58.6%
Space Separator 1953
24.8%
Decimal Number 1264
 
16.1%
Dash Punctuation 31
 
0.4%
Uppercase Letter 4
 
0.1%
Other Punctuation 3
 
< 0.1%
Close Punctuation 2
 
< 0.1%
Open Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
417
 
9.0%
412
 
8.9%
333
 
7.2%
333
 
7.2%
332
 
7.2%
331
 
7.2%
281
 
6.1%
230
 
5.0%
228
 
4.9%
180
 
3.9%
Other values (164) 1538
33.3%
Decimal Number
ValueCountFrequency (%)
1 271
21.4%
3 174
13.8%
2 168
13.3%
4 111
8.8%
0 109
8.6%
5 109
8.6%
8 91
 
7.2%
6 91
 
7.2%
7 73
 
5.8%
9 67
 
5.3%
Uppercase Letter
ValueCountFrequency (%)
A 1
25.0%
S 1
25.0%
G 1
25.0%
C 1
25.0%
Space Separator
ValueCountFrequency (%)
1953
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 31
100.0%
Other Punctuation
ValueCountFrequency (%)
. 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4615
58.6%
Common 3255
41.3%
Latin 4
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
417
 
9.0%
412
 
8.9%
333
 
7.2%
333
 
7.2%
332
 
7.2%
331
 
7.2%
281
 
6.1%
230
 
5.0%
228
 
4.9%
180
 
3.9%
Other values (164) 1538
33.3%
Common
ValueCountFrequency (%)
1953
60.0%
1 271
 
8.3%
3 174
 
5.3%
2 168
 
5.2%
4 111
 
3.4%
0 109
 
3.3%
5 109
 
3.3%
8 91
 
2.8%
6 91
 
2.8%
7 73
 
2.2%
Other values (5) 105
 
3.2%
Latin
ValueCountFrequency (%)
A 1
25.0%
S 1
25.0%
G 1
25.0%
C 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4615
58.6%
ASCII 3259
41.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1953
59.9%
1 271
 
8.3%
3 174
 
5.3%
2 168
 
5.2%
4 111
 
3.4%
0 109
 
3.3%
5 109
 
3.3%
8 91
 
2.8%
6 91
 
2.8%
7 73
 
2.2%
Other values (9) 109
 
3.3%
Hangul
ValueCountFrequency (%)
417
 
9.0%
412
 
8.9%
333
 
7.2%
333
 
7.2%
332
 
7.2%
331
 
7.2%
281
 
6.1%
230
 
5.0%
228
 
4.9%
180
 
3.9%
Other values (164) 1538
33.3%
Distinct309
Distinct (%)76.3%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2023-12-12T08:41:52.957873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length41
Mean length18.651852
Min length1

Characters and Unicode

Total characters7554
Distinct characters226
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique305 ?
Unique (%)75.3%

Sample

1st row
2nd row강원도 삼척시 새천년도로 233. 소망의탑 (교동)
3rd row강원도 삼척시 새천년도로 326. 조각공원 (교동)
4th row강원도 삼척시 하장면 역둔원동로 12-19
5th row강원도 삼척시 진주로 55 (당저동)
ValueCountFrequency (%)
강원도 312
 
18.3%
삼척시 312
 
18.3%
근덕면 47
 
2.8%
원덕읍 46
 
2.7%
도계읍 43
 
2.5%
남양동 40
 
2.3%
교동 34
 
2.0%
삼척로 33
 
1.9%
1층 22
 
1.3%
정상동 19
 
1.1%
Other values (484) 796
46.7%
2023-12-12T08:41:53.516940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1505
19.9%
405
 
5.4%
403
 
5.3%
355
 
4.7%
354
 
4.7%
327
 
4.3%
318
 
4.2%
1 246
 
3.3%
209
 
2.8%
182
 
2.4%
Other values (216) 3250
43.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4421
58.5%
Space Separator 1505
 
19.9%
Decimal Number 1114
 
14.7%
Open Punctuation 162
 
2.1%
Close Punctuation 162
 
2.1%
Other Punctuation 102
 
1.4%
Dash Punctuation 76
 
1.0%
Uppercase Letter 12
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
405
 
9.2%
403
 
9.1%
355
 
8.0%
354
 
8.0%
327
 
7.4%
318
 
7.2%
209
 
4.7%
182
 
4.1%
119
 
2.7%
98
 
2.2%
Other values (194) 1651
37.3%
Decimal Number
ValueCountFrequency (%)
1 246
22.1%
2 152
13.6%
4 117
10.5%
3 116
10.4%
6 88
 
7.9%
9 87
 
7.8%
5 86
 
7.7%
0 83
 
7.5%
7 71
 
6.4%
8 68
 
6.1%
Uppercase Letter
ValueCountFrequency (%)
B 3
25.0%
A 3
25.0%
C 3
25.0%
U 1
 
8.3%
S 1
 
8.3%
G 1
 
8.3%
Other Punctuation
ValueCountFrequency (%)
. 101
99.0%
& 1
 
1.0%
Space Separator
ValueCountFrequency (%)
1505
100.0%
Open Punctuation
ValueCountFrequency (%)
( 162
100.0%
Close Punctuation
ValueCountFrequency (%)
) 162
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 76
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4421
58.5%
Common 3121
41.3%
Latin 12
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
405
 
9.2%
403
 
9.1%
355
 
8.0%
354
 
8.0%
327
 
7.4%
318
 
7.2%
209
 
4.7%
182
 
4.1%
119
 
2.7%
98
 
2.2%
Other values (194) 1651
37.3%
Common
ValueCountFrequency (%)
1505
48.2%
1 246
 
7.9%
( 162
 
5.2%
) 162
 
5.2%
2 152
 
4.9%
4 117
 
3.7%
3 116
 
3.7%
. 101
 
3.2%
6 88
 
2.8%
9 87
 
2.8%
Other values (6) 385
 
12.3%
Latin
ValueCountFrequency (%)
B 3
25.0%
A 3
25.0%
C 3
25.0%
U 1
 
8.3%
S 1
 
8.3%
G 1
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4421
58.5%
ASCII 3133
41.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1505
48.0%
1 246
 
7.9%
( 162
 
5.2%
) 162
 
5.2%
2 152
 
4.9%
4 117
 
3.7%
3 116
 
3.7%
. 101
 
3.2%
6 88
 
2.8%
9 87
 
2.8%
Other values (12) 397
 
12.7%
Hangul
ValueCountFrequency (%)
405
 
9.2%
403
 
9.1%
355
 
8.0%
354
 
8.0%
327
 
7.4%
318
 
7.2%
209
 
4.7%
182
 
4.1%
119
 
2.7%
98
 
2.2%
Other values (194) 1651
37.3%

Missing values

2023-12-12T08:41:50.351933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:41:50.430566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

대표자명업소명업소지번주소업소도로명주소
0황규호유워시 셀프세차장강원도 삼척시 교동 222-2 .222-3
1양춘자교동새마을부녀회강원도 삼척시 교동 산 80-1강원도 삼척시 새천년도로 233. 소망의탑 (교동)
2양춘자교동새마을부녀회강원도 삼척시 교동 86-3강원도 삼척시 새천년도로 326. 조각공원 (교동)
3권옥랑나원식당강원도 삼척시 하장면 역둔리 3-4강원도 삼척시 하장면 역둔원동로 12-19
4이정임대성디자인건설강원도 삼척시 당저동 140-2강원도 삼척시 진주로 55 (당저동)
5천세열풍곡리마을회강원도 삼척시 가곡면 풍곡리 631강원도 삼척시 가곡면 풍곡안길 17-18
6이상용도계복권방강원도 삼척시 도계읍 도계리 393-1강원도 삼척시 도계읍 도계로 287-2
7김진경씨유(CU)삼척SK남양점강원도 삼척시 남양동 334-10강원도 삼척시 오십천로 457. 1층 (남양동)
8이현정미니스톱 삼척덕산해변점강원도 삼척시 근덕면 덕산리 119-1강원도 삼척시 근덕면 덕산해변길 82
9김준석세븐일레븐 삼척병원점강원도 삼척시 정상동 386-5강원도 삼척시 오십천로 500 (정상동)
대표자명업소명업소지번주소업소도로명주소
395김영화도원슈퍼강원도 삼척시 도계읍 도계리 호 1통 14반
396이화자화경미니슈퍼강원도 삼척시 도계읍 고사리 103번지 6호 0통 2반
397김분선상호없음강원도 삼척시 정하동 호 1통 6반
398홍정순평화사강원도 삼척시 남양동 호 13통 1반
399정관식세광상회강원도 삼척시 정하동 65번지 3호 11통 3반
400심순택상호없음강원도 삼척시 교동 호 1통 3반
401박성표중앙 상회강원도 삼척시 남양동 호 6통 2반
402최옥기중앙 문구사강원도 삼척시 남양동 55번지 4호 6통 3반강원도 삼척시 진주로 12-21 (남양동)
403박귀월상호없음강원도 삼척시 정상동 호 1통 2반
404김영자상호없음강원도 삼척시 성내동 16호 1통 4반