Overview

Dataset statistics

Number of variables4
Number of observations1238
Missing cells1063
Missing cells (%)21.5%
Duplicate rows12
Duplicate rows (%)1.0%
Total size in memory38.8 KiB
Average record size in memory32.1 B

Variable types

Text3
Categorical1

Dataset

Description총 1238건의 경상남도 진주시 소재의 쓰레기 봉투 판매소에 대한 정보(상호, 소재지, 전화번호, 행정구역)를 제공합니다.
Author경상남도 진주시
URLhttps://www.data.go.kr/data/15064524/fileData.do

Alerts

Dataset has 12 (1.0%) duplicate rowsDuplicates
전화번호 has 1063 (85.9%) missing valuesMissing

Reproduction

Analysis started2023-12-12 06:39:24.945690
Analysis finished2023-12-12 06:39:25.579276
Duration0.63 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

상호
Text

Distinct1148
Distinct (%)92.7%
Missing0
Missing (%)0.0%
Memory size9.8 KiB
2023-12-12T15:39:25.747670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length20
Mean length7.7584814
Min length2

Characters and Unicode

Total characters9605
Distinct characters450
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1081 ?
Unique (%)87.3%

Sample

1st row씨유 진주가호올리움점
2nd row이마트24 진주명석점
3rd row세븐일레븐 진주신안스마일점
4th row작은가게
5th row플러스할인마트
ValueCountFrequency (%)
씨유 58
 
3.9%
gs25 43
 
2.9%
세븐일레븐 35
 
2.4%
이마트24 12
 
0.8%
지에스25 8
 
0.5%
진주사봉산업단지점 7
 
0.5%
하나로마트 6
 
0.4%
훼미리마트 5
 
0.3%
오렌지마트 5
 
0.3%
가좌점 5
 
0.3%
Other values (1175) 1304
87.6%
2023-12-12T15:39:26.169261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
593
 
6.2%
535
 
5.6%
505
 
5.3%
336
 
3.5%
322
 
3.4%
251
 
2.6%
2 221
 
2.3%
199
 
2.1%
184
 
1.9%
171
 
1.8%
Other values (440) 6288
65.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8298
86.4%
Decimal Number 461
 
4.8%
Uppercase Letter 423
 
4.4%
Space Separator 251
 
2.6%
Close Punctuation 63
 
0.7%
Open Punctuation 62
 
0.6%
Lowercase Letter 19
 
0.2%
Other Symbol 15
 
0.2%
Dash Punctuation 7
 
0.1%
Other Punctuation 6
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
593
 
7.1%
535
 
6.4%
505
 
6.1%
336
 
4.0%
322
 
3.9%
199
 
2.4%
184
 
2.2%
171
 
2.1%
169
 
2.0%
143
 
1.7%
Other values (391) 5141
62.0%
Uppercase Letter
ValueCountFrequency (%)
S 144
34.0%
G 141
33.3%
C 38
 
9.0%
U 27
 
6.4%
L 10
 
2.4%
H 9
 
2.1%
M 9
 
2.1%
K 9
 
2.1%
O 7
 
1.7%
B 6
 
1.4%
Other values (9) 23
 
5.4%
Lowercase Letter
ValueCountFrequency (%)
e 4
21.1%
o 3
15.8%
h 2
10.5%
n 1
 
5.3%
w 1
 
5.3%
r 1
 
5.3%
g 1
 
5.3%
s 1
 
5.3%
a 1
 
5.3%
p 1
 
5.3%
Other values (3) 3
15.8%
Decimal Number
ValueCountFrequency (%)
2 221
47.9%
5 154
33.4%
4 65
 
14.1%
1 6
 
1.3%
3 6
 
1.3%
6 3
 
0.7%
8 2
 
0.4%
0 2
 
0.4%
9 2
 
0.4%
Other Punctuation
ValueCountFrequency (%)
. 4
66.7%
, 1
 
16.7%
& 1
 
16.7%
Space Separator
ValueCountFrequency (%)
251
100.0%
Close Punctuation
ValueCountFrequency (%)
) 63
100.0%
Open Punctuation
ValueCountFrequency (%)
( 62
100.0%
Other Symbol
ValueCountFrequency (%)
15
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8313
86.5%
Common 850
 
8.8%
Latin 442
 
4.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
593
 
7.1%
535
 
6.4%
505
 
6.1%
336
 
4.0%
322
 
3.9%
199
 
2.4%
184
 
2.2%
171
 
2.1%
169
 
2.0%
143
 
1.7%
Other values (392) 5156
62.0%
Latin
ValueCountFrequency (%)
S 144
32.6%
G 141
31.9%
C 38
 
8.6%
U 27
 
6.1%
L 10
 
2.3%
H 9
 
2.0%
M 9
 
2.0%
K 9
 
2.0%
O 7
 
1.6%
B 6
 
1.4%
Other values (22) 42
 
9.5%
Common
ValueCountFrequency (%)
251
29.5%
2 221
26.0%
5 154
18.1%
4 65
 
7.6%
) 63
 
7.4%
( 62
 
7.3%
- 7
 
0.8%
1 6
 
0.7%
3 6
 
0.7%
. 4
 
0.5%
Other values (6) 11
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8298
86.4%
ASCII 1292
 
13.5%
None 15
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
593
 
7.1%
535
 
6.4%
505
 
6.1%
336
 
4.0%
322
 
3.9%
199
 
2.4%
184
 
2.2%
171
 
2.1%
169
 
2.0%
143
 
1.7%
Other values (391) 5141
62.0%
ASCII
ValueCountFrequency (%)
251
19.4%
2 221
17.1%
5 154
11.9%
S 144
11.1%
G 141
10.9%
4 65
 
5.0%
) 63
 
4.9%
( 62
 
4.8%
C 38
 
2.9%
U 27
 
2.1%
Other values (38) 126
9.8%
None
ValueCountFrequency (%)
15
100.0%
Distinct1177
Distinct (%)95.1%
Missing0
Missing (%)0.0%
Memory size9.8 KiB
2023-12-12T15:39:26.496237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length44
Mean length19.449919
Min length9

Characters and Unicode

Total characters24079
Distinct characters261
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1147 ?
Unique (%)92.6%

Sample

1st row진주시 가호로 17-10 105동 101호(가좌동, 올리움아파트상가동)
2nd row진주시 명석면 광제산로 11-23, 5동 1층 2,3호(동신아파트)
3rd row진주시 평거로126번길 1(신안동)
4th row진주시 창렬로145번길 5 (상봉동)
5th row진주시 진주성로147번길 8, 상가동 4 (상봉동)
ValueCountFrequency (%)
진주시 1232
 
25.1%
상대동 74
 
1.5%
진주대로 60
 
1.2%
하대동 55
 
1.1%
평거동 44
 
0.9%
상봉동 44
 
0.9%
가좌동 44
 
0.9%
진양호로 38
 
0.8%
1층 38
 
0.8%
상평동 37
 
0.8%
Other values (1325) 3250
66.1%
2023-12-12T15:39:27.031919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3692
 
15.3%
1521
 
6.3%
1 1491
 
6.2%
1407
 
5.8%
1252
 
5.2%
1100
 
4.6%
1076
 
4.5%
( 824
 
3.4%
) 822
 
3.4%
2 642
 
2.7%
Other values (251) 10252
42.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13132
54.5%
Decimal Number 5035
 
20.9%
Space Separator 3692
 
15.3%
Open Punctuation 824
 
3.4%
Close Punctuation 822
 
3.4%
Other Punctuation 303
 
1.3%
Dash Punctuation 240
 
1.0%
Uppercase Letter 22
 
0.1%
Math Symbol 9
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1521
 
11.6%
1407
 
10.7%
1252
 
9.5%
1100
 
8.4%
1076
 
8.2%
595
 
4.5%
489
 
3.7%
432
 
3.3%
304
 
2.3%
290
 
2.2%
Other values (223) 4666
35.5%
Uppercase Letter
ValueCountFrequency (%)
B 6
27.3%
A 6
27.3%
S 2
 
9.1%
I 1
 
4.5%
O 1
 
4.5%
K 1
 
4.5%
C 1
 
4.5%
T 1
 
4.5%
Y 1
 
4.5%
M 1
 
4.5%
Decimal Number
ValueCountFrequency (%)
1 1491
29.6%
2 642
12.8%
0 450
 
8.9%
3 438
 
8.7%
5 406
 
8.1%
4 370
 
7.3%
6 332
 
6.6%
9 315
 
6.3%
8 314
 
6.2%
7 277
 
5.5%
Other Punctuation
ValueCountFrequency (%)
, 301
99.3%
. 2
 
0.7%
Space Separator
ValueCountFrequency (%)
3692
100.0%
Open Punctuation
ValueCountFrequency (%)
( 824
100.0%
Close Punctuation
ValueCountFrequency (%)
) 822
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 240
100.0%
Math Symbol
ValueCountFrequency (%)
~ 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13132
54.5%
Common 10925
45.4%
Latin 22
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1521
 
11.6%
1407
 
10.7%
1252
 
9.5%
1100
 
8.4%
1076
 
8.2%
595
 
4.5%
489
 
3.7%
432
 
3.3%
304
 
2.3%
290
 
2.2%
Other values (223) 4666
35.5%
Common
ValueCountFrequency (%)
3692
33.8%
1 1491
13.6%
( 824
 
7.5%
) 822
 
7.5%
2 642
 
5.9%
0 450
 
4.1%
3 438
 
4.0%
5 406
 
3.7%
4 370
 
3.4%
6 332
 
3.0%
Other values (7) 1458
 
13.3%
Latin
ValueCountFrequency (%)
B 6
27.3%
A 6
27.3%
S 2
 
9.1%
I 1
 
4.5%
O 1
 
4.5%
K 1
 
4.5%
C 1
 
4.5%
T 1
 
4.5%
Y 1
 
4.5%
M 1
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13132
54.5%
ASCII 10947
45.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3692
33.7%
1 1491
13.6%
( 824
 
7.5%
) 822
 
7.5%
2 642
 
5.9%
0 450
 
4.1%
3 438
 
4.0%
5 406
 
3.7%
4 370
 
3.4%
6 332
 
3.0%
Other values (18) 1480
13.5%
Hangul
ValueCountFrequency (%)
1521
 
11.6%
1407
 
10.7%
1252
 
9.5%
1100
 
8.4%
1076
 
8.2%
595
 
4.5%
489
 
3.7%
432
 
3.3%
304
 
2.3%
290
 
2.2%
Other values (223) 4666
35.5%

전화번호
Text

MISSING 

Distinct172
Distinct (%)98.3%
Missing1063
Missing (%)85.9%
Memory size9.8 KiB
2023-12-12T15:39:27.368698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.022857
Min length12

Characters and Unicode

Total characters2104
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique169 ?
Unique (%)96.6%

Sample

1st row055-747-0336
2nd row055-753-2243
3rd row055-752-8136
4th row055-747-6360
5th row055-745-6295
ValueCountFrequency (%)
055-756-5000 2
 
1.1%
055-745-4003 2
 
1.1%
055-763-9000 2
 
1.1%
055-762-1339 1
 
0.6%
055-748-6039 1
 
0.6%
055-747-6686 1
 
0.6%
055-747-8002 1
 
0.6%
055-741-0810 1
 
0.6%
055-743-0677 1
 
0.6%
055-757-5372 1
 
0.6%
Other values (162) 162
92.6%
2023-12-12T15:39:27.847087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 489
23.2%
- 350
16.6%
0 287
13.6%
7 267
12.7%
4 149
 
7.1%
2 122
 
5.8%
6 118
 
5.6%
8 86
 
4.1%
3 80
 
3.8%
9 79
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1754
83.4%
Dash Punctuation 350
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 489
27.9%
0 287
16.4%
7 267
15.2%
4 149
 
8.5%
2 122
 
7.0%
6 118
 
6.7%
8 86
 
4.9%
3 80
 
4.6%
9 79
 
4.5%
1 77
 
4.4%
Dash Punctuation
ValueCountFrequency (%)
- 350
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2104
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 489
23.2%
- 350
16.6%
0 287
13.6%
7 267
12.7%
4 149
 
7.1%
2 122
 
5.8%
6 118
 
5.6%
8 86
 
4.1%
3 80
 
3.8%
9 79
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2104
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 489
23.2%
- 350
16.6%
0 287
13.6%
7 267
12.7%
4 149
 
7.1%
2 122
 
5.8%
6 118
 
5.6%
8 86
 
4.1%
3 80
 
3.8%
9 79
 
3.8%

행정구역
Categorical

Distinct33
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size9.8 KiB
천전동
103 
중앙동
101 
가호동
98 
성북동
89 
충무공동
86 
Other values (28)
761 

Length

Max length4
Median length3
Mean length3.1357027
Min length2

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row가호동
2nd row명석면
3rd row신안동
4th row상봉동
5th row상봉동

Common Values

ValueCountFrequency (%)
천전동 103
 
8.3%
중앙동 101
 
8.2%
가호동 98
 
7.9%
성북동 89
 
7.2%
충무공동 86
 
6.9%
평거동 76
 
6.1%
상대동 73
 
5.9%
상평동 67
 
5.4%
상봉동 66
 
5.3%
초장동 54
 
4.4%
Other values (23) 425
34.3%

Length

2023-12-12T15:39:28.040664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
천전동 103
 
8.3%
중앙동 101
 
8.2%
가호동 98
 
7.9%
성북동 89
 
7.2%
충무공동 86
 
6.9%
평거동 76
 
6.1%
상대동 73
 
5.9%
상평동 67
 
5.4%
상봉동 66
 
5.3%
초장동 54
 
4.4%
Other values (23) 425
34.3%

Missing values

2023-12-12T15:39:25.414105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:39:25.536021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상호소재지(도로명)전화번호행정구역
0씨유 진주가호올리움점진주시 가호로 17-10 105동 101호(가좌동, 올리움아파트상가동)<NA>가호동
1이마트24 진주명석점진주시 명석면 광제산로 11-23, 5동 1층 2,3호(동신아파트)<NA>명석면
2세븐일레븐 진주신안스마일점진주시 평거로126번길 1(신안동)<NA>신안동
3작은가게진주시 창렬로145번길 5 (상봉동)<NA>상봉동
4플러스할인마트진주시 진주성로147번길 8, 상가동 4 (상봉동)<NA>상봉동
5거성할인마트진주시 의병로150번길 12 (상봉동)<NA>상봉동
6세원슈퍼진주시 창렬로180번길 7 (상봉동)<NA>상봉동
7CU진주보건대점진주시 의병로 55 (상봉동)055-747-0336상봉동
8코리아세븐 진주보건대점진주시 창렬로 101 (상봉동)<NA>상봉동
9씨유 진주봉원점진주시 상봉대룡길 8 (상봉동)<NA>상봉동
상호소재지(도로명)전화번호행정구역
1228야묵자푸드마켓 진주상회진주시 초북로20번길 5<NA>초장동
1229진주우리먹거리협동조합 진주텃밭 진양호진주시 진양호로44번길6<NA>평거동
1230이마트24 진주혁신LH허브점진주시 에나로138,1층111<NA>충무공동
1231씨유 진주신안강변점진주시 남강로491번길 8<NA>신안동
1232리퍼모아진주시 정촌면 삼일로95번길50-25<NA>정촌면
1233이마트24 진주상평로드점진주시 돗골로 3<NA>상평동
1234씨유 진주남강댐점진주시 남강로58, 1층<NA>판문동
1235킹스할인마트진주시 공단로44번길 10<NA>상평동
1236이마트24 진주금산푸르지오점진주시 금산면 덕의길11-1,진주푸르지오2단지 상가 102호<NA>금산면
1237아엔지할인마트진주시 천수로 312<NA>천전동

Duplicate rows

Most frequently occurring

상호소재지(도로명)전화번호행정구역# duplicates
6씨유 진주사봉산업단지점진주시 사봉면 산업단지로 32<NA>사봉면7
5씨유 가좌점진주시 가좌길 52<NA>가호동4
3세븐일레븐만덕화점진주시 충의로 20-12(충무공동)<NA>충무공동3
0GS25진주혁신허브점진주시 에나로 138, 1층 111호(충무공동)<NA>충무공동2
1문산롯데슈퍼진주시 문산읍 월아산로 1080<NA>문산읍2
2세븐일레븐 만덕화점진주시 충의로 20-12(충무공동)<NA>충무공동2
4세븐일레븐진주혁신허브점진주시 사들로34번길 8 101-102호<NA>충무공동2
7씨유 진주산업단지점진주시 사봉면 산업단지로 32<NA>사봉면2
8우리슈퍼(주공안)진주시 가좌동 683<NA>가호동2
9진주씨유가좌원룸점진주시 개양로6번길 8<NA>가호동2