Overview

Dataset statistics

Number of variables3
Number of observations169
Missing cells0
Missing cells (%)0.0%
Duplicate rows6
Duplicate rows (%)3.6%
Total size in memory4.1 KiB
Average record size in memory24.8 B

Variable types

Text3

Dataset

Description울산광역시 남구 약국현황에 대한 데이터로 약국명칭, 약국전화번호(052-000-0000), 약국소재지(도로명) 항목을 제공합니다.
Author울산광역시 남구
URLhttps://www.data.go.kr/data/3076248/fileData.do

Alerts

Dataset has 6 (3.6%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 16:42:52.086618
Analysis finished2023-12-12 16:42:52.373780
Duration0.29 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct162
Distinct (%)95.9%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2023-12-13T01:42:52.560061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length4
Mean length4.8224852
Min length3

Characters and Unicode

Total characters815
Distinct characters177
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique156 ?
Unique (%)92.3%

Sample

1st row사랑약국
2nd row나정약국
3rd row주약국
4th row해바라기약국
5th row울산현대약국
ValueCountFrequency (%)
신유명약국 3
 
1.8%
한솔약국 2
 
1.2%
강남약국 2
 
1.2%
아름약국 2
 
1.2%
중앙약국 2
 
1.2%
삼산현대약국 2
 
1.2%
두레약국 1
 
0.6%
올리브약국 1
 
0.6%
우리들약국 1
 
0.6%
태성약국 1
 
0.6%
Other values (154) 154
90.1%
2023-12-13T01:42:52.981413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
170
20.9%
169
20.7%
20
 
2.5%
15
 
1.8%
15
 
1.8%
13
 
1.6%
11
 
1.3%
9
 
1.1%
9
 
1.1%
9
 
1.1%
Other values (167) 375
46.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 808
99.1%
Decimal Number 4
 
0.5%
Space Separator 2
 
0.2%
Lowercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
170
21.0%
169
20.9%
20
 
2.5%
15
 
1.9%
15
 
1.9%
13
 
1.6%
11
 
1.4%
9
 
1.1%
9
 
1.1%
9
 
1.1%
Other values (163) 368
45.5%
Decimal Number
ValueCountFrequency (%)
2 2
50.0%
1 2
50.0%
Space Separator
ValueCountFrequency (%)
2
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 808
99.1%
Common 6
 
0.7%
Latin 1
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
170
21.0%
169
20.9%
20
 
2.5%
15
 
1.9%
15
 
1.9%
13
 
1.6%
11
 
1.4%
9
 
1.1%
9
 
1.1%
9
 
1.1%
Other values (163) 368
45.5%
Common
ValueCountFrequency (%)
2
33.3%
2 2
33.3%
1 2
33.3%
Latin
ValueCountFrequency (%)
e 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 808
99.1%
ASCII 7
 
0.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
170
21.0%
169
20.9%
20
 
2.5%
15
 
1.9%
15
 
1.9%
13
 
1.6%
11
 
1.4%
9
 
1.1%
9
 
1.1%
9
 
1.1%
Other values (163) 368
45.5%
ASCII
ValueCountFrequency (%)
2
28.6%
2 2
28.6%
1 2
28.6%
e 1
14.3%
Distinct162
Distinct (%)95.9%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2023-12-13T01:42:53.250688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12
Min length11

Characters and Unicode

Total characters2028
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique156 ?
Unique (%)92.3%

Sample

1st row052-266-5953
2nd row052-269-3999
3rd row052-267-7588
4th row052-257-1267
5th row052-260-4567
ValueCountFrequency (%)
052-260-8861 3
 
1.8%
052-267-2631 2
 
1.2%
052-271-3101 2
 
1.2%
052-260-1483 2
 
1.2%
052-266-9965 2
 
1.2%
052-268-5666 2
 
1.2%
052-223-6163 1
 
0.6%
052-257-5700 1
 
0.6%
052-269-1004 1
 
0.6%
052-265-1540 1
 
0.6%
Other values (152) 152
89.9%
2023-12-13T01:42:53.666872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 436
21.5%
- 337
16.6%
5 284
14.0%
0 258
12.7%
6 164
 
8.1%
7 151
 
7.4%
8 96
 
4.7%
1 90
 
4.4%
3 74
 
3.6%
4 69
 
3.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1691
83.4%
Dash Punctuation 337
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 436
25.8%
5 284
16.8%
0 258
15.3%
6 164
 
9.7%
7 151
 
8.9%
8 96
 
5.7%
1 90
 
5.3%
3 74
 
4.4%
4 69
 
4.1%
9 69
 
4.1%
Dash Punctuation
ValueCountFrequency (%)
- 337
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2028
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 436
21.5%
- 337
16.6%
5 284
14.0%
0 258
12.7%
6 164
 
8.1%
7 151
 
7.4%
8 96
 
4.7%
1 90
 
4.4%
3 74
 
3.6%
4 69
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2028
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 436
21.5%
- 337
16.6%
5 284
14.0%
0 258
12.7%
6 164
 
8.1%
7 151
 
7.4%
8 96
 
4.7%
1 90
 
4.4%
3 74
 
3.6%
4 69
 
3.4%
Distinct160
Distinct (%)94.7%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2023-12-13T01:42:54.066875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length41
Mean length26.04142
Min length20

Characters and Unicode

Total characters4401
Distinct characters138
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique152 ?
Unique (%)89.9%

Sample

1st row울산광역시 남구 중앙로 234, 1층 102호 (신정동)
2nd row울산광역시 남구 문수로 377, 대공원센트럴하임더시티 1층 106호 (옥동)
3rd row울산광역시 남구 돋질로 369, 1층 102호 (삼산동)
4th row울산광역시 남구 삼산로 231, 울산 센트럴 자이 1층 106호 (달동)
5th row울산광역시 남구 월평로159번길 20, 1층 (신정동)
ValueCountFrequency (%)
울산광역시 169
17.7%
남구 169
17.7%
1층 56
 
5.9%
신정동 55
 
5.8%
달동 35
 
3.7%
삼산로 35
 
3.7%
삼산동 30
 
3.1%
무거동 24
 
2.5%
수암로 19
 
2.0%
중앙로 16
 
1.7%
Other values (221) 348
36.4%
2023-12-13T01:42:54.536796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
787
17.9%
256
 
5.8%
1 193
 
4.4%
175
 
4.0%
174
 
4.0%
173
 
3.9%
170
 
3.9%
( 169
 
3.8%
) 169
 
3.8%
169
 
3.8%
Other values (128) 1966
44.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2489
56.6%
Space Separator 787
 
17.9%
Decimal Number 669
 
15.2%
Open Punctuation 169
 
3.8%
Close Punctuation 169
 
3.8%
Other Punctuation 92
 
2.1%
Dash Punctuation 18
 
0.4%
Uppercase Letter 5
 
0.1%
Math Symbol 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
256
 
10.3%
175
 
7.0%
174
 
7.0%
173
 
7.0%
170
 
6.8%
169
 
6.8%
169
 
6.8%
169
 
6.8%
167
 
6.7%
78
 
3.1%
Other values (107) 789
31.7%
Decimal Number
ValueCountFrequency (%)
1 193
28.8%
2 101
15.1%
3 69
 
10.3%
4 57
 
8.5%
7 48
 
7.2%
8 46
 
6.9%
6 43
 
6.4%
0 42
 
6.3%
5 37
 
5.5%
9 33
 
4.9%
Uppercase Letter
ValueCountFrequency (%)
R 1
20.0%
E 1
20.0%
M 1
20.0%
A 1
20.0%
T 1
20.0%
Space Separator
ValueCountFrequency (%)
787
100.0%
Open Punctuation
ValueCountFrequency (%)
( 169
100.0%
Close Punctuation
ValueCountFrequency (%)
) 169
100.0%
Other Punctuation
ValueCountFrequency (%)
, 92
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 18
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2489
56.6%
Common 1907
43.3%
Latin 5
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
256
 
10.3%
175
 
7.0%
174
 
7.0%
173
 
7.0%
170
 
6.8%
169
 
6.8%
169
 
6.8%
169
 
6.8%
167
 
6.7%
78
 
3.1%
Other values (107) 789
31.7%
Common
ValueCountFrequency (%)
787
41.3%
1 193
 
10.1%
( 169
 
8.9%
) 169
 
8.9%
2 101
 
5.3%
, 92
 
4.8%
3 69
 
3.6%
4 57
 
3.0%
7 48
 
2.5%
8 46
 
2.4%
Other values (6) 176
 
9.2%
Latin
ValueCountFrequency (%)
R 1
20.0%
E 1
20.0%
M 1
20.0%
A 1
20.0%
T 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2489
56.6%
ASCII 1912
43.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
787
41.2%
1 193
 
10.1%
( 169
 
8.8%
) 169
 
8.8%
2 101
 
5.3%
, 92
 
4.8%
3 69
 
3.6%
4 57
 
3.0%
7 48
 
2.5%
8 46
 
2.4%
Other values (11) 181
 
9.5%
Hangul
ValueCountFrequency (%)
256
 
10.3%
175
 
7.0%
174
 
7.0%
173
 
7.0%
170
 
6.8%
169
 
6.8%
169
 
6.8%
169
 
6.8%
167
 
6.7%
78
 
3.1%
Other values (107) 789
31.7%

Missing values

2023-12-13T01:42:52.277709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:42:52.346592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

약국명칭약국전화번호약국소재지(도로명)
0사랑약국052-266-5953울산광역시 남구 중앙로 234, 1층 102호 (신정동)
1나정약국052-269-3999울산광역시 남구 문수로 377, 대공원센트럴하임더시티 1층 106호 (옥동)
2주약국052-267-7588울산광역시 남구 돋질로 369, 1층 102호 (삼산동)
3해바라기약국052-257-1267울산광역시 남구 삼산로 231, 울산 센트럴 자이 1층 106호 (달동)
4울산현대약국052-260-4567울산광역시 남구 월평로159번길 20, 1층 (신정동)
5왕약국052-266-4171울산광역시 남구 삼산로 243, 1층 (달동)
6새울산약국052-266-5397울산광역시 남구 월평로171번길 26, 1층 125호 (신정동, 신정 지웰)
7삼산보령약국052-256-1240울산광역시 남구 삼산로 283, 소망빌딩 1층 (삼산동)
8정온누리약국052-710-5202울산광역시 남구 삼산로 57, 1층 (신정동)
9주차장약국052-260-2316울산광역시 남구 번영로245번길 12, 1~2층 (신정동)
약국명칭약국전화번호약국소재지(도로명)
159온누리양지약국052-275-4536울산광역시 남구 거마로134번길 17 (신정동)
160영진약국052-271-1292울산광역시 남구 수암로 230 (야음동)
161소망약국052-275-5552울산광역시 남구 중앙로 246 (신정동)
162울산선진약국052-272-8555울산광역시 남구 중앙로 258 (신정동)
163한양약국052-275-4172울산광역시 남구 산업로339번길 24, 한양상가 108호 (선암동)
164대가약국052-272-0720울산광역시 남구 수암로 246 (야음동)
165매일약국052-272-4418울산광역시 남구 수암로288번길 5 (야음동)
166왕자약국052-265-6842울산광역시 남구 봉월로102번길 42 (신정동)
167모범약국052-272-8359울산광역시 남구 수암로 115 (신정동)
168동보약국052-272-4845울산광역시 남구 중앙로241번길 1 (신정동)

Duplicate rows

Most frequently occurring

약국명칭약국전화번호약국소재지(도로명)# duplicates
2신유명약국052-260-8861울산광역시 남구 중앙로 243-1, 1층 (신정동)3
0강남약국052-271-3101울산광역시 남구 삼산로 266 (삼산동)2
1삼산현대약국052-268-5666울산광역시 남구 삼산중로 69 (달동)2
3아름약국052-260-1483울산광역시 남구 삼산중로 98 (삼산동)2
4중앙약국052-266-9965울산광역시 남구 신정로148번길 50 (신정동)2
5한솔약국052-267-2631울산광역시 남구 번영로107번길 1 (달동)2