Overview

Dataset statistics

Number of variables4
Number of observations145
Missing cells0
Missing cells (%)0.0%
Duplicate rows5
Duplicate rows (%)3.4%
Total size in memory4.7 KiB
Average record size in memory32.9 B

Variable types

Text3
Categorical1

Dataset

Description전라북도 군산시 소재한 약국 현황(약국명칭, 약국전화번호, 약국소재지(도로명), 약국구분) 관련된 데이터를 제공합니다.
URLhttps://www.data.go.kr/data/3080249/fileData.do

Alerts

Dataset has 5 (3.4%) duplicate rowsDuplicates
약국구분 is highly imbalanced (69.2%)Imbalance

Reproduction

Analysis started2023-12-12 11:14:07.509760
Analysis finished2023-12-12 11:14:07.977789
Duration0.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct140
Distinct (%)96.6%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-12T20:14:08.278025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length4
Mean length4.8758621
Min length3

Characters and Unicode

Total characters707
Distinct characters160
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique135 ?
Unique (%)93.1%

Sample

1st row라온약국
2nd row믿음약국
3rd row나라약국
4th row연합약국
5th row서봄약국
ValueCountFrequency (%)
한약국 3
 
2.0%
아이약국 2
 
1.3%
사랑약국 2
 
1.3%
백화점약국 2
 
1.3%
수송우리약국 2
 
1.3%
하늘약국 2
 
1.3%
유한약국 1
 
0.7%
백제약국 1
 
0.7%
조은약국 1
 
0.7%
참조은약국 1
 
0.7%
Other values (134) 134
88.7%
2023-12-12T20:14:08.985753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
147
20.8%
145
20.5%
16
 
2.3%
10
 
1.4%
8
 
1.1%
8
 
1.1%
7
 
1.0%
7
 
1.0%
7
 
1.0%
7
 
1.0%
Other values (150) 345
48.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 695
98.3%
Space Separator 6
 
0.8%
Decimal Number 6
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
147
21.2%
145
20.9%
16
 
2.3%
10
 
1.4%
8
 
1.2%
8
 
1.2%
7
 
1.0%
7
 
1.0%
7
 
1.0%
7
 
1.0%
Other values (146) 333
47.9%
Decimal Number
ValueCountFrequency (%)
6 2
33.3%
5 2
33.3%
3 2
33.3%
Space Separator
ValueCountFrequency (%)
6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 695
98.3%
Common 12
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
147
21.2%
145
20.9%
16
 
2.3%
10
 
1.4%
8
 
1.2%
8
 
1.2%
7
 
1.0%
7
 
1.0%
7
 
1.0%
7
 
1.0%
Other values (146) 333
47.9%
Common
ValueCountFrequency (%)
6
50.0%
6 2
 
16.7%
5 2
 
16.7%
3 2
 
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 695
98.3%
ASCII 12
 
1.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
147
21.2%
145
20.9%
16
 
2.3%
10
 
1.4%
8
 
1.2%
8
 
1.2%
7
 
1.0%
7
 
1.0%
7
 
1.0%
7
 
1.0%
Other values (146) 333
47.9%
ASCII
ValueCountFrequency (%)
6
50.0%
6 2
 
16.7%
5 2
 
16.7%
3 2
 
16.7%
Distinct140
Distinct (%)96.6%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-12T20:14:09.413237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.013793
Min length12

Characters and Unicode

Total characters1742
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique135 ?
Unique (%)93.1%

Sample

1st row063-462-1056
2nd row063-731-0093
3rd row063-731-0303
4th row063-451-2045
5th row063-466-6780
ValueCountFrequency (%)
063-467-4907 2
 
1.4%
063-442-7700 2
 
1.4%
063-445-4008 2
 
1.4%
063-466-7784 2
 
1.4%
063-466-8763 2
 
1.4%
063-443-4320 1
 
0.7%
063-465-7190 1
 
0.7%
063-451-9981 1
 
0.7%
063-471-6755 1
 
0.7%
063-442-3553 1
 
0.7%
Other values (130) 130
89.7%
2023-12-12T20:14:10.052780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 290
16.6%
6 275
15.8%
0 240
13.8%
3 234
13.4%
4 227
13.0%
7 107
 
6.1%
5 107
 
6.1%
2 84
 
4.8%
1 79
 
4.5%
8 64
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1452
83.4%
Dash Punctuation 290
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
6 275
18.9%
0 240
16.5%
3 234
16.1%
4 227
15.6%
7 107
 
7.4%
5 107
 
7.4%
2 84
 
5.8%
1 79
 
5.4%
8 64
 
4.4%
9 35
 
2.4%
Dash Punctuation
ValueCountFrequency (%)
- 290
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1742
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 290
16.6%
6 275
15.8%
0 240
13.8%
3 234
13.4%
4 227
13.0%
7 107
 
6.1%
5 107
 
6.1%
2 84
 
4.8%
1 79
 
4.5%
8 64
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1742
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 290
16.6%
6 275
15.8%
0 240
13.8%
3 234
13.4%
4 227
13.0%
7 107
 
6.1%
5 107
 
6.1%
2 84
 
4.8%
1 79
 
4.5%
8 64
 
3.7%
Distinct140
Distinct (%)96.6%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-12T20:14:10.665061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length38
Mean length25.737931
Min length19

Characters and Unicode

Total characters3732
Distinct characters135
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique135 ?
Unique (%)93.1%

Sample

1st row전라북도 군산시 월명로 219, 1층 (수송동)
2nd row전라북도 군산시 공단대로 252, 1층 (수송동)
3rd row전라북도 군산시 백릉안4길 12(조촌동)
4th row전라북도 군산시 대야면 번영로 904, 연합약국
5th row전라북도 군산시 공항로 86, 106호 (소룡동)
ValueCountFrequency (%)
전라북도 145
 
17.6%
군산시 145
 
17.6%
나운동 33
 
4.0%
수송동 32
 
3.9%
1층 31
 
3.8%
월명로 17
 
2.1%
조촌동 14
 
1.7%
대학로 14
 
1.7%
공단대로 10
 
1.2%
소룡동 8
 
1.0%
Other values (242) 373
45.4%
2023-12-12T20:14:11.472708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
685
18.4%
156
 
4.2%
151
 
4.0%
151
 
4.0%
151
 
4.0%
150
 
4.0%
149
 
4.0%
148
 
4.0%
1 147
 
3.9%
145
 
3.9%
Other values (125) 1699
45.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2175
58.3%
Space Separator 685
 
18.4%
Decimal Number 508
 
13.6%
Open Punctuation 138
 
3.7%
Close Punctuation 138
 
3.7%
Other Punctuation 76
 
2.0%
Dash Punctuation 12
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
156
 
7.2%
151
 
6.9%
151
 
6.9%
151
 
6.9%
150
 
6.9%
149
 
6.9%
148
 
6.8%
145
 
6.7%
121
 
5.6%
47
 
2.2%
Other values (109) 806
37.1%
Decimal Number
ValueCountFrequency (%)
1 147
28.9%
3 68
13.4%
2 68
13.4%
0 50
 
9.8%
4 47
 
9.3%
5 28
 
5.5%
7 27
 
5.3%
8 25
 
4.9%
6 24
 
4.7%
9 24
 
4.7%
Other Punctuation
ValueCountFrequency (%)
, 72
94.7%
. 4
 
5.3%
Space Separator
ValueCountFrequency (%)
685
100.0%
Open Punctuation
ValueCountFrequency (%)
( 138
100.0%
Close Punctuation
ValueCountFrequency (%)
) 138
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2175
58.3%
Common 1557
41.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
156
 
7.2%
151
 
6.9%
151
 
6.9%
151
 
6.9%
150
 
6.9%
149
 
6.9%
148
 
6.8%
145
 
6.7%
121
 
5.6%
47
 
2.2%
Other values (109) 806
37.1%
Common
ValueCountFrequency (%)
685
44.0%
1 147
 
9.4%
( 138
 
8.9%
) 138
 
8.9%
, 72
 
4.6%
3 68
 
4.4%
2 68
 
4.4%
0 50
 
3.2%
4 47
 
3.0%
5 28
 
1.8%
Other values (6) 116
 
7.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2175
58.3%
ASCII 1557
41.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
685
44.0%
1 147
 
9.4%
( 138
 
8.9%
) 138
 
8.9%
, 72
 
4.6%
3 68
 
4.4%
2 68
 
4.4%
0 50
 
3.2%
4 47
 
3.0%
5 28
 
1.8%
Other values (6) 116
 
7.5%
Hangul
ValueCountFrequency (%)
156
 
7.2%
151
 
6.9%
151
 
6.9%
151
 
6.9%
150
 
6.9%
149
 
6.9%
148
 
6.8%
145
 
6.7%
121
 
5.6%
47
 
2.2%
Other values (109) 806
37.1%

약국구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
약국
137 
한약국
 
8

Length

Max length3
Median length2
Mean length2.0551724
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row약국
2nd row약국
3rd row약국
4th row약국
5th row약국

Common Values

ValueCountFrequency (%)
약국 137
94.5%
한약국 8
 
5.5%

Length

2023-12-12T20:14:11.710726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:14:11.899398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
약국 137
94.5%
한약국 8
 
5.5%

Missing values

2023-12-12T20:14:07.807394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:14:07.929194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

약국명칭약국전화번호약국소재지(도로명)약국구분
0라온약국063-462-1056전라북도 군산시 월명로 219, 1층 (수송동)약국
1믿음약국063-731-0093전라북도 군산시 공단대로 252, 1층 (수송동)약국
2나라약국063-731-0303전라북도 군산시 백릉안4길 12(조촌동)약국
3연합약국063-451-2045전라북도 군산시 대야면 번영로 904, 연합약국약국
4서봄약국063-466-6780전라북도 군산시 공항로 86, 106호 (소룡동)약국
5다나은온누리약국063-471-5086전라북도 군산시 하나운로 71, 1층 (나운동)약국
6휴베이스등대약국063-443-3335전라북도 군산시 궁포안1길 33, 101호 (조촌동)약국
7영광약국063-734-2277전라북도 군산시 궁포3로 4, 1동 1층 104호 (조촌동)약국
8만수약국063-445-3603전라북도 군산시 대학로 51(명산동)약국
9중앙온누리약국063-442-5266전라북도 군산시 대명2길 2, 1층 (대명동)약국
약국명칭약국전화번호약국소재지(도로명)약국구분
135조흥약국063-445-8035전라북도 군산시 신영2길 1-1 (신영동)약국
136심약국063-463-9297전라북도 군산시 문화로 33 (문화동)약국
137명산약국063-453-2404전라북도 군산시 임피면 동군산로 708약국
138군산약국063-462-0069전라북도 군산시 월명로 300, 항장앤유외과의원 1층 (수송동)약국
139은파약국063-462-2710전라북도 군산시 의료원로 159, 신일아파트상가 (나운동)약국
140태광약국063-463-3571전라북도 군산시 대학로 414 (나운동)약국
141전북약국063-445-6436전라북도 군산시 구암3.1로 7 (대명동)약국
142아세아약국063-465-7337전라북도 군산시 수송로 14 (나운동)약국
143성도약국063-463-7247전라북도 군산시 백토로 32 (문화동)약국
144중국장수당약국063-445-2417전라북도 군산시 영동로 3-1 (영동)약국

Duplicate rows

Most frequently occurring

약국명칭약국전화번호약국소재지(도로명)약국구분# duplicates
0백화점약국063-445-4008전라북도 군산시 구암3.1로 137, 2층 (경암동, 이마트)약국2
1사랑약국063-442-7700전라북도 군산시 궁포안2길 24, 103호 (조촌동)약국2
2수송우리약국063-466-7784전라북도 군산시 월명로 205 (수송동)약국2
3아이약국063-467-4907전라북도 군산시 수송로 198, 1층 (수송동)약국2
4하늘약국063-466-8763전라북도 군산시 수송안길 50 (수송동)약국2