Overview

Dataset statistics

Number of variables5
Number of observations29
Missing cells3
Missing cells (%)2.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 KiB
Average record size in memory44.6 B

Variable types

Text4
Categorical1

Dataset

Description전북특별자치도 장수군에 소재하고 있는 의약업소 현황(업소명, 소재지, 규모, 전화번호, 팩스번호)에 대한 데이터 정보를 제공하고자 합니다
Author전북특별자치도 장수군
URLhttps://www.data.go.kr/data/15041641/fileData.do

Alerts

규모 is highly imbalanced (78.4%)Imbalance
팩스번호 has 3 (10.3%) missing valuesMissing
업소명 has unique valuesUnique
전화번호 has unique valuesUnique

Reproduction

Analysis started2024-04-06 08:07:06.557750
Analysis finished2024-04-06 08:07:07.426802
Duration0.87 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업소명
Text

UNIQUE 

Distinct29
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size364.0 B
2024-04-06T17:07:07.695935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length5.6206897
Min length3

Characters and Unicode

Total characters163
Distinct characters69
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)100.0%

Sample

1st row장수군보건의료원
2nd row박승민내과의원
3rd row한사랑의원
4th row동아가정의원
5th row중앙의원
ValueCountFrequency (%)
장수군보건의료원 1
 
3.4%
장수치과의원 1
 
3.4%
호남당한약방 1
 
3.4%
연수당한약방 1
 
3.4%
터미널약국 1
 
3.4%
태평양약국 1
 
3.4%
장수종로약국 1
 
3.4%
장계백제약국 1
 
3.4%
독일약국 1
 
3.4%
유약국 1
 
3.4%
Other values (19) 19
65.5%
2024-04-06T17:07:08.459707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
18
 
11.0%
17
 
10.4%
12
 
7.4%
9
 
5.5%
8
 
4.9%
8
 
4.9%
6
 
3.7%
5
 
3.1%
5
 
3.1%
3
 
1.8%
Other values (59) 72
44.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 162
99.4%
Space Separator 1
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
18
 
11.1%
17
 
10.5%
12
 
7.4%
9
 
5.6%
8
 
4.9%
8
 
4.9%
6
 
3.7%
5
 
3.1%
5
 
3.1%
3
 
1.9%
Other values (58) 71
43.8%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 162
99.4%
Common 1
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
18
 
11.1%
17
 
10.5%
12
 
7.4%
9
 
5.6%
8
 
4.9%
8
 
4.9%
6
 
3.7%
5
 
3.1%
5
 
3.1%
3
 
1.9%
Other values (58) 71
43.8%
Common
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 162
99.4%
ASCII 1
 
0.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
18
 
11.1%
17
 
10.5%
12
 
7.4%
9
 
5.6%
8
 
4.9%
8
 
4.9%
6
 
3.7%
5
 
3.1%
5
 
3.1%
3
 
1.9%
Other values (58) 71
43.8%
ASCII
ValueCountFrequency (%)
1
100.0%
Distinct23
Distinct (%)79.3%
Missing0
Missing (%)0.0%
Memory size364.0 B
2024-04-06T17:07:08.850008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length23
Mean length23.068966
Min length21

Characters and Unicode

Total characters669
Distinct characters39
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18 ?
Unique (%)62.1%

Sample

1st row전북특별자치도 장수군 장수읍 장천로 247
2nd row전북특별자치도 장수군 장수읍 장천로 180
3rd row전북특별자치도 장수군 장수읍 시장로 3
4th row전북특별자치도 장수군 장수읍 장천로 175
5th row전북특별자치도 장수군 장계면 한들로106
ValueCountFrequency (%)
전북특별자치도 29
20.4%
장수군 29
20.4%
장수읍 14
9.9%
장계면 13
9.2%
장천로 12
8.5%
한들로 9
 
6.3%
175 3
 
2.1%
180 2
 
1.4%
107 2
 
1.4%
산서면 2
 
1.4%
Other values (25) 27
19.0%
2024-04-06T17:07:09.610951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
113
16.9%
71
 
10.6%
43
 
6.4%
29
 
4.3%
29
 
4.3%
29
 
4.3%
29
 
4.3%
29
 
4.3%
29
 
4.3%
29
 
4.3%
Other values (29) 239
35.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 467
69.8%
Space Separator 113
 
16.9%
Decimal Number 83
 
12.4%
Dash Punctuation 6
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
71
15.2%
43
 
9.2%
29
 
6.2%
29
 
6.2%
29
 
6.2%
29
 
6.2%
29
 
6.2%
29
 
6.2%
29
 
6.2%
29
 
6.2%
Other values (17) 121
25.9%
Decimal Number
ValueCountFrequency (%)
1 28
33.7%
7 11
 
13.3%
0 10
 
12.0%
8 8
 
9.6%
5 6
 
7.2%
3 6
 
7.2%
9 5
 
6.0%
6 4
 
4.8%
2 3
 
3.6%
4 2
 
2.4%
Space Separator
ValueCountFrequency (%)
113
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 467
69.8%
Common 202
30.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
71
15.2%
43
 
9.2%
29
 
6.2%
29
 
6.2%
29
 
6.2%
29
 
6.2%
29
 
6.2%
29
 
6.2%
29
 
6.2%
29
 
6.2%
Other values (17) 121
25.9%
Common
ValueCountFrequency (%)
113
55.9%
1 28
 
13.9%
7 11
 
5.4%
0 10
 
5.0%
8 8
 
4.0%
5 6
 
3.0%
3 6
 
3.0%
- 6
 
3.0%
9 5
 
2.5%
6 4
 
2.0%
Other values (2) 5
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 467
69.8%
ASCII 202
30.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
113
55.9%
1 28
 
13.9%
7 11
 
5.4%
0 10
 
5.0%
8 8
 
4.0%
5 6
 
3.0%
3 6
 
3.0%
- 6
 
3.0%
9 5
 
2.5%
6 4
 
2.0%
Other values (2) 5
 
2.5%
Hangul
ValueCountFrequency (%)
71
15.2%
43
 
9.2%
29
 
6.2%
29
 
6.2%
29
 
6.2%
29
 
6.2%
29
 
6.2%
29
 
6.2%
29
 
6.2%
29
 
6.2%
Other values (17) 121
25.9%

규모
Categorical

IMBALANCE 

Distinct2
Distinct (%)6.9%
Missing0
Missing (%)0.0%
Memory size364.0 B
1층
28 
4층이하
 
1

Length

Max length4
Median length2
Mean length2.0689655
Min length2

Unique

Unique1 ?
Unique (%)3.4%

Sample

1st row4층이하
2nd row1층
3rd row1층
4th row1층
5th row1층

Common Values

ValueCountFrequency (%)
1층 28
96.6%
4층이하 1
 
3.4%

Length

2024-04-06T17:07:10.000880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:07:10.234356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1층 28
96.6%
4층이하 1
 
3.4%

전화번호
Text

UNIQUE 

Distinct29
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size364.0 B
2024-04-06T17:07:10.559827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters348
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)100.0%

Sample

1st row063-351-8000
2nd row063-351-1616
3rd row063-351-8575
4th row063-351-8275
5th row063-351-0110
ValueCountFrequency (%)
063-351-8000 1
 
3.4%
063-353-2828 1
 
3.4%
063-351-3456 1
 
3.4%
063-351-2242 1
 
3.4%
063-352-5588 1
 
3.4%
063-353-5677 1
 
3.4%
063-353-0202 1
 
3.4%
063-351-1333 1
 
3.4%
063-351-0276 1
 
3.4%
063-352-0786 1
 
3.4%
Other values (19) 19
65.5%
2024-04-06T17:07:11.143417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 73
21.0%
- 58
16.7%
0 48
13.8%
5 42
12.1%
6 40
11.5%
1 34
9.8%
2 20
 
5.7%
7 14
 
4.0%
8 12
 
3.4%
4 5
 
1.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 290
83.3%
Dash Punctuation 58
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 73
25.2%
0 48
16.6%
5 42
14.5%
6 40
13.8%
1 34
11.7%
2 20
 
6.9%
7 14
 
4.8%
8 12
 
4.1%
4 5
 
1.7%
9 2
 
0.7%
Dash Punctuation
ValueCountFrequency (%)
- 58
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 348
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 73
21.0%
- 58
16.7%
0 48
13.8%
5 42
12.1%
6 40
11.5%
1 34
9.8%
2 20
 
5.7%
7 14
 
4.0%
8 12
 
3.4%
4 5
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 348
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 73
21.0%
- 58
16.7%
0 48
13.8%
5 42
12.1%
6 40
11.5%
1 34
9.8%
2 20
 
5.7%
7 14
 
4.0%
8 12
 
3.4%
4 5
 
1.4%

팩스번호
Text

MISSING 

Distinct26
Distinct (%)100.0%
Missing3
Missing (%)10.3%
Memory size364.0 B
2024-04-06T17:07:11.498047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters312
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)100.0%

Sample

1st row063-351-0904
2nd row063-351-8574
3rd row063-351-8279
4th row063-351-0789
5th row063-351-0650
ValueCountFrequency (%)
063-351-0904 1
 
3.8%
063-351-8574 1
 
3.8%
063-351-3455 1
 
3.8%
063-352-5589 1
 
3.8%
063-353-5677 1
 
3.8%
063-900-3992 1
 
3.8%
063-351-1333 1
 
3.8%
063-351-0276 1
 
3.8%
063-352-0788 1
 
3.8%
063-351-6880 1
 
3.8%
Other values (16) 16
61.5%
2024-04-06T17:07:12.129424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 67
21.5%
- 52
16.7%
0 43
13.8%
6 37
11.9%
5 37
11.9%
1 22
 
7.1%
7 16
 
5.1%
2 14
 
4.5%
8 13
 
4.2%
9 8
 
2.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 260
83.3%
Dash Punctuation 52
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 67
25.8%
0 43
16.5%
6 37
14.2%
5 37
14.2%
1 22
 
8.5%
7 16
 
6.2%
2 14
 
5.4%
8 13
 
5.0%
9 8
 
3.1%
4 3
 
1.2%
Dash Punctuation
ValueCountFrequency (%)
- 52
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 312
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 67
21.5%
- 52
16.7%
0 43
13.8%
6 37
11.9%
5 37
11.9%
1 22
 
7.1%
7 16
 
5.1%
2 14
 
4.5%
8 13
 
4.2%
9 8
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 312
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 67
21.5%
- 52
16.7%
0 43
13.8%
6 37
11.9%
5 37
11.9%
1 22
 
7.1%
7 16
 
5.1%
2 14
 
4.5%
8 13
 
4.2%
9 8
 
2.6%

Correlations

2024-04-06T17:07:12.350053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소명소재지규모전화번호팩스번호
업소명1.0001.0001.0001.0001.000
소재지1.0001.0001.0001.0001.000
규모1.0001.0001.0001.000NaN
전화번호1.0001.0001.0001.0001.000
팩스번호1.0001.000NaN1.0001.000

Missing values

2024-04-06T17:07:07.164575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T17:07:07.361401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명소재지규모전화번호팩스번호
0장수군보건의료원전북특별자치도 장수군 장수읍 장천로 2474층이하063-351-8000<NA>
1박승민내과의원전북특별자치도 장수군 장수읍 장천로 1801층063-351-1616063-351-0904
2한사랑의원전북특별자치도 장수군 장수읍 시장로 31층063-351-8575063-351-8574
3동아가정의원전북특별자치도 장수군 장수읍 장천로 1751층063-351-8275063-351-8279
4중앙의원전북특별자치도 장수군 장계면 한들로1061층063-351-0110063-351-0789
5연세의원전북특별자치도 장수군 장계면 한들로 1071층063-352-1478063-351-0650
6참가정의학과의원전북특별자치도 장수군 장계면 한들로 931층063-351-7277063-351-7278
7김문철내과의원전북특별자치도 장수군 장계면 한들로 90-11층063-352-5575063-352-5576
8소망한의원전북특별자치도 장수군 장수읍 장천로 1751층063-351-7676063-351-7676
9장수바다한의원전북특별자치도 장수군 장수읍 장천로 1661층063-351-8900063-351-8900
업소명소재지규모전화번호팩스번호
19보건약국전북특별자치도 장수군 산서면 비행로 241층063-351-3440<NA>
20유약국전북특별자치도 장수군 장계면 한들로 1071층063-352-0786063-352-0788
21독일약국전북특별자치도 장수군 장계면 한들로 1081층063-351-0276063-351-0276
22장계백제약국전북특별자치도 장수군 장계면 한들로 931층063-351-1333063-351-1333
23장수종로약국전북특별자치도 장수군 장수읍 장천로 1891층063-353-0202063-900-3992
24태평양약국전북특별자치도 장수군 장수읍 장천로 1751층063-353-5677063-353-5677
25터미널약국전북특별자치도 장수군 장계면 한들로 90-11층063-352-5588063-352-5589
26연수당한약방전북특별자치도 장수군 장수읍 장천로 1711층063-351-2242<NA>
27호남당한약방전북특별자치도 장수군 산서면 보산로1873-11층063-351-3456063-351-3455
28백인당한약방전북특별자치도 장수군 장수읍 시장통길 151층063-351-3351063-351-3352