Overview

Dataset statistics

Number of variables6
Number of observations81
Missing cells2
Missing cells (%)0.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.0 KiB
Average record size in memory50.6 B

Variable types

Numeric1
Categorical1
Text3
DateTime1

Dataset

Description경상남도 거제시 내의 의약업소의 명칭, 전화번호, 소재지 등의 현황을 제공하여 관광객의 편의 도모 및 지역경제 활성화 목적
Author경상남도 거제시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=3079321

Alerts

기준일자 has constant value ""Constant
구분 is highly imbalanced (77.1%)Imbalance
전화번호 has 2 (2.5%) missing valuesMissing
순번 has unique valuesUnique
명칭 has unique valuesUnique
소재지(도로명) has unique valuesUnique

Reproduction

Analysis started2023-12-10 23:48:55.959237
Analysis finished2023-12-10 23:48:56.728943
Duration0.77 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct81
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean41
Minimum1
Maximum81
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size861.0 B
2023-12-11T08:48:56.850478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5
Q121
median41
Q361
95-th percentile77
Maximum81
Range80
Interquartile range (IQR)40

Descriptive statistics

Standard deviation23.526581
Coefficient of variation (CV)0.57381904
Kurtosis-1.2
Mean41
Median Absolute Deviation (MAD)20
Skewness0
Sum3321
Variance553.5
MonotonicityStrictly increasing
2023-12-11T08:48:57.045992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.2%
62 1
 
1.2%
60 1
 
1.2%
59 1
 
1.2%
58 1
 
1.2%
57 1
 
1.2%
56 1
 
1.2%
55 1
 
1.2%
54 1
 
1.2%
53 1
 
1.2%
Other values (71) 71
87.7%
ValueCountFrequency (%)
1 1
1.2%
2 1
1.2%
3 1
1.2%
4 1
1.2%
5 1
1.2%
6 1
1.2%
7 1
1.2%
8 1
1.2%
9 1
1.2%
10 1
1.2%
ValueCountFrequency (%)
81 1
1.2%
80 1
1.2%
79 1
1.2%
78 1
1.2%
77 1
1.2%
76 1
1.2%
75 1
1.2%
74 1
1.2%
73 1
1.2%
72 1
1.2%

구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size780.0 B
약국
78 
한약방
 
3

Length

Max length3
Median length2
Mean length2.037037
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row약국
2nd row약국
3rd row약국
4th row약국
5th row약국

Common Values

ValueCountFrequency (%)
약국 78
96.3%
한약방 3
 
3.7%

Length

2023-12-11T08:48:57.210094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:48:57.328588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
약국 78
96.3%
한약방 3
 
3.7%

명칭
Text

UNIQUE 

Distinct81
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size780.0 B
2023-12-11T08:48:57.611374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length4.9135802
Min length3

Characters and Unicode

Total characters398
Distinct characters114
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique81 ?
Unique (%)100.0%

Sample

1st row아주약국
2nd row진약국
3rd row사곡조제약국
4th row옥포로약국
5th row미리내약국
ValueCountFrequency (%)
아주약국 1
 
1.2%
장평온누리약국 1
 
1.2%
대교건강약국 1
 
1.2%
거제프라자약국 1
 
1.2%
만세약국 1
 
1.2%
하담약국 1
 
1.2%
새조은약국 1
 
1.2%
건강약국 1
 
1.2%
신세계약국 1
 
1.2%
수월온누리약국 1
 
1.2%
Other values (71) 71
87.7%
2023-12-11T08:48:58.099214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
81
20.4%
78
19.6%
9
 
2.3%
8
 
2.0%
7
 
1.8%
7
 
1.8%
7
 
1.8%
6
 
1.5%
5
 
1.3%
5
 
1.3%
Other values (104) 185
46.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 397
99.7%
Lowercase Letter 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
81
20.4%
78
19.6%
9
 
2.3%
8
 
2.0%
7
 
1.8%
7
 
1.8%
7
 
1.8%
6
 
1.5%
5
 
1.3%
5
 
1.3%
Other values (103) 184
46.3%
Lowercase Letter
ValueCountFrequency (%)
i 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 397
99.7%
Latin 1
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
81
20.4%
78
19.6%
9
 
2.3%
8
 
2.0%
7
 
1.8%
7
 
1.8%
7
 
1.8%
6
 
1.5%
5
 
1.3%
5
 
1.3%
Other values (103) 184
46.3%
Latin
ValueCountFrequency (%)
i 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 397
99.7%
ASCII 1
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
81
20.4%
78
19.6%
9
 
2.3%
8
 
2.0%
7
 
1.8%
7
 
1.8%
7
 
1.8%
6
 
1.5%
5
 
1.3%
5
 
1.3%
Other values (103) 184
46.3%
ASCII
ValueCountFrequency (%)
i 1
100.0%

전화번호
Text

MISSING 

Distinct79
Distinct (%)100.0%
Missing2
Missing (%)2.5%
Memory size780.0 B
2023-12-11T08:48:58.418695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.037975
Min length12

Characters and Unicode

Total characters951
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique79 ?
Unique (%)100.0%

Sample

1st row055-637-9723
2nd row055-635-9855
3rd row055-688-3238
4th row055-633-7903
5th row055-682-0854
ValueCountFrequency (%)
055-637-9723 1
 
1.3%
055-688-3044 1
 
1.3%
055-635-0069 1
 
1.3%
055-638-1351 1
 
1.3%
055-636-6248 1
 
1.3%
055-632-7323 1
 
1.3%
055-632-7870 1
 
1.3%
055-637-2177 1
 
1.3%
055-636-6888 1
 
1.3%
055-681-5180 1
 
1.3%
Other values (69) 69
87.3%
2023-12-11T08:48:58.927953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 192
20.2%
- 158
16.6%
0 121
12.7%
6 108
11.4%
3 89
9.4%
8 87
9.1%
2 51
 
5.4%
7 49
 
5.2%
1 38
 
4.0%
4 30
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 793
83.4%
Dash Punctuation 158
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 192
24.2%
0 121
15.3%
6 108
13.6%
3 89
11.2%
8 87
11.0%
2 51
 
6.4%
7 49
 
6.2%
1 38
 
4.8%
4 30
 
3.8%
9 28
 
3.5%
Dash Punctuation
ValueCountFrequency (%)
- 158
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 951
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 192
20.2%
- 158
16.6%
0 121
12.7%
6 108
11.4%
3 89
9.4%
8 87
9.1%
2 51
 
5.4%
7 49
 
5.2%
1 38
 
4.0%
4 30
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 951
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 192
20.2%
- 158
16.6%
0 121
12.7%
6 108
11.4%
3 89
9.4%
8 87
9.1%
2 51
 
5.4%
7 49
 
5.2%
1 38
 
4.0%
4 30
 
3.2%
Distinct81
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size780.0 B
2023-12-11T08:48:59.261089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length58
Median length37
Mean length25.197531
Min length19

Characters and Unicode

Total characters2041
Distinct characters94
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique81 ?
Unique (%)100.0%

Sample

1st row경상남도 거제시 아주1로2길 62,명주빌딩 1층 (아주동)
2nd row경상남도 거제시 수양로 458,1층 (수월동)
3rd row경상남도 거제시 사등면 두동로 14
4th row경상남도 거제시 옥포로 232 (옥포동)
5th row경상남도 거제시 상동5길 79,1층 4호 (상동동)
ValueCountFrequency (%)
경상남도 81
19.4%
거제시 81
19.4%
고현동 25
 
6.0%
옥포동 13
 
3.1%
거제중앙로 12
 
2.9%
옥포로 8
 
1.9%
상동동 6
 
1.4%
서문로 5
 
1.2%
장평동 4
 
1.0%
고현로 4
 
1.0%
Other values (136) 178
42.7%
2023-12-11T08:48:59.771563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
336
 
16.5%
105
 
5.1%
104
 
5.1%
94
 
4.6%
87
 
4.3%
1 85
 
4.2%
84
 
4.1%
82
 
4.0%
82
 
4.0%
81
 
4.0%
Other values (84) 901
44.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1225
60.0%
Space Separator 336
 
16.5%
Decimal Number 295
 
14.5%
Close Punctuation 70
 
3.4%
Open Punctuation 70
 
3.4%
Other Punctuation 34
 
1.7%
Dash Punctuation 11
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
105
 
8.6%
104
 
8.5%
94
 
7.7%
87
 
7.1%
84
 
6.9%
82
 
6.7%
82
 
6.7%
81
 
6.6%
73
 
6.0%
38
 
3.1%
Other values (69) 395
32.2%
Decimal Number
ValueCountFrequency (%)
1 85
28.8%
2 45
15.3%
4 27
 
9.2%
8 26
 
8.8%
3 23
 
7.8%
5 22
 
7.5%
0 20
 
6.8%
9 19
 
6.4%
6 15
 
5.1%
7 13
 
4.4%
Space Separator
ValueCountFrequency (%)
336
100.0%
Close Punctuation
ValueCountFrequency (%)
) 70
100.0%
Open Punctuation
ValueCountFrequency (%)
( 70
100.0%
Other Punctuation
ValueCountFrequency (%)
34
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1225
60.0%
Common 816
40.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
105
 
8.6%
104
 
8.5%
94
 
7.7%
87
 
7.1%
84
 
6.9%
82
 
6.7%
82
 
6.7%
81
 
6.6%
73
 
6.0%
38
 
3.1%
Other values (69) 395
32.2%
Common
ValueCountFrequency (%)
336
41.2%
1 85
 
10.4%
) 70
 
8.6%
( 70
 
8.6%
2 45
 
5.5%
34
 
4.2%
4 27
 
3.3%
8 26
 
3.2%
3 23
 
2.8%
5 22
 
2.7%
Other values (5) 78
 
9.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1225
60.0%
ASCII 782
38.3%
None 34
 
1.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
336
43.0%
1 85
 
10.9%
) 70
 
9.0%
( 70
 
9.0%
2 45
 
5.8%
4 27
 
3.5%
8 26
 
3.3%
3 23
 
2.9%
5 22
 
2.8%
0 20
 
2.6%
Other values (4) 58
 
7.4%
Hangul
ValueCountFrequency (%)
105
 
8.6%
104
 
8.5%
94
 
7.7%
87
 
7.1%
84
 
6.9%
82
 
6.7%
82
 
6.7%
81
 
6.6%
73
 
6.0%
38
 
3.1%
Other values (69) 395
32.2%
None
ValueCountFrequency (%)
34
100.0%

기준일자
Date

CONSTANT 

Distinct1
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size780.0 B
Minimum2021-11-15 00:00:00
Maximum2021-11-15 00:00:00
2023-12-11T08:48:59.896359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:49:00.000809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-11T08:48:56.353436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T08:49:00.103542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번구분명칭전화번호소재지(도로명)
순번1.0000.6681.0001.0001.000
구분0.6681.0001.0001.0001.000
명칭1.0001.0001.0001.0001.000
전화번호1.0001.0001.0001.0001.000
소재지(도로명)1.0001.0001.0001.0001.000
2023-12-11T08:49:00.219969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번구분
순번1.0000.491
구분0.4911.000

Missing values

2023-12-11T08:48:56.515183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:48:56.668202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번구분명칭전화번호소재지(도로명)기준일자
01약국아주약국<NA>경상남도 거제시 아주1로2길 62,명주빌딩 1층 (아주동)2021-11-15
12약국진약국055-637-9723경상남도 거제시 수양로 458,1층 (수월동)2021-11-15
23약국사곡조제약국055-635-9855경상남도 거제시 사등면 두동로 142021-11-15
34약국옥포로약국055-688-3238경상남도 거제시 옥포로 232 (옥포동)2021-11-15
45약국미리내약국055-633-7903경상남도 거제시 상동5길 79,1층 4호 (상동동)2021-11-15
56약국튼튼i약국<NA>경상남도 거제시 장평1로 7,거제 장평 유림노르웨이숲 상가동 215호 (장평동,거제 장평 유림노르웨이숲)2021-11-15
67약국아주조은약국055-682-0854경상남도 거제시 아주2로 45,1층 (아주동)2021-11-15
78약국다원온누리약국055-638-4805경상남도 거제시 거제중앙로 1938,1층 102,103호 (고현동)2021-11-15
89약국고은약국055-633-3375경상남도 거제시 고현로 88 (고현동)2021-11-15
910약국한마음약국055-688-6145경상남도 거제시 옥포로 221,동흥빌딩 1층 (옥포동)2021-11-15
순번구분명칭전화번호소재지(도로명)기준일자
7172약국희보약국055-681-2441경상남도 거제시 능포로 134 (능포동)2021-11-15
7273약국모범약국055-687-2431경상남도 거제시 옥포대첩로 33 (옥포동)2021-11-15
7374약국대우약국055-681-2695경상남도 거제시 장승포로1길 4 (장승포동)2021-11-15
7475약국옥수당약국055-682-1616경상남도 거제시 옥수로10길 40 (능포동)2021-11-15
7576약국고현약국055-635-2125경상남도 거제시 거제중앙로 1867 (고현동)2021-11-15
7677약국삼성약국055-635-2408경상남도 거제시 중곡로 45 (고현동)2021-11-15
7778약국우리약국055-633-4178경상남도 거제시 거제면 읍내로2길 282021-11-15
7879한약방세종당한약방055-635-7800경상남도 거제시 사등면 거제남서로 53402021-11-15
7980한약방수인당한약방055-632-8363경상남도 거제시 서문로 79 (고현동)2021-11-15
8081한약방동신당한약방055-633-1496경상남도 거제시 고현로 117 (고현동)2021-11-15