Overview

Dataset statistics

Number of variables5
Number of observations832
Missing cells544
Missing cells (%)13.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory33.4 KiB
Average record size in memory41.2 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description광주 서구 관내 의료기기 판매업소 현황입니다. 영업소명, 영업소재지(도로명)등의 항목이 제공됩니다. 또한 영업소에는 주식회사와 대리점, 개인사업장등이 포함되어 있습니다.
Author광주광역시 서구
URLhttps://www.data.go.kr/data/15011794/fileData.do

Alerts

종별 is highly imbalanced (65.8%)Imbalance
영업소전화번호 has 544 (65.4%) missing valuesMissing
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-23 07:28:55.821686
Analysis finished2023-12-23 07:29:00.102853
Duration4.28 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct832
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean416.5
Minimum1
Maximum832
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.4 KiB
2023-12-23T07:29:00.420959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile42.55
Q1208.75
median416.5
Q3624.25
95-th percentile790.45
Maximum832
Range831
Interquartile range (IQR)415.5

Descriptive statistics

Standard deviation240.32201
Coefficient of variation (CV)0.57700362
Kurtosis-1.2
Mean416.5
Median Absolute Deviation (MAD)208
Skewness0
Sum346528
Variance57754.667
MonotonicityStrictly increasing
2023-12-23T07:29:01.530488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
418 1
 
0.1%
550 1
 
0.1%
551 1
 
0.1%
552 1
 
0.1%
553 1
 
0.1%
554 1
 
0.1%
555 1
 
0.1%
556 1
 
0.1%
557 1
 
0.1%
Other values (822) 822
98.8%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
832 1
0.1%
831 1
0.1%
830 1
0.1%
829 1
0.1%
828 1
0.1%
827 1
0.1%
826 1
0.1%
825 1
0.1%
824 1
0.1%
823 1
0.1%

종별
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size6.6 KiB
판매업
779 
판매(임대)업
 
53

Length

Max length7
Median length3
Mean length3.2548077
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row판매업
2nd row판매업
3rd row판매업
4th row판매업
5th row판매업

Common Values

ValueCountFrequency (%)
판매업 779
93.6%
판매(임대)업 53
 
6.4%

Length

2023-12-23T07:29:02.108407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-23T07:29:03.380676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
판매업 779
93.6%
판매(임대)업 53
 
6.4%
Distinct827
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Memory size6.6 KiB
2023-12-23T07:29:05.017739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length16
Mean length8.2836538
Min length2

Characters and Unicode

Total characters6892
Distinct characters479
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique822 ?
Unique (%)98.8%

Sample

1st row클리어하우스
2nd row지에스25 금호마륵점
3rd rowenews 공간디자인
4th row알고뷰티
5th row티에스홀딩스
ValueCountFrequency (%)
세븐일레븐 55
 
4.6%
주식회사 54
 
4.5%
gs25 38
 
3.1%
씨유 16
 
1.3%
이마트24 15
 
1.2%
씨제이올리브영(주 10
 
0.8%
상무점 9
 
0.7%
광주 8
 
0.7%
지에스25 7
 
0.6%
주)하이프라자 6
 
0.5%
Other values (906) 990
82.0%
2023-12-23T07:29:07.894111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
377
 
5.5%
369
 
5.4%
249
 
3.6%
190
 
2.8%
183
 
2.7%
182
 
2.6%
174
 
2.5%
174
 
2.5%
) 171
 
2.5%
( 170
 
2.5%
Other values (469) 4653
67.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5672
82.3%
Space Separator 377
 
5.5%
Uppercase Letter 217
 
3.1%
Decimal Number 185
 
2.7%
Close Punctuation 171
 
2.5%
Open Punctuation 170
 
2.5%
Lowercase Letter 86
 
1.2%
Other Punctuation 9
 
0.1%
Other Symbol 3
 
< 0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
369
 
6.5%
249
 
4.4%
190
 
3.3%
183
 
3.2%
182
 
3.2%
174
 
3.1%
174
 
3.1%
120
 
2.1%
104
 
1.8%
96
 
1.7%
Other values (413) 3831
67.5%
Uppercase Letter
ValueCountFrequency (%)
S 64
29.5%
G 55
25.3%
C 16
 
7.4%
U 12
 
5.5%
M 11
 
5.1%
O 7
 
3.2%
J 7
 
3.2%
B 6
 
2.8%
D 5
 
2.3%
K 5
 
2.3%
Other values (11) 29
13.4%
Lowercase Letter
ValueCountFrequency (%)
e 12
14.0%
a 10
11.6%
i 8
9.3%
r 7
8.1%
s 7
8.1%
n 7
8.1%
t 6
 
7.0%
o 5
 
5.8%
l 5
 
5.8%
k 3
 
3.5%
Other values (10) 16
18.6%
Decimal Number
ValueCountFrequency (%)
2 88
47.6%
5 66
35.7%
4 18
 
9.7%
1 5
 
2.7%
3 4
 
2.2%
6 2
 
1.1%
8 1
 
0.5%
0 1
 
0.5%
Other Punctuation
ValueCountFrequency (%)
. 6
66.7%
& 3
33.3%
Space Separator
ValueCountFrequency (%)
377
100.0%
Close Punctuation
ValueCountFrequency (%)
) 171
100.0%
Open Punctuation
ValueCountFrequency (%)
( 170
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5675
82.3%
Common 914
 
13.3%
Latin 303
 
4.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
369
 
6.5%
249
 
4.4%
190
 
3.3%
183
 
3.2%
182
 
3.2%
174
 
3.1%
174
 
3.1%
120
 
2.1%
104
 
1.8%
96
 
1.7%
Other values (414) 3834
67.6%
Latin
ValueCountFrequency (%)
S 64
21.1%
G 55
18.2%
C 16
 
5.3%
e 12
 
4.0%
U 12
 
4.0%
M 11
 
3.6%
a 10
 
3.3%
i 8
 
2.6%
r 7
 
2.3%
O 7
 
2.3%
Other values (31) 101
33.3%
Common
ValueCountFrequency (%)
377
41.2%
) 171
18.7%
( 170
18.6%
2 88
 
9.6%
5 66
 
7.2%
4 18
 
2.0%
. 6
 
0.7%
1 5
 
0.5%
3 4
 
0.4%
& 3
 
0.3%
Other values (4) 6
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5672
82.3%
ASCII 1217
 
17.7%
None 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
377
31.0%
) 171
14.1%
( 170
14.0%
2 88
 
7.2%
5 66
 
5.4%
S 64
 
5.3%
G 55
 
4.5%
4 18
 
1.5%
C 16
 
1.3%
e 12
 
1.0%
Other values (45) 180
14.8%
Hangul
ValueCountFrequency (%)
369
 
6.5%
249
 
4.4%
190
 
3.3%
183
 
3.2%
182
 
3.2%
174
 
3.1%
174
 
3.1%
120
 
2.1%
104
 
1.8%
96
 
1.7%
Other values (413) 3831
67.5%
None
ValueCountFrequency (%)
3
100.0%
Distinct806
Distinct (%)96.9%
Missing0
Missing (%)0.0%
Memory size6.6 KiB
2023-12-23T07:29:08.956885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length46.5
Mean length31.378606
Min length19

Characters and Unicode

Total characters26107
Distinct characters285
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique783 ?
Unique (%)94.1%

Sample

1st row광주광역시 서구 유림로51번길 18, 1층 (쌍촌동)
2nd row광주광역시 서구 금화로85번길 30-14, eleven 1층 (금호동)
3rd row광주광역시 서구 유림로98번길 43, 503호 (동천동)
4th row광주광역시 서구 상무화원로12번길 4, 1층 (치평동)
5th row광주광역시 서구 유덕로 104, (주)대동에스엔엘) 2층 (덕흥동)
ValueCountFrequency (%)
광주광역시 832
 
15.7%
서구 831
 
15.7%
1층 256
 
4.8%
치평동 166
 
3.1%
2층 123
 
2.3%
쌍촌동 115
 
2.2%
화정동 111
 
2.1%
풍암동 89
 
1.7%
금호동 66
 
1.2%
농성동 63
 
1.2%
Other values (990) 2657
50.0%
2023-12-23T07:29:10.972026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4480
 
17.2%
1741
 
6.7%
1 1208
 
4.6%
951
 
3.6%
885
 
3.4%
884
 
3.4%
846
 
3.2%
846
 
3.2%
) 844
 
3.2%
( 843
 
3.2%
Other values (275) 12579
48.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14586
55.9%
Space Separator 4480
 
17.2%
Decimal Number 4308
 
16.5%
Close Punctuation 844
 
3.2%
Open Punctuation 843
 
3.2%
Other Punctuation 805
 
3.1%
Dash Punctuation 182
 
0.7%
Uppercase Letter 34
 
0.1%
Lowercase Letter 19
 
0.1%
Math Symbol 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1741
 
11.9%
951
 
6.5%
885
 
6.1%
884
 
6.1%
846
 
5.8%
846
 
5.8%
832
 
5.7%
807
 
5.5%
573
 
3.9%
357
 
2.4%
Other values (239) 5864
40.2%
Uppercase Letter
ValueCountFrequency (%)
B 6
17.6%
S 5
14.7%
G 4
11.8%
I 3
8.8%
K 3
8.8%
E 3
8.8%
C 3
8.8%
W 2
 
5.9%
L 2
 
5.9%
V 2
 
5.9%
Decimal Number
ValueCountFrequency (%)
1 1208
28.0%
2 623
14.5%
0 417
 
9.7%
3 397
 
9.2%
4 366
 
8.5%
5 299
 
6.9%
9 264
 
6.1%
7 252
 
5.8%
6 246
 
5.7%
8 236
 
5.5%
Lowercase Letter
ValueCountFrequency (%)
e 5
26.3%
n 3
15.8%
l 3
15.8%
a 2
 
10.5%
r 2
 
10.5%
t 2
 
10.5%
k 1
 
5.3%
v 1
 
5.3%
Other Punctuation
ValueCountFrequency (%)
, 802
99.6%
. 3
 
0.4%
Space Separator
ValueCountFrequency (%)
4480
100.0%
Close Punctuation
ValueCountFrequency (%)
) 844
100.0%
Open Punctuation
ValueCountFrequency (%)
( 843
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 182
100.0%
Math Symbol
ValueCountFrequency (%)
~ 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14586
55.9%
Common 11468
43.9%
Latin 53
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1741
 
11.9%
951
 
6.5%
885
 
6.1%
884
 
6.1%
846
 
5.8%
846
 
5.8%
832
 
5.7%
807
 
5.5%
573
 
3.9%
357
 
2.4%
Other values (239) 5864
40.2%
Latin
ValueCountFrequency (%)
B 6
 
11.3%
e 5
 
9.4%
S 5
 
9.4%
G 4
 
7.5%
n 3
 
5.7%
I 3
 
5.7%
l 3
 
5.7%
K 3
 
5.7%
E 3
 
5.7%
C 3
 
5.7%
Other values (9) 15
28.3%
Common
ValueCountFrequency (%)
4480
39.1%
1 1208
 
10.5%
) 844
 
7.4%
( 843
 
7.4%
, 802
 
7.0%
2 623
 
5.4%
0 417
 
3.6%
3 397
 
3.5%
4 366
 
3.2%
5 299
 
2.6%
Other values (7) 1189
 
10.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14586
55.9%
ASCII 11521
44.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4480
38.9%
1 1208
 
10.5%
) 844
 
7.3%
( 843
 
7.3%
, 802
 
7.0%
2 623
 
5.4%
0 417
 
3.6%
3 397
 
3.4%
4 366
 
3.2%
5 299
 
2.6%
Other values (26) 1242
 
10.8%
Hangul
ValueCountFrequency (%)
1741
 
11.9%
951
 
6.5%
885
 
6.1%
884
 
6.1%
846
 
5.8%
846
 
5.8%
832
 
5.7%
807
 
5.5%
573
 
3.9%
357
 
2.4%
Other values (239) 5864
40.2%

영업소전화번호
Text

MISSING 

Distinct276
Distinct (%)95.8%
Missing544
Missing (%)65.4%
Memory size6.6 KiB
2023-12-23T07:29:12.322502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.017361
Min length9

Characters and Unicode

Total characters3461
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique267 ?
Unique (%)92.7%

Sample

1st row062-609-7512
2nd row062-351-9396
3rd row070-7807-1241
4th row062-369-9296
5th row062-603-0089
ValueCountFrequency (%)
062-372-9005 5
 
1.7%
062-268-4470 2
 
0.7%
062-652-6622 2
 
0.7%
062-655-0881 2
 
0.7%
062-369-7979 2
 
0.7%
062-375-1500 2
 
0.7%
062-382-5860 2
 
0.7%
062-673-9913 2
 
0.7%
062-375-0208 2
 
0.7%
062-430-0011 1
 
0.3%
Other values (267) 267
92.4%
2023-12-23T07:29:14.519618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 573
16.6%
0 511
14.8%
6 510
14.7%
2 478
13.8%
3 300
8.7%
5 240
6.9%
1 196
 
5.7%
7 194
 
5.6%
8 191
 
5.5%
4 143
 
4.1%
Other values (2) 125
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2887
83.4%
Dash Punctuation 573
 
16.6%
Space Separator 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 511
17.7%
6 510
17.7%
2 478
16.6%
3 300
10.4%
5 240
8.3%
1 196
 
6.8%
7 194
 
6.7%
8 191
 
6.6%
4 143
 
5.0%
9 124
 
4.3%
Dash Punctuation
ValueCountFrequency (%)
- 573
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3461
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 573
16.6%
0 511
14.8%
6 510
14.7%
2 478
13.8%
3 300
8.7%
5 240
6.9%
1 196
 
5.7%
7 194
 
5.6%
8 191
 
5.5%
4 143
 
4.1%
Other values (2) 125
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3461
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 573
16.6%
0 511
14.8%
6 510
14.7%
2 478
13.8%
3 300
8.7%
5 240
6.9%
1 196
 
5.7%
7 194
 
5.6%
8 191
 
5.5%
4 143
 
4.1%
Other values (2) 125
 
3.6%

Interactions

2023-12-23T07:28:58.073732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-23T07:29:15.095085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번종별
순번1.0000.351
종별0.3511.000
2023-12-23T07:29:15.651448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번종별
순번1.0000.268
종별0.2681.000

Missing values

2023-12-23T07:28:58.807592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-23T07:28:59.631371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번종별영업소명영업소소재지(도로명)영업소전화번호
01판매업클리어하우스광주광역시 서구 유림로51번길 18, 1층 (쌍촌동)<NA>
12판매업지에스25 금호마륵점광주광역시 서구 금화로85번길 30-14, eleven 1층 (금호동)<NA>
23판매업enews 공간디자인광주광역시 서구 유림로98번길 43, 503호 (동천동)<NA>
34판매업알고뷰티광주광역시 서구 상무화원로12번길 4, 1층 (치평동)<NA>
45판매업티에스홀딩스광주광역시 서구 유덕로 104, (주)대동에스엔엘) 2층 (덕흥동)<NA>
56판매업지에스25 염주센터점광주광역시 서구 월드컵4강로 66-1, 1층 (화정동)<NA>
67판매업세븐일레븐 광주쌍촌엘리체점광주광역시 서구 월드컵4강로197번길 5-4 (쌍촌동)<NA>
78판매업With medical광주광역시 서구 칠성로 89, 109호 (쌍촌동, 광천모아엘가)<NA>
89판매업코리아세븐 광주광천터미널점광주광역시 서구 무진대로 904, 유스퀘어(광천터미널) 1층 (광천동)<NA>
910판매업주식회사 인사이드컴퍼니광주광역시 서구 화개1로 8, 5층 2호 (금호동)<NA>
순번종별영업소명영업소소재지(도로명)영업소전화번호
822823판매업독일디지털보청기광주광역시 서구 상무대로 1081-5 (화정동)062-365-3456
823824판매업토마토의료기광주광역시 서구 내방로398번길 5, C동 1층 (농성동)062-447-4747
824825판매업(주)롯데하이마트 상무점광주광역시 서구 상무대로 820, 1층 (치평동)062-371-1177
825826판매업지오메디칼광주광역시 서구 쌍학로 43, 상가동 201호 (쌍촌동, 쌍촌주공아파트)062-382-5860
826827판매업인화메디칼광주광역시 서구 상무중앙로 71, 4층 (치평동)062-375-1500
827828판매업(주)성암메테크광주광역시 서구 풍암중앙로86번길 36, 1층 (풍암동)062-653-9621
828829판매업현진엠엔에스광주광역시 서구 매월2로 53 (매월동, 광주산업용재유통센터)062-412-5341
829830판매업동강엑스선상사광주광역시 서구 매월2로 53, 10동 1층 136호 (매월동, 광주산업용재유통센터)062-603-4115
830831판매업케이지메디칼광주광역시 서구 풍암신흥로62번길 3-9, 1층 (풍암동, (K.G랜드))062-652-2470
831832판매업(주)천우의료기상사광주광역시 서구 풍서좌로 179 (매월동)062-529-6871