Overview

Dataset statistics

Number of variables6
Number of observations33
Missing cells6
Missing cells (%)3.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.7 KiB
Average record size in memory53.0 B

Variable types

Numeric1
Text3
Categorical2

Dataset

Description경상남도 하동군에 있는 의료기기판매업 현황 (연번, 업체명, 읍면, 소재지, 전화번호 등)의 정보를 제공하고 있습니다.
URLhttps://www.data.go.kr/data/15086342/fileData.do

Alerts

읍면 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
기타유의사항 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
연번 is highly overall correlated with 읍면 and 1 other fieldsHigh correlation
기타유의사항 is highly imbalanced (54.4%)Imbalance
전화번호 has 6 (18.2%) missing valuesMissing
연번 has unique valuesUnique
업체명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 19:11:04.403747
Analysis finished2023-12-12 19:11:05.073996
Duration0.67 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct33
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17
Minimum1
Maximum33
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size429.0 B
2023-12-13T04:11:05.145422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.6
Q19
median17
Q325
95-th percentile31.4
Maximum33
Range32
Interquartile range (IQR)16

Descriptive statistics

Standard deviation9.6695398
Coefficient of variation (CV)0.56879646
Kurtosis-1.2
Mean17
Median Absolute Deviation (MAD)8
Skewness0
Sum561
Variance93.5
MonotonicityStrictly increasing
2023-12-13T04:11:05.288033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)
ValueCountFrequency (%)
1 1
 
3.0%
26 1
 
3.0%
20 1
 
3.0%
21 1
 
3.0%
22 1
 
3.0%
23 1
 
3.0%
24 1
 
3.0%
25 1
 
3.0%
27 1
 
3.0%
2 1
 
3.0%
Other values (23) 23
69.7%
ValueCountFrequency (%)
1 1
3.0%
2 1
3.0%
3 1
3.0%
4 1
3.0%
5 1
3.0%
6 1
3.0%
7 1
3.0%
8 1
3.0%
9 1
3.0%
10 1
3.0%
ValueCountFrequency (%)
33 1
3.0%
32 1
3.0%
31 1
3.0%
30 1
3.0%
29 1
3.0%
28 1
3.0%
27 1
3.0%
26 1
3.0%
25 1
3.0%
24 1
3.0%

업체명
Text

UNIQUE 

Distinct33
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size396.0 B
2023-12-13T04:11:05.521267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length10
Mean length7.6060606
Min length2

Characters and Unicode

Total characters251
Distinct characters107
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)100.0%

Sample

1st row세븐일레븐 하동점
2nd row다이소 하동점
3rd row씨유뉴하동터미널점
4th row씨유하동녹차마을점
5th rowGS25(하동연화점)
ValueCountFrequency (%)
하동점 3
 
7.5%
씨유 2
 
5.0%
gs25 2
 
5.0%
씨유하동진교점 1
 
2.5%
씨유하동금남대박점 1
 
2.5%
남해대교점 1
 
2.5%
cu전도365점 1
 
2.5%
하동금성점 1
 
2.5%
하동친환경 1
 
2.5%
세븐일레븐 1
 
2.5%
Other values (26) 26
65.0%
2023-12-13T04:11:05.939859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
18
 
7.2%
16
 
6.4%
16
 
6.4%
8
 
3.2%
8
 
3.2%
8
 
3.2%
7
 
2.8%
5
 
2.0%
5
 
2.0%
5
 
2.0%
Other values (97) 155
61.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 218
86.9%
Decimal Number 11
 
4.4%
Uppercase Letter 11
 
4.4%
Space Separator 7
 
2.8%
Open Punctuation 2
 
0.8%
Close Punctuation 2
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
18
 
8.3%
16
 
7.3%
16
 
7.3%
8
 
3.7%
8
 
3.7%
8
 
3.7%
5
 
2.3%
5
 
2.3%
5
 
2.3%
4
 
1.8%
Other values (84) 125
57.3%
Decimal Number
ValueCountFrequency (%)
5 4
36.4%
2 4
36.4%
6 1
 
9.1%
3 1
 
9.1%
4 1
 
9.1%
Uppercase Letter
ValueCountFrequency (%)
S 3
27.3%
G 3
27.3%
U 2
18.2%
C 2
18.2%
R 1
 
9.1%
Space Separator
ValueCountFrequency (%)
7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 218
86.9%
Common 22
 
8.8%
Latin 11
 
4.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
18
 
8.3%
16
 
7.3%
16
 
7.3%
8
 
3.7%
8
 
3.7%
8
 
3.7%
5
 
2.3%
5
 
2.3%
5
 
2.3%
4
 
1.8%
Other values (84) 125
57.3%
Common
ValueCountFrequency (%)
7
31.8%
5 4
18.2%
2 4
18.2%
( 2
 
9.1%
) 2
 
9.1%
6 1
 
4.5%
3 1
 
4.5%
4 1
 
4.5%
Latin
ValueCountFrequency (%)
S 3
27.3%
G 3
27.3%
U 2
18.2%
C 2
18.2%
R 1
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 218
86.9%
ASCII 33
 
13.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
18
 
8.3%
16
 
7.3%
16
 
7.3%
8
 
3.7%
8
 
3.7%
8
 
3.7%
5
 
2.3%
5
 
2.3%
5
 
2.3%
4
 
1.8%
Other values (84) 125
57.3%
ASCII
ValueCountFrequency (%)
7
21.2%
5 4
12.1%
2 4
12.1%
S 3
9.1%
G 3
9.1%
U 2
 
6.1%
C 2
 
6.1%
( 2
 
6.1%
) 2
 
6.1%
6 1
 
3.0%
Other values (3) 3
9.1%

읍면
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)21.2%
Missing0
Missing (%)0.0%
Memory size396.0 B
하동읍
16 
진교면
금남면
옥종면
화개면
Other values (2)

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique1 ?
Unique (%)3.0%

Sample

1st row하동읍
2nd row하동읍
3rd row하동읍
4th row하동읍
5th row하동읍

Common Values

ValueCountFrequency (%)
하동읍 16
48.5%
진교면 6
 
18.2%
금남면 3
 
9.1%
옥종면 3
 
9.1%
화개면 2
 
6.1%
금성면 2
 
6.1%
고전면 1
 
3.0%

Length

2023-12-13T04:11:06.115853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:11:06.254064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
하동읍 16
48.5%
진교면 6
 
18.2%
금남면 3
 
9.1%
옥종면 3
 
9.1%
화개면 2
 
6.1%
금성면 2
 
6.1%
고전면 1
 
3.0%
Distinct32
Distinct (%)97.0%
Missing0
Missing (%)0.0%
Memory size396.0 B
2023-12-13T04:11:06.499239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length22
Mean length16.909091
Min length14

Characters and Unicode

Total characters558
Distinct characters64
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)93.9%

Sample

1st row하동군 하동읍 중앙로 37
2nd row하동군 하동읍 경서대로 145
3rd row하동군 하동읍 중앙로 13
4th row하동군 하동읍 경서대로 243-2
5th row하동군 하동읍 연화길 14
ValueCountFrequency (%)
하동군 33
23.6%
하동읍 16
 
11.4%
중앙로 7
 
5.0%
진교면 6
 
4.3%
경서대로 6
 
4.3%
섬진강대로 3
 
2.1%
46 3
 
2.1%
금남면 3
 
2.1%
옥종면 3
 
2.1%
금성면 2
 
1.4%
Other values (48) 58
41.4%
2023-12-13T04:11:06.908302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
108
19.4%
51
 
9.1%
49
 
8.8%
33
 
5.9%
21
 
3.8%
1 20
 
3.6%
17
 
3.0%
16
 
2.9%
2 16
 
2.9%
13
 
2.3%
Other values (54) 214
38.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 339
60.8%
Space Separator 108
 
19.4%
Decimal Number 95
 
17.0%
Dash Punctuation 9
 
1.6%
Other Punctuation 5
 
0.9%
Close Punctuation 1
 
0.2%
Open Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
51
15.0%
49
14.5%
33
 
9.7%
21
 
6.2%
17
 
5.0%
16
 
4.7%
13
 
3.8%
12
 
3.5%
11
 
3.2%
11
 
3.2%
Other values (39) 105
31.0%
Decimal Number
ValueCountFrequency (%)
1 20
21.1%
2 16
16.8%
3 11
11.6%
0 10
10.5%
6 10
10.5%
4 9
9.5%
8 6
 
6.3%
5 5
 
5.3%
7 4
 
4.2%
9 4
 
4.2%
Space Separator
ValueCountFrequency (%)
108
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%
Other Punctuation
ValueCountFrequency (%)
, 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 339
60.8%
Common 219
39.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
51
15.0%
49
14.5%
33
 
9.7%
21
 
6.2%
17
 
5.0%
16
 
4.7%
13
 
3.8%
12
 
3.5%
11
 
3.2%
11
 
3.2%
Other values (39) 105
31.0%
Common
ValueCountFrequency (%)
108
49.3%
1 20
 
9.1%
2 16
 
7.3%
3 11
 
5.0%
0 10
 
4.6%
6 10
 
4.6%
4 9
 
4.1%
- 9
 
4.1%
8 6
 
2.7%
5 5
 
2.3%
Other values (5) 15
 
6.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 339
60.8%
ASCII 219
39.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
108
49.3%
1 20
 
9.1%
2 16
 
7.3%
3 11
 
5.0%
0 10
 
4.6%
6 10
 
4.6%
4 9
 
4.1%
- 9
 
4.1%
8 6
 
2.7%
5 5
 
2.3%
Other values (5) 15
 
6.8%
Hangul
ValueCountFrequency (%)
51
15.0%
49
14.5%
33
 
9.7%
21
 
6.2%
17
 
5.0%
16
 
4.7%
13
 
3.8%
12
 
3.5%
11
 
3.2%
11
 
3.2%
Other values (39) 105
31.0%

전화번호
Text

MISSING 

Distinct26
Distinct (%)96.3%
Missing6
Missing (%)18.2%
Memory size396.0 B
2023-12-13T04:11:07.144391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.074074
Min length12

Characters and Unicode

Total characters326
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)92.6%

Sample

1st row055-882-8800
2nd row055-883-6467
3rd row055-882-2568
4th row055-883-1504
5th row055-883-0235
ValueCountFrequency (%)
055-882-5431 2
 
7.4%
055-884-2211 1
 
3.7%
055-882-8800 1
 
3.7%
055-882-1544 1
 
3.7%
055-883-8744 1
 
3.7%
055-884-7133 1
 
3.7%
055-882-6909 1
 
3.7%
02-6916-1500 1
 
3.7%
055-884-6652 1
 
3.7%
055-883-4111 1
 
3.7%
Other values (16) 16
59.3%
2023-12-13T04:11:07.562058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 65
19.9%
8 56
17.2%
- 54
16.6%
0 42
12.9%
3 23
 
7.1%
1 22
 
6.7%
2 20
 
6.1%
4 17
 
5.2%
6 10
 
3.1%
9 9
 
2.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 272
83.4%
Dash Punctuation 54
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 65
23.9%
8 56
20.6%
0 42
15.4%
3 23
 
8.5%
1 22
 
8.1%
2 20
 
7.4%
4 17
 
6.2%
6 10
 
3.7%
9 9
 
3.3%
7 8
 
2.9%
Dash Punctuation
ValueCountFrequency (%)
- 54
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 326
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 65
19.9%
8 56
17.2%
- 54
16.6%
0 42
12.9%
3 23
 
7.1%
1 22
 
6.7%
2 20
 
6.1%
4 17
 
5.2%
6 10
 
3.1%
9 9
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 326
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 65
19.9%
8 56
17.2%
- 54
16.6%
0 42
12.9%
3 23
 
7.1%
1 22
 
6.7%
2 20
 
6.1%
4 17
 
5.2%
6 10
 
3.1%
9 9
 
2.8%

기타유의사항
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)9.1%
Missing0
Missing (%)0.0%
Memory size396.0 B
<NA>
28 
데이터 미집계
개인정보 포함
 
1

Length

Max length7
Median length4
Mean length4.4545455
Min length4

Unique

Unique1 ?
Unique (%)3.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 28
84.8%
데이터 미집계 4
 
12.1%
개인정보 포함 1
 
3.0%

Length

2023-12-13T04:11:07.746632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:11:07.864270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 28
73.7%
데이터 4
 
10.5%
미집계 4
 
10.5%
개인정보 1
 
2.6%
포함 1
 
2.6%

Interactions

2023-12-13T04:11:04.753607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T04:11:07.952497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업체명읍면소재지전화번호기타유의사항
연번1.0001.0000.8061.0000.9001.000
업체명1.0001.0001.0001.0001.0001.000
읍면0.8061.0001.0001.0001.0001.000
소재지1.0001.0001.0001.0001.0001.000
전화번호0.9001.0001.0001.0001.000NaN
기타유의사항1.0001.0001.0001.000NaN1.000
2023-12-13T04:11:08.081596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
읍면기타유의사항
읍면1.0001.000
기타유의사항1.0001.000
2023-12-13T04:11:08.190300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번읍면기타유의사항
연번1.0000.5151.000
읍면0.5151.0001.000
기타유의사항1.0001.0001.000

Missing values

2023-12-13T04:11:04.915711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:11:05.029427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업체명읍면소재지전화번호기타유의사항
01세븐일레븐 하동점하동읍하동군 하동읍 중앙로 37055-882-8800<NA>
12다이소 하동점하동읍하동군 하동읍 경서대로 145055-883-6467<NA>
23씨유뉴하동터미널점하동읍하동군 하동읍 중앙로 13055-882-2568<NA>
34씨유하동녹차마을점하동읍하동군 하동읍 경서대로 243-2055-883-1504<NA>
45GS25(하동연화점)하동읍하동군 하동읍 연화길 14055-883-0235<NA>
56하동엘지전자하동읍하동군 하동읍 경서대로 167055-883-9555<NA>
67동성의료기하동읍하동군 하동읍 시장1길 16-1055-883-2114<NA>
78광성메디맥스하동읍하동군 하동읍 중앙로 21<NA>데이터 미집계
89소리샘보청기하동읍하동군 하동읍 중앙로 46055-883-6003<NA>
910넘버원메디컬하동읍하동군 하동읍 시장1길 20-1055-882-5431<NA>
연번업체명읍면소재지전화번호기타유의사항
2324하동친환경금성면하동군 금성면 금성로 286055-883-4111<NA>
2425씨유하동진교점진교면하동군 진교면 진교중앙길 21, 2호055-884-6652<NA>
2526이마트24 R진교터미널점진교면하동군 진교면 진교중앙길 1502-6916-1500<NA>
2627진교척추운동센타진교면하동군 진교면 민다리길 46<NA>개인정보 포함
2728장인의료기진교면하동군 진교면 민다리길 46055-882-6909<NA>
2829서희진교면하동군 진교면 들포길 38, 103동 804호 (미진스위트빌)<NA><NA>
2930씨유 진교금오점진교면하동군 진교면 들포길 40, 미진스위트빌 상가동055-884-7133<NA>
3031옥종농협하나로마트옥종면하동군 옥종면 주포중앙길 36055-883-8744<NA>
3132도경메디케어옥종면하동군 옥종면 덕천로 22<NA>데이터 미집계
3233CU하동옥종청룡점옥종면하동군 옥종면 옥종중앙길 50-1055-882-9898<NA>