Overview

Dataset statistics

Number of variables2
Number of observations429
Missing cells0
Missing cells (%)0.0%
Duplicate rows2
Duplicate rows (%)0.5%
Total size in memory6.8 KiB
Average record size in memory16.3 B

Variable types

Text1
Categorical1

Dataset

Description강남구 의료관광DB(기관정보_영어)는 영어권 관광객을 위해 강남구 의료관광에 대해 다양한 자료를 영어로 소개하는 정보를 수록하였습니다
Author서울특별시 강남구
URLhttps://www.data.go.kr/data/15072591/fileData.do

Alerts

Dataset has 2 (0.5%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 10:43:01.625153
Analysis finished2023-12-12 10:43:01.889508
Duration0.26 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct416
Distinct (%)97.0%
Missing0
Missing (%)0.0%
Memory size3.5 KiB
2023-12-12T19:43:02.183367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length41
Mean length23.130536
Min length3

Characters and Unicode

Total characters9923
Distinct characters109
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique410 ?
Unique (%)95.6%

Sample

1st rowTako Plastic Surgery
2nd rowBright St. Mary's Eye Center
3rd rowHyundai Aesthetics Plastic Surgery Clinic
4th rowDEESSE PLASTIC SURGERY
5th rowMD Breast Surgery Clinic
ValueCountFrequency (%)
clinic 193
 
13.4%
surgery 126
 
8.8%
plastic 125
 
8.7%
dental 54
 
3.8%
hospital 38
 
2.6%
gangnam 26
 
1.8%
seoul 25
 
1.7%
center 25
 
1.7%
dermatology 21
 
1.5%
medical 18
 
1.3%
Other values (510) 787
54.7%
2023-12-12T19:43:02.827076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1047
 
10.6%
i 797
 
8.0%
e 711
 
7.2%
n 603
 
6.1%
l 603
 
6.1%
a 575
 
5.8%
r 461
 
4.6%
t 422
 
4.3%
c 413
 
4.2%
o 346
 
3.5%
Other values (99) 3945
39.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 6571
66.2%
Uppercase Letter 2105
 
21.2%
Space Separator 1047
 
10.6%
Other Letter 78
 
0.8%
Other Punctuation 52
 
0.5%
Dash Punctuation 20
 
0.2%
Decimal Number 20
 
0.2%
Modifier Symbol 10
 
0.1%
Close Punctuation 7
 
0.1%
Open Punctuation 7
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9
 
11.5%
8
 
10.3%
5
 
6.4%
5
 
6.4%
5
 
6.4%
4
 
5.1%
3
 
3.8%
2
 
2.6%
2
 
2.6%
2
 
2.6%
Other values (26) 33
42.3%
Lowercase Letter
ValueCountFrequency (%)
i 797
12.1%
e 711
10.8%
n 603
9.2%
l 603
9.2%
a 575
8.8%
r 461
 
7.0%
t 422
 
6.4%
c 413
 
6.3%
o 346
 
5.3%
s 300
 
4.6%
Other values (16) 1340
20.4%
Uppercase Letter
ValueCountFrequency (%)
C 293
13.9%
S 273
 
13.0%
P 153
 
7.3%
A 116
 
5.5%
D 109
 
5.2%
E 107
 
5.1%
I 98
 
4.7%
H 95
 
4.5%
L 90
 
4.3%
O 88
 
4.2%
Other values (16) 683
32.4%
Decimal Number
ValueCountFrequency (%)
2 3
15.0%
1 3
15.0%
8 2
10.0%
0 2
10.0%
9 2
10.0%
6 2
10.0%
4 2
10.0%
3 2
10.0%
7 1
 
5.0%
5 1
 
5.0%
Other Punctuation
ValueCountFrequency (%)
& 19
36.5%
. 19
36.5%
, 7
 
13.5%
' 6
 
11.5%
/ 1
 
1.9%
Space Separator
ValueCountFrequency (%)
1047
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 20
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 10
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Final Punctuation
ValueCountFrequency (%)
6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 8676
87.4%
Common 1169
 
11.8%
Hangul 78
 
0.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 797
 
9.2%
e 711
 
8.2%
n 603
 
7.0%
l 603
 
7.0%
a 575
 
6.6%
r 461
 
5.3%
t 422
 
4.9%
c 413
 
4.8%
o 346
 
4.0%
s 300
 
3.5%
Other values (42) 3445
39.7%
Hangul
ValueCountFrequency (%)
9
 
11.5%
8
 
10.3%
5
 
6.4%
5
 
6.4%
5
 
6.4%
4
 
5.1%
3
 
3.8%
2
 
2.6%
2
 
2.6%
2
 
2.6%
Other values (26) 33
42.3%
Common
ValueCountFrequency (%)
1047
89.6%
- 20
 
1.7%
& 19
 
1.6%
. 19
 
1.6%
` 10
 
0.9%
) 7
 
0.6%
( 7
 
0.6%
, 7
 
0.6%
6
 
0.5%
' 6
 
0.5%
Other values (11) 21
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9839
99.2%
Hangul 78
 
0.8%
Punctuation 6
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1047
 
10.6%
i 797
 
8.1%
e 711
 
7.2%
n 603
 
6.1%
l 603
 
6.1%
a 575
 
5.8%
r 461
 
4.7%
t 422
 
4.3%
c 413
 
4.2%
o 346
 
3.5%
Other values (62) 3861
39.2%
Hangul
ValueCountFrequency (%)
9
 
11.5%
8
 
10.3%
5
 
6.4%
5
 
6.4%
5
 
6.4%
4
 
5.1%
3
 
3.8%
2
 
2.6%
2
 
2.6%
2
 
2.6%
Other values (26) 33
42.3%
Punctuation
ValueCountFrequency (%)
6
100.0%

기관분류
Categorical

Distinct15
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Memory size3.5 KiB
성형외과
143 
치과
62 
피부미용
45 
기타
29 
한방진료
22 
Other values (10)
128 

Length

Max length13
Median length4
Mean length4.002331
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row성형외과
2nd row안과
3rd row성형외과
4th row성형외과
5th row성형외과

Common Values

ValueCountFrequency (%)
성형외과 143
33.3%
치과 62
14.5%
피부미용 45
 
10.5%
기타 29
 
6.8%
한방진료 22
 
5.1%
스파+쇼핑+유치업체+기타 22
 
5.1%
안과 21
 
4.9%
종합검진 19
 
4.4%
척추+관절치료 18
 
4.2%
호텔 16
 
3.7%
Other values (5) 32
 
7.5%

Length

2023-12-12T19:43:03.020954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
성형외과 143
33.3%
치과 62
14.5%
피부미용 45
 
10.5%
기타 29
 
6.8%
한방진료 22
 
5.1%
스파+쇼핑+유치업체+기타 22
 
5.1%
안과 21
 
4.9%
종합검진 19
 
4.4%
척추+관절치료 18
 
4.2%
호텔 16
 
3.7%
Other values (5) 32
 
7.5%

Missing values

2023-12-12T19:43:01.777325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:43:01.855304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기관명기관분류
0Tako Plastic Surgery성형외과
1Bright St. Mary's Eye Center안과
2Hyundai Aesthetics Plastic Surgery Clinic성형외과
3DEESSE PLASTIC SURGERY성형외과
4MD Breast Surgery Clinic성형외과
5BANTANG PLASTIC SURGERY성형외과
6PREMIER PLASTIC SURGERY성형외과
7Opera Plastic Surgery성형외과
8Dental Clinic SOJOONG치과
9For.B Plastic Surgery성형외과
기관명기관분류
419Foreheal호텔
420Gangnam Family Hotel호텔
421Grammos Hotel호텔
422HOTEL THE DESIGNERS호텔
423TRIA Hotel호텔
424Best Western Premier Gangnam호텔
425Novotel Seoul Ambassador Gangnam호텔
426Ritz-Carlton Seoul호텔
427JBIS Hotel호텔
428Oakwood Premier Coex Center호텔

Duplicate rows

Most frequently occurring

기관명기관분류# duplicates
0Glovi Plastic Surgery성형외과2
1Su Dental Hospital치과2