Overview

Dataset statistics

Number of variables5
Number of observations98
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.0 KiB
Average record size in memory41.3 B

Variable types

Text4
Categorical1

Dataset

Description시설명,소재지,전화번호,비고,외국어
Author강동구
URLhttps://data.seoul.go.kr/dataList/OA-12646/S/1/datasetView.do

Alerts

외국어 is highly imbalanced (53.2%)Imbalance
시설명 has unique valuesUnique

Reproduction

Analysis started2024-05-04 05:36:17.736844
Analysis finished2024-05-04 05:36:19.110551
Duration1.37 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시설명
Text

UNIQUE 

Distinct98
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size916.0 B
2024-05-04T05:36:19.362414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10
Mean length7.377551
Min length3

Characters and Unicode

Total characters723
Distinct characters165
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique98 ?
Unique (%)100.0%

Sample

1st row중앙보훈병원
2nd row허리나은병원
3rd row강동성심병원
4th row강동예치과의원
5th row최강피부과의원
ValueCountFrequency (%)
중앙보훈병원 1
 
1.0%
연세소아청소년과의원 1
 
1.0%
강동연세정형외과의원 1
 
1.0%
우리마취통증의학과의원 1
 
1.0%
본의원 1
 
1.0%
명성정형외과의원 1
 
1.0%
이엘씨김정우안과의원 1
 
1.0%
씨앤피차앤박피부과의원 1
 
1.0%
에스듈의원 1
 
1.0%
류마내과의원 1
 
1.0%
Other values (88) 88
89.8%
2024-05-04T05:36:20.029148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
100
 
13.8%
89
 
12.3%
65
 
9.0%
24
 
3.3%
15
 
2.1%
15
 
2.1%
15
 
2.1%
15
 
2.1%
14
 
1.9%
13
 
1.8%
Other values (155) 358
49.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 723
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
100
 
13.8%
89
 
12.3%
65
 
9.0%
24
 
3.3%
15
 
2.1%
15
 
2.1%
15
 
2.1%
15
 
2.1%
14
 
1.9%
13
 
1.8%
Other values (155) 358
49.5%

Most occurring scripts

ValueCountFrequency (%)
Hangul 723
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
100
 
13.8%
89
 
12.3%
65
 
9.0%
24
 
3.3%
15
 
2.1%
15
 
2.1%
15
 
2.1%
15
 
2.1%
14
 
1.9%
13
 
1.8%
Other values (155) 358
49.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 723
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
100
 
13.8%
89
 
12.3%
65
 
9.0%
24
 
3.3%
15
 
2.1%
15
 
2.1%
15
 
2.1%
15
 
2.1%
14
 
1.9%
13
 
1.8%
Other values (155) 358
49.5%
Distinct95
Distinct (%)96.9%
Missing0
Missing (%)0.0%
Memory size916.0 B
2024-05-04T05:36:20.734679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length27
Mean length16.683673
Min length8

Characters and Unicode

Total characters1635
Distinct characters79
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique93 ?
Unique (%)94.9%

Sample

1st row서울 강동구 진황도로61길 53 (둔촌동)
2nd row서울 강동구 성내2동 62-4
3rd row서울 강동구 성안로 150 (길동)
4th row서울 강동구 명일2동 48 주양쇼핑 4층
5th row서울 강동구 동남로71길 32 (명일동)
ValueCountFrequency (%)
천호동 27
 
7.5%
강동구 26
 
7.3%
서울 25
 
7.0%
천호대로 19
 
5.3%
성내동 19
 
5.3%
명일동 13
 
3.6%
양재대로 11
 
3.1%
올림픽로 9
 
2.5%
길동 9
 
2.5%
둔촌로 7
 
2.0%
Other values (145) 193
53.9%
2024-05-04T05:36:21.758541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
261
 
16.0%
132
 
8.1%
) 91
 
5.6%
( 91
 
5.6%
80
 
4.9%
1 74
 
4.5%
2 58
 
3.5%
57
 
3.5%
56
 
3.4%
3 42
 
2.6%
Other values (69) 693
42.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 778
47.6%
Decimal Number 391
23.9%
Space Separator 261
 
16.0%
Close Punctuation 91
 
5.6%
Open Punctuation 91
 
5.6%
Dash Punctuation 13
 
0.8%
Other Punctuation 10
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
132
17.0%
80
 
10.3%
57
 
7.3%
56
 
7.2%
37
 
4.8%
32
 
4.1%
31
 
4.0%
29
 
3.7%
27
 
3.5%
26
 
3.3%
Other values (54) 271
34.8%
Decimal Number
ValueCountFrequency (%)
1 74
18.9%
2 58
14.8%
3 42
10.7%
4 41
10.5%
7 35
9.0%
9 33
8.4%
0 32
8.2%
6 30
7.7%
5 29
 
7.4%
8 17
 
4.3%
Space Separator
ValueCountFrequency (%)
261
100.0%
Close Punctuation
ValueCountFrequency (%)
) 91
100.0%
Open Punctuation
ValueCountFrequency (%)
( 91
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%
Other Punctuation
ValueCountFrequency (%)
, 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 857
52.4%
Hangul 778
47.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
132
17.0%
80
 
10.3%
57
 
7.3%
56
 
7.2%
37
 
4.8%
32
 
4.1%
31
 
4.0%
29
 
3.7%
27
 
3.5%
26
 
3.3%
Other values (54) 271
34.8%
Common
ValueCountFrequency (%)
261
30.5%
) 91
 
10.6%
( 91
 
10.6%
1 74
 
8.6%
2 58
 
6.8%
3 42
 
4.9%
4 41
 
4.8%
7 35
 
4.1%
9 33
 
3.9%
0 32
 
3.7%
Other values (5) 99
 
11.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 857
52.4%
Hangul 778
47.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
261
30.5%
) 91
 
10.6%
( 91
 
10.6%
1 74
 
8.6%
2 58
 
6.8%
3 42
 
4.9%
4 41
 
4.8%
7 35
 
4.1%
9 33
 
3.9%
0 32
 
3.7%
Other values (5) 99
 
11.6%
Hangul
ValueCountFrequency (%)
132
17.0%
80
 
10.3%
57
 
7.3%
56
 
7.2%
37
 
4.8%
32
 
4.1%
31
 
4.0%
29
 
3.7%
27
 
3.5%
26
 
3.3%
Other values (54) 271
34.8%
Distinct96
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size916.0 B
2024-05-04T05:36:22.271331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length11.05102
Min length9

Characters and Unicode

Total characters1083
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique95 ?
Unique (%)96.9%

Sample

1st row02-2225-1111
2nd row02-472-0114
3rd row02-2224-2114
4th row02-428-2277
5th row02-429-3300
ValueCountFrequency (%)
1577-5800 3
 
3.1%
02-2225-1111 1
 
1.0%
02-474-5450 1
 
1.0%
02-429-8275 1
 
1.0%
02-479-1175 1
 
1.0%
02-474-8555 1
 
1.0%
02-426-8575 1
 
1.0%
02-427-7533 1
 
1.0%
02-428-8275 1
 
1.0%
02-2202-8888 1
 
1.0%
Other values (86) 86
87.8%
2024-05-04T05:36:22.950880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 193
17.8%
2 176
16.3%
0 156
14.4%
4 131
12.1%
7 108
10.0%
8 82
7.6%
5 76
 
7.0%
1 59
 
5.4%
3 42
 
3.9%
9 32
 
3.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 890
82.2%
Dash Punctuation 193
 
17.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 176
19.8%
0 156
17.5%
4 131
14.7%
7 108
12.1%
8 82
9.2%
5 76
8.5%
1 59
 
6.6%
3 42
 
4.7%
9 32
 
3.6%
6 28
 
3.1%
Dash Punctuation
ValueCountFrequency (%)
- 193
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1083
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 193
17.8%
2 176
16.3%
0 156
14.4%
4 131
12.1%
7 108
10.0%
8 82
7.6%
5 76
 
7.0%
1 59
 
5.4%
3 42
 
3.9%
9 32
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1083
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 193
17.8%
2 176
16.3%
0 156
14.4%
4 131
12.1%
7 108
10.0%
8 82
7.6%
5 76
 
7.0%
1 59
 
5.4%
3 42
 
3.9%
9 32
 
3.0%

비고
Text

Distinct62
Distinct (%)63.3%
Missing0
Missing (%)0.0%
Memory size916.0 B
2024-05-04T05:36:23.247812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length122
Median length53
Mean length17.27551
Min length2

Characters and Unicode

Total characters1693
Distinct characters75
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique55 ?
Unique (%)56.1%

Sample

1st row내과,소아청소년과,신경과,외과,흉부외과,정형외과,신경외과,산부인과,안과,이비인후과,피부과,비뇨기과,치과,재활의학과,마취통증의학과,영상의학과,핵의학과,진단검사의학과,병리과,성형외과,정신건강의학과,가정의학과,응급의학과
2nd row신경외과,내과,정형외과,영상의학과,마취통증의학과
3rd row내과,외과,소아청소년과,산부인과,정형외과,신경외과,흉부외과,성형외과,신경과,정신건강의학과,안과,이비인후과,마취통증의학과,비뇨기과,피부과,영상의학과,임상병리과,해부병리과,치과,재활의학과,가정의학과,방사선종양학과,응급의학과
4th row치과
5th row피부과
ValueCountFrequency (%)
치과 23
22.8%
한방각과 8
 
7.9%
안과 4
 
4.0%
내과,소아청소년과,이비인후과 4
 
4.0%
피부과 4
 
4.0%
이비인후과 3
 
3.0%
산부인과,소아청소년과 2
 
2.0%
성형외과,피부과,비뇨기과 1
 
1.0%
내과,신경과,소아청소년과,이비인후과 1
 
1.0%
정형외과,재활의학과,영상의학과,비뇨기과 1
 
1.0%
Other values (50) 50
49.5%
2024-05-04T05:36:23.830761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
355
21.0%
, 253
 
14.9%
67
 
4.0%
66
 
3.9%
64
 
3.8%
64
 
3.8%
43
 
2.5%
43
 
2.5%
43
 
2.5%
42
 
2.5%
Other values (65) 653
38.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1408
83.2%
Other Punctuation 254
 
15.0%
Space Separator 29
 
1.7%
Close Punctuation 1
 
0.1%
Open Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
355
25.2%
67
 
4.8%
66
 
4.7%
64
 
4.5%
64
 
4.5%
43
 
3.1%
43
 
3.1%
43
 
3.1%
42
 
3.0%
34
 
2.4%
Other values (60) 587
41.7%
Other Punctuation
ValueCountFrequency (%)
, 253
99.6%
. 1
 
0.4%
Space Separator
ValueCountFrequency (%)
29
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1408
83.2%
Common 285
 
16.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
355
25.2%
67
 
4.8%
66
 
4.7%
64
 
4.5%
64
 
4.5%
43
 
3.1%
43
 
3.1%
43
 
3.1%
42
 
3.0%
34
 
2.4%
Other values (60) 587
41.7%
Common
ValueCountFrequency (%)
, 253
88.8%
29
 
10.2%
) 1
 
0.4%
( 1
 
0.4%
. 1
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1408
83.2%
ASCII 285
 
16.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
355
25.2%
67
 
4.8%
66
 
4.7%
64
 
4.5%
64
 
4.5%
43
 
3.1%
43
 
3.1%
43
 
3.1%
42
 
3.0%
34
 
2.4%
Other values (60) 587
41.7%
ASCII
ValueCountFrequency (%)
, 253
88.8%
29
 
10.2%
) 1
 
0.4%
( 1
 
0.4%
. 1
 
0.4%

외국어
Categorical

IMBALANCE 

Distinct9
Distinct (%)9.2%
Missing0
Missing (%)0.0%
Memory size916.0 B
영어
72 
영어,일어
12 
영어, 러시아어
 
3
영어,일어,중국어
 
3
중국어
 
2
Other values (4)
 
6

Length

Max length10
Median length2
Mean length3
Min length2

Unique

Unique2 ?
Unique (%)2.0%

Sample

1st row영어
2nd row영어,일어
3rd row영어
4th row영어
5th row영어

Common Values

ValueCountFrequency (%)
영어 72
73.5%
영어,일어 12
 
12.2%
영어, 러시아어 3
 
3.1%
영어,일어,중국어 3
 
3.1%
중국어 2
 
2.0%
영어,중국어 2
 
2.0%
일어 2
 
2.0%
영어,러시아어 1
 
1.0%
영어,우즈베키스탄어 1
 
1.0%

Length

2024-05-04T05:36:24.274417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T05:36:24.545584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영어 75
74.3%
영어,일어 12
 
11.9%
러시아어 3
 
3.0%
영어,일어,중국어 3
 
3.0%
중국어 2
 
2.0%
영어,중국어 2
 
2.0%
일어 2
 
2.0%
영어,러시아어 1
 
1.0%
영어,우즈베키스탄어 1
 
1.0%

Correlations

2024-05-04T05:36:24.704935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설명소재지전화번호비고외국어
시설명1.0001.0001.0001.0001.000
소재지1.0001.0001.0000.0001.000
전화번호1.0001.0001.0000.0001.000
비고1.0000.0000.0001.0000.000
외국어1.0001.0001.0000.0001.000

Missing values

2024-05-04T05:36:18.696835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-04T05:36:18.997364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시설명소재지전화번호비고외국어
0중앙보훈병원서울 강동구 진황도로61길 53 (둔촌동)02-2225-1111내과,소아청소년과,신경과,외과,흉부외과,정형외과,신경외과,산부인과,안과,이비인후과,피부과,비뇨기과,치과,재활의학과,마취통증의학과,영상의학과,핵의학과,진단검사의학과,병리과,성형외과,정신건강의학과,가정의학과,응급의학과영어
1허리나은병원서울 강동구 성내2동 62-402-472-0114신경외과,내과,정형외과,영상의학과,마취통증의학과영어,일어
2강동성심병원서울 강동구 성안로 150 (길동)02-2224-2114내과,외과,소아청소년과,산부인과,정형외과,신경외과,흉부외과,성형외과,신경과,정신건강의학과,안과,이비인후과,마취통증의학과,비뇨기과,피부과,영상의학과,임상병리과,해부병리과,치과,재활의학과,가정의학과,방사선종양학과,응급의학과영어
3강동예치과의원서울 강동구 명일2동 48 주양쇼핑 4층02-428-2277치과영어
4최강피부과의원서울 강동구 동남로71길 32 (명일동)02-429-3300피부과영어
5강동경희대학교의대병원서울 강동구 동남로 892 (상일동)1577-5800내과,외과,산부인과,소아청소년과,신경외과,정형외과,흉부외과,비뇨기과,안과,피부과,신경과,재활의학과,정신건강의학과,치과,응급의학과,이비인후과,영상의학과,마취통증의학과,진단검사의학과,병리과,핵의학과영어, 러시아어
6강동경희대학교한방병원서울 강동구 동남로 892 (상일동)1577-5800한방내과,한방소아과,한방안이비인후피부과,한방신경정신과,한방부인과,한방재활의학과,사상체질과,침구과영어, 러시아어
7강동경희대학교치과병원서울 강동구 동남로 892 (상일동)1577-5800구강악안면외과, 치과보철과,치과교정과.소아치과,치주과,치과보존과영어, 러시아어
8공안과병원서울 강동구 성내2동 13902-480-5085안과,내과,소아과,가정의학과영어
9삼육오엠씨의원서울 강동구 성내2동 9-14 탑메디칼센터 402호02-475-3657가정의학과영어
시설명소재지전화번호비고외국어
88강동뉴욕치과의원둔촌로 329 (천호동)02-474-0028치과영어,중국어
89연세미치과의원천호대로 1048 (둔촌동)02-484-8787치과영어
90엘림치과의원선사로 25 (천호동)02-474-7522치과영어
91고우넷하모니치과의원천호대로 842 (천호동)02-484-1122치과영어
92강동그린치과의원둔촌로 287 (천호동)02-2217-7738치과영어
93샬롬치과의원강동대로53길 94 (성내동)02-2679-9596치과영어,우즈베키스탄어
94올치과의원동남로71길 38 (명일동)02-441-2879치과영어
95미켈란치과의원올림픽로 659 (천호동)02-1566-7426치과영어
96자이연세치과의원천호대로 1077 (길동)02-482-2875치과일어
97코랄치과의원서울특별시 강동구 강동대로 177 (성내동, 현대코랄)02-484-2879치과영어