Overview

Dataset statistics

Number of variables6
Number of observations326
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.7 KiB
Average record size in memory49.4 B

Variable types

Numeric1
Categorical1
Text4

Dataset

Description서울특별시 용산구 의료기관 현황(종별(종합병원, 병원, 의원), 의료기관 도로명주소, 전화번호, 진료과목)에 대한 데이터를 제공합니다.
URLhttps://www.data.go.kr/data/3077890/fileData.do

Alerts

연번 is highly overall correlated with 종별High correlation
종별 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
명칭 has unique valuesUnique

Reproduction

Analysis started2023-12-12 01:26:53.859279
Analysis finished2023-12-12 01:26:54.595821
Duration0.74 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct326
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean163.5
Minimum1
Maximum326
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.0 KiB
2023-12-12T10:26:54.681560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile17.25
Q182.25
median163.5
Q3244.75
95-th percentile309.75
Maximum326
Range325
Interquartile range (IQR)162.5

Descriptive statistics

Standard deviation94.252321
Coefficient of variation (CV)0.57646679
Kurtosis-1.2
Mean163.5
Median Absolute Deviation (MAD)81.5
Skewness0
Sum53301
Variance8883.5
MonotonicityStrictly increasing
2023-12-12T10:26:54.862778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
206 1
 
0.3%
224 1
 
0.3%
223 1
 
0.3%
222 1
 
0.3%
221 1
 
0.3%
220 1
 
0.3%
219 1
 
0.3%
218 1
 
0.3%
217 1
 
0.3%
Other values (316) 316
96.9%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
326 1
0.3%
325 1
0.3%
324 1
0.3%
323 1
0.3%
322 1
0.3%
321 1
0.3%
320 1
0.3%
319 1
0.3%
318 1
0.3%
317 1
0.3%

종별
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
의원
159 
치과의원
102 
한의원
62 
병원
 
2
종합병원
 
1

Length

Max length4
Median length3
Mean length2.8220859
Min length2

Unique

Unique1 ?
Unique (%)0.3%

Sample

1st row종합병원
2nd row병원
3rd row병원
4th row의원
5th row의원

Common Values

ValueCountFrequency (%)
의원 159
48.8%
치과의원 102
31.3%
한의원 62
 
19.0%
병원 2
 
0.6%
종합병원 1
 
0.3%

Length

2023-12-12T10:26:55.052776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:26:55.222390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
의원 159
48.8%
치과의원 102
31.3%
한의원 62
 
19.0%
병원 2
 
0.6%
종합병원 1
 
0.3%

명칭
Text

UNIQUE 

Distinct326
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
2023-12-12T10:26:55.495364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length14
Mean length7.2791411
Min length3

Characters and Unicode

Total characters2373
Distinct characters277
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique326 ?
Unique (%)100.0%

Sample

1st row순천향대학교 부속 서울병원
2nd row금강아산병원
3rd row소화병원
4th row수연제장의원
5th row고유피부과의원
ValueCountFrequency (%)
치과의원 2
 
0.6%
순천향대학교 1
 
0.3%
연세밝은미소치과의원 1
 
0.3%
나비드치과의원 1
 
0.3%
히카루치과의원 1
 
0.3%
이엔아이치과의원 1
 
0.3%
서울휴치과의원 1
 
0.3%
아인치과의원 1
 
0.3%
효창치과의원 1
 
0.3%
샘치과의원 1
 
0.3%
Other values (322) 322
96.7%
2023-12-12T10:26:55.987465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
344
 
14.5%
344
 
14.5%
207
 
8.7%
103
 
4.3%
78
 
3.3%
45
 
1.9%
32
 
1.3%
32
 
1.3%
31
 
1.3%
29
 
1.2%
Other values (267) 1128
47.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2355
99.2%
Space Separator 7
 
0.3%
Decimal Number 6
 
0.3%
Uppercase Letter 2
 
0.1%
Other Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
344
 
14.6%
344
 
14.6%
207
 
8.8%
103
 
4.4%
78
 
3.3%
45
 
1.9%
32
 
1.4%
32
 
1.4%
31
 
1.3%
29
 
1.2%
Other values (258) 1110
47.1%
Decimal Number
ValueCountFrequency (%)
3 2
33.3%
6 2
33.3%
5 2
33.3%
Uppercase Letter
ValueCountFrequency (%)
E 1
50.0%
W 1
50.0%
Space Separator
ValueCountFrequency (%)
7
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2355
99.2%
Common 16
 
0.7%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
344
 
14.6%
344
 
14.6%
207
 
8.8%
103
 
4.4%
78
 
3.3%
45
 
1.9%
32
 
1.4%
32
 
1.4%
31
 
1.3%
29
 
1.2%
Other values (258) 1110
47.1%
Common
ValueCountFrequency (%)
7
43.8%
3 2
 
12.5%
6 2
 
12.5%
5 2
 
12.5%
& 1
 
6.2%
) 1
 
6.2%
( 1
 
6.2%
Latin
ValueCountFrequency (%)
E 1
50.0%
W 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2355
99.2%
ASCII 18
 
0.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
344
 
14.6%
344
 
14.6%
207
 
8.8%
103
 
4.4%
78
 
3.3%
45
 
1.9%
32
 
1.4%
32
 
1.4%
31
 
1.3%
29
 
1.2%
Other values (258) 1110
47.1%
ASCII
ValueCountFrequency (%)
7
38.9%
3 2
 
11.1%
6 2
 
11.1%
5 2
 
11.1%
& 1
 
5.6%
) 1
 
5.6%
E 1
 
5.6%
W 1
 
5.6%
( 1
 
5.6%
Distinct312
Distinct (%)95.7%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
2023-12-12T10:26:56.377859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length49.5
Mean length32.076687
Min length17

Characters and Unicode

Total characters10457
Distinct characters215
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique299 ?
Unique (%)91.7%

Sample

1st row서울특별시 용산구 대사관로 59
2nd row서울특별시 용산구 이촌로 318
3rd row서울특별시 용산구 청파로 383
4th row서울특별시 용산구 유엔빌리지길 72, 1층 (한남동)
5th row서울특별시 용산구 독서당로 124, 3층 (한남동)
ValueCountFrequency (%)
서울특별시 326
 
15.9%
용산구 326
 
15.9%
한강대로 65
 
3.2%
2층 50
 
2.4%
한강로2가 41
 
2.0%
이촌동 40
 
2.0%
이촌로 36
 
1.8%
3층 32
 
1.6%
한남동 31
 
1.5%
갈월동 28
 
1.4%
Other values (449) 1071
52.3%
2023-12-12T10:26:56.972598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1720
 
16.4%
414
 
4.0%
392
 
3.7%
389
 
3.7%
2 368
 
3.5%
360
 
3.4%
340
 
3.3%
329
 
3.1%
326
 
3.1%
326
 
3.1%
Other values (205) 5493
52.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6107
58.4%
Space Separator 1720
 
16.4%
Decimal Number 1578
 
15.1%
Open Punctuation 325
 
3.1%
Close Punctuation 325
 
3.1%
Other Punctuation 312
 
3.0%
Dash Punctuation 36
 
0.3%
Math Symbol 32
 
0.3%
Uppercase Letter 22
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
414
 
6.8%
392
 
6.4%
389
 
6.4%
360
 
5.9%
340
 
5.6%
329
 
5.4%
326
 
5.3%
326
 
5.3%
326
 
5.3%
259
 
4.2%
Other values (182) 2646
43.3%
Decimal Number
ValueCountFrequency (%)
2 368
23.3%
1 269
17.0%
3 201
12.7%
5 134
 
8.5%
0 130
 
8.2%
4 127
 
8.0%
6 100
 
6.3%
7 97
 
6.1%
9 87
 
5.5%
8 65
 
4.1%
Uppercase Letter
ValueCountFrequency (%)
A 7
31.8%
L 4
18.2%
B 4
18.2%
G 3
13.6%
D 2
 
9.1%
R 1
 
4.5%
S 1
 
4.5%
Space Separator
ValueCountFrequency (%)
1720
100.0%
Open Punctuation
ValueCountFrequency (%)
( 325
100.0%
Close Punctuation
ValueCountFrequency (%)
) 325
100.0%
Other Punctuation
ValueCountFrequency (%)
, 312
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 36
100.0%
Math Symbol
ValueCountFrequency (%)
~ 32
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6107
58.4%
Common 4328
41.4%
Latin 22
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
414
 
6.8%
392
 
6.4%
389
 
6.4%
360
 
5.9%
340
 
5.6%
329
 
5.4%
326
 
5.3%
326
 
5.3%
326
 
5.3%
259
 
4.2%
Other values (182) 2646
43.3%
Common
ValueCountFrequency (%)
1720
39.7%
2 368
 
8.5%
( 325
 
7.5%
) 325
 
7.5%
, 312
 
7.2%
1 269
 
6.2%
3 201
 
4.6%
5 134
 
3.1%
0 130
 
3.0%
4 127
 
2.9%
Other values (6) 417
 
9.6%
Latin
ValueCountFrequency (%)
A 7
31.8%
L 4
18.2%
B 4
18.2%
G 3
13.6%
D 2
 
9.1%
R 1
 
4.5%
S 1
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6107
58.4%
ASCII 4350
41.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1720
39.5%
2 368
 
8.5%
( 325
 
7.5%
) 325
 
7.5%
, 312
 
7.2%
1 269
 
6.2%
3 201
 
4.6%
5 134
 
3.1%
0 130
 
3.0%
4 127
 
2.9%
Other values (13) 439
 
10.1%
Hangul
ValueCountFrequency (%)
414
 
6.8%
392
 
6.4%
389
 
6.4%
360
 
5.9%
340
 
5.6%
329
 
5.4%
326
 
5.3%
326
 
5.3%
326
 
5.3%
259
 
4.2%
Other values (182) 2646
43.3%
Distinct323
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
2023-12-12T10:26:57.318184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length11
Mean length11.168712
Min length11

Characters and Unicode

Total characters3641
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique320 ?
Unique (%)98.2%

Sample

1st row02-709-9114
2nd row02-799-5000
3rd row02-705-9000
4th row02-3438-0510
5th row02-749-5005
ValueCountFrequency (%)
02-794-4438 2
 
0.6%
02-1644-5275 2
 
0.6%
02-000-0000 2
 
0.6%
02-757-0675 1
 
0.3%
02-793-0455 1
 
0.3%
02-712-2804 1
 
0.3%
02-790-7119 1
 
0.3%
02-796-2822 1
 
0.3%
02-749-1150 1
 
0.3%
02-754-2855 1
 
0.3%
Other values (313) 313
96.0%
2023-12-12T10:26:58.139865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 652
17.9%
0 594
16.3%
2 579
15.9%
7 510
14.0%
9 264
7.3%
5 255
 
7.0%
1 196
 
5.4%
8 174
 
4.8%
3 157
 
4.3%
4 142
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2989
82.1%
Dash Punctuation 652
 
17.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 594
19.9%
2 579
19.4%
7 510
17.1%
9 264
8.8%
5 255
8.5%
1 196
 
6.6%
8 174
 
5.8%
3 157
 
5.3%
4 142
 
4.8%
6 118
 
3.9%
Dash Punctuation
ValueCountFrequency (%)
- 652
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3641
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 652
17.9%
0 594
16.3%
2 579
15.9%
7 510
14.0%
9 264
7.3%
5 255
 
7.0%
1 196
 
5.4%
8 174
 
4.8%
3 157
 
4.3%
4 142
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3641
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 652
17.9%
0 594
16.3%
2 579
15.9%
7 510
14.0%
9 264
7.3%
5 255
 
7.0%
1 196
 
5.4%
8 174
 
4.8%
3 157
 
4.3%
4 142
 
3.9%
Distinct111
Distinct (%)34.0%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
2023-12-12T10:26:58.399350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length147
Median length4
Mean length12.233129
Min length2

Characters and Unicode

Total characters3988
Distinct characters63
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique89 ?
Unique (%)27.3%

Sample

1st row내과, 외과, 정형외과, 신경과, 산부인과, 소아과,이비인후과, 흉부외과, 정신과, 성형외과, 신경외과, 안과, 피부과, 치과, 마취통증의학과, 비뇨기과, 산업의학과, 영상의학과, 방사선종양학과, 진단검사 의학과, 병리과, 재활의학과, 가정의학과, 응급의학과
2nd row내과,정형외과,산부인과,영상의학과,가정의학과 진단검사의학과,마취통증의학과,재활의학과,외과
3rd row소아청소년과, 영상의학과, 진단검사의학과
4th row피부과 외과 신경과 내과
5th row피부과
ValueCountFrequency (%)
치과각과 102
12.1%
내과 96
11.4%
피부과 86
10.2%
소아청소년과 67
 
8.0%
한방각과 62
 
7.4%
이비인후과 58
 
6.9%
정형외과 47
 
5.6%
가정의학과 45
 
5.4%
외과 32
 
3.8%
산부인과 32
 
3.8%
Other values (28) 214
25.4%
2023-12-12T10:26:58.811168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
950
23.8%
515
 
12.9%
169
 
4.2%
167
 
4.2%
164
 
4.1%
135
 
3.4%
123
 
3.1%
123
 
3.1%
107
 
2.7%
103
 
2.6%
Other values (53) 1432
35.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3441
86.3%
Space Separator 515
 
12.9%
Other Punctuation 32
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
950
27.6%
169
 
4.9%
167
 
4.9%
164
 
4.8%
135
 
3.9%
123
 
3.6%
123
 
3.6%
107
 
3.1%
103
 
3.0%
97
 
2.8%
Other values (51) 1303
37.9%
Space Separator
ValueCountFrequency (%)
515
100.0%
Other Punctuation
ValueCountFrequency (%)
, 32
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3441
86.3%
Common 547
 
13.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
950
27.6%
169
 
4.9%
167
 
4.9%
164
 
4.8%
135
 
3.9%
123
 
3.6%
123
 
3.6%
107
 
3.1%
103
 
3.0%
97
 
2.8%
Other values (51) 1303
37.9%
Common
ValueCountFrequency (%)
515
94.1%
, 32
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3441
86.3%
ASCII 547
 
13.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
950
27.6%
169
 
4.9%
167
 
4.9%
164
 
4.8%
135
 
3.9%
123
 
3.6%
123
 
3.6%
107
 
3.1%
103
 
3.0%
97
 
2.8%
Other values (51) 1303
37.9%
ASCII
ValueCountFrequency (%)
515
94.1%
, 32
 
5.9%

Interactions

2023-12-12T10:26:54.294284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T10:26:58.917521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번종별
연번1.0000.950
종별0.9501.000
2023-12-12T10:26:59.001172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번종별
연번1.0000.689
종별0.6891.000

Missing values

2023-12-12T10:26:54.436507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:26:54.548686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번종별명칭도로명주소전화번호진료과목
01종합병원순천향대학교 부속 서울병원서울특별시 용산구 대사관로 5902-709-9114내과, 외과, 정형외과, 신경과, 산부인과, 소아과,이비인후과, 흉부외과, 정신과, 성형외과, 신경외과, 안과, 피부과, 치과, 마취통증의학과, 비뇨기과, 산업의학과, 영상의학과, 방사선종양학과, 진단검사 의학과, 병리과, 재활의학과, 가정의학과, 응급의학과
12병원금강아산병원서울특별시 용산구 이촌로 31802-799-5000내과,정형외과,산부인과,영상의학과,가정의학과 진단검사의학과,마취통증의학과,재활의학과,외과
23병원소화병원서울특별시 용산구 청파로 38302-705-9000소아청소년과, 영상의학과, 진단검사의학과
34의원수연제장의원서울특별시 용산구 유엔빌리지길 72, 1층 (한남동)02-3438-0510피부과 외과 신경과 내과
45의원고유피부과의원서울특별시 용산구 독서당로 124, 3층 (한남동)02-749-5005피부과
56의원페이브피부과의원서울특별시 용산구 한강대로 69, 2층 217-1, 2호 (한강로2가, 용산푸르지오써밋)02-2039-1656피부과
67의원마인드카페정신건강의학과의원서울특별시 용산구 장문로 23, 몬드리안 서울 이태원 지하2층 (이태원동)02-790-9811정신건강의학과
78의원봄빛의원서울특별시 용산구 이촌로88길 16, 미학빌딩 3층 (이촌동)02-6951-2870가정의학과 피부과
89의원제일성모이비인후과의원서울특별시 용산구 백범로 341, 128호 (원효로1가, 리첸시아 용산)02-706-3222가정의학과 이비인후과 소아청소년과 내과
910의원에스앤유제통의원서울특별시 용산구 이촌로 200, 206호 (이촌동)02-795-7575가정의학과 마취통증의학과 내과
연번종별명칭도로명주소전화번호진료과목
316317한의원한일한의원서울특별시 용산구 소월로2길 21 (후암동)02-777-8501한방각과
317318한의원조재형한의원서울특별시 용산구 후암로 32 (후암동)02-757-0675한방각과
318319한의원안아픈세상경희한의원서울특별시 용산구 한강대로 109, 용성비즈텔 506~507호 (한강로2가)02-796-8706한방각과
319320한의원강원한의원서울특별시 용산구 이태원로 270, 호성빌딩 4층 (한남동)02-792-1991한방각과
320321한의원안영한의원서울특별시 용산구 한남대로21길 9 (한남동)02-797-3566한방각과
321322한의원김재관한의원서울특별시 용산구 한강대로 69, 상가동 307-1호 (한강로2가, 용산푸르지오써밋)02-790-1047한방각과
322323한의원영화당한의원서울특별시 용산구 새창로33길 4 (용문동, 영화당한의원)02-718-9315한방각과
323324한의원보림한의원서울특별시 용산구 한강대로40길 5 (한강로2가)02-794-4454한방각과
324325한의원세정한의원서울특별시 용산구 이촌로 248 (이촌동)02-794-1464한방각과
325326한의원동성한의원서울특별시 용산구 후암로 64-1 (후암동)02-754-8742한방각과