Overview

Dataset statistics

Number of variables4
Number of observations36
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 KiB
Average record size in memory36.7 B

Variable types

Text3
Categorical1

Dataset

Description부산광역시 서구에 주소지를 둔 치과병원의 사업장명, 주소(도로명주소), 의료인수, 전화번호 정보에 대한 데이터를 제공합니다.
URLhttps://www.data.go.kr/data/15045205/fileData.do

Alerts

의료인(전체) is highly imbalanced (62.6%)Imbalance
의료기관주소 has unique valuesUnique
의료기관전화번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 09:50:08.560487
Analysis finished2023-12-12 09:50:08.984343
Duration0.42 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct35
Distinct (%)97.2%
Missing0
Missing (%)0.0%
Memory size420.0 B
2023-12-12T18:50:09.135558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9.5
Mean length6.6944444
Min length5

Characters and Unicode

Total characters241
Distinct characters68
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)94.4%

Sample

1st row서울리안치과의원
2nd row스타치과의원
3rd row에이스치과의원
4th row플러스치과의원
5th row경로치과의원
ValueCountFrequency (%)
강치과의원 2
 
5.6%
조호성치과의원 1
 
2.8%
임완재치과의원 1
 
2.8%
미치과의원 1
 
2.8%
연세치과의원 1
 
2.8%
손영경치과의원 1
 
2.8%
조영제치과의원 1
 
2.8%
정혜진치과의원 1
 
2.8%
유재현치과의원 1
 
2.8%
서울리안치과의원 1
 
2.8%
Other values (25) 25
69.4%
2023-12-12T18:50:09.493535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
36
14.9%
36
14.9%
36
14.9%
36
14.9%
6
 
2.5%
4
 
1.7%
4
 
1.7%
3
 
1.2%
3
 
1.2%
3
 
1.2%
Other values (58) 74
30.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 241
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
36
14.9%
36
14.9%
36
14.9%
36
14.9%
6
 
2.5%
4
 
1.7%
4
 
1.7%
3
 
1.2%
3
 
1.2%
3
 
1.2%
Other values (58) 74
30.7%

Most occurring scripts

ValueCountFrequency (%)
Hangul 241
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
36
14.9%
36
14.9%
36
14.9%
36
14.9%
6
 
2.5%
4
 
1.7%
4
 
1.7%
3
 
1.2%
3
 
1.2%
3
 
1.2%
Other values (58) 74
30.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 241
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
36
14.9%
36
14.9%
36
14.9%
36
14.9%
6
 
2.5%
4
 
1.7%
4
 
1.7%
3
 
1.2%
3
 
1.2%
3
 
1.2%
Other values (58) 74
30.7%

의료기관주소
Text

UNIQUE 

Distinct36
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size420.0 B
2023-12-12T18:50:09.748909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length37
Mean length29.888889
Min length21

Characters and Unicode

Total characters1076
Distinct characters78
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)100.0%

Sample

1st row부산광역시 서구 구덕로 234, 201호 (부용동1가, 남명더라우)
2nd row부산광역시 서구 구덕로 317-1, 3층 (서대신동2가)
3rd row부산광역시 서구 충무대로 14, 3층 (암남동)
4th row부산광역시 서구 구덕로 109, 7층 (충무동1가)
5th row부산광역시 서구 구덕로321번길 38, 1층 (서대신동3가)
ValueCountFrequency (%)
부산광역시 36
17.1%
서구 36
17.1%
구덕로 19
 
9.0%
3층 7
 
3.3%
2층 7
 
3.3%
대영로 5
 
2.4%
충무동1가 5
 
2.4%
동대신동2가 5
 
2.4%
서대신동1가 5
 
2.4%
서대신동2가 4
 
1.9%
Other values (68) 82
38.9%
2023-12-12T18:50:10.157583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
175
 
16.3%
56
 
5.2%
48
 
4.5%
1 44
 
4.1%
44
 
4.1%
41
 
3.8%
2 41
 
3.8%
37
 
3.4%
, 37
 
3.4%
36
 
3.3%
Other values (68) 517
48.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 607
56.4%
Decimal Number 181
 
16.8%
Space Separator 175
 
16.3%
Other Punctuation 37
 
3.4%
Close Punctuation 36
 
3.3%
Open Punctuation 36
 
3.3%
Dash Punctuation 4
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
56
 
9.2%
48
 
7.9%
44
 
7.2%
41
 
6.8%
37
 
6.1%
36
 
5.9%
36
 
5.9%
36
 
5.9%
36
 
5.9%
33
 
5.4%
Other values (53) 204
33.6%
Decimal Number
ValueCountFrequency (%)
1 44
24.3%
2 41
22.7%
3 32
17.7%
4 20
11.0%
9 13
 
7.2%
0 12
 
6.6%
5 5
 
2.8%
8 5
 
2.8%
6 5
 
2.8%
7 4
 
2.2%
Space Separator
ValueCountFrequency (%)
175
100.0%
Other Punctuation
ValueCountFrequency (%)
, 37
100.0%
Close Punctuation
ValueCountFrequency (%)
) 36
100.0%
Open Punctuation
ValueCountFrequency (%)
( 36
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 607
56.4%
Common 469
43.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
56
 
9.2%
48
 
7.9%
44
 
7.2%
41
 
6.8%
37
 
6.1%
36
 
5.9%
36
 
5.9%
36
 
5.9%
36
 
5.9%
33
 
5.4%
Other values (53) 204
33.6%
Common
ValueCountFrequency (%)
175
37.3%
1 44
 
9.4%
2 41
 
8.7%
, 37
 
7.9%
) 36
 
7.7%
( 36
 
7.7%
3 32
 
6.8%
4 20
 
4.3%
9 13
 
2.8%
0 12
 
2.6%
Other values (5) 23
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 607
56.4%
ASCII 469
43.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
175
37.3%
1 44
 
9.4%
2 41
 
8.7%
, 37
 
7.9%
) 36
 
7.7%
( 36
 
7.7%
3 32
 
6.8%
4 20
 
4.3%
9 13
 
2.8%
0 12
 
2.6%
Other values (5) 23
 
4.9%
Hangul
ValueCountFrequency (%)
56
 
9.2%
48
 
7.9%
44
 
7.2%
41
 
6.8%
37
 
6.1%
36
 
5.9%
36
 
5.9%
36
 
5.9%
36
 
5.9%
33
 
5.4%
Other values (53) 204
33.6%
Distinct36
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size420.0 B
2023-12-12T18:50:10.375007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length11.666667
Min length8

Characters and Unicode

Total characters420
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)100.0%

Sample

1st row051-791-2880
2nd row952-1000
3rd row051-255-1555
4th row051-257-2804
5th row051-254-7141
ValueCountFrequency (%)
051-791-2880 1
 
2.8%
952-1000 1
 
2.8%
051-241-2357 1
 
2.8%
051-242-8026 1
 
2.8%
051-241-2337 1
 
2.8%
051-241-7903 1
 
2.8%
051-255-3328 1
 
2.8%
051-254-8126 1
 
2.8%
051-254-2626 1
 
2.8%
051-254-1790 1
 
2.8%
Other values (26) 26
72.2%
2023-12-12T18:50:10.737086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 70
16.7%
- 69
16.4%
5 63
15.0%
0 51
12.1%
1 50
11.9%
4 33
7.9%
7 27
 
6.4%
8 21
 
5.0%
3 14
 
3.3%
6 12
 
2.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 351
83.6%
Dash Punctuation 69
 
16.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 70
19.9%
5 63
17.9%
0 51
14.5%
1 50
14.2%
4 33
9.4%
7 27
 
7.7%
8 21
 
6.0%
3 14
 
4.0%
6 12
 
3.4%
9 10
 
2.8%
Dash Punctuation
ValueCountFrequency (%)
- 69
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 420
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 70
16.7%
- 69
16.4%
5 63
15.0%
0 51
12.1%
1 50
11.9%
4 33
7.9%
7 27
 
6.4%
8 21
 
5.0%
3 14
 
3.3%
6 12
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 420
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 70
16.7%
- 69
16.4%
5 63
15.0%
0 51
12.1%
1 50
11.9%
4 33
7.9%
7 27
 
6.4%
8 21
 
5.0%
3 14
 
3.3%
6 12
 
2.9%

의료인(전체)
Categorical

IMBALANCE 

Distinct3
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Memory size420.0 B
1
32 
2
 
3
4
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)2.8%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 32
88.9%
2 3
 
8.3%
4 1
 
2.8%

Length

2023-12-12T18:50:10.907107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:50:11.014928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 32
88.9%
2 3
 
8.3%
4 1
 
2.8%

Correlations

2023-12-12T18:50:11.085824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
의료기관명의료기관주소의료기관전화번호의료인(전체)
의료기관명1.0001.0001.0001.000
의료기관주소1.0001.0001.0001.000
의료기관전화번호1.0001.0001.0001.000
의료인(전체)1.0001.0001.0001.000

Missing values

2023-12-12T18:50:08.831393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:50:08.942519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

의료기관명의료기관주소의료기관전화번호의료인(전체)
0서울리안치과의원부산광역시 서구 구덕로 234, 201호 (부용동1가, 남명더라우)051-791-28801
1스타치과의원부산광역시 서구 구덕로 317-1, 3층 (서대신동2가)952-10001
2에이스치과의원부산광역시 서구 충무대로 14, 3층 (암남동)051-255-15551
3플러스치과의원부산광역시 서구 구덕로 109, 7층 (충무동1가)051-257-28041
4경로치과의원부산광역시 서구 구덕로321번길 38, 1층 (서대신동3가)051-254-71411
5이창웅치과의원부산광역시 서구 보수대로 9, 3층 (충무동1가, 바른빌딩)051-622-28781
6대신스마일치과의원부산광역시 서구 구덕로 323, 2층 (서대신동3가)051-257-28721
7강치과의원부산광역시 서구 구덕로 296, 2층 (동대신동2가)244-20801
8송성국치과의원부산광역시 서구 구덕로 304, 2층 (동대신동2가)051-255-75281
9리더스치과의원부산광역시 서구 대영로 54, 3층,6층 (서대신동1가, 대신메디컬센터)051-243-75284
의료기관명의료기관주소의료기관전화번호의료인(전체)
26임완재치과의원부산광역시 서구 대영로 44-1 (서대신동1가)051-241-23571
27조호성치과의원부산광역시 서구 충무시장길 9 (충무동1가)051-254-17901
28박종을치과의원부산광역시 서구 대티로 178, 12,13,14호 (서대신동2가, 대신해모로센트럴아파트)051-254-28281
29프랜드성창수치과의원부산광역시 서구 구덕로 281, 3층 (서대신동1가)051-247-71231
30정치과의원부산광역시 서구 까치고개로 194 (아미동2가)051-246-28751
31이수영치과의원부산광역시 서구 구덕로 121 (토성동4가)051-254-75741
32강순일치과의원부산광역시 서구 구덕로 124, 4층 (토성동4가)051-244-74001
33강치과의원부산광역시 서구 구덕로 115 (충무동1가)051-242-10031
34유재현치과의원부산광역시 서구 구덕로 293, 203,204,205동 (서대신동2가, 희망센츄럴타운)051-244-26231
35선일치과의원부산광역시 서구 까치고개로233번길 4, 2,3층 (토성동2가)051-231-13021