Overview

Dataset statistics

Number of variables4
Number of observations7138
Missing cells3
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory230.2 KiB
Average record size in memory33.0 B

Variable types

Text2
Numeric1
DateTime1

Dataset

Description시도별 진료 정보를 교류하는 의료기관의 우편번호, 주소, 가입일자 등을 제공하는 의료기관 진료정보 교류 현황 조회 서비스
URLhttps://www.data.go.kr/data/15065402/fileData.do

Alerts

우편번호 has 104 (1.5%) zerosZeros

Reproduction

Analysis started2023-12-12 15:29:34.307365
Analysis finished2023-12-12 15:29:35.405845
Duration1.1 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct6290
Distinct (%)88.1%
Missing0
Missing (%)0.0%
Memory size55.9 KiB
2023-12-13T00:29:35.579685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length25
Mean length8.2792099
Min length3

Characters and Unicode

Total characters59097
Distinct characters642
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5865 ?
Unique (%)82.2%

Sample

1st row가톨릭대학교 여의도성모병원
2nd row중앙대학교병원
3rd row서울대학교병원
4th row강북삼성병원
5th row서울적십자병원
ValueCountFrequency (%)
의료법인 104
 
1.4%
서울삼성내과의원 19
 
0.2%
연세내과의원 15
 
0.2%
의원 15
 
0.2%
속편한내과의원 14
 
0.2%
두리이비인후과의원 12
 
0.2%
이내과의원 12
 
0.2%
서울내과의원 11
 
0.1%
연세이비인후과의원 11
 
0.1%
근로복지공단 9
 
0.1%
Other values (6524) 7398
97.1%
2023-12-13T00:29:36.028091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7148
 
12.1%
6804
 
11.5%
5002
 
8.5%
1648
 
2.8%
1216
 
2.1%
1215
 
2.1%
1100
 
1.9%
1021
 
1.7%
848
 
1.4%
842
 
1.4%
Other values (632) 32253
54.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 58117
98.3%
Space Separator 489
 
0.8%
Decimal Number 222
 
0.4%
Uppercase Letter 101
 
0.2%
Close Punctuation 65
 
0.1%
Open Punctuation 60
 
0.1%
Other Punctuation 21
 
< 0.1%
Lowercase Letter 19
 
< 0.1%
Dash Punctuation 2
 
< 0.1%
Modifier Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7148
 
12.3%
6804
 
11.7%
5002
 
8.6%
1648
 
2.8%
1216
 
2.1%
1215
 
2.1%
1100
 
1.9%
1021
 
1.8%
848
 
1.5%
842
 
1.4%
Other values (586) 31273
53.8%
Uppercase Letter
ValueCountFrequency (%)
S 22
21.8%
K 10
9.9%
M 9
8.9%
B 7
 
6.9%
H 7
 
6.9%
W 6
 
5.9%
D 6
 
5.9%
N 6
 
5.9%
J 5
 
5.0%
O 4
 
4.0%
Other values (8) 19
18.8%
Lowercase Letter
ValueCountFrequency (%)
e 8
42.1%
r 3
 
15.8%
s 1
 
5.3%
n 1
 
5.3%
a 1
 
5.3%
g 1
 
5.3%
m 1
 
5.3%
c 1
 
5.3%
i 1
 
5.3%
h 1
 
5.3%
Decimal Number
ValueCountFrequency (%)
3 55
24.8%
5 55
24.8%
6 53
23.9%
2 23
10.4%
1 22
 
9.9%
8 6
 
2.7%
0 5
 
2.3%
9 2
 
0.9%
4 1
 
0.5%
Other Punctuation
ValueCountFrequency (%)
. 10
47.6%
& 7
33.3%
· 3
 
14.3%
, 1
 
4.8%
Space Separator
ValueCountFrequency (%)
489
100.0%
Close Punctuation
ValueCountFrequency (%)
) 65
100.0%
Open Punctuation
ValueCountFrequency (%)
( 60
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 58116
98.3%
Common 860
 
1.5%
Latin 120
 
0.2%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7148
 
12.3%
6804
 
11.7%
5002
 
8.6%
1648
 
2.8%
1216
 
2.1%
1215
 
2.1%
1100
 
1.9%
1021
 
1.8%
848
 
1.5%
842
 
1.4%
Other values (585) 31272
53.8%
Latin
ValueCountFrequency (%)
S 22
18.3%
K 10
 
8.3%
M 9
 
7.5%
e 8
 
6.7%
B 7
 
5.8%
H 7
 
5.8%
W 6
 
5.0%
D 6
 
5.0%
N 6
 
5.0%
J 5
 
4.2%
Other values (18) 34
28.3%
Common
ValueCountFrequency (%)
489
56.9%
) 65
 
7.6%
( 60
 
7.0%
3 55
 
6.4%
5 55
 
6.4%
6 53
 
6.2%
2 23
 
2.7%
1 22
 
2.6%
. 10
 
1.2%
& 7
 
0.8%
Other values (8) 21
 
2.4%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 58116
98.3%
ASCII 977
 
1.7%
None 3
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
7148
 
12.3%
6804
 
11.7%
5002
 
8.6%
1648
 
2.8%
1216
 
2.1%
1215
 
2.1%
1100
 
1.9%
1021
 
1.8%
848
 
1.5%
842
 
1.4%
Other values (585) 31272
53.8%
ASCII
ValueCountFrequency (%)
489
50.1%
) 65
 
6.7%
( 60
 
6.1%
3 55
 
5.6%
5 55
 
5.6%
6 53
 
5.4%
2 23
 
2.4%
1 22
 
2.3%
S 22
 
2.3%
K 10
 
1.0%
Other values (35) 123
 
12.6%
None
ValueCountFrequency (%)
· 3
100.0%
CJK
ValueCountFrequency (%)
1
100.0%

우편번호
Real number (ℝ)

ZEROS 

Distinct4361
Distinct (%)61.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25099.379
Minimum0
Maximum704939
Zeros104
Zeros (%)1.5%
Negative0
Negative (%)0.0%
Memory size62.9 KiB
2023-12-13T00:29:36.186472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2484
Q17774
median17117
Q341996
95-th percentile58651.45
Maximum704939
Range704939
Interquartile range (IQR)34222

Descriptive statistics

Standard deviation27435.693
Coefficient of variation (CV)1.0930825
Kurtosis253.39647
Mean25099.379
Median Absolute Deviation (MAD)11853.5
Skewness11.797121
Sum1.7915937 × 108
Variance7.5271727 × 108
MonotonicityNot monotonic
2023-12-13T00:29:36.349050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 104
 
1.5%
14072 14
 
0.2%
46576 13
 
0.2%
15360 13
 
0.2%
13618 12
 
0.2%
21405 12
 
0.2%
16329 12
 
0.2%
15865 12
 
0.2%
34909 11
 
0.2%
10386 11
 
0.2%
Other values (4351) 6924
97.0%
ValueCountFrequency (%)
0 104
1.5%
1034 1
 
< 0.1%
1041 1
 
< 0.1%
1043 1
 
< 0.1%
1044 1
 
< 0.1%
1046 1
 
< 0.1%
1054 1
 
< 0.1%
1055 1
 
< 0.1%
1058 1
 
< 0.1%
1062 4
 
0.1%
ValueCountFrequency (%)
704939 1
< 0.1%
704910 1
< 0.1%
619952 1
< 0.1%
617010 1
< 0.1%
614849 1
< 0.1%
614817 1
< 0.1%
501821 1
< 0.1%
471833 1
< 0.1%
425830 1
< 0.1%
63640 1
< 0.1%

주소
Text

Distinct7059
Distinct (%)98.9%
Missing3
Missing (%)< 0.1%
Memory size55.9 KiB
2023-12-13T00:29:36.698143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length87
Median length68
Mean length28.908619
Min length8

Characters and Unicode

Total characters206263
Distinct characters674
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6990 ?
Unique (%)98.0%

Sample

1st row서울특별시 영등포구 63로 10 여의도성모병원 (여의도동)
2nd row서울특별시 동작구 흑석로 102 (흑석동)
3rd row서울특별시 종로구 대학로 101 (연건동)
4th row서울특별시 종로구 새문안로 29 (평동)
5th row서울특별시 종로구 새문안로 9 적십자병원 (평동)
ValueCountFrequency (%)
서울특별시 1783
 
4.1%
경기도 1729
 
4.0%
2층 859
 
2.0%
부산광역시 642
 
1.5%
3층 611
 
1.4%
대구광역시 337
 
0.8%
4층 336
 
0.8%
인천광역시 326
 
0.7%
수원시 303
 
0.7%
서구 254
 
0.6%
Other values (9165) 36538
83.6%
2023-12-13T00:29:37.362953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
38096
 
18.5%
7147
 
3.5%
7108
 
3.4%
6823
 
3.3%
1 6241
 
3.0%
6088
 
3.0%
2 5652
 
2.7%
) 5411
 
2.6%
( 5411
 
2.6%
3 4600
 
2.2%
Other values (664) 113686
55.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 118199
57.3%
Space Separator 38096
 
18.5%
Decimal Number 34372
 
16.7%
Close Punctuation 5411
 
2.6%
Open Punctuation 5411
 
2.6%
Other Punctuation 2940
 
1.4%
Dash Punctuation 801
 
0.4%
Math Symbol 499
 
0.2%
Uppercase Letter 458
 
0.2%
Lowercase Letter 70
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7147
 
6.0%
7108
 
6.0%
6823
 
5.8%
6088
 
5.2%
3429
 
2.9%
3407
 
2.9%
3163
 
2.7%
2671
 
2.3%
2651
 
2.2%
2503
 
2.1%
Other values (592) 73209
61.9%
Uppercase Letter
ValueCountFrequency (%)
B 79
17.2%
A 69
15.1%
C 35
 
7.6%
S 29
 
6.3%
M 28
 
6.1%
K 25
 
5.5%
L 20
 
4.4%
D 17
 
3.7%
E 16
 
3.5%
I 14
 
3.1%
Other values (14) 126
27.5%
Lowercase Letter
ValueCountFrequency (%)
e 23
32.9%
o 5
 
7.1%
i 4
 
5.7%
w 4
 
5.7%
l 4
 
5.7%
k 3
 
4.3%
r 3
 
4.3%
s 3
 
4.3%
a 3
 
4.3%
t 2
 
2.9%
Other values (10) 16
22.9%
Decimal Number
ValueCountFrequency (%)
1 6241
18.2%
2 5652
16.4%
3 4600
13.4%
0 4074
11.9%
4 3218
9.4%
5 2874
8.4%
6 2319
 
6.7%
7 1998
 
5.8%
8 1737
 
5.1%
9 1659
 
4.8%
Other Punctuation
ValueCountFrequency (%)
, 2848
96.9%
. 63
 
2.1%
/ 15
 
0.5%
· 6
 
0.2%
& 5
 
0.2%
: 2
 
0.1%
@ 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 493
98.8%
+ 3
 
0.6%
2
 
0.4%
1
 
0.2%
Letter Number
ValueCountFrequency (%)
3
50.0%
2
33.3%
1
 
16.7%
Space Separator
ValueCountFrequency (%)
38096
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5411
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5411
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 801
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 118199
57.3%
Common 87530
42.4%
Latin 534
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7147
 
6.0%
7108
 
6.0%
6823
 
5.8%
6088
 
5.2%
3429
 
2.9%
3407
 
2.9%
3163
 
2.7%
2671
 
2.3%
2651
 
2.2%
2503
 
2.1%
Other values (592) 73209
61.9%
Latin
ValueCountFrequency (%)
B 79
14.8%
A 69
 
12.9%
C 35
 
6.6%
S 29
 
5.4%
M 28
 
5.2%
K 25
 
4.7%
e 23
 
4.3%
L 20
 
3.7%
D 17
 
3.2%
E 16
 
3.0%
Other values (37) 193
36.1%
Common
ValueCountFrequency (%)
38096
43.5%
1 6241
 
7.1%
2 5652
 
6.5%
) 5411
 
6.2%
( 5411
 
6.2%
3 4600
 
5.3%
0 4074
 
4.7%
4 3218
 
3.7%
5 2874
 
3.3%
, 2848
 
3.3%
Other values (15) 9105
 
10.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 118195
57.3%
ASCII 88049
42.7%
None 8
 
< 0.1%
Number Forms 6
 
< 0.1%
Compat Jamo 4
 
< 0.1%
Math Operators 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
38096
43.3%
1 6241
 
7.1%
2 5652
 
6.4%
) 5411
 
6.1%
( 5411
 
6.1%
3 4600
 
5.2%
0 4074
 
4.6%
4 3218
 
3.7%
5 2874
 
3.3%
, 2848
 
3.2%
Other values (56) 9624
 
10.9%
Hangul
ValueCountFrequency (%)
7147
 
6.0%
7108
 
6.0%
6823
 
5.8%
6088
 
5.2%
3429
 
2.9%
3407
 
2.9%
3163
 
2.7%
2671
 
2.3%
2651
 
2.2%
2503
 
2.1%
Other values (591) 73205
61.9%
None
ValueCountFrequency (%)
· 6
75.0%
2
 
25.0%
Compat Jamo
ValueCountFrequency (%)
4
100.0%
Number Forms
ValueCountFrequency (%)
3
50.0%
2
33.3%
1
 
16.7%
Math Operators
ValueCountFrequency (%)
1
100.0%
Distinct437
Distinct (%)6.1%
Missing0
Missing (%)0.0%
Memory size55.9 KiB
Minimum2017-12-13 00:00:00
Maximum2022-12-29 00:00:00
2023-12-13T00:29:37.517804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:29:37.691509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-13T00:29:35.027593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-13T00:29:35.198843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:29:35.368741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

의료기관명우편번호주소등록일
0가톨릭대학교 여의도성모병원7345서울특별시 영등포구 63로 10 여의도성모병원 (여의도동)2019-03-19
1중앙대학교병원6973서울특별시 동작구 흑석로 102 (흑석동)2020-09-22
2서울대학교병원3080서울특별시 종로구 대학로 101 (연건동)2019-02-26
3강북삼성병원3181서울특별시 종로구 새문안로 29 (평동)2020-06-04
4서울적십자병원3181서울특별시 종로구 새문안로 9 적십자병원 (평동)2019-02-11
5학교법인 고려중앙학원 고려대학교의과대학부속병원(안암병원)2841서울특별시 성북구 고려대로 73 고려대병원2020-09-04
6가톨릭대학교 은평성모병원3312서울특별시 은평구 통일로 1021 (진관동)2019-03-19
7경희대학교병원2447서울특별시 동대문구 경희대로 23 (회기동)2020-09-22
8한양대학교병원4763서울특별시 성동구 왕십리로 222-1 (사근동)2020-09-22
9연세대학교의과대학세브란스병원3722서울특별시 서대문구 연세로 50-1 (신촌동)2018-01-08
의료기관명우편번호주소등록일
7128눈애안과의원18309경기도 화성시 봉담읍 상리2길 51 3층2022-10-06
7129연세베스트내과의원16469경기도 수원시 팔달구 인계로 20 SK뷰 근린생활시설1 207~212호 (매교동)2022-10-24
7130늘서울치과의원21524인천광역시 남동구 만수서로 62 306, 307호 (만수동)2021-01-22
7131삼성이튼치과의원13835경기도 과천시 별양로 28 래미안 슈르 제상가 3층 3007~3010호,3011호호 (원문동)2021-03-23
7132서울올바른치과의원16509경기 수원시 영통구 에듀타운로 24 (이의동, 그린메디칼)2021-03-19
7133보아스이비인후과22695인천 서구 승학로 497 (검암동, 검암프라자)2022-06-17
7134광교JC정형외과16943경기 용인시 수지구 상현동 1131-22020-12-17
7135오치과의원57959전남 순천시 장평로 55 (인제동, 순천원예농협 인제지점)2021-04-06
7136오즈신경과의원18141경기도 오산시 대원로 6 4층 (원동, 역전빌딩)2020-10-14
7137근로복지공단44428울산광역시 중구 종가로 340 근로복지공단2021-03-16