Overview

Dataset statistics

Number of variables5
Number of observations298
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.1 KiB
Average record size in memory41.4 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description경상북도 23개 시군 보건진료소 현황 자료 입니다. 308개소/ 농어촌 등 보건의료를 위한 특별조치법에 따라 의료취약지역 주민의 건강증진, 예방, 치료, 및 재활 등의 통합된 보건의료서비스를 제공합니다.
URLhttps://www.data.go.kr/data/15056412/fileData.do

Alerts

연번 is highly overall correlated with 시군High correlation
시군 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
전화번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 01:54:06.936743
Analysis finished2023-12-12 01:54:07.541855
Duration0.61 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct298
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean149.5
Minimum1
Maximum298
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.7 KiB
2023-12-12T10:54:07.636715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile15.85
Q175.25
median149.5
Q3223.75
95-th percentile283.15
Maximum298
Range297
Interquartile range (IQR)148.5

Descriptive statistics

Standard deviation86.169407
Coefficient of variation (CV)0.57638399
Kurtosis-1.2
Mean149.5
Median Absolute Deviation (MAD)74.5
Skewness0
Sum44551
Variance7425.1667
MonotonicityStrictly increasing
2023-12-12T10:54:07.794651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
206 1
 
0.3%
204 1
 
0.3%
203 1
 
0.3%
202 1
 
0.3%
201 1
 
0.3%
200 1
 
0.3%
199 1
 
0.3%
198 1
 
0.3%
197 1
 
0.3%
Other values (288) 288
96.6%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
298 1
0.3%
297 1
0.3%
296 1
0.3%
295 1
0.3%
294 1
0.3%
293 1
0.3%
292 1
0.3%
291 1
0.3%
290 1
0.3%
289 1
0.3%

시군
Categorical

HIGH CORRELATION 

Distinct23
Distinct (%)7.7%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
안동시
25 
상주시
25 
의성군
21 
경주시
 
16
김천시
 
16
Other values (18)
195 

Length

Max length6
Median length3
Mean length3.2214765
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row포항시 남구
2nd row포항시 남구
3rd row포항시 남구
4th row포항시 남구
5th row포항시 남구

Common Values

ValueCountFrequency (%)
안동시 25
 
8.4%
상주시 25
 
8.4%
의성군 21
 
7.0%
경주시 16
 
5.4%
김천시 16
 
5.4%
울진군 16
 
5.4%
예천군 16
 
5.4%
영주시 13
 
4.4%
영천시 13
 
4.4%
문경시 13
 
4.4%
Other values (13) 124
41.6%

Length

2023-12-12T10:54:07.992074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
안동시 25
 
7.8%
상주시 25
 
7.8%
포항시 22
 
6.9%
의성군 21
 
6.6%
경주시 16
 
5.0%
김천시 16
 
5.0%
울진군 16
 
5.0%
예천군 16
 
5.0%
영주시 13
 
4.1%
영천시 13
 
4.1%
Other values (14) 137
42.8%
Distinct284
Distinct (%)95.3%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2023-12-12T10:54:08.299728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length7.0067114
Min length7

Characters and Unicode

Total characters2088
Distinct characters153
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique273 ?
Unique (%)91.6%

Sample

1st row구평보건진료소
2nd row금광보건진료소
3rd row대동배보건진료소
4th row두원보건진료소
5th row모포보건진료소
ValueCountFrequency (%)
봉산보건진료소 3
 
1.0%
금곡보건진료소 3
 
1.0%
옥산보건진료소 3
 
1.0%
지동보건진료소 2
 
0.7%
동부보건진료소 2
 
0.7%
용산보건진료소 2
 
0.7%
운곡보건진료소 2
 
0.7%
용화보건진료소 2
 
0.7%
덕촌보건진료소 2
 
0.7%
상송보건진료소 2
 
0.7%
Other values (274) 275
92.3%
2023-12-12T10:54:08.773991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
307
14.7%
303
14.5%
302
14.5%
298
14.3%
298
14.3%
25
 
1.2%
23
 
1.1%
16
 
0.8%
16
 
0.8%
15
 
0.7%
Other values (143) 485
23.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2088
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
307
14.7%
303
14.5%
302
14.5%
298
14.3%
298
14.3%
25
 
1.2%
23
 
1.1%
16
 
0.8%
16
 
0.8%
15
 
0.7%
Other values (143) 485
23.2%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2088
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
307
14.7%
303
14.5%
302
14.5%
298
14.3%
298
14.3%
25
 
1.2%
23
 
1.1%
16
 
0.8%
16
 
0.8%
15
 
0.7%
Other values (143) 485
23.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2088
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
307
14.7%
303
14.5%
302
14.5%
298
14.3%
298
14.3%
25
 
1.2%
23
 
1.1%
16
 
0.8%
16
 
0.8%
15
 
0.7%
Other values (143) 485
23.2%

주소
Text

Distinct296
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2023-12-12T10:54:09.162143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length28
Mean length21.902685
Min length16

Characters and Unicode

Total characters6527
Distinct characters233
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique294 ?
Unique (%)98.7%

Sample

1st row경상북도 포항시 남구 구룡포읍 동해안로4260번길 5-7
2nd row경상북도 포항시 남구 동해면 금광로 239
3rd row경상북도 포항시 남구 호미곶면 호미로 1871
4th row경상북도 포항시 남구 장기면 동해안로 2718
5th row경상북도 포항시 남구 장기면 모포길 67
ValueCountFrequency (%)
경상북도 297
 
19.6%
안동시 25
 
1.7%
상주시 25
 
1.7%
포항시 22
 
1.5%
의성군 21
 
1.4%
경주시 16
 
1.1%
김천시 16
 
1.1%
예천군 16
 
1.1%
울진군 16
 
1.1%
영주시 13
 
0.9%
Other values (751) 1047
69.2%
2023-12-12T10:54:09.761804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1434
22.0%
344
 
5.3%
332
 
5.1%
330
 
5.1%
313
 
4.8%
260
 
4.0%
1 200
 
3.1%
168
 
2.6%
167
 
2.6%
147
 
2.3%
Other values (223) 2832
43.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4073
62.4%
Space Separator 1434
 
22.0%
Decimal Number 949
 
14.5%
Dash Punctuation 68
 
1.0%
Other Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
344
 
8.4%
332
 
8.2%
330
 
8.1%
313
 
7.7%
260
 
6.4%
168
 
4.1%
167
 
4.1%
147
 
3.6%
133
 
3.3%
80
 
2.0%
Other values (210) 1799
44.2%
Decimal Number
ValueCountFrequency (%)
1 200
21.1%
3 117
12.3%
4 103
10.9%
2 97
10.2%
5 93
9.8%
7 77
 
8.1%
6 74
 
7.8%
0 65
 
6.8%
8 62
 
6.5%
9 61
 
6.4%
Space Separator
ValueCountFrequency (%)
1434
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 68
100.0%
Other Punctuation
ValueCountFrequency (%)
, 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4073
62.4%
Common 2454
37.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
344
 
8.4%
332
 
8.2%
330
 
8.1%
313
 
7.7%
260
 
6.4%
168
 
4.1%
167
 
4.1%
147
 
3.6%
133
 
3.3%
80
 
2.0%
Other values (210) 1799
44.2%
Common
ValueCountFrequency (%)
1434
58.4%
1 200
 
8.1%
3 117
 
4.8%
4 103
 
4.2%
2 97
 
4.0%
5 93
 
3.8%
7 77
 
3.1%
6 74
 
3.0%
- 68
 
2.8%
0 65
 
2.6%
Other values (3) 126
 
5.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4073
62.4%
ASCII 2454
37.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1434
58.4%
1 200
 
8.1%
3 117
 
4.8%
4 103
 
4.2%
2 97
 
4.0%
5 93
 
3.8%
7 77
 
3.1%
6 74
 
3.0%
- 68
 
2.8%
0 65
 
2.6%
Other values (3) 126
 
5.1%
Hangul
ValueCountFrequency (%)
344
 
8.4%
332
 
8.2%
330
 
8.1%
313
 
7.7%
260
 
6.4%
168
 
4.1%
167
 
4.1%
147
 
3.6%
133
 
3.3%
80
 
2.0%
Other values (210) 1799
44.2%

전화번호
Text

UNIQUE 

Distinct298
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2023-12-12T10:54:10.041249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters3576
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique298 ?
Unique (%)100.0%

Sample

1st row054-284-7091
2nd row054-284-4742
3rd row054-284-8992
4th row054-293-3752
5th row054-284-7121
ValueCountFrequency (%)
054-284-7091 1
 
0.3%
054-730-7032 1
 
0.3%
054-682-8852 1
 
0.3%
054-682-7186 1
 
0.3%
054-682-0445 1
 
0.3%
054-682-0553 1
 
0.3%
054-682-6725 1
 
0.3%
054-683-0205 1
 
0.3%
054-683-4972 1
 
0.3%
054-870-7481 1
 
0.3%
Other values (288) 288
96.6%
2023-12-12T10:54:10.524714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 596
16.7%
5 530
14.8%
0 494
13.8%
4 459
12.8%
3 277
7.7%
7 244
6.8%
8 241
6.7%
9 211
 
5.9%
2 208
 
5.8%
6 192
 
5.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2980
83.3%
Dash Punctuation 596
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 530
17.8%
0 494
16.6%
4 459
15.4%
3 277
9.3%
7 244
8.2%
8 241
8.1%
9 211
 
7.1%
2 208
 
7.0%
6 192
 
6.4%
1 124
 
4.2%
Dash Punctuation
ValueCountFrequency (%)
- 596
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3576
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 596
16.7%
5 530
14.8%
0 494
13.8%
4 459
12.8%
3 277
7.7%
7 244
6.8%
8 241
6.7%
9 211
 
5.9%
2 208
 
5.8%
6 192
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3576
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 596
16.7%
5 530
14.8%
0 494
13.8%
4 459
12.8%
3 277
7.7%
7 244
6.8%
8 241
6.7%
9 211
 
5.9%
2 208
 
5.8%
6 192
 
5.4%

Interactions

2023-12-12T10:54:07.259659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T10:54:10.658315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시군
연번1.0000.979
시군0.9791.000
2023-12-12T10:54:10.771223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시군
연번1.0000.862
시군0.8621.000

Missing values

2023-12-12T10:54:07.390550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:54:07.493805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시군보건진료소명주소전화번호
01포항시 남구구평보건진료소경상북도 포항시 남구 구룡포읍 동해안로4260번길 5-7054-284-7091
12포항시 남구금광보건진료소경상북도 포항시 남구 동해면 금광로 239054-284-4742
23포항시 남구대동배보건진료소경상북도 포항시 남구 호미곶면 호미로 1871054-284-8992
34포항시 남구두원보건진료소경상북도 포항시 남구 장기면 동해안로 2718054-293-3752
45포항시 남구모포보건진료소경상북도 포항시 남구 장기면 모포길 67054-284-7121
56포항시 남구양포보건진료소경상북도 포항시 남구 장기면 양포항길 12054-276-1019
67포항시 남구용산보건진료소경상북도 포항시 남구 오천읍 기림로 2000054-291-6994
78포항시 남구장동보건진료소경상북도 포항시 남구 대송면 장동홍계길 222-18054-285-0970
89포항시 남구학전보건진료소경상북도 포항시 남구 연일읍 자명로 326-3054-278-3861
910포항시 남구흥환보건진료소경상북도 포항시 남구 동해면 호미로 2494054-291-5963
연번시군보건진료소명주소전화번호
288289울진군왕피보건진료소경상북도 울진군 금강송면 왕피길 1435054-789-5986
289290울진군갈면보건진료소경상북도 울진군 매화면 갈면대평길 237054-789-5988
290291울진군오산보건진료소경상북도 울진군 매화면 망양정로 49054-789-5989
291292울진군직산보건진료소경상북도 울진군 평해읍 직고개길 27054-789-5981
292293울진군진복보건진료소경상북도 울진군 근남면 매오길 443054-789-5987
293294울진군하당보건진료소경상북도 울진군 북면 하당3길 34054-789-5983
294295울진군후포보건진료소경상북도 울진군 후포면 울진대게로 333054-789-5995
295296울릉군저동보건진료소경상북도 울릉군 울릉읍 저동1길 21-20054-791-2808
296297울릉군태하보건진료소경상북도 울릉군 서면 태하길 214-1054-791-5307
297298울릉군현포보건진료소경상북도 울릉군 북면 울릉순환로 2624-21054-790-5788