Overview

Dataset statistics

Number of variables8
Number of observations34
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.3 KiB
Average record size in memory68.9 B

Variable types

DateTime2
Text4
Categorical1
Numeric1

Dataset

Description서천군 관내 운영중인 병원과 의원 현황정보입니다 (개설일자, 개설자명, 의료기관명, 의료기관종별, 주소, 전화번호를 안내하고 있습니다)
URLhttps://www.data.go.kr/data/3069260/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
우편번호 is highly overall correlated with 의료기관종별High correlation
의료기관종별 is highly overall correlated with 우편번호High correlation
의료기관종별 is highly imbalanced (55.3%)Imbalance
개설일자 has unique valuesUnique
개설자명 has unique valuesUnique
의료기관명 has unique valuesUnique
의료기관전화번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 16:28:12.368121
Analysis finished2023-12-12 16:28:13.077644
Duration0.71 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

개설일자
Date

UNIQUE 

Distinct34
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size404.0 B
Minimum1988-06-03 00:00:00
Maximum2017-12-12 00:00:00
2023-12-13T01:28:13.158044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:28:13.300193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)

개설자명
Text

UNIQUE 

Distinct34
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size404.0 B
2023-12-13T01:28:13.553299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length3.0294118
Min length3

Characters and Unicode

Total characters103
Distinct characters61
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)100.0%

Sample

1st row이학수
2nd row조민수
3rd row김재환
4th row김홍섭
5th row김바울
ValueCountFrequency (%)
이학수 1
 
2.9%
조민수 1
 
2.9%
김진오 1
 
2.9%
박정자 1
 
2.9%
서천군수 1
 
2.9%
손장신 1
 
2.9%
장명훈 1
 
2.9%
이규현 1
 
2.9%
장봉열 1
 
2.9%
양조환 1
 
2.9%
Other values (24) 24
70.6%
2023-12-13T01:28:13.948738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10
 
9.7%
6
 
5.8%
5
 
4.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
2
 
1.9%
2
 
1.9%
Other values (51) 63
61.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 103
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10
 
9.7%
6
 
5.8%
5
 
4.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
2
 
1.9%
2
 
1.9%
Other values (51) 63
61.2%

Most occurring scripts

ValueCountFrequency (%)
Hangul 103
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10
 
9.7%
6
 
5.8%
5
 
4.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
2
 
1.9%
2
 
1.9%
Other values (51) 63
61.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 103
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
10
 
9.7%
6
 
5.8%
5
 
4.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
2
 
1.9%
2
 
1.9%
Other values (51) 63
61.2%

의료기관명
Text

UNIQUE 

Distinct34
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size404.0 B
2023-12-13T01:28:14.239020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length9
Mean length7.2352941
Min length4

Characters and Unicode

Total characters246
Distinct characters80
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)100.0%

Sample

1st row서천이안과의원
2nd row서천항장외과의원
3rd row청담플러스의원
4th row늘봄정형외과의원
5th row김바울내과의원
ValueCountFrequency (%)
서천이안과의원 1
 
2.8%
서울의원 1
 
2.8%
서천사랑병원 1
 
2.8%
서천요양병원 1
 
2.8%
서천군립노인요양병원 1
 
2.8%
서천한국요양병원 1
 
2.8%
동서천요양병원 1
 
2.8%
한산의원 1
 
2.8%
서천항장외과의원 1
 
2.8%
장봉열내과의원 1
 
2.8%
Other values (26) 26
72.2%
2023-12-13T01:28:15.027035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
34
 
13.8%
32
 
13.0%
19
 
7.7%
13
 
5.3%
10
 
4.1%
8
 
3.3%
8
 
3.3%
6
 
2.4%
6
 
2.4%
4
 
1.6%
Other values (70) 106
43.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 244
99.2%
Space Separator 2
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
34
 
13.9%
32
 
13.1%
19
 
7.8%
13
 
5.3%
10
 
4.1%
8
 
3.3%
8
 
3.3%
6
 
2.5%
6
 
2.5%
4
 
1.6%
Other values (69) 104
42.6%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 244
99.2%
Common 2
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
34
 
13.9%
32
 
13.1%
19
 
7.8%
13
 
5.3%
10
 
4.1%
8
 
3.3%
8
 
3.3%
6
 
2.5%
6
 
2.5%
4
 
1.6%
Other values (69) 104
42.6%
Common
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 244
99.2%
ASCII 2
 
0.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
34
 
13.9%
32
 
13.1%
19
 
7.8%
13
 
5.3%
10
 
4.1%
8
 
3.3%
8
 
3.3%
6
 
2.5%
6
 
2.5%
4
 
1.6%
Other values (69) 104
42.6%
ASCII
ValueCountFrequency (%)
2
100.0%

의료기관종별
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)11.8%
Missing0
Missing (%)0.0%
Memory size404.0 B
의원
28 
일반요양병원
요양병원(정신병원)
 
1
병원
 
1

Length

Max length10
Median length2
Mean length2.7058824
Min length2

Unique

Unique2 ?
Unique (%)5.9%

Sample

1st row의원
2nd row의원
3rd row의원
4th row의원
5th row의원

Common Values

ValueCountFrequency (%)
의원 28
82.4%
일반요양병원 4
 
11.8%
요양병원(정신병원) 1
 
2.9%
병원 1
 
2.9%

Length

2023-12-13T01:28:15.195520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:28:15.338245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
의원 28
82.4%
일반요양병원 4
 
11.8%
요양병원(정신병원 1
 
2.9%
병원 1
 
2.9%

우편번호
Real number (ℝ)

HIGH CORRELATION 

Distinct16
Distinct (%)47.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean33641.5
Minimum33603
Maximum33673
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size438.0 B
2023-12-13T01:28:15.442310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum33603
5-th percentile33608
Q133636.75
median33642
Q333647
95-th percentile33671.35
Maximum33673
Range70
Interquartile range (IQR)10.25

Descriptive statistics

Standard deviation18.035739
Coefficient of variation (CV)0.00053611578
Kurtosis0.23399992
Mean33641.5
Median Absolute Deviation (MAD)5
Skewness-0.30038504
Sum1143811
Variance325.28788
MonotonicityNot monotonic
2023-12-13T01:28:15.561486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
33642 10
29.4%
33647 5
14.7%
33643 3
 
8.8%
33608 2
 
5.9%
33635 2
 
5.9%
33670 2
 
5.9%
33603 1
 
2.9%
33672 1
 
2.9%
33673 1
 
2.9%
33671 1
 
2.9%
Other values (6) 6
17.6%
ValueCountFrequency (%)
33603 1
 
2.9%
33608 2
 
5.9%
33610 1
 
2.9%
33615 1
 
2.9%
33624 1
 
2.9%
33630 1
 
2.9%
33635 2
 
5.9%
33642 10
29.4%
33643 3
 
8.8%
33647 5
14.7%
ValueCountFrequency (%)
33673 1
 
2.9%
33672 1
 
2.9%
33671 1
 
2.9%
33670 2
 
5.9%
33654 1
 
2.9%
33649 1
 
2.9%
33647 5
14.7%
33643 3
 
8.8%
33642 10
29.4%
33635 2
 
5.9%
Distinct31
Distinct (%)91.2%
Missing0
Missing (%)0.0%
Memory size404.0 B
2023-12-13T01:28:15.786131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length25
Mean length22.058824
Min length18

Characters and Unicode

Total characters750
Distinct characters60
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)85.3%

Sample

1st row충청남도 서천군 서천읍 충절로59번길 11-6
2nd row충청남도 서천군 서천읍 충절로49번길 11-5
3rd row충청남도 서천군 서천읍 충절로41번길 2
4th row충청남도 서천군 서천읍 군청로 7
5th row충청남도 서천군 서천읍 충절로 50, 3층
ValueCountFrequency (%)
충청남도 34
19.3%
서천군 34
19.3%
서천읍 21
 
11.9%
충절로 7
 
4.0%
50 5
 
2.8%
장항읍 5
 
2.8%
서천로 4
 
2.3%
2층 4
 
2.3%
96 3
 
1.7%
충절로59번길 3
 
1.7%
Other values (49) 56
31.8%
2023-12-13T01:28:16.132225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
142
18.9%
65
 
8.7%
61
 
8.1%
49
 
6.5%
35
 
4.7%
35
 
4.7%
35
 
4.7%
34
 
4.5%
30
 
4.0%
26
 
3.5%
Other values (50) 238
31.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 480
64.0%
Space Separator 142
 
18.9%
Decimal Number 111
 
14.8%
Dash Punctuation 11
 
1.5%
Other Punctuation 6
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
65
13.5%
61
12.7%
49
10.2%
35
 
7.3%
35
 
7.3%
35
 
7.3%
34
 
7.1%
30
 
6.2%
26
 
5.4%
14
 
2.9%
Other values (37) 96
20.0%
Decimal Number
ValueCountFrequency (%)
1 22
19.8%
2 17
15.3%
5 14
12.6%
9 11
9.9%
0 9
8.1%
4 9
8.1%
3 8
 
7.2%
7 8
 
7.2%
6 7
 
6.3%
8 6
 
5.4%
Space Separator
ValueCountFrequency (%)
142
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Other Punctuation
ValueCountFrequency (%)
, 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 480
64.0%
Common 270
36.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
65
13.5%
61
12.7%
49
10.2%
35
 
7.3%
35
 
7.3%
35
 
7.3%
34
 
7.1%
30
 
6.2%
26
 
5.4%
14
 
2.9%
Other values (37) 96
20.0%
Common
ValueCountFrequency (%)
142
52.6%
1 22
 
8.1%
2 17
 
6.3%
5 14
 
5.2%
- 11
 
4.1%
9 11
 
4.1%
0 9
 
3.3%
4 9
 
3.3%
3 8
 
3.0%
7 8
 
3.0%
Other values (3) 19
 
7.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 480
64.0%
ASCII 270
36.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
142
52.6%
1 22
 
8.1%
2 17
 
6.3%
5 14
 
5.2%
- 11
 
4.1%
9 11
 
4.1%
0 9
 
3.3%
4 9
 
3.3%
3 8
 
3.0%
7 8
 
3.0%
Other values (3) 19
 
7.0%
Hangul
ValueCountFrequency (%)
65
13.5%
61
12.7%
49
10.2%
35
 
7.3%
35
 
7.3%
35
 
7.3%
34
 
7.1%
30
 
6.2%
26
 
5.4%
14
 
2.9%
Other values (37) 96
20.0%
Distinct34
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size404.0 B
2023-12-13T01:28:16.341378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters408
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)100.0%

Sample

1st row041-952-9990
2nd row041-952-9582
3rd row041-953-8766
4th row041-951-0114
5th row041-951-0606
ValueCountFrequency (%)
041-952-9990 1
 
2.9%
041-952-9582 1
 
2.9%
041-951-8114 1
 
2.9%
041-953-8376 1
 
2.9%
041-950-1001 1
 
2.9%
041-950-5200 1
 
2.9%
041-952-7147 1
 
2.9%
041-951-0002 1
 
2.9%
041-956-5747 1
 
2.9%
041-951-7887 1
 
2.9%
Other values (24) 24
70.6%
2023-12-13T01:28:16.648617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 68
16.7%
1 61
15.0%
0 57
14.0%
5 50
12.3%
9 45
11.0%
4 39
9.6%
2 22
 
5.4%
7 17
 
4.2%
8 17
 
4.2%
3 17
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 340
83.3%
Dash Punctuation 68
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 61
17.9%
0 57
16.8%
5 50
14.7%
9 45
13.2%
4 39
11.5%
2 22
 
6.5%
7 17
 
5.0%
8 17
 
5.0%
3 17
 
5.0%
6 15
 
4.4%
Dash Punctuation
ValueCountFrequency (%)
- 68
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 408
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 68
16.7%
1 61
15.0%
0 57
14.0%
5 50
12.3%
9 45
11.0%
4 39
9.6%
2 22
 
5.4%
7 17
 
4.2%
8 17
 
4.2%
3 17
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 408
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 68
16.7%
1 61
15.0%
0 57
14.0%
5 50
12.3%
9 45
11.0%
4 39
9.6%
2 22
 
5.4%
7 17
 
4.2%
8 17
 
4.2%
3 17
 
4.2%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size404.0 B
Minimum2023-06-30 00:00:00
Maximum2023-06-30 00:00:00
2023-12-13T01:28:16.766773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:28:16.870437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-13T01:28:12.737968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T01:28:16.940901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
개설일자개설자명의료기관명의료기관종별우편번호의료기관주소(도로명)의료기관전화번호
개설일자1.0001.0001.0001.0001.0001.0001.000
개설자명1.0001.0001.0001.0001.0001.0001.000
의료기관명1.0001.0001.0001.0001.0001.0001.000
의료기관종별1.0001.0001.0001.0000.8661.0001.000
우편번호1.0001.0001.0000.8661.0001.0001.000
의료기관주소(도로명)1.0001.0001.0001.0001.0001.0001.000
의료기관전화번호1.0001.0001.0001.0001.0001.0001.000
2023-12-13T01:28:17.044945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
우편번호의료기관종별
우편번호1.0000.691
의료기관종별0.6911.000

Missing values

2023-12-13T01:28:12.878722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:28:13.015724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

개설일자개설자명의료기관명의료기관종별우편번호의료기관주소(도로명)의료기관전화번호데이터기준일자
02014-11-26이학수서천이안과의원의원33642충청남도 서천군 서천읍 충절로59번길 11-6041-952-99902023-06-30
12014-10-10조민수서천항장외과의원의원33642충청남도 서천군 서천읍 충절로49번길 11-5041-952-95822023-06-30
22014-08-05김재환청담플러스의원의원33642충청남도 서천군 서천읍 충절로41번길 2041-953-87662023-06-30
32011-09-06김홍섭늘봄정형외과의원의원33642충청남도 서천군 서천읍 군청로 7041-951-01142023-06-30
42010-09-02김바울김바울내과의원의원33647충청남도 서천군 서천읍 충절로 50, 3층041-951-06062023-06-30
52009-04-24김용우서천 김안과의원의원33647충청남도 서천군 서천읍 충절로 50041-951-15112023-06-30
62009-03-26이영수우리들의원의원33642충청남도 서천군 서천읍 충절로41번길 1041-952-33332023-06-30
72008-10-08이우찬장항삼성의원의원33670충청남도 서천군 장항읍 장항로 129-1041-956-75802023-06-30
82008-06-30이종찬위앤장서천내과의원의원33643충청남도 서천군 서천읍 서천로 96, 2층041-953-01112023-06-30
92008-06-03양조환서울의원의원33647충청남도 서천군 서천읍 충절로 50041-951-78872023-06-30
개설일자개설자명의료기관명의료기관종별우편번호의료기관주소(도로명)의료기관전화번호데이터기준일자
241993-03-19김성호우리의원의원33608충청남도 서천군 비인면 비인로 207-1041-952-17902023-06-30
251992-11-03한상배한일의원의원33670충청남도 서천군 장항읍 신창동로 39041-956-82212023-06-30
261992-04-14장봉열장봉열내과의원의원33671충청남도 서천군 장항읍 장서로29번길 3041-956-57472023-06-30
271992-02-21이규현한산의원의원33624충청남도 서천군 한산면 한산모시길 26-1041-951-00022023-06-30
282017-12-12장명훈동서천요양병원일반요양병원33630충청남도 서천군 화양면 활산로 245, 동서천요양병원041-952-71472023-06-30
292012-04-25손장신서천한국요양병원일반요양병원33654충청남도 서천군 마서면 어리길 205-7041-950-52002023-06-30
302008-10-23서천군수서천군립노인요양병원일반요양병원33610충청남도 서천군 종천면 충서로302번길 88-12041-950-10012023-06-30
312007-01-29박정자서천요양병원일반요양병원33649충청남도 서천군 서천읍 삼산북길 56041-953-83762023-06-30
322006-09-08김진오서천사랑병원요양병원(정신병원)33615충청남도 서천군 판교면 대백제로 2078041-951-81142023-06-30
331988-06-03김형주의료법인 서해병원병원33635충청남도 서천군 서천읍 서천로 184041-951-82822023-06-30