Overview

Dataset statistics

Number of variables8
Number of observations34
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.3 KiB
Average record size in memory68.9 B

Variable types

DateTime1
Text4
Categorical2
Numeric1

Alerts

데이터기준일자 has constant value ""Constant
우편번호 is highly overall correlated with 의료기관종별High correlation
의료기관종별 is highly overall correlated with 우편번호High correlation
의료기관종별 is highly imbalanced (55.3%)Imbalance
개설일자 has unique valuesUnique
개설자명 has unique valuesUnique
의료기관명 has unique valuesUnique
의료기관전화번호 has unique valuesUnique

Reproduction

Analysis started2024-01-09 22:13:56.183831
Analysis finished2024-01-09 22:13:56.705150
Duration0.52 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

개설일자
Date

UNIQUE 

Distinct34
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size404.0 B
Minimum1988-06-03 00:00:00
Maximum2017-12-12 00:00:00
2024-01-10T07:13:56.753647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:13:56.842771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)

개설자명
Text

UNIQUE 

Distinct34
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size404.0 B
2024-01-10T07:13:56.997612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length3.0294118
Min length3

Characters and Unicode

Total characters103
Distinct characters61
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)100.0%

Sample

1st row이학수
2nd row조민수
3rd row김재환
4th row김홍섭
5th row김바울
ValueCountFrequency (%)
이학수 1
 
2.9%
조민수 1
 
2.9%
김진오 1
 
2.9%
박정자 1
 
2.9%
서천군수 1
 
2.9%
손장신 1
 
2.9%
장명훈 1
 
2.9%
이규현 1
 
2.9%
장봉열 1
 
2.9%
양조환 1
 
2.9%
Other values (24) 24
70.6%
2024-01-10T07:13:57.284464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10
 
9.7%
6
 
5.8%
5
 
4.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
2
 
1.9%
2
 
1.9%
Other values (51) 63
61.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 103
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10
 
9.7%
6
 
5.8%
5
 
4.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
2
 
1.9%
2
 
1.9%
Other values (51) 63
61.2%

Most occurring scripts

ValueCountFrequency (%)
Hangul 103
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10
 
9.7%
6
 
5.8%
5
 
4.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
2
 
1.9%
2
 
1.9%
Other values (51) 63
61.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 103
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
10
 
9.7%
6
 
5.8%
5
 
4.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
2
 
1.9%
2
 
1.9%
Other values (51) 63
61.2%

의료기관명
Text

UNIQUE 

Distinct34
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size404.0 B
2024-01-10T07:13:57.464928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length9
Mean length7.2352941
Min length4

Characters and Unicode

Total characters246
Distinct characters80
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)100.0%

Sample

1st row서천이안과의원
2nd row서천항장외과의원
3rd row청담플러스의원
4th row늘봄정형외과의원
5th row김바울내과의원
ValueCountFrequency (%)
서천이안과의원 1
 
2.8%
서울의원 1
 
2.8%
서천사랑병원 1
 
2.8%
서천요양병원 1
 
2.8%
서천군립노인요양병원 1
 
2.8%
서천한국요양병원 1
 
2.8%
동서천요양병원 1
 
2.8%
한산의원 1
 
2.8%
서천항장외과의원 1
 
2.8%
장봉열내과의원 1
 
2.8%
Other values (26) 26
72.2%
2024-01-10T07:13:57.749264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
34
 
13.8%
32
 
13.0%
19
 
7.7%
13
 
5.3%
10
 
4.1%
8
 
3.3%
8
 
3.3%
6
 
2.4%
6
 
2.4%
4
 
1.6%
Other values (70) 106
43.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 244
99.2%
Space Separator 2
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
34
 
13.9%
32
 
13.1%
19
 
7.8%
13
 
5.3%
10
 
4.1%
8
 
3.3%
8
 
3.3%
6
 
2.5%
6
 
2.5%
4
 
1.6%
Other values (69) 104
42.6%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 244
99.2%
Common 2
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
34
 
13.9%
32
 
13.1%
19
 
7.8%
13
 
5.3%
10
 
4.1%
8
 
3.3%
8
 
3.3%
6
 
2.5%
6
 
2.5%
4
 
1.6%
Other values (69) 104
42.6%
Common
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 244
99.2%
ASCII 2
 
0.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
34
 
13.9%
32
 
13.1%
19
 
7.8%
13
 
5.3%
10
 
4.1%
8
 
3.3%
8
 
3.3%
6
 
2.5%
6
 
2.5%
4
 
1.6%
Other values (69) 104
42.6%
ASCII
ValueCountFrequency (%)
2
100.0%

의료기관종별
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)11.8%
Missing0
Missing (%)0.0%
Memory size404.0 B
의원
28 
일반요양병원
요양병원(정신병원)
 
1
병원
 
1

Length

Max length10
Median length2
Mean length2.7058824
Min length2

Unique

Unique2 ?
Unique (%)5.9%

Sample

1st row의원
2nd row의원
3rd row의원
4th row의원
5th row의원

Common Values

ValueCountFrequency (%)
의원 28
82.4%
일반요양병원 4
 
11.8%
요양병원(정신병원) 1
 
2.9%
병원 1
 
2.9%

Length

2024-01-10T07:13:57.861566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:13:57.944997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
의원 28
82.4%
일반요양병원 4
 
11.8%
요양병원(정신병원 1
 
2.9%
병원 1
 
2.9%

우편번호
Real number (ℝ)

HIGH CORRELATION 

Distinct16
Distinct (%)47.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean33641.5
Minimum33603
Maximum33673
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size438.0 B
2024-01-10T07:13:58.021544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum33603
5-th percentile33608
Q133636.75
median33642
Q333647
95-th percentile33671.35
Maximum33673
Range70
Interquartile range (IQR)10.25

Descriptive statistics

Standard deviation18.035739
Coefficient of variation (CV)0.00053611578
Kurtosis0.23399992
Mean33641.5
Median Absolute Deviation (MAD)5
Skewness-0.30038504
Sum1143811
Variance325.28788
MonotonicityNot monotonic
2024-01-10T07:13:58.107238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
33642 10
29.4%
33647 5
14.7%
33643 3
 
8.8%
33608 2
 
5.9%
33635 2
 
5.9%
33670 2
 
5.9%
33603 1
 
2.9%
33672 1
 
2.9%
33673 1
 
2.9%
33671 1
 
2.9%
Other values (6) 6
17.6%
ValueCountFrequency (%)
33603 1
 
2.9%
33608 2
 
5.9%
33610 1
 
2.9%
33615 1
 
2.9%
33624 1
 
2.9%
33630 1
 
2.9%
33635 2
 
5.9%
33642 10
29.4%
33643 3
 
8.8%
33647 5
14.7%
ValueCountFrequency (%)
33673 1
 
2.9%
33672 1
 
2.9%
33671 1
 
2.9%
33670 2
 
5.9%
33654 1
 
2.9%
33649 1
 
2.9%
33647 5
14.7%
33643 3
 
8.8%
33642 10
29.4%
33635 2
 
5.9%
Distinct31
Distinct (%)91.2%
Missing0
Missing (%)0.0%
Memory size404.0 B
2024-01-10T07:13:58.279966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length25
Mean length22.058824
Min length18

Characters and Unicode

Total characters750
Distinct characters60
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)85.3%

Sample

1st row충청남도 서천군 서천읍 충절로59번길 11-6
2nd row충청남도 서천군 서천읍 충절로49번길 11-5
3rd row충청남도 서천군 서천읍 충절로41번길 2
4th row충청남도 서천군 서천읍 군청로 7
5th row충청남도 서천군 서천읍 충절로 50, 3층
ValueCountFrequency (%)
충청남도 34
19.3%
서천군 34
19.3%
서천읍 21
 
11.9%
충절로 7
 
4.0%
50 5
 
2.8%
장항읍 5
 
2.8%
서천로 4
 
2.3%
2층 4
 
2.3%
96 3
 
1.7%
충절로59번길 3
 
1.7%
Other values (49) 56
31.8%
2024-01-10T07:13:58.552785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
142
18.9%
65
 
8.7%
61
 
8.1%
49
 
6.5%
35
 
4.7%
35
 
4.7%
35
 
4.7%
34
 
4.5%
30
 
4.0%
26
 
3.5%
Other values (50) 238
31.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 480
64.0%
Space Separator 142
 
18.9%
Decimal Number 111
 
14.8%
Dash Punctuation 11
 
1.5%
Other Punctuation 6
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
65
13.5%
61
12.7%
49
10.2%
35
 
7.3%
35
 
7.3%
35
 
7.3%
34
 
7.1%
30
 
6.2%
26
 
5.4%
14
 
2.9%
Other values (37) 96
20.0%
Decimal Number
ValueCountFrequency (%)
1 22
19.8%
2 17
15.3%
5 14
12.6%
9 11
9.9%
0 9
8.1%
4 9
8.1%
3 8
 
7.2%
7 8
 
7.2%
6 7
 
6.3%
8 6
 
5.4%
Space Separator
ValueCountFrequency (%)
142
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Other Punctuation
ValueCountFrequency (%)
, 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 480
64.0%
Common 270
36.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
65
13.5%
61
12.7%
49
10.2%
35
 
7.3%
35
 
7.3%
35
 
7.3%
34
 
7.1%
30
 
6.2%
26
 
5.4%
14
 
2.9%
Other values (37) 96
20.0%
Common
ValueCountFrequency (%)
142
52.6%
1 22
 
8.1%
2 17
 
6.3%
5 14
 
5.2%
- 11
 
4.1%
9 11
 
4.1%
0 9
 
3.3%
4 9
 
3.3%
3 8
 
3.0%
7 8
 
3.0%
Other values (3) 19
 
7.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 480
64.0%
ASCII 270
36.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
142
52.6%
1 22
 
8.1%
2 17
 
6.3%
5 14
 
5.2%
- 11
 
4.1%
9 11
 
4.1%
0 9
 
3.3%
4 9
 
3.3%
3 8
 
3.0%
7 8
 
3.0%
Other values (3) 19
 
7.0%
Hangul
ValueCountFrequency (%)
65
13.5%
61
12.7%
49
10.2%
35
 
7.3%
35
 
7.3%
35
 
7.3%
34
 
7.1%
30
 
6.2%
26
 
5.4%
14
 
2.9%
Other values (37) 96
20.0%
Distinct34
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size404.0 B
2024-01-10T07:13:58.720150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters408
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)100.0%

Sample

1st row041-952-9990
2nd row041-952-9582
3rd row041-953-8766
4th row041-951-0114
5th row041-951-0606
ValueCountFrequency (%)
041-952-9990 1
 
2.9%
041-952-9582 1
 
2.9%
041-951-8114 1
 
2.9%
041-953-8376 1
 
2.9%
041-950-1001 1
 
2.9%
041-950-5200 1
 
2.9%
041-952-7147 1
 
2.9%
041-951-0002 1
 
2.9%
041-956-5747 1
 
2.9%
041-951-7887 1
 
2.9%
Other values (24) 24
70.6%
2024-01-10T07:13:58.976979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 68
16.7%
1 61
15.0%
0 57
14.0%
5 50
12.3%
9 45
11.0%
4 39
9.6%
2 22
 
5.4%
7 17
 
4.2%
8 17
 
4.2%
3 17
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 340
83.3%
Dash Punctuation 68
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 61
17.9%
0 57
16.8%
5 50
14.7%
9 45
13.2%
4 39
11.5%
2 22
 
6.5%
7 17
 
5.0%
8 17
 
5.0%
3 17
 
5.0%
6 15
 
4.4%
Dash Punctuation
ValueCountFrequency (%)
- 68
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 408
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 68
16.7%
1 61
15.0%
0 57
14.0%
5 50
12.3%
9 45
11.0%
4 39
9.6%
2 22
 
5.4%
7 17
 
4.2%
8 17
 
4.2%
3 17
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 408
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 68
16.7%
1 61
15.0%
0 57
14.0%
5 50
12.3%
9 45
11.0%
4 39
9.6%
2 22
 
5.4%
7 17
 
4.2%
8 17
 
4.2%
3 17
 
4.2%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size404.0 B
2023-06-30
34 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-06-30
2nd row2023-06-30
3rd row2023-06-30
4th row2023-06-30
5th row2023-06-30

Common Values

ValueCountFrequency (%)
2023-06-30 34
100.0%

Length

2024-01-10T07:13:59.092764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:13:59.164342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-06-30 34
100.0%

Interactions

2024-01-10T07:13:56.480738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T07:13:59.429861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
개설일자개설자명의료기관명의료기관종별우편번호의료기관주소(도로명)의료기관전화번호
개설일자1.0001.0001.0001.0001.0001.0001.000
개설자명1.0001.0001.0001.0001.0001.0001.000
의료기관명1.0001.0001.0001.0001.0001.0001.000
의료기관종별1.0001.0001.0001.0000.8661.0001.000
우편번호1.0001.0001.0000.8661.0001.0001.000
의료기관주소(도로명)1.0001.0001.0001.0001.0001.0001.000
의료기관전화번호1.0001.0001.0001.0001.0001.0001.000
2024-01-10T07:13:59.511431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
우편번호의료기관종별
우편번호1.0000.691
의료기관종별0.6911.000

Missing values

2024-01-10T07:13:56.574517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:13:56.666964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

개설일자개설자명의료기관명의료기관종별우편번호의료기관주소(도로명)의료기관전화번호데이터기준일자
02014-11-26이학수서천이안과의원의원33642충청남도 서천군 서천읍 충절로59번길 11-6041-952-99902023-06-30
12014-10-10조민수서천항장외과의원의원33642충청남도 서천군 서천읍 충절로49번길 11-5041-952-95822023-06-30
22014-08-05김재환청담플러스의원의원33642충청남도 서천군 서천읍 충절로41번길 2041-953-87662023-06-30
32011-09-06김홍섭늘봄정형외과의원의원33642충청남도 서천군 서천읍 군청로 7041-951-01142023-06-30
42010-09-02김바울김바울내과의원의원33647충청남도 서천군 서천읍 충절로 50, 3층041-951-06062023-06-30
52009-04-24김용우서천 김안과의원의원33647충청남도 서천군 서천읍 충절로 50041-951-15112023-06-30
62009-03-26이영수우리들의원의원33642충청남도 서천군 서천읍 충절로41번길 1041-952-33332023-06-30
72008-10-08이우찬장항삼성의원의원33670충청남도 서천군 장항읍 장항로 129-1041-956-75802023-06-30
82008-06-30이종찬위앤장서천내과의원의원33643충청남도 서천군 서천읍 서천로 96, 2층041-953-01112023-06-30
92008-06-03양조환서울의원의원33647충청남도 서천군 서천읍 충절로 50041-951-78872023-06-30
개설일자개설자명의료기관명의료기관종별우편번호의료기관주소(도로명)의료기관전화번호데이터기준일자
241993-03-19김성호우리의원의원33608충청남도 서천군 비인면 비인로 207-1041-952-17902023-06-30
251992-11-03한상배한일의원의원33670충청남도 서천군 장항읍 신창동로 39041-956-82212023-06-30
261992-04-14장봉열장봉열내과의원의원33671충청남도 서천군 장항읍 장서로29번길 3041-956-57472023-06-30
271992-02-21이규현한산의원의원33624충청남도 서천군 한산면 한산모시길 26-1041-951-00022023-06-30
282017-12-12장명훈동서천요양병원일반요양병원33630충청남도 서천군 화양면 활산로 245, 동서천요양병원041-952-71472023-06-30
292012-04-25손장신서천한국요양병원일반요양병원33654충청남도 서천군 마서면 어리길 205-7041-950-52002023-06-30
302008-10-23서천군수서천군립노인요양병원일반요양병원33610충청남도 서천군 종천면 충서로302번길 88-12041-950-10012023-06-30
312007-01-29박정자서천요양병원일반요양병원33649충청남도 서천군 서천읍 삼산북길 56041-953-83762023-06-30
322006-09-08김진오서천사랑병원요양병원(정신병원)33615충청남도 서천군 판교면 대백제로 2078041-951-81142023-06-30
331988-06-03김형주의료법인 서해병원병원33635충청남도 서천군 서천읍 서천로 184041-951-82822023-06-30