Overview

Dataset statistics

Number of variables8
Number of observations35
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 KiB
Average record size in memory68.8 B

Variable types

DateTime2
Text4
Categorical1
Numeric1

Alerts

데이터기준일자 has constant value ""Constant
우편번호 is highly overall correlated with 의료기관종별High correlation
의료기관종별 is highly overall correlated with 우편번호High correlation
의료기관종별 is highly imbalanced (56.2%)Imbalance
개설일자 has unique valuesUnique
개설자명 has unique valuesUnique
의료기관명 has unique valuesUnique
의료기관전화번호 has unique valuesUnique

Reproduction

Analysis started2024-01-09 22:14:00.071950
Analysis finished2024-01-09 22:14:00.617490
Duration0.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

개설일자
Date

UNIQUE 

Distinct35
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size412.0 B
Minimum1988-06-03 00:00:00
Maximum2017-12-12 00:00:00
2024-01-10T07:14:00.668754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:14:00.766881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)

개설자명
Text

UNIQUE 

Distinct35
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size412.0 B
2024-01-10T07:14:00.932109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length3.0285714
Min length3

Characters and Unicode

Total characters106
Distinct characters62
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)100.0%

Sample

1st row이학수
2nd row조민수
3rd row김재환
4th row김홍섭
5th row김바울
ValueCountFrequency (%)
이학수 1
 
2.9%
한광희 1
 
2.9%
서기원 1
 
2.9%
구도욱 1
 
2.9%
안대식 1
 
2.9%
김신호 1
 
2.9%
정기영 1
 
2.9%
김성호 1
 
2.9%
공경석 1
 
2.9%
한상배 1
 
2.9%
Other values (25) 25
71.4%
2024-01-10T07:14:01.202643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12
 
11.3%
6
 
5.7%
6
 
5.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
Other values (52) 65
61.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 106
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12
 
11.3%
6
 
5.7%
6
 
5.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
Other values (52) 65
61.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 106
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12
 
11.3%
6
 
5.7%
6
 
5.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
Other values (52) 65
61.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 106
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
12
 
11.3%
6
 
5.7%
6
 
5.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
Other values (52) 65
61.3%

의료기관명
Text

UNIQUE 

Distinct35
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size412.0 B
2024-01-10T07:14:01.382533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length9
Mean length7.1142857
Min length4

Characters and Unicode

Total characters249
Distinct characters82
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)100.0%

Sample

1st row서천이안과의원
2nd row서천항장외과의원
3rd row청담플러스의원
4th row늘봄정형외과의원
5th row김바울내과의원
ValueCountFrequency (%)
서천이안과의원 1
 
2.7%
한일의원 1
 
2.7%
연세정형외과의원 1
 
2.7%
해성정신건강의학과의원 1
 
2.7%
서울정형외과의원 1
 
2.7%
미래산부인과의원 1
 
2.7%
정소아청소년과의원 1
 
2.7%
우리의원 1
 
2.7%
제일의원 1
 
2.7%
공정형외과의원 1
 
2.7%
Other values (27) 27
73.0%
2024-01-10T07:14:01.671301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
35
 
14.1%
33
 
13.3%
19
 
7.6%
12
 
4.8%
9
 
3.6%
8
 
3.2%
8
 
3.2%
6
 
2.4%
6
 
2.4%
4
 
1.6%
Other values (72) 109
43.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 247
99.2%
Space Separator 2
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
35
 
14.2%
33
 
13.4%
19
 
7.7%
12
 
4.9%
9
 
3.6%
8
 
3.2%
8
 
3.2%
6
 
2.4%
6
 
2.4%
4
 
1.6%
Other values (71) 107
43.3%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 247
99.2%
Common 2
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
35
 
14.2%
33
 
13.4%
19
 
7.7%
12
 
4.9%
9
 
3.6%
8
 
3.2%
8
 
3.2%
6
 
2.4%
6
 
2.4%
4
 
1.6%
Other values (71) 107
43.3%
Common
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 247
99.2%
ASCII 2
 
0.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
35
 
14.2%
33
 
13.4%
19
 
7.7%
12
 
4.9%
9
 
3.6%
8
 
3.2%
8
 
3.2%
6
 
2.4%
6
 
2.4%
4
 
1.6%
Other values (71) 107
43.3%
ASCII
ValueCountFrequency (%)
2
100.0%

의료기관종별
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)11.4%
Missing0
Missing (%)0.0%
Memory size412.0 B
의원
29 
일반요양병원
요양병원(정신병원)
 
1
병원
 
1

Length

Max length10
Median length2
Mean length2.6857143
Min length2

Unique

Unique2 ?
Unique (%)5.7%

Sample

1st row의원
2nd row의원
3rd row의원
4th row의원
5th row의원

Common Values

ValueCountFrequency (%)
의원 29
82.9%
일반요양병원 4
 
11.4%
요양병원(정신병원) 1
 
2.9%
병원 1
 
2.9%

Length

2024-01-10T07:14:01.784346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:14:01.877007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
의원 29
82.9%
일반요양병원 4
 
11.4%
요양병원(정신병원 1
 
2.9%
병원 1
 
2.9%

우편번호
Real number (ℝ)

HIGH CORRELATION 

Distinct16
Distinct (%)45.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean33640.743
Minimum33603
Maximum33673
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size447.0 B
2024-01-10T07:14:01.955132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum33603
5-th percentile33608
Q133635
median33642
Q333647
95-th percentile33671.3
Maximum33673
Range70
Interquartile range (IQR)12

Descriptive statistics

Standard deviation18.324434
Coefficient of variation (CV)0.00054470955
Kurtosis0.0082128703
Mean33640.743
Median Absolute Deviation (MAD)5
Skewness-0.24029051
Sum1177426
Variance335.78487
MonotonicityNot monotonic
2024-01-10T07:14:02.043812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
33642 10
28.6%
33647 5
14.3%
33643 3
 
8.6%
33615 2
 
5.7%
33670 2
 
5.7%
33608 2
 
5.7%
33635 2
 
5.7%
33630 1
 
2.9%
33649 1
 
2.9%
33610 1
 
2.9%
Other values (6) 6
17.1%
ValueCountFrequency (%)
33603 1
 
2.9%
33608 2
 
5.7%
33610 1
 
2.9%
33615 2
 
5.7%
33624 1
 
2.9%
33630 1
 
2.9%
33635 2
 
5.7%
33642 10
28.6%
33643 3
 
8.6%
33647 5
14.3%
ValueCountFrequency (%)
33673 1
 
2.9%
33672 1
 
2.9%
33671 1
 
2.9%
33670 2
 
5.7%
33654 1
 
2.9%
33649 1
 
2.9%
33647 5
14.3%
33643 3
 
8.6%
33642 10
28.6%
33635 2
 
5.7%
Distinct32
Distinct (%)91.4%
Missing0
Missing (%)0.0%
Memory size412.0 B
2024-01-10T07:14:02.213898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length25
Mean length22
Min length18

Characters and Unicode

Total characters770
Distinct characters60
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)85.7%

Sample

1st row충청남도 서천군 서천읍 충절로59번길 11-6
2nd row충청남도 서천군 서천읍 충절로49번길 11-5
3rd row충청남도 서천군 서천읍 충절로41번길 2
4th row충청남도 서천군 서천읍 군청로 7
5th row충청남도 서천군 서천읍 충절로 50, 3층
ValueCountFrequency (%)
충청남도 35
19.3%
서천군 35
19.3%
서천읍 21
 
11.6%
충절로 7
 
3.9%
장항읍 5
 
2.8%
50 5
 
2.8%
서천로 4
 
2.2%
2층 4
 
2.2%
96 3
 
1.7%
충절로59번길 3
 
1.7%
Other values (51) 59
32.6%
2024-01-10T07:14:02.483279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
146
19.0%
66
 
8.6%
62
 
8.1%
50
 
6.5%
36
 
4.7%
36
 
4.7%
36
 
4.7%
35
 
4.5%
31
 
4.0%
26
 
3.4%
Other values (50) 246
31.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 493
64.0%
Space Separator 146
 
19.0%
Decimal Number 114
 
14.8%
Dash Punctuation 11
 
1.4%
Other Punctuation 6
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
66
13.4%
62
12.6%
50
10.1%
36
 
7.3%
36
 
7.3%
36
 
7.3%
35
 
7.1%
31
 
6.3%
26
 
5.3%
14
 
2.8%
Other values (37) 101
20.5%
Decimal Number
ValueCountFrequency (%)
1 23
20.2%
2 17
14.9%
5 14
12.3%
9 12
10.5%
0 9
 
7.9%
4 9
 
7.9%
7 8
 
7.0%
3 8
 
7.0%
8 7
 
6.1%
6 7
 
6.1%
Space Separator
ValueCountFrequency (%)
146
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Other Punctuation
ValueCountFrequency (%)
, 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 493
64.0%
Common 277
36.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
66
13.4%
62
12.6%
50
10.1%
36
 
7.3%
36
 
7.3%
36
 
7.3%
35
 
7.1%
31
 
6.3%
26
 
5.3%
14
 
2.8%
Other values (37) 101
20.5%
Common
ValueCountFrequency (%)
146
52.7%
1 23
 
8.3%
2 17
 
6.1%
5 14
 
5.1%
9 12
 
4.3%
- 11
 
4.0%
0 9
 
3.2%
4 9
 
3.2%
7 8
 
2.9%
3 8
 
2.9%
Other values (3) 20
 
7.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 493
64.0%
ASCII 277
36.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
146
52.7%
1 23
 
8.3%
2 17
 
6.1%
5 14
 
5.1%
9 12
 
4.3%
- 11
 
4.0%
0 9
 
3.2%
4 9
 
3.2%
7 8
 
2.9%
3 8
 
2.9%
Other values (3) 20
 
7.2%
Hangul
ValueCountFrequency (%)
66
13.4%
62
12.6%
50
10.1%
36
 
7.3%
36
 
7.3%
36
 
7.3%
35
 
7.1%
31
 
6.3%
26
 
5.3%
14
 
2.8%
Other values (37) 101
20.5%
Distinct35
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size412.0 B
2024-01-10T07:14:02.657818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters420
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)100.0%

Sample

1st row041-952-9990
2nd row041-952-9582
3rd row041-953-8766
4th row041-951-0114
5th row041-951-0606
ValueCountFrequency (%)
041-952-9990 1
 
2.9%
041-953-8292 1
 
2.9%
041-956-3836 1
 
2.9%
041-952-0079 1
 
2.9%
041-956-7351 1
 
2.9%
041-951-8900 1
 
2.9%
041-953-2676 1
 
2.9%
041-952-1790 1
 
2.9%
041-952-0788 1
 
2.9%
041-956-8221 1
 
2.9%
Other values (25) 25
71.4%
2024-01-10T07:14:02.920623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 70
16.7%
1 63
15.0%
0 58
13.8%
5 52
12.4%
9 46
11.0%
4 40
9.5%
2 22
 
5.2%
8 19
 
4.5%
7 18
 
4.3%
3 17
 
4.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 350
83.3%
Dash Punctuation 70
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 63
18.0%
0 58
16.6%
5 52
14.9%
9 46
13.1%
4 40
11.4%
2 22
 
6.3%
8 19
 
5.4%
7 18
 
5.1%
3 17
 
4.9%
6 15
 
4.3%
Dash Punctuation
ValueCountFrequency (%)
- 70
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 420
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 70
16.7%
1 63
15.0%
0 58
13.8%
5 52
12.4%
9 46
11.0%
4 40
9.5%
2 22
 
5.2%
8 19
 
4.5%
7 18
 
4.3%
3 17
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 420
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 70
16.7%
1 63
15.0%
0 58
13.8%
5 52
12.4%
9 46
11.0%
4 40
9.5%
2 22
 
5.2%
8 19
 
4.5%
7 18
 
4.3%
3 17
 
4.0%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size412.0 B
Minimum2019-05-23 00:00:00
Maximum2019-05-23 00:00:00
2024-01-10T07:14:03.011699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:14:03.086515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-01-10T07:14:00.377814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T07:14:03.142731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
개설일자개설자명의료기관명의료기관종별우편번호의료기관주소(도로명)의료기관전화번호
개설일자1.0001.0001.0001.0001.0001.0001.000
개설자명1.0001.0001.0001.0001.0001.0001.000
의료기관명1.0001.0001.0001.0001.0001.0001.000
의료기관종별1.0001.0001.0001.0000.7561.0001.000
우편번호1.0001.0001.0000.7561.0001.0001.000
의료기관주소(도로명)1.0001.0001.0001.0001.0001.0001.000
의료기관전화번호1.0001.0001.0001.0001.0001.0001.000
2024-01-10T07:14:03.233872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
우편번호의료기관종별
우편번호1.0000.544
의료기관종별0.5441.000

Missing values

2024-01-10T07:14:00.476349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:14:00.576294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

개설일자개설자명의료기관명의료기관종별우편번호의료기관주소(도로명)의료기관전화번호데이터기준일자
02014-11-26이학수서천이안과의원의원33642충청남도 서천군 서천읍 충절로59번길 11-6041-952-99902019-05-23
12014-10-10조민수서천항장외과의원의원33642충청남도 서천군 서천읍 충절로49번길 11-5041-952-95822019-05-23
22014-08-05김재환청담플러스의원의원33642충청남도 서천군 서천읍 충절로41번길 2041-953-87662019-05-23
32011-09-06김홍섭늘봄정형외과의원의원33642충청남도 서천군 서천읍 군청로 7041-951-01142019-05-23
42010-09-02김바울김바울내과의원의원33647충청남도 서천군 서천읍 충절로 50, 3층041-951-06062019-05-23
52009-04-24김용우서천 김안과의원의원33647충청남도 서천군 서천읍 충절로 50041-951-15112019-05-23
62009-03-26이영수우리들의원의원33642충청남도 서천군 서천읍 충절로41번길 1041-952-33332019-05-23
72008-11-04김선수소문난의원의원33615충청남도 서천군 판교면 종판로 891041-951-75882019-05-23
82008-10-08이우찬장항삼성의원의원33670충청남도 서천군 장항읍 장항로 129-1041-956-75802019-05-23
92008-06-30이종찬위앤장서천내과의원의원33643충청남도 서천군 서천읍 서천로 96, 2층041-953-01112019-05-23
개설일자개설자명의료기관명의료기관종별우편번호의료기관주소(도로명)의료기관전화번호데이터기준일자
251993-03-19김성호우리의원의원33608충청남도 서천군 비인면 비인로 207-1041-952-17902019-05-23
261992-11-03한상배한일의원의원33670충청남도 서천군 장항읍 신창동로 39041-956-82212019-05-23
271992-04-14장봉열장봉열내과의원의원33671충청남도 서천군 장항읍 장서로29번길 3041-956-57472019-05-23
281992-02-21이규현한산의원의원33624충청남도 서천군 한산면 한산모시길 26-1041-951-00022019-05-23
292017-12-12장명훈동서천요양병원일반요양병원33630충청남도 서천군 화양면 활산로 245, 동서천요양병원041-952-71472019-05-23
302012-04-25김재겸한국요양병원일반요양병원33654충청남도 서천군 마서면 어리길 205-7041-950-52002019-05-23
312008-10-23서천군수서천군립노인요양병원일반요양병원33610충청남도 서천군 종천면 충서로302번길 88-12041-950-10012019-05-23
322007-01-29박정자서천요양병원일반요양병원33649충청남도 서천군 서천읍 삼산북길 56041-953-83762019-05-23
332006-09-08김진오서천사랑병원요양병원(정신병원)33615충청남도 서천군 판교면 대백제로 2078041-951-81142019-05-23
341988-06-03김형주의료법인 서해병원병원33635충청남도 서천군 서천읍 서천로 184041-951-82822019-05-23