Overview

Dataset statistics

Number of variables7
Number of observations56
Missing cells54
Missing cells (%)13.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.3 KiB
Average record size in memory60.4 B

Variable types

Numeric2
Categorical1
Text4

Dataset

Description영동군의 의료기관 병원, 일반의원, 한의원 치과 현황 자료 제공으로 번호, 구분, 의료기관명, 도로명주소, 병상수, 전화번호,팩스 정보가 있습니다.
Author충청북도 영동군
URLhttps://www.data.go.kr/data/3071331/fileData.do

Alerts

번호 is highly overall correlated with 병상 and 1 other fieldsHigh correlation
병상 is highly overall correlated with 번호 and 1 other fieldsHigh correlation
구분 is highly overall correlated with 번호 and 1 other fieldsHigh correlation
병상 has 49 (87.5%) missing valuesMissing
팩스 has 5 (8.9%) missing valuesMissing
번호 has unique valuesUnique
의료기관명 has unique valuesUnique
전화번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 14:11:46.489950
Analysis finished2023-12-12 14:11:47.592921
Duration1.1 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct56
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean28.5
Minimum1
Maximum56
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size636.0 B
2023-12-12T23:11:47.659023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.75
Q114.75
median28.5
Q342.25
95-th percentile53.25
Maximum56
Range55
Interquartile range (IQR)27.5

Descriptive statistics

Standard deviation16.309506
Coefficient of variation (CV)0.57226338
Kurtosis-1.2
Mean28.5
Median Absolute Deviation (MAD)14
Skewness0
Sum1596
Variance266
MonotonicityStrictly increasing
2023-12-12T23:11:47.821026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.8%
30 1
 
1.8%
32 1
 
1.8%
33 1
 
1.8%
34 1
 
1.8%
35 1
 
1.8%
36 1
 
1.8%
37 1
 
1.8%
38 1
 
1.8%
39 1
 
1.8%
Other values (46) 46
82.1%
ValueCountFrequency (%)
1 1
1.8%
2 1
1.8%
3 1
1.8%
4 1
1.8%
5 1
1.8%
6 1
1.8%
7 1
1.8%
8 1
1.8%
9 1
1.8%
10 1
1.8%
ValueCountFrequency (%)
56 1
1.8%
55 1
1.8%
54 1
1.8%
53 1
1.8%
52 1
1.8%
51 1
1.8%
50 1
1.8%
49 1
1.8%
48 1
1.8%
47 1
1.8%

구분
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)7.1%
Missing0
Missing (%)0.0%
Memory size580.0 B
일반의원
26 
한의원
15 
치과
11 
병원

Length

Max length4
Median length3
Mean length3.1964286
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row병원
2nd row병원
3rd row병원
4th row병원
5th row일반의원

Common Values

ValueCountFrequency (%)
일반의원 26
46.4%
한의원 15
26.8%
치과 11
19.6%
병원 4
 
7.1%

Length

2023-12-12T23:11:47.995312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:11:48.130284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반의원 26
46.4%
한의원 15
26.8%
치과 11
19.6%
병원 4
 
7.1%

의료기관명
Text

UNIQUE 

Distinct56
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size580.0 B
2023-12-12T23:11:48.368802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17.5
Mean length7.75
Min length4

Characters and Unicode

Total characters434
Distinct characters119
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique56 ?
Unique (%)100.0%

Sample

1st row의료법인 다원의료재단 영동제일요양병원
2nd row의료법인 조윤의료재단 영동병원
3rd row학교법인 금강학원 영동군립노인전문병원
4th row의료법인 조윤의료재단 감고을요양병원
5th row서외과의원
ValueCountFrequency (%)
의료법인 3
 
4.7%
조윤의료재단 2
 
3.1%
광혜한의원 1
 
1.6%
속편한신내과의원 1
 
1.6%
노상필내과의원 1
 
1.6%
경희한의원 1
 
1.6%
영일당한의원 1
 
1.6%
성심한의원 1
 
1.6%
금강한의원 1
 
1.6%
북경한의원 1
 
1.6%
Other values (51) 51
79.7%
2023-12-12T23:11:48.884260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
62
 
14.3%
57
 
13.1%
29
 
6.7%
19
 
4.4%
11
 
2.5%
9
 
2.1%
9
 
2.1%
8
 
1.8%
7
 
1.6%
7
 
1.6%
Other values (109) 216
49.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 405
93.3%
Decimal Number 10
 
2.3%
Space Separator 9
 
2.1%
Close Punctuation 5
 
1.2%
Open Punctuation 5
 
1.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
62
 
15.3%
57
 
14.1%
29
 
7.2%
19
 
4.7%
11
 
2.7%
9
 
2.2%
8
 
2.0%
7
 
1.7%
7
 
1.7%
6
 
1.5%
Other values (102) 190
46.9%
Decimal Number
ValueCountFrequency (%)
1 4
40.0%
8 3
30.0%
7 2
20.0%
9 1
 
10.0%
Space Separator
ValueCountFrequency (%)
9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 405
93.3%
Common 29
 
6.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
62
 
15.3%
57
 
14.1%
29
 
7.2%
19
 
4.7%
11
 
2.7%
9
 
2.2%
8
 
2.0%
7
 
1.7%
7
 
1.7%
6
 
1.5%
Other values (102) 190
46.9%
Common
ValueCountFrequency (%)
9
31.0%
) 5
17.2%
( 5
17.2%
1 4
13.8%
8 3
 
10.3%
7 2
 
6.9%
9 1
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 405
93.3%
ASCII 29
 
6.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
62
 
15.3%
57
 
14.1%
29
 
7.2%
19
 
4.7%
11
 
2.7%
9
 
2.2%
8
 
2.0%
7
 
1.7%
7
 
1.7%
6
 
1.5%
Other values (102) 190
46.9%
ASCII
ValueCountFrequency (%)
9
31.0%
) 5
17.2%
( 5
17.2%
1 4
13.8%
8 3
 
10.3%
7 2
 
6.9%
9 1
 
3.4%
Distinct47
Distinct (%)83.9%
Missing0
Missing (%)0.0%
Memory size580.0 B
2023-12-12T23:11:49.153813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length23
Mean length20.714286
Min length17

Characters and Unicode

Total characters1160
Distinct characters56
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique41 ?
Unique (%)73.2%

Sample

1st row충청북도 영동군 양강면 양정죽촌로 53-12
2nd row충청북도 영동군 영동읍 대학로 106
3rd row충청북도 영동군 영동읍 대학로 290
4th row충청북도 영동군 영동읍 대학로 106
5th row충청북도 영동군 영동읍 학산영동로 1241
ValueCountFrequency (%)
영동군 56
19.0%
충청북도 55
18.7%
영동읍 46
15.6%
중앙로 21
 
7.1%
계산로 8
 
2.7%
3길 7
 
2.4%
황간면 6
 
2.0%
영동시장 4
 
1.4%
6 4
 
1.4%
33 4
 
1.4%
Other values (61) 83
28.2%
2023-12-12T23:11:49.597099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
239
20.6%
115
 
9.9%
114
 
9.8%
56
 
4.8%
56
 
4.8%
56
 
4.8%
55
 
4.7%
55
 
4.7%
47
 
4.1%
46
 
4.0%
Other values (46) 321
27.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 773
66.6%
Space Separator 239
 
20.6%
Decimal Number 136
 
11.7%
Dash Punctuation 11
 
0.9%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
115
14.9%
114
14.7%
56
 
7.2%
56
 
7.2%
56
 
7.2%
55
 
7.1%
55
 
7.1%
47
 
6.1%
46
 
6.0%
23
 
3.0%
Other values (33) 150
19.4%
Decimal Number
ValueCountFrequency (%)
1 31
22.8%
2 31
22.8%
3 24
17.6%
4 12
 
8.8%
6 9
 
6.6%
5 8
 
5.9%
8 6
 
4.4%
0 6
 
4.4%
9 5
 
3.7%
7 4
 
2.9%
Space Separator
ValueCountFrequency (%)
239
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 773
66.6%
Common 387
33.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
115
14.9%
114
14.7%
56
 
7.2%
56
 
7.2%
56
 
7.2%
55
 
7.1%
55
 
7.1%
47
 
6.1%
46
 
6.0%
23
 
3.0%
Other values (33) 150
19.4%
Common
ValueCountFrequency (%)
239
61.8%
1 31
 
8.0%
2 31
 
8.0%
3 24
 
6.2%
4 12
 
3.1%
- 11
 
2.8%
6 9
 
2.3%
5 8
 
2.1%
8 6
 
1.6%
0 6
 
1.6%
Other values (3) 10
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 773
66.6%
ASCII 387
33.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
239
61.8%
1 31
 
8.0%
2 31
 
8.0%
3 24
 
6.2%
4 12
 
3.1%
- 11
 
2.8%
6 9
 
2.3%
5 8
 
2.1%
8 6
 
1.6%
0 6
 
1.6%
Other values (3) 10
 
2.6%
Hangul
ValueCountFrequency (%)
115
14.9%
114
14.7%
56
 
7.2%
56
 
7.2%
56
 
7.2%
55
 
7.1%
55
 
7.1%
47
 
6.1%
46
 
6.0%
23
 
3.0%
Other values (33) 150
19.4%

병상
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct6
Distinct (%)85.7%
Missing49
Missing (%)87.5%
Infinite0
Infinite (%)0.0%
Mean100.85714
Minimum26
Maximum193
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size636.0 B
2023-12-12T23:11:49.735867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum26
5-th percentile26.9
Q129
median120
Q3154.5
95-th percentile183.4
Maximum193
Range167
Interquartile range (IQR)125.5

Descriptive statistics

Standard deviation71.445617
Coefficient of variation (CV)0.7083843
Kurtosis-2.2663062
Mean100.85714
Median Absolute Deviation (MAD)73
Skewness-0.031910567
Sum706
Variance5104.4762
MonotonicityNot monotonic
2023-12-12T23:11:49.859006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
29 2
 
3.6%
148 1
 
1.8%
161 1
 
1.8%
120 1
 
1.8%
193 1
 
1.8%
26 1
 
1.8%
(Missing) 49
87.5%
ValueCountFrequency (%)
26 1
1.8%
29 2
3.6%
120 1
1.8%
148 1
1.8%
161 1
1.8%
193 1
1.8%
ValueCountFrequency (%)
193 1
1.8%
161 1
1.8%
148 1
1.8%
120 1
1.8%
29 2
3.6%
26 1
1.8%

전화번호
Text

UNIQUE 

Distinct56
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size580.0 B
2023-12-12T23:11:50.169501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters672
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique56 ?
Unique (%)100.0%

Sample

1st row043-745-3004
2nd row043-740-9000
3rd row043-745-1600
4th row043-740-9102
5th row043-742-1171
ValueCountFrequency (%)
043-745-3004 1
 
1.8%
043-740-9000 1
 
1.8%
043-742-5522 1
 
1.8%
043-743-6677 1
 
1.8%
043-744-2552 1
 
1.8%
043-743-1075 1
 
1.8%
043-744-7088 1
 
1.8%
043-744-4568 1
 
1.8%
043-743-1675 1
 
1.8%
043-743-6041 1
 
1.8%
Other values (46) 46
82.1%
2023-12-12T23:11:50.621766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 136
20.2%
- 112
16.7%
0 91
13.5%
7 91
13.5%
3 83
12.4%
2 47
 
7.0%
5 39
 
5.8%
8 27
 
4.0%
1 20
 
3.0%
9 14
 
2.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 560
83.3%
Dash Punctuation 112
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 136
24.3%
0 91
16.2%
7 91
16.2%
3 83
14.8%
2 47
 
8.4%
5 39
 
7.0%
8 27
 
4.8%
1 20
 
3.6%
9 14
 
2.5%
6 12
 
2.1%
Dash Punctuation
ValueCountFrequency (%)
- 112
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 672
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 136
20.2%
- 112
16.7%
0 91
13.5%
7 91
13.5%
3 83
12.4%
2 47
 
7.0%
5 39
 
5.8%
8 27
 
4.0%
1 20
 
3.0%
9 14
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 672
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 136
20.2%
- 112
16.7%
0 91
13.5%
7 91
13.5%
3 83
12.4%
2 47
 
7.0%
5 39
 
5.8%
8 27
 
4.0%
1 20
 
3.0%
9 14
 
2.1%

팩스
Text

MISSING 

Distinct51
Distinct (%)100.0%
Missing5
Missing (%)8.9%
Memory size580.0 B
2023-12-12T23:11:50.949913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters612
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique51 ?
Unique (%)100.0%

Sample

1st row043-745-3006
2nd row043-740-9001
3rd row043-742-8275
4th row043-740-9101
5th row043-744-1839
ValueCountFrequency (%)
043-745-3006 1
 
2.0%
043-742-5523 1
 
2.0%
043-740-9001 1
 
2.0%
043-770-7583 1
 
2.0%
043-742-0818 1
 
2.0%
043-744-2551 1
 
2.0%
043-744-6411 1
 
2.0%
043-744-7088 1
 
2.0%
043-744-4560 1
 
2.0%
043-743-1698 1
 
2.0%
Other values (41) 41
80.4%
2023-12-12T23:11:51.380436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 123
20.1%
- 102
16.7%
3 78
12.7%
7 78
12.7%
0 75
12.3%
5 35
 
5.7%
2 34
 
5.6%
8 28
 
4.6%
1 21
 
3.4%
6 20
 
3.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 510
83.3%
Dash Punctuation 102
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 123
24.1%
3 78
15.3%
7 78
15.3%
0 75
14.7%
5 35
 
6.9%
2 34
 
6.7%
8 28
 
5.5%
1 21
 
4.1%
6 20
 
3.9%
9 18
 
3.5%
Dash Punctuation
ValueCountFrequency (%)
- 102
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 612
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 123
20.1%
- 102
16.7%
3 78
12.7%
7 78
12.7%
0 75
12.3%
5 35
 
5.7%
2 34
 
5.6%
8 28
 
4.6%
1 21
 
3.4%
6 20
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 612
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 123
20.1%
- 102
16.7%
3 78
12.7%
7 78
12.7%
0 75
12.3%
5 35
 
5.7%
2 34
 
5.6%
8 28
 
4.6%
1 21
 
3.4%
6 20
 
3.3%

Interactions

2023-12-12T23:11:47.047008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:11:46.846473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:11:47.160398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:11:46.948625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:11:51.499664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호구분의료기관명도로명주소병상전화번호팩스
번호1.0000.9581.0000.8650.0001.0001.000
구분0.9581.0001.0000.7891.0001.0001.000
의료기관명1.0001.0001.0001.0001.0001.0001.000
도로명주소0.8650.7891.0001.0000.4841.0001.000
병상0.0001.0001.0000.4841.0001.0001.000
전화번호1.0001.0001.0001.0001.0001.0001.000
팩스1.0001.0001.0001.0001.0001.0001.000
2023-12-12T23:11:51.638942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호병상구분
번호1.000-0.6850.836
병상-0.6851.0000.632
구분0.8360.6321.000

Missing values

2023-12-12T23:11:47.290006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:11:47.435956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T23:11:47.548535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

번호구분의료기관명도로명주소병상전화번호팩스
01병원의료법인 다원의료재단 영동제일요양병원충청북도 영동군 양강면 양정죽촌로 53-12148043-745-3004043-745-3006
12병원의료법인 조윤의료재단 영동병원충청북도 영동군 영동읍 대학로 106161043-740-9000043-740-9001
23병원학교법인 금강학원 영동군립노인전문병원충청북도 영동군 영동읍 대학로 290120043-745-1600043-742-8275
34병원의료법인 조윤의료재단 감고을요양병원충청북도 영동군 영동읍 대학로 106193043-740-9102043-740-9101
45일반의원서외과의원충청북도 영동군 영동읍 학산영동로 124129043-742-1171043-744-1839
56일반의원한내과의원충청북도 영동군 영동읍 중앙로 3길 5-1<NA>043-742-2271043-742-2271
67일반의원소화의원충청북도 영동군 영동읍 중앙로 48-1<NA>043-742-2277043-744-3232
78일반의원강남의원충청북도 영동군 영동읍 영동시장 4길 12<NA>043-743-5522043-744-7189
89일반의원오정형외과의원충청북도 영동군 영동읍 계산로 526043-742-3711043-743-3711
910일반의원현대의원충청북도 영동군 영동읍 계산로 29-1<NA>043-743-0088043-743-7288
번호구분의료기관명도로명주소병상전화번호팩스
4647치과오치과의원충청북도 영동군 영동읍 중앙로 21<NA>043-743-3043043-743-9700
4748치과허치과의원충청북도 영동군 영동읍 중앙로 25<NA>043-744-0512043-744-0512
4849치과박치과의원충청북도 영동군 영동읍 중앙로 33<NA>043-744-9329043-745-0135
4950치과영동치과의원충청북도 영동군 영동읍 계산로 3<NA>043-745-2804043-745-2805
5051치과미래치과의원충청북도 영동군 영동읍 중앙로 28<NA>043-744-7572043-744-7573
5152치과서울하이치과의원(17호)충청북도 영동군 영동읍 영산로 7<NA>043-743-2875043-745-2878
5253치과백세치과의원(18호)충청북도 영동군 영동읍 중앙로 3길 2-2<NA>043-743-3328043-743-7528
5354치과임플라인치과의원(18호)충청북도 영동군 영동읍 중앙로 3길 2-2<NA>043-743-2872043-743-8872
5455치과소망치과의원충청북도 영동군 황간면 영동황간로 1690<NA>043-745-0288<NA>
5556치과연세치과의원충청북도 영동군 황간면 영동황간로 1704<NA>043-744-0028043-745-0028