Overview

Dataset statistics

Number of variables9
Number of observations3718
Missing cells52
Missing cells (%)0.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory276.1 KiB
Average record size in memory76.0 B

Variable types

Numeric3
Categorical3
Text3

Dataset

Description인천광역시 소재의 병원에 대한 의료기관명, 소재지, 진료과목과 관련된 정보를 제공합니다. * 연번,군구명,병원종별,의료기관명,소재지,병상수,진료과목,비고
Author인천광역시
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=3045143&srcSe=7661IVAWM27C61E190

Alerts

군구명 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
Unnamed: 8 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
연번 is highly overall correlated with 군구명 and 1 other fieldsHigh correlation
병실수 is highly overall correlated with 병상수High correlation
병상수 is highly overall correlated with 병실수High correlation
병원종별 is highly imbalanced (50.0%)Imbalance
Unnamed: 8 is highly imbalanced (89.4%)Imbalance
연번 has unique valuesUnique
병실수 has 3182 (85.6%) zerosZeros
병상수 has 3211 (86.4%) zerosZeros

Reproduction

Analysis started2024-01-28 13:37:10.082445
Analysis finished2024-01-28 13:37:11.997736
Duration1.92 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct3718
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1859.5
Minimum1
Maximum3718
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size32.8 KiB
2024-01-28T22:37:12.386478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile186.85
Q1930.25
median1859.5
Q32788.75
95-th percentile3532.15
Maximum3718
Range3717
Interquartile range (IQR)1858.5

Descriptive statistics

Standard deviation1073.4385
Coefficient of variation (CV)0.57727264
Kurtosis-1.2
Mean1859.5
Median Absolute Deviation (MAD)929.5
Skewness0
Sum6913621
Variance1152270.2
MonotonicityStrictly increasing
2024-01-28T22:37:12.498408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
2485 1
 
< 0.1%
2473 1
 
< 0.1%
2474 1
 
< 0.1%
2475 1
 
< 0.1%
2476 1
 
< 0.1%
2477 1
 
< 0.1%
2478 1
 
< 0.1%
2479 1
 
< 0.1%
2480 1
 
< 0.1%
Other values (3708) 3708
99.7%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
3718 1
< 0.1%
3717 1
< 0.1%
3716 1
< 0.1%
3715 1
< 0.1%
3714 1
< 0.1%
3713 1
< 0.1%
3712 1
< 0.1%
3711 1
< 0.1%
3710 1
< 0.1%
3709 1
< 0.1%

군구명
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size29.2 KiB
남동구
761 
부평구
666 
서구
610 
미추홀구
482 
연수구
470 
Other values (5)
729 

Length

Max length4
Median length3
Mean length2.9020979
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강화군
2nd row강화군
3rd row강화군
4th row강화군
5th row강화군

Common Values

ValueCountFrequency (%)
남동구 761
20.5%
부평구 666
17.9%
서구 610
16.4%
미추홀구 482
13.0%
연수구 470
12.6%
계양구 378
10.2%
중구 145
 
3.9%
동구 91
 
2.4%
강화군 87
 
2.3%
옹진군 28
 
0.8%

Length

2024-01-28T22:37:12.625635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T22:37:12.763571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
남동구 761
20.5%
부평구 666
17.9%
서구 610
16.4%
미추홀구 482
13.0%
연수구 470
12.6%
계양구 378
10.2%
중구 145
 
3.9%
동구 91
 
2.4%
강화군 87
 
2.3%
옹진군 28
 
0.8%

병원종별
Categorical

IMBALANCE 

Distinct16
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size29.2 KiB
의원
1763 
치과의원
987 
한의원
678 
병원
 
70
요양병원
 
47
Other values (11)
 
173

Length

Max length12
Median length5
Mean length2.8636364
Min length2

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row종합병원
2nd row병원
3rd row정신병원
4th row정신병원
5th row요양병원

Common Values

ValueCountFrequency (%)
의원 1763
47.4%
치과의원 987
26.5%
한의원 678
 
18.2%
병원 70
 
1.9%
요양병원 47
 
1.3%
한방병원 45
 
1.2%
보건지소 26
 
0.7%
보건진료소 25
 
0.7%
종합병원 19
 
0.5%
정신병원 18
 
0.5%
Other values (6) 40
 
1.1%

Length

2024-01-28T22:37:12.879482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
의원 1763
47.4%
치과의원 987
26.5%
한의원 678
 
18.2%
병원 70
 
1.9%
요양병원 47
 
1.3%
한방병원 45
 
1.2%
보건지소 26
 
0.7%
보건진료소 25
 
0.7%
종합병원 19
 
0.5%
정신병원 18
 
0.5%
Other values (6) 40
 
1.1%
Distinct3429
Distinct (%)92.2%
Missing0
Missing (%)0.0%
Memory size29.2 KiB
2024-01-28T22:37:13.074538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length23
Mean length7.697149
Min length3

Characters and Unicode

Total characters28618
Distinct characters562
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3217 ?
Unique (%)86.5%

Sample

1st row비에스종합병원
2nd row강화병원
3rd row해주병원
4th row예담병원
5th row의료법인우진의료재단한길병원
ValueCountFrequency (%)
의료법인 17
 
0.4%
굿모닝치과의원 8
 
0.2%
의원 7
 
0.2%
동의보감한의원 6
 
0.2%
연세세브란스치과의원 5
 
0.1%
아가파의원 5
 
0.1%
이사랑치과의원 5
 
0.1%
서울이비인후과의원 5
 
0.1%
코아이비인후과의원 5
 
0.1%
정강의료재단 5
 
0.1%
Other values (3480) 3761
98.2%
2024-01-28T22:37:13.395055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3751
 
13.1%
3704
 
12.9%
2354
 
8.2%
1041
 
3.6%
874
 
3.1%
505
 
1.8%
481
 
1.7%
423
 
1.5%
412
 
1.4%
303
 
1.1%
Other values (552) 14770
51.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 28183
98.5%
Decimal Number 161
 
0.6%
Space Separator 111
 
0.4%
Uppercase Letter 66
 
0.2%
Close Punctuation 30
 
0.1%
Open Punctuation 29
 
0.1%
Lowercase Letter 18
 
0.1%
Dash Punctuation 12
 
< 0.1%
Other Punctuation 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3751
 
13.3%
3704
 
13.1%
2354
 
8.4%
1041
 
3.7%
874
 
3.1%
505
 
1.8%
481
 
1.7%
423
 
1.5%
412
 
1.5%
303
 
1.1%
Other values (508) 14335
50.9%
Uppercase Letter
ValueCountFrequency (%)
S 10
15.2%
C 7
 
10.6%
J 5
 
7.6%
B 5
 
7.6%
N 4
 
6.1%
T 4
 
6.1%
H 4
 
6.1%
D 3
 
4.5%
K 3
 
4.5%
P 3
 
4.5%
Other values (9) 18
27.3%
Decimal Number
ValueCountFrequency (%)
5 39
24.2%
3 38
23.6%
6 33
20.5%
2 16
9.9%
0 14
 
8.7%
1 10
 
6.2%
7 4
 
2.5%
8 4
 
2.5%
4 2
 
1.2%
9 1
 
0.6%
Lowercase Letter
ValueCountFrequency (%)
e 7
38.9%
m 3
16.7%
h 2
 
11.1%
i 2
 
11.1%
t 1
 
5.6%
n 1
 
5.6%
c 1
 
5.6%
r 1
 
5.6%
Other Punctuation
ValueCountFrequency (%)
. 6
75.0%
& 1
 
12.5%
1
 
12.5%
Space Separator
ValueCountFrequency (%)
111
100.0%
Close Punctuation
ValueCountFrequency (%)
) 30
100.0%
Open Punctuation
ValueCountFrequency (%)
( 29
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 28180
98.5%
Common 351
 
1.2%
Latin 84
 
0.3%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3751
 
13.3%
3704
 
13.1%
2354
 
8.4%
1041
 
3.7%
874
 
3.1%
505
 
1.8%
481
 
1.7%
423
 
1.5%
412
 
1.5%
303
 
1.1%
Other values (505) 14332
50.9%
Latin
ValueCountFrequency (%)
S 10
 
11.9%
e 7
 
8.3%
C 7
 
8.3%
J 5
 
6.0%
B 5
 
6.0%
N 4
 
4.8%
T 4
 
4.8%
H 4
 
4.8%
m 3
 
3.6%
D 3
 
3.6%
Other values (17) 32
38.1%
Common
ValueCountFrequency (%)
111
31.6%
5 39
 
11.1%
3 38
 
10.8%
6 33
 
9.4%
) 30
 
8.5%
( 29
 
8.3%
2 16
 
4.6%
0 14
 
4.0%
- 12
 
3.4%
1 10
 
2.8%
Other values (7) 19
 
5.4%
Han
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 28180
98.5%
ASCII 434
 
1.5%
CJK 3
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3751
 
13.3%
3704
 
13.1%
2354
 
8.4%
1041
 
3.7%
874
 
3.1%
505
 
1.8%
481
 
1.7%
423
 
1.5%
412
 
1.5%
303
 
1.1%
Other values (505) 14332
50.9%
ASCII
ValueCountFrequency (%)
111
25.6%
5 39
 
9.0%
3 38
 
8.8%
6 33
 
7.6%
) 30
 
6.9%
( 29
 
6.7%
2 16
 
3.7%
0 14
 
3.2%
- 12
 
2.8%
S 10
 
2.3%
Other values (33) 102
23.5%
CJK
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
None
ValueCountFrequency (%)
1
100.0%
Distinct3553
Distinct (%)95.6%
Missing0
Missing (%)0.0%
Memory size29.2 KiB
2024-01-28T22:37:13.704651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length81
Median length63
Mean length33.823023
Min length18

Characters and Unicode

Total characters125754
Distinct characters494
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3428 ?
Unique (%)92.2%

Sample

1st row인천광역시 강화군 강화읍 충렬사로 31
2nd row인천광역시 강화군 강화읍 강화대로312번길 11
3rd row인천광역시 강화군 하점면 창후로 286
4th row인천광역시 강화군 내가면 해안서로 964
5th row인천광역시 강화군 송해면 강화대로 902
ValueCountFrequency (%)
인천광역시 3719
 
15.2%
남동구 761
 
3.1%
부평구 666
 
2.7%
서구 610
 
2.5%
2층 532
 
2.2%
미추홀구 483
 
2.0%
연수구 470
 
1.9%
3층 387
 
1.6%
계양구 378
 
1.5%
구월동 304
 
1.2%
Other values (3446) 16221
66.1%
2024-01-28T22:37:14.154023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
20825
 
16.6%
4882
 
3.9%
, 4870
 
3.9%
4152
 
3.3%
4089
 
3.3%
3875
 
3.1%
3855
 
3.1%
3786
 
3.0%
3767
 
3.0%
3743
 
3.0%
Other values (484) 67910
54.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 70872
56.4%
Decimal Number 20843
 
16.6%
Space Separator 20825
 
16.6%
Other Punctuation 4891
 
3.9%
Open Punctuation 3657
 
2.9%
Close Punctuation 3657
 
2.9%
Uppercase Letter 362
 
0.3%
Dash Punctuation 307
 
0.2%
Math Symbol 292
 
0.2%
Lowercase Letter 48
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4882
 
6.9%
4152
 
5.9%
4089
 
5.8%
3875
 
5.5%
3855
 
5.4%
3786
 
5.3%
3767
 
5.3%
3743
 
5.3%
2000
 
2.8%
1830
 
2.6%
Other values (428) 34893
49.2%
Uppercase Letter
ValueCountFrequency (%)
A 79
21.8%
C 33
 
9.1%
S 28
 
7.7%
B 27
 
7.5%
I 19
 
5.2%
M 18
 
5.0%
E 17
 
4.7%
K 15
 
4.1%
H 12
 
3.3%
W 11
 
3.0%
Other values (15) 103
28.5%
Decimal Number
ValueCountFrequency (%)
1 3308
15.9%
2 3265
15.7%
0 2919
14.0%
3 2838
13.6%
4 2261
10.8%
5 1725
8.3%
6 1357
6.5%
8 1167
 
5.6%
7 1131
 
5.4%
9 872
 
4.2%
Lowercase Letter
ValueCountFrequency (%)
e 15
31.2%
s 7
14.6%
d 7
14.6%
a 7
14.6%
r 7
14.6%
t 1
 
2.1%
h 1
 
2.1%
c 1
 
2.1%
i 1
 
2.1%
y 1
 
2.1%
Other Punctuation
ValueCountFrequency (%)
, 4870
99.6%
. 11
 
0.2%
' 7
 
0.1%
@ 1
 
< 0.1%
/ 1
 
< 0.1%
· 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
20825
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3657
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3657
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 307
100.0%
Math Symbol
ValueCountFrequency (%)
~ 292
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 70872
56.4%
Common 54472
43.3%
Latin 410
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4882
 
6.9%
4152
 
5.9%
4089
 
5.8%
3875
 
5.5%
3855
 
5.4%
3786
 
5.3%
3767
 
5.3%
3743
 
5.3%
2000
 
2.8%
1830
 
2.6%
Other values (428) 34893
49.2%
Latin
ValueCountFrequency (%)
A 79
19.3%
C 33
 
8.0%
S 28
 
6.8%
B 27
 
6.6%
I 19
 
4.6%
M 18
 
4.4%
E 17
 
4.1%
e 15
 
3.7%
K 15
 
3.7%
H 12
 
2.9%
Other values (25) 147
35.9%
Common
ValueCountFrequency (%)
20825
38.2%
, 4870
 
8.9%
( 3657
 
6.7%
) 3657
 
6.7%
1 3308
 
6.1%
2 3265
 
6.0%
0 2919
 
5.4%
3 2838
 
5.2%
4 2261
 
4.2%
5 1725
 
3.2%
Other values (11) 5147
 
9.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 70872
56.4%
ASCII 54881
43.6%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
20825
37.9%
, 4870
 
8.9%
( 3657
 
6.7%
) 3657
 
6.7%
1 3308
 
6.0%
2 3265
 
5.9%
0 2919
 
5.3%
3 2838
 
5.2%
4 2261
 
4.1%
5 1725
 
3.1%
Other values (45) 5556
 
10.1%
Hangul
ValueCountFrequency (%)
4882
 
6.9%
4152
 
5.9%
4089
 
5.8%
3875
 
5.5%
3855
 
5.4%
3786
 
5.3%
3767
 
5.3%
3743
 
5.3%
2000
 
2.8%
1830
 
2.6%
Other values (428) 34893
49.2%
None
ValueCountFrequency (%)
· 1
100.0%

병실수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct72
Distinct (%)2.0%
Missing27
Missing (%)0.7%
Infinite0
Infinite (%)0.0%
Mean2.4524519
Minimum0
Maximum300
Zeros3182
Zeros (%)85.6%
Negative0
Negative (%)0.0%
Memory size32.8 KiB
2024-01-28T22:37:14.279690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile15
Maximum300
Range300
Interquartile range (IQR)0

Descriptive statistics

Standard deviation11.557801
Coefficient of variation (CV)4.7127534
Kurtosis202.64336
Mean2.4524519
Median Absolute Deviation (MAD)0
Skewness11.342016
Sum9052
Variance133.58277
MonotonicityNot monotonic
2024-01-28T22:37:14.395209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 3182
85.6%
1 76
 
2.0%
2 34
 
0.9%
6 31
 
0.8%
9 26
 
0.7%
7 24
 
0.6%
10 22
 
0.6%
3 21
 
0.6%
8 20
 
0.5%
4 16
 
0.4%
Other values (62) 239
 
6.4%
(Missing) 27
 
0.7%
ValueCountFrequency (%)
0 3182
85.6%
1 76
 
2.0%
2 34
 
0.9%
3 21
 
0.6%
4 16
 
0.4%
5 12
 
0.3%
6 31
 
0.8%
7 24
 
0.6%
8 20
 
0.5%
9 26
 
0.7%
ValueCountFrequency (%)
300 1
 
< 0.1%
230 1
 
< 0.1%
184 1
 
< 0.1%
154 1
 
< 0.1%
140 1
 
< 0.1%
124 1
 
< 0.1%
104 1
 
< 0.1%
92 1
 
< 0.1%
90 1
 
< 0.1%
85 3
0.1%

병상수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct157
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.1818182
Minimum0
Maximum1137
Zeros3211
Zeros (%)86.4%
Negative0
Negative (%)0.0%
Memory size32.8 KiB
2024-01-28T22:37:14.517909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile42.3
Maximum1137
Range1137
Interquartile range (IQR)0

Descriptive statistics

Standard deviation46.891098
Coefficient of variation (CV)5.1069513
Kurtosis173.19465
Mean9.1818182
Median Absolute Deviation (MAD)0
Skewness10.593202
Sum34138
Variance2198.7751
MonotonicityNot monotonic
2024-01-28T22:37:14.637855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 3211
86.4%
1 47
 
1.3%
29 39
 
1.0%
2 36
 
1.0%
3 22
 
0.6%
4 19
 
0.5%
10 15
 
0.4%
24 11
 
0.3%
18 10
 
0.3%
9 9
 
0.2%
Other values (147) 299
 
8.0%
ValueCountFrequency (%)
0 3211
86.4%
1 47
 
1.3%
2 36
 
1.0%
3 22
 
0.6%
4 19
 
0.5%
5 6
 
0.2%
6 5
 
0.1%
7 4
 
0.1%
8 9
 
0.2%
9 9
 
0.2%
ValueCountFrequency (%)
1137 1
< 0.1%
901 1
< 0.1%
850 1
< 0.1%
544 1
< 0.1%
468 1
< 0.1%
414 1
< 0.1%
390 1
< 0.1%
368 1
< 0.1%
357 1
< 0.1%
355 1
< 0.1%
Distinct1233
Distinct (%)33.4%
Missing25
Missing (%)0.7%
Memory size29.2 KiB
2024-01-28T22:37:14.833562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length255
Median length132
Mean length30.530734
Min length1

Characters and Unicode

Total characters112750
Distinct characters81
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1006 ?
Unique (%)27.2%

Sample

1st row내과,외과,정형외과,신경외과,흉부외과,마취통증의학과,산부인과,소아청소년과,안과,비뇨의학과,영상의학과,진단검사의학과, 가정의학과,응급의학과,치과교정과
2nd row가정의학과 영상의학과 비뇨의학과 소아청소년과 정형외과 외과 내과 산부인과 영상의학과 마취통증의학과 진단검사의학과 응급의학과 피부/비뇨기과 흉부외과 안과 가정의학과
3rd row정신건강의학과
4th row정신건강의학과
5th row이비인후과 내과 외과 비뇨의학과 신경과 가정의학과
ValueCountFrequency (%)
내과 1183
 
5.9%
피부과 875
 
4.4%
소아청소년과 871
 
4.4%
한방내과 779
 
3.9%
한방소아과 719
 
3.6%
침구과 717
 
3.6%
한방부인과 713
 
3.6%
한방신경정신과 675
 
3.4%
한방안·이비인후·피부과 664
 
3.3%
이비인후과 657
 
3.3%
Other values (58) 12077
60.6%
2024-01-28T22:37:15.148875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
21633
19.2%
16567
 
14.7%
4964
 
4.4%
4436
 
3.9%
4357
 
3.9%
3512
 
3.1%
3490
 
3.1%
3052
 
2.7%
2524
 
2.2%
2505
 
2.2%
Other values (71) 45710
40.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 94478
83.8%
Space Separator 16567
 
14.7%
Other Punctuation 1705
 
1.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
21633
22.9%
4964
 
5.3%
4436
 
4.7%
4357
 
4.6%
3512
 
3.7%
3490
 
3.7%
3052
 
3.2%
2524
 
2.7%
2505
 
2.7%
2492
 
2.6%
Other values (66) 41513
43.9%
Other Punctuation
ValueCountFrequency (%)
· 1330
78.0%
, 373
 
21.9%
. 1
 
0.1%
/ 1
 
0.1%
Space Separator
ValueCountFrequency (%)
16567
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 94478
83.8%
Common 18272
 
16.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
21633
22.9%
4964
 
5.3%
4436
 
4.7%
4357
 
4.6%
3512
 
3.7%
3490
 
3.7%
3052
 
3.2%
2524
 
2.7%
2505
 
2.7%
2492
 
2.6%
Other values (66) 41513
43.9%
Common
ValueCountFrequency (%)
16567
90.7%
· 1330
 
7.3%
, 373
 
2.0%
. 1
 
< 0.1%
/ 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 94478
83.8%
ASCII 16942
 
15.0%
None 1330
 
1.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
21633
22.9%
4964
 
5.3%
4436
 
4.7%
4357
 
4.6%
3512
 
3.7%
3490
 
3.7%
3052
 
3.2%
2524
 
2.7%
2505
 
2.7%
2492
 
2.6%
Other values (66) 41513
43.9%
ASCII
ValueCountFrequency (%)
16567
97.8%
, 373
 
2.2%
. 1
 
< 0.1%
/ 1
 
< 0.1%
None
ValueCountFrequency (%)
· 1330
100.0%

Unnamed: 8
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size29.2 KiB
<NA>
3627 
0
 
90
1
 
1

Length

Max length4
Median length4
Mean length3.9265734
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 3627
97.6%
0 90
 
2.4%
1 1
 
< 0.1%

Length

2024-01-28T22:37:15.274816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T22:37:15.411080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 3627
97.6%
0 90
 
2.4%
1 1
 
< 0.1%

Interactions

2024-01-28T22:37:11.471557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T22:37:10.997297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T22:37:11.234539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T22:37:11.554552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T22:37:11.077936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T22:37:11.315689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T22:37:11.628749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T22:37:11.162224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-28T22:37:11.392565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-28T22:37:15.481819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번군구명병원종별병실수병상수Unnamed: 8
연번1.0000.9680.5070.0910.096NaN
군구명0.9681.0000.4970.0260.000NaN
병원종별0.5070.4971.0000.6800.6910.000
병실수0.0910.0260.6801.0000.9270.000
병상수0.0960.0000.6910.9271.0000.000
Unnamed: 8NaNNaN0.0000.0000.0001.000
2024-01-28T22:37:15.574635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
병원종별군구명Unnamed: 8
병원종별1.0000.2200.000
군구명0.2201.0001.000
Unnamed: 80.0001.0001.000
2024-01-28T22:37:15.655941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번병실수병상수군구명병원종별Unnamed: 8
연번1.000-0.052-0.0490.6900.2251.000
병실수-0.0521.0000.9980.0160.3570.000
병상수-0.0490.9981.0000.0000.4050.000
군구명0.6900.0160.0001.0000.2201.000
병원종별0.2250.3570.4050.2201.0000.000
Unnamed: 81.0000.0000.0001.0000.0001.000

Missing values

2024-01-28T22:37:11.732603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T22:37:11.851698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-01-28T22:37:11.946931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번군구명병원종별의료기관명소재지병실수병상수진료과목Unnamed: 8
01강화군종합병원비에스종합병원인천광역시 강화군 강화읍 충렬사로 3174266내과,외과,정형외과,신경외과,흉부외과,마취통증의학과,산부인과,소아청소년과,안과,비뇨의학과,영상의학과,진단검사의학과, 가정의학과,응급의학과,치과교정과<NA>
12강화군병원강화병원인천광역시 강화군 강화읍 강화대로312번길 112498가정의학과 영상의학과 비뇨의학과 소아청소년과 정형외과 외과 내과 산부인과 영상의학과 마취통증의학과 진단검사의학과 응급의학과 피부/비뇨기과 흉부외과 안과 가정의학과<NA>
23강화군정신병원해주병원인천광역시 강화군 하점면 창후로 28633180정신건강의학과<NA>
34강화군정신병원예담병원인천광역시 강화군 내가면 해안서로 9641668정신건강의학과<NA>
45강화군요양병원의료법인우진의료재단한길병원인천광역시 강화군 송해면 강화대로 9021271이비인후과 내과 외과 비뇨의학과 신경과 가정의학과<NA>
56강화군요양병원강화희망요양병원인천광역시 강화군 송해면 강화대로 7431790가정의학과 재활의학과 피부과 외과 신경과 내과<NA>
67강화군요양병원올바른 요양병원인천광역시 강화군 강화읍 강화대로 393번길 520104침구과 사상체질과 한방재활의학과 한방신경정신과 한방안·이비인후·피부과 한방내과 응급의학과 가정의학과 재활의학과 신경외과 정형외과 외과 신경과 내과<NA>
78강화군요양병원강화요양병원인천광역시 강화군 강화동로 18132110내과 신경과 외과 정형외과 흉부외과 성형외과 소아청소년과 안과 이비인후과 피부과 재활의학과<NA>
89강화군치과의원강화본치과의원인천광역시 강화군 강화읍 중앙로 9, 부경빌딩 3층00예방치과 구강병리과 영상치의학과 구강내과 치과보존과 치주과 소아치과 치과교정과 치과보철과 구강악안면외과<NA>
910강화군치과의원강화 탑치과의원인천광역시 강화군 강화읍 강화대로 403, 2층00치과<NA>
연번군구명병원종별의료기관명소재지병실수병상수진료과목Unnamed: 8
37083709중구병원영종국제병원인천광역시 중구 하늘별빛로65번길 7-9, 3,4층 (중산동)2177내과,정형외과,신경외과,영상의학과,신경과, 마취통증의학과<NA>
37093710중구요양병원힐락암요양병원인천광역시 중구 영종대로 106, 지하1층(일부), 5~6층, 8~10층 (운서동)43164한방내과 내과 비뇨의학과 정형외과 외과<NA>
37103711중구한방병원스카이한방병원인천광역시 중구 자연대로 47, 4층, 5층(중산동)1640내과, 한방내과, 한방부인과, 한방소아과, 한방안·이비인후·피부과, 한방신경정신과, 침구과, 한방재활의학과, 사상체질과<NA>
37113712중구치과병원슈어치과교정과치과병원인천광역시 중구 하늘중앙로225번길 3, 301~302호(중산동)00치과보철과, 치과교정과, 소아치과, 치주과, 치과보존과, 구강내과, 영상치의학과<NA>
37123713연수구병원송도미소어린이병원인천광역시 연수구 하모니로 158 송도타임스페이스 7~9층(송도동)3446내과, 소아청소년과<NA>
37133714연수구치과병원22세기서울치과병원인천광역시 연수구 컨벤시아대로 69, 415호 (송도동, 송도 밀레니엄)00치과보존과 치주과 소아치과 치과교정과 치과보철과 구강악안면외과<NA>
37143715연수구치과병원서울시카고치과병원인천광역시 연수구 컨벤시아대로 81, 701,704호 (송도동, 드림시티)00예방치과 구강병리과 영상치의학과 구강내과 치과보존과 치주과 소아치과 치과교정과 치과보철과 구강악안면외과<NA>
37153716연수구한방병원송도한방병원인천광역시 연수구 하모니로 158 송도타임스페이스 6층(송도동)1244침구과 사상체질과 한방재활의학과 한방신경정신과 한방안·이비인후·피부과 한방소아과 한방부인과 한방내과<NA>
37163717연수구한방병원자양한방병원인천광역시 연수구 신송로 154, 4층(송도동)1339가정의학과, 한방내과, 한방부인과, 한방소아과, 한방안· 이비인후·피부과, 침구과, 한방재활의학과, 사상체질과<NA>
37173718연수구병원송도연세병원인천광역시 연수구 신송로 159-7, 3층~8층 일부(송도동)2646내과, 신경과, 외과, 정형외과, 신경외과, 마취통증의학과, 피부과, 영상의학과, 재활의학과, 가정의학과<NA>