Overview

Dataset statistics

Number of variables16
Number of observations313
Missing cells278
Missing cells (%)5.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory39.3 KiB
Average record size in memory128.4 B

Variable types

DateTime3
Categorical7
Text6

Dataset

Description충청남도 공주시 진단용방사선발생장치 현황에 대한 데이터로 (의료기관명, 장비명, 장비수 ) 등의 항목을 제공합니다.
Author충청남도 공주시
URLhttps://www.data.go.kr/data/15030706/fileData.do

Alerts

장비용도 is highly overall correlated with 장비형태High correlation
장비형태 is highly overall correlated with 장비용도High correlation
제조국명 is highly imbalanced (52.8%)Imbalance
의료기관영업구분 is highly imbalanced (60.7%)Imbalance
판매회사 has 263 (84.0%) missing valuesMissing
제조사 has 15 (4.8%) missing valuesMissing

Reproduction

Analysis started2023-12-12 06:12:07.018746
Analysis finished2023-12-12 06:12:08.330922
Duration1.31 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct240
Distinct (%)76.7%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
Minimum1994-05-23 00:00:00
Maximum2018-08-24 00:00:00
2023-12-12T15:12:08.449344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:12:08.625272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct5
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
사용중
159 
양도양수
70 
폐기
41 
사용중지
34 
이전
 
9

Length

Max length4
Median length3
Mean length3.172524
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사용중
2nd row사용중
3rd row사용중
4th row사용중
5th row사용중

Common Values

ValueCountFrequency (%)
사용중 159
50.8%
양도양수 70
22.4%
폐기 41
 
13.1%
사용중지 34
 
10.9%
이전 9
 
2.9%

Length

2023-12-12T15:12:08.752547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:12:08.857048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사용중 159
50.8%
양도양수 70
22.4%
폐기 41
 
13.1%
사용중지 34
 
10.9%
이전 9
 
2.9%

장비용도
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
치과일반
71 
일반
48 
촬영 및 투시
48 
골밀도
35 
치과용 파노라마
33 
Other values (10)
78 

Length

Max length12
Median length8
Mean length4.7380192
Min length2

Unique

Unique3 ?
Unique (%)1.0%

Sample

1st row골밀도
2nd row일반
3rd row유방촬영
4th row골밀도
5th row일반

Common Values

ValueCountFrequency (%)
치과일반 71
22.7%
일반 48
15.3%
촬영 및 투시 48
15.3%
골밀도 35
11.2%
치과용 파노라마 33
10.5%
전신 16
 
5.1%
치과용CT 및 파노라마 14
 
4.5%
C-arm 14
 
4.5%
이동용 14
 
4.5%
유방촬영 11
 
3.5%
Other values (5) 9
 
2.9%

Length

2023-12-12T15:12:08.969720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
치과일반 71
15.0%
62
13.1%
촬영 49
10.4%
투시 49
10.4%
일반 48
10.2%
파노라마 47
10.0%
골밀도 35
7.4%
치과용 33
7.0%
치과용ct 18
 
3.8%
전신 16
 
3.4%
Other values (7) 44
9.3%

장비형태
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
거치형
273 
이동형
40 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row거치형
2nd row거치형
3rd row거치형
4th row거치형
5th row거치형

Common Values

ValueCountFrequency (%)
거치형 273
87.2%
이동형 40
 
12.8%

Length

2023-12-12T15:12:09.086343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:12:09.179170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
거치형 273
87.2%
이동형 40
 
12.8%

장비상태
Categorical

Distinct3
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
신제품
216 
중고제품
86 
기타
 
11

Length

Max length4
Median length3
Mean length3.2396166
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row신제품
2nd row중고제품
3rd row중고제품
4th row중고제품
5th row중고제품

Common Values

ValueCountFrequency (%)
신제품 216
69.0%
중고제품 86
 
27.5%
기타 11
 
3.5%

Length

2023-12-12T15:12:09.276229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:12:09.368731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
신제품 216
69.0%
중고제품 86
 
27.5%
기타 11
 
3.5%

판매회사
Text

MISSING 

Distinct38
Distinct (%)76.0%
Missing263
Missing (%)84.0%
Memory size2.6 KiB
2023-12-12T15:12:09.521847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length13
Mean length6.1
Min length2

Characters and Unicode

Total characters305
Distinct characters94
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)62.0%

Sample

1st row(주)한결메디칼
2nd row바텍(주)
3rd row(주)바텍
4th row한결메디칼
5th row영한엑스레이(주)
ValueCountFrequency (%)
한결메디칼 5
 
10.0%
리스템 3
 
6.0%
바텍 3
 
6.0%
리스템대전충남북대리점 2
 
4.0%
제노레이 2
 
4.0%
바텍코리아 2
 
4.0%
포인트닉스 2
 
4.0%
주)아시아방사선 1
 
2.0%
도시바메디칼시스템즈코리아 1
 
2.0%
주)코메드메디칼 1
 
2.0%
Other values (28) 28
56.0%
2023-12-12T15:12:09.815899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17
 
5.6%
16
 
5.2%
16
 
5.2%
14
 
4.6%
) 14
 
4.6%
( 14
 
4.6%
13
 
4.3%
12
 
3.9%
10
 
3.3%
9
 
3.0%
Other values (84) 170
55.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 275
90.2%
Close Punctuation 14
 
4.6%
Open Punctuation 14
 
4.6%
Uppercase Letter 2
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
17
 
6.2%
16
 
5.8%
16
 
5.8%
14
 
5.1%
13
 
4.7%
12
 
4.4%
10
 
3.6%
9
 
3.3%
9
 
3.3%
9
 
3.3%
Other values (80) 150
54.5%
Uppercase Letter
ValueCountFrequency (%)
T 1
50.0%
I 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 14
100.0%
Open Punctuation
ValueCountFrequency (%)
( 14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 275
90.2%
Common 28
 
9.2%
Latin 2
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
17
 
6.2%
16
 
5.8%
16
 
5.8%
14
 
5.1%
13
 
4.7%
12
 
4.4%
10
 
3.6%
9
 
3.3%
9
 
3.3%
9
 
3.3%
Other values (80) 150
54.5%
Common
ValueCountFrequency (%)
) 14
50.0%
( 14
50.0%
Latin
ValueCountFrequency (%)
T 1
50.0%
I 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 275
90.2%
ASCII 30
 
9.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
17
 
6.2%
16
 
5.8%
16
 
5.8%
14
 
5.1%
13
 
4.7%
12
 
4.4%
10
 
3.6%
9
 
3.3%
9
 
3.3%
9
 
3.3%
Other values (80) 150
54.5%
ASCII
ValueCountFrequency (%)
) 14
46.7%
( 14
46.7%
T 1
 
3.3%
I 1
 
3.3%

장치명칭
Categorical

Distinct16
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
치과진단용 엑스선 발생장치
99 
진단용 엑스선 장치
94 
진단용 엑스선 발생기
71 
치과용 전산화 단층 촬영장치
20 
전산화 단층 촬영장치
 
9
Other values (11)
20 

Length

Max length15
Median length14
Mean length11.776358
Min length7

Unique

Unique7 ?
Unique (%)2.2%

Sample

1st row진단용 엑스선 발생기
2nd row진단용 엑스선 장치
3rd row유방촬영용장치
4th row진단용 엑스선 발생기
5th row진단용 엑스선 장치

Common Values

ValueCountFrequency (%)
치과진단용 엑스선 발생장치 99
31.6%
진단용 엑스선 장치 94
30.0%
진단용 엑스선 발생기 71
22.7%
치과용 전산화 단층 촬영장치 20
 
6.4%
전산화 단층 촬영장치 9
 
2.9%
유방촬영용장치 5
 
1.6%
유방촬영용 장치 3
 
1.0%
유방촬영용 장치 등 3
 
1.0%
진단용엑스선촬영장치 2
 
0.6%
전신용엑스선골밀도측정기 1
 
0.3%
Other values (6) 6
 
1.9%

Length

2023-12-12T15:12:09.931518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
엑스선 264
28.4%
진단용 165
17.8%
장치 100
 
10.8%
치과진단용 99
 
10.7%
발생장치 99
 
10.7%
발생기 71
 
7.7%
전산화 29
 
3.1%
단층 29
 
3.1%
촬영장치 29
 
3.1%
치과용 20
 
2.2%
Other values (11) 23
 
2.5%
Distinct207
Distinct (%)66.1%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2023-12-12T15:12:10.182481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length59
Median length42
Mean length9.5463259
Min length3

Characters and Unicode

Total characters2988
Distinct characters73
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique159 ?
Unique (%)50.8%

Sample

1st rowDEXXUM T
2nd rowMXHF-1500R
3rd rowAffinity Mammography System & Accessories
4th rowOSTEOPRIMA
5th rowDKⅡ-525RF
ValueCountFrequency (%)
dexxum 18
 
4.0%
max-gls 18
 
4.0%
t 18
 
4.0%
zeus 8
 
1.8%
esx 6
 
1.3%
max-gl 6
 
1.3%
point 5
 
1.1%
pht-30lfo 5
 
1.1%
dxg-5125 4
 
0.9%
제우스 4
 
0.9%
Other values (267) 353
79.3%
2023-12-12T15:12:10.561598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 227
 
7.6%
- 222
 
7.4%
X 176
 
5.9%
R 132
 
4.4%
132
 
4.4%
S 128
 
4.3%
A 120
 
4.0%
D 119
 
4.0%
E 104
 
3.5%
T 98
 
3.3%
Other values (63) 1530
51.2%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 1583
53.0%
Decimal Number 563
 
18.8%
Lowercase Letter 462
 
15.5%
Dash Punctuation 222
 
7.4%
Space Separator 132
 
4.4%
Other Letter 12
 
0.4%
Other Punctuation 6
 
0.2%
Letter Number 6
 
0.2%
Math Symbol 2
 
0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
X 176
 
11.1%
R 132
 
8.3%
S 128
 
8.1%
A 120
 
7.6%
D 119
 
7.5%
E 104
 
6.6%
T 98
 
6.2%
M 90
 
5.7%
O 79
 
5.0%
H 74
 
4.7%
Other values (16) 463
29.2%
Lowercase Letter
ValueCountFrequency (%)
a 55
11.9%
o 49
10.6%
i 46
 
10.0%
r 36
 
7.8%
e 33
 
7.1%
t 31
 
6.7%
n 26
 
5.6%
s 23
 
5.0%
y 22
 
4.8%
m 21
 
4.5%
Other values (16) 120
26.0%
Decimal Number
ValueCountFrequency (%)
0 227
40.3%
5 88
 
15.6%
3 65
 
11.5%
2 64
 
11.4%
1 62
 
11.0%
6 23
 
4.1%
7 12
 
2.1%
4 10
 
1.8%
8 10
 
1.8%
9 2
 
0.4%
Other Letter
ValueCountFrequency (%)
4
33.3%
4
33.3%
4
33.3%
Letter Number
ValueCountFrequency (%)
3
50.0%
2
33.3%
1
 
16.7%
Other Punctuation
ValueCountFrequency (%)
& 3
50.0%
/ 3
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 222
100.0%
Space Separator
ValueCountFrequency (%)
132
100.0%
Math Symbol
ValueCountFrequency (%)
+ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2050
68.6%
Common 925
31.0%
Hangul 12
 
0.4%
Greek 1
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
X 176
 
8.6%
R 132
 
6.4%
S 128
 
6.2%
A 120
 
5.9%
D 119
 
5.8%
E 104
 
5.1%
T 98
 
4.8%
M 90
 
4.4%
O 79
 
3.9%
H 74
 
3.6%
Other values (44) 930
45.4%
Common
ValueCountFrequency (%)
0 227
24.5%
- 222
24.0%
132
14.3%
5 88
 
9.5%
3 65
 
7.0%
2 64
 
6.9%
1 62
 
6.7%
6 23
 
2.5%
7 12
 
1.3%
4 10
 
1.1%
Other values (5) 20
 
2.2%
Hangul
ValueCountFrequency (%)
4
33.3%
4
33.3%
4
33.3%
Greek
ValueCountFrequency (%)
α 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2969
99.4%
Hangul 12
 
0.4%
Number Forms 6
 
0.2%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 227
 
7.6%
- 222
 
7.5%
X 176
 
5.9%
R 132
 
4.4%
132
 
4.4%
S 128
 
4.3%
A 120
 
4.0%
D 119
 
4.0%
E 104
 
3.5%
T 98
 
3.3%
Other values (56) 1511
50.9%
Hangul
ValueCountFrequency (%)
4
33.3%
4
33.3%
4
33.3%
Number Forms
ValueCountFrequency (%)
3
50.0%
2
33.3%
1
 
16.7%
None
ValueCountFrequency (%)
α 1
100.0%
Distinct100
Distinct (%)31.9%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2023-12-12T15:12:10.730992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length19
Mean length8.4153355
Min length6

Characters and Unicode

Total characters2634
Distinct characters31
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique56 ?
Unique (%)17.9%

Sample

1st rowBHR-83-P
2nd rowR-500-125
3rd rowMR-100-39
4th rowBHR-86-P
5th rowRF-500-125
ValueCountFrequency (%)
d-60-p 35
 
11.1%
r-500-125 23
 
7.3%
dp-90-p 16
 
5.1%
bhr-83-p 13
 
4.1%
hr-60-p 11
 
3.5%
r-300-125 10
 
3.2%
r-100-100 10
 
3.2%
d-70-p 9
 
2.9%
dp-80-p 8
 
2.5%
hrf-110-p 8
 
2.5%
Other values (91) 171
54.5%
2023-12-12T15:12:11.002968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 642
24.4%
0 438
16.6%
P 254
 
9.6%
1 200
 
7.6%
R 197
 
7.5%
5 147
 
5.6%
D 116
 
4.4%
H 91
 
3.5%
6 81
 
3.1%
2 77
 
2.9%
Other values (21) 391
14.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1140
43.3%
Uppercase Letter 831
31.5%
Dash Punctuation 642
24.4%
Open Punctuation 10
 
0.4%
Close Punctuation 10
 
0.4%
Space Separator 1
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
P 254
30.6%
R 197
23.7%
D 116
14.0%
H 91
 
11.0%
B 37
 
4.5%
F 33
 
4.0%
C 32
 
3.9%
T 31
 
3.7%
M 14
 
1.7%
W 9
 
1.1%
Other values (7) 17
 
2.0%
Decimal Number
ValueCountFrequency (%)
0 438
38.4%
1 200
17.5%
5 147
 
12.9%
6 81
 
7.1%
2 77
 
6.8%
3 60
 
5.3%
9 58
 
5.1%
8 49
 
4.3%
7 21
 
1.8%
4 9
 
0.8%
Dash Punctuation
ValueCountFrequency (%)
- 642
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1803
68.5%
Latin 831
31.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
P 254
30.6%
R 197
23.7%
D 116
14.0%
H 91
 
11.0%
B 37
 
4.5%
F 33
 
4.0%
C 32
 
3.9%
T 31
 
3.7%
M 14
 
1.7%
W 9
 
1.1%
Other values (7) 17
 
2.0%
Common
ValueCountFrequency (%)
- 642
35.6%
0 438
24.3%
1 200
 
11.1%
5 147
 
8.2%
6 81
 
4.5%
2 77
 
4.3%
3 60
 
3.3%
9 58
 
3.2%
8 49
 
2.7%
7 21
 
1.2%
Other values (4) 30
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2634
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 642
24.4%
0 438
16.6%
P 254
 
9.6%
1 200
 
7.6%
R 197
 
7.5%
5 147
 
5.6%
D 116
 
4.4%
H 91
 
3.5%
6 81
 
3.1%
2 77
 
2.9%
Other values (21) 391
14.8%
Distinct296
Distinct (%)94.6%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2023-12-12T15:12:11.266655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length15
Mean length8.1214058
Min length2

Characters and Unicode

Total characters2542
Distinct characters53
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique279 ?
Unique (%)89.1%

Sample

1st rowDT1800229
2nd row405009
3rd row21103060365
4th rowMBA0801-123
5th rowGJ2-1020
ValueCountFrequency (%)
92271 2
 
0.6%
15372 2
 
0.6%
51 2
 
0.6%
ah1ff 2
 
0.6%
920902 2
 
0.6%
78722 2
 
0.6%
961237 2
 
0.6%
30110801153 2
 
0.6%
606780 2
 
0.6%
vx70-226 2
 
0.6%
Other values (287) 295
93.7%
2023-12-12T15:12:11.648556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 491
19.3%
1 333
13.1%
2 188
 
7.4%
3 150
 
5.9%
6 136
 
5.4%
4 135
 
5.3%
9 120
 
4.7%
5 119
 
4.7%
7 117
 
4.6%
- 114
 
4.5%
Other values (43) 639
25.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1891
74.4%
Uppercase Letter 501
 
19.7%
Dash Punctuation 114
 
4.5%
Lowercase Letter 19
 
0.7%
Other Punctuation 6
 
0.2%
Math Symbol 4
 
0.2%
Other Letter 3
 
0.1%
Space Separator 2
 
0.1%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A 79
15.8%
X 33
 
6.6%
P 30
 
6.0%
M 30
 
6.0%
G 29
 
5.8%
B 29
 
5.8%
H 27
 
5.4%
S 26
 
5.2%
E 25
 
5.0%
D 24
 
4.8%
Other values (15) 169
33.7%
Decimal Number
ValueCountFrequency (%)
0 491
26.0%
1 333
17.6%
2 188
 
9.9%
3 150
 
7.9%
6 136
 
7.2%
4 135
 
7.1%
9 120
 
6.3%
5 119
 
6.3%
7 117
 
6.2%
8 102
 
5.4%
Lowercase Letter
ValueCountFrequency (%)
c 5
26.3%
a 5
26.3%
k 4
21.1%
m 2
 
10.5%
h 1
 
5.3%
f 1
 
5.3%
e 1
 
5.3%
Other Punctuation
ValueCountFrequency (%)
. 4
66.7%
& 1
 
16.7%
/ 1
 
16.7%
Other Letter
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Dash Punctuation
ValueCountFrequency (%)
- 114
100.0%
Math Symbol
ValueCountFrequency (%)
+ 4
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2019
79.4%
Latin 520
 
20.5%
Hangul 3
 
0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
A 79
15.2%
X 33
 
6.3%
P 30
 
5.8%
M 30
 
5.8%
G 29
 
5.6%
B 29
 
5.6%
H 27
 
5.2%
S 26
 
5.0%
E 25
 
4.8%
D 24
 
4.6%
Other values (22) 188
36.2%
Common
ValueCountFrequency (%)
0 491
24.3%
1 333
16.5%
2 188
 
9.3%
3 150
 
7.4%
6 136
 
6.7%
4 135
 
6.7%
9 120
 
5.9%
5 119
 
5.9%
7 117
 
5.8%
- 114
 
5.6%
Other values (8) 116
 
5.7%
Hangul
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2539
99.9%
Hangul 3
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 491
19.3%
1 333
13.1%
2 188
 
7.4%
3 150
 
5.9%
6 136
 
5.4%
4 135
 
5.3%
9 120
 
4.7%
5 119
 
4.7%
7 117
 
4.6%
- 114
 
4.5%
Other values (40) 636
25.0%
Hangul
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

제조사
Text

MISSING 

Distinct136
Distinct (%)45.6%
Missing15
Missing (%)4.8%
Memory size2.6 KiB
2023-12-12T15:12:11.959851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length89
Median length21
Mean length6.1107383
Min length2

Characters and Unicode

Total characters1821
Distinct characters154
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique86 ?
Unique (%)28.9%

Sample

1st row(주)오스테오시스
2nd row(주)엠아이에스
3rd rowLORAD
4th row(주)메디레이
5th rowDK메디칼
ValueCountFrequency (%)
신흥 22
 
6.8%
바텍 19
 
5.9%
오스테오시스 14
 
4.3%
ge 11
 
3.4%
동아 8
 
2.5%
hitachi 8
 
2.5%
주)오스테오시스 8
 
2.5%
동아x-선기계 7
 
2.2%
주)바텍 7
 
2.2%
sirona 7
 
2.2%
Other values (131) 211
65.5%
2023-12-12T15:12:12.344733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
92
 
5.1%
) 79
 
4.3%
78
 
4.3%
( 77
 
4.2%
I 49
 
2.7%
47
 
2.6%
A 46
 
2.5%
44
 
2.4%
41
 
2.3%
36
 
2.0%
Other values (144) 1232
67.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 983
54.0%
Uppercase Letter 453
24.9%
Lowercase Letter 164
 
9.0%
Close Punctuation 79
 
4.3%
Open Punctuation 77
 
4.2%
Space Separator 27
 
1.5%
Dash Punctuation 20
 
1.1%
Decimal Number 15
 
0.8%
Other Punctuation 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
92
 
9.4%
78
 
7.9%
47
 
4.8%
44
 
4.5%
41
 
4.2%
36
 
3.7%
34
 
3.5%
34
 
3.5%
32
 
3.3%
31
 
3.2%
Other values (91) 514
52.3%
Uppercase Letter
ValueCountFrequency (%)
I 49
 
10.8%
A 46
 
10.2%
T 35
 
7.7%
S 34
 
7.5%
H 33
 
7.3%
N 30
 
6.6%
E 30
 
6.6%
M 25
 
5.5%
R 23
 
5.1%
O 23
 
5.1%
Other values (13) 125
27.6%
Lowercase Letter
ValueCountFrequency (%)
a 22
13.4%
i 17
10.4%
e 14
 
8.5%
o 13
 
7.9%
t 12
 
7.3%
r 11
 
6.7%
h 11
 
6.7%
y 9
 
5.5%
s 9
 
5.5%
d 8
 
4.9%
Other values (9) 38
23.2%
Decimal Number
ValueCountFrequency (%)
1 6
40.0%
0 5
33.3%
9 2
 
13.3%
3 1
 
6.7%
8 1
 
6.7%
Other Punctuation
ValueCountFrequency (%)
. 2
66.7%
/ 1
33.3%
Close Punctuation
ValueCountFrequency (%)
) 79
100.0%
Open Punctuation
ValueCountFrequency (%)
( 77
100.0%
Space Separator
ValueCountFrequency (%)
27
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 20
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 983
54.0%
Latin 617
33.9%
Common 221
 
12.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
92
 
9.4%
78
 
7.9%
47
 
4.8%
44
 
4.5%
41
 
4.2%
36
 
3.7%
34
 
3.5%
34
 
3.5%
32
 
3.3%
31
 
3.2%
Other values (91) 514
52.3%
Latin
ValueCountFrequency (%)
I 49
 
7.9%
A 46
 
7.5%
T 35
 
5.7%
S 34
 
5.5%
H 33
 
5.3%
N 30
 
4.9%
E 30
 
4.9%
M 25
 
4.1%
R 23
 
3.7%
O 23
 
3.7%
Other values (32) 289
46.8%
Common
ValueCountFrequency (%)
) 79
35.7%
( 77
34.8%
27
 
12.2%
- 20
 
9.0%
1 6
 
2.7%
0 5
 
2.3%
. 2
 
0.9%
9 2
 
0.9%
3 1
 
0.5%
8 1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 983
54.0%
ASCII 838
46.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
92
 
9.4%
78
 
7.9%
47
 
4.8%
44
 
4.5%
41
 
4.2%
36
 
3.7%
34
 
3.5%
34
 
3.5%
32
 
3.3%
31
 
3.2%
Other values (91) 514
52.3%
ASCII
ValueCountFrequency (%)
) 79
 
9.4%
( 77
 
9.2%
I 49
 
5.8%
A 46
 
5.5%
T 35
 
4.2%
S 34
 
4.1%
H 33
 
3.9%
N 30
 
3.6%
E 30
 
3.6%
27
 
3.2%
Other values (43) 398
47.5%

제조국명
Categorical

IMBALANCE 

Distinct10
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
대한민국
226 
일본
33 
<NA>
 
14
미국
 
13
독일
 
10
Other values (5)
 
17

Length

Max length4
Median length4
Mean length3.5942492
Min length2

Unique

Unique2 ?
Unique (%)0.6%

Sample

1st row대한민국
2nd row대한민국
3rd row미국
4th row대한민국
5th row대한민국

Common Values

ValueCountFrequency (%)
대한민국 226
72.2%
일본 33
 
10.5%
<NA> 14
 
4.5%
미국 13
 
4.2%
독일 10
 
3.2%
프랑스 6
 
1.9%
핀란드 6
 
1.9%
이탈리아 3
 
1.0%
멕시코 1
 
0.3%
터키 1
 
0.3%

Length

2023-12-12T15:12:12.485481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:12:12.636193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대한민국 226
72.2%
일본 33
 
10.5%
na 14
 
4.5%
미국 13
 
4.2%
독일 10
 
3.2%
프랑스 6
 
1.9%
핀란드 6
 
1.9%
이탈리아 3
 
1.0%
멕시코 1
 
0.3%
터키 1
 
0.3%
Distinct102
Distinct (%)32.6%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
Minimum1970-12-08 00:00:00
Maximum2018-05-18 00:00:00
2023-12-12T15:12:12.824705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:12:12.993925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

의료기관영업구분
Categorical

IMBALANCE 

Distinct3
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
영업중
269 
폐업
42 
직권폐업
 
2

Length

Max length4
Median length3
Mean length2.8722045
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업중
2nd row영업중
3rd row영업중
4th row영업중
5th row영업중

Common Values

ValueCountFrequency (%)
영업중 269
85.9%
폐업 42
 
13.4%
직권폐업 2
 
0.6%

Length

2023-12-12T15:12:13.159492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:12:13.364166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업중 269
85.9%
폐업 42
 
13.4%
직권폐업 2
 
0.6%
Distinct104
Distinct (%)33.2%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2023-12-12T15:12:13.721932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length13
Mean length7.3354633
Min length4

Characters and Unicode

Total characters2296
Distinct characters152
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)9.3%

Sample

1st row장내과의원
2nd row공주푸르메요양병원
3rd row속튼튼내과의원
4th row속튼튼내과의원
5th row속튼튼내과의원
ValueCountFrequency (%)
충청남도 23
 
6.7%
공주의료원 23
 
6.7%
공주현대병원 15
 
4.3%
이치과의원 8
 
2.3%
공주푸르메요양병원 7
 
2.0%
윤정형외과의원 7
 
2.0%
정연채내과의원 7
 
2.0%
유구박치과의원 6
 
1.7%
국립공주병원 6
 
1.7%
순풍외과의원 6
 
1.7%
Other values (101) 237
68.7%
2023-12-12T15:12:14.264924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
299
 
13.0%
273
 
11.9%
196
 
8.5%
105
 
4.6%
88
 
3.8%
86
 
3.7%
50
 
2.2%
40
 
1.7%
37
 
1.6%
32
 
1.4%
Other values (142) 1090
47.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2264
98.6%
Space Separator 32
 
1.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
299
 
13.2%
273
 
12.1%
196
 
8.7%
105
 
4.6%
88
 
3.9%
86
 
3.8%
50
 
2.2%
40
 
1.8%
37
 
1.6%
31
 
1.4%
Other values (141) 1059
46.8%
Space Separator
ValueCountFrequency (%)
32
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2264
98.6%
Common 32
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
299
 
13.2%
273
 
12.1%
196
 
8.7%
105
 
4.6%
88
 
3.9%
86
 
3.8%
50
 
2.2%
40
 
1.8%
37
 
1.6%
31
 
1.4%
Other values (141) 1059
46.8%
Common
ValueCountFrequency (%)
32
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2264
98.6%
ASCII 32
 
1.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
299
 
13.2%
273
 
12.1%
196
 
8.7%
105
 
4.6%
88
 
3.9%
86
 
3.8%
50
 
2.2%
40
 
1.8%
37
 
1.6%
31
 
1.4%
Other values (141) 1059
46.8%
ASCII
ValueCountFrequency (%)
32
100.0%
Distinct212
Distinct (%)67.7%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
Minimum1997-07-18 00:00:00
Maximum2018-08-23 00:00:00
2023-12-12T15:12:14.536895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:12:14.834395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Correlations

2023-12-12T15:12:15.061366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
장비사용구분장비용도장비형태장비상태판매회사장치명칭장치형식명제조국명의료기관영업구분
장비사용구분1.0000.4690.0400.1520.9100.4470.5870.2760.414
장비용도0.4691.0000.7010.5530.9130.8500.9710.5910.000
장비형태0.0400.7011.0000.0280.8000.4130.9200.0000.000
장비상태0.1520.5530.0281.0000.8030.5030.7370.3270.000
판매회사0.9100.9130.8000.8031.0000.9850.9590.9881.000
장치명칭0.4470.8500.4130.5030.9851.0000.9480.5960.000
장치형식명0.5870.9710.9200.7370.9590.9481.0000.9200.000
제조국명0.2760.5910.0000.3270.9880.5960.9201.0000.120
의료기관영업구분0.4140.0000.0000.0001.0000.0000.0000.1201.000
2023-12-12T15:12:15.238567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
장비상태장비형태장비사용구분제조국명장치명칭장비용도의료기관영업구분
장비상태1.0000.0460.1140.1510.3110.2970.000
장비형태0.0461.0000.0480.0000.3170.6390.000
장비사용구분0.1140.0481.0000.1620.2420.2150.343
제조국명0.1510.0000.1621.0000.2980.2840.119
장치명칭0.3110.3170.2420.2981.0000.4840.000
장비용도0.2970.6390.2150.2840.4841.0000.000
의료기관영업구분0.0000.0000.3430.1190.0000.0001.000
2023-12-12T15:12:15.380066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
장비사용구분장비용도장비형태장비상태장치명칭제조국명의료기관영업구분
장비사용구분1.0000.2150.0480.1140.2420.1620.343
장비용도0.2151.0000.6390.2970.4840.2840.000
장비형태0.0480.6391.0000.0460.3170.0000.000
장비상태0.1140.2970.0461.0000.3110.1510.000
장치명칭0.2420.4840.3170.3111.0000.2980.000
제조국명0.1620.2840.0000.1510.2981.0000.119
의료기관영업구분0.3430.0000.0000.0000.0000.1191.000

Missing values

2023-12-12T15:12:07.900425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:12:08.110809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T15:12:08.226562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

신고일자장비사용구분장비용도장비형태장비상태판매회사장치명칭장치모델명장치형식명제조번호제조사제조국명개설일자의료기관영업구분의료기관명장비검사일자
02018-08-24사용중골밀도거치형신제품(주)한결메디칼진단용 엑스선 발생기DEXXUM TBHR-83-PDT1800229(주)오스테오시스대한민국2004-04-22영업중장내과의원2018-08-23
12018-07-09사용중일반거치형중고제품<NA>진단용 엑스선 장치MXHF-1500RR-500-125405009(주)엠아이에스대한민국2009-02-25영업중공주푸르메요양병원2018-07-06
22018-06-15사용중유방촬영거치형중고제품<NA>유방촬영용장치Affinity Mammography System & AccessoriesMR-100-3921103060365LORAD미국2018-05-18영업중속튼튼내과의원2018-06-07
32018-06-15사용중골밀도거치형중고제품<NA>진단용 엑스선 발생기OSTEOPRIMABHR-86-PMBA0801-123(주)메디레이대한민국2018-05-18영업중속튼튼내과의원2018-06-07
42018-06-15사용중일반거치형중고제품<NA>진단용 엑스선 장치DKⅡ-525RFRF-500-125GJ2-1020DK메디칼대한민국2018-05-18영업중속튼튼내과의원2018-06-07
52018-06-05사용중유방촬영거치형중고제품<NA>유방촬영용장치Senographe DSMR-100-49580279BU8GE프랑스2000-10-23영업중공주현대병원2018-05-31
62018-05-15사용중지검진차량이동형중고제품<NA>진단용 엑스선 장치RDR-MR-500-150RM-60-0501(주)금산메디칼대한민국2009-02-25영업중공주푸르메요양병원2016-11-11
72018-04-24사용중치과일반이동형신제품<NA>진단용 엑스선 발생기PROXHR-60-PP18A8002(주)디지메드대한민국2018-04-20영업중더바른치과의원2018-04-23
82018-04-24사용중치과용CT 및 파노라마거치형신제품<NA>치과용 전산화 단층 촬영장치Point 3D Combi 500SDCTP-90-PPNC5S-KEL018A16T(주)포인트닉스대한민국2018-04-20영업중더바른치과의원2018-04-23
92018-04-24사용중치과일반거치형신제품<NA>치과진단용 엑스선 발생장치ZEUS 100SDP-70-PPNZS-KEM011D0(주)포인트닉스대한민국2018-04-20영업중더바른치과의원2018-04-23
신고일자장비사용구분장비용도장비형태장비상태판매회사장치명칭장치모델명장치형식명제조번호제조사제조국명개설일자의료기관영업구분의료기관명장비검사일자
3031997-07-25폐기치과용 파노라마거치형신제품<NA>치과진단용 엑스선 발생장치CRANEX3+CEPHDP-81-PH64211대한민국대한민국1996-06-18폐업뿌리연합치과의원1997-07-18
3041997-07-25양도양수일반거치형신제품<NA>진단용 엑스선 장치C-300R-ADR-300-125960313Hyun-Dai Medical대한민국1997-07-23영업중대성새마을금고의원2012-03-15
3051997-12-20사용중촬영 및 투시거치형중고제품<NA>진단용 엑스선 장치R-300-100/ASA 300R-RR-300-10064아시아엑스선기계대한민국1995-04-21영업중유구성심의원2016-03-10
3061997-11-22사용중지촬영 및 투시거치형중고제품<NA>진단용 엑스선 장치DA-3001R-100-10052아시아엑스선기계대한민국1987-01-21영업중허외과의원2012-09-17
3071997-06-21양도양수촬영 및 투시거치형신제품<NA>진단용 엑스선 발생기DHF-105CX-6HRF-100-P(R-100-80)SX-16265503HITACHI일본1985-01-08폐업윤정형외과의원2015-11-09
3081996-06-03사용중치과일반거치형신제품<NA>치과진단용 엑스선 발생장치MAX-GLSD-60-P2353신흥대한민국1998-04-24영업중정치과의원2018-04-19
3091995-01-11사용중지치과일반거치형신제품<NA>치과진단용 엑스선 발생장치MAX-GLD-60-P11154신흥대한민국1989-01-01영업중공주시보건소2003-07-28
3101995-01-11폐기치과일반거치형신제품<NA>치과진단용 엑스선 발생장치DS-SD-60-S11889동서대한민국1992-04-30영업중서울치과의원2012-02-02
3111995-01-11폐기치과일반거치형신제품<NA>치과진단용 엑스선 발생장치REX-601D-60-PBI-079MYOSHIDA일본1991-11-05영업중최유상치과의원2007-02-21
3121995-01-11폐기촬영 및 투시거치형중고제품<NA>진단용 엑스선 장치DA-3001R-300-100K0200RSN0007현대의료기기대한민국1974-08-23폐업이건우의원2015-09-11