Overview

Dataset statistics

Number of variables8
Number of observations771
Missing cells125
Missing cells (%)2.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory48.3 KiB
Average record size in memory64.2 B

Variable types

Text5
Categorical2
Boolean1

Dataset

Description국립소록도병원에 입원중인 한센인의 질병 치료를 목적으로 사용하는 의약품 정보로 약품코드, 주성분, 약품분류, 마약류여부, 약품명칭, 한글명칭, 사용여부를 제공합니다.
Author공공데이터포털
URLhttps://www.data.go.kr/data/3069474/fileData.do

Alerts

마약류 is highly imbalanced (83.7%)Imbalance
주성분 has 67 (8.7%) missing valuesMissing
약품분류 has 26 (3.4%) missing valuesMissing
한글명칭 has 28 (3.6%) missing valuesMissing

Reproduction

Analysis started2024-04-21 16:28:53.981535
Analysis finished2024-04-21 16:28:56.078249
Duration2.1 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct770
Distinct (%)100.0%
Missing1
Missing (%)0.1%
Memory size6.1 KiB
2024-04-22T01:28:57.204070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length4.9168831
Min length2

Characters and Unicode

Total characters3786
Distinct characters40
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique770 ?
Unique (%)100.0%

Sample

1st rowAAP650
2nd rowABDZ
3rd rowABLI2
4th rowABLI5
5th rowABOD10
ValueCountFrequency (%)
aap650 1
 
0.1%
rsp2 1
 
0.1%
doxa 1
 
0.1%
d-lind 1
 
0.1%
d-loc 1
 
0.1%
dmd 1
 
0.1%
d-nzr 1
 
0.1%
doapcr 1
 
0.1%
dobe 1
 
0.1%
dom10 1
 
0.1%
Other values (760) 760
98.7%
2024-04-22T01:28:58.951999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 376
 
9.9%
I 325
 
8.6%
A 229
 
6.0%
T 217
 
5.7%
P 202
 
5.3%
L 201
 
5.3%
D 189
 
5.0%
O 185
 
4.9%
C 174
 
4.6%
M 155
 
4.1%
Other values (30) 1533
40.5%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 3048
80.5%
Dash Punctuation 376
 
9.9%
Decimal Number 354
 
9.4%
Other Punctuation 7
 
0.2%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
I 325
 
10.7%
A 229
 
7.5%
T 217
 
7.1%
P 202
 
6.6%
L 201
 
6.6%
D 189
 
6.2%
O 185
 
6.1%
C 174
 
5.7%
M 155
 
5.1%
S 151
 
5.0%
Other values (16) 1020
33.5%
Decimal Number
ValueCountFrequency (%)
0 97
27.4%
5 83
23.4%
1 64
18.1%
2 58
16.4%
3 23
 
6.5%
4 13
 
3.7%
6 6
 
1.7%
7 5
 
1.4%
8 4
 
1.1%
9 1
 
0.3%
Other Punctuation
ValueCountFrequency (%)
/ 5
71.4%
. 2
 
28.6%
Dash Punctuation
ValueCountFrequency (%)
- 376
100.0%
Lowercase Letter
ValueCountFrequency (%)
a 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 3049
80.5%
Common 737
 
19.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
I 325
 
10.7%
A 229
 
7.5%
T 217
 
7.1%
P 202
 
6.6%
L 201
 
6.6%
D 189
 
6.2%
O 185
 
6.1%
C 174
 
5.7%
M 155
 
5.1%
S 151
 
5.0%
Other values (17) 1021
33.5%
Common
ValueCountFrequency (%)
- 376
51.0%
0 97
 
13.2%
5 83
 
11.3%
1 64
 
8.7%
2 58
 
7.9%
3 23
 
3.1%
4 13
 
1.8%
6 6
 
0.8%
/ 5
 
0.7%
7 5
 
0.7%
Other values (3) 7
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3786
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 376
 
9.9%
I 325
 
8.6%
A 229
 
6.0%
T 217
 
5.7%
P 202
 
5.3%
L 201
 
5.3%
D 189
 
5.0%
O 185
 
4.9%
C 174
 
4.6%
M 155
 
4.1%
Other values (30) 1533
40.5%

주성분
Text

MISSING 

Distinct693
Distinct (%)98.4%
Missing67
Missing (%)8.7%
Memory size6.1 KiB
2024-04-22T01:28:59.960754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length9.0014205
Min length8

Characters and Unicode

Total characters6337
Distinct characters30
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique683 ?
Unique (%)97.0%

Sample

1st row101430ATR
2nd row104303ATB
3rd row451504ATB
4th row451503ATB
5th row451501ATD
ValueCountFrequency (%)
101305atb 3
 
0.4%
333300cos 2
 
0.3%
101501atb 2
 
0.3%
247402bij 2
 
0.3%
246132ccm 2
 
0.3%
216201clq 2
 
0.3%
635300csi 2
 
0.3%
204002atb 2
 
0.3%
256500atr 2
 
0.3%
152103bij 2
 
0.3%
Other values (684) 684
97.0%
2024-04-22T01:29:01.420535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 925
14.6%
1 820
12.9%
2 551
 
8.7%
3 468
 
7.4%
B 452
 
7.1%
A 382
 
6.0%
4 343
 
5.4%
T 308
 
4.9%
5 294
 
4.6%
C 223
 
3.5%
Other values (20) 1571
24.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4215
66.5%
Uppercase Letter 2121
33.5%
Space Separator 1
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
B 452
21.3%
A 382
18.0%
T 308
14.5%
C 223
10.5%
I 198
9.3%
J 183
8.6%
S 92
 
4.3%
O 68
 
3.2%
H 37
 
1.7%
L 35
 
1.7%
Other values (9) 143
 
6.7%
Decimal Number
ValueCountFrequency (%)
0 925
21.9%
1 820
19.5%
2 551
13.1%
3 468
11.1%
4 343
 
8.1%
5 294
 
7.0%
6 222
 
5.3%
7 219
 
5.2%
8 197
 
4.7%
9 176
 
4.2%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4216
66.5%
Latin 2121
33.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
B 452
21.3%
A 382
18.0%
T 308
14.5%
C 223
10.5%
I 198
9.3%
J 183
8.6%
S 92
 
4.3%
O 68
 
3.2%
H 37
 
1.7%
L 35
 
1.7%
Other values (9) 143
 
6.7%
Common
ValueCountFrequency (%)
0 925
21.9%
1 820
19.4%
2 551
13.1%
3 468
11.1%
4 343
 
8.1%
5 294
 
7.0%
6 222
 
5.3%
7 219
 
5.2%
8 197
 
4.7%
9 176
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6337
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 925
14.6%
1 820
12.9%
2 551
 
8.7%
3 468
 
7.4%
B 452
 
7.1%
A 382
 
6.0%
4 343
 
5.4%
T 308
 
4.9%
5 294
 
4.6%
C 223
 
3.5%
Other values (20) 1571
24.8%

약품분류
Text

MISSING 

Distinct104
Distinct (%)14.0%
Missing26
Missing (%)3.4%
Memory size6.1 KiB
2024-04-22T01:29:02.435955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length22
Mean length11.914094
Min length7

Characters and Unicode

Total characters8876
Distinct characters202
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)3.2%

Sample

1st row114 해열.진통.소염제
2nd row642 구충제
3rd row117 정신신경용제
4th row117 정신신경용제
5th row117 정신신경용제
ValueCountFrequency (%)
기타의 125
 
6.3%
117 58
 
2.9%
정신신경용제 58
 
2.9%
131 52
 
2.6%
안과용제 52
 
2.6%
주로 33
 
1.7%
32
 
1.6%
의약품 30
 
1.5%
해열.진통.소염제 29
 
1.5%
114 29
 
1.5%
Other values (229) 1476
74.8%
2024-04-22T01:29:03.930993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1231
 
13.9%
1 677
 
7.6%
659
 
7.4%
2 479
 
5.4%
3 376
 
4.2%
351
 
4.0%
9 207
 
2.3%
186
 
2.1%
171
 
1.9%
6 168
 
1.9%
Other values (192) 4371
49.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5198
58.6%
Decimal Number 2240
25.2%
Space Separator 1231
 
13.9%
Other Punctuation 135
 
1.5%
Uppercase Letter 32
 
0.4%
Close Punctuation 20
 
0.2%
Open Punctuation 20
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
659
 
12.7%
351
 
6.8%
186
 
3.6%
171
 
3.3%
160
 
3.1%
155
 
3.0%
122
 
2.3%
112
 
2.2%
111
 
2.1%
96
 
1.8%
Other values (169) 3075
59.2%
Decimal Number
ValueCountFrequency (%)
1 677
30.2%
2 479
21.4%
3 376
16.8%
9 207
 
9.2%
6 168
 
7.5%
4 131
 
5.8%
7 80
 
3.6%
5 66
 
2.9%
8 47
 
2.1%
0 9
 
0.4%
Uppercase Letter
ValueCountFrequency (%)
B 8
25.0%
A 6
18.8%
D 5
15.6%
P 4
12.5%
C 4
12.5%
X 3
 
9.4%
K 1
 
3.1%
E 1
 
3.1%
Other Punctuation
ValueCountFrequency (%)
. 92
68.1%
, 43
31.9%
Space Separator
ValueCountFrequency (%)
1231
100.0%
Close Punctuation
ValueCountFrequency (%)
) 20
100.0%
Open Punctuation
ValueCountFrequency (%)
( 20
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5198
58.6%
Common 3646
41.1%
Latin 32
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
659
 
12.7%
351
 
6.8%
186
 
3.6%
171
 
3.3%
160
 
3.1%
155
 
3.0%
122
 
2.3%
112
 
2.2%
111
 
2.1%
96
 
1.8%
Other values (169) 3075
59.2%
Common
ValueCountFrequency (%)
1231
33.8%
1 677
18.6%
2 479
 
13.1%
3 376
 
10.3%
9 207
 
5.7%
6 168
 
4.6%
4 131
 
3.6%
. 92
 
2.5%
7 80
 
2.2%
5 66
 
1.8%
Other values (5) 139
 
3.8%
Latin
ValueCountFrequency (%)
B 8
25.0%
A 6
18.8%
D 5
15.6%
P 4
12.5%
C 4
12.5%
X 3
 
9.4%
K 1
 
3.1%
E 1
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5198
58.6%
ASCII 3678
41.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1231
33.5%
1 677
18.4%
2 479
 
13.0%
3 376
 
10.2%
9 207
 
5.6%
6 168
 
4.6%
4 131
 
3.6%
. 92
 
2.5%
7 80
 
2.2%
5 66
 
1.8%
Other values (13) 171
 
4.6%
Hangul
ValueCountFrequency (%)
659
 
12.7%
351
 
6.8%
186
 
3.6%
171
 
3.3%
160
 
3.1%
155
 
3.0%
122
 
2.3%
112
 
2.2%
111
 
2.1%
96
 
1.8%
Other values (169) 3075
59.2%

약물구분
Categorical

Distinct5
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size6.1 KiB
내복약(조제)
304 
주사제
208 
내복약(비조제)
116 
외용약(기타)
101 
외용약(안약)
42 

Length

Max length8
Median length7
Mean length6.0713359
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row내복약(조제)
2nd row내복약(비조제)
3rd row내복약(조제)
4th row내복약(조제)
5th row내복약(비조제)

Common Values

ValueCountFrequency (%)
내복약(조제) 304
39.4%
주사제 208
27.0%
내복약(비조제) 116
 
15.0%
외용약(기타) 101
 
13.1%
외용약(안약) 42
 
5.4%

Length

2024-04-22T01:29:04.343617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T01:29:04.679396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
내복약(조제 304
39.4%
주사제 208
27.0%
내복약(비조제 116
 
15.0%
외용약(기타 101
 
13.1%
외용약(안약 42
 
5.4%

마약류
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size6.1 KiB
일반
743 
향정
 
19
마약
 
9

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반
2nd row일반
3rd row일반
4th row일반
5th row일반

Common Values

ValueCountFrequency (%)
일반 743
96.4%
향정 19
 
2.5%
마약 9
 
1.2%

Length

2024-04-22T01:29:05.049986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T01:29:05.357300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 743
96.4%
향정 19
 
2.5%
마약 9
 
1.2%
Distinct765
Distinct (%)99.6%
Missing3
Missing (%)0.4%
Memory size6.1 KiB
2024-04-22T01:29:06.309525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length100
Median length61
Mean length27.950521
Min length3

Characters and Unicode

Total characters21466
Distinct characters262
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique762 ?
Unique (%)99.2%

Sample

1st rowAcetaminophen 650mg/Tab
2nd rowAlbendazole 400mg/Tab
3rd rowAripiprazole 2mg
4th rowAripiprazole 5mg
5th rowAripiprazole OD 10mg/Tab
ValueCountFrequency (%)
hcl 108
 
4.2%
sodium 58
 
2.2%
eye 40
 
1.5%
drop 39
 
1.5%
10mg/tab 21
 
0.8%
5mg/tab 18
 
0.7%
acid 18
 
0.7%
chloride 16
 
0.6%
sulfate 15
 
0.6%
100mg/tab 15
 
0.6%
Other values (1347) 2239
86.5%
2024-04-22T01:29:07.792816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1824
 
8.5%
a 1320
 
6.1%
e 1264
 
5.9%
i 1213
 
5.7%
m 1155
 
5.4%
l 1059
 
4.9%
o 1056
 
4.9%
0 836
 
3.9%
n 808
 
3.8%
r 772
 
3.6%
Other values (252) 10159
47.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 13345
62.2%
Decimal Number 2297
 
10.7%
Uppercase Letter 1937
 
9.0%
Space Separator 1824
 
8.5%
Other Punctuation 1141
 
5.3%
Other Letter 584
 
2.7%
Other Symbol 179
 
0.8%
Open Punctuation 61
 
0.3%
Close Punctuation 59
 
0.3%
Dash Punctuation 36
 
0.2%
Other values (2) 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
33
 
5.7%
33
 
5.7%
29
 
5.0%
20
 
3.4%
17
 
2.9%
14
 
2.4%
13
 
2.2%
13
 
2.2%
12
 
2.1%
11
 
1.9%
Other values (171) 389
66.6%
Lowercase Letter
ValueCountFrequency (%)
a 1320
 
9.9%
e 1264
 
9.5%
i 1213
 
9.1%
m 1155
 
8.7%
l 1059
 
7.9%
o 1056
 
7.9%
n 808
 
6.1%
r 772
 
5.8%
t 732
 
5.5%
g 647
 
4.8%
Other values (18) 3319
24.9%
Uppercase Letter
ValueCountFrequency (%)
T 337
17.4%
C 298
15.4%
A 173
8.9%
H 156
 
8.1%
B 123
 
6.4%
S 118
 
6.1%
D 88
 
4.5%
P 86
 
4.4%
L 76
 
3.9%
E 69
 
3.6%
Other values (15) 413
21.3%
Decimal Number
ValueCountFrequency (%)
0 836
36.4%
5 420
18.3%
1 355
15.5%
2 301
 
13.1%
3 119
 
5.2%
4 100
 
4.4%
6 64
 
2.8%
7 39
 
1.7%
8 38
 
1.7%
9 25
 
1.1%
Other Punctuation
ValueCountFrequency (%)
/ 757
66.3%
. 231
 
20.2%
% 100
 
8.8%
, 41
 
3.6%
: 6
 
0.5%
& 3
 
0.3%
' 2
 
0.2%
· 1
 
0.1%
Other Symbol
ValueCountFrequency (%)
138
77.1%
26
 
14.5%
14
 
7.8%
1
 
0.6%
Space Separator
ValueCountFrequency (%)
1824
100.0%
Open Punctuation
ValueCountFrequency (%)
( 61
100.0%
Close Punctuation
ValueCountFrequency (%)
) 59
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 36
100.0%
Math Symbol
ValueCountFrequency (%)
+ 2
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 15279
71.2%
Common 5600
 
26.1%
Hangul 584
 
2.7%
Greek 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
33
 
5.7%
33
 
5.7%
29
 
5.0%
20
 
3.4%
17
 
2.9%
14
 
2.4%
13
 
2.2%
13
 
2.2%
12
 
2.1%
11
 
1.9%
Other values (171) 389
66.6%
Latin
ValueCountFrequency (%)
a 1320
 
8.6%
e 1264
 
8.3%
i 1213
 
7.9%
m 1155
 
7.6%
l 1059
 
6.9%
o 1056
 
6.9%
n 808
 
5.3%
r 772
 
5.1%
t 732
 
4.8%
g 647
 
4.2%
Other values (41) 5253
34.4%
Common
ValueCountFrequency (%)
1824
32.6%
0 836
14.9%
/ 757
13.5%
5 420
 
7.5%
1 355
 
6.3%
2 301
 
5.4%
. 231
 
4.1%
138
 
2.5%
3 119
 
2.1%
% 100
 
1.8%
Other values (18) 519
 
9.3%
Greek
ValueCountFrequency (%)
α 2
66.7%
β 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 20699
96.4%
Hangul 584
 
2.7%
CJK Compat 179
 
0.8%
None 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1824
 
8.8%
a 1320
 
6.4%
e 1264
 
6.1%
i 1213
 
5.9%
m 1155
 
5.6%
l 1059
 
5.1%
o 1056
 
5.1%
0 836
 
4.0%
n 808
 
3.9%
r 772
 
3.7%
Other values (64) 9392
45.4%
CJK Compat
ValueCountFrequency (%)
138
77.1%
26
 
14.5%
14
 
7.8%
1
 
0.6%
Hangul
ValueCountFrequency (%)
33
 
5.7%
33
 
5.7%
29
 
5.0%
20
 
3.4%
17
 
2.9%
14
 
2.4%
13
 
2.2%
13
 
2.2%
12
 
2.1%
11
 
1.9%
Other values (171) 389
66.6%
None
ValueCountFrequency (%)
α 2
50.0%
β 1
25.0%
· 1
25.0%

한글명칭
Text

MISSING 

Distinct738
Distinct (%)99.3%
Missing28
Missing (%)3.6%
Memory size6.1 KiB
2024-04-22T01:29:08.772407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length80
Median length36
Mean length14.397039
Min length2

Characters and Unicode

Total characters10697
Distinct characters444
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique733 ?
Unique (%)98.7%

Sample

1st row아세트아미노펜서방정 650mg
2nd row알벤다졸정 400mg
3rd row아리피프라졸 2mg
4th row아리피프라졸정 5mg
5th row아리피프라졸 OD 10mg 정
ValueCountFrequency (%)
102
 
5.0%
88
 
4.3%
점안액 39
 
1.9%
캡슐 29
 
1.4%
염산 27
 
1.3%
5mg 24
 
1.2%
10mg 24
 
1.2%
100mg 21
 
1.0%
50mg 16
 
0.8%
크림 15
 
0.7%
Other values (1134) 1641
81.0%
2024-04-22T01:29:10.258816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1284
 
12.0%
0 613
 
5.7%
m 507
 
4.7%
g 434
 
4.1%
5 313
 
2.9%
1 256
 
2.4%
256
 
2.4%
/ 252
 
2.4%
2 209
 
2.0%
160
 
1.5%
Other values (434) 6413
60.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5521
51.6%
Decimal Number 1620
 
15.1%
Lowercase Letter 1448
 
13.5%
Space Separator 1284
 
12.0%
Other Punctuation 468
 
4.4%
Uppercase Letter 183
 
1.7%
Other Symbol 82
 
0.8%
Close Punctuation 34
 
0.3%
Open Punctuation 34
 
0.3%
Dash Punctuation 23
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
256
 
4.6%
160
 
2.9%
160
 
2.9%
145
 
2.6%
140
 
2.5%
123
 
2.2%
113
 
2.0%
111
 
2.0%
100
 
1.8%
99
 
1.8%
Other values (366) 4114
74.5%
Lowercase Letter
ValueCountFrequency (%)
m 507
35.0%
g 434
30.0%
l 135
 
9.3%
a 57
 
3.9%
e 50
 
3.5%
i 36
 
2.5%
n 32
 
2.2%
o 29
 
2.0%
r 26
 
1.8%
t 22
 
1.5%
Other values (13) 120
 
8.3%
Uppercase Letter
ValueCountFrequency (%)
C 20
10.9%
T 17
 
9.3%
L 16
 
8.7%
B 14
 
7.7%
I 13
 
7.1%
S 13
 
7.1%
D 12
 
6.6%
U 12
 
6.6%
P 11
 
6.0%
A 10
 
5.5%
Other values (11) 45
24.6%
Decimal Number
ValueCountFrequency (%)
0 613
37.8%
5 313
19.3%
1 256
15.8%
2 209
 
12.9%
3 74
 
4.6%
4 60
 
3.7%
6 39
 
2.4%
7 27
 
1.7%
8 20
 
1.2%
9 9
 
0.6%
Other Punctuation
ValueCountFrequency (%)
/ 252
53.8%
. 121
25.9%
% 74
 
15.8%
, 15
 
3.2%
: 4
 
0.9%
' 1
 
0.2%
· 1
 
0.2%
Other Symbol
ValueCountFrequency (%)
59
72.0%
13
 
15.9%
10
 
12.2%
Space Separator
ValueCountFrequency (%)
1284
100.0%
Close Punctuation
ValueCountFrequency (%)
) 34
100.0%
Open Punctuation
ValueCountFrequency (%)
( 34
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 23
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5521
51.6%
Common 3545
33.1%
Latin 1631
 
15.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
256
 
4.6%
160
 
2.9%
160
 
2.9%
145
 
2.6%
140
 
2.5%
123
 
2.2%
113
 
2.0%
111
 
2.0%
100
 
1.8%
99
 
1.8%
Other values (366) 4114
74.5%
Latin
ValueCountFrequency (%)
m 507
31.1%
g 434
26.6%
l 135
 
8.3%
a 57
 
3.5%
e 50
 
3.1%
i 36
 
2.2%
n 32
 
2.0%
o 29
 
1.8%
r 26
 
1.6%
t 22
 
1.3%
Other values (34) 303
18.6%
Common
ValueCountFrequency (%)
1284
36.2%
0 613
17.3%
5 313
 
8.8%
1 256
 
7.2%
/ 252
 
7.1%
2 209
 
5.9%
. 121
 
3.4%
3 74
 
2.1%
% 74
 
2.1%
4 60
 
1.7%
Other values (14) 289
 
8.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5521
51.6%
ASCII 5093
47.6%
CJK Compat 82
 
0.8%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1284
25.2%
0 613
12.0%
m 507
 
10.0%
g 434
 
8.5%
5 313
 
6.1%
1 256
 
5.0%
/ 252
 
4.9%
2 209
 
4.1%
l 135
 
2.7%
. 121
 
2.4%
Other values (54) 969
19.0%
Hangul
ValueCountFrequency (%)
256
 
4.6%
160
 
2.9%
160
 
2.9%
145
 
2.6%
140
 
2.5%
123
 
2.2%
113
 
2.0%
111
 
2.0%
100
 
1.8%
99
 
1.8%
Other values (366) 4114
74.5%
CJK Compat
ValueCountFrequency (%)
59
72.0%
13
 
15.9%
10
 
12.2%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size899.0 B
True
462 
False
309 
ValueCountFrequency (%)
True 462
59.9%
False 309
40.1%
2024-04-22T01:29:10.609838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-22T01:29:10.800842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
약물구분마약류사용여부
약물구분1.0000.1030.144
마약류0.1031.0000.038
사용여부0.1440.0381.000
2024-04-22T01:29:11.044159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사용여부약물구분마약류
사용여부1.0000.1760.062
약물구분0.1761.0000.077
마약류0.0620.0771.000
2024-04-22T01:29:11.285365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
약물구분마약류사용여부
약물구분1.0000.0770.176
마약류0.0771.0000.062
사용여부0.1760.0621.000

Missing values

2024-04-22T01:28:54.930064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-22T01:28:55.592239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-22T01:28:55.909993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

약품코드주성분약품분류약물구분마약류약품명칭한글명칭사용여부
0AAP650101430ATR114 해열.진통.소염제내복약(조제)일반Acetaminophen 650mg/Tab아세트아미노펜서방정 650mgY
1ABDZ104303ATB642 구충제내복약(비조제)일반Albendazole 400mg/Tab알벤다졸정 400mgY
2ABLI2451504ATB117 정신신경용제내복약(조제)일반Aripiprazole 2mg아리피프라졸 2mgY
3ABLI5451503ATB117 정신신경용제내복약(조제)일반Aripiprazole 5mg아리피프라졸정 5mgY
4ABOD10451501ATD117 정신신경용제내복약(비조제)일반Aripiprazole OD 10mg/Tab아리피프라졸 OD 10mg 정Y
5ACTI260300ATB141 항히스타민제내복약(조제)일반Triprolidine HCl 2.5mg/Pseudoephedrine HCl 60mg Tab트리프롤리딘/슈도에페드린정Y
6ACTN150442330ATB399 따로 분류되지 않는 대사성 의약품내복약(비조제)일반Risedronate sodium 150mg/Tab리세드론산 나트륨 정 150mgY
7ACTN35511200ATB399 따로 분류되지 않는 대사성 의약품내복약(비조제)일반Risedronate sodium 35㎎/Cholecalciferol 5600IU/Tab리세드론산 나트륨 35㎎/콜레칼시페롤 5600IU/정Y
8ADT25231101ATB213 이뇨제내복약(조제)일반Spironolactone 25mg/Tab스피로노락톤정 25mgY
9AGIO214630AGN238 하제,완장제내복약(비조제)일반Agiocur pregranules 6g/포아기오쿨원료과립 6g/포Y
약품코드주성분약품분류약물구분마약류약품명칭한글명칭사용여부
761VENIT163601ATB215 혈관보강제내복약(조제)일반Fraction Flavonoid Purifiee Micronise 500mg/Tab베니톨정 500mgN
762VITB6221603ATB313 비타민 B 제(비타민 B1 제외)내복약(조제)일반Pyridoxine HCl 50mg/Tab염산 피리독신 정 50mgN
763VITC110404ATB314 비타민 C 및 P 제내복약(조제)일반Ascorbic Acid 500mg/Tab<NA>N
764VIV617101ATB399 따로 분류되지 않는 대사성 의약품내복약(조제)일반Bazedoxifene acetate 20mg바제독시펜정N
765VLIDO183902CLQ121 국소마취제외용약(기타)일반Lidocaine Viscous Soln. 2% 100ml/Btl<NA>N
766VPA247001ACS113 항전간제내복약(조제)일반Valproic acid 250mg/Cap발프로익산 연질캅셀 250mgN
767WEL150428102 ATR117 정신신경용제내복약(조제)일반Bupropion HCl 150mg부프로피온 XL 150mgN
768X2032<NA><NA>주사제일반WRC(Washed RBC 400ml)WRC(Washed RBC 400ml) (혈액)N
769ZOCO227801ATB218 동맥경화용제내복약(조제)일반Simvastatin 20mg/Tab심바스타틴정 20mgN
770ZPD6250503ATR112 최면진정제내복약(조제)향정Zolpidem tartrate 6.25㎎/Tab졸피뎀 6.25mg 정N