Overview

Dataset statistics

Number of variables6
Number of observations31
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.6 KiB
Average record size in memory54.3 B

Variable types

Text1
Categorical4
DateTime1

Dataset

Descriptiono 고혈압, 당뇨병 환자 중 적정투약이 이루어지지 않는 대상자 발췌 위한 약품 관리 - 내용: 2021년 적정투약 대상자 발췌 위해 신규 등록된 약품 - 내용: 약품명, 투여구분코드, 주성분명, 보건복지부효능코드, 질병코드 1 약품명 (국내 등록된 약품명) 2 투여구분코드 (1: 내복/ 2: 주사/ 3: 외용) 3 주성분명 (의약품에 함유된 유효성분) 4 보건복지부효능코드 (212: 부정맥용제/ 213: 이뇨제/ 214: 혈압강하제/ 217: 혈관확장제/ 218: 동맥경화용약/ 219: 기타의 순환계용약/ 396: 당뇨병용제) 5 질병코드 (A: 고혈압/ B : 당뇨/ C : 고지혈증/ AC : 고혈압+고지혈증/ BC : 당뇨+고지혈증) 6 시스템등록일시(화면에 등록한 날짜 (yyyy-mm-dd))
URLhttps://www.data.go.kr/data/15120897/fileData.do

Alerts

투여구분코드 has constant value ""Constant
질병코드 is highly overall correlated with 주성분명 and 1 other fieldsHigh correlation
보건복지부효능코드 is highly overall correlated with 주성분명 and 1 other fieldsHigh correlation
주성분명 is highly overall correlated with 보건복지부효능코드 and 1 other fieldsHigh correlation
약품명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 10:56:59.331790
Analysis finished2023-12-12 10:56:59.968153
Duration0.64 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

약품명
Text

UNIQUE 

Distinct31
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size380.0 B
2023-12-12T19:57:00.447397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length30
Mean length23.806452
Min length18

Characters and Unicode

Total characters738
Distinct characters77
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)100.0%

Sample

1st row딜라트렌이스알정16밀리그램(카르베딜롤)_(16mg/1정)
2nd row발트리오정10/160/20밀리그램_(1정)
3rd row네비로스타정2.5/5밀리그램_(1정)
4th row엑스원에이정5/80/10밀리그램_(1정)
5th row아발탄에이플러스정5/160/20밀리그램_(1정)
ValueCountFrequency (%)
딜라트렌이스알정16밀리그램(카르베딜롤)_(16mg/1정 1
 
3.2%
네비로스타정1.25/5밀리그램_(1정 1
 
3.2%
네시나메트정12.5/850밀리그램_(1정 1
 
3.2%
아카브정120/40밀리그램_(1정 1
 
3.2%
아카브정60/10밀리그램_(1정 1
 
3.2%
아모디핀정2.5밀리그램(암로디핀칸실산염)_(3.921mg/1정 1
 
3.2%
엑스원에이정5/160/10밀리그램_(1정 1
 
3.2%
아바트리정5/160/20밀리그램_(1정 1
 
3.2%
아바트리정5/80/10밀리그램_(1정 1
 
3.2%
올로맥스정40/5/5밀리그램_1(정 1
 
3.2%
Other values (21) 21
67.7%
2023-12-12T19:57:01.061745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 62
 
8.4%
58
 
7.9%
0 50
 
6.8%
/ 47
 
6.4%
) 39
 
5.3%
( 39
 
5.3%
36
 
4.9%
_ 31
 
4.2%
30
 
4.1%
30
 
4.1%
Other values (67) 316
42.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 377
51.1%
Decimal Number 185
25.1%
Other Punctuation 54
 
7.3%
Close Punctuation 39
 
5.3%
Open Punctuation 39
 
5.3%
Connector Punctuation 31
 
4.2%
Lowercase Letter 13
 
1.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
58
15.4%
36
 
9.5%
30
 
8.0%
30
 
8.0%
29
 
7.7%
18
 
4.8%
14
 
3.7%
13
 
3.4%
11
 
2.9%
9
 
2.4%
Other values (50) 129
34.2%
Decimal Number
ValueCountFrequency (%)
1 62
33.5%
0 50
27.0%
5 23
 
12.4%
2 20
 
10.8%
6 12
 
6.5%
8 10
 
5.4%
3 3
 
1.6%
4 3
 
1.6%
9 2
 
1.1%
Other Punctuation
ValueCountFrequency (%)
/ 47
87.0%
. 6
 
11.1%
, 1
 
1.9%
Lowercase Letter
ValueCountFrequency (%)
g 7
53.8%
m 6
46.2%
Close Punctuation
ValueCountFrequency (%)
) 39
100.0%
Open Punctuation
ValueCountFrequency (%)
( 39
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 31
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 377
51.1%
Common 348
47.2%
Latin 13
 
1.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
58
15.4%
36
 
9.5%
30
 
8.0%
30
 
8.0%
29
 
7.7%
18
 
4.8%
14
 
3.7%
13
 
3.4%
11
 
2.9%
9
 
2.4%
Other values (50) 129
34.2%
Common
ValueCountFrequency (%)
1 62
17.8%
0 50
14.4%
/ 47
13.5%
) 39
11.2%
( 39
11.2%
_ 31
8.9%
5 23
 
6.6%
2 20
 
5.7%
6 12
 
3.4%
8 10
 
2.9%
Other values (5) 15
 
4.3%
Latin
ValueCountFrequency (%)
g 7
53.8%
m 6
46.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 377
51.1%
ASCII 361
48.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 62
17.2%
0 50
13.9%
/ 47
13.0%
) 39
10.8%
( 39
10.8%
_ 31
8.6%
5 23
 
6.4%
2 20
 
5.5%
6 12
 
3.3%
8 10
 
2.8%
Other values (7) 28
7.8%
Hangul
ValueCountFrequency (%)
58
15.4%
36
 
9.5%
30
 
8.0%
30
 
8.0%
29
 
7.7%
18
 
4.8%
14
 
3.7%
13
 
3.4%
11
 
2.9%
9
 
2.4%
Other values (50) 129
34.2%

투여구분코드
Categorical

CONSTANT 

Distinct1
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size380.0 B
1
31 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 31
100.0%

Length

2023-12-12T19:57:01.312282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:57:01.469402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 31
100.0%

주성분명
Categorical

HIGH CORRELATION 

Distinct14
Distinct (%)45.2%
Missing0
Missing (%)0.0%
Memory size380.0 B
amlodipine besylate (as amlodipine 5mg)
14 
amlodipine besylate (as amlodipine 10mg)
ibudilast 10mg
atorvastatin calcium (as atorvastatin 10mg)
atorvastatin calcium (as atorvastatin 20mg)
Other values (9)

Length

Max length45
Median length44
Mean length36.258065
Min length14

Unique

Unique9 ?
Unique (%)29.0%

Sample

1st rowcarvedilol 16mg
2nd rowamlodipine besylate (as amlodipine 10mg)
3rd rownebivolol hydrochloride (as nebivolol 2.5mg)
4th rowamlodipine besylate (as amlodipine 5mg)
5th rowamlodipine besylate (as amlodipine 5mg)

Common Values

ValueCountFrequency (%)
amlodipine besylate (as amlodipine 5mg) 14
45.2%
amlodipine besylate (as amlodipine 10mg) 2
 
6.5%
ibudilast 10mg 2
 
6.5%
atorvastatin calcium (as atorvastatin 10mg) 2
 
6.5%
atorvastatin calcium (as atorvastatin 20mg) 2
 
6.5%
carvedilol 16mg 1
 
3.2%
nebivolol hydrochloride (as nebivolol 2.5mg) 1
 
3.2%
bosentan hydrate (as bosentan 0.125g) 1
 
3.2%
carvedilil 8mg 1
 
3.2%
telmisartan 20mg 1
 
3.2%
Other values (4) 4
 
12.9%

Length

2023-12-12T19:57:01.676509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
amlodipine 34
24.3%
as 26
18.6%
besylate 16
11.4%
5mg 14
10.0%
atorvastatin 10
 
7.1%
10mg 6
 
4.3%
calcium 5
 
3.6%
nebivolol 4
 
2.9%
20mg 3
 
2.1%
hydrochloride 2
 
1.4%
Other values (16) 20
14.3%

보건복지부효능코드
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)9.7%
Missing0
Missing (%)0.0%
Memory size380.0 B
219
25 
214
396
 
1

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique1 ?
Unique (%)3.2%

Sample

1st row214
2nd row219
3rd row219
4th row219
5th row219

Common Values

ValueCountFrequency (%)
219 25
80.6%
214 5
 
16.1%
396 1
 
3.2%

Length

2023-12-12T19:57:01.906436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:57:02.117968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
219 25
80.6%
214 5
 
16.1%
396 1
 
3.2%

질병코드
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)9.7%
Missing0
Missing (%)0.0%
Memory size380.0 B
AC
23 
A
B
 
1

Length

Max length2
Median length2
Mean length1.7419355
Min length1

Unique

Unique1 ?
Unique (%)3.2%

Sample

1st rowA
2nd rowAC
3rd rowAC
4th rowAC
5th rowAC

Common Values

ValueCountFrequency (%)
AC 23
74.2%
A 7
 
22.6%
B 1
 
3.2%

Length

2023-12-12T19:57:02.813663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:57:03.006227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
ac 23
74.2%
a 7
 
22.6%
b 1
 
3.2%
Distinct2
Distinct (%)6.5%
Missing0
Missing (%)0.0%
Memory size380.0 B
Minimum2021-05-24 00:00:00
Maximum2021-05-27 00:00:00
2023-12-12T19:57:03.166962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:57:03.341800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=2)

Correlations

2023-12-12T19:57:03.476969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
약품명주성분명보건복지부효능코드질병코드등록일시
약품명1.0001.0001.0001.0001.000
주성분명1.0001.0001.0001.0001.000
보건복지부효능코드1.0001.0001.0000.9940.334
질병코드1.0001.0000.9941.0000.427
등록일시1.0001.0000.3340.4271.000
2023-12-12T19:57:03.640727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
질병코드보건복지부효능코드주성분명
질병코드1.0000.9040.779
보건복지부효능코드0.9041.0000.779
주성분명0.7790.7791.000
2023-12-12T19:57:03.797136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주성분명보건복지부효능코드질병코드
주성분명1.0000.7790.779
보건복지부효능코드0.7791.0000.904
질병코드0.7790.9041.000

Missing values

2023-12-12T19:56:59.720130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:56:59.895891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

약품명투여구분코드주성분명보건복지부효능코드질병코드등록일시
0딜라트렌이스알정16밀리그램(카르베딜롤)_(16mg/1정)1carvedilol 16mg214A2021-05-24
1발트리오정10/160/20밀리그램_(1정)1amlodipine besylate (as amlodipine 10mg)219AC2021-05-27
2네비로스타정2.5/5밀리그램_(1정)1nebivolol hydrochloride (as nebivolol 2.5mg)219AC2021-05-27
3엑스원에이정5/80/10밀리그램_(1정)1amlodipine besylate (as amlodipine 5mg)219AC2021-05-27
4아발탄에이플러스정5/160/20밀리그램_(1정)1amlodipine besylate (as amlodipine 5mg)219AC2021-05-27
5아발탄에이플러스정5/160/10밀리그램_(1정)1amlodipine besylate (as amlodipine 5mg)219AC2021-05-27
6카나보센정125밀리그램(보센탄수화물(미분화))_(1.29082g/1정)1bosentan hydrate (as bosentan 0.125g)214A2021-05-24
7엑스원에이정5/80/20밀리그램_(1정)1amlodipine besylate (as amlodipine 5mg)219AC2021-05-27
8아바트리정5/80/20밀리그램_(1정)1amlodipine besylate (as amlodipine 5mg)219AC2021-05-27
9아발탄에이플러스정5/80/10밀리그램_(1정)1amlodipine besylate (as amlodipine 5mg)219AC2021-05-27
약품명투여구분코드주성분명보건복지부효능코드질병코드등록일시
21아카브정60/20밀리그램_(1정)1atorvastatin calcium (as atorvastatin 20mg)219AC2021-05-24
22올로맥스정40/5/5밀리그램_1(정)1amlodipine besylate (as amlodipine 5mg)219AC2021-05-27
23아바트리정5/80/10밀리그램_(1정),1amlodipine besylate (as amlodipine 5mg)219AC2021-05-27
24아바트리정5/160/20밀리그램_(1정)1amlodipine besylate (as amlodipine 5mg)219AC2021-05-27
25엑스원에이정5/160/10밀리그램_(1정)1amlodipine besylate (as amlodipine 5mg)219AC2021-05-27
26아모디핀정2.5밀리그램(암로디핀칸실산염)_(3.921mg/1정)1amlodipine camsylate (as amlodipine 2.5mg)214A2021-05-24
27아카브정60/10밀리그램_(1정)1atorvastatin calcium (as atorvastatin 10mg)219AC2021-05-24
28아카브정120/40밀리그램_(1정)1atorvastatin calcium (as atorvastatin 40mg)219AC2021-05-24
29네시나메트정12.5/850밀리그램_(1정)1alogliptin benzoate (as alogliptin 12.5mg)396B2021-05-24
30딜라스트캡슐(이부딜라스트)_(10mg/1캡슐)1ibudilast 10mg219A2021-05-24