Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells22
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory468.8 KiB
Average record size in memory48.0 B

Variable types

Text1
Boolean3
DateTime1

Dataset

Description의료취약지 의료지원 시범사업의 디지털의료지원시스템에서 사용되는 약품정보 입니다.시스템에서 처방을 하기 위한 약품정보 약품명, EDI 코드 등을 확인 할 수 있습니다.
Author한국사회보장정보원
URLhttps://www.data.go.kr/data/15090584/fileData.do

Alerts

용법 has constant value ""Constant
사용여부 has constant value ""Constant

Reproduction

Analysis started2023-12-12 16:25:45.979820
Analysis finished2023-12-12 16:25:47.003625
Duration1.02 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct9804
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T01:25:47.152011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length209
Median length97
Mean length24.1456
Min length3

Characters and Unicode

Total characters241456
Distinct characters731
Distinct categories13 ?
Distinct scripts5 ?
Distinct blocks10 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9608 ?
Unique (%)96.1%

Sample

1st row넥모클린정625mg(아목시실린·클라불란산칼륨)_(1정)
2nd row스텍신정(애엽95%에탄올연조엑스(20→1)_(60mg/1정)
3rd row후나콘주사액<피프린히드리네이트>_(3mg/2mL)
4th row시에스캡슐_(1캡슐)
5th row씨록신정250밀리그램(시프로플록사신염산염수화물)_(0.2915g/1정)
ValueCountFrequency (%)
500밀리리터 28
 
0.3%
1000밀리리터 19
 
0.2%
17
 
0.2%
250밀리리터 16
 
0.1%
10밀리리터 8
 
0.1%
싸이로키(i-131)치료용 8
 
0.1%
100밀리리터 8
 
0.1%
20밀리리터 7
 
0.1%
eye 7
 
0.1%
2밀리리터 6
 
0.1%
Other values (10185) 10581
98.8%
2023-12-13T01:25:47.574624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 13687
 
5.7%
( 13685
 
5.7%
0 9691
 
4.0%
1 8145
 
3.4%
7094
 
2.9%
5946
 
2.5%
g 5602
 
2.3%
5 5373
 
2.2%
/ 5088
 
2.1%
_ 5056
 
2.1%
Other values (721) 162089
67.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 144048
59.7%
Decimal Number 33149
 
13.7%
Lowercase Letter 14006
 
5.8%
Close Punctuation 13728
 
5.7%
Open Punctuation 13727
 
5.7%
Other Punctuation 11341
 
4.7%
Connector Punctuation 5056
 
2.1%
Uppercase Letter 4870
 
2.0%
Space Separator 777
 
0.3%
Dash Punctuation 404
 
0.2%
Other values (3) 350
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7094
 
4.9%
5946
 
4.1%
4017
 
2.8%
3472
 
2.4%
3392
 
2.4%
3372
 
2.3%
3166
 
2.2%
3165
 
2.2%
3131
 
2.2%
3077
 
2.1%
Other values (626) 104216
72.3%
Lowercase Letter
ValueCountFrequency (%)
g 5602
40.0%
m 4938
35.3%
n 393
 
2.8%
a 324
 
2.3%
i 316
 
2.3%
e 315
 
2.2%
l 291
 
2.1%
o 281
 
2.0%
t 207
 
1.5%
r 185
 
1.3%
Other values (19) 1154
 
8.2%
Uppercase Letter
ValueCountFrequency (%)
L 1782
36.6%
I 521
 
10.7%
O 228
 
4.7%
T 210
 
4.3%
A 210
 
4.3%
N 204
 
4.2%
C 204
 
4.2%
E 193
 
4.0%
U 179
 
3.7%
R 121
 
2.5%
Other values (16) 1018
20.9%
Decimal Number
ValueCountFrequency (%)
0 9691
29.2%
1 8145
24.6%
5 5373
16.2%
2 3645
 
11.0%
3 1553
 
4.7%
4 1302
 
3.9%
6 1089
 
3.3%
8 965
 
2.9%
7 912
 
2.8%
9 474
 
1.4%
Other Punctuation
ValueCountFrequency (%)
/ 5088
44.9%
. 4194
37.0%
: 765
 
6.7%
% 660
 
5.8%
, 422
 
3.7%
· 211
 
1.9%
; 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
117
56.0%
~ 71
34.0%
9
 
4.3%
> 5
 
2.4%
× 3
 
1.4%
+ 2
 
1.0%
< 2
 
1.0%
Other Symbol
ValueCountFrequency (%)
88
69.3%
24
 
18.9%
7
 
5.5%
6
 
4.7%
2
 
1.6%
Letter Number
ValueCountFrequency (%)
10
71.4%
2
 
14.3%
1
 
7.1%
1
 
7.1%
Close Punctuation
ValueCountFrequency (%)
) 13687
99.7%
] 41
 
0.3%
Open Punctuation
ValueCountFrequency (%)
( 13685
99.7%
[ 42
 
0.3%
Connector Punctuation
ValueCountFrequency (%)
_ 5056
100.0%
Space Separator
ValueCountFrequency (%)
777
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 404
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 144046
59.7%
Common 78526
32.5%
Latin 18844
 
7.8%
Greek 38
 
< 0.1%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7094
 
4.9%
5946
 
4.1%
4017
 
2.8%
3472
 
2.4%
3392
 
2.4%
3372
 
2.3%
3166
 
2.2%
3165
 
2.2%
3131
 
2.2%
3077
 
2.1%
Other values (624) 104214
72.3%
Latin
ValueCountFrequency (%)
g 5602
29.7%
m 4938
26.2%
L 1782
 
9.5%
I 521
 
2.8%
n 393
 
2.1%
a 324
 
1.7%
i 316
 
1.7%
e 315
 
1.7%
l 291
 
1.5%
o 281
 
1.5%
Other values (46) 4081
21.7%
Common
ValueCountFrequency (%)
) 13687
17.4%
( 13685
17.4%
0 9691
12.3%
1 8145
10.4%
5 5373
 
6.8%
/ 5088
 
6.5%
_ 5056
 
6.4%
. 4194
 
5.3%
2 3645
 
4.6%
3 1553
 
2.0%
Other values (27) 8409
10.7%
Greek
ValueCountFrequency (%)
μ 31
81.6%
β 7
 
18.4%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 144034
59.7%
ASCII 96881
40.1%
None 252
 
0.1%
CJK Compat 127
 
0.1%
Arrows 117
 
< 0.1%
Number Forms 14
 
< 0.1%
Compat Jamo 12
 
< 0.1%
Math Operators 9
 
< 0.1%
Letterlike Symbols 8
 
< 0.1%
CJK 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 13687
14.1%
( 13685
14.1%
0 9691
10.0%
1 8145
 
8.4%
g 5602
 
5.8%
5 5373
 
5.5%
/ 5088
 
5.3%
_ 5056
 
5.2%
m 4938
 
5.1%
. 4194
 
4.3%
Other values (69) 21422
22.1%
Hangul
ValueCountFrequency (%)
7094
 
4.9%
5946
 
4.1%
4017
 
2.8%
3472
 
2.4%
3392
 
2.4%
3372
 
2.3%
3166
 
2.2%
3165
 
2.2%
3131
 
2.2%
3077
 
2.1%
Other values (623) 104202
72.3%
None
ValueCountFrequency (%)
· 211
83.7%
μ 31
 
12.3%
β 7
 
2.8%
× 3
 
1.2%
Arrows
ValueCountFrequency (%)
117
100.0%
CJK Compat
ValueCountFrequency (%)
88
69.3%
24
 
18.9%
7
 
5.5%
6
 
4.7%
2
 
1.6%
Compat Jamo
ValueCountFrequency (%)
12
100.0%
Number Forms
ValueCountFrequency (%)
10
71.4%
2
 
14.3%
1
 
7.1%
1
 
7.1%
Math Operators
ValueCountFrequency (%)
9
100.0%
Letterlike Symbols
ValueCountFrequency (%)
8
100.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%

용법
Boolean

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
True
10000 
ValueCountFrequency (%)
True 10000
100.0%
2023-12-13T01:25:47.694486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct2
Distinct (%)< 0.1%
Missing22
Missing (%)0.2%
Memory size97.7 KiB
True
7542 
False
2436 
(Missing)
 
22
ValueCountFrequency (%)
True 7542
75.4%
False 2436
 
24.4%
(Missing) 22
 
0.2%
2023-12-13T01:25:47.773904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

사용여부
Boolean

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
True
10000 
ValueCountFrequency (%)
True 10000
100.0%
2023-12-13T01:25:47.857459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct20
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2017-08-23 00:00:00
Maximum2023-09-15 00:00:00
2023-12-13T01:25:47.973104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:25:48.113046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=20)

Correlations

2023-12-13T01:25:48.202857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
급여여부등록일
급여여부1.000NaN
등록일NaN1.000

Missing values

2023-12-13T01:25:46.857869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:25:46.956635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

약품명용법급여여부사용여부등록일
17553넥모클린정625mg(아목시실린·클라불란산칼륨)_(1정)YYY2017-08-23
6111스텍신정(애엽95%에탄올연조엑스(20→1)_(60mg/1정)YYY2017-08-23
9928후나콘주사액<피프린히드리네이트>_(3mg/2mL)YYY2017-08-23
28889시에스캡슐_(1캡슐)YYY2017-08-23
16056씨록신정250밀리그램(시프로플록사신염산염수화물)_(0.2915g/1정)YYY2017-08-23
10946프래빅스정(클로피도그렐황산염)_(97.875mg/1정)YYY2017-08-23
6669아로베스트정(아플로쿠알론)YYY2017-08-23
32962더블셋정YNY2017-08-23
28771그라트릴주(그라니세트론염산염)_(1mg/1mL)YYY2017-08-23
28056펜타사관장액(메살라진)YYY2017-08-23
약품명용법급여여부사용여부등록일
45270테리드정(나테글리니드)YNY2017-08-23
38794발싸이트정450밀리그람(염산발간시클로버)_(0.4963g/1정)YYY2017-08-23
15380로큐론주(로쿠로니움브롬화물)YNY2017-08-23
36673휴토졸정(판토프라졸나트륨세스키히드레이트)YNY2017-08-23
20293훼럼키드액(수산화제이철폴리말토스복염)YNY2017-08-23
32959엘도라캡슐(에르도스테인)_(0.3g/1캡슐)YYY2017-08-23
9168크린세프시럽125mg/5ml(세파클러수화물)_(3.75g/150mL)YYY2017-08-23
25646치옥티아에이취알정600밀리그램(티옥트산)_(0.6g/1정)YYY2017-08-23
42128케포돈1그람주(세파제돈나트륨)YYY2017-08-23
16232히야론퍼스트주사(히알우론산나트륨)(프리필드)YYY2017-08-23