Overview

Dataset statistics

Number of variables7
Number of observations479
Missing cells479
Missing cells (%)14.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory28.2 KiB
Average record size in memory60.3 B

Variable types

Numeric3
Text1
Unsupported1
DateTime2

Dataset

Description교육과정 소그룹 분류(교육그룹코드, 소그룹코드, 소그룹명칭, 소그룹기존명칭, 정렬순서, 등)
Author한국의료기기안전정보원
URLhttps://www.data.go.kr/data/15067079/fileData.do

Alerts

SGROUP_CODE is highly overall correlated with SSORT_SEQHigh correlation
SSORT_SEQ is highly overall correlated with SGROUP_CODEHigh correlation
SGROUP_ORG has 479 (100.0%) missing valuesMissing
IN_DTIME has unique valuesUnique
UP_DTIME has unique valuesUnique
SGROUP_ORG is an unsupported type, check if it needs cleaning or further analysisUnsupported
SSORT_SEQ has 9 (1.9%) zerosZeros

Reproduction

Analysis started2023-12-12 20:31:50.391173
Analysis finished2023-12-12 20:31:52.182127
Duration1.79 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

GROUP_CODE
Real number (ℝ)

Distinct83
Distinct (%)17.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean38.634656
Minimum1
Maximum99
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.3 KiB
2023-12-13T05:31:52.265125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5
Q111
median29
Q360
95-th percentile97
Maximum99
Range98
Interquartile range (IQR)49

Descriptive statistics

Standard deviation31.183123
Coefficient of variation (CV)0.80712828
Kurtosis-1.0664477
Mean38.634656
Median Absolute Deviation (MAD)22
Skewness0.55594181
Sum18506
Variance972.38716
MonotonicityNot monotonic
2023-12-13T05:31:52.431785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5 60
 
12.5%
11 36
 
7.5%
50 35
 
7.3%
97 32
 
6.7%
41 23
 
4.8%
80 22
 
4.6%
18 22
 
4.6%
6 15
 
3.1%
29 14
 
2.9%
81 13
 
2.7%
Other values (73) 207
43.2%
ValueCountFrequency (%)
1 2
 
0.4%
2 12
 
2.5%
3 2
 
0.4%
4 1
 
0.2%
5 60
12.5%
6 15
 
3.1%
7 6
 
1.3%
8 3
 
0.6%
9 9
 
1.9%
10 1
 
0.2%
ValueCountFrequency (%)
99 1
 
0.2%
98 1
 
0.2%
97 32
6.7%
96 1
 
0.2%
95 1
 
0.2%
93 1
 
0.2%
92 1
 
0.2%
91 3
 
0.6%
90 1
 
0.2%
89 1
 
0.2%

SGROUP_CODE
Real number (ℝ)

HIGH CORRELATION 

Distinct73
Distinct (%)15.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.843424
Minimum0
Maximum99
Zeros1
Zeros (%)0.2%
Negative0
Negative (%)0.0%
Memory size4.3 KiB
2023-12-13T05:31:52.589611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q13
median8
Q319.5
95-th percentile59.1
Maximum99
Range99
Interquartile range (IQR)16.5

Descriptive statistics

Standard deviation20.874163
Coefficient of variation (CV)1.3175285
Kurtosis5.1272301
Mean15.843424
Median Absolute Deviation (MAD)6
Skewness2.2637827
Sum7589
Variance435.73066
MonotonicityNot monotonic
2023-12-13T05:31:52.744215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 73
 
15.2%
2 40
 
8.4%
3 32
 
6.7%
4 27
 
5.6%
5 23
 
4.8%
6 21
 
4.4%
7 18
 
3.8%
10 16
 
3.3%
11 15
 
3.1%
9 15
 
3.1%
Other values (63) 199
41.5%
ValueCountFrequency (%)
0 1
 
0.2%
1 73
15.2%
2 40
8.4%
3 32
6.7%
4 27
 
5.6%
5 23
 
4.8%
6 21
 
4.4%
7 18
 
3.8%
8 14
 
2.9%
9 15
 
3.1%
ValueCountFrequency (%)
99 2
0.4%
98 2
0.4%
97 1
0.2%
96 1
0.2%
95 1
0.2%
94 2
0.4%
93 2
0.4%
91 2
0.4%
90 1
0.2%
89 1
0.2%
Distinct453
Distinct (%)94.6%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
2023-12-13T05:31:53.041891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length65
Median length45
Mean length18.607516
Min length2

Characters and Unicode

Total characters8913
Distinct characters358
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique427 ?
Unique (%)89.1%

Sample

1st row[디딤돌(기초)] 임상(용품) 4차산업을 선도하는 의료기기 R&D 연구개발자 교육
2nd row[디딤돌플러스] 실무/기초
3rd row[디딤돌플러스] 인허가/심화
4th row[디딤돌플러스] 인허가/특성화
5th row[디딤돌플러스] 해외전문가 초청과정
ValueCountFrequency (%)
의료기기 153
 
9.3%
교육 43
 
2.6%
42
 
2.6%
세미나 35
 
2.1%
임상시험 20
 
1.2%
gmp 19
 
1.2%
17
 
1.0%
사전컨설팅 16
 
1.0%
민원설명회 16
 
1.0%
디딤돌플러스 16
 
1.0%
Other values (695) 1263
77.0%
2023-12-13T05:31:53.492987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1171
 
13.1%
499
 
5.6%
258
 
2.9%
228
 
2.6%
( 207
 
2.3%
) 207
 
2.3%
1 129
 
1.4%
125
 
1.4%
[ 105
 
1.2%
] 105
 
1.2%
Other values (348) 5879
66.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5907
66.3%
Space Separator 1171
 
13.1%
Decimal Number 488
 
5.5%
Uppercase Letter 371
 
4.2%
Open Punctuation 319
 
3.6%
Close Punctuation 319
 
3.6%
Lowercase Letter 146
 
1.6%
Other Punctuation 97
 
1.1%
Dash Punctuation 75
 
0.8%
Connector Punctuation 9
 
0.1%
Other values (3) 11
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
499
 
8.4%
258
 
4.4%
228
 
3.9%
125
 
2.1%
97
 
1.6%
89
 
1.5%
86
 
1.5%
77
 
1.3%
76
 
1.3%
74
 
1.3%
Other values (278) 4298
72.8%
Uppercase Letter
ValueCountFrequency (%)
D 42
11.3%
I 40
10.8%
P 37
10.0%
C 32
8.6%
A 31
8.4%
E 30
8.1%
M 29
7.8%
G 29
7.8%
R 22
 
5.9%
S 20
 
5.4%
Other values (10) 59
15.9%
Lowercase Letter
ValueCountFrequency (%)
i 25
17.1%
a 19
13.0%
n 16
11.0%
d 12
8.2%
e 12
8.2%
t 11
7.5%
o 10
 
6.8%
r 9
 
6.2%
l 8
 
5.5%
s 5
 
3.4%
Other values (10) 19
13.0%
Decimal Number
ValueCountFrequency (%)
1 129
26.4%
2 90
18.4%
0 85
17.4%
6 54
11.1%
3 37
 
7.6%
4 32
 
6.6%
8 19
 
3.9%
9 16
 
3.3%
7 13
 
2.7%
5 13
 
2.7%
Other Punctuation
ValueCountFrequency (%)
/ 42
43.3%
: 16
 
16.5%
& 12
 
12.4%
, 9
 
9.3%
· 8
 
8.2%
. 8
 
8.2%
* 2
 
2.1%
Open Punctuation
ValueCountFrequency (%)
( 207
64.9%
[ 105
32.9%
7
 
2.2%
Close Punctuation
ValueCountFrequency (%)
) 207
64.9%
] 105
32.9%
7
 
2.2%
Letter Number
ValueCountFrequency (%)
3
50.0%
3
50.0%
Space Separator
ValueCountFrequency (%)
1171
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 75
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 9
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 4
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5907
66.3%
Common 2483
27.9%
Latin 523
 
5.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
499
 
8.4%
258
 
4.4%
228
 
3.9%
125
 
2.1%
97
 
1.6%
89
 
1.5%
86
 
1.5%
77
 
1.3%
76
 
1.3%
74
 
1.3%
Other values (278) 4298
72.8%
Latin
ValueCountFrequency (%)
D 42
 
8.0%
I 40
 
7.6%
P 37
 
7.1%
C 32
 
6.1%
A 31
 
5.9%
E 30
 
5.7%
M 29
 
5.5%
G 29
 
5.5%
i 25
 
4.8%
R 22
 
4.2%
Other values (32) 206
39.4%
Common
ValueCountFrequency (%)
1171
47.2%
( 207
 
8.3%
) 207
 
8.3%
1 129
 
5.2%
[ 105
 
4.2%
] 105
 
4.2%
2 90
 
3.6%
0 85
 
3.4%
- 75
 
3.0%
6 54
 
2.2%
Other values (18) 255
 
10.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5907
66.3%
ASCII 2977
33.4%
None 22
 
0.2%
Number Forms 6
 
0.1%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1171
39.3%
( 207
 
7.0%
) 207
 
7.0%
1 129
 
4.3%
[ 105
 
3.5%
] 105
 
3.5%
2 90
 
3.0%
0 85
 
2.9%
- 75
 
2.5%
6 54
 
1.8%
Other values (54) 749
25.2%
Hangul
ValueCountFrequency (%)
499
 
8.4%
258
 
4.4%
228
 
3.9%
125
 
2.1%
97
 
1.6%
89
 
1.5%
86
 
1.5%
77
 
1.3%
76
 
1.3%
74
 
1.3%
Other values (278) 4298
72.8%
None
ValueCountFrequency (%)
· 8
36.4%
7
31.8%
7
31.8%
Number Forms
ValueCountFrequency (%)
3
50.0%
3
50.0%
Punctuation
ValueCountFrequency (%)
1
100.0%

SGROUP_ORG
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing479
Missing (%)100.0%
Memory size4.3 KiB

SSORT_SEQ
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct60
Distinct (%)12.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.678497
Minimum0
Maximum99
Zeros9
Zeros (%)1.9%
Negative0
Negative (%)0.0%
Memory size4.3 KiB
2023-12-13T05:31:53.635334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q12
median6
Q314
95-th percentile43
Maximum99
Range99
Interquartile range (IQR)12

Descriptive statistics

Standard deviation15.950531
Coefficient of variation (CV)1.3658034
Kurtosis10.556742
Mean11.678497
Median Absolute Deviation (MAD)5
Skewness2.8940081
Sum5594
Variance254.41943
MonotonicityNot monotonic
2023-12-13T05:31:53.783473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 98
20.5%
2 44
 
9.2%
3 30
 
6.3%
4 28
 
5.8%
5 23
 
4.8%
6 20
 
4.2%
7 17
 
3.5%
11 16
 
3.3%
10 15
 
3.1%
12 14
 
2.9%
Other values (50) 174
36.3%
ValueCountFrequency (%)
0 9
 
1.9%
1 98
20.5%
2 44
9.2%
3 30
 
6.3%
4 28
 
5.8%
5 23
 
4.8%
6 20
 
4.2%
7 17
 
3.5%
8 13
 
2.7%
9 13
 
2.7%
ValueCountFrequency (%)
99 2
0.4%
98 1
0.2%
97 1
0.2%
96 1
0.2%
86 1
0.2%
85 1
0.2%
81 1
0.2%
66 1
0.2%
62 1
0.2%
59 1
0.2%

IN_DTIME
Date

UNIQUE 

Distinct479
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
Minimum2013-01-21 00:00:00
Maximum2020-08-19 14:21:40
2023-12-13T05:31:53.932918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:31:54.111406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

UP_DTIME
Date

UNIQUE 

Distinct479
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
Minimum2013-01-24 19:49:43.763000
Maximum2020-08-19 14:21:40
2023-12-13T05:31:54.271179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:31:54.481572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-13T05:31:51.317307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:31:50.696617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:31:50.995187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:31:51.427474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:31:50.783941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:31:51.108326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:31:51.544401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:31:50.883994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:31:51.213840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T05:31:54.606976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
GROUP_CODESGROUP_CODESSORT_SEQ
GROUP_CODE1.0000.5250.360
SGROUP_CODE0.5251.0000.914
SSORT_SEQ0.3600.9141.000
2023-12-13T05:31:54.726707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
GROUP_CODESGROUP_CODESSORT_SEQ
GROUP_CODE1.000-0.091-0.181
SGROUP_CODE-0.0911.0000.747
SSORT_SEQ-0.1810.7471.000

Missing values

2023-12-13T05:31:51.694769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:31:52.133828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

GROUP_CODESGROUP_CODESGROUP_NAMESGROUP_ORGSSORT_SEQIN_DTIMEUP_DTIME
09757[디딤돌(기초)] 임상(용품) 4차산업을 선도하는 의료기기 R&D 연구개발자 교육<NA>132018-06-12 21:14:50.02020-07-02 13:48:20.0
1819[디딤돌플러스] 실무/기초<NA>92019-04-22 09:40:23.02019-04-22 09:40:23.0
29741[디딤돌플러스] 인허가/심화<NA>412019-04-22 09:47:52.02019-04-22 09:47:52.0
39742[디딤돌플러스] 인허가/특성화<NA>422019-04-22 09:48:10.02019-04-22 09:48:10.0
49747[디딤돌플러스] 해외전문가 초청과정<NA>472019-04-22 09:50:05.02019-04-22 09:50:05.0
5541전자의료기기시험검사법<NA>12019-09-06 09:40:40.02019-09-06 09:40:40.0
6273IEC60601-1-2(4판)<NA>12019-09-06 10:21:25.02019-09-06 10:21:42.0
7844[단과반] 사후관리<NA>42020-05-19 16:11:33.02020-05-19 16:11:33.0
886219년도 온라인 보수교육<NA>22020-05-29 17:18:24.02020-05-29 17:18:24.0
99759[디딤돌플러스](실시간온라인) 기초_체외진단<NA>132020-07-02 13:48:58.02020-07-02 13:48:58.0
GROUP_CODESGROUP_CODESGROUP_NAMESGROUP_ORGSSORT_SEQIN_DTIMEUP_DTIME
4698585ISO 13485:2016 전환을 위한 업계 교육<NA>852020-06-19 09:22:50.02020-06-19 10:34:30.0
4704116제조/수입-공정밸리데이션<NA>162020-06-29 14:04:58.02020-06-29 14:05:28.0
4714120제조/수입-클린룸밸리데이션<NA>202020-06-29 14:06:40.02020-06-29 14:07:07.0
4729755[디딤돌플러스](실시간온라인) 기초_기구기계<NA>112020-07-02 13:44:34.02020-07-02 13:44:34.0
473313의료기기 통합정보시스템 가이드라인 교육(대구)<NA>32018-08-20 18:48:01.02018-08-20 18:48:01.0
4746132018년 제 6회 의료기기 안전성정보 사례연구 워크숍<NA>132018-10-30 09:21:07.02018-10-30 09:21:07.0
4755912019년 의료기기심사부 허가·심사 민원설명회<NA>22019-02-14 17:20:12.02019-02-14 17:20:12.0
476826[1:1멘토] 루트로닉 RA팀장<NA>12019-11-01 14:32:07.02019-11-01 14:32:07.0
477827[1:1멘토] 한국의료기기공업협동조합 전략기획팀 선임<NA>12019-11-01 14:32:39.02019-11-01 14:32:39.0
4788412019 온라인 RA 교육<NA>12020-04-23 11:26:50.02020-04-23 11:26:50.0