Overview

Dataset statistics

Number of variables8
Number of observations98
Missing cells4
Missing cells (%)0.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.3 KiB
Average record size in memory65.3 B

Variable types

Categorical5
Text2
DateTime1

Dataset

Description등록민간자격, 국가공인민간자격 등 민간자격에 대한 정보데이터 중 공인민간자격등록현황 데이터입니다. 현재 국가공인민간자격 중 소관부처, 자격종목, 등급, 자격관리자, 공인유효기간, 기공인기간 데이터로 구성되어 있으며, 실시간 자료를 원하실 경우 민간자격정보서비스(www.pqi.or.kr)에서 확인하실 수 있습니다.
URLhttps://www.data.go.kr/data/15069557/fileData.do

Alerts

기공인기간(만료) is highly overall correlated with 공인유효기간(시작) and 1 other fieldsHigh correlation
공인유효기간(시작) is highly overall correlated with 공인유효기간(만료) and 1 other fieldsHigh correlation
공인유효기간(만료) is highly overall correlated with 공인유효기간(시작) and 1 other fieldsHigh correlation
기공인기간(시작) has 4 (4.1%) missing valuesMissing

Reproduction

Analysis started2023-12-12 21:11:09.912707
Analysis finished2023-12-12 21:11:11.214226
Duration1.3 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

소관 부처
Categorical

Distinct17
Distinct (%)17.3%
Missing0
Missing (%)0.0%
Memory size916.0 B
교육부
21 
과학기술정보통신부
20 
금융위원회
11 
산업통상자원부
기획재정부
Other values (12)
29 

Length

Max length9
Median length8
Mean length5.5612245
Min length3

Unique

Unique5 ?
Unique (%)5.1%

Sample

1st row식품의약품안전처
2nd row금융위원회
3rd row금융위원회
4th row금융위원회
5th row금융위원회

Common Values

ValueCountFrequency (%)
교육부 21
21.4%
과학기술정보통신부 20
20.4%
금융위원회 11
11.2%
산업통상자원부 9
9.2%
기획재정부 8
 
8.2%
문화체육관광부 5
 
5.1%
보건복지부 4
 
4.1%
국토교통부 4
 
4.1%
경찰청 3
 
3.1%
산림청 3
 
3.1%
Other values (7) 10
10.2%

Length

2023-12-13T06:11:11.305680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
교육부 21
21.4%
과학기술정보통신부 20
20.4%
금융위원회 11
11.2%
산업통상자원부 9
9.2%
기획재정부 8
 
8.2%
문화체육관광부 5
 
5.1%
보건복지부 4
 
4.1%
국토교통부 4
 
4.1%
행정안전부 3
 
3.1%
경찰청 3
 
3.1%
Other values (7) 10
10.2%
Distinct95
Distinct (%)96.9%
Missing0
Missing (%)0.0%
Memory size916.0 B
2023-12-13T06:11:11.584107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length20.5
Mean length9.3367347
Min length3

Characters and Unicode

Total characters915
Distinct characters202
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique92 ?
Unique (%)93.9%

Sample

1st row의료기기 RA전문가
2nd row신용관리사
3rd row신용위험분석사(CRA)
4th row신용분석사
5th row여신심사역
ValueCountFrequency (%)
flex 6
 
5.1%
한자급수자격검정 2
 
1.7%
한자.한문 2
 
1.7%
전문지도사 2
 
1.7%
한자능력급수 2
 
1.7%
상공회의소 2
 
1.7%
kbs한국어능력시험 1
 
0.8%
실천예절지도사 1
 
0.8%
종이접기 1
 
0.8%
한국실용글쓰기검정 1
 
0.8%
Other values (98) 98
83.1%
2023-12-13T06:11:12.039453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
44
 
4.8%
) 26
 
2.8%
( 26
 
2.8%
25
 
2.7%
25
 
2.7%
24
 
2.6%
T 21
 
2.3%
21
 
2.3%
21
 
2.3%
20
 
2.2%
Other values (192) 662
72.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 643
70.3%
Uppercase Letter 138
 
15.1%
Lowercase Letter 46
 
5.0%
Close Punctuation 26
 
2.8%
Open Punctuation 26
 
2.8%
Space Separator 20
 
2.2%
Other Punctuation 12
 
1.3%
Dash Punctuation 3
 
0.3%
Math Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
44
 
6.8%
25
 
3.9%
25
 
3.9%
24
 
3.7%
21
 
3.3%
21
 
3.3%
18
 
2.8%
16
 
2.5%
15
 
2.3%
12
 
1.9%
Other values (150) 422
65.6%
Uppercase Letter
ValueCountFrequency (%)
T 21
15.2%
E 17
12.3%
S 13
9.4%
P 13
9.4%
I 9
 
6.5%
F 9
 
6.5%
L 9
 
6.5%
A 9
 
6.5%
C 8
 
5.8%
X 7
 
5.1%
Other values (8) 23
16.7%
Lowercase Letter
ValueCountFrequency (%)
e 7
15.2%
s 6
13.0%
i 5
10.9%
n 5
10.9%
a 4
8.7%
c 4
8.7%
r 3
6.5%
o 3
6.5%
t 3
6.5%
d 1
 
2.2%
Other values (5) 5
10.9%
Other Punctuation
ValueCountFrequency (%)
/ 7
58.3%
· 2
 
16.7%
. 2
 
16.7%
, 1
 
8.3%
Close Punctuation
ValueCountFrequency (%)
) 26
100.0%
Open Punctuation
ValueCountFrequency (%)
( 26
100.0%
Space Separator
ValueCountFrequency (%)
20
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 643
70.3%
Latin 184
 
20.1%
Common 88
 
9.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
44
 
6.8%
25
 
3.9%
25
 
3.9%
24
 
3.7%
21
 
3.3%
21
 
3.3%
18
 
2.8%
16
 
2.5%
15
 
2.3%
12
 
1.9%
Other values (150) 422
65.6%
Latin
ValueCountFrequency (%)
T 21
 
11.4%
E 17
 
9.2%
S 13
 
7.1%
P 13
 
7.1%
I 9
 
4.9%
F 9
 
4.9%
L 9
 
4.9%
A 9
 
4.9%
C 8
 
4.3%
X 7
 
3.8%
Other values (23) 69
37.5%
Common
ValueCountFrequency (%)
) 26
29.5%
( 26
29.5%
20
22.7%
/ 7
 
8.0%
- 3
 
3.4%
· 2
 
2.3%
. 2
 
2.3%
, 1
 
1.1%
+ 1
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 643
70.3%
ASCII 270
29.5%
None 2
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
44
 
6.8%
25
 
3.9%
25
 
3.9%
24
 
3.7%
21
 
3.3%
21
 
3.3%
18
 
2.8%
16
 
2.5%
15
 
2.3%
12
 
1.9%
Other values (150) 422
65.6%
ASCII
ValueCountFrequency (%)
) 26
 
9.6%
( 26
 
9.6%
T 21
 
7.8%
20
 
7.4%
E 17
 
6.3%
S 13
 
4.8%
P 13
 
4.8%
I 9
 
3.3%
F 9
 
3.3%
L 9
 
3.3%
Other values (31) 107
39.6%
None
ValueCountFrequency (%)
· 2
100.0%

등급
Categorical

Distinct40
Distinct (%)40.8%
Missing0
Missing (%)0.0%
Memory size916.0 B
등급없음
30 
1급, 2급
11 
1급, 2급, 3급
1A급, 1B급, 1C급, 2A급, 2B급, 2C급, 3A급, 3B급, 3C급
2급
Other values (35)
38 

Length

Max length43
Median length34
Mean length11.040816
Min length2

Unique

Unique32 ?
Unique (%)32.7%

Sample

1st row2급
2nd row등급없음
3rd row등급없음
4th row등급없음
5th row등급없음

Common Values

ValueCountFrequency (%)
등급없음 30
30.6%
1급, 2급 11
 
11.2%
1급, 2급, 3급 8
 
8.2%
1A급, 1B급, 1C급, 2A급, 2B급, 2C급, 3A급, 3B급, 3C급 6
 
6.1%
2급 5
 
5.1%
S급, 1급, 2급, 3급 2
 
2.0%
사범, 1급, 준1급, 2급, 준2급 2
 
2.0%
1급, 2급, 3급, 4급 2
 
2.0%
전문가 1
 
1.0%
Ⅰ종, Ⅱ종 1
 
1.0%
Other values (30) 30
30.6%

Length

2023-12-13T06:11:12.237755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2급 38
 
14.8%
1급 33
 
12.8%
등급없음 30
 
11.7%
3급 18
 
7.0%
2b급 7
 
2.7%
3c급 7
 
2.7%
3b급 7
 
2.7%
2c급 7
 
2.7%
3a급 7
 
2.7%
2a급 7
 
2.7%
Other values (65) 96
37.4%
Distinct59
Distinct (%)60.2%
Missing0
Missing (%)0.0%
Memory size916.0 B
2023-12-13T06:11:12.524336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length16
Mean length10.234694
Min length6

Characters and Unicode

Total characters1003
Distinct characters146
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique42 ?
Unique (%)42.9%

Sample

1st row한국의료기기안전정보원
2nd row(사)신용정보협회
3rd row(사)한국금융연수원
4th row(사)한국금융연수원
5th row(사)한국금융연수원
ValueCountFrequency (%)
대한상공회의소 10
 
10.0%
한국생산성본부 9
 
9.0%
사)한국금융연수원 6
 
6.0%
사)대한민국한자교육연구회·대한검정회 4
 
4.0%
재)한국데이터산업진흥원 3
 
3.0%
매일경제신문사 2
 
2.0%
사)한국시각장애인연합회 2
 
2.0%
사)한자교육진흥회 2
 
2.0%
사)한국어문회 2
 
2.0%
사)한국정보평가협회 2
 
2.0%
Other values (50) 58
58.0%
2023-12-13T06:11:12.972075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
93
 
9.3%
75
 
7.5%
66
 
6.6%
61
 
6.1%
( 56
 
5.6%
) 56
 
5.6%
29
 
2.9%
24
 
2.4%
23
 
2.3%
19
 
1.9%
Other values (136) 501
50.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 880
87.7%
Open Punctuation 56
 
5.6%
Close Punctuation 56
 
5.6%
Other Punctuation 5
 
0.5%
Space Separator 3
 
0.3%
Uppercase Letter 3
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
93
 
10.6%
75
 
8.5%
66
 
7.5%
61
 
6.9%
29
 
3.3%
24
 
2.7%
23
 
2.6%
19
 
2.2%
19
 
2.2%
16
 
1.8%
Other values (128) 455
51.7%
Uppercase Letter
ValueCountFrequency (%)
S 1
33.3%
B 1
33.3%
K 1
33.3%
Other Punctuation
ValueCountFrequency (%)
· 4
80.0%
/ 1
 
20.0%
Open Punctuation
ValueCountFrequency (%)
( 56
100.0%
Close Punctuation
ValueCountFrequency (%)
) 56
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 880
87.7%
Common 120
 
12.0%
Latin 3
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
93
 
10.6%
75
 
8.5%
66
 
7.5%
61
 
6.9%
29
 
3.3%
24
 
2.7%
23
 
2.6%
19
 
2.2%
19
 
2.2%
16
 
1.8%
Other values (128) 455
51.7%
Common
ValueCountFrequency (%)
( 56
46.7%
) 56
46.7%
· 4
 
3.3%
3
 
2.5%
/ 1
 
0.8%
Latin
ValueCountFrequency (%)
S 1
33.3%
B 1
33.3%
K 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 880
87.7%
ASCII 119
 
11.9%
None 4
 
0.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
93
 
10.6%
75
 
8.5%
66
 
7.5%
61
 
6.9%
29
 
3.3%
24
 
2.7%
23
 
2.6%
19
 
2.2%
19
 
2.2%
16
 
1.8%
Other values (128) 455
51.7%
ASCII
ValueCountFrequency (%)
( 56
47.1%
) 56
47.1%
3
 
2.5%
/ 1
 
0.8%
S 1
 
0.8%
B 1
 
0.8%
K 1
 
0.8%
None
ValueCountFrequency (%)
· 4
100.0%

공인유효기간(시작)
Categorical

HIGH CORRELATION 

Distinct40
Distinct (%)40.8%
Missing0
Missing (%)0.0%
Memory size916.0 B
2022-01-01
25 
2023-01-01
18 
2021-01-01
2020-01-01
 
4
2020-01-20
 
3
Other values (35)
42 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique28 ?
Unique (%)28.6%

Sample

1st row2019-01-01
2nd row2023-02-15
3rd row2021-02-15
4th row2020-01-20
5th row2020-01-20

Common Values

ValueCountFrequency (%)
2022-01-01 25
25.5%
2023-01-01 18
18.4%
2021-01-01 6
 
6.1%
2020-01-01 4
 
4.1%
2020-01-20 3
 
3.1%
2021-02-17 2
 
2.0%
2023-02-17 2
 
2.0%
2021-12-13 2
 
2.0%
2021-01-20 2
 
2.0%
2023-04-01 2
 
2.0%
Other values (30) 32
32.7%

Length

2023-12-13T06:11:13.140963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2022-01-01 25
25.5%
2023-01-01 18
18.4%
2021-01-01 6
 
6.1%
2020-01-01 4
 
4.1%
2020-01-20 3
 
3.1%
2023-04-01 2
 
2.0%
2019-01-01 2
 
2.0%
2021-09-30 2
 
2.0%
2021-01-20 2
 
2.0%
2021-12-13 2
 
2.0%
Other values (30) 32
32.7%

공인유효기간(만료)
Categorical

HIGH CORRELATION 

Distinct40
Distinct (%)40.8%
Missing0
Missing (%)0.0%
Memory size916.0 B
2023-12-31
17 
2025-12-31
14 
2024-12-31
13 
2027-12-31
2026-02-16
 
3
Other values (35)
42 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique29 ?
Unique (%)29.6%

Sample

1st row2023-12-31
2nd row2028-02-14
3rd row2024-02-14
4th row2024-12-31
5th row2024-12-31

Common Values

ValueCountFrequency (%)
2023-12-31 17
17.3%
2025-12-31 14
14.3%
2024-12-31 13
13.3%
2027-12-31 9
 
9.2%
2026-02-16 3
 
3.1%
2026-12-31 3
 
3.1%
2024-12-12 2
 
2.0%
2026-03-31 2
 
2.0%
2026-09-29 2
 
2.0%
2026-01-19 2
 
2.0%
Other values (30) 31
31.6%

Length

2023-12-13T06:11:13.289081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2023-12-31 17
17.3%
2025-12-31 14
14.3%
2024-12-31 13
13.3%
2027-12-31 9
 
9.2%
2026-02-16 3
 
3.1%
2026-12-31 3
 
3.1%
2026-01-19 2
 
2.0%
2025-11-30 2
 
2.0%
2026-09-29 2
 
2.0%
2026-03-31 2
 
2.0%
Other values (30) 31
31.6%
Distinct59
Distinct (%)62.8%
Missing4
Missing (%)4.1%
Memory size916.0 B
Minimum2000-12-22 00:00:00
Maximum2021-05-01 00:00:00
2023-12-13T06:11:13.442982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:11:13.623886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

기공인기간(만료)
Categorical

HIGH CORRELATION 

Distinct42
Distinct (%)42.9%
Missing0
Missing (%)0.0%
Memory size916.0 B
2021-12-31
24 
2022-12-31
17 
2020-12-31
<NA>
 
4
2019-12-31
 
3
Other values (37)
45 

Length

Max length10
Median length10
Mean length9.755102
Min length4

Unique

Unique30 ?
Unique (%)30.6%

Sample

1st row<NA>
2nd row2023-02-14
3rd row2021-02-14
4th row2020-01-19
5th row2020-01-19

Common Values

ValueCountFrequency (%)
2021-12-31 24
24.5%
2022-12-31 17
17.3%
2020-12-31 5
 
5.1%
<NA> 4
 
4.1%
2019-12-31 3
 
3.1%
2020-01-19 3
 
3.1%
2023-02-16 2
 
2.0%
2021-01-19 2
 
2.0%
2021-02-16 2
 
2.0%
2021-09-29 2
 
2.0%
Other values (32) 34
34.7%

Length

2023-12-13T06:11:13.764971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2021-12-31 24
24.5%
2022-12-31 17
17.3%
2020-12-31 5
 
5.1%
na 4
 
4.1%
2019-12-31 3
 
3.1%
2020-01-19 3
 
3.1%
2021-12-12 2
 
2.0%
2023-03-31 2
 
2.0%
2021-09-29 2
 
2.0%
2021-02-16 2
 
2.0%
Other values (32) 34
34.7%

Correlations

2023-12-13T06:11:13.864852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소관 부처자격종목등급자격관리자공인유효기간(시작)공인유효기간(만료)기공인기간(시작)기공인기간(만료)
소관 부처1.0001.0000.0000.9920.8720.8180.9600.873
자격종목1.0001.0000.0001.0001.0000.9960.8350.997
등급0.0000.0001.0000.8410.0000.0000.9710.000
자격관리자0.9921.0000.8411.0000.9810.9800.9760.979
공인유효기간(시작)0.8721.0000.0000.9811.0000.9990.9901.000
공인유효기간(만료)0.8180.9960.0000.9800.9991.0000.9860.996
기공인기간(시작)0.9600.8350.9710.9760.9900.9861.0000.989
기공인기간(만료)0.8730.9970.0000.9791.0000.9960.9891.000
2023-12-13T06:11:14.012245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기공인기간(만료)등급소관 부처공인유효기간(시작)공인유효기간(만료)
기공인기간(만료)1.0000.0000.3690.9910.844
등급0.0001.0000.0000.0000.000
소관 부처0.3690.0001.0000.3700.304
공인유효기간(시작)0.9910.0000.3701.0000.859
공인유효기간(만료)0.8440.0000.3040.8591.000
2023-12-13T06:11:14.120425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소관 부처등급공인유효기간(시작)공인유효기간(만료)기공인기간(만료)
소관 부처1.0000.0000.3700.3040.369
등급0.0001.0000.0000.0000.000
공인유효기간(시작)0.3700.0001.0000.8590.991
공인유효기간(만료)0.3040.0000.8591.0000.844
기공인기간(만료)0.3690.0000.9910.8441.000

Missing values

2023-12-13T06:11:10.979295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:11:11.157231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

소관 부처자격종목등급자격관리자공인유효기간(시작)공인유효기간(만료)기공인기간(시작)기공인기간(만료)
0식품의약품안전처의료기기 RA전문가2급한국의료기기안전정보원2019-01-012023-12-31<NA><NA>
1금융위원회신용관리사등급없음(사)신용정보협회2023-02-152028-02-142006-02-152023-02-14
2금융위원회신용위험분석사(CRA)등급없음(사)한국금융연수원2021-02-152024-02-142006-02-152021-02-14
3금융위원회신용분석사등급없음(사)한국금융연수원2020-01-202024-12-312001-01-202020-01-19
4금융위원회여신심사역등급없음(사)한국금융연수원2020-01-202024-12-312001-01-202020-01-19
5금융위원회자산관리사(FP)등급없음(사)한국금융연수원2021-01-052026-01-042005-01-052021-01-04
6금융위원회재경관리사등급없음삼일회계법인2023-04-012026-03-312007-04-012023-03-31
7금융위원회회계관리1급, 2급삼일회계법인2023-04-012026-03-312007-04-012023-03-31
8금융위원회AT(Accounting Technician)TAT1급, TAT2급, FAT1급, FAT2급한국공인회계사회2022-12-012025-11-302015-12-012022-11-30
9금융위원회개인보험심사역등급없음사단법인 보험연수원2021-01-012025-12-312016-01-012020-12-31
소관 부처자격종목등급자격관리자공인유효기간(시작)공인유효기간(만료)기공인기간(시작)기공인기간(만료)
88국토교통부실내디자이너등급없음한국실내건축가협회2023-01-012025-12-312017-01-012022-12-31
89국토교통부보상관리사등급없음(사)한국토지보상관리사협회2023-01-012027-12-31<NA><NA>
90관세청원산지관리사등급없음(재)국제원산지정보원2022-01-012024-12-312013-01-012021-12-31
91경찰청열쇠관리사1급, 2급(사)한국열쇠협회2023-01-032026-01-022005-01-032023-01-02
92경찰청도로교통사고감정사등급없음도로교통공단2021-04-062024-04-052007-04-062021-04-05
93경찰청신변보호사등급없음(사)한국경비협회2021-01-012025-12-312013-01-012020-12-31
94산림청수목보호기술자격기술자(사)한국수목보호협회2020-01-152025-01-142001-04-012020-01-14
95산림청분재관리사전문관리사, 1급, 2급(사)한국분재조합2019-02-012024-01-312002-02-012019-01-31
96산림청조경수조성관리사2급, 3급(사)한국조경수협회2020-11-162025-11-152010-11-162020-11-15
97특허청지식재산능력시험1급, 2급, 3급, 4급한국발명진흥회2023-01-012027-12-312018-01-012022-12-31