Overview

Dataset statistics

Number of variables5
Number of observations49
Missing cells3
Missing cells (%)1.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.0 KiB
Average record size in memory42.7 B

Variable types

Text2
Categorical1
DateTime2

Dataset

Description한국물기술인증원의 위생안전기준 인증등록정보망에 등록된 인증심사원정보(성명, 교육기관명, 교육명, 수료일)에 대한 정보입니다.
Author환경부
URLhttps://www.data.go.kr/data/15071367/fileData.do

Alerts

교육기관명 is highly imbalanced (81.9%)Imbalance
수료일 has 3 (6.1%) missing valuesMissing
성명 has unique valuesUnique

Reproduction

Analysis started2024-05-04 07:41:28.127620
Analysis finished2024-05-04 07:41:29.070911
Duration0.94 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

성명
Text

UNIQUE 

Distinct49
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size524.0 B
2024-05-04T07:41:29.495343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.9795918
Min length2

Characters and Unicode

Total characters146
Distinct characters70
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique49 ?
Unique (%)100.0%

Sample

1st row김대수
2nd row김종수
3rd row남복현
4th row류승현
5th row박치순
ValueCountFrequency (%)
김대수 1
 
2.0%
정용 1
 
2.0%
최강하 1
 
2.0%
최재본 1
 
2.0%
한상덕 1
 
2.0%
허영태 1
 
2.0%
오정석 1
 
2.0%
박경도 1
 
2.0%
박순덕 1
 
2.0%
정종한 1
 
2.0%
Other values (39) 39
79.6%
2024-05-04T07:41:30.428691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9
 
6.2%
8
 
5.5%
7
 
4.8%
5
 
3.4%
5
 
3.4%
4
 
2.7%
4
 
2.7%
4
 
2.7%
4
 
2.7%
4
 
2.7%
Other values (60) 92
63.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 146
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9
 
6.2%
8
 
5.5%
7
 
4.8%
5
 
3.4%
5
 
3.4%
4
 
2.7%
4
 
2.7%
4
 
2.7%
4
 
2.7%
4
 
2.7%
Other values (60) 92
63.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 146
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9
 
6.2%
8
 
5.5%
7
 
4.8%
5
 
3.4%
5
 
3.4%
4
 
2.7%
4
 
2.7%
4
 
2.7%
4
 
2.7%
4
 
2.7%
Other values (60) 92
63.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 146
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
9
 
6.2%
8
 
5.5%
7
 
4.8%
5
 
3.4%
5
 
3.4%
4
 
2.7%
4
 
2.7%
4
 
2.7%
4
 
2.7%
4
 
2.7%
Other values (60) 92
63.0%

교육기관명
Categorical

IMBALANCE 

Distinct3
Distinct (%)6.1%
Missing0
Missing (%)0.0%
Memory size524.0 B
한국표준협회
47 
한국계량측정협회
 
1
한국인증지원센터
 
1

Length

Max length8
Median length6
Mean length6.0816327
Min length6

Unique

Unique2 ?
Unique (%)4.1%

Sample

1st row한국표준협회
2nd row한국표준협회
3rd row한국표준협회
4th row한국표준협회
5th row한국표준협회

Common Values

ValueCountFrequency (%)
한국표준협회 47
95.9%
한국계량측정협회 1
 
2.0%
한국인증지원센터 1
 
2.0%

Length

2024-05-04T07:41:30.879556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T07:41:31.255584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한국표준협회 47
95.9%
한국계량측정협회 1
 
2.0%
한국인증지원센터 1
 
2.0%
Distinct37
Distinct (%)75.5%
Missing0
Missing (%)0.0%
Memory size524.0 B
2024-05-04T07:41:31.610028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length22
Mean length20.612245
Min length10

Characters and Unicode

Total characters1010
Distinct characters37
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)65.3%

Sample

1st rowKS인증심사원(B, D, F, M, P)
2nd rowKS인증심사원(B, D, F, M, J)
3rd rowKS인증심사원(B, D, G, P, R)
4th rowKS인증심사원(B, D, F, M, R)
5th rowKS인증심사원(B, C, D, F, M)
ValueCountFrequency (%)
f 34
15.4%
ks인증심사원(b 33
14.9%
m 33
14.9%
d 29
13.1%
l 18
8.1%
g 12
 
5.4%
c 11
 
5.0%
ks인증심사원(d 6
 
2.7%
p 5
 
2.3%
t 5
 
2.3%
Other values (16) 35
15.8%
2024-05-04T07:41:32.466946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
172
17.0%
, 169
16.7%
S 53
 
5.2%
50
 
5.0%
( 49
 
4.9%
) 49
 
4.9%
K 49
 
4.9%
47
 
4.7%
47
 
4.7%
47
 
4.7%
Other values (27) 278
27.5%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 318
31.5%
Other Letter 253
25.0%
Space Separator 172
17.0%
Other Punctuation 169
16.7%
Open Punctuation 49
 
4.9%
Close Punctuation 49
 
4.9%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
S 53
16.7%
K 49
15.4%
F 37
11.6%
D 35
11.0%
B 33
10.4%
M 33
10.4%
L 20
 
6.3%
C 15
 
4.7%
G 12
 
3.8%
P 5
 
1.6%
Other values (9) 26
8.2%
Other Letter
ValueCountFrequency (%)
50
19.8%
47
18.6%
47
18.6%
47
18.6%
47
18.6%
2
 
0.8%
2
 
0.8%
2
 
0.8%
2
 
0.8%
2
 
0.8%
Other values (4) 5
 
2.0%
Space Separator
ValueCountFrequency (%)
172
100.0%
Other Punctuation
ValueCountFrequency (%)
, 169
100.0%
Open Punctuation
ValueCountFrequency (%)
( 49
100.0%
Close Punctuation
ValueCountFrequency (%)
) 49
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 439
43.5%
Latin 318
31.5%
Hangul 253
25.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
S 53
16.7%
K 49
15.4%
F 37
11.6%
D 35
11.0%
B 33
10.4%
M 33
10.4%
L 20
 
6.3%
C 15
 
4.7%
G 12
 
3.8%
P 5
 
1.6%
Other values (9) 26
8.2%
Hangul
ValueCountFrequency (%)
50
19.8%
47
18.6%
47
18.6%
47
18.6%
47
18.6%
2
 
0.8%
2
 
0.8%
2
 
0.8%
2
 
0.8%
2
 
0.8%
Other values (4) 5
 
2.0%
Common
ValueCountFrequency (%)
172
39.2%
, 169
38.5%
( 49
 
11.2%
) 49
 
11.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 757
75.0%
Hangul 253
 
25.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
172
22.7%
, 169
22.3%
S 53
 
7.0%
( 49
 
6.5%
) 49
 
6.5%
K 49
 
6.5%
F 37
 
4.9%
D 35
 
4.6%
B 33
 
4.4%
M 33
 
4.4%
Other values (13) 78
10.3%
Hangul
ValueCountFrequency (%)
50
19.8%
47
18.6%
47
18.6%
47
18.6%
47
18.6%
2
 
0.8%
2
 
0.8%
2
 
0.8%
2
 
0.8%
2
 
0.8%
Other values (4) 5
 
2.0%

수료일
Date

MISSING 

Distinct22
Distinct (%)47.8%
Missing3
Missing (%)6.1%
Memory size524.0 B
Minimum2020-10-13 00:00:00
Maximum2022-05-04 00:00:00
2024-05-04T07:41:32.828413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T07:41:33.202578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
Distinct35
Distinct (%)71.4%
Missing0
Missing (%)0.0%
Memory size524.0 B
Minimum2022-02-26 00:00:00
Maximum2025-03-28 00:00:00
2024-05-04T07:41:33.584568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T07:41:33.987301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)

Correlations

2024-05-04T07:41:34.230693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성명교육기관명교육명수료일만료일
성명1.0001.0001.0001.0001.000
교육기관명1.0001.0001.0001.0001.000
교육명1.0001.0001.0000.9160.606
수료일1.0001.0000.9161.0000.771
만료일1.0001.0000.6060.7711.000

Missing values

2024-05-04T07:41:28.679084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-04T07:41:28.967168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

성명교육기관명교육명수료일만료일
0김대수한국표준협회KS인증심사원(B, D, F, M, P)2022-05-042022-12-18
1김종수한국표준협회KS인증심사원(B, D, F, M, J)2022-02-252022-08-26
2남복현한국표준협회KS인증심사원(B, D, G, P, R)2022-02-252022-12-05
3류승현한국표준협회KS인증심사원(B, D, F, M, R)2021-09-172022-09-01
4박치순한국표준협회KS인증심사원(B, C, D, F, M)2022-04-222022-11-13
5박현출한국표준협회KS인증심사원(B, D, F, G, M)2021-07-232023-02-10
6이환성한국표준협회KS인증심사원(B, D, F, L, M)2022-02-252022-12-04
7임춘순한국표준협회KS인증심사원(B, C, D, F, M)2022-01-282022-11-03
8김동식한국표준협회KS인증심사원(B, F, M, I, R)2021-10-052022-12-05
9남선광한국표준협회KS인증심사원(I, M)2021-10-052023-02-10
성명교육기관명교육명수료일만료일
39이종현한국표준협회KS인증심사원(B, D, F, L, W)2022-01-102025-01-16
40한용석한국표준협회KS인증심사원(B, C, P)2022-04-242022-11-04
41박영우한국표준협회KS인증심사원(B, D, F, L, M)2022-01-102025-02-22
42정현구한국표준협회KS인증심사원(B, D, F, G, M)2022-04-222023-03-31
43조구영한국표준협회KS인증심사원(D, F, G, L, M)2022-05-022025-01-17
44홍현기한국표준협회KS인증심사원(D, F, G, L, M)2022-03-112022-02-26
45노용수한국표준협회KS인증심사원(B, C, D, P, X)2022-04-222024-05-30
46이재환한국표준협회KS인증심사원(F, Q)2021-09-172024-10-31
47이상규한국표준협회KS인증심사원(C)2021-04-162024-05-30
48이승덕한국인증지원센터KOLAS 평가사(시험, 검사, 교정 분야)2020-10-132023-11-29