Overview

Dataset statistics

Number of variables4
Number of observations58
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.0 KiB
Average record size in memory35.3 B

Variable types

Text3
Categorical1

Dataset

Description천연가스 생산 및 공급 설비의 책임정비를 위한 기술인력 육성 교육과정 현황으로써,가스설비 관련 교육을 시행 또는 설비를 제작하는 민간에서 활용하여 편익을 증진시키는 데 유용한 정보
Author(주)한국가스기술공사
URLhttps://www.data.go.kr/data/15020719/fileData.do

Alerts

과정명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 16:50:56.735824
Analysis finished2023-12-12 16:50:57.204967
Duration0.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

과정명
Text

UNIQUE 

Distinct58
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size596.0 B
2023-12-13T01:50:57.476453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length34
Mean length23.793103
Min length12

Characters and Unicode

Total characters1380
Distinct characters185
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique58 ?
Unique (%)100.0%

Sample

1st row 매설배관 관리 실무과정(YB) 1차
2nd row 측량 및 공간정보 실무과정(GB)_위탁
3rd row GNSS 측량과정(GB)
4th row 초경량 비행장치 무인멀티콥터 (드론) 운영 능력 향상과정(GB)_위탁
5th row 감압 System 진단 및 정비과정(YB)
ValueCountFrequency (%)
14
 
6.5%
집체 7
 
3.2%
교육과정(bb 6
 
2.8%
진단 5
 
2.3%
신설과정 4
 
1.8%
실무과정(gb 4
 
1.8%
실무 4
 
1.8%
관리 4
 
1.8%
과정(gb 4
 
1.8%
위탁 4
 
1.8%
Other values (136) 161
74.2%
2023-12-13T01:50:58.067700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
271
 
19.6%
74
 
5.4%
( 66
 
4.8%
B 66
 
4.8%
) 66
 
4.8%
59
 
4.3%
G 33
 
2.4%
27
 
2.0%
20
 
1.4%
20
 
1.4%
Other values (175) 678
49.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 621
45.0%
Space Separator 271
19.6%
Uppercase Letter 196
 
14.2%
Lowercase Letter 101
 
7.3%
Open Punctuation 70
 
5.1%
Close Punctuation 69
 
5.0%
Decimal Number 27
 
2.0%
Connector Punctuation 14
 
1.0%
Other Punctuation 9
 
0.7%
Dash Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
74
 
11.9%
59
 
9.5%
27
 
4.3%
20
 
3.2%
20
 
3.2%
18
 
2.9%
17
 
2.7%
14
 
2.3%
14
 
2.3%
14
 
2.3%
Other values (119) 344
55.4%
Uppercase Letter
ValueCountFrequency (%)
B 66
33.7%
G 33
16.8%
Y 17
 
8.7%
S 14
 
7.1%
C 9
 
4.6%
L 8
 
4.1%
I 6
 
3.1%
A 5
 
2.6%
O 5
 
2.6%
P 5
 
2.6%
Other values (11) 28
14.3%
Lowercase Letter
ValueCountFrequency (%)
e 14
13.9%
n 11
10.9%
o 10
9.9%
t 10
9.9%
a 8
7.9%
m 7
6.9%
s 7
6.9%
i 7
6.9%
l 5
 
5.0%
r 5
 
5.0%
Other values (8) 17
16.8%
Decimal Number
ValueCountFrequency (%)
2 8
29.6%
0 6
22.2%
1 6
22.2%
3 4
14.8%
5 2
 
7.4%
7 1
 
3.7%
Other Punctuation
ValueCountFrequency (%)
' 4
44.4%
, 3
33.3%
& 1
 
11.1%
/ 1
 
11.1%
Open Punctuation
ValueCountFrequency (%)
( 66
94.3%
[ 4
 
5.7%
Close Punctuation
ValueCountFrequency (%)
) 66
95.7%
] 3
 
4.3%
Space Separator
ValueCountFrequency (%)
271
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 14
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 621
45.0%
Common 462
33.5%
Latin 297
21.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
74
 
11.9%
59
 
9.5%
27
 
4.3%
20
 
3.2%
20
 
3.2%
18
 
2.9%
17
 
2.7%
14
 
2.3%
14
 
2.3%
14
 
2.3%
Other values (119) 344
55.4%
Latin
ValueCountFrequency (%)
B 66
22.2%
G 33
 
11.1%
Y 17
 
5.7%
e 14
 
4.7%
S 14
 
4.7%
n 11
 
3.7%
o 10
 
3.4%
t 10
 
3.4%
C 9
 
3.0%
a 8
 
2.7%
Other values (29) 105
35.4%
Common
ValueCountFrequency (%)
271
58.7%
( 66
 
14.3%
) 66
 
14.3%
_ 14
 
3.0%
2 8
 
1.7%
0 6
 
1.3%
1 6
 
1.3%
' 4
 
0.9%
3 4
 
0.9%
[ 4
 
0.9%
Other values (7) 13
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 759
55.0%
Hangul 621
45.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
271
35.7%
( 66
 
8.7%
B 66
 
8.7%
) 66
 
8.7%
G 33
 
4.3%
Y 17
 
2.2%
e 14
 
1.8%
S 14
 
1.8%
_ 14
 
1.8%
n 11
 
1.4%
Other values (46) 187
24.6%
Hangul
ValueCountFrequency (%)
74
 
11.9%
59
 
9.5%
27
 
4.3%
20
 
3.2%
20
 
3.2%
18
 
2.9%
17
 
2.7%
14
 
2.3%
14
 
2.3%
14
 
2.3%
Other values (119) 344
55.4%

교육인원
Categorical

Distinct5
Distinct (%)8.6%
Missing0
Missing (%)0.0%
Memory size596.0 B
10
39 
8
12
7
 
3
5
 
1

Length

Max length2
Median length2
Mean length1.7931034
Min length1

Unique

Unique1 ?
Unique (%)1.7%

Sample

1st row12
2nd row10
3rd row10
4th row10
5th row10

Common Values

ValueCountFrequency (%)
10 39
67.2%
8 8
 
13.8%
12 7
 
12.1%
7 3
 
5.2%
5 1
 
1.7%

Length

2023-12-13T01:50:58.259840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:50:58.388924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
10 39
67.2%
8 8
 
13.8%
12 7
 
12.1%
7 3
 
5.2%
5 1
 
1.7%
Distinct44
Distinct (%)75.9%
Missing0
Missing (%)0.0%
Memory size596.0 B
2023-12-13T01:50:58.632665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.4482759
Min length2

Characters and Unicode

Total characters548
Distinct characters13
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)56.9%

Sample

1st row2023-02-15
2nd row2023-03-13
3rd row2023-04-12
4th row2023-04-25
5th row2023-05-10
ValueCountFrequency (%)
미정 4
 
6.9%
2023-04-25 3
 
5.2%
2023-07-19 2
 
3.4%
2023-02-15 2
 
3.4%
2023-10-23 2
 
3.4%
2023-10-10 2
 
3.4%
2023-11-15 2
 
3.4%
2023-02-06 2
 
3.4%
2023-03-06 2
 
3.4%
2023-06-19 2
 
3.4%
Other values (34) 35
60.3%
2023-12-13T01:50:59.080468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 140
25.5%
0 119
21.7%
- 108
19.7%
3 67
12.2%
1 43
 
7.8%
6 14
 
2.6%
9 13
 
2.4%
5 11
 
2.0%
4 9
 
1.6%
8 9
 
1.6%
Other values (3) 15
 
2.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 432
78.8%
Dash Punctuation 108
 
19.7%
Other Letter 8
 
1.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 140
32.4%
0 119
27.5%
3 67
15.5%
1 43
 
10.0%
6 14
 
3.2%
9 13
 
3.0%
5 11
 
2.5%
4 9
 
2.1%
8 9
 
2.1%
7 7
 
1.6%
Other Letter
ValueCountFrequency (%)
4
50.0%
4
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 108
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 540
98.5%
Hangul 8
 
1.5%

Most frequent character per script

Common
ValueCountFrequency (%)
2 140
25.9%
0 119
22.0%
- 108
20.0%
3 67
12.4%
1 43
 
8.0%
6 14
 
2.6%
9 13
 
2.4%
5 11
 
2.0%
4 9
 
1.7%
8 9
 
1.7%
Hangul
ValueCountFrequency (%)
4
50.0%
4
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 540
98.5%
Hangul 8
 
1.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 140
25.9%
0 119
22.0%
- 108
20.0%
3 67
12.4%
1 43
 
8.0%
6 14
 
2.6%
9 13
 
2.4%
5 11
 
2.0%
4 9
 
1.7%
8 9
 
1.7%
Hangul
ValueCountFrequency (%)
4
50.0%
4
50.0%
Distinct34
Distinct (%)58.6%
Missing0
Missing (%)0.0%
Memory size596.0 B
2023-12-13T01:50:59.332852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.4482759
Min length2

Characters and Unicode

Total characters548
Distinct characters13
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18 ?
Unique (%)31.0%

Sample

1st row2023-02-17
2nd row2023-03-17
3rd row2023-04-14
4th row2023-04-28
5th row2023-05-12
ValueCountFrequency (%)
미정 4
 
6.9%
2023-04-28 4
 
6.9%
2023-02-17 3
 
5.2%
2023-10-13 3
 
5.2%
2023-11-17 3
 
5.2%
2023-10-27 3
 
5.2%
2023-07-21 2
 
3.4%
2023-10-20 2
 
3.4%
2023-04-14 2
 
3.4%
2023-06-09 2
 
3.4%
Other values (24) 30
51.7%
2023-12-13T01:50:59.759839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 143
26.1%
0 116
21.2%
- 108
19.7%
3 66
12.0%
1 45
 
8.2%
4 14
 
2.6%
7 14
 
2.6%
6 10
 
1.8%
8 9
 
1.6%
9 9
 
1.6%
Other values (3) 14
 
2.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 432
78.8%
Dash Punctuation 108
 
19.7%
Other Letter 8
 
1.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 143
33.1%
0 116
26.9%
3 66
15.3%
1 45
 
10.4%
4 14
 
3.2%
7 14
 
3.2%
6 10
 
2.3%
8 9
 
2.1%
9 9
 
2.1%
5 6
 
1.4%
Other Letter
ValueCountFrequency (%)
4
50.0%
4
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 108
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 540
98.5%
Hangul 8
 
1.5%

Most frequent character per script

Common
ValueCountFrequency (%)
2 143
26.5%
0 116
21.5%
- 108
20.0%
3 66
12.2%
1 45
 
8.3%
4 14
 
2.6%
7 14
 
2.6%
6 10
 
1.9%
8 9
 
1.7%
9 9
 
1.7%
Hangul
ValueCountFrequency (%)
4
50.0%
4
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 540
98.5%
Hangul 8
 
1.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 143
26.5%
0 116
21.5%
- 108
20.0%
3 66
12.2%
1 45
 
8.3%
4 14
 
2.6%
7 14
 
2.6%
6 10
 
1.9%
8 9
 
1.7%
9 9
 
1.7%
Hangul
ValueCountFrequency (%)
4
50.0%
4
50.0%

Correlations

2023-12-13T01:50:59.872158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과정명교육인원교육시작일교육종료일
과정명1.0001.0001.0001.000
교육인원1.0001.0000.5410.000
교육시작일1.0000.5411.0000.997
교육종료일1.0000.0000.9971.000

Missing values

2023-12-13T01:50:57.056186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:50:57.163532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

과정명교육인원교육시작일교육종료일
0매설배관 관리 실무과정(YB) 1차122023-02-152023-02-17
1측량 및 공간정보 실무과정(GB)_위탁102023-03-132023-03-17
2GNSS 측량과정(GB)102023-04-122023-04-14
3초경량 비행장치 무인멀티콥터 (드론) 운영 능력 향상과정(GB)_위탁102023-04-252023-04-28
4감압 System 진단 및 정비과정(YB)102023-05-102023-05-12
5밸브 분해 정비 실무과정(GB)102023-06-142023-06-16
6CAD 실무 기초과정(YB) 2차(기계/관로)_위탁102023-06-192023-06-23
7매설배관 관리 실무과정(YB) 2차122023-07-192023-07-21
8열교환 System 진단 및 정비과정(YB)102023-08-232023-08-25
9매설배관 관리 집체 교육과정(BB)102023-09-072023-09-08
과정명교육인원교육시작일교육종료일
48Siemens HMI 프로그래밍 중급과정(GB)_위탁122023-08-282023-08-09
49K-POS 활용과정(GB)102023-09-132023-09-15
50LNG RECEIVING과정(YB)82023-09-192023-09-22
51기화설비 (SCV, HP-ORV)과정(GB)82023-10-182023-10-20
52가스분석기 실무과정 (GB) ['23년 신설과정]102023-10-242023-10-27
53승압설비 LNG펌프(GB) ['23년 신설과정]82023-11-152023-11-17
54유량계 컴퓨터 실무과정(GB) ['23년 신설과정102023-11-292023-11-12
55유틸리티 (공기압축기,차염처리,부취설비)살비 과정(GB) ['23년 신설과정]8미정미정
56기지분야 계측 제어설비 진단 및 정비 집체 교육과정(BB)8미정미정
57통신설비정비 집체 교육과정(BB) 2차10미정미정