Overview

Dataset statistics

Number of variables4
Number of observations600
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory18.9 KiB
Average record size in memory32.2 B

Variable types

Categorical3
Text1

Dataset

Description한국산업인력공단에서 시행하는 국가기술자격(등급별) 및 국가전문자격 현황에 대한 종목의 리스트를 제공합니다.
URLhttps://www.data.go.kr/data/15082998/fileData.do

Alerts

계열명 is highly overall correlated with 자격구분코드 and 1 other fieldsHigh correlation
자격구분명 is highly overall correlated with 자격구분코드 and 1 other fieldsHigh correlation
자격구분코드 is highly overall correlated with 자격구분명 and 1 other fieldsHigh correlation
종목명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 20:26:12.486006
Analysis finished2023-12-12 20:26:12.959715
Duration0.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

자격구분코드
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
T
496 
S
104 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowS
2nd rowS
3rd rowS
4th rowS
5th rowS

Common Values

ValueCountFrequency (%)
T 496
82.7%
S 104
 
17.3%

Length

2023-12-13T05:26:13.027505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:26:13.138082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
t 496
82.7%
s 104
 
17.3%

자격구분명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
국가기술자격
496 
국가전문자격
104 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국가전문자격
2nd row국가전문자격
3rd row국가전문자격
4th row국가전문자격
5th row국가전문자격

Common Values

ValueCountFrequency (%)
국가기술자격 496
82.7%
국가전문자격 104
 
17.3%

Length

2023-12-13T05:26:13.277140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:26:13.386546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국가기술자격 496
82.7%
국가전문자격 104
 
17.3%

계열명
Categorical

HIGH CORRELATION 

Distinct45
Distinct (%)7.5%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
기사
235 
기능사
154 
기술사
79 
기능장
28 
문화재수리기능자(24종목)
24 
Other values (40)
80 

Length

Max length16
Median length14
Mean length3.495
Min length2

Unique

Unique28 ?
Unique (%)4.7%

Sample

1st row세무사
2nd row관세사
3rd row관광통역안내사
4th row국내여행안내사
5th row호텔경영사

Common Values

ValueCountFrequency (%)
기사 235
39.2%
기능사 154
25.7%
기술사 79
 
13.2%
기능장 28
 
4.7%
문화재수리기능자(24종목) 24
 
4.0%
관광통역안내사 12
 
2.0%
문화재수리기술자 6
 
1.0%
경매사 6
 
1.0%
경영지도사 5
 
0.8%
산업안전지도사 4
 
0.7%
Other values (35) 47
 
7.8%

Length

2023-12-13T05:26:13.517971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
기사 235
38.8%
기능사 154
25.4%
기술사 79
 
13.0%
기능장 28
 
4.6%
문화재수리기능자(24종목 24
 
4.0%
관광통역안내사 12
 
2.0%
문화재수리기술자 6
 
1.0%
경매사 6
 
1.0%
경영지도사 5
 
0.8%
산업안전지도사 4
 
0.7%
Other values (41) 53
 
8.7%

종목명
Text

UNIQUE 

Distinct600
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2023-12-13T05:26:13.789062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length16
Mean length7.8966667
Min length3

Characters and Unicode

Total characters4738
Distinct characters291
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique600 ?
Unique (%)100.0%

Sample

1st row세무사
2nd row관세사
3rd row관광통역안내사(영어)
4th row국내여행안내사
5th row호텔경영사
ValueCountFrequency (%)
1급 3
 
0.5%
청소년상담사 3
 
0.5%
청소년지도사 3
 
0.5%
3급 2
 
0.3%
2급 2
 
0.3%
세무사 1
 
0.2%
상하수도기술사 1
 
0.2%
전자기사 1
 
0.2%
비파괴검사기술사 1
 
0.2%
산업위생관리기술사 1
 
0.2%
Other values (590) 590
97.0%
2023-12-13T05:26:14.514621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
601
 
12.7%
572
 
12.1%
199
 
4.2%
161
 
3.4%
141
 
3.0%
101
 
2.1%
91
 
1.9%
85
 
1.8%
83
 
1.8%
) 78
 
1.6%
Other values (281) 2626
55.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4546
95.9%
Close Punctuation 78
 
1.6%
Open Punctuation 78
 
1.6%
Decimal Number 25
 
0.5%
Space Separator 8
 
0.2%
Uppercase Letter 2
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
601
 
13.2%
572
 
12.6%
199
 
4.4%
161
 
3.5%
141
 
3.1%
101
 
2.2%
91
 
2.0%
85
 
1.9%
83
 
1.8%
76
 
1.7%
Other values (272) 2436
53.6%
Decimal Number
ValueCountFrequency (%)
1 11
44.0%
2 8
32.0%
3 5
20.0%
7 1
 
4.0%
Close Punctuation
ValueCountFrequency (%)
) 78
100.0%
Open Punctuation
ValueCountFrequency (%)
( 78
100.0%
Space Separator
ValueCountFrequency (%)
8
100.0%
Uppercase Letter
ValueCountFrequency (%)
D 2
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4546
95.9%
Common 190
 
4.0%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
601
 
13.2%
572
 
12.6%
199
 
4.4%
161
 
3.5%
141
 
3.1%
101
 
2.2%
91
 
2.0%
85
 
1.9%
83
 
1.8%
76
 
1.7%
Other values (272) 2436
53.6%
Common
ValueCountFrequency (%)
) 78
41.1%
( 78
41.1%
1 11
 
5.8%
8
 
4.2%
2 8
 
4.2%
3 5
 
2.6%
7 1
 
0.5%
/ 1
 
0.5%
Latin
ValueCountFrequency (%)
D 2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4546
95.9%
ASCII 192
 
4.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
601
 
13.2%
572
 
12.6%
199
 
4.4%
161
 
3.5%
141
 
3.1%
101
 
2.2%
91
 
2.0%
85
 
1.9%
83
 
1.8%
76
 
1.7%
Other values (272) 2436
53.6%
ASCII
ValueCountFrequency (%)
) 78
40.6%
( 78
40.6%
1 11
 
5.7%
8
 
4.2%
2 8
 
4.2%
3 5
 
2.6%
D 2
 
1.0%
7 1
 
0.5%
/ 1
 
0.5%

Correlations

2023-12-13T05:26:14.604959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
자격구분코드자격구분명계열명
자격구분코드1.0001.0001.000
자격구분명1.0001.0001.000
계열명1.0001.0001.000
2023-12-13T05:26:14.686921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
계열명자격구분명자격구분코드
계열명1.0000.9630.963
자격구분명0.9631.0000.994
자격구분코드0.9630.9941.000
2023-12-13T05:26:14.801576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
자격구분코드자격구분명계열명
자격구분코드1.0000.9940.963
자격구분명0.9941.0000.963
계열명0.9630.9631.000

Missing values

2023-12-13T05:26:12.829100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:26:12.924930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

자격구분코드자격구분명계열명종목명
0S국가전문자격세무사세무사
1S국가전문자격관세사관세사
2S국가전문자격관광통역안내사관광통역안내사(영어)
3S국가전문자격국내여행안내사국내여행안내사
4S국가전문자격호텔경영사호텔경영사
5S국가전문자격호텔관리사호텔관리사
6S국가전문자격관광통역안내사관광통역안내사(불어)
7S국가전문자격정수시설운영관리사정수시설운영관리사3급
8S국가전문자격정수시설운영관리사정수시설운영관리사2급
9S국가전문자격정수시설운영관리사정수시설운영관리사1급
자격구분코드자격구분명계열명종목명
590T국가기술자격기사제과산업기사
591T국가기술자격기사버섯산업기사
592T국가기술자격기사농작업안전보건기사
593T국가기술자격기사신발산업기사
594T국가기술자격기사로봇하드웨어개발기사
595T국가기술자격기사가구제작산업기사
596T국가기술자격기사보석디자인산업기사
597T국가기술자격기능장잠수기능장
598T국가기술자격기사환경위해관리기사
599T국가기술자격기사로봇기구개발기사