Overview

Dataset statistics

Number of variables4
Number of observations80
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.6 KiB
Average record size in memory33.6 B

Variable types

Text2
Categorical2

Dataset

Description연도별 인적자원개발 우수기관의 인증 현황- 제공 주요 정보(인증 번호, 신규 또는 재인증 여부, 인증대상기관명, 기업규모)
URLhttps://www.data.go.kr/data/15055718/fileData.do

Alerts

인증번호 has unique valuesUnique
신청기관명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 17:19:31.781932
Analysis finished2023-12-12 17:19:32.135632
Duration0.35 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

인증번호
Text

UNIQUE 

Distinct80
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size772.0 B
2023-12-13T02:19:32.462869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length7.8875
Min length7

Characters and Unicode

Total characters631
Distinct characters12
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique80 ?
Unique (%)100.0%

Sample

1st row2021-1호
2nd row2021-2호
3rd row2021-3호
4th row2021-4호
5th row2021-5호
ValueCountFrequency (%)
2021-1호 1
 
1.2%
2021-2호 1
 
1.2%
2021-59호 1
 
1.2%
2021-58호 1
 
1.2%
2021-57호 1
 
1.2%
2021-56호 1
 
1.2%
2021-55호 1
 
1.2%
2021-54호 1
 
1.2%
2021-53호 1
 
1.2%
2021-60호 1
 
1.2%
Other values (70) 70
87.5%
2023-12-13T02:19:32.924696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 178
28.2%
1 98
15.5%
0 88
13.9%
- 80
12.7%
80
12.7%
3 18
 
2.9%
4 18
 
2.9%
5 18
 
2.9%
6 18
 
2.9%
7 18
 
2.9%
Other values (2) 17
 
2.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 471
74.6%
Dash Punctuation 80
 
12.7%
Other Letter 80
 
12.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 178
37.8%
1 98
20.8%
0 88
18.7%
3 18
 
3.8%
4 18
 
3.8%
5 18
 
3.8%
6 18
 
3.8%
7 18
 
3.8%
8 9
 
1.9%
9 8
 
1.7%
Dash Punctuation
ValueCountFrequency (%)
- 80
100.0%
Other Letter
ValueCountFrequency (%)
80
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 551
87.3%
Hangul 80
 
12.7%

Most frequent character per script

Common
ValueCountFrequency (%)
2 178
32.3%
1 98
17.8%
0 88
16.0%
- 80
14.5%
3 18
 
3.3%
4 18
 
3.3%
5 18
 
3.3%
6 18
 
3.3%
7 18
 
3.3%
8 9
 
1.6%
Hangul
ValueCountFrequency (%)
80
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 551
87.3%
Hangul 80
 
12.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 178
32.3%
1 98
17.8%
0 88
16.0%
- 80
14.5%
3 18
 
3.3%
4 18
 
3.3%
5 18
 
3.3%
6 18
 
3.3%
7 18
 
3.3%
8 9
 
1.6%
Hangul
ValueCountFrequency (%)
80
100.0%

신청기관명
Text

UNIQUE 

Distinct80
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size772.0 B
2023-12-13T02:19:33.154495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length7.2875
Min length3

Characters and Unicode

Total characters583
Distinct characters166
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique80 ?
Unique (%)100.0%

Sample

1st row하나금융티아이
2nd rowSK이노베이션㈜
3rd row㈜네패스
4th row에스케이아이이테크놀로지㈜
5th row㈜에이텍씨앤
ValueCountFrequency (%)
18
 
16.7%
주식회사 7
 
6.5%
하나금융티아이 1
 
0.9%
탑엔지니어링 1
 
0.9%
중앙전력주식회사 1
 
0.9%
골프존 1
 
0.9%
주)휴넷 1
 
0.9%
에이텍티앤 1
 
0.9%
㈜휴넥트 1
 
0.9%
다우기술 1
 
0.9%
Other values (75) 75
69.4%
2023-12-13T02:19:33.567765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
52
 
8.9%
30
 
5.1%
29
 
5.0%
22
 
3.8%
20
 
3.4%
19
 
3.3%
18
 
3.1%
18
 
3.1%
15
 
2.6%
11
 
1.9%
Other values (156) 349
59.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 487
83.5%
Other Symbol 52
 
8.9%
Space Separator 29
 
5.0%
Close Punctuation 6
 
1.0%
Uppercase Letter 5
 
0.9%
Open Punctuation 4
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
30
 
6.2%
22
 
4.5%
20
 
4.1%
19
 
3.9%
18
 
3.7%
18
 
3.7%
15
 
3.1%
11
 
2.3%
10
 
2.1%
9
 
1.8%
Other values (148) 315
64.7%
Uppercase Letter
ValueCountFrequency (%)
S 2
40.0%
K 1
20.0%
M 1
20.0%
B 1
20.0%
Other Symbol
ValueCountFrequency (%)
52
100.0%
Space Separator
ValueCountFrequency (%)
29
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 539
92.5%
Common 39
 
6.7%
Latin 5
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
52
 
9.6%
30
 
5.6%
22
 
4.1%
20
 
3.7%
19
 
3.5%
18
 
3.3%
18
 
3.3%
15
 
2.8%
11
 
2.0%
10
 
1.9%
Other values (149) 324
60.1%
Latin
ValueCountFrequency (%)
S 2
40.0%
K 1
20.0%
M 1
20.0%
B 1
20.0%
Common
ValueCountFrequency (%)
29
74.4%
) 6
 
15.4%
( 4
 
10.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 487
83.5%
None 52
 
8.9%
ASCII 44
 
7.5%

Most frequent character per block

None
ValueCountFrequency (%)
52
100.0%
Hangul
ValueCountFrequency (%)
30
 
6.2%
22
 
4.5%
20
 
4.1%
19
 
3.9%
18
 
3.7%
18
 
3.7%
15
 
3.1%
11
 
2.3%
10
 
2.1%
9
 
1.8%
Other values (148) 315
64.7%
ASCII
ValueCountFrequency (%)
29
65.9%
) 6
 
13.6%
( 4
 
9.1%
S 2
 
4.5%
K 1
 
2.3%
M 1
 
2.3%
B 1
 
2.3%

기업규모
Categorical

Distinct3
Distinct (%)3.8%
Missing0
Missing (%)0.0%
Memory size772.0 B
중소기업
61 
선취업후학습 기업
12 
대기업

Length

Max length9
Median length4
Mean length4.6625
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대기업
2nd row대기업
3rd row대기업
4th row대기업
5th row중소기업

Common Values

ValueCountFrequency (%)
중소기업 61
76.2%
선취업후학습 기업 12
 
15.0%
대기업 7
 
8.8%

Length

2023-12-13T02:19:33.757223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:19:33.883200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
중소기업 61
66.3%
선취업후학습 12
 
13.0%
기업 12
 
13.0%
대기업 7
 
7.6%

신청구분
Categorical

Distinct2
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size772.0 B
신규
52 
재인증
28 

Length

Max length3
Median length2
Mean length2.35
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row신규
2nd row신규
3rd row신규
4th row신규
5th row신규

Common Values

ValueCountFrequency (%)
신규 52
65.0%
재인증 28
35.0%

Length

2023-12-13T02:19:34.038761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:19:34.192124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
신규 52
65.0%
재인증 28
35.0%

Correlations

2023-12-13T02:19:34.294982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인증번호신청기관명기업규모신청구분
인증번호1.0001.0001.0001.000
신청기관명1.0001.0001.0001.000
기업규모1.0001.0001.0000.163
신청구분1.0001.0000.1631.000
2023-12-13T02:19:34.415303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기업규모신청구분
기업규모1.0000.266
신청구분0.2661.000
2023-12-13T02:19:34.553133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기업규모신청구분
기업규모1.0000.266
신청구분0.2661.000

Missing values

2023-12-13T02:19:32.023180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:19:32.102141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

인증번호신청기관명기업규모신청구분
02021-1호하나금융티아이대기업신규
12021-2호SK이노베이션㈜대기업신규
22021-3호㈜네패스대기업신규
32021-4호에스케이아이이테크놀로지㈜대기업신규
42021-5호㈜에이텍씨앤중소기업신규
52021-6호㈜대진중소기업신규
62021-7호㈜세아씨엔티중소기업신규
72021-8호더화이트커뮤니케이션㈜중소기업신규
82021-9호㈜에이텍에이피중소기업신규
92021-10호시앤피컨설팅주식회사중소기업신규
인증번호신청기관명기업규모신청구분
702021-71호주식회사화인폰중소기업재인증
712021-72호㈜ 럭키산업중소기업재인증
722021-73호주식회사마루에이치알중소기업재인증
732021-74호주식회사씨엠테크중소기업재인증
742021-75호㈜ 에이티씨중소기업재인증
752021-76호㈜ 골프존뉴딘홀딩스중소기업재인증
762021-77호유양기술주식회사중소기업재인증
772021-78호㈜ 원일중소기업재인증
782021-79호한국BMS제약중소기업재인증
792021-80호금강엔지니어링㈜중소기업재인증