Overview

Dataset statistics

Number of variables5
Number of observations89
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.7 KiB
Average record size in memory42.5 B

Variable types

Text2
Categorical2
Numeric1

Dataset

Description국가기술자격 필기시험 CBT 시행에 사용되는 감독관 프로그램, 수험자 프로그램, 점검 프로그램 등의 등록 정보입니다.프로그램 버전, 파일 이름, 파일 크기 등의 정보를 제공합니다.
Author한국산업인력공단
URLhttps://www.data.go.kr/data/15081997/fileData.do

Alerts

파일크기 is highly overall correlated with 종류High correlation
종류 is highly overall correlated with 파일크기High correlation
파일명 has unique valuesUnique
파일크기 has unique valuesUnique

Reproduction

Analysis started2024-04-21 01:55:17.477241
Analysis finished2024-04-21 01:55:19.620390
Duration2.14 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

버전
Text

Distinct45
Distinct (%)50.6%
Missing0
Missing (%)0.0%
Memory size844.0 B
2024-04-21T10:55:19.776660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length7
Mean length7
Min length7

Characters and Unicode

Total characters623
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)1.1%

Sample

1st row2.2.0.1
2nd row2.0.1.0
3rd row2.0.1.0
4th row2.0.0.9
5th row2.0.0.9
ValueCountFrequency (%)
1.0.0.2 2
 
2.2%
1.0.0.1 2
 
2.2%
0.9.4.3 2
 
2.2%
0.9.4.2 2
 
2.2%
0.9.4.1 2
 
2.2%
0.9.4.0 2
 
2.2%
0.9.3.7 2
 
2.2%
0.9.3.6 2
 
2.2%
0.9.3.5 2
 
2.2%
0.9.3.4 2
 
2.2%
Other values (35) 69
77.5%
2024-04-21T10:55:20.107033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 267
42.9%
0 143
23.0%
2 52
 
8.3%
1 47
 
7.5%
9 44
 
7.1%
3 26
 
4.2%
4 16
 
2.6%
7 8
 
1.3%
6 8
 
1.3%
5 8
 
1.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 356
57.1%
Other Punctuation 267
42.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 143
40.2%
2 52
 
14.6%
1 47
 
13.2%
9 44
 
12.4%
3 26
 
7.3%
4 16
 
4.5%
7 8
 
2.2%
6 8
 
2.2%
5 8
 
2.2%
8 4
 
1.1%
Other Punctuation
ValueCountFrequency (%)
. 267
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 623
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
. 267
42.9%
0 143
23.0%
2 52
 
8.3%
1 47
 
7.5%
9 44
 
7.1%
3 26
 
4.2%
4 16
 
2.6%
7 8
 
1.3%
6 8
 
1.3%
5 8
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 623
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 267
42.9%
0 143
23.0%
2 52
 
8.3%
1 47
 
7.5%
9 44
 
7.1%
3 26
 
4.2%
4 16
 
2.6%
7 8
 
1.3%
6 8
 
1.3%
5 8
 
1.3%

종류
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size844.0 B
수험자 프로그램
44 
감독위원 프로그램
44 
VPN 접속 프로그램
 
1

Length

Max length11
Median length9
Mean length8.5280899
Min length8

Unique

Unique1 ?
Unique (%)1.1%

Sample

1st rowVPN 접속 프로그램
2nd row수험자 프로그램
3rd row감독위원 프로그램
4th row수험자 프로그램
5th row감독위원 프로그램

Common Values

ValueCountFrequency (%)
수험자 프로그램 44
49.4%
감독위원 프로그램 44
49.4%
VPN 접속 프로그램 1
 
1.1%

Length

2024-04-21T10:55:20.245099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T10:55:20.362644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
프로그램 89
49.7%
수험자 44
24.6%
감독위원 44
24.6%
vpn 1
 
0.6%
접속 1
 
0.6%

파일명
Text

UNIQUE 

Distinct89
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size844.0 B
2024-04-21T10:55:20.552369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length30
Mean length30.460674
Min length27

Characters and Unicode

Total characters2711
Distinct characters42
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique89 ?
Unique (%)100.0%

Sample

1st rowBLUEMAXCLIENT-Installer.exe
2nd rowHRDCBTViewer_Setup_2.0.1.0.exe
3rd rowHRDCBTManager_Setup_2.0.1.0.exe
4th rowHRDCBTViewer_Setup_2.0.0.9.exe
5th rowHRDCBTManager_Setup_2.0.0.9.exe
ValueCountFrequency (%)
bluemaxclient-installer.exe 1
 
1.1%
hrdcbtmanager_setup_1.0.0.1.exe 1
 
1.1%
hrdcbtmanager_setup_0.9.3.3.exe 1
 
1.1%
hrdcbtmanager_setup_0.9.3.4.exe 1
 
1.1%
hrdcbtviewer_setup_0.9.3.4.exe 1
 
1.1%
hrdcbtviewer_setup_0.9.3.5.exe 1
 
1.1%
hrdcbtmanager_setup_0.9.3.5.exe 1
 
1.1%
hrdcbtmanager_setup_0.9.3.6.exe 1
 
1.1%
hrdcbtviewer_setup_0.9.3.6.exe 1
 
1.1%
hrdcbtviewer_setup_0.9.3.7.exe 1
 
1.1%
Other values (79) 79
88.8%
2024-04-21T10:55:20.901290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 399
 
14.7%
. 353
 
13.0%
_ 176
 
6.5%
0 142
 
5.2%
B 89
 
3.3%
C 89
 
3.3%
T 89
 
3.3%
x 89
 
3.3%
r 89
 
3.3%
t 89
 
3.3%
Other values (32) 1107
40.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1111
41.0%
Uppercase Letter 718
26.5%
Other Punctuation 353
 
13.0%
Decimal Number 352
 
13.0%
Connector Punctuation 176
 
6.5%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
B 89
12.4%
C 89
12.4%
T 89
12.4%
S 88
12.3%
D 88
12.3%
R 88
12.3%
H 88
12.3%
M 45
6.3%
V 44
6.1%
E 2
 
0.3%
Other values (6) 8
 
1.1%
Lowercase Letter
ValueCountFrequency (%)
e 399
35.9%
x 89
 
8.0%
r 89
 
8.0%
t 89
 
8.0%
a 89
 
8.0%
p 88
 
7.9%
u 88
 
7.9%
n 45
 
4.1%
w 44
 
4.0%
g 44
 
4.0%
Other values (3) 47
 
4.2%
Decimal Number
ValueCountFrequency (%)
0 142
40.3%
2 50
 
14.2%
1 46
 
13.1%
9 44
 
12.5%
3 26
 
7.4%
4 16
 
4.5%
7 8
 
2.3%
6 8
 
2.3%
5 8
 
2.3%
8 4
 
1.1%
Other Punctuation
ValueCountFrequency (%)
. 353
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 176
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1829
67.5%
Common 882
32.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 399
21.8%
B 89
 
4.9%
C 89
 
4.9%
T 89
 
4.9%
x 89
 
4.9%
r 89
 
4.9%
t 89
 
4.9%
a 89
 
4.9%
p 88
 
4.8%
u 88
 
4.8%
Other values (19) 631
34.5%
Common
ValueCountFrequency (%)
. 353
40.0%
_ 176
20.0%
0 142
16.1%
2 50
 
5.7%
1 46
 
5.2%
9 44
 
5.0%
3 26
 
2.9%
4 16
 
1.8%
7 8
 
0.9%
6 8
 
0.9%
Other values (3) 13
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2711
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 399
 
14.7%
. 353
 
13.0%
_ 176
 
6.5%
0 142
 
5.2%
B 89
 
3.3%
C 89
 
3.3%
T 89
 
3.3%
x 89
 
3.3%
r 89
 
3.3%
t 89
 
3.3%
Other values (32) 1107
40.8%

파일크기
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct89
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25930143
Minimum15295048
Maximum38350220
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size933.0 B
2024-04-21T10:55:21.040669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum15295048
5-th percentile15966858
Q117296666
median18114731
Q335495420
95-th percentile37570289
Maximum38350220
Range23055172
Interquartile range (IQR)18198754

Descriptive statistics

Standard deviation9217341.4
Coefficient of variation (CV)0.3554682
Kurtosis-1.9288957
Mean25930143
Median Absolute Deviation (MAD)2819683
Skewness0.088371711
Sum2.3077828 × 109
Variance8.4959382 × 1013
MonotonicityNot monotonic
2024-04-21T10:55:21.178810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15295048 1
 
1.1%
16976230 1
 
1.1%
34114681 1
 
1.1%
34114454 1
 
1.1%
16975205 1
 
1.1%
16436171 1
 
1.1%
33564152 1
 
1.1%
33572430 1
 
1.1%
16439538 1
 
1.1%
16439851 1
 
1.1%
Other values (79) 79
88.8%
ValueCountFrequency (%)
15295048 1
1.1%
15457655 1
1.1%
15964031 1
1.1%
15965029 1
1.1%
15966588 1
1.1%
15967262 1
1.1%
15967455 1
1.1%
15969334 1
1.1%
15973729 1
1.1%
16313743 1
1.1%
ValueCountFrequency (%)
38350220 1
1.1%
38293692 1
1.1%
37945192 1
1.1%
37917443 1
1.1%
37912762 1
1.1%
37056579 1
1.1%
37054946 1
1.1%
37054842 1
1.1%
37054421 1
1.1%
37048759 1
1.1%

적용일
Categorical

Distinct44
Distinct (%)49.4%
Missing0
Missing (%)0.0%
Memory size844.0 B
2014-10-23
 
4
2018-08-30
 
2
2020-09-10
 
2
2023-07-24
 
2
2023-01-04
 
2
Other values (39)
77 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique1 ?
Unique (%)1.1%

Sample

1st row2022-05-10
2nd row2023-12-26
3rd row2023-12-26
4th row2023-07-24
5th row2023-07-24

Common Values

ValueCountFrequency (%)
2014-10-23 4
 
4.5%
2018-08-30 2
 
2.2%
2020-09-10 2
 
2.2%
2023-07-24 2
 
2.2%
2023-01-04 2
 
2.2%
2022-12-26 2
 
2.2%
2022-08-01 2
 
2.2%
2021-08-30 2
 
2.2%
2021-07-07 2
 
2.2%
2021-03-24 2
 
2.2%
Other values (34) 67
75.3%

Length

2024-04-21T10:55:21.314871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2014-10-23 4
 
4.5%
2018-08-30 2
 
2.2%
2016-08-26 2
 
2.2%
2016-06-26 2
 
2.2%
2018-06-27 2
 
2.2%
2018-03-18 2
 
2.2%
2017-12-29 2
 
2.2%
2017-07-21 2
 
2.2%
2017-06-04 2
 
2.2%
2017-03-13 2
 
2.2%
Other values (34) 67
75.3%

Interactions

2024-04-21T10:55:19.298390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T10:55:21.386204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
버전종류파일명파일크기적용일
버전1.0000.2241.0000.0001.000
종류0.2241.0001.0001.0000.344
파일명1.0001.0001.0001.0001.000
파일크기0.0001.0001.0001.0000.000
적용일1.0000.3441.0000.0001.000
2024-04-21T10:55:21.483743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
종류적용일
종류1.0000.108
적용일0.1081.000
2024-04-21T10:55:21.562974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
파일크기종류적용일
파일크기1.0000.6750.000
종류0.6751.0000.108
적용일0.0000.1081.000

Missing values

2024-04-21T10:55:19.468618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T10:55:19.569011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

버전종류파일명파일크기적용일
02.2.0.1VPN 접속 프로그램BLUEMAXCLIENT-Installer.exe152950482022-05-10
12.0.1.0수험자 프로그램HRDCBTViewer_Setup_2.0.1.0.exe163137432023-12-26
22.0.1.0감독위원 프로그램HRDCBTManager_Setup_2.0.1.0.exe383502202023-12-26
32.0.0.9수험자 프로그램HRDCBTViewer_Setup_2.0.0.9.exe165754452023-07-24
42.0.0.9감독위원 프로그램HRDCBTManager_Setup_2.0.0.9.exe379451922023-07-24
52.0.0.8감독위원 프로그램HRDCBTManager_Setup_2.0.0.8.exe379174432023-01-04
62.0.0.8수험자 프로그램HRDCBTViewer_Setup_2.0.0.8.exe165453672023-01-04
72.0.0.7감독위원 프로그램HRDCBTManager_Setup_2.0.0.7.exe379127622022-12-26
82.0.0.7수험자 프로그램HRDCBTViewer_Setup_2.0.0.7.exe165447262022-12-26
92.0.0.6감독위원 프로그램HRDCBTManager_Setup_2.0.0.6.exe382936922022-08-01
버전종류파일명파일크기적용일
790.9.2.4감독위원 프로그램HRDCBTManager_Setup_0.9.2.4.exe315788232014-11-04
800.9.2.4수험자 프로그램HRDCBTViewer_Setup_0.9.2.4.exe159650292014-11-04
810.9.2.3수험자 프로그램HRDCBTViewer_Setup_0.9.2.3.exe159674552014-11-02
820.9.2.3감독위원 프로그램HRDCBTManager_Setup_0.9.2.3.exe315809292014-11-02
830.9.2.2수험자 프로그램HRDCBTViewer_Setup_0.9.2.2.exe159672622014-10-23
840.9.2.2감독위원 프로그램HRDCBTManager_Setup_0.9.2.2.exe315820442014-10-23
850.9.2.1수험자 프로그램HRDCBTViewer_Setup_0.9.2.1.exe159640312014-10-23
860.9.2.1감독위원 프로그램HRDCBTManager_Setup_0.9.2.1.exe315812882014-10-23
870.9.2.0감독위원 프로그램HRDCBTManager_Setup_0.9.2.0.exe309855692014-10-20
880.9.2.0수험자 프로그램HRDCBTViewer_Setup_0.9.2.0.exe154576552014-10-20