Overview

Dataset statistics

Number of variables7
Number of observations147
Missing cells7
Missing cells (%)0.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.2 KiB
Average record size in memory56.9 B

Variable types

Text4
Boolean1
Categorical1
DateTime1

Dataset

Description교육시스템_공통_마스터에 대한 데이터로 마스터 코드, 마스터명, 컬럼아이디, 사용여부, 사용구분 등에 대한 항목들을 제공합니다.
Author소상공인시장진흥공단
URLhttps://www.data.go.kr/data/15100665/fileData.do

Alerts

사용여부 has constant value ""Constant
컬럼아이디(사용되는 컬럼명값) has 7 (4.8%) missing valuesMissing
마스터코드 has unique valuesUnique

Reproduction

Analysis started2023-12-12 14:18:34.276035
Analysis finished2023-12-12 14:18:34.762796
Duration0.49 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

마스터코드
Text

UNIQUE 

Distinct147
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-12T23:18:35.083169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length6
Mean length4.6054422
Min length2

Characters and Unicode

Total characters677
Distinct characters32
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique147 ?
Unique (%)100.0%

Sample

1st rowBIZ105
2nd rowBIZ106
3rd rowTMP001
4th rowFE
5th rowUT
ValueCountFrequency (%)
biz105 1
 
0.7%
biz065 1
 
0.7%
biz104 1
 
0.7%
biz021 1
 
0.7%
biz022 1
 
0.7%
biz099 1
 
0.7%
biz100 1
 
0.7%
biz039 1
 
0.7%
biz103 1
 
0.7%
biz076 1
 
0.7%
Other values (137) 137
93.2%
2023-12-12T23:18:35.541406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
B 98
14.5%
I 97
14.3%
Z 94
13.9%
0 77
11.4%
1 67
9.9%
2 29
 
4.3%
9 19
 
2.8%
3 19
 
2.8%
6 17
 
2.5%
4 16
 
2.4%
Other values (22) 144
21.3%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 392
57.9%
Decimal Number 285
42.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
B 98
25.0%
I 97
24.7%
Z 94
24.0%
C 14
 
3.6%
T 11
 
2.8%
A 11
 
2.8%
S 10
 
2.6%
P 9
 
2.3%
F 7
 
1.8%
J 5
 
1.3%
Other values (12) 36
 
9.2%
Decimal Number
ValueCountFrequency (%)
0 77
27.0%
1 67
23.5%
2 29
 
10.2%
9 19
 
6.7%
3 19
 
6.7%
6 17
 
6.0%
4 16
 
5.6%
7 15
 
5.3%
5 13
 
4.6%
8 13
 
4.6%

Most occurring scripts

ValueCountFrequency (%)
Latin 392
57.9%
Common 285
42.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
B 98
25.0%
I 97
24.7%
Z 94
24.0%
C 14
 
3.6%
T 11
 
2.8%
A 11
 
2.8%
S 10
 
2.6%
P 9
 
2.3%
F 7
 
1.8%
J 5
 
1.3%
Other values (12) 36
 
9.2%
Common
ValueCountFrequency (%)
0 77
27.0%
1 67
23.5%
2 29
 
10.2%
9 19
 
6.7%
3 19
 
6.7%
6 17
 
6.0%
4 16
 
5.6%
7 15
 
5.3%
5 13
 
4.6%
8 13
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 677
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
B 98
14.5%
I 97
14.3%
Z 94
13.9%
0 77
11.4%
1 67
9.9%
2 29
 
4.3%
9 19
 
2.8%
3 19
 
2.8%
6 17
 
2.5%
4 16
 
2.4%
Other values (22) 144
21.3%
Distinct140
Distinct (%)95.2%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-12T23:18:35.801408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length23
Mean length8.1020408
Min length4

Characters and Unicode

Total characters1191
Distinct characters201
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique137 ?
Unique (%)93.2%

Sample

1st row수정보완상태코드
2nd row신청상태코드
3rd row회원조회 URL
4th row첨부파일 등록가능 형식
5th row사용자 구분코드
ValueCountFrequency (%)
코드 13
 
5.7%
소공인특화지원 12
 
5.2%
제출구분코드 6
 
2.6%
구분코드 4
 
1.7%
수혜자 4
 
1.7%
구분 3
 
1.3%
작업환경개선사업 3
 
1.3%
유형 3
 
1.3%
강사 2
 
0.9%
게시판 2
 
0.9%
Other values (167) 178
77.4%
2023-12-12T23:18:36.169504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
111
 
9.3%
111
 
9.3%
83
 
7.0%
54
 
4.5%
40
 
3.4%
26
 
2.2%
24
 
2.0%
23
 
1.9%
20
 
1.7%
18
 
1.5%
Other values (191) 681
57.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1078
90.5%
Space Separator 83
 
7.0%
Uppercase Letter 13
 
1.1%
Connector Punctuation 10
 
0.8%
Decimal Number 3
 
0.3%
Close Punctuation 2
 
0.2%
Open Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
111
 
10.3%
111
 
10.3%
54
 
5.0%
40
 
3.7%
26
 
2.4%
24
 
2.2%
23
 
2.1%
20
 
1.9%
18
 
1.7%
18
 
1.7%
Other values (177) 633
58.7%
Uppercase Letter
ValueCountFrequency (%)
S 4
30.8%
M 3
23.1%
L 1
 
7.7%
U 1
 
7.7%
T 1
 
7.7%
R 1
 
7.7%
P 1
 
7.7%
I 1
 
7.7%
Decimal Number
ValueCountFrequency (%)
1 2
66.7%
2 1
33.3%
Space Separator
ValueCountFrequency (%)
83
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 10
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1078
90.5%
Common 100
 
8.4%
Latin 13
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
111
 
10.3%
111
 
10.3%
54
 
5.0%
40
 
3.7%
26
 
2.4%
24
 
2.2%
23
 
2.1%
20
 
1.9%
18
 
1.7%
18
 
1.7%
Other values (177) 633
58.7%
Latin
ValueCountFrequency (%)
S 4
30.8%
M 3
23.1%
L 1
 
7.7%
U 1
 
7.7%
T 1
 
7.7%
R 1
 
7.7%
P 1
 
7.7%
I 1
 
7.7%
Common
ValueCountFrequency (%)
83
83.0%
_ 10
 
10.0%
1 2
 
2.0%
) 2
 
2.0%
( 2
 
2.0%
2 1
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1078
90.5%
ASCII 113
 
9.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
111
 
10.3%
111
 
10.3%
54
 
5.0%
40
 
3.7%
26
 
2.4%
24
 
2.2%
23
 
2.1%
20
 
1.9%
18
 
1.7%
18
 
1.7%
Other values (177) 633
58.7%
ASCII
ValueCountFrequency (%)
83
73.5%
_ 10
 
8.8%
S 4
 
3.5%
M 3
 
2.7%
1 2
 
1.8%
) 2
 
1.8%
( 2
 
1.8%
L 1
 
0.9%
2 1
 
0.9%
U 1
 
0.9%
Other values (4) 4
 
3.5%
Distinct120
Distinct (%)85.7%
Missing7
Missing (%)4.8%
Memory size1.3 KiB
2023-12-12T23:18:36.429680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length18
Mean length13.492857
Min length8

Characters and Unicode

Total characters1889
Distinct characters29
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique110 ?
Unique (%)78.6%

Sample

1st rowCORR_CODE
2nd rowREQST_URL
3rd rowFILE_EXTSN
4th rowUSER_TYPE_CODE
5th rowCORR_CODE
ValueCountFrequency (%)
presentn_se_code 7
 
5.0%
status_code 4
 
2.9%
corr_code 4
 
2.9%
punsh_resn_code 3
 
2.1%
rcrit_realm_code 2
 
1.4%
mt_cate_code 2
 
1.4%
qustnr_se_code 2
 
1.4%
mt_ctype_code 2
 
1.4%
reqst_code 2
 
1.4%
cate_code 2
 
1.4%
Other values (110) 110
78.6%
2023-12-12T23:18:36.947847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
E 277
14.7%
_ 264
14.0%
C 195
10.3%
T 176
9.3%
O 165
8.7%
D 156
8.3%
S 112
 
5.9%
R 82
 
4.3%
N 79
 
4.2%
M 60
 
3.2%
Other values (19) 323
17.1%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 1623
85.9%
Connector Punctuation 264
 
14.0%
Space Separator 1
 
0.1%
Other Punctuation 1
 
0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
E 277
17.1%
C 195
12.0%
T 176
10.8%
O 165
10.2%
D 156
9.6%
S 112
6.9%
R 82
 
5.1%
N 79
 
4.9%
M 60
 
3.7%
A 54
 
3.3%
Other values (16) 267
16.5%
Connector Punctuation
ValueCountFrequency (%)
_ 264
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1623
85.9%
Common 266
 
14.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
E 277
17.1%
C 195
12.0%
T 176
10.8%
O 165
10.2%
D 156
9.6%
S 112
6.9%
R 82
 
5.1%
N 79
 
4.9%
M 60
 
3.7%
A 54
 
3.3%
Other values (16) 267
16.5%
Common
ValueCountFrequency (%)
_ 264
99.2%
1
 
0.4%
. 1
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1889
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
E 277
14.7%
_ 264
14.0%
C 195
10.3%
T 176
9.3%
O 165
8.7%
D 156
8.3%
S 112
 
5.9%
R 82
 
4.3%
N 79
 
4.2%
M 60
 
3.2%
Other values (19) 323
17.1%

사용여부
Boolean

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size279.0 B
True
147 
ValueCountFrequency (%)
True 147
100.0%
2023-12-12T23:18:37.128218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct2
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
U
116 
S
31 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowU
2nd rowU
3rd rowU
4th rowU
5th rowU

Common Values

ValueCountFrequency (%)
U 116
78.9%
S 31
 
21.1%

Length

2023-12-12T23:18:37.227977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:18:37.349145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
u 116
78.9%
s 31
 
21.1%

비고
Text

Distinct140
Distinct (%)95.2%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-12T23:18:37.593977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length328
Median length30
Mean length12.272109
Min length2

Characters and Unicode

Total characters1804
Distinct characters254
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique135 ?
Unique (%)91.8%

Sample

1st row수정보완상태코드
2nd row신청상태코드
3rd row회원조회 URL (마당 회원 조회 임시 페이지 URL용
4th row첨부파일 등록가능 형식
5th row사용자 구분값
ValueCountFrequency (%)
소공인특화지원 13
 
4.0%
12
 
3.7%
c 9
 
2.8%
코드 9
 
2.8%
사용 6
 
1.8%
b 6
 
1.8%
제출구분코드 6
 
1.8%
시스템 4
 
1.2%
수혜자 4
 
1.2%
구분 3
 
0.9%
Other values (222) 255
78.0%
2023-12-12T23:18:38.407081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
359
 
19.9%
99
 
5.5%
96
 
5.3%
49
 
2.7%
37
 
2.1%
31
 
1.7%
29
 
1.6%
26
 
1.4%
24
 
1.3%
24
 
1.3%
Other values (244) 1030
57.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1307
72.5%
Space Separator 359
 
19.9%
Uppercase Letter 46
 
2.5%
Other Punctuation 20
 
1.1%
Connector Punctuation 19
 
1.1%
Lowercase Letter 17
 
0.9%
Open Punctuation 12
 
0.7%
Close Punctuation 11
 
0.6%
Dash Punctuation 7
 
0.4%
Decimal Number 6
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
99
 
7.6%
96
 
7.3%
49
 
3.7%
37
 
2.8%
31
 
2.4%
29
 
2.2%
26
 
2.0%
24
 
1.8%
24
 
1.8%
24
 
1.8%
Other values (208) 868
66.4%
Uppercase Letter
ValueCountFrequency (%)
C 9
19.6%
B 6
13.0%
D 5
10.9%
R 4
8.7%
A 3
 
6.5%
E 3
 
6.5%
T 2
 
4.3%
S 2
 
4.3%
N 2
 
4.3%
U 2
 
4.3%
Other values (6) 8
17.4%
Lowercase Letter
ValueCountFrequency (%)
p 3
17.6%
t 3
17.6%
h 3
17.6%
m 2
11.8%
e 2
11.8%
d 2
11.8%
c 1
 
5.9%
l 1
 
5.9%
Decimal Number
ValueCountFrequency (%)
1 3
50.0%
2 2
33.3%
4 1
 
16.7%
Other Punctuation
ValueCountFrequency (%)
. 13
65.0%
/ 7
35.0%
Open Punctuation
ValueCountFrequency (%)
( 9
75.0%
[ 3
 
25.0%
Close Punctuation
ValueCountFrequency (%)
) 8
72.7%
] 3
 
27.3%
Space Separator
ValueCountFrequency (%)
359
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 19
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1307
72.5%
Common 434
 
24.1%
Latin 63
 
3.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
99
 
7.6%
96
 
7.3%
49
 
3.7%
37
 
2.8%
31
 
2.4%
29
 
2.2%
26
 
2.0%
24
 
1.8%
24
 
1.8%
24
 
1.8%
Other values (208) 868
66.4%
Latin
ValueCountFrequency (%)
C 9
14.3%
B 6
 
9.5%
D 5
 
7.9%
R 4
 
6.3%
p 3
 
4.8%
t 3
 
4.8%
h 3
 
4.8%
A 3
 
4.8%
E 3
 
4.8%
m 2
 
3.2%
Other values (14) 22
34.9%
Common
ValueCountFrequency (%)
359
82.7%
_ 19
 
4.4%
. 13
 
3.0%
( 9
 
2.1%
) 8
 
1.8%
/ 7
 
1.6%
- 7
 
1.6%
1 3
 
0.7%
] 3
 
0.7%
[ 3
 
0.7%
Other values (2) 3
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1307
72.5%
ASCII 497
 
27.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
359
72.2%
_ 19
 
3.8%
. 13
 
2.6%
( 9
 
1.8%
C 9
 
1.8%
) 8
 
1.6%
/ 7
 
1.4%
- 7
 
1.4%
B 6
 
1.2%
D 5
 
1.0%
Other values (26) 55
 
11.1%
Hangul
ValueCountFrequency (%)
99
 
7.6%
96
 
7.3%
49
 
3.7%
37
 
2.8%
31
 
2.4%
29
 
2.2%
26
 
2.0%
24
 
1.8%
24
 
1.8%
24
 
1.8%
Other values (208) 868
66.4%
Distinct22
Distinct (%)15.0%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
Minimum2019-03-01 00:00:00
Maximum2021-10-13 09:22:00
2023-12-12T23:18:38.570723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:18:38.709048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)

Correlations

2023-12-12T23:18:38.796172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사용구분-S(시스템)_U(업무)등록일
사용구분-S(시스템)_U(업무)1.0000.000
등록일0.0001.000

Missing values

2023-12-12T23:18:34.603151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:18:34.713778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

마스터코드마스터명컬럼아이디(사용되는 컬럼명값)사용여부사용구분-S(시스템)_U(업무)비고등록일
0BIZ105수정보완상태코드CORR_CODEYU수정보완상태코드2019-03-01 00:00:00
1BIZ106신청상태코드<NA>YU신청상태코드2019-03-01 00:00:00
2TMP001회원조회 URLREQST_URLYU회원조회 URL (마당 회원 조회 임시 페이지 URL용2021-06-15 15:59:00
3FE첨부파일 등록가능 형식FILE_EXTSNYU첨부파일 등록가능 형식2019-03-01 00:00:00
4UT사용자 구분코드USER_TYPE_CODEYU사용자 구분값2019-03-01 00:00:00
5BIZ112진로상담 담당자 구분CORR_CODEYU진로상담 담당자 구분2019-03-01 00:00:00
6BIZ109제출구분코드PRESENTN_SE_CODEYU제출구분코드 - 워크숍2019-03-01 00:00:00
7BIZ113멘토링신청상태MNTR_REQST_CODEYU멘토링신청상태2019-03-01 00:00:00
8BIZ124소공인특화지원 작업환경개선사업 상태코드STATUS_CODEYU소공인특화지원 작업환경개선사업 상태코드2020-08-10 17:57:00
9BIZ125소공인특화지원 작업환경개선사업 지원항목 대분류STATUS_CODEYU소공인특화지원 작업환경개선사업 지원항목 대분류2020-08-10 17:57:00
마스터코드마스터명컬럼아이디(사용되는 컬럼명값)사용여부사용구분-S(시스템)_U(업무)비고등록일
137BIZ073경력구분코드CAREER_SE_CODEYU경력구분코드2019-03-01 00:00:00
138BIZ092산업세분류코드INDUTY_SE_CODEYU산업세분류코드2019-03-01 00:00:00
139BIZ018결과점수코드RSLT_SCORE_CODEYU현장점검 결과점수코드2019-03-01 00:00:00
140PW비밀번호초기화 기본코드MT_PWD_CLEAR_CODEYS회원2019-03-01 00:00:00
141SF자주묻는질문_분류코드MT_CATE_CODEYS자주묻는질문_분류코드2019-03-01 00:00:00
142BIZ006창업지역코드FNTN_AREA_CODEYU창업지역코드2019-03-01 00:00:00
143BIZ024창업시기코드FNTN_ERA_CODEYU창업시기코드2019-03-01 00:00:00
144BIZ046결재구분코드SANCTN_SE_CODEYU결재구분코드2019-03-01 00:00:00
145BIZ094연령별코드<NA>YU연령별코드2019-03-01 00:00:00
146BIZ121신청상태REQST_CODEYU공모전_신청상태2020-07-27 18:54:00