Overview

Dataset statistics

Number of variables3
Number of observations165
Missing cells0
Missing cells (%)0.0%
Duplicate rows2
Duplicate rows (%)1.2%
Total size in memory4.3 KiB
Average record size in memory26.8 B

Variable types

Categorical1
Text1
Numeric1

Dataset

Description중소벤처기업 재직자의 역량강화를 위해 연수사업을 담당하는 중소벤처기업연수원이 운영하는 과정 중 교재를 포함한 과정의 교재 운영현황입니다. 해당 목록에서 아래의 칼럼명에 해당하는 데이터를 확인해 주십시오.- 칼럼명: 기준년, 과정명, 교재 신청건수
Author중소벤처기업진흥공단
URLhttps://www.data.go.kr/data/15093800/fileData.do

Alerts

기준년 has constant value ""Constant
Dataset has 2 (1.2%) duplicate rowsDuplicates
교재 신청건수 has 97 (58.8%) zerosZeros

Reproduction

Analysis started2023-12-12 08:03:56.446110
Analysis finished2023-12-12 08:03:56.939341
Duration0.49 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준년
Categorical

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2022
165 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 165
100.0%

Length

2023-12-12T17:03:57.025425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:03:57.139360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 165
100.0%
Distinct159
Distinct (%)96.4%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2023-12-12T17:03:57.366253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length37
Mean length24.848485
Min length12

Characters and Unicode

Total characters4100
Distinct characters333
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique153 ?
Unique (%)92.7%

Sample

1st row(폐강)[2021 경비지도사] 1차 법학개론 기본이론
2nd row(폐강)[2021 회계관리 1급] 세무회계 기본이론
3rd rowAI 초격차 시대! 미래 비즈니스로의 접근 - AI 이론편
4th row(폐강)베이시스 일본어 1
5th row[2022 공인중개사 1차] 부동산학개론 기초이론
ValueCountFrequency (%)
2022 54
 
6.6%
공인중개사 28
 
3.4%
기본이론 24
 
2.9%
1 23
 
2.8%
2차 21
 
2.6%
폐강)[2021 18
 
2.2%
16
 
2.0%
일본어 14
 
1.7%
1차 13
 
1.6%
문제풀이 12
 
1.5%
Other values (323) 595
72.7%
2023-12-12T17:03:57.846741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
654
 
16.0%
2 247
 
6.0%
106
 
2.6%
0 103
 
2.5%
( 98
 
2.4%
) 98
 
2.4%
[ 91
 
2.2%
] 91
 
2.2%
87
 
2.1%
79
 
1.9%
Other values (323) 2446
59.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2167
52.9%
Space Separator 654
 
16.0%
Decimal Number 462
 
11.3%
Lowercase Letter 209
 
5.1%
Open Punctuation 189
 
4.6%
Close Punctuation 189
 
4.6%
Uppercase Letter 157
 
3.8%
Other Punctuation 39
 
1.0%
Math Symbol 15
 
0.4%
Dash Punctuation 14
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
106
 
4.9%
87
 
4.0%
79
 
3.6%
66
 
3.0%
61
 
2.8%
58
 
2.7%
58
 
2.7%
46
 
2.1%
42
 
1.9%
41
 
1.9%
Other values (258) 1523
70.3%
Uppercase Letter
ValueCountFrequency (%)
E 19
12.1%
A 16
 
10.2%
D 11
 
7.0%
G 10
 
6.4%
L 10
 
6.4%
C 10
 
6.4%
P 10
 
6.4%
I 9
 
5.7%
O 8
 
5.1%
B 8
 
5.1%
Other values (13) 46
29.3%
Lowercase Letter
ValueCountFrequency (%)
i 28
13.4%
n 23
11.0%
e 22
10.5%
r 18
8.6%
d 18
8.6%
t 16
7.7%
s 15
7.2%
a 14
 
6.7%
o 12
 
5.7%
c 7
 
3.3%
Other values (9) 36
17.2%
Decimal Number
ValueCountFrequency (%)
2 247
53.5%
0 103
22.3%
1 79
 
17.1%
3 15
 
3.2%
9 8
 
1.7%
4 4
 
0.9%
7 2
 
0.4%
8 2
 
0.4%
6 2
 
0.4%
Other Punctuation
ValueCountFrequency (%)
, 16
41.0%
/ 13
33.3%
& 6
 
15.4%
! 3
 
7.7%
. 1
 
2.6%
Open Punctuation
ValueCountFrequency (%)
( 98
51.9%
[ 91
48.1%
Close Punctuation
ValueCountFrequency (%)
) 98
51.9%
] 91
48.1%
Math Symbol
ValueCountFrequency (%)
+ 13
86.7%
~ 2
 
13.3%
Space Separator
ValueCountFrequency (%)
654
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 14
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2167
52.9%
Common 1567
38.2%
Latin 366
 
8.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
106
 
4.9%
87
 
4.0%
79
 
3.6%
66
 
3.0%
61
 
2.8%
58
 
2.7%
58
 
2.7%
46
 
2.1%
42
 
1.9%
41
 
1.9%
Other values (258) 1523
70.3%
Latin
ValueCountFrequency (%)
i 28
 
7.7%
n 23
 
6.3%
e 22
 
6.0%
E 19
 
5.2%
r 18
 
4.9%
d 18
 
4.9%
t 16
 
4.4%
A 16
 
4.4%
s 15
 
4.1%
a 14
 
3.8%
Other values (32) 177
48.4%
Common
ValueCountFrequency (%)
654
41.7%
2 247
 
15.8%
0 103
 
6.6%
( 98
 
6.3%
) 98
 
6.3%
[ 91
 
5.8%
] 91
 
5.8%
1 79
 
5.0%
, 16
 
1.0%
3 15
 
1.0%
Other values (13) 75
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2167
52.9%
ASCII 1933
47.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
654
33.8%
2 247
 
12.8%
0 103
 
5.3%
( 98
 
5.1%
) 98
 
5.1%
[ 91
 
4.7%
] 91
 
4.7%
1 79
 
4.1%
i 28
 
1.4%
n 23
 
1.2%
Other values (55) 421
21.8%
Hangul
ValueCountFrequency (%)
106
 
4.9%
87
 
4.0%
79
 
3.6%
66
 
3.0%
61
 
2.8%
58
 
2.7%
58
 
2.7%
46
 
2.1%
42
 
1.9%
41
 
1.9%
Other values (258) 1523
70.3%

교재 신청건수
Real number (ℝ)

ZEROS 

Distinct21
Distinct (%)12.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.0060606
Minimum0
Maximum36
Zeros97
Zeros (%)58.8%
Negative0
Negative (%)0.0%
Memory size1.6 KiB
2023-12-12T17:03:57.983326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q34
95-th percentile12.8
Maximum36
Range36
Interquartile range (IQR)4

Descriptive statistics

Standard deviation5.593438
Coefficient of variation (CV)1.8607203
Kurtosis9.8239163
Mean3.0060606
Median Absolute Deviation (MAD)0
Skewness2.8031058
Sum496
Variance31.286548
MonotonicityNot monotonic
2023-12-12T17:03:58.114972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
0 97
58.8%
2 9
 
5.5%
3 8
 
4.8%
4 8
 
4.8%
1 7
 
4.2%
6 6
 
3.6%
9 5
 
3.0%
10 5
 
3.0%
5 3
 
1.8%
11 3
 
1.8%
Other values (11) 14
 
8.5%
ValueCountFrequency (%)
0 97
58.8%
1 7
 
4.2%
2 9
 
5.5%
3 8
 
4.8%
4 8
 
4.8%
5 3
 
1.8%
6 6
 
3.6%
7 1
 
0.6%
8 2
 
1.2%
9 5
 
3.0%
ValueCountFrequency (%)
36 1
 
0.6%
26 1
 
0.6%
22 2
1.2%
21 1
 
0.6%
18 1
 
0.6%
16 1
 
0.6%
15 1
 
0.6%
13 1
 
0.6%
12 2
1.2%
11 3
1.8%

Interactions

2023-12-12T17:03:56.625342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T17:03:56.795174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:03:56.897831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준년과정명교재 신청건수
02022(폐강)[2021 경비지도사] 1차 법학개론 기본이론0
12022(폐강)[2021 회계관리 1급] 세무회계 기본이론0
22022AI 초격차 시대! 미래 비즈니스로의 접근 - AI 이론편3
32022(폐강)베이시스 일본어 16
42022[2022 공인중개사 1차] 부동산학개론 기초이론0
52022[2022 공인중개사 2차] 부동산공법 기초이론0
62022(폐강)[2021 공인중개사 1차] 부동산학개론 기출공략 & 핵심정리0
72022[2022 경비지도사] 2차 경비업법 기본이론0
82022IoT(사물인터넷) 전문가 되기(아두이노)0
92022티엔티엔 중국어 초급회화 2 (상)4
기준년과정명교재 신청건수
1552022(폐강)일단 합격! JLPT N2 실전모의고사(1/2)2
1562022[2022 공인중개사 2차] 부동산세법 기본이론0
1572022(폐강)[2021 공인중개사 2차] 부동산공법 문제풀이0
1582022[2022 재경관리사] 세무회계 기본이론0
1592022직장인 휴테크, 스마트한 여행의 기술12
1602022일상 드로잉, 나를 찾는 그림 그리기0
1612022한 달 만에 끝내는 재무제표분석0
1622022실전에 바로 쓰는 ZOOM 비즈니스 중국어회화 step10
1632022월터의 낫(not)뻔한 영어 (1)0
1642022[2022 공인중개사 1차] 민법 및 민사특별법 문제풀이0

Duplicate rows

Most frequently occurring

기준년과정명교재 신청건수# duplicates
02022[2022 회계관리 1급] 세무회계 기본이론02
12022[2022 회계관리 1급] 재무회계 기본이론02