Overview

Dataset statistics

Number of variables8
Number of observations116
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.8 KiB
Average record size in memory69.1 B

Variable types

Text1
Categorical2
Numeric3
DateTime2

Dataset

Description인천광역시 인재개발원 내 공무원 교육과정에 대한 정보(과정명, 교육 시작 및 종료일, 교육시간, 정원 등)를 제공합니다.
Author인천광역시
URLhttps://www.data.go.kr/data/15061606/fileData.do

Alerts

개설년도 has constant value ""Constant
데이터기준일자 has constant value ""Constant

Reproduction

Analysis started2023-12-12 02:42:23.655671
Analysis finished2023-12-12 02:42:25.536945
Duration1.88 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct54
Distinct (%)46.6%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-12T11:42:25.769544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length15
Mean length9.887931
Min length5

Characters and Unicode

Total characters1147
Distinct characters201
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)19.0%

Sample

1st row신임인재양성과정
2nd row신임인재양성과정
3rd row신임인재양성과정
4th row신임인재양성과정
5th row신임인재양성과정
ValueCountFrequency (%)
실무 16
 
5.7%
역량 15
 
5.4%
승진후보자 11
 
3.9%
핵심가치 11
 
3.9%
신임인재양성과정 9
 
3.2%
균형·창조·소통 8
 
2.9%
공감 6
 
2.2%
up 6
 
2.2%
소통 6
 
2.2%
in 6
 
2.2%
Other values (100) 185
66.3%
2023-12-12T11:42:26.285637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
163
 
14.2%
29
 
2.5%
21
 
1.8%
21
 
1.8%
20
 
1.7%
19
 
1.7%
· 19
 
1.7%
18
 
1.6%
18
 
1.6%
16
 
1.4%
Other values (191) 803
70.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 893
77.9%
Space Separator 163
 
14.2%
Lowercase Letter 25
 
2.2%
Other Punctuation 19
 
1.7%
Decimal Number 15
 
1.3%
Uppercase Letter 12
 
1.0%
Close Punctuation 10
 
0.9%
Open Punctuation 10
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
29
 
3.2%
21
 
2.4%
21
 
2.4%
20
 
2.2%
19
 
2.1%
18
 
2.0%
18
 
2.0%
16
 
1.8%
16
 
1.8%
15
 
1.7%
Other values (173) 700
78.4%
Lowercase Letter
ValueCountFrequency (%)
i 6
24.0%
u 6
24.0%
p 6
24.0%
n 6
24.0%
o 1
 
4.0%
Uppercase Letter
ValueCountFrequency (%)
S 4
33.3%
I 3
25.0%
A 2
16.7%
N 2
16.7%
T 1
 
8.3%
Decimal Number
ValueCountFrequency (%)
4 5
33.3%
6 4
26.7%
5 4
26.7%
2 2
 
13.3%
Space Separator
ValueCountFrequency (%)
163
100.0%
Other Punctuation
ValueCountFrequency (%)
· 19
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 893
77.9%
Common 217
 
18.9%
Latin 37
 
3.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
29
 
3.2%
21
 
2.4%
21
 
2.4%
20
 
2.2%
19
 
2.1%
18
 
2.0%
18
 
2.0%
16
 
1.8%
16
 
1.8%
15
 
1.7%
Other values (173) 700
78.4%
Latin
ValueCountFrequency (%)
i 6
16.2%
u 6
16.2%
p 6
16.2%
n 6
16.2%
S 4
10.8%
I 3
8.1%
A 2
 
5.4%
N 2
 
5.4%
o 1
 
2.7%
T 1
 
2.7%
Common
ValueCountFrequency (%)
163
75.1%
· 19
 
8.8%
) 10
 
4.6%
( 10
 
4.6%
4 5
 
2.3%
6 4
 
1.8%
5 4
 
1.8%
2 2
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 893
77.9%
ASCII 235
 
20.5%
None 19
 
1.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
163
69.4%
) 10
 
4.3%
( 10
 
4.3%
i 6
 
2.6%
u 6
 
2.6%
p 6
 
2.6%
n 6
 
2.6%
4 5
 
2.1%
S 4
 
1.7%
6 4
 
1.7%
Other values (7) 15
 
6.4%
Hangul
ValueCountFrequency (%)
29
 
3.2%
21
 
2.4%
21
 
2.4%
20
 
2.2%
19
 
2.1%
18
 
2.0%
18
 
2.0%
16
 
1.8%
16
 
1.8%
15
 
1.7%
Other values (173) 700
78.4%
None
ValueCountFrequency (%)
· 19
100.0%

개설년도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023
116 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023
2nd row2023
3rd row2023
4th row2023
5th row2023

Common Values

ValueCountFrequency (%)
2023 116
100.0%

Length

2023-12-12T11:42:26.424771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:42:26.524636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023 116
100.0%

기수
Real number (ℝ)

Distinct9
Distinct (%)7.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.1982759
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-12T11:42:26.634543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q33
95-th percentile6
Maximum9
Range8
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.7155039
Coefficient of variation (CV)0.78038607
Kurtosis3.7758829
Mean2.1982759
Median Absolute Deviation (MAD)1
Skewness1.9572613
Sum255
Variance2.9429535
MonotonicityNot monotonic
2023-12-12T11:42:26.828233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
1 54
46.6%
2 32
27.6%
3 11
 
9.5%
4 8
 
6.9%
5 3
 
2.6%
6 3
 
2.6%
7 2
 
1.7%
8 2
 
1.7%
9 1
 
0.9%
ValueCountFrequency (%)
1 54
46.6%
2 32
27.6%
3 11
 
9.5%
4 8
 
6.9%
5 3
 
2.6%
6 3
 
2.6%
7 2
 
1.7%
8 2
 
1.7%
9 1
 
0.9%
ValueCountFrequency (%)
9 1
 
0.9%
8 2
 
1.7%
7 2
 
1.7%
6 3
 
2.6%
5 3
 
2.6%
4 8
 
6.9%
3 11
 
9.5%
2 32
27.6%
1 54
46.6%
Distinct84
Distinct (%)72.4%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
Minimum2023-02-06 00:00:00
Maximum2023-12-11 00:00:00
2023-12-12T11:42:26.976102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:42:27.196260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct83
Distinct (%)71.6%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
Minimum2023-02-14 00:00:00
Maximum2023-12-15 00:00:00
2023-12-12T11:42:27.364255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:42:27.545728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

교육시간
Real number (ℝ)

Distinct8
Distinct (%)6.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean59.715517
Minimum4
Maximum1442
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-12T11:42:27.657223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4
5-th percentile7
Q114
median14
Q321
95-th percentile101
Maximum1442
Range1438
Interquartile range (IQR)7

Descriptive statistics

Standard deviation227.40452
Coefficient of variation (CV)3.8081311
Kurtosis34.442855
Mean59.715517
Median Absolute Deviation (MAD)7
Skewness5.9554735
Sum6927
Variance51712.814
MonotonicityNot monotonic
2023-12-12T11:42:27.772670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
14 56
48.3%
21 33
28.4%
101 9
 
7.8%
7 7
 
6.0%
1442 3
 
2.6%
4 3
 
2.6%
28 3
 
2.6%
35 2
 
1.7%
ValueCountFrequency (%)
4 3
 
2.6%
7 7
 
6.0%
14 56
48.3%
21 33
28.4%
28 3
 
2.6%
35 2
 
1.7%
101 9
 
7.8%
1442 3
 
2.6%
ValueCountFrequency (%)
1442 3
 
2.6%
101 9
 
7.8%
35 2
 
1.7%
28 3
 
2.6%
21 33
28.4%
14 56
48.3%
7 7
 
6.0%
4 3
 
2.6%

정원
Real number (ℝ)

Distinct12
Distinct (%)10.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean41.439655
Minimum10
Maximum170
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-12T11:42:27.909199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10
5-th percentile20
Q130
median35
Q340
95-th percentile88
Maximum170
Range160
Interquartile range (IQR)10

Descriptive statistics

Standard deviation26.262981
Coefficient of variation (CV)0.63376446
Kurtosis12.153476
Mean41.439655
Median Absolute Deviation (MAD)5
Skewness3.2260031
Sum4807
Variance689.74415
MonotonicityNot monotonic
2023-12-12T11:42:28.029024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
40 40
34.5%
35 18
15.5%
30 18
15.5%
25 13
 
11.2%
88 9
 
7.8%
20 7
 
6.0%
45 4
 
3.4%
170 2
 
1.7%
15 2
 
1.7%
60 1
 
0.9%
Other values (2) 2
 
1.7%
ValueCountFrequency (%)
10 1
 
0.9%
15 2
 
1.7%
20 7
 
6.0%
25 13
 
11.2%
30 18
15.5%
35 18
15.5%
40 40
34.5%
45 4
 
3.4%
60 1
 
0.9%
88 9
 
7.8%
ValueCountFrequency (%)
170 2
 
1.7%
160 1
 
0.9%
88 9
 
7.8%
60 1
 
0.9%
45 4
 
3.4%
40 40
34.5%
35 18
15.5%
30 18
15.5%
25 13
 
11.2%
20 7
 
6.0%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-02-15
116 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-02-15
2nd row2023-02-15
3rd row2023-02-15
4th row2023-02-15
5th row2023-02-15

Common Values

ValueCountFrequency (%)
2023-02-15 116
100.0%

Length

2023-12-12T11:42:28.139551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:42:28.239381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-02-15 116
100.0%

Interactions

2023-12-12T11:42:24.979796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:42:24.335222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:42:24.657105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:42:25.082932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:42:24.448243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:42:24.759906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:42:25.178537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:42:24.556959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:42:24.862027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T11:42:28.303807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과정명기수교육시작일교육종료일교육시간정원
과정명1.0000.0000.9020.7531.0001.000
기수0.0001.0000.0000.0000.0000.000
교육시작일0.9020.0001.0000.9910.3920.000
교육종료일0.7530.0000.9911.0000.0000.000
교육시간1.0000.0000.3920.0001.0000.784
정원1.0000.0000.0000.0000.7841.000
2023-12-12T11:42:28.392830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기수교육시간정원
기수1.0000.2410.256
교육시간0.2411.0000.064
정원0.2560.0641.000

Missing values

2023-12-12T11:42:25.322046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T11:42:25.478305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

과정명개설년도기수교육시작일교육종료일교육시간정원데이터기준일자
0신임인재양성과정202312023-02-062023-02-24101882023-02-15
1신임인재양성과정202322023-03-062023-03-24101882023-02-15
2신임인재양성과정202332023-04-032023-04-21101882023-02-15
3신임인재양성과정202342023-05-012023-05-19101882023-02-15
4신임인재양성과정202352023-06-052023-06-23101882023-02-15
5신임인재양성과정202362023-09-042023-09-22101882023-02-15
6신임인재양성과정202372023-10-022023-10-20101882023-02-15
7신임인재양성과정202382023-10-302023-11-17101882023-02-15
8신임인재양성과정202392023-11-272023-12-15101882023-02-15
9핵심중견간부양성과정202312023-02-062023-12-011442602023-02-15
과정명개설년도기수교육시작일교육종료일교육시간정원데이터기준일자
106행복한 인문학202322023-10-122023-10-1314452023-02-15
107행복한 제2막 인생도약202312023-05-082023-05-1235402023-02-15
108행복한 제2막 인생도약202322023-10-162023-10-2035402023-02-15
109문화가 있는 삶202312023-04-272023-04-2814402023-02-15
110문화가 있는 삶202322023-06-142023-06-1514402023-02-15
111문화가 있는 삶202332023-08-312023-09-0114402023-02-15
112문화가 있는 삶202342023-11-092023-11-1014402023-02-15
113마음 힐링202312023-04-242023-04-2514402023-02-15
114마음 힐링202322023-06-122023-06-1314402023-02-15
115마음 힐링202332023-10-302023-10-3114402023-02-15