Overview

Dataset statistics

Number of variables6
Number of observations559
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.2%
Total size in memory27.4 KiB
Average record size in memory50.2 B

Variable types

Categorical1
DateTime3
Numeric2

Dataset

Description산림분야 기능인들의 교육현황을 나타내는 자료입니다. 영림단의 연도별 교육기간 및 이수일자, 교육이수인원 등을 나타냄
Author산림청
URLhttps://www.data.go.kr/data/15091322/fileData.do

Alerts

Dataset has 1 (0.2%) duplicate rowsDuplicates
교육주수 is highly overall correlated with 교육명High correlation
교육명 is highly overall correlated with 교육주수High correlation

Reproduction

Analysis started2023-12-12 12:20:48.467054
Analysis finished2023-12-12 12:20:49.463915
Duration1 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

교육명
Categorical

HIGH CORRELATION 

Distinct34
Distinct (%)6.1%
Missing0
Missing (%)0.0%
Memory size4.5 KiB
민유림영림단기본교육
98 
산림바이오매스수집단교육
94 
숲가꾸기공공근로사업단기술교육
69 
숲가꾸기기술교육
59 
숲가꾸기근로자기술교육
51 
Other values (29)
188 

Length

Max length19
Median length15
Mean length10.214669
Min length3

Unique

Unique8 ?
Unique (%)1.4%

Sample

1st row강원도작업단
2nd row기능인영림단과정
3rd row기능인영림단과정
4th row기능인영림단과정
5th row기능인영림단과정

Common Values

ValueCountFrequency (%)
민유림영림단기본교육 98
17.5%
산림바이오매스수집단교육 94
16.8%
숲가꾸기공공근로사업단기술교육 69
12.3%
숲가꾸기기술교육 59
10.6%
숲가꾸기근로자기술교육 51
9.1%
기능인영림단과정 35
 
6.3%
영림과정 35
 
6.3%
산림보호강화사업교육 20
 
3.6%
숲가꾸기자원조사단교육 13
 
2.3%
민유림영림단교육 11
 
2.0%
Other values (24) 74
13.2%

Length

2023-12-12T21:20:49.570902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
민유림영림단기본교육 98
17.5%
산림바이오매스수집단교육 94
16.8%
숲가꾸기공공근로사업단기술교육 69
12.3%
숲가꾸기기술교육 59
10.5%
숲가꾸기근로자기술교육 51
9.1%
기능인영림단과정 35
 
6.2%
영림과정 35
 
6.2%
산림보호강화사업교육 20
 
3.6%
숲가꾸기자원조사단교육 13
 
2.3%
민유림영림단교육 11
 
2.0%
Other values (24) 75
13.4%
Distinct441
Distinct (%)78.9%
Missing0
Missing (%)0.0%
Memory size4.5 KiB
Minimum1984-08-13 00:00:00
Maximum2015-11-23 00:00:00
2023-12-12T21:20:49.736529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:20:49.930296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct449
Distinct (%)80.3%
Missing0
Missing (%)0.0%
Memory size4.5 KiB
Minimum1984-09-08 00:00:00
Maximum2015-12-11 00:00:00
2023-12-12T21:20:50.098173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:20:50.304739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct451
Distinct (%)80.7%
Missing0
Missing (%)0.0%
Memory size4.5 KiB
Minimum1984-09-08 00:00:00
Maximum2015-12-11 00:00:00
2023-12-12T21:20:50.456013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:20:50.621700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

교육주수
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.391771
Minimum1
Maximum7
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.0 KiB
2023-12-12T21:20:50.734908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median2
Q33
95-th percentile6
Maximum7
Range6
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.2820809
Coefficient of variation (CV)0.53603831
Kurtosis2.4109642
Mean2.391771
Median Absolute Deviation (MAD)1
Skewness1.5066629
Sum1337
Variance1.6437314
MonotonicityNot monotonic
2023-12-12T21:20:50.847356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2 219
39.2%
3 167
29.9%
1 127
22.7%
6 42
 
7.5%
4 3
 
0.5%
7 1
 
0.2%
ValueCountFrequency (%)
1 127
22.7%
2 219
39.2%
3 167
29.9%
4 3
 
0.5%
6 42
 
7.5%
7 1
 
0.2%
ValueCountFrequency (%)
7 1
 
0.2%
6 42
 
7.5%
4 3
 
0.5%
3 167
29.9%
2 219
39.2%
1 127
22.7%

이수인원
Real number (ℝ)

Distinct139
Distinct (%)24.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean57.162791
Minimum1
Maximum234
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.0 KiB
2023-12-12T21:20:51.008309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile15.9
Q131
median46
Q374
95-th percentile142.3
Maximum234
Range233
Interquartile range (IQR)43

Descriptive statistics

Standard deviation38.977435
Coefficient of variation (CV)0.68186726
Kurtosis3.0546694
Mean57.162791
Median Absolute Deviation (MAD)18
Skewness1.6450081
Sum31954
Variance1519.2405
MonotonicityNot monotonic
2023-12-12T21:20:51.158868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
30 19
 
3.4%
31 14
 
2.5%
34 14
 
2.5%
36 13
 
2.3%
76 12
 
2.1%
32 11
 
2.0%
29 11
 
2.0%
26 11
 
2.0%
41 11
 
2.0%
46 10
 
1.8%
Other values (129) 433
77.5%
ValueCountFrequency (%)
1 2
 
0.4%
5 1
 
0.2%
6 2
 
0.4%
7 1
 
0.2%
8 1
 
0.2%
9 1
 
0.2%
10 5
0.9%
11 3
0.5%
12 6
1.1%
14 3
0.5%
ValueCountFrequency (%)
234 1
0.2%
230 1
0.2%
195 1
0.2%
190 1
0.2%
189 1
0.2%
188 1
0.2%
186 2
0.4%
185 2
0.4%
184 1
0.2%
179 1
0.2%

Interactions

2023-12-12T21:20:48.925515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:20:48.668464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:20:49.067946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:20:48.802061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:20:51.273499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
교육명교육주수이수인원
교육명1.0000.9530.602
교육주수0.9531.0000.335
이수인원0.6020.3351.000
2023-12-12T21:20:51.376433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
교육주수이수인원교육명
교육주수1.000-0.3580.777
이수인원-0.3581.0000.253
교육명0.7770.2531.000

Missing values

2023-12-12T21:20:49.266124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:20:49.409554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

교육명교육기간(시작일)교육기간(종료일)이수일자교육주수이수인원
0강원도작업단1985-03-111985-03-231985-03-23217
1기능인영림단과정2010-01-112010-01-292010-01-29368
2기능인영림단과정2010-01-252010-02-122010-02-12346
3기능인영림단과정2010-02-222010-03-122010-03-12325
4기능인영림단과정2010-08-302010-09-172010-09-17323
5기능인영림단과정2010-10-042010-10-222010-10-22326
6기능인영림단과정2010-10-252010-11-122010-11-12330
7기능인영림단과정2010-11-152010-12-032010-12-03337
8기능인영림단과정2010-12-062010-12-242010-12-24327
9기능인영림단과정2011-01-032011-01-282011-01-28341
교육명교육기간(시작일)교육기간(종료일)이수일자교육주수이수인원
549작업단1985-07-291985-08-241985-08-24431
550작업단1985-10-281985-11-091985-11-09214
551작업단1985-11-251985-12-071985-12-07210
552작업단1986-02-171986-03-081986-03-08312
553작업단1987-11-231987-12-121987-12-12312
554작업단1988-10-241988-11-121988-11-12312
555작업단1988-11-141988-11-261988-11-26212
556작업단1989-04-101989-04-151989-04-15124
557전북작업단1984-08-131984-09-081984-09-0849
558전북작업단1985-07-081985-07-271985-07-27424

Duplicate rows

Most frequently occurring

교육명교육기간(시작일)교육기간(종료일)이수일자교육주수이수인원# duplicates
0산림바이오매스수집단교육2014-03-172014-03-282014-03-283912