Overview

Dataset statistics

Number of variables6
Number of observations70
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.8 KiB
Average record size in memory54.9 B

Variable types

Text1
Numeric5

Dataset

Description산림청 산림교육원 2019년 ~ 2023년 교육 연령 현황에 관한 데이터로 연령별 교육 인원수- 집합(대면)교육 기준 (나라배움터 사이버교육 제외)- 연령 정보는 출생연도 기준 (미성년자는 교육기관별 분류)- 연령 정보를 제공받지 않거나 없는 경우 '정보없음'으로 표시
Author산림청 산림교육원
URLhttps://www.data.go.kr/data/15127247/fileData.do

Alerts

2019년 집합교육 is highly overall correlated with 2020년 집합교육 and 3 other fieldsHigh correlation
2020년 집합교육 is highly overall correlated with 2019년 집합교육 and 3 other fieldsHigh correlation
2021년 집합교육 is highly overall correlated with 2019년 집합교육 and 3 other fieldsHigh correlation
2022년 집합교육 is highly overall correlated with 2019년 집합교육 and 3 other fieldsHigh correlation
2023년 집합교육 is highly overall correlated with 2019년 집합교육 and 3 other fieldsHigh correlation
교육 연령 has unique valuesUnique
2019년 집합교육 has 8 (11.4%) zerosZeros
2020년 집합교육 has 15 (21.4%) zerosZeros
2021년 집합교육 has 8 (11.4%) zerosZeros
2022년 집합교육 has 7 (10.0%) zerosZeros
2023년 집합교육 has 9 (12.9%) zerosZeros

Reproduction

Analysis started2024-03-23 05:38:27.883559
Analysis finished2024-03-23 05:38:38.223316
Duration10.34 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

교육 연령
Text

UNIQUE 

Distinct70
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size692.0 B
2024-03-23T05:38:38.765716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length6
Mean length5.8571429
Min length3

Characters and Unicode

Total characters410
Distinct characters25
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique70 ?
Unique (%)100.0%

Sample

1st row1940년생
2nd row1941년생
3rd row1942년생
4th row1943년생
5th row1944년생
ValueCountFrequency (%)
1940년생 1
 
1.4%
1983년생 1
 
1.4%
1989년생 1
 
1.4%
1988년생 1
 
1.4%
1987년생 1
 
1.4%
1986년생 1
 
1.4%
1985년생 1
 
1.4%
1995년생 1
 
1.4%
1982년생 1
 
1.4%
1991년생 1
 
1.4%
Other values (60) 60
85.7%
2024-03-23T05:38:40.012353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9 76
18.5%
68
16.6%
1 67
16.3%
65
15.9%
4 17
 
4.1%
0 17
 
4.1%
5 16
 
3.9%
7 16
 
3.9%
6 16
 
3.9%
8 16
 
3.9%
Other values (15) 36
8.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 260
63.4%
Other Letter 150
36.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
68
45.3%
65
43.3%
4
 
2.7%
2
 
1.3%
1
 
0.7%
1
 
0.7%
1
 
0.7%
1
 
0.7%
1
 
0.7%
1
 
0.7%
Other values (5) 5
 
3.3%
Decimal Number
ValueCountFrequency (%)
9 76
29.2%
1 67
25.8%
4 17
 
6.5%
0 17
 
6.5%
5 16
 
6.2%
7 16
 
6.2%
6 16
 
6.2%
8 16
 
6.2%
2 12
 
4.6%
3 7
 
2.7%

Most occurring scripts

ValueCountFrequency (%)
Common 260
63.4%
Hangul 150
36.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
68
45.3%
65
43.3%
4
 
2.7%
2
 
1.3%
1
 
0.7%
1
 
0.7%
1
 
0.7%
1
 
0.7%
1
 
0.7%
1
 
0.7%
Other values (5) 5
 
3.3%
Common
ValueCountFrequency (%)
9 76
29.2%
1 67
25.8%
4 17
 
6.5%
0 17
 
6.5%
5 16
 
6.2%
7 16
 
6.2%
6 16
 
6.2%
8 16
 
6.2%
2 12
 
4.6%
3 7
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 260
63.4%
Hangul 150
36.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9 76
29.2%
1 67
25.8%
4 17
 
6.5%
0 17
 
6.5%
5 16
 
6.2%
7 16
 
6.2%
6 16
 
6.2%
8 16
 
6.2%
2 12
 
4.6%
3 7
 
2.7%
Hangul
ValueCountFrequency (%)
68
45.3%
65
43.3%
4
 
2.7%
2
 
1.3%
1
 
0.7%
1
 
0.7%
1
 
0.7%
1
 
0.7%
1
 
0.7%
1
 
0.7%
Other values (5) 5
 
3.3%

2019년 집합교육
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct49
Distinct (%)70.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean60.971429
Minimum0
Maximum321
Zeros8
Zeros (%)11.4%
Negative0
Negative (%)0.0%
Memory size762.0 B
2024-03-23T05:38:40.434046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q18.75
median64
Q392.5
95-th percentile131.15
Maximum321
Range321
Interquartile range (IQR)83.75

Descriptive statistics

Standard deviation56.467246
Coefficient of variation (CV)0.92612634
Kurtosis5.2871662
Mean60.971429
Median Absolute Deviation (MAD)43.5
Skewness1.5488227
Sum4268
Variance3188.5499
MonotonicityNot monotonic
2024-03-23T05:38:40.856406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=49)
ValueCountFrequency (%)
0 8
 
11.4%
4 3
 
4.3%
85 2
 
2.9%
82 2
 
2.9%
1 2
 
2.9%
91 2
 
2.9%
88 2
 
2.9%
113 2
 
2.9%
108 2
 
2.9%
73 2
 
2.9%
Other values (39) 43
61.4%
ValueCountFrequency (%)
0 8
11.4%
1 2
 
2.9%
2 1
 
1.4%
3 2
 
2.9%
4 3
 
4.3%
6 1
 
1.4%
8 1
 
1.4%
11 2
 
2.9%
12 1
 
1.4%
15 1
 
1.4%
ValueCountFrequency (%)
321 1
1.4%
206 1
1.4%
140 1
1.4%
137 1
1.4%
124 1
1.4%
113 2
2.9%
112 1
1.4%
111 1
1.4%
108 2
2.9%
107 1
1.4%

2020년 집합교육
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct37
Distinct (%)52.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean68.414286
Minimum0
Maximum2452
Zeros15
Zeros (%)21.4%
Negative0
Negative (%)0.0%
Memory size762.0 B
2024-03-23T05:38:41.326456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median43
Q359.5
95-th percentile81.1
Maximum2452
Range2452
Interquartile range (IQR)58.5

Descriptive statistics

Standard deviation290.57831
Coefficient of variation (CV)4.2473338
Kurtosis68.446674
Mean68.414286
Median Absolute Deviation (MAD)33
Skewness8.2291447
Sum4789
Variance84435.753
MonotonicityNot monotonic
2024-03-23T05:38:41.722171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=37)
ValueCountFrequency (%)
0 15
21.4%
1 4
 
5.7%
7 3
 
4.3%
47 3
 
4.3%
51 3
 
4.3%
66 2
 
2.9%
80 2
 
2.9%
54 2
 
2.9%
38 2
 
2.9%
58 2
 
2.9%
Other values (27) 32
45.7%
ValueCountFrequency (%)
0 15
21.4%
1 4
 
5.7%
3 2
 
2.9%
4 1
 
1.4%
6 1
 
1.4%
7 3
 
4.3%
10 2
 
2.9%
12 1
 
1.4%
13 1
 
1.4%
16 1
 
1.4%
ValueCountFrequency (%)
2452 1
1.4%
87 1
1.4%
85 1
1.4%
82 1
1.4%
80 2
2.9%
79 2
2.9%
76 1
1.4%
75 1
1.4%
70 1
1.4%
69 1
1.4%

2021년 집합교육
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct46
Distinct (%)65.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean53.757143
Minimum0
Maximum393
Zeros8
Zeros (%)11.4%
Negative0
Negative (%)0.0%
Memory size762.0 B
2024-03-23T05:38:42.162327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q16.5
median59.5
Q376
95-th percentile111.3
Maximum393
Range393
Interquartile range (IQR)69.5

Descriptive statistics

Standard deviation58.158812
Coefficient of variation (CV)1.0818806
Kurtosis16.093681
Mean53.757143
Median Absolute Deviation (MAD)35
Skewness3.0724322
Sum3763
Variance3382.4474
MonotonicityNot monotonic
2024-03-23T05:38:42.637685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=46)
ValueCountFrequency (%)
0 8
 
11.4%
1 5
 
7.1%
73 3
 
4.3%
62 3
 
4.3%
76 3
 
4.3%
89 2
 
2.9%
75 2
 
2.9%
2 2
 
2.9%
80 2
 
2.9%
57 2
 
2.9%
Other values (36) 38
54.3%
ValueCountFrequency (%)
0 8
11.4%
1 5
7.1%
2 2
 
2.9%
3 1
 
1.4%
4 1
 
1.4%
6 1
 
1.4%
8 1
 
1.4%
9 1
 
1.4%
13 1
 
1.4%
16 1
 
1.4%
ValueCountFrequency (%)
393 1
1.4%
195 1
1.4%
144 1
1.4%
114 1
1.4%
108 1
1.4%
106 1
1.4%
102 1
1.4%
99 1
1.4%
91 1
1.4%
89 2
2.9%

2022년 집합교육
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct48
Distinct (%)68.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean51.771429
Minimum0
Maximum123
Zeros7
Zeros (%)10.0%
Negative0
Negative (%)0.0%
Memory size762.0 B
2024-03-23T05:38:43.100543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q112.25
median61.5
Q387.75
95-th percentile106.7
Maximum123
Range123
Interquartile range (IQR)75.5

Descriptive statistics

Standard deviation38.772533
Coefficient of variation (CV)0.74891758
Kurtosis-1.4401299
Mean51.771429
Median Absolute Deviation (MAD)34
Skewness0.0032007057
Sum3624
Variance1503.3093
MonotonicityNot monotonic
2024-03-23T05:38:43.690984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=48)
ValueCountFrequency (%)
0 7
 
10.0%
96 3
 
4.3%
6 3
 
4.3%
91 3
 
4.3%
1 3
 
4.3%
38 2
 
2.9%
65 2
 
2.9%
80 2
 
2.9%
97 2
 
2.9%
74 2
 
2.9%
Other values (38) 41
58.6%
ValueCountFrequency (%)
0 7
10.0%
1 3
4.3%
2 2
 
2.9%
5 1
 
1.4%
6 3
4.3%
12 2
 
2.9%
13 1
 
1.4%
15 1
 
1.4%
16 1
 
1.4%
22 2
 
2.9%
ValueCountFrequency (%)
123 1
 
1.4%
119 1
 
1.4%
116 1
 
1.4%
113 1
 
1.4%
99 1
 
1.4%
97 2
2.9%
96 3
4.3%
94 1
 
1.4%
92 1
 
1.4%
91 3
4.3%

2023년 집합교육
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct40
Distinct (%)57.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean53.342857
Minimum0
Maximum281
Zeros9
Zeros (%)12.9%
Negative0
Negative (%)0.0%
Memory size762.0 B
2024-03-23T05:38:44.211490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q18
median59.5
Q380.75
95-th percentile107.95
Maximum281
Range281
Interquartile range (IQR)72.75

Descriptive statistics

Standard deviation51.051519
Coefficient of variation (CV)0.95704507
Kurtosis5.2594713
Mean53.342857
Median Absolute Deviation (MAD)38.5
Skewness1.6545067
Sum3734
Variance2606.2576
MonotonicityNot monotonic
2024-03-23T05:38:45.004708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=40)
ValueCountFrequency (%)
0 9
 
12.9%
97 4
 
5.7%
1 4
 
5.7%
74 3
 
4.3%
8 2
 
2.9%
7 2
 
2.9%
62 2
 
2.9%
68 2
 
2.9%
3 2
 
2.9%
79 2
 
2.9%
Other values (30) 38
54.3%
ValueCountFrequency (%)
0 9
12.9%
1 4
5.7%
3 2
 
2.9%
7 2
 
2.9%
8 2
 
2.9%
10 1
 
1.4%
16 2
 
2.9%
17 1
 
1.4%
20 2
 
2.9%
21 2
 
2.9%
ValueCountFrequency (%)
281 1
 
1.4%
215 1
 
1.4%
126 1
 
1.4%
112 1
 
1.4%
103 2
2.9%
102 1
 
1.4%
100 1
 
1.4%
98 1
 
1.4%
97 4
5.7%
90 1
 
1.4%

Interactions

2024-03-23T05:38:35.932133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T05:38:28.446970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T05:38:30.214267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T05:38:31.982759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T05:38:33.873108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T05:38:36.222258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T05:38:28.833980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T05:38:30.444708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T05:38:32.312457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T05:38:34.159838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T05:38:36.608353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T05:38:29.159173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T05:38:30.779010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T05:38:32.675556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T05:38:34.796653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T05:38:36.952107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T05:38:29.557508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T05:38:31.203506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T05:38:33.084689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T05:38:35.306688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T05:38:37.195584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T05:38:29.907027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T05:38:31.606581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T05:38:33.502951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T05:38:35.591116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-23T05:38:45.321315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
교육 연령2019년 집합교육2020년 집합교육2021년 집합교육2022년 집합교육2023년 집합교육
교육 연령1.0001.0001.0001.0001.0001.000
2019년 집합교육1.0001.0001.0000.7600.7010.885
2020년 집합교육1.0001.0001.0001.0000.4631.000
2021년 집합교육1.0000.7601.0001.0000.7500.760
2022년 집합교육1.0000.7010.4630.7501.0000.671
2023년 집합교육1.0000.8851.0000.7600.6711.000
2024-03-23T05:38:45.651002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2019년 집합교육2020년 집합교육2021년 집합교육2022년 집합교육2023년 집합교육
2019년 집합교육1.0000.7420.7260.7660.649
2020년 집합교육0.7421.0000.8370.8520.744
2021년 집합교육0.7260.8371.0000.8770.709
2022년 집합교육0.7660.8520.8771.0000.828
2023년 집합교육0.6490.7440.7090.8281.000

Missing values

2024-03-23T05:38:37.596406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T05:38:37.998677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

교육 연령2019년 집합교육2020년 집합교육2021년 집합교육2022년 집합교육2023년 집합교육
01940년생00100
11941년생10000
21942년생00000
31943년생00000
41944년생10101
51945년생30111
61946년생30010
71947년생40120
81948년생41063
91949년생40261
교육 연령2019년 집합교육2020년 집합교육2021년 집합교육2022년 집합교육2023년 집합교육
602000년생24253920
612001년생079621
622002년생0142621
632003년생0011216
642004년생00013
65고등학생0000281
66중학생66044290
67초등학생280000
68미취학아동450195380
69정보없음3212452393123215