Overview

Dataset statistics

Number of variables5
Number of observations37
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.7 KiB
Average record size in memory47.6 B

Variable types

Numeric4
Categorical1

Dataset

Description년도별, 등급별, 보건교육사 자격증 신청 및 발급 정보- 발급, 미발급, 합격자수로 구분하여 제공합니다. (2010년부터 정보제공)- 2020년 보건교육사 국가시험은 코로나19로 인해 미시행 (2급 건수는 승급심의 결과임)
Author한국건강증진개발원
URLhttps://www.data.go.kr/data/15042489/fileData.do

Alerts

발급 is highly overall correlated with 미발급 and 2 other fieldsHigh correlation
미발급 is highly overall correlated with 발급 and 2 other fieldsHigh correlation
합격자 수 is highly overall correlated with 발급 and 2 other fieldsHigh correlation
자격증 등급 is highly overall correlated with 발급 and 2 other fieldsHigh correlation
미발급 has 7 (18.9%) zerosZeros

Reproduction

Analysis started2023-12-12 17:44:08.978271
Analysis finished2023-12-12 17:44:10.770861
Duration1.79 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

년도
Real number (ℝ)

Distinct14
Distinct (%)37.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2016.6216
Minimum2010
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size465.0 B
2023-12-13T02:44:10.825865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2010
5-th percentile2010.8
Q12013
median2017
Q32020
95-th percentile2023
Maximum2023
Range13
Interquartile range (IQR)7

Descriptive statistics

Standard deviation4.0644212
Coefficient of variation (CV)0.0020154605
Kurtosis-1.1929069
Mean2016.6216
Median Absolute Deviation (MAD)4
Skewness0.027585054
Sum74615
Variance16.51952
MonotonicityIncreasing
2023-12-13T02:44:10.948827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
2011 3
8.1%
2013 3
8.1%
2014 3
8.1%
2015 3
8.1%
2017 3
8.1%
2018 3
8.1%
2019 3
8.1%
2021 3
8.1%
2022 3
8.1%
2023 3
8.1%
Other values (4) 7
18.9%
ValueCountFrequency (%)
2010 2
5.4%
2011 3
8.1%
2012 2
5.4%
2013 3
8.1%
2014 3
8.1%
2015 3
8.1%
2016 2
5.4%
2017 3
8.1%
2018 3
8.1%
2019 3
8.1%
ValueCountFrequency (%)
2023 3
8.1%
2022 3
8.1%
2021 3
8.1%
2020 1
 
2.7%
2019 3
8.1%
2018 3
8.1%
2017 3
8.1%
2016 2
5.4%
2015 3
8.1%
2014 3
8.1%

자격증 등급
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)8.1%
Missing0
Missing (%)0.0%
Memory size428.0 B
보건교육사 2급
14 
보건교육사 3급
13 
보건교육사 1급
10 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row보건교육사 2급
2nd row보건교육사 3급
3rd row보건교육사 1급
4th row보건교육사 2급
5th row보건교육사 3급

Common Values

ValueCountFrequency (%)
보건교육사 2급 14
37.8%
보건교육사 3급 13
35.1%
보건교육사 1급 10
27.0%

Length

2023-12-13T02:44:11.063026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:44:11.200798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
보건교육사 37
50.0%
2급 14
 
18.9%
3급 13
 
17.6%
1급 10
 
13.5%

발급
Real number (ℝ)

HIGH CORRELATION 

Distinct33
Distinct (%)89.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean377.35135
Minimum1
Maximum2186
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size465.0 B
2023-12-13T02:44:11.307149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q19
median83
Q3554
95-th percentile1509
Maximum2186
Range2185
Interquartile range (IQR)545

Descriptive statistics

Standard deviation550.99804
Coefficient of variation (CV)1.4601724
Kurtosis3.7251092
Mean377.35135
Median Absolute Deviation (MAD)82
Skewness1.9725333
Sum13962
Variance303598.85
MonotonicityNot monotonic
2023-12-13T02:44:11.447724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)
ValueCountFrequency (%)
1 4
 
10.8%
5 2
 
5.4%
16 1
 
2.7%
83 1
 
2.7%
543 1
 
2.7%
8 1
 
2.7%
61 1
 
2.7%
521 1
 
2.7%
30 1
 
2.7%
2 1
 
2.7%
Other values (23) 23
62.2%
ValueCountFrequency (%)
1 4
10.8%
2 1
 
2.7%
3 1
 
2.7%
5 2
5.4%
8 1
 
2.7%
9 1
 
2.7%
16 1
 
2.7%
17 1
 
2.7%
30 1
 
2.7%
31 1
 
2.7%
ValueCountFrequency (%)
2186 1
2.7%
1993 1
2.7%
1388 1
2.7%
1289 1
2.7%
796 1
2.7%
775 1
2.7%
662 1
2.7%
660 1
2.7%
602 1
2.7%
554 1
2.7%

미발급
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct24
Distinct (%)64.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean89.216216
Minimum0
Maximum790
Zeros7
Zeros (%)18.9%
Negative0
Negative (%)0.0%
Memory size465.0 B
2023-12-13T02:44:11.576330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median9
Q3141
95-th percentile333
Maximum790
Range790
Interquartile range (IQR)140

Descriptive statistics

Standard deviation158.01057
Coefficient of variation (CV)1.771097
Kurtosis10.076599
Mean89.216216
Median Absolute Deviation (MAD)9
Skewness2.8243236
Sum3301
Variance24967.341
MonotonicityNot monotonic
2023-12-13T02:44:11.698189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
0 7
18.9%
1 5
 
13.5%
2 2
 
5.4%
5 2
 
5.4%
3 2
 
5.4%
278 1
 
2.7%
21 1
 
2.7%
222 1
 
2.7%
11 1
 
2.7%
272 1
 
2.7%
Other values (14) 14
37.8%
ValueCountFrequency (%)
0 7
18.9%
1 5
13.5%
2 2
 
5.4%
3 2
 
5.4%
5 2
 
5.4%
9 1
 
2.7%
11 1
 
2.7%
18 1
 
2.7%
20 1
 
2.7%
21 1
 
2.7%
ValueCountFrequency (%)
790 1
2.7%
353 1
2.7%
328 1
2.7%
278 1
2.7%
272 1
2.7%
222 1
2.7%
211 1
2.7%
151 1
2.7%
150 1
2.7%
141 1
2.7%

합격자 수
Real number (ℝ)

HIGH CORRELATION 

Distinct32
Distinct (%)86.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean466.56757
Minimum1
Maximum2246
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size465.0 B
2023-12-13T02:44:11.830036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q110
median96
Q3732
95-th percentile2142.8
Maximum2246
Range2245
Interquartile range (IQR)722

Descriptive statistics

Standard deviation649.47323
Coefficient of variation (CV)1.392024
Kurtosis1.9425342
Mean466.56757
Median Absolute Deviation (MAD)95
Skewness1.6131892
Sum17263
Variance421815.47
MonotonicityNot monotonic
2023-12-13T02:44:11.960015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=32)
ValueCountFrequency (%)
1 4
 
10.8%
18 2
 
5.4%
88 2
 
5.4%
871 1
 
2.7%
9 1
 
2.7%
63 1
 
2.7%
732 1
 
2.7%
33 1
 
2.7%
5 1
 
2.7%
934 1
 
2.7%
Other values (22) 22
59.5%
ValueCountFrequency (%)
1 4
10.8%
2 1
 
2.7%
3 1
 
2.7%
5 1
 
2.7%
6 1
 
2.7%
9 1
 
2.7%
10 1
 
2.7%
18 2
5.4%
32 1
 
2.7%
33 1
 
2.7%
ValueCountFrequency (%)
2246 1
2.7%
2178 1
2.7%
2134 1
2.7%
1408 1
2.7%
1149 1
2.7%
934 1
2.7%
925 1
2.7%
871 1
2.7%
811 1
2.7%
732 1
2.7%

Interactions

2023-12-13T02:44:10.262371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:44:09.151308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:44:09.538472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:44:09.896776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:44:10.347409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:44:09.253711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:44:09.641861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:44:10.012445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:44:10.427620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:44:09.348456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:44:09.715575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:44:10.100599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:44:10.532643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:44:09.453602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:44:09.816797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:44:10.188203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:44:12.087487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년도자격증 등급발급미발급합격자 수
년도1.0000.0000.2560.0000.000
자격증 등급0.0001.0000.6590.8500.712
발급0.2560.6591.0000.7820.899
미발급0.0000.8500.7821.0000.741
합격자 수0.0000.7120.8990.7411.000
2023-12-13T02:44:12.185771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년도발급미발급합격자 수자격증 등급
년도1.000-0.1260.083-0.0770.000
발급-0.1261.0000.9200.9930.525
미발급0.0830.9201.0000.9420.516
합격자 수-0.0770.9930.9421.0000.553
자격증 등급0.0000.5250.5160.5531.000

Missing values

2023-12-13T02:44:10.629106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:44:10.727829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

년도자격증 등급발급미발급합격자 수
02010보건교육사 2급16218
12010보건교육사 3급2186602246
22011보건교육사 1급101
32011보건교육사 2급31132
42011보건교육사 3급19931412134
52012보건교육사 2급17118
62012보건교육사 3급60253655
72013보건교육사 1급101
82013보건교육사 2급44347
92013보건교육사 3급12891191408
년도자격증 등급발급미발급합격자 수
272020보건교육사 2급30333
282021보건교육사 1급505
292021보건교육사 2급83588
302021보건교육사 3급662272934
312022보건교육사 1급202
322022보건교육사 2급771188
332022보건교육사 3급414222636
342023보건교육사 1급303
352023보건교육사 2급752196
362023보건교육사 3급438278716