Overview

Dataset statistics

Number of variables6
Number of observations60
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.1 KiB
Average record size in memory52.2 B

Variable types

Numeric2
Categorical3
DateTime1

Dataset

Description공군사관학교 종교활동 통계: 공군사관학교 내 재학중인 생도 인원 중 기독교/천주교/불교/원불교 4개종파의 종교활동 참석인원 및 비율(1주단위)을 알수있음
Author국방부
URLhttps://www.data.go.kr/data/15087550/fileData.do

Alerts

비고 is highly overall correlated with 순번 and 2 other fieldsHigh correlation
비율 is highly overall correlated with 참석자수 and 1 other fieldsHigh correlation
순번 is highly overall correlated with 참석자수 and 1 other fieldsHigh correlation
참석자수 is highly overall correlated with 순번 and 2 other fieldsHigh correlation
순번 has unique valuesUnique
참석자수 has 35 (58.3%) zerosZeros

Reproduction

Analysis started2024-04-16 07:46:24.911458
Analysis finished2024-04-16 07:46:25.537852
Duration0.63 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct60
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean30.5
Minimum1
Maximum60
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size672.0 B
2024-04-16T16:46:25.591332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.95
Q115.75
median30.5
Q345.25
95-th percentile57.05
Maximum60
Range59
Interquartile range (IQR)29.5

Descriptive statistics

Standard deviation17.464249
Coefficient of variation (CV)0.57259833
Kurtosis-1.2
Mean30.5
Median Absolute Deviation (MAD)15
Skewness0
Sum1830
Variance305
MonotonicityStrictly increasing
2024-04-16T16:46:25.715226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.7%
32 1
 
1.7%
34 1
 
1.7%
35 1
 
1.7%
36 1
 
1.7%
37 1
 
1.7%
38 1
 
1.7%
39 1
 
1.7%
40 1
 
1.7%
41 1
 
1.7%
Other values (50) 50
83.3%
ValueCountFrequency (%)
1 1
1.7%
2 1
1.7%
3 1
1.7%
4 1
1.7%
5 1
1.7%
6 1
1.7%
7 1
1.7%
8 1
1.7%
9 1
1.7%
10 1
1.7%
ValueCountFrequency (%)
60 1
1.7%
59 1
1.7%
58 1
1.7%
57 1
1.7%
56 1
1.7%
55 1
1.7%
54 1
1.7%
53 1
1.7%
52 1
1.7%
51 1
1.7%

종교구분
Categorical

Distinct4
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size612.0 B
기독교
15 
천주교
15 
불교
15 
원불교
15 

Length

Max length3
Median length3
Mean length2.75
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row기독교
2nd row천주교
3rd row불교
4th row원불교
5th row기독교

Common Values

ValueCountFrequency (%)
기독교 15
25.0%
천주교 15
25.0%
불교 15
25.0%
원불교 15
25.0%

Length

2024-04-16T16:46:25.822482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-16T16:46:25.917179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기독교 15
25.0%
천주교 15
25.0%
불교 15
25.0%
원불교 15
25.0%

참석자수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct16
Distinct (%)26.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.133333
Minimum0
Maximum80
Zeros35
Zeros (%)58.3%
Negative0
Negative (%)0.0%
Memory size672.0 B
2024-04-16T16:46:26.016744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q330
95-th percentile60.75
Maximum80
Range80
Interquartile range (IQR)30

Descriptive statistics

Standard deviation22.092615
Coefficient of variation (CV)1.4598644
Kurtosis1.5135267
Mean15.133333
Median Absolute Deviation (MAD)0
Skewness1.465778
Sum908
Variance488.08362
MonotonicityNot monotonic
2024-04-16T16:46:26.133443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
0 35
58.3%
30 4
 
6.7%
25 3
 
5.0%
15 2
 
3.3%
34 2
 
3.3%
80 2
 
3.3%
35 2
 
3.3%
40 2
 
3.3%
45 1
 
1.7%
75 1
 
1.7%
Other values (6) 6
 
10.0%
ValueCountFrequency (%)
0 35
58.3%
5 1
 
1.7%
15 2
 
3.3%
16 1
 
1.7%
20 1
 
1.7%
25 3
 
5.0%
28 1
 
1.7%
30 4
 
6.7%
34 2
 
3.3%
35 2
 
3.3%
ValueCountFrequency (%)
80 2
3.3%
75 1
 
1.7%
60 1
 
1.7%
56 1
 
1.7%
45 1
 
1.7%
40 2
3.3%
35 2
3.3%
34 2
3.3%
30 4
6.7%
28 1
 
1.7%

비율
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Memory size612.0 B
0%
35 
4%
5%
3%
10%
 
3
Other values (5)

Length

Max length3
Median length2
Mean length2.05
Min length2

Unique

Unique4 ?
Unique (%)6.7%

Sample

1st row10%
2nd row4%
3rd row5%
4th row2%
5th row10%

Common Values

ValueCountFrequency (%)
0% 35
58.3%
4% 7
 
11.7%
5% 4
 
6.7%
3% 4
 
6.7%
10% 3
 
5.0%
2% 3
 
5.0%
6% 1
 
1.7%
7% 1
 
1.7%
1% 1
 
1.7%
8% 1
 
1.7%

Length

2024-04-16T16:46:26.245089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-16T16:46:26.362411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 35
58.3%
4 7
 
11.7%
5 4
 
6.7%
3 4
 
6.7%
10 3
 
5.0%
2 3
 
5.0%
6 1
 
1.7%
7 1
 
1.7%
1 1
 
1.7%
8 1
 
1.7%
Distinct15
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size612.0 B
Minimum2021-06-09 00:00:00
Maximum2021-09-15 00:00:00
2024-04-16T16:46:26.453650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-16T16:46:26.534237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)

비고
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size612.0 B
<NA>
28 
생도하계방학기간
24 
코로나예방적관찰기간

Length

Max length10
Median length8
Mean length6.4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 28
46.7%
생도하계방학기간 24
40.0%
코로나예방적관찰기간 8
 
13.3%

Length

2024-04-16T16:46:26.626134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-16T16:46:26.708653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 28
46.7%
생도하계방학기간 24
40.0%
코로나예방적관찰기간 8
 
13.3%

Interactions

2024-04-16T16:46:25.267688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-16T16:46:25.129965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-16T16:46:25.334582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-16T16:46:25.194362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-16T16:46:26.764584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번종교구분참석자수비율발생일자비고
순번1.0000.0000.5290.6110.9840.975
종교구분0.0001.0000.4490.5520.0000.000
참석자수0.5290.4491.0000.9660.499NaN
비율0.6110.5520.9661.0000.444NaN
발생일자0.9840.0000.4990.4441.0001.000
비고0.9750.000NaNNaN1.0001.000
2024-04-16T16:46:26.849585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
종교구분비고비율
종교구분1.0000.0000.340
비고0.0001.0001.000
비율0.3401.0001.000
2024-04-16T16:46:26.916708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번참석자수종교구분비율비고
순번1.000-0.8100.0000.2210.798
참석자수-0.8101.0000.3240.8931.000
종교구분0.0000.3241.0000.3400.000
비율0.2210.8930.3401.0001.000
비고0.7981.0000.0001.0001.000

Missing values

2024-04-16T16:46:25.426912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-16T16:46:25.507775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번종교구분참석자수비율발생일자비고
01기독교8010%2021-06-09<NA>
12천주교344%2021-06-09<NA>
23불교405%2021-06-09<NA>
34원불교162%2021-06-09<NA>
45기독교8010%2021-06-16<NA>
56천주교355%2021-06-16<NA>
67불교456%2021-06-16<NA>
78원불교00%2021-06-16<NA>
89기독교7510%2021-06-23<NA>
910천주교253%2021-06-23<NA>
순번종교구분참석자수비율발생일자비고
5051불교00%2021-09-01생도하계방학기간
5152원불교00%2021-09-01생도하계방학기간
5253기독교00%2021-09-08코로나예방적관찰기간
5354천주교00%2021-09-08코로나예방적관찰기간
5455불교00%2021-09-08코로나예방적관찰기간
5556원불교00%2021-09-08코로나예방적관찰기간
5657기독교00%2021-09-15코로나예방적관찰기간
5758천주교00%2021-09-15코로나예방적관찰기간
5859불교00%2021-09-15코로나예방적관찰기간
5960원불교00%2021-09-15코로나예방적관찰기간