Overview

Dataset statistics

Number of variables5
Number of observations26
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 KiB
Average record size in memory50.1 B

Variable types

Numeric5

Dataset

Description육군사관학교의 남여생도 비율과 문이과 비율입니다.
Author국방부
URLhttps://www.data.go.kr/data/15089913/fileData.do

Alerts

입학년도 is highly overall correlated with 문과(남생도) and 2 other fieldsHigh correlation
문과(남생도) is highly overall correlated with 입학년도 and 3 other fieldsHigh correlation
문과(여생도) is highly overall correlated with 입학년도 and 2 other fieldsHigh correlation
이과(남생도) is highly overall correlated with 문과(남생도)High correlation
이과(여생도) is highly overall correlated with 입학년도 and 2 other fieldsHigh correlation
입학년도 has unique valuesUnique
문과(여생도) has 2 (7.7%) zerosZeros
이과(여생도) has 2 (7.7%) zerosZeros

Reproduction

Analysis started2024-04-17 18:26:22.627613
Analysis finished2024-04-17 18:26:24.708360
Duration2.08 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

입학년도
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct26
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2008.5
Minimum1996
Maximum2021
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size366.0 B
2024-04-18T03:26:24.755148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1996
5-th percentile1997.25
Q12002.25
median2008.5
Q32014.75
95-th percentile2019.75
Maximum2021
Range25
Interquartile range (IQR)12.5

Descriptive statistics

Standard deviation7.6485293
Coefficient of variation (CV)0.0038080803
Kurtosis-1.2
Mean2008.5
Median Absolute Deviation (MAD)6.5
Skewness0
Sum52221
Variance58.5
MonotonicityStrictly increasing
2024-04-18T03:26:24.844529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
1996 1
 
3.8%
2010 1
 
3.8%
2021 1
 
3.8%
2020 1
 
3.8%
2019 1
 
3.8%
2018 1
 
3.8%
2017 1
 
3.8%
2016 1
 
3.8%
2015 1
 
3.8%
2014 1
 
3.8%
Other values (16) 16
61.5%
ValueCountFrequency (%)
1996 1
3.8%
1997 1
3.8%
1998 1
3.8%
1999 1
3.8%
2000 1
3.8%
2001 1
3.8%
2002 1
3.8%
2003 1
3.8%
2004 1
3.8%
2005 1
3.8%
ValueCountFrequency (%)
2021 1
3.8%
2020 1
3.8%
2019 1
3.8%
2018 1
3.8%
2017 1
3.8%
2016 1
3.8%
2015 1
3.8%
2014 1
3.8%
2013 1
3.8%
2012 1
3.8%

문과(남생도)
Real number (ℝ)

HIGH CORRELATION 

Distinct9
Distinct (%)34.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean110.26923
Minimum83
Maximum145
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size366.0 B
2024-04-18T03:26:24.926601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum83
5-th percentile83
Q191.75
median100
Q3140
95-th percentile145
Maximum145
Range62
Interquartile range (IQR)48.25

Descriptive statistics

Standard deviation23.546648
Coefficient of variation (CV)0.21353779
Kurtosis-1.5258017
Mean110.26923
Median Absolute Deviation (MAD)14
Skewness0.48593571
Sum2867
Variance554.44462
MonotonicityNot monotonic
2024-04-18T03:26:25.009385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
100 6
23.1%
140 5
19.2%
83 3
11.5%
97 3
11.5%
145 3
11.5%
90 2
 
7.7%
86 2
 
7.7%
109 1
 
3.8%
131 1
 
3.8%
ValueCountFrequency (%)
83 3
11.5%
86 2
 
7.7%
90 2
 
7.7%
97 3
11.5%
100 6
23.1%
109 1
 
3.8%
131 1
 
3.8%
140 5
19.2%
145 3
11.5%
ValueCountFrequency (%)
145 3
11.5%
140 5
19.2%
131 1
 
3.8%
109 1
 
3.8%
100 6
23.1%
97 3
11.5%
90 2
 
7.7%
86 2
 
7.7%
83 3
11.5%

문과(여생도)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct7
Distinct (%)26.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.269231
Minimum0
Maximum24
Zeros2
Zeros (%)7.7%
Negative0
Negative (%)0.0%
Memory size366.0 B
2024-04-18T03:26:25.089405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3.5
Q114
median15
Q318
95-th percentile24
Maximum24
Range24
Interquartile range (IQR)4

Descriptive statistics

Standard deviation5.4739945
Coefficient of variation (CV)0.35849838
Kurtosis3.7677069
Mean15.269231
Median Absolute Deviation (MAD)1
Skewness-1.3295301
Sum397
Variance29.964615
MonotonicityNot monotonic
2024-04-18T03:26:25.173564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
14 8
30.8%
15 6
23.1%
18 5
19.2%
24 3
 
11.5%
0 2
 
7.7%
16 1
 
3.8%
17 1
 
3.8%
ValueCountFrequency (%)
0 2
 
7.7%
14 8
30.8%
15 6
23.1%
16 1
 
3.8%
17 1
 
3.8%
18 5
19.2%
24 3
 
11.5%
ValueCountFrequency (%)
24 3
 
11.5%
18 5
19.2%
17 1
 
3.8%
16 1
 
3.8%
15 6
23.1%
14 8
30.8%
0 2
 
7.7%

이과(남생도)
Real number (ℝ)

HIGH CORRELATION 

Distinct9
Distinct (%)34.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean136.88462
Minimum119
Maximum150
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size366.0 B
2024-04-18T03:26:25.251078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum119
5-th percentile119
Q1130
median140
Q3145
95-th percentile150
Maximum150
Range31
Interquartile range (IQR)15

Descriptive statistics

Standard deviation10.734345
Coefficient of variation (CV)0.078418927
Kurtosis-1.1499642
Mean136.88462
Median Absolute Deviation (MAD)10
Skewness-0.32167983
Sum3559
Variance115.22615
MonotonicityNot monotonic
2024-04-18T03:26:25.330015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
150 6
23.1%
140 5
19.2%
124 3
11.5%
119 3
11.5%
145 3
11.5%
135 2
 
7.7%
130 2
 
7.7%
134 1
 
3.8%
131 1
 
3.8%
ValueCountFrequency (%)
119 3
11.5%
124 3
11.5%
130 2
 
7.7%
131 1
 
3.8%
134 1
 
3.8%
135 2
 
7.7%
140 5
19.2%
145 3
11.5%
150 6
23.1%
ValueCountFrequency (%)
150 6
23.1%
145 3
11.5%
140 5
19.2%
135 2
 
7.7%
134 1
 
3.8%
131 1
 
3.8%
130 2
 
7.7%
124 3
11.5%
119 3
11.5%

이과(여생도)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct6
Distinct (%)23.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.269231
Minimum0
Maximum16
Zeros2
Zeros (%)7.7%
Negative0
Negative (%)0.0%
Memory size366.0 B
2024-04-18T03:26:25.408817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2.25
Q110
median10
Q312
95-th percentile16
Maximum16
Range16
Interquartile range (IQR)2

Descriptive statistics

Standard deviation3.6393152
Coefficient of variation (CV)0.35439025
Kurtosis4.007053
Mean10.269231
Median Absolute Deviation (MAD)1
Skewness-1.4212822
Sum267
Variance13.244615
MonotonicityNot monotonic
2024-04-18T03:26:25.488649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
10 11
42.3%
12 5
19.2%
9 3
 
11.5%
16 3
 
11.5%
0 2
 
7.7%
11 2
 
7.7%
ValueCountFrequency (%)
0 2
 
7.7%
9 3
 
11.5%
10 11
42.3%
11 2
 
7.7%
12 5
19.2%
16 3
 
11.5%
ValueCountFrequency (%)
16 3
 
11.5%
12 5
19.2%
11 2
 
7.7%
10 11
42.3%
9 3
 
11.5%
0 2
 
7.7%

Interactions

2024-04-18T03:26:24.260990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T03:26:22.737704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T03:26:23.045588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T03:26:23.369583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T03:26:23.687242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T03:26:24.325674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T03:26:22.800666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T03:26:23.113094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T03:26:23.438163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T03:26:23.746719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T03:26:24.395481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T03:26:22.861621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T03:26:23.175158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T03:26:23.499047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T03:26:23.809336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T03:26:24.456792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T03:26:22.918554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T03:26:23.237347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T03:26:23.557263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T03:26:23.870492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T03:26:24.523630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T03:26:22.980399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T03:26:23.298997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T03:26:23.620675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T03:26:23.931707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-18T03:26:25.563996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
입학년도문과(남생도)문과(여생도)이과(남생도)이과(여생도)
입학년도1.0000.9330.9880.9570.985
문과(남생도)0.9331.0000.6790.9530.650
문과(여생도)0.9880.6791.0000.8850.988
이과(남생도)0.9570.9530.8851.0000.916
이과(여생도)0.9850.6500.9880.9161.000
2024-04-18T03:26:25.643740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
입학년도문과(남생도)문과(여생도)이과(남생도)이과(여생도)
입학년도1.0000.6210.739-0.2270.836
문과(남생도)0.6211.0000.8320.5590.833
문과(여생도)0.7390.8321.0000.3390.942
이과(남생도)-0.2270.5590.3391.0000.175
이과(여생도)0.8360.8330.9420.1751.000

Missing values

2024-04-18T03:26:24.611587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-18T03:26:24.680583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

입학년도문과(남생도)문과(여생도)이과(남생도)이과(여생도)
0199610001500
1199710001500
219981001515010
319991001515010
420001001515010
520011001515010
62002901513510
72003901513510
82004861413010
92005861413010
입학년도문과(남생도)문과(여생도)이과(남생도)이과(여생도)
1620121091613411
1720131311713111
1820141401814012
1920151401814012
2020161401814012
2120171401814012
2220181401814012
2320191452414516
2420201452414516
2520211452414516