Overview

Dataset statistics

Number of variables5
Number of observations31
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.5 KiB
Average record size in memory48.3 B

Variable types

Numeric4
DateTime1

Dataset

Description1994학년도부터 연도별로 시행된 대학수학능력시험(본수능) 응시현황(시험일정, 지원인원, 응시인원, 응시율)에 대한 통계 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15098904/fileData.do

Alerts

학년도 is highly overall correlated with 지원인원(명) and 2 other fieldsHigh correlation
지원인원(명) is highly overall correlated with 학년도 and 2 other fieldsHigh correlation
응시인원(명) is highly overall correlated with 학년도 and 2 other fieldsHigh correlation
응시율(퍼센트) is highly overall correlated with 학년도 and 2 other fieldsHigh correlation
시험일정 has unique valuesUnique
지원인원(명) has unique valuesUnique
응시인원(명) has unique valuesUnique

Reproduction

Analysis started2023-12-12 09:51:22.341380
Analysis finished2023-12-12 09:51:24.219308
Duration1.88 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

학년도
Real number (ℝ)

HIGH CORRELATION 

Distinct30
Distinct (%)96.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2008.0323
Minimum1994
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size411.0 B
2023-12-12T18:51:24.286932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1994
5-th percentile1994.5
Q12000.5
median2008
Q32015.5
95-th percentile2021.5
Maximum2023
Range29
Interquartile range (IQR)15

Descriptive statistics

Standard deviation9.0387457
Coefficient of variation (CV)0.0045012951
Kurtosis-1.2242473
Mean2008.0323
Median Absolute Deviation (MAD)8
Skewness0.019002164
Sum62249
Variance81.698925
MonotonicityIncreasing
2023-12-12T18:51:24.399251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
1994 2
 
6.5%
2010 1
 
3.2%
2023 1
 
3.2%
2022 1
 
3.2%
2021 1
 
3.2%
2020 1
 
3.2%
2019 1
 
3.2%
2018 1
 
3.2%
2017 1
 
3.2%
2016 1
 
3.2%
Other values (20) 20
64.5%
ValueCountFrequency (%)
1994 2
6.5%
1995 1
3.2%
1996 1
3.2%
1997 1
3.2%
1998 1
3.2%
1999 1
3.2%
2000 1
3.2%
2001 1
3.2%
2002 1
3.2%
2003 1
3.2%
ValueCountFrequency (%)
2023 1
3.2%
2022 1
3.2%
2021 1
3.2%
2020 1
3.2%
2019 1
3.2%
2018 1
3.2%
2017 1
3.2%
2016 1
3.2%
2015 1
3.2%
2014 1
3.2%

시험일정
Date

UNIQUE 

Distinct31
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size380.0 B
Minimum1993-08-20 00:00:00
Maximum2022-11-17 00:00:00
2023-12-12T18:51:24.520952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:51:24.649461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)

지원인원(명)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct31
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean678941.35
Minimum493434
Maximum896122
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size411.0 B
2023-12-12T18:51:24.755072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum493434
5-th percentile508925.5
Q1593666.5
median668522
Q3746424.5
95-th percentile878809
Maximum896122
Range402688
Interquartile range (IQR)152758

Descriptive statistics

Standard deviation116284.1
Coefficient of variation (CV)0.17127267
Kurtosis-0.74198508
Mean678941.35
Median Absolute Deviation (MAD)74995
Skewness0.42857916
Sum21047182
Variance1.3521991 × 1010
MonotonicityNot monotonic
2023-12-12T18:51:25.207313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
742668 1
 
3.2%
750181 1
 
3.2%
508030 1
 
3.2%
509821 1
 
3.2%
493434 1
 
3.2%
548734 1
 
3.2%
594924 1
 
3.2%
593527 1
 
3.2%
605987 1
 
3.2%
631187 1
 
3.2%
Other values (21) 21
67.7%
ValueCountFrequency (%)
493434 1
3.2%
508030 1
3.2%
509821 1
3.2%
548734 1
3.2%
584934 1
3.2%
588839 1
3.2%
588899 1
3.2%
593527 1
3.2%
593806 1
3.2%
594924 1
3.2%
ValueCountFrequency (%)
896122 1
3.2%
885321 1
3.2%
872297 1
3.2%
868643 1
3.2%
840661 1
3.2%
824374 1
3.2%
781749 1
3.2%
750181 1
3.2%
742668 1
3.2%
739129 1
3.2%

응시인원(명)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct31
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean638878.77
Minimum421034
Maximum868366
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size411.0 B
2023-12-12T18:51:25.334955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum421034
5-th percentile447903.5
Q1552090.5
median621336
Q3722537.5
95-th percentile852288.5
Maximum868366
Range447332
Interquartile range (IQR)170447

Descriptive statistics

Standard deviation127465.09
Coefficient of variation (CV)0.19951374
Kurtosis-0.7968436
Mean638878.77
Median Absolute Deviation (MAD)90009
Skewness0.30477998
Sum19805242
Variance1.624735 × 1010
MonotonicityNot monotonic
2023-12-12T18:51:25.451941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
716326 1
 
3.2%
726634 1
 
3.2%
447669 1
 
3.2%
448138 1
 
3.2%
421034 1
 
3.2%
484737 1
 
3.2%
530220 1
 
3.2%
531327 1
 
3.2%
552297 1
 
3.2%
585332 1
 
3.2%
Other values (21) 21
67.7%
ValueCountFrequency (%)
421034 1
3.2%
447669 1
3.2%
448138 1
3.2%
484737 1
3.2%
530220 1
3.2%
531327 1
3.2%
550588 1
3.2%
551884 1
3.2%
552297 1
3.2%
554345 1
3.2%
ValueCountFrequency (%)
868366 1
3.2%
854272 1
3.2%
850305 1
3.2%
839837 1
3.2%
809867 1
3.2%
795338 1
3.2%
757488 1
3.2%
726634 1
3.2%
718441 1
3.2%
716326 1
3.2%

응시율(퍼센트)
Real number (ℝ)

HIGH CORRELATION 

Distinct25
Distinct (%)80.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean93.651613
Minimum85.3
Maximum97.5
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size411.0 B
2023-12-12T18:51:25.575536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum85.3
5-th percentile88
Q192.8
median94.1
Q396.5
95-th percentile97.1
Maximum97.5
Range12.2
Interquartile range (IQR)3.7

Descriptive statistics

Standard deviation3.2928074
Coefficient of variation (CV)0.035160178
Kurtosis0.028357462
Mean93.651613
Median Absolute Deviation (MAD)2.4
Skewness-0.91407321
Sum2903.2
Variance10.842581
MonotonicityNot monotonic
2023-12-12T18:51:25.717715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
96.5 3
 
9.7%
96.9 3
 
9.7%
94.1 2
 
6.5%
92.9 2
 
6.5%
93.6 1
 
3.2%
88.1 1
 
3.2%
87.9 1
 
3.2%
85.3 1
 
3.2%
88.3 1
 
3.2%
89.1 1
 
3.2%
Other values (15) 15
48.4%
ValueCountFrequency (%)
85.3 1
3.2%
87.9 1
3.2%
88.1 1
3.2%
88.3 1
3.2%
89.1 1
3.2%
89.5 1
3.2%
91.1 1
3.2%
92.7 1
3.2%
92.9 2
6.5%
93.2 1
3.2%
ValueCountFrequency (%)
97.5 1
 
3.2%
97.2 1
 
3.2%
97.0 1
 
3.2%
96.9 3
9.7%
96.7 1
 
3.2%
96.5 3
9.7%
96.3 1
 
3.2%
95.3 1
 
3.2%
95.0 1
 
3.2%
94.2 1
 
3.2%

Interactions

2023-12-12T18:51:23.699261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:51:22.494323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:51:22.901011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:51:23.353086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:51:23.793189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:51:22.590458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:51:23.010716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:51:23.432835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:51:23.890075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:51:22.704594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:51:23.139625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:51:23.517883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:51:23.970475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:51:22.806522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:51:23.243343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:51:23.596092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:51:25.827881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
학년도시험일정지원인원(명)응시인원(명)응시율(퍼센트)
학년도1.0001.0000.8990.8950.808
시험일정1.0001.0001.0001.0001.000
지원인원(명)0.8991.0001.0000.9940.680
응시인원(명)0.8951.0000.9941.0000.647
응시율(퍼센트)0.8081.0000.6800.6471.000
2023-12-12T18:51:26.002528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
학년도지원인원(명)응시인원(명)응시율(퍼센트)
학년도1.000-0.802-0.844-0.910
지원인원(명)-0.8021.0000.9850.809
응시인원(명)-0.8440.9851.0000.865
응시율(퍼센트)-0.9100.8090.8651.000

Missing values

2023-12-12T18:51:24.094849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:51:24.182903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

학년도시험일정지원인원(명)응시인원(명)응시율(퍼센트)
019941993-08-2074266871632696.5
119941993-11-1675018172663496.9
219951994-11-2378174975748896.9
319961995-11-2284066180986796.3
419971996-11-1382437479533896.5
519981997-11-1988532185427296.5
619991998-11-1886864383983796.7
720001999-11-1789612286836696.9
820012000-11-1587229785030597.5
920022001-11-0773912971844197.2
학년도시험일정지원인원(명)응시인원(명)응시율(퍼센트)
2120142013-11-0765074760681393.2
2220152014-11-1364062159483592.9
2320162015-11-1263118758533292.7
2420172016-11-1760598755229791.1
2520182017-11-2359352753132789.5
2620192018-11-1559492453022089.1
2720202019-11-1454873448473788.3
2820212020-12-0349343442103485.3
2920222021-11-1850982144813887.9
3020232022-11-1750803044766988.1