Overview

Dataset statistics

Number of variables5
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.4 KiB
Average record size in memory48.3 B

Variable types

Numeric4
Categorical1

Dataset

Description뉴스데이터베이스 "BIGKinds" 기반 분석 자료, 기타 메타정보https://www.bigkinds.or.kr 에 접속하시면 보다 많은 정보를 확인할 수 있습니다.
Author한국언론진흥재단
URLhttps://www.data.go.kr/data/15126554/fileData.do

Alerts

언론의 추정 난이도 평균 is highly overall correlated with 언론의 추정 난이도 표준편차High correlation
언론의 추정 난이도 표준편차 is highly overall correlated with 언론의 추정 난이도 평균High correlation
언론의 추정 난이도 평균 has unique valuesUnique
언론의 추정 난이도 표준편차 has unique valuesUnique
언론의 추정 난이도 표준편차 has 1 (3.3%) zerosZeros

Reproduction

Analysis started2024-03-14 23:06:20.336343
Analysis finished2024-03-14 23:06:24.607493
Duration4.27 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Real number (ℝ)

Distinct10
Distinct (%)33.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2017.5
Minimum2013
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size398.0 B
2024-03-15T08:06:24.773005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2013
5-th percentile2013
Q12015
median2017.5
Q32020
95-th percentile2022
Maximum2022
Range9
Interquartile range (IQR)5

Descriptive statistics

Standard deviation2.9213837
Coefficient of variation (CV)0.0014480217
Kurtosis-1.2256534
Mean2017.5
Median Absolute Deviation (MAD)2.5
Skewness0
Sum60525
Variance8.5344828
MonotonicityIncreasing
2024-03-15T08:06:25.000511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
2013 3
10.0%
2014 3
10.0%
2015 3
10.0%
2016 3
10.0%
2017 3
10.0%
2018 3
10.0%
2019 3
10.0%
2020 3
10.0%
2021 3
10.0%
2022 3
10.0%
ValueCountFrequency (%)
2013 3
10.0%
2014 3
10.0%
2015 3
10.0%
2016 3
10.0%
2017 3
10.0%
2018 3
10.0%
2019 3
10.0%
2020 3
10.0%
2021 3
10.0%
2022 3
10.0%
ValueCountFrequency (%)
2022 3
10.0%
2021 3
10.0%
2020 3
10.0%
2019 3
10.0%
2018 3
10.0%
2017 3
10.0%
2016 3
10.0%
2015 3
10.0%
2014 3
10.0%
2013 3
10.0%

과목
Categorical

Distinct3
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size368.0 B
국어
10 
수학
10 
영어
10 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국어
2nd row수학
3rd row영어
4th row국어
5th row수학

Common Values

ValueCountFrequency (%)
국어 10
33.3%
수학 10
33.3%
영어 10
33.3%

Length

2024-03-15T08:06:25.236482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T08:06:25.435302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국어 10
33.3%
수학 10
33.3%
영어 10
33.3%

보도 기사 수
Real number (ℝ)

Distinct27
Distinct (%)90.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean75.066667
Minimum28
Maximum147
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size398.0 B
2024-03-15T08:06:25.621562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum28
5-th percentile43.35
Q154
median75.5
Q386.5
95-th percentile141.1
Maximum147
Range119
Interquartile range (IQR)32.5

Descriptive statistics

Standard deviation29.094298
Coefficient of variation (CV)0.38757945
Kurtosis1.0826772
Mean75.066667
Median Absolute Deviation (MAD)18
Skewness1.0364797
Sum2252
Variance846.47816
MonotonicityNot monotonic
2024-03-15T08:06:25.850142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%)
54 2
 
6.7%
45 2
 
6.7%
85 2
 
6.7%
92 1
 
3.3%
51 1
 
3.3%
88 1
 
3.3%
79 1
 
3.3%
74 1
 
3.3%
28 1
 
3.3%
57 1
 
3.3%
Other values (17) 17
56.7%
ValueCountFrequency (%)
28 1
3.3%
42 1
3.3%
45 2
6.7%
49 1
3.3%
50 1
3.3%
51 1
3.3%
54 2
6.7%
56 1
3.3%
57 1
3.3%
60 1
3.3%
ValueCountFrequency (%)
147 1
3.3%
142 1
3.3%
140 1
3.3%
94 1
3.3%
93 1
3.3%
92 1
3.3%
88 1
3.3%
87 1
3.3%
85 2
6.7%
84 1
3.3%

언론의 추정 난이도 평균
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.42035253
Minimum-0.76428571
Maximum1
Zeros0
Zeros (%)0.0%
Negative6
Negative (%)20.0%
Memory size398.0 B
2024-03-15T08:06:26.220137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-0.76428571
5-th percentile-0.5914193
Q10.12152778
median0.61439888
Q30.75642857
95-th percentile0.97366164
Maximum1
Range1.7642857
Interquartile range (IQR)0.63490079

Descriptive statistics

Standard deviation0.51329298
Coefficient of variation (CV)1.2211012
Kurtosis-0.038232705
Mean0.42035253
Median Absolute Deviation (MAD)0.21534057
Skewness-1.0609406
Sum12.610576
Variance0.26346969
MonotonicityNot monotonic
2024-03-15T08:06:26.632654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
0.375 1
 
3.3%
0.517857143 1
 
3.3%
0.62745098 1
 
3.3%
0.6 1
 
3.3%
-0.590909091 1
 
3.3%
0.722222222 1
 
3.3%
0.835443038 1
 
3.3%
0.608108108 1
 
3.3%
-0.214285714 1
 
3.3%
0.466666667 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
-0.764285714 1
3.3%
-0.591836735 1
3.3%
-0.590909091 1
3.3%
-0.355555556 1
3.3%
-0.294117647 1
3.3%
-0.214285714 1
3.3%
0.035087719 1
3.3%
0.037037037 1
3.3%
0.375 1
3.3%
0.404761905 1
3.3%
ValueCountFrequency (%)
1.0 1
3.3%
0.985507246 1
3.3%
0.959183673 1
3.3%
0.909090909 1
3.3%
0.9 1
3.3%
0.835443038 1
3.3%
0.795180723 1
3.3%
0.761904762 1
3.3%
0.74 1
3.3%
0.722222222 1
3.3%

언론의 추정 난이도 표준편차
Real number (ℝ)

HIGH CORRELATION  UNIQUE  ZEROS 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.54971492
Minimum0
Maximum0.96297367
Zeros1
Zeros (%)3.3%
Negative0
Negative (%)0.0%
Memory size398.0 B
2024-03-15T08:06:27.043273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0.15617395
Q10.49911549
median0.57270605
Q30.65373481
95-th percentile0.80746868
Maximum0.96297367
Range0.96297367
Interquartile range (IQR)0.15461932

Descriptive statistics

Standard deviation0.2036439
Coefficient of variation (CV)0.37045366
Kurtosis1.3745995
Mean0.54971492
Median Absolute Deviation (MAD)0.083783245
Skewness-0.8738736
Sum16.491448
Variance0.04147084
MonotonicityNot monotonic
2024-03-15T08:06:27.476692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
0.752632511 1
 
3.3%
0.571793715 1
 
3.3%
0.598691383 1
 
3.3%
0.601585208 1
 
3.3%
0.654539649 1
 
3.3%
0.56356904 1
 
3.3%
0.40605607 1
 
3.3%
0.637150497 1
 
3.3%
0.568112069 1
 
3.3%
0.625227231 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
0.0 1
3.3%
0.120385853 1
3.3%
0.199914948 1
3.3%
0.289364921 1
3.3%
0.30253169 1
3.3%
0.40605607 1
3.3%
0.484366512 1
3.3%
0.486973159 1
3.3%
0.535542491 1
3.3%
0.556914139 1
3.3%
ValueCountFrequency (%)
0.962973673 1
3.3%
0.851915597 1
3.3%
0.753144668 1
3.3%
0.752632511 1
3.3%
0.679423404 1
3.3%
0.672195401 1
3.3%
0.671887315 1
3.3%
0.654539649 1
3.3%
0.651320293 1
3.3%
0.637150497 1
3.3%

Interactions

2024-03-15T08:06:23.092556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T08:06:20.847704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T08:06:21.504756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T08:06:22.101384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T08:06:23.328651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T08:06:20.993946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T08:06:21.640678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T08:06:22.344167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T08:06:23.571064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T08:06:21.162473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T08:06:21.791149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T08:06:22.566863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T08:06:23.828125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T08:06:21.365899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T08:06:21.947515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T08:06:22.840207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T08:06:27.743899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도과목보도 기사 수언론의 추정 난이도 평균언론의 추정 난이도 표준편차
연도1.0000.0000.5900.5560.695
과목0.0001.0000.3350.0000.000
보도 기사 수0.5900.3351.0000.6180.000
언론의 추정 난이도 평균0.5560.0000.6181.0000.704
언론의 추정 난이도 표준편차0.6950.0000.0000.7041.000
2024-03-15T08:06:28.020970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도보도 기사 수언론의 추정 난이도 평균언론의 추정 난이도 표준편차과목
연도1.000-0.427-0.0910.0460.000
보도 기사 수-0.4271.000-0.0940.1000.205
언론의 추정 난이도 평균-0.091-0.0941.000-0.7050.000
언론의 추정 난이도 표준편차0.0460.100-0.7051.0000.000
과목0.0000.2050.0000.0001.000

Missing values

2024-03-15T08:06:24.157912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T08:06:24.480833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도과목보도 기사 수언론의 추정 난이도 평균언론의 추정 난이도 표준편차
02013국어800.3750.752633
12013수학920.6413040.60368
22013영어870.620690.65132
32014국어1420.6478870.573618
42014수학147-0.5918370.558313
52014영어140-0.7642860.557625
62015국어930.6344090.672195
72015수학830.7951810.535542
82015영어840.4047620.851916
92016국어690.9855070.120386
연도과목보도 기사 수언론의 추정 난이도 평균언론의 추정 난이도 표준편차
202019영어45-0.3555560.679423
212020국어570.0350880.962974
222020수학450.4666670.625227
232020영어28-0.2142860.568112
242021국어740.6081080.63715
252021수학790.8354430.406056
262021영어540.7222220.563569
272022국어88-0.5909090.65454
282022수학850.60.601585
292022영어510.6274510.598691