Overview

Dataset statistics

Number of variables3
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory882.0 B
Average record size in memory29.4 B

Variable types

Numeric1
Categorical1
Text1

Dataset

Description독학학위제의 전공 분야 별 동일 전공 인정학과 정보 목록에 대한 데이터로 전공분야와 동일 전공 항목을 제공합니다.
Author국가평생교육진흥원
URLhttps://www.data.go.kr/data/15050110/fileData.do

Alerts

연번 is highly overall correlated with 전공분야High correlation
전공분야 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
동일전공 has unique valuesUnique

Reproduction

Analysis started2023-12-12 20:16:23.076127
Analysis finished2023-12-12 20:16:23.470817
Duration0.39 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.5
Minimum1
Maximum30
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-13T05:16:23.558683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.45
Q18.25
median15.5
Q322.75
95-th percentile28.55
Maximum30
Range29
Interquartile range (IQR)14.5

Descriptive statistics

Standard deviation8.8034084
Coefficient of variation (CV)0.56796183
Kurtosis-1.2
Mean15.5
Median Absolute Deviation (MAD)7.5
Skewness0
Sum465
Variance77.5
MonotonicityStrictly increasing
2023-12-13T05:16:23.728150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
1 1
 
3.3%
17 1
 
3.3%
30 1
 
3.3%
29 1
 
3.3%
28 1
 
3.3%
27 1
 
3.3%
26 1
 
3.3%
25 1
 
3.3%
24 1
 
3.3%
23 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
1 1
3.3%
2 1
3.3%
3 1
3.3%
4 1
3.3%
5 1
3.3%
6 1
3.3%
7 1
3.3%
8 1
3.3%
9 1
3.3%
10 1
3.3%
ValueCountFrequency (%)
30 1
3.3%
29 1
3.3%
28 1
3.3%
27 1
3.3%
26 1
3.3%
25 1
3.3%
24 1
3.3%
23 1
3.3%
22 1
3.3%
21 1
3.3%

전공분야
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)13.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
영어영문학
13 
법학
국어국문학
심리학

Length

Max length5
Median length5
Mean length4.0333333
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국어국문학
2nd row국어국문학
3rd row국어국문학
4th row국어국문학
5th row국어국문학

Common Values

ValueCountFrequency (%)
영어영문학 13
43.3%
법학 7
23.3%
국어국문학 6
20.0%
심리학 4
 
13.3%

Length

2023-12-13T05:16:23.880343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:16:24.003910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영어영문학 13
43.3%
법학 7
23.3%
국어국문학 6
20.0%
심리학 4
 
13.3%

동일전공
Text

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-13T05:16:24.273744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length6
Mean length4.1333333
Min length2

Characters and Unicode

Total characters124
Distinct characters45
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st row국어
2nd row국어교육
3rd row영상문예
4th row문예창작
5th row미디어문예창작
ValueCountFrequency (%)
국어 1
 
3.3%
국어교육 1
 
3.3%
법무행정 1
 
3.3%
법률실무 1
 
3.3%
법률 1
 
3.3%
사법 1
 
3.3%
공법 1
 
3.3%
국제법무 1
 
3.3%
상담심리학 1
 
3.3%
산업심리학 1
 
3.3%
Other values (20) 20
66.7%
2023-12-13T05:16:24.701588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
15
 
12.1%
11
 
8.9%
7
 
5.6%
6
 
4.8%
5
 
4.0%
5
 
4.0%
5
 
4.0%
4
 
3.2%
4
 
3.2%
4
 
3.2%
Other values (35) 58
46.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 119
96.0%
Uppercase Letter 5
 
4.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
15
 
12.6%
11
 
9.2%
7
 
5.9%
6
 
5.0%
5
 
4.2%
5
 
4.2%
5
 
4.2%
4
 
3.4%
4
 
3.4%
4
 
3.4%
Other values (30) 53
44.5%
Uppercase Letter
ValueCountFrequency (%)
E 1
20.0%
L 1
20.0%
O 1
20.0%
S 1
20.0%
T 1
20.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 119
96.0%
Latin 5
 
4.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
15
 
12.6%
11
 
9.2%
7
 
5.9%
6
 
5.0%
5
 
4.2%
5
 
4.2%
5
 
4.2%
4
 
3.4%
4
 
3.4%
4
 
3.4%
Other values (30) 53
44.5%
Latin
ValueCountFrequency (%)
E 1
20.0%
L 1
20.0%
O 1
20.0%
S 1
20.0%
T 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 119
96.0%
ASCII 5
 
4.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
15
 
12.6%
11
 
9.2%
7
 
5.9%
6
 
5.0%
5
 
4.2%
5
 
4.2%
5
 
4.2%
4
 
3.4%
4
 
3.4%
4
 
3.4%
Other values (30) 53
44.5%
ASCII
ValueCountFrequency (%)
E 1
20.0%
L 1
20.0%
O 1
20.0%
S 1
20.0%
T 1
20.0%

Interactions

2023-12-13T05:16:23.196244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T05:16:24.824768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번전공분야동일전공
연번1.0000.9521.000
전공분야0.9521.0001.000
동일전공1.0001.0001.000
2023-12-13T05:16:24.941272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번전공분야
연번1.0000.769
전공분야0.7691.000

Missing values

2023-12-13T05:16:23.335435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:16:23.431032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번전공분야동일전공
01국어국문학국어
12국어국문학국어교육
23국어국문학영상문예
34국어국문학문예창작
45국어국문학미디어문예창작
56국어국문학한국어문학
67영어영문학국제문화
78영어영문학관광영어
89영어영문학관광영어통역
910영어영문학관광통역
연번전공분야동일전공
2021심리학가족상담학
2122심리학산업심리학
2223심리학상담심리학
2324법학국제법무
2425법학공법
2526법학사법
2627법학법률
2728법학법률실무
2829법학법무행정
2930법학해사법