Overview

Dataset statistics

Number of variables5
Number of observations448
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.2%
Total size in memory18.1 KiB
Average record size in memory41.3 B

Variable types

Text1
Numeric1
Boolean1
DateTime2

Dataset

Description학점은행제 정보공시 시스템의 공통코드 상세 내용이며, 공통코드상세, 순번, 사용여부, 생성일시, 수정일시 항목의 정보를 제공합니다.
Author국가평생교육진흥원
URLhttps://www.data.go.kr/data/15090540/fileData.do

Alerts

사용여부 has constant value ""Constant
Dataset has 1 (0.2%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 01:12:53.186319
Analysis finished2023-12-12 01:12:53.749727
Duration0.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct434
Distinct (%)96.9%
Missing0
Missing (%)0.0%
Memory size3.6 KiB
2023-12-12T10:12:53.942399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length76
Median length41.5
Mean length18.227679
Min length1

Characters and Unicode

Total characters8166
Distinct characters304
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique421 ?
Unique (%)94.0%

Sample

1st row중요무형문화재기관
2nd row전공심화 및 특별과정
3rd row평생교육시설
4th row원격교육
5th row공지사항
ValueCountFrequency (%)
공시 158
 
9.5%
2020년 92
 
5.5%
따른 65
 
3.9%
예외처리 60
 
3.6%
46
 
2.8%
2월 42
 
2.5%
8월 40
 
2.4%
2019년 38
 
2.3%
시정명령 29
 
1.7%
9월 26
 
1.6%
Other values (472) 1069
64.2%
2023-12-12T10:12:54.521193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1370
 
16.8%
2 329
 
4.0%
300
 
3.7%
0 294
 
3.6%
282
 
3.5%
244
 
3.0%
221
 
2.7%
188
 
2.3%
1 167
 
2.0%
157
 
1.9%
Other values (294) 4614
56.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5435
66.6%
Space Separator 1370
 
16.8%
Decimal Number 1060
 
13.0%
Other Punctuation 105
 
1.3%
Math Symbol 52
 
0.6%
Close Punctuation 48
 
0.6%
Open Punctuation 47
 
0.6%
Uppercase Letter 44
 
0.5%
Dash Punctuation 5
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
300
 
5.5%
282
 
5.2%
244
 
4.5%
221
 
4.1%
188
 
3.5%
157
 
2.9%
151
 
2.8%
133
 
2.4%
129
 
2.4%
120
 
2.2%
Other values (253) 3510
64.6%
Uppercase Letter
ValueCountFrequency (%)
C 9
20.5%
O 8
18.2%
K 7
15.9%
M 5
11.4%
I 3
 
6.8%
A 2
 
4.5%
X 2
 
4.5%
D 1
 
2.3%
H 1
 
2.3%
R 1
 
2.3%
Other values (5) 5
11.4%
Decimal Number
ValueCountFrequency (%)
2 329
31.0%
0 294
27.7%
1 167
15.8%
8 70
 
6.6%
9 69
 
6.5%
7 49
 
4.6%
3 33
 
3.1%
6 26
 
2.5%
4 13
 
1.2%
5 10
 
0.9%
Other Punctuation
ValueCountFrequency (%)
. 84
80.0%
: 7
 
6.7%
/ 4
 
3.8%
! 4
 
3.8%
* 3
 
2.9%
, 2
 
1.9%
· 1
 
1.0%
Math Symbol
ValueCountFrequency (%)
= 20
38.5%
< 17
32.7%
> 15
28.8%
Close Punctuation
ValueCountFrequency (%)
) 26
54.2%
] 22
45.8%
Open Punctuation
ValueCountFrequency (%)
( 25
53.2%
[ 22
46.8%
Space Separator
ValueCountFrequency (%)
1370
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5433
66.5%
Common 2687
32.9%
Latin 44
 
0.5%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
300
 
5.5%
282
 
5.2%
244
 
4.5%
221
 
4.1%
188
 
3.5%
157
 
2.9%
151
 
2.8%
133
 
2.4%
129
 
2.4%
120
 
2.2%
Other values (251) 3508
64.6%
Common
ValueCountFrequency (%)
1370
51.0%
2 329
 
12.2%
0 294
 
10.9%
1 167
 
6.2%
. 84
 
3.1%
8 70
 
2.6%
9 69
 
2.6%
7 49
 
1.8%
3 33
 
1.2%
6 26
 
1.0%
Other values (16) 196
 
7.3%
Latin
ValueCountFrequency (%)
C 9
20.5%
O 8
18.2%
K 7
15.9%
M 5
11.4%
I 3
 
6.8%
A 2
 
4.5%
X 2
 
4.5%
D 1
 
2.3%
H 1
 
2.3%
R 1
 
2.3%
Other values (5) 5
11.4%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5433
66.5%
ASCII 2730
33.4%
CJK 2
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1370
50.2%
2 329
 
12.1%
0 294
 
10.8%
1 167
 
6.1%
. 84
 
3.1%
8 70
 
2.6%
9 69
 
2.5%
7 49
 
1.8%
3 33
 
1.2%
6 26
 
1.0%
Other values (30) 239
 
8.8%
Hangul
ValueCountFrequency (%)
300
 
5.5%
282
 
5.2%
244
 
4.5%
221
 
4.1%
188
 
3.5%
157
 
2.9%
151
 
2.8%
133
 
2.4%
129
 
2.4%
120
 
2.2%
Other values (251) 3508
64.6%
None
ValueCountFrequency (%)
· 1
100.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%

순번
Real number (ℝ)

Distinct149
Distinct (%)33.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean33.611607
Minimum1
Maximum158
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.1 KiB
2023-12-12T10:12:54.731103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median10
Q349.25
95-th percentile134.65
Maximum158
Range157
Interquartile range (IQR)47.25

Descriptive statistics

Standard deviation44.117963
Coefficient of variation (CV)1.3125812
Kurtosis0.59793713
Mean33.611607
Median Absolute Deviation (MAD)9
Skewness1.3708986
Sum15058
Variance1946.3947
MonotonicityNot monotonic
2023-12-12T10:12:54.924042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 72
 
16.1%
2 49
 
10.9%
3 28
 
6.2%
4 21
 
4.7%
5 16
 
3.6%
6 13
 
2.9%
7 10
 
2.2%
8 7
 
1.6%
9 7
 
1.6%
12 6
 
1.3%
Other values (139) 219
48.9%
ValueCountFrequency (%)
1 72
16.1%
2 49
10.9%
3 28
 
6.2%
4 21
 
4.7%
5 16
 
3.6%
6 13
 
2.9%
7 10
 
2.2%
8 7
 
1.6%
9 7
 
1.6%
10 5
 
1.1%
ValueCountFrequency (%)
158 1
0.2%
157 1
0.2%
154 1
0.2%
153 1
0.2%
152 1
0.2%
151 1
0.2%
150 1
0.2%
149 1
0.2%
148 1
0.2%
147 1
0.2%

사용여부
Boolean

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size580.0 B
True
448 
ValueCountFrequency (%)
True 448
100.0%
2023-12-12T10:12:55.077567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct173
Distinct (%)38.6%
Missing0
Missing (%)0.0%
Memory size3.6 KiB
Minimum2017-11-20 00:00:00
Maximum2022-09-16 10:25:23
2023-12-12T10:12:55.204774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:12:55.379732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct164
Distinct (%)36.6%
Missing0
Missing (%)0.0%
Memory size3.6 KiB
Minimum2017-11-20 00:00:00
Maximum2022-11-17 11:05:19
2023-12-12T10:12:55.581385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:12:55.766092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T10:12:53.411714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T10:12:53.583867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:12:53.701679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

공통코드상세순번사용여부생성일시수정일시
0중요무형문화재기관1Y2017-11-20 00:00:00.02017-11-20 00:00:00.0
1전공심화 및 특별과정1Y2017-11-20 00:00:00.02017-11-20 00:00:00.0
2평생교육시설1Y2017-11-20 00:00:00.02017-11-20 00:00:00.0
3원격교육1Y2017-11-20 00:00:00.02017-11-20 00:00:00.0
4공지사항1Y2017-11-20 00:00:00.02017-11-20 00:00:00.0
5FAQ2Y2017-11-20 00:00:00.02017-11-20 00:00:00.0
6자료실3Y2017-11-20 00:00:00.02017-11-20 00:00:00.0
7문서보관실1Y2017-11-20 00:00:00.02017-11-20 00:00:00.0
8매체제작실3Y2017-11-20 00:00:00.02017-11-20 00:00:00.0
9기관 운영규칙 및 평가인정 학습과정 운영에 관한 각종 규정1Y2017-11-20 00:00:00.02017-11-20 00:00:00.0
공통코드상세순번사용여부생성일시수정일시
438대학부설평생교육원1Y2017-11-20 00:00:00.02017-11-20 00:00:00.0
439전문대학부설평생교육원1Y2017-11-20 00:00:00.02017-11-20 00:00:00.0
440기술계학원1Y2017-11-20 00:00:00.02017-11-20 00:00:00.0
441사회계학원1Y2017-11-20 00:00:00.02017-11-20 00:00:00.0
442예능계학원1Y2017-11-20 00:00:00.02017-11-20 00:00:00.0
443공공직업훈련원1Y2017-11-20 00:00:00.02017-11-20 00:00:00.0
444인정직업훈련원1Y2017-11-20 00:00:00.02017-11-20 00:00:00.0
445정부관련기관1Y2017-11-20 00:00:00.02017-11-20 00:00:00.0
446고등기술학교1Y2017-11-20 00:00:00.02017-11-20 00:00:00.0
447특수학교1Y2017-11-20 00:00:00.02017-11-20 00:00:00.0

Duplicate rows

Most frequently occurring

공통코드상세순번사용여부생성일시수정일시# duplicates
0공통1Y2017-11-20 00:00:00.02017-11-20 00:00:00.02