Overview

Dataset statistics

Number of variables4
Number of observations34
Missing cells8
Missing cells (%)5.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 KiB
Average record size in memory35.9 B

Variable types

Text1
DateTime2
Categorical1

Dataset

Description"2021년도 공군사관학교 학사일정입니다.
Author국방부
URLhttps://www.data.go.kr/data/15089929/fileData.do

Alerts

시작일 has 4 (11.8%) missing valuesMissing
종료일 has 4 (11.8%) missing valuesMissing
행사명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 04:33:11.926335
Analysis finished2023-12-12 04:33:12.398987
Duration0.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

행사명
Text

UNIQUE 

Distinct34
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size404.0 B
2023-12-12T13:33:12.566754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length14
Mean length9.1470588
Min length3

Characters and Unicode

Total characters311
Distinct characters113
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)100.0%

Sample

1st row1학기 개강
2nd row제73기 생도 입학식/재학생 진급식
3rd row인성·리더십 집중개발 기간
4th row어버이날 행사
5th row리더십 심포지엄
ValueCountFrequency (%)
제74기 3
 
4.6%
1학기 2
 
3.1%
선발 2
 
3.1%
개강 2
 
3.1%
기말시험 2
 
3.1%
시험 2
 
3.1%
무용기 2
 
3.1%
2학기 2
 
3.1%
1
 
1.5%
동계휴가 1
 
1.5%
Other values (46) 46
70.8%
2023-12-12T13:33:12.915606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
31
 
10.0%
18
 
5.8%
17
 
5.5%
) 8
 
2.6%
8
 
2.6%
( 8
 
2.6%
1 6
 
1.9%
4 6
 
1.9%
6
 
1.9%
6
 
1.9%
Other values (103) 197
63.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 236
75.9%
Space Separator 31
 
10.0%
Decimal Number 24
 
7.7%
Close Punctuation 8
 
2.6%
Open Punctuation 8
 
2.6%
Other Punctuation 4
 
1.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
18
 
7.6%
17
 
7.2%
8
 
3.4%
6
 
2.5%
6
 
2.5%
5
 
2.1%
5
 
2.1%
5
 
2.1%
5
 
2.1%
5
 
2.1%
Other values (92) 156
66.1%
Decimal Number
ValueCountFrequency (%)
1 6
25.0%
4 6
25.0%
2 4
16.7%
3 4
16.7%
7 4
16.7%
Other Punctuation
ValueCountFrequency (%)
/ 2
50.0%
, 1
25.0%
· 1
25.0%
Space Separator
ValueCountFrequency (%)
31
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 236
75.9%
Common 75
 
24.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
18
 
7.6%
17
 
7.2%
8
 
3.4%
6
 
2.5%
6
 
2.5%
5
 
2.1%
5
 
2.1%
5
 
2.1%
5
 
2.1%
5
 
2.1%
Other values (92) 156
66.1%
Common
ValueCountFrequency (%)
31
41.3%
) 8
 
10.7%
( 8
 
10.7%
1 6
 
8.0%
4 6
 
8.0%
2 4
 
5.3%
3 4
 
5.3%
7 4
 
5.3%
/ 2
 
2.7%
, 1
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 236
75.9%
ASCII 74
 
23.8%
None 1
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
31
41.9%
) 8
 
10.8%
( 8
 
10.8%
1 6
 
8.1%
4 6
 
8.1%
2 4
 
5.4%
3 4
 
5.4%
7 4
 
5.4%
/ 2
 
2.7%
, 1
 
1.4%
Hangul
ValueCountFrequency (%)
18
 
7.6%
17
 
7.2%
8
 
3.4%
6
 
2.5%
6
 
2.5%
5
 
2.1%
5
 
2.1%
5
 
2.1%
5
 
2.1%
5
 
2.1%
Other values (92) 156
66.1%
None
ValueCountFrequency (%)
· 1
100.0%

시작일
Date

MISSING 

Distinct29
Distinct (%)96.7%
Missing4
Missing (%)11.8%
Memory size404.0 B
Minimum2021-02-22 00:00:00
Maximum2022-02-07 00:00:00
2023-12-12T13:33:13.026247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:33:13.151456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)

종료일
Date

MISSING 

Distinct25
Distinct (%)83.3%
Missing4
Missing (%)11.8%
Memory size404.0 B
Minimum2021-03-02 00:00:00
Maximum2022-02-18 00:00:00
2023-12-12T13:33:13.259332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T13:33:13.348656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)

비고
Categorical

Distinct2
Distinct (%)5.9%
Missing0
Missing (%)0.0%
Memory size404.0 B
<NA>
30 
취소

Length

Max length4
Median length4
Mean length3.7647059
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row취소
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 30
88.2%
취소 4
 
11.8%

Length

2023-12-12T13:33:13.780387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:33:13.885500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 30
88.2%
취소 4
 
11.8%

Correlations

2023-12-12T13:33:13.960501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
행사명시작일종료일
행사명1.0001.0001.000
시작일1.0001.0000.977
종료일1.0000.9771.000

Missing values

2023-12-12T13:33:12.142569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:33:12.256234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T13:33:12.348650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

행사명시작일종료일비고
01학기 개강2021-02-222021-06-24<NA>
1제73기 생도 입학식/재학생 진급식2021-03-022021-03-02<NA>
2인성·리더십 집중개발 기간2021-04-152021-04-15<NA>
3어버이날 행사<NA><NA>취소
4리더십 심포지엄2021-05-182021-05-18<NA>
5개교기념일2021-06-102021-06-10<NA>
61학기 기말시험2021-06-212021-06-24<NA>
7하계군사훈련준비2021-06-252021-06-25<NA>
8하계군사훈련2021-06-282021-07-23<NA>
9항공우주캠프(대학생)<NA><NA>취소
행사명시작일종료일비고
24성무철인 경기2021-10-262021-10-26<NA>
25성무제2021-10-272021-10-29<NA>
26해외항법2021-11-152021-11-20<NA>
27국토순례(4학년)2021-11-222021-11-26<NA>
28합동교육(1학년, 3학년)2021-11-152021-11-26<NA>
29합동교육(2학년)2021-11-052021-12-03<NA>
302학기 기말시험2021-12-272021-12-30<NA>
31동계휴가2021-12-312022-02-06<NA>
32제74기 기초군사훈련2022-01-212022-02-18<NA>
33동계학기2022-02-072022-02-18<NA>