Overview

Dataset statistics

Number of variables3
Number of observations365
Missing cells363
Missing cells (%)33.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.7 KiB
Average record size in memory24.4 B

Variable types

DateTime1
Categorical1
Text1

Dataset

Description'2021년도 육군사관학교 학사일정입니다.
Author국방부
URLhttps://www.data.go.kr/data/15089912/fileData.do

Alerts

비고 has 363 (99.5%) missing valuesMissing
일자 has unique valuesUnique

Reproduction

Analysis started2023-12-12 19:15:25.713837
Analysis finished2023-12-12 19:15:25.971495
Duration0.26 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일자
Date

UNIQUE 

Distinct365
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
Minimum2021-01-01 00:00:00
Maximum2021-12-31 00:00:00
2023-12-13T04:15:26.062489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:15:26.224328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

주요내용
Categorical

Distinct18
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
<NA>
213 
동계휴가
35 
하계군사훈련
35 
하계휴가
27 
3군사관학교 합동교육
 
12
Other values (13)
43 

Length

Max length23
Median length4
Mean length4.8164384
Min length4

Unique

Unique5 ?
Unique (%)1.4%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 213
58.4%
동계휴가 35
 
9.6%
하계군사훈련 35
 
9.6%
하계휴가 27
 
7.4%
3군사관학교 합동교육 12
 
3.3%
동계교육훈련 12
 
3.3%
기말시험 8
 
2.2%
중간시험 5
 
1.4%
후반기 생도 체력검정 4
 
1.1%
고교방문 입시홍보 3
 
0.8%
Other values (8) 11
 
3.0%

Length

2023-12-13T04:15:26.412913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 213
53.2%
하계군사훈련 35
 
8.8%
동계휴가 35
 
8.8%
하계휴가 27
 
6.8%
3군사관학교 12
 
3.0%
합동교육 12
 
3.0%
동계교육훈련 12
 
3.0%
기말시험 8
 
2.0%
생도 6
 
1.5%
체력검정 6
 
1.5%
Other values (19) 34
 
8.5%

비고
Text

MISSING 

Distinct2
Distinct (%)100.0%
Missing363
Missing (%)99.5%
Memory size3.0 KiB
2023-12-13T04:15:26.626579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length24
Mean length24
Min length23

Characters and Unicode

Total characters48
Distinct characters27
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st row1. 8.(금) ~ 2. 5.(토), 4주
2nd row코로나 19로 인해 6주에서 5주로 조정 시행
ValueCountFrequency (%)
1 1
 
7.7%
8.(금 1
 
7.7%
1
 
7.7%
2 1
 
7.7%
5.(토 1
 
7.7%
4주 1
 
7.7%
코로나 1
 
7.7%
19로 1
 
7.7%
인해 1
 
7.7%
6주에서 1
 
7.7%
Other values (3) 3
23.1%
2023-12-13T04:15:26.977866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11
22.9%
. 4
 
8.3%
3
 
6.2%
3
 
6.2%
) 2
 
4.2%
5 2
 
4.2%
1 2
 
4.2%
( 2
 
4.2%
~ 1
 
2.1%
1
 
2.1%
Other values (17) 17
35.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 18
37.5%
Space Separator 11
22.9%
Decimal Number 9
18.8%
Other Punctuation 5
 
10.4%
Close Punctuation 2
 
4.2%
Open Punctuation 2
 
4.2%
Math Symbol 1
 
2.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3
16.7%
3
16.7%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
Other values (4) 4
22.2%
Decimal Number
ValueCountFrequency (%)
5 2
22.2%
1 2
22.2%
6 1
11.1%
9 1
11.1%
2 1
11.1%
8 1
11.1%
4 1
11.1%
Other Punctuation
ValueCountFrequency (%)
. 4
80.0%
, 1
 
20.0%
Space Separator
ValueCountFrequency (%)
11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 30
62.5%
Hangul 18
37.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3
16.7%
3
16.7%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
Other values (4) 4
22.2%
Common
ValueCountFrequency (%)
11
36.7%
. 4
 
13.3%
) 2
 
6.7%
5 2
 
6.7%
1 2
 
6.7%
( 2
 
6.7%
~ 1
 
3.3%
6 1
 
3.3%
9 1
 
3.3%
2 1
 
3.3%
Other values (3) 3
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 30
62.5%
Hangul 18
37.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11
36.7%
. 4
 
13.3%
) 2
 
6.7%
5 2
 
6.7%
1 2
 
6.7%
( 2
 
6.7%
~ 1
 
3.3%
6 1
 
3.3%
9 1
 
3.3%
2 1
 
3.3%
Other values (3) 3
 
10.0%
Hangul
ValueCountFrequency (%)
3
16.7%
3
16.7%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
Other values (4) 4
22.2%

Correlations

2023-12-13T04:15:27.080609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주요내용비고
주요내용1.0000.000
비고0.0001.000

Missing values

2023-12-13T04:15:25.849667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:15:25.936695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일자주요내용비고
02021-01-01<NA><NA>
12021-01-02<NA><NA>
22021-01-03<NA><NA>
32021-01-04<NA><NA>
42021-01-05<NA><NA>
52021-01-06<NA><NA>
62021-01-07<NA><NA>
72021-01-08동계휴가1. 8.(금) ~ 2. 5.(토), 4주
82021-01-09동계휴가<NA>
92021-01-10동계휴가<NA>
일자주요내용비고
3552021-12-22기말시험<NA>
3562021-12-23기말시험<NA>
3572021-12-24기말시험<NA>
3582021-12-25동계휴가<NA>
3592021-12-26동계휴가<NA>
3602021-12-27동계휴가<NA>
3612021-12-28동계휴가<NA>
3622021-12-29동계휴가<NA>
3632021-12-30동계휴가<NA>
3642021-12-31동계휴가<NA>