Overview

Dataset statistics

Number of variables8
Number of observations33
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.2 KiB
Average record size in memory69.0 B

Variable types

DateTime2
Categorical4
Numeric1
Text1

Dataset

Description2020년 1년간 한국항공우주연구원 메인 홈페이지로 접수된 견학통계입니다.
Author한국항공우주연구원
URLhttps://www.data.go.kr/data/15091999/fileData.do

Alerts

견학시간 has constant value ""Constant
인원 is highly overall correlated with 견학코드High correlation
견학코드 is highly overall correlated with 인원 and 1 other fieldsHigh correlation
시간구분 is highly overall correlated with 견학코드High correlation
인원 has 10 (30.3%) zerosZeros

Reproduction

Analysis started2023-12-12 17:36:30.849845
Analysis finished2023-12-12 17:36:31.380784
Duration0.53 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct21
Distinct (%)63.6%
Missing0
Missing (%)0.0%
Memory size396.0 B
Minimum2020-01-06 00:00:00
Maximum2020-03-23 00:00:00
2023-12-13T02:36:31.425517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:36:31.521854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)

견학코드
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)6.1%
Missing0
Missing (%)0.0%
Memory size396.0 B
단체
18 
개인
15 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개인
2nd row단체
3rd row단체
4th row단체
5th row단체

Common Values

ValueCountFrequency (%)
단체 18
54.5%
개인 15
45.5%

Length

2023-12-13T02:36:31.628483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:36:31.710987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
단체 18
54.5%
개인 15
45.5%

시간구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)6.1%
Missing0
Missing (%)0.0%
Memory size396.0 B
오후
23 
오전
10 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row오후
2nd row오전
3rd row오후
4th row오전
5th row오후

Common Values

ValueCountFrequency (%)
오후 23
69.7%
오전 10
30.3%

Length

2023-12-13T02:36:31.812021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:36:31.900082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
오후 23
69.7%
오전 10
30.3%

견학시간
Categorical

CONSTANT 

Distinct1
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size396.0 B
1시간
33 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1시간
2nd row1시간
3rd row1시간
4th row1시간
5th row1시간

Common Values

ValueCountFrequency (%)
1시간 33
100.0%

Length

2023-12-13T02:36:31.986564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:36:32.070061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1시간 33
100.0%

단체구분
Categorical

Distinct8
Distinct (%)24.2%
Missing0
Missing (%)0.0%
Memory size396.0 B
초등학교
일반
공무원
중학교
대학생
Other values (3)

Length

Max length4
Median length3
Mean length2.969697
Min length2

Unique

Unique1 ?
Unique (%)3.0%

Sample

1st row중학교
2nd row일반
3rd row교사
4th row일반
5th row일반

Common Values

ValueCountFrequency (%)
초등학교 7
21.2%
일반 6
18.2%
공무원 6
18.2%
중학교 5
15.2%
대학생 3
9.1%
군인 3
9.1%
고등학교 2
 
6.1%
교사 1
 
3.0%

Length

2023-12-13T02:36:32.173564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:36:32.314250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
초등학교 7
21.2%
일반 6
18.2%
공무원 6
18.2%
중학교 5
15.2%
대학생 3
9.1%
군인 3
9.1%
고등학교 2
 
6.1%
교사 1
 
3.0%

인원
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct17
Distinct (%)51.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.30303
Minimum0
Maximum65
Zeros10
Zeros (%)30.3%
Negative0
Negative (%)0.0%
Memory size429.0 B
2023-12-13T02:36:32.439561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median3
Q328
95-th percentile44.4
Maximum65
Range65
Interquartile range (IQR)28

Descriptive statistics

Standard deviation17.595249
Coefficient of variation (CV)1.3226497
Kurtosis0.83921221
Mean13.30303
Median Absolute Deviation (MAD)3
Skewness1.2913186
Sum439
Variance309.5928
MonotonicityNot monotonic
2023-12-13T02:36:32.617775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=17)
ValueCountFrequency (%)
0 10
30.3%
3 4
 
12.1%
2 4
 
12.1%
13 2
 
6.1%
40 1
 
3.0%
15 1
 
3.0%
22 1
 
3.0%
65 1
 
3.0%
44 1
 
3.0%
30 1
 
3.0%
Other values (7) 7
21.2%
ValueCountFrequency (%)
0 10
30.3%
1 1
 
3.0%
2 4
 
12.1%
3 4
 
12.1%
6 1
 
3.0%
13 2
 
6.1%
15 1
 
3.0%
22 1
 
3.0%
28 1
 
3.0%
30 1
 
3.0%
ValueCountFrequency (%)
65 1
3.0%
45 1
3.0%
44 1
3.0%
40 1
3.0%
34 1
3.0%
32 1
3.0%
31 1
3.0%
30 1
3.0%
28 1
3.0%
22 1
3.0%
Distinct30
Distinct (%)90.9%
Missing0
Missing (%)0.0%
Memory size396.0 B
Minimum2020-01-02 09:31:00
Maximum2020-02-19 15:52:00
2023-12-13T02:36:32.732422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:36:32.842694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
Distinct20
Distinct (%)60.6%
Missing0
Missing (%)0.0%
Memory size396.0 B
2023-12-13T02:36:33.014481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length15
Mean length10.787879
Min length1

Characters and Unicode

Total characters356
Distinct characters13
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16 ?
Unique (%)48.5%

Sample

1st row
2nd row
3rd row
4th row
5th row
ValueCountFrequency (%)
2020-02-06 5
 
11.4%
2020-02-21 5
 
11.4%
10:14 2
 
4.5%
10:13 2
 
4.5%
2020-01-13 2
 
4.5%
9:01 2
 
4.5%
15:57 2
 
4.5%
2020-02-04 2
 
4.5%
9:21 1
 
2.3%
2020-01-23 1
 
2.3%
Other values (20) 20
45.5%
2023-12-13T02:36:33.588287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 84
23.6%
2 73
20.5%
1 45
12.6%
- 44
12.4%
33
 
9.3%
: 22
 
6.2%
5 16
 
4.5%
6 9
 
2.5%
4 8
 
2.2%
3 8
 
2.2%
Other values (3) 14
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 257
72.2%
Dash Punctuation 44
 
12.4%
Space Separator 33
 
9.3%
Other Punctuation 22
 
6.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 84
32.7%
2 73
28.4%
1 45
17.5%
5 16
 
6.2%
6 9
 
3.5%
4 8
 
3.1%
3 8
 
3.1%
9 8
 
3.1%
7 4
 
1.6%
8 2
 
0.8%
Dash Punctuation
ValueCountFrequency (%)
- 44
100.0%
Space Separator
ValueCountFrequency (%)
33
100.0%
Other Punctuation
ValueCountFrequency (%)
: 22
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 356
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 84
23.6%
2 73
20.5%
1 45
12.6%
- 44
12.4%
33
 
9.3%
: 22
 
6.2%
5 16
 
4.5%
6 9
 
2.5%
4 8
 
2.2%
3 8
 
2.2%
Other values (3) 14
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 356
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 84
23.6%
2 73
20.5%
1 45
12.6%
- 44
12.4%
33
 
9.3%
: 22
 
6.2%
5 16
 
4.5%
6 9
 
2.5%
4 8
 
2.2%
3 8
 
2.2%
Other values (3) 14
 
3.9%

Interactions

2023-12-13T02:36:31.114252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:36:33.691997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
견학신청일견학코드시간구분단체구분인원등록일자수정일자
견학신청일1.0001.0000.7460.0000.9460.9760.577
견학코드1.0001.0000.7230.5470.6160.0000.868
시간구분0.7460.7231.0000.0000.5140.7770.470
단체구분0.0000.5470.0001.0000.6580.7890.399
인원0.9460.6160.5140.6581.0000.9290.000
등록일자0.9760.0000.7770.7890.9291.0000.892
수정일자0.5770.8680.4700.3990.0000.8921.000
2023-12-13T02:36:33.799364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
견학코드단체구분시간구분
견학코드1.0000.3630.514
단체구분0.3631.0000.000
시간구분0.5140.0001.000
2023-12-13T02:36:33.899789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인원견학코드시간구분단체구분
인원1.0000.5770.4940.356
견학코드0.5771.0000.5140.363
시간구분0.4940.5141.0000.000
단체구분0.3560.3630.0001.000

Missing values

2023-12-13T02:36:31.231368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:36:31.341542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

견학신청일견학코드시간구분견학시간단체구분인원등록일자수정일자
02020-01-06개인오후1시간중학교32020-01-02 9:31
12020-01-09단체오전1시간일반302020-01-02 15:28
22020-01-08단체오후1시간교사652020-01-02 17:03
32020-01-08단체오전1시간일반152020-01-06 11:27
42020-01-14단체오후1시간일반402020-01-07 8:47
52020-02-10개인오후1시간초등학교02020-01-07 8:472020-02-07 9:01
62020-01-17단체오전1시간중학교222020-01-08 8:53
72020-01-20개인오후1시간중학교22020-01-08 8:542020-01-13 14:23
82020-01-15단체오후1시간초등학교442020-01-08 13:07
92020-02-10개인오후1시간중학교02020-01-08 16:102020-02-04 17:25
견학신청일견학코드시간구분견학시간단체구분인원등록일자수정일자
232020-02-10개인오후1시간대학생32020-01-22 8:482020-02-06 15:57
242020-01-30단체오전1시간군인132020-01-22 11:222020-01-23 8:52
252020-02-03단체오후1시간일반62020-01-30 13:47
262020-02-06단체오전1시간공무원02020-01-31 14:412020-02-04 9:21
272020-02-24개인오후1시간초등학교02020-02-04 10:552020-02-20 9:02
282020-02-11단체오후1시간공무원32020-02-06 14:372020-02-06 16:51
292020-03-23개인오후1시간초등학교02020-02-06 15:062020-02-21 10:13
302020-03-09개인오후1시간초등학교02020-02-06 15:502020-02-21 10:13
312020-03-09개인오후1시간대학생02020-02-10 13:412020-02-21 10:14
322020-02-28단체오전1시간초등학교02020-02-19 15:522020-02-21 10:11