Overview

Dataset statistics

Number of variables4
Number of observations95
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.2 KiB
Average record size in memory34.4 B

Variable types

Numeric1
Text2
Categorical1

Dataset

Description사이언스레벨업 사이트에서 제공하는 생활주변에서 찾아볼 수 있는 생활과학에 대한 정보입니다.
Author한국과학창의재단
URLhttps://www.data.go.kr/data/15093441/fileData.do

Alerts

등록일자 is highly imbalanced (64.9%)Imbalance
연번 has unique valuesUnique
제목 has unique valuesUnique
내용 has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:44:32.629609
Analysis finished2023-12-12 12:44:33.349698
Duration0.72 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct95
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean49.168421
Minimum2
Maximum97
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size987.0 B
2023-12-12T21:44:33.427871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile6.7
Q125.5
median49
Q372.5
95-th percentile92.3
Maximum97
Range95
Interquartile range (IQR)47

Descriptive statistics

Standard deviation27.813456
Coefficient of variation (CV)0.56567723
Kurtosis-1.1861397
Mean49.168421
Median Absolute Deviation (MAD)24
Skewness0.020749197
Sum4671
Variance773.58835
MonotonicityStrictly increasing
2023-12-12T21:44:33.568418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2 1
 
1.1%
3 1
 
1.1%
72 1
 
1.1%
71 1
 
1.1%
70 1
 
1.1%
69 1
 
1.1%
68 1
 
1.1%
67 1
 
1.1%
66 1
 
1.1%
65 1
 
1.1%
Other values (85) 85
89.5%
ValueCountFrequency (%)
2 1
1.1%
3 1
1.1%
4 1
1.1%
5 1
1.1%
6 1
1.1%
7 1
1.1%
8 1
1.1%
9 1
1.1%
10 1
1.1%
11 1
1.1%
ValueCountFrequency (%)
97 1
1.1%
96 1
1.1%
95 1
1.1%
94 1
1.1%
93 1
1.1%
92 1
1.1%
91 1
1.1%
90 1
1.1%
89 1
1.1%
88 1
1.1%

제목
Text

UNIQUE 

Distinct95
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size892.0 B
2023-12-12T21:44:33.944421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length26
Mean length18.663158
Min length3

Characters and Unicode

Total characters1773
Distinct characters347
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique95 ?
Unique (%)100.0%

Sample

1st row평창동계올림픽_강릉코스탈클러스터
2nd row바이애슬론_평창동계올림픽
3rd row스키점프_평창동계올림픽
4th row피겨스케이팅_평창동계올림픽
5th row아이스하키_평창동계올림픽
ValueCountFrequency (%)
10
 
2.2%
첨단 7
 
1.6%
가장 5
 
1.1%
위한 5
 
1.1%
5
 
1.1%
인공지능 5
 
1.1%
얼음 4
 
0.9%
로봇 4
 
0.9%
과학 4
 
0.9%
음성인식기반 4
 
0.9%
Other values (351) 395
88.2%
2023-12-12T21:44:34.462952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
371
 
20.9%
37
 
2.1%
37
 
2.1%
32
 
1.8%
28
 
1.6%
27
 
1.5%
25
 
1.4%
24
 
1.4%
21
 
1.2%
20
 
1.1%
Other values (337) 1151
64.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1349
76.1%
Space Separator 371
 
20.9%
Decimal Number 16
 
0.9%
Other Punctuation 12
 
0.7%
Dash Punctuation 11
 
0.6%
Uppercase Letter 9
 
0.5%
Connector Punctuation 5
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
37
 
2.7%
37
 
2.7%
32
 
2.4%
28
 
2.1%
27
 
2.0%
25
 
1.9%
24
 
1.8%
21
 
1.6%
20
 
1.5%
20
 
1.5%
Other values (315) 1078
79.9%
Decimal Number
ValueCountFrequency (%)
1 4
25.0%
0 3
18.8%
3 2
12.5%
4 1
 
6.2%
9 1
 
6.2%
2 1
 
6.2%
8 1
 
6.2%
5 1
 
6.2%
7 1
 
6.2%
6 1
 
6.2%
Uppercase Letter
ValueCountFrequency (%)
S 2
22.2%
M 1
11.1%
D 1
11.1%
R 1
11.1%
V 1
11.1%
T 1
11.1%
C 1
11.1%
I 1
11.1%
Space Separator
ValueCountFrequency (%)
371
100.0%
Other Punctuation
ValueCountFrequency (%)
. 12
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1349
76.1%
Common 415
 
23.4%
Latin 9
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
37
 
2.7%
37
 
2.7%
32
 
2.4%
28
 
2.1%
27
 
2.0%
25
 
1.9%
24
 
1.8%
21
 
1.6%
20
 
1.5%
20
 
1.5%
Other values (315) 1078
79.9%
Common
ValueCountFrequency (%)
371
89.4%
. 12
 
2.9%
- 11
 
2.7%
_ 5
 
1.2%
1 4
 
1.0%
0 3
 
0.7%
3 2
 
0.5%
4 1
 
0.2%
9 1
 
0.2%
2 1
 
0.2%
Other values (4) 4
 
1.0%
Latin
ValueCountFrequency (%)
S 2
22.2%
M 1
11.1%
D 1
11.1%
R 1
11.1%
V 1
11.1%
T 1
11.1%
C 1
11.1%
I 1
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1349
76.1%
ASCII 424
 
23.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
371
87.5%
. 12
 
2.8%
- 11
 
2.6%
_ 5
 
1.2%
1 4
 
0.9%
0 3
 
0.7%
3 2
 
0.5%
S 2
 
0.5%
4 1
 
0.2%
9 1
 
0.2%
Other values (12) 12
 
2.8%
Hangul
ValueCountFrequency (%)
37
 
2.7%
37
 
2.7%
32
 
2.4%
28
 
2.1%
27
 
2.0%
25
 
1.9%
24
 
1.8%
21
 
1.6%
20
 
1.5%
20
 
1.5%
Other values (315) 1078
79.9%

내용
Text

UNIQUE 

Distinct95
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size892.0 B
2023-12-12T21:44:34.732936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length252
Median length28
Mean length20.884211
Min length3

Characters and Unicode

Total characters1984
Distinct characters356
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique95 ?
Unique (%)100.0%

Sample

1st row평창동계올림픽_강릉코스탈클러스터
2nd row바이애슬론_평창동계올림픽
3rd row스키점프_평창동계올림픽
4th row피겨스케이팅_평창동계올림픽
5th row아이스하키_평창동계올림픽
ValueCountFrequency (%)
10
 
2.0%
위한 7
 
1.4%
첨단 7
 
1.4%
인공지능 5
 
1.0%
가장 5
 
1.0%
기술 4
 
0.8%
얼음 4
 
0.8%
음성인식기반 4
 
0.8%
로봇 4
 
0.8%
4
 
0.8%
Other values (397) 448
89.2%
2023-12-12T21:44:35.130680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
434
 
21.9%
39
 
2.0%
37
 
1.9%
35
 
1.8%
31
 
1.6%
30
 
1.5%
28
 
1.4%
25
 
1.3%
25
 
1.3%
22
 
1.1%
Other values (346) 1278
64.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1507
76.0%
Space Separator 434
 
21.9%
Decimal Number 13
 
0.7%
Dash Punctuation 11
 
0.6%
Uppercase Letter 9
 
0.5%
Other Punctuation 5
 
0.3%
Connector Punctuation 5
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
39
 
2.6%
37
 
2.5%
35
 
2.3%
31
 
2.1%
30
 
2.0%
28
 
1.9%
25
 
1.7%
25
 
1.7%
22
 
1.5%
22
 
1.5%
Other values (329) 1213
80.5%
Uppercase Letter
ValueCountFrequency (%)
S 2
22.2%
D 1
11.1%
M 1
11.1%
T 1
11.1%
C 1
11.1%
I 1
11.1%
V 1
11.1%
R 1
11.1%
Decimal Number
ValueCountFrequency (%)
1 6
46.2%
0 4
30.8%
2 1
 
7.7%
3 1
 
7.7%
8 1
 
7.7%
Space Separator
ValueCountFrequency (%)
434
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Other Punctuation
ValueCountFrequency (%)
. 5
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1507
76.0%
Common 468
 
23.6%
Latin 9
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
39
 
2.6%
37
 
2.5%
35
 
2.3%
31
 
2.1%
30
 
2.0%
28
 
1.9%
25
 
1.7%
25
 
1.7%
22
 
1.5%
22
 
1.5%
Other values (329) 1213
80.5%
Common
ValueCountFrequency (%)
434
92.7%
- 11
 
2.4%
1 6
 
1.3%
. 5
 
1.1%
_ 5
 
1.1%
0 4
 
0.9%
2 1
 
0.2%
3 1
 
0.2%
8 1
 
0.2%
Latin
ValueCountFrequency (%)
S 2
22.2%
D 1
11.1%
M 1
11.1%
T 1
11.1%
C 1
11.1%
I 1
11.1%
V 1
11.1%
R 1
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1507
76.0%
ASCII 477
 
24.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
434
91.0%
- 11
 
2.3%
1 6
 
1.3%
. 5
 
1.0%
_ 5
 
1.0%
0 4
 
0.8%
S 2
 
0.4%
D 1
 
0.2%
M 1
 
0.2%
T 1
 
0.2%
Other values (7) 7
 
1.5%
Hangul
ValueCountFrequency (%)
39
 
2.6%
37
 
2.5%
35
 
2.3%
31
 
2.1%
30
 
2.0%
28
 
1.9%
25
 
1.7%
25
 
1.7%
22
 
1.5%
22
 
1.5%
Other values (329) 1213
80.5%

등록일자
Categorical

IMBALANCE 

Distinct6
Distinct (%)6.3%
Missing0
Missing (%)0.0%
Memory size892.0 B
2020-04-13
79 
2020-08-27
11 
2020-08-28
 
2
2021-02-09
 
1
2021-02-10
 
1

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique3 ?
Unique (%)3.2%

Sample

1st row2020-04-13
2nd row2020-04-13
3rd row2020-04-13
4th row2020-04-13
5th row2020-04-13

Common Values

ValueCountFrequency (%)
2020-04-13 79
83.2%
2020-08-27 11
 
11.6%
2020-08-28 2
 
2.1%
2021-02-09 1
 
1.1%
2021-02-10 1
 
1.1%
2021-03-25 1
 
1.1%

Length

2023-12-12T21:44:35.296729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:44:35.398534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020-04-13 79
83.2%
2020-08-27 11
 
11.6%
2020-08-28 2
 
2.1%
2021-02-09 1
 
1.1%
2021-02-10 1
 
1.1%
2021-03-25 1
 
1.1%

Interactions

2023-12-12T21:44:33.068654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:44:35.485913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번제목내용등록일자
연번1.0001.0001.0000.609
제목1.0001.0001.0001.000
내용1.0001.0001.0001.000
등록일자0.6091.0001.0001.000
2023-12-12T21:44:35.599921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번등록일자
연번1.0000.367
등록일자0.3671.000

Missing values

2023-12-12T21:44:33.223853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:44:33.313982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번제목내용등록일자
02평창동계올림픽_강릉코스탈클러스터평창동계올림픽_강릉코스탈클러스터2020-04-13
13바이애슬론_평창동계올림픽바이애슬론_평창동계올림픽2020-04-13
24스키점프_평창동계올림픽스키점프_평창동계올림픽2020-04-13
35피겨스케이팅_평창동계올림픽피겨스케이팅_평창동계올림픽2020-04-13
46아이스하키_평창동계올림픽아이스하키_평창동계올림픽2020-04-13
57강릉코스탈클러스터강릉코스탈클러스터2020-04-13
68바이애슬론바이애슬론2020-04-13
79스키점프스키점프2020-04-13
810피겨스케이팅피겨스케이팅2020-04-13
911아이스하키아이스하키2020-04-13
연번제목내용등록일자
8588스트레스는 어떻게 흰머리를 만들까스트레스는 어떻게 흰머리를 만들까2020-08-27
8689인간은 왜 잠들까인간은 왜 잠들까2020-08-27
8790미세플라스틱의 위험성미세플라스틱의 위험성2020-08-27
8891동물도 범죄사건의 증인이 될 수 있을까동물도 범죄사건의 증인이 될 수 있을까2020-08-27
8992겨울에는 자외선 차단제를 바르지 않아도 될까겨울에는 자외선 차단제를 바르지 않아도 될까2020-08-27
9093네모 속 아홉가지 경고 MSDS네모 속 아홉가지 경고 MSDS2020-08-28
9194미세먼지 초미세먼지 극초미세먼지 그 다음은미세먼지 초미세먼지 극초미세먼지 그 다음은2020-08-28
9295액체 괴물의 비밀액체 괴물의 비밀2021-02-09
9396물방울의 여행 같이 따라가 볼까요물방울의 여행 같이 따라가 볼까요2021-02-10
9497환경변화에 대비하는 지속가능한 미래 과학기술중고등학교 과학 11 12학년 교육과정에 나오는 태양 풍력 조력 파력 지열 바이오 등과 같은 재생 에너지와 핵융합이나 수소 등과 같은 신에너지 자원을 이해하고 지속가능한 발전의 관점에서 신재생 에너지를 활용하는 방안을 설명할 수 있다.와 연계하여 사용하실 수 있습니다. 환경을 위한 온난화 감소 기술 교통 기술 혁신 에너지 효율 개선 제품 개발 자연환경 보호 등을 통해 미래를 위한 좀 더 나은 방안을 설명하는 인포그래픽 콘텐츠입니다.2021-03-25