Overview

Dataset statistics

Number of variables1
Number of observations453
Missing cells0
Missing cells (%)0.0%
Duplicate rows38
Duplicate rows (%)8.4%
Total size in memory3.7 KiB
Average record size in memory8.3 B

Variable types

Text1

Dataset

Description농림수산식품 생산시스템R&D 과제 정보
Author농림식품기술기획평가원
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220211000000001841

Alerts

Dataset has 38 (8.4%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-11 03:45:16.680597
Analysis finished2023-12-11 03:45:16.900647
Duration0.22 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables


Text

Distinct352
Distinct (%)77.7%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
2023-12-11T12:45:17.029763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length146
Median length76
Mean length34.677704
Min length2

Characters and Unicode

Total characters15709
Distinct characters265
Distinct categories14 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique314 ?
Unique (%)69.3%

Sample

1st row<html lang="ko">
2nd row<head>
3rd row <meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
4th row <meta name="viewport" content="width=device-width" />
5th row <title>서비스 장애</title>
ValueCountFrequency (%)
164
 
15.9%
div 70
 
6.8%
li><a 60
 
5.8%
script 28
 
2.7%
ul 25
 
2.4%
li 21
 
2.0%
target="_blank 15
 
1.5%
button 11
 
1.1%
ul></li 10
 
1.0%
function 9
 
0.9%
Other values (427) 621
60.1%
2023-12-11T12:45:17.459682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1797
 
11.4%
i 751
 
4.8%
a 714
 
4.5%
702
 
4.5%
e 673
 
4.3%
t 623
 
4.0%
> 608
 
3.9%
< 600
 
3.8%
" 586
 
3.7%
/ 583
 
3.7%
Other values (255) 8072
51.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 7920
50.4%
Control 1797
 
11.4%
Other Punctuation 1723
 
11.0%
Math Symbol 1538
 
9.8%
Other Letter 809
 
5.1%
Space Separator 702
 
4.5%
Decimal Number 254
 
1.6%
Uppercase Letter 230
 
1.5%
Dash Punctuation 219
 
1.4%
Close Punctuation 208
 
1.3%
Other values (4) 309
 
2.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
38
 
4.7%
28
 
3.5%
26
 
3.2%
25
 
3.1%
25
 
3.1%
17
 
2.1%
17
 
2.1%
16
 
2.0%
15
 
1.9%
15
 
1.9%
Other values (167) 587
72.6%
Lowercase Letter
ValueCountFrequency (%)
i 751
 
9.5%
a 714
 
9.0%
e 673
 
8.5%
t 623
 
7.9%
s 570
 
7.2%
l 522
 
6.6%
r 474
 
6.0%
n 473
 
6.0%
d 378
 
4.8%
o 374
 
4.7%
Other values (16) 2368
29.9%
Uppercase Letter
ValueCountFrequency (%)
L 31
13.5%
C 27
11.7%
I 25
10.9%
M 24
10.4%
P 21
9.1%
D 18
7.8%
O 14
 
6.1%
A 12
 
5.2%
F 10
 
4.3%
R 10
 
4.3%
Other values (12) 38
16.5%
Other Punctuation
ValueCountFrequency (%)
" 586
34.0%
/ 583
33.8%
. 262
15.2%
' 86
 
5.0%
; 84
 
4.9%
: 46
 
2.7%
! 27
 
1.6%
? 19
 
1.1%
# 17
 
1.0%
* 8
 
0.5%
Other values (2) 5
 
0.3%
Decimal Number
ValueCountFrequency (%)
2 90
35.4%
0 69
27.2%
3 44
17.3%
1 25
 
9.8%
5 9
 
3.5%
9 4
 
1.6%
8 4
 
1.6%
7 4
 
1.6%
6 3
 
1.2%
4 2
 
0.8%
Math Symbol
ValueCountFrequency (%)
> 608
39.5%
< 600
39.0%
= 315
20.5%
+ 8
 
0.5%
| 6
 
0.4%
~ 1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 167
80.3%
} 35
 
16.8%
] 6
 
2.9%
Open Punctuation
ValueCountFrequency (%)
( 166
81.4%
{ 32
 
15.7%
[ 6
 
2.9%
Control
ValueCountFrequency (%)
1797
100.0%
Space Separator
ValueCountFrequency (%)
702
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 219
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 62
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 42
100.0%
Other Symbol
ValueCountFrequency (%)
© 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 8150
51.9%
Common 6750
43.0%
Hangul 809
 
5.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
38
 
4.7%
28
 
3.5%
26
 
3.2%
25
 
3.1%
25
 
3.1%
17
 
2.1%
17
 
2.1%
16
 
2.0%
15
 
1.9%
15
 
1.9%
Other values (167) 587
72.6%
Latin
ValueCountFrequency (%)
i 751
 
9.2%
a 714
 
8.8%
e 673
 
8.3%
t 623
 
7.6%
s 570
 
7.0%
l 522
 
6.4%
r 474
 
5.8%
n 473
 
5.8%
d 378
 
4.6%
o 374
 
4.6%
Other values (38) 2598
31.9%
Common
ValueCountFrequency (%)
1797
26.6%
702
 
10.4%
> 608
 
9.0%
< 600
 
8.9%
" 586
 
8.7%
/ 583
 
8.6%
= 315
 
4.7%
. 262
 
3.9%
- 219
 
3.2%
) 167
 
2.5%
Other values (30) 911
13.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 14899
94.8%
Hangul 809
 
5.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1797
 
12.1%
i 751
 
5.0%
a 714
 
4.8%
702
 
4.7%
e 673
 
4.5%
t 623
 
4.2%
> 608
 
4.1%
< 600
 
4.0%
" 586
 
3.9%
/ 583
 
3.9%
Other values (77) 7262
48.7%
Hangul
ValueCountFrequency (%)
38
 
4.7%
28
 
3.5%
26
 
3.2%
25
 
3.1%
25
 
3.1%
17
 
2.1%
17
 
2.1%
16
 
2.0%
15
 
1.9%
15
 
1.9%
Other values (167) 587
72.6%
None
ValueCountFrequency (%)
© 1
100.0%

Missing values

2023-12-11T12:45:16.819777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T12:45:16.877257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

<!DOCTYPE html>
0<html lang="ko">
1<head>
2<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
3<meta name="viewport" content="width=device-width" />
4<title>서비스 장애</title>
5<link href="/css/common.css?v=20220323" rel="stylesheet" />
6<link href="/css/style.css?v=20220323" rel="stylesheet" />
7<link href="/css/mobile.css?v=20220323" rel="stylesheet" media="(max-width:1280px)" />
8<!-- HTLM5shiv ie6~8 -->
9<!--[if lt IE 9]>
<!DOCTYPE html>
443}
444})();
445</script>
446<noscript><p><img src="//weblog.epis.or.kr/piwik/matomo.php?idsite=15&amp;rec=1" style="border:0;" alt="" /></p></noscript>
447<!-- End Matomo Code -->
448</footer>
449</div>
450<div id="popupAlertMessage"></div>
451</body>
452</html>

Duplicate rows

Most frequently occurring

<!DOCTYPE html># duplicates
25}9
11</div>7
24</div>6
31}6
3</ul></li>5
4<ul>5
7</ul></li>5
9<ul>5
12</div>5
18</div>5