Overview

Dataset statistics

Number of variables1
Number of observations489
Missing cells0
Missing cells (%)0.0%
Duplicate rows42
Duplicate rows (%)8.6%
Total size in memory3.9 KiB
Average record size in memory8.3 B

Variable types

Text1

Dataset

Description국립농산물품질관리원에서 관리하는 생산, 유통 단계에서의 농산물 독소류 분석결과(품목, 수거단계, 재배양식, 생산 지역, 재배면적, 조사물량, 등록일자, 분석결과)
Author국립농산물품질관리원
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20170912000000000793

Alerts

Dataset has 42 (8.6%) duplicate rowsDuplicates

Reproduction

Analysis started2024-03-23 07:38:35.718164
Analysis finished2024-03-23 07:38:36.300940
Duration0.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables


Text

Distinct381
Distinct (%)77.9%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
2024-03-23T07:38:36.521249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length146
Median length89
Mean length36.274029
Min length2

Characters and Unicode

Total characters17738
Distinct characters270
Distinct categories14 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique339 ?
Unique (%)69.3%

Sample

1st row<html lang="ko">
2nd row<head>
3rd row <meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
4th row <meta name="viewport" content="width=device-width" />
5th row <title>서비스 장애</title>
ValueCountFrequency (%)
183
 
16.3%
div 70
 
6.2%
li><a 62
 
5.5%
script 28
 
2.5%
ul 25
 
2.2%
li 21
 
1.9%
target="_blank 16
 
1.4%
function 13
 
1.2%
button 11
 
1.0%
let 10
 
0.9%
Other values (471) 685
60.9%
2024-03-23T07:38:37.478128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1868
 
10.5%
i 858
 
4.8%
a 811
 
4.6%
e 797
 
4.5%
793
 
4.5%
t 746
 
4.2%
" 648
 
3.7%
s 636
 
3.6%
> 621
 
3.5%
< 613
 
3.5%
Other values (260) 9347
52.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 9225
52.0%
Other Punctuation 1935
 
10.9%
Control 1868
 
10.5%
Math Symbol 1602
 
9.0%
Other Letter 849
 
4.8%
Space Separator 793
 
4.5%
Uppercase Letter 332
 
1.9%
Decimal Number 284
 
1.6%
Open Punctuation 260
 
1.5%
Close Punctuation 237
 
1.3%
Other values (4) 353
 
2.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
39
 
4.6%
28
 
3.3%
28
 
3.3%
26
 
3.1%
26
 
3.1%
18
 
2.1%
18
 
2.1%
17
 
2.0%
16
 
1.9%
15
 
1.8%
Other values (170) 618
72.8%
Lowercase Letter
ValueCountFrequency (%)
i 858
 
9.3%
a 811
 
8.8%
e 797
 
8.6%
t 746
 
8.1%
s 636
 
6.9%
n 578
 
6.3%
l 575
 
6.2%
r 542
 
5.9%
o 491
 
5.3%
d 441
 
4.8%
Other values (16) 2750
29.8%
Uppercase Letter
ValueCountFrequency (%)
L 43
13.0%
C 33
9.9%
I 31
9.3%
M 30
9.0%
P 29
8.7%
D 28
 
8.4%
O 22
 
6.6%
A 16
 
4.8%
F 16
 
4.8%
S 13
 
3.9%
Other values (14) 71
21.4%
Other Punctuation
ValueCountFrequency (%)
" 648
33.5%
/ 608
31.4%
. 318
16.4%
' 134
 
6.9%
; 93
 
4.8%
: 51
 
2.6%
! 30
 
1.6%
# 21
 
1.1%
? 19
 
1.0%
* 8
 
0.4%
Other values (2) 5
 
0.3%
Decimal Number
ValueCountFrequency (%)
2 86
30.3%
0 81
28.5%
3 49
17.3%
1 37
13.0%
5 9
 
3.2%
6 7
 
2.5%
7 5
 
1.8%
8 5
 
1.8%
9 4
 
1.4%
4 1
 
0.4%
Math Symbol
ValueCountFrequency (%)
> 621
38.8%
< 613
38.3%
= 348
21.7%
+ 12
 
0.7%
| 6
 
0.4%
~ 2
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 215
82.7%
{ 33
 
12.7%
[ 12
 
4.6%
Close Punctuation
ValueCountFrequency (%)
) 192
81.0%
} 39
 
16.5%
] 6
 
2.5%
Control
ValueCountFrequency (%)
1868
100.0%
Space Separator
ValueCountFrequency (%)
793
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 230
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 71
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 51
100.0%
Other Symbol
ValueCountFrequency (%)
© 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 9557
53.9%
Common 7332
41.3%
Hangul 849
 
4.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
39
 
4.6%
28
 
3.3%
28
 
3.3%
26
 
3.1%
26
 
3.1%
18
 
2.1%
18
 
2.1%
17
 
2.0%
16
 
1.9%
15
 
1.8%
Other values (170) 618
72.8%
Latin
ValueCountFrequency (%)
i 858
 
9.0%
a 811
 
8.5%
e 797
 
8.3%
t 746
 
7.8%
s 636
 
6.7%
n 578
 
6.0%
l 575
 
6.0%
r 542
 
5.7%
o 491
 
5.1%
d 441
 
4.6%
Other values (40) 3082
32.2%
Common
ValueCountFrequency (%)
1868
25.5%
793
10.8%
" 648
 
8.8%
> 621
 
8.5%
< 613
 
8.4%
/ 608
 
8.3%
= 348
 
4.7%
. 318
 
4.3%
- 230
 
3.1%
( 215
 
2.9%
Other values (30) 1070
14.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 16888
95.2%
Hangul 849
 
4.8%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1868
 
11.1%
i 858
 
5.1%
a 811
 
4.8%
e 797
 
4.7%
793
 
4.7%
t 746
 
4.4%
" 648
 
3.8%
s 636
 
3.8%
> 621
 
3.7%
< 613
 
3.6%
Other values (79) 8497
50.3%
Hangul
ValueCountFrequency (%)
39
 
4.6%
28
 
3.3%
28
 
3.3%
26
 
3.1%
26
 
3.1%
18
 
2.1%
18
 
2.1%
17
 
2.0%
16
 
1.9%
15
 
1.8%
Other values (170) 618
72.8%
None
ValueCountFrequency (%)
© 1
100.0%

Missing values

2024-03-23T07:38:36.043610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T07:38:36.222330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

<!DOCTYPE html>
0<html lang="ko">
1<head>
2<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
3<meta name="viewport" content="width=device-width" />
4<title>서비스 장애</title>
5<link href="/css/common.css?v=20220323" rel="stylesheet" />
6<link href="/css/style.css?v=20220323" rel="stylesheet" />
7<link href="/css/mobile.css?v=20220323" rel="stylesheet" media="(max-width:1280px)" />
8<!-- HTLM5shiv ie6~8 -->
9<!--[if lt IE 9]>
<!DOCTYPE html>
479g.type='text/javascript'; g.async=true; g.src=u+'matomo.js'; s.parentNode.insertBefore(g
480})();
481</script>
482<noscript><p><img src="//weblog.epis.or.kr/piwik/matomo.php?idsite=15&amp;rec=1" style="border:0;" alt="" /></p></noscript>
483<!-- End Matomo Code -->
484</footer>
485</div>
486<div id="popupAlertMessage"></div>
487</body>
488</html>

Duplicate rows

Most frequently occurring

<!DOCTYPE html># duplicates
29}9
11</div>7
28</div>6
35}6
36});6
3</ul></li>5
4<ul>5
7</ul></li>5
9<ul>5
12</div>5