Overview

Dataset statistics

Number of variables1
Number of observations454
Missing cells0
Missing cells (%)0.0%
Duplicate rows38
Duplicate rows (%)8.4%
Total size in memory3.7 KiB
Average record size in memory8.3 B

Variable types

Text1

Dataset

Description국립농산물품질관리원에서 관리하는 유기농업자재 시험연구기관 지정현황(지정번호, 지정일자, 기관명, 대표자, 소재지, 연락처, 시험분야)
Author국립농산물품질관리원
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20191011000000001207

Alerts

Dataset has 38 (8.4%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-11 03:15:59.418603
Analysis finished2023-12-11 03:15:59.660315
Duration0.24 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables


Text

Distinct353
Distinct (%)77.8%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
2023-12-11T12:15:59.827487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length146
Median length78
Mean length35.046256
Min length2

Characters and Unicode

Total characters15911
Distinct characters266
Distinct categories14 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique315 ?
Unique (%)69.4%

Sample

1st row<html lang="ko">
2nd row<head>
3rd row <meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
4th row <meta name="viewport" content="width=device-width" />
5th row <title>서비스 장애</title>
ValueCountFrequency (%)
171
 
16.3%
div 70
 
6.7%
li><a 62
 
5.9%
script 28
 
2.7%
ul 25
 
2.4%
li 21
 
2.0%
target="_blank 16
 
1.5%
button 11
 
1.1%
ul></li 10
 
1.0%
function 9
 
0.9%
Other values (429) 623
59.6%
2023-12-11T12:16:00.642878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1800
 
11.3%
i 767
 
4.8%
a 723
 
4.5%
712
 
4.5%
e 683
 
4.3%
t 633
 
4.0%
> 619
 
3.9%
< 611
 
3.8%
/ 594
 
3.7%
" 594
 
3.7%
Other values (256) 8175
51.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 8031
50.5%
Control 1800
 
11.3%
Other Punctuation 1747
 
11.0%
Math Symbol 1564
 
9.8%
Other Letter 824
 
5.2%
Space Separator 712
 
4.5%
Decimal Number 253
 
1.6%
Uppercase Letter 235
 
1.5%
Dash Punctuation 229
 
1.4%
Close Punctuation 207
 
1.3%
Other values (4) 309
 
1.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
38
 
4.6%
28
 
3.4%
26
 
3.2%
25
 
3.0%
24
 
2.9%
18
 
2.2%
17
 
2.1%
17
 
2.1%
16
 
1.9%
15
 
1.8%
Other values (168) 600
72.8%
Lowercase Letter
ValueCountFrequency (%)
i 767
 
9.6%
a 723
 
9.0%
e 683
 
8.5%
t 633
 
7.9%
s 573
 
7.1%
l 536
 
6.7%
n 481
 
6.0%
r 479
 
6.0%
o 384
 
4.8%
d 382
 
4.8%
Other values (16) 2390
29.8%
Uppercase Letter
ValueCountFrequency (%)
L 32
13.6%
C 29
12.3%
M 26
11.1%
I 25
10.6%
P 21
8.9%
D 17
7.2%
O 14
 
6.0%
R 12
 
5.1%
A 12
 
5.1%
F 10
 
4.3%
Other values (12) 37
15.7%
Other Punctuation
ValueCountFrequency (%)
/ 594
34.0%
" 594
34.0%
. 265
15.2%
' 86
 
4.9%
; 84
 
4.8%
: 46
 
2.6%
! 30
 
1.7%
? 19
 
1.1%
# 16
 
0.9%
* 8
 
0.5%
Other values (2) 5
 
0.3%
Decimal Number
ValueCountFrequency (%)
2 84
33.2%
0 65
25.7%
3 46
18.2%
1 30
 
11.9%
5 9
 
3.6%
6 9
 
3.6%
8 3
 
1.2%
7 3
 
1.2%
9 3
 
1.2%
4 1
 
0.4%
Math Symbol
ValueCountFrequency (%)
> 619
39.6%
< 611
39.1%
= 319
20.4%
+ 8
 
0.5%
| 6
 
0.4%
~ 1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 166
80.2%
} 35
 
16.9%
] 6
 
2.9%
Open Punctuation
ValueCountFrequency (%)
( 165
81.3%
{ 32
 
15.8%
[ 6
 
3.0%
Control
ValueCountFrequency (%)
1800
100.0%
Space Separator
ValueCountFrequency (%)
712
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 229
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 62
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 43
100.0%
Other Symbol
ValueCountFrequency (%)
© 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 8266
52.0%
Common 6821
42.9%
Hangul 824
 
5.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
38
 
4.6%
28
 
3.4%
26
 
3.2%
25
 
3.0%
24
 
2.9%
18
 
2.2%
17
 
2.1%
17
 
2.1%
16
 
1.9%
15
 
1.8%
Other values (168) 600
72.8%
Latin
ValueCountFrequency (%)
i 767
 
9.3%
a 723
 
8.7%
e 683
 
8.3%
t 633
 
7.7%
s 573
 
6.9%
l 536
 
6.5%
n 481
 
5.8%
r 479
 
5.8%
o 384
 
4.6%
d 382
 
4.6%
Other values (38) 2625
31.8%
Common
ValueCountFrequency (%)
1800
26.4%
712
 
10.4%
> 619
 
9.1%
< 611
 
9.0%
/ 594
 
8.7%
" 594
 
8.7%
= 319
 
4.7%
. 265
 
3.9%
- 229
 
3.4%
) 166
 
2.4%
Other values (30) 912
13.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 15086
94.8%
Hangul 824
 
5.2%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1800
 
11.9%
i 767
 
5.1%
a 723
 
4.8%
712
 
4.7%
e 683
 
4.5%
t 633
 
4.2%
> 619
 
4.1%
< 611
 
4.1%
/ 594
 
3.9%
" 594
 
3.9%
Other values (77) 7350
48.7%
Hangul
ValueCountFrequency (%)
38
 
4.6%
28
 
3.4%
26
 
3.2%
25
 
3.0%
24
 
2.9%
18
 
2.2%
17
 
2.1%
17
 
2.1%
16
 
1.9%
15
 
1.8%
Other values (168) 600
72.8%
None
ValueCountFrequency (%)
© 1
100.0%

Missing values

2023-12-11T12:15:59.565230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T12:15:59.628829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

<!DOCTYPE html>
0<html lang="ko">
1<head>
2<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
3<meta name="viewport" content="width=device-width" />
4<title>서비스 장애</title>
5<link href="/css/common.css?v=20220323" rel="stylesheet" />
6<link href="/css/style.css?v=20220323" rel="stylesheet" />
7<link href="/css/mobile.css?v=20220323" rel="stylesheet" media="(max-width:1280px)" />
8<!-- HTLM5shiv ie6~8 -->
9<!--[if lt IE 9]>
<!DOCTYPE html>
444}
445})();
446</script>
447<noscript><p><img src="//weblog.epis.or.kr/piwik/matomo.php?idsite=15&amp;rec=1" style="border:0;" alt="" /></p></noscript>
448<!-- End Matomo Code -->
449</footer>
450</div>
451<div id="popupAlertMessage"></div>
452</body>
453</html>

Duplicate rows

Most frequently occurring

<!DOCTYPE html># duplicates
25}9
11</div>7
24</div>6
31}6
3</ul></li>5
4<ul>5
7</ul></li>5
9<ul>5
12</div>5
18</div>5