Overview

Dataset statistics

Number of variables1
Number of observations460
Missing cells0
Missing cells (%)0.0%
Duplicate rows39
Duplicate rows (%)8.5%
Total size in memory3.7 KiB
Average record size in memory8.3 B

Variable types

Text1

Dataset

Description수출화물 목재포장재 열처리 지역별, 품목별 처리수량
Author농림축산검역본부
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220214000000001866

Alerts

Dataset has 39 (8.5%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-11 03:17:34.877521
Analysis finished2023-12-11 03:17:35.125937
Duration0.25 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables


Text

Distinct355
Distinct (%)77.2%
Missing0
Missing (%)0.0%
Memory size3.7 KiB
2023-12-11T12:17:35.281288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length146
Median length87
Mean length36.213043
Min length2

Characters and Unicode

Total characters16658
Distinct characters266
Distinct categories14 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique316 ?
Unique (%)68.7%

Sample

1st row<html lang="ko">
2nd row<head>
3rd row <meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
4th row <meta name="viewport" content="width=device-width" />
5th row <title>서비스 장애</title>
ValueCountFrequency (%)
175
 
16.5%
div 70
 
6.6%
li><a 62
 
5.8%
script 28
 
2.6%
ul 25
 
2.4%
li 21
 
2.0%
target="_blank 16
 
1.5%
button 11
 
1.0%
href="javascript:void(0 10
 
0.9%
ul></li 10
 
0.9%
Other values (433) 635
59.7%
2023-12-11T12:17:35.815533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1814
 
10.9%
i 830
 
5.0%
a 765
 
4.6%
e 735
 
4.4%
723
 
4.3%
t 674
 
4.0%
> 619
 
3.7%
" 614
 
3.7%
< 611
 
3.7%
/ 602
 
3.6%
Other values (256) 8671
52.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 8583
51.5%
Other Punctuation 1822
 
10.9%
Control 1814
 
10.9%
Math Symbol 1584
 
9.5%
Other Letter 824
 
4.9%
Space Separator 723
 
4.3%
Uppercase Letter 265
 
1.6%
Decimal Number 263
 
1.6%
Dash Punctuation 229
 
1.4%
Close Punctuation 226
 
1.4%
Other values (4) 325
 
2.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
38
 
4.6%
28
 
3.4%
26
 
3.2%
25
 
3.0%
24
 
2.9%
18
 
2.2%
17
 
2.1%
17
 
2.1%
16
 
1.9%
15
 
1.8%
Other values (168) 600
72.8%
Lowercase Letter
ValueCountFrequency (%)
i 830
 
9.7%
a 765
 
8.9%
e 735
 
8.6%
t 674
 
7.9%
s 593
 
6.9%
l 556
 
6.5%
n 527
 
6.1%
r 503
 
5.9%
o 451
 
5.3%
d 420
 
4.9%
Other values (16) 2529
29.5%
Uppercase Letter
ValueCountFrequency (%)
L 42
15.8%
C 29
10.9%
M 26
9.8%
P 25
9.4%
I 25
9.4%
D 23
8.7%
O 18
6.8%
A 14
 
5.3%
R 12
 
4.5%
F 10
 
3.8%
Other values (12) 41
15.5%
Other Punctuation
ValueCountFrequency (%)
" 614
33.7%
/ 602
33.0%
. 287
15.8%
' 106
 
5.8%
; 89
 
4.9%
: 46
 
2.5%
! 30
 
1.6%
? 19
 
1.0%
# 16
 
0.9%
* 8
 
0.4%
Other values (2) 5
 
0.3%
Decimal Number
ValueCountFrequency (%)
2 84
31.9%
0 77
29.3%
3 47
17.9%
1 29
 
11.0%
5 10
 
3.8%
6 6
 
2.3%
9 3
 
1.1%
7 3
 
1.1%
8 3
 
1.1%
4 1
 
0.4%
Math Symbol
ValueCountFrequency (%)
> 619
39.1%
< 611
38.6%
= 339
21.4%
+ 8
 
0.5%
| 6
 
0.4%
~ 1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 182
80.5%
} 38
 
16.8%
] 6
 
2.7%
Open Punctuation
ValueCountFrequency (%)
( 179
82.1%
{ 33
 
15.1%
[ 6
 
2.8%
Control
ValueCountFrequency (%)
1814
100.0%
Space Separator
ValueCountFrequency (%)
723
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 229
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 63
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 43
100.0%
Other Symbol
ValueCountFrequency (%)
© 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 8848
53.1%
Common 6986
41.9%
Hangul 824
 
4.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
38
 
4.6%
28
 
3.4%
26
 
3.2%
25
 
3.0%
24
 
2.9%
18
 
2.2%
17
 
2.1%
17
 
2.1%
16
 
1.9%
15
 
1.8%
Other values (168) 600
72.8%
Latin
ValueCountFrequency (%)
i 830
 
9.4%
a 765
 
8.6%
e 735
 
8.3%
t 674
 
7.6%
s 593
 
6.7%
l 556
 
6.3%
n 527
 
6.0%
r 503
 
5.7%
o 451
 
5.1%
d 420
 
4.7%
Other values (38) 2794
31.6%
Common
ValueCountFrequency (%)
1814
26.0%
723
 
10.3%
> 619
 
8.9%
" 614
 
8.8%
< 611
 
8.7%
/ 602
 
8.6%
= 339
 
4.9%
. 287
 
4.1%
- 229
 
3.3%
) 182
 
2.6%
Other values (30) 966
13.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 15833
95.0%
Hangul 824
 
4.9%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1814
 
11.5%
i 830
 
5.2%
a 765
 
4.8%
e 735
 
4.6%
723
 
4.6%
t 674
 
4.3%
> 619
 
3.9%
" 614
 
3.9%
< 611
 
3.9%
/ 602
 
3.8%
Other values (77) 7846
49.6%
Hangul
ValueCountFrequency (%)
38
 
4.6%
28
 
3.4%
26
 
3.2%
25
 
3.0%
24
 
2.9%
18
 
2.2%
17
 
2.1%
17
 
2.1%
16
 
1.9%
15
 
1.8%
Other values (168) 600
72.8%
None
ValueCountFrequency (%)
© 1
100.0%

Missing values

2023-12-11T12:17:35.032494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T12:17:35.095580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

<!DOCTYPE html>
0<html lang="ko">
1<head>
2<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
3<meta name="viewport" content="width=device-width" />
4<title>서비스 장애</title>
5<link href="/css/common.css?v=20220323" rel="stylesheet" />
6<link href="/css/style.css?v=20220323" rel="stylesheet" />
7<link href="/css/mobile.css?v=20220323" rel="stylesheet" media="(max-width:1280px)" />
8<!-- HTLM5shiv ie6~8 -->
9<!--[if lt IE 9]>
<!DOCTYPE html>
450}
451})();
452</script>
453<noscript><p><img src="//weblog.epis.or.kr/piwik/matomo.php?idsite=15&amp;rec=1" style="border:0;" alt="" /></p></noscript>
454<!-- End Matomo Code -->
455</footer>
456</div>
457<div id="popupAlertMessage"></div>
458</body>
459</html>

Duplicate rows

Most frequently occurring

<!DOCTYPE html># duplicates
26}9
11</div>7
25</div>6
32}6
33});6
3</ul></li>5
4<ul>5
7</ul></li>5
9<ul>5
12</div>5