Overview

Dataset statistics

Number of variables5
Number of observations52
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.3 KiB
Average record size in memory44.5 B

Variable types

Numeric2
Text2
DateTime1

Dataset

Description해양환경공단에서 보유하고 있는 지식재산권 보유 자료에 대한 정보로 출원번호, 등록번호, 국문명칭 등에 대한 정보를 포함
URLhttps://www.data.go.kr/data/15044007/fileData.do

Alerts

연번 has unique valuesUnique
출원번호 has unique valuesUnique
등록번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 03:08:44.605298
Analysis finished2023-12-12 03:08:45.663510
Duration1.06 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct52
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26.5
Minimum1
Maximum52
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size600.0 B
2023-12-12T12:08:45.778056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.55
Q113.75
median26.5
Q339.25
95-th percentile49.45
Maximum52
Range51
Interquartile range (IQR)25.5

Descriptive statistics

Standard deviation15.154757
Coefficient of variation (CV)0.57187763
Kurtosis-1.2
Mean26.5
Median Absolute Deviation (MAD)13
Skewness0
Sum1378
Variance229.66667
MonotonicityStrictly increasing
2023-12-12T12:08:45.967639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.9%
28 1
 
1.9%
30 1
 
1.9%
31 1
 
1.9%
32 1
 
1.9%
33 1
 
1.9%
34 1
 
1.9%
35 1
 
1.9%
36 1
 
1.9%
37 1
 
1.9%
Other values (42) 42
80.8%
ValueCountFrequency (%)
1 1
1.9%
2 1
1.9%
3 1
1.9%
4 1
1.9%
5 1
1.9%
6 1
1.9%
7 1
1.9%
8 1
1.9%
9 1
1.9%
10 1
1.9%
ValueCountFrequency (%)
52 1
1.9%
51 1
1.9%
50 1
1.9%
49 1
1.9%
48 1
1.9%
47 1
1.9%
46 1
1.9%
45 1
1.9%
44 1
1.9%
43 1
1.9%

출원번호
Text

UNIQUE 

Distinct52
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size548.0 B
2023-12-12T12:08:46.205580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length15
Mean length14.884615
Min length14

Characters and Unicode

Total characters774
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52 ?
Unique (%)100.0%

Sample

1st row10-2005-0111552
2nd row10-2009-0018288
3rd row10-2009-0130383
4th row10-2011-0131546
5th row10-2014-0086556
ValueCountFrequency (%)
10-2005-0111552 1
 
1.9%
10-2009-0018288 1
 
1.9%
40-2018-0039874 1
 
1.9%
30-2005-0039234 1
 
1.9%
30-2016-0053716 1
 
1.9%
30-2017-0013442 1
 
1.9%
30-2017-0015272 1
 
1.9%
30-2018-0025169 1
 
1.9%
30-20200054223 1
 
1.9%
30-20210052970 1
 
1.9%
Other values (42) 42
80.8%
2023-12-12T12:08:46.534646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 214
27.6%
1 123
15.9%
- 98
12.7%
2 92
11.9%
8 48
 
6.2%
3 45
 
5.8%
4 33
 
4.3%
9 32
 
4.1%
7 32
 
4.1%
6 30
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 676
87.3%
Dash Punctuation 98
 
12.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 214
31.7%
1 123
18.2%
2 92
13.6%
8 48
 
7.1%
3 45
 
6.7%
4 33
 
4.9%
9 32
 
4.7%
7 32
 
4.7%
6 30
 
4.4%
5 27
 
4.0%
Dash Punctuation
ValueCountFrequency (%)
- 98
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 774
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 214
27.6%
1 123
15.9%
- 98
12.7%
2 92
11.9%
8 48
 
6.2%
3 45
 
5.8%
4 33
 
4.3%
9 32
 
4.1%
7 32
 
4.1%
6 30
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 774
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 214
27.6%
1 123
15.9%
- 98
12.7%
2 92
11.9%
8 48
 
6.2%
3 45
 
5.8%
4 33
 
4.3%
9 32
 
4.1%
7 32
 
4.1%
6 30
 
3.9%

등록번호
Real number (ℝ)

UNIQUE 

Distinct52
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1388658.6
Minimum40430
Maximum2497024
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size600.0 B
2023-12-12T12:08:46.677722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum40430
5-th percentile179378
Q1966547
median1467434
Q31981221.8
95-th percentile2401093.3
Maximum2497024
Range2456594
Interquartile range (IQR)1014674.8

Descriptive statistics

Standard deviation709249.65
Coefficient of variation (CV)0.51074444
Kurtosis-0.7530959
Mean1388658.6
Median Absolute Deviation (MAD)513789.5
Skewness-0.45477061
Sum72210246
Variance5.0303507 × 1011
MonotonicityNot monotonic
2023-12-12T12:08:46.869556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
711296 1
 
1.9%
2497024 1
 
1.9%
904393 1
 
1.9%
931050 1
 
1.9%
931495 1
 
1.9%
985737 1
 
1.9%
1092496 1
 
1.9%
1179686 1
 
1.9%
1179685 1
 
1.9%
1190754 1
 
1.9%
Other values (42) 42
80.8%
ValueCountFrequency (%)
40430 1
1.9%
60507 1
1.9%
159028 1
1.9%
196028 1
1.9%
196029 1
1.9%
196030 1
1.9%
196031 1
1.9%
211559 1
1.9%
422315 1
1.9%
711296 1
1.9%
ValueCountFrequency (%)
2497024 1
1.9%
2494774 1
1.9%
2429866 1
1.9%
2377552 1
1.9%
2187660 1
1.9%
2183203 1
1.9%
2128896 1
1.9%
2100637 1
1.9%
2077405 1
1.9%
2076735 1
1.9%
Distinct46
Distinct (%)88.5%
Missing0
Missing (%)0.0%
Memory size548.0 B
2023-12-12T12:08:47.189526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length75
Median length40
Mean length23.115385
Min length4

Characters and Unicode

Total characters1202
Distinct characters199
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique40 ?
Unique (%)76.9%

Sample

1st row친유성 드럼디스크형 유회수기
2nd row자갈 세척기
3rd row인공해안이 설치된 조파수조
4th row오일 붐 인양장치
5th row오일펜스 전개판
ValueCountFrequency (%)
해양환경공단 10
 
4.4%
8
 
3.5%
koem 7
 
3.1%
6
 
2.6%
marine 5
 
2.2%
management 5
 
2.2%
environment 5
 
2.2%
korea 5
 
2.2%
이용한 4
 
1.8%
corporation 4
 
1.8%
Other values (124) 169
74.1%
2023-12-12T12:08:47.720334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
178
 
14.8%
n 35
 
2.9%
e 25
 
2.1%
a 25
 
2.1%
r 25
 
2.1%
o 25
 
2.1%
M 25
 
2.1%
23
 
1.9%
K 20
 
1.7%
( 20
 
1.7%
Other values (189) 801
66.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 611
50.8%
Lowercase Letter 190
 
15.8%
Space Separator 178
 
14.8%
Uppercase Letter 103
 
8.6%
Decimal Number 52
 
4.3%
Open Punctuation 20
 
1.7%
Close Punctuation 20
 
1.7%
Connector Punctuation 15
 
1.2%
Other Punctuation 12
 
1.0%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
23
 
3.8%
18
 
2.9%
17
 
2.8%
17
 
2.8%
16
 
2.6%
16
 
2.6%
15
 
2.5%
14
 
2.3%
14
 
2.3%
14
 
2.3%
Other values (154) 447
73.2%
Lowercase Letter
ValueCountFrequency (%)
n 35
18.4%
e 25
13.2%
a 25
13.2%
r 25
13.2%
o 25
13.2%
t 15
7.9%
i 15
7.9%
m 10
 
5.3%
p 5
 
2.6%
v 5
 
2.6%
Uppercase Letter
ValueCountFrequency (%)
M 25
24.3%
K 20
19.4%
E 20
19.4%
O 15
14.6%
P 6
 
5.8%
C 6
 
5.8%
N 3
 
2.9%
A 3
 
2.9%
S 2
 
1.9%
G 2
 
1.9%
Decimal Number
ValueCountFrequency (%)
4 18
34.6%
0 9
17.3%
2 8
15.4%
3 8
15.4%
9 7
 
13.5%
1 2
 
3.8%
Other Punctuation
ValueCountFrequency (%)
, 10
83.3%
/ 2
 
16.7%
Space Separator
ValueCountFrequency (%)
178
100.0%
Open Punctuation
ValueCountFrequency (%)
( 20
100.0%
Close Punctuation
ValueCountFrequency (%)
) 20
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 15
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 611
50.8%
Common 298
24.8%
Latin 293
24.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
23
 
3.8%
18
 
2.9%
17
 
2.8%
17
 
2.8%
16
 
2.6%
16
 
2.6%
15
 
2.5%
14
 
2.3%
14
 
2.3%
14
 
2.3%
Other values (154) 447
73.2%
Latin
ValueCountFrequency (%)
n 35
11.9%
e 25
 
8.5%
a 25
 
8.5%
r 25
 
8.5%
o 25
 
8.5%
M 25
 
8.5%
K 20
 
6.8%
E 20
 
6.8%
t 15
 
5.1%
O 15
 
5.1%
Other values (12) 63
21.5%
Common
ValueCountFrequency (%)
178
59.7%
( 20
 
6.7%
) 20
 
6.7%
4 18
 
6.0%
_ 15
 
5.0%
, 10
 
3.4%
0 9
 
3.0%
2 8
 
2.7%
3 8
 
2.7%
9 7
 
2.3%
Other values (3) 5
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 611
50.8%
ASCII 591
49.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
178
30.1%
n 35
 
5.9%
e 25
 
4.2%
a 25
 
4.2%
r 25
 
4.2%
o 25
 
4.2%
M 25
 
4.2%
K 20
 
3.4%
( 20
 
3.4%
E 20
 
3.4%
Other values (25) 193
32.7%
Hangul
ValueCountFrequency (%)
23
 
3.8%
18
 
2.9%
17
 
2.8%
17
 
2.8%
16
 
2.6%
16
 
2.6%
15
 
2.5%
14
 
2.3%
14
 
2.3%
14
 
2.3%
Other values (154) 447
73.2%
Distinct36
Distinct (%)69.2%
Missing0
Missing (%)0.0%
Memory size548.0 B
Minimum2021-08-08 00:00:00
Maximum2042-02-18 00:00:00
2023-12-12T12:08:47.921959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:08:48.121164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=36)

Interactions

2023-12-12T12:08:45.248640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:08:45.015413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:08:45.355006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:08:45.143585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T12:08:48.242279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번출원번호등록번호국문명칭존속만료일
연번1.0001.0000.9190.7960.976
출원번호1.0001.0001.0001.0001.000
등록번호0.9191.0001.0000.9940.989
국문명칭0.7961.0000.9941.0000.989
존속만료일0.9761.0000.9890.9891.000
2023-12-12T12:08:48.362290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번등록번호
연번1.000-0.331
등록번호-0.3311.000

Missing values

2023-12-12T12:08:45.481445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:08:45.610590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번출원번호등록번호국문명칭존속만료일
0110-2005-0111552711296친유성 드럼디스크형 유회수기2025-11-22
1210-2009-00182881033520자갈 세척기2029-03-04
2310-2009-0130383978231인공해안이 설치된 조파수조2029-12-24
3410-2011-01315461362001오일 붐 인양장치2031-12-09
4510-2014-00865561605620오일펜스 전개판2034-07-10
5610-2015-00838291699687파워팩용 포터블 냉각장치2035-06-15
6710-2016-01498861865403스마트 스키머2036-11-10
7810-2016-01554831754703금아말감-원자흡수분광법을 이용한 해양시료 내의 메틸수은 분석법2036-11-22
8910-2016-01627751981220선박평형수에 존재하는 병원성 대장균 검출을 위한 PNA 프로브 및 이의 용도2036-12-01
91010-2016-01627761981227선박평형수에 존재하는 비브리오 콜레라균 검출을 위한 PNA 프로브 및 이의 용도2036-12-01
연번출원번호등록번호국문명칭존속만료일
424340-2018-00398791466334해양환경공단 ( KOEM Korea Marine Environment Management Corporation )_42류2029-04-05
434440-2018-00398841466336해양환경공단 (KOEM)_39류2029-04-05
444540-2018-00398851468534해양환경공단 (KOEM)_40류2029-04-01
454640-2018-00398861466337해양환경공단 (KOEM)_42류2029-04-01
464740-2019-00133571579232해양환경공단 (KOEM)_41류(13)2029-04-01
474841-2008-0031160196028해양환경관리공단 ( KOEM )_39,40,42류2029-04-01
484941-2008-0031161196029KOEM Korea Marine Environment Management Corporation_39,40,42류2030-04-13
495041-2008-0031162196030해양환경관리공단 ( KOEM )_39,40,42류2030-04-13
505141-2008-0031163196031해양환경관리공단 ( KOEM Korea Marine Environment Management Corporation )_39,40,42류2030-04-13
515241-2010-0002389211559KOEM_39,40,42류2031-06-09