Overview

Dataset statistics

Number of variables3
Number of observations48
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)2.1%
Total size in memory1.3 KiB
Average record size in memory26.8 B

Variable types

Categorical1
DateTime1
Text1

Dataset

Description부산교통공사의 지식재산권 출원, 등록 및 관리 현황에 관한 데이터로 지식재산권 구분, 등록일자 및 발명 명칭의 항목을 제공
URLhttps://www.data.go.kr/data/3057237/fileData.do

Alerts

Dataset has 1 (2.1%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 15:58:15.936762
Analysis finished2023-12-12 15:58:16.401361
Duration0.46 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

Distinct3
Distinct (%)6.2%
Missing0
Missing (%)0.0%
Memory size516.0 B
특허
32 
디자인
15 
특허출원
 
1

Length

Max length4
Median length2
Mean length2.3541667
Min length2

Unique

Unique1 ?
Unique (%)2.1%

Sample

1st row특허
2nd row특허
3rd row특허
4th row특허
5th row특허

Common Values

ValueCountFrequency (%)
특허 32
66.7%
디자인 15
31.2%
특허출원 1
 
2.1%

Length

2023-12-13T00:58:16.490133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:58:16.643951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
특허 32
66.7%
디자인 15
31.2%
특허출원 1
 
2.1%
Distinct41
Distinct (%)85.4%
Missing0
Missing (%)0.0%
Memory size516.0 B
Minimum2008-06-13 00:00:00
Maximum2021-12-30 00:00:00
2023-12-13T00:58:16.789256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:58:16.963756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=41)
Distinct45
Distinct (%)93.8%
Missing0
Missing (%)0.0%
Memory size516.0 B
2023-12-13T00:58:17.256701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length27
Mean length20.020833
Min length3

Characters and Unicode

Total characters961
Distinct characters212
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)89.6%

Sample

1st row전동차 역사의 피에스디 제어장치
2nd row고압비접지전력계통보호설비의결합시험장치
3rd row화재감지시스템
4th row열차의 ATO/ATC 차량시스템
5th row고무차륜 에이지티 경량전철용 전차선로
ValueCountFrequency (%)
강체 10
 
4.6%
전차선용 10
 
4.6%
8
 
3.7%
시스템 5
 
2.3%
전동차 4
 
1.8%
애자 4
 
1.8%
방법 4
 
1.8%
익스펜션조인트 4
 
1.8%
철도차량용 3
 
1.4%
엘리베이터 3
 
1.4%
Other values (148) 163
74.8%
2023-12-13T00:58:17.645649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
171
 
17.8%
30
 
3.1%
29
 
3.0%
28
 
2.9%
25
 
2.6%
21
 
2.2%
20
 
2.1%
19
 
2.0%
18
 
1.9%
16
 
1.7%
Other values (202) 584
60.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 764
79.5%
Space Separator 171
 
17.8%
Uppercase Letter 15
 
1.6%
Dash Punctuation 3
 
0.3%
Decimal Number 3
 
0.3%
Open Punctuation 2
 
0.2%
Close Punctuation 2
 
0.2%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
30
 
3.9%
29
 
3.8%
28
 
3.7%
25
 
3.3%
21
 
2.7%
20
 
2.6%
19
 
2.5%
18
 
2.4%
16
 
2.1%
14
 
1.8%
Other values (186) 544
71.2%
Uppercase Letter
ValueCountFrequency (%)
T 4
26.7%
L 2
13.3%
A 2
13.3%
E 2
13.3%
R 2
13.3%
W 1
 
6.7%
C 1
 
6.7%
O 1
 
6.7%
Decimal Number
ValueCountFrequency (%)
1 1
33.3%
3 1
33.3%
2 1
33.3%
Space Separator
ValueCountFrequency (%)
171
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 764
79.5%
Common 182
 
18.9%
Latin 15
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
30
 
3.9%
29
 
3.8%
28
 
3.7%
25
 
3.3%
21
 
2.7%
20
 
2.6%
19
 
2.5%
18
 
2.4%
16
 
2.1%
14
 
1.8%
Other values (186) 544
71.2%
Common
ValueCountFrequency (%)
171
94.0%
- 3
 
1.6%
( 2
 
1.1%
) 2
 
1.1%
1 1
 
0.5%
3 1
 
0.5%
/ 1
 
0.5%
2 1
 
0.5%
Latin
ValueCountFrequency (%)
T 4
26.7%
L 2
13.3%
A 2
13.3%
E 2
13.3%
R 2
13.3%
W 1
 
6.7%
C 1
 
6.7%
O 1
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 764
79.5%
ASCII 197
 
20.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
171
86.8%
T 4
 
2.0%
- 3
 
1.5%
L 2
 
1.0%
A 2
 
1.0%
( 2
 
1.0%
) 2
 
1.0%
E 2
 
1.0%
R 2
 
1.0%
W 1
 
0.5%
Other values (6) 6
 
3.0%
Hangul
ValueCountFrequency (%)
30
 
3.9%
29
 
3.8%
28
 
3.7%
25
 
3.3%
21
 
2.7%
20
 
2.6%
19
 
2.5%
18
 
2.4%
16
 
2.1%
14
 
1.8%
Other values (186) 544
71.2%

Correlations

2023-12-13T00:58:17.729316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분등록일자발명명칭
구분1.0001.0001.000
등록일자1.0001.0000.991
발명명칭1.0000.9911.000

Missing values

2023-12-13T00:58:16.245222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:58:16.360096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분등록일자발명명칭
0특허2008-06-13전동차 역사의 피에스디 제어장치
1특허2009-09-01고압비접지전력계통보호설비의결합시험장치
2특허2010-05-13화재감지시스템
3특허2011-08-18열차의 ATO/ATC 차량시스템
4특허2012-01-09고무차륜 에이지티 경량전철용 전차선로
5특허2012-04-18철도차량용 차축 베어링의 그리스 주입장치
6특허2012-09-06경량전철 제3궤조 전차선 연결작업용 공기구
7특허2013-03-08철도차량 연결기 작업용 받침대
8특허2013-06-17철도차량용 공기건조기의 건조통 분해조립장치
9특허2013-06-25철도레일 신축이음매의 변위감지와 살수장치 및 그방법
구분등록일자발명명칭
38디자인2011-10-14강체 전차선용 익스펜션조인트
39디자인2011-03-11강체 전차선용 익스펜션조인트
40디자인2011-03-14강체 전차선용 앤드 어프로치 애자
41디자인2011-03-11강체 전차선용 익스펜션조인트
42디자인2011-07-01강체 전차선용 잉카링 디바이스 애자
43디자인2011-07-01강체 전차선용 익스펜션조인트 애자
44디자인2011-03-11강체 전차선용 앙카링 디바이스
45디자인2011-03-14강체 전차선용 표준형 애자
46디자인2021-08-27도시철도 벤치(1인용)
47디자인2021-08-27도시철도 벤치(2인용)

Duplicate rows

Most frequently occurring

구분등록일자발명명칭# duplicates
0디자인2011-03-11강체 전차선용 익스펜션조인트2