Overview

Dataset statistics

Number of variables3
Number of observations47
Missing cells0
Missing cells (%)0.0%
Duplicate rows3
Duplicate rows (%)6.4%
Total size in memory1.2 KiB
Average record size in memory26.8 B

Variable types

Categorical1
DateTime1
Text1

Dataset

Description부산교통공사_지식재산권관리정보_20200526
Author부산교통공사
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3057237

Alerts

Dataset has 3 (6.4%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-10 16:16:03.778353
Analysis finished2023-12-10 16:16:04.508603
Duration0.73 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

Distinct3
Distinct (%)6.4%
Missing0
Missing (%)0.0%
Memory size508.0 B
특허
30 
디자인
16 
출원 중
 
1

Length

Max length4
Median length2
Mean length2.3829787
Min length2

Unique

Unique1 ?
Unique (%)2.1%

Sample

1st row특허
2nd row특허
3rd row특허
4th row특허
5th row특허

Common Values

ValueCountFrequency (%)
특허 30
63.8%
디자인 16
34.0%
출원 중 1
 
2.1%

Length

2023-12-11T01:16:04.655446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:16:04.816744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
특허 30
62.5%
디자인 16
33.3%
출원 1
 
2.1%
1
 
2.1%
Distinct39
Distinct (%)83.0%
Missing0
Missing (%)0.0%
Memory size508.0 B
Minimum2007-10-12 00:00:00
Maximum2019-11-19 00:00:00
2023-12-11T01:16:04.985994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:16:05.221042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=39)
Distinct42
Distinct (%)89.4%
Missing0
Missing (%)0.0%
Memory size508.0 B
2023-12-11T01:16:05.586712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length23
Mean length18.978723
Min length3

Characters and Unicode

Total characters892
Distinct characters197
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique38 ?
Unique (%)80.9%

Sample

1st row전동차 역사의 피에스디 제어장치
2nd row고압비접지전력계통보호설비의결합시험장치
3rd row화재감지시스템
4th row열차운행정보를 이용한 화상무선 설비 핸드오프
5th row열차의 ATO/ATC 차량시스템
ValueCountFrequency (%)
강체 12
 
6.0%
전차선용 12
 
6.0%
6
 
3.0%
애자 5
 
2.5%
시스템 4
 
2.0%
엘리베이터 4
 
2.0%
전동차 4
 
2.0%
익스펜션조인트 4
 
2.0%
이용한 3
 
1.5%
디바이스 3
 
1.5%
Other values (129) 142
71.4%
2023-12-11T01:16:06.165560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
152
 
17.0%
33
 
3.7%
31
 
3.5%
29
 
3.3%
21
 
2.4%
19
 
2.1%
18
 
2.0%
18
 
2.0%
17
 
1.9%
16
 
1.8%
Other values (187) 538
60.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 726
81.4%
Space Separator 152
 
17.0%
Uppercase Letter 10
 
1.1%
Dash Punctuation 2
 
0.2%
Decimal Number 1
 
0.1%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
33
 
4.5%
31
 
4.3%
29
 
4.0%
21
 
2.9%
19
 
2.6%
18
 
2.5%
18
 
2.5%
17
 
2.3%
16
 
2.2%
16
 
2.2%
Other values (176) 508
70.0%
Uppercase Letter
ValueCountFrequency (%)
T 3
30.0%
A 2
20.0%
R 1
 
10.0%
E 1
 
10.0%
L 1
 
10.0%
C 1
 
10.0%
O 1
 
10.0%
Space Separator
ValueCountFrequency (%)
152
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Decimal Number
ValueCountFrequency (%)
3 1
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 726
81.4%
Common 156
 
17.5%
Latin 10
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
33
 
4.5%
31
 
4.3%
29
 
4.0%
21
 
2.9%
19
 
2.6%
18
 
2.5%
18
 
2.5%
17
 
2.3%
16
 
2.2%
16
 
2.2%
Other values (176) 508
70.0%
Latin
ValueCountFrequency (%)
T 3
30.0%
A 2
20.0%
R 1
 
10.0%
E 1
 
10.0%
L 1
 
10.0%
C 1
 
10.0%
O 1
 
10.0%
Common
ValueCountFrequency (%)
152
97.4%
- 2
 
1.3%
3 1
 
0.6%
/ 1
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 726
81.4%
ASCII 166
 
18.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
152
91.6%
T 3
 
1.8%
- 2
 
1.2%
A 2
 
1.2%
R 1
 
0.6%
E 1
 
0.6%
L 1
 
0.6%
3 1
 
0.6%
C 1
 
0.6%
/ 1
 
0.6%
Hangul
ValueCountFrequency (%)
33
 
4.5%
31
 
4.3%
29
 
4.0%
21
 
2.9%
19
 
2.6%
18
 
2.5%
18
 
2.5%
17
 
2.3%
16
 
2.2%
16
 
2.2%
Other values (176) 508
70.0%

Correlations

2023-12-11T01:16:06.325633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분등록일자발 명 명 칭
구분1.0001.0001.000
등록일자1.0001.0000.995
발 명 명 칭1.0000.9951.000

Missing values

2023-12-11T01:16:04.339086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:16:04.461949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분등록일자발 명 명 칭
0특허2008-06-13전동차 역사의 피에스디 제어장치
1특허2009-09-01고압비접지전력계통보호설비의결합시험장치
2특허2010-05-13화재감지시스템
3특허2010-09-20열차운행정보를 이용한 화상무선 설비 핸드오프
4특허2011-08-18열차의 ATO/ATC 차량시스템
5특허2012-01-09고무차륜 에이지티 경량전철용 전차선로
6특허2012-04-18철도차량용 차축 베어링의 그리스 주입장치
7특허2012-09-06경량전철 제3궤조 전차선 연결작업용 공기구
8특허2013-03-08철도차량 연결기 작업용 받침대
9특허2013-06-17철도차량용 공기건조기의 건조통 분해조립장치
구분등록일자발 명 명 칭
37디자인2011-10-14강체 전차선용 익스펜션조인트
38디자인2011-03-11강체 전차선용 익스펜션조인트
39디자인2011-03-14강체 전차선용 앤드 어프로치 애자
40디자인2011-03-11강체 전차선용 익스펜션조인트
41디자인2011-07-01강체 전차선용 잉카링 디바이스 애자
42디자인2011-07-01강체 전차선용 익스펜션조인트 애자
43디자인2011-03-11강체 전차선용 앙카링 디바이스
44디자인2011-03-14강체 전차선용 표준형 애자
45디자인2011-03-11강체 전차선용 앙카링 디바이스
46디자인2011-03-14강체 전차선용 표준형 애자

Duplicate rows

Most frequently occurring

구분등록일자발 명 명 칭# duplicates
0디자인2011-03-11강체 전차선용 앙카링 디바이스2
1디자인2011-03-11강체 전차선용 익스펜션조인트2
2디자인2011-03-14강체 전차선용 표준형 애자2