Overview

Dataset statistics

Number of variables3
Number of observations47
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)2.1%
Total size in memory1.2 KiB
Average record size in memory26.8 B

Variable types

Categorical1
DateTime1
Text1

Dataset

Description부산교통공사_지식재산권관리정보_20210526
Author부산교통공사
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3057237

Alerts

Dataset has 1 (2.1%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-10 16:16:00.039981
Analysis finished2023-12-10 16:16:00.565607
Duration0.53 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

Distinct3
Distinct (%)6.4%
Missing0
Missing (%)0.0%
Memory size508.0 B
특허
30 
디자인
14 
출원중
 
3

Length

Max length3
Median length2
Mean length2.3617021
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row특허
2nd row특허
3rd row특허
4th row특허
5th row특허

Common Values

ValueCountFrequency (%)
특허 30
63.8%
디자인 14
29.8%
출원중 3
 
6.4%

Length

2023-12-11T01:16:00.677034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:16:00.874934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
특허 30
63.8%
디자인 14
29.8%
출원중 3
 
6.4%
Distinct40
Distinct (%)85.1%
Missing0
Missing (%)0.0%
Memory size508.0 B
Minimum2007-10-12 00:00:00
Maximum2020-12-22 00:00:00
2023-12-11T01:16:01.112763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:16:01.360426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=40)
Distinct44
Distinct (%)93.6%
Missing0
Missing (%)0.0%
Memory size508.0 B
2023-12-11T01:16:01.706316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length23
Mean length18.851064
Min length3

Characters and Unicode

Total characters886
Distinct characters202
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique42 ?
Unique (%)89.4%

Sample

1st row전동차 역사의 피에스디 제어장치
2nd row고압비접지전력계통보호설비의결합시험장치
3rd row화재감지시스템
4th row열차운행정보를 이용한 화상무선 설비 핸드오프
5th row열차의 ATO/ATC 차량시스템
ValueCountFrequency (%)
강체 10
 
5.1%
전차선용 10
 
5.1%
6
 
3.1%
익스펜션조인트 4
 
2.1%
시스템 4
 
2.1%
엘리베이터 4
 
2.1%
애자 4
 
2.1%
전동차 4
 
2.1%
이용한 3
 
1.5%
철도차량용 3
 
1.5%
Other values (132) 143
73.3%
2023-12-11T01:16:02.433074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
148
 
16.7%
31
 
3.5%
29
 
3.3%
29
 
3.3%
21
 
2.4%
18
 
2.0%
18
 
2.0%
18
 
2.0%
17
 
1.9%
17
 
1.9%
Other values (192) 540
60.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 718
81.0%
Space Separator 148
 
16.7%
Uppercase Letter 10
 
1.1%
Decimal Number 3
 
0.3%
Open Punctuation 2
 
0.2%
Dash Punctuation 2
 
0.2%
Close Punctuation 2
 
0.2%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
31
 
4.3%
29
 
4.0%
29
 
4.0%
21
 
2.9%
18
 
2.5%
18
 
2.5%
18
 
2.5%
17
 
2.4%
17
 
2.4%
16
 
2.2%
Other values (177) 504
70.2%
Uppercase Letter
ValueCountFrequency (%)
T 3
30.0%
A 2
20.0%
R 1
 
10.0%
E 1
 
10.0%
L 1
 
10.0%
C 1
 
10.0%
O 1
 
10.0%
Decimal Number
ValueCountFrequency (%)
1 1
33.3%
3 1
33.3%
2 1
33.3%
Space Separator
ValueCountFrequency (%)
148
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 718
81.0%
Common 158
 
17.8%
Latin 10
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
31
 
4.3%
29
 
4.0%
29
 
4.0%
21
 
2.9%
18
 
2.5%
18
 
2.5%
18
 
2.5%
17
 
2.4%
17
 
2.4%
16
 
2.2%
Other values (177) 504
70.2%
Common
ValueCountFrequency (%)
148
93.7%
( 2
 
1.3%
- 2
 
1.3%
) 2
 
1.3%
1 1
 
0.6%
3 1
 
0.6%
/ 1
 
0.6%
2 1
 
0.6%
Latin
ValueCountFrequency (%)
T 3
30.0%
A 2
20.0%
R 1
 
10.0%
E 1
 
10.0%
L 1
 
10.0%
C 1
 
10.0%
O 1
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 718
81.0%
ASCII 168
 
19.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
148
88.1%
T 3
 
1.8%
( 2
 
1.2%
- 2
 
1.2%
A 2
 
1.2%
) 2
 
1.2%
R 1
 
0.6%
E 1
 
0.6%
L 1
 
0.6%
1 1
 
0.6%
Other values (5) 5
 
3.0%
Hangul
ValueCountFrequency (%)
31
 
4.3%
29
 
4.0%
29
 
4.0%
21
 
2.9%
18
 
2.5%
18
 
2.5%
18
 
2.5%
17
 
2.4%
17
 
2.4%
16
 
2.2%
Other values (177) 504
70.2%

Correlations

2023-12-11T01:16:02.604408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분등록일자발 명 명 칭
구분1.0001.0001.000
등록일자1.0001.0000.991
발 명 명 칭1.0000.9911.000

Missing values

2023-12-11T01:16:00.411359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:16:00.522105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분등록일자발 명 명 칭
0특허2008-06-13전동차 역사의 피에스디 제어장치
1특허2009-09-01고압비접지전력계통보호설비의결합시험장치
2특허2010-05-13화재감지시스템
3특허2010-09-20열차운행정보를 이용한 화상무선 설비 핸드오프
4특허2011-08-18열차의 ATO/ATC 차량시스템
5특허2012-01-09고무차륜 에이지티 경량전철용 전차선로
6특허2012-04-18철도차량용 차축 베어링의 그리스 주입장치
7특허2012-09-06경량전철 제3궤조 전차선 연결작업용 공기구
8특허2013-03-08철도차량 연결기 작업용 받침대
9특허2013-06-17철도차량용 공기건조기의 건조통 분해조립장치
구분등록일자발 명 명 칭
37디자인2011-10-14강체 전차선용 익스펜션조인트
38디자인2011-03-11강체 전차선용 익스펜션조인트
39디자인2011-03-14강체 전차선용 앤드 어프로치 애자
40디자인2011-03-11강체 전차선용 익스펜션조인트
41디자인2011-07-01강체 전차선용 잉카링 디바이스 애자
42디자인2011-07-01강체 전차선용 익스펜션조인트 애자
43디자인2011-03-11강체 전차선용 앙카링 디바이스
44디자인2011-03-14강체 전차선용 표준형 애자
45출원중2020-12-22도시철도 벤치(1인용)
46출원중2020-12-22도시철도 벤치(2인용)

Duplicate rows

Most frequently occurring

구분등록일자발 명 명 칭# duplicates
0디자인2011-03-11강체 전차선용 익스펜션조인트2