Overview

Dataset statistics

Number of variables4
Number of observations513
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory17.2 KiB
Average record size in memory34.3 B

Variable types

Numeric2
Text1
DateTime1

Dataset

Description한전KDN의 2023년 4월 3일 기준 특허 보유현황 데이터입니다. 발명명칭과 특허의 등록번호, 등록일에 대한 정보를 제공합니다.
URLhttps://www.data.go.kr/data/3070856/fileData.do

Alerts

연번 is highly overall correlated with 등록번호High correlation
등록번호 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
등록번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 16:10:19.810419
Analysis finished2023-12-12 16:10:20.580095
Duration0.77 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct513
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean257
Minimum1
Maximum513
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.6 KiB
2023-12-13T01:10:20.663107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile26.6
Q1129
median257
Q3385
95-th percentile487.4
Maximum513
Range512
Interquartile range (IQR)256

Descriptive statistics

Standard deviation148.23461
Coefficient of variation (CV)0.57678837
Kurtosis-1.2
Mean257
Median Absolute Deviation (MAD)128
Skewness0
Sum131841
Variance21973.5
MonotonicityStrictly increasing
2023-12-13T01:10:20.788846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
386 1
 
0.2%
352 1
 
0.2%
351 1
 
0.2%
350 1
 
0.2%
349 1
 
0.2%
348 1
 
0.2%
347 1
 
0.2%
346 1
 
0.2%
345 1
 
0.2%
Other values (503) 503
98.1%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
513 1
0.2%
512 1
0.2%
511 1
0.2%
510 1
0.2%
509 1
0.2%
508 1
0.2%
507 1
0.2%
506 1
0.2%
505 1
0.2%
504 1
0.2%
Distinct505
Distinct (%)98.4%
Missing0
Missing (%)0.0%
Memory size4.1 KiB
2023-12-13T01:10:21.080020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length71
Median length45
Mean length24.838207
Min length5

Characters and Unicode

Total characters12742
Distinct characters412
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique499 ?
Unique (%)97.3%

Sample

1st row태양광 발전소의 기상정보 추정 장치 및 그 방법
2nd row머신러닝 기반 맞춤형 챗봇 추천 시스템 및 그 방법
3rd row기술유출 방지를 위한 스마트 콤비카드와 위치정보 시스템
4th rowAMI 시공 관리 서비스를 제공하는 장치, 방법 및 프로그램
5th row블록체인 기반의 AMI기기 검증 시스템 및 방법
ValueCountFrequency (%)
283
 
8.1%
시스템 243
 
6.9%
방법 223
 
6.4%
장치 122
 
3.5%
이용한 83
 
2.4%
관리 52
 
1.5%
51
 
1.5%
데이터 46
 
1.3%
이를 43
 
1.2%
위한 42
 
1.2%
Other values (1119) 2315
66.1%
2023-12-13T01:10:21.500181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2990
23.5%
389
 
3.1%
376
 
3.0%
343
 
2.7%
301
 
2.4%
301
 
2.4%
300
 
2.4%
285
 
2.2%
260
 
2.0%
251
 
2.0%
Other values (402) 6946
54.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9416
73.9%
Space Separator 2990
 
23.5%
Uppercase Letter 286
 
2.2%
Other Punctuation 26
 
0.2%
Decimal Number 14
 
0.1%
Close Punctuation 4
 
< 0.1%
Open Punctuation 4
 
< 0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
389
 
4.1%
376
 
4.0%
343
 
3.6%
301
 
3.2%
301
 
3.2%
300
 
3.2%
285
 
3.0%
260
 
2.8%
251
 
2.7%
226
 
2.4%
Other values (365) 6384
67.8%
Uppercase Letter
ValueCountFrequency (%)
I 36
12.6%
M 29
10.1%
A 29
10.1%
C 26
9.1%
S 25
8.7%
P 23
8.0%
D 23
8.0%
L 16
 
5.6%
T 12
 
4.2%
R 12
 
4.2%
Other values (11) 55
19.2%
Decimal Number
ValueCountFrequency (%)
6 2
14.3%
1 2
14.3%
8 2
14.3%
5 2
14.3%
0 2
14.3%
3 2
14.3%
2 1
7.1%
4 1
7.1%
Other Punctuation
ValueCountFrequency (%)
, 20
76.9%
/ 3
 
11.5%
· 2
 
7.7%
. 1
 
3.8%
Space Separator
ValueCountFrequency (%)
2990
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9416
73.9%
Common 3040
 
23.9%
Latin 286
 
2.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
389
 
4.1%
376
 
4.0%
343
 
3.6%
301
 
3.2%
301
 
3.2%
300
 
3.2%
285
 
3.0%
260
 
2.8%
251
 
2.7%
226
 
2.4%
Other values (365) 6384
67.8%
Latin
ValueCountFrequency (%)
I 36
12.6%
M 29
10.1%
A 29
10.1%
C 26
9.1%
S 25
8.7%
P 23
8.0%
D 23
8.0%
L 16
 
5.6%
T 12
 
4.2%
R 12
 
4.2%
Other values (11) 55
19.2%
Common
ValueCountFrequency (%)
2990
98.4%
, 20
 
0.7%
) 4
 
0.1%
( 4
 
0.1%
/ 3
 
0.1%
· 2
 
0.1%
6 2
 
0.1%
1 2
 
0.1%
8 2
 
0.1%
5 2
 
0.1%
Other values (6) 9
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9416
73.9%
ASCII 3324
 
26.1%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2990
90.0%
I 36
 
1.1%
M 29
 
0.9%
A 29
 
0.9%
C 26
 
0.8%
S 25
 
0.8%
P 23
 
0.7%
D 23
 
0.7%
, 20
 
0.6%
L 16
 
0.5%
Other values (26) 107
 
3.2%
Hangul
ValueCountFrequency (%)
389
 
4.1%
376
 
4.0%
343
 
3.6%
301
 
3.2%
301
 
3.2%
300
 
3.2%
285
 
3.0%
260
 
2.8%
251
 
2.7%
226
 
2.4%
Other values (365) 6384
67.8%
None
ValueCountFrequency (%)
· 2
100.0%

등록번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct513
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.0171888 × 108
Minimum1.00446 × 108
Maximum1.0246116 × 108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.6 KiB
2023-12-13T01:10:21.620110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.00446 × 108
5-th percentile1.0090216 × 108
Q11.0132102 × 108
median1.0174306 × 108
Q31.0218587 × 108
95-th percentile1.024075 × 108
Maximum1.0246116 × 108
Range2015160
Interquartile range (IQR)864848

Descriptive statistics

Standard deviation498192.75
Coefficient of variation (CV)0.004897741
Kurtosis-0.96419974
Mean1.0171888 × 108
Median Absolute Deviation (MAD)432252
Skewness-0.26904166
Sum5.2181788 × 1010
Variance2.4819601 × 1011
MonotonicityNot monotonic
2023-12-13T01:10:21.756780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
102461156 1
 
0.2%
101321021 1
 
0.2%
101417794 1
 
0.2%
101417795 1
 
0.2%
101419595 1
 
0.2%
101430176 1
 
0.2%
101430177 1
 
0.2%
101435181 1
 
0.2%
101437595 1
 
0.2%
101443201 1
 
0.2%
Other values (503) 503
98.1%
ValueCountFrequency (%)
100445996 1
0.2%
100478219 1
0.2%
100489244 1
0.2%
100575228 1
0.2%
100581719 1
0.2%
100610353 1
0.2%
100622985 1
0.2%
100622986 1
0.2%
100629399 1
0.2%
100699332 1
0.2%
ValueCountFrequency (%)
102461156 1
0.2%
102461155 1
0.2%
102454796 1
0.2%
102453832 1
0.2%
102442283 1
0.2%
102442282 1
0.2%
102442281 1
0.2%
102442280 1
0.2%
102442134 1
0.2%
102442133 1
0.2%
Distinct309
Distinct (%)60.2%
Missing0
Missing (%)0.0%
Memory size4.1 KiB
Minimum2004-08-17 00:00:00
Maximum2022-10-26 00:00:00
2023-12-13T01:10:21.873284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:10:22.215466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-13T01:10:20.233938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:10:20.051149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:10:20.327145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:10:20.130249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T01:10:22.289475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번등록번호
연번1.0000.983
등록번호0.9831.000
2023-12-13T01:10:22.354224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번등록번호
연번1.000-1.000
등록번호-1.0001.000

Missing values

2023-12-13T01:10:20.454807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:10:20.542142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번발명명칭등록번호등록일
01태양광 발전소의 기상정보 추정 장치 및 그 방법1024611562022-10-26
12머신러닝 기반 맞춤형 챗봇 추천 시스템 및 그 방법1024611552022-10-26
23기술유출 방지를 위한 스마트 콤비카드와 위치정보 시스템1024547962022-10-11
34AMI 시공 관리 서비스를 제공하는 장치, 방법 및 프로그램1024538322022-10-06
45블록체인 기반의 AMI기기 검증 시스템 및 방법1024422832022-09-06
56실내 위치 측위 장치 및 방법1024422822022-09-06
67무인 비행체 및 무인 비행체 배터리 공급 방법1024422812022-09-06
78배전지능화용 단말장치의 암호서비스 및 방법1024422802022-09-06
89배전자동화 시스템의 단말장치 및 그의 동작 제어 방법1024421342022-09-05
910블록체인 환경에서의 제안서 평가 진행 시스템 및 방법1024421332022-09-05
연번발명명칭등록번호등록일
503504전력선통신을 이용한 무인 원격감시 보안시스템1006993322007-03-19
504505원격검침시스템1006293992006-09-21
505506수용가의 도전 및 누전 감시가 가능한 원격검침 시스템1006229852006-09-05
506507원격 검침 시스템의 데이터 수집장치1006229862006-09-05
507508무선 자주기1006103532006-08-01
508509지그비 통신방식을 이용한 통합 원격 검침 시스템 및 그방법1005817192006-05-12
509510전력선 통신을 이용한 피디에이 검침시스템 및 그 검침방법1005752282006-04-24
510511무인 중계소를 감시 및 제어하기 위한 단말 장치 및 그 방법1004892442005-05-03
511512FCI용 단말 장치1004782192005-03-11
512513인터넷을 이용한 교통정보의 생성/제공시스템 및 그생성/제공방법1004459962004-08-17