Overview

Dataset statistics

Number of variables3
Number of observations560
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.8 KiB
Average record size in memory25.2 B

Variable types

Numeric1
Text2

Dataset

Description특구재단에서 보유한 2022년 유망 사업화 기술 목록입니다. 해당 데이터가 보유한 정보는 다음과 같습니다. 칼럼명 : 순번, 기술명, 특허출원번호
Author(재)연구개발특구진흥재단
URLhttps://www.data.go.kr/data/15106294/fileData.do

Alerts

순번 has unique valuesUnique
기술명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 16:23:11.411500
Analysis finished2023-12-12 16:23:12.390427
Duration0.98 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct560
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean314.35714
Minimum1
Maximum639
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.1 KiB
2023-12-13T01:23:12.478262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile28.95
Q1140.75
median280.5
Q3499.25
95-th percentile611.05
Maximum639
Range638
Interquartile range (IQR)358.5

Descriptive statistics

Standard deviation196.37925
Coefficient of variation (CV)0.62470109
Kurtosis-1.3930879
Mean314.35714
Median Absolute Deviation (MAD)179.5
Skewness0.084169119
Sum176040
Variance38564.81
MonotonicityStrictly increasing
2023-12-13T01:23:12.634498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
457 1
 
0.2%
451 1
 
0.2%
452 1
 
0.2%
453 1
 
0.2%
454 1
 
0.2%
455 1
 
0.2%
456 1
 
0.2%
458 1
 
0.2%
449 1
 
0.2%
Other values (550) 550
98.2%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
639 1
0.2%
638 1
0.2%
637 1
0.2%
636 1
0.2%
635 1
0.2%
634 1
0.2%
633 1
0.2%
632 1
0.2%
631 1
0.2%
630 1
0.2%

기술명
Text

UNIQUE 

Distinct560
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size4.5 KiB
2023-12-13T01:23:12.991178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length89
Median length51
Mean length28.905357
Min length3

Characters and Unicode

Total characters16187
Distinct characters610
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique560 ?
Unique (%)100.0%

Sample

1st row3D 프린팅 기반의 유전체 공진기와 구조체 및 그 제조 방법
2nd rowAGTR-1에 특이적으로 결합 유방암 치료용 핵산압타머
3rd rowCAR 유전자가 도입된 NK 세포, 면역세포의 제조방법 용도
4th rowCRISP/Cas 시스템에 사용하기 위한 융합 단백질, 이의 복합체
5th rowNK면역 치료제
ValueCountFrequency (%)
256
 
6.1%
방법 111
 
2.7%
시스템 90
 
2.2%
이용한 83
 
2.0%
장치 76
 
1.8%
위한 53
 
1.3%
기술 53
 
1.3%
조성물 51
 
1.2%
제조방법 44
 
1.1%
이를 39
 
0.9%
Other values (2119) 3318
79.5%
2023-12-13T01:23:13.457501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3652
 
22.6%
366
 
2.3%
285
 
1.8%
281
 
1.7%
257
 
1.6%
255
 
1.6%
247
 
1.5%
212
 
1.3%
205
 
1.3%
200
 
1.2%
Other values (600) 10227
63.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12026
74.3%
Space Separator 3653
 
22.6%
Uppercase Letter 237
 
1.5%
Lowercase Letter 74
 
0.5%
Decimal Number 71
 
0.4%
Other Punctuation 70
 
0.4%
Dash Punctuation 36
 
0.2%
Open Punctuation 10
 
0.1%
Close Punctuation 10
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
366
 
3.0%
285
 
2.4%
281
 
2.3%
257
 
2.1%
255
 
2.1%
247
 
2.1%
212
 
1.8%
205
 
1.7%
200
 
1.7%
196
 
1.6%
Other values (535) 9522
79.2%
Uppercase Letter
ValueCountFrequency (%)
D 24
 
10.1%
I 20
 
8.4%
S 16
 
6.8%
L 16
 
6.8%
C 15
 
6.3%
A 14
 
5.9%
E 13
 
5.5%
P 13
 
5.5%
M 12
 
5.1%
N 12
 
5.1%
Other values (15) 82
34.6%
Lowercase Letter
ValueCountFrequency (%)
i 11
14.9%
o 9
12.2%
a 8
10.8%
n 6
 
8.1%
r 5
 
6.8%
l 5
 
6.8%
e 4
 
5.4%
t 4
 
5.4%
s 4
 
5.4%
p 3
 
4.1%
Other values (9) 15
20.3%
Decimal Number
ValueCountFrequency (%)
3 24
33.8%
1 15
21.1%
2 12
16.9%
9 5
 
7.0%
5 4
 
5.6%
4 4
 
5.6%
0 3
 
4.2%
6 3
 
4.2%
8 1
 
1.4%
Other Punctuation
ValueCountFrequency (%)
, 46
65.7%
/ 15
 
21.4%
· 7
 
10.0%
! 1
 
1.4%
. 1
 
1.4%
Space Separator
ValueCountFrequency (%)
3652
> 99.9%
  1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 8
80.0%
[ 2
 
20.0%
Close Punctuation
ValueCountFrequency (%)
) 8
80.0%
] 2
 
20.0%
Dash Punctuation
ValueCountFrequency (%)
- 36
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12026
74.3%
Common 3850
 
23.8%
Latin 311
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
366
 
3.0%
285
 
2.4%
281
 
2.3%
257
 
2.1%
255
 
2.1%
247
 
2.1%
212
 
1.8%
205
 
1.7%
200
 
1.7%
196
 
1.6%
Other values (535) 9522
79.2%
Latin
ValueCountFrequency (%)
D 24
 
7.7%
I 20
 
6.4%
S 16
 
5.1%
L 16
 
5.1%
C 15
 
4.8%
A 14
 
4.5%
E 13
 
4.2%
P 13
 
4.2%
M 12
 
3.9%
N 12
 
3.9%
Other values (34) 156
50.2%
Common
ValueCountFrequency (%)
3652
94.9%
, 46
 
1.2%
- 36
 
0.9%
3 24
 
0.6%
/ 15
 
0.4%
1 15
 
0.4%
2 12
 
0.3%
( 8
 
0.2%
) 8
 
0.2%
· 7
 
0.2%
Other values (11) 27
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12026
74.3%
ASCII 4150
 
25.6%
None 11
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3652
88.0%
, 46
 
1.1%
- 36
 
0.9%
D 24
 
0.6%
3 24
 
0.6%
I 20
 
0.5%
S 16
 
0.4%
L 16
 
0.4%
/ 15
 
0.4%
C 15
 
0.4%
Other values (50) 286
 
6.9%
Hangul
ValueCountFrequency (%)
366
 
3.0%
285
 
2.4%
281
 
2.3%
257
 
2.1%
255
 
2.1%
247
 
2.1%
212
 
1.8%
205
 
1.7%
200
 
1.7%
196
 
1.6%
Other values (535) 9522
79.2%
None
ValueCountFrequency (%)
· 7
63.6%
1
 
9.1%
1
 
9.1%
1
 
9.1%
  1
 
9.1%
Distinct559
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Memory size4.5 KiB
2023-12-13T01:23:13.658832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length15
Mean length14.994643
Min length12

Characters and Unicode

Total characters8397
Distinct characters16
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique558 ?
Unique (%)99.6%

Sample

1st row10-2017-0166666
2nd row10-2018-0089025
3rd row10-2020-0037719
4th row10-2017-0131637
5th row10-2016-7036895
ValueCountFrequency (%)
10-2017-0136246 2
 
0.4%
10-2019-0031047 1
 
0.2%
10-2017-0050485 1
 
0.2%
10-2019-0093705 1
 
0.2%
10-2021-0044421 1
 
0.2%
10-2019-0009412 1
 
0.2%
10-2021-0028907 1
 
0.2%
10-2019-0156671 1
 
0.2%
10-2021-0007351 1
 
0.2%
10-2020-0110959 1
 
0.2%
Other values (549) 549
98.0%
2023-12-13T01:23:13.973654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 2346
27.9%
1 1604
19.1%
- 1118
13.3%
2 999
11.9%
8 380
 
4.5%
7 364
 
4.3%
9 356
 
4.2%
6 346
 
4.1%
5 305
 
3.6%
4 302
 
3.6%
Other values (6) 277
 
3.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 7274
86.6%
Dash Punctuation 1118
 
13.3%
Other Letter 2
 
< 0.1%
Space Separator 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 2346
32.3%
1 1604
22.1%
2 999
13.7%
8 380
 
5.2%
7 364
 
5.0%
9 356
 
4.9%
6 346
 
4.8%
5 305
 
4.2%
4 302
 
4.2%
3 272
 
3.7%
Other Letter
ValueCountFrequency (%)
1
50.0%
1
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 1118
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 8395
> 99.9%
Hangul 2
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 2346
27.9%
1 1604
19.1%
- 1118
13.3%
2 999
11.9%
8 380
 
4.5%
7 364
 
4.3%
9 356
 
4.2%
6 346
 
4.1%
5 305
 
3.6%
4 302
 
3.6%
Other values (4) 275
 
3.3%
Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8395
> 99.9%
Hangul 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 2346
27.9%
1 1604
19.1%
- 1118
13.3%
2 999
11.9%
8 380
 
4.5%
7 364
 
4.3%
9 356
 
4.2%
6 346
 
4.1%
5 305
 
3.6%
4 302
 
3.6%
Other values (4) 275
 
3.3%
Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Interactions

2023-12-13T01:23:11.800302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-13T01:23:12.280108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:23:12.359890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번기술명특허 출원번호
013D 프린팅 기반의 유전체 공진기와 구조체 및 그 제조 방법10-2017-0166666
12AGTR-1에 특이적으로 결합 유방암 치료용 핵산압타머10-2018-0089025
23CAR 유전자가 도입된 NK 세포, 면역세포의 제조방법 용도10-2020-0037719
34CRISP/Cas 시스템에 사용하기 위한 융합 단백질, 이의 복합체10-2017-0131637
45NK면역 치료제10-2016-7036895
56감태 추출물을 포함하는 초미세먼지로 인한 질환 예방 및 치료용 조성물10-2019-0112283
67견방사의 고온밀폐식 정련방법 및 이에 의해 제조된 견방사10-2017-0055630
78고에너지 파장 가변 근적외선 레이저 제작 기술10-2016-0105100
89고에너지밀도 전고체 이차전지 기술10-2018-0135867
910고주파 및 저주파 저감 클램프10-2020-0026449
순번기술명특허 출원번호
550630차량에 설치되는 화물 적재장치10-2021-0158033
551631차량의 유리를 이용한 전기자동차의 무접점 충전장치10-2015-0122005
552632파워팩을 적용한 수평 유지기술10-2021-0103063
553633Sediment 미생물연료전지10-2014-0073798
554634생물학적 C1 가스 전환 공정을 위한 생물전기화학반응기 및 이를 이용한 공정방법10-2018-0117357
555635수소 생산 장치를 구비한 LNG 운반선10-2019-0069256
556636수소 정제 장치를 포함하는 부생수소 운반선10-2019-0086755
557637수소연료전지 추진 선박의 압축 수소 연료공급 방법10-2020-0166248
558638음식물쓰레기 분해 혼합균주 및 이를 이용한 음식물쓰레기 분해 방법10-2018-0035254
559639제조합 HcRNAV 34 바이러스 유사 입자 단백질을 발현하는 형질전환 담배10-2014-0114211