Overview

Dataset statistics

Number of variables7
Number of observations47
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.7 KiB
Average record size in memory59.8 B

Variable types

Numeric1
Categorical3
DateTime1
Text2

Dataset

Description한국서부발전 산업재산권 기술이전목록 정보입니다. 제공데이터는 No.,종류,계약일,기간,기술명,기술료,이전기업 입니다.
Author공공데이터포털
URLhttps://www.data.go.kr/data/15090242/fileData.do

Alerts

종류 is highly overall correlated with 기술료High correlation
기술료 is highly overall correlated with 종류High correlation
종류 is highly imbalanced (74.7%)Imbalance
기간 is highly imbalanced (58.8%)Imbalance
순번 has unique valuesUnique

Reproduction

Analysis started2024-04-17 19:10:42.341821
Analysis finished2024-04-17 19:10:42.907497
Duration0.57 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct47
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean24
Minimum1
Maximum47
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size555.0 B
2024-04-18T04:10:42.973954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.3
Q112.5
median24
Q335.5
95-th percentile44.7
Maximum47
Range46
Interquartile range (IQR)23

Descriptive statistics

Standard deviation13.711309
Coefficient of variation (CV)0.57130455
Kurtosis-1.2
Mean24
Median Absolute Deviation (MAD)12
Skewness0
Sum1128
Variance188
MonotonicityStrictly increasing
2024-04-18T04:10:43.073499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=47)
ValueCountFrequency (%)
1 1
 
2.1%
2 1
 
2.1%
27 1
 
2.1%
28 1
 
2.1%
29 1
 
2.1%
30 1
 
2.1%
31 1
 
2.1%
32 1
 
2.1%
33 1
 
2.1%
34 1
 
2.1%
Other values (37) 37
78.7%
ValueCountFrequency (%)
1 1
2.1%
2 1
2.1%
3 1
2.1%
4 1
2.1%
5 1
2.1%
6 1
2.1%
7 1
2.1%
8 1
2.1%
9 1
2.1%
10 1
2.1%
ValueCountFrequency (%)
47 1
2.1%
46 1
2.1%
45 1
2.1%
44 1
2.1%
43 1
2.1%
42 1
2.1%
41 1
2.1%
40 1
2.1%
39 1
2.1%
38 1
2.1%

종류
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)6.4%
Missing0
Missing (%)0.0%
Memory size508.0 B
특허
44 
연구성과물
 
2
프로그램
 
1

Length

Max length5
Median length2
Mean length2.1702128
Min length2

Unique

Unique1 ?
Unique (%)2.1%

Sample

1st row특허
2nd row특허
3rd row특허
4th row특허
5th row특허

Common Values

ValueCountFrequency (%)
특허 44
93.6%
연구성과물 2
 
4.3%
프로그램 1
 
2.1%

Length

2024-04-18T04:10:43.180221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T04:10:43.274329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
특허 44
93.6%
연구성과물 2
 
4.3%
프로그램 1
 
2.1%
Distinct33
Distinct (%)70.2%
Missing0
Missing (%)0.0%
Memory size508.0 B
Minimum2008-05-01 00:00:00
Maximum2021-10-01 00:00:00
2024-04-18T04:10:43.361153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T04:10:43.461091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)

기간
Categorical

IMBALANCE 

Distinct6
Distinct (%)12.8%
Missing0
Missing (%)0.0%
Memory size508.0 B
5년
38 
10년
15년
 
1
2년
 
1
23.03.25
 
1

Length

Max length8
Median length2
Mean length2.2553191
Min length2

Unique

Unique4 ?
Unique (%)8.5%

Sample

1st row15년
2nd row10년
3rd row10년
4th row10년
5th row10년

Common Values

ValueCountFrequency (%)
5년 38
80.9%
10년 5
 
10.6%
15년 1
 
2.1%
2년 1
 
2.1%
23.03.25 1
 
2.1%
3년 1
 
2.1%

Length

2024-04-18T04:10:43.574930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T04:10:43.666086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5년 38
80.9%
10년 5
 
10.6%
15년 1
 
2.1%
2년 1
 
2.1%
23.03.25 1
 
2.1%
3년 1
 
2.1%
Distinct41
Distinct (%)87.2%
Missing0
Missing (%)0.0%
Memory size508.0 B
2024-04-18T04:10:43.937506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length131
Median length31
Mean length27.12766
Min length3

Characters and Unicode

Total characters1275
Distinct characters221
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)74.5%

Sample

1st row고전압 설비의 부분방전 감시센서 및 그 제조방법
2nd row과열증기를 이용한 석탄건조 시스템
3rd row배관의 모재 및 용접부 초음파 검사장치
4th row2차전지의 수명 예측장치 및 방법
5th row보부재 보강장치 및 이를 이용한 경량지붕틀의 보강구조
ValueCountFrequency (%)
29
 
8.5%
장치 12
 
3.5%
방법 12
 
3.5%
이용한 8
 
2.4%
발전기 7
 
2.1%
7
 
2.1%
시스템 6
 
1.8%
고정자 6
 
1.8%
제조방법 5
 
1.5%
3
 
0.9%
Other values (194) 245
72.1%
2024-04-18T04:10:44.337644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
293
 
23.0%
32
 
2.5%
31
 
2.4%
29
 
2.3%
26
 
2.0%
26
 
2.0%
25
 
2.0%
25
 
2.0%
23
 
1.8%
23
 
1.8%
Other values (211) 742
58.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 964
75.6%
Space Separator 293
 
23.0%
Decimal Number 9
 
0.7%
Other Punctuation 5
 
0.4%
Uppercase Letter 2
 
0.2%
Open Punctuation 1
 
0.1%
Close Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
32
 
3.3%
31
 
3.2%
29
 
3.0%
26
 
2.7%
26
 
2.7%
25
 
2.6%
25
 
2.6%
23
 
2.4%
23
 
2.4%
23
 
2.4%
Other values (199) 701
72.7%
Decimal Number
ValueCountFrequency (%)
5 3
33.3%
2 3
33.3%
0 2
22.2%
6 1
 
11.1%
Other Punctuation
ValueCountFrequency (%)
, 2
40.0%
" 2
40.0%
/ 1
20.0%
Uppercase Letter
ValueCountFrequency (%)
M 1
50.0%
W 1
50.0%
Space Separator
ValueCountFrequency (%)
293
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 964
75.6%
Common 309
 
24.2%
Latin 2
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
32
 
3.3%
31
 
3.2%
29
 
3.0%
26
 
2.7%
26
 
2.7%
25
 
2.6%
25
 
2.6%
23
 
2.4%
23
 
2.4%
23
 
2.4%
Other values (199) 701
72.7%
Common
ValueCountFrequency (%)
293
94.8%
5 3
 
1.0%
2 3
 
1.0%
, 2
 
0.6%
" 2
 
0.6%
0 2
 
0.6%
( 1
 
0.3%
) 1
 
0.3%
6 1
 
0.3%
/ 1
 
0.3%
Latin
ValueCountFrequency (%)
M 1
50.0%
W 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 964
75.6%
ASCII 311
 
24.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
293
94.2%
5 3
 
1.0%
2 3
 
1.0%
, 2
 
0.6%
" 2
 
0.6%
0 2
 
0.6%
M 1
 
0.3%
( 1
 
0.3%
) 1
 
0.3%
W 1
 
0.3%
Other values (2) 2
 
0.6%
Hangul
ValueCountFrequency (%)
32
 
3.3%
31
 
3.2%
29
 
3.0%
26
 
2.7%
26
 
2.7%
25
 
2.6%
25
 
2.6%
23
 
2.4%
23
 
2.4%
23
 
2.4%
Other values (199) 701
72.7%

기술료
Categorical

HIGH CORRELATION 

Distinct23
Distinct (%)48.9%
Missing0
Missing (%)0.0%
Memory size508.0 B
매출액×1.0%
11 
매출액×1.25%
매출액×2.0%
매출액×2.5%
매출액×2.3%
 
2
Other values (18)
20 

Length

Max length10
Median length8
Mean length8.3617021
Min length6

Unique

Unique16 ?
Unique (%)34.0%

Sample

1st row순이익×4.0%
2nd row매출액×1.25%
3rd row매출액×1.25%
4th row매출액×1.25%
5th row매출액×1.25%

Common Values

ValueCountFrequency (%)
매출액×1.0% 11
23.4%
매출액×1.25% 6
12.8%
매출액×2.0% 5
 
10.6%
매출액×2.5% 3
 
6.4%
매출액×2.3% 2
 
4.3%
매출액×2.47% 2
 
4.3%
매출액x1.0% 2
 
4.3%
매출액×1.71% 1
 
2.1%
매출액×3.5% 1
 
2.1%
매출액×2.9% 1
 
2.1%
Other values (13) 13
27.7%

Length

2024-04-18T04:10:44.450218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
매출액×1.0 11
23.4%
매출액×1.25 6
12.8%
매출액×2.0 5
 
10.6%
매출액×2.5 3
 
6.4%
매출액×2.3 2
 
4.3%
매출액×2.47 2
 
4.3%
매출액x1.0 2
 
4.3%
매출액x2.62 1
 
2.1%
순이익×4.0 1
 
2.1%
매출액×0.925 1
 
2.1%
Other values (13) 13
27.7%
Distinct41
Distinct (%)87.2%
Missing0
Missing (%)0.0%
Memory size508.0 B
2024-04-18T04:10:44.619222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length4.9574468
Min length2

Characters and Unicode

Total characters233
Distinct characters102
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)76.6%

Sample

1st row대덕시스템
2nd row한국테크놀로지
3rd row에네스지
4th row티엠에스비
5th row동양구조안전기술
ValueCountFrequency (%)
피레타 3
 
6.2%
토다이수㈜ 2
 
4.2%
엔지원 2
 
4.2%
대덕시스템 2
 
4.2%
두산중공업 2
 
4.2%
거명파워㈜ 1
 
2.1%
대윤계기산업 1
 
2.1%
프린스 1
 
2.1%
㈜씨이씨 1
 
2.1%
시너지 1
 
2.1%
Other values (32) 32
66.7%
2024-04-18T04:10:44.881389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12
 
5.2%
11
 
4.7%
11
 
4.7%
10
 
4.3%
8
 
3.4%
7
 
3.0%
5
 
2.1%
5
 
2.1%
5
 
2.1%
4
 
1.7%
Other values (92) 155
66.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 217
93.1%
Other Symbol 10
 
4.3%
Uppercase Letter 3
 
1.3%
Close Punctuation 1
 
0.4%
Open Punctuation 1
 
0.4%
Space Separator 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12
 
5.5%
11
 
5.1%
11
 
5.1%
8
 
3.7%
7
 
3.2%
5
 
2.3%
5
 
2.3%
5
 
2.3%
4
 
1.8%
4
 
1.8%
Other values (85) 145
66.8%
Uppercase Letter
ValueCountFrequency (%)
X 1
33.3%
B 1
33.3%
T 1
33.3%
Other Symbol
ValueCountFrequency (%)
10
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 227
97.4%
Latin 3
 
1.3%
Common 3
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12
 
5.3%
11
 
4.8%
11
 
4.8%
10
 
4.4%
8
 
3.5%
7
 
3.1%
5
 
2.2%
5
 
2.2%
5
 
2.2%
4
 
1.8%
Other values (86) 149
65.6%
Latin
ValueCountFrequency (%)
X 1
33.3%
B 1
33.3%
T 1
33.3%
Common
ValueCountFrequency (%)
) 1
33.3%
( 1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 217
93.1%
None 10
 
4.3%
ASCII 6
 
2.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
12
 
5.5%
11
 
5.1%
11
 
5.1%
8
 
3.7%
7
 
3.2%
5
 
2.3%
5
 
2.3%
5
 
2.3%
4
 
1.8%
4
 
1.8%
Other values (85) 145
66.8%
None
ValueCountFrequency (%)
10
100.0%
ASCII
ValueCountFrequency (%)
X 1
16.7%
B 1
16.7%
T 1
16.7%
) 1
16.7%
( 1
16.7%
1
16.7%

Interactions

2024-04-18T04:10:42.717757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-18T04:10:44.957550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번종류계약년월기간기술명기술료이전기업
순번1.0000.4950.9450.5790.9490.7820.889
종류0.4951.0000.0000.0001.0000.8891.000
계약년월0.9450.0001.0001.0000.9830.7910.962
기간0.5790.0001.0001.0000.8430.7650.000
기술명0.9491.0000.9830.8431.0000.9900.992
기술료0.7820.8890.7910.7650.9901.0000.956
이전기업0.8891.0000.9620.0000.9920.9561.000
2024-04-18T04:10:45.035876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기간기술료종류
기간1.0000.3520.000
기술료0.3521.0000.545
종류0.0000.5451.000
2024-04-18T04:10:45.316051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번종류기간기술료
순번1.0000.3080.3280.296
종류0.3081.0000.0000.545
기간0.3280.0001.0000.352
기술료0.2960.5450.3521.000

Missing values

2024-04-18T04:10:42.792570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-18T04:10:42.874939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번종류계약년월기간기술명기술료이전기업
01특허2008-0515년고전압 설비의 부분방전 감시센서 및 그 제조방법순이익×4.0%대덕시스템
12특허2013-0410년과열증기를 이용한 석탄건조 시스템매출액×1.25%한국테크놀로지
23특허2014-0810년배관의 모재 및 용접부 초음파 검사장치매출액×1.25%에네스지
34특허2014-1210년2차전지의 수명 예측장치 및 방법매출액×1.25%티엠에스비
45특허2015-0110년보부재 보강장치 및 이를 이용한 경량지붕틀의 보강구조매출액×1.25%동양구조안전기술
56특허2016-065년전동기 제어반 고장예측 진단장치매출액×1.0%제스 엔지니어링
67특허2016-065년음성경보시스템매출액×2.5%소프트커널
78특허2016-085년비계용 단위 조립체 및 그것을 구비한 비계 등 특허 6건매출액×2.3%한국가설협회
89특허2016-085년증기터빈제어시스템 검증용 시뮬레이터매출액×2.47%지엔피시스템
910특허2017-065년탈황 설비내 가스 쿨러 및 리히터의 세정 장치매출액×1.0%케이엘이에스
순번종류계약년월기간기술명기술료이전기업
3738특허2021-055년수냉식 발전기 고정자 권선에 대한 누수시험 장치 및 그 진단방법매출액×2.0%엔지원
3839특허2021-055년수냉식 발전기 고정자 권선에 대한 누수시험 장치 및 그 진단방법매출액×2.0%피레타
3940특허2021-055년발전기 고정자 권선의 전용 진동 진단 방법 및 분석시스템매출액×2.0%엔지원
4041특허2021-055년발전기 고정자 권선의 전용 진동 진단 방법 및 분석시스템매출액×2.0%피레타
4142특허2021-095년볼밸브매출액×1.0%BTX
4243특허2021-095년화염온도 측정이 가능한 스마트형 화염 검출장치매출액×2.5%㈜에스텍
4344특허2020-115년점화 장치 이중화 시스템매출액×1.0%㈜고려엔지니어링
4445프로그램2018-025년500MW급 시뮬레이터 탑재형 디지털 삼중화 발전기 자동전압제어시스템(대상기술 5건, 서부 보유기술 프로그램 2건)매출액×1.64%금화씨앤이㈜
4546연구성과물2018-095년"황연저감설비 성능개선을 위한 기술개발" 과제의 성과물매출액×1.0%(주)이엠코
4647연구성과물2019-115년발전현장 스마트 모바일 점검 시스템 기술매출액×0.32%한울주식회사