Overview

Dataset statistics

Number of variables3
Number of observations75
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.0 KiB
Average record size in memory26.8 B

Variable types

Numeric1
Categorical1
Text1

Dataset

Description중소기업 구매조건부신제품개발사업 공동투자형 사업 관련, 중소기업과의 기술협력 촉진을 위해 중소벤처기업부와 공동투자 기술개발 자금을 출연하는 투자기업 형태, 기관명 등 정보
Author중소기업기술정보진흥원
URLhttps://www.data.go.kr/data/15089155/fileData.do

Alerts

구분 is highly overall correlated with 투자기업형태High correlation
투자기업형태 is highly overall correlated with 구분High correlation
구분 has unique valuesUnique
투자기업명 has unique valuesUnique

Reproduction

Analysis started2023-12-23 07:19:03.948049
Analysis finished2023-12-23 07:19:05.427019
Duration1.48 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct75
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean38
Minimum1
Maximum75
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size807.0 B
2023-12-23T07:19:05.729491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.7
Q119.5
median38
Q356.5
95-th percentile71.3
Maximum75
Range74
Interquartile range (IQR)37

Descriptive statistics

Standard deviation21.794495
Coefficient of variation (CV)0.57353933
Kurtosis-1.2
Mean38
Median Absolute Deviation (MAD)19
Skewness0
Sum2850
Variance475
MonotonicityStrictly increasing
2023-12-23T07:19:06.392634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.3%
49 1
 
1.3%
56 1
 
1.3%
55 1
 
1.3%
54 1
 
1.3%
53 1
 
1.3%
52 1
 
1.3%
51 1
 
1.3%
50 1
 
1.3%
48 1
 
1.3%
Other values (65) 65
86.7%
ValueCountFrequency (%)
1 1
1.3%
2 1
1.3%
3 1
1.3%
4 1
1.3%
5 1
1.3%
6 1
1.3%
7 1
1.3%
8 1
1.3%
9 1
1.3%
10 1
1.3%
ValueCountFrequency (%)
75 1
1.3%
74 1
1.3%
73 1
1.3%
72 1
1.3%
71 1
1.3%
70 1
1.3%
69 1
1.3%
68 1
1.3%
67 1
1.3%
66 1
1.3%

투자기업형태
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size732.0 B
중견기업
30 
공공기관
27 
대기업
18 

Length

Max length4
Median length4
Mean length3.76
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대기업
2nd row대기업
3rd row대기업
4th row대기업
5th row대기업

Common Values

ValueCountFrequency (%)
중견기업 30
40.0%
공공기관 27
36.0%
대기업 18
24.0%

Length

2023-12-23T07:19:07.145779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-23T07:19:07.616743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
중견기업 30
40.0%
공공기관 27
36.0%
대기업 18
24.0%

투자기업명
Text

UNIQUE 

Distinct75
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size732.0 B
2023-12-23T07:19:08.393669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length7
Mean length5.4266667
Min length2

Characters and Unicode

Total characters407
Distinct characters142
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique75 ?
Unique (%)100.0%

Sample

1st row포스코
2nd row르노코리아자동차
3rd row삼성디스플레이
4th row삼성전기
5th row삼성에스디아이
ValueCountFrequency (%)
포스코 1
 
1.3%
아세아텍 1
 
1.3%
한국수력원자력 1
 
1.3%
한국중부발전 1
 
1.3%
한국남부발전 1
 
1.3%
한국남동발전 1
 
1.3%
한국서부발전 1
 
1.3%
한국동서발전 1
 
1.3%
한국전력공사 1
 
1.3%
광동제약 1
 
1.3%
Other values (65) 65
86.7%
2023-12-23T07:19:10.152978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
23
 
5.7%
21
 
5.2%
20
 
4.9%
20
 
4.9%
16
 
3.9%
16
 
3.9%
12
 
2.9%
10
 
2.5%
9
 
2.2%
8
 
2.0%
Other values (132) 252
61.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 397
97.5%
Uppercase Letter 10
 
2.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
23
 
5.8%
21
 
5.3%
20
 
5.0%
20
 
5.0%
16
 
4.0%
16
 
4.0%
12
 
3.0%
10
 
2.5%
9
 
2.3%
8
 
2.0%
Other values (126) 242
61.0%
Uppercase Letter
ValueCountFrequency (%)
J 2
20.0%
C 2
20.0%
W 2
20.0%
K 2
20.0%
N 1
10.0%
D 1
10.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 397
97.5%
Latin 10
 
2.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
23
 
5.8%
21
 
5.3%
20
 
5.0%
20
 
5.0%
16
 
4.0%
16
 
4.0%
12
 
3.0%
10
 
2.5%
9
 
2.3%
8
 
2.0%
Other values (126) 242
61.0%
Latin
ValueCountFrequency (%)
J 2
20.0%
C 2
20.0%
W 2
20.0%
K 2
20.0%
N 1
10.0%
D 1
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 397
97.5%
ASCII 10
 
2.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
23
 
5.8%
21
 
5.3%
20
 
5.0%
20
 
5.0%
16
 
4.0%
16
 
4.0%
12
 
3.0%
10
 
2.5%
9
 
2.3%
8
 
2.0%
Other values (126) 242
61.0%
ASCII
ValueCountFrequency (%)
J 2
20.0%
C 2
20.0%
W 2
20.0%
K 2
20.0%
N 1
10.0%
D 1
10.0%

Interactions

2023-12-23T07:19:04.415378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-23T07:19:10.660466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분투자기업형태투자기업명
구분1.0000.9351.000
투자기업형태0.9351.0001.000
투자기업명1.0001.0001.000
2023-12-23T07:19:11.127242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분투자기업형태
구분1.0000.869
투자기업형태0.8691.000

Missing values

2023-12-23T07:19:04.976744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-23T07:19:05.278419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분투자기업형태투자기업명
01대기업포스코
12대기업르노코리아자동차
23대기업삼성디스플레이
34대기업삼성전기
45대기업삼성에스디아이
56대기업삼성전자
67대기업한화에어로스페이스
78대기업엔에스쇼핑
89대기업엘지전자
910대기업현대자동차
구분투자기업형태투자기업명
6566공공기관부산항만공사
6667공공기관한국도로공사
6768공공기관한국가스기술공사
6869공공기관한국조폐공사
6970공공기관한전KDN
7071공공기관에스알
7172공공기관도로교통공단
7273공공기관여수광양항만공사
7374공공기관한국교통안전공단
7475공공기관강원랜드