Overview

Dataset statistics

Number of variables6
Number of observations23
Missing cells1
Missing cells (%)0.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 KiB
Average record size in memory54.7 B

Variable types

Text1
Numeric1
Categorical4

Dataset

Description비대면 기술 보유기업의 기술사업화 가능성 진단 결과 및 지원내용1. 진단결과 사업화 유형분류에 따른 지원항목 및 내용① TC(사업화유망기업) -사업화 지원 - 사업화 기획, 제품 성능 테스트, 시장 마케팅② MC(기술강화추진기업) - 시장친화형 기능개선 - 상용화를 위한 기능개선, 성능향상 등을 위한 RND 지원③ TM(사업화기술보유기업) - 기술이전 - 기술보증기금 Tech Bridge 등록 지원2. 시장친화형기능개선① 추가RND : 상용화를 위한 기능개선, 성능향상 등을 위한 RND 지원
Author중소벤처기업진흥공단
URLhttps://www.data.go.kr/data/15071332/fileData.do

Alerts

진단결과 is highly overall correlated with 추가 지원내용High correlation
추가 지원내용 is highly overall correlated with 진단결과High correlation
종업원수 has 1 (4.3%) missing valuesMissing
업체명 has unique valuesUnique
종업원수 has 2 (8.7%) zerosZeros

Reproduction

Analysis started2023-12-12 22:21:29.549657
Analysis finished2023-12-12 22:21:30.162578
Duration0.61 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업체명
Text

UNIQUE 

Distinct23
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size316.0 B
2023-12-13T07:21:30.258052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length8
Mean length5.6086957
Min length3

Characters and Unicode

Total characters129
Distinct characters23
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)100.0%

Sample

1st row씨*******
2nd row팩******
3rd row데****
4th row달******
5th row그*****
ValueCountFrequency (%)
2
 
8.7%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
Other values (12) 12
52.2%
2023-12-13T07:21:30.542487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 106
82.2%
2
 
1.6%
1
 
0.8%
1
 
0.8%
1
 
0.8%
1
 
0.8%
1
 
0.8%
1
 
0.8%
1
 
0.8%
1
 
0.8%
Other values (13) 13
 
10.1%

Most occurring categories

ValueCountFrequency (%)
Other Punctuation 106
82.2%
Other Letter 23
 
17.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2
 
8.7%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
Other values (12) 12
52.2%
Other Punctuation
ValueCountFrequency (%)
* 106
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 106
82.2%
Hangul 23
 
17.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2
 
8.7%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
Other values (12) 12
52.2%
Common
ValueCountFrequency (%)
* 106
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 106
82.2%
Hangul 23
 
17.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 106
100.0%
Hangul
ValueCountFrequency (%)
2
 
8.7%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
Other values (12) 12
52.2%

종업원수
Real number (ℝ)

MISSING  ZEROS 

Distinct16
Distinct (%)72.7%
Missing1
Missing (%)4.3%
Infinite0
Infinite (%)0.0%
Mean34.545455
Minimum0
Maximum397
Zeros2
Zeros (%)8.7%
Negative0
Negative (%)0.0%
Memory size339.0 B
2023-12-13T07:21:30.649933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0.05
Q14
median7
Q313.5
95-th percentile175.95
Maximum397
Range397
Interquartile range (IQR)9.5

Descriptive statistics

Standard deviation89.505375
Coefficient of variation (CV)2.5909451
Kurtosis14.062712
Mean34.545455
Median Absolute Deviation (MAD)4.5
Skewness3.6858671
Sum760
Variance8011.2121
MonotonicityNot monotonic
2023-12-13T07:21:30.757230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
7 3
13.0%
9 2
 
8.7%
14 2
 
8.7%
4 2
 
8.7%
0 2
 
8.7%
2 1
 
4.3%
12 1
 
4.3%
397 1
 
4.3%
183 1
 
4.3%
42 1
 
4.3%
Other values (6) 6
26.1%
ValueCountFrequency (%)
0 2
8.7%
1 1
 
4.3%
2 1
 
4.3%
3 1
 
4.3%
4 2
8.7%
5 1
 
4.3%
6 1
 
4.3%
7 3
13.0%
9 2
8.7%
10 1
 
4.3%
ValueCountFrequency (%)
397 1
 
4.3%
183 1
 
4.3%
42 1
 
4.3%
24 1
 
4.3%
14 2
8.7%
12 1
 
4.3%
10 1
 
4.3%
9 2
8.7%
7 3
13.0%
6 1
 
4.3%

지역
Categorical

Distinct9
Distinct (%)39.1%
Missing0
Missing (%)0.0%
Memory size316.0 B
경기도
서울특별시
광주광역시
경상남도
충청남도
Other values (4)

Length

Max length5
Median length4
Mean length4.0434783
Min length3

Unique

Unique6 ?
Unique (%)26.1%

Sample

1st row경상남도
2nd row충청남도
3rd row서울특별시
4th row서울특별시
5th row광주광역시

Common Values

ValueCountFrequency (%)
경기도 9
39.1%
서울특별시 6
26.1%
광주광역시 2
 
8.7%
경상남도 1
 
4.3%
충청남도 1
 
4.3%
경상북도 1
 
4.3%
대전광역시 1
 
4.3%
전라북도 1
 
4.3%
대구광역시 1
 
4.3%

Length

2023-12-13T07:21:30.885017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:21:30.994123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경기도 9
39.1%
서울특별시 6
26.1%
광주광역시 2
 
8.7%
경상남도 1
 
4.3%
충청남도 1
 
4.3%
경상북도 1
 
4.3%
대전광역시 1
 
4.3%
전라북도 1
 
4.3%
대구광역시 1
 
4.3%
Distinct6
Distinct (%)26.1%
Missing0
Missing (%)0.0%
Memory size316.0 B
빅데이터, AI
10 
사물인터넷(IoT)
정보보안
지능형/차세대 반도체
드론·무인기, 자율차
 
1

Length

Max length22
Median length11
Mean length8.7826087
Min length4

Unique

Unique2 ?
Unique (%)8.7%

Sample

1st row드론·무인기, 자율차
2nd row빅데이터, AI
3rd row빅데이터, AI
4th row빅데이터, AI
5th row빅데이터, AI

Common Values

ValueCountFrequency (%)
빅데이터, AI 10
43.5%
사물인터넷(IoT) 4
 
17.4%
정보보안 4
 
17.4%
지능형/차세대 반도체 3
 
13.0%
드론·무인기, 자율차 1
 
4.3%
빅데이터, AI, 원격의료(진단, 치료) 1
 
4.3%

Length

2023-12-13T07:21:31.114332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:21:31.219857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
빅데이터 11
27.5%
ai 11
27.5%
사물인터넷(iot 4
 
10.0%
정보보안 4
 
10.0%
지능형/차세대 3
 
7.5%
반도체 3
 
7.5%
드론·무인기 1
 
2.5%
자율차 1
 
2.5%
원격의료(진단 1
 
2.5%
치료 1
 
2.5%

진단결과
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)17.4%
Missing0
Missing (%)0.0%
Memory size316.0 B
TC
10 
MC
TM
해당없음

Length

Max length4
Median length2
Mean length2.2608696
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowTC
2nd rowTM
3rd rowTM
4th rowMC
5th rowTC

Common Values

ValueCountFrequency (%)
TC 10
43.5%
MC 7
30.4%
TM 3
 
13.0%
해당없음 3
 
13.0%

Length

2023-12-13T07:21:31.353617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:21:31.447796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
tc 10
43.5%
mc 7
30.4%
tm 3
 
13.0%
해당없음 3
 
13.0%

추가 지원내용
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)13.0%
Missing0
Missing (%)0.0%
Memory size316.0 B
사업화지원
시장친화형기능개선(R&D)
<NA>

Length

Max length14
Median length5
Mean length7.826087
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사업화지원
2nd row<NA>
3rd row<NA>
4th row시장친화형기능개선(R&D)
5th row사업화지원

Common Values

ValueCountFrequency (%)
사업화지원 8
34.8%
시장친화형기능개선(R&D) 8
34.8%
<NA> 7
30.4%

Length

2023-12-13T07:21:31.572076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:21:31.666306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사업화지원 8
34.8%
시장친화형기능개선(r&d 8
34.8%
na 7
30.4%

Interactions

2023-12-13T07:21:29.828160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T07:21:31.742852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업체명종업원수지역비대면 신청기술진단결과추가 지원내용
업체명1.0001.0001.0001.0001.0001.000
종업원수1.0001.0000.0000.0000.0000.116
지역1.0000.0001.0000.7360.2830.000
비대면 신청기술1.0000.0000.7361.0000.4710.000
진단결과1.0000.0000.2830.4711.0000.916
추가 지원내용1.0000.1160.0000.0000.9161.000
2023-12-13T07:21:31.837177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역추가 지원내용진단결과비대면 신청기술
지역1.0000.0000.0860.413
추가 지원내용0.0001.0000.7350.000
진단결과0.0860.7351.0000.287
비대면 신청기술0.4130.0000.2871.000
2023-12-13T07:21:31.923651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
종업원수지역비대면 신청기술진단결과추가 지원내용
종업원수1.0000.0000.0000.0000.000
지역0.0001.0000.4130.0860.000
비대면 신청기술0.0000.4131.0000.2870.000
진단결과0.0000.0860.2871.0000.735
추가 지원내용0.0000.0000.0000.7351.000

Missing values

2023-12-13T07:21:29.974557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:21:30.119732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업체명종업원수지역비대면 신청기술진단결과추가 지원내용
0씨*******9경상남도드론·무인기, 자율차TC사업화지원
1팩******7충청남도빅데이터, AITM<NA>
2데****14서울특별시빅데이터, AITM<NA>
3달******4서울특별시빅데이터, AIMC시장친화형기능개선(R&D)
4그*****7광주광역시빅데이터, AITC사업화지원
5힐****10서울특별시빅데이터, AI, 원격의료(진단, 치료)TM<NA>
6나*******4경기도사물인터넷(IoT)TC사업화지원
7위**7경기도사물인터넷(IoT)TC사업화지원
8케********<NA>경상북도사물인터넷(IoT)해당없음<NA>
9제***0경기도정보보안MC시장친화형기능개선(R&D)
업체명종업원수지역비대면 신청기술진단결과추가 지원내용
13더*****9경기도지능형/차세대 반도체해당없음<NA>
14브****0경기도지능형/차세대 반도체해당없음<NA>
15지*********5대구광역시빅데이터, AITC사업화지원
16우*****6경기도사물인터넷(IoT)MC시장친화형기능개선(R&D)
17담*******2광주광역시빅데이터, AIMC시장친화형기능개선(R&D)
18청****14경기도빅데이터, AITC사업화지원
19기***42서울특별시빅데이터, AITC사업화지원
20스**183서울특별시빅데이터, AIMC시장친화형기능개선(R&D)
21인**397서울특별시빅데이터, AIMC시장친화형기능개선(R&D)
22알**12경기도정보보안TC사업화지원