Overview

Dataset statistics

Number of variables8
Number of observations152
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.8 KiB
Average record size in memory65.9 B

Variable types

Categorical4
Text2
DateTime2

Dataset

Description중소기업기술정보진흥원이 2017~2018 선정한 시험·분석, 연구개발, 디자인, 임상실험, 시제품 제작분야 등 분야별 전문역량을 보유한 기술전문기업(K-ESP)의 서비스분야, 지역, 지정기한 등 정보 (단, 지정이후 선정 포기, 폐업 기업은 제외)
Author중소기업기술정보진흥원
URLhttps://www.data.go.kr/data/15038898/fileData.do

Alerts

업체명 has unique valuesUnique
연락처 has unique valuesUnique

Reproduction

Analysis started2023-12-12 03:55:42.295704
Analysis finished2023-12-12 03:55:43.133239
Duration0.84 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지정연도
Categorical

Distinct2
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2017
92 
2018
60 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2017
2nd row2017
3rd row2017
4th row2017
5th row2017

Common Values

ValueCountFrequency (%)
2017 92
60.5%
2018 60
39.5%

Length

2023-12-12T12:55:43.255677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:55:43.438584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2017 92
60.5%
2018 60
39.5%

업체명
Text

UNIQUE 

Distinct152
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-12T12:55:43.744342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length13
Mean length7.2302632
Min length2

Characters and Unicode

Total characters1099
Distinct characters215
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique152 ?
Unique (%)100.0%

Sample

1st row(주)가람디자인컨설팅
2nd row(주)고스디자인
3rd row(주)코엔
4th row젠디자인
5th row지디피(GDP)
ValueCountFrequency (%)
주)가람디자인컨설팅 1
 
0.7%
아이듀오 1
 
0.7%
파노이앤디 1
 
0.7%
joy디자인 1
 
0.7%
다래전략사업화센터 1
 
0.7%
디자인부산 1
 
0.7%
디토 1
 
0.7%
마코 1
 
0.7%
모트 1
 
0.7%
아트핸즈 1
 
0.7%
Other values (142) 142
93.4%
2023-12-12T12:55:44.364891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 110
 
10.0%
) 110
 
10.0%
108
 
9.8%
45
 
4.1%
38
 
3.5%
35
 
3.2%
25
 
2.3%
20
 
1.8%
18
 
1.6%
17
 
1.5%
Other values (205) 573
52.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 855
77.8%
Open Punctuation 110
 
10.0%
Close Punctuation 110
 
10.0%
Uppercase Letter 15
 
1.4%
Other Punctuation 3
 
0.3%
Decimal Number 3
 
0.3%
Lowercase Letter 2
 
0.2%
Other Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
108
 
12.6%
45
 
5.3%
38
 
4.4%
35
 
4.1%
25
 
2.9%
20
 
2.3%
18
 
2.1%
17
 
2.0%
14
 
1.6%
12
 
1.4%
Other values (185) 523
61.2%
Uppercase Letter
ValueCountFrequency (%)
I 3
20.0%
D 2
13.3%
V 2
13.3%
J 1
 
6.7%
N 1
 
6.7%
R 1
 
6.7%
G 1
 
6.7%
P 1
 
6.7%
S 1
 
6.7%
T 1
 
6.7%
Other Punctuation
ValueCountFrequency (%)
. 2
66.7%
& 1
33.3%
Decimal Number
ValueCountFrequency (%)
2 2
66.7%
1 1
33.3%
Lowercase Letter
ValueCountFrequency (%)
o 1
50.0%
y 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 110
100.0%
Close Punctuation
ValueCountFrequency (%)
) 110
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 856
77.9%
Common 226
 
20.6%
Latin 17
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
108
 
12.6%
45
 
5.3%
38
 
4.4%
35
 
4.1%
25
 
2.9%
20
 
2.3%
18
 
2.1%
17
 
2.0%
14
 
1.6%
12
 
1.4%
Other values (186) 524
61.2%
Latin
ValueCountFrequency (%)
I 3
17.6%
D 2
11.8%
V 2
11.8%
J 1
 
5.9%
o 1
 
5.9%
y 1
 
5.9%
N 1
 
5.9%
R 1
 
5.9%
G 1
 
5.9%
P 1
 
5.9%
Other values (3) 3
17.6%
Common
ValueCountFrequency (%)
( 110
48.7%
) 110
48.7%
. 2
 
0.9%
2 2
 
0.9%
1 1
 
0.4%
& 1
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 855
77.8%
ASCII 243
 
22.1%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 110
45.3%
) 110
45.3%
I 3
 
1.2%
D 2
 
0.8%
V 2
 
0.8%
. 2
 
0.8%
2 2
 
0.8%
J 1
 
0.4%
o 1
 
0.4%
y 1
 
0.4%
Other values (9) 9
 
3.7%
Hangul
ValueCountFrequency (%)
108
 
12.6%
45
 
5.3%
38
 
4.4%
35
 
4.1%
25
 
2.9%
20
 
2.3%
18
 
2.1%
17
 
2.0%
14
 
1.6%
12
 
1.4%
Other values (185) 523
61.2%
None
ValueCountFrequency (%)
1
100.0%

서비스 분야
Categorical

Distinct6
Distinct (%)3.9%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
디자인
53 
연구개발
47 
시제품제작
20 
시험ㆍ분석
16 
설계ㆍ해석
12 

Length

Max length5
Median length4
Mean length3.9144737
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row디자인
2nd row디자인
3rd row디자인
4th row디자인
5th row디자인

Common Values

ValueCountFrequency (%)
디자인 53
34.9%
연구개발 47
30.9%
시제품제작 20
 
13.2%
시험ㆍ분석 16
 
10.5%
설계ㆍ해석 12
 
7.9%
임상 4
 
2.6%

Length

2023-12-12T12:55:44.589051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:55:44.808259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
디자인 53
34.9%
연구개발 47
30.9%
시제품제작 20
 
13.2%
시험ㆍ분석 16
 
10.5%
설계ㆍ해석 12
 
7.9%
임상 4
 
2.6%

지역
Categorical

Distinct13
Distinct (%)8.6%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
서울
47 
경기
28 
대전
19 
부산
15 
대구
12 
Other values (8)
31 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique2 ?
Unique (%)1.3%

Sample

1st row경기
2nd row경기
3rd row경기
4th row경기
5th row경북

Common Values

ValueCountFrequency (%)
서울 47
30.9%
경기 28
18.4%
대전 19
12.5%
부산 15
 
9.9%
대구 12
 
7.9%
광주 10
 
6.6%
전북 5
 
3.3%
경남 5
 
3.3%
경북 4
 
2.6%
인천 3
 
2.0%
Other values (3) 4
 
2.6%

Length

2023-12-12T12:55:45.021743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서울 47
30.9%
경기 28
18.4%
대전 19
12.5%
부산 15
 
9.9%
대구 12
 
7.9%
광주 10
 
6.6%
전북 5
 
3.3%
경남 5
 
3.3%
경북 4
 
2.6%
인천 3
 
2.0%
Other values (3) 4
 
2.6%
Distinct11
Distinct (%)7.2%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
지식서비스
85 
정보통신
26 
기계ㆍ소재
12 
바이오ㆍ의료
기계소재
 
7
Other values (6)
13 

Length

Max length6
Median length5
Mean length4.7894737
Min length2

Unique

Unique2 ?
Unique (%)1.3%

Sample

1st row지식서비스
2nd row지식서비스
3rd row지식서비스
4th row지식서비스
5th row지식서비스

Common Values

ValueCountFrequency (%)
지식서비스 85
55.9%
정보통신 26
 
17.1%
기계ㆍ소재 12
 
7.9%
바이오ㆍ의료 9
 
5.9%
기계소재 7
 
4.6%
전기전자 4
 
2.6%
전기ㆍ전자 3
 
2.0%
화학 2
 
1.3%
에너지자원 2
 
1.3%
에너지ㆍ자원 1
 
0.7%

Length

2023-12-12T12:55:45.246072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
지식서비스 85
55.9%
정보통신 26
 
17.1%
기계ㆍ소재 12
 
7.9%
바이오ㆍ의료 9
 
5.9%
기계소재 7
 
4.6%
전기전자 4
 
2.6%
전기ㆍ전자 3
 
2.0%
화학 2
 
1.3%
에너지자원 2
 
1.3%
에너지ㆍ자원 1
 
0.7%

연락처
Text

UNIQUE 

Distinct152
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-12T12:55:45.611176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.993421
Min length9

Characters and Unicode

Total characters1823
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique152 ?
Unique (%)100.0%

Sample

1st row031-753-5596
2nd row070-8633-8510
3rd row031-476-1390
4th row031-8041-1767
5th row054-471-2328
ValueCountFrequency (%)
031-753-5596 1
 
0.7%
02-3448-5244 1
 
0.7%
062-383-1370 1
 
0.7%
042-825-8564 1
 
0.7%
02-3475-7880 1
 
0.7%
051-936-3782 1
 
0.7%
02-3446-0971 1
 
0.7%
051-417-8969 1
 
0.7%
02-3477-7793 1
 
0.7%
051-741-8342 1
 
0.7%
Other values (142) 142
93.4%
2023-12-12T12:55:46.230381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 301
16.5%
0 289
15.9%
2 186
10.2%
5 162
8.9%
3 156
8.6%
1 149
8.2%
7 131
7.2%
4 128
7.0%
8 115
 
6.3%
6 113
 
6.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1522
83.5%
Dash Punctuation 301
 
16.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 289
19.0%
2 186
12.2%
5 162
10.6%
3 156
10.2%
1 149
9.8%
7 131
8.6%
4 128
8.4%
8 115
 
7.6%
6 113
 
7.4%
9 93
 
6.1%
Dash Punctuation
ValueCountFrequency (%)
- 301
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1823
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 301
16.5%
0 289
15.9%
2 186
10.2%
5 162
8.9%
3 156
8.6%
1 149
8.2%
7 131
7.2%
4 128
7.0%
8 115
 
6.3%
6 113
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1823
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 301
16.5%
0 289
15.9%
2 186
10.2%
5 162
8.9%
3 156
8.6%
1 149
8.2%
7 131
7.2%
4 128
7.0%
8 115
 
6.3%
6 113
 
6.2%
Distinct3
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
Minimum2017-03-27 00:00:00
Maximum2018-04-05 00:00:00
2023-12-12T12:55:46.429425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:55:46.576733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=3)
Distinct3
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
Minimum2020-03-26 00:00:00
Maximum2020-07-16 00:00:00
2023-12-12T12:55:46.738110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:55:46.886927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=3)

Correlations

2023-12-12T12:55:47.408933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지정연도서비스 분야지역산업기술분류기술전문기업 지정일기술전문기업 지정만료일
지정연도1.0000.4920.1710.3281.0001.000
서비스 분야0.4921.0000.5190.6480.7160.716
지역0.1710.5191.0000.4850.2490.249
산업기술분류0.3280.6480.4851.0000.5180.518
기술전문기업 지정일1.0000.7160.2490.5181.0001.000
기술전문기업 지정만료일1.0000.7160.2490.5181.0001.000
2023-12-12T12:55:47.583437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
서비스 분야지역지정연도산업기술분류
서비스 분야1.0000.2800.3500.395
지역0.2801.0000.1520.214
지정연도0.3500.1521.0000.304
산업기술분류0.3950.2140.3041.000
2023-12-12T12:55:47.728375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지정연도서비스 분야지역산업기술분류
지정연도1.0000.3500.1520.304
서비스 분야0.3501.0000.2800.395
지역0.1520.2801.0000.214
산업기술분류0.3040.3950.2141.000

Missing values

2023-12-12T12:55:42.827604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:55:43.057483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

지정연도업체명서비스 분야지역산업기술분류연락처기술전문기업 지정일기술전문기업 지정만료일
02017(주)가람디자인컨설팅디자인경기지식서비스031-753-55962017-07-172020-07-16
12017(주)고스디자인디자인경기지식서비스070-8633-85102017-07-172020-07-16
22017(주)코엔디자인경기지식서비스031-476-13902017-07-172020-07-16
32017젠디자인디자인경기지식서비스031-8041-17672017-07-172020-07-16
42017지디피(GDP)디자인경북지식서비스054-471-23282017-07-172020-07-16
52017(주)가화디자인광주지식서비스062-611-56192017-07-172020-07-16
62017(주)디자인바이디자인광주지식서비스062-961-99002017-07-172020-07-16
72017바우디자인디자인광주지식서비스063-274-49372017-07-172020-07-16
82017아이디(주)디자인광주지식서비스062-611-56082017-07-172020-07-16
92017(주)낫씽디자인그룹디자인광주지식서비스062-531-66332017-07-172020-07-16
지정연도업체명서비스 분야지역산업기술분류연락처기술전문기업 지정일기술전문기업 지정만료일
1422018(주)세자에너지연구개발전북정보통신063-471-11162018-04-052020-04-04
1432018(주)피디젠연구개발대전바이오ㆍ의료070-4603-42722018-04-052020-04-04
1442018(주)핀텔연구개발경기정보통신031-259-66782018-04-052020-04-04
1452018(주)씨더스연구개발대전지식서비스042-710-40352018-04-052020-04-04
1462018(주)유디엠텍연구개발경기정보통신031-8064-18882018-04-052020-04-04
1472018켐아이넷(주)연구개발서울정보통신02-2647-49302018-04-052020-04-04
1482018(주)비트윈임상전북바이오ㆍ의료063-850-09512018-04-052020-04-04
1492018(주)엘리드임상경기지식서비스031-709-90702018-04-052020-04-04
1502018(주)크로엔임상경기바이오ㆍ의료031-548-15022018-04-052020-04-04
1512018(주)인비보(INVIVO)임상전북바이오ㆍ의료063-857-12542018-04-052020-04-04