Overview

Dataset statistics

Number of variables5
Number of observations32
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.4 KiB
Average record size in memory44.1 B

Variable types

Text2
Categorical3

Dataset

Description중소벤처기업진흥공단의 수출지원사업 중 하나로 중소기업의 해외조달시장 진출을 지원하기 위한 목적으로 지원하며, 지원받은 기업의 주요현황을 제공합니다.
Author중소벤처기업진흥공단
URLhttps://www.data.go.kr/data/15107052/fileData.do

Reproduction

Analysis started2023-12-12 04:54:58.449836
Analysis finished2023-12-12 04:54:59.002385
Duration0.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct22
Distinct (%)68.8%
Missing0
Missing (%)0.0%
Memory size388.0 B
2023-12-12T13:54:59.131533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters128
Distinct characters23
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique17 ?
Unique (%)53.1%

Sample

1st row브***
2nd row플***
3rd row웨***
4th row아***
5th row이***
ValueCountFrequency (%)
4
 
12.5%
4
 
12.5%
3
 
9.4%
2
 
6.2%
2
 
6.2%
1
 
3.1%
1
 
3.1%
1
 
3.1%
1
 
3.1%
1
 
3.1%
Other values (12) 12
37.5%
2023-12-12T13:54:59.525341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 96
75.0%
4
 
3.1%
4
 
3.1%
3
 
2.3%
2
 
1.6%
2
 
1.6%
1
 
0.8%
1
 
0.8%
1
 
0.8%
1
 
0.8%
Other values (13) 13
 
10.2%

Most occurring categories

ValueCountFrequency (%)
Other Punctuation 96
75.0%
Other Letter 32
 
25.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4
 
12.5%
4
 
12.5%
3
 
9.4%
2
 
6.2%
2
 
6.2%
1
 
3.1%
1
 
3.1%
1
 
3.1%
1
 
3.1%
1
 
3.1%
Other values (12) 12
37.5%
Other Punctuation
ValueCountFrequency (%)
* 96
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 96
75.0%
Hangul 32
 
25.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4
 
12.5%
4
 
12.5%
3
 
9.4%
2
 
6.2%
2
 
6.2%
1
 
3.1%
1
 
3.1%
1
 
3.1%
1
 
3.1%
1
 
3.1%
Other values (12) 12
37.5%
Common
ValueCountFrequency (%)
* 96
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 96
75.0%
Hangul 32
 
25.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 96
100.0%
Hangul
ValueCountFrequency (%)
4
 
12.5%
4
 
12.5%
3
 
9.4%
2
 
6.2%
2
 
6.2%
1
 
3.1%
1
 
3.1%
1
 
3.1%
1
 
3.1%
1
 
3.1%
Other values (12) 12
37.5%
Distinct27
Distinct (%)84.4%
Missing0
Missing (%)0.0%
Memory size388.0 B
2023-12-12T13:54:59.904901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length35
Mean length27
Min length8

Characters and Unicode

Total characters864
Distinct characters127
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)75.0%

Sample

1st row스킨 케어 (Skin Care)
2nd row살균 설비 (Sterilization Equipments)
3rd row전원 공급 장치 (Power Supplies)
4th row기타 측정 분석기기 (Other Measuring & Analysing Instruments)
5th row임플란트 및 인터 벤션 재료 (Implants & Interventional Materials)
ValueCountFrequency (%)
care 6
 
4.0%
기타 6
 
4.0%
5
 
3.4%
스킨 4
 
2.7%
skin 4
 
2.7%
케어 4
 
2.7%
other 4
 
2.7%
4
 
2.7%
장비 3
 
2.0%
equipment 3
 
2.0%
Other values (89) 106
71.1%
2023-12-12T13:55:00.400374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
117
 
13.5%
e 57
 
6.6%
i 41
 
4.7%
a 40
 
4.6%
n 38
 
4.4%
t 37
 
4.3%
r 33
 
3.8%
( 32
 
3.7%
) 32
 
3.7%
s 31
 
3.6%
Other values (117) 406
47.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 443
51.3%
Other Letter 160
 
18.5%
Space Separator 117
 
13.5%
Uppercase Letter 75
 
8.7%
Open Punctuation 32
 
3.7%
Close Punctuation 32
 
3.7%
Other Punctuation 5
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
 
8.8%
7
 
4.4%
7
 
4.4%
6
 
3.8%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
Other values (73) 102
63.7%
Lowercase Letter
ValueCountFrequency (%)
e 57
12.9%
i 41
 
9.3%
a 40
 
9.0%
n 38
 
8.6%
t 37
 
8.4%
r 33
 
7.4%
s 31
 
7.0%
l 24
 
5.4%
p 19
 
4.3%
o 19
 
4.3%
Other values (14) 104
23.5%
Uppercase Letter
ValueCountFrequency (%)
S 11
14.7%
C 10
13.3%
M 9
12.0%
E 8
10.7%
P 8
10.7%
A 5
6.7%
I 5
6.7%
O 4
 
5.3%
H 3
 
4.0%
T 3
 
4.0%
Other values (6) 9
12.0%
Space Separator
ValueCountFrequency (%)
117
100.0%
Open Punctuation
ValueCountFrequency (%)
( 32
100.0%
Close Punctuation
ValueCountFrequency (%)
) 32
100.0%
Other Punctuation
ValueCountFrequency (%)
& 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 518
60.0%
Common 186
 
21.5%
Hangul 160
 
18.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14
 
8.8%
7
 
4.4%
7
 
4.4%
6
 
3.8%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
Other values (73) 102
63.7%
Latin
ValueCountFrequency (%)
e 57
 
11.0%
i 41
 
7.9%
a 40
 
7.7%
n 38
 
7.3%
t 37
 
7.1%
r 33
 
6.4%
s 31
 
6.0%
l 24
 
4.6%
p 19
 
3.7%
o 19
 
3.7%
Other values (30) 179
34.6%
Common
ValueCountFrequency (%)
117
62.9%
( 32
 
17.2%
) 32
 
17.2%
& 5
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 704
81.5%
Hangul 160
 
18.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
117
16.6%
e 57
 
8.1%
i 41
 
5.8%
a 40
 
5.7%
n 38
 
5.4%
t 37
 
5.3%
r 33
 
4.7%
( 32
 
4.5%
) 32
 
4.5%
s 31
 
4.4%
Other values (34) 246
34.9%
Hangul
ValueCountFrequency (%)
14
 
8.8%
7
 
4.4%
7
 
4.4%
6
 
3.8%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
Other values (73) 102
63.7%

소재지
Categorical

Distinct12
Distinct (%)37.5%
Missing0
Missing (%)0.0%
Memory size388.0 B
경기
서울
대전
전북
광주
Other values (7)

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique5 ?
Unique (%)15.6%

Sample

1st row경기
2nd row대전
3rd row경기
4th row서울
5th row대전

Common Values

ValueCountFrequency (%)
경기 9
28.1%
서울 6
18.8%
대전 3
 
9.4%
전북 3
 
9.4%
광주 2
 
6.2%
대구 2
 
6.2%
인천 2
 
6.2%
전남 1
 
3.1%
제주 1
 
3.1%
충남 1
 
3.1%
Other values (2) 2
 
6.2%

Length

2023-12-12T13:55:00.585807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기 9
28.1%
서울 6
18.8%
대전 3
 
9.4%
전북 3
 
9.4%
광주 2
 
6.2%
대구 2
 
6.2%
인천 2
 
6.2%
전남 1
 
3.1%
제주 1
 
3.1%
충남 1
 
3.1%
Other values (2) 2
 
6.2%

자산규모
Categorical

Distinct6
Distinct (%)18.8%
Missing0
Missing (%)0.0%
Memory size388.0 B
300억 미만
30억 미만
100억 미만
10억 미만
50억 미만

Length

Max length7
Median length6.5
Mean length6.5
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row30억 미만
2nd row300억 미만
3rd row100억 미만
4th row30억 미만
5th row300억 이상

Common Values

ValueCountFrequency (%)
300억 미만 9
28.1%
30억 미만 8
25.0%
100억 미만 5
15.6%
10억 미만 5
15.6%
50억 미만 3
 
9.4%
300억 이상 2
 
6.2%

Length

2023-12-12T13:55:00.730911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:55:00.906327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미만 30
46.9%
300억 11
 
17.2%
30억 8
 
12.5%
100억 5
 
7.8%
10억 5
 
7.8%
50억 3
 
4.7%
이상 2
 
3.1%

매출규모
Categorical

Distinct6
Distinct (%)18.8%
Missing0
Missing (%)0.0%
Memory size388.0 B
30억 미만
10 
200억 미만
10억 미만
100억 미만
50억 미만

Length

Max length7
Median length6
Mean length6.375
Min length6

Unique

Unique1 ?
Unique (%)3.1%

Sample

1st row30억 미만
2nd row100억 미만
3rd row100억 미만
4th row10억 미만
5th row200억 이상

Common Values

ValueCountFrequency (%)
30억 미만 10
31.2%
200억 미만 7
21.9%
10억 미만 6
18.8%
100억 미만 4
 
12.5%
50억 미만 4
 
12.5%
200억 이상 1
 
3.1%

Length

2023-12-12T13:55:01.055036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:55:01.193792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미만 31
48.4%
30억 10
 
15.6%
200억 8
 
12.5%
10억 6
 
9.4%
100억 4
 
6.2%
50억 4
 
6.2%
이상 1
 
1.6%

Correlations

2023-12-12T13:55:01.296207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기업명주요수출품목소재지자산규모매출규모
기업명1.0000.5960.8470.5120.812
주요수출품목0.5961.0000.0000.8240.881
소재지0.8470.0001.0000.3930.000
자산규모0.5120.8240.3931.0000.863
매출규모0.8120.8810.0000.8631.000
2023-12-12T13:55:01.415890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
자산규모소재지매출규모
자산규모1.0000.0890.485
소재지0.0891.0000.000
매출규모0.4850.0001.000
2023-12-12T13:55:01.519356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소재지자산규모매출규모
소재지1.0000.0890.000
자산규모0.0891.0000.485
매출규모0.0000.4851.000

Missing values

2023-12-12T13:54:58.844486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:54:58.950365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기업명주요수출품목소재지자산규모매출규모
0브***스킨 케어 (Skin Care)경기30억 미만30억 미만
1플***살균 설비 (Sterilization Equipments)대전300억 미만100억 미만
2웨***전원 공급 장치 (Power Supplies)경기100억 미만100억 미만
3아***기타 측정 분석기기 (Other Measuring & Analysing Instruments)서울30억 미만10억 미만
4이***임플란트 및 인터 벤션 재료 (Implants & Interventional Materials)대전300억 이상200억 이상
5랜***기타 기계 및 산업 장비 (Other Machinery & Industry Equipment)전북10억 미만30억 미만
6뷰***화장품 (Makeup)서울100억 미만200억 미만
7일***보트 및 배 (Boats & Ships)경기100억 미만30억 미만
8에***무선 네트워킹 장비 (Wireless Networking Equipment)경기30억 미만30억 미만
9아***스킨 케어 (Skin Care)광주10억 미만10억 미만
기업명주요수출품목소재지자산규모매출규모
22가***의료기기 (Medical Device)경기30억 미만30억 미만
23케***건강 관리 보충 (Health Care Supplement)전북100억 미만50억 미만
24나***내화 재료 (Fireproofing Materials)경기300억 미만50억 미만
25플***유량 측정 계기 (Flow Measuring Instruments)서울300억 미만100억 미만
26나***긴급 사태 및 원칙 준수 (Emergency & Clinics Apparatuses)광주300억 미만200억 미만
27디***방역 마스크 (Protective Mask)대구30억 미만30억 미만
28다***볼밸브 (Ball Valves)부산100억 미만100억 미만
29디***건강 관리 용품 (Health Care Supplies)인천10억 미만10억 미만
30디***기타 (ETC)인천300억 미만200억 미만
31아***기타 (ETC)충북50억 미만30억 미만