Overview

Dataset statistics

Number of variables4
Number of observations32
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.1 KiB
Average record size in memory36.1 B

Variable types

Text2
Categorical2

Dataset

Description경상북도 안에 있는 주요 외국인 투자 기업 정보를 제공합니다. 기업명, 업종, 국적, 소재지 순으로 분류하였습니다.
Author경상북도
URLhttps://www.data.go.kr/data/15070503/fileData.do

Alerts

기업명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 18:03:33.659858
Analysis finished2023-12-12 18:03:34.097232
Duration0.44 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기업명
Text

UNIQUE 

Distinct32
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size388.0 B
2023-12-13T03:03:34.315050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length11.5
Mean length8.46875
Min length2

Characters and Unicode

Total characters271
Distinct characters117
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)100.0%

Sample

1st row(주)엑세스바이오코리아
2nd row(주)케이디에스
3rd rowLB 루셈
4th rowNPK
5th rowSDFLEX
ValueCountFrequency (%)
주)엑세스바이오코리아 1
 
2.9%
제트에프렘페더샤시(주 1
 
2.9%
에코프로gem 1
 
2.9%
엘링크링거코리아 1
 
2.9%
올리콘발저스코팅 1
 
2.9%
코리아 1
 
2.9%
동국제강 1
 
2.9%
이비덴그라파이트코리아 1
 
2.9%
지멘스헬시니어스 1
 
2.9%
주)케이디에스 1
 
2.9%
Other values (24) 24
70.6%
2023-12-13T03:03:34.759921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
21
 
7.7%
16
 
5.9%
15
 
5.5%
13
 
4.8%
9
 
3.3%
8
 
3.0%
( 6
 
2.2%
) 6
 
2.2%
5
 
1.8%
5
 
1.8%
Other values (107) 167
61.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 236
87.1%
Uppercase Letter 20
 
7.4%
Open Punctuation 6
 
2.2%
Close Punctuation 6
 
2.2%
Space Separator 2
 
0.7%
Other Symbol 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
21
 
8.9%
16
 
6.8%
15
 
6.4%
13
 
5.5%
9
 
3.8%
8
 
3.4%
5
 
2.1%
5
 
2.1%
5
 
2.1%
5
 
2.1%
Other values (90) 134
56.8%
Uppercase Letter
ValueCountFrequency (%)
N 3
15.0%
S 2
10.0%
L 2
10.0%
B 2
10.0%
E 2
10.0%
F 2
10.0%
P 1
 
5.0%
K 1
 
5.0%
X 1
 
5.0%
M 1
 
5.0%
Other values (3) 3
15.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 237
87.5%
Latin 20
 
7.4%
Common 14
 
5.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
21
 
8.9%
16
 
6.8%
15
 
6.3%
13
 
5.5%
9
 
3.8%
8
 
3.4%
5
 
2.1%
5
 
2.1%
5
 
2.1%
5
 
2.1%
Other values (91) 135
57.0%
Latin
ValueCountFrequency (%)
N 3
15.0%
S 2
10.0%
L 2
10.0%
B 2
10.0%
E 2
10.0%
F 2
10.0%
P 1
 
5.0%
K 1
 
5.0%
X 1
 
5.0%
M 1
 
5.0%
Other values (3) 3
15.0%
Common
ValueCountFrequency (%)
( 6
42.9%
) 6
42.9%
2
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 236
87.1%
ASCII 34
 
12.5%
None 1
 
0.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
21
 
8.9%
16
 
6.8%
15
 
6.4%
13
 
5.5%
9
 
3.8%
8
 
3.4%
5
 
2.1%
5
 
2.1%
5
 
2.1%
5
 
2.1%
Other values (90) 134
56.8%
ASCII
ValueCountFrequency (%)
( 6
17.6%
) 6
17.6%
N 3
8.8%
S 2
 
5.9%
L 2
 
5.9%
B 2
 
5.9%
2
 
5.9%
E 2
 
5.9%
F 2
 
5.9%
P 1
 
2.9%
Other values (6) 6
17.6%
None
ValueCountFrequency (%)
1
100.0%

업종
Text

Distinct28
Distinct (%)87.5%
Missing0
Missing (%)0.0%
Memory size388.0 B
2023-12-13T03:03:35.007058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length16
Mean length10.21875
Min length4

Characters and Unicode

Total characters327
Distinct characters113
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)81.2%

Sample

1st row의학 및 약학 연구개발
2nd row자동차 부품
3rd rowLED BLU의 PKG제조
4th row플라스틱 컴파운드
5th row산업용 특수부품
ValueCountFrequency (%)
제조 17
 
19.8%
자동차부품 5
 
5.8%
5
 
5.8%
자동차 3
 
3.5%
유리 2
 
2.3%
부품 2
 
2.3%
반도체 2
 
2.3%
플라스틱 2
 
2.3%
1차 1
 
1.2%
피막처리업 1
 
1.2%
Other values (46) 46
53.5%
2023-12-13T03:03:35.405064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
54
 
16.5%
24
 
7.3%
20
 
6.1%
11
 
3.4%
11
 
3.4%
11
 
3.4%
9
 
2.8%
8
 
2.4%
8
 
2.4%
5
 
1.5%
Other values (103) 166
50.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 260
79.5%
Space Separator 54
 
16.5%
Uppercase Letter 9
 
2.8%
Open Punctuation 1
 
0.3%
Decimal Number 1
 
0.3%
Close Punctuation 1
 
0.3%
Other Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
 
9.2%
20
 
7.7%
11
 
4.2%
11
 
4.2%
11
 
4.2%
9
 
3.5%
8
 
3.1%
8
 
3.1%
5
 
1.9%
5
 
1.9%
Other values (90) 148
56.9%
Uppercase Letter
ValueCountFrequency (%)
L 2
22.2%
G 1
11.1%
K 1
11.1%
P 1
11.1%
U 1
11.1%
B 1
11.1%
D 1
11.1%
E 1
11.1%
Space Separator
ValueCountFrequency (%)
54
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 260
79.5%
Common 58
 
17.7%
Latin 9
 
2.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
 
9.2%
20
 
7.7%
11
 
4.2%
11
 
4.2%
11
 
4.2%
9
 
3.5%
8
 
3.1%
8
 
3.1%
5
 
1.9%
5
 
1.9%
Other values (90) 148
56.9%
Latin
ValueCountFrequency (%)
L 2
22.2%
G 1
11.1%
K 1
11.1%
P 1
11.1%
U 1
11.1%
B 1
11.1%
D 1
11.1%
E 1
11.1%
Common
ValueCountFrequency (%)
54
93.1%
( 1
 
1.7%
1 1
 
1.7%
) 1
 
1.7%
, 1
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 260
79.5%
ASCII 67
 
20.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
54
80.6%
L 2
 
3.0%
( 1
 
1.5%
1 1
 
1.5%
) 1
 
1.5%
G 1
 
1.5%
K 1
 
1.5%
P 1
 
1.5%
U 1
 
1.5%
B 1
 
1.5%
Other values (3) 3
 
4.5%
Hangul
ValueCountFrequency (%)
24
 
9.2%
20
 
7.7%
11
 
4.2%
11
 
4.2%
11
 
4.2%
9
 
3.5%
8
 
3.1%
8
 
3.1%
5
 
1.9%
5
 
1.9%
Other values (90) 148
56.9%

국적
Categorical

Distinct9
Distinct (%)28.1%
Missing0
Missing (%)0.0%
Memory size388.0 B
미국
11 
일본
독일
프랑스
인도
 
1
Other values (4)

Length

Max length5
Median length2
Mean length2.28125
Min length2

Unique

Unique5 ?
Unique (%)15.6%

Sample

1st row미국
2nd row미국
3rd row일본
4th row일본
5th row미국

Common Values

ValueCountFrequency (%)
미국 11
34.4%
일본 8
25.0%
독일 5
15.6%
프랑스 3
 
9.4%
인도 1
 
3.1%
룩셈부르크 1
 
3.1%
중국 1
 
3.1%
스위스 1
 
3.1%
네덜란드 1
 
3.1%

Length

2023-12-13T03:03:35.547369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:03:35.941928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미국 11
34.4%
일본 8
25.0%
독일 5
15.6%
프랑스 3
 
9.4%
인도 1
 
3.1%
룩셈부르크 1
 
3.1%
중국 1
 
3.1%
스위스 1
 
3.1%
네덜란드 1
 
3.1%

소재지
Categorical

Distinct8
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size388.0 B
구미시
14 
포항시
경산시
경주시
김천시
Other values (3)

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique2 ?
Unique (%)6.2%

Sample

1st row고령군
2nd row경산시
3rd row구미시
4th row구미시
5th row구미시

Common Values

ValueCountFrequency (%)
구미시 14
43.8%
포항시 6
18.8%
경산시 4
 
12.5%
경주시 2
 
6.2%
김천시 2
 
6.2%
영천시 2
 
6.2%
고령군 1
 
3.1%
영주시 1
 
3.1%

Length

2023-12-13T03:03:36.075382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:03:36.186479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
구미시 14
43.8%
포항시 6
18.8%
경산시 4
 
12.5%
경주시 2
 
6.2%
김천시 2
 
6.2%
영천시 2
 
6.2%
고령군 1
 
3.1%
영주시 1
 
3.1%

Correlations

2023-12-13T03:03:36.272626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기업명업종국적소재지
기업명1.0001.0001.0001.000
업종1.0001.0000.9070.711
국적1.0000.9071.0000.687
소재지1.0000.7110.6871.000
2023-12-13T03:03:36.363426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
국적소재지
국적1.0000.406
소재지0.4061.000
2023-12-13T03:03:36.457011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
국적소재지
국적1.0000.406
소재지0.4061.000

Missing values

2023-12-13T03:03:33.927149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:03:34.043457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기업명업종국적소재지
0(주)엑세스바이오코리아의학 및 약학 연구개발미국고령군
1(주)케이디에스자동차 부품미국경산시
2LB 루셈LED BLU의 PKG제조일본구미시
3NPK플라스틱 컴파운드일본구미시
4SDFLEX산업용 특수부품미국구미시
5한국전기초자주식회사브라운관 유리 제조일본구미시
6노벨리스코리아알루미늄 리싸이클링인도영주시
7델코자동차밧데리 제조미국구미시
8도레이BSF한국(유)전자분리막 필름 제조일본구미시
9도레이첨단소재합성섬유, 필름제조일본구미시
기업명업종국적소재지
22이비덴그라파이트코리아그라파이트(흑연) 제조일본포항시
23제트에프렘페더샤시(주)자동차부품 제조독일구미시
24지멘스헬시니어스초음파의료기기 제조독일포항시
25코오롱바스프이노폼엔지니어링 플라스틱독일김천시
26쿠어스텍코리아(유)반도체 세라믹 장비 제조미국구미시
27타이코에이엠피자동차 커넥터 제조미국경산시
28파워카본테크놀로지전기용 탄소제품 및 절연제품 제조미국경산시
29포레시아배기컨트롤시스템코리아자동차부품 제조프랑스영천시
30포레시아오토모티브시팅코리아자동차부품 제조프랑스영천시
31한국오웬스코닝유리장섬유 제작미국김천시