Overview

Dataset statistics

Number of variables8
Number of observations1560
Missing cells645
Missing cells (%)5.2%
Duplicate rows149
Duplicate rows (%)9.6%
Total size in memory97.6 KiB
Average record size in memory64.1 B

Variable types

Text7
Categorical1

Dataset

Description열린정보마당(인터넷 DTiMS)에서 제공 중인 국방분야 강소벤처가 보유 기업별 기술정보 목록
Author국방기술품질원
URLhttps://www.data.go.kr/data/15040975/fileData.do

Alerts

Dataset has 149 (9.6%) duplicate rowsDuplicates
홈페이지주소 has 115 (7.4%) missing valuesMissing
무기체계분류 has 25 (1.6%) missing valuesMissing
국방기술분류 has 24 (1.5%) missing valuesMissing
키워드 has 480 (30.8%) missing valuesMissing

Reproduction

Analysis started2023-12-12 06:04:57.060589
Analysis finished2023-12-12 06:04:58.566376
Duration1.51 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct797
Distinct (%)51.1%
Missing0
Missing (%)0.0%
Memory size12.3 KiB
2023-12-12T15:04:58.769480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length15
Mean length7.2551282
Min length2

Characters and Unicode

Total characters11318
Distinct characters394
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique391 ?
Unique (%)25.1%

Sample

1st row(주)한라이비텍
2nd row(주)한라이비텍
3rd row(주)제일기공
4th row(주)우리해양기술
5th row주식회사 와이지엠
ValueCountFrequency (%)
주식회사 114
 
6.6%
20
 
1.2%
주)제이더블류시스텍 12
 
0.7%
주)이노시뮬레이션 11
 
0.6%
디알비동일 9
 
0.5%
주)디앤비 8
 
0.5%
주)지엔티 8
 
0.5%
주)에이엔에이치스트럭쳐 8
 
0.5%
주)웨이비스 7
 
0.4%
아소아 7
 
0.4%
Other values (808) 1529
88.2%
2023-12-12T15:04:59.242289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1086
 
9.6%
( 954
 
8.4%
) 954
 
8.4%
483
 
4.3%
479
 
4.2%
224
 
2.0%
211
 
1.9%
203
 
1.8%
174
 
1.5%
153
 
1.4%
Other values (384) 6397
56.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8740
77.2%
Open Punctuation 954
 
8.4%
Close Punctuation 954
 
8.4%
Other Symbol 224
 
2.0%
Space Separator 174
 
1.5%
Uppercase Letter 135
 
1.2%
Lowercase Letter 116
 
1.0%
Other Punctuation 13
 
0.1%
Dash Punctuation 6
 
0.1%
Decimal Number 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1086
 
12.4%
483
 
5.5%
479
 
5.5%
211
 
2.4%
203
 
2.3%
153
 
1.8%
143
 
1.6%
140
 
1.6%
138
 
1.6%
136
 
1.6%
Other values (342) 5568
63.7%
Uppercase Letter
ValueCountFrequency (%)
S 19
14.1%
E 18
13.3%
T 12
8.9%
C 11
 
8.1%
A 10
 
7.4%
M 10
 
7.4%
F 7
 
5.2%
K 7
 
5.2%
D 7
 
5.2%
H 6
 
4.4%
Other values (10) 28
20.7%
Lowercase Letter
ValueCountFrequency (%)
e 28
24.1%
n 12
10.3%
c 11
 
9.5%
r 11
 
9.5%
o 9
 
7.8%
a 8
 
6.9%
y 6
 
5.2%
i 6
 
5.2%
s 6
 
5.2%
g 6
 
5.2%
Other values (4) 13
11.2%
Other Punctuation
ValueCountFrequency (%)
. 8
61.5%
& 5
38.5%
Open Punctuation
ValueCountFrequency (%)
( 954
100.0%
Close Punctuation
ValueCountFrequency (%)
) 954
100.0%
Other Symbol
ValueCountFrequency (%)
224
100.0%
Space Separator
ValueCountFrequency (%)
174
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Decimal Number
ValueCountFrequency (%)
4 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8964
79.2%
Common 2103
 
18.6%
Latin 251
 
2.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1086
 
12.1%
483
 
5.4%
479
 
5.3%
224
 
2.5%
211
 
2.4%
203
 
2.3%
153
 
1.7%
143
 
1.6%
140
 
1.6%
138
 
1.5%
Other values (343) 5704
63.6%
Latin
ValueCountFrequency (%)
e 28
 
11.2%
S 19
 
7.6%
E 18
 
7.2%
n 12
 
4.8%
T 12
 
4.8%
c 11
 
4.4%
r 11
 
4.4%
C 11
 
4.4%
A 10
 
4.0%
M 10
 
4.0%
Other values (24) 109
43.4%
Common
ValueCountFrequency (%)
( 954
45.4%
) 954
45.4%
174
 
8.3%
. 8
 
0.4%
- 6
 
0.3%
& 5
 
0.2%
4 2
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8740
77.2%
ASCII 2354
 
20.8%
None 224
 
2.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1086
 
12.4%
483
 
5.5%
479
 
5.5%
211
 
2.4%
203
 
2.3%
153
 
1.8%
143
 
1.6%
140
 
1.6%
138
 
1.6%
136
 
1.6%
Other values (342) 5568
63.7%
ASCII
ValueCountFrequency (%)
( 954
40.5%
) 954
40.5%
174
 
7.4%
e 28
 
1.2%
S 19
 
0.8%
E 18
 
0.8%
n 12
 
0.5%
T 12
 
0.5%
c 11
 
0.5%
r 11
 
0.5%
Other values (31) 161
 
6.8%
None
ValueCountFrequency (%)
224
100.0%

홈페이지주소
Text

MISSING 

Distinct731
Distinct (%)50.6%
Missing115
Missing (%)7.4%
Memory size12.3 KiB
2023-12-12T15:04:59.521336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length29
Mean length19.379239
Min length1

Characters and Unicode

Total characters28003
Distinct characters83
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique360 ?
Unique (%)24.9%

Sample

1st rowhttp://www.ebwel.com
2nd rowhttp://www.ebwel.com
3rd rowhttp://www.jeaim.co.kr
4th rowhttp://www.wooriocean.com
5th rowhttp://www.ygmarine.co.kr
ValueCountFrequency (%)
34
 
2.3%
www.innosim.com 11
 
0.8%
www.drbworld.com 9
 
0.6%
http://www.dnb2003.com 8
 
0.5%
http://www.anhstructure.com 8
 
0.5%
www.danam.co.kr 7
 
0.5%
http://www.wavice.com 7
 
0.5%
www.asoa.co.kr 7
 
0.5%
http://www.winxen.com 7
 
0.5%
www.envinode.com 6
 
0.4%
Other values (728) 1357
92.9%
2023-12-12T15:04:59.961625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
w 4021
14.4%
. 3328
 
11.9%
o 2187
 
7.8%
t 2125
 
7.6%
c 1892
 
6.8%
/ 1514
 
5.4%
r 1247
 
4.5%
e 1207
 
4.3%
h 1068
 
3.8%
m 1057
 
3.8%
Other values (73) 8357
29.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 22016
78.6%
Other Punctuation 5573
 
19.9%
Decimal Number 177
 
0.6%
Dash Punctuation 155
 
0.6%
Other Letter 34
 
0.1%
Uppercase Letter 29
 
0.1%
Space Separator 18
 
0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3
 
8.8%
2
 
5.9%
2
 
5.9%
2
 
5.9%
2
 
5.9%
2
 
5.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
Other values (17) 17
50.0%
Lowercase Letter
ValueCountFrequency (%)
w 4021
18.3%
o 2187
9.9%
t 2125
 
9.7%
c 1892
 
8.6%
r 1247
 
5.7%
e 1207
 
5.5%
h 1068
 
4.9%
m 1057
 
4.8%
k 964
 
4.4%
p 932
 
4.2%
Other values (16) 5316
24.1%
Uppercase Letter
ValueCountFrequency (%)
W 6
20.7%
S 3
10.3%
N 3
10.3%
C 3
10.3%
K 3
10.3%
R 2
 
6.9%
O 2
 
6.9%
E 2
 
6.9%
U 1
 
3.4%
T 1
 
3.4%
Other values (3) 3
10.3%
Decimal Number
ValueCountFrequency (%)
2 40
22.6%
1 40
22.6%
0 35
19.8%
3 27
15.3%
5 14
 
7.9%
4 10
 
5.6%
6 8
 
4.5%
8 2
 
1.1%
9 1
 
0.6%
Other Punctuation
ValueCountFrequency (%)
. 3328
59.7%
/ 1514
27.2%
: 727
 
13.0%
@ 3
 
0.1%
, 1
 
< 0.1%
Dash Punctuation
ValueCountFrequency (%)
- 155
100.0%
Space Separator
ValueCountFrequency (%)
18
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 22045
78.7%
Common 5924
 
21.2%
Hangul 34
 
0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
w 4021
18.2%
o 2187
9.9%
t 2125
 
9.6%
c 1892
 
8.6%
r 1247
 
5.7%
e 1207
 
5.5%
h 1068
 
4.8%
m 1057
 
4.8%
k 964
 
4.4%
p 932
 
4.2%
Other values (29) 5345
24.2%
Hangul
ValueCountFrequency (%)
3
 
8.8%
2
 
5.9%
2
 
5.9%
2
 
5.9%
2
 
5.9%
2
 
5.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
Other values (17) 17
50.0%
Common
ValueCountFrequency (%)
. 3328
56.2%
/ 1514
25.6%
: 727
 
12.3%
- 155
 
2.6%
2 40
 
0.7%
1 40
 
0.7%
0 35
 
0.6%
3 27
 
0.5%
18
 
0.3%
5 14
 
0.2%
Other values (7) 26
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 27969
99.9%
Hangul 33
 
0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
w 4021
14.4%
. 3328
 
11.9%
o 2187
 
7.8%
t 2125
 
7.6%
c 1892
 
6.8%
/ 1514
 
5.4%
r 1247
 
4.5%
e 1207
 
4.3%
h 1068
 
3.8%
m 1057
 
3.8%
Other values (46) 8323
29.8%
Hangul
ValueCountFrequency (%)
3
 
9.1%
2
 
6.1%
2
 
6.1%
2
 
6.1%
2
 
6.1%
2
 
6.1%
1
 
3.0%
1
 
3.0%
1
 
3.0%
1
 
3.0%
Other values (16) 16
48.5%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

업종
Text

Distinct264
Distinct (%)16.9%
Missing0
Missing (%)0.0%
Memory size12.3 KiB
2023-12-12T15:05:00.322659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length59
Median length3
Mean length6.4794872
Min length2

Characters and Unicode

Total characters10108
Distinct characters243
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique111 ?
Unique (%)7.1%

Sample

1st row제조업
2nd row제조업
3rd row제조업
4th row복합소재선박 건조, 연구 개발
5th row제조업
ValueCountFrequency (%)
제조업 1029
39.6%
130
 
5.0%
개발 111
 
4.3%
제조 99
 
3.8%
소프트웨어 74
 
2.8%
서비스업 65
 
2.5%
개발업 52
 
2.0%
공급 48
 
1.8%
공급업 38
 
1.5%
서비스 34
 
1.3%
Other values (316) 918
35.3%
2023-12-12T15:05:01.129531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1330
 
13.2%
1230
 
12.2%
1194
 
11.8%
1039
 
10.3%
, 406
 
4.0%
268
 
2.7%
267
 
2.6%
208
 
2.1%
187
 
1.9%
167
 
1.7%
Other values (233) 3812
37.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8353
82.6%
Space Separator 1039
 
10.3%
Other Punctuation 481
 
4.8%
Uppercase Letter 151
 
1.5%
Lowercase Letter 36
 
0.4%
Decimal Number 16
 
0.2%
Open Punctuation 15
 
0.1%
Close Punctuation 15
 
0.1%
Dash Punctuation 1
 
< 0.1%
Modifier Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1330
15.9%
1230
 
14.7%
1194
 
14.3%
268
 
3.2%
267
 
3.2%
208
 
2.5%
187
 
2.2%
167
 
2.0%
165
 
2.0%
162
 
1.9%
Other values (190) 3175
38.0%
Uppercase Letter
ValueCountFrequency (%)
W 38
25.2%
S 36
23.8%
T 22
14.6%
I 17
11.3%
C 6
 
4.0%
V 6
 
4.0%
H 5
 
3.3%
E 5
 
3.3%
P 3
 
2.0%
R 3
 
2.0%
Other values (5) 10
 
6.6%
Lowercase Letter
ValueCountFrequency (%)
w 8
22.2%
e 5
13.9%
s 4
11.1%
d 4
11.1%
r 3
 
8.3%
i 3
 
8.3%
c 2
 
5.6%
l 2
 
5.6%
a 2
 
5.6%
t 1
 
2.8%
Other values (2) 2
 
5.6%
Decimal Number
ValueCountFrequency (%)
1 7
43.8%
3 3
18.8%
4 2
 
12.5%
0 2
 
12.5%
2 1
 
6.2%
5 1
 
6.2%
Other Punctuation
ValueCountFrequency (%)
, 406
84.4%
/ 68
 
14.1%
. 4
 
0.8%
· 2
 
0.4%
& 1
 
0.2%
Space Separator
ValueCountFrequency (%)
1039
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8353
82.6%
Common 1568
 
15.5%
Latin 187
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1330
15.9%
1230
 
14.7%
1194
 
14.3%
268
 
3.2%
267
 
3.2%
208
 
2.5%
187
 
2.2%
167
 
2.0%
165
 
2.0%
162
 
1.9%
Other values (190) 3175
38.0%
Latin
ValueCountFrequency (%)
W 38
20.3%
S 36
19.3%
T 22
11.8%
I 17
9.1%
w 8
 
4.3%
C 6
 
3.2%
V 6
 
3.2%
H 5
 
2.7%
e 5
 
2.7%
E 5
 
2.7%
Other values (17) 39
20.9%
Common
ValueCountFrequency (%)
1039
66.3%
, 406
 
25.9%
/ 68
 
4.3%
( 15
 
1.0%
) 15
 
1.0%
1 7
 
0.4%
. 4
 
0.3%
3 3
 
0.2%
· 2
 
0.1%
4 2
 
0.1%
Other values (6) 7
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8350
82.6%
ASCII 1753
 
17.3%
Compat Jamo 3
 
< 0.1%
None 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1330
15.9%
1230
 
14.7%
1194
 
14.3%
268
 
3.2%
267
 
3.2%
208
 
2.5%
187
 
2.2%
167
 
2.0%
165
 
2.0%
162
 
1.9%
Other values (188) 3172
38.0%
ASCII
ValueCountFrequency (%)
1039
59.3%
, 406
 
23.2%
/ 68
 
3.9%
W 38
 
2.2%
S 36
 
2.1%
T 22
 
1.3%
I 17
 
1.0%
( 15
 
0.9%
) 15
 
0.9%
w 8
 
0.5%
Other values (32) 89
 
5.1%
None
ValueCountFrequency (%)
· 2
100.0%
Compat Jamo
ValueCountFrequency (%)
2
66.7%
1
33.3%

주소
Text

Distinct856
Distinct (%)54.9%
Missing1
Missing (%)0.1%
Memory size12.3 KiB
2023-12-12T15:05:01.494182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length44
Mean length25.305965
Min length3

Characters and Unicode

Total characters39452
Distinct characters450
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique471 ?
Unique (%)30.2%

Sample

1st row부산시 사하구 다산로 106번길 71(다대동)
2nd row부산시 사하구 다산로 106번길 71(다대동)
3rd row경기도 이천시 부발읍 황무로 2065번길 72-75
4th row부산시 강서구 녹산산단335로 24-12 산프라자빌딩 4층 (우 46754)
5th row경상남도 김해시 인제로 197, 창조관 425호 (어방동, 인제대학교)
ValueCountFrequency (%)
경기도 290
 
3.4%
유성구 189
 
2.2%
경남 184
 
2.2%
창원시 150
 
1.8%
서울시 118
 
1.4%
부산광역시 112
 
1.3%
대전광역시 108
 
1.3%
강서구 90
 
1.1%
성남시 84
 
1.0%
부산시 76
 
0.9%
Other values (2042) 7089
83.5%
2023-12-12T15:05:01.973959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6940
 
17.6%
1 1532
 
3.9%
1448
 
3.7%
1419
 
3.6%
1273
 
3.2%
2 1184
 
3.0%
3 806
 
2.0%
756
 
1.9%
0 754
 
1.9%
4 711
 
1.8%
Other values (440) 22629
57.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 23209
58.8%
Decimal Number 7635
 
19.4%
Space Separator 6940
 
17.6%
Other Punctuation 473
 
1.2%
Dash Punctuation 353
 
0.9%
Open Punctuation 297
 
0.8%
Close Punctuation 290
 
0.7%
Uppercase Letter 205
 
0.5%
Lowercase Letter 39
 
0.1%
Math Symbol 8
 
< 0.1%
Other values (2) 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1448
 
6.2%
1419
 
6.1%
1273
 
5.5%
756
 
3.3%
656
 
2.8%
637
 
2.7%
610
 
2.6%
610
 
2.6%
577
 
2.5%
494
 
2.1%
Other values (384) 14729
63.5%
Uppercase Letter
ValueCountFrequency (%)
B 34
16.6%
I 31
15.1%
A 30
14.6%
T 22
10.7%
C 16
7.8%
K 12
 
5.9%
S 10
 
4.9%
E 8
 
3.9%
L 8
 
3.9%
F 7
 
3.4%
Other values (8) 27
13.2%
Lowercase Letter
ValueCountFrequency (%)
h 6
15.4%
t 5
12.8%
d 5
12.8%
e 5
12.8%
w 3
7.7%
n 3
7.7%
c 2
 
5.1%
k 2
 
5.1%
p 2
 
5.1%
i 1
 
2.6%
Other values (5) 5
12.8%
Decimal Number
ValueCountFrequency (%)
1 1532
20.1%
2 1184
15.5%
3 806
10.6%
0 754
9.9%
4 711
9.3%
5 669
8.8%
6 650
8.5%
7 495
 
6.5%
8 420
 
5.5%
9 414
 
5.4%
Other Punctuation
ValueCountFrequency (%)
, 453
95.8%
. 9
 
1.9%
/ 5
 
1.1%
: 4
 
0.8%
& 1
 
0.2%
# 1
 
0.2%
Space Separator
ValueCountFrequency (%)
6940
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 353
100.0%
Open Punctuation
ValueCountFrequency (%)
( 297
100.0%
Close Punctuation
ValueCountFrequency (%)
) 290
100.0%
Math Symbol
ValueCountFrequency (%)
~ 8
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 23211
58.8%
Common 15997
40.5%
Latin 244
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1448
 
6.2%
1419
 
6.1%
1273
 
5.5%
756
 
3.3%
656
 
2.8%
637
 
2.7%
610
 
2.6%
610
 
2.6%
577
 
2.5%
494
 
2.1%
Other values (385) 14731
63.5%
Latin
ValueCountFrequency (%)
B 34
13.9%
I 31
12.7%
A 30
12.3%
T 22
 
9.0%
C 16
 
6.6%
K 12
 
4.9%
S 10
 
4.1%
E 8
 
3.3%
L 8
 
3.3%
F 7
 
2.9%
Other values (23) 66
27.0%
Common
ValueCountFrequency (%)
6940
43.4%
1 1532
 
9.6%
2 1184
 
7.4%
3 806
 
5.0%
0 754
 
4.7%
4 711
 
4.4%
5 669
 
4.2%
6 650
 
4.1%
7 495
 
3.1%
, 453
 
2.8%
Other values (12) 1803
 
11.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 23209
58.8%
ASCII 16241
41.2%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6940
42.7%
1 1532
 
9.4%
2 1184
 
7.3%
3 806
 
5.0%
0 754
 
4.6%
4 711
 
4.4%
5 669
 
4.1%
6 650
 
4.0%
7 495
 
3.0%
, 453
 
2.8%
Other values (45) 2047
 
12.6%
Hangul
ValueCountFrequency (%)
1448
 
6.2%
1419
 
6.1%
1273
 
5.5%
756
 
3.3%
656
 
2.8%
637
 
2.7%
610
 
2.6%
610
 
2.6%
577
 
2.5%
494
 
2.1%
Other values (384) 14729
63.5%
None
ValueCountFrequency (%)
2
100.0%

인력규모
Categorical

Distinct4
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size12.3 KiB
1명이상~50명미만
978 
50명이상~100명미만
296 
100명이상
282 
<NA>
 
4

Length

Max length12
Median length10
Mean length9.6410256
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1명이상~50명미만
2nd row1명이상~50명미만
3rd row50명이상~100명미만
4th row1명이상~50명미만
5th row1명이상~50명미만

Common Values

ValueCountFrequency (%)
1명이상~50명미만 978
62.7%
50명이상~100명미만 296
 
19.0%
100명이상 282
 
18.1%
<NA> 4
 
0.3%

Length

2023-12-12T15:05:02.157703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:05:02.309269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1명이상~50명미만 978
62.7%
50명이상~100명미만 296
 
19.0%
100명이상 282
 
18.1%
na 4
 
0.3%

무기체계분류
Text

MISSING 

Distinct106
Distinct (%)6.9%
Missing25
Missing (%)1.6%
Memory size12.3 KiB
2023-12-12T15:05:02.628802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length35
Mean length5.904886
Min length2

Characters and Unicode

Total characters9064
Distinct characters31
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)2.4%

Sample

1st row화력
2nd row화력
3rd row전력지원체계
4th row함정
5th row전력지원체계
ValueCountFrequency (%)
감시/정찰 188
 
11.1%
항공 169
 
9.9%
기동 162
 
9.5%
지휘통제·통신 150
 
8.8%
기타 126
 
7.4%
함정 116
 
6.8%
비무기체계 93
 
5.5%
무기체계 78
 
4.6%
화력 76
 
4.5%
전력지원체계 71
 
4.2%
Other values (92) 472
27.7%
2023-12-12T15:05:03.162547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
852
 
9.4%
. 597
 
6.6%
579
 
6.4%
548
 
6.0%
548
 
6.0%
480
 
5.3%
370
 
4.1%
361
 
4.0%
325
 
3.6%
280
 
3.1%
Other values (21) 4124
45.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7781
85.8%
Other Punctuation 1117
 
12.3%
Space Separator 166
 
1.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
852
 
10.9%
579
 
7.4%
548
 
7.0%
548
 
7.0%
480
 
6.2%
370
 
4.8%
361
 
4.6%
325
 
4.2%
280
 
3.6%
280
 
3.6%
Other values (17) 3158
40.6%
Other Punctuation
ValueCountFrequency (%)
. 597
53.4%
/ 280
25.1%
· 240
21.5%
Space Separator
ValueCountFrequency (%)
166
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7781
85.8%
Common 1283
 
14.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
852
 
10.9%
579
 
7.4%
548
 
7.0%
548
 
7.0%
480
 
6.2%
370
 
4.8%
361
 
4.6%
325
 
4.2%
280
 
3.6%
280
 
3.6%
Other values (17) 3158
40.6%
Common
ValueCountFrequency (%)
. 597
46.5%
/ 280
21.8%
· 240
18.7%
166
 
12.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7781
85.8%
ASCII 1043
 
11.5%
None 240
 
2.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
852
 
10.9%
579
 
7.4%
548
 
7.0%
548
 
7.0%
480
 
6.2%
370
 
4.8%
361
 
4.6%
325
 
4.2%
280
 
3.6%
280
 
3.6%
Other values (17) 3158
40.6%
ASCII
ValueCountFrequency (%)
. 597
57.2%
/ 280
26.8%
166
 
15.9%
None
ValueCountFrequency (%)
· 240
100.0%

국방기술분류
Text

MISSING 

Distinct57
Distinct (%)3.7%
Missing24
Missing (%)1.5%
Memory size12.3 KiB
2023-12-12T15:05:03.457629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length22
Mean length4.6165365
Min length2

Characters and Unicode

Total characters7091
Distinct characters31
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)1.3%

Sample

1st row플랫폼/구조
2nd row플랫폼/구조
3rd row탄약/에너지
4th row소재.플랫폼/구조
5th row소재.플랫폼/구조
ValueCountFrequency (%)
정보통신 347
22.6%
플랫폼/구조 223
14.5%
센서 204
13.3%
소재 197
12.8%
제어전자 151
9.8%
탄약/에너지 96
 
6.2%
추진 41
 
2.7%
화생방 38
 
2.5%
센서.정보통신 32
 
2.1%
소재.플랫폼/구조 27
 
1.8%
Other values (47) 180
11.7%
2023-12-12T15:05:03.832613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
440
 
6.2%
440
 
6.2%
440
 
6.2%
440
 
6.2%
/ 434
 
6.1%
298
 
4.2%
298
 
4.2%
298
 
4.2%
298
 
4.2%
298
 
4.2%
Other values (21) 3407
48.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6372
89.9%
Other Punctuation 719
 
10.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
440
 
6.9%
440
 
6.9%
440
 
6.9%
440
 
6.9%
298
 
4.7%
298
 
4.7%
298
 
4.7%
298
 
4.7%
298
 
4.7%
296
 
4.6%
Other values (19) 2826
44.4%
Other Punctuation
ValueCountFrequency (%)
/ 434
60.4%
. 285
39.6%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6372
89.9%
Common 719
 
10.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
440
 
6.9%
440
 
6.9%
440
 
6.9%
440
 
6.9%
298
 
4.7%
298
 
4.7%
298
 
4.7%
298
 
4.7%
298
 
4.7%
296
 
4.6%
Other values (19) 2826
44.4%
Common
ValueCountFrequency (%)
/ 434
60.4%
. 285
39.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6372
89.9%
ASCII 719
 
10.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
440
 
6.9%
440
 
6.9%
440
 
6.9%
440
 
6.9%
298
 
4.7%
298
 
4.7%
298
 
4.7%
298
 
4.7%
298
 
4.7%
296
 
4.6%
Other values (19) 2826
44.4%
ASCII
ValueCountFrequency (%)
/ 434
60.4%
. 285
39.6%

키워드
Text

MISSING 

Distinct954
Distinct (%)88.3%
Missing480
Missing (%)30.8%
Memory size12.3 KiB
2023-12-12T15:05:04.245407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length167
Median length79
Mean length25.883333
Min length1

Characters and Unicode

Total characters27954
Distinct characters608
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique864 ?
Unique (%)80.0%

Sample

1st row탄도미사일의 탄두부 외피, 탄도미사일의 연소관, 전자빔용접, 고밀도 에너지 용접
2nd row탄도미사일의 탄두부 외피, 탄도미사일의 연소관, 전자빔용접, 고밀도 에너지 용접
3rd row추진제, 제조 설비
4th row탄소복합소재, 초고속정
5th row방탄, 의장, 커플링, 마그네틱 커플링
ValueCountFrequency (%)
드론 62
 
1.1%
59
 
1.1%
38
 
0.7%
시뮬레이터 32
 
0.6%
시스템 27
 
0.5%
센서 27
 
0.5%
데이터 23
 
0.4%
3d 22
 
0.4%
시뮬레이션 22
 
0.4%
항공기 21
 
0.4%
Other values (2897) 5196
94.0%
2023-12-12T15:05:04.788291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4450
 
15.9%
, 2783
 
10.0%
416
 
1.5%
383
 
1.4%
373
 
1.3%
e 359
 
1.3%
r 279
 
1.0%
S 267
 
1.0%
i 263
 
0.9%
a 250
 
0.9%
Other values (598) 18131
64.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14642
52.4%
Space Separator 4450
 
15.9%
Other Punctuation 2904
 
10.4%
Lowercase Letter 2842
 
10.2%
Uppercase Letter 2720
 
9.7%
Decimal Number 186
 
0.7%
Open Punctuation 76
 
0.3%
Close Punctuation 75
 
0.3%
Dash Punctuation 52
 
0.2%
Connector Punctuation 3
 
< 0.1%
Other values (3) 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
416
 
2.8%
383
 
2.6%
373
 
2.5%
249
 
1.7%
239
 
1.6%
232
 
1.6%
231
 
1.6%
228
 
1.6%
193
 
1.3%
174
 
1.2%
Other values (521) 11924
81.4%
Lowercase Letter
ValueCountFrequency (%)
e 359
12.6%
r 279
9.8%
i 263
 
9.3%
a 250
 
8.8%
o 233
 
8.2%
t 206
 
7.2%
n 204
 
7.2%
l 152
 
5.3%
s 116
 
4.1%
c 109
 
3.8%
Other values (16) 671
23.6%
Uppercase Letter
ValueCountFrequency (%)
S 267
 
9.8%
C 205
 
7.5%
A 188
 
6.9%
I 187
 
6.9%
T 186
 
6.8%
M 170
 
6.2%
D 170
 
6.2%
P 168
 
6.2%
E 145
 
5.3%
R 133
 
4.9%
Other values (16) 901
33.1%
Decimal Number
ValueCountFrequency (%)
3 59
31.7%
0 33
17.7%
2 26
14.0%
5 19
 
10.2%
1 18
 
9.7%
6 15
 
8.1%
8 6
 
3.2%
7 5
 
2.7%
4 4
 
2.2%
9 1
 
0.5%
Other Punctuation
ValueCountFrequency (%)
, 2783
95.8%
/ 81
 
2.8%
& 16
 
0.6%
. 12
 
0.4%
* 6
 
0.2%
· 4
 
0.1%
; 2
 
0.1%
Space Separator
ValueCountFrequency (%)
4450
100.0%
Open Punctuation
ValueCountFrequency (%)
( 76
100.0%
Close Punctuation
ValueCountFrequency (%)
) 75
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 52
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%
Math Symbol
ValueCountFrequency (%)
+ 2
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14642
52.4%
Common 7750
27.7%
Latin 5562
 
19.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
416
 
2.8%
383
 
2.6%
373
 
2.5%
249
 
1.7%
239
 
1.6%
232
 
1.6%
231
 
1.6%
228
 
1.6%
193
 
1.3%
174
 
1.2%
Other values (521) 11924
81.4%
Latin
ValueCountFrequency (%)
e 359
 
6.5%
r 279
 
5.0%
S 267
 
4.8%
i 263
 
4.7%
a 250
 
4.5%
o 233
 
4.2%
t 206
 
3.7%
C 205
 
3.7%
n 204
 
3.7%
A 188
 
3.4%
Other values (42) 3108
55.9%
Common
ValueCountFrequency (%)
4450
57.4%
, 2783
35.9%
/ 81
 
1.0%
( 76
 
1.0%
) 75
 
1.0%
3 59
 
0.8%
- 52
 
0.7%
0 33
 
0.4%
2 26
 
0.3%
5 19
 
0.2%
Other values (15) 96
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14642
52.4%
ASCII 13307
47.6%
None 4
 
< 0.1%
Letterlike Symbols 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4450
33.4%
, 2783
20.9%
e 359
 
2.7%
r 279
 
2.1%
S 267
 
2.0%
i 263
 
2.0%
a 250
 
1.9%
o 233
 
1.8%
t 206
 
1.5%
C 205
 
1.5%
Other values (65) 4012
30.1%
Hangul
ValueCountFrequency (%)
416
 
2.8%
383
 
2.6%
373
 
2.5%
249
 
1.7%
239
 
1.6%
232
 
1.6%
231
 
1.6%
228
 
1.6%
193
 
1.3%
174
 
1.2%
Other values (521) 11924
81.4%
None
ValueCountFrequency (%)
· 4
100.0%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%

Correlations

2023-12-12T15:05:04.896765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인력규모국방기술분류
인력규모1.0000.244
국방기술분류0.2441.000

Missing values

2023-12-12T15:04:58.198455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:04:58.347939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T15:04:58.487624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

회사명홈페이지주소업종주소인력규모무기체계분류국방기술분류키워드
0(주)한라이비텍http://www.ebwel.com제조업부산시 사하구 다산로 106번길 71(다대동)1명이상~50명미만화력플랫폼/구조탄도미사일의 탄두부 외피, 탄도미사일의 연소관, 전자빔용접, 고밀도 에너지 용접
1(주)한라이비텍http://www.ebwel.com제조업부산시 사하구 다산로 106번길 71(다대동)1명이상~50명미만화력플랫폼/구조탄도미사일의 탄두부 외피, 탄도미사일의 연소관, 전자빔용접, 고밀도 에너지 용접
2(주)제일기공http://www.jeaim.co.kr제조업경기도 이천시 부발읍 황무로 2065번길 72-7550명이상~100명미만전력지원체계탄약/에너지추진제, 제조 설비
3(주)우리해양기술http://www.wooriocean.com복합소재선박 건조, 연구 개발부산시 강서구 녹산산단335로 24-12 산프라자빌딩 4층 (우 46754)1명이상~50명미만함정소재.플랫폼/구조탄소복합소재, 초고속정
4주식회사 와이지엠http://www.ygmarine.co.kr제조업경상남도 김해시 인제로 197, 창조관 425호 (어방동, 인제대학교)1명이상~50명미만전력지원체계소재.플랫폼/구조방탄, 의장, 커플링, 마그네틱 커플링
5주식회사 와이지엠http://www.ygmarine.co.kr제조업경상남도 김해시 인제로 197, 창조관 425호 (어방동, 인제대학교)1명이상~50명미만전력지원체계소재.플랫폼/구조방탄, 의장, 커플링, 마그네틱 커플링
6에버그린텍(주)http://www.egt1014.com제조업충남 공주시 이인면 내건너길211명이상~50명미만전력지원체계소재<NA>
7명 비엔비머티리얼(주)http://www.bnbmat.co.kr제조업부산 부산진구 엄광동 1761명이상~50명미만전력지원체계소재열전도 및 방열 복합재료
8(주)무스마http://www.musma.netㅈ제조업부산광역시 해운대구 센텀중앙로 78 센텀그린타워 1708호1명이상~50명미만지휘통제·통신.감시/정찰정보통신.제어전자안전 관리 시스템
9드론올레<NA>제조업제주 서귀포시 안덕면 일주서로1836 안덕비행장1명이상~50명미만지휘통제·통신.항공.전력지원체계센서.정보통신.제어전자.탄약/에너지.화생방<NA>
회사명홈페이지주소업종주소인력규모무기체계분류국방기술분류키워드
1550알티엑스www.irtx.co.kr제조업대전광역시 유성구 테크노2로 167-61명이상~50명미만감시/정찰탄약/에너지<NA>
1551소나테크(주)www.sonartech.com전자장비 제조업부산시 남구 황령대로 353번길 9-371명이상~50명미만감시/정찰센서<NA>
1552㈜서광www.seog.co.kr생활가전 부품제조전라남도 장성군 동화면 농공단지길 64100명이상감시/정찰.함정소재<NA>
1553비나텍www.vian.co.kr제조업전라북도 전주시 덕진구 운암로 15100명이상화력제어전자<NA>
1554㈜디엠티www.dmtflex.com제조업전남 순천시 해룡면 율촌산단 1로 80-91명이상~50명미만기타 무기체계.비무기체계추진<NA>
1555㈜두시텍www.dusi.co.kr제조업, 도매업, 서비스업대전시 유성구 테크노 10로 44-15(탑림동)1명이상~50명미만감시/정찰정보통신<NA>
1556데크카본www.dacc21.co.kr제조업전북 전주시 덕진구 운암로 3050명이상~100명미만화력소재<NA>
1557구름네트웍스gurum.cc소프트웨어자문개발 및 공급업서울시 광진구 능동로 120, 창의관 212호(건국대)1명이상~50명미만지휘통제·통신정보통신<NA>
1558(주)코셋www.coset.com제조업광주광역시 북구 첨단벤처로 60번길 391명이상~50명미만화력탄약/에너지<NA>
1559(주)이상테크www.lstech.kr제조업광주광역시 북구 첨단벤처소로38번길 33-31명이상~50명미만화력탄약/에너지<NA>

Duplicate rows

Most frequently occurring

회사명홈페이지주소업종주소인력규모무기체계분류국방기술분류키워드# duplicates
29(주)웨이비스http://www.wavice.com제조업경기도 화성시 삼성1로5길 4650명이상~100명미만감시/정찰센서<NA>6
100라온시큐어https://www.raonsecure.com소프트웨어 공급 및 개발서울특별시 강남구 테헤란로 145100명이상지휘통제·통신정보통신<NA>6
101마린전자상사http://www.mecys.com제조 및 도소매 외부산광역시 동구 고관로 621명이상~50명미만지휘통제·통신.함정정보통신<NA>6
16(주)삼영엠아이텍http://www.symit.co.kr비파괴검사경상남도 창원시 의창구 동읍 자여로 115번길 21명이상~50명미만함정.항공.전력지원체계소재<NA>5
99라온시스템즈(주)http:/www.laonsys.co.kr제조업경기도 군포시 공단로298-8 2층1명이상~50명미만감시/정찰정보통신신호탐지, 정보탐지, 방향탐지5
103범한산업(주)http://www.bumhan.com제조업경남 창원시 마산회원구 자유무역4길 6150명이상~100명미만함정추진<NA>5
121유콘시스템(주)http://www.uconsystem.com무인항공기, 드론, 지상제어 시스템대전시 유성구 테크노 2로 40-950명이상~100명미만항공센서.정보통신.제어전자<NA>5
129주식회사 아벡스www.a-vex.com제조업, 서비스업, 도소매업경기 화성시 봉담읍 최루백로72 협성대학교 산학협력관 202호1명이상~50명미만비무기체계<NA><NA>5
141하이게인안테나www.highgain.co.kr통신장비 외경기도 안산시 단원구 산단로 224(원시동)100명이상지휘통제·통신정보통신<NA>5
10(주)리얼타임웨이브http://www.realtimewave.com시스템소프트웨어 개발 및 공급, 군수항공용 시험장비 제작경기도 성남시 분당구 판교역로 240, 삼환하이펙스 A동 710호1명이상~50명미만항공.화력정보통신항공전자, MC, 임무컴퓨터, SMC, FACE, IMA, Arinc653, Hot bench, SIL, STE4