Overview

Dataset statistics

Number of variables8
Number of observations1290
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory80.8 KiB
Average record size in memory64.1 B

Variable types

Text5
Categorical3

Dataset

Description- 중소벤처기업진흥공단의 중소기업 계약학과 참여 중소기업의 기업명, 사업자번호, 소재지, 기업유형, 대표자 성명, 스마트공장 사업참여 여부에 대한 리스트
Author공공데이터포털
URLhttps://www.data.go.kr/data/15019774/fileData.do

Alerts

참여대학 is highly overall correlated with 학위High correlation
학위 is highly overall correlated with 참여대학High correlation
기업유형 is highly imbalanced (76.7%)Imbalance
사업자 번호 has unique valuesUnique

Reproduction

Analysis started2024-04-17 13:24:13.228924
Analysis finished2024-04-17 13:24:14.463875
Duration1.23 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1288
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
2024-04-17T22:24:14.628222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length15
Mean length7.6147287
Min length2

Characters and Unicode

Total characters9823
Distinct characters504
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1286 ?
Unique (%)99.7%

Sample

1st row효동기계공업(주)
2nd row(주)이너트론
3rd row신일이엔티㈜
4th row주식회사 이엠아이
5th row코라스
ValueCountFrequency (%)
주식회사 83
 
6.0%
주)비에스이엔지 2
 
0.1%
제이 2
 
0.1%
선보유니텍(주 2
 
0.1%
주)우진정밀 1
 
0.1%
주)금경라이팅 1
 
0.1%
나라드라이브(주 1
 
0.1%
삼우엠씨피(주 1
 
0.1%
주)서영 1
 
0.1%
세원이앤씨(주 1
 
0.1%
Other values (1292) 1292
93.2%
2024-04-17T22:24:14.978076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1001
 
10.2%
) 860
 
8.8%
( 860
 
8.8%
461
 
4.7%
322
 
3.3%
200
 
2.0%
198
 
2.0%
167
 
1.7%
159
 
1.6%
140
 
1.4%
Other values (494) 5455
55.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7840
79.8%
Close Punctuation 860
 
8.8%
Open Punctuation 860
 
8.8%
Space Separator 99
 
1.0%
Uppercase Letter 84
 
0.9%
Other Symbol 54
 
0.5%
Other Punctuation 11
 
0.1%
Lowercase Letter 9
 
0.1%
Decimal Number 4
 
< 0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1001
 
12.8%
461
 
5.9%
322
 
4.1%
200
 
2.6%
198
 
2.5%
167
 
2.1%
159
 
2.0%
140
 
1.8%
122
 
1.6%
114
 
1.5%
Other values (453) 4956
63.2%
Uppercase Letter
ValueCountFrequency (%)
T 11
13.1%
S 11
13.1%
C 10
11.9%
A 8
 
9.5%
K 5
 
6.0%
O 5
 
6.0%
I 4
 
4.8%
X 3
 
3.6%
E 3
 
3.6%
W 3
 
3.6%
Other values (10) 21
25.0%
Lowercase Letter
ValueCountFrequency (%)
b 1
11.1%
a 1
11.1%
l 1
11.1%
o 1
11.1%
d 1
11.1%
i 1
11.1%
n 1
11.1%
g 1
11.1%
s 1
11.1%
Decimal Number
ValueCountFrequency (%)
2 1
25.0%
3 1
25.0%
6 1
25.0%
5 1
25.0%
Other Punctuation
ValueCountFrequency (%)
. 8
72.7%
& 2
 
18.2%
/ 1
 
9.1%
Close Punctuation
ValueCountFrequency (%)
) 860
100.0%
Open Punctuation
ValueCountFrequency (%)
( 860
100.0%
Space Separator
ValueCountFrequency (%)
99
100.0%
Other Symbol
ValueCountFrequency (%)
54
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7894
80.4%
Common 1836
 
18.7%
Latin 93
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1001
 
12.7%
461
 
5.8%
322
 
4.1%
200
 
2.5%
198
 
2.5%
167
 
2.1%
159
 
2.0%
140
 
1.8%
122
 
1.5%
114
 
1.4%
Other values (454) 5010
63.5%
Latin
ValueCountFrequency (%)
T 11
 
11.8%
S 11
 
11.8%
C 10
 
10.8%
A 8
 
8.6%
K 5
 
5.4%
O 5
 
5.4%
I 4
 
4.3%
X 3
 
3.2%
E 3
 
3.2%
W 3
 
3.2%
Other values (19) 30
32.3%
Common
ValueCountFrequency (%)
) 860
46.8%
( 860
46.8%
99
 
5.4%
. 8
 
0.4%
& 2
 
0.1%
- 2
 
0.1%
2 1
 
0.1%
3 1
 
0.1%
6 1
 
0.1%
5 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7840
79.8%
ASCII 1929
 
19.6%
None 54
 
0.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1001
 
12.8%
461
 
5.9%
322
 
4.1%
200
 
2.6%
198
 
2.5%
167
 
2.1%
159
 
2.0%
140
 
1.8%
122
 
1.6%
114
 
1.5%
Other values (453) 4956
63.2%
ASCII
ValueCountFrequency (%)
) 860
44.6%
( 860
44.6%
99
 
5.1%
T 11
 
0.6%
S 11
 
0.6%
C 10
 
0.5%
. 8
 
0.4%
A 8
 
0.4%
K 5
 
0.3%
O 5
 
0.3%
Other values (30) 52
 
2.7%
None
ValueCountFrequency (%)
54
100.0%

참여대학
Categorical

HIGH CORRELATION 

Distinct50
Distinct (%)3.9%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
한성대학교
 
82
경상국립대학교
 
76
동아대학교
 
66
한국공학대학교
 
65
전주대학교
 
65
Other values (45)
936 

Length

Max length9
Median length5
Mean length5.5968992
Min length5

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row한국공학대학교
2nd row한국공학대학교
3rd row한국공학대학교
4th row한국공학대학교
5th row한국공학대학교

Common Values

ValueCountFrequency (%)
한성대학교 82
 
6.4%
경상국립대학교 76
 
5.9%
동아대학교 66
 
5.1%
한국공학대학교 65
 
5.0%
전주대학교 65
 
5.0%
숭실대학교 56
 
4.3%
한밭대학교 50
 
3.9%
명지대학교 49
 
3.8%
경희대학교 49
 
3.8%
대구대학교 47
 
3.6%
Other values (40) 685
53.1%

Length

2024-04-17T22:24:15.105948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한성대학교 82
 
6.4%
경상국립대학교 76
 
5.9%
동아대학교 66
 
5.1%
한국공학대학교 65
 
5.0%
전주대학교 65
 
5.0%
숭실대학교 56
 
4.3%
한밭대학교 50
 
3.9%
경희대학교 49
 
3.8%
충북대학교 49
 
3.8%
명지대학교 49
 
3.8%
Other values (39) 683
52.9%

학과
Text

Distinct68
Distinct (%)5.3%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
2024-04-17T22:24:15.304268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length13
Mean length9.0085271
Min length5

Characters and Unicode

Total characters11621
Distinct characters123
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row반도체기계시스템공학과
2nd row반도체기계시스템공학과
3rd row반도체기계시스템공학과
4th row반도체기계시스템공학과
5th row반도체기계시스템공학과
ValueCountFrequency (%)
스마트융합컨설팅학과 58
 
4.5%
it융합학과 51
 
3.9%
스마트생산융합시스템공학과 40
 
3.1%
ai·sw융합학과 35
 
2.7%
산업시스템공학과 34
 
2.6%
ai기술경영학과 33
 
2.5%
스마트소재부품공학과 32
 
2.5%
스마트사회인프라유지관리학과 32
 
2.5%
스마트생산경영공학과 31
 
2.4%
화장품산업학과 30
 
2.3%
Other values (59) 926
71.1%
2024-04-17T22:24:15.618183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1213
 
10.4%
1153
 
9.9%
748
 
6.4%
619
 
5.3%
518
 
4.5%
518
 
4.5%
404
 
3.5%
380
 
3.3%
332
 
2.9%
255
 
2.2%
Other values (113) 5481
47.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10666
91.8%
Uppercase Letter 772
 
6.6%
Other Punctuation 95
 
0.8%
Dash Punctuation 30
 
0.3%
Lowercase Letter 30
 
0.3%
Decimal Number 16
 
0.1%
Space Separator 12
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1213
 
11.4%
1153
 
10.8%
748
 
7.0%
619
 
5.8%
518
 
4.9%
518
 
4.9%
404
 
3.8%
380
 
3.6%
332
 
3.1%
255
 
2.4%
Other values (98) 4526
42.4%
Uppercase Letter
ValueCountFrequency (%)
I 239
31.0%
T 159
20.6%
A 128
16.6%
C 92
 
11.9%
S 53
 
6.9%
M 42
 
5.4%
W 35
 
4.5%
D 24
 
3.1%
Other Punctuation
ValueCountFrequency (%)
· 45
47.4%
. 26
27.4%
/ 24
25.3%
Dash Punctuation
ValueCountFrequency (%)
- 30
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 30
100.0%
Decimal Number
ValueCountFrequency (%)
6 16
100.0%
Space Separator
ValueCountFrequency (%)
12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10666
91.8%
Latin 802
 
6.9%
Common 153
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1213
 
11.4%
1153
 
10.8%
748
 
7.0%
619
 
5.8%
518
 
4.9%
518
 
4.9%
404
 
3.8%
380
 
3.6%
332
 
3.1%
255
 
2.4%
Other values (98) 4526
42.4%
Latin
ValueCountFrequency (%)
I 239
29.8%
T 159
19.8%
A 128
16.0%
C 92
 
11.5%
S 53
 
6.6%
M 42
 
5.2%
W 35
 
4.4%
e 30
 
3.7%
D 24
 
3.0%
Common
ValueCountFrequency (%)
· 45
29.4%
- 30
19.6%
. 26
17.0%
/ 24
15.7%
6 16
 
10.5%
12
 
7.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10666
91.8%
ASCII 910
 
7.8%
None 45
 
0.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1213
 
11.4%
1153
 
10.8%
748
 
7.0%
619
 
5.8%
518
 
4.9%
518
 
4.9%
404
 
3.8%
380
 
3.6%
332
 
3.1%
255
 
2.4%
Other values (98) 4526
42.4%
ASCII
ValueCountFrequency (%)
I 239
26.3%
T 159
17.5%
A 128
14.1%
C 92
 
10.1%
S 53
 
5.8%
M 42
 
4.6%
W 35
 
3.8%
- 30
 
3.3%
e 30
 
3.3%
. 26
 
2.9%
Other values (4) 76
 
8.4%
None
ValueCountFrequency (%)
· 45
100.0%

학위
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
석사
636 
학사
402 
전문학사
151 
박사
101 

Length

Max length4
Median length2
Mean length2.2341085
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row학사
2nd row학사
3rd row학사
4th row학사
5th row학사

Common Values

ValueCountFrequency (%)
석사 636
49.3%
학사 402
31.2%
전문학사 151
 
11.7%
박사 101
 
7.8%

Length

2024-04-17T22:24:15.752058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T22:24:15.849931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
석사 636
49.3%
학사 402
31.2%
전문학사 151
 
11.7%
박사 101
 
7.8%

사업자 번호
Text

UNIQUE 

Distinct1290
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
2024-04-17T22:24:16.075557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters15480
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1290 ?
Unique (%)100.0%

Sample

1st row134-81-35081
2nd row214-87-06910
3rd row134-81-56470
4th row135-86-25219
5th row138-01-85320
ValueCountFrequency (%)
134-81-35081 1
 
0.1%
220-81-91007 1
 
0.1%
603-81-76856 1
 
0.1%
622-81-06588 1
 
0.1%
606-81-48927 1
 
0.1%
107-86-80236 1
 
0.1%
736-87-00982 1
 
0.1%
204-86-02491 1
 
0.1%
606-81-70074 1
 
0.1%
606-81-11468 1
 
0.1%
Other values (1280) 1280
99.2%
2024-04-17T22:24:16.439070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 2580
16.7%
1 2230
14.4%
8 1969
12.7%
0 1717
11.1%
6 1253
8.1%
2 1166
7.5%
3 1066
6.9%
4 1002
 
6.5%
5 911
 
5.9%
7 877
 
5.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 12900
83.3%
Dash Punctuation 2580
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 2230
17.3%
8 1969
15.3%
0 1717
13.3%
6 1253
9.7%
2 1166
9.0%
3 1066
8.3%
4 1002
7.8%
5 911
7.1%
7 877
 
6.8%
9 709
 
5.5%
Dash Punctuation
ValueCountFrequency (%)
- 2580
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 15480
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 2580
16.7%
1 2230
14.4%
8 1969
12.7%
0 1717
11.1%
6 1253
8.1%
2 1166
7.5%
3 1066
6.9%
4 1002
 
6.5%
5 911
 
5.9%
7 877
 
5.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 15480
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 2580
16.7%
1 2230
14.4%
8 1969
12.7%
0 1717
11.1%
6 1253
8.1%
2 1166
7.5%
3 1066
6.9%
4 1002
 
6.5%
5 911
 
5.9%
7 877
 
5.7%
Distinct1266
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
2024-04-17T22:24:16.745337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length57
Median length45
Mean length22.277519
Min length5

Characters and Unicode

Total characters28738
Distinct characters466
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1243 ?
Unique (%)96.4%

Sample

1st row경기 화성시 향남읍 발안공단로4길 97-17(발안지방산업단지 5-3B)
2nd row인천 연수
3rd row경기 시흥시 시화벤처로 299
4th row경기 오산시 세남로14번길 11
5th row경기 안양시 동안구 호계동 1072-4
ValueCountFrequency (%)
경기 202
 
3.1%
서울 157
 
2.4%
경남 147
 
2.2%
부산 116
 
1.8%
대전 82
 
1.3%
전북 81
 
1.2%
충북 78
 
1.2%
유성구 68
 
1.0%
강서구 63
 
1.0%
창원시 62
 
0.9%
Other values (2587) 5502
83.9%
2024-04-17T22:24:17.170037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5295
 
18.4%
1 1274
 
4.4%
1092
 
3.8%
892
 
3.1%
2 845
 
2.9%
761
 
2.6%
3 701
 
2.4%
615
 
2.1%
0 579
 
2.0%
542
 
1.9%
Other values (456) 16142
56.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16386
57.0%
Decimal Number 5963
 
20.7%
Space Separator 5295
 
18.4%
Other Punctuation 367
 
1.3%
Dash Punctuation 304
 
1.1%
Close Punctuation 157
 
0.5%
Open Punctuation 157
 
0.5%
Uppercase Letter 91
 
0.3%
Math Symbol 11
 
< 0.1%
Lowercase Letter 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1092
 
6.7%
892
 
5.4%
761
 
4.6%
615
 
3.8%
542
 
3.3%
467
 
2.8%
435
 
2.7%
408
 
2.5%
404
 
2.5%
382
 
2.3%
Other values (419) 10388
63.4%
Uppercase Letter
ValueCountFrequency (%)
A 23
25.3%
B 20
22.0%
C 12
13.2%
T 9
 
9.9%
I 7
 
7.7%
D 4
 
4.4%
K 3
 
3.3%
G 3
 
3.3%
R 2
 
2.2%
L 2
 
2.2%
Other values (5) 6
 
6.6%
Decimal Number
ValueCountFrequency (%)
1 1274
21.4%
2 845
14.2%
3 701
11.8%
0 579
9.7%
4 535
9.0%
5 487
 
8.2%
6 453
 
7.6%
7 417
 
7.0%
8 347
 
5.8%
9 325
 
5.5%
Other Punctuation
ValueCountFrequency (%)
, 363
98.9%
. 3
 
0.8%
& 1
 
0.3%
Lowercase Letter
ValueCountFrequency (%)
b 2
50.0%
s 1
25.0%
e 1
25.0%
Space Separator
ValueCountFrequency (%)
5295
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 304
100.0%
Close Punctuation
ValueCountFrequency (%)
) 157
100.0%
Open Punctuation
ValueCountFrequency (%)
( 157
100.0%
Math Symbol
ValueCountFrequency (%)
~ 11
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16389
57.0%
Common 12254
42.6%
Latin 95
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1092
 
6.7%
892
 
5.4%
761
 
4.6%
615
 
3.8%
542
 
3.3%
467
 
2.8%
435
 
2.7%
408
 
2.5%
404
 
2.5%
382
 
2.3%
Other values (420) 10391
63.4%
Common
ValueCountFrequency (%)
5295
43.2%
1 1274
 
10.4%
2 845
 
6.9%
3 701
 
5.7%
0 579
 
4.7%
4 535
 
4.4%
5 487
 
4.0%
6 453
 
3.7%
7 417
 
3.4%
, 363
 
3.0%
Other values (8) 1305
 
10.6%
Latin
ValueCountFrequency (%)
A 23
24.2%
B 20
21.1%
C 12
12.6%
T 9
 
9.5%
I 7
 
7.4%
D 4
 
4.2%
K 3
 
3.2%
G 3
 
3.2%
R 2
 
2.1%
b 2
 
2.1%
Other values (8) 10
10.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16386
57.0%
ASCII 12349
43.0%
None 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5295
42.9%
1 1274
 
10.3%
2 845
 
6.8%
3 701
 
5.7%
0 579
 
4.7%
4 535
 
4.3%
5 487
 
3.9%
6 453
 
3.7%
7 417
 
3.4%
, 363
 
2.9%
Other values (26) 1400
 
11.3%
Hangul
ValueCountFrequency (%)
1092
 
6.7%
892
 
5.4%
761
 
4.6%
615
 
3.8%
542
 
3.3%
467
 
2.8%
435
 
2.7%
408
 
2.5%
404
 
2.5%
382
 
2.3%
Other values (419) 10388
63.4%
None
ValueCountFrequency (%)
3
100.0%

기업유형
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
중소기업
1241 
중견기업
 
49

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중소기업
2nd row중소기업
3rd row중소기업
4th row중소기업
5th row중소기업

Common Values

ValueCountFrequency (%)
중소기업 1241
96.2%
중견기업 49
 
3.8%

Length

2024-04-17T22:24:17.295069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T22:24:17.383077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
중소기업 1241
96.2%
중견기업 49
 
3.8%
Distinct74
Distinct (%)5.7%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
2024-04-17T22:24:17.540420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters3870
Distinct characters75
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)1.6%

Sample

1st row김**
2nd row조**
3rd row권**
4th row이**
5th row고**
ValueCountFrequency (%)
265
20.5%
197
15.3%
102
 
7.9%
72
 
5.6%
57
 
4.4%
46
 
3.6%
32
 
2.5%
32
 
2.5%
30
 
2.3%
24
 
1.9%
Other values (64) 433
33.6%
2024-04-17T22:24:17.823541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 2580
66.7%
265
 
6.8%
197
 
5.1%
102
 
2.6%
72
 
1.9%
57
 
1.5%
46
 
1.2%
32
 
0.8%
32
 
0.8%
30
 
0.8%
Other values (65) 457
 
11.8%

Most occurring categories

ValueCountFrequency (%)
Other Punctuation 2580
66.7%
Other Letter 1287
33.3%
Uppercase Letter 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
265
20.6%
197
15.3%
102
 
7.9%
72
 
5.6%
57
 
4.4%
46
 
3.6%
32
 
2.5%
32
 
2.5%
30
 
2.3%
24
 
1.9%
Other values (61) 430
33.4%
Uppercase Letter
ValueCountFrequency (%)
S 1
33.3%
J 1
33.3%
A 1
33.3%
Other Punctuation
ValueCountFrequency (%)
* 2580
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2580
66.7%
Hangul 1287
33.3%
Latin 3
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
265
20.6%
197
15.3%
102
 
7.9%
72
 
5.6%
57
 
4.4%
46
 
3.6%
32
 
2.5%
32
 
2.5%
30
 
2.3%
24
 
1.9%
Other values (61) 430
33.4%
Latin
ValueCountFrequency (%)
S 1
33.3%
J 1
33.3%
A 1
33.3%
Common
ValueCountFrequency (%)
* 2580
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2583
66.7%
Hangul 1287
33.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 2580
99.9%
S 1
 
< 0.1%
J 1
 
< 0.1%
A 1
 
< 0.1%
Hangul
ValueCountFrequency (%)
265
20.6%
197
15.3%
102
 
7.9%
72
 
5.6%
57
 
4.4%
46
 
3.6%
32
 
2.5%
32
 
2.5%
30
 
2.3%
24
 
1.9%
Other values (61) 430
33.4%

Correlations

2024-04-17T22:24:17.904991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
참여대학학과학위기업유형대표자
참여대학1.0001.0000.9370.1880.000
학과1.0001.0000.9860.2740.353
학위0.9370.9861.0000.0920.000
기업유형0.1880.2740.0921.0000.000
대표자0.0000.3530.0000.0001.000
2024-04-17T22:24:17.993194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기업유형학위참여대학
기업유형1.0000.0610.147
학위0.0611.0000.753
참여대학0.1470.7531.000
2024-04-17T22:24:18.074841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
참여대학학위기업유형
참여대학1.0000.7530.147
학위0.7531.0000.061
기업유형0.1470.0611.000

Missing values

2024-04-17T22:24:14.293869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-17T22:24:14.419135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기업명참여대학학과학위사업자 번호소재지기업유형대표자
0효동기계공업(주)한국공학대학교반도체기계시스템공학과학사134-81-35081경기 화성시 향남읍 발안공단로4길 97-17(발안지방산업단지 5-3B)중소기업김**
1(주)이너트론한국공학대학교반도체기계시스템공학과학사214-87-06910인천 연수중소기업조**
2신일이엔티㈜한국공학대학교반도체기계시스템공학과학사134-81-56470경기 시흥시 시화벤처로 299중소기업권**
3주식회사 이엠아이한국공학대학교반도체기계시스템공학과학사135-86-25219경기 오산시 세남로14번길 11중소기업이**
4코라스한국공학대학교반도체기계시스템공학과학사138-01-85320경기 안양시 동안구 호계동 1072-4중소기업고**
5㈜엠엠티케이한국공학대학교반도체기계시스템공학과학사142-81-54342경기 용인시 처인구 모현읍 곡현로 750중소기업차**
6주식회사 기산테크한국공학대학교반도체기계시스템공학과학사140-81-93989경기 시흥시 마유로 136 (시화공단3라 124호)중소기업이**
7현대합성공업 주식회사한국공학대학교반도체기계시스템공학과학사134-81-03560경기 안산시 단원구 해안로 130중견기업정**
8주식회사 지.티.아이한국공학대학교반도체기계시스템공학과학사123-81-78741경기 화성시 남양읍 주석로184번길 22-23중소기업이**
9하이스큐한국공학대학교반도체기계시스템공학과학사140-11-06012경기 시흥시 시흥대로 1115중소기업이**
기업명참여대학학과학위사업자 번호소재지기업유형대표자
1280(주)한진산업부산대학교기계부품시스템전공석사621-81-06057경남 양산시 하북면 양산대로 1981-48중소기업윤**
1281유원산업(주)부산대학교기계부품시스템전공석사603-81-19272부산광역시 사하구 을숙도대로677번길 23중소기업권**
1282(주)유진코메탈부산대학교기계부품시스템전공석사606-81-64285부산 강서구 녹산산업북로313번길 28중소기업오**
1283(주)디오부산대학교기계부품시스템전공석사621-81-01561부산광역시 해운대구 센텀서로 66중소기업김**
1284동보체인공업(주)부산대학교기계부품시스템전공석사621-81-13956부산 기장군 정관읍 농공길 27중소기업이**
1285(주)한미유압기계부산대학교기계부품시스템전공석사606-81-22403부산광역시 사상구 낙동대로 1420번길 24중소기업전**
1286(주)킴부산대학교기계부품시스템전공석사609-81-88952경상남도 창원시 의창구 대산면 진산대로 269중소기업김**
1287(주)대건테크부산대학교기계부품시스템전공석사609-81-64848경남 창원시 의창구 사화로 138중소기업신**
1288(주)우진정밀부산대학교기계부품시스템전공석사606-81-58444경상남도 김해시 생림면 생림대로 713번길 4중소기업김**
1289태광후지킨(주)부산대학교기계부품시스템전공석사135-81-92203부산광역시 강서구 녹산산단261 로 88번길 7중견기업김**