Overview

Dataset statistics

Number of variables6
Number of observations26
Missing cells59
Missing cells (%)37.8%
Duplicate rows2
Duplicate rows (%)7.7%
Total size in memory1.3 KiB
Average record size in memory53.1 B

Variable types

Unsupported1
Text4
Categorical1

Dataset

Description탄소융합부품소재창업보육센터입주업체현황20146
Author전라북도
URLhttps://www.bigdatahub.go.kr/opendata/dataSet/detail.nm?contentId=37&rlik=49451aebf056b486&serviceId=202133

Alerts

Dataset has 2 (7.7%) duplicate rowsDuplicates
탄소융합부품소재 창업보육센터 입주업체 현황 (13개사) has 11 (42.3%) missing valuesMissing
Unnamed: 1 has 12 (46.2%) missing valuesMissing
Unnamed: 2 has 12 (46.2%) missing valuesMissing
Unnamed: 3 has 1 (3.8%) missing valuesMissing
Unnamed: 5 has 23 (88.5%) missing valuesMissing
탄소융합부품소재 창업보육센터 입주업체 현황 (13개사) is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-03-14 01:27:23.673154
Analysis finished2024-03-14 01:27:24.383357
Duration0.71 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Missing11
Missing (%)42.3%
Memory size340.0 B

Unnamed: 1
Text

MISSING 

Distinct14
Distinct (%)100.0%
Missing12
Missing (%)46.2%
Memory size340.0 B
2024-03-14T10:27:24.487402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length5.2142857
Min length3

Characters and Unicode

Total characters73
Distinct characters51
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)100.0%

Sample

1st row업체명
2nd row이지컴퍼지스
3rd row신아티앤씨
4th rowHK&CT(주)
5th row알티모
ValueCountFrequency (%)
업체명 1
 
6.7%
이지컴퍼지스 1
 
6.7%
신아티앤씨 1
 
6.7%
hk&ct(주 1
 
6.7%
알티모 1
 
6.7%
㈜유메코 1
 
6.7%
㈜r&dt 1
 
6.7%
휴먼컴퍼지트 1
 
6.7%
㈜유광화학 1
 
6.7%
㈜아이즈텍 1
 
6.7%
Other values (5) 5
33.3%
2024-03-14T10:27:24.800690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7
 
9.6%
4
 
5.5%
& 3
 
4.1%
3
 
4.1%
2
 
2.7%
o 2
 
2.7%
T 2
 
2.7%
C 2
 
2.7%
2
 
2.7%
2
 
2.7%
Other values (41) 44
60.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 44
60.3%
Uppercase Letter 11
 
15.1%
Other Symbol 7
 
9.6%
Lowercase Letter 4
 
5.5%
Other Punctuation 3
 
4.1%
Space Separator 1
 
1.4%
Decimal Number 1
 
1.4%
Open Punctuation 1
 
1.4%
Close Punctuation 1
 
1.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4
 
9.1%
3
 
6.8%
2
 
4.5%
2
 
4.5%
2
 
4.5%
2
 
4.5%
2
 
4.5%
2
 
4.5%
1
 
2.3%
1
 
2.3%
Other values (23) 23
52.3%
Uppercase Letter
ValueCountFrequency (%)
T 2
18.2%
C 2
18.2%
S 1
9.1%
F 1
9.1%
H 1
9.1%
K 1
9.1%
R 1
9.1%
D 1
9.1%
W 1
9.1%
Lowercase Letter
ValueCountFrequency (%)
o 2
50.0%
t 1
25.0%
l 1
25.0%
Other Symbol
ValueCountFrequency (%)
7
100.0%
Other Punctuation
ValueCountFrequency (%)
& 3
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 51
69.9%
Latin 15
 
20.5%
Common 7
 
9.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7
 
13.7%
4
 
7.8%
3
 
5.9%
2
 
3.9%
2
 
3.9%
2
 
3.9%
2
 
3.9%
2
 
3.9%
2
 
3.9%
1
 
2.0%
Other values (24) 24
47.1%
Latin
ValueCountFrequency (%)
o 2
13.3%
T 2
13.3%
C 2
13.3%
t 1
6.7%
S 1
6.7%
l 1
6.7%
F 1
6.7%
H 1
6.7%
K 1
6.7%
R 1
6.7%
Other values (2) 2
13.3%
Common
ValueCountFrequency (%)
& 3
42.9%
1
 
14.3%
2 1
 
14.3%
( 1
 
14.3%
) 1
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 44
60.3%
ASCII 22
30.1%
None 7
 
9.6%

Most frequent character per block

None
ValueCountFrequency (%)
7
100.0%
Hangul
ValueCountFrequency (%)
4
 
9.1%
3
 
6.8%
2
 
4.5%
2
 
4.5%
2
 
4.5%
2
 
4.5%
2
 
4.5%
2
 
4.5%
1
 
2.3%
1
 
2.3%
Other values (23) 23
52.3%
ASCII
ValueCountFrequency (%)
& 3
13.6%
o 2
 
9.1%
T 2
 
9.1%
C 2
 
9.1%
t 1
 
4.5%
S 1
 
4.5%
l 1
 
4.5%
1
 
4.5%
F 1
 
4.5%
2 1
 
4.5%
Other values (7) 7
31.8%

Unnamed: 2
Text

MISSING 

Distinct14
Distinct (%)100.0%
Missing12
Missing (%)46.2%
Memory size340.0 B
2024-03-14T10:27:24.959465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters42
Distinct characters35
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)100.0%

Sample

1st row대표자
2nd row정연중
3rd row최춘구
4th row이주영
5th row홍성환
ValueCountFrequency (%)
대표자 1
 
7.1%
정연중 1
 
7.1%
최춘구 1
 
7.1%
이주영 1
 
7.1%
홍성환 1
 
7.1%
권기철 1
 
7.1%
조계현 1
 
7.1%
양승운 1
 
7.1%
국광호 1
 
7.1%
김철웅 1
 
7.1%
Other values (4) 4
28.6%
2024-03-14T10:27:25.197711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
1
 
2.4%
1
 
2.4%
1
 
2.4%
Other values (25) 25
59.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 42
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
1
 
2.4%
1
 
2.4%
1
 
2.4%
Other values (25) 25
59.5%

Most occurring scripts

ValueCountFrequency (%)
Hangul 42
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
1
 
2.4%
1
 
2.4%
1
 
2.4%
Other values (25) 25
59.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 42
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
1
 
2.4%
1
 
2.4%
1
 
2.4%
Other values (25) 25
59.5%

Unnamed: 3
Text

MISSING 

Distinct22
Distinct (%)88.0%
Missing1
Missing (%)3.8%
Memory size340.0 B
2024-03-14T10:27:25.398636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length15
Mean length10
Min length2

Characters and Unicode

Total characters250
Distinct characters106
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique19 ?
Unique (%)76.0%

Sample

1st row주생산품
2nd row열가소성 수지 프리프레그 및 성형품
3rd rowCFRP용 열경화성 에폭시 시스켐
4th row캠핑 카라반, 생물체운송용 트럭, 소형선박
5th row냉장고 탈취제
ValueCountFrequency (%)
회수 2
 
3.6%
탄소섬유 2
 
3.6%
sic복합재료 2
 
3.6%
filler 2
 
3.6%
방열판 2
 
3.6%
탄소를 2
 
3.6%
포장재 1
 
1.8%
개발 1
 
1.8%
cfrp풍력 1
 
1.8%
blade개발 1
 
1.8%
Other values (39) 39
70.9%
2024-03-14T10:27:25.906798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
30
 
12.0%
10
 
4.0%
, 9
 
3.6%
l 7
 
2.8%
F 5
 
2.0%
5
 
2.0%
e 5
 
2.0%
5
 
2.0%
C 5
 
2.0%
5
 
2.0%
Other values (96) 164
65.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 167
66.8%
Space Separator 30
 
12.0%
Lowercase Letter 24
 
9.6%
Uppercase Letter 20
 
8.0%
Other Punctuation 9
 
3.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10
 
6.0%
5
 
3.0%
5
 
3.0%
5
 
3.0%
5
 
3.0%
5
 
3.0%
4
 
2.4%
4
 
2.4%
4
 
2.4%
4
 
2.4%
Other values (80) 116
69.5%
Lowercase Letter
ValueCountFrequency (%)
l 7
29.2%
e 5
20.8%
i 4
16.7%
r 4
16.7%
o 1
 
4.2%
a 1
 
4.2%
p 1
 
4.2%
d 1
 
4.2%
Uppercase Letter
ValueCountFrequency (%)
F 5
25.0%
C 5
25.0%
P 4
20.0%
R 3
15.0%
S 2
 
10.0%
B 1
 
5.0%
Space Separator
ValueCountFrequency (%)
30
100.0%
Other Punctuation
ValueCountFrequency (%)
, 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 167
66.8%
Latin 44
 
17.6%
Common 39
 
15.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10
 
6.0%
5
 
3.0%
5
 
3.0%
5
 
3.0%
5
 
3.0%
5
 
3.0%
4
 
2.4%
4
 
2.4%
4
 
2.4%
4
 
2.4%
Other values (80) 116
69.5%
Latin
ValueCountFrequency (%)
l 7
15.9%
F 5
11.4%
e 5
11.4%
C 5
11.4%
i 4
9.1%
P 4
9.1%
r 4
9.1%
R 3
6.8%
S 2
 
4.5%
B 1
 
2.3%
Other values (4) 4
9.1%
Common
ValueCountFrequency (%)
30
76.9%
, 9
 
23.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 167
66.8%
ASCII 83
33.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
30
36.1%
, 9
 
10.8%
l 7
 
8.4%
F 5
 
6.0%
e 5
 
6.0%
C 5
 
6.0%
i 4
 
4.8%
P 4
 
4.8%
r 4
 
4.8%
R 3
 
3.6%
Other values (6) 7
 
8.4%
Hangul
ValueCountFrequency (%)
10
 
6.0%
5
 
3.0%
5
 
3.0%
5
 
3.0%
5
 
3.0%
5
 
3.0%
4
 
2.4%
4
 
2.4%
4
 
2.4%
4
 
2.4%
Other values (80) 116
69.5%

Unnamed: 4
Categorical

Distinct7
Distinct (%)26.9%
Missing0
Missing (%)0.0%
Memory size340.0 B
<NA>
13 
-
대표번호
 
1
212-2220
 
1
214-9234
 
1
Other values (2)

Length

Max length8
Median length4
Mean length3.6923077
Min length1

Unique

Unique5 ?
Unique (%)19.2%

Sample

1st row<NA>
2nd row대표번호
3rd row-
4th row-
5th row212-2220

Common Values

ValueCountFrequency (%)
<NA> 13
50.0%
- 8
30.8%
대표번호 1
 
3.8%
212-2220 1
 
3.8%
214-9234 1
 
3.8%
842-1803 1
 
3.8%
211-2967 1
 
3.8%

Length

2024-03-14T10:27:26.024602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T10:27:26.120871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 13
50.0%
8
30.8%
대표번호 1
 
3.8%
212-2220 1
 
3.8%
214-9234 1
 
3.8%
842-1803 1
 
3.8%
211-2967 1
 
3.8%

Unnamed: 5
Text

MISSING 

Distinct2
Distinct (%)66.7%
Missing23
Missing (%)88.5%
Memory size340.0 B
2024-03-14T10:27:26.217171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters6
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)33.3%

Sample

1st row비고
2nd row퇴거
3rd row퇴거
ValueCountFrequency (%)
퇴거 2
66.7%
비고 1
33.3%
2024-03-14T10:27:26.401833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2
33.3%
2
33.3%
1
16.7%
1
16.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2
33.3%
2
33.3%
1
16.7%
1
16.7%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2
33.3%
2
33.3%
1
16.7%
1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2
33.3%
2
33.3%
1
16.7%
1
16.7%

Correlations

2024-03-14T10:27:26.473472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5
Unnamed: 11.0001.0001.0001.0001.000
Unnamed: 21.0001.0001.0001.0001.000
Unnamed: 31.0001.0001.0000.8601.000
Unnamed: 41.0001.0000.8601.0000.000
Unnamed: 51.0001.0001.0000.0001.000

Missing values

2024-03-14T10:27:24.135457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T10:27:24.218759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-14T10:27:24.312814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

탄소융합부품소재 창업보육센터 입주업체 현황 (13개사)Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5
02014. 06. 17.<NA><NA><NA><NA><NA>
1연번업체명대표자주생산품대표번호비고
21이지컴퍼지스정연중열가소성 수지 프리프레그 및 성형품-<NA>
32신아티앤씨최춘구CFRP용 열경화성 에폭시 시스켐-퇴거
43HK&CT(주)이주영캠핑 카라반, 생물체운송용 트럭, 소형선박212-2220<NA>
54알티모홍성환냉장고 탈취제<NA><NA>
65㈜유메코권기철탐소섬유 롤러, 이중관, 전자기기케이스214-9234<NA>
76㈜R&DT조계현CFRP Propeller 개발-<NA>
8NaN<NA><NA>CFRP풍력 Blade개발<NA><NA>
97휴먼컴퍼지트양승운SiC복합재료 Filler,-<NA>
탄소융합부품소재 창업보육센터 입주업체 현황 (13개사)Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5
16NaN<NA><NA>보안등<NA><NA>
1710Solto C&F최기운탄소를 이용한 식품용-<NA>
18NaN<NA><NA>플라스틱 포장재<NA><NA>
1911㈜에이치엘김봉원복합소재 패스트너-퇴거
2012㈜피치케이블임동욱탄소를 활용한211-2967<NA>
21NaN<NA><NA>교통신호등주, 전기<NA><NA>
22NaN<NA><NA>발열체 제품<NA><NA>
23132W㈜원태연친환경-<NA>
24NaN<NA><NA>탄소복합소재<NA><NA>
25NaN<NA><NA>창호개발<NA><NA>

Duplicate rows

Most frequently occurring

Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5# duplicates
0<NA><NA>방열판, 탄소섬유<NA><NA>2
1<NA><NA>회수<NA><NA>2