Overview

Dataset statistics

Number of variables4
Number of observations26
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory964.0 B
Average record size in memory37.1 B

Variable types

Categorical2
Text2

Dataset

Description인천광역시 바이오기관과 관련한 현황으로 기관명, 구분( 학교 연구소 공공기관 등), 지역, 주요분야(또는 관련 학과) 등의 항목값에 대한 데이터 정보를 제공합니다.
Author인천광역시
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15038423&srcSe=7661IVAWM27C61E190

Alerts

지역 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 지역High correlation
기업명 has unique valuesUnique

Reproduction

Analysis started2024-01-28 13:24:40.978436
Analysis finished2024-01-28 13:24:41.261207
Duration0.28 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지역
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)34.6%
Missing0
Missing (%)0.0%
Memory size340.0 B
연수구
13 
미추홀구
서구
남동구
계양구
Other values (4)

Length

Max length4
Median length3
Mean length2.9615385
Min length2

Unique

Unique4 ?
Unique (%)15.4%

Sample

1st row옹진군
2nd row서구
3rd row서구
4th row중구
5th row연수구

Common Values

ValueCountFrequency (%)
연수구 13
50.0%
미추홀구 3
 
11.5%
서구 2
 
7.7%
남동구 2
 
7.7%
계양구 2
 
7.7%
옹진군 1
 
3.8%
중구 1
 
3.8%
부평구 1
 
3.8%
동구 1
 
3.8%

Length

2024-01-28T22:24:41.321384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T22:24:41.426054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
연수구 13
50.0%
미추홀구 3
 
11.5%
서구 2
 
7.7%
남동구 2
 
7.7%
계양구 2
 
7.7%
옹진군 1
 
3.8%
중구 1
 
3.8%
부평구 1
 
3.8%
동구 1
 
3.8%

구분
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)15.4%
Missing0
Missing (%)0.0%
Memory size340.0 B
학교
14 
연구소
공공기관
연구원
 
1

Length

Max length4
Median length2
Mean length2.6153846
Min length2

Unique

Unique1 ?
Unique (%)3.8%

Sample

1st row공공기관
2nd row공공기관
3rd row공공기관
4th row공공기관
5th row연구소

Common Values

ValueCountFrequency (%)
학교 14
53.8%
연구소 7
26.9%
공공기관 4
 
15.4%
연구원 1
 
3.8%

Length

2024-01-28T22:24:41.533573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T22:24:41.625988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
학교 14
53.8%
연구소 7
26.9%
공공기관 4
 
15.4%
연구원 1
 
3.8%

기업명
Text

UNIQUE 

Distinct26
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size340.0 B
2024-01-28T22:24:41.788434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length18
Mean length10.461538
Min length5

Characters and Unicode

Total characters272
Distinct characters102
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)100.0%

Sample

1st row인천광역시 수산자원연구소
2nd row국립생물자원관
3rd row국립환경과학원 본원
4th row국립수의과학검역원
5th rowIFEZ바이오분석지원센터
ValueCountFrequency (%)
가천대학교 2
 
5.7%
연세대학교 1
 
2.9%
kcl(한국건설생활환경시험연구원 1
 
2.9%
인천경기지원 1
 
2.9%
인천뷰티예술고등학교 1
 
2.9%
인천바이오과학고등학교 1
 
2.9%
인천해양과학고 1
 
2.9%
송도글로벌캠퍼스 1
 
2.9%
인천대학교 1
 
2.9%
수산자원연구소 1
 
2.9%
Other values (24) 24
68.6%
2024-01-28T22:24:42.098761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
20
 
7.4%
12
 
4.4%
12
 
4.4%
12
 
4.4%
10
 
3.7%
9
 
3.3%
9
 
3.3%
9
 
3.3%
8
 
2.9%
7
 
2.6%
Other values (92) 164
60.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 250
91.9%
Uppercase Letter 10
 
3.7%
Space Separator 9
 
3.3%
Open Punctuation 1
 
0.4%
Close Punctuation 1
 
0.4%
Dash Punctuation 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
20
 
8.0%
12
 
4.8%
12
 
4.8%
12
 
4.8%
10
 
4.0%
9
 
3.6%
9
 
3.6%
8
 
3.2%
7
 
2.8%
6
 
2.4%
Other values (79) 145
58.0%
Uppercase Letter
ValueCountFrequency (%)
D 2
20.0%
K 1
10.0%
L 1
10.0%
C 1
10.0%
S 1
10.0%
Z 1
10.0%
E 1
10.0%
F 1
10.0%
I 1
10.0%
Space Separator
ValueCountFrequency (%)
9
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 250
91.9%
Common 12
 
4.4%
Latin 10
 
3.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
20
 
8.0%
12
 
4.8%
12
 
4.8%
12
 
4.8%
10
 
4.0%
9
 
3.6%
9
 
3.6%
8
 
3.2%
7
 
2.8%
6
 
2.4%
Other values (79) 145
58.0%
Latin
ValueCountFrequency (%)
D 2
20.0%
K 1
10.0%
L 1
10.0%
C 1
10.0%
S 1
10.0%
Z 1
10.0%
E 1
10.0%
F 1
10.0%
I 1
10.0%
Common
ValueCountFrequency (%)
9
75.0%
( 1
 
8.3%
) 1
 
8.3%
- 1
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 250
91.9%
ASCII 22
 
8.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
20
 
8.0%
12
 
4.8%
12
 
4.8%
12
 
4.8%
10
 
4.0%
9
 
3.6%
9
 
3.6%
8
 
3.2%
7
 
2.8%
6
 
2.4%
Other values (79) 145
58.0%
ASCII
ValueCountFrequency (%)
9
40.9%
D 2
 
9.1%
K 1
 
4.5%
L 1
 
4.5%
( 1
 
4.5%
) 1
 
4.5%
C 1
 
4.5%
- 1
 
4.5%
S 1
 
4.5%
Z 1
 
4.5%
Other values (3) 3
 
13.6%
Distinct25
Distinct (%)96.2%
Missing0
Missing (%)0.0%
Memory size340.0 B
2024-01-28T22:24:42.308102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length27
Mean length15.115385
Min length4

Characters and Unicode

Total characters393
Distinct characters105
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)92.3%

Sample

1st row수산자원연구
2nd row생물자원연구
3rd row환경연구
4th row수의과학기술개발 연구
5th row바이오의약품 분석지원
ValueCountFrequency (%)
연구 5
 
6.8%
3
 
4.1%
화학생명공학과 2
 
2.7%
2
 
2.7%
치료제 2
 
2.7%
관련 2
 
2.7%
바이오제약과 1
 
1.4%
화학과 1
 
1.4%
뷰티아트과 1
 
1.4%
뷰티디자인과 1
 
1.4%
Other values (53) 53
72.6%
2024-01-28T22:24:42.659564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
47
 
12.0%
29
 
7.4%
24
 
6.1%
, 20
 
5.1%
15
 
3.8%
13
 
3.3%
10
 
2.5%
10
 
2.5%
9
 
2.3%
8
 
2.0%
Other values (95) 208
52.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 319
81.2%
Space Separator 47
 
12.0%
Other Punctuation 20
 
5.1%
Uppercase Letter 3
 
0.8%
Open Punctuation 2
 
0.5%
Close Punctuation 2
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
29
 
9.1%
24
 
7.5%
15
 
4.7%
13
 
4.1%
10
 
3.1%
10
 
3.1%
9
 
2.8%
8
 
2.5%
8
 
2.5%
7
 
2.2%
Other values (88) 186
58.3%
Uppercase Letter
ValueCountFrequency (%)
Y 1
33.3%
S 1
33.3%
P 1
33.3%
Space Separator
ValueCountFrequency (%)
47
100.0%
Other Punctuation
ValueCountFrequency (%)
, 20
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 319
81.2%
Common 71
 
18.1%
Latin 3
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
29
 
9.1%
24
 
7.5%
15
 
4.7%
13
 
4.1%
10
 
3.1%
10
 
3.1%
9
 
2.8%
8
 
2.5%
8
 
2.5%
7
 
2.2%
Other values (88) 186
58.3%
Common
ValueCountFrequency (%)
47
66.2%
, 20
28.2%
( 2
 
2.8%
) 2
 
2.8%
Latin
ValueCountFrequency (%)
Y 1
33.3%
S 1
33.3%
P 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 319
81.2%
ASCII 74
 
18.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
47
63.5%
, 20
27.0%
( 2
 
2.7%
) 2
 
2.7%
Y 1
 
1.4%
S 1
 
1.4%
P 1
 
1.4%
Hangul
ValueCountFrequency (%)
29
 
9.1%
24
 
7.5%
15
 
4.7%
13
 
4.1%
10
 
3.1%
10
 
3.1%
9
 
2.8%
8
 
2.5%
8
 
2.5%
7
 
2.2%
Other values (88) 186
58.3%

Correlations

2024-01-28T22:24:42.737308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역구분기업명주요분야(또는 관련학과)
지역1.0000.7741.0001.000
구분0.7741.0001.0001.000
기업명1.0001.0001.0001.000
주요분야(또는 관련학과)1.0001.0001.0001.000
2024-01-28T22:24:42.808372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역구분
지역1.0000.542
구분0.5421.000
2024-01-28T22:24:42.874184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역구분
지역1.0000.542
구분0.5421.000

Missing values

2024-01-28T22:24:41.162616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T22:24:41.231397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

지역구분기업명주요분야(또는 관련학과)
0옹진군공공기관인천광역시 수산자원연구소수산자원연구
1서구공공기관국립생물자원관생물자원연구
2서구공공기관국립환경과학원 본원환경연구
3중구공공기관국립수의과학검역원수의과학기술개발 연구
4연수구연구소IFEZ바이오분석지원센터바이오의약품 분석지원
5연수구연구소유타-인하DDS 및 신의료기술개발공동연구소약물전달기술 및 나노기술 기반 치료제 개발
6연수구연구소이길여암당뇨연구원인간 대사성 질환 및 암 관련 연구
7연수구연구소제이씨비공동생물과학연구소생물학분야 기초연구 및 생명공학기술의 산업화 연구 등
8연수구연구소한국건설생활환경시험연구원 바이오본부신뢰성보증, 기술심사, 환경의료, 비임상연구 지원
9연수구연구소한국해양과학기술원 부설 극지연구소국지 생명 연구
지역구분기업명주요분야(또는 관련학과)
16연수구학교연세대학교약학대학, YSP추진본부, 공과대학, 생명시스템대학
17연수구학교인천대학교생명과학기술대학(생명과학부, 생명공학부)
18연수구학교가천대학교약학과, 의예과
19계양구학교경인여자대학교뷰티스킨케어학과
20계양구학교계산공업고등학교식품생명과학과
21부평구학교인천미래생활고등학교바이오식품과
22미추홀구학교인하공업전문대학화학생명공학과
23미추홀구학교인하대학교생명과학과, 화학과
24미추홀구학교청운대학교화학생명공학과
25동구학교인천재능대학교송도바이오생명과, 코스메틱개발과, 뷰티스타일리스트과