Overview

Dataset statistics

Number of variables4
Number of observations35
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 KiB
Average record size in memory35.8 B

Variable types

Categorical2
Text2

Dataset

Description벤처타운(다산관, 장영실관) 입주현황에 대한 데이터로(번호, 업체명, 업종, 주생산품 등)의 항목을 제공합니다.
Author대전광역시
URLhttps://www.data.go.kr/data/15077576/fileData.do

Alerts

기업명 has unique valuesUnique
주생산품 has unique valuesUnique

Reproduction

Analysis started2023-12-12 21:53:52.025024
Analysis finished2023-12-12 21:53:52.397622
Duration0.37 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

Distinct2
Distinct (%)5.7%
Missing0
Missing (%)0.0%
Memory size412.0 B
장영실관
27 
다산관

Length

Max length4
Median length4
Mean length3.7714286
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row다산관
2nd row다산관
3rd row다산관
4th row다산관
5th row다산관

Common Values

ValueCountFrequency (%)
장영실관 27
77.1%
다산관 8
 
22.9%

Length

2023-12-13T06:53:52.471227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:53:52.570434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
장영실관 27
77.1%
다산관 8
 
22.9%

기업명
Text

UNIQUE 

Distinct35
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size412.0 B
2023-12-13T06:53:52.784706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length9
Mean length5.5714286
Min length3

Characters and Unicode

Total characters195
Distinct characters91
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)100.0%

Sample

1st row㈜성산기업
2nd row㈜피코팩
3rd row㈜포휴먼스
4th row이지엠㈜
5th row태평양이노베이션(주)
ValueCountFrequency (%)
㈜성산기업 1
 
2.8%
㈜키프로젠 1
 
2.8%
브라이튼 1
 
2.8%
세야산업 1
 
2.8%
㈜지에스지 1
 
2.8%
㈜휴텍스 1
 
2.8%
㈜유니플라텍 1
 
2.8%
㈜로드텍 1
 
2.8%
㈜삼정이엔에스 1
 
2.8%
㈜파이오셀 1
 
2.8%
Other values (26) 26
72.2%
2023-12-13T06:53:53.285511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
25
 
12.8%
14
 
7.2%
12
 
6.2%
7
 
3.6%
5
 
2.6%
5
 
2.6%
5
 
2.6%
5
 
2.6%
) 4
 
2.1%
4
 
2.1%
Other values (81) 109
55.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 161
82.6%
Other Symbol 25
 
12.8%
Close Punctuation 4
 
2.1%
Open Punctuation 4
 
2.1%
Space Separator 1
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
 
8.7%
12
 
7.5%
7
 
4.3%
5
 
3.1%
5
 
3.1%
5
 
3.1%
5
 
3.1%
4
 
2.5%
3
 
1.9%
3
 
1.9%
Other values (77) 98
60.9%
Other Symbol
ValueCountFrequency (%)
25
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 186
95.4%
Common 9
 
4.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
25
 
13.4%
14
 
7.5%
12
 
6.5%
7
 
3.8%
5
 
2.7%
5
 
2.7%
5
 
2.7%
5
 
2.7%
4
 
2.2%
3
 
1.6%
Other values (78) 101
54.3%
Common
ValueCountFrequency (%)
) 4
44.4%
( 4
44.4%
1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 161
82.6%
None 25
 
12.8%
ASCII 9
 
4.6%

Most frequent character per block

None
ValueCountFrequency (%)
25
100.0%
Hangul
ValueCountFrequency (%)
14
 
8.7%
12
 
7.5%
7
 
4.3%
5
 
3.1%
5
 
3.1%
5
 
3.1%
5
 
3.1%
4
 
2.5%
3
 
1.9%
3
 
1.9%
Other values (77) 98
60.9%
ASCII
ValueCountFrequency (%)
) 4
44.4%
( 4
44.4%
1
 
11.1%

업종
Categorical

Distinct15
Distinct (%)42.9%
Missing0
Missing (%)0.0%
Memory size412.0 B
제조업
10 
전기전자
제조
원자력
정보통신
Other values (10)
11 

Length

Max length14
Median length13
Mean length4.1714286
Min length2

Unique

Unique9 ?
Unique (%)25.7%

Sample

1st row화학
2nd row반도체 및 기타 전자부품
3rd row전기전자
4th row전기전자
5th row전기전자

Common Values

ValueCountFrequency (%)
제조업 10
28.6%
전기전자 7
20.0%
제조 3
 
8.6%
원자력 2
 
5.7%
정보통신 2
 
5.7%
기계 2
 
5.7%
화학 1
 
2.9%
반도체 및 기타 전자부품 1
 
2.9%
생명공학 1
 
2.9%
단열패키지 1
 
2.9%
Other values (5) 5
14.3%

Length

2023-12-13T06:53:53.440887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
제조업 10
23.8%
전기전자 7
16.7%
제조 3
 
7.1%
3
 
7.1%
원자력 2
 
4.8%
정보통신 2
 
4.8%
기계 2
 
4.8%
제조업,서비스 1
 
2.4%
부품 1
 
2.4%
통신장비 1
 
2.4%
Other values (10) 10
23.8%

주생산품
Text

UNIQUE 

Distinct35
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size412.0 B
2023-12-13T06:53:53.719557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length10
Mean length6.9714286
Min length3

Characters and Unicode

Total characters244
Distinct characters128
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)100.0%

Sample

1st row코팅재(도료)
2nd row반도체
3rd row기초화학물질
4th row전자식 가바나
5th row정수기
ValueCountFrequency (%)
3
 
5.4%
제조 2
 
3.6%
코팅재(도료 1
 
1.8%
과학기자재 1
 
1.8%
위성통신용 1
 
1.8%
수신기 1
 
1.8%
시뮬레이터 1
 
1.8%
수력모듈 1
 
1.8%
통신장비 1
 
1.8%
부품 1
 
1.8%
Other values (43) 43
76.8%
2023-12-13T06:53:54.151192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
22
 
9.0%
8
 
3.3%
7
 
2.9%
6
 
2.5%
5
 
2.0%
5
 
2.0%
5
 
2.0%
4
 
1.6%
4
 
1.6%
4
 
1.6%
Other values (118) 174
71.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 213
87.3%
Space Separator 22
 
9.0%
Lowercase Letter 4
 
1.6%
Open Punctuation 2
 
0.8%
Close Punctuation 2
 
0.8%
Other Punctuation 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8
 
3.8%
7
 
3.3%
6
 
2.8%
5
 
2.3%
5
 
2.3%
5
 
2.3%
4
 
1.9%
4
 
1.9%
4
 
1.9%
4
 
1.9%
Other values (110) 161
75.6%
Lowercase Letter
ValueCountFrequency (%)
n 1
25.0%
s 1
25.0%
e 1
25.0%
r 1
25.0%
Space Separator
ValueCountFrequency (%)
22
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 213
87.3%
Common 27
 
11.1%
Latin 4
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8
 
3.8%
7
 
3.3%
6
 
2.8%
5
 
2.3%
5
 
2.3%
5
 
2.3%
4
 
1.9%
4
 
1.9%
4
 
1.9%
4
 
1.9%
Other values (110) 161
75.6%
Common
ValueCountFrequency (%)
22
81.5%
( 2
 
7.4%
) 2
 
7.4%
, 1
 
3.7%
Latin
ValueCountFrequency (%)
n 1
25.0%
s 1
25.0%
e 1
25.0%
r 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 213
87.3%
ASCII 31
 
12.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
22
71.0%
( 2
 
6.5%
) 2
 
6.5%
n 1
 
3.2%
s 1
 
3.2%
e 1
 
3.2%
r 1
 
3.2%
, 1
 
3.2%
Hangul
ValueCountFrequency (%)
8
 
3.8%
7
 
3.3%
6
 
2.8%
5
 
2.3%
5
 
2.3%
5
 
2.3%
4
 
1.9%
4
 
1.9%
4
 
1.9%
4
 
1.9%
Other values (110) 161
75.6%

Correlations

2023-12-13T06:53:54.258039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분기업명업종주생산품
구분1.0001.0000.4361.000
기업명1.0001.0001.0001.000
업종0.4361.0001.0001.000
주생산품1.0001.0001.0001.000
2023-12-13T06:53:54.363307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분업종
구분1.0000.293
업종0.2931.000
2023-12-13T06:53:54.460321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분업종
구분1.0000.293
업종0.2931.000

Missing values

2023-12-13T06:53:52.269012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:53:52.362785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분기업명업종주생산품
0다산관㈜성산기업화학코팅재(도료)
1다산관㈜피코팩반도체 및 기타 전자부품반도체
2다산관㈜포휴먼스전기전자기초화학물질
3다산관이지엠㈜전기전자전자식 가바나
4다산관태평양이노베이션(주)전기전자정수기
5다산관㈜엔바이로코리아원자력방사선동위원소분배 제조
6다산관시크제네시스제조업식품신선도 유지 기능성 제품
7다산관㈜로카스전기전자가스 분석기
8장영실관㈜엑스엠더블유정보통신위성통신용 수신기
9장영실관㈜키프로젠생명공학resn(단백질 분리정체 메진)
구분기업명업종주생산품
25장영실관㈜로드텍정보기술 및 컴퓨터운영관리정보기술 및 컴퓨터운영 관리
26장영실관㈜에이엔씨통신장비 및 부품통신장비 및 부품
27장영실관선샤인광학(주)제조안경렌즈
28장영실관(주)미래뷰제조방독면
29장영실관㈜필리아바이오제조업탈취제
30장영실관제스텍제조업반도체부품조립
31장영실관㈜에카제조특수냉각장치
32장영실관㈜탑드림제조업미용재료
33장영실관㈜부국보일러제조업보일러
34장영실관맛있는 밥상위탁급식업위탁급식