Overview

Dataset statistics

Number of variables6
Number of observations89
Missing cells4
Missing cells (%)0.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.4 KiB
Average record size in memory50.5 B

Variable types

Numeric1
Categorical1
Text4

Dataset

Description대구광역시_첨단의료복합단지 입주기업 현황_20220517
Author대구광역시
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15036286&dataSetDetailId=150362861a574ae3bfb54&provdMethod=FILE

Alerts

연번 is highly overall correlated with 입주형태High correlation
입주형태 is highly overall correlated with 연번High correlation
대표전화 has 4 (4.5%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-10 18:23:21.896301
Analysis finished2023-12-10 18:23:23.779707
Duration1.88 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct89
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean45
Minimum1
Maximum89
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size933.0 B
2023-12-11T03:23:23.940957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.4
Q123
median45
Q367
95-th percentile84.6
Maximum89
Range88
Interquartile range (IQR)44

Descriptive statistics

Standard deviation25.836021
Coefficient of variation (CV)0.57413381
Kurtosis-1.2
Mean45
Median Absolute Deviation (MAD)22
Skewness0
Sum4005
Variance667.5
MonotonicityStrictly increasing
2023-12-11T03:23:24.219422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.1%
68 1
 
1.1%
66 1
 
1.1%
65 1
 
1.1%
64 1
 
1.1%
63 1
 
1.1%
62 1
 
1.1%
61 1
 
1.1%
60 1
 
1.1%
59 1
 
1.1%
Other values (79) 79
88.8%
ValueCountFrequency (%)
1 1
1.1%
2 1
1.1%
3 1
1.1%
4 1
1.1%
5 1
1.1%
6 1
1.1%
7 1
1.1%
8 1
1.1%
9 1
1.1%
10 1
1.1%
ValueCountFrequency (%)
89 1
1.1%
88 1
1.1%
87 1
1.1%
86 1
1.1%
85 1
1.1%
84 1
1.1%
83 1
1.1%
82 1
1.1%
81 1
1.1%
80 1
1.1%

입주형태
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size844.0 B
공동연구센터 분양
39 
토지
27 
3D융합기술지원센터 임대
12 
커뮤니케이션센터 임대
신약개발지원센터 임대

Length

Max length13
Median length11
Mean length7.6629213
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row토지
2nd row토지
3rd row토지
4th row토지
5th row토지

Common Values

ValueCountFrequency (%)
공동연구센터 분양 39
43.8%
토지 27
30.3%
3D융합기술지원센터 임대 12
 
13.5%
커뮤니케이션센터 임대 7
 
7.9%
신약개발지원센터 임대 4
 
4.5%

Length

2023-12-11T03:23:24.467294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T03:23:24.703440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공동연구센터 39
25.8%
분양 39
25.8%
토지 27
17.9%
임대 23
15.2%
3d융합기술지원센터 12
 
7.9%
커뮤니케이션센터 7
 
4.6%
신약개발지원센터 4
 
2.6%
Distinct88
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Memory size844.0 B
2023-12-11T03:23:25.132938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length14
Mean length6.752809
Min length3

Characters and Unicode

Total characters601
Distinct characters157
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique87 ?
Unique (%)97.8%

Sample

1st row㈜오성하이텍
2nd row동성제약㈜
3rd row㈜제이에스테크윈
4th row㈜세신정밀
5th row㈜플라즈맵
ValueCountFrequency (%)
㈜멘티스로지텍 2
 
2.2%
㈜오성하이텍 1
 
1.1%
올패스바이오 1
 
1.1%
㈜코넥스트 1
 
1.1%
㈜이롭(토지분양 1
 
1.1%
㈜레인보우앤네이처코리아 1
 
1.1%
코리아향진원 1
 
1.1%
㈜나노레이 1
 
1.1%
㈜래현 1
 
1.1%
㈜아크에이르 1
 
1.1%
Other values (81) 81
88.0%
2023-12-11T03:23:25.824262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
81
 
13.5%
37
 
6.2%
31
 
5.2%
20
 
3.3%
17
 
2.8%
14
 
2.3%
) 13
 
2.2%
( 13
 
2.2%
12
 
2.0%
12
 
2.0%
Other values (147) 351
58.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 488
81.2%
Other Symbol 81
 
13.5%
Close Punctuation 13
 
2.2%
Open Punctuation 13
 
2.2%
Space Separator 3
 
0.5%
Uppercase Letter 3
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
37
 
7.6%
31
 
6.4%
20
 
4.1%
17
 
3.5%
14
 
2.9%
12
 
2.5%
12
 
2.5%
11
 
2.3%
11
 
2.3%
9
 
1.8%
Other values (140) 314
64.3%
Uppercase Letter
ValueCountFrequency (%)
G 1
33.3%
S 1
33.3%
O 1
33.3%
Other Symbol
ValueCountFrequency (%)
81
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 569
94.7%
Common 29
 
4.8%
Latin 3
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
81
 
14.2%
37
 
6.5%
31
 
5.4%
20
 
3.5%
17
 
3.0%
14
 
2.5%
12
 
2.1%
12
 
2.1%
11
 
1.9%
11
 
1.9%
Other values (141) 323
56.8%
Common
ValueCountFrequency (%)
) 13
44.8%
( 13
44.8%
3
 
10.3%
Latin
ValueCountFrequency (%)
G 1
33.3%
S 1
33.3%
O 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 488
81.2%
None 81
 
13.5%
ASCII 32
 
5.3%

Most frequent character per block

None
ValueCountFrequency (%)
81
100.0%
Hangul
ValueCountFrequency (%)
37
 
7.6%
31
 
6.4%
20
 
4.1%
17
 
3.5%
14
 
2.9%
12
 
2.5%
12
 
2.5%
11
 
2.3%
11
 
2.3%
9
 
1.8%
Other values (140) 314
64.3%
ASCII
ValueCountFrequency (%)
) 13
40.6%
( 13
40.6%
3
 
9.4%
G 1
 
3.1%
S 1
 
3.1%
O 1
 
3.1%
Distinct45
Distinct (%)50.6%
Missing0
Missing (%)0.0%
Memory size844.0 B
2023-12-11T03:23:26.182111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length9
Mean length8.2808989
Min length3

Characters and Unicode

Total characters737
Distinct characters58
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)38.2%

Sample

1st row대구시 동구(첨복단지)
2nd row서울시 도봉구
3rd row대구시 동구(첨복단지)
4th row대구시 달성군
5th row대전시 유성구
ValueCountFrequency (%)
대구시 36
20.3%
동구(첨복단지 29
16.4%
대구 11
 
6.2%
경기도 8
 
4.5%
서울시 7
 
4.0%
북구 7
 
4.0%
서울 5
 
2.8%
수성구 5
 
2.8%
동구 4
 
2.3%
경북 4
 
2.3%
Other values (41) 61
34.5%
2023-12-11T03:23:26.727222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
118
16.0%
88
 
11.9%
71
 
9.6%
54
 
7.3%
34
 
4.6%
( 29
 
3.9%
29
 
3.9%
29
 
3.9%
29
 
3.9%
29
 
3.9%
Other values (48) 227
30.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 591
80.2%
Space Separator 88
 
11.9%
Open Punctuation 29
 
3.9%
Close Punctuation 29
 
3.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
118
20.0%
71
12.0%
54
 
9.1%
34
 
5.8%
29
 
4.9%
29
 
4.9%
29
 
4.9%
29
 
4.9%
22
 
3.7%
18
 
3.0%
Other values (45) 158
26.7%
Space Separator
ValueCountFrequency (%)
88
100.0%
Open Punctuation
ValueCountFrequency (%)
( 29
100.0%
Close Punctuation
ValueCountFrequency (%)
) 29
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 591
80.2%
Common 146
 
19.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
118
20.0%
71
12.0%
54
 
9.1%
34
 
5.8%
29
 
4.9%
29
 
4.9%
29
 
4.9%
29
 
4.9%
22
 
3.7%
18
 
3.0%
Other values (45) 158
26.7%
Common
ValueCountFrequency (%)
88
60.3%
( 29
 
19.9%
) 29
 
19.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 591
80.2%
ASCII 146
 
19.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
118
20.0%
71
12.0%
54
 
9.1%
34
 
5.8%
29
 
4.9%
29
 
4.9%
29
 
4.9%
29
 
4.9%
22
 
3.7%
18
 
3.0%
Other values (45) 158
26.7%
ASCII
ValueCountFrequency (%)
88
60.3%
( 29
 
19.9%
) 29
 
19.9%
Distinct82
Distinct (%)92.1%
Missing0
Missing (%)0.0%
Memory size844.0 B
2023-12-11T03:23:27.114316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length28
Mean length16.505618
Min length4

Characters and Unicode

Total characters1469
Distinct characters281
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique78 ?
Unique (%)87.6%

Sample

1st row고주파가열기, 전원공급장치, 의료기기 부품
2nd row염모제, 정로환, 세븐에이트 외
3rd row방사선 검출기 및 부품, 섬광체 등
4th row치과용 핸드피스 등 의료기기
5th row플라즈마 멸균기, 플라즈마 피부치료기 등
ValueCountFrequency (%)
16
 
5.1%
의료기기 15
 
4.8%
14
 
4.5%
11
 
3.5%
3d 5
 
1.6%
임플란트 4
 
1.3%
개발 3
 
1.0%
창상피복제 3
 
1.0%
부품 3
 
1.0%
정형외과용 3
 
1.0%
Other values (208) 235
75.3%
2023-12-11T03:23:27.782010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
231
 
15.7%
67
 
4.6%
, 53
 
3.6%
33
 
2.2%
32
 
2.2%
22
 
1.5%
21
 
1.4%
20
 
1.4%
19
 
1.3%
19
 
1.3%
Other values (271) 952
64.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1025
69.8%
Space Separator 231
 
15.7%
Lowercase Letter 97
 
6.6%
Other Punctuation 54
 
3.7%
Uppercase Letter 30
 
2.0%
Close Punctuation 10
 
0.7%
Open Punctuation 10
 
0.7%
Decimal Number 9
 
0.6%
Dash Punctuation 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
67
 
6.5%
33
 
3.2%
32
 
3.1%
22
 
2.1%
21
 
2.0%
20
 
2.0%
19
 
1.9%
19
 
1.9%
17
 
1.7%
16
 
1.6%
Other values (238) 759
74.0%
Lowercase Letter
ValueCountFrequency (%)
e 13
13.4%
i 12
12.4%
l 10
10.3%
s 9
9.3%
t 8
8.2%
n 8
8.2%
r 7
7.2%
a 7
7.2%
o 7
7.2%
f 4
 
4.1%
Other values (7) 12
12.4%
Uppercase Letter
ValueCountFrequency (%)
D 10
33.3%
P 8
26.7%
S 4
 
13.3%
B 3
 
10.0%
A 1
 
3.3%
C 1
 
3.3%
X 1
 
3.3%
V 1
 
3.3%
I 1
 
3.3%
Other Punctuation
ValueCountFrequency (%)
, 53
98.1%
/ 1
 
1.9%
Space Separator
ValueCountFrequency (%)
231
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%
Decimal Number
ValueCountFrequency (%)
3 9
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1025
69.8%
Common 317
 
21.6%
Latin 127
 
8.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
67
 
6.5%
33
 
3.2%
32
 
3.1%
22
 
2.1%
21
 
2.0%
20
 
2.0%
19
 
1.9%
19
 
1.9%
17
 
1.7%
16
 
1.6%
Other values (238) 759
74.0%
Latin
ValueCountFrequency (%)
e 13
 
10.2%
i 12
 
9.4%
l 10
 
7.9%
D 10
 
7.9%
s 9
 
7.1%
t 8
 
6.3%
n 8
 
6.3%
P 8
 
6.3%
r 7
 
5.5%
a 7
 
5.5%
Other values (16) 35
27.6%
Common
ValueCountFrequency (%)
231
72.9%
, 53
 
16.7%
) 10
 
3.2%
( 10
 
3.2%
3 9
 
2.8%
- 3
 
0.9%
/ 1
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1025
69.8%
ASCII 444
30.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
231
52.0%
, 53
 
11.9%
e 13
 
2.9%
i 12
 
2.7%
l 10
 
2.3%
D 10
 
2.3%
) 10
 
2.3%
( 10
 
2.3%
s 9
 
2.0%
3 9
 
2.0%
Other values (23) 77
 
17.3%
Hangul
ValueCountFrequency (%)
67
 
6.5%
33
 
3.2%
32
 
3.1%
22
 
2.1%
21
 
2.0%
20
 
2.0%
19
 
1.9%
19
 
1.9%
17
 
1.7%
16
 
1.6%
Other values (238) 759
74.0%

대표전화
Text

MISSING 

Distinct80
Distinct (%)94.1%
Missing4
Missing (%)4.5%
Memory size844.0 B
2023-12-11T03:23:28.227114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.905882
Min length9

Characters and Unicode

Total characters1012
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique75 ?
Unique (%)88.2%

Sample

1st row053-355-8228
2nd row02-6911-3600
3rd row070-7794-4596
4th row053-580-0902
5th row042-716-2115
ValueCountFrequency (%)
053-811-8191 2
 
2.4%
053-312-2582 2
 
2.4%
053-942-1117 2
 
2.4%
053-355-8228 2
 
2.4%
1577-2805 2
 
2.4%
053-962-4900 1
 
1.2%
054-337-6333 1
 
1.2%
053-745-5447 1
 
1.2%
053-817-9891 1
 
1.2%
053-217-2477 1
 
1.2%
Other values (70) 70
82.4%
2023-12-11T03:23:29.005623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 168
16.6%
0 149
14.7%
3 127
12.5%
5 122
12.1%
1 101
10.0%
2 80
7.9%
8 66
 
6.5%
7 55
 
5.4%
4 53
 
5.2%
9 48
 
4.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 844
83.4%
Dash Punctuation 168
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 149
17.7%
3 127
15.0%
5 122
14.5%
1 101
12.0%
2 80
9.5%
8 66
7.8%
7 55
 
6.5%
4 53
 
6.3%
9 48
 
5.7%
6 43
 
5.1%
Dash Punctuation
ValueCountFrequency (%)
- 168
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1012
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 168
16.6%
0 149
14.7%
3 127
12.5%
5 122
12.1%
1 101
10.0%
2 80
7.9%
8 66
 
6.5%
7 55
 
5.4%
4 53
 
5.2%
9 48
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1012
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 168
16.6%
0 149
14.7%
3 127
12.5%
5 122
12.1%
1 101
10.0%
2 80
7.9%
8 66
 
6.5%
7 55
 
5.4%
4 53
 
5.2%
9 48
 
4.7%

Interactions

2023-12-11T03:23:23.320954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T03:23:29.211441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번입주형태한글명소재지 주소주요연구분야(제품)대표전화
연번1.0000.9790.9400.5480.9130.861
입주형태0.9791.0000.8500.6450.8280.944
한글명0.9400.8501.0001.0000.9960.996
소재지 주소0.5480.6451.0001.0000.7940.999
주요연구분야(제품)0.9130.8280.9960.7941.0000.985
대표전화0.8610.9440.9960.9990.9851.000
2023-12-11T03:23:29.433412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번입주형태
연번1.0000.771
입주형태0.7711.000

Missing values

2023-12-11T03:23:23.506794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T03:23:23.700790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번입주형태한글명소재지 주소주요연구분야(제품)대표전화
01토지㈜오성하이텍대구시 동구(첨복단지)고주파가열기, 전원공급장치, 의료기기 부품053-355-8228
12토지동성제약㈜서울시 도봉구염모제, 정로환, 세븐에이트 외02-6911-3600
23토지㈜제이에스테크윈대구시 동구(첨복단지)방사선 검출기 및 부품, 섬광체 등070-7794-4596
34토지㈜세신정밀대구시 달성군치과용 핸드피스 등 의료기기053-580-0902
45토지㈜플라즈맵대전시 유성구플라즈마 멸균기, 플라즈마 피부치료기 등042-716-2115
56토지㈜이노벡테크놀러지대구시 동구진공부품장치(가스분석기, 승화정제정비 등), 의료기기 외<NA>
67토지㈜인트인(구 ㈜종로의료기)대구시 동구의료기기02-843-2085
78토지㈜루트로닉경기도 고양시facial lotion 외, 기타 레이저, 의료기기 외070-4714-6006
89토지대우제약㈜부산시 사하구의약제제품(완제품)051-204-3831
910토지㈜쎄텍대구광역시 달서구너트러너, 토르크변환기, 로드셀 외053-585-5261
연번입주형태한글명소재지 주소주요연구분야(제품)대표전화
79803D융합기술지원센터 임대애니메디솔루션㈜서울 송파구3D프린팅 환자맞춤형 의료기기(수술시뮬레이터, 수술가이드, 맞춤형임플란드)02-6591-1311
80813D융합기술지원센터 임대바이오코엔㈜경기 평택시척추 나사삽입 가이드031-8053-9631
81823D융합기술지원센터 임대㈜멘티스로지텍대구 수성구3D 프린터 의료기기053-961-5833
82833D융합기술지원센터 임대㈜코렌텍충남 천안시인공관절041-585-7114
83843D융합기술지원센터 임대㈜지에스메디칼충북 청주시정형외과용 의료기기043-237-7397
84853D융합기술지원센터 임대㈜지비에스커먼웰스서울 금천구정형외과용 의료기기02-6925-4469
85863D융합기술지원센터 임대㈜비트러스트메디텍서울 서초구임플란트, 인공관절<NA>
86873D융합기술지원센터 임대휴카시스템㈜대구시 동구(첨복단지)재활, 케어 로봇시스템02-978-0225
87883D융합기술지원센터 임대㈜솔고바이오메디칼경기도 평택시3D프린팅 척추임플란트031-610-4000
88893D융합기술지원센터 임대㈜티큐브잇세종시면역 세포기반 면역함암제 연구개발044-866-7200