Overview

Dataset statistics

Number of variables4
Number of observations59
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.0 KiB
Average record size in memory35.2 B

Variable types

Text3
Numeric1

Dataset

Description대구경북첨단의료산업진흥재단에서 보유하고 있는 입주기업 현황 자료입니다.
Author대구경북첨단의료산업진흥재단
URLhttps://www.data.go.kr/data/15020969/fileData.do

Alerts

필지번호 has unique valuesUnique
면적(㎡) has unique valuesUnique

Reproduction

Analysis started2023-12-12 13:58:01.413435
Analysis finished2023-12-12 13:58:01.964289
Duration0.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

필지번호
Text

UNIQUE 

Distinct59
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size604.0 B
2023-12-12T22:58:02.488105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length4
Mean length4.3389831
Min length2

Characters and Unicode

Total characters256
Distinct characters12
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique59 ?
Unique (%)100.0%

Sample

1st row첨1-1
2nd row첨1-2
3rd row첨2-1
4th row첨2-2
5th row첨3-1
ValueCountFrequency (%)
첨1-1 1
 
1.7%
첨9-5 1
 
1.7%
첨11-1 1
 
1.7%
첨11-2 1
 
1.7%
첨11-3 1
 
1.7%
첨11-4 1
 
1.7%
첨12-1 1
 
1.7%
첨12-2 1
 
1.7%
첨12-3 1
 
1.7%
첨12-4 1
 
1.7%
Other values (49) 49
83.1%
2023-12-12T22:58:02.886956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
59
23.0%
- 55
21.5%
1 46
18.0%
3 20
 
7.8%
4 18
 
7.0%
2 18
 
7.0%
5 15
 
5.9%
6 13
 
5.1%
9 5
 
2.0%
7 4
 
1.6%
Other values (2) 3
 
1.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 142
55.5%
Other Letter 59
23.0%
Dash Punctuation 55
 
21.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 46
32.4%
3 20
14.1%
4 18
 
12.7%
2 18
 
12.7%
5 15
 
10.6%
6 13
 
9.2%
9 5
 
3.5%
7 4
 
2.8%
8 2
 
1.4%
0 1
 
0.7%
Other Letter
ValueCountFrequency (%)
59
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 55
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 197
77.0%
Hangul 59
 
23.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 55
27.9%
1 46
23.4%
3 20
 
10.2%
4 18
 
9.1%
2 18
 
9.1%
5 15
 
7.6%
6 13
 
6.6%
9 5
 
2.5%
7 4
 
2.0%
8 2
 
1.0%
Hangul
ValueCountFrequency (%)
59
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 197
77.0%
Hangul 59
 
23.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
59
100.0%
ASCII
ValueCountFrequency (%)
- 55
27.9%
1 46
23.4%
3 20
 
10.2%
4 18
 
9.1%
2 18
 
9.1%
5 15
 
7.6%
6 13
 
6.6%
9 5
 
2.5%
7 4
 
2.0%
8 2
 
1.0%

주소
Text

Distinct58
Distinct (%)98.3%
Missing0
Missing (%)0.0%
Memory size604.0 B
2023-12-12T22:58:03.098579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length20
Mean length16.525424
Min length14

Characters and Unicode

Total characters975
Distinct characters24
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique57 ?
Unique (%)96.6%

Sample

1st row대구시 동구 신서동 1142
2nd row대구시 동구 신서동 1142-1
3rd row대구시 동구 신서동 1146
4th row대구시 동구 신서동 1146-1
5th row대구시 동구 동내동 산 78번지 일원
ValueCountFrequency (%)
대구시 59
24.0%
동구 59
24.0%
동내동 34
13.8%
대림동 16
 
6.5%
신서동 9
 
3.7%
5
 
2.0%
일원 5
 
2.0%
78번지 2
 
0.8%
1118-1 1
 
0.4%
881-1 1
 
0.4%
Other values (55) 55
22.4%
2023-12-12T22:58:03.487900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
187
19.2%
152
15.6%
1 119
12.2%
118
12.1%
75
7.7%
59
 
6.1%
- 43
 
4.4%
8 42
 
4.3%
34
 
3.5%
2 24
 
2.5%
Other values (14) 122
12.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 495
50.8%
Decimal Number 250
25.6%
Space Separator 187
 
19.2%
Dash Punctuation 43
 
4.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
152
30.7%
118
23.8%
75
15.2%
59
 
11.9%
34
 
6.9%
16
 
3.2%
9
 
1.8%
9
 
1.8%
5
 
1.0%
5
 
1.0%
Other values (3) 13
 
2.6%
Decimal Number
ValueCountFrequency (%)
1 119
47.6%
8 42
 
16.8%
2 24
 
9.6%
3 21
 
8.4%
5 13
 
5.2%
6 13
 
5.2%
4 11
 
4.4%
7 4
 
1.6%
0 3
 
1.2%
Space Separator
ValueCountFrequency (%)
187
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 43
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 495
50.8%
Common 480
49.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
152
30.7%
118
23.8%
75
15.2%
59
 
11.9%
34
 
6.9%
16
 
3.2%
9
 
1.8%
9
 
1.8%
5
 
1.0%
5
 
1.0%
Other values (3) 13
 
2.6%
Common
ValueCountFrequency (%)
187
39.0%
1 119
24.8%
- 43
 
9.0%
8 42
 
8.8%
2 24
 
5.0%
3 21
 
4.4%
5 13
 
2.7%
6 13
 
2.7%
4 11
 
2.3%
7 4
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 495
50.8%
ASCII 480
49.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
187
39.0%
1 119
24.8%
- 43
 
9.0%
8 42
 
8.8%
2 24
 
5.0%
3 21
 
4.4%
5 13
 
2.7%
6 13
 
2.7%
4 11
 
2.3%
7 4
 
0.8%
Hangul
ValueCountFrequency (%)
152
30.7%
118
23.8%
75
15.2%
59
 
11.9%
34
 
6.9%
16
 
3.2%
9
 
1.8%
9
 
1.8%
5
 
1.0%
5
 
1.0%
Other values (3) 13
 
2.6%

면적(㎡)
Real number (ℝ)

UNIQUE 

Distinct59
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8202.4746
Minimum1650
Maximum70068.4
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size663.0 B
2023-12-12T22:58:03.634559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1650
5-th percentile1744.64
Q12270.6
median4020.1
Q39858.2
95-th percentile21343.06
Maximum70068.4
Range68418.4
Interquartile range (IQR)7587.6

Descriptive statistics

Standard deviation11714.079
Coefficient of variation (CV)1.4281152
Kurtosis16.04921
Mean8202.4746
Median Absolute Deviation (MAD)2131.9
Skewness3.7494626
Sum483946
Variance1.3721964 × 108
MonotonicityNot monotonic
2023-12-12T22:58:03.810217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
35113.6 1
 
1.7%
51988.7 1
 
1.7%
3945.1 1
 
1.7%
4020.1 1
 
1.7%
1650.0 1
 
1.7%
4655.1 1
 
1.7%
2298.1 1
 
1.7%
4435.2 1
 
1.7%
2256.2 1
 
1.7%
2186.3 1
 
1.7%
Other values (49) 49
83.1%
ValueCountFrequency (%)
1650.0 1
1.7%
1687.3 1
1.7%
1740.5 1
1.7%
1745.1 1
1.7%
1848.1 1
1.7%
1886.5 1
1.7%
1887.1 1
1.7%
1887.4 1
1.7%
1888.2 1
1.7%
1889.0 1
1.7%
ValueCountFrequency (%)
70068.4 1
1.7%
51988.7 1
1.7%
35113.6 1
1.7%
19813.0 1
1.7%
17031.0 1
1.7%
16660.3 1
1.7%
15710.0 1
1.7%
12039.0 1
1.7%
11991.0 1
1.7%
11660.0 1
1.7%
Distinct34
Distinct (%)57.6%
Missing0
Missing (%)0.0%
Memory size604.0 B
2023-12-12T22:58:04.038873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length19
Mean length8.0677966
Min length3

Characters and Unicode

Total characters476
Distinct characters131
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)49.2%

Sample

1st row한국뇌병원 예정
2nd row한국뇌연구원
3rd row한국한의학연구원
4th row한의학연구원 예정
5th row국가분자이미징센터 예정
ValueCountFrequency (%)
예정 31
32.0%
분양 20
20.6%
국가분자이미징센터 3
 
3.1%
의료기술시험훈련원 3
 
3.1%
경북대학교 2
 
2.1%
첨단정보통신융합산업기술원(3d융합연구센터 2
 
2.1%
㈜세신정밀 2
 
2.1%
㈜한국전통의학연구소 1
 
1.0%
㈜쎄텍 1
 
1.0%
한약진흥재단 1
 
1.0%
Other values (31) 31
32.0%
2023-12-12T22:58:04.428508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
38
 
8.0%
36
 
7.6%
32
 
6.7%
24
 
5.0%
21
 
4.4%
16
 
3.4%
14
 
2.9%
11
 
2.3%
11
 
2.3%
10
 
2.1%
Other values (121) 263
55.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 397
83.4%
Space Separator 38
 
8.0%
Other Symbol 16
 
3.4%
Lowercase Letter 13
 
2.7%
Open Punctuation 3
 
0.6%
Close Punctuation 3
 
0.6%
Uppercase Letter 3
 
0.6%
Decimal Number 2
 
0.4%
Dash Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
36
 
9.1%
32
 
8.1%
24
 
6.0%
21
 
5.3%
14
 
3.5%
11
 
2.8%
11
 
2.8%
10
 
2.5%
10
 
2.5%
8
 
2.0%
Other values (103) 220
55.4%
Lowercase Letter
ValueCountFrequency (%)
e 3
23.1%
c 2
15.4%
n 1
 
7.7%
r 1
 
7.7%
t 1
 
7.7%
l 1
 
7.7%
a 1
 
7.7%
i 1
 
7.7%
d 1
 
7.7%
m 1
 
7.7%
Uppercase Letter
ValueCountFrequency (%)
D 2
66.7%
K 1
33.3%
Space Separator
ValueCountFrequency (%)
38
100.0%
Other Symbol
ValueCountFrequency (%)
16
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Decimal Number
ValueCountFrequency (%)
3 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 413
86.8%
Common 47
 
9.9%
Latin 16
 
3.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
36
 
8.7%
32
 
7.7%
24
 
5.8%
21
 
5.1%
16
 
3.9%
14
 
3.4%
11
 
2.7%
11
 
2.7%
10
 
2.4%
10
 
2.4%
Other values (104) 228
55.2%
Latin
ValueCountFrequency (%)
e 3
18.8%
c 2
12.5%
D 2
12.5%
n 1
 
6.2%
r 1
 
6.2%
t 1
 
6.2%
l 1
 
6.2%
a 1
 
6.2%
i 1
 
6.2%
d 1
 
6.2%
Other values (2) 2
12.5%
Common
ValueCountFrequency (%)
38
80.9%
( 3
 
6.4%
) 3
 
6.4%
3 2
 
4.3%
- 1
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 397
83.4%
ASCII 63
 
13.2%
None 16
 
3.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
38
60.3%
( 3
 
4.8%
e 3
 
4.8%
) 3
 
4.8%
c 2
 
3.2%
3 2
 
3.2%
D 2
 
3.2%
n 1
 
1.6%
r 1
 
1.6%
t 1
 
1.6%
Other values (7) 7
 
11.1%
Hangul
ValueCountFrequency (%)
36
 
9.1%
32
 
8.1%
24
 
6.0%
21
 
5.3%
14
 
3.5%
11
 
2.8%
11
 
2.8%
10
 
2.5%
10
 
2.5%
8
 
2.0%
Other values (103) 220
55.4%
None
ValueCountFrequency (%)
16
100.0%

Interactions

2023-12-12T22:58:01.696074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T22:58:04.515131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
필지번호주소면적(㎡)기업 및 기관명
필지번호1.0001.0001.0001.000
주소1.0001.0001.0000.980
면적(㎡)1.0001.0001.0000.950
기업 및 기관명1.0000.9800.9501.000

Missing values

2023-12-12T22:58:01.826865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:58:01.931427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

필지번호주소면적(㎡)기업 및 기관명
0첨1-1대구시 동구 신서동 114235113.6한국뇌병원 예정
1첨1-2대구시 동구 신서동 1142-151988.7한국뇌연구원
2첨2-1대구시 동구 신서동 114611539.5한국한의학연구원
3첨2-2대구시 동구 신서동 1146-19334.5한의학연구원 예정
4첨3-1대구시 동구 동내동 산 78번지 일원8775.7국가분자이미징센터 예정
5첨3-2대구시 동구 동내동 산 78번지 일원11660.0한국생명공학연구원(첨단유전체연구소)
6첨3-3대구시 동구 동내동 산 34-1번지 일원11392.9국가분자이미징센터 예정
7첨3-4대구시 동구 동내동 산 37번지 일원9974.4국가분자이미징센터 예정
8첨3-5대구시 동구 동내동 산 230-111991.0식품의약품안전평가원 실험동물자원은행
9첨4-1대구시 동구 동내동 11137817.1분양 예정
필지번호주소면적(㎡)기업 및 기관명
49첨15-2대구시 동구 대림동 886-13713.0분양예정
50첨15-3대구시 동구 대림동 886-215710.0임상시험센터 예정
51첨15-4대구시 동구 대림동 886-317031.0분양 예정
52첨15-5대구시 동구 대림동 886-43072.5분양 예정
53첨15-6대구시 동구 대림동 886-53090.5분양 예정
54첨15-7대구시 동구 대림동 886-63302.2분양 예정
55첨15-8대구시 동구 대림동 886-73293.6분양 예정
56첨16-1대구시 동구 동내동 158-19039.8분양 예정
57첨16-2대구시 동구 동내동 158-29742.0분양 예정
58첨16-3대구시 동구 동내동 158-39010.3분양 예정