Overview

Dataset statistics

Number of variables7
Number of observations490
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory27.4 KiB
Average record size in memory57.3 B

Variable types

Text3
Categorical1
DateTime2
Numeric1

Dataset

Description대구도시개발공사 사업별 담당자 정보 입니다. 메타데이터기반 공공데이터 개방자료이기 때문에 가공되지 않은 원본 테이블의 데이터가 등록되었습니다.
URLhttps://www.data.go.kr/data/15120605/fileData.do

Alerts

정렬순서 has 27 (5.5%) zerosZeros

Reproduction

Analysis started2023-12-12 01:24:30.633480
Analysis finished2023-12-12 01:24:31.346349
Duration0.71 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사번
Text

Distinct89
Distinct (%)18.2%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
2023-12-12T10:24:31.512225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length7.9938776
Min length5

Characters and Unicode

Total characters3917
Distinct characters17
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique38 ?
Unique (%)7.8%

Sample

1st row19910091
2nd row20169063
3rd row20040171
4th row20040171
5th row20040171
ValueCountFrequency (%)
20160238 34
 
6.9%
20189093 29
 
5.9%
20189092 29
 
5.9%
20170257 27
 
5.5%
20210320 24
 
4.9%
20150237 22
 
4.5%
20200308 20
 
4.1%
20160248 20
 
4.1%
20040171 17
 
3.5%
19940154 17
 
3.5%
Other values (79) 251
51.2%
2023-12-12T10:24:31.897471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1063
27.1%
2 725
18.5%
9 660
16.8%
1 519
13.2%
3 194
 
5.0%
8 177
 
4.5%
4 159
 
4.1%
7 145
 
3.7%
6 139
 
3.5%
5 126
 
3.2%
Other values (7) 10
 
0.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3907
99.7%
Lowercase Letter 10
 
0.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1063
27.2%
2 725
18.6%
9 660
16.9%
1 519
13.3%
3 194
 
5.0%
8 177
 
4.5%
4 159
 
4.1%
7 145
 
3.7%
6 139
 
3.6%
5 126
 
3.2%
Lowercase Letter
ValueCountFrequency (%)
a 2
20.0%
d 2
20.0%
i 2
20.0%
u 1
10.0%
t 1
10.0%
m 1
10.0%
n 1
10.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3907
99.7%
Latin 10
 
0.3%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1063
27.2%
2 725
18.6%
9 660
16.9%
1 519
13.3%
3 194
 
5.0%
8 177
 
4.5%
4 159
 
4.1%
7 145
 
3.7%
6 139
 
3.6%
5 126
 
3.2%
Latin
ValueCountFrequency (%)
a 2
20.0%
d 2
20.0%
i 2
20.0%
u 1
10.0%
t 1
10.0%
m 1
10.0%
n 1
10.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3917
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1063
27.1%
2 725
18.5%
9 660
16.8%
1 519
13.2%
3 194
 
5.0%
8 177
 
4.5%
4 159
 
4.1%
7 145
 
3.7%
6 139
 
3.5%
5 126
 
3.2%
Other values (7) 10
 
0.3%

사업지역
Categorical

Distinct34
Distinct (%)6.9%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
금호워터폴리스산업단지 개발사업
44 
수성의료지구개발사업
43 
안심뉴타운조성사업
42 
국가과학산업단지
42 
장기미집행공원 조성사업
 
27
Other values (29)
292 

Length

Max length27
Median length21.5
Mean length13.471429
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대구선폐선부지개발사업(동촌지구)
2nd row국가과학산업단지
3rd row대구선폐선부지개발사업(각산지구)
4th row국가과학산업단지
5th row수성의료지구개발사업

Common Values

ValueCountFrequency (%)
금호워터폴리스산업단지 개발사업 44
 
9.0%
수성의료지구개발사업 43
 
8.8%
안심뉴타운조성사업 42
 
8.6%
국가과학산업단지 42
 
8.6%
장기미집행공원 조성사업 27
 
5.5%
대구대공원개발사업 26
 
5.3%
복현지구주거환경개선사업 24
 
4.9%
엑스코 제2전시장 건립(대구전시컨벤션센터) 24
 
4.9%
대구선폐선부지개발사업(동촌지구) 16
 
3.3%
대구선폐선부지개발사업(각산지구) 16
 
3.3%
Other values (24) 186
38.0%

Length

2023-12-12T10:24:32.050871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
개발사업 59
 
7.7%
금호워터폴리스산업단지 44
 
5.7%
수성의료지구개발사업 43
 
5.6%
국가과학산업단지 42
 
5.5%
안심뉴타운조성사업 42
 
5.5%
조성사업 37
 
4.8%
택지개발사업 27
 
3.5%
장기미집행공원 27
 
3.5%
대구대공원개발사업 26
 
3.4%
복현지구주거환경개선사업 24
 
3.1%
Other values (42) 395
51.6%
Distinct69
Distinct (%)14.1%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
2023-12-12T10:24:32.274519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length7.7673469
Min length4

Characters and Unicode

Total characters3806
Distinct characters19
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique19 ?
Unique (%)3.9%

Sample

1st row19910091
2nd row20160963
3rd row20040171
4th row20040171
5th row20040171
ValueCountFrequency (%)
20160238 34
 
6.9%
20189092 29
 
5.9%
20189093 29
 
5.9%
자료이관 28
 
5.7%
20170257 27
 
5.5%
20210320 24
 
4.9%
20150237 22
 
4.5%
20200308 20
 
4.1%
20160248 20
 
4.1%
20040171 17
 
3.5%
Other values (59) 240
49.0%
2023-12-12T10:24:32.716094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 988
26.0%
2 702
18.4%
9 647
17.0%
1 473
12.4%
3 198
 
5.2%
8 175
 
4.6%
7 141
 
3.7%
4 133
 
3.5%
6 120
 
3.2%
5 111
 
2.9%
Other values (9) 118
 
3.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3688
96.9%
Other Letter 112
 
2.9%
Uppercase Letter 6
 
0.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 988
26.8%
2 702
19.0%
9 647
17.5%
1 473
12.8%
3 198
 
5.4%
8 175
 
4.7%
7 141
 
3.8%
4 133
 
3.6%
6 120
 
3.3%
5 111
 
3.0%
Uppercase Letter
ValueCountFrequency (%)
S 2
33.3%
Y 1
16.7%
T 1
16.7%
E 1
16.7%
M 1
16.7%
Other Letter
ValueCountFrequency (%)
28
25.0%
28
25.0%
28
25.0%
28
25.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3688
96.9%
Hangul 112
 
2.9%
Latin 6
 
0.2%

Most frequent character per script

Common
ValueCountFrequency (%)
0 988
26.8%
2 702
19.0%
9 647
17.5%
1 473
12.8%
3 198
 
5.4%
8 175
 
4.7%
7 141
 
3.8%
4 133
 
3.6%
6 120
 
3.3%
5 111
 
3.0%
Latin
ValueCountFrequency (%)
S 2
33.3%
Y 1
16.7%
T 1
16.7%
E 1
16.7%
M 1
16.7%
Hangul
ValueCountFrequency (%)
28
25.0%
28
25.0%
28
25.0%
28
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3694
97.1%
Hangul 112
 
2.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 988
26.7%
2 702
19.0%
9 647
17.5%
1 473
12.8%
3 198
 
5.4%
8 175
 
4.7%
7 141
 
3.8%
4 133
 
3.6%
6 120
 
3.2%
5 111
 
3.0%
Other values (5) 6
 
0.2%
Hangul
ValueCountFrequency (%)
28
25.0%
28
25.0%
28
25.0%
28
25.0%
Distinct254
Distinct (%)51.8%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
Minimum2016-09-02 19:46:45
Maximum2023-08-21 13:59:12
2023-12-12T10:24:32.869522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:24:33.009557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct69
Distinct (%)14.1%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
2023-12-12T10:24:33.267073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length7.7755102
Min length4

Characters and Unicode

Total characters3810
Distinct characters19
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique19 ?
Unique (%)3.9%

Sample

1st row19910091
2nd row20160963
3rd row20040171
4th row20040171
5th row20040171
ValueCountFrequency (%)
20160238 34
 
6.9%
20189092 29
 
5.9%
20189093 29
 
5.9%
20170257 27
 
5.5%
자료이관 27
 
5.5%
20210320 24
 
4.9%
20150237 22
 
4.5%
20200308 20
 
4.1%
20160248 20
 
4.1%
20040171 17
 
3.5%
Other values (59) 241
49.2%
2023-12-12T10:24:33.684659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 991
26.0%
2 702
18.4%
9 649
17.0%
1 474
12.4%
3 198
 
5.2%
8 175
 
4.6%
7 142
 
3.7%
4 133
 
3.5%
6 121
 
3.2%
5 111
 
2.9%
Other values (9) 114
 
3.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3696
97.0%
Other Letter 108
 
2.8%
Uppercase Letter 6
 
0.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 991
26.8%
2 702
19.0%
9 649
17.6%
1 474
12.8%
3 198
 
5.4%
8 175
 
4.7%
7 142
 
3.8%
4 133
 
3.6%
6 121
 
3.3%
5 111
 
3.0%
Uppercase Letter
ValueCountFrequency (%)
S 2
33.3%
Y 1
16.7%
T 1
16.7%
E 1
16.7%
M 1
16.7%
Other Letter
ValueCountFrequency (%)
27
25.0%
27
25.0%
27
25.0%
27
25.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3696
97.0%
Hangul 108
 
2.8%
Latin 6
 
0.2%

Most frequent character per script

Common
ValueCountFrequency (%)
0 991
26.8%
2 702
19.0%
9 649
17.6%
1 474
12.8%
3 198
 
5.4%
8 175
 
4.7%
7 142
 
3.8%
4 133
 
3.6%
6 121
 
3.3%
5 111
 
3.0%
Latin
ValueCountFrequency (%)
S 2
33.3%
Y 1
16.7%
T 1
16.7%
E 1
16.7%
M 1
16.7%
Hangul
ValueCountFrequency (%)
27
25.0%
27
25.0%
27
25.0%
27
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3702
97.2%
Hangul 108
 
2.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 991
26.8%
2 702
19.0%
9 649
17.5%
1 474
12.8%
3 198
 
5.3%
8 175
 
4.7%
7 142
 
3.8%
4 133
 
3.6%
6 121
 
3.3%
5 111
 
3.0%
Other values (5) 6
 
0.2%
Hangul
ValueCountFrequency (%)
27
25.0%
27
25.0%
27
25.0%
27
25.0%
Distinct237
Distinct (%)48.4%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
Minimum2016-09-02 19:46:45
Maximum2023-08-21 13:59:12
2023-12-12T10:24:33.841599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:24:34.026057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

정렬순서
Real number (ℝ)

ZEROS 

Distinct27
Distinct (%)5.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.4387755
Minimum0
Maximum26
Zeros27
Zeros (%)5.5%
Negative0
Negative (%)0.0%
Memory size4.4 KiB
2023-12-12T10:24:34.158812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q12
median4
Q39.75
95-th percentile19
Maximum26
Range26
Interquartile range (IQR)7.75

Descriptive statistics

Standard deviation6.2344552
Coefficient of variation (CV)0.96826722
Kurtosis0.51410315
Mean6.4387755
Median Absolute Deviation (MAD)3
Skewness1.1985052
Sum3155
Variance38.868432
MonotonicityNot monotonic
2023-12-12T10:24:34.285486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%)
2 99
20.2%
1 68
13.9%
3 37
 
7.6%
4 32
 
6.5%
0 27
 
5.5%
5 27
 
5.5%
6 22
 
4.5%
7 19
 
3.9%
9 18
 
3.7%
8 18
 
3.7%
Other values (17) 123
25.1%
ValueCountFrequency (%)
0 27
 
5.5%
1 68
13.9%
2 99
20.2%
3 37
 
7.6%
4 32
 
6.5%
5 27
 
5.5%
6 22
 
4.5%
7 19
 
3.9%
8 18
 
3.7%
9 18
 
3.7%
ValueCountFrequency (%)
26 2
 
0.4%
25 3
 
0.6%
24 3
 
0.6%
23 3
 
0.6%
22 4
 
0.8%
21 4
 
0.8%
20 5
 
1.0%
19 6
 
1.2%
18 15
3.1%
17 7
1.4%

Interactions

2023-12-12T10:24:31.041357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T10:24:34.370378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사번사업지역등록자번호수정자번호정렬순서
사번1.0000.0001.0001.0000.000
사업지역0.0001.0000.0000.0000.613
등록자번호1.0000.0001.0001.0000.000
수정자번호1.0000.0001.0001.0000.000
정렬순서0.0000.6130.0000.0001.000
2023-12-12T10:24:34.467498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
정렬순서사업지역
정렬순서1.0000.245
사업지역0.2451.000

Missing values

2023-12-12T10:24:31.184315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:24:31.302422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사번사업지역등록자번호등록일시수정자번호수정일시정렬순서
019910091대구선폐선부지개발사업(동촌지구)199100912016-09-12 13:08:23199100912016-09-12 13:08:233
120169063국가과학산업단지201609632016-09-13 10:50:07201609632016-09-13 10:50:071
220040171대구선폐선부지개발사업(각산지구)200401712016-09-26 14:07:21200401712016-09-26 14:07:211
320040171국가과학산업단지200401712016-09-26 14:07:21200401712016-09-26 14:07:212
420040171수성의료지구개발사업200401712016-09-26 14:07:21200401712016-09-26 14:07:213
520040171대구선폐선부지개발사업(동촌지구)200401712016-09-26 14:07:21200401712016-09-26 14:07:214
620040171대구출판산업단지200401712016-09-26 14:07:21200401712016-09-26 14:07:215
720169064대구선폐선부지개발사업(동촌지구)201699932016-10-12 17:28:00201699932016-10-12 17:28:009
820169064대구선폐선부지개발사업(각산지구)201699932016-10-12 17:28:00201699932016-10-12 17:28:0010
920040171신암2-2지구 주거환경개선사업200401712016-11-21 15:07:21200401712016-11-21 15:07:217
사번사업지역등록자번호등록일시수정자번호수정일시정렬순서
48019930135장기미집행공원 조성사업199301352021-12-13 09:40:35199301352021-12-13 09:40:353
48120210320달성2차산업단지 도로(소로2-구27호선)공사202103202022-02-03 14:34:39202103202022-10-25 09:08:1815
48220210320달성2차산업단지하수도(낙동강방류)설치공사202103202022-02-03 14:34:39202103202022-10-25 09:08:1816
48399999999안심뉴타운조성사업999999992022-02-09 09:07:16999999992022-02-09 09:07:161
48499999999수성의료지구개발사업999999992022-02-09 09:07:16999999992022-02-09 09:07:162
48599999999국가과학산업단지999999992022-02-09 09:07:16999999992022-02-09 09:07:163
48699999999죽곡2지구 택지개발사업999999992022-02-09 09:07:16999999992022-02-09 09:07:164
48799999999금호워터폴리스산업단지 개발사업999999992022-02-09 09:07:16999999992022-02-09 09:07:165
48899999999엑스코 제2전시장 건립(대구전시컨벤션센터)999999992022-02-09 09:07:16999999992022-02-09 09:07:166
48999999999장기미집행공원 조성사업999999992022-02-09 09:07:16999999992022-02-09 09:07:167