Overview

Dataset statistics

Number of variables6
Number of observations309
Missing cells309
Missing cells (%)16.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory14.9 KiB
Average record size in memory49.4 B

Variable types

Text3
Numeric1
DateTime1
Categorical1

Dataset

Description대구광역시 달서구 기계설비 성능점검 대상 건축물(일반건축물) 현황 자료입니다. 건물명, 주소, 연면적을 포함하고 있습니다.
Author대구광역시 달서구
URLhttps://www.data.go.kr/data/15112162/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
세대수 is highly overall correlated with 비고High correlation
비고 is highly overall correlated with 세대수High correlation
연면적 has 146 (47.2%) missing valuesMissing
세대수 has 163 (52.8%) missing valuesMissing

Reproduction

Analysis started2024-04-21 02:32:07.479957
Analysis finished2024-04-21 02:32:09.931116
Duration2.45 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct308
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2024-04-21T11:32:10.636217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length18
Mean length8.2071197
Min length2

Characters and Unicode

Total characters2536
Distinct characters321
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique307 ?
Unique (%)99.4%

Sample

1st row계명대학교
2nd row계명대학교 동산의료원
3rd row진천역 계룡리슈빌 상가
4th row삼정브리티시용산 상가
5th row빌리브 스카이 주상복합 상가
ValueCountFrequency (%)
상가 16
 
3.6%
오피스텔 8
 
1.8%
7
 
1.6%
진천역 6
 
1.3%
월배 6
 
1.3%
월성 6
 
1.3%
죽전역 6
 
1.3%
아파트 5
 
1.1%
빌리브 5
 
1.1%
월성동 4
 
0.9%
Other values (344) 381
84.7%
2024-04-21T11:32:11.965344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
143
 
5.6%
96
 
3.8%
66
 
2.6%
57
 
2.2%
55
 
2.2%
55
 
2.2%
51
 
2.0%
49
 
1.9%
48
 
1.9%
46
 
1.8%
Other values (311) 1870
73.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2280
89.9%
Space Separator 143
 
5.6%
Decimal Number 49
 
1.9%
Uppercase Letter 26
 
1.0%
Other Punctuation 8
 
0.3%
Lowercase Letter 7
 
0.3%
Dash Punctuation 7
 
0.3%
Open Punctuation 7
 
0.3%
Close Punctuation 7
 
0.3%
Other Symbol 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
96
 
4.2%
66
 
2.9%
57
 
2.5%
55
 
2.4%
55
 
2.4%
51
 
2.2%
49
 
2.1%
48
 
2.1%
46
 
2.0%
46
 
2.0%
Other values (282) 1711
75.0%
Uppercase Letter
ValueCountFrequency (%)
S 5
19.2%
C 3
11.5%
G 3
11.5%
A 3
11.5%
K 2
 
7.7%
O 2
 
7.7%
R 2
 
7.7%
M 2
 
7.7%
E 1
 
3.8%
P 1
 
3.8%
Other values (2) 2
 
7.7%
Decimal Number
ValueCountFrequency (%)
2 20
40.8%
1 16
32.7%
3 5
 
10.2%
6 2
 
4.1%
5 2
 
4.1%
4 2
 
4.1%
8 1
 
2.0%
7 1
 
2.0%
Other Punctuation
ValueCountFrequency (%)
, 6
75.0%
· 1
 
12.5%
& 1
 
12.5%
Space Separator
ValueCountFrequency (%)
143
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 7
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2282
90.0%
Common 221
 
8.7%
Latin 33
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
96
 
4.2%
66
 
2.9%
57
 
2.5%
55
 
2.4%
55
 
2.4%
51
 
2.2%
49
 
2.1%
48
 
2.1%
46
 
2.0%
46
 
2.0%
Other values (283) 1713
75.1%
Common
ValueCountFrequency (%)
143
64.7%
2 20
 
9.0%
1 16
 
7.2%
- 7
 
3.2%
( 7
 
3.2%
) 7
 
3.2%
, 6
 
2.7%
3 5
 
2.3%
6 2
 
0.9%
5 2
 
0.9%
Other values (5) 6
 
2.7%
Latin
ValueCountFrequency (%)
e 7
21.2%
S 5
15.2%
C 3
9.1%
G 3
9.1%
A 3
9.1%
K 2
 
6.1%
O 2
 
6.1%
R 2
 
6.1%
M 2
 
6.1%
E 1
 
3.0%
Other values (3) 3
9.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2280
89.9%
ASCII 253
 
10.0%
None 3
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
143
56.5%
2 20
 
7.9%
1 16
 
6.3%
e 7
 
2.8%
- 7
 
2.8%
( 7
 
2.8%
) 7
 
2.8%
, 6
 
2.4%
S 5
 
2.0%
3 5
 
2.0%
Other values (17) 30
 
11.9%
Hangul
ValueCountFrequency (%)
96
 
4.2%
66
 
2.9%
57
 
2.5%
55
 
2.4%
55
 
2.4%
51
 
2.2%
49
 
2.1%
48
 
2.1%
46
 
2.0%
46
 
2.0%
Other values (282) 1711
75.0%
None
ValueCountFrequency (%)
2
66.7%
· 1
33.3%

주소
Text

Distinct307
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2024-04-21T11:32:13.062228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length37
Mean length28.919094
Min length19

Characters and Unicode

Total characters8936
Distinct characters221
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique305 ?
Unique (%)98.7%

Sample

1st row대구광역시 달서구 달구벌대로 1095 (신당동)
2nd row대구광역시 달서구 달구벌대로 1035 (신당동)
3rd row대구광역시 달서구 진천로 77 (진천동)
4th row대구광역시 달서구 달구벌대로 1530 (감삼동)
5th row대구광역시 달서구 와룡로 186 (감삼동)
ValueCountFrequency (%)
대구광역시 309
 
17.7%
달서구 309
 
17.7%
월성동 38
 
2.2%
용산동 36
 
2.1%
상인동 25
 
1.4%
이곡동 23
 
1.3%
도원동 17
 
1.0%
달구벌대로 17
 
1.0%
호산동 17
 
1.0%
감삼동 16
 
0.9%
Other values (485) 935
53.7%
2024-04-21T11:32:14.638479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1433
 
16.0%
657
 
7.4%
427
 
4.8%
398
 
4.5%
349
 
3.9%
335
 
3.7%
320
 
3.6%
312
 
3.5%
310
 
3.5%
( 305
 
3.4%
Other values (211) 4090
45.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5739
64.2%
Space Separator 1433
 
16.0%
Decimal Number 972
 
10.9%
Open Punctuation 305
 
3.4%
Close Punctuation 305
 
3.4%
Other Punctuation 152
 
1.7%
Dash Punctuation 16
 
0.2%
Lowercase Letter 8
 
0.1%
Uppercase Letter 6
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
657
 
11.4%
427
 
7.4%
398
 
6.9%
349
 
6.1%
335
 
5.8%
320
 
5.6%
312
 
5.4%
310
 
5.4%
305
 
5.3%
178
 
3.1%
Other values (190) 2148
37.4%
Decimal Number
ValueCountFrequency (%)
1 211
21.7%
2 141
14.5%
3 131
13.5%
5 95
9.8%
0 83
 
8.5%
4 72
 
7.4%
7 71
 
7.3%
6 71
 
7.3%
9 55
 
5.7%
8 42
 
4.3%
Uppercase Letter
ValueCountFrequency (%)
C 2
33.3%
K 2
33.3%
A 1
16.7%
L 1
16.7%
Lowercase Letter
ValueCountFrequency (%)
e 7
87.5%
h 1
 
12.5%
Space Separator
ValueCountFrequency (%)
1433
100.0%
Open Punctuation
ValueCountFrequency (%)
( 305
100.0%
Close Punctuation
ValueCountFrequency (%)
) 305
100.0%
Other Punctuation
ValueCountFrequency (%)
, 152
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5739
64.2%
Common 3183
35.6%
Latin 14
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
657
 
11.4%
427
 
7.4%
398
 
6.9%
349
 
6.1%
335
 
5.8%
320
 
5.6%
312
 
5.4%
310
 
5.4%
305
 
5.3%
178
 
3.1%
Other values (190) 2148
37.4%
Common
ValueCountFrequency (%)
1433
45.0%
( 305
 
9.6%
) 305
 
9.6%
1 211
 
6.6%
, 152
 
4.8%
2 141
 
4.4%
3 131
 
4.1%
5 95
 
3.0%
0 83
 
2.6%
4 72
 
2.3%
Other values (5) 255
 
8.0%
Latin
ValueCountFrequency (%)
e 7
50.0%
C 2
 
14.3%
K 2
 
14.3%
A 1
 
7.1%
L 1
 
7.1%
h 1
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5739
64.2%
ASCII 3197
35.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1433
44.8%
( 305
 
9.5%
) 305
 
9.5%
1 211
 
6.6%
, 152
 
4.8%
2 141
 
4.4%
3 131
 
4.1%
5 95
 
3.0%
0 83
 
2.6%
4 72
 
2.3%
Other values (11) 269
 
8.4%
Hangul
ValueCountFrequency (%)
657
 
11.4%
427
 
7.4%
398
 
6.9%
349
 
6.1%
335
 
5.8%
320
 
5.6%
312
 
5.4%
310
 
5.4%
305
 
5.3%
178
 
3.1%
Other values (190) 2148
37.4%

연면적
Text

MISSING 

Distinct163
Distinct (%)100.0%
Missing146
Missing (%)47.2%
Memory size2.5 KiB
2024-04-21T11:32:15.797410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length8
Mean length8.2453988
Min length5

Characters and Unicode

Total characters1344
Distinct characters12
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique163 ?
Unique (%)100.0%

Sample

1st row383853.76
2nd row179,152.96
3rd row174115.4956
4th row125045.3168
5th row121,755.02
ValueCountFrequency (%)
39723.65 1
 
0.6%
12881.86 1
 
0.6%
13797 1
 
0.6%
14270.18 1
 
0.6%
14217.24 1
 
0.6%
14049.25 1
 
0.6%
13991.95 1
 
0.6%
13915.14 1
 
0.6%
13908.758 1
 
0.6%
13821.94 1
 
0.6%
Other values (153) 153
93.9%
2024-04-21T11:32:17.162696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 212
15.8%
. 161
12.0%
2 136
10.1%
3 130
9.7%
9 111
8.3%
5 107
8.0%
7 103
7.7%
4 102
7.6%
6 99
7.4%
8 97
7.2%
Other values (2) 86
6.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1180
87.8%
Other Punctuation 164
 
12.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 212
18.0%
2 136
11.5%
3 130
11.0%
9 111
9.4%
5 107
9.1%
7 103
8.7%
4 102
8.6%
6 99
8.4%
8 97
8.2%
0 83
 
7.0%
Other Punctuation
ValueCountFrequency (%)
. 161
98.2%
, 3
 
1.8%

Most occurring scripts

ValueCountFrequency (%)
Common 1344
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 212
15.8%
. 161
12.0%
2 136
10.1%
3 130
9.7%
9 111
8.3%
5 107
8.0%
7 103
7.7%
4 102
7.6%
6 99
7.4%
8 97
7.2%
Other values (2) 86
6.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1344
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 212
15.8%
. 161
12.0%
2 136
10.1%
3 130
9.7%
9 111
8.3%
5 107
8.0%
7 103
7.7%
4 102
7.6%
6 99
7.4%
8 97
7.2%
Other values (2) 86
6.4%

세대수
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct137
Distinct (%)93.8%
Missing163
Missing (%)52.8%
Infinite0
Infinite (%)0.0%
Mean896.28082
Minimum300
Maximum2827
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.8 KiB
2024-04-21T11:32:17.404699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum300
5-th percentile377.5
Q1557.25
median787
Q31087.75
95-th percentile1836
Maximum2827
Range2527
Interquartile range (IQR)530.5

Descriptive statistics

Standard deviation472.78809
Coefficient of variation (CV)0.52749995
Kurtosis2.3461841
Mean896.28082
Median Absolute Deviation (MAD)245
Skewness1.4614516
Sum130857
Variance223528.58
MonotonicityDecreasing
2024-04-21T11:32:17.653155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
570 2
 
0.6%
672 2
 
0.6%
1234 2
 
0.6%
504 2
 
0.6%
810 2
 
0.6%
823 2
 
0.6%
794 2
 
0.6%
760 2
 
0.6%
767 2
 
0.6%
574 1
 
0.3%
Other values (127) 127
41.1%
(Missing) 163
52.8%
ValueCountFrequency (%)
300 1
0.3%
303 1
0.3%
310 1
0.3%
345 1
0.3%
352 1
0.3%
356 1
0.3%
365 1
0.3%
375 1
0.3%
385 1
0.3%
390 1
0.3%
ValueCountFrequency (%)
2827 1
0.3%
2420 1
0.3%
2364 1
0.3%
2160 1
0.3%
2134 1
0.3%
1999 1
0.3%
1844 1
0.3%
1840 1
0.3%
1824 1
0.3%
1740 1
0.3%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
Minimum2024-01-29 00:00:00
Maximum2024-01-29 00:00:00
2024-04-21T11:32:17.860728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:32:18.024082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

비고
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
<NA>
163 
공동주택
146 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 163
52.8%
공동주택 146
47.2%

Length

2024-04-21T11:32:18.215264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T11:32:18.383718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 163
52.8%
공동주택 146
47.2%

Interactions

2024-04-21T11:32:08.828375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T11:32:18.491013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세대수
세대수1.000
2024-04-21T11:32:18.617672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세대수비고
세대수1.0001.000
비고1.0001.000

Missing values

2024-04-21T11:32:09.144543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T11:32:09.499231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-21T11:32:09.790684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

건물명주소연면적세대수데이터기준일자비고
0계명대학교대구광역시 달서구 달구벌대로 1095 (신당동)383853.76<NA>2024-01-29<NA>
1계명대학교 동산의료원대구광역시 달서구 달구벌대로 1035 (신당동)179,152.96<NA>2024-01-29<NA>
2진천역 계룡리슈빌 상가대구광역시 달서구 진천로 77 (진천동)174115.4956<NA>2024-01-29<NA>
3삼정브리티시용산 상가대구광역시 달서구 달구벌대로 1530 (감삼동)125045.3168<NA>2024-01-29<NA>
4빌리브 스카이 주상복합 상가대구광역시 달서구 와룡로 186 (감삼동)121,755.02<NA>2024-01-29<NA>
5상인 대성스카이렉스 상가대구광역시 달서구 월배로 183 (상인동)119669.638<NA>2024-01-29<NA>
6진천역 라온프라이빗 센텀 상가대구광역시 달서구 진천로16길 26 (진천동)112885.952<NA>2024-01-29<NA>
7힐스테이트 죽전역 더 퍼스트 오피스텔대구광역시 달서구 와룡로31길 26 (감삼동, 힐스테이트 죽전역 더 퍼스트)105509.3909<NA>2024-01-29<NA>
8진천역 대성스카이렉스 상가대구광역시 달서구 월배로 80 (진천동)92772.613<NA>2024-01-29<NA>
9계명문화대학교대구광역시 달서구 달서대로 675 (신당동, 계명문화대학교)83775.189<NA>2024-01-29<NA>
건물명주소연면적세대수데이터기준일자비고
299용산2차우방타운대구광역시 달서구 용산서로 38 (용산동, 용산2차우방타운)<NA>3902024-01-29공동주택
300월성화성타운대구광역시 달서구 학산로2길 27 (월성동, 월성화성타운)<NA>3852024-01-29공동주택
301월성영남맨션대구광역시 달서구 학산로2길 39 (월성동, 월성영남맨션)<NA>3752024-01-29공동주택
302성서2차화성타운대구광역시 달서구 성서동로 355-20 (이곡동, 성서2차화성타운)<NA>3652024-01-29공동주택
303장기누림타운대구광역시 달서구 한들로 31 (장기동, 장기누림타운)<NA>3562024-01-29공동주택
304용산영남타운대구광역시 달서구 용산로 266 (용산동, 용산영남타운)<NA>3522024-01-29공동주택
305용산보성타운대구광역시 달서구 용산서로 37 (용산동, 용산보성타운)<NA>3452024-01-29공동주택
306대곡역금강 1단지대구광역시 달서구 비슬로540길 38 (대곡동, 금강1차아파트)<NA>3102024-01-29공동주택
307월성코오롱하늘채2단지대구광역시 달서구 조암남로16길 20 (월성동, 월성코오롱하늘채2단지)<NA>3032024-01-29공동주택
308월성동서타운대구광역시 달서구 학산로7길 5 (월성동, 월성동서타운)<NA>3002024-01-29공동주택