Overview

Dataset statistics

Number of variables6
Number of observations590
Missing cells590
Missing cells (%)16.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory29.5 KiB
Average record size in memory51.2 B

Variable types

Numeric3
Text2
DateTime1

Dataset

Description서울특별시 강남구 기계설비성능점검 대상 건축물 입니다. 기타 사항은 서울특별시 강남구 건축과(02-3423-6178)로 문의주시면 자세히 안내해 드리도록 하겠습니다.
Author서울특별시 강남구
URLhttps://www.data.go.kr/data/15111865/fileData.do

Alerts

데이터기준일 has constant value ""Constant
연번 is highly overall correlated with 세대수(공동주택)High correlation
세대수(공동주택) is highly overall correlated with 연번High correlation
연면적(건축물) has 107 (18.1%) missing valuesMissing
세대수(공동주택) has 483 (81.9%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 05:41:53.453725
Analysis finished2023-12-12 05:41:54.938182
Duration1.48 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct590
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean295.5
Minimum1
Maximum590
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.3 KiB
2023-12-12T14:41:55.026062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile30.45
Q1148.25
median295.5
Q3442.75
95-th percentile560.55
Maximum590
Range589
Interquartile range (IQR)294.5

Descriptive statistics

Standard deviation170.46261
Coefficient of variation (CV)0.57686161
Kurtosis-1.2
Mean295.5
Median Absolute Deviation (MAD)147.5
Skewness0
Sum174345
Variance29057.5
MonotonicityStrictly increasing
2023-12-12T14:41:55.213809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
407 1
 
0.2%
391 1
 
0.2%
392 1
 
0.2%
393 1
 
0.2%
394 1
 
0.2%
395 1
 
0.2%
396 1
 
0.2%
397 1
 
0.2%
398 1
 
0.2%
Other values (580) 580
98.3%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
590 1
0.2%
589 1
0.2%
588 1
0.2%
587 1
0.2%
586 1
0.2%
585 1
0.2%
584 1
0.2%
583 1
0.2%
582 1
0.2%
581 1
0.2%
Distinct588
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size4.7 KiB
2023-12-12T14:41:55.507791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length25
Mean length7.4457627
Min length2

Characters and Unicode

Total characters4393
Distinct characters434
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique586 ?
Unique (%)99.3%

Sample

1st row현대고등학교
2nd row단국고등학교
3rd row수서중학교
4th row경기여자고등학교
5th row역삼중학교
ValueCountFrequency (%)
강남 11
 
1.4%
오피스텔 8
 
1.0%
tower 7
 
0.9%
호텔 7
 
0.9%
강남역 6
 
0.8%
빌딩 6
 
0.8%
역삼 5
 
0.6%
타워 5
 
0.6%
대치 3
 
0.4%
3
 
0.4%
Other values (684) 714
92.1%
2023-12-12T14:41:55.946693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
189
 
4.3%
133
 
3.0%
121
 
2.8%
106
 
2.4%
80
 
1.8%
78
 
1.8%
74
 
1.7%
71
 
1.6%
68
 
1.5%
63
 
1.4%
Other values (424) 3410
77.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3539
80.6%
Uppercase Letter 322
 
7.3%
Space Separator 189
 
4.3%
Decimal Number 129
 
2.9%
Lowercase Letter 96
 
2.2%
Close Punctuation 50
 
1.1%
Open Punctuation 50
 
1.1%
Dash Punctuation 8
 
0.2%
Other Punctuation 6
 
0.1%
Letter Number 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
133
 
3.8%
121
 
3.4%
106
 
3.0%
80
 
2.3%
78
 
2.2%
74
 
2.1%
71
 
2.0%
68
 
1.9%
63
 
1.8%
59
 
1.7%
Other values (358) 2686
75.9%
Uppercase Letter
ValueCountFrequency (%)
E 30
 
9.3%
T 29
 
9.0%
L 24
 
7.5%
A 23
 
7.1%
S 20
 
6.2%
O 19
 
5.9%
H 19
 
5.9%
I 18
 
5.6%
K 17
 
5.3%
G 16
 
5.0%
Other values (16) 107
33.2%
Lowercase Letter
ValueCountFrequency (%)
e 15
15.6%
o 12
12.5%
r 9
 
9.4%
w 7
 
7.3%
a 6
 
6.2%
n 6
 
6.2%
s 5
 
5.2%
t 5
 
5.2%
l 5
 
5.2%
k 3
 
3.1%
Other values (11) 23
24.0%
Decimal Number
ValueCountFrequency (%)
1 40
31.0%
2 25
19.4%
3 14
 
10.9%
8 9
 
7.0%
4 9
 
7.0%
6 8
 
6.2%
5 7
 
5.4%
7 6
 
4.7%
9 6
 
4.7%
0 5
 
3.9%
Other Punctuation
ValueCountFrequency (%)
, 5
83.3%
? 1
 
16.7%
Letter Number
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
189
100.0%
Close Punctuation
ValueCountFrequency (%)
) 50
100.0%
Open Punctuation
ValueCountFrequency (%)
( 50
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3539
80.6%
Common 432
 
9.8%
Latin 421
 
9.6%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
133
 
3.8%
121
 
3.4%
106
 
3.0%
80
 
2.3%
78
 
2.2%
74
 
2.1%
71
 
2.0%
68
 
1.9%
63
 
1.8%
59
 
1.7%
Other values (358) 2686
75.9%
Latin
ValueCountFrequency (%)
E 30
 
7.1%
T 29
 
6.9%
L 24
 
5.7%
A 23
 
5.5%
S 20
 
4.8%
O 19
 
4.5%
H 19
 
4.5%
I 18
 
4.3%
K 17
 
4.0%
G 16
 
3.8%
Other values (39) 206
48.9%
Common
ValueCountFrequency (%)
189
43.8%
) 50
 
11.6%
( 50
 
11.6%
1 40
 
9.3%
2 25
 
5.8%
3 14
 
3.2%
8 9
 
2.1%
4 9
 
2.1%
6 8
 
1.9%
- 8
 
1.9%
Other values (6) 30
 
6.9%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3538
80.5%
ASCII 850
 
19.3%
Number Forms 3
 
0.1%
CJK 1
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
189
22.2%
) 50
 
5.9%
( 50
 
5.9%
1 40
 
4.7%
E 30
 
3.5%
T 29
 
3.4%
2 25
 
2.9%
L 24
 
2.8%
A 23
 
2.7%
S 20
 
2.4%
Other values (53) 370
43.5%
Hangul
ValueCountFrequency (%)
133
 
3.8%
121
 
3.4%
106
 
3.0%
80
 
2.3%
78
 
2.2%
74
 
2.1%
71
 
2.0%
68
 
1.9%
63
 
1.8%
59
 
1.7%
Other values (357) 2685
75.9%
Number Forms
ValueCountFrequency (%)
2
66.7%
1
33.3%
CJK
ValueCountFrequency (%)
1
100.0%
None
ValueCountFrequency (%)
1
100.0%
Distinct587
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size4.7 KiB
2023-12-12T14:41:56.235719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length33
Mean length23.267797
Min length15

Characters and Unicode

Total characters13728
Distinct characters84
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique584 ?
Unique (%)99.0%

Sample

1st row서울특별시 강남구 압구정로 127 (압구정동)
2nd row서울특별시 강남구 도곡로64길 21 (대치동)
3rd row서울특별시 강남구 광평로59길 57 (수서동)
4th row서울특별시 강남구 삼성로 29 (개포동)
5th row서울특별시 강남구 도곡로43길 10 (역삼동)
ValueCountFrequency (%)
서울특별시 591
20.7%
강남구 590
20.7%
역삼동 145
 
5.1%
테헤란로 98
 
3.4%
대치동 80
 
2.8%
삼성동 65
 
2.3%
논현동 48
 
1.7%
언주로 37
 
1.3%
강남대로 35
 
1.2%
영동대로 29
 
1.0%
Other values (476) 1135
39.8%
2023-12-12T14:41:56.747629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2264
 
16.5%
649
 
4.7%
634
 
4.6%
622
 
4.5%
604
 
4.4%
591
 
4.3%
591
 
4.3%
591
 
4.3%
591
 
4.3%
591
 
4.3%
Other values (74) 6000
43.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8501
61.9%
Space Separator 2264
 
16.5%
Decimal Number 1967
 
14.3%
Open Punctuation 486
 
3.5%
Close Punctuation 486
 
3.5%
Dash Punctuation 10
 
0.1%
Uppercase Letter 9
 
0.1%
Other Punctuation 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
649
 
7.6%
634
 
7.5%
622
 
7.3%
604
 
7.1%
591
 
7.0%
591
 
7.0%
591
 
7.0%
591
 
7.0%
591
 
7.0%
538
 
6.3%
Other values (52) 2499
29.4%
Decimal Number
ValueCountFrequency (%)
1 372
18.9%
2 284
14.4%
3 226
11.5%
4 214
10.9%
5 191
9.7%
0 188
9.6%
6 168
8.5%
7 118
 
6.0%
8 117
 
5.9%
9 89
 
4.5%
Uppercase Letter
ValueCountFrequency (%)
T 2
22.2%
K 2
22.2%
D 1
11.1%
C 1
11.1%
I 1
11.1%
B 1
11.1%
A 1
11.1%
Space Separator
ValueCountFrequency (%)
2264
100.0%
Open Punctuation
ValueCountFrequency (%)
( 486
100.0%
Close Punctuation
ValueCountFrequency (%)
) 486
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%
Other Punctuation
ValueCountFrequency (%)
, 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8501
61.9%
Common 5218
38.0%
Latin 9
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
649
 
7.6%
634
 
7.5%
622
 
7.3%
604
 
7.1%
591
 
7.0%
591
 
7.0%
591
 
7.0%
591
 
7.0%
591
 
7.0%
538
 
6.3%
Other values (52) 2499
29.4%
Common
ValueCountFrequency (%)
2264
43.4%
( 486
 
9.3%
) 486
 
9.3%
1 372
 
7.1%
2 284
 
5.4%
3 226
 
4.3%
4 214
 
4.1%
5 191
 
3.7%
0 188
 
3.6%
6 168
 
3.2%
Other values (5) 339
 
6.5%
Latin
ValueCountFrequency (%)
T 2
22.2%
K 2
22.2%
D 1
11.1%
C 1
11.1%
I 1
11.1%
B 1
11.1%
A 1
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8501
61.9%
ASCII 5227
38.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2264
43.3%
( 486
 
9.3%
) 486
 
9.3%
1 372
 
7.1%
2 284
 
5.4%
3 226
 
4.3%
4 214
 
4.1%
5 191
 
3.7%
0 188
 
3.6%
6 168
 
3.2%
Other values (12) 348
 
6.7%
Hangul
ValueCountFrequency (%)
649
 
7.6%
634
 
7.5%
622
 
7.3%
604
 
7.1%
591
 
7.0%
591
 
7.0%
591
 
7.0%
591
 
7.0%
591
 
7.0%
538
 
6.3%
Other values (52) 2499
29.4%

연면적(건축물)
Real number (ℝ)

MISSING 

Distinct481
Distinct (%)99.6%
Missing107
Missing (%)18.1%
Infinite0
Infinite (%)0.0%
Mean27371.268
Minimum10000
Maximum715848.09
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.3 KiB
2023-12-12T14:41:56.904740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10000
5-th percentile10517.756
Q113000
median18205.17
Q328700.955
95-th percentile58878.289
Maximum715848.09
Range705848.09
Interquartile range (IQR)15700.955

Descriptive statistics

Standard deviation42102.379
Coefficient of variation (CV)1.5381962
Kurtosis157.53977
Mean27371.268
Median Absolute Deviation (MAD)6212.91
Skewness10.856147
Sum13220322
Variance1.7726103 × 109
MonotonicityNot monotonic
2023-12-12T14:41:57.081839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
13000.0 3
 
0.5%
11288.98 1
 
0.2%
12399.44 1
 
0.2%
12378.94 1
 
0.2%
12341.01 1
 
0.2%
12276.41 1
 
0.2%
12101.0 1
 
0.2%
12004.426 1
 
0.2%
11950.96 1
 
0.2%
11766.14 1
 
0.2%
Other values (471) 471
79.8%
(Missing) 107
 
18.1%
ValueCountFrequency (%)
10000.0 1
0.2%
10011.41 1
0.2%
10027.79 1
0.2%
10052.34 1
0.2%
10101.36 1
0.2%
10111.71 1
0.2%
10122.43 1
0.2%
10166.89 1
0.2%
10203.11 1
0.2%
10288.37 1
0.2%
ValueCountFrequency (%)
715848.09 1
0.2%
343314.54 1
0.2%
239252.0 1
0.2%
219385.0 1
0.2%
212615.29 1
0.2%
181061.23 1
0.2%
141551.69 1
0.2%
114129.26 1
0.2%
110467.64 1
0.2%
99963.74 1
0.2%

세대수(공동주택)
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct102
Distinct (%)95.3%
Missing483
Missing (%)81.9%
Infinite0
Infinite (%)0.0%
Mean906.5514
Minimum321
Maximum4424
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.3 KiB
2023-12-12T14:41:57.244679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum321
5-th percentile357
Q1470
median716
Q31055
95-th percentile2271.4
Maximum4424
Range4103
Interquartile range (IQR)585

Descriptive statistics

Standard deviation668.84158
Coefficient of variation (CV)0.73778672
Kurtosis8.0801969
Mean906.5514
Median Absolute Deviation (MAD)267
Skewness2.4924809
Sum97001
Variance447349.06
MonotonicityNot monotonic
2023-12-12T14:41:57.406678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
630 3
 
0.5%
364 2
 
0.3%
416 2
 
0.3%
476 2
 
0.3%
813 1
 
0.2%
496 1
 
0.2%
402 1
 
0.2%
679 1
 
0.2%
926 1
 
0.2%
515 1
 
0.2%
Other values (92) 92
 
15.6%
(Missing) 483
81.9%
ValueCountFrequency (%)
321 1
0.2%
322 1
0.2%
330 1
0.2%
332 1
0.2%
339 1
0.2%
354 1
0.2%
364 2
0.3%
368 1
0.2%
384 1
0.2%
390 1
0.2%
ValueCountFrequency (%)
4424 1
0.2%
3130 1
0.2%
3002 1
0.2%
2565 1
0.2%
2436 1
0.2%
2296 1
0.2%
2214 1
0.2%
1996 1
0.2%
1957 1
0.2%
1924 1
0.2%

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.7 KiB
Minimum2023-10-23 00:00:00
Maximum2023-10-23 00:00:00
2023-12-12T14:41:57.564132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:41:57.654486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T14:41:54.354999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:41:53.812456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:41:54.085461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:41:54.429695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:41:53.912617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:41:54.180825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:41:54.514890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:41:54.012583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:41:54.279221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:41:57.730309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번연면적(건축물)세대수(공동주택)
연번1.0000.1180.835
연면적(건축물)0.1181.000NaN
세대수(공동주택)0.835NaN1.000
2023-12-12T14:41:57.848298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번연면적(건축물)세대수(공동주택)
연번1.0000.142-0.896
연면적(건축물)0.1421.000NaN
세대수(공동주택)-0.896NaN1.000

Missing values

2023-12-12T14:41:54.624938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:41:54.760773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T14:41:54.879682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번건물명도로명주소연면적(건축물)세대수(공동주택)데이터기준일
01현대고등학교서울특별시 강남구 압구정로 127 (압구정동)19120.87<NA>2023-10-23
12단국고등학교서울특별시 강남구 도곡로64길 21 (대치동)13669.14<NA>2023-10-23
23수서중학교서울특별시 강남구 광평로59길 57 (수서동)10307.11<NA>2023-10-23
34경기여자고등학교서울특별시 강남구 삼성로 29 (개포동)28097.93<NA>2023-10-23
45역삼중학교서울특별시 강남구 도곡로43길 10 (역삼동)11597.56<NA>2023-10-23
56서울언북초등학교서울특별시 강남구 삼성로135길 42 (청담동)25100.81<NA>2023-10-23
67국악고등학교서울특별시 강남구 개포로22길 65 (개포동)21743.47<NA>2023-10-23
78개원중학교서울특별시 강남구 영동대로 101 (개포동)11819.08<NA>2023-10-23
89세종고등학교서울특별시 강남구 광평로51길 36 (수서동)12748.78<NA>2023-10-23
910서울대도초등학교서울특별시 강남구 선릉로 209 (도곡동)11377.01<NA>2023-10-23
연번건물명도로명주소연면적(건축물)세대수(공동주택)데이터기준일
580581쌍용2차서울특별시 강남구 영동대로 220<NA>3642023-10-23
581582아이파크삼성동서울특별시 강남구 영동대로 640<NA>4492023-10-23
582583아카데미스위트서울특별시 강남구 언주로30길 21<NA>4142023-10-23
583584아크로힐스논현서울특별시 강남구 언주로 604<NA>3682023-10-23
584585도곡우성(구역삼우성)서울특별시 강남구 남부순환로363길 49<NA>3902023-10-23
585586역삼자이서울특별시 강남구 언주로 420<NA>4082023-10-23
586587우성1차서울특별시 강남구 영동대로 230<NA>4762023-10-23
587588타워팰리스3차서울특별시 강남구 언주로30길 26<NA>4802023-10-23
588589테헤란아이파크서울특별시 강남구 테헤란로 52길 16<NA>4112023-10-23
589590한아름서울특별시 강남구 광평로51길 22<NA>4982023-10-23