Overview

Dataset statistics

Number of variables6
Number of observations203
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.8 KiB
Average record size in memory49.7 B

Variable types

Numeric1
Categorical1
Text2
Boolean1
DateTime1

Dataset

Description서울시 서초구에서 제공하는 공공건축물 정보(건축물 구분, 건물명, 석면건축물 여부, 주소)에 대한 데이터 자료입니다,
Author서울특별시 서초구
URLhttps://www.data.go.kr/data/15052329/fileData.do

Alerts

연번 is highly overall correlated with 건축물구분High correlation
건축물구분 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 16:51:59.696526
Analysis finished2023-12-12 16:52:00.266263
Duration0.57 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct203
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean102
Minimum1
Maximum203
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-13T01:52:00.335595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile11.1
Q151.5
median102
Q3152.5
95-th percentile192.9
Maximum203
Range202
Interquartile range (IQR)101

Descriptive statistics

Standard deviation58.745213
Coefficient of variation (CV)0.57593346
Kurtosis-1.2
Mean102
Median Absolute Deviation (MAD)51
Skewness0
Sum20706
Variance3451
MonotonicityStrictly increasing
2023-12-13T01:52:00.468326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.5%
141 1
 
0.5%
131 1
 
0.5%
132 1
 
0.5%
133 1
 
0.5%
134 1
 
0.5%
135 1
 
0.5%
136 1
 
0.5%
137 1
 
0.5%
138 1
 
0.5%
Other values (193) 193
95.1%
ValueCountFrequency (%)
1 1
0.5%
2 1
0.5%
3 1
0.5%
4 1
0.5%
5 1
0.5%
6 1
0.5%
7 1
0.5%
8 1
0.5%
9 1
0.5%
10 1
0.5%
ValueCountFrequency (%)
203 1
0.5%
202 1
0.5%
201 1
0.5%
200 1
0.5%
199 1
0.5%
198 1
0.5%
197 1
0.5%
196 1
0.5%
195 1
0.5%
194 1
0.5%

건축물구분
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
행정기관
141 
공공기관
37 
특수법인
 
14
지방공사/공단
 
11

Length

Max length7
Median length4
Mean length4.1625616
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row행정기관
2nd row행정기관
3rd row행정기관
4th row행정기관
5th row행정기관

Common Values

ValueCountFrequency (%)
행정기관 141
69.5%
공공기관 37
 
18.2%
특수법인 14
 
6.9%
지방공사/공단 11
 
5.4%

Length

2023-12-13T01:52:00.602169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:52:00.691622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
행정기관 141
69.5%
공공기관 37
 
18.2%
특수법인 14
 
6.9%
지방공사/공단 11
 
5.4%
Distinct202
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-13T01:52:00.914127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length19
Mean length10.389163
Min length3

Characters and Unicode

Total characters2109
Distinct characters253
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique201 ?
Unique (%)99.0%

Sample

1st row신중초등학교 서관
2nd row전라북도서울장학숙 별관
3rd row서울소방학교 본관
4th row서울검찰청사 다솜어린이집
5th row국립중앙도서관 본관2
ValueCountFrequency (%)
한국농수산식품유통공사 7
 
2.2%
서울고등법원 6
 
1.9%
서초소방서 6
 
1.9%
서울특별시교육연수원 6
 
1.9%
한전아트센터 6
 
1.9%
본관 5
 
1.6%
국립국악원 5
 
1.6%
ibk기업은행 4
 
1.3%
3호선 4
 
1.3%
반포빗물펌프장 4
 
1.3%
Other values (231) 262
83.2%
2023-12-13T01:52:01.274493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
112
 
5.3%
94
 
4.5%
51
 
2.4%
44
 
2.1%
42
 
2.0%
41
 
1.9%
39
 
1.8%
39
 
1.8%
38
 
1.8%
38
 
1.8%
Other values (243) 1571
74.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1858
88.1%
Space Separator 112
 
5.3%
Decimal Number 52
 
2.5%
Uppercase Letter 27
 
1.3%
Close Punctuation 26
 
1.2%
Open Punctuation 26
 
1.2%
Other Punctuation 4
 
0.2%
Other Symbol 2
 
0.1%
Lowercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
94
 
5.1%
51
 
2.7%
44
 
2.4%
42
 
2.3%
41
 
2.2%
39
 
2.1%
39
 
2.1%
38
 
2.0%
38
 
2.0%
37
 
2.0%
Other values (216) 1395
75.1%
Uppercase Letter
ValueCountFrequency (%)
K 6
22.2%
I 5
18.5%
B 4
14.8%
T 2
 
7.4%
A 2
 
7.4%
C 1
 
3.7%
D 1
 
3.7%
F 1
 
3.7%
G 1
 
3.7%
S 1
 
3.7%
Other values (3) 3
11.1%
Decimal Number
ValueCountFrequency (%)
1 20
38.5%
9 10
19.2%
2 10
19.2%
3 7
 
13.5%
7 3
 
5.8%
4 2
 
3.8%
Other Punctuation
ValueCountFrequency (%)
, 3
75.0%
/ 1
 
25.0%
Lowercase Letter
ValueCountFrequency (%)
v 1
50.0%
t 1
50.0%
Space Separator
ValueCountFrequency (%)
112
100.0%
Close Punctuation
ValueCountFrequency (%)
) 26
100.0%
Open Punctuation
ValueCountFrequency (%)
( 26
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1860
88.2%
Common 220
 
10.4%
Latin 29
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
94
 
5.1%
51
 
2.7%
44
 
2.4%
42
 
2.3%
41
 
2.2%
39
 
2.1%
39
 
2.1%
38
 
2.0%
38
 
2.0%
37
 
2.0%
Other values (217) 1397
75.1%
Latin
ValueCountFrequency (%)
K 6
20.7%
I 5
17.2%
B 4
13.8%
T 2
 
6.9%
A 2
 
6.9%
C 1
 
3.4%
D 1
 
3.4%
F 1
 
3.4%
G 1
 
3.4%
S 1
 
3.4%
Other values (5) 5
17.2%
Common
ValueCountFrequency (%)
112
50.9%
) 26
 
11.8%
( 26
 
11.8%
1 20
 
9.1%
9 10
 
4.5%
2 10
 
4.5%
3 7
 
3.2%
, 3
 
1.4%
7 3
 
1.4%
4 2
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1858
88.1%
ASCII 249
 
11.8%
None 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
112
45.0%
) 26
 
10.4%
( 26
 
10.4%
1 20
 
8.0%
9 10
 
4.0%
2 10
 
4.0%
3 7
 
2.8%
K 6
 
2.4%
I 5
 
2.0%
B 4
 
1.6%
Other values (16) 23
 
9.2%
Hangul
ValueCountFrequency (%)
94
 
5.1%
51
 
2.7%
44
 
2.4%
42
 
2.3%
41
 
2.2%
39
 
2.1%
39
 
2.1%
38
 
2.0%
38
 
2.0%
37
 
2.0%
Other values (216) 1395
75.1%
None
ValueCountFrequency (%)
2
100.0%
Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size335.0 B
False
116 
True
87 
ValueCountFrequency (%)
False 116
57.1%
True 87
42.9%
2023-12-13T01:52:01.386733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

주소
Text

Distinct147
Distinct (%)72.4%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-13T01:52:01.622057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length29
Mean length25.20197
Min length21

Characters and Unicode

Total characters5116
Distinct characters103
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique121 ?
Unique (%)59.6%

Sample

1st row서울특별시 서초구 남부순환로317길 15 (서초동)
2nd row서울특별시 서초구 명달로9길 117-17 (방배동)
3rd row서울특별시 서초구 남부순환로340길 29 (서초동)
4th row서울특별시 서초구 반포대로 158 (서초동)
5th row서울특별시 서초구 반포대로 201 (반포동)
ValueCountFrequency (%)
서울특별시 203
19.6%
서초구 203
19.6%
서초동 74
 
7.1%
방배동 35
 
3.4%
반포동 32
 
3.1%
양재동 30
 
2.9%
남부순환로 26
 
2.5%
지하 17
 
1.6%
반포대로 13
 
1.3%
서초중앙로 11
 
1.1%
Other values (204) 392
37.8%
2023-12-13T01:52:02.039599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
833
16.3%
507
 
9.9%
300
 
5.9%
210
 
4.1%
203
 
4.0%
203
 
4.0%
203
 
4.0%
203
 
4.0%
203
 
4.0%
198
 
3.9%
Other values (93) 2053
40.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3170
62.0%
Space Separator 833
 
16.3%
Decimal Number 706
 
13.8%
Close Punctuation 181
 
3.5%
Open Punctuation 181
 
3.5%
Dash Punctuation 41
 
0.8%
Other Punctuation 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
507
16.0%
300
 
9.5%
210
 
6.6%
203
 
6.4%
203
 
6.4%
203
 
6.4%
203
 
6.4%
203
 
6.4%
198
 
6.2%
76
 
2.4%
Other values (78) 864
27.3%
Decimal Number
ValueCountFrequency (%)
1 132
18.7%
2 114
16.1%
0 74
10.5%
5 72
10.2%
7 69
9.8%
4 68
9.6%
3 64
9.1%
9 39
 
5.5%
8 37
 
5.2%
6 37
 
5.2%
Space Separator
ValueCountFrequency (%)
833
100.0%
Close Punctuation
ValueCountFrequency (%)
) 181
100.0%
Open Punctuation
ValueCountFrequency (%)
( 181
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 41
100.0%
Other Punctuation
ValueCountFrequency (%)
, 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3170
62.0%
Common 1946
38.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
507
16.0%
300
 
9.5%
210
 
6.6%
203
 
6.4%
203
 
6.4%
203
 
6.4%
203
 
6.4%
203
 
6.4%
198
 
6.2%
76
 
2.4%
Other values (78) 864
27.3%
Common
ValueCountFrequency (%)
833
42.8%
) 181
 
9.3%
( 181
 
9.3%
1 132
 
6.8%
2 114
 
5.9%
0 74
 
3.8%
5 72
 
3.7%
7 69
 
3.5%
4 68
 
3.5%
3 64
 
3.3%
Other values (5) 158
 
8.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3170
62.0%
ASCII 1946
38.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
833
42.8%
) 181
 
9.3%
( 181
 
9.3%
1 132
 
6.8%
2 114
 
5.9%
0 74
 
3.8%
5 72
 
3.7%
7 69
 
3.5%
4 68
 
3.5%
3 64
 
3.3%
Other values (5) 158
 
8.1%
Hangul
ValueCountFrequency (%)
507
16.0%
300
 
9.5%
210
 
6.6%
203
 
6.4%
203
 
6.4%
203
 
6.4%
203
 
6.4%
203
 
6.4%
198
 
6.2%
76
 
2.4%
Other values (78) 864
27.3%
Distinct94
Distinct (%)46.3%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
Minimum2008-06-30 00:00:00
Maximum2014-11-14 00:00:00
2023-12-13T01:52:02.178083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:52:02.334724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-13T01:52:00.000340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T01:52:02.426662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번건축물구분석면건축물여부조사일자
연번1.0000.7250.3100.927
건축물구분0.7251.0000.2220.994
석면건축물여부0.3100.2221.0000.541
조사일자0.9270.9940.5411.000
2023-12-13T01:52:02.515009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
석면건축물여부건축물구분
석면건축물여부1.0000.146
건축물구분0.1461.000
2023-12-13T01:52:02.586829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번건축물구분석면건축물여부
연번1.0000.5220.220
건축물구분0.5221.0000.146
석면건축물여부0.2200.1461.000

Missing values

2023-12-13T01:52:00.127862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:52:00.230421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번건축물구분건물명석면건축물여부주소조사일자
01행정기관신중초등학교 서관N서울특별시 서초구 남부순환로317길 15 (서초동)2010-09-06
12행정기관전라북도서울장학숙 별관Y서울특별시 서초구 명달로9길 117-17 (방배동)2013-03-26
23행정기관서울소방학교 본관N서울특별시 서초구 남부순환로340길 29 (서초동)2010-02-02
34행정기관서울검찰청사 다솜어린이집N서울특별시 서초구 반포대로 158 (서초동)2013-06-10
45행정기관국립중앙도서관 본관2Y서울특별시 서초구 반포대로 201 (반포동)2014-06-16
56행정기관전라북도서울장학숙 본관Y서울특별시 서초구 명달로9길 117-17 (방배동)2013-03-26
67행정기관잠원스포츠파크N서울특별시 서초구 신반포로23길 66 (잠원동)2012-08-30
78행정기관잠원빗물펌프장N서울특별시 서초구 잠원로14길 10 (잠원동)2012-09-18
89행정기관잠원동주민센터Y서울특별시 서초구 나루터로 38 (잠원동)2011-12-07
910행정기관우면산배수지N서울특별시 서초구 남부순환로344길 10 (서초동)2012-09-11
연번건축물구분건물명석면건축물여부주소조사일자
193194공공기관한국농수산식품유통공사 화훼공판장(중매인점포 지하동)Y서울특별시 서초구 강남대로 27 (양재동)2013-04-22
194195공공기관한국농수산식품유통공사 화훼공판장(자재점포동)Y서울특별시 서초구 강남대로 27 (양재동)2013-04-22
195196공공기관한국농수산식품유통공사 화훼공판장(본관)Y서울특별시 서초구 강남대로 27 (양재동)2013-04-22
196197공공기관한국농수산식품유통공사 AT센터N서울특별시 서초구 강남대로 27 (양재동)2013-10-10
197198특수법인한국교육단체총연합회Y서울특별시 서초구 태봉로 114 (우면동)2014-03-18
198199행정기관한강홍수통제소N서울특별시 서초구 동작대로 328 (반포동)2014-04-01
199200행정기관한강공원 잠원안내센터Y서울특별시 서초구 잠원로 221-124 (잠원동)2011-09-03
200201행정기관한강공원 반포안내센터N서울특별시 서초구 신반포로11길 40 (반포동)2010-07-21
201202행정기관품질시험소 본관Y서울특별시 서초구 태봉로 131 (우면동)2010-03-10
202203행정기관품질시험소 별관Y서울특별시 서초구 태봉로 108 (우면동)2010-07-21