Overview

Dataset statistics

Number of variables6
Number of observations99
Missing cells6
Missing cells (%)1.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.9 KiB
Average record size in memory50.3 B

Variable types

Text4
Numeric1
Categorical1

Dataset

Description서울특별시 강남구 공공건축물 현황 입니다. 기타 사항은 서울특별 강남구 디지털도시과로 주시면 안내해 드리도록 하겠습니다.
URLhttps://www.data.go.kr/data/15112835/fileData.do

Alerts

건축면적(제곱미터) has 1 (1.0%) missing valuesMissing
연면적(제곱미터) has 1 (1.0%) missing valuesMissing
준공일자 has 4 (4.0%) missing valuesMissing
시설명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 01:40:38.761817
Analysis finished2023-12-12 01:40:39.981217
Duration1.22 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시설명
Text

UNIQUE 

Distinct99
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size924.0 B
2023-12-12T10:40:40.214017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length12
Mean length7.5252525
Min length5

Characters and Unicode

Total characters745
Distinct characters136
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique99 ?
Unique (%)100.0%

Sample

1st row강남지역자활센터+강남구어린이급식관리지원센터
2nd row강남구보건소
3rd row신사동 주민센터
4th row논현1동 주민센터
5th row논현2동 주민센터
ValueCountFrequency (%)
주민센터 10
 
8.6%
구립 2
 
1.7%
강남구립 2
 
1.7%
청담경로당 1
 
0.9%
상방교경로당 1
 
0.9%
하방교경로당 1
 
0.9%
선정경로당 1
 
0.9%
테헤란경로당 1
 
0.9%
역삼제3경로당 1
 
0.9%
한티경로당 1
 
0.9%
Other values (95) 95
81.9%
2023-12-12T10:40:40.659593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
49
 
6.6%
49
 
6.6%
41
 
5.5%
41
 
5.5%
41
 
5.5%
17
 
2.3%
17
 
2.3%
15
 
2.0%
15
 
2.0%
15
 
2.0%
Other values (126) 445
59.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 703
94.4%
Decimal Number 24
 
3.2%
Space Separator 17
 
2.3%
Math Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
49
 
7.0%
49
 
7.0%
41
 
5.8%
41
 
5.8%
41
 
5.8%
17
 
2.4%
15
 
2.1%
15
 
2.1%
15
 
2.1%
15
 
2.1%
Other values (119) 405
57.6%
Decimal Number
ValueCountFrequency (%)
1 9
37.5%
2 8
33.3%
3 3
 
12.5%
4 3
 
12.5%
6 1
 
4.2%
Space Separator
ValueCountFrequency (%)
17
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 703
94.4%
Common 42
 
5.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
49
 
7.0%
49
 
7.0%
41
 
5.8%
41
 
5.8%
41
 
5.8%
17
 
2.4%
15
 
2.1%
15
 
2.1%
15
 
2.1%
15
 
2.1%
Other values (119) 405
57.6%
Common
ValueCountFrequency (%)
17
40.5%
1 9
21.4%
2 8
19.0%
3 3
 
7.1%
4 3
 
7.1%
6 1
 
2.4%
+ 1
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 703
94.4%
ASCII 42
 
5.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
49
 
7.0%
49
 
7.0%
41
 
5.8%
41
 
5.8%
41
 
5.8%
17
 
2.4%
15
 
2.1%
15
 
2.1%
15
 
2.1%
15
 
2.1%
Other values (119) 405
57.6%
ASCII
ValueCountFrequency (%)
17
40.5%
1 9
21.4%
2 8
19.0%
3 3
 
7.1%
4 3
 
7.1%
6 1
 
2.4%
+ 1
 
2.4%
Distinct85
Distinct (%)85.9%
Missing0
Missing (%)0.0%
Memory size924.0 B
2023-12-12T10:40:41.117192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length32
Mean length19.818182
Min length16

Characters and Unicode

Total characters1962
Distinct characters63
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique77 ?
Unique (%)77.8%

Sample

1st row서울특별시 강남구 개포로38길 12
2nd row서울특별시 강남구 선릉로668
3rd row서울특별시 강남구 압구정로 128
4th row서울특별시 강남구 학동로20길 25
5th row서울특별시 강남구 학동로43길 17
ValueCountFrequency (%)
서울특별시 99
24.6%
강남구 99
24.6%
22 7
 
1.7%
27 5
 
1.2%
개포로 5
 
1.2%
도곡로27길 4
 
1.0%
18 4
 
1.0%
5 4
 
1.0%
24 3
 
0.7%
11 3
 
0.7%
Other values (137) 169
42.0%
2023-12-12T10:40:41.773878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
306
15.6%
107
 
5.5%
102
 
5.2%
102
 
5.2%
100
 
5.1%
100
 
5.1%
99
 
5.0%
99
 
5.0%
99
 
5.0%
99
 
5.0%
Other values (53) 749
38.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1221
62.2%
Decimal Number 410
 
20.9%
Space Separator 306
 
15.6%
Dash Punctuation 16
 
0.8%
Open Punctuation 4
 
0.2%
Close Punctuation 4
 
0.2%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
107
8.8%
102
8.4%
102
8.4%
100
8.2%
100
8.2%
99
8.1%
99
8.1%
99
8.1%
99
8.1%
78
 
6.4%
Other values (38) 236
19.3%
Decimal Number
ValueCountFrequency (%)
1 75
18.3%
2 66
16.1%
5 41
10.0%
7 39
9.5%
6 38
9.3%
3 38
9.3%
4 31
7.6%
8 30
 
7.3%
9 26
 
6.3%
0 26
 
6.3%
Space Separator
ValueCountFrequency (%)
306
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 16
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Other Punctuation
ValueCountFrequency (%)
: 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1221
62.2%
Common 741
37.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
107
8.8%
102
8.4%
102
8.4%
100
8.2%
100
8.2%
99
8.1%
99
8.1%
99
8.1%
99
8.1%
78
 
6.4%
Other values (38) 236
19.3%
Common
ValueCountFrequency (%)
306
41.3%
1 75
 
10.1%
2 66
 
8.9%
5 41
 
5.5%
7 39
 
5.3%
6 38
 
5.1%
3 38
 
5.1%
4 31
 
4.2%
8 30
 
4.0%
9 26
 
3.5%
Other values (5) 51
 
6.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1221
62.2%
ASCII 741
37.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
306
41.3%
1 75
 
10.1%
2 66
 
8.9%
5 41
 
5.5%
7 39
 
5.3%
6 38
 
5.1%
3 38
 
5.1%
4 31
 
4.2%
8 30
 
4.0%
9 26
 
3.5%
Other values (5) 51
 
6.9%
Hangul
ValueCountFrequency (%)
107
8.8%
102
8.4%
102
8.4%
100
8.2%
100
8.2%
99
8.1%
99
8.1%
99
8.1%
99
8.1%
78
 
6.4%
Other values (38) 236
19.3%

건축면적(제곱미터)
Real number (ℝ)

MISSING 

Distinct80
Distinct (%)81.6%
Missing1
Missing (%)1.0%
Infinite0
Infinite (%)0.0%
Mean853.38449
Minimum60.48
Maximum18610.13
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1023.0 B
2023-12-12T10:40:41.973914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum60.48
5-th percentile81.4
Q1123.8475
median222.23
Q3585.5075
95-th percentile1728.8675
Maximum18610.13
Range18549.65
Interquartile range (IQR)461.66

Descriptive statistics

Standard deviation2350.062
Coefficient of variation (CV)2.7538138
Kurtosis38.023124
Mean853.38449
Median Absolute Deviation (MAD)133.385
Skewness5.842361
Sum83631.68
Variance5522791.3
MonotonicityNot monotonic
2023-12-12T10:40:42.182999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
222.23 4
 
4.0%
81.4 3
 
3.0%
729.84 3
 
3.0%
233.45 3
 
3.0%
155.54 3
 
3.0%
182.72 3
 
3.0%
125.85 2
 
2.0%
1377.65 2
 
2.0%
213.6 2
 
2.0%
169.07 2
 
2.0%
Other values (70) 71
71.7%
ValueCountFrequency (%)
60.48 1
 
1.0%
65.46 1
 
1.0%
73.59 1
 
1.0%
81.2 1
 
1.0%
81.4 3
3.0%
83.6 1
 
1.0%
84.28 1
 
1.0%
84.3 1
 
1.0%
86.5 1
 
1.0%
91.19 2
2.0%
ValueCountFrequency (%)
18610.13 1
1.0%
10969.04 1
1.0%
8886.19 1
1.0%
4958.82 1
1.0%
2370.66 1
1.0%
1615.61 1
1.0%
1571.72 1
1.0%
1538.16 1
1.0%
1399.55 1
1.0%
1377.65 2
2.0%
Distinct91
Distinct (%)92.9%
Missing1
Missing (%)1.0%
Memory size924.0 B
2023-12-12T10:40:42.588000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length5.6530612
Min length3

Characters and Unicode

Total characters554
Distinct characters12
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique85 ?
Unique (%)86.7%

Sample

1st row825
2nd row7533.33
3rd row1722.8
4th row4003.3
5th row7472.9
ValueCountFrequency (%)
81.4 3
 
3.1%
1649.17 2
 
2.0%
792.92 2
 
2.0%
1029.72 2
 
2.0%
5232.08 2
 
2.0%
751.42 2
 
2.0%
95.5 1
 
1.0%
90.09 1
 
1.0%
93.13 1
 
1.0%
87.39 1
 
1.0%
Other values (81) 81
82.7%
2023-12-12T10:40:43.093548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 88
15.9%
2 67
12.1%
1 63
11.4%
4 51
9.2%
3 51
9.2%
9 49
8.8%
8 44
7.9%
6 44
7.9%
7 42
7.6%
0 28
 
5.1%
Other values (2) 27
 
4.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 464
83.8%
Other Punctuation 90
 
16.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 67
14.4%
1 63
13.6%
4 51
11.0%
3 51
11.0%
9 49
10.6%
8 44
9.5%
6 44
9.5%
7 42
9.1%
0 28
6.0%
5 25
 
5.4%
Other Punctuation
ValueCountFrequency (%)
. 88
97.8%
, 2
 
2.2%

Most occurring scripts

ValueCountFrequency (%)
Common 554
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
. 88
15.9%
2 67
12.1%
1 63
11.4%
4 51
9.2%
3 51
9.2%
9 49
8.8%
8 44
7.9%
6 44
7.9%
7 42
7.6%
0 28
 
5.1%
Other values (2) 27
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 554
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 88
15.9%
2 67
12.1%
1 63
11.4%
4 51
9.2%
3 51
9.2%
9 49
8.8%
8 44
7.9%
6 44
7.9%
7 42
7.6%
0 28
 
5.1%
Other values (2) 27
 
4.9%

관리부서
Categorical

Distinct30
Distinct (%)30.3%
Missing0
Missing (%)0.0%
Memory size924.0 B
어르신복지과
65 
생활체육과(강남구 도시관리공단 위탁 운영)
 
5
장애인복지과
 
2
도곡1동 주민센터
 
1
신사동 주민센터
 
1
Other values (25)
25 

Length

Max length23
Median length6
Mean length7.4444444
Min length5

Unique

Unique27 ?
Unique (%)27.3%

Sample

1st row사회보장과
2nd row보건행정과
3rd row신사동 주민센터
4th row논현1동 주민센터
5th row논현2동 주민센터

Common Values

ValueCountFrequency (%)
어르신복지과 65
65.7%
생활체육과(강남구 도시관리공단 위탁 운영) 5
 
5.1%
장애인복지과 2
 
2.0%
도곡1동 주민센터 1
 
1.0%
신사동 주민센터 1
 
1.0%
논현1동 주민센터 1
 
1.0%
논현2동 주민센터 1
 
1.0%
압구정동 주민센터 1
 
1.0%
청담동 주민센터 1
 
1.0%
삼성1동 주민센터 1
 
1.0%
Other values (20) 20
 
20.2%

Length

2023-12-12T10:40:43.265901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
어르신복지과 65
47.8%
주민센터 22
 
16.2%
도시관리공단 5
 
3.7%
위탁 5
 
3.7%
운영 5
 
3.7%
생활체육과(강남구 5
 
3.7%
장애인복지과 2
 
1.5%
일원1동 1
 
0.7%
개포2동 1
 
0.7%
개포3동 1
 
0.7%
Other values (24) 24
 
17.6%

준공일자
Text

MISSING 

Distinct80
Distinct (%)84.2%
Missing4
Missing (%)4.0%
Memory size924.0 B
2023-12-12T10:40:43.577058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length10
Mean length10.031579
Min length10

Characters and Unicode

Total characters953
Distinct characters17
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique73 ?
Unique (%)76.8%

Sample

1st row1989-08-27
2nd row1975-12-26
3rd row2001-11-12
4th row2001-11-23
5th row1997-08-31
ValueCountFrequency (%)
2009-06-22 6
 
6.3%
1995-06-15 3
 
3.2%
2011-07-20 3
 
3.2%
2006-09-26 3
 
3.2%
1988-11-24 3
 
3.2%
2011-06-07 2
 
2.1%
1981-12-16 2
 
2.1%
1982-07-30 1
 
1.1%
1998-10-23 1
 
1.1%
1978-10-26 1
 
1.1%
Other values (70) 70
73.7%
2023-12-12T10:40:44.071181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 189
19.8%
0 188
19.7%
1 164
17.2%
2 137
14.4%
9 106
11.1%
8 40
 
4.2%
6 33
 
3.5%
7 30
 
3.1%
3 27
 
2.8%
5 18
 
1.9%
Other values (7) 21
 
2.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 758
79.5%
Dash Punctuation 189
 
19.8%
Other Letter 4
 
0.4%
Open Punctuation 1
 
0.1%
Close Punctuation 1
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 188
24.8%
1 164
21.6%
2 137
18.1%
9 106
14.0%
8 40
 
5.3%
6 33
 
4.4%
7 30
 
4.0%
3 27
 
3.6%
5 18
 
2.4%
4 15
 
2.0%
Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Dash Punctuation
ValueCountFrequency (%)
- 189
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 949
99.6%
Hangul 4
 
0.4%

Most frequent character per script

Common
ValueCountFrequency (%)
- 189
19.9%
0 188
19.8%
1 164
17.3%
2 137
14.4%
9 106
11.2%
8 40
 
4.2%
6 33
 
3.5%
7 30
 
3.2%
3 27
 
2.8%
5 18
 
1.9%
Other values (3) 17
 
1.8%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 949
99.6%
Hangul 4
 
0.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 189
19.9%
0 188
19.8%
1 164
17.3%
2 137
14.4%
9 106
11.2%
8 40
 
4.2%
6 33
 
3.5%
7 30
 
3.2%
3 27
 
2.8%
5 18
 
1.9%
Other values (3) 17
 
1.8%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Interactions

2023-12-12T10:40:39.539754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T10:40:44.209216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설명주소(도로명)건축면적(제곱미터)연면적(제곱미터)관리부서준공일자
시설명1.0001.0001.0001.0001.0001.000
주소(도로명)1.0001.0000.8790.9991.0000.999
건축면적(제곱미터)1.0000.8791.0001.0000.0001.000
연면적(제곱미터)1.0000.9991.0001.0001.0001.000
관리부서1.0001.0000.0001.0001.0001.000
준공일자1.0000.9991.0001.0001.0001.000
2023-12-12T10:40:44.326470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
건축면적(제곱미터)관리부서
건축면적(제곱미터)1.0000.000
관리부서0.0001.000

Missing values

2023-12-12T10:40:39.671690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:40:39.804752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T10:40:39.904240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

시설명주소(도로명)건축면적(제곱미터)연면적(제곱미터)관리부서준공일자
0강남지역자활센터+강남구어린이급식관리지원센터서울특별시 강남구 개포로38길 12218.1825사회보장과1989-08-27
1강남구보건소서울특별시 강남구 선릉로6681399.557533.33보건행정과1975-12-26
2신사동 주민센터서울특별시 강남구 압구정로 128254.341722.8신사동 주민센터2001-11-12
3논현1동 주민센터서울특별시 강남구 학동로20길 25549.754003.3논현1동 주민센터2001-11-23
4논현2동 주민센터서울특별시 강남구 학동로43길 171010.467472.9논현2동 주민센터1997-08-31
5압구정동 주민센터서울특별시 강남구 압구정로33길 48439.71402.2압구정동 주민센터1979-05-09
6청담문화센터서울특별시 강남구 압구정로79길 26348.053203.2청담동 주민센터1991-10-12
7삼성1문화센터서울특별시 강남구 봉은사로 616575.486543.8삼성1동 주민센터2009-09-06
8삼성2문화센터서울특별시 강남구 봉은사로 419426.584144.9삼성2동 주민센터2004-11-12
9대치1문화센터서울특별시 강남구 남부순환로391길 19283.112513.9대치1동 주민센터2008-03-05
시설명주소(도로명)건축면적(제곱미터)연면적(제곱미터)관리부서준공일자
89반고개경로당서울특별시 강남구 헌릉로622길 994.498.4어르신복지과1992-10-06
90방죽2경로당서울특별시 강남구 밤고개로23길 14-3199.45198.9어르신복지과1986-06-22
91방죽1경로당서울특별시 강남구 밤고개로24길 60186.65364.66어르신복지과2011-01-15
92윗반고개마을경로당서울특별시 강남구 헌릉로645길 111153.56209.97어르신복지과2015-11-05
93강남도시관제센터서울특별시 강남구 역삼동 687-10(언주로108길 20)575.541088.93재난안전과2004-06-17
94강남스포츠문화센터서울특별시 강남구 밤고개로1길 521571.729914.44생활체육과(강남구 도시관리공단 위탁 운영)2004-12-08
95구민체육관서울특별시 강남구 개포로28길 47978.072494.36생활체육과(강남구 도시관리공단 위탁 운영)1993-12-31
96대진체육관서울특별시 강남구 개포로109길 62885.91885.91생활체육과(강남구 도시관리공단 위탁 운영)1997-12-03
97매봉산실내배드민턴장서울특별시 강남구 남부순환로 2711812.83987.99생활체육과(강남구 도시관리공단 위탁 운영)2017-06-08
98일원스포츠문화센터서울특별시 강남구 영동대로 2218610.139682.27생활체육과(강남구 도시관리공단 위탁 운영)2021-07-30