Overview

Dataset statistics

Number of variables12
Number of observations3965
Missing cells5707
Missing cells (%)12.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory383.5 KiB
Average record size in memory99.0 B

Variable types

Text4
Categorical4
DateTime1
Numeric3

Dataset

Description한국부동산원(구.한국감정원)에서 제공하는 녹색건축 인증현황 정보 중 인증번호,시도,인증기준,인증구분,건축물명,인증대상 건축물군,인증등급,인증일자,건축물 주소 데이터입니다.
Author한국부동산원
URLhttps://www.data.go.kr/data/15003214/fileData.do

Alerts

인증기준 has constant value ""Constant
연면적 is highly overall correlated with 지상층수High correlation
지상층수 is highly overall correlated with 연면적 and 1 other fieldsHigh correlation
지하층수 is highly overall correlated with 지상층수High correlation
연면적 has 1886 (47.6%) missing valuesMissing
지상층수 has 1909 (48.1%) missing valuesMissing
지하층수 has 1912 (48.2%) missing valuesMissing
지하층수 has 109 (2.7%) zerosZeros

Reproduction

Analysis started2024-01-14 13:54:34.831935
Analysis finished2024-01-14 13:54:37.756393
Duration2.92 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct3963
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size31.1 KiB
2024-01-14T22:54:38.074124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length8
Mean length10.817654
Min length8

Characters and Unicode

Total characters42892
Distinct characters17
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3961 ?
Unique (%)99.9%

Sample

1st row2015-001
2nd row2015-002
3rd row2015-003
4th row2015-004
5th row2015-005
ValueCountFrequency (%)
2019-089 2
 
0.1%
2017-155 2
 
0.1%
g-seed-p-2023-1437-5 1
 
< 0.1%
2021-263 1
 
< 0.1%
2021-268 1
 
< 0.1%
2021-295 1
 
< 0.1%
2021-294 1
 
< 0.1%
2015-001 1
 
< 0.1%
2021-281 1
 
< 0.1%
2021-269 1
 
< 0.1%
Other values (3953) 3953
99.7%
2024-01-14T22:54:38.629096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 8024
18.7%
- 7689
17.9%
0 6855
16.0%
1 4345
10.1%
5 2066
 
4.8%
3 1963
 
4.6%
E 1862
 
4.3%
9 1370
 
3.2%
8 1341
 
3.1%
4 1286
 
3.0%
Other values (7) 6091
14.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 29617
69.1%
Dash Punctuation 7689
 
17.9%
Uppercase Letter 5586
 
13.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 8024
27.1%
0 6855
23.1%
1 4345
14.7%
5 2066
 
7.0%
3 1963
 
6.6%
9 1370
 
4.6%
8 1341
 
4.5%
4 1286
 
4.3%
7 1218
 
4.1%
6 1149
 
3.9%
Uppercase Letter
ValueCountFrequency (%)
E 1862
33.3%
G 931
16.7%
S 931
16.7%
D 931
16.7%
P 541
 
9.7%
C 390
 
7.0%
Dash Punctuation
ValueCountFrequency (%)
- 7689
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 37306
87.0%
Latin 5586
 
13.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 8024
21.5%
- 7689
20.6%
0 6855
18.4%
1 4345
11.6%
5 2066
 
5.5%
3 1963
 
5.3%
9 1370
 
3.7%
8 1341
 
3.6%
4 1286
 
3.4%
7 1218
 
3.3%
Latin
ValueCountFrequency (%)
E 1862
33.3%
G 931
16.7%
S 931
16.7%
D 931
16.7%
P 541
 
9.7%
C 390
 
7.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 42892
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 8024
18.7%
- 7689
17.9%
0 6855
16.0%
1 4345
10.1%
5 2066
 
4.8%
3 1963
 
4.6%
E 1862
 
4.3%
9 1370
 
3.2%
8 1341
 
3.1%
4 1286
 
3.0%
Other values (7) 6091
14.2%

시도
Categorical

Distinct23
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size31.1 KiB
서울특별시
1297 
경기도
1065 
인천광역시
282 
경상남도
157 
경상북도
139 
Other values (18)
1025 

Length

Max length11
Median length5
Mean length4.3508197
Min length2

Unique

Unique4 ?
Unique (%)0.1%

Sample

1st row경기도
2nd row서울특별시
3rd row세종특별자치시
4th row충청남도
5th row경기도

Common Values

ValueCountFrequency (%)
서울특별시 1297
32.7%
경기도 1065
26.9%
인천광역시 282
 
7.1%
경상남도 157
 
4.0%
경상북도 139
 
3.5%
대구광역시 138
 
3.5%
충청남도 135
 
3.4%
세종특별자치시 117
 
3.0%
부산광역시 91
 
2.3%
강원도 84
 
2.1%
Other values (13) 460
 
11.6%

Length

2024-01-14T22:54:38.795801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서울특별시 1297
32.7%
경기도 1065
26.9%
인천광역시 282
 
7.1%
경상남도 157
 
4.0%
경상북도 139
 
3.5%
대구광역시 138
 
3.5%
충청남도 135
 
3.4%
세종특별자치시 117
 
3.0%
부산광역시 91
 
2.3%
강원도 84
 
2.1%
Other values (13) 460
 
11.6%

인증기준
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size31.1 KiB
녹색
3965 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row녹색
2nd row녹색
3rd row녹색
4th row녹색
5th row녹색

Common Values

ValueCountFrequency (%)
녹색 3965
100.0%

Length

2024-01-14T22:54:39.262363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-14T22:54:39.390223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
녹색 3965
100.0%

인증구분
Categorical

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size31.1 KiB
예비인증
2392 
본인증
1570 
유효기간연장(본인증)
 
3

Length

Max length11
Median length4
Mean length3.6093317
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row본인증
2nd row본인증
3rd row본인증
4th row예비인증
5th row예비인증

Common Values

ValueCountFrequency (%)
예비인증 2392
60.3%
본인증 1570
39.6%
유효기간연장(본인증) 3
 
0.1%

Length

2024-01-14T22:54:39.541155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-14T22:54:39.667749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
예비인증 2392
60.3%
본인증 1570
39.6%
유효기간연장(본인증 3
 
0.1%
Distinct3688
Distinct (%)93.0%
Missing0
Missing (%)0.0%
Memory size31.1 KiB
2024-01-14T22:54:40.019997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length36
Mean length16.616393
Min length3

Characters and Unicode

Total characters65884
Distinct characters632
Distinct categories14 ?
Distinct scripts4 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3416 ?
Unique (%)86.2%

Sample

1st row고양원흥 2초등학교(경비실 제외)
2nd row마곡 에스비타운
3rd rowNH농협 세종통합센터
4th row장항국가생태산단 A-1BL공동주택 건설공사
5th row시흥배곧신도시B11BL 호반베르디움
ValueCountFrequency (%)
신축공사 649
 
5.2%
공동주택 529
 
4.3%
아파트 165
 
1.3%
128
 
1.0%
오피스텔 99
 
0.8%
건설공사 92
 
0.7%
정비사업 79
 
0.6%
주상복합 78
 
0.6%
근린생활시설 66
 
0.5%
업무시설 64
 
0.5%
Other values (5272) 10463
84.3%
2024-01-14T22:54:40.630039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8461
 
12.8%
2023
 
3.1%
1875
 
2.8%
1808
 
2.7%
1721
 
2.6%
1454
 
2.2%
1322
 
2.0%
1255
 
1.9%
1041
 
1.6%
1 1025
 
1.6%
Other values (622) 43899
66.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 48307
73.3%
Space Separator 8461
 
12.8%
Decimal Number 4144
 
6.3%
Uppercase Letter 2715
 
4.1%
Dash Punctuation 853
 
1.3%
Close Punctuation 533
 
0.8%
Open Punctuation 528
 
0.8%
Lowercase Letter 163
 
0.2%
Other Punctuation 150
 
0.2%
Letter Number 8
 
< 0.1%
Other values (4) 22
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2023
 
4.2%
1875
 
3.9%
1808
 
3.7%
1721
 
3.6%
1454
 
3.0%
1322
 
2.7%
1255
 
2.6%
1041
 
2.2%
999
 
2.1%
971
 
2.0%
Other values (545) 33838
70.0%
Uppercase Letter
ValueCountFrequency (%)
B 739
27.2%
L 647
23.8%
A 405
14.9%
C 129
 
4.8%
S 100
 
3.7%
R 72
 
2.7%
M 67
 
2.5%
T 61
 
2.2%
D 59
 
2.2%
I 59
 
2.2%
Other values (16) 377
13.9%
Lowercase Letter
ValueCountFrequency (%)
e 46
28.2%
a 17
 
10.4%
t 10
 
6.1%
b 9
 
5.5%
h 9
 
5.5%
l 9
 
5.5%
c 8
 
4.9%
r 8
 
4.9%
o 7
 
4.3%
i 6
 
3.7%
Other values (10) 34
20.9%
Decimal Number
ValueCountFrequency (%)
1 1025
24.7%
2 917
22.1%
3 534
12.9%
4 367
 
8.9%
5 288
 
6.9%
6 247
 
6.0%
0 237
 
5.7%
7 188
 
4.5%
9 173
 
4.2%
8 168
 
4.1%
Other Punctuation
ValueCountFrequency (%)
, 101
67.3%
& 21
 
14.0%
. 13
 
8.7%
· 7
 
4.7%
/ 6
 
4.0%
' 2
 
1.3%
Letter Number
ValueCountFrequency (%)
6
75.0%
1
 
12.5%
1
 
12.5%
Close Punctuation
ValueCountFrequency (%)
) 524
98.3%
] 9
 
1.7%
Open Punctuation
ValueCountFrequency (%)
( 519
98.3%
[ 9
 
1.7%
Other Symbol
ValueCountFrequency (%)
4
66.7%
2
33.3%
Math Symbol
ValueCountFrequency (%)
+ 4
50.0%
~ 4
50.0%
Space Separator
ValueCountFrequency (%)
8461
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 853
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 7
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 48310
73.3%
Common 14687
 
22.3%
Latin 2886
 
4.4%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2023
 
4.2%
1875
 
3.9%
1808
 
3.7%
1721
 
3.6%
1454
 
3.0%
1322
 
2.7%
1255
 
2.6%
1041
 
2.2%
999
 
2.1%
971
 
2.0%
Other values (545) 33841
70.0%
Latin
ValueCountFrequency (%)
B 739
25.6%
L 647
22.4%
A 405
14.0%
C 129
 
4.5%
S 100
 
3.5%
R 72
 
2.5%
M 67
 
2.3%
T 61
 
2.1%
D 59
 
2.0%
I 59
 
2.0%
Other values (39) 548
19.0%
Common
ValueCountFrequency (%)
8461
57.6%
1 1025
 
7.0%
2 917
 
6.2%
- 853
 
5.8%
3 534
 
3.6%
) 524
 
3.6%
( 519
 
3.5%
4 367
 
2.5%
5 288
 
2.0%
6 247
 
1.7%
Other values (17) 952
 
6.5%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 48304
73.3%
ASCII 17555
 
26.6%
None 11
 
< 0.1%
Number Forms 8
 
< 0.1%
Compat Jamo 2
 
< 0.1%
Geometric Shapes 2
 
< 0.1%
CJK 1
 
< 0.1%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8461
48.2%
1 1025
 
5.8%
2 917
 
5.2%
- 853
 
4.9%
B 739
 
4.2%
L 647
 
3.7%
3 534
 
3.0%
) 524
 
3.0%
( 519
 
3.0%
A 405
 
2.3%
Other values (60) 2931
 
16.7%
Hangul
ValueCountFrequency (%)
2023
 
4.2%
1875
 
3.9%
1808
 
3.7%
1721
 
3.6%
1454
 
3.0%
1322
 
2.7%
1255
 
2.6%
1041
 
2.2%
999
 
2.1%
971
 
2.0%
Other values (543) 33835
70.0%
None
ValueCountFrequency (%)
· 7
63.6%
4
36.4%
Number Forms
ValueCountFrequency (%)
6
75.0%
1
 
12.5%
1
 
12.5%
Compat Jamo
ValueCountFrequency (%)
2
100.0%
Geometric Shapes
ValueCountFrequency (%)
2
100.0%
CJK
ValueCountFrequency (%)
1
100.0%
Punctuation
ValueCountFrequency (%)
1
100.0%
Distinct68
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size31.1 KiB
2024-01-14T22:54:40.898167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length4
Mean length6.0116015
Min length4

Characters and Unicode

Total characters23836
Distinct characters49
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique13 ?
Unique (%)0.3%

Sample

1st row학교시설
2nd row그 밖의 건축물
3rd row업무용 건축물
4th row공동주택
5th row공동주택
ValueCountFrequency (%)
공동주택 1629
33.9%
일반건축물 634
 
13.2%
건축물 368
 
7.7%
업무용건축물 294
 
6.1%
237
 
4.9%
밖의 237
 
4.9%
신축 217
 
4.5%
학교시설 207
 
4.3%
일반주택 148
 
3.1%
업무용 125
 
2.6%
Other values (63) 714
14.8%
2024-01-14T22:54:41.307435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2352
 
9.9%
2127
 
8.9%
2037
 
8.5%
1864
 
7.8%
1825
 
7.7%
1799
 
7.5%
1799
 
7.5%
1176
 
4.9%
1176
 
4.9%
845
 
3.5%
Other values (39) 6836
28.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 21541
90.4%
Space Separator 845
 
3.5%
Other Punctuation 452
 
1.9%
Open Punctuation 407
 
1.7%
Close Punctuation 407
 
1.7%
Math Symbol 92
 
0.4%
Uppercase Letter 92
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2352
10.9%
2127
9.9%
2037
9.5%
1864
 
8.7%
1825
 
8.5%
1799
 
8.4%
1799
 
8.4%
1176
 
5.5%
1176
 
5.5%
724
 
3.4%
Other values (29) 4662
21.6%
Open Punctuation
ValueCountFrequency (%)
( 406
99.8%
[ 1
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 406
99.8%
] 1
 
0.2%
Math Symbol
ValueCountFrequency (%)
> 46
50.0%
< 46
50.0%
Uppercase Letter
ValueCountFrequency (%)
R 46
50.0%
B 46
50.0%
Space Separator
ValueCountFrequency (%)
845
100.0%
Other Punctuation
ValueCountFrequency (%)
, 452
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 21541
90.4%
Common 2203
 
9.2%
Latin 92
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2352
10.9%
2127
9.9%
2037
9.5%
1864
 
8.7%
1825
 
8.5%
1799
 
8.4%
1799
 
8.4%
1176
 
5.5%
1176
 
5.5%
724
 
3.4%
Other values (29) 4662
21.6%
Common
ValueCountFrequency (%)
845
38.4%
, 452
20.5%
( 406
18.4%
) 406
18.4%
> 46
 
2.1%
< 46
 
2.1%
[ 1
 
< 0.1%
] 1
 
< 0.1%
Latin
ValueCountFrequency (%)
R 46
50.0%
B 46
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 21541
90.4%
ASCII 2295
 
9.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2352
10.9%
2127
9.9%
2037
9.5%
1864
 
8.7%
1825
 
8.5%
1799
 
8.4%
1799
 
8.4%
1176
 
5.5%
1176
 
5.5%
724
 
3.4%
Other values (29) 4662
21.6%
ASCII
ValueCountFrequency (%)
845
36.8%
, 452
19.7%
( 406
17.7%
) 406
17.7%
> 46
 
2.0%
R 46
 
2.0%
B 46
 
2.0%
< 46
 
2.0%
[ 1
 
< 0.1%
] 1
 
< 0.1%

인증등급
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size31.1 KiB
일반(그린4등급)
2236 
우수(그린2등급)
1022 
우량(그린3등급)
560 
최우수(그린1등급)
 
146
우수
 
1

Length

Max length10
Median length9
Mean length9.0350567
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row일반(그린4등급)
2nd row우수(그린2등급)
3rd row우량(그린3등급)
4th row일반(그린4등급)
5th row일반(그린4등급)

Common Values

ValueCountFrequency (%)
일반(그린4등급) 2236
56.4%
우수(그린2등급) 1022
25.8%
우량(그린3등급) 560
 
14.1%
최우수(그린1등급) 146
 
3.7%
우수 1
 
< 0.1%

Length

2024-01-14T22:54:41.490194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-14T22:54:41.625060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반(그린4등급 2236
56.4%
우수(그린2등급 1022
25.8%
우량(그린3등급 560
 
14.1%
최우수(그린1등급 146
 
3.7%
우수 1
 
< 0.1%
Distinct972
Distinct (%)24.5%
Missing0
Missing (%)0.0%
Memory size31.1 KiB
Minimum2015-01-08 00:00:00
Maximum2023-12-29 00:00:00
2024-01-14T22:54:41.768296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-14T22:54:41.927949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct3671
Distinct (%)92.6%
Missing0
Missing (%)0.0%
Memory size31.1 KiB
2024-01-14T22:54:42.270434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length85
Median length57
Mean length24.543253
Min length7

Characters and Unicode

Total characters97314
Distinct characters467
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3402 ?
Unique (%)85.8%

Sample

1st row경기도 고양시 덕양구 도내동 1012(고양원흥보금자리주택지구 내 초2용지)
2nd row서울특별시 강서구 방화동 마곡지구 상업용지 B2-2BL
3rd row세종특별자치시 어진동 63-2(행복도시 1-5생활권 C3-4)
4th row충청남도 서천군 마서면 계동리 마서면, 장항읍 일원 장항국가생태산업단지 내 A-1BL
5th row경기도 시흥시 정왕동 시흥배곧신도시 B11BL
ValueCountFrequency (%)
서울특별시 1290
 
6.3%
경기도 1041
 
5.1%
일원 600
 
2.9%
350
 
1.7%
인천광역시 280
 
1.4%
강서구 192
 
0.9%
176
 
0.9%
강남구 166
 
0.8%
경상남도 149
 
0.7%
경상북도 138
 
0.7%
Other values (6128) 16121
78.6%
2024-01-14T22:54:42.838621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17025
 
17.5%
3944
 
4.1%
3851
 
4.0%
1 3393
 
3.5%
3325
 
3.4%
2648
 
2.7%
- 2389
 
2.5%
2 2324
 
2.4%
2146
 
2.2%
2025
 
2.1%
Other values (457) 54244
55.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 59729
61.4%
Space Separator 17025
 
17.5%
Decimal Number 15494
 
15.9%
Dash Punctuation 2389
 
2.5%
Uppercase Letter 1229
 
1.3%
Other Punctuation 543
 
0.6%
Close Punctuation 445
 
0.5%
Open Punctuation 443
 
0.5%
Lowercase Letter 14
 
< 0.1%
Other Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3944
 
6.6%
3851
 
6.4%
3325
 
5.6%
2648
 
4.4%
2146
 
3.6%
2025
 
3.4%
1478
 
2.5%
1470
 
2.5%
1454
 
2.4%
1385
 
2.3%
Other values (410) 36003
60.3%
Uppercase Letter
ValueCountFrequency (%)
B 411
33.4%
L 332
27.0%
A 219
17.8%
C 84
 
6.8%
D 46
 
3.7%
S 37
 
3.0%
M 32
 
2.6%
H 16
 
1.3%
E 10
 
0.8%
P 8
 
0.7%
Other values (9) 34
 
2.8%
Decimal Number
ValueCountFrequency (%)
1 3393
21.9%
2 2324
15.0%
3 1709
11.0%
4 1353
 
8.7%
5 1343
 
8.7%
6 1245
 
8.0%
0 1076
 
6.9%
7 1063
 
6.9%
9 996
 
6.4%
8 992
 
6.4%
Other Punctuation
ValueCountFrequency (%)
, 526
96.9%
' 6
 
1.1%
/ 4
 
0.7%
· 4
 
0.7%
: 2
 
0.4%
. 1
 
0.2%
Lowercase Letter
ValueCountFrequency (%)
b 8
57.1%
c 4
28.6%
d 1
 
7.1%
a 1
 
7.1%
Close Punctuation
ValueCountFrequency (%)
) 441
99.1%
] 4
 
0.9%
Open Punctuation
ValueCountFrequency (%)
( 439
99.1%
[ 4
 
0.9%
Space Separator
ValueCountFrequency (%)
17025
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2389
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 59729
61.4%
Common 36342
37.3%
Latin 1243
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3944
 
6.6%
3851
 
6.4%
3325
 
5.6%
2648
 
4.4%
2146
 
3.6%
2025
 
3.4%
1478
 
2.5%
1470
 
2.5%
1454
 
2.4%
1385
 
2.3%
Other values (410) 36003
60.3%
Common
ValueCountFrequency (%)
17025
46.8%
1 3393
 
9.3%
- 2389
 
6.6%
2 2324
 
6.4%
3 1709
 
4.7%
4 1353
 
3.7%
5 1343
 
3.7%
6 1245
 
3.4%
0 1076
 
3.0%
7 1063
 
2.9%
Other values (14) 3422
 
9.4%
Latin
ValueCountFrequency (%)
B 411
33.1%
L 332
26.7%
A 219
17.6%
C 84
 
6.8%
D 46
 
3.7%
S 37
 
3.0%
M 32
 
2.6%
H 16
 
1.3%
E 10
 
0.8%
P 8
 
0.6%
Other values (13) 48
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 59728
61.4%
ASCII 37579
38.6%
None 4
 
< 0.1%
Geometric Shapes 2
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
17025
45.3%
1 3393
 
9.0%
- 2389
 
6.4%
2 2324
 
6.2%
3 1709
 
4.5%
4 1353
 
3.6%
5 1343
 
3.6%
6 1245
 
3.3%
0 1076
 
2.9%
7 1063
 
2.8%
Other values (35) 4659
 
12.4%
Hangul
ValueCountFrequency (%)
3944
 
6.6%
3851
 
6.4%
3325
 
5.6%
2648
 
4.4%
2146
 
3.6%
2025
 
3.4%
1478
 
2.5%
1470
 
2.5%
1454
 
2.4%
1385
 
2.3%
Other values (409) 36002
60.3%
None
ValueCountFrequency (%)
· 4
100.0%
Geometric Shapes
ValueCountFrequency (%)
2
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

연면적
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct1962
Distinct (%)94.4%
Missing1886
Missing (%)47.6%
Infinite0
Infinite (%)0.0%
Mean64976.663
Minimum166.15
Maximum2216298.6
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size35.0 KiB
2024-01-14T22:54:42.993325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum166.15
5-th percentile2989.064
Q15927.98
median19787.89
Q398263.775
95-th percentile228875.14
Maximum2216298.6
Range2216132.4
Interquartile range (IQR)92335.795

Descriptive statistics

Standard deviation109291.23
Coefficient of variation (CV)1.6820075
Kurtosis140.45023
Mean64976.663
Median Absolute Deviation (MAD)16322.94
Skewness8.3693454
Sum1.3508648 × 108
Variance1.1944573 × 1010
MonotonicityNot monotonic
2024-01-14T22:54:43.166438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
94804.59 5
 
0.1%
7616.49 3
 
0.1%
172336.418 3
 
0.1%
17336.164 2
 
0.1%
51128.667 2
 
0.1%
6980.86 2
 
0.1%
3116.08 2
 
0.1%
28485.49 2
 
0.1%
209384.55 2
 
0.1%
5859.08 2
 
0.1%
Other values (1952) 2054
51.8%
(Missing) 1886
47.6%
ValueCountFrequency (%)
166.15 1
< 0.1%
196.84 2
0.1%
306.0 2
0.1%
430.8 1
< 0.1%
433.25 2
0.1%
499.79 1
< 0.1%
729.04 1
< 0.1%
795.66 1
< 0.1%
801.04 1
< 0.1%
808.0 1
< 0.1%
ValueCountFrequency (%)
2216298.57 1
< 0.1%
2163961.07 1
< 0.1%
755721.943 1
< 0.1%
673754.28 1
< 0.1%
673552.93 1
< 0.1%
662314.126 1
< 0.1%
658007.225 1
< 0.1%
650907.408 1
< 0.1%
590683.53 1
< 0.1%
556547.555 1
< 0.1%

지상층수
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct50
Distinct (%)2.4%
Missing1909
Missing (%)48.1%
Infinite0
Infinite (%)0.0%
Mean16.019455
Minimum0
Maximum59
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size35.0 KiB
2024-01-14T22:54:43.344321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3
Q15
median15
Q325
95-th percentile35
Maximum59
Range59
Interquartile range (IQR)20

Descriptive statistics

Standard deviation11.376957
Coefficient of variation (CV)0.71019623
Kurtosis-0.23647152
Mean16.019455
Median Absolute Deviation (MAD)10
Skewness0.66209135
Sum32936
Variance129.43514
MonotonicityNot monotonic
2024-01-14T22:54:43.536514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4 194
 
4.9%
29 159
 
4.0%
3 130
 
3.3%
5 129
 
3.3%
20 127
 
3.2%
25 119
 
3.0%
15 118
 
3.0%
7 78
 
2.0%
6 77
 
1.9%
2 74
 
1.9%
Other values (40) 851
21.5%
(Missing) 1909
48.1%
ValueCountFrequency (%)
0 1
 
< 0.1%
1 18
 
0.5%
2 74
 
1.9%
3 130
3.3%
4 194
4.9%
5 129
3.3%
6 77
 
1.9%
7 78
2.0%
8 46
 
1.2%
9 29
 
0.7%
ValueCountFrequency (%)
59 1
 
< 0.1%
57 1
 
< 0.1%
49 22
0.6%
48 4
 
0.1%
47 6
 
0.2%
46 2
 
0.1%
45 5
 
0.1%
43 6
 
0.2%
42 4
 
0.1%
40 9
0.2%

지하층수
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct11
Distinct (%)0.5%
Missing1912
Missing (%)48.2%
Infinite0
Infinite (%)0.0%
Mean2.2459815
Minimum0
Maximum29
Zeros109
Zeros (%)2.7%
Negative0
Negative (%)0.0%
Memory size35.0 KiB
2024-01-14T22:54:43.652538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q33
95-th percentile5
Maximum29
Range29
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.632811
Coefficient of variation (CV)0.7269922
Kurtosis36.011619
Mean2.2459815
Median Absolute Deviation (MAD)1
Skewness3.1655272
Sum4611
Variance2.6660718
MonotonicityNot monotonic
2024-01-14T22:54:43.754880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
2 663
 
16.7%
1 633
 
16.0%
3 290
 
7.3%
4 173
 
4.4%
0 109
 
2.7%
5 100
 
2.5%
6 49
 
1.2%
7 20
 
0.5%
8 8
 
0.2%
9 7
 
0.2%
(Missing) 1912
48.2%
ValueCountFrequency (%)
0 109
 
2.7%
1 633
16.0%
2 663
16.7%
3 290
7.3%
4 173
 
4.4%
5 100
 
2.5%
6 49
 
1.2%
7 20
 
0.5%
8 8
 
0.2%
9 7
 
0.2%
ValueCountFrequency (%)
29 1
 
< 0.1%
9 7
 
0.2%
8 8
 
0.2%
7 20
 
0.5%
6 49
 
1.2%
5 100
 
2.5%
4 173
 
4.4%
3 290
7.3%
2 663
16.7%
1 633
16.0%

Interactions

2024-01-14T22:54:36.903179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-14T22:54:36.162323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-14T22:54:36.519065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-14T22:54:37.023995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-14T22:54:36.286510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-14T22:54:36.653771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-14T22:54:37.157226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-14T22:54:36.398741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-14T22:54:36.777363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-14T22:54:43.836059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시도인증구분인증대상 건축물군인증등급연면적지상층수지하층수
시도1.0000.0650.4310.3730.0000.4440.278
인증구분0.0651.0000.2600.0880.0000.0000.000
인증대상 건축물군0.4310.2601.0000.4580.1250.7020.422
인증등급0.3730.0880.4581.0000.2720.2790.391
연면적0.0000.0000.1250.2721.0000.4650.145
지상층수0.4440.0000.7020.2790.4651.0000.458
지하층수0.2780.0000.4220.3910.1450.4581.000
2024-01-14T22:54:43.930268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시도인증구분인증등급
시도1.0000.0330.193
인증구분0.0331.0000.066
인증등급0.1930.0661.000
2024-01-14T22:54:44.019151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연면적지상층수지하층수시도인증구분인증등급
연면적1.0000.7820.4390.0000.0000.104
지상층수0.7821.0000.5130.1800.0000.119
지하층수0.4390.5131.0000.1400.0000.155
시도0.0000.1800.1401.0000.0330.193
인증구분0.0000.0000.0000.0331.0000.066
인증등급0.1040.1190.1550.1930.0661.000

Missing values

2024-01-14T22:54:37.305690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-14T22:54:37.498883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-01-14T22:54:37.656066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

인증번호시도인증기준인증구분건축물명인증대상 건축물군인증등급인증일자건축물 주소연면적지상층수지하층수
02015-001경기도녹색본인증고양원흥 2초등학교(경비실 제외)학교시설일반(그린4등급)2015-01-08경기도 고양시 덕양구 도내동 1012(고양원흥보금자리주택지구 내 초2용지)<NA><NA><NA>
12015-002서울특별시녹색본인증마곡 에스비타운그 밖의 건축물우수(그린2등급)2015-01-08서울특별시 강서구 방화동 마곡지구 상업용지 B2-2BL<NA><NA><NA>
22015-003세종특별자치시녹색본인증NH농협 세종통합센터업무용 건축물우량(그린3등급)2015-01-16세종특별자치시 어진동 63-2(행복도시 1-5생활권 C3-4)<NA><NA><NA>
32015-004충청남도녹색예비인증장항국가생태산단 A-1BL공동주택 건설공사공동주택일반(그린4등급)2015-01-16충청남도 서천군 마서면 계동리 마서면, 장항읍 일원 장항국가생태산업단지 내 A-1BL<NA><NA><NA>
42015-005경기도녹색예비인증시흥배곧신도시B11BL 호반베르디움공동주택일반(그린4등급)2015-01-21경기도 시흥시 정왕동 시흥배곧신도시 B11BL<NA><NA><NA>
52015-006세종특별자치시녹색예비인증소담고등학교학교시설일반(그린4등급)2015-01-21세종특별자치시 소담동 3-3생활권 소담동 산9-147대 일원<NA><NA><NA>
62015-007경기도녹색본인증에스케이뷰 파크 아파트공동주택우량(그린3등급)2015-01-29경기도 화성시 반월동 777번지 일원(동탄원천로 382-37)<NA><NA><NA>
72015-008서울특별시녹색예비인증강서세무서 청사업무용 건축물우수(그린2등급)2015-01-29서울특별시 강서구 공항동 944<NA><NA><NA>
82015-009서울특별시녹색예비인증여의도 K-Tower 신축공사업무용 건축물우수(그린2등급)2015-02-02서울특별시 영등포구 여의도동 45-1<NA><NA><NA>
92015-010서울특별시녹색예비인증대치동빌딩 신축공사업무용 건축물일반(그린4등급)2015-02-09서울특별시 강남구 대치동 942-15<NA><NA><NA>
인증번호시도인증기준인증구분건축물명인증대상 건축물군인증등급인증일자건축물 주소연면적지상층수지하층수
3955G-SEED-C-2023-1065-5경기도녹색본인증식사2지구 상1-5BL 근린생활시설일반건축물일반(그린4등급)2023-12-26경기도 고양시 일산동구 원중1길 164368.85102
3956G-SEED-P-2023-1425-5충청남도녹색예비인증(가칭)홍성주촌중학교 신축공사학교시설일반(그린4등급)2023-12-26충청남도 홍성군 홍북읍 신경리 120818699.0341
3957G-SEED-P-2023-1424-5서울특별시녹색예비인증성수동2가 278-3번지 외2필지 업무시설 신축공사업무용건축물우수(그린2등급)2023-12-26서울특별시 성동구 성수동2가 278-3번지 외 2필지60661.9186
3958G-SEED-P-2023-1423-5울산광역시녹색예비인증그린스마트 스쿨 농소중 등 2교(농소중학교)학교시설우량(그린3등급)2023-12-26울산광역시 북구 호계7길 13(호계동 726)14505.0941
3959G-SEED-P-2023-1421-5경상남도녹색예비인증창원 가음4구역 재건축사업공동주택우수(그린2등급)2023-12-26경상남도 창원시 성산구 가음동 24-1 일원85531.1936332
3960G-SEED-P-2023-1438-5경기도녹색예비인증세림 Plaza일반건축물우량(그린3등급)2023-12-28경기도 광주시 중앙로 136 외 1필지10985.6572
3961G-SEED-P-2023-1437-5서울특별시녹색예비인증장안동 현대아파트 재건축정비사업공동주택최우수(그린1등급)2023-12-28서울특별시 동대문구 장안동 95-1번지 일대125769.15293
3962G-SEED-P-2023-1436-5서울특별시녹색예비인증VIP 재활병동일반건축물일반(그린4등급)2023-12-28서울특별시 영등포구 대림동 664-17번지3305.373
3963G-SEED-P-2023-1440-5충청남도녹색예비인증공주 월송지구 B-4BL 공동주택공동주택우량(그린3등급)2023-12-29충청남도 공주시 월송동 671-153691.2671202
3964G-SEED-P-2023-1441-5서울특별시녹색예비인증천왕역세권 도시환경정비사업공동주택우수(그린2등급)2023-12-29서울특별시 구로구 오류동73339.78264