Overview

Dataset statistics

Number of variables6
Number of observations537
Missing cells0
Missing cells (%)0.0%
Duplicate rows5
Duplicate rows (%)0.9%
Total size in memory25.8 KiB
Average record size in memory49.2 B

Variable types

Categorical2
Text2
Numeric1
DateTime1

Dataset

Description안양시에 존재하는 공공건축물현황(시군명, 명칭, 주소(도로명 or 지번), 면적, 취득일, 데이터기준일자)입니다.
URLhttps://www.data.go.kr/data/15114534/fileData.do

Alerts

시군명 has constant value ""Constant
데이터기준일자 has constant value ""Constant
Dataset has 5 (0.9%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 01:09:16.157471
Analysis finished2023-12-12 01:09:16.742852
Duration0.59 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.3 KiB
안양시
537 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row안양시
2nd row안양시
3rd row안양시
4th row안양시
5th row안양시

Common Values

ValueCountFrequency (%)
안양시 537
100.0%

Length

2023-12-12T10:09:16.810773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:09:16.914824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
안양시 537
100.0%

명칭
Text

Distinct493
Distinct (%)91.8%
Missing0
Missing (%)0.0%
Memory size4.3 KiB
2023-12-12T10:09:17.137142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length17
Mean length8.7374302
Min length3

Characters and Unicode

Total characters4692
Distinct characters304
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique474 ?
Unique (%)88.3%

Sample

1st row희성놀이터화장실
2nd row후생관(구내식당)
3rd row화창경로당
4th row화장실및 음수대
5th row평화공원 화장실
ValueCountFrequency (%)
관양두산벤처다임 21
 
3.1%
행정복지센터 17
 
2.5%
박달하수처리장 15
 
2.2%
다목적복지회관 15
 
2.2%
화장실 7
 
1.0%
만안구청 7
 
1.0%
농수산물도매시장 6
 
0.9%
시내버스공영차고지 5
 
0.7%
김중업관 4
 
0.6%
지하주차장 4
 
0.6%
Other values (509) 569
84.9%
2023-12-12T10:09:17.561339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
223
 
4.8%
211
 
4.5%
161
 
3.4%
149
 
3.2%
134
 
2.9%
127
 
2.7%
106
 
2.3%
92
 
2.0%
88
 
1.9%
87
 
1.9%
Other values (294) 3314
70.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4240
90.4%
Decimal Number 192
 
4.1%
Space Separator 134
 
2.9%
Close Punctuation 40
 
0.9%
Open Punctuation 40
 
0.9%
Uppercase Letter 33
 
0.7%
Dash Punctuation 13
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
223
 
5.3%
211
 
5.0%
161
 
3.8%
149
 
3.5%
127
 
3.0%
106
 
2.5%
92
 
2.2%
88
 
2.1%
87
 
2.1%
80
 
1.9%
Other values (270) 2916
68.8%
Decimal Number
ValueCountFrequency (%)
1 76
39.6%
2 61
31.8%
3 14
 
7.3%
7 8
 
4.2%
9 8
 
4.2%
8 7
 
3.6%
5 6
 
3.1%
4 6
 
3.1%
6 4
 
2.1%
0 2
 
1.0%
Uppercase Letter
ValueCountFrequency (%)
B 7
21.2%
A 6
18.2%
C 5
15.2%
D 5
15.2%
E 4
12.1%
F 2
 
6.1%
V 1
 
3.0%
H 1
 
3.0%
G 1
 
3.0%
U 1
 
3.0%
Space Separator
ValueCountFrequency (%)
134
100.0%
Close Punctuation
ValueCountFrequency (%)
) 40
100.0%
Open Punctuation
ValueCountFrequency (%)
( 40
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4240
90.4%
Common 419
 
8.9%
Latin 33
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
223
 
5.3%
211
 
5.0%
161
 
3.8%
149
 
3.5%
127
 
3.0%
106
 
2.5%
92
 
2.2%
88
 
2.1%
87
 
2.1%
80
 
1.9%
Other values (270) 2916
68.8%
Common
ValueCountFrequency (%)
134
32.0%
1 76
18.1%
2 61
14.6%
) 40
 
9.5%
( 40
 
9.5%
3 14
 
3.3%
- 13
 
3.1%
7 8
 
1.9%
9 8
 
1.9%
8 7
 
1.7%
Other values (4) 18
 
4.3%
Latin
ValueCountFrequency (%)
B 7
21.2%
A 6
18.2%
C 5
15.2%
D 5
15.2%
E 4
12.1%
F 2
 
6.1%
V 1
 
3.0%
H 1
 
3.0%
G 1
 
3.0%
U 1
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4240
90.4%
ASCII 452
 
9.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
223
 
5.3%
211
 
5.0%
161
 
3.8%
149
 
3.5%
127
 
3.0%
106
 
2.5%
92
 
2.2%
88
 
2.1%
87
 
2.1%
80
 
1.9%
Other values (270) 2916
68.8%
ASCII
ValueCountFrequency (%)
134
29.6%
1 76
16.8%
2 61
13.5%
) 40
 
8.8%
( 40
 
8.8%
3 14
 
3.1%
- 13
 
2.9%
7 8
 
1.8%
9 8
 
1.8%
8 7
 
1.5%
Other values (14) 51
 
11.3%

주소
Text

Distinct267
Distinct (%)49.7%
Missing0
Missing (%)0.0%
Memory size4.3 KiB
2023-12-12T10:09:17.860132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length37
Mean length20.824953
Min length14

Characters and Unicode

Total characters11183
Distinct characters119
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique210 ?
Unique (%)39.1%

Sample

1st row경기도 안양시 동안구 비산동 1101-3
2nd row경기도 안양시 만안구 안양로 175
3rd row경기도 안양시 만안구 화창로 84
4th row경기도 안양시 동안구 관양동 1609
5th row경기도 안양시 동안구 비산동 1104-2
ValueCountFrequency (%)
경기도 537
19.8%
안양시 501
18.5%
동안구 256
 
9.4%
만안구 245
 
9.0%
비산동 60
 
2.2%
의왕시 36
 
1.3%
안양로 30
 
1.1%
29
 
1.1%
예술공원로103번길 28
 
1.0%
4 27
 
1.0%
Other values (371) 962
35.5%
2023-12-12T10:09:18.308708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2174
19.4%
1134
 
10.1%
598
 
5.3%
556
 
5.0%
551
 
4.9%
538
 
4.8%
537
 
4.8%
508
 
4.5%
1 418
 
3.7%
404
 
3.6%
Other values (109) 3765
33.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6966
62.3%
Space Separator 2174
 
19.4%
Decimal Number 1932
 
17.3%
Dash Punctuation 106
 
0.9%
Close Punctuation 2
 
< 0.1%
Open Punctuation 2
 
< 0.1%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1134
16.3%
598
8.6%
556
 
8.0%
551
 
7.9%
538
 
7.7%
537
 
7.7%
508
 
7.3%
404
 
5.8%
400
 
5.7%
254
 
3.6%
Other values (94) 1486
21.3%
Decimal Number
ValueCountFrequency (%)
1 418
21.6%
3 234
12.1%
5 227
11.7%
2 217
11.2%
6 174
9.0%
4 164
 
8.5%
0 153
 
7.9%
9 130
 
6.7%
8 114
 
5.9%
7 101
 
5.2%
Space Separator
ValueCountFrequency (%)
2174
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 106
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6966
62.3%
Common 4216
37.7%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1134
16.3%
598
8.6%
556
 
8.0%
551
 
7.9%
538
 
7.7%
537
 
7.7%
508
 
7.3%
404
 
5.8%
400
 
5.7%
254
 
3.6%
Other values (94) 1486
21.3%
Common
ValueCountFrequency (%)
2174
51.6%
1 418
 
9.9%
3 234
 
5.6%
5 227
 
5.4%
2 217
 
5.1%
6 174
 
4.1%
4 164
 
3.9%
0 153
 
3.6%
9 130
 
3.1%
8 114
 
2.7%
Other values (4) 211
 
5.0%
Latin
ValueCountFrequency (%)
B 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6966
62.3%
ASCII 4217
37.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2174
51.6%
1 418
 
9.9%
3 234
 
5.5%
5 227
 
5.4%
2 217
 
5.1%
6 174
 
4.1%
4 164
 
3.9%
0 153
 
3.6%
9 130
 
3.1%
8 114
 
2.7%
Other values (5) 212
 
5.0%
Hangul
ValueCountFrequency (%)
1134
16.3%
598
8.6%
556
 
8.0%
551
 
7.9%
538
 
7.7%
537
 
7.7%
508
 
7.3%
404
 
5.8%
400
 
5.7%
254
 
3.6%
Other values (94) 1486
21.3%

면적
Real number (ℝ)

Distinct443
Distinct (%)82.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1551.0016
Minimum1
Maximum72915.77
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.8 KiB
2023-12-12T10:09:18.466486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile12.96
Q152.11
median237.92
Q3983.5
95-th percentile5931.704
Maximum72915.77
Range72914.77
Interquartile range (IQR)931.39

Descriptive statistics

Standard deviation5264.2873
Coefficient of variation (CV)3.3941211
Kurtosis85.321888
Mean1551.0016
Median Absolute Deviation (MAD)209.09
Skewness8.2182836
Sum832887.86
Variance27712721
MonotonicityNot monotonic
2023-12-12T10:09:18.659208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
12.96 20
 
3.7%
232.81 13
 
2.4%
36.0 6
 
1.1%
29.0 4
 
0.7%
244.19 4
 
0.7%
40.0 4
 
0.7%
20.25 4
 
0.7%
36.63 3
 
0.6%
34.0 3
 
0.6%
4.0 3
 
0.6%
Other values (433) 473
88.1%
ValueCountFrequency (%)
1.0 1
 
0.2%
3.89 1
 
0.2%
4.0 3
0.6%
5.0 1
 
0.2%
7.2 1
 
0.2%
7.3 1
 
0.2%
7.6 1
 
0.2%
8.79 1
 
0.2%
9.2 2
0.4%
10.0 1
 
0.2%
ValueCountFrequency (%)
72915.77 1
0.2%
45451.78 1
0.2%
42664.94 1
0.2%
35254.2 1
0.2%
32580.0 1
0.2%
32361.74 1
0.2%
17840.4 1
0.2%
15824.0 1
0.2%
15621.25 1
0.2%
15188.4 1
0.2%
Distinct326
Distinct (%)60.7%
Missing0
Missing (%)0.0%
Memory size4.3 KiB
Minimum1973-07-01 00:00:00
Maximum2023-02-06 00:00:00
2023-12-12T10:09:19.108081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:09:19.262977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.3 KiB
2023-06-14
537 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-06-14
2nd row2023-06-14
3rd row2023-06-14
4th row2023-06-14
5th row2023-06-14

Common Values

ValueCountFrequency (%)
2023-06-14 537
100.0%

Length

2023-12-12T10:09:19.438365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:09:19.559868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-06-14 537
100.0%

Interactions

2023-12-12T10:09:16.452887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T10:09:16.593327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:09:16.701245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군명명칭주소면적취득일데이터기준일자
0안양시희성놀이터화장실경기도 안양시 동안구 비산동 1101-312.961995-02-282023-06-14
1안양시후생관(구내식당)경기도 안양시 만안구 안양로 175315.02018-01-312023-06-14
2안양시화창경로당경기도 안양시 만안구 화창로 84144.181999-12-272023-06-14
3안양시화장실및 음수대경기도 안양시 동안구 관양동 160947.612006-05-232023-06-14
4안양시평화공원 화장실경기도 안양시 동안구 비산동 1104-261.672013-08-232023-06-14
5안양시호계 배드민턴장 화장실경기도 안양시 동안구 엘에스로144번길 8632.642006-09-212023-06-14
6안양시호현경로당경기도 안양시 만안구 박달로275번길 39175.561999-12-292023-06-14
7안양시호암배수지관리사택경기도 안양시 만안구 경수대로 135257.961993-10-022023-06-14
8안양시호암공원 화장실경기도 안양시 만안구 석수동 497-518.62016-04-302023-06-14
9안양시호계체육관경기도 안양시 동안구 호계동 177-012391.32008-08-062023-06-14
시군명명칭주소면적취득일데이터기준일자
527안양시1단계유입펌프동경기도 안양시 동안구 비산동 655707.01992-12-302023-06-14
528안양시1단계오니탈수기동경기도 안양시 동안구 비산동 583252.01992-12-302023-06-14
529안양시1단계슬러지탈수동경기도 안양시 동안구 비산동 6551596.01992-12-302023-06-14
530안양시1단계반송슬러지동경기도 안양시 동안구 비산동 655252.01992-12-302023-06-14
531안양시1단계모래여과및 염소투입동경기도 안양시 동안구 비산동 655872.01992-12-302023-06-14
532안양시1단계기존관리동사무실경기도 안양시 동안구 비산동 655161.31993-01-102023-06-14
533안양시1단계관리동경기도 안양시 동안구 비산동 6551063.01992-12-302023-06-14
534안양시1단계공기송풍기동경기도 안양시 동안구 비산동 655360.01992-12-302023-06-14
535안양시1단계경비실경기도 안양시 동안구 비산동 65521.01992-12-302023-06-14
536안양시1단계가스송풍기동경기도 안양시 동안구 비산동 655342.01992-12-302023-06-14

Duplicate rows

Most frequently occurring

시군명명칭주소면적취득일데이터기준일자# duplicates
0안양시관양두산벤처다임경기도 안양시 동안구 학의로 250232.812007-12-282023-06-1413
1안양시관양두산벤처다임경기도 안양시 동안구 학의로 250244.192007-12-282023-06-144
2안양시관양두산벤처다임경기도 안양시 동안구 학의로 250477.692007-12-282023-06-142
3안양시관양두산벤처다임경기도 안양시 동안구 학의로 250519.952007-12-282023-06-142
4안양시자유공원화장실경기도 안양시 동안구 평촌대로 8636.01995-12-052023-06-142