Overview

Dataset statistics

Number of variables5
Number of observations59
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.5 KiB
Average record size in memory43.2 B

Variable types

Categorical1
Text2
Numeric1
DateTime1

Dataset

Description경기도 의왕시에서 담당하는 공공건축물에 대한 정보를 제공하고 있습니다.명칭, 주소, 면적, 취득일 등에 대한 정보를 제공하고 있습니다.
Author경기도 의왕시
URLhttps://www.data.go.kr/data/15112879/fileData.do

Alerts

시군명 has constant value ""Constant
명칭 has unique valuesUnique
면적 has unique valuesUnique

Reproduction

Analysis started2024-04-29 23:03:37.865819
Analysis finished2024-04-29 23:03:40.038867
Duration2.17 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size604.0 B
의왕시
59 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row의왕시
2nd row의왕시
3rd row의왕시
4th row의왕시
5th row의왕시

Common Values

ValueCountFrequency (%)
의왕시 59
100.0%

Length

2024-04-30T08:03:40.110544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T08:03:40.214370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
의왕시 59
100.0%

명칭
Text

UNIQUE 

Distinct59
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size604.0 B
2024-04-30T08:03:40.420182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length17
Mean length9.559322
Min length4

Characters and Unicode

Total characters564
Distinct characters162
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique59 ?
Unique (%)100.0%

Sample

1st row청소년수련관(체육관)
2nd row청소년수련관(본관)
3rd row노인복지회관
4th row보건소(본관동)
5th row보건소(정신보건센터)
ValueCountFrequency (%)
의왕시 3
 
3.5%
주민센터 2
 
2.3%
내손1동 2
 
2.3%
백운커뮤니티2 1
 
1.2%
의왕시복지회관 1
 
1.2%
글로벌인재센터 1
 
1.2%
의왕국민체육센터 1
 
1.2%
밝은누리어린이집 1
 
1.2%
백운커뮤니티센터 1
 
1.2%
주차타워 1
 
1.2%
Other values (72) 72
83.7%
2024-04-30T08:03:40.786453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
27
 
4.8%
24
 
4.3%
20
 
3.5%
18
 
3.2%
17
 
3.0%
16
 
2.8%
15
 
2.7%
15
 
2.7%
14
 
2.5%
( 14
 
2.5%
Other values (152) 384
68.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 494
87.6%
Space Separator 27
 
4.8%
Open Punctuation 14
 
2.5%
Close Punctuation 14
 
2.5%
Decimal Number 13
 
2.3%
Dash Punctuation 2
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
 
4.9%
20
 
4.0%
18
 
3.6%
17
 
3.4%
16
 
3.2%
15
 
3.0%
15
 
3.0%
14
 
2.8%
12
 
2.4%
11
 
2.2%
Other values (142) 332
67.2%
Decimal Number
ValueCountFrequency (%)
1 5
38.5%
6 2
 
15.4%
0 2
 
15.4%
2 2
 
15.4%
4 1
 
7.7%
7 1
 
7.7%
Space Separator
ValueCountFrequency (%)
27
100.0%
Open Punctuation
ValueCountFrequency (%)
( 14
100.0%
Close Punctuation
ValueCountFrequency (%)
) 14
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 494
87.6%
Common 70
 
12.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
 
4.9%
20
 
4.0%
18
 
3.6%
17
 
3.4%
16
 
3.2%
15
 
3.0%
15
 
3.0%
14
 
2.8%
12
 
2.4%
11
 
2.2%
Other values (142) 332
67.2%
Common
ValueCountFrequency (%)
27
38.6%
( 14
20.0%
) 14
20.0%
1 5
 
7.1%
6 2
 
2.9%
0 2
 
2.9%
- 2
 
2.9%
2 2
 
2.9%
4 1
 
1.4%
7 1
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 494
87.6%
ASCII 70
 
12.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
27
38.6%
( 14
20.0%
) 14
20.0%
1 5
 
7.1%
6 2
 
2.9%
0 2
 
2.9%
- 2
 
2.9%
2 2
 
2.9%
4 1
 
1.4%
7 1
 
1.4%
Hangul
ValueCountFrequency (%)
24
 
4.9%
20
 
4.0%
18
 
3.6%
17
 
3.4%
16
 
3.2%
15
 
3.0%
15
 
3.0%
14
 
2.8%
12
 
2.4%
11
 
2.2%
Other values (142) 332
67.2%

주소
Text

Distinct51
Distinct (%)86.4%
Missing0
Missing (%)0.0%
Memory size604.0 B
2024-04-30T08:03:41.002608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length24
Mean length16.525424
Min length14

Characters and Unicode

Total characters975
Distinct characters53
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)76.3%

Sample

1st row경기도 의왕시 고천동 100-1
2nd row경기도 의왕시 고천동 100-1
3rd row경기도 의왕시 고천동 100
4th row경기도 의왕시 고천동 108
5th row경기도 의왕시 고천동 108
ValueCountFrequency (%)
경기도 59
24.7%
의왕시 59
24.7%
고천동 15
 
6.3%
오전동 11
 
4.6%
삼동 8
 
3.3%
내손동 6
 
2.5%
월암동 5
 
2.1%
포일동 5
 
2.1%
왕곡동 4
 
1.7%
171 4
 
1.7%
Other values (55) 63
26.4%
2024-04-30T08:03:41.332605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
180
18.5%
64
 
6.6%
62
 
6.4%
60
 
6.2%
59
 
6.1%
59
 
6.1%
59
 
6.1%
59
 
6.1%
1 45
 
4.6%
2 31
 
3.2%
Other values (43) 297
30.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 547
56.1%
Decimal Number 218
 
22.4%
Space Separator 180
 
18.5%
Dash Punctuation 30
 
3.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
64
11.7%
62
11.3%
60
11.0%
59
10.8%
59
10.8%
59
10.8%
59
10.8%
15
 
2.7%
15
 
2.7%
11
 
2.0%
Other values (31) 84
15.4%
Decimal Number
ValueCountFrequency (%)
1 45
20.6%
2 31
14.2%
8 26
11.9%
6 21
9.6%
4 20
9.2%
7 19
8.7%
0 17
 
7.8%
5 15
 
6.9%
3 14
 
6.4%
9 10
 
4.6%
Space Separator
ValueCountFrequency (%)
180
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 30
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 547
56.1%
Common 428
43.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
64
11.7%
62
11.3%
60
11.0%
59
10.8%
59
10.8%
59
10.8%
59
10.8%
15
 
2.7%
15
 
2.7%
11
 
2.0%
Other values (31) 84
15.4%
Common
ValueCountFrequency (%)
180
42.1%
1 45
 
10.5%
2 31
 
7.2%
- 30
 
7.0%
8 26
 
6.1%
6 21
 
4.9%
4 20
 
4.7%
7 19
 
4.4%
0 17
 
4.0%
5 15
 
3.5%
Other values (2) 24
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 547
56.1%
ASCII 428
43.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
180
42.1%
1 45
 
10.5%
2 31
 
7.2%
- 30
 
7.0%
8 26
 
6.1%
6 21
 
4.9%
4 20
 
4.7%
7 19
 
4.4%
0 17
 
4.0%
5 15
 
3.5%
Other values (2) 24
 
5.6%
Hangul
ValueCountFrequency (%)
64
11.7%
62
11.3%
60
11.0%
59
10.8%
59
10.8%
59
10.8%
59
10.8%
15
 
2.7%
15
 
2.7%
11
 
2.0%
Other values (31) 84
15.4%

면적
Real number (ℝ)

UNIQUE 

Distinct59
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3435.6915
Minimum105.6
Maximum17819.41
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size663.0 B
2024-04-30T08:03:41.490915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum105.6
5-th percentile150.142
Q1755.46
median2205.53
Q34792.265
95-th percentile8890.949
Maximum17819.41
Range17713.81
Interquartile range (IQR)4036.805

Descriptive statistics

Standard deviation3725.0446
Coefficient of variation (CV)1.0842198
Kurtosis5.3464255
Mean3435.6915
Median Absolute Deviation (MAD)1560.12
Skewness2.0596902
Sum202705.8
Variance13875958
MonotonicityNot monotonic
2024-04-30T08:03:41.631011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
776.0 1
 
1.7%
4418.0 1
 
1.7%
725.66 1
 
1.7%
844.28 1
 
1.7%
495.06 1
 
1.7%
698.13 1
 
1.7%
4908.67 1
 
1.7%
7990.39 1
 
1.7%
645.41 1
 
1.7%
2640.87 1
 
1.7%
Other values (49) 49
83.1%
ValueCountFrequency (%)
105.6 1
1.7%
130.0 1
1.7%
136.3 1
1.7%
151.68 1
1.7%
216.66 1
1.7%
271.2 1
1.7%
495.06 1
1.7%
548.9 1
1.7%
567.08 1
1.7%
645.41 1
1.7%
ValueCountFrequency (%)
17819.41 1
1.7%
17246.27 1
1.7%
10497.17 1
1.7%
8712.48 1
1.7%
7990.39 1
1.7%
7508.23 1
1.7%
7453.48 1
1.7%
7267.5 1
1.7%
6941.67 1
1.7%
6595.07 1
1.7%
Distinct51
Distinct (%)86.4%
Missing0
Missing (%)0.0%
Memory size604.0 B
Minimum1986-06-10 00:00:00
Maximum2022-12-30 00:00:00
2024-04-30T08:03:41.765511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T08:03:41.887509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2024-04-30T08:03:39.733223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-30T08:03:41.967904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
명칭주소면적취득일
명칭1.0001.0001.0001.000
주소1.0001.0000.0001.000
면적1.0000.0001.0000.000
취득일1.0001.0000.0001.000

Missing values

2024-04-30T08:03:39.901769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-30T08:03:39.989861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군명명칭주소면적취득일
0의왕시청소년수련관(체육관)경기도 의왕시 고천동 100-1776.02008-02-14
1의왕시청소년수련관(본관)경기도 의왕시 고천동 100-14418.02008-02-14
2의왕시노인복지회관경기도 의왕시 고천동 1003760.972008-02-14
3의왕시보건소(본관동)경기도 의왕시 고천동 1086231.652004-01-27
4의왕시보건소(정신보건센터)경기도 의왕시 고천동 108648.682004-01-27
5의왕시의왕시중앙도서관경기도 의왕시 고천동 1597267.52007-05-17
6의왕시의왕시청(행정동)경기도 의왕시 고천동 17110497.171993-10-18
7의왕시의왕시청(민원동)경기도 의왕시 고천동 1713621.831993-10-18
8의왕시의왕시청(의회동)경기도 의왕시 고천동 1712205.531993-10-18
9의왕시시 청사 직장어린이집경기도 의왕시 고천동 171654.852022-12-30
시군명명칭주소면적취득일
49의왕시청계동주민센터경기도 의왕시 포일동 602-153777.062010-12-22
50의왕시산빛근린공원 공영주차장 조성비경기도 의왕시 포일동 6544675.862022-01-13
51의왕시포일어울림센터경기도 의왕시 포일동 68617819.412020-11-25
52의왕시포일스포츠센터경기도 의왕시 포일동 6865536.372020-11-25
53의왕시거점공간 들락날락경기도 의왕시 포일동 687136.32021-04-13
54의왕시부곡체육공원경기도 의왕시 월암동 421734.922006-01-06
55의왕시의왕조류생태과학관경기도 의왕시 월암동 525-101980.02012-03-15
56의왕시의왕레일바이크매표소 미디어체험관경기도 의왕시 월암동 541130.02016-11-25
57의왕시부곡자연학습공원경기도 의왕시 월암동 543-3946.72001-11-28
58의왕시월암동 공영차고지경기도 의왕시 월암동 576-2105.61998-12-08