Overview

Dataset statistics

Number of variables5
Number of observations227
Missing cells1
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.4 KiB
Average record size in memory42.6 B

Variable types

Categorical1
Text2
Numeric2

Dataset

Description경기도 의정부시의 건축물 중 기계설비 성능점검 대상 현황으로 건축물용도, 건물명, 도로명주소, 우편번호, 건축물면적(m2)을 확인 하실 수 있습니다.
Author경기도 의정부시
URLhttps://www.data.go.kr/data/15124728/fileData.do

Alerts

건축물연먼적(m2) has unique valuesUnique

Reproduction

Analysis started2023-12-12 02:01:21.103988
Analysis finished2023-12-12 02:01:22.020264
Duration0.92 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

건축물용도
Categorical

Distinct16
Distinct (%)7.0%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
공동주택(아파트)
104 
교육연구시설
37 
업무시설
17 
제1종근린생활시설
16 
제2종근린생활시설
14 
Other values (11)
39 

Length

Max length18
Median length9
Mean length7.4537445
Min length2

Unique

Unique4 ?
Unique (%)1.8%

Sample

1st row공동주택(아파트)
2nd row공동주택(아파트)
3rd row공동주택(아파트)
4th row공동주택(아파트)
5th row공동주택(아파트)

Common Values

ValueCountFrequency (%)
공동주택(아파트) 104
45.8%
교육연구시설 37
 
16.3%
업무시설 17
 
7.5%
제1종근린생활시설 16
 
7.0%
제2종근린생활시설 14
 
6.2%
판매시설 11
 
4.8%
공장 7
 
3.1%
자동차관련시설 5
 
2.2%
의료시설 4
 
1.8%
종교시설 3
 
1.3%
Other values (6) 9
 
4.0%

Length

2023-12-12T11:01:22.136027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
공동주택(아파트 104
45.6%
교육연구시설 37
 
16.2%
업무시설 17
 
7.5%
제1종근린생활시설 16
 
7.0%
제2종근린생활시설 14
 
6.1%
판매시설 11
 
4.8%
공장 7
 
3.1%
자동차관련시설 5
 
2.2%
의료시설 4
 
1.8%
종교시설 3
 
1.3%
Other values (7) 10
 
4.4%
Distinct222
Distinct (%)98.2%
Missing1
Missing (%)0.4%
Memory size1.9 KiB
2023-12-12T11:01:22.432790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length19
Mean length8.2389381
Min length2

Characters and Unicode

Total characters1862
Distinct characters279
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique220 ?
Unique (%)97.3%

Sample

1st row민락센트럴(구 민락17단지)
2nd row브라운빌리지
3rd row대광로제비앙포레스트
4th row호반베르디움 3차
5th row수락리버시티 2단지
ValueCountFrequency (%)
신곡 11
 
3.1%
장암 10
 
2.8%
호원 8
 
2.2%
의정부 8
 
2.2%
민락 5
 
1.4%
고산 5
 
1.4%
금오 5
 
1.4%
호반베르디움 3
 
0.8%
송산 3
 
0.8%
건영 3
 
0.8%
Other values (266) 295
82.9%
2023-12-12T11:01:22.902268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
130
 
7.0%
55
 
3.0%
46
 
2.5%
41
 
2.2%
39
 
2.1%
38
 
2.0%
36
 
1.9%
35
 
1.9%
32
 
1.7%
26
 
1.4%
Other values (269) 1384
74.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1610
86.5%
Space Separator 130
 
7.0%
Decimal Number 64
 
3.4%
Uppercase Letter 21
 
1.1%
Other Punctuation 8
 
0.4%
Close Punctuation 7
 
0.4%
Open Punctuation 7
 
0.4%
Letter Number 6
 
0.3%
Dash Punctuation 5
 
0.3%
Lowercase Letter 4
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
55
 
3.4%
46
 
2.9%
41
 
2.5%
39
 
2.4%
38
 
2.4%
36
 
2.2%
35
 
2.2%
32
 
2.0%
26
 
1.6%
26
 
1.6%
Other values (238) 1236
76.8%
Decimal Number
ValueCountFrequency (%)
1 22
34.4%
2 15
23.4%
3 8
 
12.5%
4 7
 
10.9%
7 3
 
4.7%
0 2
 
3.1%
6 2
 
3.1%
5 2
 
3.1%
9 2
 
3.1%
8 1
 
1.6%
Uppercase Letter
ValueCountFrequency (%)
I 6
28.6%
H 4
19.0%
L 3
14.3%
S 2
 
9.5%
K 2
 
9.5%
G 1
 
4.8%
T 1
 
4.8%
N 1
 
4.8%
F 1
 
4.8%
Other Punctuation
ValueCountFrequency (%)
, 6
75.0%
. 1
 
12.5%
& 1
 
12.5%
Letter Number
ValueCountFrequency (%)
3
50.0%
2
33.3%
1
 
16.7%
Lowercase Letter
ValueCountFrequency (%)
e 2
50.0%
c 2
50.0%
Space Separator
ValueCountFrequency (%)
130
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1610
86.5%
Common 221
 
11.9%
Latin 31
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
55
 
3.4%
46
 
2.9%
41
 
2.5%
39
 
2.4%
38
 
2.4%
36
 
2.2%
35
 
2.2%
32
 
2.0%
26
 
1.6%
26
 
1.6%
Other values (238) 1236
76.8%
Common
ValueCountFrequency (%)
130
58.8%
1 22
 
10.0%
2 15
 
6.8%
3 8
 
3.6%
4 7
 
3.2%
) 7
 
3.2%
( 7
 
3.2%
, 6
 
2.7%
- 5
 
2.3%
7 3
 
1.4%
Other values (7) 11
 
5.0%
Latin
ValueCountFrequency (%)
I 6
19.4%
H 4
12.9%
3
9.7%
L 3
9.7%
2
 
6.5%
e 2
 
6.5%
c 2
 
6.5%
S 2
 
6.5%
K 2
 
6.5%
G 1
 
3.2%
Other values (4) 4
12.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1610
86.5%
ASCII 246
 
13.2%
Number Forms 6
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
130
52.8%
1 22
 
8.9%
2 15
 
6.1%
3 8
 
3.3%
4 7
 
2.8%
) 7
 
2.8%
( 7
 
2.8%
I 6
 
2.4%
, 6
 
2.4%
- 5
 
2.0%
Other values (18) 33
 
13.4%
Hangul
ValueCountFrequency (%)
55
 
3.4%
46
 
2.9%
41
 
2.5%
39
 
2.4%
38
 
2.4%
36
 
2.2%
35
 
2.2%
32
 
2.0%
26
 
1.6%
26
 
1.6%
Other values (238) 1236
76.8%
Number Forms
ValueCountFrequency (%)
3
50.0%
2
33.3%
1
 
16.7%
Distinct219
Distinct (%)96.5%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-12T11:01:23.321402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length22
Mean length16.60793
Min length14

Characters and Unicode

Total characters3770
Distinct characters87
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique214 ?
Unique (%)94.3%

Sample

1st row경기도 의정부시 송양로 75
2nd row경기도 의정부시 용민로 441
3rd row경기도 의정부시 용민로 373-20
4th row경기도 의정부시 송양로 93
5th row경기도 의정부시 누원로 52
ValueCountFrequency (%)
경기도 227
24.9%
의정부시 227
24.9%
용민로 16
 
1.8%
오목로 15
 
1.6%
시민로 11
 
1.2%
부용로 11
 
1.2%
장곡로 11
 
1.2%
송양로 10
 
1.1%
동일로 9
 
1.0%
천보로 7
 
0.8%
Other values (239) 366
40.2%
2023-12-12T11:01:23.965967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
683
18.1%
241
 
6.4%
239
 
6.3%
230
 
6.1%
229
 
6.1%
229
 
6.1%
227
 
6.0%
227
 
6.0%
224
 
5.9%
2 128
 
3.4%
Other values (77) 1113
29.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2378
63.1%
Decimal Number 698
 
18.5%
Space Separator 683
 
18.1%
Dash Punctuation 11
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
241
10.1%
239
10.1%
230
9.7%
229
9.6%
229
9.6%
227
9.5%
227
9.5%
224
9.4%
56
 
2.4%
53
 
2.2%
Other values (65) 423
17.8%
Decimal Number
ValueCountFrequency (%)
2 128
18.3%
1 116
16.6%
5 73
10.5%
3 71
10.2%
6 64
9.2%
4 61
8.7%
7 53
7.6%
0 52
7.4%
9 42
 
6.0%
8 38
 
5.4%
Space Separator
ValueCountFrequency (%)
683
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2378
63.1%
Common 1392
36.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
241
10.1%
239
10.1%
230
9.7%
229
9.6%
229
9.6%
227
9.5%
227
9.5%
224
9.4%
56
 
2.4%
53
 
2.2%
Other values (65) 423
17.8%
Common
ValueCountFrequency (%)
683
49.1%
2 128
 
9.2%
1 116
 
8.3%
5 73
 
5.2%
3 71
 
5.1%
6 64
 
4.6%
4 61
 
4.4%
7 53
 
3.8%
0 52
 
3.7%
9 42
 
3.0%
Other values (2) 49
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2378
63.1%
ASCII 1392
36.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
683
49.1%
2 128
 
9.2%
1 116
 
8.3%
5 73
 
5.2%
3 71
 
5.1%
6 64
 
4.6%
4 61
 
4.4%
7 53
 
3.8%
0 52
 
3.7%
9 42
 
3.0%
Other values (2) 49
 
3.5%
Hangul
ValueCountFrequency (%)
241
10.1%
239
10.1%
230
9.7%
229
9.6%
229
9.6%
227
9.5%
227
9.5%
224
9.4%
56
 
2.4%
53
 
2.2%
Other values (65) 423
17.8%

우편번호
Real number (ℝ)

Distinct112
Distinct (%)49.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11749.52
Minimum11601
Maximum11815
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2023-12-12T11:01:24.170967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum11601
5-th percentile11622
Q111716
median11769
Q311803
95-th percentile11813
Maximum11815
Range214
Interquartile range (IQR)87

Descriptive statistics

Standard deviation63.271387
Coefficient of variation (CV)0.005385019
Kurtosis-0.41157983
Mean11749.52
Median Absolute Deviation (MAD)43
Skewness-0.86107599
Sum2667141
Variance4003.2684
MonotonicityNot monotonic
2023-12-12T11:01:24.348354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
11813 20
 
8.8%
11812 14
 
6.2%
11800 11
 
4.8%
11801 10
 
4.4%
11815 8
 
3.5%
11757 5
 
2.2%
11759 4
 
1.8%
11644 3
 
1.3%
11752 3
 
1.3%
11811 3
 
1.3%
Other values (102) 146
64.3%
ValueCountFrequency (%)
11601 1
0.4%
11602 2
0.9%
11606 1
0.4%
11609 1
0.4%
11611 1
0.4%
11612 1
0.4%
11615 2
0.9%
11618 1
0.4%
11621 1
0.4%
11622 2
0.9%
ValueCountFrequency (%)
11815 8
 
3.5%
11814 2
 
0.9%
11813 20
8.8%
11812 14
6.2%
11811 3
 
1.3%
11810 2
 
0.9%
11808 1
 
0.4%
11807 1
 
0.4%
11806 2
 
0.9%
11805 2
 
0.9%

건축물연먼적(m2)
Real number (ℝ)

UNIQUE 

Distinct227
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean61960.243
Minimum10015.31
Maximum360789.01
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2023-12-12T11:01:24.506012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10015.31
5-th percentile10459.579
Q113348.36
median45178
Q390496.72
95-th percentile170459.53
Maximum360789.01
Range350773.7
Interquartile range (IQR)77148.36

Descriptive statistics

Standard deviation60378.04
Coefficient of variation (CV)0.97446422
Kurtosis4.1775324
Mean61960.243
Median Absolute Deviation (MAD)32300.57
Skewness1.7458072
Sum14064975
Variance3.6455077 × 109
MonotonicityNot monotonic
2023-12-12T11:01:24.717528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
46013.96 1
 
0.4%
12637.74 1
 
0.4%
12776.12 1
 
0.4%
12877.43 1
 
0.4%
12892.59 1
 
0.4%
12934.17 1
 
0.4%
12961.12 1
 
0.4%
13009.5 1
 
0.4%
13135.33 1
 
0.4%
13140.34 1
 
0.4%
Other values (217) 217
95.6%
ValueCountFrequency (%)
10015.31 1
0.4%
10018.05 1
0.4%
10055.55 1
0.4%
10115.8 1
0.4%
10119.59 1
0.4%
10155.13 1
0.4%
10181.66 1
0.4%
10218.67 1
0.4%
10237.25 1
0.4%
10269.45 1
0.4%
ValueCountFrequency (%)
360789.01 1
0.4%
319882.27 1
0.4%
283286.33 1
0.4%
248685.49 1
0.4%
240523.0 1
0.4%
231705.31 1
0.4%
212492.0 1
0.4%
210676.47 1
0.4%
183536.25 1
0.4%
175898.02 1
0.4%

Interactions

2023-12-12T11:01:21.618447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:01:21.394926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:01:21.720522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:01:21.478595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T11:01:24.838603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
건축물용도우편번호건축물연먼적(m2)
건축물용도1.0000.5120.430
우편번호0.5121.0000.339
건축물연먼적(m2)0.4300.3391.000
2023-12-12T11:01:24.954700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
우편번호건축물연먼적(m2)건축물용도
우편번호1.000-0.0390.238
건축물연먼적(m2)-0.0391.0000.182
건축물용도0.2380.1821.000

Missing values

2023-12-12T11:01:21.875184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T11:01:21.981907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

건축물용도건물명도로명주소우편번호건축물연먼적(m2)
0공동주택(아파트)민락센트럴(구 민락17단지)경기도 의정부시 송양로 751181246013.96
1공동주택(아파트)브라운빌리지경기도 의정부시 용민로 4411181540500.62
2공동주택(아파트)대광로제비앙포레스트경기도 의정부시 용민로 373-201177060336.0
3공동주택(아파트)호반베르디움 3차경기도 의정부시 송양로 931181271346.84
4공동주택(아파트)수락리버시티 2단지경기도 의정부시 누원로 521172765197.82
5공동주택(아파트)정음마을 고산1단지경기도 의정부시 입암로 141180129219.31
6공동주택(아파트)장암 우성경기도 의정부시 동일로 4001172061020.66
7공동주택(아파트)신곡 성원1차경기도 의정부시 시민로 3331178251521.68
8공동주택(아파트)호반베르디움 2차경기도 의정부시 송양로 161181280942.0
9공동주택(아파트)민락15단지경기도 의정부시 용민로 4421181277068.62
건축물용도건물명도로명주소우편번호건축물연먼적(m2)
217교육연구시설을지대학교 의정부캠퍼스 및 부속병원경기도 의정부시 동일로 71211759210676.47
218판매시설역지하상가경기도 의정부시 시민로 지하 10011696283286.33
219공장한강듀클래스 의정부 고산경기도 의정부시 문화로 101180137044.97
220공장의정부 더리브 센텀스퀘어Ⅲ 지식산업센터경기도 의정부시 배꽃길 631181541963.58
221공장의정부 더리브 센텀스퀘어Ⅱ 지식산업센터경기도 의정부시 배꽃길 71181542297.63
222공장의정부 더리브 센텀스퀘어Ⅰ 지식산업센터경기도 의정부시 배꽃길 1051181542390.09
223교육연구및복지시설경기북과학고등학교경기도 의정부시 체육로135번길 321160118553.22
224교육연구및복지시설광동고등학교경기도 의정부시 서부로 6911161510269.45
225제2종근린생활시설그린타워경기도 의정부시 천보로 641181313293.92
226업무시설경기도북부경찰청경기도 의정부시 금오로23번길 22-491176310898.02