Overview

Dataset statistics

Number of variables6
Number of observations89
Missing cells10
Missing cells (%)1.9%
Duplicate rows1
Duplicate rows (%)1.1%
Total size in memory4.4 KiB
Average record size in memory50.5 B

Variable types

Text3
Numeric1
Categorical1
DateTime1

Dataset

Description경상북도 포항시 석면조사대상 건축물 현황에 대하여 건물명 대지위치 연면적 주용도에 따라 목록화 함으로써 건축물 유지관리에 참고자료로 작성함
URLhttps://www.data.go.kr/data/15113223/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 1 (1.1%) duplicate rowsDuplicates
도로명주소 has 10 (11.2%) missing valuesMissing

Reproduction

Analysis started2023-12-12 08:33:48.457043
Analysis finished2023-12-12 08:33:49.872679
Duration1.42 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct82
Distinct (%)92.1%
Missing0
Missing (%)0.0%
Memory size844.0 B
2023-12-12T17:33:50.080240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length13
Mean length7.752809
Min length2

Characters and Unicode

Total characters690
Distinct characters172
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique75 ?
Unique (%)84.3%

Sample

1st row포항전통문화체험관
2nd row중앙동사무소
3rd row포항전통문화체험관
4th row중계 펌프장
5th row교육원
ValueCountFrequency (%)
포항시 7
 
5.7%
포항전통문화체험관 2
 
1.6%
새천년기념관 2
 
1.6%
2
 
1.6%
장량하수처리장 2
 
1.6%
남구보건소 2
 
1.6%
복지회관 2
 
1.6%
북구보건소 2
 
1.6%
행정복지센터 2
 
1.6%
버스공영차고지 2
 
1.6%
Other values (92) 97
79.5%
2023-12-12T17:33:50.536864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
35
 
5.1%
27
 
3.9%
26
 
3.8%
25
 
3.6%
20
 
2.9%
20
 
2.9%
18
 
2.6%
17
 
2.5%
15
 
2.2%
15
 
2.2%
Other values (162) 472
68.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 644
93.3%
Space Separator 35
 
5.1%
Decimal Number 5
 
0.7%
Open Punctuation 3
 
0.4%
Close Punctuation 3
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
27
 
4.2%
26
 
4.0%
25
 
3.9%
20
 
3.1%
20
 
3.1%
18
 
2.8%
17
 
2.6%
15
 
2.3%
15
 
2.3%
13
 
2.0%
Other values (155) 448
69.6%
Decimal Number
ValueCountFrequency (%)
2 2
40.0%
9 1
20.0%
4 1
20.0%
1 1
20.0%
Space Separator
ValueCountFrequency (%)
35
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 644
93.3%
Common 46
 
6.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
27
 
4.2%
26
 
4.0%
25
 
3.9%
20
 
3.1%
20
 
3.1%
18
 
2.8%
17
 
2.6%
15
 
2.3%
15
 
2.3%
13
 
2.0%
Other values (155) 448
69.6%
Common
ValueCountFrequency (%)
35
76.1%
( 3
 
6.5%
) 3
 
6.5%
2 2
 
4.3%
9 1
 
2.2%
4 1
 
2.2%
1 1
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 644
93.3%
ASCII 46
 
6.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
35
76.1%
( 3
 
6.5%
) 3
 
6.5%
2 2
 
4.3%
9 1
 
2.2%
4 1
 
2.2%
1 1
 
2.2%
Hangul
ValueCountFrequency (%)
27
 
4.2%
26
 
4.0%
25
 
3.9%
20
 
3.1%
20
 
3.1%
18
 
2.8%
17
 
2.6%
15
 
2.3%
15
 
2.3%
13
 
2.0%
Other values (155) 448
69.6%
Distinct72
Distinct (%)80.9%
Missing0
Missing (%)0.0%
Memory size844.0 B
2023-12-12T17:33:50.892302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length27
Mean length22.404494
Min length17

Characters and Unicode

Total characters1994
Distinct characters83
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique62 ?
Unique (%)69.7%

Sample

1st row경상북도 포항시 북구 기북면 오덕리 235-1
2nd row경상북도 포항시 북구 신흥동 693-19
3rd row경상북도 포항시 북구 기북면 오덕리 235-1
4th row경상북도 포항시 북구 흥해읍 흥안리 329
5th row경상북도 포항시 북구 기계면 봉계리 258-1
ValueCountFrequency (%)
경상북도 89
18.2%
포항시 89
18.2%
남구 49
 
10.0%
북구 40
 
8.2%
대도동 14
 
2.9%
흥해읍 9
 
1.8%
313-1 8
 
1.6%
7
 
1.4%
오천읍 5
 
1.0%
대보리 4
 
0.8%
Other values (120) 175
35.8%
2023-12-12T17:33:51.388949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
400
20.1%
132
 
6.6%
112
 
5.6%
96
 
4.8%
95
 
4.8%
1 93
 
4.7%
92
 
4.6%
89
 
4.5%
89
 
4.5%
89
 
4.5%
Other values (73) 707
35.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1194
59.9%
Space Separator 400
 
20.1%
Decimal Number 337
 
16.9%
Dash Punctuation 59
 
3.0%
Uppercase Letter 4
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
132
11.1%
112
 
9.4%
96
 
8.0%
95
 
8.0%
92
 
7.7%
89
 
7.5%
89
 
7.5%
89
 
7.5%
58
 
4.9%
49
 
4.1%
Other values (59) 293
24.5%
Decimal Number
ValueCountFrequency (%)
1 93
27.6%
3 59
17.5%
2 35
 
10.4%
6 31
 
9.2%
5 27
 
8.0%
4 25
 
7.4%
9 23
 
6.8%
8 19
 
5.6%
7 13
 
3.9%
0 12
 
3.6%
Uppercase Letter
ValueCountFrequency (%)
L 2
50.0%
B 2
50.0%
Space Separator
ValueCountFrequency (%)
400
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 59
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1194
59.9%
Common 796
39.9%
Latin 4
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
132
11.1%
112
 
9.4%
96
 
8.0%
95
 
8.0%
92
 
7.7%
89
 
7.5%
89
 
7.5%
89
 
7.5%
58
 
4.9%
49
 
4.1%
Other values (59) 293
24.5%
Common
ValueCountFrequency (%)
400
50.3%
1 93
 
11.7%
3 59
 
7.4%
- 59
 
7.4%
2 35
 
4.4%
6 31
 
3.9%
5 27
 
3.4%
4 25
 
3.1%
9 23
 
2.9%
8 19
 
2.4%
Other values (2) 25
 
3.1%
Latin
ValueCountFrequency (%)
L 2
50.0%
B 2
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1194
59.9%
ASCII 800
40.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
400
50.0%
1 93
 
11.6%
3 59
 
7.4%
- 59
 
7.4%
2 35
 
4.4%
6 31
 
3.9%
5 27
 
3.4%
4 25
 
3.1%
9 23
 
2.9%
8 19
 
2.4%
Other values (4) 29
 
3.6%
Hangul
ValueCountFrequency (%)
132
11.1%
112
 
9.4%
96
 
8.0%
95
 
8.0%
92
 
7.7%
89
 
7.5%
89
 
7.5%
89
 
7.5%
58
 
4.9%
49
 
4.1%
Other values (59) 293
24.5%

도로명주소
Text

MISSING 

Distinct66
Distinct (%)83.5%
Missing10
Missing (%)11.2%
Memory size844.0 B
2023-12-12T17:33:51.760404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length29
Mean length25.379747
Min length22

Characters and Unicode

Total characters2005
Distinct characters103
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique58 ?
Unique (%)73.4%

Sample

1st row경상북도 포항시 북구 기북면 덕동문화길 7
2nd row경상북도 포항시 북구 용당로 142 (신흥동)
3rd row경상북도 포항시 북구 기북면 덕동문화길 7
4th row경상북도 포항시 북구 흥해읍 칠포로258번길 40-24
5th row경상북도 포항시 북구 기계면 봉계길 34-71
ValueCountFrequency (%)
경상북도 79
16.7%
포항시 79
16.7%
남구 43
 
9.1%
북구 36
 
7.6%
대도동 14
 
3.0%
희망대로 9
 
1.9%
흥해읍 8
 
1.7%
810 6
 
1.3%
해맞이로 4
 
0.8%
14 4
 
0.8%
Other values (129) 192
40.5%
2023-12-12T17:33:52.382869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
395
19.7%
120
 
6.0%
100
 
5.0%
85
 
4.2%
85
 
4.2%
83
 
4.1%
82
 
4.1%
80
 
4.0%
79
 
3.9%
63
 
3.1%
Other values (93) 833
41.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1259
62.8%
Space Separator 395
 
19.7%
Decimal Number 243
 
12.1%
Open Punctuation 49
 
2.4%
Close Punctuation 49
 
2.4%
Dash Punctuation 10
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
120
 
9.5%
100
 
7.9%
85
 
6.8%
85
 
6.8%
83
 
6.6%
82
 
6.5%
80
 
6.4%
79
 
6.3%
63
 
5.0%
58
 
4.6%
Other values (79) 424
33.7%
Decimal Number
ValueCountFrequency (%)
1 48
19.8%
0 34
14.0%
3 32
13.2%
4 26
10.7%
2 24
9.9%
7 17
 
7.0%
8 16
 
6.6%
6 16
 
6.6%
5 15
 
6.2%
9 15
 
6.2%
Space Separator
ValueCountFrequency (%)
395
100.0%
Open Punctuation
ValueCountFrequency (%)
( 49
100.0%
Close Punctuation
ValueCountFrequency (%)
) 49
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1259
62.8%
Common 746
37.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
120
 
9.5%
100
 
7.9%
85
 
6.8%
85
 
6.8%
83
 
6.6%
82
 
6.5%
80
 
6.4%
79
 
6.3%
63
 
5.0%
58
 
4.6%
Other values (79) 424
33.7%
Common
ValueCountFrequency (%)
395
52.9%
( 49
 
6.6%
) 49
 
6.6%
1 48
 
6.4%
0 34
 
4.6%
3 32
 
4.3%
4 26
 
3.5%
2 24
 
3.2%
7 17
 
2.3%
8 16
 
2.1%
Other values (4) 56
 
7.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1259
62.8%
ASCII 746
37.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
395
52.9%
( 49
 
6.6%
) 49
 
6.6%
1 48
 
6.4%
0 34
 
4.6%
3 32
 
4.3%
4 26
 
3.5%
2 24
 
3.2%
7 17
 
2.3%
8 16
 
2.1%
Other values (4) 56
 
7.5%
Hangul
ValueCountFrequency (%)
120
 
9.5%
100
 
7.9%
85
 
6.8%
85
 
6.8%
83
 
6.6%
82
 
6.5%
80
 
6.4%
79
 
6.3%
63
 
5.0%
58
 
4.6%
Other values (79) 424
33.7%

연면적
Real number (ℝ)

Distinct87
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3323.2004
Minimum544.32
Maximum36333.17
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size933.0 B
2023-12-12T17:33:52.604216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum544.32
5-th percentile582
Q1868.22
median1312.02
Q33579.26
95-th percentile9957.308
Maximum36333.17
Range35788.85
Interquartile range (IQR)2711.04

Descriptive statistics

Standard deviation5031.7395
Coefficient of variation (CV)1.5141246
Kurtosis22.132594
Mean3323.2004
Median Absolute Deviation (MAD)667
Skewness4.1269851
Sum295764.83
Variance25318402
MonotonicityIncreasing
2023-12-12T17:33:52.829332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
868.22 2
 
2.2%
5447.7 2
 
2.2%
544.32 1
 
1.1%
2575.53 1
 
1.1%
3312.08 1
 
1.1%
3226.78 1
 
1.1%
3206.3 1
 
1.1%
3150.0 1
 
1.1%
2952.49 1
 
1.1%
2732.65 1
 
1.1%
Other values (77) 77
86.5%
ValueCountFrequency (%)
544.32 1
1.1%
565.0 1
1.1%
570.54 1
1.1%
572.25 1
1.1%
572.8 1
1.1%
595.8 1
1.1%
608.34 1
1.1%
628.63 1
1.1%
631.96 1
1.1%
658.56 1
1.1%
ValueCountFrequency (%)
36333.17 1
1.1%
20334.84 1
1.1%
17609.0 1
1.1%
12760.28 1
1.1%
10053.98 1
1.1%
9812.3 1
1.1%
9195.98 1
1.1%
8875.34 1
1.1%
8380.8 1
1.1%
7534.41 1
1.1%

주용도
Categorical

Distinct16
Distinct (%)18.0%
Missing0
Missing (%)0.0%
Memory size844.0 B
제1종근린생활시설
19 
업무시설
11 
문화및집회시설
10 
운동시설
10 
노유자시설
10 
Other values (11)
29 

Length

Max length10
Median length9
Mean length6.4044944
Min length2

Unique

Unique3 ?
Unique (%)3.4%

Sample

1st row문화및집회시설
2nd row제2종근린생활시설
3rd row수련시설
4th row분뇨.쓰레기처리시설
5th row교육연구시설

Common Values

ValueCountFrequency (%)
제1종근린생활시설 19
21.3%
업무시설 11
12.4%
문화및집회시설 10
11.2%
운동시설 10
11.2%
노유자시설 10
11.2%
교육연구시설 5
 
5.6%
제2종근린생활시설 4
 
4.5%
분뇨.쓰레기처리시설 4
 
4.5%
자원순환관련시설 4
 
4.5%
자동차관련시설 3
 
3.4%
Other values (6) 9
10.1%

Length

2023-12-12T17:33:53.021016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
제1종근린생활시설 19
21.3%
업무시설 11
12.4%
문화및집회시설 10
11.2%
운동시설 10
11.2%
노유자시설 10
11.2%
교육연구시설 5
 
5.6%
제2종근린생활시설 4
 
4.5%
분뇨.쓰레기처리시설 4
 
4.5%
자원순환관련시설 4
 
4.5%
자동차관련시설 3
 
3.4%
Other values (6) 9
10.1%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size844.0 B
Minimum2023-04-07 00:00:00
Maximum2023-04-07 00:00:00
2023-12-12T17:33:53.170132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:33:53.287075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T17:33:49.214980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:33:53.375118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
건물명대지위치도로명주소연면적주용도
건물명1.0000.9980.9991.0000.985
대지위치0.9981.0001.0000.0000.973
도로명주소0.9991.0001.0000.0000.987
연면적1.0000.0000.0001.0000.000
주용도0.9850.9730.9870.0001.000
2023-12-12T17:33:53.493677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연면적주용도
연면적1.0000.000
주용도0.0001.000

Missing values

2023-12-12T17:33:49.373162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:33:49.806200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

건물명대지위치도로명주소연면적주용도데이터기준일자
0포항전통문화체험관경상북도 포항시 북구 기북면 오덕리 235-1경상북도 포항시 북구 기북면 덕동문화길 7544.32문화및집회시설2023-04-07
1중앙동사무소경상북도 포항시 북구 신흥동 693-19경상북도 포항시 북구 용당로 142 (신흥동)565.0제2종근린생활시설2023-04-07
2포항전통문화체험관경상북도 포항시 북구 기북면 오덕리 235-1경상북도 포항시 북구 기북면 덕동문화길 7570.54수련시설2023-04-07
3중계 펌프장경상북도 포항시 북구 흥해읍 흥안리 329경상북도 포항시 북구 흥해읍 칠포로258번길 40-24572.25분뇨.쓰레기처리시설2023-04-07
4교육원경상북도 포항시 북구 기계면 봉계리 258-1경상북도 포항시 북구 기계면 봉계길 34-71572.8교육연구시설2023-04-07
5동빈동사무소경상북도 포항시 북구 동빈1가 74-15경상북도 포항시 북구 삼호로46번길 14 (동빈1가)595.8제1종근린생활시설2023-04-07
6클라이밍장(인공암벽장)경상북도 포항시 남구 대도동 313-1경상북도 포항시 남구 희망대로 810 (대도동)608.34운동시설2023-04-07
7제철동다목적복지회관경상북도 포항시 남구 인덕동 161-2경상북도 포항시 남구 인덕로 52 (인덕동)628.63제1종근린생활시설2023-04-07
8용한리복지시설경상북도 포항시 북구 흥해읍 용한리 산 55-2<NA>631.96제1종근린생활시설2023-04-07
9청하면사무소경상북도 포항시 북구 청하면 덕성리 276-3경상북도 포항시 북구 청하면 청하로217번길 22658.56제1종근린생활시설2023-04-07
건물명대지위치도로명주소연면적주용도데이터기준일자
79포항시 하수처리수 재이용시설 신축공사경상북도 포항시 남구 상도동 125-25경상북도 포항시 남구 형산강북로 203 (상도동)7534.41자원순환관련시설2023-04-07
80포항시청경상북도 포항시 남구 대잠동 1001경상북도 포항시 남구 시청로 1 (대잠동)8380.8업무시설2023-04-07
81청과 2동경상북도 포항시 북구 흥해읍 학천리 4경상북도 포항시 북구 흥해읍 동해대로 11828875.34판매시설2023-04-07
82포항시청경상북도 포항시 남구 대잠동 1001경상북도 포항시 남구 시청로 1 (대잠동)9195.98업무시설2023-04-07
83중앙도서관경상북도 포항시 북구 덕수동 35-19경상북도 포항시 북구 삼호로 31 (덕수동)9812.3교육연구시설2023-04-07
84실내체육관경상북도 포항시 남구 대도동 313-1경상북도 포항시 남구 희망대로 810 (대도동)10053.98운동시설2023-04-07
85포항시 평생학습원경상북도 포항시 남구 상도동 672경상북도 포항시 남구 뱃머리길 39 (상도동)12760.28노유자시설2023-04-07
86운동장경상북도 포항시 남구 대도동 313-1경상북도 포항시 남구 희망대로 810 (대도동)17609.0운동시설2023-04-07
87야구장경상북도 포항시 남구 대도동 313-1경상북도 포항시 남구 희망대로 790 (대도동)20334.84문화및집회시설2023-04-07
88시청사경상북도 포항시 남구 대잠동 1001경상북도 포항시 남구 시청로 1 (대잠동)36333.17업무시설2023-04-07

Duplicate rows

Most frequently occurring

건물명대지위치도로명주소연면적주용도데이터기준일자# duplicates
0새천년기념관경상북도 포항시 남구 호미곶면 대보리 293-1경상북도 포항시 남구 호미곶면 해맞이로 1365447.7문화및집회시설2023-04-072