Overview

Dataset statistics

Number of variables4
Number of observations135
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.6 KiB
Average record size in memory35.0 B

Variable types

Numeric2
Text2

Dataset

Description서울특별시 서초구 기계설비 성능점검 대상 건축물 현황입니다. 건물명, 우편번호, 도로명주소 정보를 제공하고 있습니다.
Author서울특별시 서초구
URLhttps://www.data.go.kr/data/15125082/fileData.do

Alerts

연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 16:36:12.206078
Analysis finished2023-12-12 16:36:13.109456
Duration0.9 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct135
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean68
Minimum1
Maximum135
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-13T01:36:13.226501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile7.7
Q134.5
median68
Q3101.5
95-th percentile128.3
Maximum135
Range134
Interquartile range (IQR)67

Descriptive statistics

Standard deviation39.115214
Coefficient of variation (CV)0.57522374
Kurtosis-1.2
Mean68
Median Absolute Deviation (MAD)34
Skewness0
Sum9180
Variance1530
MonotonicityStrictly increasing
2023-12-13T01:36:13.454900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.7%
94 1
 
0.7%
88 1
 
0.7%
89 1
 
0.7%
90 1
 
0.7%
91 1
 
0.7%
92 1
 
0.7%
93 1
 
0.7%
95 1
 
0.7%
2 1
 
0.7%
Other values (125) 125
92.6%
ValueCountFrequency (%)
1 1
0.7%
2 1
0.7%
3 1
0.7%
4 1
0.7%
5 1
0.7%
6 1
0.7%
7 1
0.7%
8 1
0.7%
9 1
0.7%
10 1
0.7%
ValueCountFrequency (%)
135 1
0.7%
134 1
0.7%
133 1
0.7%
132 1
0.7%
131 1
0.7%
130 1
0.7%
129 1
0.7%
128 1
0.7%
127 1
0.7%
126 1
0.7%
Distinct134
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-13T01:36:13.773262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length12
Mean length7.3037037
Min length3

Characters and Unicode

Total characters986
Distinct characters239
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique133 ?
Unique (%)98.5%

Sample

1st row양진빌딩
2nd row서초타운트라팰리스
3rd row신반포상가
4th row서초구청
5th row대각빌딩
ValueCountFrequency (%)
서초 4
 
2.4%
남서울빌딩 2
 
1.2%
국립국악원 2
 
1.2%
래미안 2
 
1.2%
서초동 2
 
1.2%
서초지웰타워 1
 
0.6%
국악누리동 1
 
0.6%
가산빌딩 1
 
0.6%
블루콤타워 1
 
0.6%
우면초등학교(본관,별관 1
 
0.6%
Other values (147) 147
89.6%
2023-12-13T01:36:14.199027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
42
 
4.3%
31
 
3.1%
30
 
3.0%
27
 
2.7%
24
 
2.4%
23
 
2.3%
21
 
2.1%
20
 
2.0%
18
 
1.8%
17
 
1.7%
Other values (229) 733
74.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 896
90.9%
Space Separator 31
 
3.1%
Decimal Number 18
 
1.8%
Uppercase Letter 15
 
1.5%
Open Punctuation 11
 
1.1%
Close Punctuation 11
 
1.1%
Dash Punctuation 2
 
0.2%
Other Punctuation 1
 
0.1%
Letter Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
42
 
4.7%
30
 
3.3%
27
 
3.0%
24
 
2.7%
23
 
2.6%
21
 
2.3%
20
 
2.2%
18
 
2.0%
17
 
1.9%
17
 
1.9%
Other values (206) 657
73.3%
Uppercase Letter
ValueCountFrequency (%)
T 3
20.0%
C 2
13.3%
D 1
 
6.7%
I 1
 
6.7%
K 1
 
6.7%
O 1
 
6.7%
W 1
 
6.7%
E 1
 
6.7%
R 1
 
6.7%
A 1
 
6.7%
Other values (2) 2
13.3%
Decimal Number
ValueCountFrequency (%)
1 8
44.4%
5 3
 
16.7%
3 3
 
16.7%
6 2
 
11.1%
2 2
 
11.1%
Space Separator
ValueCountFrequency (%)
31
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 896
90.9%
Common 74
 
7.5%
Latin 16
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
42
 
4.7%
30
 
3.3%
27
 
3.0%
24
 
2.7%
23
 
2.6%
21
 
2.3%
20
 
2.2%
18
 
2.0%
17
 
1.9%
17
 
1.9%
Other values (206) 657
73.3%
Latin
ValueCountFrequency (%)
T 3
18.8%
C 2
12.5%
D 1
 
6.2%
I 1
 
6.2%
K 1
 
6.2%
O 1
 
6.2%
W 1
 
6.2%
E 1
 
6.2%
R 1
 
6.2%
A 1
 
6.2%
Other values (3) 3
18.8%
Common
ValueCountFrequency (%)
31
41.9%
( 11
 
14.9%
) 11
 
14.9%
1 8
 
10.8%
5 3
 
4.1%
3 3
 
4.1%
6 2
 
2.7%
- 2
 
2.7%
2 2
 
2.7%
, 1
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 896
90.9%
ASCII 89
 
9.0%
Number Forms 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
42
 
4.7%
30
 
3.3%
27
 
3.0%
24
 
2.7%
23
 
2.6%
21
 
2.3%
20
 
2.2%
18
 
2.0%
17
 
1.9%
17
 
1.9%
Other values (206) 657
73.3%
ASCII
ValueCountFrequency (%)
31
34.8%
( 11
 
12.4%
) 11
 
12.4%
1 8
 
9.0%
5 3
 
3.4%
T 3
 
3.4%
3 3
 
3.4%
6 2
 
2.2%
- 2
 
2.2%
C 2
 
2.2%
Other values (12) 13
14.6%
Number Forms
ValueCountFrequency (%)
1
100.0%

우편번호
Real number (ℝ)

Distinct93
Distinct (%)68.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6651.363
Minimum6307
Maximum6802
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-13T01:36:14.370846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6307
5-th percentile6515.2
Q16599.5
median6639
Q36733.5
95-th percentile6783.6
Maximum6802
Range495
Interquartile range (IQR)134

Descriptive statistics

Standard deviation88.890475
Coefficient of variation (CV)0.01336425
Kurtosis0.38541403
Mean6651.363
Median Absolute Deviation (MAD)63
Skewness-0.29339626
Sum897934
Variance7901.5165
MonotonicityNot monotonic
2023-12-13T01:36:14.575671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
6621 5
 
3.7%
6757 5
 
3.7%
6765 4
 
3.0%
6654 4
 
3.0%
6644 3
 
2.2%
6768 3
 
2.2%
6627 3
 
2.2%
6628 3
 
2.2%
6695 3
 
2.2%
6521 3
 
2.2%
Other values (83) 99
73.3%
ValueCountFrequency (%)
6307 1
 
0.7%
6503 1
 
0.7%
6504 2
1.5%
6508 1
 
0.7%
6509 1
 
0.7%
6511 1
 
0.7%
6517 1
 
0.7%
6518 2
1.5%
6521 3
2.2%
6524 1
 
0.7%
ValueCountFrequency (%)
6802 2
1.5%
6801 1
 
0.7%
6800 1
 
0.7%
6799 1
 
0.7%
6798 1
 
0.7%
6792 1
 
0.7%
6780 1
 
0.7%
6774 1
 
0.7%
6768 3
2.2%
6767 3
2.2%
Distinct130
Distinct (%)96.3%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-13T01:36:14.964168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length22
Mean length18.437037
Min length15

Characters and Unicode

Total characters2489
Distinct characters72
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique126 ?
Unique (%)93.3%

Sample

1st row서울특별시 서초구 반포대로 138
2nd row서울특별시 서초구 서초대로74길 23
3rd row서울특별시 서초구 신반포로15길 29
4th row서울특별시 서초구 남부순환로 2584
5th row서울특별시 서초구 서초대로78길 5
ValueCountFrequency (%)
서울특별시 135
25.0%
서초구 135
25.0%
강남대로 14
 
2.6%
남부순환로 11
 
2.0%
서초중앙로 11
 
2.0%
효령로 6
 
1.1%
서초대로 6
 
1.1%
양재대로2길 5
 
0.9%
사임당로 5
 
0.9%
반포대로 4
 
0.7%
Other values (158) 208
38.5%
2023-12-13T01:36:15.530925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
405
16.3%
297
11.9%
158
 
6.3%
135
 
5.4%
135
 
5.4%
135
 
5.4%
135
 
5.4%
135
 
5.4%
133
 
5.3%
1 88
 
3.5%
Other values (62) 733
29.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1651
66.3%
Decimal Number 428
 
17.2%
Space Separator 405
 
16.3%
Dash Punctuation 5
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
297
18.0%
158
9.6%
135
8.2%
135
8.2%
135
8.2%
135
8.2%
135
8.2%
133
8.1%
48
 
2.9%
46
 
2.8%
Other values (50) 294
17.8%
Decimal Number
ValueCountFrequency (%)
1 88
20.6%
2 69
16.1%
3 61
14.3%
4 44
10.3%
5 44
10.3%
7 31
 
7.2%
6 26
 
6.1%
8 25
 
5.8%
0 21
 
4.9%
9 19
 
4.4%
Space Separator
ValueCountFrequency (%)
405
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1651
66.3%
Common 838
33.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
297
18.0%
158
9.6%
135
8.2%
135
8.2%
135
8.2%
135
8.2%
135
8.2%
133
8.1%
48
 
2.9%
46
 
2.8%
Other values (50) 294
17.8%
Common
ValueCountFrequency (%)
405
48.3%
1 88
 
10.5%
2 69
 
8.2%
3 61
 
7.3%
4 44
 
5.3%
5 44
 
5.3%
7 31
 
3.7%
6 26
 
3.1%
8 25
 
3.0%
0 21
 
2.5%
Other values (2) 24
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1651
66.3%
ASCII 838
33.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
405
48.3%
1 88
 
10.5%
2 69
 
8.2%
3 61
 
7.3%
4 44
 
5.3%
5 44
 
5.3%
7 31
 
3.7%
6 26
 
3.1%
8 25
 
3.0%
0 21
 
2.5%
Other values (2) 24
 
2.9%
Hangul
ValueCountFrequency (%)
297
18.0%
158
9.6%
135
8.2%
135
8.2%
135
8.2%
135
8.2%
135
8.2%
133
8.1%
48
 
2.9%
46
 
2.8%
Other values (50) 294
17.8%

Interactions

2023-12-13T01:36:12.720446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:36:12.485495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:36:12.835960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:36:12.608649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T01:36:15.647895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번우편번호
연번1.0000.188
우편번호0.1881.000
2023-12-13T01:36:15.749288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번우편번호
연번1.0000.104
우편번호0.1041.000

Missing values

2023-12-13T01:36:12.954307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:36:13.062194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번건물명우편번호도로명주소
01양진빌딩6595서울특별시 서초구 반포대로 138
12서초타운트라팰리스6621서울특별시 서초구 서초대로74길 23
23신반포상가6503서울특별시 서초구 신반포로15길 29
34서초구청6750서울특별시 서초구 남부순환로 2584
45대각빌딩6620서울특별시 서초구 서초대로78길 5
56서초동 광일빌딩6627서울특별시 서초구 강남대로 331
67그린빌오피스텔6654서울특별시 서초구 효령로55길 22
78청화오피스텔6628서울특별시 서초구 효령로 431
89케이타워6633서울특별시 서초구 서초대로 320
910강남역 한화오벨리스크6621서울특별시 서초구 서초대로74길 27
연번건물명우편번호도로명주소
125126진흥아파트6617서울특별시 서초구 서초대로 385
126127신반포자이6521서울특별시 서초구 잠원로 60
127128아크로리버뷰신반포6508서울특별시 서초구 잠원로 117
128129래미안서초에스티지에스6625서울특별시 서초구 서운로 104
129130래미안방배아트힐6715서울특별시 서초구 남부순환로 2311-12
130131서초포레스타6단지6802서울특별시 서초구 청계산로9길 1-12
131132서초호반써밋6767서울특별시 서초구 양재대로2길 109
132133서초포레스타5단지6798서울특별시 서초구 청계산로7길 43
133134잠원한신아파트6518서울특별시 서초구 잠원로 150
134135서초네이처힐5단지6764서울특별시 서초구 태봉로2길 5