Overview

Dataset statistics

Number of variables6
Number of observations1134
Missing cells278
Missing cells (%)4.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory54.4 KiB
Average record size in memory49.1 B

Variable types

Numeric1
Text4
Categorical1

Dataset

Description부산광역시_건축사사무소현황_20240118
Author부산광역시
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15034666

Alerts

데이터기준일자 has constant value ""Constant
전화번호 has 278 (24.5%) missing valuesMissing
순번 has unique valuesUnique

Reproduction

Analysis started2024-03-13 13:19:58.565921
Analysis finished2024-03-13 13:19:59.620204
Duration1.05 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct1134
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean567.5
Minimum1
Maximum1134
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size10.1 KiB
2024-03-13T22:19:59.717766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile57.65
Q1284.25
median567.5
Q3850.75
95-th percentile1077.35
Maximum1134
Range1133
Interquartile range (IQR)566.5

Descriptive statistics

Standard deviation327.50191
Coefficient of variation (CV)0.57709587
Kurtosis-1.2
Mean567.5
Median Absolute Deviation (MAD)283.5
Skewness0
Sum643545
Variance107257.5
MonotonicityStrictly increasing
2024-03-13T22:19:59.870641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
755 1
 
0.1%
761 1
 
0.1%
760 1
 
0.1%
759 1
 
0.1%
758 1
 
0.1%
757 1
 
0.1%
756 1
 
0.1%
754 1
 
0.1%
763 1
 
0.1%
Other values (1124) 1124
99.1%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1134 1
0.1%
1133 1
0.1%
1132 1
0.1%
1131 1
0.1%
1130 1
0.1%
1129 1
0.1%
1128 1
0.1%
1127 1
0.1%
1126 1
0.1%
1125 1
0.1%
Distinct1050
Distinct (%)92.6%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
2024-03-13T22:20:00.289839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length19
Mean length11.032628
Min length7

Characters and Unicode

Total characters12511
Distinct characters357
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique992 ?
Unique (%)87.5%

Sample

1st row건축사사무소 한국
2nd row신기종합건축사사무소
3rd row삼아 건축사사무소
4th row혁진 건축사사무소
5th row(주)상지엔지니어링건축사사무소
ValueCountFrequency (%)
건축사사무소 520
27.5%
주식회사 87
 
4.6%
종합건축사사무소 82
 
4.3%
주)종합건축사사무소 16
 
0.8%
주)상지엔지니어링건축사사무소 11
 
0.6%
주)건축사사무소 10
 
0.5%
건축사 9
 
0.5%
사무소 9
 
0.5%
5
 
0.3%
주)일신설계종합건축사사무소 5
 
0.3%
Other values (1051) 1136
60.1%
2024-03-13T22:20:01.159432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2374
19.0%
1211
 
9.7%
1180
 
9.4%
1146
 
9.2%
1144
 
9.1%
777
 
6.2%
341
 
2.7%
( 240
 
1.9%
) 240
 
1.9%
220
 
1.8%
Other values (347) 3638
29.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11031
88.2%
Space Separator 777
 
6.2%
Open Punctuation 240
 
1.9%
Close Punctuation 240
 
1.9%
Uppercase Letter 142
 
1.1%
Other Punctuation 39
 
0.3%
Decimal Number 22
 
0.2%
Lowercase Letter 14
 
0.1%
Dash Punctuation 4
 
< 0.1%
Final Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2374
21.5%
1211
11.0%
1180
10.7%
1146
 
10.4%
1144
 
10.4%
341
 
3.1%
220
 
2.0%
215
 
1.9%
192
 
1.7%
134
 
1.2%
Other values (298) 2874
26.1%
Uppercase Letter
ValueCountFrequency (%)
A 30
21.1%
T 11
 
7.7%
M 11
 
7.7%
C 11
 
7.7%
N 11
 
7.7%
S 9
 
6.3%
E 9
 
6.3%
U 8
 
5.6%
J 7
 
4.9%
P 7
 
4.9%
Other values (12) 28
19.7%
Lowercase Letter
ValueCountFrequency (%)
m 2
14.3%
n 2
14.3%
a 2
14.3%
l 2
14.3%
p 2
14.3%
h 1
7.1%
e 1
7.1%
s 1
7.1%
c 1
7.1%
Other Punctuation
ValueCountFrequency (%)
. 19
48.7%
& 14
35.9%
· 2
 
5.1%
' 2
 
5.1%
# 1
 
2.6%
, 1
 
2.6%
Decimal Number
ValueCountFrequency (%)
1 9
40.9%
2 8
36.4%
5 2
 
9.1%
0 1
 
4.5%
8 1
 
4.5%
4 1
 
4.5%
Space Separator
ValueCountFrequency (%)
777
100.0%
Open Punctuation
ValueCountFrequency (%)
( 240
100.0%
Close Punctuation
ValueCountFrequency (%)
) 240
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11030
88.2%
Common 1323
 
10.6%
Latin 156
 
1.2%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2374
21.5%
1211
11.0%
1180
10.7%
1146
 
10.4%
1144
 
10.4%
341
 
3.1%
220
 
2.0%
215
 
1.9%
192
 
1.7%
134
 
1.2%
Other values (297) 2873
26.0%
Latin
ValueCountFrequency (%)
A 30
19.2%
T 11
 
7.1%
M 11
 
7.1%
C 11
 
7.1%
N 11
 
7.1%
S 9
 
5.8%
E 9
 
5.8%
U 8
 
5.1%
J 7
 
4.5%
P 7
 
4.5%
Other values (21) 42
26.9%
Common
ValueCountFrequency (%)
777
58.7%
( 240
 
18.1%
) 240
 
18.1%
. 19
 
1.4%
& 14
 
1.1%
1 9
 
0.7%
2 8
 
0.6%
- 4
 
0.3%
· 2
 
0.2%
5 2
 
0.2%
Other values (7) 8
 
0.6%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11029
88.2%
ASCII 1476
 
11.8%
None 3
 
< 0.1%
CJK 2
 
< 0.1%
Punctuation 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2374
21.5%
1211
11.0%
1180
10.7%
1146
 
10.4%
1144
 
10.4%
341
 
3.1%
220
 
2.0%
215
 
1.9%
192
 
1.7%
134
 
1.2%
Other values (296) 2872
26.0%
ASCII
ValueCountFrequency (%)
777
52.6%
( 240
 
16.3%
) 240
 
16.3%
A 30
 
2.0%
. 19
 
1.3%
& 14
 
0.9%
T 11
 
0.7%
M 11
 
0.7%
C 11
 
0.7%
N 11
 
0.7%
Other values (36) 112
 
7.6%
None
ValueCountFrequency (%)
· 2
66.7%
1
33.3%
Punctuation
ValueCountFrequency (%)
1
100.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct945
Distinct (%)83.3%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
2024-03-13T22:20:01.534047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length39
Mean length28.267196
Min length1

Characters and Unicode

Total characters32055
Distinct characters365
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique824 ?
Unique (%)72.7%

Sample

1st row부산광역시 부산진구 부전로152번길 31-7
2nd row부산광역시 사상구 학감대로 238-10 (감전동 , 협성빌딩202호)
3rd row부산광역시 부산진구 새싹로 31
4th row부산광역시 동구 중앙대로320번길 7-8, 대양빌딩 302
5th row부산광역시 중구 자갈치로 42
ValueCountFrequency (%)
부산광역시 1108
 
18.3%
해운대구 178
 
2.9%
연제구 126
 
2.1%
부산진구 118
 
2.0%
3층 95
 
1.6%
수영구 95
 
1.6%
2층 94
 
1.6%
동래구 94
 
1.6%
금정구 93
 
1.5%
동구 89
 
1.5%
Other values (1534) 3960
65.5%
2024-03-13T22:20:02.010498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4978
 
15.5%
1301
 
4.1%
1 1292
 
4.0%
1262
 
3.9%
1177
 
3.7%
1159
 
3.6%
1109
 
3.5%
1102
 
3.4%
1101
 
3.4%
, 1085
 
3.4%
Other values (355) 16489
51.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 18756
58.5%
Decimal Number 6317
 
19.7%
Space Separator 4978
 
15.5%
Other Punctuation 1095
 
3.4%
Open Punctuation 300
 
0.9%
Close Punctuation 299
 
0.9%
Dash Punctuation 180
 
0.6%
Uppercase Letter 108
 
0.3%
Lowercase Letter 18
 
0.1%
Control 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1301
 
6.9%
1262
 
6.7%
1177
 
6.3%
1159
 
6.2%
1109
 
5.9%
1102
 
5.9%
1101
 
5.9%
689
 
3.7%
588
 
3.1%
528
 
2.8%
Other values (311) 8740
46.6%
Uppercase Letter
ValueCountFrequency (%)
A 18
16.7%
B 15
13.9%
C 11
10.2%
H 7
 
6.5%
K 7
 
6.5%
T 7
 
6.5%
E 6
 
5.6%
O 6
 
5.6%
P 4
 
3.7%
S 4
 
3.7%
Other values (9) 23
21.3%
Decimal Number
ValueCountFrequency (%)
1 1292
20.5%
2 989
15.7%
0 796
12.6%
3 744
11.8%
4 475
 
7.5%
7 449
 
7.1%
6 437
 
6.9%
5 434
 
6.9%
9 403
 
6.4%
8 298
 
4.7%
Other Punctuation
ValueCountFrequency (%)
, 1085
99.1%
/ 6
 
0.5%
. 2
 
0.2%
· 1
 
0.1%
@ 1
 
0.1%
Lowercase Letter
ValueCountFrequency (%)
e 12
66.7%
s 2
 
11.1%
k 2
 
11.1%
y 1
 
5.6%
h 1
 
5.6%
Space Separator
ValueCountFrequency (%)
4978
100.0%
Open Punctuation
ValueCountFrequency (%)
( 300
100.0%
Close Punctuation
ValueCountFrequency (%)
) 299
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 180
100.0%
Control
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 18756
58.5%
Common 13173
41.1%
Latin 126
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1301
 
6.9%
1262
 
6.7%
1177
 
6.3%
1159
 
6.2%
1109
 
5.9%
1102
 
5.9%
1101
 
5.9%
689
 
3.7%
588
 
3.1%
528
 
2.8%
Other values (311) 8740
46.6%
Latin
ValueCountFrequency (%)
A 18
14.3%
B 15
11.9%
e 12
 
9.5%
C 11
 
8.7%
H 7
 
5.6%
K 7
 
5.6%
T 7
 
5.6%
E 6
 
4.8%
O 6
 
4.8%
P 4
 
3.2%
Other values (14) 33
26.2%
Common
ValueCountFrequency (%)
4978
37.8%
1 1292
 
9.8%
, 1085
 
8.2%
2 989
 
7.5%
0 796
 
6.0%
3 744
 
5.6%
4 475
 
3.6%
7 449
 
3.4%
6 437
 
3.3%
5 434
 
3.3%
Other values (10) 1494
 
11.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 18756
58.5%
ASCII 13298
41.5%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4978
37.4%
1 1292
 
9.7%
, 1085
 
8.2%
2 989
 
7.4%
0 796
 
6.0%
3 744
 
5.6%
4 475
 
3.6%
7 449
 
3.4%
6 437
 
3.3%
5 434
 
3.3%
Other values (33) 1619
 
12.2%
Hangul
ValueCountFrequency (%)
1301
 
6.9%
1262
 
6.7%
1177
 
6.3%
1159
 
6.2%
1109
 
5.9%
1102
 
5.9%
1101
 
5.9%
689
 
3.7%
588
 
3.1%
528
 
2.8%
Other values (311) 8740
46.6%
None
ValueCountFrequency (%)
· 1
100.0%

전화번호
Text

MISSING 

Distinct729
Distinct (%)85.2%
Missing278
Missing (%)24.5%
Memory size9.0 KiB
2024-03-13T22:20:02.268129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.044393
Min length12

Characters and Unicode

Total characters10310
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique641 ?
Unique (%)74.9%

Sample

1st row051-817-2119
2nd row051-328-4474
3rd row051-816-8193
4th row051-469-1285
5th row051-247-0208
ValueCountFrequency (%)
051-247-0208 13
 
1.5%
051-462-4712 6
 
0.7%
051-632-8634 6
 
0.7%
070-4044-7174 4
 
0.5%
051-781-5781 4
 
0.5%
051-462-0463 4
 
0.5%
051-514-8008 3
 
0.4%
051-463-3355 3
 
0.4%
051-626-7341 3
 
0.4%
051-502-1520 3
 
0.4%
Other values (719) 807
94.3%
2024-03-13T22:20:02.682844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1712
16.6%
5 1549
15.0%
0 1545
15.0%
1 1485
14.4%
7 668
 
6.5%
2 666
 
6.5%
4 639
 
6.2%
6 613
 
5.9%
8 570
 
5.5%
3 512
 
5.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 8598
83.4%
Dash Punctuation 1712
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 1549
18.0%
0 1545
18.0%
1 1485
17.3%
7 668
7.8%
2 666
7.7%
4 639
7.4%
6 613
 
7.1%
8 570
 
6.6%
3 512
 
6.0%
9 351
 
4.1%
Dash Punctuation
ValueCountFrequency (%)
- 1712
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 10310
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 1712
16.6%
5 1549
15.0%
0 1545
15.0%
1 1485
14.4%
7 668
 
6.5%
2 666
 
6.5%
4 639
 
6.2%
6 613
 
5.9%
8 570
 
5.5%
3 512
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 10310
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1712
16.6%
5 1549
15.0%
0 1545
15.0%
1 1485
14.4%
7 668
 
6.5%
2 666
 
6.5%
4 639
 
6.2%
6 613
 
5.9%
8 570
 
5.5%
3 512
 
5.0%
Distinct1106
Distinct (%)97.5%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
2024-03-13T22:20:03.047810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.9814815
Min length2

Characters and Unicode

Total characters3381
Distinct characters206
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1078 ?
Unique (%)95.1%

Sample

1st row김창신
2nd row김건수
3rd row김성곤
4th row임창수
5th row김동균
ValueCountFrequency (%)
박재현 2
 
0.2%
이상호 2
 
0.2%
최재훈 2
 
0.2%
이상일 2
 
0.2%
이창훈 2
 
0.2%
이동준 2
 
0.2%
김동준 2
 
0.2%
김영환 2
 
0.2%
이현정 2
 
0.2%
김도형 2
 
0.2%
Other values (1096) 1114
98.2%
2024-03-13T22:20:03.563144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
216
 
6.4%
180
 
5.3%
128
 
3.8%
114
 
3.4%
93
 
2.8%
74
 
2.2%
70
 
2.1%
65
 
1.9%
62
 
1.8%
61
 
1.8%
Other values (196) 2318
68.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3381
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
216
 
6.4%
180
 
5.3%
128
 
3.8%
114
 
3.4%
93
 
2.8%
74
 
2.2%
70
 
2.1%
65
 
1.9%
62
 
1.8%
61
 
1.8%
Other values (196) 2318
68.6%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3381
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
216
 
6.4%
180
 
5.3%
128
 
3.8%
114
 
3.4%
93
 
2.8%
74
 
2.2%
70
 
2.1%
65
 
1.9%
62
 
1.8%
61
 
1.8%
Other values (196) 2318
68.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3381
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
216
 
6.4%
180
 
5.3%
128
 
3.8%
114
 
3.4%
93
 
2.8%
74
 
2.2%
70
 
2.1%
65
 
1.9%
62
 
1.8%
61
 
1.8%
Other values (196) 2318
68.6%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
2024-01-18
1134 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024-01-18
2nd row2024-01-18
3rd row2024-01-18
4th row2024-01-18
5th row2024-01-18

Common Values

ValueCountFrequency (%)
2024-01-18 1134
100.0%

Length

2024-03-13T22:20:03.695736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T22:20:03.788406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024-01-18 1134
100.0%

Interactions

2024-03-13T22:19:59.153613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-03-13T22:19:59.393062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-13T22:19:59.568510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번사무소명도로명주소전화번호신고건축사데이터기준일자
01건축사사무소 한국부산광역시 부산진구 부전로152번길 31-7051-817-2119김창신2024-01-18
12신기종합건축사사무소부산광역시 사상구 학감대로 238-10 (감전동 , 협성빌딩202호)051-328-4474김건수2024-01-18
23삼아 건축사사무소부산광역시 부산진구 새싹로 31051-816-8193김성곤2024-01-18
34혁진 건축사사무소부산광역시 동구 중앙대로320번길 7-8, 대양빌딩 302051-469-1285임창수2024-01-18
45(주)상지엔지니어링건축사사무소부산광역시 중구 자갈치로 42051-247-0208김동균2024-01-18
56(주)상지엔지니어링건축사사무소부산광역시 중구 자갈치로 42051-247-0208이광택2024-01-18
67(주)상지엔지니어링건축사사무소부산광역시 중구 자갈치로 42051-247-0208김태선2024-01-18
78(주)상지엔지니어링건축사사무소부산광역시 중구 자갈치로 42051-247-0208박영목2024-01-18
89(주)상지엔지니어링건축사사무소부산광역시 중구 자갈치로 42051-247-0208윤정아2024-01-18
910(주)상지엔지니어링건축사사무소부산광역시 중구 자갈치로 42051-247-0208조웅제2024-01-18
순번사무소명도로명주소전화번호신고건축사데이터기준일자
11241125(주)나무종합건축사사무소부산광역시 연제구 과정로344번길 50, 상가2동 201호<NA>김쌍용2024-01-18
11251126건축사사무소 무무부산광역시 남구 동명로 182, 1층<NA>문철민2024-01-18
11261127HAUS 건축사사무소부산광역시 영도구 태종로 413, 3층<NA>하남구2024-01-18
11271128(주)일신이엔지종합건축사사무소부산광역시 동구 중앙대로320번길 3-2 (초량동)051-462-4712위동록2024-01-18
11281129경부건축종합건축사사무소부산광역시 연제구 중앙대로1133번길 14, 3층(연산동)<NA>강은민2024-01-18
11291130주원건축사사무소부산광역시 해운대구 센텀중앙로 97, A동 1208호051-781-2144김준기2024-01-18
11301131시매스건축사사무소부산광역시 금정구 금정로 191, 3층051-516-0482홍동연2024-01-18
11311132(주)시소건축사사무소부산광역시 연제구 거제천로124번길 43, 2층(연산동)051-714-2040정광수2024-01-18
11321133아우라 건축사사무소부산광역시 동구 조방로26번길 7, 103동301호051-632-8634박재현2024-01-18
11331134나우에이앤디건축사사무소부산광역시 연제구 안연로 38, 그린빌딩3층051-791-1026김성중2024-01-18