Overview

Dataset statistics

Number of variables4
Number of observations70
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 KiB
Average record size in memory34.9 B

Variable types

Numeric1
Text3

Dataset

Description서울특별시 중랑구에 위치한 건축사사무소 정보를 제공합니다. 사무소주소, 전화번호를 나타냅니다. 업무에 참고해주시기 바랍니다.
URLhttps://www.data.go.kr/data/15034664/fileData.do

Alerts

연번 has unique valuesUnique
사무소명 has unique valuesUnique

Reproduction

Analysis started2023-12-13 01:00:20.485715
Analysis finished2023-12-13 01:00:21.005025
Duration0.52 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct70
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean35.5
Minimum1
Maximum70
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size762.0 B
2023-12-13T10:00:21.056206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.45
Q118.25
median35.5
Q352.75
95-th percentile66.55
Maximum70
Range69
Interquartile range (IQR)34.5

Descriptive statistics

Standard deviation20.351085
Coefficient of variation (CV)0.57327
Kurtosis-1.2
Mean35.5
Median Absolute Deviation (MAD)17.5
Skewness0
Sum2485
Variance414.16667
MonotonicityStrictly increasing
2023-12-13T10:00:21.158515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.4%
46 1
 
1.4%
52 1
 
1.4%
51 1
 
1.4%
50 1
 
1.4%
49 1
 
1.4%
48 1
 
1.4%
47 1
 
1.4%
45 1
 
1.4%
54 1
 
1.4%
Other values (60) 60
85.7%
ValueCountFrequency (%)
1 1
1.4%
2 1
1.4%
3 1
1.4%
4 1
1.4%
5 1
1.4%
6 1
1.4%
7 1
1.4%
8 1
1.4%
9 1
1.4%
10 1
1.4%
ValueCountFrequency (%)
70 1
1.4%
69 1
1.4%
68 1
1.4%
67 1
1.4%
66 1
1.4%
65 1
1.4%
64 1
1.4%
63 1
1.4%
62 1
1.4%
61 1
1.4%

사무소명
Text

UNIQUE 

Distinct70
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size692.0 B
2023-12-13T10:00:21.352976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length14
Mean length10
Min length8

Characters and Unicode

Total characters700
Distinct characters115
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique70 ?
Unique (%)100.0%

Sample

1st row건축사사무소 창건
2nd row건축사사무소삼진건축
3rd row삼중건축사사무소
4th row박을식건축사사무소
5th row건축사사무소 세윤
ValueCountFrequency (%)
건축사사무소 17
 
18.7%
주식회사 3
 
3.3%
주식회사정다움건축사사무소 1
 
1.1%
엠엠엠웍스건축사사무소(주 1
 
1.1%
소유건축사사무소 1
 
1.1%
주)운율건축사사무소 1
 
1.1%
건축사사무소지선 1
 
1.1%
제이원종합건축사사무소 1
 
1.1%
고요 1
 
1.1%
1
 
1.1%
Other values (63) 63
69.2%
2023-12-13T10:00:21.653686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
145
20.7%
73
 
10.4%
72
 
10.3%
72
 
10.3%
70
 
10.0%
21
 
3.0%
17
 
2.4%
) 11
 
1.6%
( 11
 
1.6%
7
 
1.0%
Other values (105) 201
28.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 652
93.1%
Space Separator 21
 
3.0%
Close Punctuation 11
 
1.6%
Open Punctuation 11
 
1.6%
Uppercase Letter 3
 
0.4%
Decimal Number 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
145
22.2%
73
11.2%
72
11.0%
72
11.0%
70
 
10.7%
17
 
2.6%
7
 
1.1%
7
 
1.1%
7
 
1.1%
7
 
1.1%
Other values (98) 175
26.8%
Uppercase Letter
ValueCountFrequency (%)
D 2
66.7%
L 1
33.3%
Decimal Number
ValueCountFrequency (%)
2 1
50.0%
1 1
50.0%
Space Separator
ValueCountFrequency (%)
21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 652
93.1%
Common 45
 
6.4%
Latin 3
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
145
22.2%
73
11.2%
72
11.0%
72
11.0%
70
 
10.7%
17
 
2.6%
7
 
1.1%
7
 
1.1%
7
 
1.1%
7
 
1.1%
Other values (98) 175
26.8%
Common
ValueCountFrequency (%)
21
46.7%
) 11
24.4%
( 11
24.4%
2 1
 
2.2%
1 1
 
2.2%
Latin
ValueCountFrequency (%)
D 2
66.7%
L 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 652
93.1%
ASCII 48
 
6.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
145
22.2%
73
11.2%
72
11.0%
72
11.0%
70
 
10.7%
17
 
2.6%
7
 
1.1%
7
 
1.1%
7
 
1.1%
7
 
1.1%
Other values (98) 175
26.8%
ASCII
ValueCountFrequency (%)
21
43.8%
) 11
22.9%
( 11
22.9%
D 2
 
4.2%
2 1
 
2.1%
1 1
 
2.1%
L 1
 
2.1%
Distinct68
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Memory size692.0 B
2023-12-13T10:00:21.857951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length40
Mean length29.885714
Min length17

Characters and Unicode

Total characters2092
Distinct characters112
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique66 ?
Unique (%)94.3%

Sample

1st row서울특별시 중랑구 망우로32길 23, 2층 208호
2nd row서울특별시 중랑구 면목동 대지 472-32 3층
3rd row서울특별시 중랑구 봉화산로 189, 406호(신내11단지 상가동)
4th row서울특별시 중랑구 망우동 대지 515-39
5th row서울특별시 중랑구 중화동 대지 438-0 삼익아파트 상가동 206호
ValueCountFrequency (%)
서울특별시 70
 
17.0%
중랑구 70
 
17.0%
대지 14
 
3.4%
2층 10
 
2.4%
동일로 8
 
1.9%
중랑역로 6
 
1.5%
40-36 5
 
1.2%
신내역로3길 5
 
1.2%
면목동 5
 
1.2%
상가동 5
 
1.2%
Other values (147) 214
51.9%
2023-12-13T10:00:22.200807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
348
 
16.6%
1 125
 
6.0%
84
 
4.0%
78
 
3.7%
77
 
3.7%
70
 
3.3%
70
 
3.3%
70
 
3.3%
70
 
3.3%
70
 
3.3%
Other values (102) 1030
49.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1151
55.0%
Decimal Number 463
22.1%
Space Separator 348
 
16.6%
Other Punctuation 60
 
2.9%
Dash Punctuation 29
 
1.4%
Uppercase Letter 23
 
1.1%
Close Punctuation 9
 
0.4%
Open Punctuation 9
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
84
 
7.3%
78
 
6.8%
77
 
6.7%
70
 
6.1%
70
 
6.1%
70
 
6.1%
70
 
6.1%
70
 
6.1%
56
 
4.9%
51
 
4.4%
Other values (74) 455
39.5%
Uppercase Letter
ValueCountFrequency (%)
B 6
26.1%
A 5
21.7%
E 2
 
8.7%
S 2
 
8.7%
K 1
 
4.3%
V 1
 
4.3%
C 1
 
4.3%
G 1
 
4.3%
T 1
 
4.3%
O 1
 
4.3%
Other values (2) 2
 
8.7%
Decimal Number
ValueCountFrequency (%)
1 125
27.0%
2 66
14.3%
0 59
12.7%
3 53
11.4%
4 38
 
8.2%
9 28
 
6.0%
5 25
 
5.4%
8 24
 
5.2%
7 23
 
5.0%
6 22
 
4.8%
Other Punctuation
ValueCountFrequency (%)
, 59
98.3%
@ 1
 
1.7%
Space Separator
ValueCountFrequency (%)
348
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 29
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1151
55.0%
Common 918
43.9%
Latin 23
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
84
 
7.3%
78
 
6.8%
77
 
6.7%
70
 
6.1%
70
 
6.1%
70
 
6.1%
70
 
6.1%
70
 
6.1%
56
 
4.9%
51
 
4.4%
Other values (74) 455
39.5%
Common
ValueCountFrequency (%)
348
37.9%
1 125
 
13.6%
2 66
 
7.2%
, 59
 
6.4%
0 59
 
6.4%
3 53
 
5.8%
4 38
 
4.1%
- 29
 
3.2%
9 28
 
3.1%
5 25
 
2.7%
Other values (6) 88
 
9.6%
Latin
ValueCountFrequency (%)
B 6
26.1%
A 5
21.7%
E 2
 
8.7%
S 2
 
8.7%
K 1
 
4.3%
V 1
 
4.3%
C 1
 
4.3%
G 1
 
4.3%
T 1
 
4.3%
O 1
 
4.3%
Other values (2) 2
 
8.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1151
55.0%
ASCII 941
45.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
348
37.0%
1 125
 
13.3%
2 66
 
7.0%
, 59
 
6.3%
0 59
 
6.3%
3 53
 
5.6%
4 38
 
4.0%
- 29
 
3.1%
9 28
 
3.0%
5 25
 
2.7%
Other values (18) 111
 
11.8%
Hangul
ValueCountFrequency (%)
84
 
7.3%
78
 
6.8%
77
 
6.7%
70
 
6.1%
70
 
6.1%
70
 
6.1%
70
 
6.1%
70
 
6.1%
56
 
4.9%
51
 
4.4%
Other values (74) 455
39.5%
Distinct54
Distinct (%)77.1%
Missing0
Missing (%)0.0%
Memory size692.0 B
2023-12-13T10:00:22.381505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length13
Mean length12
Min length11

Characters and Unicode

Total characters840
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52 ?
Unique (%)74.3%

Sample

1st row02-436-0714
2nd row02-491-5901
3rd row02-438-2900
4th row02-496-3900
5th row02-2209-6300
ValueCountFrequency (%)
000-0000-0000 16
 
22.2%
02 2
 
2.8%
02-436-0714 2
 
2.8%
02-743-8505 1
 
1.4%
02-6953-9661 1
 
1.4%
02-508-1836 1
 
1.4%
0507-1317-4734 1
 
1.4%
02-2208-5514 1
 
1.4%
070-7459-9000 1
 
1.4%
02-494-1378 1
 
1.4%
Other values (45) 45
62.5%
2023-12-13T10:00:22.704667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 274
32.6%
- 140
16.7%
2 85
 
10.1%
4 63
 
7.5%
3 50
 
6.0%
9 41
 
4.9%
5 41
 
4.9%
7 40
 
4.8%
1 36
 
4.3%
6 34
 
4.0%
Other values (2) 36
 
4.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 698
83.1%
Dash Punctuation 140
 
16.7%
Space Separator 2
 
0.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 274
39.3%
2 85
 
12.2%
4 63
 
9.0%
3 50
 
7.2%
9 41
 
5.9%
5 41
 
5.9%
7 40
 
5.7%
1 36
 
5.2%
6 34
 
4.9%
8 34
 
4.9%
Dash Punctuation
ValueCountFrequency (%)
- 140
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 840
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 274
32.6%
- 140
16.7%
2 85
 
10.1%
4 63
 
7.5%
3 50
 
6.0%
9 41
 
4.9%
5 41
 
4.9%
7 40
 
4.8%
1 36
 
4.3%
6 34
 
4.0%
Other values (2) 36
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 840
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 274
32.6%
- 140
16.7%
2 85
 
10.1%
4 63
 
7.5%
3 50
 
6.0%
9 41
 
4.9%
5 41
 
4.9%
7 40
 
4.8%
1 36
 
4.3%
6 34
 
4.0%
Other values (2) 36
 
4.3%

Interactions

2023-12-13T10:00:20.828095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T10:00:22.783725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사무소명사무소주소전화번호
연번1.0001.0001.0000.773
사무소명1.0001.0001.0001.000
사무소주소1.0001.0001.0000.995
전화번호0.7731.0000.9951.000

Missing values

2023-12-13T10:00:20.914890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T10:00:20.979923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번사무소명사무소주소전화번호
01건축사사무소 창건서울특별시 중랑구 망우로32길 23, 2층 208호02-436-0714
12건축사사무소삼진건축서울특별시 중랑구 면목동 대지 472-32 3층02-491-5901
23삼중건축사사무소서울특별시 중랑구 봉화산로 189, 406호(신내11단지 상가동)02-438-2900
34박을식건축사사무소서울특별시 중랑구 망우동 대지 515-3902-496-3900
45건축사사무소 세윤서울특별시 중랑구 중화동 대지 438-0 삼익아파트 상가동 206호02-2209-6300
56미성건축사사무소서울특별시 중랑구 중화동 대지 306-5302-974-2352
67건축사사무소 창성서울특별시 중랑구 망우로32길 23, 2층 208호02-436-0714
78국원건축사사무소서울특별시 중랑구 중랑역로 124, 삼익@상가동 201호02-491-8321
89예일건축사사무소서울특별시 중랑구 동일로 59502-439-7779
910박노철건축사무소서울특별시 중랑구 신내동 대지 661-002-3423-2696
연번사무소명사무소주소전화번호
6061건축사사무소 그레서울특별시 중랑구 동일로 897, 블루포스빌딩 8층, 828호070-8064-3978
6162박기정종합건축사사무소서울특별시 중랑구 중랑천로 77, 상봉오피스텔419호02-436-5854
6263송윤건축사사무소서울특별시 중랑구 신내역로3길 40-36, 신내데시앙플렉스 B동911호02-743-8505
6364엘브이건축사사무소서울특별시 중랑구 동일로 859, 2층 E-114000-0000-0000
6465선그릇건축사사무소서울특별시 중랑구 겸재로35길 37, 1층0507-1492-0832
6566예일씨엔씨건축사사무소서울특별시 중랑구 동일로 483, 1층 103호02-508-1836
6667아름건축사사무소서울특별시 중랑구 중랑역로 193, 지하000-0000-0000
6768(주)한맥건축건축사사무소서울특별시 중랑구 신내역로3길 40-36, , A동 1110호 (신내동, 신내데시앙플렉스)02-515-0854
6869오아제건축사사무소서울특별시 중랑구 신내역로 111, B동 829-B호000-0000-0000
6970아우르건축사사무소서울특별시 중랑구 신내역로3길 40-36, B동 206호000-0000-0000