Overview

Dataset statistics

Number of variables5
Number of observations56
Missing cells22
Missing cells (%)7.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 KiB
Average record size in memory43.3 B

Variable types

Numeric1
Text3
Categorical1

Dataset

Description서울특별시 노원구 건축사사무소 현황은 노원구에 위치한 건축사사무소의 명칭, 주소, 전화번호의 데이터를 포함하고 있습니다.
Author서울특별시 노원구
URLhttps://www.data.go.kr/data/15126270/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
전화번호 has 22 (39.3%) missing valuesMissing
순번 has unique valuesUnique
사무소명 has unique valuesUnique

Reproduction

Analysis started2024-03-14 15:40:08.869587
Analysis finished2024-03-14 15:40:10.565434
Duration1.7 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct56
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean28.5
Minimum1
Maximum56
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size632.0 B
2024-03-15T00:40:10.894089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.75
Q114.75
median28.5
Q342.25
95-th percentile53.25
Maximum56
Range55
Interquartile range (IQR)27.5

Descriptive statistics

Standard deviation16.309506
Coefficient of variation (CV)0.57226338
Kurtosis-1.2
Mean28.5
Median Absolute Deviation (MAD)14
Skewness0
Sum1596
Variance266
MonotonicityStrictly increasing
2024-03-15T00:40:11.403151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.8%
30 1
 
1.8%
32 1
 
1.8%
33 1
 
1.8%
34 1
 
1.8%
35 1
 
1.8%
36 1
 
1.8%
37 1
 
1.8%
38 1
 
1.8%
39 1
 
1.8%
Other values (46) 46
82.1%
ValueCountFrequency (%)
1 1
1.8%
2 1
1.8%
3 1
1.8%
4 1
1.8%
5 1
1.8%
6 1
1.8%
7 1
1.8%
8 1
1.8%
9 1
1.8%
10 1
1.8%
ValueCountFrequency (%)
56 1
1.8%
55 1
1.8%
54 1
1.8%
53 1
1.8%
52 1
1.8%
51 1
1.8%
50 1
1.8%
49 1
1.8%
48 1
1.8%
47 1
1.8%

사무소명
Text

UNIQUE 

Distinct56
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size576.0 B
2024-03-15T00:40:12.344766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length16
Mean length10.714286
Min length7

Characters and Unicode

Total characters600
Distinct characters104
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique56 ?
Unique (%)100.0%

Sample

1st row주식회사 시원건축사사무소
2nd row주식회사 씨밀레건축사사무소
3rd row(주)대승이엔지건축사사무소
4th row(주)청구건축사사무소
5th row상민건축사사무소
ValueCountFrequency (%)
건축사사무소 17
 
21.2%
주식회사 6
 
7.5%
시원건축사사무소 2
 
2.5%
더환건축사사무소 1
 
1.2%
건안건축사사무소 1
 
1.2%
이현 1
 
1.2%
주)에스건축사사무소 1
 
1.2%
주)재원피앤씨건축사사무소 1
 
1.2%
공유 1
 
1.2%
두림건축사사무소 1
 
1.2%
Other values (48) 48
60.0%
2024-03-15T00:40:13.580821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
118
19.7%
58
 
9.7%
57
 
9.5%
57
 
9.5%
56
 
9.3%
25
 
4.2%
16
 
2.7%
15
 
2.5%
) 10
 
1.7%
( 10
 
1.7%
Other values (94) 178
29.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 551
91.8%
Space Separator 25
 
4.2%
Close Punctuation 10
 
1.7%
Open Punctuation 10
 
1.7%
Other Symbol 2
 
0.3%
Uppercase Letter 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
118
21.4%
58
 
10.5%
57
 
10.3%
57
 
10.3%
56
 
10.2%
16
 
2.9%
15
 
2.7%
7
 
1.3%
6
 
1.1%
6
 
1.1%
Other values (88) 155
28.1%
Uppercase Letter
ValueCountFrequency (%)
J 1
50.0%
K 1
50.0%
Space Separator
ValueCountFrequency (%)
25
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 553
92.2%
Common 45
 
7.5%
Latin 2
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
118
21.3%
58
 
10.5%
57
 
10.3%
57
 
10.3%
56
 
10.1%
16
 
2.9%
15
 
2.7%
7
 
1.3%
6
 
1.1%
6
 
1.1%
Other values (89) 157
28.4%
Common
ValueCountFrequency (%)
25
55.6%
) 10
 
22.2%
( 10
 
22.2%
Latin
ValueCountFrequency (%)
J 1
50.0%
K 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 551
91.8%
ASCII 47
 
7.8%
None 2
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
118
21.4%
58
 
10.5%
57
 
10.3%
57
 
10.3%
56
 
10.2%
16
 
2.9%
15
 
2.7%
7
 
1.3%
6
 
1.1%
6
 
1.1%
Other values (88) 155
28.1%
ASCII
ValueCountFrequency (%)
25
53.2%
) 10
 
21.3%
( 10
 
21.3%
J 1
 
2.1%
K 1
 
2.1%
None
ValueCountFrequency (%)
2
100.0%
Distinct53
Distinct (%)94.6%
Missing0
Missing (%)0.0%
Memory size576.0 B
2024-03-15T00:40:14.673428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length26
Mean length21.375
Min length16

Characters and Unicode

Total characters1197
Distinct characters93
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52 ?
Unique (%)92.9%

Sample

1st row서울특별시 노원구 노해로75길 14-2
2nd row서울특별시 노원구 공릉로46길 6 (현대프라자)
3rd row서울특별시 노원구 동일로192길 45
4th row서울특별시 노원구 동일로241길 44
5th row서울특별시 노원구 동일로241길 44(수락넥스트죤)
ValueCountFrequency (%)
서울특별시 56
23.1%
노원구 56
23.1%
노해로75길 5
 
2.1%
동일로241길 4
 
1.7%
노해로 4
 
1.7%
491 4
 
1.7%
공릉동 4
 
1.7%
월계로 3
 
1.2%
동일로 3
 
1.2%
25 3
 
1.2%
Other values (86) 100
41.3%
2024-03-15T00:40:16.120994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
186
 
15.5%
66
 
5.5%
58
 
4.8%
56
 
4.7%
56
 
4.7%
56
 
4.7%
56
 
4.7%
56
 
4.7%
56
 
4.7%
52
 
4.3%
Other values (83) 499
41.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 757
63.2%
Decimal Number 234
 
19.5%
Space Separator 186
 
15.5%
Dash Punctuation 13
 
1.1%
Other Punctuation 3
 
0.3%
Close Punctuation 2
 
0.2%
Open Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
66
 
8.7%
58
 
7.7%
56
 
7.4%
56
 
7.4%
56
 
7.4%
56
 
7.4%
56
 
7.4%
56
 
7.4%
52
 
6.9%
36
 
4.8%
Other values (68) 209
27.6%
Decimal Number
ValueCountFrequency (%)
1 41
17.5%
4 39
16.7%
2 34
14.5%
7 30
12.8%
3 24
10.3%
5 18
7.7%
6 15
 
6.4%
0 14
 
6.0%
9 12
 
5.1%
8 7
 
3.0%
Space Separator
ValueCountFrequency (%)
186
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%
Other Punctuation
ValueCountFrequency (%)
, 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 757
63.2%
Common 440
36.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
66
 
8.7%
58
 
7.7%
56
 
7.4%
56
 
7.4%
56
 
7.4%
56
 
7.4%
56
 
7.4%
56
 
7.4%
52
 
6.9%
36
 
4.8%
Other values (68) 209
27.6%
Common
ValueCountFrequency (%)
186
42.3%
1 41
 
9.3%
4 39
 
8.9%
2 34
 
7.7%
7 30
 
6.8%
3 24
 
5.5%
5 18
 
4.1%
6 15
 
3.4%
0 14
 
3.2%
- 13
 
3.0%
Other values (5) 26
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 757
63.2%
ASCII 440
36.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
186
42.3%
1 41
 
9.3%
4 39
 
8.9%
2 34
 
7.7%
7 30
 
6.8%
3 24
 
5.5%
5 18
 
4.1%
6 15
 
3.4%
0 14
 
3.2%
- 13
 
3.0%
Other values (5) 26
 
5.9%
Hangul
ValueCountFrequency (%)
66
 
8.7%
58
 
7.7%
56
 
7.4%
56
 
7.4%
56
 
7.4%
56
 
7.4%
56
 
7.4%
56
 
7.4%
52
 
6.9%
36
 
4.8%
Other values (68) 209
27.6%

전화번호
Text

MISSING 

Distinct33
Distinct (%)97.1%
Missing22
Missing (%)39.3%
Memory size576.0 B
2024-03-15T00:40:16.870760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length11
Mean length11.411765
Min length11

Characters and Unicode

Total characters388
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)94.1%

Sample

1st row02-937-1100
2nd row02-975-9006
3rd row02-3296-0132
4th row02-971-7087
5th row02-949-7305
ValueCountFrequency (%)
02-937-1100 2
 
5.9%
02-3391-2627 1
 
2.9%
02-972-5152 1
 
2.9%
02-974-9810 1
 
2.9%
02-931-5032 1
 
2.9%
02-933-5021 1
 
2.9%
02-469-6234 1
 
2.9%
02-6397-2146 1
 
2.9%
02-511-5034 1
 
2.9%
02-3291-3158 1
 
2.9%
Other values (23) 23
67.6%
2024-03-15T00:40:18.152995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 68
17.5%
0 60
15.5%
2 59
15.2%
7 34
8.8%
9 32
8.2%
3 29
7.5%
5 27
 
7.0%
1 26
 
6.7%
4 20
 
5.2%
6 19
 
4.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 320
82.5%
Dash Punctuation 68
 
17.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 60
18.8%
2 59
18.4%
7 34
10.6%
9 32
10.0%
3 29
9.1%
5 27
8.4%
1 26
8.1%
4 20
 
6.2%
6 19
 
5.9%
8 14
 
4.4%
Dash Punctuation
ValueCountFrequency (%)
- 68
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 388
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 68
17.5%
0 60
15.5%
2 59
15.2%
7 34
8.8%
9 32
8.2%
3 29
7.5%
5 27
 
7.0%
1 26
 
6.7%
4 20
 
5.2%
6 19
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 388
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 68
17.5%
0 60
15.5%
2 59
15.2%
7 34
8.8%
9 32
8.2%
3 29
7.5%
5 27
 
7.0%
1 26
 
6.7%
4 20
 
5.2%
6 19
 
4.9%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size576.0 B
2024-01-05
56 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024-01-05
2nd row2024-01-05
3rd row2024-01-05
4th row2024-01-05
5th row2024-01-05

Common Values

ValueCountFrequency (%)
2024-01-05 56
100.0%

Length

2024-03-15T00:40:18.732404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T00:40:19.136921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024-01-05 56
100.0%

Interactions

2024-03-15T00:40:09.468125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T00:40:19.236999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번사무소명사무소 주소전화번호
순번1.0001.0000.7060.950
사무소명1.0001.0001.0001.000
사무소 주소0.7061.0001.0000.977
전화번호0.9501.0000.9771.000

Missing values

2024-03-15T00:40:10.102408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T00:40:10.428482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번사무소명사무소 주소전화번호데이터기준일자
01주식회사 시원건축사사무소서울특별시 노원구 노해로75길 14-202-937-11002024-01-05
12주식회사 씨밀레건축사사무소서울특별시 노원구 공릉로46길 6 (현대프라자)02-975-90062024-01-05
23(주)대승이엔지건축사사무소서울특별시 노원구 동일로192길 4502-3296-01322024-01-05
34(주)청구건축사사무소서울특별시 노원구 동일로241길 4402-971-70872024-01-05
45상민건축사사무소서울특별시 노원구 동일로241길 44(수락넥스트죤)02-949-73052024-01-05
56(주)보강건축사사무소서울특별시 노원구 동일로232길 16 연호빌딩02-977-88102024-01-05
67시원건축사사무소서울특별시 노원구 노해로75길 14-2 중앙빌딩02-937-11002024-01-05
78건축사사무소 노아서울특별시 노원구 상계로23길 52-702-3446-03822024-01-05
89제이풀종합건축사사무소서울특별시 노원구 동일로203가길 29, 브라운스톤 중계02-3462-19772024-01-05
910㈜예진인건축사사무소 건축컨설턴트서울특별시 노원구 한글비석로 39602-597-23112024-01-05
순번사무소명사무소 주소전화번호데이터기준일자
4647진터에이앤씨건축사사무소서울특별시 노원구 동일로174길 7<NA>2024-01-05
4748지아건축사사무소서울특별시 노원구 동일로204가길 34<NA>2024-01-05
4849주식회사 유월건축사사무소서울특별시 노원구 노해로 491<NA>2024-01-05
4950김영아건축사사무소서울특별시 노원구 한글비석로46가길 38<NA>2024-01-05
5051두꺼비종합건축사사무소서울특별시 노원구 동일로207길 50<NA>2024-01-05
5152JK건축사사무소서울특별시 노원구 노해로 49102-6925-74722024-01-05
5253더환건축사사무소서울특별시 노원구 동일로186길 31-6<NA>2024-01-05
5354디스이즈낫 건축사사무소서울특별시 노원구 덕릉로127길 25<NA>2024-01-05
5455정이제건축사사무소서울특별시 노원구 동일로 1055<NA>2024-01-05
5556세움건축사사무소서울특별시 노원구 동일로241길<NA>2024-01-05