Overview

Dataset statistics

Number of variables3
Number of observations68
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.8 KiB
Average record size in memory26.9 B

Variable types

Numeric1
Text2

Dataset

Description부산광역시_금정구_실내건축공사업등록현황_20231012
Author부산광역시 금정구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15025831

Alerts

연번 has unique valuesUnique
상호 has unique valuesUnique
영업소재지(도로명주소) has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:11:39.677317
Analysis finished2023-12-10 16:11:40.057306
Duration0.38 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct68
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean34.5
Minimum1
Maximum68
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size744.0 B
2023-12-11T01:11:40.127238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.35
Q117.75
median34.5
Q351.25
95-th percentile64.65
Maximum68
Range67
Interquartile range (IQR)33.5

Descriptive statistics

Standard deviation19.77372
Coefficient of variation (CV)0.5731513
Kurtosis-1.2
Mean34.5
Median Absolute Deviation (MAD)17
Skewness0
Sum2346
Variance391
MonotonicityStrictly increasing
2023-12-11T01:11:40.250672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.5%
45 1
 
1.5%
51 1
 
1.5%
50 1
 
1.5%
49 1
 
1.5%
48 1
 
1.5%
47 1
 
1.5%
46 1
 
1.5%
44 1
 
1.5%
36 1
 
1.5%
Other values (58) 58
85.3%
ValueCountFrequency (%)
1 1
1.5%
2 1
1.5%
3 1
1.5%
4 1
1.5%
5 1
1.5%
6 1
1.5%
7 1
1.5%
8 1
1.5%
9 1
1.5%
10 1
1.5%
ValueCountFrequency (%)
68 1
1.5%
67 1
1.5%
66 1
1.5%
65 1
1.5%
64 1
1.5%
63 1
1.5%
62 1
1.5%
61 1
1.5%
60 1
1.5%
59 1
1.5%

상호
Text

UNIQUE 

Distinct68
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size676.0 B
2023-12-11T01:11:40.480728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length11
Mean length7.9852941
Min length2

Characters and Unicode

Total characters543
Distinct characters140
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique68 ?
Unique (%)100.0%

Sample

1st row(주)가람디자인
2nd row(주)강윤
3rd row(주)까미노
4th row(주)너른건축
5th row(주)네스티지
ValueCountFrequency (%)
주)가람디자인 1
 
1.5%
오즈디자인랩 1
 
1.5%
더이룸컴퍼니 1
 
1.5%
더휴 1
 
1.5%
명성건업(주 1
 
1.5%
성림종합건설(주 1
 
1.5%
오알크루(orcrew 1
 
1.5%
주)강윤 1
 
1.5%
이오건설(주 1
 
1.5%
주)라움디자인 1
 
1.5%
Other values (58) 58
85.3%
2023-12-11T01:11:40.853130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
59
 
10.9%
( 41
 
7.6%
) 41
 
7.6%
23
 
4.2%
23
 
4.2%
21
 
3.9%
20
 
3.7%
20
 
3.7%
20
 
3.7%
18
 
3.3%
Other values (130) 257
47.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 446
82.1%
Open Punctuation 41
 
7.6%
Close Punctuation 41
 
7.6%
Uppercase Letter 9
 
1.7%
Decimal Number 5
 
0.9%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
59
 
13.2%
23
 
5.2%
23
 
5.2%
21
 
4.7%
20
 
4.5%
20
 
4.5%
20
 
4.5%
18
 
4.0%
18
 
4.0%
11
 
2.5%
Other values (117) 213
47.8%
Uppercase Letter
ValueCountFrequency (%)
R 2
22.2%
O 2
22.2%
C 1
11.1%
E 1
11.1%
W 1
11.1%
Y 1
11.1%
J 1
11.1%
Decimal Number
ValueCountFrequency (%)
0 2
40.0%
4 2
40.0%
9 1
20.0%
Open Punctuation
ValueCountFrequency (%)
( 41
100.0%
Close Punctuation
ValueCountFrequency (%)
) 41
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 446
82.1%
Common 88
 
16.2%
Latin 9
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
59
 
13.2%
23
 
5.2%
23
 
5.2%
21
 
4.7%
20
 
4.5%
20
 
4.5%
20
 
4.5%
18
 
4.0%
18
 
4.0%
11
 
2.5%
Other values (117) 213
47.8%
Latin
ValueCountFrequency (%)
R 2
22.2%
O 2
22.2%
C 1
11.1%
E 1
11.1%
W 1
11.1%
Y 1
11.1%
J 1
11.1%
Common
ValueCountFrequency (%)
( 41
46.6%
) 41
46.6%
0 2
 
2.3%
4 2
 
2.3%
. 1
 
1.1%
9 1
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 446
82.1%
ASCII 97
 
17.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
59
 
13.2%
23
 
5.2%
23
 
5.2%
21
 
4.7%
20
 
4.5%
20
 
4.5%
20
 
4.5%
18
 
4.0%
18
 
4.0%
11
 
2.5%
Other values (117) 213
47.8%
ASCII
ValueCountFrequency (%)
( 41
42.3%
) 41
42.3%
0 2
 
2.1%
4 2
 
2.1%
R 2
 
2.1%
O 2
 
2.1%
C 1
 
1.0%
E 1
 
1.0%
W 1
 
1.0%
Y 1
 
1.0%
Other values (3) 3
 
3.1%
Distinct68
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size676.0 B
2023-12-11T01:11:41.102258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length39
Mean length28.367647
Min length22

Characters and Unicode

Total characters1929
Distinct characters108
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique68 ?
Unique (%)100.0%

Sample

1st row부산광역시 금정구 삼어로 219 (금사동)
2nd row부산광역시 금정구 개좌로272번길 21-13 (회동동)
3rd row부산광역시 금정구 구서온천천로 59 202호 (구서동, 구서동하나로아파트3차)
4th row부산광역시 금정구 금샘로 401,4층(구서동)
5th row부산광역시 금정구 무학송로 158 (부곡동)
ValueCountFrequency (%)
부산광역시 68
 
17.8%
금정구 68
 
17.8%
부곡동 14
 
3.7%
2층 11
 
2.9%
1층 9
 
2.4%
남산동 9
 
2.4%
구서동 8
 
2.1%
금사동 8
 
2.1%
장전동 7
 
1.8%
두구동 7
 
1.8%
Other values (127) 172
45.1%
2023-12-11T01:11:41.440704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
313
 
16.2%
95
 
4.9%
89
 
4.6%
83
 
4.3%
82
 
4.3%
79
 
4.1%
71
 
3.7%
1 70
 
3.6%
70
 
3.6%
69
 
3.6%
Other values (98) 908
47.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1118
58.0%
Decimal Number 322
 
16.7%
Space Separator 313
 
16.2%
Open Punctuation 67
 
3.5%
Close Punctuation 67
 
3.5%
Other Punctuation 21
 
1.1%
Dash Punctuation 18
 
0.9%
Lowercase Letter 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
95
 
8.5%
89
 
8.0%
83
 
7.4%
82
 
7.3%
79
 
7.1%
71
 
6.4%
70
 
6.3%
69
 
6.2%
68
 
6.1%
68
 
6.1%
Other values (79) 344
30.8%
Decimal Number
ValueCountFrequency (%)
1 70
21.7%
2 61
18.9%
0 31
9.6%
4 27
 
8.4%
3 26
 
8.1%
5 24
 
7.5%
9 22
 
6.8%
6 22
 
6.8%
7 21
 
6.5%
8 18
 
5.6%
Lowercase Letter
ValueCountFrequency (%)
c 1
33.3%
m 1
33.3%
d 1
33.3%
Other Punctuation
ValueCountFrequency (%)
, 14
66.7%
7
33.3%
Space Separator
ValueCountFrequency (%)
313
100.0%
Open Punctuation
ValueCountFrequency (%)
( 67
100.0%
Close Punctuation
ValueCountFrequency (%)
) 67
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1118
58.0%
Common 808
41.9%
Latin 3
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
95
 
8.5%
89
 
8.0%
83
 
7.4%
82
 
7.3%
79
 
7.1%
71
 
6.4%
70
 
6.3%
69
 
6.2%
68
 
6.1%
68
 
6.1%
Other values (79) 344
30.8%
Common
ValueCountFrequency (%)
313
38.7%
1 70
 
8.7%
( 67
 
8.3%
) 67
 
8.3%
2 61
 
7.5%
0 31
 
3.8%
4 27
 
3.3%
3 26
 
3.2%
5 24
 
3.0%
9 22
 
2.7%
Other values (6) 100
 
12.4%
Latin
ValueCountFrequency (%)
c 1
33.3%
m 1
33.3%
d 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1118
58.0%
ASCII 804
41.7%
None 7
 
0.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
313
38.9%
1 70
 
8.7%
( 67
 
8.3%
) 67
 
8.3%
2 61
 
7.6%
0 31
 
3.9%
4 27
 
3.4%
3 26
 
3.2%
5 24
 
3.0%
9 22
 
2.7%
Other values (8) 96
 
11.9%
Hangul
ValueCountFrequency (%)
95
 
8.5%
89
 
8.0%
83
 
7.4%
82
 
7.3%
79
 
7.1%
71
 
6.4%
70
 
6.3%
69
 
6.2%
68
 
6.1%
68
 
6.1%
Other values (79) 344
30.8%
None
ValueCountFrequency (%)
7
100.0%

Interactions

2023-12-11T01:11:39.845499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:11:41.517922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번상호영업소재지(도로명주소)
연번1.0001.0001.000
상호1.0001.0001.000
영업소재지(도로명주소)1.0001.0001.000

Missing values

2023-12-11T01:11:39.955296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:11:40.028926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번상호영업소재지(도로명주소)
01(주)가람디자인부산광역시 금정구 삼어로 219 (금사동)
12(주)강윤부산광역시 금정구 개좌로272번길 21-13 (회동동)
23(주)까미노부산광역시 금정구 구서온천천로 59 202호 (구서동, 구서동하나로아파트3차)
34(주)너른건축부산광역시 금정구 금샘로 401,4층(구서동)
45(주)네스티지부산광역시 금정구 무학송로 158 (부곡동)
56(주)대경아이디부산광역시 금정구 금강로 298 2층 (장전동)
67(주)디자인단부산광역시 금정구 중앙대로1826번길 48-55 ,2층 (구서동)
78(주)디자인엠에스건설부산광역시 금정구 체육공원로 702 (두구동)
89(주)라움디자인부산광역시 금정구 수림로 22-9 1층 (부곡동)
910(주)메트로아이.디부산광역시 금정구 동현로43번길 8 (부곡동)
연번상호영업소재지(도로명주소)
5859주식회사제이엠케이디자인부산광역시 금정구 체육공원로 541-2 (두구동)
5960주식회사조이디자인(JOY)부산광역시 금정구 두실로 66 1층 (남산동)
6061주식회사청담건설부산광역시 금정구 무학송로 124, 2층 (부곡동)
6162주식회사초원부산광역시 금정구 범어천로 38 명천빌딩3층 (남산동)
6263주식회사하논홀딩스부산광역시 금정구 회천로14번길 20-12 2층 201호 (회동동)
6364주식회사하랑기획부산광역시 금정구 반송로 437,302호(금사동)
6465지니텍부산광역시 금정구 공단동로 51 (금사동)
6566지오인테리어주식회사부산광역시 금정구 금강로 680-1 (남산동)
6667지올부산광역시 금정구 수림로62번길 19 2층 (장전동)
6768태흥건축디자인주식회사부산광역시 금정구 동현로16번길 30 2층 (부곡동)