Overview

Dataset statistics

Number of variables4
Number of observations52
Missing cells28
Missing cells (%)13.5%
Duplicate rows1
Duplicate rows (%)1.9%
Total size in memory1.8 KiB
Average record size in memory35.5 B

Variable types

Numeric1
Text3

Dataset

Description부산광역시 부산진구 인테리어업체 현황에 대한 데이터로 인테리어 업체의 업체명, 전화번호, 주소 등의 항목을 제공합니다.
Author부산광역시 부산진구
URLhttps://www.data.go.kr/data/15126478/fileData.do

Alerts

Dataset has 1 (1.9%) duplicate rowsDuplicates
연번 has 6 (11.5%) missing valuesMissing
업체명 has 6 (11.5%) missing valuesMissing
전화번호 has 10 (19.2%) missing valuesMissing
주소 has 6 (11.5%) missing valuesMissing

Reproduction

Analysis started2024-03-14 16:10:17.941333
Analysis finished2024-03-14 16:10:19.057080
Duration1.12 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

MISSING 

Distinct46
Distinct (%)100.0%
Missing6
Missing (%)11.5%
Infinite0
Infinite (%)0.0%
Mean23.5
Minimum1
Maximum46
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size596.0 B
2024-03-15T01:10:19.294335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.25
Q112.25
median23.5
Q334.75
95-th percentile43.75
Maximum46
Range45
Interquartile range (IQR)22.5

Descriptive statistics

Standard deviation13.422618
Coefficient of variation (CV)0.57117522
Kurtosis-1.2
Mean23.5
Median Absolute Deviation (MAD)11.5
Skewness0
Sum1081
Variance180.16667
MonotonicityStrictly increasing
2024-03-15T01:10:19.725453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=46)
ValueCountFrequency (%)
36 1
 
1.9%
27 1
 
1.9%
28 1
 
1.9%
29 1
 
1.9%
30 1
 
1.9%
31 1
 
1.9%
32 1
 
1.9%
33 1
 
1.9%
34 1
 
1.9%
35 1
 
1.9%
Other values (36) 36
69.2%
(Missing) 6
 
11.5%
ValueCountFrequency (%)
1 1
1.9%
2 1
1.9%
3 1
1.9%
4 1
1.9%
5 1
1.9%
6 1
1.9%
7 1
1.9%
8 1
1.9%
9 1
1.9%
10 1
1.9%
ValueCountFrequency (%)
46 1
1.9%
45 1
1.9%
44 1
1.9%
43 1
1.9%
42 1
1.9%
41 1
1.9%
40 1
1.9%
39 1
1.9%
38 1
1.9%
37 1
1.9%

업체명
Text

MISSING 

Distinct46
Distinct (%)100.0%
Missing6
Missing (%)11.5%
Memory size544.0 B
2024-03-15T01:10:20.684129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length9
Mean length7.9130435
Min length4

Characters and Unicode

Total characters364
Distinct characters103
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique46 ?
Unique (%)100.0%

Sample

1st row(주)경원건축디자인
2nd row(주)금강고려하우징
3rd row(주)끌레어림피드
4th row(주)다미건설
5th row(주)동성
ValueCountFrequency (%)
주)수가이앤씨 1
 
2.2%
거림건설(주 1
 
2.2%
공간보감인테리어디자인 1
 
2.2%
다원디자인 1
 
2.2%
대림종합건설(주 1
 
2.2%
동양건구산업 1
 
2.2%
디자인노보주식회사 1
 
2.2%
디자인하우스 1
 
2.2%
마지공간 1
 
2.2%
명성개발기업(주 1
 
2.2%
Other values (36) 36
78.3%
2024-03-15T01:10:22.066275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
41
 
11.3%
( 29
 
8.0%
) 29
 
8.0%
15
 
4.1%
15
 
4.1%
14
 
3.8%
12
 
3.3%
12
 
3.3%
12
 
3.3%
12
 
3.3%
Other values (93) 173
47.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 306
84.1%
Open Punctuation 29
 
8.0%
Close Punctuation 29
 
8.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
41
 
13.4%
15
 
4.9%
15
 
4.9%
14
 
4.6%
12
 
3.9%
12
 
3.9%
12
 
3.9%
12
 
3.9%
12
 
3.9%
6
 
2.0%
Other values (91) 155
50.7%
Open Punctuation
ValueCountFrequency (%)
( 29
100.0%
Close Punctuation
ValueCountFrequency (%)
) 29
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 306
84.1%
Common 58
 
15.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
41
 
13.4%
15
 
4.9%
15
 
4.9%
14
 
4.6%
12
 
3.9%
12
 
3.9%
12
 
3.9%
12
 
3.9%
12
 
3.9%
6
 
2.0%
Other values (91) 155
50.7%
Common
ValueCountFrequency (%)
( 29
50.0%
) 29
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 306
84.1%
ASCII 58
 
15.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
41
 
13.4%
15
 
4.9%
15
 
4.9%
14
 
4.6%
12
 
3.9%
12
 
3.9%
12
 
3.9%
12
 
3.9%
12
 
3.9%
6
 
2.0%
Other values (91) 155
50.7%
ASCII
ValueCountFrequency (%)
( 29
50.0%
) 29
50.0%

전화번호
Text

MISSING 

Distinct42
Distinct (%)100.0%
Missing10
Missing (%)19.2%
Memory size544.0 B
2024-03-15T01:10:22.882770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.095238
Min length12

Characters and Unicode

Total characters508
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique42 ?
Unique (%)100.0%

Sample

1st row051-817-8727
2nd row051-861-1881
3rd row051-936-8989
4th row051-895-5108
5th row070-7703-8257
ValueCountFrequency (%)
051-501-5925 1
 
2.4%
051-808-9164 1
 
2.4%
051-853-9505 1
 
2.4%
051-862-7431 1
 
2.4%
051-807-6216 1
 
2.4%
070-7710-1250 1
 
2.4%
051-636-5616 1
 
2.4%
051-809-3764 1
 
2.4%
051-805-6457 1
 
2.4%
051-942-6061 1
 
2.4%
Other values (32) 32
76.2%
2024-03-15T01:10:24.488351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 84
16.5%
1 75
14.8%
0 70
13.8%
5 65
12.8%
8 54
10.6%
7 40
7.9%
9 31
 
6.1%
6 27
 
5.3%
4 24
 
4.7%
2 21
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 424
83.5%
Dash Punctuation 84
 
16.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 75
17.7%
0 70
16.5%
5 65
15.3%
8 54
12.7%
7 40
9.4%
9 31
7.3%
6 27
 
6.4%
4 24
 
5.7%
2 21
 
5.0%
3 17
 
4.0%
Dash Punctuation
ValueCountFrequency (%)
- 84
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 508
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 84
16.5%
1 75
14.8%
0 70
13.8%
5 65
12.8%
8 54
10.6%
7 40
7.9%
9 31
 
6.1%
6 27
 
5.3%
4 24
 
4.7%
2 21
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 508
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 84
16.5%
1 75
14.8%
0 70
13.8%
5 65
12.8%
8 54
10.6%
7 40
7.9%
9 31
 
6.1%
6 27
 
5.3%
4 24
 
4.7%
2 21
 
4.1%

주소
Text

MISSING 

Distinct46
Distinct (%)100.0%
Missing6
Missing (%)11.5%
Memory size544.0 B
2024-03-15T01:10:25.522177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length41
Mean length32.891304
Min length24

Characters and Unicode

Total characters1513
Distinct characters108
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique46 ?
Unique (%)100.0%

Sample

1st row부산광역시 부산진구 백양순환로 65 (당감동)
2nd row부산광역시 부산진구 거제대로36번가길 8 (양정동)
3rd row부산광역시 부산진구 중앙대로666번길 50 상가동 1층 102-2호 (부전동)
4th row부산광역시 부산진구 엄광로178번길 9, 지하층 (가야동)
5th row부산광역시 부산진구 중앙대로 770, 7층 (부전동)
ValueCountFrequency (%)
부산광역시 46
 
16.5%
부산진구 46
 
16.5%
양정동 8
 
2.9%
전포동 6
 
2.2%
초읍동 6
 
2.2%
당감동 5
 
1.8%
2층 5
 
1.8%
1층 5
 
1.8%
부전동 5
 
1.8%
가야동 4
 
1.4%
Other values (123) 143
51.3%
2024-03-15T01:10:27.280780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
233
 
15.4%
102
 
6.7%
92
 
6.1%
58
 
3.8%
1 57
 
3.8%
48
 
3.2%
48
 
3.2%
47
 
3.1%
) 47
 
3.1%
( 47
 
3.1%
Other values (98) 734
48.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 895
59.2%
Decimal Number 249
 
16.5%
Space Separator 233
 
15.4%
Close Punctuation 47
 
3.1%
Open Punctuation 47
 
3.1%
Other Punctuation 30
 
2.0%
Dash Punctuation 11
 
0.7%
Uppercase Letter 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
102
 
11.4%
92
 
10.3%
58
 
6.5%
48
 
5.4%
48
 
5.4%
47
 
5.3%
46
 
5.1%
46
 
5.1%
46
 
5.1%
25
 
2.8%
Other values (81) 337
37.7%
Decimal Number
ValueCountFrequency (%)
1 57
22.9%
5 32
12.9%
2 31
12.4%
0 23
9.2%
7 21
 
8.4%
3 21
 
8.4%
6 19
 
7.6%
4 17
 
6.8%
9 15
 
6.0%
8 13
 
5.2%
Other Punctuation
ValueCountFrequency (%)
, 27
90.0%
3
 
10.0%
Space Separator
ValueCountFrequency (%)
233
100.0%
Close Punctuation
ValueCountFrequency (%)
) 47
100.0%
Open Punctuation
ValueCountFrequency (%)
( 47
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 895
59.2%
Common 617
40.8%
Latin 1
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
102
 
11.4%
92
 
10.3%
58
 
6.5%
48
 
5.4%
48
 
5.4%
47
 
5.3%
46
 
5.1%
46
 
5.1%
46
 
5.1%
25
 
2.8%
Other values (81) 337
37.7%
Common
ValueCountFrequency (%)
233
37.8%
1 57
 
9.2%
) 47
 
7.6%
( 47
 
7.6%
5 32
 
5.2%
2 31
 
5.0%
, 27
 
4.4%
0 23
 
3.7%
7 21
 
3.4%
3 21
 
3.4%
Other values (6) 78
 
12.6%
Latin
ValueCountFrequency (%)
B 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 895
59.2%
ASCII 615
40.6%
None 3
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
233
37.9%
1 57
 
9.3%
) 47
 
7.6%
( 47
 
7.6%
5 32
 
5.2%
2 31
 
5.0%
, 27
 
4.4%
0 23
 
3.7%
7 21
 
3.4%
3 21
 
3.4%
Other values (6) 76
 
12.4%
Hangul
ValueCountFrequency (%)
102
 
11.4%
92
 
10.3%
58
 
6.5%
48
 
5.4%
48
 
5.4%
47
 
5.3%
46
 
5.1%
46
 
5.1%
46
 
5.1%
25
 
2.8%
Other values (81) 337
37.7%
None
ValueCountFrequency (%)
3
100.0%

Interactions

2024-03-15T01:10:18.263355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T01:10:27.540938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업체명전화번호주소
연번1.0001.0001.0001.000
업체명1.0001.0001.0001.000
전화번호1.0001.0001.0001.000
주소1.0001.0001.0001.000

Missing values

2024-03-15T01:10:18.572591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T01:10:18.743360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-15T01:10:18.903178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번업체명전화번호주소
01(주)경원건축디자인051-817-8727부산광역시 부산진구 백양순환로 65 (당감동)
12(주)금강고려하우징051-861-1881부산광역시 부산진구 거제대로36번가길 8 (양정동)
23(주)끌레어림피드051-936-8989부산광역시 부산진구 중앙대로666번길 50 상가동 1층 102-2호 (부전동)
34(주)다미건설051-895-5108부산광역시 부산진구 엄광로178번길 9, 지하층 (가야동)
45(주)동성070-7703-8257부산광역시 부산진구 중앙대로 770, 7층 (부전동)
56(주)디자인혜윰051-911-9995부산광역시 부산진구 경마장로 20, 1층 (범전동)
67(주)모스앤파트너스070-8719-8888부산광역시 부산진구 새싹로 89-10, 3층 (부암동)
78(주)미래디자인051-862-6894부산광역시 부산진구 성지로 26, 1층(연지동)
89(주)비손051-715-4880부산광역시 부산진구 거제천로 56 501호 도운빌딩 (양정동)
910(주)상호산업051-501-5925부산광역시 부산진구 개금본동로17번길 32, 2층 (개금동)
연번업체명전화번호주소
4243주식회사유니브원051-515-8184부산광역시 부산진구 황령대로17번길 29-3 (범천동)
4344주식회사제이엠디자인070-8727-3915부산광역시 부산진구 동평로55번길 27 ,1층 (당감동)
4445주식회사플랜디051-727-7999부산광역시 부산진구 동평로 341 3층 (양정동)
4546청수건설(주)051-867-5708부산광역시 부산진구 중앙대로691번가길 27 (부전동)
46<NA><NA><NA><NA>
47<NA><NA><NA><NA>
48<NA><NA><NA><NA>
49<NA><NA><NA><NA>
50<NA><NA><NA><NA>
51<NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

연번업체명전화번호주소# duplicates
0<NA><NA><NA><NA>6