Overview

Dataset statistics

Number of variables4
Number of observations54
Missing cells14
Missing cells (%)6.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.9 KiB
Average record size in memory35.4 B

Variable types

Numeric1
Text3

Dataset

Description부산 남구 관내 소독업소 현황에 대한 데이터로 소독업체 명칭, 소독업체 주소, 소독업체 전화번호 등의 정보를 제공합니다.
Author부산광역시 남구
URLhttps://www.data.go.kr/data/3082369/fileData.do

Alerts

영업소전화번호 has 14 (25.9%) missing valuesMissing
순번 has unique valuesUnique
소독업소명칭 has unique valuesUnique

Reproduction

Analysis started2023-12-12 10:11:22.114473
Analysis finished2023-12-12 10:11:22.694248
Duration0.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct54
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean27.5
Minimum1
Maximum54
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size618.0 B
2023-12-12T19:11:22.778339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.65
Q114.25
median27.5
Q340.75
95-th percentile51.35
Maximum54
Range53
Interquartile range (IQR)26.5

Descriptive statistics

Standard deviation15.732133
Coefficient of variation (CV)0.57207755
Kurtosis-1.2
Mean27.5
Median Absolute Deviation (MAD)13.5
Skewness0
Sum1485
Variance247.5
MonotonicityStrictly increasing
2023-12-12T19:11:22.933635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.9%
42 1
 
1.9%
31 1
 
1.9%
32 1
 
1.9%
33 1
 
1.9%
34 1
 
1.9%
35 1
 
1.9%
36 1
 
1.9%
37 1
 
1.9%
38 1
 
1.9%
Other values (44) 44
81.5%
ValueCountFrequency (%)
1 1
1.9%
2 1
1.9%
3 1
1.9%
4 1
1.9%
5 1
1.9%
6 1
1.9%
7 1
1.9%
8 1
1.9%
9 1
1.9%
10 1
1.9%
ValueCountFrequency (%)
54 1
1.9%
53 1
1.9%
52 1
1.9%
51 1
1.9%
50 1
1.9%
49 1
1.9%
48 1
1.9%
47 1
1.9%
46 1
1.9%
45 1
1.9%

소독업소명칭
Text

UNIQUE 

Distinct54
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size564.0 B
2023-12-12T19:11:23.201592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length10
Mean length6.9074074
Min length2

Characters and Unicode

Total characters373
Distinct characters132
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique54 ?
Unique (%)100.0%

Sample

1st row부산방역공사
2nd row영남방역공사
3rd row(주)구구환경공사
4th row주식회사 태극산업
5th row산동방역
ValueCountFrequency (%)
주식회사 10
 
14.5%
부산방역공사 1
 
1.4%
총각들 1
 
1.4%
㈜물과공기 1
 
1.4%
다올(daall 1
 
1.4%
주)대명에프엠 1
 
1.4%
바이제로(vi-zero 1
 
1.4%
클린에어존 1
 
1.4%
그린f5 1
 
1.4%
부산남구본부 1
 
1.4%
Other values (50) 50
72.5%
2023-12-12T19:11:23.574605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
20
 
5.4%
17
 
4.6%
17
 
4.6%
15
 
4.0%
15
 
4.0%
( 12
 
3.2%
) 12
 
3.2%
10
 
2.7%
10
 
2.7%
10
 
2.7%
Other values (122) 235
63.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 302
81.0%
Space Separator 15
 
4.0%
Uppercase Letter 15
 
4.0%
Open Punctuation 12
 
3.2%
Close Punctuation 12
 
3.2%
Lowercase Letter 8
 
2.1%
Other Symbol 4
 
1.1%
Other Punctuation 3
 
0.8%
Decimal Number 1
 
0.3%
Dash Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
20
 
6.6%
17
 
5.6%
17
 
5.6%
15
 
5.0%
10
 
3.3%
10
 
3.3%
10
 
3.3%
10
 
3.3%
8
 
2.6%
7
 
2.3%
Other values (97) 178
58.9%
Uppercase Letter
ValueCountFrequency (%)
M 2
13.3%
C 2
13.3%
D 2
13.3%
P 2
13.3%
F 1
6.7%
U 1
6.7%
G 1
6.7%
Z 1
6.7%
V 1
6.7%
B 1
6.7%
Lowercase Letter
ValueCountFrequency (%)
l 2
25.0%
a 2
25.0%
o 1
12.5%
e 1
12.5%
r 1
12.5%
i 1
12.5%
Other Punctuation
ValueCountFrequency (%)
. 2
66.7%
& 1
33.3%
Space Separator
ValueCountFrequency (%)
15
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%
Decimal Number
ValueCountFrequency (%)
5 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 306
82.0%
Common 44
 
11.8%
Latin 23
 
6.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
20
 
6.5%
17
 
5.6%
17
 
5.6%
15
 
4.9%
10
 
3.3%
10
 
3.3%
10
 
3.3%
10
 
3.3%
8
 
2.6%
7
 
2.3%
Other values (98) 182
59.5%
Latin
ValueCountFrequency (%)
M 2
 
8.7%
C 2
 
8.7%
l 2
 
8.7%
a 2
 
8.7%
D 2
 
8.7%
P 2
 
8.7%
F 1
 
4.3%
o 1
 
4.3%
U 1
 
4.3%
e 1
 
4.3%
Other values (7) 7
30.4%
Common
ValueCountFrequency (%)
15
34.1%
( 12
27.3%
) 12
27.3%
. 2
 
4.5%
5 1
 
2.3%
- 1
 
2.3%
& 1
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 302
81.0%
ASCII 67
 
18.0%
None 4
 
1.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
20
 
6.6%
17
 
5.6%
17
 
5.6%
15
 
5.0%
10
 
3.3%
10
 
3.3%
10
 
3.3%
10
 
3.3%
8
 
2.6%
7
 
2.3%
Other values (97) 178
58.9%
ASCII
ValueCountFrequency (%)
15
22.4%
( 12
17.9%
) 12
17.9%
M 2
 
3.0%
C 2
 
3.0%
l 2
 
3.0%
a 2
 
3.0%
D 2
 
3.0%
. 2
 
3.0%
P 2
 
3.0%
Other values (14) 14
20.9%
None
ValueCountFrequency (%)
4
100.0%
Distinct51
Distinct (%)94.4%
Missing0
Missing (%)0.0%
Memory size564.0 B
2023-12-12T19:11:23.856967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length38
Mean length31.944444
Min length17

Characters and Unicode

Total characters1725
Distinct characters127
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique48 ?
Unique (%)88.9%

Sample

1st row부산광역시 남구 유엔로 104-1 (대연동, (2층))
2nd row부산광역시 남구 석포로127번길 23 (대연동)
3rd row부산광역시 남구 우암로154번길 57 (우암동)
4th row부산광역시 남구 유엔평화로4번길 81, 3층 (대연동)
5th row부산광역시 남구 용호로110번길 17 (용호동)
ValueCountFrequency (%)
부산광역시 54
 
15.7%
남구 54
 
15.7%
대연동 20
 
5.8%
2층 7
 
2.0%
수영로 7
 
2.0%
문현동 7
 
2.0%
용호동 6
 
1.7%
1층 6
 
1.7%
3층 5
 
1.5%
4층 4
 
1.2%
Other values (136) 173
50.4%
2023-12-12T19:11:24.356785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
291
 
16.9%
1 82
 
4.8%
59
 
3.4%
56
 
3.2%
56
 
3.2%
55
 
3.2%
54
 
3.1%
54
 
3.1%
54
 
3.1%
54
 
3.1%
Other values (117) 910
52.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 969
56.2%
Decimal Number 301
 
17.4%
Space Separator 291
 
16.9%
Close Punctuation 49
 
2.8%
Open Punctuation 49
 
2.8%
Other Punctuation 46
 
2.7%
Uppercase Letter 10
 
0.6%
Dash Punctuation 6
 
0.3%
Lowercase Letter 4
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
59
 
6.1%
56
 
5.8%
56
 
5.8%
55
 
5.7%
54
 
5.6%
54
 
5.6%
54
 
5.6%
54
 
5.6%
53
 
5.5%
35
 
3.6%
Other values (90) 439
45.3%
Decimal Number
ValueCountFrequency (%)
1 82
27.2%
2 54
17.9%
3 38
12.6%
4 29
 
9.6%
0 26
 
8.6%
6 18
 
6.0%
5 16
 
5.3%
9 15
 
5.0%
8 12
 
4.0%
7 11
 
3.7%
Uppercase Letter
ValueCountFrequency (%)
I 2
20.0%
F 1
10.0%
S 1
10.0%
K 1
10.0%
V 1
10.0%
E 1
10.0%
W 1
10.0%
H 1
10.0%
C 1
10.0%
Lowercase Letter
ValueCountFrequency (%)
l 2
50.0%
i 1
25.0%
s 1
25.0%
Space Separator
ValueCountFrequency (%)
291
100.0%
Close Punctuation
ValueCountFrequency (%)
) 49
100.0%
Open Punctuation
ValueCountFrequency (%)
( 49
100.0%
Other Punctuation
ValueCountFrequency (%)
, 46
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 969
56.2%
Common 742
43.0%
Latin 14
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
59
 
6.1%
56
 
5.8%
56
 
5.8%
55
 
5.7%
54
 
5.6%
54
 
5.6%
54
 
5.6%
54
 
5.6%
53
 
5.5%
35
 
3.6%
Other values (90) 439
45.3%
Common
ValueCountFrequency (%)
291
39.2%
1 82
 
11.1%
2 54
 
7.3%
) 49
 
6.6%
( 49
 
6.6%
, 46
 
6.2%
3 38
 
5.1%
4 29
 
3.9%
0 26
 
3.5%
6 18
 
2.4%
Other values (5) 60
 
8.1%
Latin
ValueCountFrequency (%)
I 2
14.3%
l 2
14.3%
F 1
7.1%
i 1
7.1%
s 1
7.1%
S 1
7.1%
K 1
7.1%
V 1
7.1%
E 1
7.1%
W 1
7.1%
Other values (2) 2
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 969
56.2%
ASCII 756
43.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
291
38.5%
1 82
 
10.8%
2 54
 
7.1%
) 49
 
6.5%
( 49
 
6.5%
, 46
 
6.1%
3 38
 
5.0%
4 29
 
3.8%
0 26
 
3.4%
6 18
 
2.4%
Other values (17) 74
 
9.8%
Hangul
ValueCountFrequency (%)
59
 
6.1%
56
 
5.8%
56
 
5.8%
55
 
5.7%
54
 
5.6%
54
 
5.6%
54
 
5.6%
54
 
5.6%
53
 
5.5%
35
 
3.6%
Other values (90) 439
45.3%

영업소전화번호
Text

MISSING 

Distinct38
Distinct (%)95.0%
Missing14
Missing (%)25.9%
Memory size564.0 B
2023-12-12T19:11:24.595742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.5
Min length9

Characters and Unicode

Total characters460
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)90.0%

Sample

1st row051-624-8464
2nd row051-628-2886
3rd row051-632-9941
4th row051-627-4842
5th row051-611-2739
ValueCountFrequency (%)
051-623-0931 2
 
5.0%
051-627-4842 2
 
5.0%
051-611-4150 1
 
2.5%
051-644-9988 1
 
2.5%
051-624-8464 1
 
2.5%
1899-9671 1
 
2.5%
1833-2919 1
 
2.5%
1566-6730 1
 
2.5%
1644-3044 1
 
2.5%
051-711-1444 1
 
2.5%
Other values (28) 28
70.0%
2023-12-12T19:11:25.027452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 73
15.9%
1 61
13.3%
0 59
12.8%
5 55
12.0%
6 52
11.3%
3 35
7.6%
2 33
7.2%
9 25
 
5.4%
4 25
 
5.4%
8 23
 
5.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 387
84.1%
Dash Punctuation 73
 
15.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 61
15.8%
0 59
15.2%
5 55
14.2%
6 52
13.4%
3 35
9.0%
2 33
8.5%
9 25
6.5%
4 25
6.5%
8 23
 
5.9%
7 19
 
4.9%
Dash Punctuation
ValueCountFrequency (%)
- 73
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 460
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 73
15.9%
1 61
13.3%
0 59
12.8%
5 55
12.0%
6 52
11.3%
3 35
7.6%
2 33
7.2%
9 25
 
5.4%
4 25
 
5.4%
8 23
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 460
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 73
15.9%
1 61
13.3%
0 59
12.8%
5 55
12.0%
6 52
11.3%
3 35
7.6%
2 33
7.2%
9 25
 
5.4%
4 25
 
5.4%
8 23
 
5.0%

Interactions

2023-12-12T19:11:22.410763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T19:11:25.161064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번소독업소명칭사무실소재지(도로명)영업소전화번호
순번1.0001.0000.8230.627
소독업소명칭1.0001.0001.0001.000
사무실소재지(도로명)0.8231.0001.0001.000
영업소전화번호0.6271.0001.0001.000

Missing values

2023-12-12T19:11:22.538912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:11:22.641760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번소독업소명칭사무실소재지(도로명)영업소전화번호
01부산방역공사부산광역시 남구 유엔로 104-1 (대연동, (2층))051-624-8464
12영남방역공사부산광역시 남구 석포로127번길 23 (대연동)051-628-2886
23(주)구구환경공사부산광역시 남구 우암로154번길 57 (우암동)051-632-9941
34주식회사 태극산업부산광역시 남구 유엔평화로4번길 81, 3층 (대연동)051-627-4842
45산동방역부산광역시 남구 용호로110번길 17 (용호동)051-611-2739
56케이피엠부산광역시 남구 우암로104번길 29, 111동 201호 (감만동)051-635-8086
67제명 방역공사부산광역시 남구 유엔로201번길 43 (대연동)051-623-7139
78주식회사 보승부산광역시 남구 지게골로 101-22 (문현동)051-643-9365
89(주)미화실업부산광역시 남구 유엔평화로4번길 61 (대연동, (4층))051-469-0900
910P.C.M환경부산광역시 남구 동명로146번길 134 (용호동)051-611-2006
순번소독업소명칭사무실소재지(도로명)영업소전화번호
4445주식회사 이오테크부산광역시 남구 신선로 365, 부경대학교용당캠퍼스(용당동)051-622-6263
4546일진환경부산광역시 남구 양지골로 169번길 25-1, 1층(감만동)051-635-3368
4647제로킬부산광역시 남구 동명로 146번길 132, 풍성숯불갈비(용호동)<NA>
4748방력부산광역시 남구 전포대로 133, 문현금융단지 IFC부산 오피스텔 23층 2319호(문현동)<NA>
4849남도방역공사부산광역시 남구 고동골로 105-2, 1층 (문현동)051-633-8828
4950주식회사 주안시스템부산광역시 남구 유엔평화로41번가길 42, 4층 (대연동)<NA>
5051부산소독부산광역시 남구 석포로26번가길 24, 감만2동새마을금고 5층 (감만동)051-644-9988
5152㈜케이피엠부산광역시 남구 우암로104번길 29, 111동 201호 (감만동)051-635-7055
5253㈜물과공기부산광역시 남구 신선로 3651566-9362
5354UDT방역부산광역시 남구 대연동 39-29<NA>