Overview

Dataset statistics

Number of variables4
Number of observations53
Missing cells13
Missing cells (%)6.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.8 KiB
Average record size in memory35.5 B

Variable types

Numeric1
Text3

Dataset

Description부산광역시남구소독정보_20230807
Author부산광역시 남구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3082369

Alerts

영업소전화번호 has 13 (24.5%) missing valuesMissing
순번 has unique valuesUnique
소독업소명칭 has unique valuesUnique

Reproduction

Analysis started2023-12-10 17:31:24.285833
Analysis finished2023-12-10 17:31:25.894429
Duration1.61 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct53
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean27
Minimum1
Maximum53
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size609.0 B
2023-12-11T02:31:26.198723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.6
Q114
median27
Q340
95-th percentile50.4
Maximum53
Range52
Interquartile range (IQR)26

Descriptive statistics

Standard deviation15.443445
Coefficient of variation (CV)0.57197945
Kurtosis-1.2
Mean27
Median Absolute Deviation (MAD)13
Skewness0
Sum1431
Variance238.5
MonotonicityStrictly increasing
2023-12-11T02:31:26.584560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.9%
41 1
 
1.9%
30 1
 
1.9%
31 1
 
1.9%
32 1
 
1.9%
33 1
 
1.9%
34 1
 
1.9%
35 1
 
1.9%
36 1
 
1.9%
37 1
 
1.9%
Other values (43) 43
81.1%
ValueCountFrequency (%)
1 1
1.9%
2 1
1.9%
3 1
1.9%
4 1
1.9%
5 1
1.9%
6 1
1.9%
7 1
1.9%
8 1
1.9%
9 1
1.9%
10 1
1.9%
ValueCountFrequency (%)
53 1
1.9%
52 1
1.9%
51 1
1.9%
50 1
1.9%
49 1
1.9%
48 1
1.9%
47 1
1.9%
46 1
1.9%
45 1
1.9%
44 1
1.9%

소독업소명칭
Text

UNIQUE 

Distinct53
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size556.0 B
2023-12-11T02:31:27.160843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length10
Mean length6.9433962
Min length2

Characters and Unicode

Total characters368
Distinct characters130
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique53 ?
Unique (%)100.0%

Sample

1st row부산방역공사
2nd row영남방역공사
3rd row(주)구구환경공사
4th row주식회사 태극산업
5th row산동방역
ValueCountFrequency (%)
주식회사 10
 
14.7%
부산방역공사 1
 
1.5%
총각들 1
 
1.5%
주)유칼릭스 1
 
1.5%
다올(daall 1
 
1.5%
주)대명에프엠 1
 
1.5%
바이제로(vi-zero 1
 
1.5%
클린에어존 1
 
1.5%
그린f5 1
 
1.5%
부산남구본부 1
 
1.5%
Other values (49) 49
72.1%
2023-12-11T02:31:28.000224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
20
 
5.4%
17
 
4.6%
16
 
4.3%
15
 
4.1%
14
 
3.8%
) 12
 
3.3%
( 12
 
3.3%
10
 
2.7%
10
 
2.7%
10
 
2.7%
Other values (120) 232
63.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 300
81.5%
Space Separator 15
 
4.1%
Close Punctuation 12
 
3.3%
Open Punctuation 12
 
3.3%
Uppercase Letter 12
 
3.3%
Lowercase Letter 8
 
2.2%
Other Symbol 4
 
1.1%
Other Punctuation 3
 
0.8%
Decimal Number 1
 
0.3%
Dash Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
20
 
6.7%
17
 
5.7%
16
 
5.3%
14
 
4.7%
10
 
3.3%
10
 
3.3%
10
 
3.3%
10
 
3.3%
8
 
2.7%
7
 
2.3%
Other values (97) 178
59.3%
Uppercase Letter
ValueCountFrequency (%)
M 2
16.7%
P 2
16.7%
C 2
16.7%
F 1
8.3%
B 1
8.3%
G 1
8.3%
Z 1
8.3%
V 1
8.3%
D 1
8.3%
Lowercase Letter
ValueCountFrequency (%)
a 2
25.0%
l 2
25.0%
o 1
12.5%
r 1
12.5%
e 1
12.5%
i 1
12.5%
Other Punctuation
ValueCountFrequency (%)
. 2
66.7%
& 1
33.3%
Space Separator
ValueCountFrequency (%)
15
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%
Decimal Number
ValueCountFrequency (%)
5 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 304
82.6%
Common 44
 
12.0%
Latin 20
 
5.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
20
 
6.6%
17
 
5.6%
16
 
5.3%
14
 
4.6%
10
 
3.3%
10
 
3.3%
10
 
3.3%
10
 
3.3%
8
 
2.6%
7
 
2.3%
Other values (98) 182
59.9%
Latin
ValueCountFrequency (%)
a 2
 
10.0%
l 2
 
10.0%
M 2
 
10.0%
P 2
 
10.0%
C 2
 
10.0%
F 1
 
5.0%
o 1
 
5.0%
B 1
 
5.0%
G 1
 
5.0%
r 1
 
5.0%
Other values (5) 5
25.0%
Common
ValueCountFrequency (%)
15
34.1%
) 12
27.3%
( 12
27.3%
. 2
 
4.5%
5 1
 
2.3%
& 1
 
2.3%
- 1
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 300
81.5%
ASCII 64
 
17.4%
None 4
 
1.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
20
 
6.7%
17
 
5.7%
16
 
5.3%
14
 
4.7%
10
 
3.3%
10
 
3.3%
10
 
3.3%
10
 
3.3%
8
 
2.7%
7
 
2.3%
Other values (97) 178
59.3%
ASCII
ValueCountFrequency (%)
15
23.4%
) 12
18.8%
( 12
18.8%
a 2
 
3.1%
l 2
 
3.1%
M 2
 
3.1%
. 2
 
3.1%
P 2
 
3.1%
C 2
 
3.1%
5 1
 
1.6%
Other values (12) 12
18.8%
None
ValueCountFrequency (%)
4
100.0%
Distinct50
Distinct (%)94.3%
Missing0
Missing (%)0.0%
Memory size556.0 B
2023-12-11T02:31:28.573136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length38
Mean length32.207547
Min length17

Characters and Unicode

Total characters1707
Distinct characters127
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique47 ?
Unique (%)88.7%

Sample

1st row부산광역시 남구 유엔로 104-1 (대연동, (2층))
2nd row부산광역시 남구 석포로127번길 23 (대연동)
3rd row부산광역시 남구 우암로154번길 57 (우암동)
4th row부산광역시 남구 유엔평화로4번길 81, 3층 (대연동)
5th row부산광역시 남구 용호로110번길 17 (용호동)
ValueCountFrequency (%)
부산광역시 53
 
15.6%
남구 53
 
15.6%
대연동 19
 
5.6%
수영로 7
 
2.1%
문현동 7
 
2.1%
2층 7
 
2.1%
용호동 6
 
1.8%
1층 6
 
1.8%
3층 5
 
1.5%
감만동 4
 
1.2%
Other values (135) 172
50.7%
2023-12-11T02:31:29.656950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
288
 
16.9%
1 82
 
4.8%
58
 
3.4%
55
 
3.2%
55
 
3.2%
54
 
3.2%
53
 
3.1%
53
 
3.1%
53
 
3.1%
53
 
3.1%
Other values (117) 903
52.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 959
56.2%
Decimal Number 297
 
17.4%
Space Separator 288
 
16.9%
Close Punctuation 49
 
2.9%
Open Punctuation 49
 
2.9%
Other Punctuation 46
 
2.7%
Uppercase Letter 10
 
0.6%
Dash Punctuation 5
 
0.3%
Lowercase Letter 4
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
58
 
6.0%
55
 
5.7%
55
 
5.7%
54
 
5.6%
53
 
5.5%
53
 
5.5%
53
 
5.5%
53
 
5.5%
53
 
5.5%
35
 
3.6%
Other values (90) 437
45.6%
Decimal Number
ValueCountFrequency (%)
1 82
27.6%
2 53
17.8%
3 37
12.5%
4 29
 
9.8%
0 26
 
8.8%
6 18
 
6.1%
5 16
 
5.4%
9 13
 
4.4%
8 12
 
4.0%
7 11
 
3.7%
Uppercase Letter
ValueCountFrequency (%)
I 2
20.0%
W 1
10.0%
F 1
10.0%
H 1
10.0%
E 1
10.0%
S 1
10.0%
K 1
10.0%
V 1
10.0%
C 1
10.0%
Lowercase Letter
ValueCountFrequency (%)
l 2
50.0%
i 1
25.0%
s 1
25.0%
Space Separator
ValueCountFrequency (%)
288
100.0%
Close Punctuation
ValueCountFrequency (%)
) 49
100.0%
Open Punctuation
ValueCountFrequency (%)
( 49
100.0%
Other Punctuation
ValueCountFrequency (%)
, 46
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 959
56.2%
Common 734
43.0%
Latin 14
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
58
 
6.0%
55
 
5.7%
55
 
5.7%
54
 
5.6%
53
 
5.5%
53
 
5.5%
53
 
5.5%
53
 
5.5%
53
 
5.5%
35
 
3.6%
Other values (90) 437
45.6%
Common
ValueCountFrequency (%)
288
39.2%
1 82
 
11.2%
2 53
 
7.2%
) 49
 
6.7%
( 49
 
6.7%
, 46
 
6.3%
3 37
 
5.0%
4 29
 
4.0%
0 26
 
3.5%
6 18
 
2.5%
Other values (5) 57
 
7.8%
Latin
ValueCountFrequency (%)
l 2
14.3%
I 2
14.3%
i 1
7.1%
s 1
7.1%
W 1
7.1%
F 1
7.1%
H 1
7.1%
E 1
7.1%
S 1
7.1%
K 1
7.1%
Other values (2) 2
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 959
56.2%
ASCII 748
43.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
288
38.5%
1 82
 
11.0%
2 53
 
7.1%
) 49
 
6.6%
( 49
 
6.6%
, 46
 
6.1%
3 37
 
4.9%
4 29
 
3.9%
0 26
 
3.5%
6 18
 
2.4%
Other values (17) 71
 
9.5%
Hangul
ValueCountFrequency (%)
58
 
6.0%
55
 
5.7%
55
 
5.7%
54
 
5.6%
53
 
5.5%
53
 
5.5%
53
 
5.5%
53
 
5.5%
53
 
5.5%
35
 
3.6%
Other values (90) 437
45.6%

영업소전화번호
Text

MISSING 

Distinct38
Distinct (%)95.0%
Missing13
Missing (%)24.5%
Memory size556.0 B
2023-12-11T02:31:30.257245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.5
Min length9

Characters and Unicode

Total characters460
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)90.0%

Sample

1st row051-624-8464
2nd row051-628-2886
3rd row051-632-9941
4th row051-627-4842
5th row051-611-2739
ValueCountFrequency (%)
051-623-0931 2
 
5.0%
051-627-4842 2
 
5.0%
051-611-4150 1
 
2.5%
051-644-9988 1
 
2.5%
051-624-8464 1
 
2.5%
1899-9671 1
 
2.5%
1833-2919 1
 
2.5%
1566-6730 1
 
2.5%
1644-3044 1
 
2.5%
051-711-1444 1
 
2.5%
Other values (28) 28
70.0%
2023-12-11T02:31:31.217827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 73
15.9%
1 61
13.3%
0 59
12.8%
5 55
12.0%
6 52
11.3%
3 35
7.6%
2 33
7.2%
9 25
 
5.4%
4 25
 
5.4%
8 23
 
5.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 387
84.1%
Dash Punctuation 73
 
15.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 61
15.8%
0 59
15.2%
5 55
14.2%
6 52
13.4%
3 35
9.0%
2 33
8.5%
9 25
6.5%
4 25
6.5%
8 23
 
5.9%
7 19
 
4.9%
Dash Punctuation
ValueCountFrequency (%)
- 73
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 460
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 73
15.9%
1 61
13.3%
0 59
12.8%
5 55
12.0%
6 52
11.3%
3 35
7.6%
2 33
7.2%
9 25
 
5.4%
4 25
 
5.4%
8 23
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 460
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 73
15.9%
1 61
13.3%
0 59
12.8%
5 55
12.0%
6 52
11.3%
3 35
7.6%
2 33
7.2%
9 25
 
5.4%
4 25
 
5.4%
8 23
 
5.0%

Interactions

2023-12-11T02:31:24.966677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T02:31:31.452774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번소독업소명칭사무실소재지(도로명)영업소전화번호
순번1.0001.0000.8520.627
소독업소명칭1.0001.0001.0001.000
사무실소재지(도로명)0.8521.0001.0001.000
영업소전화번호0.6271.0001.0001.000

Missing values

2023-12-11T02:31:25.380395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:31:25.745476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번소독업소명칭사무실소재지(도로명)영업소전화번호
01부산방역공사부산광역시 남구 유엔로 104-1 (대연동, (2층))051-624-8464
12영남방역공사부산광역시 남구 석포로127번길 23 (대연동)051-628-2886
23(주)구구환경공사부산광역시 남구 우암로154번길 57 (우암동)051-632-9941
34주식회사 태극산업부산광역시 남구 유엔평화로4번길 81, 3층 (대연동)051-627-4842
45산동방역부산광역시 남구 용호로110번길 17 (용호동)051-611-2739
56케이피엠부산광역시 남구 우암로104번길 29, 111동 201호 (감만동)051-635-8086
67제명 방역공사부산광역시 남구 유엔로201번길 43 (대연동)051-623-7139
78주식회사 보승부산광역시 남구 지게골로 101-22 (문현동)051-643-9365
89(주)미화실업부산광역시 남구 유엔평화로4번길 61 (대연동, (4층))051-469-0900
910P.C.M환경부산광역시 남구 동명로146번길 134 (용호동)051-611-2006
순번소독업소명칭사무실소재지(도로명)영업소전화번호
4344㈜시티캅부산광역시 남구 자성로 152, 한일오피스텔 3층051-630-9312
4445주식회사 이오테크부산광역시 남구 신선로 365, 부경대학교용당캠퍼스(용당동)051-622-6263
4546일진환경부산광역시 남구 양지골로 169번길 25-1, 1층(감만동)051-635-3368
4647제로킬부산광역시 남구 동명로 146번길 132, 풍성숯불갈비(용호동)<NA>
4748방력부산광역시 남구 전포대로 133, 문현금융단지 IFC부산 오피스텔 23층 2319호(문현동)<NA>
4849남도방역공사부산광역시 남구 고동골로 105-2, 1층 (문현동)051-633-8828
4950주식회사 주안시스템부산광역시 남구 유엔평화로41번가길 42, 4층 (대연동)<NA>
5051부산소독부산광역시 남구 석포로26번가길 24, 감만2동새마을금고 5층 (감만동)051-644-9988
5152㈜케이피엠부산광역시 남구 우암로104번길 29, 111동 201호 (감만동)051-635-7055
5253㈜물과공기부산광역시 남구 신선로 3651566-9362