Overview

Dataset statistics

Number of variables4
Number of observations52
Missing cells15
Missing cells (%)7.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.8 KiB
Average record size in memory35.5 B

Variable types

Numeric1
Text3

Dataset

Description부산광역시남구소독정보_20220725
Author부산광역시 남구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3082369

Alerts

영업소전화번호 has 15 (28.8%) missing valuesMissing
순번 has unique valuesUnique
소독업소명칭 has unique valuesUnique

Reproduction

Analysis started2023-12-10 17:31:33.214879
Analysis finished2023-12-10 17:31:34.694950
Duration1.48 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct52
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26.5
Minimum1
Maximum52
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size600.0 B
2023-12-11T02:31:34.850321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.55
Q113.75
median26.5
Q339.25
95-th percentile49.45
Maximum52
Range51
Interquartile range (IQR)25.5

Descriptive statistics

Standard deviation15.154757
Coefficient of variation (CV)0.57187763
Kurtosis-1.2
Mean26.5
Median Absolute Deviation (MAD)13
Skewness0
Sum1378
Variance229.66667
MonotonicityStrictly increasing
2023-12-11T02:31:35.150323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.9%
28 1
 
1.9%
30 1
 
1.9%
31 1
 
1.9%
32 1
 
1.9%
33 1
 
1.9%
34 1
 
1.9%
35 1
 
1.9%
36 1
 
1.9%
37 1
 
1.9%
Other values (42) 42
80.8%
ValueCountFrequency (%)
1 1
1.9%
2 1
1.9%
3 1
1.9%
4 1
1.9%
5 1
1.9%
6 1
1.9%
7 1
1.9%
8 1
1.9%
9 1
1.9%
10 1
1.9%
ValueCountFrequency (%)
52 1
1.9%
51 1
1.9%
50 1
1.9%
49 1
1.9%
48 1
1.9%
47 1
1.9%
46 1
1.9%
45 1
1.9%
44 1
1.9%
43 1
1.9%

소독업소명칭
Text

UNIQUE 

Distinct52
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size548.0 B
2023-12-11T02:31:35.682876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length12
Mean length7.25
Min length2

Characters and Unicode

Total characters377
Distinct characters139
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52 ?
Unique (%)100.0%

Sample

1st row부산방역공사
2nd row영남방역공사
3rd row(주)구구환경공사
4th row주식회사 태극산업
5th row산동방역
ValueCountFrequency (%)
주식회사 10
 
14.3%
방역 2
 
2.9%
부산방역공사 1
 
1.4%
주)선진솔루션 1
 
1.4%
후드솔로몬 1
 
1.4%
에스에이씨 1
 
1.4%
주)유칼릭스 1
 
1.4%
다올(daall 1
 
1.4%
주)대명에프엠 1
 
1.4%
바이제로(vi-zero 1
 
1.4%
Other values (50) 50
71.4%
2023-12-11T02:31:36.458546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
19
 
5.0%
18
 
4.8%
16
 
4.2%
16
 
4.2%
14
 
3.7%
( 13
 
3.4%
) 13
 
3.4%
10
 
2.7%
10
 
2.7%
10
 
2.7%
Other values (129) 238
63.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 295
78.2%
Space Separator 18
 
4.8%
Lowercase Letter 17
 
4.5%
Uppercase Letter 14
 
3.7%
Open Punctuation 13
 
3.4%
Close Punctuation 13
 
3.4%
Other Punctuation 3
 
0.8%
Other Symbol 2
 
0.5%
Decimal Number 1
 
0.3%
Dash Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
19
 
6.4%
16
 
5.4%
16
 
5.4%
14
 
4.7%
10
 
3.4%
10
 
3.4%
10
 
3.4%
9
 
3.1%
6
 
2.0%
6
 
2.0%
Other values (99) 179
60.7%
Lowercase Letter
ValueCountFrequency (%)
a 3
17.6%
o 3
17.6%
l 2
11.8%
e 2
11.8%
r 1
 
5.9%
d 1
 
5.9%
g 1
 
5.9%
c 1
 
5.9%
p 1
 
5.9%
s 1
 
5.9%
Uppercase Letter
ValueCountFrequency (%)
P 2
14.3%
M 2
14.3%
C 2
14.3%
Z 1
7.1%
F 1
7.1%
V 1
7.1%
B 1
7.1%
G 1
7.1%
D 1
7.1%
R 1
7.1%
Other Punctuation
ValueCountFrequency (%)
. 2
66.7%
& 1
33.3%
Space Separator
ValueCountFrequency (%)
18
100.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Decimal Number
ValueCountFrequency (%)
5 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 297
78.8%
Common 49
 
13.0%
Latin 31
 
8.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
19
 
6.4%
16
 
5.4%
16
 
5.4%
14
 
4.7%
10
 
3.4%
10
 
3.4%
10
 
3.4%
9
 
3.0%
6
 
2.0%
6
 
2.0%
Other values (100) 181
60.9%
Latin
ValueCountFrequency (%)
a 3
 
9.7%
o 3
 
9.7%
l 2
 
6.5%
e 2
 
6.5%
P 2
 
6.5%
M 2
 
6.5%
C 2
 
6.5%
Z 1
 
3.2%
F 1
 
3.2%
r 1
 
3.2%
Other values (12) 12
38.7%
Common
ValueCountFrequency (%)
18
36.7%
( 13
26.5%
) 13
26.5%
. 2
 
4.1%
5 1
 
2.0%
- 1
 
2.0%
& 1
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 295
78.2%
ASCII 80
 
21.2%
None 2
 
0.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
19
 
6.4%
16
 
5.4%
16
 
5.4%
14
 
4.7%
10
 
3.4%
10
 
3.4%
10
 
3.4%
9
 
3.1%
6
 
2.0%
6
 
2.0%
Other values (99) 179
60.7%
ASCII
ValueCountFrequency (%)
18
22.5%
( 13
16.2%
) 13
16.2%
a 3
 
3.8%
o 3
 
3.8%
l 2
 
2.5%
e 2
 
2.5%
P 2
 
2.5%
M 2
 
2.5%
. 2
 
2.5%
Other values (19) 20
25.0%
None
ValueCountFrequency (%)
2
100.0%
Distinct50
Distinct (%)96.2%
Missing0
Missing (%)0.0%
Memory size548.0 B
2023-12-11T02:31:36.962683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length40
Mean length32.692308
Min length19

Characters and Unicode

Total characters1700
Distinct characters130
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique48 ?
Unique (%)92.3%

Sample

1st row부산광역시 남구 유엔로 104-1 (대연동, (2층))
2nd row부산광역시 남구 석포로127번길 23 (대연동)
3rd row부산광역시 남구 우암로154번길 57 (우암동)
4th row부산광역시 남구 유엔평화로4번길 81, 3층 (대연동)
5th row부산광역시 남구 용호로110번길 17 (용호동)
ValueCountFrequency (%)
부산광역시 52
 
15.4%
남구 52
 
15.4%
대연동 18
 
5.3%
2층 8
 
2.4%
문현동 8
 
2.4%
1층 7
 
2.1%
수영로 7
 
2.1%
용호동 6
 
1.8%
3층 5
 
1.5%
유엔로 4
 
1.2%
Other values (138) 171
50.6%
2023-12-11T02:31:37.723997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
287
 
16.9%
1 80
 
4.7%
58
 
3.4%
54
 
3.2%
54
 
3.2%
53
 
3.1%
52
 
3.1%
52
 
3.1%
52
 
3.1%
52
 
3.1%
Other values (120) 906
53.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 961
56.5%
Space Separator 287
 
16.9%
Decimal Number 287
 
16.9%
Close Punctuation 49
 
2.9%
Open Punctuation 49
 
2.9%
Other Punctuation 48
 
2.8%
Uppercase Letter 11
 
0.6%
Dash Punctuation 4
 
0.2%
Lowercase Letter 4
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
58
 
6.0%
54
 
5.6%
54
 
5.6%
53
 
5.5%
52
 
5.4%
52
 
5.4%
52
 
5.4%
52
 
5.4%
52
 
5.4%
34
 
3.5%
Other values (92) 448
46.6%
Decimal Number
ValueCountFrequency (%)
1 80
27.9%
2 49
17.1%
3 36
12.5%
4 26
 
9.1%
0 25
 
8.7%
6 20
 
7.0%
5 14
 
4.9%
9 13
 
4.5%
8 13
 
4.5%
7 11
 
3.8%
Uppercase Letter
ValueCountFrequency (%)
I 2
18.2%
H 1
9.1%
F 1
9.1%
V 1
9.1%
W 1
9.1%
A 1
9.1%
E 1
9.1%
S 1
9.1%
K 1
9.1%
C 1
9.1%
Lowercase Letter
ValueCountFrequency (%)
l 2
50.0%
s 1
25.0%
i 1
25.0%
Space Separator
ValueCountFrequency (%)
287
100.0%
Close Punctuation
ValueCountFrequency (%)
) 49
100.0%
Open Punctuation
ValueCountFrequency (%)
( 49
100.0%
Other Punctuation
ValueCountFrequency (%)
, 48
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 961
56.5%
Common 724
42.6%
Latin 15
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
58
 
6.0%
54
 
5.6%
54
 
5.6%
53
 
5.5%
52
 
5.4%
52
 
5.4%
52
 
5.4%
52
 
5.4%
52
 
5.4%
34
 
3.5%
Other values (92) 448
46.6%
Common
ValueCountFrequency (%)
287
39.6%
1 80
 
11.0%
) 49
 
6.8%
( 49
 
6.8%
2 49
 
6.8%
, 48
 
6.6%
3 36
 
5.0%
4 26
 
3.6%
0 25
 
3.5%
6 20
 
2.8%
Other values (5) 55
 
7.6%
Latin
ValueCountFrequency (%)
I 2
13.3%
l 2
13.3%
s 1
 
6.7%
H 1
 
6.7%
F 1
 
6.7%
i 1
 
6.7%
V 1
 
6.7%
W 1
 
6.7%
A 1
 
6.7%
E 1
 
6.7%
Other values (3) 3
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 961
56.5%
ASCII 739
43.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
287
38.8%
1 80
 
10.8%
) 49
 
6.6%
( 49
 
6.6%
2 49
 
6.6%
, 48
 
6.5%
3 36
 
4.9%
4 26
 
3.5%
0 25
 
3.4%
6 20
 
2.7%
Other values (18) 70
 
9.5%
Hangul
ValueCountFrequency (%)
58
 
6.0%
54
 
5.6%
54
 
5.6%
53
 
5.5%
52
 
5.4%
52
 
5.4%
52
 
5.4%
52
 
5.4%
52
 
5.4%
34
 
3.5%
Other values (92) 448
46.6%

영업소전화번호
Text

MISSING 

Distinct35
Distinct (%)94.6%
Missing15
Missing (%)28.8%
Memory size548.0 B
2023-12-11T02:31:38.083032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.540541
Min length9

Characters and Unicode

Total characters427
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)89.2%

Sample

1st row051-624-8464
2nd row051-628-2886
3rd row051-632-9941
4th row051-627-4842
5th row051-611-2739
ValueCountFrequency (%)
051-627-4842 2
 
5.4%
051-623-0931 2
 
5.4%
051-711-1444 1
 
2.7%
051-625-5567 1
 
2.7%
051-714-0050 1
 
2.7%
051-626-3322 1
 
2.7%
1833-2919 1
 
2.7%
1566-6730 1
 
2.7%
1644-3044 1
 
2.7%
051-958-1919 1
 
2.7%
Other values (25) 25
67.6%
2023-12-11T02:31:38.658940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 68
15.9%
1 60
14.1%
0 56
13.1%
5 50
11.7%
6 46
10.8%
2 31
7.3%
3 31
7.3%
9 25
 
5.9%
4 23
 
5.4%
8 19
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 359
84.1%
Dash Punctuation 68
 
15.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 60
16.7%
0 56
15.6%
5 50
13.9%
6 46
12.8%
2 31
8.6%
3 31
8.6%
9 25
7.0%
4 23
 
6.4%
8 19
 
5.3%
7 18
 
5.0%
Dash Punctuation
ValueCountFrequency (%)
- 68
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 427
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 68
15.9%
1 60
14.1%
0 56
13.1%
5 50
11.7%
6 46
10.8%
2 31
7.3%
3 31
7.3%
9 25
 
5.9%
4 23
 
5.4%
8 19
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 427
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 68
15.9%
1 60
14.1%
0 56
13.1%
5 50
11.7%
6 46
10.8%
2 31
7.3%
3 31
7.3%
9 25
 
5.9%
4 23
 
5.4%
8 19
 
4.4%

Interactions

2023-12-11T02:31:34.197733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T02:31:38.844793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번소독업소명칭사무실소재지(도로명)영업소전화번호
순번1.0001.0000.8820.663
소독업소명칭1.0001.0001.0001.000
사무실소재지(도로명)0.8821.0001.0001.000
영업소전화번호0.6631.0001.0001.000

Missing values

2023-12-11T02:31:34.422359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:31:34.618631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번소독업소명칭사무실소재지(도로명)영업소전화번호
01부산방역공사부산광역시 남구 유엔로 104-1 (대연동, (2층))051-624-8464
12영남방역공사부산광역시 남구 석포로127번길 23 (대연동)051-628-2886
23(주)구구환경공사부산광역시 남구 우암로154번길 57 (우암동)051-632-9941
34주식회사 태극산업부산광역시 남구 유엔평화로4번길 81, 3층 (대연동)051-627-4842
45산동방역부산광역시 남구 용호로110번길 17 (용호동)051-611-2739
56케이피엠부산광역시 남구 우암로104번길 29, 111동 201호 (감만동)051-635-8086
67제명 방역공사부산광역시 남구 유엔로201번길 43 (대연동)051-623-7139
78주식회사 보승부산광역시 남구 지게골로 101-22 (문현동)051-643-9365
89(주)미화실업부산광역시 남구 유엔평화로4번길 61 (대연동, (4층))051-469-0900
910P.C.M환경부산광역시 남구 동명로146번길 134 (용호동)051-611-2006
순번소독업소명칭사무실소재지(도로명)영업소전화번호
4243㈜부산방역부산광역시 남구 수영로 39번 나길 15, 1층<NA>
4344클린앤케어부산광역시 남구 유엔로 120번 가길 19, 1층<NA>
4445청조방역부산광역시 남구 수영로 26, 2층 271호(문현동, 대림문현시티프라자)<NA>
4546클린구조대부산광역시 남구 우암로 2번길 30, 상가동 102호(감만 현대3차 아파트)1833-7735
4647㈜시티캅부산광역시 남구 자성로 152, 한일오피스텔 3층051-630-9312
4748주식회사 이오테크부산광역시 남구 신선로 365, 부경대학교용당캠퍼스(용당동)051-622-6263
4849일진환경부산광역시 남구 양지골로 169번길 25-1, 1층(감만동)051-635-3368
4950제로킬부산광역시 남구 동명로 146번길 132, 풍성숯불갈비(용호동)<NA>
5051우경부산광역시 남구 유엔로 66, 상가A동 104호(우암동, 대상하이아트)<NA>
5152방력부산광역시 남구 전포대로 133, 문현금융단지 IFC부산 오피스텔 23층 2319호(문현동)<NA>