Overview

Dataset statistics

Number of variables5
Number of observations39
Missing cells18
Missing cells (%)9.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.7 KiB
Average record size in memory44.4 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description부산광역시연제구_소독업소현황_20221110
Author부산광역시 연제구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15030010

Alerts

업종 has constant value ""Constant
영업소전화번호 has 18 (46.2%) missing valuesMissing
연번 has unique valuesUnique
소독업소명칭 has unique valuesUnique
사무실소재지(도로명) has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:17:53.628051
Analysis finished2023-12-10 16:17:54.208519
Duration0.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct39
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20
Minimum1
Maximum39
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size483.0 B
2023-12-11T01:17:54.279504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.9
Q110.5
median20
Q329.5
95-th percentile37.1
Maximum39
Range38
Interquartile range (IQR)19

Descriptive statistics

Standard deviation11.401754
Coefficient of variation (CV)0.57008771
Kurtosis-1.2
Mean20
Median Absolute Deviation (MAD)10
Skewness0
Sum780
Variance130
MonotonicityStrictly increasing
2023-12-11T01:17:54.417479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=39)
ValueCountFrequency (%)
1 1
 
2.6%
2 1
 
2.6%
23 1
 
2.6%
24 1
 
2.6%
25 1
 
2.6%
26 1
 
2.6%
27 1
 
2.6%
28 1
 
2.6%
29 1
 
2.6%
30 1
 
2.6%
Other values (29) 29
74.4%
ValueCountFrequency (%)
1 1
2.6%
2 1
2.6%
3 1
2.6%
4 1
2.6%
5 1
2.6%
6 1
2.6%
7 1
2.6%
8 1
2.6%
9 1
2.6%
10 1
2.6%
ValueCountFrequency (%)
39 1
2.6%
38 1
2.6%
37 1
2.6%
36 1
2.6%
35 1
2.6%
34 1
2.6%
33 1
2.6%
32 1
2.6%
31 1
2.6%
30 1
2.6%

업종
Categorical

CONSTANT 

Distinct1
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size444.0 B
소독업
39 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row소독업
2nd row소독업
3rd row소독업
4th row소독업
5th row소독업

Common Values

ValueCountFrequency (%)
소독업 39
100.0%

Length

2023-12-11T01:17:54.865964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:17:54.973942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
소독업 39
100.0%

소독업소명칭
Text

UNIQUE 

Distinct39
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size444.0 B
2023-12-11T01:17:55.226865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length13
Mean length7.3846154
Min length2

Characters and Unicode

Total characters288
Distinct characters118
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique39 ?
Unique (%)100.0%

Sample

1st row부산본사 퍼펙트방역 소독 전문업체
2nd row창하토탈관리
3rd row삼인IP
4th row그린 F5 연제본부
5th row수인방역
ValueCountFrequency (%)
주식회사 4
 
7.7%
부산본사 1
 
1.9%
부일환경(주 1
 
1.9%
에스비엠 1
 
1.9%
유니케어 1
 
1.9%
허브원 1
 
1.9%
주)케이제이씨에스 1
 
1.9%
조은피앤피 1
 
1.9%
우재방역 1
 
1.9%
주)삼성 1
 
1.9%
Other values (39) 39
75.0%
2023-12-11T01:17:55.855612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
16
 
5.6%
) 14
 
4.9%
( 14
 
4.9%
13
 
4.5%
8
 
2.8%
8
 
2.8%
7
 
2.4%
6
 
2.1%
6
 
2.1%
6
 
2.1%
Other values (108) 190
66.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 225
78.1%
Close Punctuation 14
 
4.9%
Open Punctuation 14
 
4.9%
Space Separator 13
 
4.5%
Uppercase Letter 11
 
3.8%
Lowercase Letter 10
 
3.5%
Decimal Number 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
16
 
7.1%
8
 
3.6%
8
 
3.6%
7
 
3.1%
6
 
2.7%
6
 
2.7%
6
 
2.7%
5
 
2.2%
5
 
2.2%
5
 
2.2%
Other values (87) 153
68.0%
Uppercase Letter
ValueCountFrequency (%)
O 2
18.2%
P 2
18.2%
E 1
9.1%
N 1
9.1%
T 1
9.1%
G 1
9.1%
Y 1
9.1%
F 1
9.1%
I 1
9.1%
Lowercase Letter
ValueCountFrequency (%)
t 2
20.0%
o 2
20.0%
n 1
10.0%
l 1
10.0%
r 1
10.0%
e 1
10.0%
s 1
10.0%
c 1
10.0%
Close Punctuation
ValueCountFrequency (%)
) 14
100.0%
Open Punctuation
ValueCountFrequency (%)
( 14
100.0%
Space Separator
ValueCountFrequency (%)
13
100.0%
Decimal Number
ValueCountFrequency (%)
5 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 225
78.1%
Common 42
 
14.6%
Latin 21
 
7.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
16
 
7.1%
8
 
3.6%
8
 
3.6%
7
 
3.1%
6
 
2.7%
6
 
2.7%
6
 
2.7%
5
 
2.2%
5
 
2.2%
5
 
2.2%
Other values (87) 153
68.0%
Latin
ValueCountFrequency (%)
O 2
 
9.5%
t 2
 
9.5%
o 2
 
9.5%
P 2
 
9.5%
n 1
 
4.8%
E 1
 
4.8%
N 1
 
4.8%
T 1
 
4.8%
G 1
 
4.8%
l 1
 
4.8%
Other values (7) 7
33.3%
Common
ValueCountFrequency (%)
) 14
33.3%
( 14
33.3%
13
31.0%
5 1
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 225
78.1%
ASCII 63
 
21.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
16
 
7.1%
8
 
3.6%
8
 
3.6%
7
 
3.1%
6
 
2.7%
6
 
2.7%
6
 
2.7%
5
 
2.2%
5
 
2.2%
5
 
2.2%
Other values (87) 153
68.0%
ASCII
ValueCountFrequency (%)
) 14
22.2%
( 14
22.2%
13
20.6%
O 2
 
3.2%
t 2
 
3.2%
o 2
 
3.2%
P 2
 
3.2%
n 1
 
1.6%
E 1
 
1.6%
N 1
 
1.6%
Other values (11) 11
17.5%
Distinct39
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size444.0 B
2023-12-11T01:17:56.231058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length36
Mean length30.512821
Min length23

Characters and Unicode

Total characters1190
Distinct characters82
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique39 ?
Unique (%)100.0%

Sample

1st row부산광역시 연제구 과정로283번나길 33, 2층 (연산동)
2nd row부산광역시 연제구 중앙대로 1080, 디오빌 905호 (연산동)
3rd row부산광역시 연제구 월드컵대로 160, 6층 615호 (연산동)
4th row부산광역시 연제구 과정로191번가길 39, 1층 (연산동)
5th row부산광역시 연제구 중앙대로 1116-5, 602호 (연산동)
ValueCountFrequency (%)
부산광역시 39
16.7%
연제구 39
16.7%
연산동 30
 
12.8%
2층 9
 
3.8%
거제동 8
 
3.4%
3층 4
 
1.7%
과정로 4
 
1.7%
1층 4
 
1.7%
중앙대로 3
 
1.3%
9 3
 
1.3%
Other values (79) 91
38.9%
2023-12-11T01:17:56.798056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
195
 
16.4%
75
 
6.3%
70
 
5.9%
50
 
4.2%
43
 
3.6%
1 42
 
3.5%
41
 
3.4%
39
 
3.3%
) 39
 
3.3%
( 39
 
3.3%
Other values (72) 557
46.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 698
58.7%
Space Separator 195
 
16.4%
Decimal Number 187
 
15.7%
Close Punctuation 39
 
3.3%
Open Punctuation 39
 
3.3%
Other Punctuation 30
 
2.5%
Dash Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
75
 
10.7%
70
 
10.0%
50
 
7.2%
43
 
6.2%
41
 
5.9%
39
 
5.6%
39
 
5.6%
39
 
5.6%
39
 
5.6%
39
 
5.6%
Other values (57) 224
32.1%
Decimal Number
ValueCountFrequency (%)
1 42
22.5%
2 37
19.8%
3 19
10.2%
0 19
10.2%
5 13
 
7.0%
7 13
 
7.0%
8 13
 
7.0%
9 12
 
6.4%
4 10
 
5.3%
6 9
 
4.8%
Space Separator
ValueCountFrequency (%)
195
100.0%
Close Punctuation
ValueCountFrequency (%)
) 39
100.0%
Open Punctuation
ValueCountFrequency (%)
( 39
100.0%
Other Punctuation
ValueCountFrequency (%)
, 30
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 698
58.7%
Common 492
41.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
75
 
10.7%
70
 
10.0%
50
 
7.2%
43
 
6.2%
41
 
5.9%
39
 
5.6%
39
 
5.6%
39
 
5.6%
39
 
5.6%
39
 
5.6%
Other values (57) 224
32.1%
Common
ValueCountFrequency (%)
195
39.6%
1 42
 
8.5%
) 39
 
7.9%
( 39
 
7.9%
2 37
 
7.5%
, 30
 
6.1%
3 19
 
3.9%
0 19
 
3.9%
5 13
 
2.6%
7 13
 
2.6%
Other values (5) 46
 
9.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 698
58.7%
ASCII 492
41.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
195
39.6%
1 42
 
8.5%
) 39
 
7.9%
( 39
 
7.9%
2 37
 
7.5%
, 30
 
6.1%
3 19
 
3.9%
0 19
 
3.9%
5 13
 
2.6%
7 13
 
2.6%
Other values (5) 46
 
9.3%
Hangul
ValueCountFrequency (%)
75
 
10.7%
70
 
10.0%
50
 
7.2%
43
 
6.2%
41
 
5.9%
39
 
5.6%
39
 
5.6%
39
 
5.6%
39
 
5.6%
39
 
5.6%
Other values (57) 224
32.1%

영업소전화번호
Text

MISSING 

Distinct20
Distinct (%)95.2%
Missing18
Missing (%)46.2%
Memory size444.0 B
2023-12-11T01:17:57.084792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length11.857143
Min length9

Characters and Unicode

Total characters249
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique19 ?
Unique (%)90.5%

Sample

1st row1668-1361
2nd row051-747-3604
3rd row051-927-2978
4th row051-441-9955
5th row051-851-5580
ValueCountFrequency (%)
051-441-9955 2
 
9.5%
051-866-5065 1
 
4.8%
051-532-5200 1
 
4.8%
051-464-5100 1
 
4.8%
051-852-8219 1
 
4.8%
051-867-4432 1
 
4.8%
051-865-0166 1
 
4.8%
051-864-1804 1
 
4.8%
051-944-7171 1
 
4.8%
051-755-3835 1
 
4.8%
Other values (10) 10
47.6%
2023-12-11T01:17:57.608498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 41
16.5%
1 41
16.5%
- 41
16.5%
0 37
14.9%
8 18
7.2%
4 16
 
6.4%
6 16
 
6.4%
7 11
 
4.4%
2 10
 
4.0%
9 9
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 208
83.5%
Dash Punctuation 41
 
16.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 41
19.7%
1 41
19.7%
0 37
17.8%
8 18
8.7%
4 16
 
7.7%
6 16
 
7.7%
7 11
 
5.3%
2 10
 
4.8%
9 9
 
4.3%
3 9
 
4.3%
Dash Punctuation
ValueCountFrequency (%)
- 41
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 249
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 41
16.5%
1 41
16.5%
- 41
16.5%
0 37
14.9%
8 18
7.2%
4 16
 
6.4%
6 16
 
6.4%
7 11
 
4.4%
2 10
 
4.0%
9 9
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 249
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 41
16.5%
1 41
16.5%
- 41
16.5%
0 37
14.9%
8 18
7.2%
4 16
 
6.4%
6 16
 
6.4%
7 11
 
4.4%
2 10
 
4.0%
9 9
 
3.6%

Interactions

2023-12-11T01:17:53.844313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:17:57.762975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번소독업소명칭사무실소재지(도로명)영업소전화번호
연번1.0001.0001.0000.911
소독업소명칭1.0001.0001.0001.000
사무실소재지(도로명)1.0001.0001.0001.000
영업소전화번호0.9111.0001.0001.000

Missing values

2023-12-11T01:17:54.040634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:17:54.162240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종소독업소명칭사무실소재지(도로명)영업소전화번호
01소독업부산본사 퍼펙트방역 소독 전문업체부산광역시 연제구 과정로283번나길 33, 2층 (연산동)<NA>
12소독업창하토탈관리부산광역시 연제구 중앙대로 1080, 디오빌 905호 (연산동)<NA>
23소독업삼인IP부산광역시 연제구 월드컵대로 160, 6층 615호 (연산동)1668-1361
34소독업그린 F5 연제본부부산광역시 연제구 과정로191번가길 39, 1층 (연산동)<NA>
45소독업수인방역부산광역시 연제구 중앙대로 1116-5, 602호 (연산동)<NA>
56소독업바른부산광역시 연제구 월드컵대로187번길 25 (거제동)<NA>
67소독업비앤케이케어부산광역시 연제구 해맞이로 23 (거제동, 거제유림아시아드)<NA>
78소독업(주)꿈드림키즈부산광역시 연제구 과정로225번길 23, 2층 (연산동)<NA>
89소독업(주)영인하우징부산광역시 연제구 중앙대로1249번길 27 (거제동)051-747-3604
910소독업(주)클라우드스토리부산광역시 연제구 반송로 22, 1층 55호 (연산동, 동서상가)<NA>
연번업종소독업소명칭사무실소재지(도로명)영업소전화번호
2930소독업유니케어부산광역시 연제구 거제천로87번길 30, 101동 3층 309호 (거제동, 연제그린타워 상가)<NA>
3031소독업허브원부산광역시 연제구 대리로22번길 51, 지하1층 (연산동)<NA>
3132소독업(주)한솔이앤씨부산광역시 연제구 과정로 254, 2층 (연산동)051-755-3835
3233소독업(주)대신에이스부산광역시 연제구 아시아드대로28번길 9 (거제동, 대우아파트 상가동 304호)051-944-7171
3334소독업주식회사 대한시스템부산광역시 연제구 마곡천로30번길 2, 2층 (연산동)051-864-1804
3435소독업(주)TO YO부산광역시 연제구 쌍미천로135번길 22, 2층 (연산동)051-865-0166
3536소독업대한방역부산광역시 연제구 연동로8번길 26 (연산동)051-867-4432
3637소독업부산연제지역자활센터부산광역시 연제구 봉수로 17, 3층 2호 (연산동)051-852-8219
3738소독업대동종합환경부산광역시 연제구 연미로13번길 20, 3층 (연산동)051-464-5100
3839소독업롯데방역부산광역시 연제구 해맞이로 71, 3층 (거제동)051-501-0118