Overview

Dataset statistics

Number of variables4
Number of observations105
Missing cells22
Missing cells (%)5.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.4 KiB
Average record size in memory33.3 B

Variable types

Text3
DateTime1

Dataset

Description부산광역시기장군_위생관리업_현황_20220316
Author부산광역시 기장군
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15069114

Alerts

소재지전화 has 21 (20.0%) missing valuesMissing
업소명 has unique valuesUnique

Reproduction

Analysis started2023-12-10 17:18:05.396984
Analysis finished2023-12-10 17:18:07.143813
Duration1.75 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업소명
Text

UNIQUE 

Distinct105
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size972.0 B
2023-12-11T02:18:07.478858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length15
Mean length7.4
Min length2

Characters and Unicode

Total characters777
Distinct characters187
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique105 ?
Unique (%)100.0%

Sample

1st row(주)동래위생공사
2nd row제이그린테크
3rd row경남상사
4th row홍익환경산업
5th row양일건업
ValueCountFrequency (%)
주식회사 10
 
8.3%
동해상사 1
 
0.8%
주)해성씨앤에이 1
 
0.8%
삼광지에스(gs 1
 
0.8%
율성 1
 
0.8%
유창홀딩스주식회사 1
 
0.8%
대아이앤씨(주 1
 
0.8%
희망기장협동조합 1
 
0.8%
인성기업 1
 
0.8%
주식회사라이크개발 1
 
0.8%
Other values (101) 101
84.2%
2023-12-11T02:18:08.928207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
55
 
7.1%
) 42
 
5.4%
( 41
 
5.3%
35
 
4.5%
24
 
3.1%
18
 
2.3%
17
 
2.2%
15
 
1.9%
15
 
1.9%
13
 
1.7%
Other values (177) 502
64.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 666
85.7%
Close Punctuation 42
 
5.4%
Open Punctuation 41
 
5.3%
Space Separator 15
 
1.9%
Uppercase Letter 11
 
1.4%
Other Punctuation 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
55
 
8.3%
35
 
5.3%
24
 
3.6%
18
 
2.7%
17
 
2.6%
15
 
2.3%
13
 
2.0%
13
 
2.0%
13
 
2.0%
12
 
1.8%
Other values (164) 451
67.7%
Uppercase Letter
ValueCountFrequency (%)
E 2
18.2%
G 2
18.2%
M 1
9.1%
H 1
9.1%
C 1
9.1%
J 1
9.1%
T 1
9.1%
S 1
9.1%
N 1
9.1%
Close Punctuation
ValueCountFrequency (%)
) 42
100.0%
Open Punctuation
ValueCountFrequency (%)
( 41
100.0%
Space Separator
ValueCountFrequency (%)
15
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 666
85.7%
Common 100
 
12.9%
Latin 11
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
55
 
8.3%
35
 
5.3%
24
 
3.6%
18
 
2.7%
17
 
2.6%
15
 
2.3%
13
 
2.0%
13
 
2.0%
13
 
2.0%
12
 
1.8%
Other values (164) 451
67.7%
Latin
ValueCountFrequency (%)
E 2
18.2%
G 2
18.2%
M 1
9.1%
H 1
9.1%
C 1
9.1%
J 1
9.1%
T 1
9.1%
S 1
9.1%
N 1
9.1%
Common
ValueCountFrequency (%)
) 42
42.0%
( 41
41.0%
15
 
15.0%
, 2
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 666
85.7%
ASCII 111
 
14.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
55
 
8.3%
35
 
5.3%
24
 
3.6%
18
 
2.7%
17
 
2.6%
15
 
2.3%
13
 
2.0%
13
 
2.0%
13
 
2.0%
12
 
1.8%
Other values (164) 451
67.7%
ASCII
ValueCountFrequency (%)
) 42
37.8%
( 41
36.9%
15
 
13.5%
, 2
 
1.8%
E 2
 
1.8%
G 2
 
1.8%
M 1
 
0.9%
H 1
 
0.9%
C 1
 
0.9%
J 1
 
0.9%
Other values (3) 3
 
2.7%
Distinct96
Distinct (%)92.3%
Missing1
Missing (%)1.0%
Memory size972.0 B
2023-12-11T02:18:09.743174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length36
Mean length26.625
Min length20

Characters and Unicode

Total characters2769
Distinct characters125
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique89 ?
Unique (%)85.6%

Sample

1st row부산광역시 기장군 장안읍 해맞이로 18, 1층
2nd row부산광역시 기장군 장안읍 월내해안4길 9-1
3rd row부산광역시 기장군 장안읍 월내1길 3
4th row부산광역시 기장군 기장읍 차성로 314
5th row부산광역시 기장군 장안읍 월내1길 3
ValueCountFrequency (%)
부산광역시 104
16.8%
기장군 104
16.8%
장안읍 55
 
8.9%
기장읍 26
 
4.2%
1층 24
 
3.9%
2층 15
 
2.4%
일광면 14
 
2.3%
길천길 11
 
1.8%
정관읍 9
 
1.5%
해맞이로 8
 
1.3%
Other values (169) 250
40.3%
2023-12-11T02:18:10.675214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
516
18.6%
193
 
7.0%
132
 
4.8%
124
 
4.5%
1 114
 
4.1%
112
 
4.0%
106
 
3.8%
105
 
3.8%
104
 
3.8%
104
 
3.8%
Other values (115) 1159
41.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1681
60.7%
Space Separator 516
 
18.6%
Decimal Number 453
 
16.4%
Other Punctuation 76
 
2.7%
Dash Punctuation 20
 
0.7%
Uppercase Letter 8
 
0.3%
Open Punctuation 7
 
0.3%
Close Punctuation 7
 
0.3%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
193
 
11.5%
132
 
7.9%
124
 
7.4%
112
 
6.7%
106
 
6.3%
105
 
6.2%
104
 
6.2%
104
 
6.2%
90
 
5.4%
80
 
4.8%
Other values (95) 531
31.6%
Decimal Number
ValueCountFrequency (%)
1 114
25.2%
3 70
15.5%
2 70
15.5%
4 53
11.7%
5 29
 
6.4%
0 26
 
5.7%
7 24
 
5.3%
9 23
 
5.1%
8 22
 
4.9%
6 22
 
4.9%
Uppercase Letter
ValueCountFrequency (%)
B 4
50.0%
A 2
25.0%
D 1
 
12.5%
L 1
 
12.5%
Space Separator
ValueCountFrequency (%)
516
100.0%
Other Punctuation
ValueCountFrequency (%)
, 76
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 20
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1681
60.7%
Common 1080
39.0%
Latin 8
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
193
 
11.5%
132
 
7.9%
124
 
7.4%
112
 
6.7%
106
 
6.3%
105
 
6.2%
104
 
6.2%
104
 
6.2%
90
 
5.4%
80
 
4.8%
Other values (95) 531
31.6%
Common
ValueCountFrequency (%)
516
47.8%
1 114
 
10.6%
, 76
 
7.0%
3 70
 
6.5%
2 70
 
6.5%
4 53
 
4.9%
5 29
 
2.7%
0 26
 
2.4%
7 24
 
2.2%
9 23
 
2.1%
Other values (6) 79
 
7.3%
Latin
ValueCountFrequency (%)
B 4
50.0%
A 2
25.0%
D 1
 
12.5%
L 1
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1681
60.7%
ASCII 1088
39.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
516
47.4%
1 114
 
10.5%
, 76
 
7.0%
3 70
 
6.4%
2 70
 
6.4%
4 53
 
4.9%
5 29
 
2.7%
0 26
 
2.4%
7 24
 
2.2%
9 23
 
2.1%
Other values (10) 87
 
8.0%
Hangul
ValueCountFrequency (%)
193
 
11.5%
132
 
7.9%
124
 
7.4%
112
 
6.7%
106
 
6.3%
105
 
6.2%
104
 
6.2%
104
 
6.2%
90
 
5.4%
80
 
4.8%
Other values (95) 531
31.6%

소재지전화
Text

MISSING 

Distinct83
Distinct (%)98.8%
Missing21
Missing (%)20.0%
Memory size972.0 B
2023-12-11T02:18:11.102582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length14
Min length14

Characters and Unicode

Total characters1176
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique82 ?
Unique (%)97.6%

Sample

1st row 051- 727-3345
2nd row 051- 727-3427
3rd row 051- 727-5586
4th row 051- 722-5711
5th row 051- 727-1558
ValueCountFrequency (%)
051 79
33.8%
727 25
 
10.7%
722 9
 
3.8%
728 6
 
2.6%
724 6
 
2.6%
070 3
 
1.3%
723 3
 
1.3%
1722 2
 
0.9%
4567 2
 
0.9%
758 2
 
0.9%
Other values (95) 97
41.5%
2023-12-11T02:18:11.774286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 168
14.3%
165
14.0%
7 149
12.7%
5 137
11.6%
0 135
11.5%
1 118
10.0%
2 118
10.0%
4 53
 
4.5%
6 37
 
3.1%
8 36
 
3.1%
Other values (2) 60
 
5.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 843
71.7%
Dash Punctuation 168
 
14.3%
Space Separator 165
 
14.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
7 149
17.7%
5 137
16.3%
0 135
16.0%
1 118
14.0%
2 118
14.0%
4 53
 
6.3%
6 37
 
4.4%
8 36
 
4.3%
3 36
 
4.3%
9 24
 
2.8%
Dash Punctuation
ValueCountFrequency (%)
- 168
100.0%
Space Separator
ValueCountFrequency (%)
165
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1176
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 168
14.3%
165
14.0%
7 149
12.7%
5 137
11.6%
0 135
11.5%
1 118
10.0%
2 118
10.0%
4 53
 
4.5%
6 37
 
3.1%
8 36
 
3.1%
Other values (2) 60
 
5.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1176
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 168
14.3%
165
14.0%
7 149
12.7%
5 137
11.6%
0 135
11.5%
1 118
10.0%
2 118
10.0%
4 53
 
4.5%
6 37
 
3.1%
8 36
 
3.1%
Other values (2) 60
 
5.1%
Distinct101
Distinct (%)96.2%
Missing0
Missing (%)0.0%
Memory size972.0 B
Minimum1996-04-25 00:00:00
Maximum2022-02-15 00:00:00
2023-12-11T02:18:12.071931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:18:12.486993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Correlations

2023-12-11T02:18:12.705664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영업소 주소(도로명)소재지전화
영업소 주소(도로명)1.0000.995
소재지전화0.9951.000

Missing values

2023-12-11T02:18:06.617512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:18:06.830825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T02:18:07.031123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업소명영업소 주소(도로명)소재지전화영업자시작일
0(주)동래위생공사부산광역시 기장군 장안읍 해맞이로 18, 1층051- 727-33452021-01-15
1제이그린테크부산광역시 기장군 장안읍 월내해안4길 9-1051- 727-34271996-04-25
2경남상사부산광역시 기장군 장안읍 월내1길 3051- 727-55861998-02-17
3홍익환경산업부산광역시 기장군 기장읍 차성로 314051- 722-57111998-04-27
4양일건업부산광역시 기장군 장안읍 월내1길 3051- 727-15582003-03-20
5명진개발부산광역시 기장군 장안읍 월내해안4길 9-1, 2층051 -727 -45672003-03-27
6고성공영부산광역시 기장군 기장읍 차성로 314051- 722-06062002-09-12
7국일산업부산광역시 기장군 장안읍 해맞이로 381-10051- 727-54742003-05-13
8(주)에이스플러스부산광역시 기장군 기장읍 차성로418번길 14, 2층051- 724-47002012-09-10
9대명엔지니어링부산광역시 기장군 장안읍 길천1길 17051-727 -64462005-11-15
업소명영업소 주소(도로명)소재지전화영업자시작일
95주식회사올바른부산광역시 기장군 장안읍 장곡길 173, 107호051 -971 -70022020-07-07
96더바른클린부산광역시 기장군 기장읍 차성로 342, 3층051 -722 -21702021-02-17
97(주)클린엔클린부산광역시 기장군 기장읍 차성로344번길 13, 상가동 307호 (한신아파트)051 -754 -40042021-04-16
98(주)동연부산광역시 기장군 장안읍 좌천4길 9051 -532 -47282021-05-07
99현이엔지(주)부산광역시 기장군 장안읍 길천길 65, 102호 일부호 (진주델링타운)051 -727 -84102021-07-13
100원전지역장애인고용창출협회부산광역시 기장군 일광면 이천8길 44, 1,2층<NA>2021-08-06
101에스엠씨이앤아이티(주)부산광역시 기장군 장안읍 길천2길 33, 1층070 -7327-33742021-08-30
102주식회사 더깨끗한환경부산광역시 기장군 기장읍 차성로249번길 8, 1층<NA>2021-12-22
103(주)다은유통부산광역시 기장군 기장읍 반송로 1582, 화승주유소 2층051- 722-22982022-01-19
104요기요마켓두원부산광역시 기장군 기장읍 기장해안로 480, 1층<NA>2022-02-15