Overview

Dataset statistics

Number of variables3
Number of observations96
Missing cells1
Missing cells (%)0.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.5 KiB
Average record size in memory26.4 B

Variable types

Numeric1
Text2

Dataset

Description부산광역시기장군_위생관리업_현황_20230321
Author부산광역시 기장군
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15069114

Alerts

소재지(도로명) has 1 (1.0%) missing valuesMissing
Unnamed: 0 has unique valuesUnique
업소명 has unique valuesUnique

Reproduction

Analysis started2023-12-10 17:17:58.828484
Analysis finished2023-12-10 17:17:59.944016
Duration1.12 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Unnamed: 0
Real number (ℝ)

UNIQUE 

Distinct96
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean48.5
Minimum1
Maximum96
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size996.0 B
2023-12-11T02:18:00.170309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.75
Q124.75
median48.5
Q372.25
95-th percentile91.25
Maximum96
Range95
Interquartile range (IQR)47.5

Descriptive statistics

Standard deviation27.856777
Coefficient of variation (CV)0.57436653
Kurtosis-1.2
Mean48.5
Median Absolute Deviation (MAD)24
Skewness0
Sum4656
Variance776
MonotonicityStrictly increasing
2023-12-11T02:18:00.663309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
50 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
67 1
 
1.0%
66 1
 
1.0%
65 1
 
1.0%
Other values (86) 86
89.6%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%
90 1
1.0%
89 1
1.0%
88 1
1.0%
87 1
1.0%

업소명
Text

UNIQUE 

Distinct96
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size900.0 B
2023-12-11T02:18:01.298358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length15
Mean length7.7708333
Min length2

Characters and Unicode

Total characters746
Distinct characters185
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique96 ?
Unique (%)100.0%

Sample

1st row(주)동래위생공사
2nd row제이그린테크
3rd row경남상사
4th row홍익환경산업
5th row명진개발
ValueCountFrequency (%)
주식회사 12
 
10.5%
주)동래위생공사 1
 
0.9%
인성기업 1
 
0.9%
서주건설(주 1
 
0.9%
남경엔지니어링 1
 
0.9%
프리즘 1
 
0.9%
푸른에너지 1
 
0.9%
미래종합환경 1
 
0.9%
주)해성씨앤에이 1
 
0.9%
삼광지에스(gs 1
 
0.9%
Other values (93) 93
81.6%
2023-12-11T02:18:02.223986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
55
 
7.4%
) 40
 
5.4%
( 39
 
5.2%
34
 
4.6%
26
 
3.5%
19
 
2.5%
18
 
2.4%
17
 
2.3%
17
 
2.3%
13
 
1.7%
Other values (175) 468
62.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 638
85.5%
Close Punctuation 40
 
5.4%
Open Punctuation 39
 
5.2%
Space Separator 18
 
2.4%
Uppercase Letter 9
 
1.2%
Other Punctuation 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
55
 
8.6%
34
 
5.3%
26
 
4.1%
19
 
3.0%
17
 
2.7%
17
 
2.7%
13
 
2.0%
13
 
2.0%
12
 
1.9%
12
 
1.9%
Other values (164) 420
65.8%
Uppercase Letter
ValueCountFrequency (%)
E 2
22.2%
G 2
22.2%
S 1
11.1%
C 1
11.1%
T 1
11.1%
H 1
11.1%
N 1
11.1%
Close Punctuation
ValueCountFrequency (%)
) 40
100.0%
Open Punctuation
ValueCountFrequency (%)
( 39
100.0%
Space Separator
ValueCountFrequency (%)
18
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 638
85.5%
Common 99
 
13.3%
Latin 9
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
55
 
8.6%
34
 
5.3%
26
 
4.1%
19
 
3.0%
17
 
2.7%
17
 
2.7%
13
 
2.0%
13
 
2.0%
12
 
1.9%
12
 
1.9%
Other values (164) 420
65.8%
Latin
ValueCountFrequency (%)
E 2
22.2%
G 2
22.2%
S 1
11.1%
C 1
11.1%
T 1
11.1%
H 1
11.1%
N 1
11.1%
Common
ValueCountFrequency (%)
) 40
40.4%
( 39
39.4%
18
18.2%
, 2
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 638
85.5%
ASCII 108
 
14.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
55
 
8.6%
34
 
5.3%
26
 
4.1%
19
 
3.0%
17
 
2.7%
17
 
2.7%
13
 
2.0%
13
 
2.0%
12
 
1.9%
12
 
1.9%
Other values (164) 420
65.8%
ASCII
ValueCountFrequency (%)
) 40
37.0%
( 39
36.1%
18
16.7%
, 2
 
1.9%
E 2
 
1.9%
G 2
 
1.9%
S 1
 
0.9%
C 1
 
0.9%
T 1
 
0.9%
H 1
 
0.9%

소재지(도로명)
Text

MISSING 

Distinct90
Distinct (%)94.7%
Missing1
Missing (%)1.0%
Memory size900.0 B
2023-12-11T02:18:02.854033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length37
Mean length27.610526
Min length20

Characters and Unicode

Total characters2623
Distinct characters133
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique86 ?
Unique (%)90.5%

Sample

1st row부산광역시 기장군 장안읍 해맞이로 18, 1층
2nd row부산광역시 기장군 장안읍 월내해안4길 9-1
3rd row부산광역시 기장군 장안읍 월내1길 3
4th row부산광역시 기장군 기장읍 차성로 314
5th row부산광역시 기장군 장안읍 월내해안4길 9-1, 2층
ValueCountFrequency (%)
부산광역시 95
 
16.4%
기장군 95
 
16.4%
장안읍 50
 
8.7%
기장읍 26
 
4.5%
1층 21
 
3.6%
2층 16
 
2.8%
정관읍 11
 
1.9%
일광읍 8
 
1.4%
길천길 8
 
1.4%
해맞이로 7
 
1.2%
Other values (169) 241
41.7%
2023-12-11T02:18:03.910531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
483
18.4%
182
 
6.9%
121
 
4.6%
1 108
 
4.1%
106
 
4.0%
106
 
4.0%
98
 
3.7%
97
 
3.7%
95
 
3.6%
95
 
3.6%
Other values (123) 1132
43.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1586
60.5%
Space Separator 483
 
18.4%
Decimal Number 434
 
16.5%
Other Punctuation 74
 
2.8%
Dash Punctuation 21
 
0.8%
Close Punctuation 8
 
0.3%
Open Punctuation 8
 
0.3%
Uppercase Letter 8
 
0.3%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
182
 
11.5%
121
 
7.6%
106
 
6.7%
106
 
6.7%
98
 
6.2%
97
 
6.1%
95
 
6.0%
95
 
6.0%
95
 
6.0%
71
 
4.5%
Other values (103) 520
32.8%
Decimal Number
ValueCountFrequency (%)
1 108
24.9%
2 72
16.6%
3 68
15.7%
4 49
11.3%
5 31
 
7.1%
6 24
 
5.5%
0 23
 
5.3%
8 21
 
4.8%
7 20
 
4.6%
9 18
 
4.1%
Uppercase Letter
ValueCountFrequency (%)
A 3
37.5%
B 3
37.5%
D 1
 
12.5%
L 1
 
12.5%
Space Separator
ValueCountFrequency (%)
483
100.0%
Other Punctuation
ValueCountFrequency (%)
, 74
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1586
60.5%
Common 1029
39.2%
Latin 8
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
182
 
11.5%
121
 
7.6%
106
 
6.7%
106
 
6.7%
98
 
6.2%
97
 
6.1%
95
 
6.0%
95
 
6.0%
95
 
6.0%
71
 
4.5%
Other values (103) 520
32.8%
Common
ValueCountFrequency (%)
483
46.9%
1 108
 
10.5%
, 74
 
7.2%
2 72
 
7.0%
3 68
 
6.6%
4 49
 
4.8%
5 31
 
3.0%
6 24
 
2.3%
0 23
 
2.2%
8 21
 
2.0%
Other values (6) 76
 
7.4%
Latin
ValueCountFrequency (%)
A 3
37.5%
B 3
37.5%
D 1
 
12.5%
L 1
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1586
60.5%
ASCII 1037
39.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
483
46.6%
1 108
 
10.4%
, 74
 
7.1%
2 72
 
6.9%
3 68
 
6.6%
4 49
 
4.7%
5 31
 
3.0%
6 24
 
2.3%
0 23
 
2.2%
8 21
 
2.0%
Other values (10) 84
 
8.1%
Hangul
ValueCountFrequency (%)
182
 
11.5%
121
 
7.6%
106
 
6.7%
106
 
6.7%
98
 
6.2%
97
 
6.1%
95
 
6.0%
95
 
6.0%
95
 
6.0%
71
 
4.5%
Other values (103) 520
32.8%

Interactions

2023-12-11T02:17:59.432105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T02:18:04.134074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 0업소명소재지(도로명)
Unnamed: 01.0001.0000.927
업소명1.0001.0001.000
소재지(도로명)0.9271.0001.000

Missing values

2023-12-11T02:17:59.689621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:17:59.863526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

Unnamed: 0업소명소재지(도로명)
01(주)동래위생공사부산광역시 기장군 장안읍 해맞이로 18, 1층
12제이그린테크부산광역시 기장군 장안읍 월내해안4길 9-1
23경남상사부산광역시 기장군 장안읍 월내1길 3
34홍익환경산업부산광역시 기장군 기장읍 차성로 314
45명진개발부산광역시 기장군 장안읍 월내해안4길 9-1, 2층
56고성공영부산광역시 기장군 기장읍 차성로 314
67국일산업부산광역시 기장군 장안읍 해맞이로 381-10
78(주)에이스플러스부산광역시 기장군 기장읍 차성로418번길 14, 2층
89대명엔지니어링부산광역시 기장군 장안읍 길천1길 17
910(주)오에스산업개발부산광역시 기장군 장안읍 해맞이로 173
Unnamed: 0업소명소재지(도로명)
8687에스엠씨이앤아이티(주)부산광역시 기장군 장안읍 길천2길 33, 1층
8788주식회사 더깨끗한환경부산광역시 기장군 기장읍 차성로249번길 8, 1층
8889주식회사 더율 시스템부산광역시 기장군 기장읍 청강로43번길 36, 1층
8990(주)다은유통부산광역시 기장군 기장읍 반송로 1582, 화승주유소 2층
9091(주)금하부산광역시 기장군 정관읍 방곡1로 23-1, 1층
9192(주)스쿨케어부산광역시 기장군 장안읍 좌천5길 25, 장안그린빌 상가1층
9293깨끗한세상만들기유클린(주)부산광역시 기장군 기장읍 소정안길 68, 상가동 2층 201호 (내리휴먼시아)
9394(주)한국토탈서비스부산광역시 기장군 장안읍 장안산단3로 43, 301호
9495엔에스케어부산광역시 기장군 정관읍 정관중앙로 45, 2층 205호
9596주식회사 파이브스타부산광역시 기장군 장안읍 반룡산단1로 6-18, 1동 일부호