Overview

Dataset statistics

Number of variables4
Number of observations70
Missing cells35
Missing cells (%)12.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 KiB
Average record size in memory34.9 B

Variable types

Numeric1
Text3

Dataset

Description부산광역시부산진구_소독업소현황_20230412
Author부산광역시 부산진구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15080671

Alerts

전화번호 has 35 (50.0%) missing valuesMissing
연번 has unique valuesUnique
상호 has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:36:32.592727
Analysis finished2023-12-10 16:36:33.195249
Duration0.6 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct70
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean35.5
Minimum1
Maximum70
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size762.0 B
2023-12-11T01:36:33.277383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.45
Q118.25
median35.5
Q352.75
95-th percentile66.55
Maximum70
Range69
Interquartile range (IQR)34.5

Descriptive statistics

Standard deviation20.351085
Coefficient of variation (CV)0.57327
Kurtosis-1.2
Mean35.5
Median Absolute Deviation (MAD)17.5
Skewness0
Sum2485
Variance414.16667
MonotonicityStrictly increasing
2023-12-11T01:36:33.440304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.4%
46 1
 
1.4%
52 1
 
1.4%
51 1
 
1.4%
50 1
 
1.4%
49 1
 
1.4%
48 1
 
1.4%
47 1
 
1.4%
45 1
 
1.4%
54 1
 
1.4%
Other values (60) 60
85.7%
ValueCountFrequency (%)
1 1
1.4%
2 1
1.4%
3 1
1.4%
4 1
1.4%
5 1
1.4%
6 1
1.4%
7 1
1.4%
8 1
1.4%
9 1
1.4%
10 1
1.4%
ValueCountFrequency (%)
70 1
1.4%
69 1
1.4%
68 1
1.4%
67 1
1.4%
66 1
1.4%
65 1
1.4%
64 1
1.4%
63 1
1.4%
62 1
1.4%
61 1
1.4%

상호
Text

UNIQUE 

Distinct70
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size692.0 B
2023-12-11T01:36:33.943795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10
Mean length6
Min length3

Characters and Unicode

Total characters420
Distinct characters151
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique70 ?
Unique (%)100.0%

Sample

1st row동원클린시스템
2nd row제이티케이시스템
3rd row㈜삼부컴퍼니
4th row비케이서비스
5th row㈜태창환경
ValueCountFrequency (%)
주식회사 2
 
2.6%
㈜거린 1
 
1.3%
㈜크린케어 1
 
1.3%
㈜일신산업개발 1
 
1.3%
㈜원엔터프라이즈 1
 
1.3%
㈜씨티원 1
 
1.3%
㈜태협 1
 
1.3%
㈜비비드 1
 
1.3%
㈜소명환경 1
 
1.3%
부산경남청소 1
 
1.3%
Other values (67) 67
85.9%
2023-12-11T01:36:34.379800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
31
 
7.4%
15
 
3.6%
15
 
3.6%
12
 
2.9%
12
 
2.9%
11
 
2.6%
9
 
2.1%
8
 
1.9%
8
 
1.9%
8
 
1.9%
Other values (141) 291
69.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 365
86.9%
Other Symbol 31
 
7.4%
Uppercase Letter 14
 
3.3%
Space Separator 8
 
1.9%
Other Punctuation 1
 
0.2%
Decimal Number 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
15
 
4.1%
15
 
4.1%
12
 
3.3%
12
 
3.3%
11
 
3.0%
9
 
2.5%
8
 
2.2%
8
 
2.2%
7
 
1.9%
7
 
1.9%
Other values (126) 261
71.5%
Uppercase Letter
ValueCountFrequency (%)
E 2
14.3%
T 2
14.3%
C 2
14.3%
H 1
7.1%
G 1
7.1%
S 1
7.1%
A 1
7.1%
P 1
7.1%
N 1
7.1%
M 1
7.1%
Other Symbol
ValueCountFrequency (%)
31
100.0%
Space Separator
ValueCountFrequency (%)
8
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%
Decimal Number
ValueCountFrequency (%)
5 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 396
94.3%
Latin 14
 
3.3%
Common 10
 
2.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
31
 
7.8%
15
 
3.8%
15
 
3.8%
12
 
3.0%
12
 
3.0%
11
 
2.8%
9
 
2.3%
8
 
2.0%
8
 
2.0%
7
 
1.8%
Other values (127) 268
67.7%
Latin
ValueCountFrequency (%)
E 2
14.3%
T 2
14.3%
C 2
14.3%
H 1
7.1%
G 1
7.1%
S 1
7.1%
A 1
7.1%
P 1
7.1%
N 1
7.1%
M 1
7.1%
Common
ValueCountFrequency (%)
8
80.0%
& 1
 
10.0%
5 1
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 365
86.9%
None 31
 
7.4%
ASCII 24
 
5.7%

Most frequent character per block

None
ValueCountFrequency (%)
31
100.0%
Hangul
ValueCountFrequency (%)
15
 
4.1%
15
 
4.1%
12
 
3.3%
12
 
3.3%
11
 
3.0%
9
 
2.5%
8
 
2.2%
8
 
2.2%
7
 
1.9%
7
 
1.9%
Other values (126) 261
71.5%
ASCII
ValueCountFrequency (%)
8
33.3%
E 2
 
8.3%
T 2
 
8.3%
C 2
 
8.3%
& 1
 
4.2%
H 1
 
4.2%
G 1
 
4.2%
S 1
 
4.2%
A 1
 
4.2%
P 1
 
4.2%
Other values (4) 4
16.7%
Distinct67
Distinct (%)95.7%
Missing0
Missing (%)0.0%
Memory size692.0 B
2023-12-11T01:36:34.609341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length37
Mean length30.514286
Min length23

Characters and Unicode

Total characters2136
Distinct characters108
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique65 ?
Unique (%)92.9%

Sample

1st row부산광역시 부산진구 백양순환로 35번길 43,103호(당감동,태우선파크맨션)
2nd row부산광역시 부산진구 엄광로 176, 동의대학교 산학협력관 329호(가야동)
3rd row부산광역시 부산진구 양지로 11번길 16,대학빌딩 602호(양정동)
4th row부산광역시 부산진구 중앙대로 969번길 29-12,영진빌딩 402호(양정동)
5th row부산광역시 부산진구 부전로 196,부산전자종합상가 3층 38호(부전동)
ValueCountFrequency (%)
부산광역시 70
 
20.1%
부산진구 70
 
20.1%
엄광로 6
 
1.7%
동평로 5
 
1.4%
동천로 4
 
1.1%
서전로 4
 
1.1%
중앙대로 4
 
1.1%
진남로 4
 
1.1%
새싹로 3
 
0.9%
116 3
 
0.9%
Other values (141) 175
50.3%
2023-12-11T01:36:34.960414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
278
 
13.0%
154
 
7.2%
143
 
6.7%
89
 
4.2%
1 84
 
3.9%
76
 
3.6%
76
 
3.6%
71
 
3.3%
71
 
3.3%
71
 
3.3%
Other values (98) 1023
47.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1316
61.6%
Decimal Number 348
 
16.3%
Space Separator 278
 
13.0%
Open Punctuation 71
 
3.3%
Close Punctuation 70
 
3.3%
Other Punctuation 41
 
1.9%
Dash Punctuation 12
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
154
 
11.7%
143
 
10.9%
89
 
6.8%
76
 
5.8%
76
 
5.8%
71
 
5.4%
71
 
5.4%
71
 
5.4%
70
 
5.3%
35
 
2.7%
Other values (83) 460
35.0%
Decimal Number
ValueCountFrequency (%)
1 84
24.1%
2 54
15.5%
6 41
11.8%
3 32
 
9.2%
0 25
 
7.2%
7 25
 
7.2%
5 24
 
6.9%
4 24
 
6.9%
9 20
 
5.7%
8 19
 
5.5%
Space Separator
ValueCountFrequency (%)
278
100.0%
Open Punctuation
ValueCountFrequency (%)
( 71
100.0%
Close Punctuation
ValueCountFrequency (%)
) 70
100.0%
Other Punctuation
ValueCountFrequency (%)
, 41
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1316
61.6%
Common 820
38.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
154
 
11.7%
143
 
10.9%
89
 
6.8%
76
 
5.8%
76
 
5.8%
71
 
5.4%
71
 
5.4%
71
 
5.4%
70
 
5.3%
35
 
2.7%
Other values (83) 460
35.0%
Common
ValueCountFrequency (%)
278
33.9%
1 84
 
10.2%
( 71
 
8.7%
) 70
 
8.5%
2 54
 
6.6%
, 41
 
5.0%
6 41
 
5.0%
3 32
 
3.9%
0 25
 
3.0%
7 25
 
3.0%
Other values (5) 99
 
12.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1316
61.6%
ASCII 820
38.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
278
33.9%
1 84
 
10.2%
( 71
 
8.7%
) 70
 
8.5%
2 54
 
6.6%
, 41
 
5.0%
6 41
 
5.0%
3 32
 
3.9%
0 25
 
3.0%
7 25
 
3.0%
Other values (5) 99
 
12.1%
Hangul
ValueCountFrequency (%)
154
 
11.7%
143
 
10.9%
89
 
6.8%
76
 
5.8%
76
 
5.8%
71
 
5.4%
71
 
5.4%
71
 
5.4%
70
 
5.3%
35
 
2.7%
Other values (83) 460
35.0%

전화번호
Text

MISSING 

Distinct34
Distinct (%)97.1%
Missing35
Missing (%)50.0%
Memory size692.0 B
2023-12-11T01:36:35.185265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.028571
Min length12

Characters and Unicode

Total characters421
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)94.3%

Sample

1st row051-646-1942
2nd row051-867-2003
3rd row051-503-6237
4th row051-896-7601
5th row051-808-4306
ValueCountFrequency (%)
051-863-7991 2
 
5.7%
051-804-7475 1
 
2.9%
051-805-7722 1
 
2.9%
051-807-2124 1
 
2.9%
051-807-5460 1
 
2.9%
051-807-1413 1
 
2.9%
051-803-9898 1
 
2.9%
051-818-1381 1
 
2.9%
051-817-7750 1
 
2.9%
051-638-9934 1
 
2.9%
Other values (24) 24
68.6%
2023-12-11T01:36:35.612978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 70
16.6%
- 70
16.6%
1 59
14.0%
5 55
13.1%
8 36
8.6%
7 29
6.9%
6 27
 
6.4%
3 25
 
5.9%
9 18
 
4.3%
4 18
 
4.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 351
83.4%
Dash Punctuation 70
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 70
19.9%
1 59
16.8%
5 55
15.7%
8 36
10.3%
7 29
8.3%
6 27
 
7.7%
3 25
 
7.1%
9 18
 
5.1%
4 18
 
5.1%
2 14
 
4.0%
Dash Punctuation
ValueCountFrequency (%)
- 70
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 421
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 70
16.6%
- 70
16.6%
1 59
14.0%
5 55
13.1%
8 36
8.6%
7 29
6.9%
6 27
 
6.4%
3 25
 
5.9%
9 18
 
4.3%
4 18
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 421
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 70
16.6%
- 70
16.6%
1 59
14.0%
5 55
13.1%
8 36
8.6%
7 29
6.9%
6 27
 
6.4%
3 25
 
5.9%
9 18
 
4.3%
4 18
 
4.3%

Interactions

2023-12-11T01:36:32.858382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:36:35.717920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번상호사업장소재지전화번호
연번1.0001.0000.7940.941
상호1.0001.0001.0001.000
사업장소재지0.7941.0001.0000.987
전화번호0.9411.0000.9871.000

Missing values

2023-12-11T01:36:33.027590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:36:33.159754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번상호사업장소재지전화번호
01동원클린시스템부산광역시 부산진구 백양순환로 35번길 43,103호(당감동,태우선파크맨션)<NA>
12제이티케이시스템부산광역시 부산진구 엄광로 176, 동의대학교 산학협력관 329호(가야동)<NA>
23㈜삼부컴퍼니부산광역시 부산진구 양지로 11번길 16,대학빌딩 602호(양정동)<NA>
34비케이서비스부산광역시 부산진구 중앙대로 969번길 29-12,영진빌딩 402호(양정동)<NA>
45㈜태창환경부산광역시 부산진구 부전로 196,부산전자종합상가 3층 38호(부전동)051-646-1942
56주식회사 벌떼에이취알부산광역시 부산진구 거제대로 16-4 3층 303호(양정동)<NA>
67자바코공사부산광역시 부산진구 진연로 9번길 22-10(양정동)051-867-2003
78㈜에덴종합관리부산광역시 부산진구 성지곡로 29-2, 1층(초읍동)<NA>
89주식회사 이엔지플러스부산광역시 부산진구 개금본동로 17,3층(개금동)<NA>
910하나방역 남부산지점부산광역시 부산진구 엄광로 68,220호(가야동,가야벽산아파트)<NA>
연번상호사업장소재지전화번호
6061㈜원봉부산광역시 부산진구 거제대로 36번길 26(양정동)051-863-7991
6162도시환경부산광역시 부산진구 서전로 67번길 15,1층(전포동)<NA>
6263㈜고려그린산업부산광역시 부산진구 신암로 70,제상가비동 107,108호(범천동)051-803-9898
6364한국특수기업부산광역시 부산진구 새싹로 207-6,57호(초읍동)051-807-1413
6465가가수조방역부산광역시 부산진구 동평로 291번길 61(연지동)051-807-5460
6566㈜한특부산광역시 부산진구 중앙대로 993,시청역롯데골드로즈 811호(양정동)051-807-2124
6667엔가드부산극동산업부산광역시 부산진구 진남로 581,2층(양정동)<NA>
6768㈜한국서비스부산광역시 부산진구 동천로 116, 한신밴 1201호(전포동)051-805-7722
6869㈜금성ENT부산광역시 부산진구 전포대로 132번길 12(전포동)<NA>
6970㈜백송기업부산광역시 부산진구 복지로 21번길 13(개금동)051-959-3200