Overview

Dataset statistics

Number of variables5
Number of observations70
Missing cells26
Missing cells (%)7.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.9 KiB
Average record size in memory42.9 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description부산광역시남구_건물위생관리업현황_20230510
Author부산광역시 남구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15047966

Alerts

업종명 has constant value ""Constant
소재지전화 has 26 (37.1%) missing valuesMissing
연번 has unique valuesUnique
업소명 has unique valuesUnique
영업소 주소(도로명) has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:26:30.718002
Analysis finished2023-12-10 16:26:31.355349
Duration0.64 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct70
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean35.5
Minimum1
Maximum70
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size762.0 B
2023-12-11T01:26:31.455612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.45
Q118.25
median35.5
Q352.75
95-th percentile66.55
Maximum70
Range69
Interquartile range (IQR)34.5

Descriptive statistics

Standard deviation20.351085
Coefficient of variation (CV)0.57327
Kurtosis-1.2
Mean35.5
Median Absolute Deviation (MAD)17.5
Skewness0
Sum2485
Variance414.16667
MonotonicityStrictly increasing
2023-12-11T01:26:31.637177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.4%
46 1
 
1.4%
52 1
 
1.4%
51 1
 
1.4%
50 1
 
1.4%
49 1
 
1.4%
48 1
 
1.4%
47 1
 
1.4%
45 1
 
1.4%
54 1
 
1.4%
Other values (60) 60
85.7%
ValueCountFrequency (%)
1 1
1.4%
2 1
1.4%
3 1
1.4%
4 1
1.4%
5 1
1.4%
6 1
1.4%
7 1
1.4%
8 1
1.4%
9 1
1.4%
10 1
1.4%
ValueCountFrequency (%)
70 1
1.4%
69 1
1.4%
68 1
1.4%
67 1
1.4%
66 1
1.4%
65 1
1.4%
64 1
1.4%
63 1
1.4%
62 1
1.4%
61 1
1.4%

업종명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size692.0 B
건물위생관리업
70 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row건물위생관리업
2nd row건물위생관리업
3rd row건물위생관리업
4th row건물위생관리업
5th row건물위생관리업

Common Values

ValueCountFrequency (%)
건물위생관리업 70
100.0%

Length

2023-12-11T01:26:31.811037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:26:31.934822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
건물위생관리업 70
100.0%

업소명
Text

UNIQUE 

Distinct70
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size692.0 B
2023-12-11T01:26:32.193819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length11
Mean length7.4142857
Min length3

Characters and Unicode

Total characters519
Distinct characters151
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique70 ?
Unique (%)100.0%

Sample

1st row(주)동해환경
2nd row(주)대평
3rd row미진기업(주)
4th row코리아멘파워
5th row(주)미성기업
ValueCountFrequency (%)
주식회사 14
 
15.6%
미래공조시스템 1
 
1.1%
부산경남본점 1
 
1.1%
청소하는사람들 1
 
1.1%
청소협동조합 1
 
1.1%
기보메이트 1
 
1.1%
케이피엠 1
 
1.1%
부산남구시니어클럽(청소 1
 
1.1%
에스에이씨 1
 
1.1%
거손환경산업 1
 
1.1%
Other values (67) 67
74.4%
2023-12-11T01:26:32.797506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
42
 
8.1%
) 26
 
5.0%
( 26
 
5.0%
20
 
3.9%
18
 
3.5%
17
 
3.3%
16
 
3.1%
16
 
3.1%
13
 
2.5%
10
 
1.9%
Other values (141) 315
60.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 444
85.5%
Close Punctuation 26
 
5.0%
Open Punctuation 26
 
5.0%
Space Separator 20
 
3.9%
Uppercase Letter 2
 
0.4%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
42
 
9.5%
18
 
4.1%
17
 
3.8%
16
 
3.6%
16
 
3.6%
13
 
2.9%
10
 
2.3%
9
 
2.0%
8
 
1.8%
8
 
1.8%
Other values (135) 287
64.6%
Uppercase Letter
ValueCountFrequency (%)
G 1
50.0%
B 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 26
100.0%
Open Punctuation
ValueCountFrequency (%)
( 26
100.0%
Space Separator
ValueCountFrequency (%)
20
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 444
85.5%
Common 73
 
14.1%
Latin 2
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
42
 
9.5%
18
 
4.1%
17
 
3.8%
16
 
3.6%
16
 
3.6%
13
 
2.9%
10
 
2.3%
9
 
2.0%
8
 
1.8%
8
 
1.8%
Other values (135) 287
64.6%
Common
ValueCountFrequency (%)
) 26
35.6%
( 26
35.6%
20
27.4%
& 1
 
1.4%
Latin
ValueCountFrequency (%)
G 1
50.0%
B 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 444
85.5%
ASCII 75
 
14.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
42
 
9.5%
18
 
4.1%
17
 
3.8%
16
 
3.6%
16
 
3.6%
13
 
2.9%
10
 
2.3%
9
 
2.0%
8
 
1.8%
8
 
1.8%
Other values (135) 287
64.6%
ASCII
ValueCountFrequency (%)
) 26
34.7%
( 26
34.7%
20
26.7%
G 1
 
1.3%
& 1
 
1.3%
B 1
 
1.3%
Distinct70
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size692.0 B
2023-12-11T01:26:33.164539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length39
Mean length32.671429
Min length22

Characters and Unicode

Total characters2287
Distinct characters122
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique70 ?
Unique (%)100.0%

Sample

1st row부산광역시 남구 진남로 207 (문현동)
2nd row부산광역시 남구 신선로301번길 12, 4층 (용당동)
3rd row부산광역시 남구 자성로 152, 한일오피스텔 1822호 (문현동)
4th row부산광역시 남구 황령대로492번길 24, 5층 502호 (대연동)
5th row부산광역시 남구 수영로 12, 207호 (문현동, 세종그랑시아아파트)
ValueCountFrequency (%)
부산광역시 70
 
15.4%
남구 70
 
15.4%
대연동 32
 
7.0%
문현동 14
 
3.1%
2층 12
 
2.6%
1층 11
 
2.4%
용당동 9
 
2.0%
신선로 8
 
1.8%
수영로 7
 
1.5%
지하1층 6
 
1.3%
Other values (153) 216
47.5%
2023-12-11T01:26:33.796943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
385
 
16.8%
1 96
 
4.2%
83
 
3.6%
74
 
3.2%
74
 
3.2%
2 74
 
3.2%
72
 
3.1%
) 72
 
3.1%
( 72
 
3.1%
71
 
3.1%
Other values (112) 1214
53.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1271
55.6%
Decimal Number 412
 
18.0%
Space Separator 385
 
16.8%
Close Punctuation 72
 
3.1%
Open Punctuation 72
 
3.1%
Other Punctuation 66
 
2.9%
Dash Punctuation 9
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
83
 
6.5%
74
 
5.8%
74
 
5.8%
72
 
5.7%
71
 
5.6%
70
 
5.5%
70
 
5.5%
70
 
5.5%
70
 
5.5%
50
 
3.9%
Other values (97) 567
44.6%
Decimal Number
ValueCountFrequency (%)
1 96
23.3%
2 74
18.0%
4 53
12.9%
3 46
11.2%
0 38
 
9.2%
6 28
 
6.8%
5 22
 
5.3%
8 20
 
4.9%
7 18
 
4.4%
9 17
 
4.1%
Space Separator
ValueCountFrequency (%)
385
100.0%
Close Punctuation
ValueCountFrequency (%)
) 72
100.0%
Open Punctuation
ValueCountFrequency (%)
( 72
100.0%
Other Punctuation
ValueCountFrequency (%)
, 66
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1271
55.6%
Common 1016
44.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
83
 
6.5%
74
 
5.8%
74
 
5.8%
72
 
5.7%
71
 
5.6%
70
 
5.5%
70
 
5.5%
70
 
5.5%
70
 
5.5%
50
 
3.9%
Other values (97) 567
44.6%
Common
ValueCountFrequency (%)
385
37.9%
1 96
 
9.4%
2 74
 
7.3%
) 72
 
7.1%
( 72
 
7.1%
, 66
 
6.5%
4 53
 
5.2%
3 46
 
4.5%
0 38
 
3.7%
6 28
 
2.8%
Other values (5) 86
 
8.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1271
55.6%
ASCII 1016
44.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
385
37.9%
1 96
 
9.4%
2 74
 
7.3%
) 72
 
7.1%
( 72
 
7.1%
, 66
 
6.5%
4 53
 
5.2%
3 46
 
4.5%
0 38
 
3.7%
6 28
 
2.8%
Other values (5) 86
 
8.5%
Hangul
ValueCountFrequency (%)
83
 
6.5%
74
 
5.8%
74
 
5.8%
72
 
5.7%
71
 
5.6%
70
 
5.5%
70
 
5.5%
70
 
5.5%
70
 
5.5%
50
 
3.9%
Other values (97) 567
44.6%

소재지전화
Text

MISSING 

Distinct44
Distinct (%)100.0%
Missing26
Missing (%)37.1%
Memory size692.0 B
2023-12-11T01:26:34.082061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length13.977273
Min length13

Characters and Unicode

Total characters615
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique44 ?
Unique (%)100.0%

Sample

1st row 051- 643-8225
2nd row051 -337 -5511
3rd row 051- 466-9403
4th row 051- 802-7036
5th row 051- 634-0134
ValueCountFrequency (%)
051 42
35.6%
626 2
 
1.7%
628 2
 
1.7%
643 2
 
1.7%
624 2
 
1.7%
621 2
 
1.7%
611 2
 
1.7%
7514-7889 1
 
0.8%
8800 1
 
0.8%
632 1
 
0.8%
Other values (61) 61
51.7%
2023-12-11T01:26:34.552763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 88
14.3%
85
13.8%
1 79
12.8%
0 77
12.5%
5 72
11.7%
6 47
7.6%
2 36
5.9%
4 32
 
5.2%
7 29
 
4.7%
3 24
 
3.9%
Other values (2) 46
7.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 442
71.9%
Dash Punctuation 88
 
14.3%
Space Separator 85
 
13.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 79
17.9%
0 77
17.4%
5 72
16.3%
6 47
10.6%
2 36
8.1%
4 32
7.2%
7 29
 
6.6%
3 24
 
5.4%
8 23
 
5.2%
9 23
 
5.2%
Dash Punctuation
ValueCountFrequency (%)
- 88
100.0%
Space Separator
ValueCountFrequency (%)
85
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 615
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 88
14.3%
85
13.8%
1 79
12.8%
0 77
12.5%
5 72
11.7%
6 47
7.6%
2 36
5.9%
4 32
 
5.2%
7 29
 
4.7%
3 24
 
3.9%
Other values (2) 46
7.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 615
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 88
14.3%
85
13.8%
1 79
12.8%
0 77
12.5%
5 72
11.7%
6 47
7.6%
2 36
5.9%
4 32
 
5.2%
7 29
 
4.7%
3 24
 
3.9%
Other values (2) 46
7.5%

Interactions

2023-12-11T01:26:31.003132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:26:34.706508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업소명영업소 주소(도로명)소재지전화
연번1.0001.0001.0001.000
업소명1.0001.0001.0001.000
영업소 주소(도로명)1.0001.0001.0001.000
소재지전화1.0001.0001.0001.000

Missing values

2023-12-11T01:26:31.159013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:26:31.306025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종명업소명영업소 주소(도로명)소재지전화
01건물위생관리업(주)동해환경부산광역시 남구 진남로 207 (문현동)051- 643-8225
12건물위생관리업(주)대평부산광역시 남구 신선로301번길 12, 4층 (용당동)051 -337 -5511
23건물위생관리업미진기업(주)부산광역시 남구 자성로 152, 한일오피스텔 1822호 (문현동)051- 466-9403
34건물위생관리업코리아멘파워부산광역시 남구 황령대로492번길 24, 5층 502호 (대연동)051- 802-7036
45건물위생관리업(주)미성기업부산광역시 남구 수영로 12, 207호 (문현동, 세종그랑시아아파트)051- 634-0134
56건물위생관리업주식회사 혜강종합관리부산광역시 남구 유엔평화로41번가길 42, 3층 (대연동)051- 897-2784
67건물위생관리업주식회사 성화에스씨부산광역시 남구 신선로 102 (감만동)051- 640-5851
78건물위생관리업(주)구구환경공사부산광역시 남구 우암로154번길 57 (우암동)<NA>
89건물위생관리업주식회사 보승부산광역시 남구 지게골로 101-22, 4층 (문현동)051 -643 -9365
910건물위생관리업(주)미화실업부산광역시 남구 유엔평화로4번길 61 (대연동)051 -469 -0900
연번업종명업소명영업소 주소(도로명)소재지전화
6061건물위생관리업서진크린 메이드부산광역시 남구 유엔평화로13번길 55, 4층 401호 (대연동)051 -755 -0795
6162건물위생관리업클린앤케어부산광역시 남구 유엔로120번가길 19, 1층 (대연동)<NA>
6263건물위생관리업클린구조대부산광역시 남구 우암로2번길 30, 상가동 1층 102호 (감만동, 감만 현대3차아파트)051- 711-1131
6364건물위생관리업부경하이택 주식회사부산광역시 남구 천제등로28번길 45, 지하1층 (대연동)051- 643-1476
6465건물위생관리업(주)상동실업부산광역시 남구 못골번영로40번길 19, 2층 (대연동)<NA>
6566건물위생관리업주식회사 제이앤케이에너지부산광역시 남구 수영로 312, 21 센츄리시티 오피스텔 724호 (대연동)<NA>
6667건물위생관리업제로킬부산광역시 남구 동명로146번길 132, 지하1층 (용호동)<NA>
6768건물위생관리업(주)네오하이텍부산광역시 남구 동명로 26, 현대아이파크 108동 304호 (용당동)<NA>
6869건물위생관리업미노시스템부산광역시 남구 수영로 298, 산암빌딩 1001-393호 (대연동)070-4917-2131
6970건물위생관리업주식회사 주안시스템부산광역시 남구 유엔평화로41번가길 42, 4층 (대연동)<NA>