Overview

Dataset statistics

Number of variables5
Number of observations65
Missing cells22
Missing cells (%)6.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.7 KiB
Average record size in memory43.0 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description부산광역시남구_건물위생관리업현황_20210802
Author부산광역시 남구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15047966

Alerts

업종명 has constant value ""Constant
소재지전화 has 22 (33.8%) missing valuesMissing
연번 has unique valuesUnique
업소명 has unique valuesUnique
영업소 주소(도로명) has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:26:36.577798
Analysis finished2023-12-10 16:26:37.242402
Duration0.66 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct65
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean33
Minimum1
Maximum65
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size717.0 B
2023-12-11T01:26:37.356186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.2
Q117
median33
Q349
95-th percentile61.8
Maximum65
Range64
Interquartile range (IQR)32

Descriptive statistics

Standard deviation18.90767
Coefficient of variation (CV)0.57295971
Kurtosis-1.2
Mean33
Median Absolute Deviation (MAD)16
Skewness0
Sum2145
Variance357.5
MonotonicityStrictly increasing
2023-12-11T01:26:37.551163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.5%
50 1
 
1.5%
36 1
 
1.5%
37 1
 
1.5%
38 1
 
1.5%
39 1
 
1.5%
40 1
 
1.5%
41 1
 
1.5%
42 1
 
1.5%
43 1
 
1.5%
Other values (55) 55
84.6%
ValueCountFrequency (%)
1 1
1.5%
2 1
1.5%
3 1
1.5%
4 1
1.5%
5 1
1.5%
6 1
1.5%
7 1
1.5%
8 1
1.5%
9 1
1.5%
10 1
1.5%
ValueCountFrequency (%)
65 1
1.5%
64 1
1.5%
63 1
1.5%
62 1
1.5%
61 1
1.5%
60 1
1.5%
59 1
1.5%
58 1
1.5%
57 1
1.5%
56 1
1.5%

업종명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size652.0 B
건물위생관리업
65 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row건물위생관리업
2nd row건물위생관리업
3rd row건물위생관리업
4th row건물위생관리업
5th row건물위생관리업

Common Values

ValueCountFrequency (%)
건물위생관리업 65
100.0%

Length

2023-12-11T01:26:38.032138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:26:38.150018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
건물위생관리업 65
100.0%

업소명
Text

UNIQUE 

Distinct65
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size652.0 B
2023-12-11T01:26:38.461069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length10
Mean length7.2923077
Min length3

Characters and Unicode

Total characters474
Distinct characters154
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique65 ?
Unique (%)100.0%

Sample

1st row올앤원 크린
2nd row개미인력
3rd row주식회사 벌떼에이취알
4th row월드시스템
5th row세영씨엔에스 주식회사
ValueCountFrequency (%)
주식회사 14
 
16.5%
올앤원 1
 
1.2%
맨파워 1
 
1.2%
국보환경산업 1
 
1.2%
베스트원전력 1
 
1.2%
후드솔로몬 1
 
1.2%
모범청소 1
 
1.2%
종로전기(주 1
 
1.2%
주)구구환경공사 1
 
1.2%
아이에스 1
 
1.2%
Other values (62) 62
72.9%
2023-12-11T01:26:39.086058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
34
 
7.2%
20
 
4.2%
( 19
 
4.0%
) 19
 
4.0%
18
 
3.8%
16
 
3.4%
15
 
3.2%
13
 
2.7%
13
 
2.7%
10
 
2.1%
Other values (144) 297
62.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 409
86.3%
Space Separator 20
 
4.2%
Open Punctuation 19
 
4.0%
Close Punctuation 19
 
4.0%
Uppercase Letter 5
 
1.1%
Other Punctuation 1
 
0.2%
Other Symbol 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
34
 
8.3%
18
 
4.4%
16
 
3.9%
15
 
3.7%
13
 
3.2%
13
 
3.2%
10
 
2.4%
7
 
1.7%
7
 
1.7%
7
 
1.7%
Other values (134) 269
65.8%
Uppercase Letter
ValueCountFrequency (%)
B 1
20.0%
G 1
20.0%
C 1
20.0%
E 1
20.0%
P 1
20.0%
Space Separator
ValueCountFrequency (%)
20
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 410
86.5%
Common 59
 
12.4%
Latin 5
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
34
 
8.3%
18
 
4.4%
16
 
3.9%
15
 
3.7%
13
 
3.2%
13
 
3.2%
10
 
2.4%
7
 
1.7%
7
 
1.7%
7
 
1.7%
Other values (135) 270
65.9%
Latin
ValueCountFrequency (%)
B 1
20.0%
G 1
20.0%
C 1
20.0%
E 1
20.0%
P 1
20.0%
Common
ValueCountFrequency (%)
20
33.9%
( 19
32.2%
) 19
32.2%
& 1
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 409
86.3%
ASCII 64
 
13.5%
None 1
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
34
 
8.3%
18
 
4.4%
16
 
3.9%
15
 
3.7%
13
 
3.2%
13
 
3.2%
10
 
2.4%
7
 
1.7%
7
 
1.7%
7
 
1.7%
Other values (134) 269
65.8%
ASCII
ValueCountFrequency (%)
20
31.2%
( 19
29.7%
) 19
29.7%
B 1
 
1.6%
& 1
 
1.6%
G 1
 
1.6%
C 1
 
1.6%
E 1
 
1.6%
P 1
 
1.6%
None
ValueCountFrequency (%)
1
100.0%
Distinct65
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size652.0 B
2023-12-11T01:26:39.458932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length40
Mean length32.430769
Min length22

Characters and Unicode

Total characters2108
Distinct characters129
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique65 ?
Unique (%)100.0%

Sample

1st row부산광역시 남구 수영로13번길 16, 태광빌라 2층 101호 (문현동)
2nd row부산광역시 남구 고동골로 22, 1층 (문현동)
3rd row부산광역시 남구 자유평화로59번길 2, 늘송빌딩 2층 (문현동)
4th row부산광역시 남구 수영로39번가길 40-3, 1층 (문현동)
5th row부산광역시 남구 수영로 283, 102동 1401호 (대연동, 벽산솔렌스힐)
ValueCountFrequency (%)
부산광역시 65
 
15.6%
남구 65
 
15.6%
대연동 28
 
6.7%
문현동 16
 
3.8%
2층 13
 
3.1%
1층 10
 
2.4%
신선로 8
 
1.9%
용당동 8
 
1.9%
4층 6
 
1.4%
수영로 6
 
1.4%
Other values (150) 192
46.0%
2023-12-11T01:26:40.054250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
352
 
16.7%
1 94
 
4.5%
77
 
3.7%
2 73
 
3.5%
68
 
3.2%
68
 
3.2%
67
 
3.2%
) 67
 
3.2%
( 67
 
3.2%
66
 
3.1%
Other values (119) 1109
52.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1171
55.6%
Decimal Number 381
 
18.1%
Space Separator 352
 
16.7%
Close Punctuation 67
 
3.2%
Open Punctuation 67
 
3.2%
Other Punctuation 61
 
2.9%
Dash Punctuation 9
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
77
 
6.6%
68
 
5.8%
68
 
5.8%
67
 
5.7%
66
 
5.6%
65
 
5.6%
65
 
5.6%
65
 
5.6%
65
 
5.6%
49
 
4.2%
Other values (104) 516
44.1%
Decimal Number
ValueCountFrequency (%)
1 94
24.7%
2 73
19.2%
4 46
12.1%
3 42
11.0%
0 30
 
7.9%
6 26
 
6.8%
5 22
 
5.8%
8 18
 
4.7%
7 16
 
4.2%
9 14
 
3.7%
Space Separator
ValueCountFrequency (%)
352
100.0%
Close Punctuation
ValueCountFrequency (%)
) 67
100.0%
Open Punctuation
ValueCountFrequency (%)
( 67
100.0%
Other Punctuation
ValueCountFrequency (%)
, 61
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1171
55.6%
Common 937
44.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
77
 
6.6%
68
 
5.8%
68
 
5.8%
67
 
5.7%
66
 
5.6%
65
 
5.6%
65
 
5.6%
65
 
5.6%
65
 
5.6%
49
 
4.2%
Other values (104) 516
44.1%
Common
ValueCountFrequency (%)
352
37.6%
1 94
 
10.0%
2 73
 
7.8%
) 67
 
7.2%
( 67
 
7.2%
, 61
 
6.5%
4 46
 
4.9%
3 42
 
4.5%
0 30
 
3.2%
6 26
 
2.8%
Other values (5) 79
 
8.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1171
55.6%
ASCII 937
44.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
352
37.6%
1 94
 
10.0%
2 73
 
7.8%
) 67
 
7.2%
( 67
 
7.2%
, 61
 
6.5%
4 46
 
4.9%
3 42
 
4.5%
0 30
 
3.2%
6 26
 
2.8%
Other values (5) 79
 
8.4%
Hangul
ValueCountFrequency (%)
77
 
6.6%
68
 
5.8%
68
 
5.8%
67
 
5.7%
66
 
5.6%
65
 
5.6%
65
 
5.6%
65
 
5.6%
65
 
5.6%
49
 
4.2%
Other values (104) 516
44.1%

소재지전화
Text

MISSING 

Distinct43
Distinct (%)100.0%
Missing22
Missing (%)33.8%
Memory size652.0 B
2023-12-11T01:26:40.382731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.023256
Min length12

Characters and Unicode

Total characters517
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)100.0%

Sample

1st row070-7514-7889
2nd row051-987-7890
3rd row051-958-1919
4th row051-931-0122
5th row051-927-0137
ValueCountFrequency (%)
051-987-7890 1
 
2.3%
051-337-5511 1
 
2.3%
051-623-6090 1
 
2.3%
051-621-9183 1
 
2.3%
051-621-6999 1
 
2.3%
051-620-0551 1
 
2.3%
051-611-7567 1
 
2.3%
051-611-1002 1
 
2.3%
051-611-0022 1
 
2.3%
051-501-9752 1
 
2.3%
Other values (33) 33
76.7%
2023-12-11T01:26:40.957496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 86
16.6%
0 78
15.1%
1 74
14.3%
5 72
13.9%
6 44
8.5%
2 36
7.0%
7 29
 
5.6%
9 27
 
5.2%
8 26
 
5.0%
4 26
 
5.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 431
83.4%
Dash Punctuation 86
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 78
18.1%
1 74
17.2%
5 72
16.7%
6 44
10.2%
2 36
8.4%
7 29
 
6.7%
9 27
 
6.3%
8 26
 
6.0%
4 26
 
6.0%
3 19
 
4.4%
Dash Punctuation
ValueCountFrequency (%)
- 86
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 517
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 86
16.6%
0 78
15.1%
1 74
14.3%
5 72
13.9%
6 44
8.5%
2 36
7.0%
7 29
 
5.6%
9 27
 
5.2%
8 26
 
5.0%
4 26
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 517
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 86
16.6%
0 78
15.1%
1 74
14.3%
5 72
13.9%
6 44
8.5%
2 36
7.0%
7 29
 
5.6%
9 27
 
5.2%
8 26
 
5.0%
4 26
 
5.0%

Interactions

2023-12-11T01:26:36.889965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:26:41.110983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업소명영업소 주소(도로명)소재지전화
연번1.0001.0001.0001.000
업소명1.0001.0001.0001.000
영업소 주소(도로명)1.0001.0001.0001.000
소재지전화1.0001.0001.0001.000

Missing values

2023-12-11T01:26:37.052346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:26:37.180398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종명업소명영업소 주소(도로명)소재지전화
01건물위생관리업올앤원 크린부산광역시 남구 수영로13번길 16, 태광빌라 2층 101호 (문현동)070-7514-7889
12건물위생관리업개미인력부산광역시 남구 고동골로 22, 1층 (문현동)051-987-7890
23건물위생관리업주식회사 벌떼에이취알부산광역시 남구 자유평화로59번길 2, 늘송빌딩 2층 (문현동)051-958-1919
34건물위생관리업월드시스템부산광역시 남구 수영로39번가길 40-3, 1층 (문현동)051-931-0122
45건물위생관리업세영씨엔에스 주식회사부산광역시 남구 수영로 283, 102동 1401호 (대연동, 벽산솔렌스힐)051-927-0137
56건물위생관리업(주)라다부산광역시 남구 유엔평화로3번길 42 (대연동,1층)051-919-2425
67건물위생관리업주식회사 지엠네트웍스부산광역시 남구 수영로266번길 77, 센트럴지엠빌딩 2층 (대연동)051-806-8251
78건물위생관리업광진개발(주)부산광역시 남구 못골번영로 41, 1층 102호 (대연동)051-744-2564
89건물위생관리업에스에이씨 주식회사부산광역시 남구 전포대로92번나길 14, 1층 (문현동)051-714-0050
910건물위생관리업주식회사 기보메이트부산광역시 남구 전포대로 133, 48층 4802호 (문현동)051-710-6103
연번업종명업소명영업소 주소(도로명)소재지전화
5556건물위생관리업오케이환경부산광역시 남구 홍곡로 15-6 (감만동)<NA>
5657건물위생관리업덕유개발(주)부산광역시 남구 신선로 433, 4층 (용당동)<NA>
5758건물위생관리업현대실업부산광역시 남구 유엔평화로125번길 43 (대연동)<NA>
5859건물위생관리업(주)한정기업부산광역시 남구 우암로362번길 27 (문현동)<NA>
5960건물위생관리업에어몬부산광역시 남구 신선로329번길 34-18, 1층 (용당동)<NA>
6061건물위생관리업(주)테무진가드부산광역시 남구 유엔로 138 (대연동)<NA>
6162건물위생관리업클린앤케어부산광역시 남구 유엔로120번가길 19, 1층 (대연동)<NA>
6263건물위생관리업청소협동조합 청소하는사람들 부산경남본점부산광역시 남구 수영로250번길 47, 살렘피아노과외교습소 지하1층(대연동)<NA>
6364건물위생관리업클린에어존부산광역시 남구 신선로 417, 3층 301호 (용당동)<NA>
6465건물위생관리업선진솔루션부산광역시 남구 수영로 312, 21센츄리시티 오피스텔 635,636호 (대연동)<NA>