Overview

Dataset statistics

Number of variables5
Number of observations48
Missing cells1
Missing cells (%)0.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.0 KiB
Average record size in memory42.8 B

Variable types

Text4
Categorical1

Dataset

Description부산광역시연제구_물가안정착한가격모범업소현황_20200928
Author부산광역시 연제구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15067671

Alerts

연락처 has 1 (2.1%) missing valuesMissing
업소명 has unique valuesUnique
새주소 has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:29:45.006266
Analysis finished2023-12-10 16:29:45.571726
Duration0.57 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업소명
Text

UNIQUE 

Distinct48
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size516.0 B
2023-12-11T01:29:45.772488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length4.8125
Min length2

Characters and Unicode

Total characters231
Distinct characters137
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique48 ?
Unique (%)100.0%

Sample

1st row청담골
2nd row우정
3rd row현대탕
4th row책임밧데리
5th row참추어탕
ValueCountFrequency (%)
청담골 1
 
2.1%
우정 1
 
2.1%
포항식당 1
 
2.1%
할매보쌈 1
 
2.1%
밥심 1
 
2.1%
미진반점 1
 
2.1%
전설의노가리 1
 
2.1%
도리오헤어 1
 
2.1%
태양세탁 1
 
2.1%
보거스 1
 
2.1%
Other values (38) 38
79.2%
2023-12-11T01:29:46.180285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9
 
3.9%
6
 
2.6%
5
 
2.2%
5
 
2.2%
4
 
1.7%
4
 
1.7%
3
 
1.3%
3
 
1.3%
3
 
1.3%
3
 
1.3%
Other values (127) 186
80.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 231
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9
 
3.9%
6
 
2.6%
5
 
2.2%
5
 
2.2%
4
 
1.7%
4
 
1.7%
3
 
1.3%
3
 
1.3%
3
 
1.3%
3
 
1.3%
Other values (127) 186
80.5%

Most occurring scripts

ValueCountFrequency (%)
Hangul 231
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9
 
3.9%
6
 
2.6%
5
 
2.2%
5
 
2.2%
4
 
1.7%
4
 
1.7%
3
 
1.3%
3
 
1.3%
3
 
1.3%
3
 
1.3%
Other values (127) 186
80.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 231
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
9
 
3.9%
6
 
2.6%
5
 
2.2%
5
 
2.2%
4
 
1.7%
4
 
1.7%
3
 
1.3%
3
 
1.3%
3
 
1.3%
3
 
1.3%
Other values (127) 186
80.5%

지정일자
Categorical

Distinct8
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Memory size516.0 B
2017-03-01
13 
2019-04-01
12 
2016-07-31
2015-02-26
2013-06-18
Other values (3)

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2015-02-26
2nd row2016-07-31
3rd row2017-03-01
4th row2019-04-01
5th row2013-06-18

Common Values

ValueCountFrequency (%)
2017-03-01 13
27.1%
2019-04-01 12
25.0%
2016-07-31 6
12.5%
2015-02-26 4
 
8.3%
2013-06-18 4
 
8.3%
2012-06-18 4
 
8.3%
2020-03-25 3
 
6.2%
2014-02-14 2
 
4.2%

Length

2023-12-11T01:29:46.316736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:29:46.427386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2017-03-01 13
27.1%
2019-04-01 12
25.0%
2016-07-31 6
12.5%
2015-02-26 4
 
8.3%
2013-06-18 4
 
8.3%
2012-06-18 4
 
8.3%
2020-03-25 3
 
6.2%
2014-02-14 2
 
4.2%
Distinct33
Distinct (%)68.8%
Missing0
Missing (%)0.0%
Memory size516.0 B
2023-12-11T01:29:46.645711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length5
Mean length3.3958333
Min length2

Characters and Unicode

Total characters163
Distinct characters79
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)52.1%

Sample

1st row김치찌개
2nd row돌솥비빔밥
3rd row목욕
4th row차량밧데리
5th row추어탕
ValueCountFrequency (%)
헤어컷 5
 
10.2%
삼겹살 4
 
8.2%
세탁 3
 
6.1%
짜장면 3
 
6.1%
보쌈정식 2
 
4.1%
각종찌개 2
 
4.1%
칼국수 2
 
4.1%
목욕 2
 
4.1%
노가리 1
 
2.0%
돼지불백 1
 
2.0%
Other values (24) 24
49.0%
2023-12-11T01:29:47.387595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6
 
3.7%
5
 
3.1%
5
 
3.1%
5
 
3.1%
5
 
3.1%
5
 
3.1%
5
 
3.1%
4
 
2.5%
4
 
2.5%
4
 
2.5%
Other values (69) 115
70.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 161
98.8%
Other Punctuation 1
 
0.6%
Space Separator 1
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6
 
3.7%
5
 
3.1%
5
 
3.1%
5
 
3.1%
5
 
3.1%
5
 
3.1%
5
 
3.1%
4
 
2.5%
4
 
2.5%
4
 
2.5%
Other values (67) 113
70.2%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 161
98.8%
Common 2
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6
 
3.7%
5
 
3.1%
5
 
3.1%
5
 
3.1%
5
 
3.1%
5
 
3.1%
5
 
3.1%
4
 
2.5%
4
 
2.5%
4
 
2.5%
Other values (67) 113
70.2%
Common
ValueCountFrequency (%)
, 1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 161
98.8%
ASCII 2
 
1.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6
 
3.7%
5
 
3.1%
5
 
3.1%
5
 
3.1%
5
 
3.1%
5
 
3.1%
5
 
3.1%
4
 
2.5%
4
 
2.5%
4
 
2.5%
Other values (67) 113
70.2%
ASCII
ValueCountFrequency (%)
, 1
50.0%
1
50.0%

새주소
Text

UNIQUE 

Distinct48
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size516.0 B
2023-12-11T01:29:47.717547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length18.5
Mean length16.604167
Min length10

Characters and Unicode

Total characters797
Distinct characters50
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique48 ?
Unique (%)100.0%

Sample

1st row월드컵대로188번길 40 (거제1동)
2nd row교대로 9 (거제1동)
3rd row여고로52번길 45 (거제1동)
4th row거제천로 195-1 (거제1동)
5th row월드컵대로 227-1 (거제2동)
ValueCountFrequency (%)
연산9동 9
 
6.4%
연산2동 6
 
4.3%
연산5동 6
 
4.3%
거제2동 5
 
3.5%
월드컵대로 5
 
3.5%
연산1동 4
 
2.8%
거제1동 4
 
2.8%
연산6동 4
 
2.8%
40 3
 
2.1%
연수로87번길 3
 
2.1%
Other values (79) 92
65.2%
2023-12-11T01:29:48.281343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
93
 
11.7%
48
 
6.0%
46
 
5.8%
1 46
 
5.8%
) 45
 
5.6%
( 45
 
5.6%
42
 
5.3%
2 39
 
4.9%
36
 
4.5%
28
 
3.5%
Other values (40) 329
41.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 375
47.1%
Decimal Number 228
28.6%
Space Separator 93
 
11.7%
Close Punctuation 45
 
5.6%
Open Punctuation 45
 
5.6%
Dash Punctuation 10
 
1.3%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
48
12.8%
46
12.3%
42
11.2%
36
9.6%
28
 
7.5%
28
 
7.5%
18
 
4.8%
17
 
4.5%
15
 
4.0%
12
 
3.2%
Other values (25) 85
22.7%
Decimal Number
ValueCountFrequency (%)
1 46
20.2%
2 39
17.1%
5 26
11.4%
3 22
9.6%
4 21
9.2%
9 19
8.3%
8 16
 
7.0%
7 14
 
6.1%
6 13
 
5.7%
0 12
 
5.3%
Space Separator
ValueCountFrequency (%)
93
100.0%
Close Punctuation
ValueCountFrequency (%)
) 45
100.0%
Open Punctuation
ValueCountFrequency (%)
( 45
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 422
52.9%
Hangul 375
47.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
48
12.8%
46
12.3%
42
11.2%
36
9.6%
28
 
7.5%
28
 
7.5%
18
 
4.8%
17
 
4.5%
15
 
4.0%
12
 
3.2%
Other values (25) 85
22.7%
Common
ValueCountFrequency (%)
93
22.0%
1 46
10.9%
) 45
10.7%
( 45
10.7%
2 39
9.2%
5 26
 
6.2%
3 22
 
5.2%
4 21
 
5.0%
9 19
 
4.5%
8 16
 
3.8%
Other values (5) 50
11.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 422
52.9%
Hangul 375
47.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
93
22.0%
1 46
10.9%
) 45
10.7%
( 45
10.7%
2 39
9.2%
5 26
 
6.2%
3 22
 
5.2%
4 21
 
5.0%
9 19
 
4.5%
8 16
 
3.8%
Other values (5) 50
11.8%
Hangul
ValueCountFrequency (%)
48
12.8%
46
12.3%
42
11.2%
36
9.6%
28
 
7.5%
28
 
7.5%
18
 
4.8%
17
 
4.5%
15
 
4.0%
12
 
3.2%
Other values (25) 85
22.7%

연락처
Text

MISSING 

Distinct47
Distinct (%)100.0%
Missing1
Missing (%)2.1%
Memory size516.0 B
2023-12-11T01:29:48.617541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.021277
Min length12

Characters and Unicode

Total characters565
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique47 ?
Unique (%)100.0%

Sample

1st row051-861-5582
2nd row051-504-3072
3rd row051-503-1067
4th row051-865-5151
5th row051-507-1717
ValueCountFrequency (%)
051-861-5582 1
 
2.1%
051-751-2911 1
 
2.1%
051-867-1713 1
 
2.1%
051-853-8005 1
 
2.1%
051-852-6333 1
 
2.1%
051-861-8588 1
 
2.1%
051-912-0609 1
 
2.1%
051-864-0017 1
 
2.1%
051-867-8993 1
 
2.1%
051-758-2046 1
 
2.1%
Other values (37) 37
78.7%
2023-12-11T01:29:49.253047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 94
16.6%
5 92
16.3%
0 82
14.5%
1 81
14.3%
8 58
10.3%
6 40
7.1%
2 31
 
5.5%
7 31
 
5.5%
3 25
 
4.4%
9 16
 
2.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 471
83.4%
Dash Punctuation 94
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 92
19.5%
0 82
17.4%
1 81
17.2%
8 58
12.3%
6 40
8.5%
2 31
 
6.6%
7 31
 
6.6%
3 25
 
5.3%
9 16
 
3.4%
4 15
 
3.2%
Dash Punctuation
ValueCountFrequency (%)
- 94
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 565
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 94
16.6%
5 92
16.3%
0 82
14.5%
1 81
14.3%
8 58
10.3%
6 40
7.1%
2 31
 
5.5%
7 31
 
5.5%
3 25
 
4.4%
9 16
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 565
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 94
16.6%
5 92
16.3%
0 82
14.5%
1 81
14.3%
8 58
10.3%
6 40
7.1%
2 31
 
5.5%
7 31
 
5.5%
3 25
 
4.4%
9 16
 
2.8%

Correlations

2023-12-11T01:29:49.388712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소명지정일자대표품목새주소연락처
업소명1.0001.0001.0001.0001.000
지정일자1.0001.0000.7571.0001.000
대표품목1.0000.7571.0001.0001.000
새주소1.0001.0001.0001.0001.000
연락처1.0001.0001.0001.0001.000

Missing values

2023-12-11T01:29:45.388144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:29:45.523335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명지정일자대표품목새주소연락처
0청담골2015-02-26김치찌개월드컵대로188번길 40 (거제1동)051-861-5582
1우정2016-07-31돌솥비빔밥교대로 9 (거제1동)051-504-3072
2현대탕2017-03-01목욕여고로52번길 45 (거제1동)051-503-1067
3책임밧데리2019-04-01차량밧데리거제천로 195-1 (거제1동)051-865-5151
4참추어탕2013-06-18추어탕월드컵대로 227-1 (거제2동)051-507-1717
5울진가자미2016-07-31가자미조림월드컵대로 245-2 (거제2동)051-501-0434
6제주흑돼지뒷고기2017-03-01뒷고기월드컵대로 223 (거제2동)051-503-5630
7평양갈비2017-03-01불고기정식아시아드대로28번길 13 (거제2동)051-502-8038
8반도이용원2017-03-01헤어컷월드컵대로235번길 33 (거제2동)051-502-0298
9동진가셀프식당2012-06-18갈비탕거제시장로14번길 55 (거제3동)051-867-2658
업소명지정일자대표품목새주소연락처
38맨남성컷전문2017-03-01헤어컷고분로236번길 70 (연산9동)051-756-0815
39하나둘세탁2017-03-01세탁고분로 200, 208호 (연산9동)051-759-8449
40서울세탁2015-02-26세탁연안로13번길 51 (연산9동)051-755-1735
41시원탕2017-03-01목욕과정로348번길 23 (연산9동)051-864-8919
42호텔오마이2017-03-01숙박과정로191번길 59 (연산9동)051-751-5800
43최짬뽕달인2019-04-01짬뽕과정로 105-7 (연산9동)051-757-3183
44맨인블랙2019-04-01헤어컷과정로191번가길 62 (연산9동)051-761-0833
45맑은생고기식육식당2020-03-25삼겹살연수로87번길 38051-867-0311
46연제왕칼국수2020-03-25칼국수연수로87번길 15051-853-2209
47민진로스팅2020-03-25커피, 원두거제천로 126-1070-7721-5955