Overview

Dataset statistics

Number of variables5
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 KiB
Average record size in memory44.4 B

Variable types

Categorical1
Text4

Dataset

Description대구광역시 동구_착한가격업소_20200428
Author대구광역시 동구
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15059765&dataSetDetailId=150597651a6e80796d9ee&provdMethod=FILE

Alerts

업종 is highly imbalanced (64.6%)Imbalance
업소명 has unique valuesUnique
소재지 has unique valuesUnique
전화번호 has unique valuesUnique

Reproduction

Analysis started2024-04-17 19:33:29.112200
Analysis finished2024-04-17 19:33:29.389071
Duration0.28 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

IMBALANCE 

Distinct3
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
대중음식점
27 
세탁업
 
2
미용업
 
1

Length

Max length5
Median length5
Mean length4.8
Min length3

Unique

Unique1 ?
Unique (%)3.3%

Sample

1st row대중음식점
2nd row대중음식점
3rd row대중음식점
4th row대중음식점
5th row대중음식점

Common Values

ValueCountFrequency (%)
대중음식점 27
90.0%
세탁업 2
 
6.7%
미용업 1
 
3.3%

Length

2024-04-18T04:33:29.447862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T04:33:29.544209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대중음식점 27
90.0%
세탁업 2
 
6.7%
미용업 1
 
3.3%

업소명
Text

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2024-04-18T04:33:29.690517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length7
Mean length5.3666667
Min length2

Characters and Unicode

Total characters161
Distinct characters88
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st row고두밭숯불촌
2nd row대구삼계탕
3rd row두곡동숯불갈비
4th row영남루반점
5th row유정갈비
ValueCountFrequency (%)
고두밭숯불촌 1
 
3.1%
대구삼계탕 1
 
3.1%
헤어 1
 
3.1%
헐리우드 1
 
3.1%
면사랑칼국수 1
 
3.1%
소반 1
 
3.1%
잔치국수 1
 
3.1%
희망나눔장수 1
 
3.1%
청도추어탕 1
 
3.1%
대청봉 1
 
3.1%
Other values (22) 22
68.8%
2024-04-18T04:33:29.948560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6
 
3.7%
6
 
3.7%
6
 
3.7%
5
 
3.1%
5
 
3.1%
5
 
3.1%
5
 
3.1%
5
 
3.1%
4
 
2.5%
4
 
2.5%
Other values (78) 110
68.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 159
98.8%
Space Separator 2
 
1.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6
 
3.8%
6
 
3.8%
6
 
3.8%
5
 
3.1%
5
 
3.1%
5
 
3.1%
5
 
3.1%
5
 
3.1%
4
 
2.5%
4
 
2.5%
Other values (77) 108
67.9%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 159
98.8%
Common 2
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6
 
3.8%
6
 
3.8%
6
 
3.8%
5
 
3.1%
5
 
3.1%
5
 
3.1%
5
 
3.1%
5
 
3.1%
4
 
2.5%
4
 
2.5%
Other values (77) 108
67.9%
Common
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 159
98.8%
ASCII 2
 
1.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6
 
3.8%
6
 
3.8%
6
 
3.8%
5
 
3.1%
5
 
3.1%
5
 
3.1%
5
 
3.1%
5
 
3.1%
4
 
2.5%
4
 
2.5%
Other values (77) 108
67.9%
ASCII
ValueCountFrequency (%)
2
100.0%

소재지
Text

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2024-04-18T04:33:30.135853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length14
Mean length13.2
Min length10

Characters and Unicode

Total characters396
Distinct characters55
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st row큰고개로 24(신암동)
2nd row대현로 118-7(신암동)
3rd row아양로49길 6(신암동)
4th row해동로 18(지저동)
5th row아양로 6(신암동)
ValueCountFrequency (%)
아양로 4
 
6.5%
동촌로 4
 
6.5%
해동로 3
 
4.8%
동부로 2
 
3.2%
6(신암동 2
 
3.2%
장등로 2
 
3.2%
37길 1
 
1.6%
아양로34길 1
 
1.6%
18(신암동 1
 
1.6%
파계로 1
 
1.6%
Other values (41) 41
66.1%
2024-04-18T04:33:30.425715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
43
 
10.9%
32
 
8.1%
32
 
8.1%
( 30
 
7.6%
) 30
 
7.6%
1 22
 
5.6%
2 14
 
3.5%
13
 
3.3%
3 12
 
3.0%
12
 
3.0%
Other values (45) 156
39.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 194
49.0%
Decimal Number 99
25.0%
Space Separator 32
 
8.1%
Open Punctuation 30
 
7.6%
Close Punctuation 30
 
7.6%
Dash Punctuation 10
 
2.5%
Other Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
43
22.2%
32
16.5%
13
 
6.7%
12
 
6.2%
9
 
4.6%
7
 
3.6%
7
 
3.6%
6
 
3.1%
6
 
3.1%
5
 
2.6%
Other values (30) 54
27.8%
Decimal Number
ValueCountFrequency (%)
1 22
22.2%
2 14
14.1%
3 12
12.1%
8 9
9.1%
0 9
9.1%
4 9
9.1%
6 8
 
8.1%
7 6
 
6.1%
5 6
 
6.1%
9 4
 
4.0%
Space Separator
ValueCountFrequency (%)
32
100.0%
Open Punctuation
ValueCountFrequency (%)
( 30
100.0%
Close Punctuation
ValueCountFrequency (%)
) 30
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 202
51.0%
Hangul 194
49.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
43
22.2%
32
16.5%
13
 
6.7%
12
 
6.2%
9
 
4.6%
7
 
3.6%
7
 
3.6%
6
 
3.1%
6
 
3.1%
5
 
2.6%
Other values (30) 54
27.8%
Common
ValueCountFrequency (%)
32
15.8%
( 30
14.9%
) 30
14.9%
1 22
10.9%
2 14
6.9%
3 12
 
5.9%
- 10
 
5.0%
8 9
 
4.5%
0 9
 
4.5%
4 9
 
4.5%
Other values (5) 25
12.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 202
51.0%
Hangul 194
49.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
43
22.2%
32
16.5%
13
 
6.7%
12
 
6.2%
9
 
4.6%
7
 
3.6%
7
 
3.6%
6
 
3.1%
6
 
3.1%
5
 
2.6%
Other values (30) 54
27.8%
ASCII
ValueCountFrequency (%)
32
15.8%
( 30
14.9%
) 30
14.9%
1 22
10.9%
2 14
6.9%
3 12
 
5.9%
- 10
 
5.0%
8 9
 
4.5%
0 9
 
4.5%
4 9
 
4.5%
Other values (5) 25
12.4%
Distinct27
Distinct (%)90.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2024-04-18T04:33:30.574593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length17
Mean length7.2333333
Min length2

Characters and Unicode

Total characters217
Distinct characters74
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)80.0%

Sample

1st row돼지갈비 (미국산) 200g
2nd row삼계탕
3rd row돼지왕갈비(국내산) 200g
4th row자장면
5th row왕갈비(국내산) 200g
ValueCountFrequency (%)
200g 4
 
10.3%
잔치국수 2
 
5.1%
복어탕 2
 
5.1%
삼계탕 2
 
5.1%
삼겹살(국내산 2
 
5.1%
추어탕 1
 
2.6%
해물칼국수 1
 
2.6%
돼지찌개 1
 
2.6%
얼큰냉면 1
 
2.6%
회덮밥 1
 
2.6%
Other values (22) 22
56.4%
2024-04-18T04:33:30.827601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
14
 
6.5%
0 13
 
6.0%
12
 
5.5%
) 10
 
4.6%
10
 
4.6%
( 10
 
4.6%
g 9
 
4.1%
7
 
3.2%
6
 
2.8%
6
 
2.8%
Other values (64) 120
55.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 146
67.3%
Decimal Number 28
 
12.9%
Space Separator 12
 
5.5%
Close Punctuation 10
 
4.6%
Open Punctuation 10
 
4.6%
Lowercase Letter 9
 
4.1%
Other Punctuation 2
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
 
9.6%
10
 
6.8%
7
 
4.8%
6
 
4.1%
6
 
4.1%
6
 
4.1%
6
 
4.1%
5
 
3.4%
5
 
3.4%
5
 
3.4%
Other values (52) 76
52.1%
Decimal Number
ValueCountFrequency (%)
0 13
46.4%
2 6
21.4%
1 5
 
17.9%
3 1
 
3.6%
8 1
 
3.6%
4 1
 
3.6%
5 1
 
3.6%
Space Separator
ValueCountFrequency (%)
12
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%
Lowercase Letter
ValueCountFrequency (%)
g 9
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 146
67.3%
Common 62
28.6%
Latin 9
 
4.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14
 
9.6%
10
 
6.8%
7
 
4.8%
6
 
4.1%
6
 
4.1%
6
 
4.1%
6
 
4.1%
5
 
3.4%
5
 
3.4%
5
 
3.4%
Other values (52) 76
52.1%
Common
ValueCountFrequency (%)
0 13
21.0%
12
19.4%
) 10
16.1%
( 10
16.1%
2 6
9.7%
1 5
 
8.1%
, 2
 
3.2%
3 1
 
1.6%
8 1
 
1.6%
4 1
 
1.6%
Latin
ValueCountFrequency (%)
g 9
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 146
67.3%
ASCII 71
32.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
14
 
9.6%
10
 
6.8%
7
 
4.8%
6
 
4.1%
6
 
4.1%
6
 
4.1%
6
 
4.1%
5
 
3.4%
5
 
3.4%
5
 
3.4%
Other values (52) 76
52.1%
ASCII
ValueCountFrequency (%)
0 13
18.3%
12
16.9%
) 10
14.1%
( 10
14.1%
g 9
12.7%
2 6
8.5%
1 5
 
7.0%
, 2
 
2.8%
3 1
 
1.4%
8 1
 
1.4%
Other values (2) 2
 
2.8%

전화번호
Text

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2024-04-18T04:33:31.004197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters360
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st row053-956-1442
2nd row053-955-7848
3rd row053-942-8495
4th row053-981-9881
5th row053-943-6616
ValueCountFrequency (%)
053-956-1442 1
 
3.3%
053-955-7848 1
 
3.3%
053-000-0000 1
 
3.3%
053-954-2001 1
 
3.3%
053-752-1269 1
 
3.3%
053-269-8045 1
 
3.3%
053-959-9964 1
 
3.3%
053-941-6540 1
 
3.3%
053-982-0890 1
 
3.3%
053-981-4044 1
 
3.3%
Other values (20) 20
66.7%
2024-04-18T04:33:31.290983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 60
16.7%
0 53
14.7%
5 53
14.7%
3 44
12.2%
9 41
11.4%
8 26
7.2%
4 23
 
6.4%
2 20
 
5.6%
1 17
 
4.7%
6 14
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 300
83.3%
Dash Punctuation 60
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 53
17.7%
5 53
17.7%
3 44
14.7%
9 41
13.7%
8 26
8.7%
4 23
7.7%
2 20
 
6.7%
1 17
 
5.7%
6 14
 
4.7%
7 9
 
3.0%
Dash Punctuation
ValueCountFrequency (%)
- 60
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 360
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 60
16.7%
0 53
14.7%
5 53
14.7%
3 44
12.2%
9 41
11.4%
8 26
7.2%
4 23
 
6.4%
2 20
 
5.6%
1 17
 
4.7%
6 14
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 360
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 60
16.7%
0 53
14.7%
5 53
14.7%
3 44
12.2%
9 41
11.4%
8 26
7.2%
4 23
 
6.4%
2 20
 
5.6%
1 17
 
4.7%
6 14
 
3.9%

Correlations

2024-04-18T04:33:31.365621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종업소명소재지취급품목(원산지)전화번호
업종1.0001.0001.0001.0001.000
업소명1.0001.0001.0001.0001.000
소재지1.0001.0001.0001.0001.000
취급품목(원산지)1.0001.0001.0001.0001.000
전화번호1.0001.0001.0001.0001.000

Missing values

2024-04-18T04:33:29.358331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종업소명소재지취급품목(원산지)전화번호
0대중음식점고두밭숯불촌큰고개로 24(신암동)돼지갈비 (미국산) 200g053-956-1442
1대중음식점대구삼계탕대현로 118-7(신암동)삼계탕053-955-7848
2대중음식점두곡동숯불갈비아양로49길 6(신암동)돼지왕갈비(국내산) 200g053-942-8495
3대중음식점영남루반점해동로 18(지저동)자장면053-981-9881
4대중음식점유정갈비아양로 6(신암동)왕갈비(국내산) 200g053-943-6616
5대중음식점태종대팔공로24길 19-10(불로동)삼겹살(국내산) 140g053-983-3477
6대중음식점팔공식당송라로32길 5(신암동)돌솥비빔밭053-941-1289
7대중음식점흥부고을숯불갈비동촌로 80-14(검사동)돼지갈비(독일,칠레산)250g053-986-0092
8대중음식점고향손칼국수장등로 35(신천동)잔치국수053-752-8894
9세탁업무한세탁소동호로2길 3(동호동)정장1벌053-961-8250
업종업소명소재지취급품목(원산지)전화번호
20세탁업방촌식육식당해동로 244-1(검사동)돼지찌개053-981-5511
21대중음식점시장냉면팔공로30길 10-3(불로동)얼큰냉면053-981-4044
22대중음식점정동진동촌로 31(입석동)회덮밥053-982-0890
23대중음식점대청봉아양로 206-1(효목동)삼계탕053-941-6540
24대중음식점청도추어탕아양로 37길 7(신암동)추어탕053-959-9964
25대중음식점희망나눔장수 잔치국수동부로 32길 128(신천동)잔치국수053-269-8045
26대중음식점소반장등로 85-1(신천동)정식053-752-1269
27대중음식점면사랑칼국수효동로 126, 2층(효목동)칼국수053-954-2001
28미용업헐리우드 헤어동부로 162(신천동)헤어컷(남성)053-000-0000
29대중음식점부산복해물칼국수팔공로28길 8-10(불로동)복어탕053-959-2830