Overview

Dataset statistics

Number of variables4
Number of observations222
Missing cells116
Missing cells (%)13.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.1 KiB
Average record size in memory32.6 B

Variable types

Categorical1
Text3

Dataset

Description부산광역시연제구축산물판매업현황_20201118
Author부산광역시 연제구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15025158

Alerts

소재지전화 has 115 (51.8%) missing valuesMissing

Reproduction

Analysis started2023-12-10 17:35:59.452054
Analysis finished2023-12-10 17:36:00.563642
Duration1.11 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

상호
Categorical

Distinct5
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
식육판매업
91 
식육즉석판매가공업
72 
우유류판매업
43 
축산물유통전문판매업
 
9
식용란수집판매업
 
7

Length

Max length10
Median length9
Mean length6.7882883
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row우유류판매업
2nd row우유류판매업
3rd row우유류판매업
4th row우유류판매업
5th row우유류판매업

Common Values

ValueCountFrequency (%)
식육판매업 91
41.0%
식육즉석판매가공업 72
32.4%
우유류판매업 43
19.4%
축산물유통전문판매업 9
 
4.1%
식용란수집판매업 7
 
3.2%

Length

2023-12-11T02:36:00.699320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:36:00.943145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
식육판매업 91
41.0%
식육즉석판매가공업 72
32.4%
우유류판매업 43
19.4%
축산물유통전문판매업 9
 
4.1%
식용란수집판매업 7
 
3.2%
Distinct214
Distinct (%)96.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-11T02:36:01.343145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length14
Mean length6.7837838
Min length2

Characters and Unicode

Total characters1506
Distinct characters246
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique206 ?
Unique (%)92.8%

Sample

1st row(주)한국야쿠르트연산점
2nd row부산우유 제5대리점
3rd row(주)한국야쿠르트양정점
4th row(주)한국야쿠르트 연제점
5th row부산우유 연산8대리점
ValueCountFrequency (%)
부산우유 7
 
2.5%
남양유업 3
 
1.1%
한우촌 3
 
1.1%
주식회사 3
 
1.1%
정육점 3
 
1.1%
서울우유 2
 
0.7%
진주통닭 2
 
0.7%
식육 2
 
0.7%
남양우유 2
 
0.7%
거제점 2
 
0.7%
Other values (238) 246
89.5%
2023-12-11T02:36:02.094836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
85
 
5.6%
71
 
4.7%
54
 
3.6%
53
 
3.5%
53
 
3.5%
49
 
3.3%
46
 
3.1%
38
 
2.5%
35
 
2.3%
34
 
2.3%
Other values (236) 988
65.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1370
91.0%
Space Separator 53
 
3.5%
Close Punctuation 29
 
1.9%
Open Punctuation 29
 
1.9%
Uppercase Letter 11
 
0.7%
Decimal Number 11
 
0.7%
Math Symbol 2
 
0.1%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
85
 
6.2%
71
 
5.2%
54
 
3.9%
53
 
3.9%
49
 
3.6%
46
 
3.4%
38
 
2.8%
35
 
2.6%
34
 
2.5%
34
 
2.5%
Other values (218) 871
63.6%
Uppercase Letter
ValueCountFrequency (%)
J 4
36.4%
N 2
18.2%
H 1
 
9.1%
T 1
 
9.1%
E 1
 
9.1%
F 1
 
9.1%
M 1
 
9.1%
Decimal Number
ValueCountFrequency (%)
1 4
36.4%
5 2
18.2%
2 2
18.2%
0 1
 
9.1%
3 1
 
9.1%
8 1
 
9.1%
Space Separator
ValueCountFrequency (%)
53
100.0%
Close Punctuation
ValueCountFrequency (%)
) 29
100.0%
Open Punctuation
ValueCountFrequency (%)
( 29
100.0%
Math Symbol
ValueCountFrequency (%)
+ 2
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1370
91.0%
Common 125
 
8.3%
Latin 11
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
85
 
6.2%
71
 
5.2%
54
 
3.9%
53
 
3.9%
49
 
3.6%
46
 
3.4%
38
 
2.8%
35
 
2.6%
34
 
2.5%
34
 
2.5%
Other values (218) 871
63.6%
Common
ValueCountFrequency (%)
53
42.4%
) 29
23.2%
( 29
23.2%
1 4
 
3.2%
+ 2
 
1.6%
5 2
 
1.6%
2 2
 
1.6%
& 1
 
0.8%
0 1
 
0.8%
3 1
 
0.8%
Latin
ValueCountFrequency (%)
J 4
36.4%
N 2
18.2%
H 1
 
9.1%
T 1
 
9.1%
E 1
 
9.1%
F 1
 
9.1%
M 1
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1370
91.0%
ASCII 136
 
9.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
85
 
6.2%
71
 
5.2%
54
 
3.9%
53
 
3.9%
49
 
3.6%
46
 
3.4%
38
 
2.8%
35
 
2.6%
34
 
2.5%
34
 
2.5%
Other values (218) 871
63.6%
ASCII
ValueCountFrequency (%)
53
39.0%
) 29
21.3%
( 29
21.3%
J 4
 
2.9%
1 4
 
2.9%
N 2
 
1.5%
+ 2
 
1.5%
5 2
 
1.5%
2 2
 
1.5%
H 1
 
0.7%
Other values (8) 8
 
5.9%
Distinct210
Distinct (%)95.0%
Missing1
Missing (%)0.5%
Memory size1.9 KiB
2023-12-11T02:36:02.793589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length46
Mean length28.253394
Min length21

Characters and Unicode

Total characters6244
Distinct characters146
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique200 ?
Unique (%)90.5%

Sample

1st row부산광역시 연제구 중앙대로1076번길 26, 1층, 지하1층 (연산동)
2nd row부산광역시 연제구 고분로32번길 68 (연산동)
3rd row부산광역시 연제구 거제천로 33 (거제동)
4th row부산광역시 연제구 안연로 20 (연산동)
5th row부산광역시 연제구 과정로225번길 15, 106호 (연산동,남산맨션)
ValueCountFrequency (%)
부산광역시 221
18.4%
연제구 220
18.3%
연산동 163
 
13.6%
거제동 47
 
3.9%
1층 22
 
1.8%
7 10
 
0.8%
과정로 10
 
0.8%
28 8
 
0.7%
과정로276번길 8
 
0.7%
8 8
 
0.7%
Other values (267) 483
40.2%
2023-12-11T02:36:03.844786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
979
 
15.7%
418
 
6.7%
406
 
6.5%
305
 
4.9%
245
 
3.9%
241
 
3.9%
224
 
3.6%
222
 
3.6%
( 221
 
3.5%
) 221
 
3.5%
Other values (136) 2762
44.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3824
61.2%
Space Separator 979
 
15.7%
Decimal Number 899
 
14.4%
Open Punctuation 221
 
3.5%
Close Punctuation 221
 
3.5%
Other Punctuation 78
 
1.2%
Dash Punctuation 17
 
0.3%
Uppercase Letter 3
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
418
 
10.9%
406
 
10.6%
305
 
8.0%
245
 
6.4%
241
 
6.3%
224
 
5.9%
222
 
5.8%
221
 
5.8%
221
 
5.8%
220
 
5.8%
Other values (117) 1101
28.8%
Decimal Number
ValueCountFrequency (%)
1 221
24.6%
2 136
15.1%
3 100
11.1%
4 77
 
8.6%
0 75
 
8.3%
8 67
 
7.5%
7 61
 
6.8%
6 61
 
6.8%
5 59
 
6.6%
9 42
 
4.7%
Uppercase Letter
ValueCountFrequency (%)
E 1
33.3%
C 1
33.3%
B 1
33.3%
Space Separator
ValueCountFrequency (%)
979
100.0%
Open Punctuation
ValueCountFrequency (%)
( 221
100.0%
Close Punctuation
ValueCountFrequency (%)
) 221
100.0%
Other Punctuation
ValueCountFrequency (%)
, 78
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 17
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3824
61.2%
Common 2417
38.7%
Latin 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
418
 
10.9%
406
 
10.6%
305
 
8.0%
245
 
6.4%
241
 
6.3%
224
 
5.9%
222
 
5.8%
221
 
5.8%
221
 
5.8%
220
 
5.8%
Other values (117) 1101
28.8%
Common
ValueCountFrequency (%)
979
40.5%
( 221
 
9.1%
) 221
 
9.1%
1 221
 
9.1%
2 136
 
5.6%
3 100
 
4.1%
, 78
 
3.2%
4 77
 
3.2%
0 75
 
3.1%
8 67
 
2.8%
Other values (6) 242
 
10.0%
Latin
ValueCountFrequency (%)
E 1
33.3%
C 1
33.3%
B 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3824
61.2%
ASCII 2420
38.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
979
40.5%
( 221
 
9.1%
) 221
 
9.1%
1 221
 
9.1%
2 136
 
5.6%
3 100
 
4.1%
, 78
 
3.2%
4 77
 
3.2%
0 75
 
3.1%
8 67
 
2.8%
Other values (9) 245
 
10.1%
Hangul
ValueCountFrequency (%)
418
 
10.9%
406
 
10.6%
305
 
8.0%
245
 
6.4%
241
 
6.3%
224
 
5.9%
222
 
5.8%
221
 
5.8%
221
 
5.8%
220
 
5.8%
Other values (117) 1101
28.8%

소재지전화
Text

MISSING 

Distinct106
Distinct (%)99.1%
Missing115
Missing (%)51.8%
Memory size1.9 KiB
2023-12-11T02:36:04.396797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.009346
Min length12

Characters and Unicode

Total characters1285
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique105 ?
Unique (%)98.1%

Sample

1st row051-867-1497
2nd row051-863-2643
3rd row051-867-1531
4th row051-759-9052
5th row051-861-6986
ValueCountFrequency (%)
051-868-6311 2
 
1.9%
051-754-9992 1
 
0.9%
051-704-8874 1
 
0.9%
051-523-8884 1
 
0.9%
051-503-5741 1
 
0.9%
051-867-7189 1
 
0.9%
051-852-2002 1
 
0.9%
051-853-1236 1
 
0.9%
051-850-0321 1
 
0.9%
051-507-2100 1
 
0.9%
Other values (96) 96
89.7%
2023-12-11T02:36:05.276480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 214
16.7%
5 203
15.8%
1 180
14.0%
0 167
13.0%
8 134
10.4%
6 97
7.5%
7 71
 
5.5%
2 71
 
5.5%
3 52
 
4.0%
4 48
 
3.7%
Other values (2) 48
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1070
83.3%
Dash Punctuation 214
 
16.7%
Space Separator 1
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 203
19.0%
1 180
16.8%
0 167
15.6%
8 134
12.5%
6 97
9.1%
7 71
 
6.6%
2 71
 
6.6%
3 52
 
4.9%
4 48
 
4.5%
9 47
 
4.4%
Dash Punctuation
ValueCountFrequency (%)
- 214
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1285
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 214
16.7%
5 203
15.8%
1 180
14.0%
0 167
13.0%
8 134
10.4%
6 97
7.5%
7 71
 
5.5%
2 71
 
5.5%
3 52
 
4.0%
4 48
 
3.7%
Other values (2) 48
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1285
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 214
16.7%
5 203
15.8%
1 180
14.0%
0 167
13.0%
8 134
10.4%
6 97
7.5%
7 71
 
5.5%
2 71
 
5.5%
3 52
 
4.0%
4 48
 
3.7%
Other values (2) 48
 
3.7%

Missing values

2023-12-11T02:35:59.986915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:36:00.219069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T02:36:00.446496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

상호사업장명칭소재지주소(도로명)소재지전화
0우유류판매업(주)한국야쿠르트연산점부산광역시 연제구 중앙대로1076번길 26, 1층, 지하1층 (연산동)051-867-1497
1우유류판매업부산우유 제5대리점부산광역시 연제구 고분로32번길 68 (연산동)051-863-2643
2우유류판매업(주)한국야쿠르트양정점부산광역시 연제구 거제천로 33 (거제동)<NA>
3우유류판매업(주)한국야쿠르트 연제점부산광역시 연제구 안연로 20 (연산동)051-867-1531
4우유류판매업부산우유 연산8대리점부산광역시 연제구 과정로225번길 15, 106호 (연산동,남산맨션)051-759-9052
5식육판매업현대식육점부산광역시 연제구 고분로 24 (연산동,연일상가 19호)051-861-6986
6우유류판매업남양유업연미보급소부산광역시 연제구 월드컵대로46번길 3 (연산동)<NA>
7식육판매업대조축농산물부산광역시 연제구 교대로54번길 20 (거제동)051-504-5140
8식육판매업연산식육점부산광역시 연제구 월드컵대로3번길 21 (연산동)<NA>
9식육판매업대성식육점부산광역시 연제구 금련로18번길 17-1 (연산동)<NA>
상호사업장명칭소재지주소(도로명)소재지전화
212식육즉석판매가공업브라더축산부산광역시 연제구 신금로 7 (연산동)<NA>
213식육즉석판매가공업(주)일출축산부산광역시 연제구 과정로 115 (연산동)<NA>
214식육즉석판매가공업부경양돈M&F 거제점부산광역시 연제구 거제천로87번길 17 (거제동)051-868-8551
215식육즉석판매가공업연산식육부산광역시 연제구 월드컵대로3번길 32 (연산동)<NA>
216식육즉석판매가공업미광유통부산광역시 연제구 과정로278번길 16 (연산동)<NA>
217식육즉석판매가공업우리축산마트부산광역시 연제구 반송로 21, 화성빌딩 (연산동)051-710-0245
218식육즉석판매가공업더드림축산센터부산광역시 연제구 과정로 157 (연산동)<NA>
219식육즉석판매가공업드림마트부산광역시 연제구 과정로287번길 51, 2층 102호 (연산동, 시원드림타워)<NA>
220식육즉석판매가공업미호축산부산광역시 연제구 쌍미천로 160 (연산동)051-867-5567
221식육즉석판매가공업(주) 참축산 연산동지점부산광역시 연제구 세병로 19 (연산동)<NA>