Overview

Dataset statistics

Number of variables4
Number of observations327
Missing cells145
Missing cells (%)11.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.3 KiB
Average record size in memory32.4 B

Variable types

Text4

Dataset

Description부산광역시기장군_즉석판매제조가공업_20210715
Author부산광역시 기장군
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15047916

Alerts

소재지전화번호 has 141 (43.1%) missing valuesMissing

Reproduction

Analysis started2023-12-10 16:35:43.904428
Analysis finished2023-12-10 16:35:44.704678
Duration0.8 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct323
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
2023-12-11T01:35:44.945302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length21
Mean length6.5321101
Min length2

Characters and Unicode

Total characters2136
Distinct characters417
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique319 ?
Unique (%)97.6%

Sample

1st row경주상회
2nd row송정기름집
3rd row안동참기름
4th row기장상회
5th row일광참기름상회
ValueCountFrequency (%)
주식회사 8
 
2.1%
정관점 5
 
1.3%
주)에이치앤디이 4
 
1.0%
일광점 3
 
0.8%
반찬 3
 
0.8%
부산정관점 3
 
0.8%
부대찌개 2
 
0.5%
장안부산휴게소 2
 
0.5%
열린매장 2
 
0.5%
장안울산휴게소 2
 
0.5%
Other values (350) 355
91.3%
2023-12-11T01:35:45.425249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
62
 
2.9%
44
 
2.1%
43
 
2.0%
) 39
 
1.8%
( 39
 
1.8%
36
 
1.7%
36
 
1.7%
36
 
1.7%
32
 
1.5%
31
 
1.5%
Other values (407) 1738
81.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1893
88.6%
Space Separator 62
 
2.9%
Lowercase Letter 46
 
2.2%
Close Punctuation 39
 
1.8%
Open Punctuation 39
 
1.8%
Uppercase Letter 35
 
1.6%
Decimal Number 16
 
0.7%
Other Punctuation 4
 
0.2%
Letter Number 1
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
44
 
2.3%
43
 
2.3%
36
 
1.9%
36
 
1.9%
36
 
1.9%
32
 
1.7%
31
 
1.6%
31
 
1.6%
31
 
1.6%
31
 
1.6%
Other values (360) 1542
81.5%
Lowercase Letter
ValueCountFrequency (%)
e 7
15.2%
o 5
10.9%
m 5
10.9%
y 4
8.7%
r 4
8.7%
a 3
 
6.5%
s 3
 
6.5%
b 3
 
6.5%
k 2
 
4.3%
u 2
 
4.3%
Other values (7) 8
17.4%
Uppercase Letter
ValueCountFrequency (%)
F 5
14.3%
E 5
14.3%
M 3
8.6%
C 3
8.6%
A 3
8.6%
D 3
8.6%
O 3
8.6%
S 2
 
5.7%
B 2
 
5.7%
G 1
 
2.9%
Other values (5) 5
14.3%
Decimal Number
ValueCountFrequency (%)
9 3
18.8%
1 3
18.8%
2 2
12.5%
4 2
12.5%
0 2
12.5%
5 2
12.5%
6 1
 
6.2%
8 1
 
6.2%
Other Punctuation
ValueCountFrequency (%)
& 3
75.0%
' 1
 
25.0%
Space Separator
ValueCountFrequency (%)
62
100.0%
Close Punctuation
ValueCountFrequency (%)
) 39
100.0%
Open Punctuation
ValueCountFrequency (%)
( 39
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1892
88.6%
Common 161
 
7.5%
Latin 82
 
3.8%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
44
 
2.3%
43
 
2.3%
36
 
1.9%
36
 
1.9%
36
 
1.9%
32
 
1.7%
31
 
1.6%
31
 
1.6%
31
 
1.6%
31
 
1.6%
Other values (359) 1541
81.4%
Latin
ValueCountFrequency (%)
e 7
 
8.5%
F 5
 
6.1%
E 5
 
6.1%
o 5
 
6.1%
m 5
 
6.1%
y 4
 
4.9%
r 4
 
4.9%
M 3
 
3.7%
a 3
 
3.7%
s 3
 
3.7%
Other values (23) 38
46.3%
Common
ValueCountFrequency (%)
62
38.5%
) 39
24.2%
( 39
24.2%
& 3
 
1.9%
9 3
 
1.9%
1 3
 
1.9%
2 2
 
1.2%
4 2
 
1.2%
0 2
 
1.2%
5 2
 
1.2%
Other values (4) 4
 
2.5%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1892
88.6%
ASCII 242
 
11.3%
CJK 1
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
62
25.6%
) 39
16.1%
( 39
16.1%
e 7
 
2.9%
F 5
 
2.1%
E 5
 
2.1%
o 5
 
2.1%
m 5
 
2.1%
y 4
 
1.7%
r 4
 
1.7%
Other values (36) 67
27.7%
Hangul
ValueCountFrequency (%)
44
 
2.3%
43
 
2.3%
36
 
1.9%
36
 
1.9%
36
 
1.9%
32
 
1.7%
31
 
1.6%
31
 
1.6%
31
 
1.6%
31
 
1.6%
Other values (359) 1541
81.4%
CJK
ValueCountFrequency (%)
1
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%

소재지전화번호
Text

MISSING 

Distinct180
Distinct (%)96.8%
Missing141
Missing (%)43.1%
Memory size2.7 KiB
2023-12-11T01:35:45.778129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length13.935484
Min length10

Characters and Unicode

Total characters2592
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique175 ?
Unique (%)94.1%

Sample

1st row051 -721 -2179
2nd row 051- 515-1796
3rd row 051- 721-2537
4th row 051- 721-1612
5th row 051- 727-0548
ValueCountFrequency (%)
051 179
35.0%
727 36
 
7.0%
728 34
 
6.6%
722 18
 
3.5%
721 16
 
3.1%
723 14
 
2.7%
724 7
 
1.4%
922 3
 
0.6%
3714 3
 
0.6%
519 3
 
0.6%
Other values (194) 199
38.9%
2023-12-11T01:35:46.341630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 372
14.4%
362
14.0%
2 300
11.6%
7 299
11.5%
1 288
11.1%
0 281
10.8%
5 278
10.7%
8 106
 
4.1%
3 95
 
3.7%
9 78
 
3.0%
Other values (2) 133
 
5.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1858
71.7%
Dash Punctuation 372
 
14.4%
Space Separator 362
 
14.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 300
16.1%
7 299
16.1%
1 288
15.5%
0 281
15.1%
5 278
15.0%
8 106
 
5.7%
3 95
 
5.1%
9 78
 
4.2%
4 74
 
4.0%
6 59
 
3.2%
Dash Punctuation
ValueCountFrequency (%)
- 372
100.0%
Space Separator
ValueCountFrequency (%)
362
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2592
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 372
14.4%
362
14.0%
2 300
11.6%
7 299
11.5%
1 288
11.1%
0 281
10.8%
5 278
10.7%
8 106
 
4.1%
3 95
 
3.7%
9 78
 
3.0%
Other values (2) 133
 
5.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2592
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 372
14.4%
362
14.0%
2 300
11.6%
7 299
11.5%
1 288
11.1%
0 281
10.8%
5 278
10.7%
8 106
 
4.1%
3 95
 
3.7%
9 78
 
3.0%
Other values (2) 133
 
5.1%
Distinct314
Distinct (%)96.6%
Missing2
Missing (%)0.6%
Memory size2.7 KiB
2023-12-11T01:35:46.668931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length47
Mean length29.738462
Min length19

Characters and Unicode

Total characters9665
Distinct characters199
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique304 ?
Unique (%)93.5%

Sample

1st row부산광역시 기장군 철마면 두송길 33-5, 1층
2nd row부산광역시 기장군 정관읍 정관1로 18, 123동 B-103호 (이지 더원1차 아파트)
3rd row부산광역시 기장군 기장읍 읍내로104번길 19
4th row부산광역시 기장군 일광면 일광로 128
5th row부산광역시 기장군 일광면 기장해안로 1291
ValueCountFrequency (%)
부산광역시 325
 
15.6%
기장군 325
 
15.6%
기장읍 127
 
6.1%
정관읍 122
 
5.9%
1층 120
 
5.8%
일광면 39
 
1.9%
장안읍 30
 
1.4%
기장해안로 23
 
1.1%
정관로 14
 
0.7%
기장대로 13
 
0.6%
Other values (480) 947
45.4%
2023-12-11T01:35:47.125605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1761
18.2%
533
 
5.5%
498
 
5.2%
1 460
 
4.8%
381
 
3.9%
361
 
3.7%
343
 
3.5%
331
 
3.4%
325
 
3.4%
325
 
3.4%
Other values (189) 4347
45.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5788
59.9%
Space Separator 1761
 
18.2%
Decimal Number 1602
 
16.6%
Other Punctuation 233
 
2.4%
Close Punctuation 79
 
0.8%
Open Punctuation 79
 
0.8%
Dash Punctuation 77
 
0.8%
Uppercase Letter 42
 
0.4%
Lowercase Letter 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
533
 
9.2%
498
 
8.6%
381
 
6.6%
361
 
6.2%
343
 
5.9%
331
 
5.7%
325
 
5.6%
325
 
5.6%
294
 
5.1%
268
 
4.6%
Other values (163) 2129
36.8%
Decimal Number
ValueCountFrequency (%)
1 460
28.7%
2 203
12.7%
0 171
 
10.7%
3 160
 
10.0%
4 152
 
9.5%
5 125
 
7.8%
6 99
 
6.2%
7 95
 
5.9%
8 78
 
4.9%
9 59
 
3.7%
Uppercase Letter
ValueCountFrequency (%)
B 25
59.5%
A 8
 
19.0%
H 2
 
4.8%
C 2
 
4.8%
L 2
 
4.8%
D 1
 
2.4%
K 1
 
2.4%
P 1
 
2.4%
Lowercase Letter
ValueCountFrequency (%)
a 2
50.0%
l 1
25.0%
z 1
25.0%
Space Separator
ValueCountFrequency (%)
1761
100.0%
Other Punctuation
ValueCountFrequency (%)
, 233
100.0%
Close Punctuation
ValueCountFrequency (%)
) 79
100.0%
Open Punctuation
ValueCountFrequency (%)
( 79
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 77
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5788
59.9%
Common 3831
39.6%
Latin 46
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
533
 
9.2%
498
 
8.6%
381
 
6.6%
361
 
6.2%
343
 
5.9%
331
 
5.7%
325
 
5.6%
325
 
5.6%
294
 
5.1%
268
 
4.6%
Other values (163) 2129
36.8%
Common
ValueCountFrequency (%)
1761
46.0%
1 460
 
12.0%
, 233
 
6.1%
2 203
 
5.3%
0 171
 
4.5%
3 160
 
4.2%
4 152
 
4.0%
5 125
 
3.3%
6 99
 
2.6%
7 95
 
2.5%
Other values (5) 372
 
9.7%
Latin
ValueCountFrequency (%)
B 25
54.3%
A 8
 
17.4%
a 2
 
4.3%
H 2
 
4.3%
C 2
 
4.3%
L 2
 
4.3%
D 1
 
2.2%
K 1
 
2.2%
P 1
 
2.2%
l 1
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5788
59.9%
ASCII 3877
40.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1761
45.4%
1 460
 
11.9%
, 233
 
6.0%
2 203
 
5.2%
0 171
 
4.4%
3 160
 
4.1%
4 152
 
3.9%
5 125
 
3.2%
6 99
 
2.6%
7 95
 
2.5%
Other values (16) 418
 
10.8%
Hangul
ValueCountFrequency (%)
533
 
9.2%
498
 
8.6%
381
 
6.6%
361
 
6.2%
343
 
5.9%
331
 
5.7%
325
 
5.6%
325
 
5.6%
294
 
5.1%
268
 
4.6%
Other values (163) 2129
36.8%
Distinct173
Distinct (%)53.2%
Missing2
Missing (%)0.6%
Memory size2.7 KiB
2023-12-11T01:35:47.401488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length155
Median length79
Mean length19.833846
Min length4

Characters and Unicode

Total characters6446
Distinct characters153
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique133 ?
Unique (%)40.9%

Sample

1st row 식용유지류(압착식으로 착유하는 전품목), 조미식품(고추가루또는실고추), 조미식품(천연향신료)
2nd row 식용유지류(압착식으로 착유하는 전품목)
3rd row 참기름, 들기름, 고춧가루
4th row 식용유지류(압착식으로 착유하는 전품목), 조미식품(고추가루또는실고추), 조미식품(천연향신료)
5th row 식용유지류(압착식으로 착유하는 전품목)
ValueCountFrequency (%)
즉석조리식품 74
 
8.1%
즉석섭취식품 43
 
4.7%
떡류 37
 
4.0%
과자 37
 
4.0%
절임식품 29
 
3.2%
양념젓갈 28
 
3.1%
액상차 28
 
3.1%
조림류 25
 
2.7%
김치 24
 
2.6%
빵류 23
 
2.5%
Other values (109) 569
62.1%
2023-12-11T01:35:47.899595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1766
27.4%
, 527
 
8.2%
341
 
5.3%
276
 
4.3%
221
 
3.4%
170
 
2.6%
143
 
2.2%
127
 
2.0%
117
 
1.8%
117
 
1.8%
Other values (143) 2641
41.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3973
61.6%
Space Separator 1766
27.4%
Other Punctuation 547
 
8.5%
Close Punctuation 80
 
1.2%
Open Punctuation 80
 
1.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
341
 
8.6%
276
 
6.9%
221
 
5.6%
170
 
4.3%
143
 
3.6%
127
 
3.2%
117
 
2.9%
117
 
2.9%
85
 
2.1%
83
 
2.1%
Other values (137) 2293
57.7%
Other Punctuation
ValueCountFrequency (%)
, 527
96.3%
. 17
 
3.1%
· 3
 
0.5%
Space Separator
ValueCountFrequency (%)
1766
100.0%
Close Punctuation
ValueCountFrequency (%)
) 80
100.0%
Open Punctuation
ValueCountFrequency (%)
( 80
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3973
61.6%
Common 2473
38.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
341
 
8.6%
276
 
6.9%
221
 
5.6%
170
 
4.3%
143
 
3.6%
127
 
3.2%
117
 
2.9%
117
 
2.9%
85
 
2.1%
83
 
2.1%
Other values (137) 2293
57.7%
Common
ValueCountFrequency (%)
1766
71.4%
, 527
 
21.3%
) 80
 
3.2%
( 80
 
3.2%
. 17
 
0.7%
· 3
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3973
61.6%
ASCII 2470
38.3%
None 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1766
71.5%
, 527
 
21.3%
) 80
 
3.2%
( 80
 
3.2%
. 17
 
0.7%
Hangul
ValueCountFrequency (%)
341
 
8.6%
276
 
6.9%
221
 
5.6%
170
 
4.3%
143
 
3.6%
127
 
3.2%
117
 
2.9%
117
 
2.9%
85
 
2.1%
83
 
2.1%
Other values (137) 2293
57.7%
None
ValueCountFrequency (%)
· 3
100.0%

Missing values

2023-12-11T01:35:44.391741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:35:44.502375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T01:35:44.629232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업소명소재지전화번호소재지(도로명)식품의종류
0경주상회051 -721 -2179<NA>식용유지류(압착식으로 착유하는 전품목), 조미식품(고추가루또는실고추), 조미식품(천연향신료)
1송정기름집<NA>부산광역시 기장군 철마면 두송길 33-5, 1층식용유지류(압착식으로 착유하는 전품목)
2안동참기름051- 515-1796부산광역시 기장군 정관읍 정관1로 18, 123동 B-103호 (이지 더원1차 아파트)참기름, 들기름, 고춧가루
3기장상회051- 721-2537부산광역시 기장군 기장읍 읍내로104번길 19식용유지류(압착식으로 착유하는 전품목), 조미식품(고추가루또는실고추), 조미식품(천연향신료)
4일광참기름상회<NA>부산광역시 기장군 일광면 일광로 128식용유지류(압착식으로 착유하는 전품목)
5안동기름집<NA>부산광역시 기장군 일광면 기장해안로 1291식용유지류(압착식으로 착유하는 전품목)
6하서떡방앗간051- 721-1612부산광역시 기장군 기장읍 차성남로65번길 4과자류(떡류)
7칠암제분업051- 727-0548부산광역시 기장군 일광면 일광로 646-1과자류(떡류)
8송정떡방앗간051 -508 -4422부산광역시 기장군 철마면 여락송정로 334-16, 1층과자류(떡류)
9풍년상회051 -721 -2022부산광역시 기장군 기장읍 차성로287번길 10, 1층식용유지류(압착식으로 착유하는 전품목)
업소명소재지전화번호소재지(도로명)식품의종류
317우리동네호두과자(기장지역자활센터)<NA>부산광역시 기장군 일광면 해송1로 33, 상가동 108호 (일광 신도시 비스타 동원 2차)빵류
318마마스낵051 -723 -1222부산광역시 기장군 기장읍 기장대로 82-8, 302동 102호 (삼정그린코아 더베스트)소스, 즉석조리식품
319짠지<NA>부산광역시 기장군 정관읍 방곡4로 30, 1층즉석조리식품
320식사준비 부산정관점051 -728 -0992부산광역시 기장군 정관읍 정관로 545, 502동 102호 (이진 캐스빌 아파트)즉석조리식품
321헤스터<NA>부산광역시 기장군 일광면 일광로 468, 1층커피
322주식회사 주경에프엔비<NA>부산광역시 기장군 일광면 기장대로 673, 메가마트 1층과자, 빵류, 기타가공품
323란희맛있는 반찬051 -728 -7122부산광역시 기장군 정관읍 정관5로 12, 227동 B109호 (동원로얄듀크2차아파트)김치, 절임식품, 조림류, 젓갈, 즉석섭취식품, 즉석조리식품
324주식회사 현승에프앤디<NA>부산광역시 기장군 기장읍 대청로71번길 5, 탑마트기장서부점어묵
325주식회사 현승에프앤디<NA>부산광역시 기장군 기장읍 읍내로 49, 탑마트 기장서부점과자, 빵류, 기타엿, 절임식품, 곡류가공품, 식육함유가공품, 어묵, 기타 어육가공품, 조미건어포
326(주)미트벨리<NA>부산광역시 기장군 기장읍 기장해안로 147, 롯데몰 동부산점양념육