Overview

Dataset statistics

Number of variables7
Number of observations4948
Missing cells2711
Missing cells (%)7.8%
Duplicate rows880
Duplicate rows (%)17.8%
Total size in memory270.7 KiB
Average record size in memory56.0 B

Variable types

DateTime1
Categorical1
Text5

Dataset

Description부산광역시 동래구에 소재한 식품위생업소 현황에 대한 데이터로 업종명, 업소명, 소재지(도로명), 소재지(지번), 우편번호(도로명), 우편번호(지번) 등에 대한 항목을 제공합니다.
Author부산광역시 동래구
URLhttps://www.data.go.kr/data/15026505/fileData.do

Alerts

Dataset has 880 (17.8%) duplicate rowsDuplicates
업종명 is highly imbalanced (65.8%)Imbalance
소재지전화 has 2696 (54.5%) missing valuesMissing

Reproduction

Analysis started2023-12-23 07:48:57.910508
Analysis finished2023-12-23 07:49:08.764684
Duration10.85 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct2813
Distinct (%)56.9%
Missing0
Missing (%)0.0%
Memory size38.8 KiB
Minimum1965-10-02 00:00:00
Maximum2023-12-15 00:00:00
2023-12-23T07:49:09.467264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-23T07:49:10.720950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

업종명
Categorical

IMBALANCE 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size38.8 KiB
일반음식점
4069 
휴게음식점
682 
유흥주점영업
 
77
단란주점
 
54
제과점영업
 
53

Length

Max length6
Median length5
Mean length5.0072757
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row휴게음식점
2nd row휴게음식점
3rd row휴게음식점
4th row휴게음식점
5th row휴게음식점

Common Values

ValueCountFrequency (%)
일반음식점 4069
82.2%
휴게음식점 682
 
13.8%
유흥주점영업 77
 
1.6%
단란주점 54
 
1.1%
제과점영업 53
 
1.1%
위탁급식영업 13
 
0.3%

Length

2023-12-23T07:49:11.771421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-23T07:49:12.512910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반음식점 4069
82.2%
휴게음식점 682
 
13.8%
유흥주점영업 77
 
1.6%
단란주점 54
 
1.1%
제과점영업 53
 
1.1%
위탁급식영업 13
 
0.3%
Distinct3927
Distinct (%)79.4%
Missing0
Missing (%)0.0%
Memory size38.8 KiB
2023-12-23T07:49:13.614617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length27
Mean length6.5299111
Min length1

Characters and Unicode

Total characters32310
Distinct characters928
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2997 ?
Unique (%)60.6%

Sample

1st row레드애플 부산아시아드점
2nd row슬랫
3rd row셔플
4th row샌드위치 박스
5th row강화당
ValueCountFrequency (%)
동래점 134
 
2.1%
사직점 68
 
1.1%
안락점 50
 
0.8%
부산사직점 41
 
0.6%
온천점 29
 
0.5%
부산동래점 27
 
0.4%
명륜점 26
 
0.4%
세븐일레븐 24
 
0.4%
온천장점 18
 
0.3%
coffee 17
 
0.3%
Other values (4218) 5889
93.1%
2023-12-23T07:49:15.404865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1379
 
4.3%
1160
 
3.6%
650
 
2.0%
561
 
1.7%
556
 
1.7%
410
 
1.3%
400
 
1.2%
( 384
 
1.2%
) 384
 
1.2%
379
 
1.2%
Other values (918) 26047
80.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 27921
86.4%
Space Separator 1379
 
4.3%
Uppercase Letter 910
 
2.8%
Lowercase Letter 708
 
2.2%
Decimal Number 498
 
1.5%
Open Punctuation 384
 
1.2%
Close Punctuation 384
 
1.2%
Other Punctuation 115
 
0.4%
Dash Punctuation 9
 
< 0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1160
 
4.2%
650
 
2.3%
561
 
2.0%
556
 
2.0%
410
 
1.5%
400
 
1.4%
379
 
1.4%
370
 
1.3%
358
 
1.3%
344
 
1.2%
Other values (844) 22733
81.4%
Uppercase Letter
ValueCountFrequency (%)
E 82
 
9.0%
O 78
 
8.6%
C 73
 
8.0%
A 65
 
7.1%
S 53
 
5.8%
B 47
 
5.2%
G 46
 
5.1%
N 42
 
4.6%
T 41
 
4.5%
F 41
 
4.5%
Other values (16) 342
37.6%
Lowercase Letter
ValueCountFrequency (%)
e 118
16.7%
o 76
10.7%
a 66
 
9.3%
i 49
 
6.9%
s 40
 
5.6%
n 40
 
5.6%
r 38
 
5.4%
t 37
 
5.2%
f 34
 
4.8%
c 34
 
4.8%
Other values (13) 176
24.9%
Decimal Number
ValueCountFrequency (%)
2 92
18.5%
1 79
15.9%
0 53
10.6%
3 52
10.4%
5 49
9.8%
9 49
9.8%
4 38
7.6%
8 32
 
6.4%
7 28
 
5.6%
6 26
 
5.2%
Other Punctuation
ValueCountFrequency (%)
& 51
44.3%
. 20
 
17.4%
, 20
 
17.4%
· 9
 
7.8%
' 9
 
7.8%
/ 2
 
1.7%
# 2
 
1.7%
: 1
 
0.9%
! 1
 
0.9%
Space Separator
ValueCountFrequency (%)
1379
100.0%
Open Punctuation
ValueCountFrequency (%)
( 384
100.0%
Close Punctuation
ValueCountFrequency (%)
) 384
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 27887
86.3%
Common 2770
 
8.6%
Latin 1619
 
5.0%
Han 34
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1160
 
4.2%
650
 
2.3%
561
 
2.0%
556
 
2.0%
410
 
1.5%
400
 
1.4%
379
 
1.4%
370
 
1.3%
358
 
1.3%
344
 
1.2%
Other values (820) 22699
81.4%
Latin
ValueCountFrequency (%)
e 118
 
7.3%
E 82
 
5.1%
O 78
 
4.8%
o 76
 
4.7%
C 73
 
4.5%
a 66
 
4.1%
A 65
 
4.0%
S 53
 
3.3%
i 49
 
3.0%
B 47
 
2.9%
Other values (40) 912
56.3%
Common
ValueCountFrequency (%)
1379
49.8%
( 384
 
13.9%
) 384
 
13.9%
2 92
 
3.3%
1 79
 
2.9%
0 53
 
1.9%
3 52
 
1.9%
& 51
 
1.8%
5 49
 
1.8%
9 49
 
1.8%
Other values (14) 198
 
7.1%
Han
ValueCountFrequency (%)
4
 
11.8%
3
 
8.8%
2
 
5.9%
2
 
5.9%
2
 
5.9%
2
 
5.9%
2
 
5.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
Other values (14) 14
41.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 27886
86.3%
ASCII 4379
 
13.6%
CJK 30
 
0.1%
None 9
 
< 0.1%
CJK Compat Ideographs 4
 
< 0.1%
Number Forms 1
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1379
31.5%
( 384
 
8.8%
) 384
 
8.8%
e 118
 
2.7%
2 92
 
2.1%
E 82
 
1.9%
1 79
 
1.8%
O 78
 
1.8%
o 76
 
1.7%
C 73
 
1.7%
Other values (62) 1634
37.3%
Hangul
ValueCountFrequency (%)
1160
 
4.2%
650
 
2.3%
561
 
2.0%
556
 
2.0%
410
 
1.5%
400
 
1.4%
379
 
1.4%
370
 
1.3%
358
 
1.3%
344
 
1.2%
Other values (819) 22698
81.4%
None
ValueCountFrequency (%)
· 9
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
4
100.0%
CJK
ValueCountFrequency (%)
3
 
10.0%
2
 
6.7%
2
 
6.7%
2
 
6.7%
2
 
6.7%
2
 
6.7%
1
 
3.3%
1
 
3.3%
1
 
3.3%
1
 
3.3%
Other values (13) 13
43.3%
Number Forms
ValueCountFrequency (%)
1
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct3575
Distinct (%)72.4%
Missing11
Missing (%)0.2%
Memory size38.8 KiB
2023-12-23T07:49:16.461277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length67
Median length60
Mean length29.87543
Min length21

Characters and Unicode

Total characters147495
Distinct characters321
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2565 ?
Unique (%)52.0%

Sample

1st row부산광역시 동래구 종합운동장로40번길 8, 103동 1층 101호 (사직동, 부산 아시아드 코오롱 하늘채 아파트)
2nd row부산광역시 동래구 충렬대로200번길 26, 1층 (명륜동)
3rd row부산광역시 동래구 충렬대로 134-1, 제일빌딩 1층 (온천동)
4th row부산광역시 동래구 미남로 87, 1층 일부호 (온천동)
5th row부산광역시 동래구 석사북로 14-1, 1층 (사직동)
ValueCountFrequency (%)
동래구 4938
17.4%
부산광역시 4937
17.4%
1층 1441
 
5.1%
온천동 1344
 
4.7%
안락동 912
 
3.2%
사직동 911
 
3.2%
명륜동 822
 
2.9%
명장동 317
 
1.1%
수안동 316
 
1.1%
2층 234
 
0.8%
Other values (1879) 12143
42.9%
2023-12-23T07:49:18.995440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
23379
 
15.9%
10524
 
7.1%
1 6894
 
4.7%
5395
 
3.7%
5279
 
3.6%
5007
 
3.4%
( 4993
 
3.4%
) 4993
 
3.4%
4959
 
3.4%
4957
 
3.4%
Other values (311) 71115
48.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 85982
58.3%
Decimal Number 23861
 
16.2%
Space Separator 23379
 
15.9%
Open Punctuation 4993
 
3.4%
Close Punctuation 4993
 
3.4%
Other Punctuation 2761
 
1.9%
Dash Punctuation 1063
 
0.7%
Uppercase Letter 417
 
0.3%
Math Symbol 34
 
< 0.1%
Lowercase Letter 12
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10524
 
12.2%
5395
 
6.3%
5279
 
6.1%
5007
 
5.8%
4959
 
5.8%
4957
 
5.8%
4954
 
5.8%
4944
 
5.8%
4840
 
5.6%
3089
 
3.6%
Other values (272) 32034
37.3%
Uppercase Letter
ValueCountFrequency (%)
B 79
18.9%
K 65
15.6%
S 60
14.4%
A 51
12.2%
W 21
 
5.0%
I 21
 
5.0%
V 21
 
5.0%
E 21
 
5.0%
C 20
 
4.8%
H 18
 
4.3%
Other values (7) 40
9.6%
Decimal Number
ValueCountFrequency (%)
1 6894
28.9%
2 3301
13.8%
3 2612
 
10.9%
4 1933
 
8.1%
5 1716
 
7.2%
9 1565
 
6.6%
7 1548
 
6.5%
0 1511
 
6.3%
8 1429
 
6.0%
6 1352
 
5.7%
Lowercase Letter
ValueCountFrequency (%)
e 9
75.0%
k 1
 
8.3%
s 1
 
8.3%
c 1
 
8.3%
Other Punctuation
ValueCountFrequency (%)
, 2757
99.9%
. 2
 
0.1%
@ 2
 
0.1%
Space Separator
ValueCountFrequency (%)
23379
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4993
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4993
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1063
100.0%
Math Symbol
ValueCountFrequency (%)
~ 34
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 85982
58.3%
Common 61084
41.4%
Latin 429
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10524
 
12.2%
5395
 
6.3%
5279
 
6.1%
5007
 
5.8%
4959
 
5.8%
4957
 
5.8%
4954
 
5.8%
4944
 
5.8%
4840
 
5.6%
3089
 
3.6%
Other values (272) 32034
37.3%
Latin
ValueCountFrequency (%)
B 79
18.4%
K 65
15.2%
S 60
14.0%
A 51
11.9%
W 21
 
4.9%
I 21
 
4.9%
V 21
 
4.9%
E 21
 
4.9%
C 20
 
4.7%
H 18
 
4.2%
Other values (11) 52
12.1%
Common
ValueCountFrequency (%)
23379
38.3%
1 6894
 
11.3%
( 4993
 
8.2%
) 4993
 
8.2%
2 3301
 
5.4%
, 2757
 
4.5%
3 2612
 
4.3%
4 1933
 
3.2%
5 1716
 
2.8%
9 1565
 
2.6%
Other values (8) 6941
 
11.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 85982
58.3%
ASCII 61513
41.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
23379
38.0%
1 6894
 
11.2%
( 4993
 
8.1%
) 4993
 
8.1%
2 3301
 
5.4%
, 2757
 
4.5%
3 2612
 
4.2%
4 1933
 
3.1%
5 1716
 
2.8%
9 1565
 
2.5%
Other values (29) 7370
 
12.0%
Hangul
ValueCountFrequency (%)
10524
 
12.2%
5395
 
6.3%
5279
 
6.1%
5007
 
5.8%
4959
 
5.8%
4957
 
5.8%
4954
 
5.8%
4944
 
5.8%
4840
 
5.6%
3089
 
3.6%
Other values (272) 32034
37.3%
Distinct3018
Distinct (%)61.0%
Missing2
Missing (%)< 0.1%
Memory size38.8 KiB
2023-12-23T07:49:20.440092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length44
Mean length21.704408
Min length16

Characters and Unicode

Total characters107350
Distinct characters303
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1834 ?
Unique (%)37.1%

Sample

1st row부산광역시 동래구 사직동 1056 부산 아시아드 코오롱 하늘채 아파트
2nd row부산광역시 동래구 명륜동 512-14 1층일부
3rd row부산광역시 동래구 온천동 1442-8 제일빌딩
4th row부산광역시 동래구 온천동 1377-35
5th row부산광역시 동래구 사직동 24-11
ValueCountFrequency (%)
동래구 4947
23.5%
부산광역시 4946
23.5%
온천동 1392
 
6.6%
사직동 928
 
4.4%
안락동 919
 
4.4%
명륜동 872
 
4.1%
수안동 326
 
1.6%
명장동 321
 
1.5%
복천동 126
 
0.6%
낙민동 82
 
0.4%
Other values (3137) 6156
29.3%
2023-12-23T07:49:22.617841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
20947
19.5%
10166
 
9.5%
5121
 
4.8%
5039
 
4.7%
4971
 
4.6%
4966
 
4.6%
4966
 
4.6%
4960
 
4.6%
4953
 
4.6%
- 4716
 
4.4%
Other values (293) 36545
34.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 58399
54.4%
Decimal Number 22772
 
21.2%
Space Separator 20947
 
19.5%
Dash Punctuation 4716
 
4.4%
Uppercase Letter 318
 
0.3%
Open Punctuation 63
 
0.1%
Close Punctuation 63
 
0.1%
Other Punctuation 56
 
0.1%
Lowercase Letter 11
 
< 0.1%
Math Symbol 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10166
17.4%
5121
8.8%
5039
8.6%
4971
8.5%
4966
8.5%
4966
8.5%
4960
8.5%
4953
8.5%
1556
 
2.7%
1421
 
2.4%
Other values (256) 10280
17.6%
Uppercase Letter
ValueCountFrequency (%)
K 68
21.4%
S 63
19.8%
B 27
 
8.5%
V 21
 
6.6%
E 21
 
6.6%
W 21
 
6.6%
I 21
 
6.6%
H 19
 
6.0%
U 17
 
5.3%
Y 17
 
5.3%
Other values (6) 23
 
7.2%
Decimal Number
ValueCountFrequency (%)
1 4327
19.0%
4 2943
12.9%
2 2901
12.7%
5 2559
11.2%
3 2513
11.0%
6 1715
 
7.5%
7 1637
 
7.2%
9 1477
 
6.5%
0 1442
 
6.3%
8 1258
 
5.5%
Other Punctuation
ValueCountFrequency (%)
, 51
91.1%
. 3
 
5.4%
@ 2
 
3.6%
Lowercase Letter
ValueCountFrequency (%)
e 9
81.8%
k 1
 
9.1%
s 1
 
9.1%
Space Separator
ValueCountFrequency (%)
20947
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4716
100.0%
Open Punctuation
ValueCountFrequency (%)
( 63
100.0%
Close Punctuation
ValueCountFrequency (%)
) 63
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 58399
54.4%
Common 48622
45.3%
Latin 329
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10166
17.4%
5121
8.8%
5039
8.6%
4971
8.5%
4966
8.5%
4966
8.5%
4960
8.5%
4953
8.5%
1556
 
2.7%
1421
 
2.4%
Other values (256) 10280
17.6%
Latin
ValueCountFrequency (%)
K 68
20.7%
S 63
19.1%
B 27
 
8.2%
V 21
 
6.4%
E 21
 
6.4%
W 21
 
6.4%
I 21
 
6.4%
H 19
 
5.8%
U 17
 
5.2%
Y 17
 
5.2%
Other values (9) 34
10.3%
Common
ValueCountFrequency (%)
20947
43.1%
- 4716
 
9.7%
1 4327
 
8.9%
4 2943
 
6.1%
2 2901
 
6.0%
5 2559
 
5.3%
3 2513
 
5.2%
6 1715
 
3.5%
7 1637
 
3.4%
9 1477
 
3.0%
Other values (8) 2887
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 58399
54.4%
ASCII 48951
45.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
20947
42.8%
- 4716
 
9.6%
1 4327
 
8.8%
4 2943
 
6.0%
2 2901
 
5.9%
5 2559
 
5.2%
3 2513
 
5.1%
6 1715
 
3.5%
7 1637
 
3.3%
9 1477
 
3.0%
Other values (27) 3216
 
6.6%
Hangul
ValueCountFrequency (%)
10166
17.4%
5121
8.8%
5039
8.6%
4971
8.5%
4966
8.5%
4966
8.5%
4960
8.5%
4953
8.5%
1556
 
2.7%
1421
 
2.4%
Other values (256) 10280
17.6%

소재지전화
Text

MISSING 

Distinct1692
Distinct (%)75.1%
Missing2696
Missing (%)54.5%
Memory size38.8 KiB
2023-12-23T07:49:23.997527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length14
Mean length14.011989
Min length12

Characters and Unicode

Total characters31555
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1157 ?
Unique (%)51.4%

Sample

1st row051 -558 -0077
2nd row 051- 522-4853
3rd row 051- 555-0080
4th row051 -715 -6688
5th row051 -831 -1622
ValueCountFrequency (%)
051 2168
39.2%
555 99
 
1.8%
552 77
 
1.4%
554 63
 
1.1%
553 55
 
1.0%
556 54
 
1.0%
557 52
 
0.9%
503 49
 
0.9%
558 45
 
0.8%
525 40
 
0.7%
Other values (1721) 2825
51.1%
2023-12-23T07:49:26.269957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 6834
21.7%
4531
14.4%
- 4504
14.3%
0 3913
12.4%
1 3240
10.3%
2 1924
 
6.1%
3 1344
 
4.3%
8 1146
 
3.6%
7 1081
 
3.4%
4 1013
 
3.2%
Other values (2) 2025
 
6.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 22520
71.4%
Space Separator 4531
 
14.4%
Dash Punctuation 4504
 
14.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 6834
30.3%
0 3913
17.4%
1 3240
14.4%
2 1924
 
8.5%
3 1344
 
6.0%
8 1146
 
5.1%
7 1081
 
4.8%
4 1013
 
4.5%
9 1013
 
4.5%
6 1012
 
4.5%
Space Separator
ValueCountFrequency (%)
4531
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4504
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 31555
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 6834
21.7%
4531
14.4%
- 4504
14.3%
0 3913
12.4%
1 3240
10.3%
2 1924
 
6.1%
3 1344
 
4.3%
8 1146
 
3.6%
7 1081
 
3.4%
4 1013
 
3.2%
Other values (2) 2025
 
6.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 31555
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 6834
21.7%
4531
14.4%
- 4504
14.3%
0 3913
12.4%
1 3240
10.3%
2 1924
 
6.1%
3 1344
 
4.3%
8 1146
 
3.6%
7 1081
 
3.4%
4 1013
 
3.2%
Other values (2) 2025
 
6.4%
Distinct70
Distinct (%)1.4%
Missing2
Missing (%)< 0.1%
Memory size38.8 KiB
2023-12-23T07:49:27.175102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length7
Mean length7
Min length7

Characters and Unicode

Total characters34622
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4 ?
Unique (%)0.1%

Sample

1st row607-120
2nd row607-804
3rd row607-836
4th row607-840
5th row607-120
ValueCountFrequency (%)
607-804 691
 
14.0%
607-815 335
 
6.8%
607-833 253
 
5.1%
607-831 227
 
4.6%
607-824 184
 
3.7%
607-817 174
 
3.5%
607-826 173
 
3.5%
607-835 169
 
3.4%
607-828 160
 
3.2%
607-830 158
 
3.2%
Other values (60) 2422
49.0%
2023-12-23T07:49:28.664546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 6780
19.6%
7 5534
16.0%
6 5305
15.3%
8 5018
14.5%
- 4946
14.3%
3 1709
 
4.9%
2 1601
 
4.6%
1 1426
 
4.1%
4 1374
 
4.0%
5 648
 
1.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 29676
85.7%
Dash Punctuation 4946
 
14.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 6780
22.8%
7 5534
18.6%
6 5305
17.9%
8 5018
16.9%
3 1709
 
5.8%
2 1601
 
5.4%
1 1426
 
4.8%
4 1374
 
4.6%
5 648
 
2.2%
9 281
 
0.9%
Dash Punctuation
ValueCountFrequency (%)
- 4946
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 34622
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 6780
19.6%
7 5534
16.0%
6 5305
15.3%
8 5018
14.5%
- 4946
14.3%
3 1709
 
4.9%
2 1601
 
4.6%
1 1426
 
4.1%
4 1374
 
4.0%
5 648
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 34622
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 6780
19.6%
7 5534
16.0%
6 5305
15.3%
8 5018
14.5%
- 4946
14.3%
3 1709
 
4.9%
2 1601
 
4.6%
1 1426
 
4.1%
4 1374
 
4.0%
5 648
 
1.9%

Correlations

2023-12-23T07:49:29.268091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종명우편번호(지번)
업종명1.0000.358
우편번호(지번)0.3581.000

Missing values

2023-12-23T07:49:07.128833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-23T07:49:07.894104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-23T07:49:08.444521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

신고일자업종명업소명소재지(도로명)소재지(지번)소재지전화우편번호(지번)
02023-12-12휴게음식점레드애플 부산아시아드점부산광역시 동래구 종합운동장로40번길 8, 103동 1층 101호 (사직동, 부산 아시아드 코오롱 하늘채 아파트)부산광역시 동래구 사직동 1056 부산 아시아드 코오롱 하늘채 아파트<NA>607-120
12023-12-05휴게음식점슬랫부산광역시 동래구 충렬대로200번길 26, 1층 (명륜동)부산광역시 동래구 명륜동 512-14 1층일부<NA>607-804
22023-12-05휴게음식점셔플부산광역시 동래구 충렬대로 134-1, 제일빌딩 1층 (온천동)부산광역시 동래구 온천동 1442-8 제일빌딩<NA>607-836
32023-12-04휴게음식점샌드위치 박스부산광역시 동래구 미남로 87, 1층 일부호 (온천동)부산광역시 동래구 온천동 1377-35<NA>607-840
42023-11-27휴게음식점강화당부산광역시 동래구 석사북로 14-1, 1층 (사직동)부산광역시 동래구 사직동 24-11<NA>607-120
52023-11-24휴게음식점경아분식부산광역시 동래구 온천천로339번길 51, 202동 1층 14호 (낙민동, 포레나 동래)부산광역시 동래구 낙민동 326 포레나 동래 202동 14호<NA>607-040
62023-11-24휴게음식점놀러갈게 온천천부산광역시 동래구 연안로59번길 28, 1층 (안락동)부산광역시 동래구 안락동 144-9<NA>607-825
72023-11-22휴게음식점리버(River)630부산광역시 동래구 삼어로 1, 1층 (안락동)부산광역시 동래구 안락동 471-48<NA>607-826
82023-11-21휴게음식점백억커피 동래점부산광역시 동래구 중앙대로1367번길 53, 세현상가B동 1층 108호 (온천동)부산광역시 동래구 온천동 707-5<NA>607-835
92023-11-17휴게음식점123카페부산광역시 동래구 여고로 123, 1층 (사직동)부산광역시 동래구 사직동 158-51<NA>607-813
신고일자업종명업소명소재지(도로명)소재지(지번)소재지전화우편번호(지번)
49381996-07-26단란주점블랙부산광역시 동래구 명륜로 127 (명륜동)부산광역시 동래구 명륜동 501-3051- 554-9383607-804
49391996-07-15단란주점샤넬노래방부산광역시 동래구 반송로 234 (안락동)부산광역시 동래구 안락동 426-39<NA>607-827
49401996-06-28단란주점불꽃원탁가라오케부산광역시 동래구 반송로250번길 18 (안락동)부산광역시 동래구 안락동 947-15051- 526-1922607-827
49411995-11-22단란주점빙고부산광역시 동래구 아시아드대로161번길 86 (사직동)부산광역시 동래구 사직동 77-26051- 504-1710607-817
49421995-11-02단란주점미라클 라이브 단란주점부산광역시 동래구 금강로131번길 3 (온천동)부산광역시 동래구 온천동 135-21051- 552-3110607-832
49431995-04-17단란주점부산광역시 동래구 반송로 236 (안락동)부산광역시 동래구 안락동 426-35051- 521-3590607-827
49441995-04-12단란주점복면가왕부산광역시 동래구 충렬대로 396 (안락동)부산광역시 동래구 안락동 436-51051- 528-4989607-829
49451995-01-07단란주점그녀S노래방부산광역시 동래구 반송로 220 (안락동)부산광역시 동래구 안락동 425-45051- 528-8423607-827
49461994-04-20단란주점리치부산광역시 동래구 명륜로129번길 3 (명륜동)부산광역시 동래구 명륜동 502-8051- 552-6043607-804
49471994-03-23단란주점신세계단란주점부산광역시 동래구 사직북로13번길 7 (사직동)부산광역시 동래구 사직동 91-6051- 502-0842607-815

Duplicate rows

Most frequently occurring

신고일자업종명업소명소재지(도로명)소재지(지번)소재지전화우편번호(지번)# duplicates
01977-06-25일반음식점원조조방낙지부산광역시 동래구 명륜로94번길 37 (명륜동)부산광역시 동래구 명륜동 400-1051- 555-7763607-8042
11978-10-08일반음식점차밭골부산광역시 동래구 미남로 137 (온천동)부산광역시 동래구 온천동 1422-25051- 503-7009607-8422
21980-10-31일반음식점박가네부산광역시 동래구 금정마을로 105 (온천동)부산광역시 동래구 온천동 1123-1051- 558-5679607-8382
31981-02-19일반음식점마지막샤부부산광역시 동래구 사직북로 17 (사직동)부산광역시 동래구 사직동 78-8<NA>607-8152
41983-06-13일반음식점엄용백 낙돈부산광역시 동래구 명륜로117번길 33, 1,2층 (명륜동)부산광역시 동래구 명륜동 450-4<NA>607-8042
51984-02-15일반음식점MOON(문)부산광역시 동래구 사직북로19번길 6 (사직동)부산광역시 동래구 사직동 46-8051- 501-3332607-8152
61985-03-25일반음식점새 청하루부산광역시 동래구 동래로 69 (명륜동)부산광역시 동래구 명륜동 680-8 기존 지번(명륜동 680-5)과 합병<NA>607-8062
71985-04-08일반음식점족보막걸리부산광역시 동래구 사직북로57번길 56 (사직동)부산광역시 동래구 사직동 42-7051- 506-4852607-8152
81986-12-15일반음식점잇츠비건(IT'S VEGAN)부산광역시 동래구 명륜로94번길 21 (명륜동)부산광역시 동래구 명륜동 401-10<NA>607-8042
91987-04-30일반음식점라무진부산광역시 동래구 삼성대길 34-20 (명륜동)부산광역시 동래구 명륜동 502-9051- 558-5550607-8042