Overview

Dataset statistics

Number of variables4
Number of observations438
Missing cells325
Missing cells (%)18.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.8 KiB
Average record size in memory32.3 B

Variable types

Categorical1
Text3

Dataset

Description수영구에 영업신고한 건강기능식품 일반판매업 및 유통전문판매업의 업소명, 소재지(도로명주소), 전화번호를 포함하고 있습니다.
Author부산광역시 수영구
URLhttps://www.data.go.kr/data/3046167/fileData.do

Alerts

업종명 is highly imbalanced (83.1%)Imbalance
소재지전화 has 325 (74.2%) missing valuesMissing

Reproduction

Analysis started2023-12-12 05:23:59.173169
Analysis finished2023-12-12 05:23:59.707268
Duration0.53 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size3.6 KiB
건강기능식품일반판매업
427 
건강기능식품유통전문판매업
 
11

Length

Max length13
Median length11
Mean length11.050228
Min length11

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row건강기능식품일반판매업
2nd row건강기능식품일반판매업
3rd row건강기능식품일반판매업
4th row건강기능식품일반판매업
5th row건강기능식품일반판매업

Common Values

ValueCountFrequency (%)
건강기능식품일반판매업 427
97.5%
건강기능식품유통전문판매업 11
 
2.5%

Length

2023-12-12T14:23:59.791203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:23:59.910672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
건강기능식품일반판매업 427
97.5%
건강기능식품유통전문판매업 11
 
2.5%
Distinct435
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Memory size3.6 KiB
2023-12-12T14:24:00.224548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length20
Mean length6.4497717
Min length2

Characters and Unicode

Total characters2825
Distinct characters463
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique433 ?
Unique (%)98.9%

Sample

1st row아모레 새수영특약점
2nd row아모레화장품수연점
3rd row정관장 수영역점
4th row자산의원
5th row금오약품
ValueCountFrequency (%)
주식회사 10
 
1.9%
세븐일레븐 5
 
0.9%
씨제이올리브영(주 5
 
0.9%
수영점 5
 
0.9%
남천점 4
 
0.7%
제이컴퍼니 3
 
0.6%
망미점 3
 
0.6%
유니베라 3
 
0.6%
인셀덤 3
 
0.6%
인터내셔널 2
 
0.4%
Other values (483) 491
91.9%
2023-12-12T14:24:00.683062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
98
 
3.5%
96
 
3.4%
81
 
2.9%
( 60
 
2.1%
) 60
 
2.1%
57
 
2.0%
54
 
1.9%
46
 
1.6%
45
 
1.6%
43
 
1.5%
Other values (453) 2185
77.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2417
85.6%
Space Separator 96
 
3.4%
Lowercase Letter 89
 
3.2%
Uppercase Letter 76
 
2.7%
Open Punctuation 60
 
2.1%
Close Punctuation 60
 
2.1%
Decimal Number 20
 
0.7%
Dash Punctuation 3
 
0.1%
Other Punctuation 2
 
0.1%
Math Symbol 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
98
 
4.1%
81
 
3.4%
57
 
2.4%
54
 
2.2%
46
 
1.9%
45
 
1.9%
43
 
1.8%
33
 
1.4%
32
 
1.3%
29
 
1.2%
Other values (398) 1899
78.6%
Uppercase Letter
ValueCountFrequency (%)
C 7
 
9.2%
S 6
 
7.9%
I 5
 
6.6%
N 5
 
6.6%
H 5
 
6.6%
J 5
 
6.6%
E 5
 
6.6%
T 4
 
5.3%
K 4
 
5.3%
L 3
 
3.9%
Other values (12) 27
35.5%
Lowercase Letter
ValueCountFrequency (%)
a 17
19.1%
o 16
18.0%
e 8
9.0%
m 6
 
6.7%
l 6
 
6.7%
k 5
 
5.6%
r 5
 
5.6%
c 5
 
5.6%
t 4
 
4.5%
i 4
 
4.5%
Other values (8) 13
14.6%
Decimal Number
ValueCountFrequency (%)
3 6
30.0%
1 5
25.0%
2 4
20.0%
5 2
 
10.0%
8 1
 
5.0%
0 1
 
5.0%
7 1
 
5.0%
Other Punctuation
ValueCountFrequency (%)
' 1
50.0%
& 1
50.0%
Math Symbol
ValueCountFrequency (%)
< 1
50.0%
> 1
50.0%
Space Separator
ValueCountFrequency (%)
96
100.0%
Open Punctuation
ValueCountFrequency (%)
( 60
100.0%
Close Punctuation
ValueCountFrequency (%)
) 60
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2417
85.6%
Common 243
 
8.6%
Latin 165
 
5.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
98
 
4.1%
81
 
3.4%
57
 
2.4%
54
 
2.2%
46
 
1.9%
45
 
1.9%
43
 
1.8%
33
 
1.4%
32
 
1.3%
29
 
1.2%
Other values (398) 1899
78.6%
Latin
ValueCountFrequency (%)
a 17
 
10.3%
o 16
 
9.7%
e 8
 
4.8%
C 7
 
4.2%
m 6
 
3.6%
l 6
 
3.6%
S 6
 
3.6%
I 5
 
3.0%
N 5
 
3.0%
H 5
 
3.0%
Other values (30) 84
50.9%
Common
ValueCountFrequency (%)
96
39.5%
( 60
24.7%
) 60
24.7%
3 6
 
2.5%
1 5
 
2.1%
2 4
 
1.6%
- 3
 
1.2%
5 2
 
0.8%
8 1
 
0.4%
' 1
 
0.4%
Other values (5) 5
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2417
85.6%
ASCII 408
 
14.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
98
 
4.1%
81
 
3.4%
57
 
2.4%
54
 
2.2%
46
 
1.9%
45
 
1.9%
43
 
1.8%
33
 
1.4%
32
 
1.3%
29
 
1.2%
Other values (398) 1899
78.6%
ASCII
ValueCountFrequency (%)
96
23.5%
( 60
14.7%
) 60
14.7%
a 17
 
4.2%
o 16
 
3.9%
e 8
 
2.0%
C 7
 
1.7%
m 6
 
1.5%
l 6
 
1.5%
3 6
 
1.5%
Other values (45) 126
30.9%
Distinct429
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Memory size3.6 KiB
2023-12-12T14:24:01.109067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length66
Median length51
Mean length38.057078
Min length22

Characters and Unicode

Total characters16669
Distinct characters267
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique424 ?
Unique (%)96.8%

Sample

1st row부산광역시 수영구 수영로 685, 영동빌딩 9층 (광안동)
2nd row부산광역시 수영구 연수로 405 (수영동,수용빌딩 3층)
3rd row부산광역시 수영구 연수로 400 (광안동)
4th row부산광역시 수영구 광남로 223 (민락동)
5th row부산광역시 수영구 광서로 55-1, 2층 (광안동)
ValueCountFrequency (%)
부산광역시 438
 
13.7%
수영구 438
 
13.7%
광안동 154
 
4.8%
남천동 99
 
3.1%
1층 95
 
3.0%
망미동 80
 
2.5%
수영로 76
 
2.4%
민락동 69
 
2.2%
2층 47
 
1.5%
수영동 46
 
1.4%
Other values (756) 1658
51.8%
2023-12-12T14:24:01.700637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2762
 
16.6%
730
 
4.4%
1 729
 
4.4%
707
 
4.2%
671
 
4.0%
605
 
3.6%
, 589
 
3.5%
464
 
2.8%
457
 
2.7%
456
 
2.7%
Other values (257) 8499
51.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9265
55.6%
Decimal Number 3058
 
18.3%
Space Separator 2762
 
16.6%
Other Punctuation 591
 
3.5%
Open Punctuation 439
 
2.6%
Close Punctuation 439
 
2.6%
Dash Punctuation 58
 
0.3%
Uppercase Letter 36
 
0.2%
Lowercase Letter 18
 
0.1%
Letter Number 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
730
 
7.9%
707
 
7.6%
671
 
7.2%
605
 
6.5%
464
 
5.0%
457
 
4.9%
456
 
4.9%
450
 
4.9%
445
 
4.8%
442
 
4.8%
Other values (227) 3838
41.4%
Decimal Number
ValueCountFrequency (%)
1 729
23.8%
0 449
14.7%
2 399
13.0%
3 317
10.4%
4 270
 
8.8%
5 237
 
7.8%
6 216
 
7.1%
8 150
 
4.9%
7 149
 
4.9%
9 142
 
4.6%
Uppercase Letter
ValueCountFrequency (%)
V 6
16.7%
W 5
13.9%
E 5
13.9%
I 5
13.9%
S 5
13.9%
K 5
13.9%
A 3
8.3%
B 2
 
5.6%
Other Punctuation
ValueCountFrequency (%)
, 589
99.7%
& 1
 
0.2%
. 1
 
0.2%
Lowercase Letter
ValueCountFrequency (%)
e 16
88.9%
w 1
 
5.6%
i 1
 
5.6%
Space Separator
ValueCountFrequency (%)
2762
100.0%
Open Punctuation
ValueCountFrequency (%)
( 439
100.0%
Close Punctuation
ValueCountFrequency (%)
) 439
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 58
100.0%
Letter Number
ValueCountFrequency (%)
2
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9264
55.6%
Common 7348
44.1%
Latin 56
 
0.3%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
730
 
7.9%
707
 
7.6%
671
 
7.2%
605
 
6.5%
464
 
5.0%
457
 
4.9%
456
 
4.9%
450
 
4.9%
445
 
4.8%
442
 
4.8%
Other values (226) 3837
41.4%
Common
ValueCountFrequency (%)
2762
37.6%
1 729
 
9.9%
, 589
 
8.0%
0 449
 
6.1%
( 439
 
6.0%
) 439
 
6.0%
2 399
 
5.4%
3 317
 
4.3%
4 270
 
3.7%
5 237
 
3.2%
Other values (8) 718
 
9.8%
Latin
ValueCountFrequency (%)
e 16
28.6%
V 6
 
10.7%
W 5
 
8.9%
E 5
 
8.9%
I 5
 
8.9%
S 5
 
8.9%
K 5
 
8.9%
A 3
 
5.4%
2
 
3.6%
B 2
 
3.6%
Other values (2) 2
 
3.6%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9264
55.6%
ASCII 7402
44.4%
Number Forms 2
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2762
37.3%
1 729
 
9.8%
, 589
 
8.0%
0 449
 
6.1%
( 439
 
5.9%
) 439
 
5.9%
2 399
 
5.4%
3 317
 
4.3%
4 270
 
3.6%
5 237
 
3.2%
Other values (19) 772
 
10.4%
Hangul
ValueCountFrequency (%)
730
 
7.9%
707
 
7.6%
671
 
7.2%
605
 
6.5%
464
 
5.0%
457
 
4.9%
456
 
4.9%
450
 
4.9%
445
 
4.8%
442
 
4.8%
Other values (226) 3837
41.4%
Number Forms
ValueCountFrequency (%)
2
100.0%
CJK
ValueCountFrequency (%)
1
100.0%

소재지전화
Text

MISSING 

Distinct112
Distinct (%)99.1%
Missing325
Missing (%)74.2%
Memory size3.6 KiB
2023-12-12T14:24:02.001458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.123894
Min length11

Characters and Unicode

Total characters1370
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique111 ?
Unique (%)98.2%

Sample

1st row051-758-8491
2nd row051-758-8111
3rd row051-754-2304
4th row051-753-9755
5th row051-758-9015
ValueCountFrequency (%)
051-755-4999 2
 
1.8%
051-612-1119 1
 
0.9%
051-752-4879 1
 
0.9%
051-611-8187 1
 
0.9%
051-752-9696 1
 
0.9%
02-430-9471 1
 
0.9%
051-752-4490 1
 
0.9%
051-753-7583 1
 
0.9%
051-759-5389 1
 
0.9%
051-757-5353 1
 
0.9%
Other values (102) 102
90.3%
2023-12-12T14:24:02.494767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 226
16.5%
5 206
15.0%
0 201
14.7%
1 176
12.8%
7 143
10.4%
2 77
 
5.6%
9 75
 
5.5%
8 73
 
5.3%
6 66
 
4.8%
3 65
 
4.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1144
83.5%
Dash Punctuation 226
 
16.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 206
18.0%
0 201
17.6%
1 176
15.4%
7 143
12.5%
2 77
 
6.7%
9 75
 
6.6%
8 73
 
6.4%
6 66
 
5.8%
3 65
 
5.7%
4 62
 
5.4%
Dash Punctuation
ValueCountFrequency (%)
- 226
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1370
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 226
16.5%
5 206
15.0%
0 201
14.7%
1 176
12.8%
7 143
10.4%
2 77
 
5.6%
9 75
 
5.5%
8 73
 
5.3%
6 66
 
4.8%
3 65
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1370
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 226
16.5%
5 206
15.0%
0 201
14.7%
1 176
12.8%
7 143
10.4%
2 77
 
5.6%
9 75
 
5.5%
8 73
 
5.3%
6 66
 
4.8%
3 65
 
4.7%

Missing values

2023-12-12T14:23:59.558597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:23:59.666055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명소재지(도로명)소재지전화
0건강기능식품일반판매업아모레 새수영특약점부산광역시 수영구 수영로 685, 영동빌딩 9층 (광안동)051-758-8491
1건강기능식품일반판매업아모레화장품수연점부산광역시 수영구 연수로 405 (수영동,수용빌딩 3층)051-758-8111
2건강기능식품일반판매업정관장 수영역점부산광역시 수영구 연수로 400 (광안동)051-754-2304
3건강기능식품일반판매업자산의원부산광역시 수영구 광남로 223 (민락동)051-753-9755
4건강기능식품일반판매업금오약품부산광역시 수영구 광서로 55-1, 2층 (광안동)051-758-9015
5건강기능식품일반판매업뉴랜드올네이처부산광역시 수영구 과정로41번길 58 (망미동)051-758-5050
6건강기능식품일반판매업비알엠연구소부산광역시 수영구 광안해변로294번길 7, 103동 107호 (민락동, 진로비치아파트)051-625-6800
7건강기능식품일반판매업남부산농협 신광안지점부산광역시 수영구 광일로 20 (광안동)051-0753-1400
8건강기능식품일반판매업일동후디스(주)부산지점부산광역시 수영구 수영로606번길 24 (광안동)051-751-2066
9건강기능식품일반판매업정관장홍삼남천점부산광역시 수영구 수영로 382, 1층 (남천동)051-0627-1939
업종명업소명소재지(도로명)소재지전화
428건강기능식품유통전문판매업터닝포인트부산광역시 수영구 광안해변로 311, 서희스타힐스 센텀프리모 19층 1931호 (민락동)<NA>
429건강기능식품유통전문판매업주식회사티읕부산광역시 수영구 수영로507번길 14, 세종엠제이드 503호 (광안동)070-4693-0528
430건강기능식품유통전문판매업부산시약사신용협동조합부산광역시 수영구 광남로 18, 부산시약사신용협동조합 4층 (남천동)051-663-3445
431건강기능식품유통전문판매업매이들부산광역시 수영구 광안로61번길 60, 18층 1803호 (민락동)<NA>
432건강기능식품유통전문판매업섬꽃부산광역시 수영구 수영로 421, 7층 일부 (남천동)051-621-7770
433건강기능식품유통전문판매업주식회사 기억부산광역시 수영구 수영로 759, 알파오피스텔 지하1층 1509호 (수영동)<NA>
434건강기능식품유통전문판매업주식회사 프렌즈엠부산광역시 수영구 광남로 211, 미래파크빌딩 6층 (민락동)<NA>
435건강기능식품유통전문판매업제이컴퍼니부산광역시 수영구 남천동로9번길 41, 인재빌딩 4층 (남천동)<NA>
436건강기능식품유통전문판매업주식회사 씨엘블루부산광역시 수영구 망미번영로16번나길 7, 1층 102호 (광안동)<NA>
437건강기능식품유통전문판매업플러스엑스팜부산광역시 수영구 수영로 710-1, 프라임메디컬 빌딩 1층 102호 (광안동)<NA>