Overview

Dataset statistics

Number of variables5
Number of observations413
Missing cells270
Missing cells (%)13.1%
Duplicate rows1
Duplicate rows (%)0.2%
Total size in memory16.3 KiB
Average record size in memory40.3 B

Variable types

Categorical2
Text3

Dataset

Description인천광역시 중구 건강기능식품관련업 현황중구 건강 기능 식품 일반 판매 업소, 건강기능식품 유통전문판매업소에 대한 업소명, 소재지, 소재지 전화번호 현황
Author인천광역시 중구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15062353&srcSe=7661IVAWM27C61E190

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 1 (0.2%) duplicate rowsDuplicates
업종명 is highly imbalanced (82.3%)Imbalance
소재지전화 has 270 (65.4%) missing valuesMissing

Reproduction

Analysis started2024-03-18 04:37:11.240988
Analysis finished2024-03-18 04:37:11.672212
Duration0.43 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size3.4 KiB
건강기능식품일반판매업
402 
건강기능식품유통전문판매업
 
11

Length

Max length13
Median length11
Mean length11.053269
Min length11

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row건강기능식품일반판매업
2nd row건강기능식품일반판매업
3rd row건강기능식품일반판매업
4th row건강기능식품일반판매업
5th row건강기능식품일반판매업

Common Values

ValueCountFrequency (%)
건강기능식품일반판매업 402
97.3%
건강기능식품유통전문판매업 11
 
2.7%

Length

2024-03-18T13:37:11.745871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T13:37:11.841494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
건강기능식품일반판매업 402
97.3%
건강기능식품유통전문판매업 11
 
2.7%
Distinct401
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Memory size3.4 KiB
2024-03-18T13:37:12.091740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length23
Mean length7.4624697
Min length2

Characters and Unicode

Total characters3082
Distinct characters436
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique391 ?
Unique (%)94.7%

Sample

1st row아모레 인천중앙 특약점
2nd row(주)이마트 동인천점
3rd row정관장 동인천 전시판매장
4th row마임신포지사
5th row김정문알로에
ValueCountFrequency (%)
주식회사 18
 
3.3%
인셀덤 12
 
2.2%
주)신세계면세점 5
 
0.9%
인셀덤코리아 4
 
0.7%
영종하늘도시점 4
 
0.7%
씨제이올리브영(주 4
 
0.7%
정관장 4
 
0.7%
코리아 3
 
0.5%
입국장점 3
 
0.5%
지에스25 3
 
0.5%
Other values (462) 486
89.0%
2024-03-18T13:37:12.546103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
133
 
4.3%
96
 
3.1%
87
 
2.8%
81
 
2.6%
( 78
 
2.5%
) 78
 
2.5%
78
 
2.5%
66
 
2.1%
53
 
1.7%
52
 
1.7%
Other values (426) 2280
74.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2558
83.0%
Space Separator 133
 
4.3%
Uppercase Letter 101
 
3.3%
Open Punctuation 78
 
2.5%
Close Punctuation 78
 
2.5%
Lowercase Letter 71
 
2.3%
Decimal Number 56
 
1.8%
Other Punctuation 6
 
0.2%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
96
 
3.8%
87
 
3.4%
81
 
3.2%
78
 
3.0%
66
 
2.6%
53
 
2.1%
52
 
2.0%
46
 
1.8%
40
 
1.6%
38
 
1.5%
Other values (367) 1921
75.1%
Uppercase Letter
ValueCountFrequency (%)
S 12
 
11.9%
A 10
 
9.9%
N 8
 
7.9%
L 7
 
6.9%
E 7
 
6.9%
J 7
 
6.9%
T 6
 
5.9%
H 5
 
5.0%
G 4
 
4.0%
I 4
 
4.0%
Other values (13) 31
30.7%
Lowercase Letter
ValueCountFrequency (%)
l 12
16.9%
e 10
14.1%
a 7
9.9%
o 6
8.5%
i 5
 
7.0%
t 4
 
5.6%
y 4
 
5.6%
b 3
 
4.2%
s 3
 
4.2%
r 3
 
4.2%
Other values (9) 14
19.7%
Decimal Number
ValueCountFrequency (%)
2 20
35.7%
5 10
17.9%
1 10
17.9%
0 7
 
12.5%
7 5
 
8.9%
3 2
 
3.6%
8 1
 
1.8%
9 1
 
1.8%
Other Punctuation
ValueCountFrequency (%)
& 2
33.3%
: 1
16.7%
. 1
16.7%
/ 1
16.7%
# 1
16.7%
Space Separator
ValueCountFrequency (%)
133
100.0%
Open Punctuation
ValueCountFrequency (%)
( 78
100.0%
Close Punctuation
ValueCountFrequency (%)
) 78
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2558
83.0%
Common 352
 
11.4%
Latin 172
 
5.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
96
 
3.8%
87
 
3.4%
81
 
3.2%
78
 
3.0%
66
 
2.6%
53
 
2.1%
52
 
2.0%
46
 
1.8%
40
 
1.6%
38
 
1.5%
Other values (367) 1921
75.1%
Latin
ValueCountFrequency (%)
l 12
 
7.0%
S 12
 
7.0%
A 10
 
5.8%
e 10
 
5.8%
N 8
 
4.7%
a 7
 
4.1%
L 7
 
4.1%
E 7
 
4.1%
J 7
 
4.1%
T 6
 
3.5%
Other values (32) 86
50.0%
Common
ValueCountFrequency (%)
133
37.8%
( 78
22.2%
) 78
22.2%
2 20
 
5.7%
5 10
 
2.8%
1 10
 
2.8%
0 7
 
2.0%
7 5
 
1.4%
3 2
 
0.6%
& 2
 
0.6%
Other values (7) 7
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2558
83.0%
ASCII 524
 
17.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
133
25.4%
( 78
14.9%
) 78
14.9%
2 20
 
3.8%
l 12
 
2.3%
S 12
 
2.3%
5 10
 
1.9%
A 10
 
1.9%
e 10
 
1.9%
1 10
 
1.9%
Other values (49) 151
28.8%
Hangul
ValueCountFrequency (%)
96
 
3.8%
87
 
3.4%
81
 
3.2%
78
 
3.0%
66
 
2.6%
53
 
2.1%
52
 
2.0%
46
 
1.8%
40
 
1.6%
38
 
1.5%
Other values (367) 1921
75.1%
Distinct405
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size3.4 KiB
2024-03-18T13:37:12.815456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length66
Median length51
Mean length40.803874
Min length19

Characters and Unicode

Total characters16852
Distinct characters292
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique397 ?
Unique (%)96.1%

Sample

1st row인천광역시 중구 제물량로 122, 3층 (답동)
2nd row인천광역시 중구 인중로 134 (신생동, 1층)
3rd row인천광역시 중구 참외전로 120-2 (인현동)
4th row인천광역시 중구 우현로 3 (사동, 범진빌딩5층)
5th row인천광역시 중구 우현로39번길 6-8, 1층 (신포동)
ValueCountFrequency (%)
인천광역시 413
 
12.7%
중구 413
 
12.7%
중산동 118
 
3.6%
운서동 100
 
3.1%
1층 83
 
2.6%
운남동 40
 
1.2%
2층 33
 
1.0%
3층 32
 
1.0%
하늘별빛로 27
 
0.8%
4층 25
 
0.8%
Other values (831) 1959
60.4%
2024-03-18T13:37:13.180241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2831
 
16.8%
1 792
 
4.7%
590
 
3.5%
, 583
 
3.5%
573
 
3.4%
515
 
3.1%
2 495
 
2.9%
477
 
2.8%
459
 
2.7%
0 424
 
2.5%
Other values (282) 9113
54.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9138
54.2%
Decimal Number 3252
 
19.3%
Space Separator 2831
 
16.8%
Other Punctuation 585
 
3.5%
Close Punctuation 421
 
2.5%
Open Punctuation 419
 
2.5%
Dash Punctuation 90
 
0.5%
Uppercase Letter 82
 
0.5%
Lowercase Letter 32
 
0.2%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
590
 
6.5%
573
 
6.3%
515
 
5.6%
477
 
5.2%
459
 
5.0%
417
 
4.6%
416
 
4.6%
415
 
4.5%
412
 
4.5%
286
 
3.1%
Other values (243) 4578
50.1%
Uppercase Letter
ValueCountFrequency (%)
S 15
18.3%
K 12
14.6%
B 8
9.8%
C 8
9.8%
L 6
 
7.3%
I 5
 
6.1%
A 5
 
6.1%
H 4
 
4.9%
G 4
 
4.9%
W 3
 
3.7%
Other values (6) 12
14.6%
Decimal Number
ValueCountFrequency (%)
1 792
24.4%
2 495
15.2%
0 424
13.0%
3 355
10.9%
4 320
9.8%
6 228
 
7.0%
5 202
 
6.2%
7 182
 
5.6%
8 129
 
4.0%
9 125
 
3.8%
Lowercase Letter
ValueCountFrequency (%)
e 14
43.8%
y 6
18.8%
k 3
 
9.4%
t 3
 
9.4%
i 3
 
9.4%
c 3
 
9.4%
Other Punctuation
ValueCountFrequency (%)
, 583
99.7%
/ 2
 
0.3%
Space Separator
ValueCountFrequency (%)
2831
100.0%
Close Punctuation
ValueCountFrequency (%)
) 421
100.0%
Open Punctuation
ValueCountFrequency (%)
( 419
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 90
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9138
54.2%
Common 7600
45.1%
Latin 114
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
590
 
6.5%
573
 
6.3%
515
 
5.6%
477
 
5.2%
459
 
5.0%
417
 
4.6%
416
 
4.6%
415
 
4.5%
412
 
4.5%
286
 
3.1%
Other values (243) 4578
50.1%
Latin
ValueCountFrequency (%)
S 15
13.2%
e 14
12.3%
K 12
 
10.5%
B 8
 
7.0%
C 8
 
7.0%
y 6
 
5.3%
L 6
 
5.3%
I 5
 
4.4%
A 5
 
4.4%
H 4
 
3.5%
Other values (12) 31
27.2%
Common
ValueCountFrequency (%)
2831
37.2%
1 792
 
10.4%
, 583
 
7.7%
2 495
 
6.5%
0 424
 
5.6%
) 421
 
5.5%
( 419
 
5.5%
3 355
 
4.7%
4 320
 
4.2%
6 228
 
3.0%
Other values (7) 732
 
9.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9138
54.2%
ASCII 7714
45.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2831
36.7%
1 792
 
10.3%
, 583
 
7.6%
2 495
 
6.4%
0 424
 
5.5%
) 421
 
5.5%
( 419
 
5.4%
3 355
 
4.6%
4 320
 
4.1%
6 228
 
3.0%
Other values (29) 846
 
11.0%
Hangul
ValueCountFrequency (%)
590
 
6.5%
573
 
6.3%
515
 
5.6%
477
 
5.2%
459
 
5.0%
417
 
4.6%
416
 
4.6%
415
 
4.5%
412
 
4.5%
286
 
3.1%
Other values (243) 4578
50.1%

소재지전화
Text

MISSING 

Distinct132
Distinct (%)92.3%
Missing270
Missing (%)65.4%
Memory size3.4 KiB
2024-03-18T13:37:13.376371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length11.965035
Min length9

Characters and Unicode

Total characters1711
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique122 ?
Unique (%)85.3%

Sample

1st row032-451-1234
2nd row032-777-2304
3rd row032-762-4795
4th row032-761-3356
5th row032-761-2381
ValueCountFrequency (%)
02-6048-5358 3
 
2.1%
032-762-0088 2
 
1.4%
032-743-2078 2
 
1.4%
032-682-1476 2
 
1.4%
032-752-6663 2
 
1.4%
032-766-0075 2
 
1.4%
032-747-1134 2
 
1.4%
02-742-7171 2
 
1.4%
032-746-0989 2
 
1.4%
070-8821-8880 2
 
1.4%
Other values (122) 122
85.3%
2024-03-18T13:37:13.668818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 280
16.4%
0 251
14.7%
2 221
12.9%
7 192
11.2%
3 190
11.1%
8 134
7.8%
5 102
 
6.0%
6 98
 
5.7%
4 91
 
5.3%
1 88
 
5.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1431
83.6%
Dash Punctuation 280
 
16.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 251
17.5%
2 221
15.4%
7 192
13.4%
3 190
13.3%
8 134
9.4%
5 102
7.1%
6 98
 
6.8%
4 91
 
6.4%
1 88
 
6.1%
9 64
 
4.5%
Dash Punctuation
ValueCountFrequency (%)
- 280
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1711
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 280
16.4%
0 251
14.7%
2 221
12.9%
7 192
11.2%
3 190
11.1%
8 134
7.8%
5 102
 
6.0%
6 98
 
5.7%
4 91
 
5.3%
1 88
 
5.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1711
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 280
16.4%
0 251
14.7%
2 221
12.9%
7 192
11.2%
3 190
11.1%
8 134
7.8%
5 102
 
6.0%
6 98
 
5.7%
4 91
 
5.3%
1 88
 
5.1%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.4 KiB
2023-07-05
413 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-07-05
2nd row2023-07-05
3rd row2023-07-05
4th row2023-07-05
5th row2023-07-05

Common Values

ValueCountFrequency (%)
2023-07-05 413
100.0%

Length

2024-03-18T13:37:13.778758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T13:37:13.866292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-07-05 413
100.0%

Missing values

2024-03-18T13:37:11.552077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-18T13:37:11.635959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명소재지소재지전화데이터기준일자
0건강기능식품일반판매업아모레 인천중앙 특약점인천광역시 중구 제물량로 122, 3층 (답동)<NA>2023-07-05
1건강기능식품일반판매업(주)이마트 동인천점인천광역시 중구 인중로 134 (신생동, 1층)032-451-12342023-07-05
2건강기능식품일반판매업정관장 동인천 전시판매장인천광역시 중구 참외전로 120-2 (인현동)032-777-23042023-07-05
3건강기능식품일반판매업마임신포지사인천광역시 중구 우현로 3 (사동, 범진빌딩5층)032-762-47952023-07-05
4건강기능식품일반판매업김정문알로에인천광역시 중구 우현로39번길 6-8, 1층 (신포동)032-761-33562023-07-05
5건강기능식품일반판매업중구농업협동조합신흥지점인천광역시 중구 제물량로80번길 1 (신흥동2가)032-761-23812023-07-05
6건강기능식품일반판매업중구농업협동조합 하인천지점인천광역시 중구 제물량로 276-1 (북성동2가)032-763-46022023-07-05
7건강기능식품일반판매업세븐일레븐(동인천점)인천광역시 중구 우현로90번길 19-13 (인현동)02-2127-58012023-07-05
8건강기능식품일반판매업중구농협하나로마트본점인천광역시 중구 운남로 166 (운남동, 중구농협종합청사)032-746-09892023-07-05
9건강기능식품일반판매업한국관광공사인천공항인천광역시 중구 공항로 272, 면세동 (운서동, 인천국제공항여객터미널)032-743-20782023-07-05
업종명업소명소재지소재지전화데이터기준일자
403건강기능식품유통전문판매업유밸눈편한안과인천광역시 중구 우현로 90, 4층 (인현동)032-762-00882023-07-05
404건강기능식품유통전문판매업(주)정신무역인천광역시 중구 연안부두로53번길 2, 1층 (항동7가)070-8821-88802023-07-05
405건강기능식품유통전문판매업라이온코리아주식회사인천광역시 중구 서해대로140번길 23, 씨제이라이온 (신흥동3가)<NA>2023-07-05
406건강기능식품유통전문판매업주식회사 아인스인천광역시 중구 자유공원로 26, 1층 (전동)031-684-69872023-07-05
407건강기능식품유통전문판매업기프트스타일인천광역시 중구 중산로 118-21, 1층 (중산동)<NA>2023-07-05
408건강기능식품유통전문판매업(주)제이앤코스인천광역시 중구 제물량로166번길 14, 4층 (신포동)070-8877-19592023-07-05
409건강기능식품유통전문판매업주식회사 피엘코리아무역인천광역시 중구 백운로228번길 51, 1동 1층 (운북동)032-752-66632023-07-05
410건강기능식품유통전문판매업(주)대관인천광역시 중구 인중로 178, 정우빌딩 4층 403호 (사동)032-766-79192023-07-05
411건강기능식품유통전문판매업(주)유니온제이인천광역시 중구 영종대로 120, 두손에어파스텔 4층 414호 (운서동)032-682-14762023-07-05
412건강기능식품유통전문판매업올베이스인천광역시 중구 축항대로86번길 47, 6동 5층 3호 (항동7가, 비취맨숀)<NA>2023-07-05

Duplicate rows

Most frequently occurring

업종명업소명소재지소재지전화데이터기준일자# duplicates
0건강기능식품일반판매업연세참가정의학과인천광역시 중구 우현로 47 (신포동, 4층)032-766-00752023-07-052