Overview

Dataset statistics

Number of variables4
Number of observations272
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.6 KiB
Average record size in memory32.5 B

Variable types

Categorical2
Text2

Dataset

Description제주국제자유도시개발센터에서 운영하는 JDC지정면세점의 2018년 4월 기준 입점 업체별 브랜드 현황 정보
Author제주국제자유도시개발센터
URLhttps://www.data.go.kr/data/3050975/fileData.do

Reproduction

Analysis started2023-12-12 04:28:59.017677
Analysis finished2023-12-12 04:28:59.523791
Duration0.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

품목
Categorical

Distinct11
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
주류
57 
향수, 화장품
57 
패션
38 
선글라스
36 
시계
26 
Other values (6)
58 

Length

Max length7
Median length2
Mean length3.4558824
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row주류
2nd row주류
3rd row주류
4th row주류
5th row주류

Common Values

ValueCountFrequency (%)
주류 57
21.0%
향수, 화장품 57
21.0%
패션 38
14.0%
선글라스 36
13.2%
시계 26
9.6%
담배 22
 
8.1%
초콜렛 15
 
5.5%
액세서리 12
 
4.4%
문구 5
 
1.8%
완구 2
 
0.7%

Length

2023-12-12T13:28:59.608832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
주류 57
17.3%
향수 57
17.3%
화장품 57
17.3%
패션 38
11.6%
선글라스 36
10.9%
시계 26
7.9%
담배 22
 
6.7%
초콜렛 15
 
4.6%
액세서리 12
 
3.6%
문구 5
 
1.5%
Other values (2) 4
 
1.2%
Distinct83
Distinct (%)30.5%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2023-12-12T13:28:59.893420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length12
Mean length6.2830882
Min length3

Characters and Unicode

Total characters1709
Distinct characters165
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)11.8%

Sample

1st row다린앤컴퍼니
2nd row다은컴퍼니
3rd row다은컴퍼니
4th row다은컴퍼니
5th row정우 인터내셔날
ValueCountFrequency (%)
인터내셔날 21
 
6.4%
lk 18
 
5.5%
인비트윈 13
 
4.0%
kt&g 12
 
3.7%
b&f통상 10
 
3.0%
주)디엘이노베이션 9
 
2.7%
주)우림에프엠지 8
 
2.4%
티디코 7
 
2.1%
ntc 7
 
2.1%
엘코잉크 7
 
2.1%
Other values (88) 216
65.9%
2023-12-12T13:29:00.377426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
65
 
3.8%
62
 
3.6%
62
 
3.6%
( 60
 
3.5%
) 60
 
3.5%
54
 
3.2%
51
 
3.0%
49
 
2.9%
45
 
2.6%
43
 
2.5%
Other values (155) 1158
67.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1233
72.1%
Uppercase Letter 249
 
14.6%
Space Separator 62
 
3.6%
Open Punctuation 60
 
3.5%
Close Punctuation 60
 
3.5%
Other Punctuation 30
 
1.8%
Other Symbol 15
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
65
 
5.3%
62
 
5.0%
54
 
4.4%
51
 
4.1%
49
 
4.0%
45
 
3.6%
43
 
3.5%
33
 
2.7%
27
 
2.2%
27
 
2.2%
Other values (127) 777
63.0%
Uppercase Letter
ValueCountFrequency (%)
K 35
14.1%
T 33
13.3%
L 26
10.4%
C 21
8.4%
B 17
 
6.8%
G 16
 
6.4%
N 16
 
6.4%
F 14
 
5.6%
D 10
 
4.0%
I 10
 
4.0%
Other values (11) 51
20.5%
Other Punctuation
ValueCountFrequency (%)
14
46.7%
& 12
40.0%
. 4
 
13.3%
Space Separator
ValueCountFrequency (%)
62
100.0%
Open Punctuation
ValueCountFrequency (%)
( 60
100.0%
Close Punctuation
ValueCountFrequency (%)
) 60
100.0%
Other Symbol
ValueCountFrequency (%)
15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1248
73.0%
Latin 249
 
14.6%
Common 212
 
12.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
65
 
5.2%
62
 
5.0%
54
 
4.3%
51
 
4.1%
49
 
3.9%
45
 
3.6%
43
 
3.4%
33
 
2.6%
27
 
2.2%
27
 
2.2%
Other values (128) 792
63.5%
Latin
ValueCountFrequency (%)
K 35
14.1%
T 33
13.3%
L 26
10.4%
C 21
8.4%
B 17
 
6.8%
G 16
 
6.4%
N 16
 
6.4%
F 14
 
5.6%
D 10
 
4.0%
I 10
 
4.0%
Other values (11) 51
20.5%
Common
ValueCountFrequency (%)
62
29.2%
( 60
28.3%
) 60
28.3%
14
 
6.6%
& 12
 
5.7%
. 4
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1233
72.1%
ASCII 447
 
26.2%
None 29
 
1.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
65
 
5.3%
62
 
5.0%
54
 
4.4%
51
 
4.1%
49
 
4.0%
45
 
3.6%
43
 
3.5%
33
 
2.7%
27
 
2.2%
27
 
2.2%
Other values (127) 777
63.0%
ASCII
ValueCountFrequency (%)
62
13.9%
( 60
13.4%
) 60
13.4%
K 35
 
7.8%
T 33
 
7.4%
L 26
 
5.8%
C 21
 
4.7%
B 17
 
3.8%
G 16
 
3.6%
N 16
 
3.6%
Other values (16) 101
22.6%
None
ValueCountFrequency (%)
15
51.7%
14
48.3%

상품군
Categorical

Distinct7
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
수입
152 
환급
62 
국산
45 
수입,환급
 
10
국산,수입
 
1
Other values (2)
 
2

Length

Max length6
Median length2
Mean length2.1470588
Min length2

Unique

Unique3 ?
Unique (%)1.1%

Sample

1st row수입
2nd row수입
3rd row수입
4th row수입
5th row수입

Common Values

ValueCountFrequency (%)
수입 152
55.9%
환급 62
22.8%
국산 45
 
16.5%
수입,환급 10
 
3.7%
국산,수입 1
 
0.4%
수입, 환급 1
 
0.4%
수입,국산 1
 
0.4%

Length

2023-12-12T13:29:00.538971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:29:00.713710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수입 153
56.0%
환급 63
23.1%
국산 45
 
16.5%
수입,환급 10
 
3.7%
국산,수입 1
 
0.4%
수입,국산 1
 
0.4%
Distinct254
Distinct (%)93.4%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2023-12-12T13:29:01.085020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length15
Mean length7.5294118
Min length2

Characters and Unicode

Total characters2048
Distinct characters156
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique238 ?
Unique (%)87.5%

Sample

1st rowBALLANTINE
2nd rowJACK DANIEL
3rd rowGLENDRONACH
4th rowWOODFORD RESERVE
5th rowCHIVAS REGAL
ValueCountFrequency (%)
prada 3
 
0.9%
lanvin 3
 
0.9%
gucci 3
 
0.9%
sui 2
 
0.6%
bottega 2
 
0.6%
sun 2
 
0.6%
montblanc 2
 
0.6%
kors 2
 
0.6%
london 2
 
0.6%
swarovski 2
 
0.6%
Other values (308) 324
93.4%
2023-12-12T13:29:01.621327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
A 104
 
5.1%
E 90
 
4.4%
a 85
 
4.2%
O 80
 
3.9%
e 78
 
3.8%
77
 
3.8%
L 75
 
3.7%
N 74
 
3.6%
I 74
 
3.6%
r 69
 
3.4%
Other values (146) 1242
60.6%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 1100
53.7%
Lowercase Letter 689
33.6%
Other Letter 145
 
7.1%
Space Separator 77
 
3.8%
Other Punctuation 27
 
1.3%
Decimal Number 4
 
0.2%
Dash Punctuation 2
 
0.1%
Close Punctuation 2
 
0.1%
Open Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12
 
8.3%
5
 
3.4%
5
 
3.4%
5
 
3.4%
3
 
2.1%
3
 
2.1%
3
 
2.1%
3
 
2.1%
3
 
2.1%
3
 
2.1%
Other values (83) 100
69.0%
Uppercase Letter
ValueCountFrequency (%)
A 104
 
9.5%
E 90
 
8.2%
O 80
 
7.3%
L 75
 
6.8%
N 74
 
6.7%
I 74
 
6.7%
S 68
 
6.2%
R 67
 
6.1%
T 54
 
4.9%
C 52
 
4.7%
Other values (16) 362
32.9%
Lowercase Letter
ValueCountFrequency (%)
a 85
12.3%
e 78
11.3%
r 69
10.0%
i 63
 
9.1%
o 58
 
8.4%
n 50
 
7.3%
s 40
 
5.8%
l 32
 
4.6%
t 26
 
3.8%
u 24
 
3.5%
Other values (15) 164
23.8%
Other Punctuation
ValueCountFrequency (%)
. 17
63.0%
' 4
 
14.8%
& 3
 
11.1%
/ 2
 
7.4%
, 1
 
3.7%
Decimal Number
ValueCountFrequency (%)
7 2
50.0%
3 1
25.0%
9 1
25.0%
Space Separator
ValueCountFrequency (%)
77
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1789
87.4%
Hangul 145
 
7.1%
Common 114
 
5.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12
 
8.3%
5
 
3.4%
5
 
3.4%
5
 
3.4%
3
 
2.1%
3
 
2.1%
3
 
2.1%
3
 
2.1%
3
 
2.1%
3
 
2.1%
Other values (83) 100
69.0%
Latin
ValueCountFrequency (%)
A 104
 
5.8%
E 90
 
5.0%
a 85
 
4.8%
O 80
 
4.5%
e 78
 
4.4%
L 75
 
4.2%
N 74
 
4.1%
I 74
 
4.1%
r 69
 
3.9%
S 68
 
3.8%
Other values (41) 992
55.4%
Common
ValueCountFrequency (%)
77
67.5%
. 17
 
14.9%
' 4
 
3.5%
& 3
 
2.6%
- 2
 
1.8%
/ 2
 
1.8%
) 2
 
1.8%
( 2
 
1.8%
7 2
 
1.8%
3 1
 
0.9%
Other values (2) 2
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1903
92.9%
Hangul 145
 
7.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
A 104
 
5.5%
E 90
 
4.7%
a 85
 
4.5%
O 80
 
4.2%
e 78
 
4.1%
77
 
4.0%
L 75
 
3.9%
N 74
 
3.9%
I 74
 
3.9%
r 69
 
3.6%
Other values (53) 1097
57.6%
Hangul
ValueCountFrequency (%)
12
 
8.3%
5
 
3.4%
5
 
3.4%
5
 
3.4%
3
 
2.1%
3
 
2.1%
3
 
2.1%
3
 
2.1%
3
 
2.1%
3
 
2.1%
Other values (83) 100
69.0%

Correlations

2023-12-12T13:29:01.779499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
품목업체명상품군
품목1.0000.9970.596
업체명0.9971.0000.897
상품군0.5960.8971.000
2023-12-12T13:29:01.897890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
품목상품군
품목1.0000.344
상품군0.3441.000
2023-12-12T13:29:01.982728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
품목상품군
품목1.0000.344
상품군0.3441.000

Missing values

2023-12-12T13:28:59.345859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:28:59.488743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

품목업체명상품군브랜드명
0주류다린앤컴퍼니수입BALLANTINE
1주류다은컴퍼니수입JACK DANIEL
2주류다은컴퍼니수입GLENDRONACH
3주류다은컴퍼니수입WOODFORD RESERVE
4주류정우 인터내셔날수입CHIVAS REGAL
5주류정우 인터내셔날수입ROYAL SALUTE
6주류정우 인터내셔날수입GLENLIVET
7주류NTC수입J. WALKER
8주류NTC수입Baileys
9주류NTC수입SHUI JING FANG
품목업체명상품군브랜드명
262향수, 화장품KL 리미티드수입PRADA
263향수, 화장품KL 리미티드수입VALENTINO
264향수, 화장품(주)마운티너스국산DUFT&DOFT
265향수, 화장품(주)이니스프리국산Innisfree
266향수, 화장품(주)해브앤비국산Dr.jart
267향수, 화장품아모레퍼시픽국산IOPE
268향수, 화장품엘지생활건강국산Belif
269향수, 화장품엘지생활건강국산L/G
270향수, 화장품엘지생활건강국산Sum37
271향수, 화장품제주 경제통상진흥원국산제주중소기업화장품