Overview

Dataset statistics

Number of variables5
Number of observations118
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.8%
Total size in memory4.7 KiB
Average record size in memory41.1 B

Variable types

Categorical4
Text1

Dataset

DescriptionJDC지정면세점_브랜드별 AS 연락처 및 주소(2014년 7월 기준)
Author제주국제자유도시개발센터
URLhttps://www.data.go.kr/data/15044056/fileData.do

Alerts

Dataset has 1 (0.8%) duplicate rowsDuplicates
연착처 is highly overall correlated with 품종 and 2 other fieldsHigh correlation
주소-세부주소 is highly overall correlated with 품종 and 2 other fieldsHigh correlation
품종 is highly overall correlated with 연착처 and 2 other fieldsHigh correlation
주소-시도구분 is highly overall correlated with 품종 and 2 other fieldsHigh correlation
주소-시도구분 is highly imbalanced (68.7%)Imbalance

Reproduction

Analysis started2023-12-12 15:49:35.394650
Analysis finished2023-12-12 15:49:35.933222
Duration0.54 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

품종
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
선글라스
42 
패션
34 
시계
25 
액세서리
15 
완구
 
2

Length

Max length4
Median length2
Mean length2.9661017
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row패션
2nd row패션
3rd row패션
4th row패션
5th row패션

Common Values

ValueCountFrequency (%)
선글라스 42
35.6%
패션 34
28.8%
시계 25
21.2%
액세서리 15
 
12.7%
완구 2
 
1.7%

Length

2023-12-13T00:49:36.040818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:49:36.179636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
선글라스 42
35.6%
패션 34
28.8%
시계 25
21.2%
액세서리 15
 
12.7%
완구 2
 
1.7%
Distinct111
Distinct (%)94.1%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-13T00:49:36.452409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length15
Mean length8.5254237
Min length2

Characters and Unicode

Total characters1006
Distinct characters76
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique104 ?
Unique (%)88.1%

Sample

1st rowLongchamp
2nd rowEtro
3rd rowKipling
4th rowLouis Quatorze
5th rowPRIMA CLASSE
ValueCountFrequency (%)
13
 
7.6%
butti 6
 
3.5%
라베트리나 5
 
2.9%
fendi 3
 
1.8%
kors 3
 
1.8%
karl 2
 
1.2%
lagerfeld 2
 
1.2%
ferragamo 2
 
1.2%
prada 2
 
1.2%
aigner 2
 
1.2%
Other values (120) 131
76.6%
2023-12-13T00:49:36.879298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
A 55
 
5.5%
53
 
5.3%
E 45
 
4.5%
I 42
 
4.2%
T 40
 
4.0%
O 40
 
4.0%
L 38
 
3.8%
N 38
 
3.8%
a 35
 
3.5%
i 33
 
3.3%
Other values (66) 587
58.3%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 587
58.3%
Lowercase Letter 285
28.3%
Space Separator 53
 
5.3%
Other Letter 46
 
4.6%
Other Punctuation 19
 
1.9%
Dash Punctuation 12
 
1.2%
Math Symbol 1
 
0.1%
Decimal Number 1
 
0.1%
Close Punctuation 1
 
0.1%
Open Punctuation 1
 
0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A 55
 
9.4%
E 45
 
7.7%
I 42
 
7.2%
T 40
 
6.8%
O 40
 
6.8%
L 38
 
6.5%
N 38
 
6.5%
R 32
 
5.5%
S 32
 
5.5%
C 27
 
4.6%
Other values (16) 198
33.7%
Lowercase Letter
ValueCountFrequency (%)
a 35
12.3%
i 33
11.6%
e 29
10.2%
o 24
 
8.4%
r 23
 
8.1%
s 21
 
7.4%
l 17
 
6.0%
c 17
 
6.0%
n 14
 
4.9%
u 10
 
3.5%
Other values (12) 62
21.8%
Other Letter
ValueCountFrequency (%)
7
15.2%
6
13.0%
6
13.0%
6
13.0%
6
13.0%
2
 
4.3%
2
 
4.3%
1
 
2.2%
1
 
2.2%
1
 
2.2%
Other values (8) 8
17.4%
Other Punctuation
ValueCountFrequency (%)
. 14
73.7%
& 3
 
15.8%
' 1
 
5.3%
, 1
 
5.3%
Space Separator
ValueCountFrequency (%)
53
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 872
86.7%
Common 88
 
8.7%
Hangul 46
 
4.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
A 55
 
6.3%
E 45
 
5.2%
I 42
 
4.8%
T 40
 
4.6%
O 40
 
4.6%
L 38
 
4.4%
N 38
 
4.4%
a 35
 
4.0%
i 33
 
3.8%
R 32
 
3.7%
Other values (38) 474
54.4%
Hangul
ValueCountFrequency (%)
7
15.2%
6
13.0%
6
13.0%
6
13.0%
6
13.0%
2
 
4.3%
2
 
4.3%
1
 
2.2%
1
 
2.2%
1
 
2.2%
Other values (8) 8
17.4%
Common
ValueCountFrequency (%)
53
60.2%
. 14
 
15.9%
- 12
 
13.6%
& 3
 
3.4%
' 1
 
1.1%
+ 1
 
1.1%
2 1
 
1.1%
) 1
 
1.1%
, 1
 
1.1%
( 1
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 960
95.4%
Hangul 46
 
4.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
A 55
 
5.7%
53
 
5.5%
E 45
 
4.7%
I 42
 
4.4%
T 40
 
4.2%
O 40
 
4.2%
L 38
 
4.0%
N 38
 
4.0%
a 35
 
3.6%
i 33
 
3.4%
Other values (48) 541
56.4%
Hangul
ValueCountFrequency (%)
7
15.2%
6
13.0%
6
13.0%
6
13.0%
6
13.0%
2
 
4.3%
2
 
4.3%
1
 
2.2%
1
 
2.2%
1
 
2.2%
Other values (8) 8
17.4%

연착처
Categorical

HIGH CORRELATION 

Distinct46
Distinct (%)39.0%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
<NA>
20 
02-466-4557
02-6712-0812
 
7
02-2156-0732
 
6
02-2658-0013
 
6
Other values (41)
70 

Length

Max length13
Median length12
Mean length10.118644
Min length4

Unique

Unique27 ?
Unique (%)22.9%

Sample

1st row02-513-2278
2nd row02-3018-2308
3rd row02-3489-6415
4th row02-582-5271
5th row02-761-0891

Common Values

ValueCountFrequency (%)
<NA> 20
16.9%
02-466-4557 9
 
7.6%
02-6712-0812 7
 
5.9%
02-2156-0732 6
 
5.1%
02-2658-0013 6
 
5.1%
02-717-3990 6
 
5.1%
1599-3016 5
 
4.2%
02-790-6738 4
 
3.4%
02-513-2331 4
 
3.4%
02-3414-0607 3
 
2.5%
Other values (36) 48
40.7%

Length

2023-12-13T00:49:37.023103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 20
16.9%
02-466-4557 9
 
7.6%
02-6712-0812 7
 
5.9%
02-2156-0732 6
 
5.1%
02-2658-0013 6
 
5.1%
02-717-3990 6
 
5.1%
1599-3016 5
 
4.2%
02-790-6738 4
 
3.4%
02-513-2331 4
 
3.4%
02-3414-0607 3
 
2.5%
Other values (36) 48
40.7%

주소-시도구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
서울시
105 
경기도
 
9
서울
 
3
제주도
 
1

Length

Max length3
Median length3
Mean length2.9745763
Min length2

Unique

Unique1 ?
Unique (%)0.8%

Sample

1st row서울시
2nd row서울시
3rd row서울시
4th row서울시
5th row서울시

Common Values

ValueCountFrequency (%)
서울시 105
89.0%
경기도 9
 
7.6%
서울 3
 
2.5%
제주도 1
 
0.8%

Length

2023-12-13T00:49:37.146940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:49:37.255589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울시 105
89.0%
경기도 9
 
7.6%
서울 3
 
2.5%
제주도 1
 
0.8%

주소-세부주소
Categorical

HIGH CORRELATION 

Distinct47
Distinct (%)39.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
강남구 신사동 665-1 한양타운 2층
서초구 서초동 1696-13번지 애니빌딩 5층
 
8
강남구 신사동 588-8번지 카라코람 빌딩 3층
 
7
동작구 신대방동 395-70 전문건설회관 23층
 
7
중구 회현동 1가 100-83번지 ㈜디케이
 
6
Other values (42)
81 

Length

Max length38
Median length32.5
Mean length24.525424
Min length14

Unique

Unique25 ?
Unique (%)21.2%

Sample

1st row강남구 논현동 231-13번지 팍스타워 8층
2nd row강남구 삼성로 133길 12 청담동 백운빌딩 2층 면세사업부
3rd row서초구 효령로 317 대한건축사협회 빌딩 5층 ㈜리노스
4th row서초구 서초동 1446-11번지 현대슈퍼빌 상가동 301호
5th row영등포구 국제금융로 70 미원빌딩 1504-1호

Common Values

ValueCountFrequency (%)
강남구 신사동 665-1 한양타운 2층 9
 
7.6%
서초구 서초동 1696-13번지 애니빌딩 5층 8
 
6.8%
강남구 신사동 588-8번지 카라코람 빌딩 3층 7
 
5.9%
동작구 신대방동 395-70 전문건설회관 23층 7
 
5.9%
중구 회현동 1가 100-83번지 ㈜디케이 6
 
5.1%
강서구 가양 1동 192-12 6
 
5.1%
강남구 논현동 50번지 삼익전자빌딩 6층 6
 
5.1%
강남구 봉은사로 44길 62 역삼동 룩옵틱스 빌딩 본관2층 AS팀 5
 
4.2%
강남구 언주로 609 팍스타워 B동 지하1층 5
 
4.2%
강남구 삼성동 145-18 구구빌딩 8층 135-090 4
 
3.4%
Other values (37) 55
46.6%

Length

2023-12-13T00:49:37.371390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
강남구 58
 
8.9%
신사동 17
 
2.6%
2층 16
 
2.5%
빌딩 15
 
2.3%
5층 11
 
1.7%
역삼동 11
 
1.7%
서초구 10
 
1.5%
3층 10
 
1.5%
논현동 10
 
1.5%
665-1 9
 
1.4%
Other values (182) 484
74.3%

Correlations

2023-12-13T00:49:37.461993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
품종연착처주소-시도구분주소-세부주소
품종1.0001.0000.5890.991
연착처1.0001.0001.0001.000
주소-시도구분0.5891.0001.0001.000
주소-세부주소0.9911.0001.0001.000
2023-12-13T00:49:37.618470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
품종연착처주소-시도구분주소-세부주소
품종1.0000.7550.5140.746
연착처0.7551.0000.7510.964
주소-시도구분0.5140.7511.0000.789
주소-세부주소0.7460.9640.7891.000
2023-12-13T00:49:37.742347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
품종연착처주소-시도구분주소-세부주소
품종1.0000.7550.5140.746
연착처0.7551.0000.7510.964
주소-시도구분0.5140.7511.0000.789
주소-세부주소0.7460.9640.7891.000

Missing values

2023-12-13T00:49:35.724712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:49:35.869293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

품종브랜드명연착처주소-시도구분주소-세부주소
0패션Longchamp02-513-2278서울시강남구 논현동 231-13번지 팍스타워 8층
1패션Etro02-3018-2308서울시강남구 삼성로 133길 12 청담동 백운빌딩 2층 면세사업부
2패션Kipling02-3489-6415서울시서초구 효령로 317 대한건축사협회 빌딩 5층 ㈜리노스
3패션Louis Quatorze02-582-5271서울시서초구 서초동 1446-11번지 현대슈퍼빌 상가동 301호
4패션PRIMA CLASSE02-761-0891서울시영등포구 국제금융로 70 미원빌딩 1504-1호
5패션BEANPOLE02-3702-7915경기도김포시 고천읍 전호리 743번지 삼성물류센터
6패션Daks1544-5114경기도군포시 산본동 1026-19번지 LF CS 센터
7패션J.ESTINA031-8028-5577경기도광주시 장지9길 48 제이에스티나 물류센터
8패션COURONNE031-218-9612경기도수원시 영통구 원천동 380-1번지
9패션S.T.DUPONT02-2106-3418서울시강남구 논현로 149길 11 세중빌딩 1층
품종브랜드명연착처주소-시도구분주소-세부주소
108선글라스adidas02-717-3990서울시중구 회현동 1가 100-83번지 ㈜디케이
109선글라스BALLY02-717-3990서울시중구 회현동 1가 100-83번지 ㈜디케이
110선글라스BALMAIN02-717-3990서울시중구 회현동 1가 100-83번지 ㈜디케이
111선글라스KENZO02-717-3990서울시중구 회현동 1가 100-83번지 ㈜디케이
112선글라스Shiseido02-717-3990서울시중구 회현동 1가 100-83번지 ㈜디케이
113선글라스S.T.DUPONT02-717-3990서울시중구 회현동 1가 100-83번지 ㈜디케이
114선글라스RUDY PROJECT02-563-8264서울시강남구 봉은사로 20길 14 지하 2층(역삼동, 주미빌딩)
115선글라스TAGHeuer02-563-8264서울시강남구 봉은사로 20길 14 지하 2층(역삼동, 주미빌딩)
116완구신우토이<NA>서울시마포구 서교동 465-9 평화빌딩 3층
117완구테디베어064-733-4627제주도서귀포시 도홍동 130-1번지 인화빌딩 1층

Duplicate rows

Most frequently occurring

품종브랜드명연착처주소-시도구분주소-세부주소# duplicates
0액세서리Facco02-3414-0607서울강남구 역삼동 840 한은빌딩 2층2