Overview

Dataset statistics

Number of variables5
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.0 KiB
Average record size in memory41.3 B

Variable types

Text3
Categorical2

Alerts

cstmr_intrst_anals_info_cn is highly overall correlated with tursm_cstmr_tyHigh correlation
tursm_cstmr_ty is highly overall correlated with cstmr_intrst_anals_info_cnHigh correlation
trrsrt_cd has unique valuesUnique

Reproduction

Analysis started2023-12-10 09:43:58.504554
Analysis finished2023-12-10 09:43:59.756530
Duration1.25 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

trrsrt_cd
Text

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T18:44:00.064549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters1200
Distinct characters11
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st rowP00000000008
2nd rowP00000002553
3rd rowP00000000051
4th rowP00000000052
5th rowP00000000053
ValueCountFrequency (%)
p00000000008 1
 
1.0%
p00000001162 1
 
1.0%
p00000001275 1
 
1.0%
p00000001274 1
 
1.0%
p00000001232 1
 
1.0%
p00000001231 1
 
1.0%
p00000001230 1
 
1.0%
p00000001229 1
 
1.0%
p00000001220 1
 
1.0%
p00000001177 1
 
1.0%
Other values (90) 90
90.0%
2023-12-10T18:44:00.826118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 789
65.8%
P 100
 
8.3%
1 66
 
5.5%
7 54
 
4.5%
2 36
 
3.0%
3 31
 
2.6%
4 27
 
2.2%
8 26
 
2.2%
5 26
 
2.2%
9 24
 
2.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1100
91.7%
Uppercase Letter 100
 
8.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 789
71.7%
1 66
 
6.0%
7 54
 
4.9%
2 36
 
3.3%
3 31
 
2.8%
4 27
 
2.5%
8 26
 
2.4%
5 26
 
2.4%
9 24
 
2.2%
6 21
 
1.9%
Uppercase Letter
ValueCountFrequency (%)
P 100
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1100
91.7%
Latin 100
 
8.3%

Most frequent character per script

Common
ValueCountFrequency (%)
0 789
71.7%
1 66
 
6.0%
7 54
 
4.9%
2 36
 
3.3%
3 31
 
2.8%
4 27
 
2.5%
8 26
 
2.4%
5 26
 
2.4%
9 24
 
2.2%
6 21
 
1.9%
Latin
ValueCountFrequency (%)
P 100
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1200
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 789
65.8%
P 100
 
8.3%
1 66
 
5.5%
7 54
 
4.5%
2 36
 
3.0%
3 31
 
2.6%
4 27
 
2.2%
8 26
 
2.2%
5 26
 
2.2%
9 24
 
2.0%
Distinct99
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T18:44:01.296609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length21
Mean length14.73
Min length2

Characters and Unicode

Total characters1473
Distinct characters230
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique98 ?
Unique (%)98.0%

Sample

1st row[강릉]커피커퍼박물관(경포점)
2nd row[강원]속초_한화워터피아_로우시즌이용권(소인)
3rd row[강릉] 정동심곡바다부채길
4th row[강릉]커피커퍼박물관(왕산점or경포점)
5th row커피커퍼 전지점 음료이용권
ValueCountFrequency (%)
속초 12
 
7.5%
강원 8
 
5.0%
망상해수욕장 8
 
5.0%
강릉 5
 
3.1%
눈꽃축제 4
 
2.5%
양주 4
 
2.5%
강릉]애니멀스토리_입장(일반 2
 
1.2%
파로호 2
 
1.2%
ar 2
 
1.2%
파라솔or튜브 2
 
1.2%
Other values (108) 112
69.6%
2023-12-10T18:44:02.103467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
[ 81
 
5.5%
] 81
 
5.5%
61
 
4.1%
38
 
2.6%
_ 38
 
2.6%
37
 
2.5%
) 34
 
2.3%
( 34
 
2.3%
33
 
2.2%
33
 
2.2%
Other values (220) 1003
68.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1098
74.5%
Open Punctuation 115
 
7.8%
Close Punctuation 115
 
7.8%
Space Separator 61
 
4.1%
Connector Punctuation 38
 
2.6%
Decimal Number 17
 
1.2%
Other Punctuation 8
 
0.5%
Lowercase Letter 8
 
0.5%
Uppercase Letter 7
 
0.5%
Dash Punctuation 4
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
38
 
3.5%
37
 
3.4%
33
 
3.0%
33
 
3.0%
30
 
2.7%
24
 
2.2%
23
 
2.1%
23
 
2.1%
22
 
2.0%
20
 
1.8%
Other values (194) 815
74.2%
Decimal Number
ValueCountFrequency (%)
2 6
35.3%
1 3
17.6%
4 2
 
11.8%
6 1
 
5.9%
8 1
 
5.9%
5 1
 
5.9%
3 1
 
5.9%
0 1
 
5.9%
9 1
 
5.9%
Uppercase Letter
ValueCountFrequency (%)
A 2
28.6%
R 2
28.6%
I 1
14.3%
N 1
14.3%
P 1
14.3%
Open Punctuation
ValueCountFrequency (%)
[ 81
70.4%
( 34
29.6%
Close Punctuation
ValueCountFrequency (%)
] 81
70.4%
) 34
29.6%
Other Punctuation
ValueCountFrequency (%)
/ 5
62.5%
& 3
37.5%
Lowercase Letter
ValueCountFrequency (%)
o 4
50.0%
r 4
50.0%
Space Separator
ValueCountFrequency (%)
61
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 38
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Math Symbol
ValueCountFrequency (%)
+ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1098
74.5%
Common 360
 
24.4%
Latin 15
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
38
 
3.5%
37
 
3.4%
33
 
3.0%
33
 
3.0%
30
 
2.7%
24
 
2.2%
23
 
2.1%
23
 
2.1%
22
 
2.0%
20
 
1.8%
Other values (194) 815
74.2%
Common
ValueCountFrequency (%)
[ 81
22.5%
] 81
22.5%
61
16.9%
_ 38
10.6%
) 34
9.4%
( 34
9.4%
2 6
 
1.7%
/ 5
 
1.4%
- 4
 
1.1%
& 3
 
0.8%
Other values (9) 13
 
3.6%
Latin
ValueCountFrequency (%)
o 4
26.7%
r 4
26.7%
A 2
13.3%
R 2
13.3%
I 1
 
6.7%
N 1
 
6.7%
P 1
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1098
74.5%
ASCII 375
 
25.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
[ 81
21.6%
] 81
21.6%
61
16.3%
_ 38
10.1%
) 34
9.1%
( 34
9.1%
2 6
 
1.6%
/ 5
 
1.3%
- 4
 
1.1%
o 4
 
1.1%
Other values (16) 27
 
7.2%
Hangul
ValueCountFrequency (%)
38
 
3.5%
37
 
3.4%
33
 
3.0%
33
 
3.0%
30
 
2.7%
24
 
2.2%
23
 
2.1%
23
 
2.1%
22
 
2.0%
20
 
1.8%
Other values (194) 815
74.2%
Distinct57
Distinct (%)57.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T18:44:02.742402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length24
Mean length18.95
Min length3

Characters and Unicode

Total characters1895
Distinct characters153
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique40 ?
Unique (%)40.0%

Sample

1st row강원도 강릉시 강문동 146-7
2nd row강원도 속초시 미시령로2983번길 111
3rd row강원도 강릉시 강동면 헌화로 950-39
4th row강원도 강릉시 왕산면 왕산로 2171-19
5th row강원도 강릉시 강문동 146-7
ValueCountFrequency (%)
강원도 32
 
7.5%
동해시 30
 
7.0%
속초시 24
 
5.6%
강원 20
 
4.7%
동해대로 19
 
4.4%
강릉시 19
 
4.4%
6270-10 17
 
4.0%
16 9
 
2.1%
고성군 7
 
1.6%
토성면 7
 
1.6%
Other values (130) 243
56.9%
2023-12-10T18:44:03.602265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
330
 
17.4%
1 98
 
5.2%
91
 
4.8%
82
 
4.3%
78
 
4.1%
64
 
3.4%
2 61
 
3.2%
57
 
3.0%
56
 
3.0%
0 49
 
2.6%
Other values (143) 929
49.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1073
56.6%
Decimal Number 403
 
21.3%
Space Separator 330
 
17.4%
Dash Punctuation 45
 
2.4%
Open Punctuation 18
 
0.9%
Close Punctuation 18
 
0.9%
Uppercase Letter 6
 
0.3%
Other Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
91
 
8.5%
82
 
7.6%
78
 
7.3%
64
 
6.0%
57
 
5.3%
56
 
5.2%
40
 
3.7%
31
 
2.9%
29
 
2.7%
27
 
2.5%
Other values (127) 518
48.3%
Decimal Number
ValueCountFrequency (%)
1 98
24.3%
2 61
15.1%
0 49
12.2%
6 38
 
9.4%
7 37
 
9.2%
4 34
 
8.4%
3 30
 
7.4%
5 21
 
5.2%
8 20
 
5.0%
9 15
 
3.7%
Space Separator
ValueCountFrequency (%)
330
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 45
100.0%
Open Punctuation
ValueCountFrequency (%)
( 18
100.0%
Close Punctuation
ValueCountFrequency (%)
) 18
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 6
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1073
56.6%
Common 816
43.1%
Latin 6
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
91
 
8.5%
82
 
7.6%
78
 
7.3%
64
 
6.0%
57
 
5.3%
56
 
5.2%
40
 
3.7%
31
 
2.9%
29
 
2.7%
27
 
2.5%
Other values (127) 518
48.3%
Common
ValueCountFrequency (%)
330
40.4%
1 98
 
12.0%
2 61
 
7.5%
0 49
 
6.0%
- 45
 
5.5%
6 38
 
4.7%
7 37
 
4.5%
4 34
 
4.2%
3 30
 
3.7%
5 21
 
2.6%
Other values (5) 73
 
8.9%
Latin
ValueCountFrequency (%)
B 6
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1073
56.6%
ASCII 822
43.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
330
40.1%
1 98
 
11.9%
2 61
 
7.4%
0 49
 
6.0%
- 45
 
5.5%
6 38
 
4.6%
7 37
 
4.5%
4 34
 
4.1%
3 30
 
3.6%
5 21
 
2.6%
Other values (6) 79
 
9.6%
Hangul
ValueCountFrequency (%)
91
 
8.5%
82
 
7.6%
78
 
7.3%
64
 
6.0%
57
 
5.3%
56
 
5.2%
40
 
3.7%
31
 
2.9%
29
 
2.7%
27
 
2.5%
Other values (127) 518
48.3%

tursm_cstmr_ty
Categorical

HIGH CORRELATION 

Distinct31
Distinct (%)31.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
가족-가족여행
15 
가족-가족여행, 그룹-친척
14 
계절 휴가-여름휴가
11 
그룹-동아리, 친구
 
5
가족-가족여행, 그룹-친척, 친구
 
5
Other values (26)
50 

Length

Max length51
Median length33
Mean length17.72
Min length6

Unique

Unique16 ?
Unique (%)16.0%

Sample

1st row당일치기여행
2nd row가족-가족여행, 그룹-직장동료, 그룹-친척, 연인-연인여행, 친구
3rd row가족-가족여행, 개별, 그룹-동아리, 그룹-직장동료, 당일치기여행, 연인-연인여행
4th row당일치기여행
5th row당일치기여행

Common Values

ValueCountFrequency (%)
가족-가족여행 15
15.0%
가족-가족여행, 그룹-친척 14
14.0%
계절 휴가-여름휴가 11
 
11.0%
그룹-동아리, 친구 5
 
5.0%
가족-가족여행, 그룹-친척, 친구 5
 
5.0%
가족-가족여행, 계절 휴가-여름휴가, 그룹-친척, 친구 5
 
5.0%
가족-가족여행, 계절 휴가-겨울휴가 4
 
4.0%
가족-가족여행, 계절 휴가-여름휴가, 그룹-친척, 장기여행 4
 
4.0%
가족-가족여행, 연인-연인여행 4
 
4.0%
당일치기여행 4
 
4.0%
Other values (21) 29
29.0%

Length

2023-12-10T18:44:03.856632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
가족-가족여행 69
25.9%
그룹-친척 33
12.4%
계절 30
11.3%
친구 27
 
10.2%
휴가-여름휴가 21
 
7.9%
연인-연인여행 20
 
7.5%
그룹-동아리 14
 
5.3%
연인-데이트코스 12
 
4.5%
휴가-겨울휴가 7
 
2.6%
그룹-직장동료 7
 
2.6%
Other values (10) 26
 
9.8%

cstmr_intrst_anals_info_cn
Categorical

HIGH CORRELATION 

Distinct20
Distinct (%)20.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
레포츠
18 
자연 및 풍경 감상-자연
15 
교육-인문
15 
장난
11 
맛집-카페
Other values (15)
34 

Length

Max length71
Median length51
Mean length10.89
Min length2

Unique

Unique10 ?
Unique (%)10.0%

Sample

1st row맛집-카페
2nd row휴식/휴양
3rd row자연 및 풍경 감상-관광지, 자연 및 풍경 감상-자연
4th row맛집-카페
5th row맛집-카페

Common Values

ValueCountFrequency (%)
레포츠 18
18.0%
자연 및 풍경 감상-자연 15
15.0%
교육-인문 15
15.0%
장난 11
11.0%
맛집-카페 7
 
7.0%
자연 및 풍경 감상-관광지, 자연 및 풍경 감상-자연 6
 
6.0%
자연 및 풍경 감상-자연, 장난 6
 
6.0%
휴식/휴양 6
 
6.0%
교육-인문, 자연 및 풍경 감상-자연, 장난 4
 
4.0%
맛집-술집, 맛집-한식 2
 
2.0%
Other values (10) 10
10.0%

Length

2023-12-10T18:44:04.146972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
자연 48
16.7%
48
16.7%
풍경 48
16.7%
감상-자연 34
11.8%
장난 24
8.4%
교육-인문 20
7.0%
레포츠 18
 
6.3%
감상-관광지 9
 
3.1%
휴식/휴양 8
 
2.8%
맛집-카페 7
 
2.4%
Other values (17) 23
8.0%

Correlations

2023-12-10T18:44:04.341367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
trrsrt_cdtrrsrt_nmtrrsrt_addrtursm_cstmr_tycstmr_intrst_anals_info_cn
trrsrt_cd1.0001.0001.0001.0001.000
trrsrt_nm1.0001.0001.0001.0001.000
trrsrt_addr1.0001.0001.0001.0000.999
tursm_cstmr_ty1.0001.0001.0001.0001.000
cstmr_intrst_anals_info_cn1.0001.0000.9991.0001.000
2023-12-10T18:44:04.552371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
cstmr_intrst_anals_info_cntursm_cstmr_ty
cstmr_intrst_anals_info_cn1.0000.929
tursm_cstmr_ty0.9291.000
2023-12-10T18:44:04.746621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
tursm_cstmr_tycstmr_intrst_anals_info_cn
tursm_cstmr_ty1.0000.929
cstmr_intrst_anals_info_cn0.9291.000

Missing values

2023-12-10T18:43:59.344860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T18:43:59.660468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

trrsrt_cdtrrsrt_nmtrrsrt_addrtursm_cstmr_tycstmr_intrst_anals_info_cn
0P00000000008[강릉]커피커퍼박물관(경포점)강원도 강릉시 강문동 146-7당일치기여행맛집-카페
1P00000002553[강원]속초_한화워터피아_로우시즌이용권(소인)강원도 속초시 미시령로2983번길 111가족-가족여행, 그룹-직장동료, 그룹-친척, 연인-연인여행, 친구휴식/휴양
2P00000000051[강릉] 정동심곡바다부채길강원도 강릉시 강동면 헌화로 950-39가족-가족여행, 개별, 그룹-동아리, 그룹-직장동료, 당일치기여행, 연인-연인여행자연 및 풍경 감상-관광지, 자연 및 풍경 감상-자연
3P00000000052[강릉]커피커퍼박물관(왕산점or경포점)강원도 강릉시 왕산면 왕산로 2171-19당일치기여행맛집-카페
4P00000000053커피커퍼 전지점 음료이용권강원도 강릉시 강문동 146-7당일치기여행맛집-카페
5P00000000690[강릉] 해마루(할인쿠폰)강원도 강릉시 남항진동 101-23가족-가족여행, 그룹-동아리, 그룹-직장동료맛집-일식
6P00000000703[춘천]소양강스카이워크강원도 춘천시 근화동 영서로 2675가족-가족여행, 당일치기여행, 연인-데이트코스자연 및 풍경 감상-관광지
7P00000002554[강원]속초_한화워터피아_튜브스터골드시즌이용권(통합)강원도 속초시 미시령로2983번길 111가족-가족여행, 그룹-직장동료, 그룹-친척, 연인-연인여행, 친구휴식/휴양
8P00000000705[춘천]애니메이션박물관+토이로봇관강원도 춘천시 서면 박사로 854가족-가족여행, 그룹-동아리, 여행유형-포상휴가TV쇼-드라마, TV쇼-예능, TV쇼-음악회, 개그, 게임-참여, 게임-리액션, 게임-스트리밍, 장난, 품평-드라마, 품평-영화
9P00000000706[춘천]커피음료이용권춘천시 동면 순환대로 1154-105가족-가족여행, 연인-연인여행, 이박자연 및 풍경 감상-관광지, 자연 및 풍경 감상-도시, 자연 및 풍경 감상-인문, 자연 및 풍경 감상-자연
trrsrt_cdtrrsrt_nmtrrsrt_addrtursm_cstmr_tycstmr_intrst_anals_info_cn
90P00000001427[속초]일성설악콘도_온천(대인)강원 고성군 토성면 고성대로 47-24가족-가족여행, 그룹-친척자연 및 풍경 감상-자연
91P00000001428[속초]청초마리나_세일요트(대인)속초시 조양동 1544-5연인-데이트코스, 연인-연인여행휴식/휴양
92P00000001429[속초]켄싱턴리조트설악비치_해수사우나(일반)강원 고성군 토성면 봉포리 40-9가족-가족여행, 그룹-친척자연 및 풍경 감상-자연
93P00000001431[속초]테라크랩팜_입장(대인)강원도 속초시 학사평2길 16가족-가족여행, 연인-연인여행교육-인문, 자연 및 풍경 감상-자연, 장난
94P00000001432[속초]해피아울하우스_입장(대인)강원도 속초시 바람꽃마을길 118가족-가족여행교육-인문
95P00000001434[강릉]강릉통일공원_입장(성인)강원 강릉시 율곡로 1715-38가족-가족여행, 그룹-동아리, 그룹-종교단체자연 및 풍경 감상-인문, 자연 및 풍경 감상-자연
96P00000001435[강릉]경포아쿠아리움_입장(성인)강원 강릉시 난설헌로 131가족-가족여행, 연인-데이트코스장난, 제품 추천-장난감
97P00000001436[강릉]대관령박물관_입장(성인)강원 강릉시 창해로 14번길 51-20가족-가족여행교육-인문
98P00000001437[강릉]동양자수박물관_입장(성인)강원 강릉시 죽헌동 140-2가족-가족여행교육-인문
99P00000001440[강릉]애니멀스토리_입장(일반)강릉시 성산면 성연로 212-10가족-가족여행교육-인문