Overview

Dataset statistics

Number of variables5
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.0 KiB
Average record size in memory41.3 B

Variable types

Text2
Categorical3

Alerts

trrsrt_addr is highly overall correlated with tursm_cstmr_ty and 1 other fieldsHigh correlation
tursm_cstmr_ty is highly overall correlated with trrsrt_addr and 1 other fieldsHigh correlation
cstmr_intrst_anals_info_cn is highly overall correlated with trrsrt_addr and 1 other fieldsHigh correlation
trrsrt_cd has unique valuesUnique
trrsrt_nm has unique valuesUnique

Reproduction

Analysis started2023-12-10 10:09:20.524152
Analysis finished2023-12-10 10:09:21.451741
Duration0.93 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

trrsrt_cd
Text

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:09:21.771703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters1200
Distinct characters11
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st rowP00000000907
2nd rowP00000002560
3rd rowP00000000909
4th rowP00000000910
5th rowP00000000911
ValueCountFrequency (%)
p00000000907 1
 
1.0%
p00000001052 1
 
1.0%
p00000001063 1
 
1.0%
p00000001062 1
 
1.0%
p00000001061 1
 
1.0%
p00000001060 1
 
1.0%
p00000001059 1
 
1.0%
p00000001058 1
 
1.0%
p00000001057 1
 
1.0%
p00000001056 1
 
1.0%
Other values (90) 90
90.0%
2023-12-10T19:09:22.356280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 822
68.5%
P 100
 
8.3%
1 85
 
7.1%
9 59
 
4.9%
5 23
 
1.9%
6 22
 
1.8%
4 21
 
1.8%
7 18
 
1.5%
2 17
 
1.4%
3 17
 
1.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1100
91.7%
Uppercase Letter 100
 
8.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 822
74.7%
1 85
 
7.7%
9 59
 
5.4%
5 23
 
2.1%
6 22
 
2.0%
4 21
 
1.9%
7 18
 
1.6%
2 17
 
1.5%
3 17
 
1.5%
8 16
 
1.5%
Uppercase Letter
ValueCountFrequency (%)
P 100
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1100
91.7%
Latin 100
 
8.3%

Most frequent character per script

Common
ValueCountFrequency (%)
0 822
74.7%
1 85
 
7.7%
9 59
 
5.4%
5 23
 
2.1%
6 22
 
2.0%
4 21
 
1.9%
7 18
 
1.6%
2 17
 
1.5%
3 17
 
1.5%
8 16
 
1.5%
Latin
ValueCountFrequency (%)
P 100
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1200
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 822
68.5%
P 100
 
8.3%
1 85
 
7.1%
9 59
 
4.9%
5 23
 
1.9%
6 22
 
1.8%
4 21
 
1.8%
7 18
 
1.5%
2 17
 
1.4%
3 17
 
1.4%

trrsrt_nm
Text

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:09:22.711508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length20
Mean length14.14
Min length7

Characters and Unicode

Total characters1414
Distinct characters173
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st row[안동]하회마을_성인
2nd row[청도]홍차리에_애프터눈티
3rd row[안동]안동시립민속박물관_성인
4th row[안동]안동전통문화 컨텐츠박물관_성인
5th row[안동]도산서원_성인
ValueCountFrequency (%)
울진]엑스포공원 6
 
4.9%
울진]금강송 4
 
3.3%
영천]보현산 2
 
1.6%
문경]문경새재 2
 
1.6%
문경]불정자연휴양림 2
 
1.6%
문경]오미자 2
 
1.6%
안동]안동전통문화 2
 
1.6%
예천 2
 
1.6%
울진]성류굴_성인 1
 
0.8%
아쿠아리움_소인 1
 
0.8%
Other values (99) 99
80.5%
2023-12-10T19:09:23.395015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
[ 100
 
7.1%
] 100
 
7.1%
_ 96
 
6.8%
75
 
5.3%
59
 
4.2%
43
 
3.0%
40
 
2.8%
26
 
1.8%
26
 
1.8%
23
 
1.6%
Other values (163) 826
58.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1063
75.2%
Open Punctuation 106
 
7.5%
Close Punctuation 106
 
7.5%
Connector Punctuation 96
 
6.8%
Space Separator 23
 
1.6%
Other Punctuation 12
 
0.8%
Decimal Number 8
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
75
 
7.1%
59
 
5.6%
43
 
4.0%
40
 
3.8%
26
 
2.4%
26
 
2.4%
23
 
2.2%
21
 
2.0%
20
 
1.9%
20
 
1.9%
Other values (151) 710
66.8%
Decimal Number
ValueCountFrequency (%)
0 2
25.0%
9 2
25.0%
1 2
25.0%
2 2
25.0%
Open Punctuation
ValueCountFrequency (%)
[ 100
94.3%
( 6
 
5.7%
Close Punctuation
ValueCountFrequency (%)
] 100
94.3%
) 6
 
5.7%
Other Punctuation
ValueCountFrequency (%)
/ 10
83.3%
& 2
 
16.7%
Connector Punctuation
ValueCountFrequency (%)
_ 96
100.0%
Space Separator
ValueCountFrequency (%)
23
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1063
75.2%
Common 351
 
24.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
75
 
7.1%
59
 
5.6%
43
 
4.0%
40
 
3.8%
26
 
2.4%
26
 
2.4%
23
 
2.2%
21
 
2.0%
20
 
1.9%
20
 
1.9%
Other values (151) 710
66.8%
Common
ValueCountFrequency (%)
[ 100
28.5%
] 100
28.5%
_ 96
27.4%
23
 
6.6%
/ 10
 
2.8%
) 6
 
1.7%
( 6
 
1.7%
0 2
 
0.6%
9 2
 
0.6%
1 2
 
0.6%
Other values (2) 4
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1063
75.2%
ASCII 351
 
24.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
[ 100
28.5%
] 100
28.5%
_ 96
27.4%
23
 
6.6%
/ 10
 
2.8%
) 6
 
1.7%
( 6
 
1.7%
0 2
 
0.6%
9 2
 
0.6%
1 2
 
0.6%
Other values (2) 4
 
1.1%
Hangul
ValueCountFrequency (%)
75
 
7.1%
59
 
5.6%
43
 
4.0%
40
 
3.8%
26
 
2.4%
26
 
2.4%
23
 
2.2%
21
 
2.0%
20
 
1.9%
20
 
1.9%
Other values (151) 710
66.8%

trrsrt_addr
Categorical

HIGH CORRELATION 

Distinct44
Distinct (%)44.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
경북 울진군 근남면 수산리 63
 
6
경북 포항시 북구 두호동 1017
 
6
경북 울진군 금강송면 십이령로 552
 
4
경북 경주시 흥무로 71
 
3
경상북도 청도군 화양읍 합천리 590
 
3
Other values (39)
78 

Length

Max length30
Median length24
Mean length18.49
Min length13

Unique

Unique7 ?
Unique (%)7.0%

Sample

1st row경북 안동시 풍천면 전서로 186
2nd row경상북도 청도군 화양읍 합천리 590
3rd row경북 안동시 민속촌길 13
4th row경북 안동시 서동문로 203
5th row경북 안동시 도산면 도산서원길 154

Common Values

ValueCountFrequency (%)
경북 울진군 근남면 수산리 63 6
 
6.0%
경북 포항시 북구 두호동 1017 6
 
6.0%
경북 울진군 금강송면 십이령로 552 4
 
4.0%
경북 경주시 흥무로 71 3
 
3.0%
경상북도 청도군 화양읍 합천리 590 3
 
3.0%
경북 영천시 임고면 승마휴양림길 105 3
 
3.0%
경북 울진군 근남면 성류굴로 225 3
 
3.0%
경상북도 영천시 화북면 배나무정길 334 3
 
3.0%
경북 경주시 배동 454-3 3
 
3.0%
경북 경주시 인왕동 517 3
 
3.0%
Other values (34) 63
63.0%

Length

2023-12-10T19:09:23.644608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경북 80
 
16.7%
경상북도 19
 
4.0%
울진군 17
 
3.5%
안동시 17
 
3.5%
경주시 16
 
3.3%
포항시 11
 
2.3%
근남면 10
 
2.1%
상주시 10
 
2.1%
북구 9
 
1.9%
영천시 8
 
1.7%
Other values (115) 283
59.0%

tursm_cstmr_ty
Categorical

HIGH CORRELATION 

Distinct29
Distinct (%)29.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
가족-가족여행, 당일치기여행, 연인-데이트코스, 연인-연인여행
15 
가족-가족여행, 당일치기여행, 연인-데이트코스, 연인-연인여행, 친구
여행유형-연휴
가족-가족여행, 당일치기여행, 연인-데이트코스
당일치기여행
 
5
Other values (24)
57 

Length

Max length58
Median length34
Mean length26.04
Min length6

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row개별, 여행유형-연휴, 친구
2nd row개별, 당일치기여행, 여행유형-연휴, 친구
3rd row가족-가족여행, 친구
4th row가족-가족여행, 계절 휴가-겨울휴가, 계절 휴가-여름휴가
5th row가족-가족여행, 계절 휴가-가을휴가, 계절 휴가-봄휴가, 당일치기여행

Common Values

ValueCountFrequency (%)
가족-가족여행, 당일치기여행, 연인-데이트코스, 연인-연인여행 15
 
15.0%
가족-가족여행, 당일치기여행, 연인-데이트코스, 연인-연인여행, 친구 8
 
8.0%
여행유형-연휴 8
 
8.0%
가족-가족여행, 당일치기여행, 연인-데이트코스 7
 
7.0%
당일치기여행 5
 
5.0%
당일치기여행, 여행유형-연휴 5
 
5.0%
가족-가족여행, 개별, 여행유형-연휴 4
 
4.0%
가족-가족여행, 계절 휴가-여름휴가, 여행유형-연휴 3
 
3.0%
개별, 당일치기여행, 여행유형-연휴, 친구 3
 
3.0%
가족-가족여행, 그룹-동아리, 그룹-학교/.단체, 당일치기여행, 연인-데이트코스, 연인-연인여행, 친구 3
 
3.0%
Other values (19) 39
39.0%

Length

2023-12-10T19:09:23.877985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
가족-가족여행 70
19.7%
당일치기여행 64
18.0%
연인-데이트코스 43
12.1%
여행유형-연휴 39
11.0%
연인-연인여행 32
9.0%
친구 30
8.4%
개별 22
 
6.2%
계절 19
 
5.3%
그룹-학교/.단체 9
 
2.5%
휴가-여름휴가 9
 
2.5%
Other values (7) 19
 
5.3%

cstmr_intrst_anals_info_cn
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)11.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
자연 및 풍경 감상-자연
33 
자연 및 풍경 감상-관광지
13 
자연 및 풍경 감상-관광지, 자연 및 풍경 감상-자연
10 
역사 유적지 방문-문화
10 
휴식/휴양
10 
Other values (6)
24 

Length

Max length29
Median length15
Mean length12.13
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row자연 및 풍경 감상-관광지, 자연 및 풍경 감상-자연
2nd row자연 및 풍경 감상-자연
3rd row역사 유적지 방문-문화
4th row교육-인문
5th row자연 및 풍경 감상-관광지

Common Values

ValueCountFrequency (%)
자연 및 풍경 감상-자연 33
33.0%
자연 및 풍경 감상-관광지 13
 
13.0%
자연 및 풍경 감상-관광지, 자연 및 풍경 감상-자연 10
 
10.0%
역사 유적지 방문-문화 10
 
10.0%
휴식/휴양 10
 
10.0%
<NA> 8
 
8.0%
맛집-카페 5
 
5.0%
교육-인문 4
 
4.0%
교육-과학 3
 
3.0%
역사 유적지 방문-음식/요리 2
 
2.0%

Length

2023-12-10T19:09:24.134728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
자연 66
19.8%
66
19.8%
풍경 66
19.8%
감상-자연 43
12.9%
감상-관광지 23
 
6.9%
역사 12
 
3.6%
유적지 12
 
3.6%
휴식/휴양 12
 
3.6%
방문-문화 10
 
3.0%
na 8
 
2.4%
Other values (5) 16
 
4.8%

Correlations

2023-12-10T19:09:24.270969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
trrsrt_cdtrrsrt_nmtrrsrt_addrtursm_cstmr_tycstmr_intrst_anals_info_cn
trrsrt_cd1.0001.0001.0001.0001.000
trrsrt_nm1.0001.0001.0001.0001.000
trrsrt_addr1.0001.0001.0001.0001.000
tursm_cstmr_ty1.0001.0001.0001.0001.000
cstmr_intrst_anals_info_cn1.0001.0001.0001.0001.000
2023-12-10T19:09:24.468300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
tursm_cstmr_tycstmr_intrst_anals_info_cntrrsrt_addr
tursm_cstmr_ty1.0000.8970.888
cstmr_intrst_anals_info_cn0.8971.0000.789
trrsrt_addr0.8880.7891.000
2023-12-10T19:09:24.618609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
trrsrt_addrtursm_cstmr_tycstmr_intrst_anals_info_cn
trrsrt_addr1.0000.8880.789
tursm_cstmr_ty0.8881.0000.897
cstmr_intrst_anals_info_cn0.7890.8971.000

Missing values

2023-12-10T19:09:21.193221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:09:21.375244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

trrsrt_cdtrrsrt_nmtrrsrt_addrtursm_cstmr_tycstmr_intrst_anals_info_cn
0P00000000907[안동]하회마을_성인경북 안동시 풍천면 전서로 186개별, 여행유형-연휴, 친구자연 및 풍경 감상-관광지, 자연 및 풍경 감상-자연
1P00000002560[청도]홍차리에_애프터눈티경상북도 청도군 화양읍 합천리 590개별, 당일치기여행, 여행유형-연휴, 친구자연 및 풍경 감상-자연
2P00000000909[안동]안동시립민속박물관_성인경북 안동시 민속촌길 13가족-가족여행, 친구역사 유적지 방문-문화
3P00000000910[안동]안동전통문화 컨텐츠박물관_성인경북 안동시 서동문로 203가족-가족여행, 계절 휴가-겨울휴가, 계절 휴가-여름휴가교육-인문
4P00000000911[안동]도산서원_성인경북 안동시 도산면 도산서원길 154가족-가족여행, 계절 휴가-가을휴가, 계절 휴가-봄휴가, 당일치기여행자연 및 풍경 감상-관광지
5P00000000912[안동]이육사문학관_성인경북 안동시 도산면 백운로 525가족-가족여행, 개별, 여행유형-연휴역사 유적지 방문-문화
6P00000000913[안동]하회마을_소인경북 안동시 풍천면 전서로 186개별, 여행유형-연휴, 친구자연 및 풍경 감상-관광지, 자연 및 풍경 감상-자연
7P00000002561[청도]홍차리에_티릴레이경상북도 청도군 화양읍 합천리 590개별, 당일치기여행, 여행유형-연휴, 친구자연 및 풍경 감상-자연
8P00000000915[안동]안동시립민속박물관_소인경북 안동시 민속촌길 13가족-가족여행, 친구역사 유적지 방문-문화
9P00000000916[안동]안동전통문화 컨텐츠박물관_소인경북 안동시 서동문로 203가족-가족여행, 계절 휴가-겨울휴가, 계절 휴가-여름휴가교육-인문
trrsrt_cdtrrsrt_nmtrrsrt_addrtursm_cstmr_tycstmr_intrst_anals_info_cn
90P00000001087[포항]요트데이_야간/소인경북 포항시 북구 두호동 1017가족-가족여행, 당일치기여행, 연인-데이트코스, 연인-연인여행자연 및 풍경 감상-자연
91P00000001088[포항]영일대 게스트하우스경북 포항시 북구 삼호로 73개별, 삼박, 이박, 일박, 친구휴식/휴양
92P00000001089[영천]운주산승마자연휴양림_성인경북 영천시 임고면 승마휴양림길 105여행유형-연휴자연 및 풍경 감상-자연
93P00000001090[영천]운주산승마자연휴양림_청소년경북 영천시 임고면 승마휴양림길 105여행유형-연휴자연 및 풍경 감상-자연
94P00000001091[영천]운주산승마자연휴양림_소인경북 영천시 임고면 승마휴양림길 105여행유형-연휴자연 및 풍경 감상-자연
95P00000001092[영천]보현산 천문전시체험관_성인/청소년경상북도 영천시 화북면 별빛로 681-32당일치기여행, 여행유형-연휴휴식/휴양
96P00000001093[영천]보현산 천문전시체험관_소인경상북도 영천시 화북면 별빛로 681-32당일치기여행, 여행유형-연휴휴식/휴양
97P00000001094[영천]목재문화체험장_대품경상북도 영천시 화북면 배나무정길 334여행유형-연휴자연 및 풍경 감상-자연
98P00000001095[영천]목재문화체험장_중품경상북도 영천시 화북면 배나무정길 334여행유형-연휴자연 및 풍경 감상-자연
99P00000001096[영천]목재문화체험장_소품경상북도 영천시 화북면 배나무정길 334여행유형-연휴자연 및 풍경 감상-자연