Overview

Dataset statistics

Number of variables2
Number of observations3963
Missing cells0
Missing cells (%)0.0%
Duplicate rows265
Duplicate rows (%)6.7%
Total size in memory62.1 KiB
Average record size in memory16.0 B

Variable types

Text1
DateTime1

Dataset

Description제주관광정보시스템(VISITJEJU)의 찜(담기)한 여행목록 정보로 여행일정, 등록일시 등의 정보를 제공합니다.
Author제주관광공사
URLhttps://www.data.go.kr/data/15049997/fileData.do

Alerts

Dataset has 265 (6.7%) duplicate rowsDuplicates

Reproduction

Analysis started2024-03-23 06:58:02.822741
Analysis finished2024-03-23 06:58:04.259308
Duration1.44 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct217
Distinct (%)5.5%
Missing0
Missing (%)0.0%
Memory size31.1 KiB
2024-03-23T06:58:04.792206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length28
Mean length13.482463
Min length3

Characters and Unicode

Total characters53431
Distinct characters455
Distinct categories13 ?
Distinct scripts6 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)0.6%

Sample

1st row제주에서 버킷리스트 이루기
2nd row제주에서 버킷리스트 이루기
3rd row제주에서 버킷리스트 이루기
4th row제주에서 버킷리스트 이루기
5th row제주에서 버킷리스트 이루기
ValueCountFrequency (%)
여행 1345
 
10.0%
제주 482
 
3.6%
제주여행 307
 
2.3%
뚜벅이 224
 
1.7%
함께하는 220
 
1.6%
3박 200
 
1.5%
4일 200
 
1.5%
제주의 195
 
1.4%
우리가족 182
 
1.4%
제주에서만 178
 
1.3%
Other values (384) 9943
73.8%
2024-03-23T06:58:06.250417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9573
 
17.9%
2285
 
4.3%
2275
 
4.3%
1722
 
3.2%
1651
 
3.1%
1072
 
2.0%
e 840
 
1.6%
809
 
1.5%
801
 
1.5%
t 709
 
1.3%
Other values (445) 31694
59.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 34223
64.1%
Space Separator 9573
 
17.9%
Lowercase Letter 6096
 
11.4%
Decimal Number 1711
 
3.2%
Uppercase Letter 922
 
1.7%
Other Punctuation 573
 
1.1%
Close Punctuation 100
 
0.2%
Open Punctuation 96
 
0.2%
Math Symbol 80
 
0.1%
Modifier Symbol 20
 
< 0.1%
Other values (3) 37
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2285
 
6.7%
2275
 
6.6%
1722
 
5.0%
1651
 
4.8%
1072
 
3.1%
809
 
2.4%
801
 
2.3%
592
 
1.7%
580
 
1.7%
573
 
1.7%
Other values (376) 21863
63.9%
Lowercase Letter
ValueCountFrequency (%)
e 840
13.8%
t 709
11.6%
i 563
 
9.2%
s 496
 
8.1%
r 478
 
7.8%
n 364
 
6.0%
u 358
 
5.9%
o 282
 
4.6%
h 272
 
4.5%
p 240
 
3.9%
Other values (14) 1494
24.5%
Uppercase Letter
ValueCountFrequency (%)
J 185
20.1%
S 163
17.7%
T 122
13.2%
M 92
10.0%
N 69
 
7.5%
V 62
 
6.7%
A 54
 
5.9%
D 40
 
4.3%
H 38
 
4.1%
O 32
 
3.5%
Other values (7) 65
 
7.0%
Decimal Number
ValueCountFrequency (%)
3 449
26.2%
4 365
21.3%
2 346
20.2%
1 283
16.5%
0 111
 
6.5%
9 79
 
4.6%
5 39
 
2.3%
8 25
 
1.5%
7 14
 
0.8%
Other Punctuation
ValueCountFrequency (%)
? 188
32.8%
! 174
30.4%
, 92
16.1%
. 62
 
10.8%
# 38
 
6.6%
& 15
 
2.6%
: 4
 
0.7%
Math Symbol
ValueCountFrequency (%)
~ 78
97.5%
+ 1
 
1.2%
= 1
 
1.2%
Close Punctuation
ValueCountFrequency (%)
84
84.0%
) 16
 
16.0%
Open Punctuation
ValueCountFrequency (%)
84
87.5%
( 12
 
12.5%
Space Separator
ValueCountFrequency (%)
9573
100.0%
Modifier Symbol
ValueCountFrequency (%)
^ 20
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 19
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%
Other Symbol
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 33617
62.9%
Common 12190
 
22.8%
Latin 7018
 
13.1%
Han 520
 
1.0%
Hiragana 64
 
0.1%
Katakana 22
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2285
 
6.8%
2275
 
6.8%
1722
 
5.1%
1651
 
4.9%
1072
 
3.2%
809
 
2.4%
801
 
2.4%
592
 
1.8%
580
 
1.7%
573
 
1.7%
Other values (280) 21257
63.2%
Han
ValueCountFrequency (%)
51
 
9.8%
50
 
9.6%
40
 
7.7%
27
 
5.2%
26
 
5.0%
24
 
4.6%
18
 
3.5%
14
 
2.7%
12
 
2.3%
12
 
2.3%
Other values (55) 246
47.3%
Latin
ValueCountFrequency (%)
e 840
 
12.0%
t 709
 
10.1%
i 563
 
8.0%
s 496
 
7.1%
r 478
 
6.8%
n 364
 
5.2%
u 358
 
5.1%
o 282
 
4.0%
h 272
 
3.9%
p 240
 
3.4%
Other values (31) 2416
34.4%
Common
ValueCountFrequency (%)
9573
78.5%
3 449
 
3.7%
4 365
 
3.0%
2 346
 
2.8%
1 283
 
2.3%
? 188
 
1.5%
! 174
 
1.4%
0 111
 
0.9%
, 92
 
0.8%
84
 
0.7%
Other values (18) 525
 
4.3%
Katakana
ValueCountFrequency (%)
3
13.6%
3
13.6%
2
 
9.1%
2
 
9.1%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
Other values (6) 6
27.3%
Hiragana
ValueCountFrequency (%)
10
15.6%
7
10.9%
6
9.4%
5
7.8%
5
7.8%
4
 
6.2%
4
 
6.2%
4
 
6.2%
4
 
6.2%
3
 
4.7%
Other values (5) 12
18.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 33613
62.9%
ASCII 19035
35.6%
CJK 520
 
1.0%
None 168
 
0.3%
Hiragana 64
 
0.1%
Katakana 22
 
< 0.1%
Misc Symbols 5
 
< 0.1%
Compat Jamo 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9573
50.3%
e 840
 
4.4%
t 709
 
3.7%
i 563
 
3.0%
s 496
 
2.6%
r 478
 
2.5%
3 449
 
2.4%
4 365
 
1.9%
n 364
 
1.9%
u 358
 
1.9%
Other values (56) 4840
25.4%
Hangul
ValueCountFrequency (%)
2285
 
6.8%
2275
 
6.8%
1722
 
5.1%
1651
 
4.9%
1072
 
3.2%
809
 
2.4%
801
 
2.4%
592
 
1.8%
580
 
1.7%
573
 
1.7%
Other values (277) 21253
63.2%
None
ValueCountFrequency (%)
84
50.0%
84
50.0%
CJK
ValueCountFrequency (%)
51
 
9.8%
50
 
9.6%
40
 
7.7%
27
 
5.2%
26
 
5.0%
24
 
4.6%
18
 
3.5%
14
 
2.7%
12
 
2.3%
12
 
2.3%
Other values (55) 246
47.3%
Hiragana
ValueCountFrequency (%)
10
15.6%
7
10.9%
6
9.4%
5
7.8%
5
7.8%
4
 
6.2%
4
 
6.2%
4
 
6.2%
4
 
6.2%
3
 
4.7%
Other values (5) 12
18.8%
Misc Symbols
ValueCountFrequency (%)
5
100.0%
Katakana
ValueCountFrequency (%)
3
13.6%
3
13.6%
2
 
9.1%
2
 
9.1%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
Other values (6) 6
27.3%
Compat Jamo
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%
Distinct161
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Memory size31.1 KiB
Minimum2016-12-25 00:00:00
Maximum2023-09-28 00:00:00
2024-03-23T06:58:06.784984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T06:58:07.424613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Missing values

2024-03-23T06:58:03.785701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T06:58:04.114845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

여행일정등록일시
0제주에서 버킷리스트 이루기2018-04-19
1제주에서 버킷리스트 이루기2018-04-19
2제주에서 버킷리스트 이루기2018-04-19
3제주에서 버킷리스트 이루기2018-04-19
4제주에서 버킷리스트 이루기2018-04-19
5제주에서 버킷리스트 이루기2018-04-19
6제주에서 버킷리스트 이루기2018-04-19
7제주에서 버킷리스트 이루기2018-04-19
8제주에서 버킷리스트 이루기2018-04-19
9제주에서 버킷리스트 이루기2018-04-19
여행일정등록일시
3953부모님과 가족여행2021-05-24
3954부모님과 가족여행2021-05-24
3955부모님과 가족여행2021-05-24
3956부모님과 가족여행2021-05-24
3957부모님과 가족여행2021-05-24
3958부모님과 가족여행2021-05-24
3959부모님과 가족여행2021-05-24
3960부모님과 가족여행2021-05-24
3961부모님과 가족여행2021-05-24
3962엄마랑 함께하는 동쪽마을 여행2023-09-28

Duplicate rows

Most frequently occurring

여행일정등록일시# duplicates
256한 템포 쉬어가는 여행2018-05-18139
184오직 제주에서만 가능한 여행2018-12-21118
245초등학생 어린이 맞춤 여행2018-12-20102
194우리가족 힐링 여행2019-11-2795
163박 4일 제주여행2019-07-0494
171엄마랑 함께하는 동쪽마을 여행2018-09-1376
120동남지역 뚜벅이 여행2018-05-1075
251트렌디한 제주의 감성 소품점 투어리스트2018-12-2071
155서북지역 뚜벅이 여행2018-05-1063
124동북지역 뚜벅이 여행2018-05-1062