Overview

Dataset statistics

Number of variables3
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory312.5 KiB
Average record size in memory32.0 B

Variable types

Text2
DateTime1

Dataset

Description제주관광정보시스템(VISITJEJU)의 나의콘텐츠로 콘텐츠명, 사용자아이디, 등록일시를 제공합니다.
URLhttps://www.data.go.kr/data/15118433/fileData.do

Reproduction

Analysis started2023-12-12 12:23:18.321658
Analysis finished2023-12-12 12:23:19.059479
Duration0.74 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1510
Distinct (%)15.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T21:23:19.250916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length57
Median length46
Mean length7.0829
Min length1

Characters and Unicode

Total characters70829
Distinct characters804
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique652 ?
Unique (%)6.5%

Sample

1st row월정리해변
2nd row지니어스 로사이
3rd row국립제주박물관
4th row중문관광단지
5th row붉은제주
ValueCountFrequency (%)
세계자연유산 221
 
1.8%
카멜리아힐 180
 
1.4%
성산일출봉(unesco 179
 
1.4%
제주 174
 
1.4%
월정리해변 162
 
1.3%
오설록티뮤지엄 161
 
1.3%
사려니숲길 161
 
1.3%
협재해수욕장 158
 
1.3%
섭지코지 155
 
1.2%
테마파크 141
 
1.1%
Other values (1960) 10777
86.4%
2023-12-12T21:23:19.764681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2481
 
3.5%
1793
 
2.5%
1609
 
2.3%
1584
 
2.2%
1369
 
1.9%
1232
 
1.7%
1170
 
1.7%
( 1028
 
1.5%
) 1028
 
1.5%
1018
 
1.4%
Other values (794) 56517
79.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 62953
88.9%
Space Separator 2481
 
3.5%
Uppercase Letter 1694
 
2.4%
Open Punctuation 1061
 
1.5%
Close Punctuation 1061
 
1.5%
Decimal Number 543
 
0.8%
Lowercase Letter 540
 
0.8%
Other Punctuation 228
 
0.3%
Math Symbol 164
 
0.2%
Dash Punctuation 53
 
0.1%
Other values (3) 51
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1793
 
2.8%
1609
 
2.6%
1584
 
2.5%
1369
 
2.2%
1232
 
2.0%
1170
 
1.9%
1018
 
1.6%
1015
 
1.6%
881
 
1.4%
822
 
1.3%
Other values (718) 50460
80.2%
Lowercase Letter
ValueCountFrequency (%)
a 75
13.9%
d 74
13.7%
l 69
12.8%
o 63
11.7%
e 47
8.7%
n 30
 
5.6%
i 30
 
5.6%
u 29
 
5.4%
p 19
 
3.5%
m 19
 
3.5%
Other values (14) 85
15.7%
Uppercase Letter
ValueCountFrequency (%)
O 273
16.1%
S 238
14.0%
N 237
14.0%
U 229
13.5%
E 228
13.5%
C 226
13.3%
H 85
 
5.0%
A 67
 
4.0%
B 27
 
1.6%
L 21
 
1.2%
Other values (12) 63
 
3.7%
Decimal Number
ValueCountFrequency (%)
1 154
28.4%
2 78
14.4%
0 75
13.8%
7 56
 
10.3%
3 45
 
8.3%
6 37
 
6.8%
4 31
 
5.7%
8 29
 
5.3%
9 21
 
3.9%
5 17
 
3.1%
Other Punctuation
ValueCountFrequency (%)
, 68
29.8%
/ 46
20.2%
. 43
18.9%
& 31
13.6%
' 16
 
7.0%
· 11
 
4.8%
! 9
 
3.9%
? 4
 
1.8%
Math Symbol
ValueCountFrequency (%)
> 61
37.2%
< 61
37.2%
~ 42
25.6%
Open Punctuation
ValueCountFrequency (%)
( 1028
96.9%
[ 33
 
3.1%
Close Punctuation
ValueCountFrequency (%)
) 1028
96.9%
] 33
 
3.1%
Space Separator
ValueCountFrequency (%)
2481
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 53
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 47
100.0%
Initial Punctuation
ValueCountFrequency (%)
2
100.0%
Final Punctuation
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 62920
88.8%
Common 5642
 
8.0%
Latin 2234
 
3.2%
Han 33
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1793
 
2.8%
1609
 
2.6%
1584
 
2.5%
1369
 
2.2%
1232
 
2.0%
1170
 
1.9%
1018
 
1.6%
1015
 
1.6%
881
 
1.4%
822
 
1.3%
Other values (702) 50427
80.1%
Latin
ValueCountFrequency (%)
O 273
12.2%
S 238
10.7%
N 237
10.6%
U 229
10.3%
E 228
10.2%
C 226
10.1%
H 85
 
3.8%
a 75
 
3.4%
d 74
 
3.3%
l 69
 
3.1%
Other values (36) 500
22.4%
Common
ValueCountFrequency (%)
2481
44.0%
( 1028
18.2%
) 1028
18.2%
1 154
 
2.7%
2 78
 
1.4%
0 75
 
1.3%
, 68
 
1.2%
> 61
 
1.1%
< 61
 
1.1%
7 56
 
1.0%
Other values (20) 552
 
9.8%
Han
ValueCountFrequency (%)
4
12.1%
4
12.1%
4
12.1%
4
12.1%
3
9.1%
2
 
6.1%
2
 
6.1%
2
 
6.1%
1
 
3.0%
1
 
3.0%
Other values (6) 6
18.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 62920
88.8%
ASCII 7861
 
11.1%
CJK 32
 
< 0.1%
None 11
 
< 0.1%
Punctuation 4
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2481
31.6%
( 1028
13.1%
) 1028
13.1%
O 273
 
3.5%
S 238
 
3.0%
N 237
 
3.0%
U 229
 
2.9%
E 228
 
2.9%
C 226
 
2.9%
1 154
 
2.0%
Other values (63) 1739
22.1%
Hangul
ValueCountFrequency (%)
1793
 
2.8%
1609
 
2.6%
1584
 
2.5%
1369
 
2.2%
1232
 
2.0%
1170
 
1.9%
1018
 
1.6%
1015
 
1.6%
881
 
1.4%
822
 
1.3%
Other values (702) 50427
80.1%
None
ValueCountFrequency (%)
· 11
100.0%
CJK
ValueCountFrequency (%)
4
12.5%
4
12.5%
4
12.5%
4
12.5%
3
9.4%
2
 
6.2%
2
 
6.2%
2
 
6.2%
1
 
3.1%
1
 
3.1%
Other values (5) 5
15.6%
Punctuation
ValueCountFrequency (%)
2
50.0%
2
50.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Distinct4397
Distinct (%)44.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T21:23:20.068362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length16
Mean length15.5572
Min length15

Characters and Unicode

Total characters155572
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2163 ?
Unique (%)21.6%

Sample

1st rowUSR_00000026786
2nd rowU000000000003559
3rd rowU000000000011515
4th rowU000000000014085
5th rowU000000000018178
ValueCountFrequency (%)
usr_00000018578 121
 
1.2%
u000000000012216 56
 
0.6%
usr_00000010511 25
 
0.2%
usr_00000026240 22
 
0.2%
u000000000004386 21
 
0.2%
usr_00000020160 21
 
0.2%
u000000000002850 20
 
0.2%
usr_00000014970 20
 
0.2%
u000000000012515 18
 
0.2%
usr_00000018872 17
 
0.2%
Other values (4387) 9659
96.6%
2023-12-12T21:23:20.576489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 89867
57.8%
U 10000
 
6.4%
1 8471
 
5.4%
2 6324
 
4.1%
S 4428
 
2.8%
R 4428
 
2.8%
_ 4428
 
2.8%
6 4208
 
2.7%
8 4039
 
2.6%
3 3974
 
2.6%
Other values (4) 15405
 
9.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 132288
85.0%
Uppercase Letter 18856
 
12.1%
Connector Punctuation 4428
 
2.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 89867
67.9%
1 8471
 
6.4%
2 6324
 
4.8%
6 4208
 
3.2%
8 4039
 
3.1%
3 3974
 
3.0%
4 3949
 
3.0%
5 3855
 
2.9%
7 3839
 
2.9%
9 3762
 
2.8%
Uppercase Letter
ValueCountFrequency (%)
U 10000
53.0%
S 4428
23.5%
R 4428
23.5%
Connector Punctuation
ValueCountFrequency (%)
_ 4428
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 136716
87.9%
Latin 18856
 
12.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 89867
65.7%
1 8471
 
6.2%
2 6324
 
4.6%
_ 4428
 
3.2%
6 4208
 
3.1%
8 4039
 
3.0%
3 3974
 
2.9%
4 3949
 
2.9%
5 3855
 
2.8%
7 3839
 
2.8%
Latin
ValueCountFrequency (%)
U 10000
53.0%
S 4428
23.5%
R 4428
23.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 155572
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 89867
57.8%
U 10000
 
6.4%
1 8471
 
5.4%
2 6324
 
4.1%
S 4428
 
2.8%
R 4428
 
2.8%
_ 4428
 
2.8%
6 4208
 
2.7%
8 4039
 
2.6%
3 3974
 
2.6%
Other values (4) 15405
 
9.9%
Distinct977
Distinct (%)9.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2016-06-04 00:00:00
Maximum2021-09-23 00:00:00
2023-12-12T21:23:20.809462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:23:21.024459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Missing values

2023-12-12T21:23:18.901020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:23:19.012415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

콘텐츠명사용자아이디등록일시
40711월정리해변USR_000000267862018-03-01
68285지니어스 로사이U0000000000035592018-08-08
83676국립제주박물관U0000000000115152019-03-25
92185중문관광단지U0000000000140852019-06-03
93033붉은제주U0000000000181782019-09-13
40718카멜리아힐USR_000000268422018-03-02
65240월정리해변U0000000000034542018-08-03
1837함덕해수욕장USR_000000129612017-05-02
19654제주별빛누리공원USR_000000261862018-01-23
21926중문색달해수욕장USR_000000266522018-02-22
콘텐츠명사용자아이디등록일시
75109한림공원U0000000000103692019-02-19
91336제주커피박물관 바움 BaumU0000000000188632019-09-29
59677제주러브랜드U0000000000035282018-08-05
82602중문관광단지U0000000000132502019-05-14
29821아날로그감귤밭USR_000000252112017-12-12
32051제주항공우주박물관USR_000000264232018-02-08
82304플레이케이팝U0000000000117442019-04-01
70470거문오름(UNESCO 세계자연유산)U0000000000113432019-03-19
62481제주아이브리조트U0000000000048742018-09-09
14019덕성원USR_000000188692017-08-16