Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 157 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 9.2 KiB |
Average record size in memory | 59.8 B |
Variable types
Numeric | 2 |
---|---|
Text | 2 |
Categorical | 1 |
DateTime | 1 |
Boolean | 1 |
Dataset
Description | 제주관광정보시스템(VISITJEJU)의 여행일정댓글 정보로 댓글ID, 여행일정, 상위댓글ID, 깊이, 댓글, 등록일시, 사용여부 등의 정보를 제공합니다. |
---|---|
Author | 제주관광공사 |
URL | https://www.data.go.kr/data/15049996/fileData.do |
Reproduction
Analysis started | 2024-03-23 05:36:23.381511 |
---|---|
Analysis finished | 2024-03-23 05:36:25.292954 |
Duration | 1.91 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
댓글ID
Real number (ℝ)
UNIQUE
 
Distinct | 157 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 240.94268 |
Minimum | 112 |
---|---|
Maximum | 441 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.5 KiB |
Quantile statistics
Minimum | 112 |
---|---|
5-th percentile | 130.8 |
Q1 | 169 |
median | 222 |
Q3 | 319 |
95-th percentile | 389.4 |
Maximum | 441 |
Range | 329 |
Interquartile range (IQR) | 150 |
Descriptive statistics
Standard deviation | 87.015931 |
---|---|
Coefficient of variation (CV) | 0.36114786 |
Kurtosis | -0.96647463 |
Mean | 240.94268 |
Median Absolute Deviation (MAD) | 63 |
Skewness | 0.5141605 |
Sum | 37828 |
Variance | 7571.7723 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
249 | 1 | 0.6% |
112 | 1 | 0.6% |
222 | 1 | 0.6% |
226 | 1 | 0.6% |
344 | 1 | 0.6% |
260 | 1 | 0.6% |
323 | 1 | 0.6% |
340 | 1 | 0.6% |
355 | 1 | 0.6% |
143 | 1 | 0.6% |
Other values (147) | 147 |
Value | Count | Frequency (%) |
112 | 1 | |
123 | 1 | |
124 | 1 | |
125 | 1 | |
127 | 1 | |
128 | 1 | |
129 | 1 | |
130 | 1 | |
131 | 1 | |
132 | 1 |
Value | Count | Frequency (%) |
441 | 1 | |
430 | 1 | |
418 | 1 | |
406 | 1 | |
405 | 1 | |
394 | 1 | |
393 | 1 | |
391 | 1 | |
389 | 1 | |
387 | 1 |
여행일정
Text
Distinct | 82 |
---|---|
Distinct (%) | 52.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.4 KiB |
Value | Count | Frequency (%) |
여행 | 52 | 13.6% |
제주 | 20 | 5.2% |
제주여행 | 16 | 4.2% |
2박3일 | 11 | 2.9% |
3박4일 | 10 | 2.6% |
겨울제주 | 10 | 2.6% |
여행기 | 8 | 2.1% |
템포 | 8 | 2.1% |
한 | 8 | 2.1% |
쉬어가는 | 8 | 2.1% |
Other values (148) | 231 |
Most occurring characters
Value | Count | Frequency (%) |
228 | 15.7% | |
여 | 100 | 6.9% |
행 | 99 | 6.8% |
제 | 75 | 5.2% |
주 | 74 | 5.1% |
일 | 33 | 2.3% |
3 | 33 | 2.3% |
박 | 32 | 2.2% |
가 | 27 | 1.9% |
이 | 23 | 1.6% |
Other values (211) | 727 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 984 | |
Space Separator | 228 | 15.7% |
Lowercase Letter | 98 | 6.8% |
Decimal Number | 96 | 6.6% |
Other Punctuation | 20 | 1.4% |
Uppercase Letter | 13 | 0.9% |
Math Symbol | 5 | 0.3% |
Connector Punctuation | 4 | 0.3% |
Other Symbol | 1 | 0.1% |
Close Punctuation | 1 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
여 | 100 | 10.2% |
행 | 99 | 10.1% |
제 | 75 | 7.6% |
주 | 74 | 7.5% |
일 | 33 | 3.4% |
박 | 32 | 3.3% |
가 | 27 | 2.7% |
이 | 23 | 2.3% |
의 | 18 | 1.8% |
한 | 17 | 1.7% |
Other values (159) | 486 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 12 | |
t | 11 | |
i | 10 | 10.2% |
s | 8 | 8.2% |
u | 6 | 6.1% |
c | 5 | 5.1% |
n | 5 | 5.1% |
d | 5 | 5.1% |
y | 5 | 5.1% |
o | 4 | 4.1% |
Other values (12) | 27 |
Decimal Number
Value | Count | Frequency (%) |
3 | 33 | |
2 | 20 | |
4 | 17 | |
1 | 11 | 11.5% |
0 | 6 | 6.2% |
6 | 5 | 5.2% |
5 | 2 | 2.1% |
9 | 1 | 1.0% |
8 | 1 | 1.0% |
Uppercase Letter
Value | Count | Frequency (%) |
J | 2 | |
B | 2 | |
M | 2 | |
L | 2 | |
N | 1 | |
W | 1 | |
E | 1 | |
T | 1 | |
A | 1 |
Other Punctuation
Value | Count | Frequency (%) |
. | 7 | |
! | 5 | |
, | 5 | |
? | 1 | 5.0% |
& | 1 | 5.0% |
/ | 1 | 5.0% |
Space Separator
Value | Count | Frequency (%) |
228 |
Math Symbol
Value | Count | Frequency (%) |
~ | 5 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 4 |
Other Symbol
Value | Count | Frequency (%) |
♥ | 1 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 979 | |
Common | 356 | 24.5% |
Latin | 111 | 7.6% |
Han | 5 | 0.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
여 | 100 | 10.2% |
행 | 99 | 10.1% |
제 | 75 | 7.7% |
주 | 74 | 7.6% |
일 | 33 | 3.4% |
박 | 32 | 3.3% |
가 | 27 | 2.8% |
이 | 23 | 2.3% |
의 | 18 | 1.8% |
한 | 17 | 1.7% |
Other values (154) | 481 |
Latin
Value | Count | Frequency (%) |
e | 12 | 10.8% |
t | 11 | 9.9% |
i | 10 | 9.0% |
s | 8 | 7.2% |
u | 6 | 5.4% |
c | 5 | 4.5% |
n | 5 | 4.5% |
d | 5 | 4.5% |
y | 5 | 4.5% |
o | 4 | 3.6% |
Other values (21) | 40 |
Common
Value | Count | Frequency (%) |
228 | ||
3 | 33 | 9.3% |
2 | 20 | 5.6% |
4 | 17 | 4.8% |
1 | 11 | 3.1% |
. | 7 | 2.0% |
0 | 6 | 1.7% |
! | 5 | 1.4% |
~ | 5 | 1.4% |
6 | 5 | 1.4% |
Other values (11) | 19 | 5.3% |
Han
Value | Count | Frequency (%) |
州 | 1 | |
島 | 1 | |
行 | 1 | |
旅 | 1 | |
美 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 979 | |
ASCII | 466 | |
CJK | 5 | 0.3% |
Misc Symbols | 1 | 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
228 | ||
3 | 33 | 7.1% |
2 | 20 | 4.3% |
4 | 17 | 3.6% |
e | 12 | 2.6% |
t | 11 | 2.4% |
1 | 11 | 2.4% |
i | 10 | 2.1% |
s | 8 | 1.7% |
. | 7 | 1.5% |
Other values (41) | 109 |
Hangul
Value | Count | Frequency (%) |
여 | 100 | 10.2% |
행 | 99 | 10.1% |
제 | 75 | 7.7% |
주 | 74 | 7.6% |
일 | 33 | 3.4% |
박 | 32 | 3.3% |
가 | 27 | 2.8% |
이 | 23 | 2.3% |
의 | 18 | 1.8% |
한 | 17 | 1.7% |
Other values (154) | 481 |
Misc Symbols
Value | Count | Frequency (%) |
♥ | 1 |
CJK
Value | Count | Frequency (%) |
州 | 1 | |
島 | 1 | |
行 | 1 | |
旅 | 1 | |
美 | 1 |
상위댓글ID
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 21 |
---|---|
Distinct (%) | 13.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 45.235669 |
Minimum | 0 |
---|---|
Maximum | 342 |
Zeros | 129 |
Zeros (%) | 82.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.5 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 311.6 |
Maximum | 342 |
Range | 342 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 102.24763 |
---|---|
Coefficient of variation (CV) | 2.2603321 |
Kurtosis | 2.3401271 |
Mean | 45.235669 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 2.0003383 |
Sum | 7102 |
Variance | 10454.579 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 129 | |
297 | 4 | 2.5% |
317 | 4 | 2.5% |
135 | 2 | 1.3% |
159 | 2 | 1.3% |
342 | 1 | 0.6% |
162 | 1 | 0.6% |
257 | 1 | 0.6% |
192 | 1 | 0.6% |
132 | 1 | 0.6% |
Other values (11) | 11 | 7.0% |
Value | Count | Frequency (%) |
0 | 129 | |
132 | 1 | 0.6% |
135 | 2 | 1.3% |
150 | 1 | 0.6% |
152 | 1 | 0.6% |
159 | 2 | 1.3% |
162 | 1 | 0.6% |
192 | 1 | 0.6% |
231 | 1 | 0.6% |
251 | 1 | 0.6% |
Value | Count | Frequency (%) |
342 | 1 | 0.6% |
327 | 1 | 0.6% |
317 | 4 | |
316 | 1 | 0.6% |
314 | 1 | 0.6% |
311 | 1 | 0.6% |
309 | 1 | 0.6% |
308 | 1 | 0.6% |
304 | 1 | 0.6% |
297 | 4 |
깊이
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 1.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.4 KiB |
0 | |
---|---|
1 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 1 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 129 | |
1 | 28 | 17.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 129 | |
1 | 28 | 17.8% |
댓글
Text
Distinct | 153 |
---|---|
Distinct (%) | 97.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.4 KiB |
Length
Max length | 311 |
---|---|
Median length | 60 |
Mean length | 19.33121 |
Min length | 5 |
Characters and Unicode
Total characters | 3035 |
---|---|
Distinct characters | 470 |
Distinct categories | 12 ? |
Distinct scripts | 4 ? |
Distinct blocks | 4 ? |
Unique
Unique | 149 ? |
---|---|
Unique (%) | 94.9% |
Sample
1st row | fgfdg |
---|---|
2nd row | 2/16 간식 녹차 오프레도 녹차 라떼 그린티 롤케익 호지 초코 다쿠아즈 |
3rd row | 2/16 저녁 |
4th row | 2/17 짱구분식 모닥치기(대) 라면 |
5th row | 2/17 간식 오는정김밥 치즈김밥 |
Value | Count | Frequency (%) |
8 | 1.3% | |
2/17 | 7 | 1.1% |
저녁 | 7 | 1.1% |
제주 | 6 | 1.0% |
2일차 | 5 | 0.8% |
점심 | 5 | 0.8% |
간식 | 5 | 0.8% |
퍼시픽 | 3 | 0.5% |
i | 3 | 0.5% |
좋아요 | 3 | 0.5% |
Other values (497) | 557 |
Most occurring characters
Value | Count | Frequency (%) |
480 | 15.8% | |
1 | 68 | 2.2% |
시 | 53 | 1.7% |
2 | 48 | 1.6% |
~ | 39 | 1.3% |
ㅡ | 38 | 1.3% |
이 | 32 | 1.1% |
0 | 32 | 1.1% |
일 | 31 | 1.0% |
3 | 31 | 1.0% |
Other values (460) | 2183 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1921 | |
Space Separator | 480 | 15.8% |
Decimal Number | 260 | 8.6% |
Lowercase Letter | 182 | 6.0% |
Other Punctuation | 73 | 2.4% |
Math Symbol | 42 | 1.4% |
Open Punctuation | 19 | 0.6% |
Close Punctuation | 18 | 0.6% |
Uppercase Letter | 17 | 0.6% |
Dash Punctuation | 14 | 0.5% |
Other values (2) | 9 | 0.3% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
시 | 53 | 2.8% |
ㅡ | 38 | 2.0% |
이 | 32 | 1.7% |
일 | 31 | 1.6% |
다 | 30 | 1.6% |
주 | 27 | 1.4% |
스 | 25 | 1.3% |
지 | 25 | 1.3% |
제 | 22 | 1.1% |
도 | 21 | 1.1% |
Other values (403) | 1617 |
Lowercase Letter
Value | Count | Frequency (%) |
t | 19 | |
o | 18 | |
i | 17 | |
s | 15 | 8.2% |
e | 14 | 7.7% |
d | 13 | 7.1% |
a | 12 | 6.6% |
f | 11 | 6.0% |
g | 11 | 6.0% |
r | 11 | 6.0% |
Other values (12) | 41 |
Decimal Number
Value | Count | Frequency (%) |
1 | 68 | |
2 | 48 | |
0 | 32 | |
3 | 31 | |
5 | 20 | 7.7% |
7 | 18 | 6.9% |
4 | 14 | 5.4% |
6 | 12 | 4.6% |
8 | 10 | 3.8% |
9 | 7 | 2.7% |
Uppercase Letter
Value | Count | Frequency (%) |
D | 3 | |
P | 3 | |
I | 2 | |
H | 2 | |
T | 2 | |
V | 1 | 5.9% |
C | 1 | 5.9% |
A | 1 | 5.9% |
Y | 1 | 5.9% |
M | 1 | 5.9% |
Other Punctuation
Value | Count | Frequency (%) |
. | 29 | |
? | 14 | |
/ | 10 | 13.7% |
, | 7 | 9.6% |
: | 5 | 6.8% |
! | 5 | 6.8% |
; | 3 | 4.1% |
Math Symbol
Value | Count | Frequency (%) |
~ | 39 | |
> | 3 | 7.1% |
Space Separator
Value | Count | Frequency (%) |
480 |
Open Punctuation
Value | Count | Frequency (%) |
( | 19 |
Close Punctuation
Value | Count | Frequency (%) |
) | 18 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 14 |
Modifier Symbol
Value | Count | Frequency (%) |
^ | 8 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1917 | |
Common | 915 | |
Latin | 199 | 6.6% |
Han | 4 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
시 | 53 | 2.8% |
ㅡ | 38 | 2.0% |
이 | 32 | 1.7% |
일 | 31 | 1.6% |
다 | 30 | 1.6% |
주 | 27 | 1.4% |
스 | 25 | 1.3% |
지 | 25 | 1.3% |
제 | 22 | 1.1% |
도 | 21 | 1.1% |
Other values (399) | 1613 |
Latin
Value | Count | Frequency (%) |
t | 19 | 9.5% |
o | 18 | 9.0% |
i | 17 | 8.5% |
s | 15 | 7.5% |
e | 14 | 7.0% |
d | 13 | 6.5% |
a | 12 | 6.0% |
f | 11 | 5.5% |
g | 11 | 5.5% |
r | 11 | 5.5% |
Other values (22) | 58 |
Common
Value | Count | Frequency (%) |
480 | ||
1 | 68 | 7.4% |
2 | 48 | 5.2% |
~ | 39 | 4.3% |
0 | 32 | 3.5% |
3 | 31 | 3.4% |
. | 29 | 3.2% |
5 | 20 | 2.2% |
( | 19 | 2.1% |
7 | 18 | 2.0% |
Other values (15) | 131 | 14.3% |
Han
Value | Count | Frequency (%) |
行 | 1 | |
旅 | 1 | |
州 | 1 | |
島 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1810 | |
ASCII | 1114 | |
Compat Jamo | 107 | 3.5% |
CJK | 4 | 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
480 | ||
1 | 68 | 6.1% |
2 | 48 | 4.3% |
~ | 39 | 3.5% |
0 | 32 | 2.9% |
3 | 31 | 2.8% |
. | 29 | 2.6% |
5 | 20 | 1.8% |
t | 19 | 1.7% |
( | 19 | 1.7% |
Other values (47) | 329 |
Hangul
Value | Count | Frequency (%) |
시 | 53 | 2.9% |
이 | 32 | 1.8% |
일 | 31 | 1.7% |
다 | 30 | 1.7% |
주 | 27 | 1.5% |
스 | 25 | 1.4% |
지 | 25 | 1.4% |
제 | 22 | 1.2% |
도 | 21 | 1.2% |
로 | 21 | 1.2% |
Other values (386) | 1523 |
Compat Jamo
Value | Count | Frequency (%) |
ㅡ | 38 | |
ㅇ | 20 | |
ㅋ | 15 | 14.0% |
ㅎ | 14 | 13.1% |
ㄴ | 6 | 5.6% |
ㅁ | 6 | 5.6% |
ㄹ | 2 | 1.9% |
ㄶ | 1 | 0.9% |
ㅓ | 1 | 0.9% |
ㅗ | 1 | 0.9% |
Other values (3) | 3 | 2.8% |
CJK
Value | Count | Frequency (%) |
行 | 1 | |
旅 | 1 | |
州 | 1 | |
島 | 1 |
등록일시
Date
Distinct | 102 |
---|---|
Distinct (%) | 65.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.4 KiB |
Minimum | 2018-05-04 00:00:00 |
---|---|
Maximum | 2024-01-31 00:00:00 |
사용여부
Boolean
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 289.0 B |
True |
---|
Value | Count | Frequency (%) |
True | 157 |
댓글ID | 여행일정 | 상위댓글ID | 깊이 | |
---|---|---|---|---|
댓글ID | 1.000 | 0.976 | 0.697 | 0.644 |
여행일정 | 0.976 | 1.000 | 0.000 | 0.431 |
상위댓글ID | 0.697 | 0.000 | 1.000 | 1.000 |
깊이 | 0.644 | 0.431 | 1.000 | 1.000 |
댓글ID | 상위댓글ID | 깊이 | |
---|---|---|---|
댓글ID | 1.000 | 0.123 | 0.487 |
상위댓글ID | 0.123 | 1.000 | 0.980 |
깊이 | 0.487 | 0.980 | 1.000 |
댓글ID | 여행일정 | 상위댓글ID | 깊이 | 댓글 | 등록일시 | 사용여부 | |
---|---|---|---|---|---|---|---|
0 | 249 | 하하하 | 0 | 0 | fgfdg | 2020-02-16 | y |
1 | 250 | 겨울제주 | 0 | 0 | 2/16 간식 녹차 오프레도 녹차 라떼 그린티 롤케익 호지 초코 다쿠아즈 | 2020-02-17 | y |
2 | 252 | 겨울제주 | 251 | 1 | 2/16 저녁 | 2020-02-17 | y |
3 | 253 | 겨울제주 | 0 | 0 | 2/17 짱구분식 모닥치기(대) 라면 | 2020-02-17 | y |
4 | 254 | 겨울제주 | 0 | 0 | 2/17 간식 오는정김밥 치즈김밥 | 2020-02-17 | y |
5 | 256 | 겨울제주 | 0 | 0 | 2/17 저녁 제주 해녀짬뽕 특 해녀짬뽕 특 해녀짜장 탕수육(중) | 2020-02-17 | y |
6 | 441 | 웰빙2 | 0 | 0 | 2박 3일 | 2024-01-31 | y |
7 | 430 | 준우가족3박4일여행 | 0 | 0 | 좋음코스별로 잘짬 | 2023-09-19 | y |
8 | 257 | 겨울제주 | 0 | 0 | 2/17 간식 스타벅스 성산D주푸치노(저지방, 에스프레소휘핑 | 2020-02-17 | y |
9 | 273 | 바이크투어 | 0 | 0 | 제주바이크투어 | 2020-11-19 | y |
댓글ID | 여행일정 | 상위댓글ID | 깊이 | 댓글 | 등록일시 | 사용여부 | |
---|---|---|---|---|---|---|---|
147 | 169 | 2박3일 여행기 | 0 | 0 | 좋네요~~~~ | 2018-10-01 | y |
148 | 174 | 10월 제주여행 | 0 | 0 | 1일 운정이네 본점 | 2018-10-28 | y |
149 | 173 | 10월 제주여행 | 0 | 0 | 점심 운정이네 본점 | 2018-10-28 | y |
150 | 175 | 10월 제주여행 | 0 | 0 | 5일 도두반점 | 2018-10-28 | y |
151 | 186 | 겨울 제주의 모든 것, 윈터 제주 여행 | 0 | 0 | 111111 | 2019-01-19 | y |
152 | 200 | 제주 여행 | 0 | 0 | 첫째낭 아침 먹고 바이제주 | 2019-04-24 | y |
153 | 229 | 6666 | 0 | 0 | 허경영!허경영!허경영! | 2019-11-28 | y |
154 | 216 | 삐뽀와 3박4일 제주여행 | 0 | 0 | 케니스토리 1008호 | 2019-06-23 | y |
155 | 217 | 삐뽀와 3박4일 제주여행 | 0 | 0 | 1033호 | 2019-06-27 | y |
156 | 230 | 세식구만의 여행 | 0 | 0 | 한림칼국수 | 2019-12-09 | y |