Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 6323 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 302.7 KiB |
Average record size in memory | 49.0 B |
Variable types
Numeric | 1 |
---|---|
Text | 2 |
Categorical | 2 |
DateTime | 1 |
Dataset
Description | 경상남도 진주시 관광상품에 대하여 SNS에서 수집한 음식점, 숙소, 관광지, 행사 데이터 및 관광상품 후기 url를 제공합니다. |
---|---|
Author | 경상남도 진주시 |
URL | https://www.data.go.kr/data/15097737/fileData.do |
번호 is highly overall correlated with 관광상품분류 | High correlation |
관광상품분류 is highly overall correlated with 번호 | High correlation |
번호 has unique values | Unique |
홈페이지주소(URL) has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 14:57:22.060168 |
---|---|
Analysis finished | 2023-12-12 14:57:23.045810 |
Duration | 0.99 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
번호
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 6323 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5003162 |
Minimum | 5000001 |
---|---|
Maximum | 5006323 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 55.7 KiB |
Quantile statistics
Minimum | 5000001 |
---|---|
5-th percentile | 5000317.1 |
Q1 | 5001581.5 |
median | 5003162 |
Q3 | 5004742.5 |
95-th percentile | 5006006.9 |
Maximum | 5006323 |
Range | 6322 |
Interquartile range (IQR) | 3161 |
Descriptive statistics
Standard deviation | 1825.4372 |
---|---|
Coefficient of variation (CV) | 0.00036485671 |
Kurtosis | -1.2 |
Mean | 5003162 |
Median Absolute Deviation (MAD) | 1581 |
Skewness | 0 |
Sum | 3.1634993 × 1010 |
Variance | 3332221 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5005529 | 1 | < 0.1% |
5006094 | 1 | < 0.1% |
5005169 | 1 | < 0.1% |
5005168 | 1 | < 0.1% |
5005167 | 1 | < 0.1% |
5004552 | 1 | < 0.1% |
5002611 | 1 | < 0.1% |
5002153 | 1 | < 0.1% |
5000755 | 1 | < 0.1% |
5005161 | 1 | < 0.1% |
Other values (6313) | 6313 |
Value | Count | Frequency (%) |
5000001 | 1 | |
5000002 | 1 | |
5000003 | 1 | |
5000004 | 1 | |
5000005 | 1 | |
5000006 | 1 | |
5000007 | 1 | |
5000008 | 1 | |
5000009 | 1 | |
5000010 | 1 |
Value | Count | Frequency (%) |
5006323 | 1 | |
5006322 | 1 | |
5006321 | 1 | |
5006320 | 1 | |
5006319 | 1 | |
5006318 | 1 | |
5006317 | 1 | |
5006316 | 1 | |
5006315 | 1 | |
5006314 | 1 |
관광상품명
Text
Distinct | 1608 |
---|---|
Distinct (%) | 25.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 49.5 KiB |
Length
Max length | 192 |
---|---|
Median length | 97 |
Mean length | 7.7031472 |
Min length | 1 |
Characters and Unicode
Total characters | 48707 |
---|---|
Distinct characters | 754 |
Distinct categories | 11 ? |
Distinct scripts | 3 ? |
Distinct blocks | 3 ? |
Unique
Unique | 980 ? |
---|---|
Unique (%) | 15.5% |
Sample
1st row | 진주국제재즈페스티벌 |
---|---|
2nd row | 진주국제재즈페스티벌 |
3rd row | 경해여자고등학교 |
4th row | 에나뮤직오픈마이크 |
5th row | 진주국제재즈페스티벌 |
Value | Count | Frequency (%) |
진주남강유등축제 | 504 | 6.4% |
진주레일바이크놀이공원 | 269 | 3.4% |
진주성 | 256 | 3.2% |
경상남도수목원 | 136 | 1.7% |
진양호 | 134 | 1.7% |
하연옥 | 116 | 1.5% |
진주 | 82 | 1.0% |
본점 | 81 | 1.0% |
월아산 | 79 | 1.0% |
진주익룡발자국전시관 | 74 | 0.9% |
Other values (1821) | 6158 |
Most occurring characters
Value | Count | Frequency (%) |
진 | 2503 | 5.1% |
주 | 2292 | 4.7% |
1569 | 3.2% | |
이 | 1195 | 2.5% |
남 | 888 | 1.8% |
제 | 828 | 1.7% |
호 | 803 | 1.6% |
강 | 765 | 1.6% |
원 | 761 | 1.6% |
스 | 673 | 1.4% |
Other values (744) | 36430 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 45321 | |
Space Separator | 1569 | 3.2% |
Other Punctuation | 751 | 1.5% |
Decimal Number | 235 | 0.5% |
Uppercase Letter | 216 | 0.4% |
Lowercase Letter | 194 | 0.4% |
Open Punctuation | 189 | 0.4% |
Close Punctuation | 189 | 0.4% |
Math Symbol | 24 | < 0.1% |
Dash Punctuation | 12 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
진 | 2503 | 5.5% |
주 | 2292 | 5.1% |
이 | 1195 | 2.6% |
남 | 888 | 2.0% |
제 | 828 | 1.8% |
호 | 803 | 1.8% |
강 | 765 | 1.7% |
원 | 761 | 1.7% |
스 | 673 | 1.5% |
점 | 643 | 1.4% |
Other values (677) | 33970 |
Uppercase Letter
Value | Count | Frequency (%) |
A | 46 | |
M | 29 | |
B | 22 | |
S | 16 | 7.4% |
I | 16 | 7.4% |
O | 14 | 6.5% |
K | 10 | 4.6% |
E | 9 | 4.2% |
C | 8 | 3.7% |
N | 8 | 3.7% |
Other values (13) | 38 |
Lowercase Letter
Value | Count | Frequency (%) |
o | 26 | |
e | 22 | |
t | 19 | |
i | 17 | |
r | 14 | 7.2% |
a | 14 | 7.2% |
f | 13 | 6.7% |
h | 12 | 6.2% |
w | 11 | 5.7% |
c | 10 | 5.2% |
Other values (10) | 36 |
Decimal Number
Value | Count | Frequency (%) |
5 | 46 | |
2 | 41 | |
0 | 35 | |
4 | 27 | |
1 | 26 | |
6 | 18 | 7.7% |
9 | 13 | 5.5% |
7 | 11 | 4.7% |
3 | 11 | 4.7% |
8 | 7 | 3.0% |
Other Punctuation
Value | Count | Frequency (%) |
/ | 641 | |
: | 49 | 6.5% |
& | 27 | 3.6% |
' | 24 | 3.2% |
. | 5 | 0.7% |
! | 4 | 0.5% |
, | 1 | 0.1% |
Math Symbol
Value | Count | Frequency (%) |
> | 12 | |
< | 12 |
Space Separator
Value | Count | Frequency (%) |
1569 |
Open Punctuation
Value | Count | Frequency (%) |
( | 189 |
Close Punctuation
Value | Count | Frequency (%) |
) | 189 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 12 |
Letter Number
Value | Count | Frequency (%) |
Ⅱ | 7 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 45321 | |
Common | 2969 | 6.1% |
Latin | 417 | 0.9% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
진 | 2503 | 5.5% |
주 | 2292 | 5.1% |
이 | 1195 | 2.6% |
남 | 888 | 2.0% |
제 | 828 | 1.8% |
호 | 803 | 1.8% |
강 | 765 | 1.7% |
원 | 761 | 1.7% |
스 | 673 | 1.5% |
점 | 643 | 1.4% |
Other values (677) | 33970 |
Latin
Value | Count | Frequency (%) |
A | 46 | 11.0% |
M | 29 | 7.0% |
o | 26 | 6.2% |
B | 22 | 5.3% |
e | 22 | 5.3% |
t | 19 | 4.6% |
i | 17 | 4.1% |
S | 16 | 3.8% |
I | 16 | 3.8% |
O | 14 | 3.4% |
Other values (34) | 190 |
Common
Value | Count | Frequency (%) |
1569 | ||
/ | 641 | |
( | 189 | 6.4% |
) | 189 | 6.4% |
: | 49 | 1.7% |
5 | 46 | 1.5% |
2 | 41 | 1.4% |
0 | 35 | 1.2% |
& | 27 | 0.9% |
4 | 27 | 0.9% |
Other values (13) | 156 | 5.3% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 45321 | |
ASCII | 3379 | 6.9% |
Number Forms | 7 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
진 | 2503 | 5.5% |
주 | 2292 | 5.1% |
이 | 1195 | 2.6% |
남 | 888 | 2.0% |
제 | 828 | 1.8% |
호 | 803 | 1.8% |
강 | 765 | 1.7% |
원 | 761 | 1.7% |
스 | 673 | 1.5% |
점 | 643 | 1.4% |
Other values (677) | 33970 |
ASCII
Value | Count | Frequency (%) |
1569 | ||
/ | 641 | |
( | 189 | 5.6% |
) | 189 | 5.6% |
: | 49 | 1.5% |
5 | 46 | 1.4% |
A | 46 | 1.4% |
2 | 41 | 1.2% |
0 | 35 | 1.0% |
M | 29 | 0.9% |
Other values (56) | 545 | 16.1% |
Number Forms
Value | Count | Frequency (%) |
Ⅱ | 7 |
관광상품분류
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 49.5 KiB |
음식점 | |
---|---|
관광지 | |
행사 | |
숙소 |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 2.7186462 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 행사 |
---|---|
2nd row | 행사 |
3rd row | 관광지 |
4th row | 행사 |
5th row | 행사 |
Common Values
Value | Count | Frequency (%) |
음식점 | 2683 | |
관광지 | 1861 | |
행사 | 1310 | |
숙소 | 469 | 7.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
음식점 | 2683 | |
관광지 | 1861 | |
행사 | 1310 | |
숙소 | 469 | 7.4% |
수집원분류
Categorical
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 49.5 KiB |
네이버블로그 | |
---|---|
인스타그램 | |
티스토리블로그 | |
유튜브 | |
다음블로그 | 247 |
Length
Max length | 7 |
---|---|
Median length | 6 |
Mean length | 5.5715641 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 인스타그램 |
---|---|
2nd row | 인스타그램 |
3rd row | 인스타그램 |
4th row | 인스타그램 |
5th row | 인스타그램 |
Common Values
Value | Count | Frequency (%) |
네이버블로그 | 2986 | |
인스타그램 | 1850 | |
티스토리블로그 | 777 | 12.3% |
유튜브 | 463 | 7.3% |
다음블로그 | 247 | 3.9% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
네이버블로그 | 2986 | |
인스타그램 | 1850 | |
티스토리블로그 | 777 | 12.3% |
유튜브 | 463 | 7.3% |
다음블로그 | 247 | 3.9% |
홈페이지주소(URL)
Text
UNIQUE
 
Distinct | 6323 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 49.5 KiB |
Length
Max length | 635 |
---|---|
Median length | 350 |
Mean length | 52.13095 |
Min length | 26 |
Characters and Unicode
Total characters | 329624 |
---|---|
Distinct characters | 71 |
Distinct categories | 7 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 6323 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | https://www.instagram.com/p/B5CUQdEAiL6/ |
---|---|
2nd row | https://www.instagram.com/p/B5CYU27lOMu/ |
3rd row | https://www.instagram.com/p/B5o5288FxpJ/ |
4th row | https://www.instagram.com/p/B5rfX9aAE7C/ |
5th row | https://www.instagram.com/p/B5t7IiylHri/ |
Value | Count | Frequency (%) |
https://www.instagram.com/p/b5cuqdeail6 | 1 | < 0.1% |
https://www.instagram.com/p/b59ag_ylpbe | 1 | < 0.1% |
https://www.youtube.com/watch?v=dkl5lw8_zum | 1 | < 0.1% |
https://www.youtube.com/watch?v=qqzhtcs8slu | 1 | < 0.1% |
https://blog.naver.com/mammy200104?redirect=log&logno=221735027378 | 1 | < 0.1% |
https://blog.naver.com/su_heeeeee?redirect=log&logno=221734416873 | 1 | < 0.1% |
https://www.instagram.com/p/b5_gv9ygzbn | 1 | < 0.1% |
https://blog.naver.com/strychinin?redirect=log&logno=221736348549 | 1 | < 0.1% |
https://blog.naver.com/ouou1111?redirect=log&logno=221736578025 | 1 | < 0.1% |
https://blog.naver.com/hotelraonstay?redirect=log&logno=221735372811 | 1 | < 0.1% |
Other values (6313) | 6313 |
Most occurring characters
Value | Count | Frequency (%) |
/ | 23085 | 7.0% |
o | 21670 | 6.6% |
t | 21340 | 6.5% |
. | 12649 | 3.8% |
2 | 12426 | 3.8% |
g | 12336 | 3.7% |
e | 12103 | 3.7% |
s | 10658 | 3.2% |
c | 10467 | 3.2% |
r | 10174 | 3.1% |
Other values (61) | 182716 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 190204 | |
Decimal Number | 53702 | 16.3% |
Other Punctuation | 52353 | 15.9% |
Uppercase Letter | 25668 | 7.8% |
Math Symbol | 6306 | 1.9% |
Dash Punctuation | 813 | 0.2% |
Connector Punctuation | 578 | 0.2% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
o | 21670 | 11.4% |
t | 21340 | 11.2% |
g | 12336 | 6.5% |
e | 12103 | 6.4% |
s | 10658 | 5.6% |
c | 10467 | 5.5% |
r | 10174 | 5.3% |
a | 9725 | 5.1% |
m | 9588 | 5.0% |
p | 9414 | 4.9% |
Other values (16) | 62729 |
Uppercase Letter
Value | Count | Frequency (%) |
R | 3454 | |
N | 3328 | |
L | 3293 | |
C | 2752 | |
B | 1960 | 7.6% |
E | 1695 | 6.6% |
A | 1409 | 5.5% |
D | 671 | 2.6% |
F | 622 | 2.4% |
M | 580 | 2.3% |
Other values (16) | 5904 |
Decimal Number
Value | Count | Frequency (%) |
2 | 12426 | |
1 | 6220 | |
0 | 4809 | 9.0% |
8 | 4649 | 8.7% |
3 | 4619 | 8.6% |
9 | 4603 | 8.6% |
4 | 4509 | 8.4% |
7 | 4052 | 7.5% |
6 | 3957 | 7.4% |
5 | 3858 | 7.2% |
Other Punctuation
Value | Count | Frequency (%) |
/ | 23085 | |
. | 12649 | |
: | 6323 | 12.1% |
% | 3990 | 7.6% |
? | 3397 | 6.5% |
& | 2909 | 5.6% |
Math Symbol
Value | Count | Frequency (%) |
= | 6306 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 813 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 578 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 215872 | |
Common | 113752 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
o | 21670 | 10.0% |
t | 21340 | 9.9% |
g | 12336 | 5.7% |
e | 12103 | 5.6% |
s | 10658 | 4.9% |
c | 10467 | 4.8% |
r | 10174 | 4.7% |
a | 9725 | 4.5% |
m | 9588 | 4.4% |
p | 9414 | 4.4% |
Other values (42) | 88397 |
Common
Value | Count | Frequency (%) |
/ | 23085 | |
. | 12649 | |
2 | 12426 | |
: | 6323 | 5.6% |
= | 6306 | 5.5% |
1 | 6220 | 5.5% |
0 | 4809 | 4.2% |
8 | 4649 | 4.1% |
3 | 4619 | 4.1% |
9 | 4603 | 4.0% |
Other values (9) | 28063 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 329624 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
/ | 23085 | 7.0% |
o | 21670 | 6.6% |
t | 21340 | 6.5% |
. | 12649 | 3.8% |
2 | 12426 | 3.8% |
g | 12336 | 3.7% |
e | 12103 | 3.7% |
s | 10658 | 3.2% |
c | 10467 | 3.2% |
r | 10174 | 3.1% |
Other values (61) | 182716 |
게시글작성일
Date
Distinct | 1542 |
---|---|
Distinct (%) | 24.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 49.5 KiB |
Minimum | 2005-10-28 00:00:00 |
---|---|
Maximum | 2021-12-17 00:00:00 |
번호 | 관광상품분류 | 수집원분류 | |
---|---|---|---|
번호 | 1.000 | 0.974 | 0.573 |
관광상품분류 | 0.974 | 1.000 | 0.309 |
수집원분류 | 0.573 | 0.309 | 1.000 |
관광상품분류 | 수집원분류 | |
---|---|---|
관광상품분류 | 1.000 | 0.256 |
수집원분류 | 0.256 | 1.000 |
번호 | 관광상품분류 | 수집원분류 | |
---|---|---|---|
번호 | 1.000 | 0.919 | 0.273 |
관광상품분류 | 0.919 | 1.000 | 0.256 |
수집원분류 | 0.273 | 0.256 | 1.000 |
번호 | 관광상품명 | 관광상품분류 | 수집원분류 | 홈페이지주소(URL) | 게시글작성일 | |
---|---|---|---|---|---|---|
0 | 5005529 | 진주국제재즈페스티벌 | 행사 | 인스타그램 | https://www.instagram.com/p/B5CUQdEAiL6/ | 2019-11-19 |
1 | 5005530 | 진주국제재즈페스티벌 | 행사 | 인스타그램 | https://www.instagram.com/p/B5CYU27lOMu/ | 2019-11-19 |
2 | 5000270 | 경해여자고등학교 | 관광지 | 인스타그램 | https://www.instagram.com/p/B5o5288FxpJ/ | 2021-12-17 |
3 | 5005323 | 에나뮤직오픈마이크 | 행사 | 인스타그램 | https://www.instagram.com/p/B5rfX9aAE7C/ | 2021-12-15 |
4 | 5005540 | 진주국제재즈페스티벌 | 행사 | 인스타그램 | https://www.instagram.com/p/B5t7IiylHri/ | 2021-12-13 |
5 | 5004900 | 헤이데이 | 음식점 | 인스타그램 | https://www.instagram.com/p/B6UfGTZhlxi/ | 2021-12-03 |
6 | 5004901 | 헤이데이 | 음식점 | 인스타그램 | https://www.instagram.com/p/B6afex0hBIj/ | 2021-11-30 |
7 | 5001518 | 진주성 | 관광지 | 인스타그램 | https://www.instagram.com/p/B62h0jrFISu/ | 2021-11-25 |
8 | 5000954 | 진양호 | 관광지 | 인스타그램 | https://www.instagram.com/p/B645cSsl3PE/ | 2021-11-24 |
9 | 5004899 | 헤이데이 | 음식점 | 인스타그램 | https://www.instagram.com/p/B65cvWOBi7c/ | 2021-11-23 |
번호 | 관광상품명 | 관광상품분류 | 수집원분류 | 홈페이지주소(URL) | 게시글작성일 | |
---|---|---|---|---|---|---|
6313 | 5000542 | 남강습지원 | 관광지 | 티스토리블로그 | https://neowind.tistory.com/224 | 2009-04-06 |
6314 | 5005957 | 진주남강유등축제 | 행사 | 티스토리블로그 | https://lalawin.tistory.com/399 | 2009-01-11 |
6315 | 5005067 | 국화작품전시회 | 행사 | 티스토리블로그 | https://heysukim114.tistory.com/470 | 2008-11-03 |
6316 | 5001630 | 진주남강유등축제 | 행사 | 티스토리블로그 | https://kimchi39.tistory.com/entry/jinju-korail-train | 2008-10-12 |
6317 | 5003702 | 안의갈비탕 | 음식점 | 티스토리블로그 | https://kimchi39.tistory.com/entry/jinju-anui | 2008-10-12 |
6318 | 5005952 | 진주남강유등축제 | 행사 | 티스토리블로그 | https://kimchi39.tistory.com/entry/jinju-namkang | 2008-10-12 |
6319 | 5006021 | 진주남강유등축제 | 행사 | 티스토리블로그 | https://gyeongnamtravel.tistory.com/entry/%EC%A7%84%EC%A3%BC-%EC%9C%A0%EB%93%B1%EC%B6%95%EC%A0%9C | 2008-09-29 |
6320 | 5006063 | 진주남강유등축제 | 행사 | 티스토리블로그 | https://5gangsan.tistory.com/entry/%EC%A7%84%EC%A3%BC-%EC%9C%A0%EB%93%B1%EC%B6%95%EC%A0%9C200610 | 2008-09-27 |
6321 | 5001835 | 촉석루 | 관광지 | 티스토리블로그 | https://talktravel.tistory.com/28283 | 2008-01-07 |
6322 | 5006031 | 진주남강유등축제 | 행사 | 티스토리블로그 | https://kym5219.tistory.com/7105 | 2005-10-28 |