Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory478.5 KiB
Average record size in memory49.0 B

Variable types

DateTime1
Categorical3
Numeric1

Dataset

Description제주관광정보시스템(VISITJEJU)의 검색어트랜드 내역으로 통계일, 성별, 연령대, 키워드, 점수 등을 제공합니다.
URLhttps://www.data.go.kr/data/15118422/fileData.do

Alerts

성별 is highly overall correlated with 연령대High correlation
연령대 is highly overall correlated with 성별High correlation

Reproduction

Analysis started2023-12-12 12:28:05.229663
Analysis finished2023-12-12 12:28:06.061169
Duration0.83 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct857
Distinct (%)8.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2020-12-24 00:00:00
Maximum2023-07-29 00:00:00
2023-12-12T21:28:06.173347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:28:06.374774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

성별
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
남성
4604 
여성
4603 
전체
793 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row여성
2nd row남성
3rd row여성
4th row여성
5th row여성

Common Values

ValueCountFrequency (%)
남성 4604
46.0%
여성 4603
46.0%
전체 793
 
7.9%

Length

2023-12-12T21:28:06.562272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:28:06.696985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
남성 4604
46.0%
여성 4603
46.0%
전체 793
 
7.9%

연령대
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
40대
1566 
60대이상
1558 
20대
1536 
30대
1527 
10대
1522 
Other values (2)
2291 

Length

Max length5
Median length3
Mean length3.2323
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row30대
2nd row40대
3rd row10대
4th row10대
5th row10대

Common Values

ValueCountFrequency (%)
40대 1566
15.7%
60대이상 1558
15.6%
20대 1536
15.4%
30대 1527
15.3%
10대 1522
15.2%
50대 1498
15.0%
전체 793
7.9%

Length

2023-12-12T21:28:06.902047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:28:07.077313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
40대 1566
15.7%
60대이상 1558
15.6%
20대 1536
15.4%
30대 1527
15.3%
10대 1522
15.2%
50대 1498
15.0%
전체 793
7.9%

키워드
Categorical

Distinct29
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
캠핑
1278 
성산일출봉
1124 
힐링
1115 
코스모스
937 
수국
792 
Other values (24)
4754 

Length

Max length5
Median length2
Mean length2.7379
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row들불축제
2nd row과일빙수
3rd row일출
4th row들불축제
5th row성산일출봉

Common Values

ValueCountFrequency (%)
캠핑 1278
12.8%
성산일출봉 1124
11.2%
힐링 1115
11.2%
코스모스 937
9.4%
수국 792
7.9%
방어 731
 
7.3%
한치 655
 
6.6%
동백 577
 
5.8%
들불축제 426
 
4.3%
단풍 399
 
4.0%
Other values (19) 1966
19.7%

Length

2023-12-12T21:28:07.290217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
캠핑 1278
12.8%
성산일출봉 1124
11.2%
힐링 1115
11.1%
코스모스 937
9.4%
수국 792
7.9%
방어 731
 
7.3%
한치 655
 
6.5%
동백 577
 
5.8%
들불축제 426
 
4.3%
단풍 399
 
4.0%
Other values (20) 1975
19.7%

점수
Real number (ℝ)

Distinct9873
Distinct (%)98.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean183.89639
Minimum0.58911
Maximum453.00811
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T21:28:07.471700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.58911
5-th percentile29.461805
Q182.571285
median159.03582
Q3299.18903
95-th percentile369.37197
Maximum453.00811
Range452.419
Interquartile range (IQR)216.61775

Descriptive statistics

Standard deviation115.61059
Coefficient of variation (CV)0.62867243
Kurtosis-1.2903034
Mean183.89639
Median Absolute Deviation (MAD)92.369165
Skewness0.31115794
Sum1838963.9
Variance13365.809
MonotonicityNot monotonic
2023-12-12T21:28:07.669121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.99998 7
 
0.1%
199.99998 6
 
0.1%
70.58822 3
 
< 0.1%
66.66665 3
 
< 0.1%
313.63635 3
 
< 0.1%
349.99999 3
 
< 0.1%
102.05578 3
 
< 0.1%
299.99999 3
 
< 0.1%
327.27271 3
 
< 0.1%
63.63634 3
 
< 0.1%
Other values (9863) 9963
99.6%
ValueCountFrequency (%)
0.5891099999999999 1
< 0.1%
0.6556299999999999 1
< 0.1%
0.93199 1
< 0.1%
1.26987 1
< 0.1%
1.30653 1
< 0.1%
1.40581 1
< 0.1%
1.42965 1
< 0.1%
1.5223 1
< 0.1%
1.80905 1
< 0.1%
2.28814 1
< 0.1%
ValueCountFrequency (%)
453.00811 1
< 0.1%
438.53953 1
< 0.1%
426.16821 1
< 0.1%
413.21625 1
< 0.1%
405.06911 1
< 0.1%
398.87172 1
< 0.1%
397.51379 1
< 0.1%
397.26739 1
< 0.1%
396.44837 1
< 0.1%
395.30385 1
< 0.1%

Interactions

2023-12-12T21:28:05.665689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:28:07.814449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성별연령대키워드점수
성별1.0000.7690.2670.107
연령대0.7691.0000.3980.138
키워드0.2670.3981.0000.512
점수0.1070.1380.5121.000
2023-12-12T21:28:07.958569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성별키워드연령대
성별1.0000.1390.707
키워드0.1391.0000.177
연령대0.7070.1771.000
2023-12-12T21:28:08.078792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
점수성별연령대키워드
점수1.0000.0640.0700.207
성별0.0641.0000.7070.139
연령대0.0700.7071.0000.177
키워드0.2070.1390.1771.000

Missing values

2023-12-12T21:28:05.839720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:28:05.990720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

통계일성별연령대키워드점수
395472023-01-08여성30대들불축제129.86109
355572022-08-13남성40대과일빙수35.63732
231352023-01-14여성10대일출301.55037
105362021-10-04여성10대들불축제14.70333
260112022-06-28여성10대성산일출봉222.05881
132292021-12-08남성60대이상방어335.84069
279302023-03-23남성40대한치66.73864
158242022-02-28전체전체캠핑361.58272
422021-01-01전체전체방어313.03765
21782021-02-17남성50대캠핑161.30135
통계일성별연령대키워드점수
366592022-07-18여성40대성산일출봉356.97327
83592021-08-12여성30대한치69.19538
276642023-02-26남성50대청보리271.11109
178012022-03-19남성30대성산일출봉111.68656
32672021-03-10여성30대벚꽃361.93446
392462023-02-28남성10대힐링28.37836
62732021-05-24남성30대캠핑356.51407
294782023-04-23남성30대일출144.50702
310512023-04-11여성10대코스모스185.08286
44602021-04-11전체전체코스모스51.62996