Overview

Dataset statistics

Number of variables5
Number of observations237
Missing cells17
Missing cells (%)1.4%
Duplicate rows4
Duplicate rows (%)1.7%
Total size in memory9.6 KiB
Average record size in memory41.6 B

Variable types

Text2
Categorical1
Numeric1
DateTime1

Dataset

Description2022년부터 2023년 4월까지 코레일관광개발(주)에서 운영 중인 국내관광상품 현황에 대한 데이터로 상품명, 상품가격, 상품 운영기간 등에 대한 정보입니다.
URLhttps://www.data.go.kr/data/15108755/fileData.do

Alerts

Dataset has 4 (1.7%) duplicate rowsDuplicates
상품가격 is highly overall correlated with 여행기간High correlation
여행기간 is highly overall correlated with 상품가격 High correlation
운행종료일 has 17 (7.2%) missing valuesMissing

Reproduction

Analysis started2023-12-12 00:02:37.134780
Analysis finished2023-12-12 00:02:37.842629
Duration0.71 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct214
Distinct (%)90.3%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-12-12T09:02:38.046131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length110
Median length89
Mean length74.029536
Min length15

Characters and Unicode

Total characters17545
Distinct characters574
Distinct categories14 ?
Distinct scripts4 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique194 ?
Unique (%)81.9%

Sample

1st row[수도권출발][경남/전남][KTX][섬_남해]|[별미]한려수도 구석구석 명소탐방! 남해/통영/여수/순천/광양 남해안 별미 기차여행 2박3일
2nd row[수도권출발][전북/충남][KTX][겨울여행]|*눈꽃 위를 걷는 하루* 대둔산 구름다리 & 케이블카 · 하늘물빛정원 당일
3rd row동해의 랜드마크! 동해안의 명물! 환상적인 해안선을 따라 달리는 유일무이 바다열차 동해 여행 당일
4th row[수도권출발][전북/충남][KTX][겨울여행]|*겨울 설산의 백미* 무주구천동&덕유산 향적봉 곤돌라 · 하늘물빛정원 당일
5th row[수도권출발][섬_제주][겨울여행][연합]|[제주]★겨울 풍경이 있는 제주★ 가파도(or 마라도) & 동백힐링정원 & 광치기해안 제주 2박3일
ValueCountFrequency (%)
당일 125
 
5.3%
· 88
 
3.8%
44
 
1.9%
1박2일 41
 
1.7%
떠나는 36
 
1.5%
제공 20
 
0.9%
출발 20
 
0.9%
여행 18
 
0.8%
타고 18
 
0.8%
2박3일 17
 
0.7%
Other values (1171) 1918
81.8%
2023-12-12T09:02:38.513560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2127
 
12.1%
] 1038
 
5.9%
[ 1037
 
5.9%
430
 
2.5%
324
 
1.8%
323
 
1.8%
320
 
1.8%
319
 
1.8%
243
 
1.4%
| 202
 
1.2%
Other values (564) 11182
63.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10809
61.6%
Space Separator 2127
 
12.1%
Close Punctuation 1137
 
6.5%
Open Punctuation 1136
 
6.5%
Other Punctuation 586
 
3.3%
Uppercase Letter 494
 
2.8%
Decimal Number 481
 
2.7%
Math Symbol 292
 
1.7%
Lowercase Letter 280
 
1.6%
Other Symbol 114
 
0.6%
Other values (4) 89
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
430
 
4.0%
324
 
3.0%
323
 
3.0%
320
 
3.0%
319
 
3.0%
243
 
2.2%
192
 
1.8%
174
 
1.6%
171
 
1.6%
155
 
1.4%
Other values (485) 8158
75.5%
Uppercase Letter
ValueCountFrequency (%)
T 136
27.5%
X 126
25.5%
K 124
25.1%
E 16
 
3.2%
A 14
 
2.8%
C 8
 
1.6%
O 8
 
1.6%
S 8
 
1.6%
G 7
 
1.4%
B 7
 
1.4%
Other values (11) 40
 
8.1%
Lowercase Letter
ValueCountFrequency (%)
a 39
13.9%
r 39
13.9%
t 29
10.4%
i 29
10.4%
n 25
8.9%
e 19
6.8%
s 16
 
5.7%
m 15
 
5.4%
o 14
 
5.0%
g 12
 
4.3%
Other values (8) 43
15.4%
Decimal Number
ValueCountFrequency (%)
2 166
34.5%
1 134
27.9%
3 64
 
13.3%
0 40
 
8.3%
4 30
 
6.2%
8 19
 
4.0%
6 10
 
2.1%
5 9
 
1.9%
9 8
 
1.7%
7 1
 
0.2%
Other Punctuation
ValueCountFrequency (%)
! 173
29.5%
, 125
21.3%
· 97
16.6%
/ 85
14.5%
& 50
 
8.5%
. 34
 
5.8%
* 18
 
3.1%
" 2
 
0.3%
: 2
 
0.3%
Other Symbol
ValueCountFrequency (%)
33
28.9%
32
28.1%
16
14.0%
8
 
7.0%
8
 
7.0%
7
 
6.1%
4
 
3.5%
4
 
3.5%
2
 
1.8%
Math Symbol
ValueCountFrequency (%)
| 202
69.2%
+ 50
 
17.1%
~ 40
 
13.7%
Close Punctuation
ValueCountFrequency (%)
] 1038
91.3%
) 99
 
8.7%
Open Punctuation
ValueCountFrequency (%)
[ 1037
91.3%
( 99
 
8.7%
Space Separator
ValueCountFrequency (%)
2127
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 48
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 34
100.0%
Modifier Symbol
ValueCountFrequency (%)
´ 6
100.0%
Other Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10794
61.5%
Common 5962
34.0%
Latin 774
 
4.4%
Han 15
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
430
 
4.0%
324
 
3.0%
323
 
3.0%
320
 
3.0%
319
 
3.0%
243
 
2.3%
192
 
1.8%
174
 
1.6%
171
 
1.6%
155
 
1.4%
Other values (480) 8143
75.4%
Common
ValueCountFrequency (%)
2127
35.7%
] 1038
17.4%
[ 1037
17.4%
| 202
 
3.4%
! 173
 
2.9%
2 166
 
2.8%
1 134
 
2.2%
, 125
 
2.1%
) 99
 
1.7%
( 99
 
1.7%
Other values (30) 762
 
12.8%
Latin
ValueCountFrequency (%)
T 136
17.6%
X 126
16.3%
K 124
16.0%
a 39
 
5.0%
r 39
 
5.0%
t 29
 
3.7%
i 29
 
3.7%
n 25
 
3.2%
e 19
 
2.5%
s 16
 
2.1%
Other values (29) 192
24.8%
Han
ValueCountFrequency (%)
6
40.0%
4
26.7%
2
 
13.3%
2
 
13.3%
1
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10679
60.9%
ASCII 6518
37.2%
Compat Jamo 115
 
0.7%
None 103
 
0.6%
Geometric Shapes 73
 
0.4%
Misc Symbols 41
 
0.2%
CJK 15
 
0.1%
Enclosed Alphanum 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2127
32.6%
] 1038
15.9%
[ 1037
15.9%
| 202
 
3.1%
! 173
 
2.7%
2 166
 
2.5%
T 136
 
2.1%
1 134
 
2.1%
X 126
 
1.9%
, 125
 
1.9%
Other values (57) 1254
19.2%
Hangul
ValueCountFrequency (%)
430
 
4.0%
324
 
3.0%
323
 
3.0%
320
 
3.0%
319
 
3.0%
243
 
2.3%
192
 
1.8%
174
 
1.6%
171
 
1.6%
155
 
1.5%
Other values (479) 8028
75.2%
Compat Jamo
ValueCountFrequency (%)
115
100.0%
None
ValueCountFrequency (%)
· 97
94.2%
´ 6
 
5.8%
Geometric Shapes
ValueCountFrequency (%)
33
45.2%
32
43.8%
8
 
11.0%
Misc Symbols
ValueCountFrequency (%)
16
39.0%
8
19.5%
7
17.1%
4
 
9.8%
4
 
9.8%
2
 
4.9%
CJK
ValueCountFrequency (%)
6
40.0%
4
26.7%
2
 
13.3%
2
 
13.3%
1
 
6.7%
Enclosed Alphanum
ValueCountFrequency (%)
1
100.0%

여행기간
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
당일
162 
1박2일
47 
2박3일
25 
무박2일
 
2
무박
 
1

Length

Max length4
Median length2
Mean length2.6244726
Min length2

Unique

Unique1 ?
Unique (%)0.4%

Sample

1st row2박3일
2nd row당일
3rd row당일
4th row당일
5th row2박3일

Common Values

ValueCountFrequency (%)
당일 162
68.4%
1박2일 47
 
19.8%
2박3일 25
 
10.5%
무박2일 2
 
0.8%
무박 1
 
0.4%

Length

2023-12-12T09:02:38.670097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:02:38.784041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
당일 162
68.4%
1박2일 47
 
19.8%
2박3일 25
 
10.5%
무박2일 2
 
0.8%
무박 1
 
0.4%

상품가격
Real number (ℝ)

HIGH CORRELATION 

Distinct110
Distinct (%)46.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean150810.97
Minimum31000
Maximum509000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2023-12-12T09:02:38.927177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum31000
5-th percentile48800
Q169000
median99000
Q3224000
95-th percentile381400
Maximum509000
Range478000
Interquartile range (IQR)155000

Descriptive statistics

Standard deviation111961.36
Coefficient of variation (CV)0.74239533
Kurtosis0.41403901
Mean150810.97
Median Absolute Deviation (MAD)34000
Skewness1.2463981
Sum35742200
Variance1.2535346 × 1010
MonotonicityNot monotonic
2023-12-12T09:02:39.116023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
69000 14
 
5.9%
79000 13
 
5.5%
129000 9
 
3.8%
99000 9
 
3.8%
119000 9
 
3.8%
84000 8
 
3.4%
149000 7
 
3.0%
89000 6
 
2.5%
109000 5
 
2.1%
59900 5
 
2.1%
Other values (100) 152
64.1%
ValueCountFrequency (%)
31000 1
 
0.4%
39000 1
 
0.4%
39900 3
1.3%
42000 1
 
0.4%
46000 1
 
0.4%
47000 1
 
0.4%
48000 4
1.7%
49000 2
0.8%
49900 1
 
0.4%
50000 2
0.8%
ValueCountFrequency (%)
509000 1
 
0.4%
505000 1
 
0.4%
446000 1
 
0.4%
443000 1
 
0.4%
433000 1
 
0.4%
428000 1
 
0.4%
420000 1
 
0.4%
404000 1
 
0.4%
399000 4
1.7%
377000 1
 
0.4%
Distinct55
Distinct (%)23.2%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-12-12T09:02:39.372801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.4261603
Min length2

Characters and Unicode

Total characters2234
Distinct characters13
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)11.4%

Sample

1st row2022-01-25
2nd row2022-01-26
3rd row2022-02-08
4th row2022-01-26
5th row2022-01-25
ValueCountFrequency (%)
2022-03-01 31
 
13.1%
2022-04-01 24
 
10.1%
2022-09-01 18
 
7.6%
연중 17
 
7.2%
2023-03-30 14
 
5.9%
2023-02-14 13
 
5.5%
2023-04-08 9
 
3.8%
2022-05-01 9
 
3.8%
2023-03-31 6
 
2.5%
2022-01-26 5
 
2.1%
Other values (45) 91
38.4%
2023-12-12T09:02:39.745253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 672
30.1%
0 562
25.2%
- 440
19.7%
1 184
 
8.2%
3 162
 
7.3%
4 78
 
3.5%
9 25
 
1.1%
8 23
 
1.0%
5 22
 
1.0%
6 19
 
0.9%
Other values (3) 47
 
2.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1760
78.8%
Dash Punctuation 440
 
19.7%
Other Letter 34
 
1.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 672
38.2%
0 562
31.9%
1 184
 
10.5%
3 162
 
9.2%
4 78
 
4.4%
9 25
 
1.4%
8 23
 
1.3%
5 22
 
1.2%
6 19
 
1.1%
7 13
 
0.7%
Other Letter
ValueCountFrequency (%)
17
50.0%
17
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 440
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2200
98.5%
Hangul 34
 
1.5%

Most frequent character per script

Common
ValueCountFrequency (%)
2 672
30.5%
0 562
25.5%
- 440
20.0%
1 184
 
8.4%
3 162
 
7.4%
4 78
 
3.5%
9 25
 
1.1%
8 23
 
1.0%
5 22
 
1.0%
6 19
 
0.9%
Hangul
ValueCountFrequency (%)
17
50.0%
17
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2200
98.5%
Hangul 34
 
1.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 672
30.5%
0 562
25.5%
- 440
20.0%
1 184
 
8.4%
3 162
 
7.4%
4 78
 
3.5%
9 25
 
1.1%
8 23
 
1.0%
5 22
 
1.0%
6 19
 
0.9%
Hangul
ValueCountFrequency (%)
17
50.0%
17
50.0%

운행종료일
Date

MISSING 

Distinct61
Distinct (%)27.7%
Missing17
Missing (%)7.2%
Memory size2.0 KiB
Minimum2022-02-26 00:00:00
Maximum2023-12-27 00:00:00
2023-12-12T09:02:39.935755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:02:40.137568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T09:02:37.580876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T09:02:40.239862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
여행기간상품가격운행시작일운행종료일
여행기간1.0000.8540.8540.667
상품가격0.8541.0000.0000.609
운행시작일0.8540.0001.0000.996
운행종료일0.6670.6090.9961.000
2023-12-12T09:02:40.334821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상품가격여행기간
상품가격1.0000.510
여행기간0.5101.000

Missing values

2023-12-12T09:02:37.679047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:02:37.795926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상품명여행기간상품가격운행시작일운행종료일
0[수도권출발][경남/전남][KTX][섬_남해]|[별미]한려수도 구석구석 명소탐방! 남해/통영/여수/순천/광양 남해안 별미 기차여행 2박3일2박3일3990002022-01-252022-04-30
1[수도권출발][전북/충남][KTX][겨울여행]|*눈꽃 위를 걷는 하루* 대둔산 구름다리 & 케이블카 · 하늘물빛정원 당일당일790002022-01-262022-02-27
2동해의 랜드마크! 동해안의 명물! 환상적인 해안선을 따라 달리는 유일무이 바다열차 동해 여행 당일당일1290002022-02-082022-03-29
3[수도권출발][전북/충남][KTX][겨울여행]|*겨울 설산의 백미* 무주구천동&덕유산 향적봉 곤돌라 · 하늘물빛정원 당일당일870002022-01-262022-02-27
4[수도권출발][섬_제주][겨울여행][연합]|[제주]★겨울 풍경이 있는 제주★ 가파도(or 마라도) & 동백힐링정원 & 광치기해안 제주 2박3일2박3일2190002022-01-252022-03-01
5[수도권출발][충남][무궁화-새마을][관광택시]|[예산택시A]우리끼리 기차+택시 타고 예산 핵심 알짜배기 당일 여행(수덕사ㆍ봉수산수목원ㆍ예당호)당일480002022-01-262022-02-27
6[수도권출발][전남][섬_서해][겨울여행][용산,천안아산,오송출발][연합]|[KTX] 천년의 신비를 간직한 홍도ㆍ흑산도 1박2일 (2명이상 출발)1박2일2520002022-01-262022-06-30
7겨울왕국 강원도! 새하얀 낭만이 넘실대는 동해안 윈터 패키지 당일(오대산 월정사 · 대관령 하늘목장 · 강릉 안목해변 카페거리)당일1190002022-02-092022-02-27
8[수도권출발][전남][먹방][KTX][겨울여행]|남도 별미여행 맛따라 멋따라 담양/곡성/순천/광양/사천 기차여행 1박2일1박2일2930002022-01-252022-06-29
9[수도권출발][전남][섬_서해][겨울여행]|[KTX][별미+섬길트레킹]보라빛 향연 신안 퍼플섬 반월/박지도ㆍ천사대교ㆍ목포 해상케이블카 1박2일1박2일3590002022-01-262022-02-26
상품명여행기간상품가격운행시작일운행종료일
227[수도권출발][충북][무궁화-ITX새마을][옥천]| ´옥천 벚꽃 맞이´ 장계관광지ㆍ부소담악 당일 (중식제공)당일420002023-04-072023-04-08
228[수도권출발][강원][봄여행][KTX][바다열차]|동해안 명물! 바다열차 타고 떠나보는 노오란 물결속 꽃놀이! 2023 삼척 맹방 유채꽃 축제 당일(삼척레일시티투어)당일1190002023-04-082023-04-23
229[수도권출발][충남][봄여행][G-Train]| 주산 봄꽃, 보령에서 봄의 정취를 느끼다! 개화예술공원ㆍ보령중앙시장 당일 (쌈밥제공)당일840002023-04-072023-04-09
230[수도권출발][충북][E-train][제휴]|단 하루!! 제천으로 떠나는 봄 나들이 특가 당일당일1080002023-04-222023-04-22
231[수도권출발][충남][전통고택][서해금빛열차]| *단4회 한정* 여운의 고택, 문헌서원을 찾아서 서천고택투어 · 장항스카이워크 & 송림욕장 당일당일1000002023-04-072023-10-13
232[수도권출발][전북][KTX][진안+전주][연합][용산,광명출발][봄여행]| 전북최고 명승지를 내품에! My 마이산 벚꽃 & 전주한옥마을 당일당일940002023-04-012023-06-30
233[수도권출발][전남][순천][봄여행][특별열차][4월29일][청량리,영등포,수원,천안]|[E-train] See you again! 함께하는 순천형 정원, 2023순천만 국제정원 박람회 당일당일1050002023-04-292023-04-29
234[수도권출발][경북]|문경으로 떠나는 친환경 기차여행♥ 문경새재ㆍ오미자테마터널ㆍ문경점촌점빵길 당일 (문경사랑상품권 1만원 제공)당일790002023-04-262023-06-24
235[수도권출발][충남][KTX]| 웰던 백제 완성 여행 ▶부여 & 공주◀ 궁남지ㆍ무령왕릉ㆍ부소산성ㆍ공산성 당일당일690002023-04-122023-04-29
236[수도권출발][충남][서해금빛열차-무궁화]| 뷰가 다 했다! 예산 수덕사ㆍ황새공원ㆍ예당호 출렁다리 당일당일600002023-04-142023-06-30

Duplicate rows

Most frequently occurring

상품명여행기간상품가격운행시작일운행종료일# duplicates
0[수도권출발][강원][여름][KTX]|여름하면 삼척! 동해안 원더~뷰 삼척! 에메랄드 빛 동해안을 달리는 바다열차와 무더위를 날려줄 삼척 환선굴 특급 모노레일 당일당일1290002022-07-162022-08-282
1[수도권출발][봄][KTX] 홍도/흑산도/목포 해상케이블카 2박3일2박3일3990002022-03-012022-06-302
2[수도권출발][전남][섬_남해][연합][KTX][봄여행]| 삶의 쉼표가 되는 그 곳, 청산도 유채꽃ㆍ보길도ㆍ땅끝마을ㆍ가우도 1박2일1박2일2730002023-03-302023-11-302
3[수도권출발][충북][무궁화-ITX새마을][옥천]|ECO 식목일 HAPPY! 묘목축제와 옥천 봄여행 ♪ 장계관광지ㆍ부소담악 당일 * 묘목제공당일620002023-03-312023-04-012