Overview

Dataset statistics

Number of variables5
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.2 KiB
Average record size in memory43.3 B

Variable types

Categorical3
Numeric1
Text1

Alerts

tour_nd_trnsport_goods_rnewl_de has constant value ""Constant
tour_nd_trnsport_goods_cd is highly overall correlated with tour_nd_trnsport_goods_addrHigh correlation
area_nm is highly overall correlated with tour_nd_trnsport_goods_addrHigh correlation
tour_nd_trnsport_goods_addr is highly overall correlated with tour_nd_trnsport_goods_cd and 1 other fieldsHigh correlation
tour_nd_trnsport_goods_cd has unique valuesUnique

Reproduction

Analysis started2023-12-10 09:47:51.282422
Analysis finished2023-12-10 09:47:52.691029
Duration1.41 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

area_nm
Categorical

HIGH CORRELATION 

Distinct21
Distinct (%)21.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
강원 평창군
20 
강원 정선군
12 
강원 삼척시
11 
강원도 강릉시
10 
강원 강릉시
Other values (16)
39 

Length

Max length7
Median length6
Mean length6.17
Min length4

Unique

Unique6 ?
Unique (%)6.0%

Sample

1st row<NA>
2nd row충북 충주시
3rd row강원 강릉시
4th row강원 강릉시
5th row강원 강릉시

Common Values

ValueCountFrequency (%)
강원 평창군 20
20.0%
강원 정선군 12
12.0%
강원 삼척시 11
11.0%
강원도 강릉시 10
10.0%
강원 강릉시 8
 
8.0%
강원 영월군 7
 
7.0%
강원 춘천시 5
 
5.0%
강원 동해시 3
 
3.0%
충북 충주시 3
 
3.0%
강원 속초시 3
 
3.0%
Other values (11) 18
18.0%

Length

2023-12-10T18:47:52.837427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
강원 76
38.2%
강원도 20
 
10.1%
평창군 20
 
10.1%
강릉시 18
 
9.0%
정선군 13
 
6.5%
삼척시 11
 
5.5%
영월군 7
 
3.5%
춘천시 5
 
2.5%
동해시 4
 
2.0%
고성군 4
 
2.0%
Other values (10) 21
 
10.6%

tour_nd_trnsport_goods_cd
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean56092.05
Minimum37250
Maximum78768
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T18:47:53.101749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum37250
5-th percentile39516.25
Q143006
median56032
Q365096.75
95-th percentile78763.05
Maximum78768
Range41518
Interquartile range (IQR)22090.75

Descriptive statistics

Standard deviation12751.992
Coefficient of variation (CV)0.22734046
Kurtosis-1.0267616
Mean56092.05
Median Absolute Deviation (MAD)10370
Skewness0.23801099
Sum5609205
Variance1.6261331 × 108
MonotonicityNot monotonic
2023-12-10T18:47:53.366575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
60843 1
 
1.0%
55650 1
 
1.0%
61903 1
 
1.0%
61883 1
 
1.0%
61114 1
 
1.0%
59830 1
 
1.0%
59565 1
 
1.0%
58987 1
 
1.0%
57757 1
 
1.0%
56337 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
37250 1
1.0%
37782 1
1.0%
37804 1
1.0%
37808 1
1.0%
38590 1
1.0%
39565 1
1.0%
39593 1
1.0%
39650 1
1.0%
40045 1
1.0%
40050 1
1.0%
ValueCountFrequency (%)
78768 1
1.0%
78767 1
1.0%
78766 1
1.0%
78765 1
1.0%
78764 1
1.0%
78763 1
1.0%
78762 1
1.0%
78760 1
1.0%
78758 1
1.0%
76080 1
1.0%
Distinct87
Distinct (%)87.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T18:47:53.955449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length33
Mean length24.96
Min length11

Characters and Unicode

Total characters2496
Distinct characters236
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique77 ?
Unique (%)77.0%

Sample

1st row[제주] 추석투어패스
2nd row[충북] 힐링투어패스 + 충주 라이트월드
3rd row[강원 강릉] 강릉시티투어 3코스 바다부채길
4th row[강원 강릉] 강릉시티투어 2코스 바다열차
5th row[강원 강릉] 정동심곡 부채길+강릉항커피거리+솔향수목원 ★ 당일여행
ValueCountFrequency (%)
강원 82
 
16.7%
20
 
4.1%
용평리조트 18
 
3.7%
평창 14
 
2.9%
강원투어패스 12
 
2.4%
투어패스 12
 
2.4%
삼척투어패스 12
 
2.4%
1+1 9
 
1.8%
8
 
1.6%
정선평창 8
 
1.6%
Other values (160) 295
60.2%
2023-12-10T18:47:54.898348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
391
 
15.7%
117
 
4.7%
113
 
4.5%
[ 102
 
4.1%
] 102
 
4.1%
76
 
3.0%
74
 
3.0%
70
 
2.8%
66
 
2.6%
1 52
 
2.1%
Other values (226) 1333
53.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1586
63.5%
Space Separator 391
 
15.7%
Open Punctuation 126
 
5.0%
Close Punctuation 126
 
5.0%
Decimal Number 88
 
3.5%
Math Symbol 56
 
2.2%
Uppercase Letter 47
 
1.9%
Other Punctuation 31
 
1.2%
Lowercase Letter 28
 
1.1%
Other Symbol 16
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
117
 
7.4%
113
 
7.1%
76
 
4.8%
74
 
4.7%
70
 
4.4%
66
 
4.2%
51
 
3.2%
40
 
2.5%
27
 
1.7%
25
 
1.6%
Other values (190) 927
58.4%
Uppercase Letter
ValueCountFrequency (%)
K 10
21.3%
G 10
21.3%
P 10
21.3%
E 7
14.9%
X 4
 
8.5%
S 1
 
2.1%
B 1
 
2.1%
F 1
 
2.1%
Z 1
 
2.1%
M 1
 
2.1%
Decimal Number
ValueCountFrequency (%)
1 52
59.1%
2 13
 
14.8%
5 9
 
10.2%
8 6
 
6.8%
4 4
 
4.5%
3 3
 
3.4%
7 1
 
1.1%
Other Punctuation
ValueCountFrequency (%)
/ 18
58.1%
& 10
32.3%
: 2
 
6.5%
, 1
 
3.2%
Lowercase Letter
ValueCountFrequency (%)
v 7
25.0%
n 7
25.0%
e 7
25.0%
t 7
25.0%
Open Punctuation
ValueCountFrequency (%)
[ 102
81.0%
( 24
 
19.0%
Close Punctuation
ValueCountFrequency (%)
] 102
81.0%
) 24
 
19.0%
Math Symbol
ValueCountFrequency (%)
+ 45
80.4%
~ 11
 
19.6%
Other Symbol
ValueCountFrequency (%)
14
87.5%
2
 
12.5%
Space Separator
ValueCountFrequency (%)
391
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1586
63.5%
Common 835
33.5%
Latin 75
 
3.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
117
 
7.4%
113
 
7.1%
76
 
4.8%
74
 
4.7%
70
 
4.4%
66
 
4.2%
51
 
3.2%
40
 
2.5%
27
 
1.7%
25
 
1.6%
Other values (190) 927
58.4%
Common
ValueCountFrequency (%)
391
46.8%
[ 102
 
12.2%
] 102
 
12.2%
1 52
 
6.2%
+ 45
 
5.4%
( 24
 
2.9%
) 24
 
2.9%
/ 18
 
2.2%
14
 
1.7%
2 13
 
1.6%
Other values (11) 50
 
6.0%
Latin
ValueCountFrequency (%)
K 10
13.3%
G 10
13.3%
P 10
13.3%
v 7
9.3%
n 7
9.3%
E 7
9.3%
e 7
9.3%
t 7
9.3%
X 4
 
5.3%
S 1
 
1.3%
Other values (5) 5
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1586
63.5%
ASCII 894
35.8%
Misc Symbols 16
 
0.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
391
43.7%
[ 102
 
11.4%
] 102
 
11.4%
1 52
 
5.8%
+ 45
 
5.0%
( 24
 
2.7%
) 24
 
2.7%
/ 18
 
2.0%
2 13
 
1.5%
~ 11
 
1.2%
Other values (24) 112
 
12.5%
Hangul
ValueCountFrequency (%)
117
 
7.4%
113
 
7.1%
76
 
4.8%
74
 
4.7%
70
 
4.4%
66
 
4.2%
51
 
3.2%
40
 
2.5%
27
 
1.7%
25
 
1.6%
Other values (190) 927
58.4%
Misc Symbols
ValueCountFrequency (%)
14
87.5%
2
 
12.5%

tour_nd_trnsport_goods_addr
Categorical

HIGH CORRELATION 

Distinct44
Distinct (%)44.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
강원 평창군 대관령면 올림픽로 715
18 
강원도 강릉시 사임당로 129
10 
강원 정선군 화암면 화암동굴길 12-8
강원 삼척시 신기면 환선로 800
강원 영월군 김삿갓면 진별리 산 262
 
4
Other values (39)
53 

Length

Max length22
Median length20
Mean length17.43
Min length4

Unique

Unique27 ?
Unique (%)27.0%

Sample

1st row제주특별자치도
2nd row충북 충주시 남한강로 24
3rd row강원 강릉시 용지로 176
4th row강원 강릉시 해안로535번길 11
5th row강원 강릉시 강동면 심곡리 114-3

Common Values

ValueCountFrequency (%)
강원 평창군 대관령면 올림픽로 715 18
18.0%
강원도 강릉시 사임당로 129 10
 
10.0%
강원 정선군 화암면 화암동굴길 12-8 9
 
9.0%
강원 삼척시 신기면 환선로 800 6
 
6.0%
강원 영월군 김삿갓면 진별리 산 262 4
 
4.0%
강원 춘천시 남산면 남이섬길 1 3
 
3.0%
강원 강릉시 3
 
3.0%
강원 정선군 북평면 어도원길 12 2
 
2.0%
강원 강릉시 해안로535번길 11 2
 
2.0%
강원 동해시 추암동 474-5 2
 
2.0%
Other values (34) 41
41.0%

Length

2023-12-10T18:47:55.198550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
강원 75
 
16.8%
강원도 20
 
4.5%
평창군 20
 
4.5%
올림픽로 18
 
4.0%
715 18
 
4.0%
대관령면 18
 
4.0%
강릉시 18
 
4.0%
정선군 13
 
2.9%
삼척시 11
 
2.5%
사임당로 10
 
2.2%
Other values (97) 225
50.4%

tour_nd_trnsport_goods_rnewl_de
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
20211130
100 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20211130
2nd row20211130
3rd row20211130
4th row20211130
5th row20211130

Common Values

ValueCountFrequency (%)
20211130 100
100.0%

Length

2023-12-10T18:47:55.682300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T18:47:55.904889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20211130 100
100.0%

Interactions

2023-12-10T18:47:52.182300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T18:47:56.068834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
area_nmtour_nd_trnsport_goods_cdtour_nd_trnsport_goods_nmtour_nd_trnsport_goods_addr
area_nm1.0000.8670.9851.000
tour_nd_trnsport_goods_cd0.8671.0000.9540.936
tour_nd_trnsport_goods_nm0.9850.9541.0000.999
tour_nd_trnsport_goods_addr1.0000.9360.9991.000
2023-12-10T18:47:56.271307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
tour_nd_trnsport_goods_addrarea_nm
tour_nd_trnsport_goods_addr1.0000.842
area_nm0.8421.000
2023-12-10T18:47:56.432054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
tour_nd_trnsport_goods_cdarea_nmtour_nd_trnsport_goods_addr
tour_nd_trnsport_goods_cd1.0000.4440.536
area_nm0.4441.0000.842
tour_nd_trnsport_goods_addr0.5360.8421.000

Missing values

2023-12-10T18:47:52.403442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T18:47:52.604688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

area_nmtour_nd_trnsport_goods_cdtour_nd_trnsport_goods_nmtour_nd_trnsport_goods_addrtour_nd_trnsport_goods_rnewl_de
0<NA>60843[제주] 추석투어패스제주특별자치도20211130
1충북 충주시41465[충북] 힐링투어패스 + 충주 라이트월드충북 충주시 남한강로 2420211130
2강원 강릉시37250[강원 강릉] 강릉시티투어 3코스 바다부채길강원 강릉시 용지로 17620211130
3강원 강릉시40050[강원 강릉] 강릉시티투어 2코스 바다열차강원 강릉시 해안로535번길 1120211130
4강원 강릉시40112[강원 강릉] 정동심곡 부채길+강릉항커피거리+솔향수목원 ★ 당일여행강원 강릉시 강동면 심곡리 114-320211130
5강원 강릉시41942[강원 강릉] 정동진&하늘목장 ★ 당일여행강원 강릉시 강동면 정동역길 1720211130
6강원 강릉시43108[강원 강릉] 강릉시티투어+바다열차 숙박패키지(1박)강원 강릉시 해안로535번길 1120211130
7충북 충주시42566[충북] 힐링투어패스 + 충주 라이트월드충북 충주시 남한강로 2420211130
8강원 강릉시58365[강원 강릉] 강릉투어패스 1+1 Event강원 강릉시20211130
9강원 강릉시58439[강원 춘천] 춘천투어패스 1+1 Event강원 강릉시20211130
area_nmtour_nd_trnsport_goods_cdtour_nd_trnsport_goods_nmtour_nd_trnsport_goods_addrtour_nd_trnsport_goods_rnewl_de
90강원도 고성군72686[해양관광] 강원 고성 봉수대 호핑투어 PKG강원도 고성군 토성면 고성대로 77-2420211130
91강원도 고성군74630[강원 고성] 강원투어패스 고성 봉수대해수욕장 (~8/23)강원도 고성군 죽왕면 오호리20211130
92강원도 고성군74631[강원 고성] 강원투어패스 봉수대해수욕장 호핑투어강원도 고성군 죽왕면 오호리20211130
93강원도 동해시74629[강원 동해] 동해투어패스 망상해수욕장강원도 동해시20211130
94강원도 양구군68875[강원 양구] 진목어촌체험마을 숙박강원도 양구군 양구읍 소양호로 252220211130
95강원도 양구군68876[강원 양구] 진목어촌체험마을 매운탕강원도 양구군 양구읍 소양호로 252220211130
96강원도 양양군68493[강원 양양] 수산어촌체험마을 갈매기펜션강원도 양양군 손양면 문화마을길 520211130
97강원도 양양군68498[강원 양양] 수산어촌체험마을 연어의집 펜션강원도 양양군 손양면 수산1길 20-1620211130
98강원도 양양군76080[강원 양양] 가비마린 요트투어강원도 양양군 손양면 수산1길 20-2420211130
99강원도 정선군65689[강원] 정선평창 투어패스강원도 정선군 화암면 화암동굴길 12-820211130