Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory732.4 KiB
Average record size in memory75.0 B

Variable types

Numeric3
Categorical3
Text2

Dataset

Description경상남도 밀양시 관광상품카데고리별해시태그에 대한 자료로, 관광상품번호, 관광상품분류, 관광상품명, 해시태그, 해시태그분위기에 대한 정보를 제공합니다.
Author경상남도 밀양시
URLhttps://www.data.go.kr/data/15111117/fileData.do

Alerts

생성일 has constant value ""Constant
관광상품번호(NameID) is highly overall correlated with 관광상품분류High correlation
관광상품분류 is highly overall correlated with 관광상품번호(NameID)High correlation
번호(ID) has unique valuesUnique

Reproduction

Analysis started2023-12-12 05:30:45.673405
Analysis finished2023-12-12 05:30:48.251385
Duration2.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호(ID)
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8005988.7
Minimum8000001
Maximum8012012
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T14:30:48.338625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum8000001
5-th percentile8000608
Q18002991.8
median8005968.5
Q38008964.5
95-th percentile8011384
Maximum8012012
Range12011
Interquartile range (IQR)5972.75

Descriptive statistics

Standard deviation3453.2652
Coefficient of variation (CV)0.00043133525
Kurtosis-1.1937359
Mean8005988.7
Median Absolute Deviation (MAD)2986.5
Skewness0.0059536249
Sum8.0059887 × 1010
Variance11925040
MonotonicityNot monotonic
2023-12-12T14:30:48.527846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
8004290 1
 
< 0.1%
8008190 1
 
< 0.1%
8005693 1
 
< 0.1%
8005571 1
 
< 0.1%
8008424 1
 
< 0.1%
8001880 1
 
< 0.1%
8004430 1
 
< 0.1%
8001235 1
 
< 0.1%
8001044 1
 
< 0.1%
8008232 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
8000001 1
< 0.1%
8000002 1
< 0.1%
8000003 1
< 0.1%
8000004 1
< 0.1%
8000006 1
< 0.1%
8000007 1
< 0.1%
8000008 1
< 0.1%
8000009 1
< 0.1%
8000010 1
< 0.1%
8000011 1
< 0.1%
ValueCountFrequency (%)
8012012 1
< 0.1%
8012011 1
< 0.1%
8012010 1
< 0.1%
8012008 1
< 0.1%
8012007 1
< 0.1%
8012006 1
< 0.1%
8012005 1
< 0.1%
8012003 1
< 0.1%
8012002 1
< 0.1%
8012001 1
< 0.1%

관광상품번호(NameID)
Real number (ℝ)

HIGH CORRELATION 

Distinct905
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3230623.3
Minimum1000001
Maximum4001470
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T14:30:48.693751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1000001
5-th percentile1000034
Q13000098
median4000132
Q34000728
95-th percentile4001315
Maximum4001470
Range3001469
Interquartile range (IQR)1000630

Descriptive statistics

Standard deviation1066229.8
Coefficient of variation (CV)0.33003841
Kurtosis0.17184821
Mean3230623.3
Median Absolute Deviation (MAD)1181
Skewness-1.244535
Sum3.2306233 × 1010
Variance1.1368459 × 1012
MonotonicityNot monotonic
2023-12-12T14:30:48.893500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4000728 41
 
0.4%
1000071 38
 
0.4%
1000003 37
 
0.4%
4000605 37
 
0.4%
4000353 36
 
0.4%
1000063 36
 
0.4%
1000009 35
 
0.4%
1000005 34
 
0.3%
4000267 34
 
0.3%
4001462 34
 
0.3%
Other values (895) 9638
96.4%
ValueCountFrequency (%)
1000001 13
 
0.1%
1000003 37
0.4%
1000004 14
 
0.1%
1000005 34
0.3%
1000006 11
 
0.1%
1000007 19
0.2%
1000009 35
0.4%
1000010 18
0.2%
1000011 10
 
0.1%
1000012 12
 
0.1%
ValueCountFrequency (%)
4001470 16
0.2%
4001469 18
0.2%
4001468 12
 
0.1%
4001466 8
 
0.1%
4001463 14
0.1%
4001462 34
0.3%
4001461 4
 
< 0.1%
4001460 6
 
0.1%
4001458 6
 
0.1%
4001451 3
 
< 0.1%

관광상품분류
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
일반음식점
5164 
숙박업소
2812 
관광지
1568 
모범음식점
 
365
문화축제
 
91

Length

Max length5
Median length5
Mean length4.3961
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row관광지
2nd row일반음식점
3rd row일반음식점
4th row숙박업소
5th row관광지

Common Values

ValueCountFrequency (%)
일반음식점 5164
51.6%
숙박업소 2812
28.1%
관광지 1568
 
15.7%
모범음식점 365
 
3.6%
문화축제 91
 
0.9%

Length

2023-12-12T14:30:49.109541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:30:49.286211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반음식점 5164
51.6%
숙박업소 2812
28.1%
관광지 1568
 
15.7%
모범음식점 365
 
3.6%
문화축제 91
 
0.9%
Distinct905
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T14:30:49.560804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length19
Mean length6.1574
Min length1

Characters and Unicode

Total characters61574
Distinct characters576
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)0.4%

Sample

1st row밀양시국궁장
2nd row막창꾼
3rd row미소육개장
4th row밀양밸리리조트
5th row영남루
ValueCountFrequency (%)
밀양점 234
 
2.0%
카페 103
 
0.9%
밀양삼문점 77
 
0.7%
키즈풀빌라 74
 
0.6%
풀빌라 73
 
0.6%
캠핑장 67
 
0.6%
밀양 67
 
0.6%
펜션 66
 
0.6%
표충사 55
 
0.5%
밀양본점 55
 
0.5%
Other values (963) 10837
92.6%
2023-12-12T14:30:50.063909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1922
 
3.1%
1769
 
2.9%
1708
 
2.8%
1325
 
2.2%
1103
 
1.8%
1069
 
1.7%
1052
 
1.7%
884
 
1.4%
844
 
1.4%
809
 
1.3%
Other values (566) 49089
79.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 56691
92.1%
Space Separator 1708
 
2.8%
Decimal Number 1075
 
1.7%
Lowercase Letter 964
 
1.6%
Uppercase Letter 336
 
0.5%
Open Punctuation 280
 
0.5%
Close Punctuation 280
 
0.5%
Other Punctuation 240
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1922
 
3.4%
1769
 
3.1%
1325
 
2.3%
1103
 
1.9%
1069
 
1.9%
1052
 
1.9%
884
 
1.6%
844
 
1.5%
809
 
1.4%
785
 
1.4%
Other values (517) 45129
79.6%
Uppercase Letter
ValueCountFrequency (%)
D 82
24.4%
C 51
15.2%
R 38
11.3%
M 27
 
8.0%
B 22
 
6.5%
A 16
 
4.8%
G 15
 
4.5%
V 14
 
4.2%
I 14
 
4.2%
P 14
 
4.2%
Other values (7) 43
12.8%
Lowercase Letter
ValueCountFrequency (%)
o 168
17.4%
e 146
15.1%
f 132
13.7%
a 86
8.9%
s 76
7.9%
g 67
 
7.0%
r 62
 
6.4%
t 52
 
5.4%
n 44
 
4.6%
c 42
 
4.4%
Other values (5) 89
9.2%
Decimal Number
ValueCountFrequency (%)
1 256
23.8%
9 169
15.7%
8 147
13.7%
3 144
13.4%
0 95
 
8.8%
4 81
 
7.5%
2 66
 
6.1%
5 62
 
5.8%
7 55
 
5.1%
Other Punctuation
ValueCountFrequency (%)
& 170
70.8%
/ 44
 
18.3%
: 12
 
5.0%
, 9
 
3.8%
. 5
 
2.1%
Space Separator
ValueCountFrequency (%)
1708
100.0%
Open Punctuation
ValueCountFrequency (%)
( 280
100.0%
Close Punctuation
ValueCountFrequency (%)
) 280
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 56691
92.1%
Common 3583
 
5.8%
Latin 1300
 
2.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1922
 
3.4%
1769
 
3.1%
1325
 
2.3%
1103
 
1.9%
1069
 
1.9%
1052
 
1.9%
884
 
1.6%
844
 
1.5%
809
 
1.4%
785
 
1.4%
Other values (517) 45129
79.6%
Latin
ValueCountFrequency (%)
o 168
12.9%
e 146
 
11.2%
f 132
 
10.2%
a 86
 
6.6%
D 82
 
6.3%
s 76
 
5.8%
g 67
 
5.2%
r 62
 
4.8%
t 52
 
4.0%
C 51
 
3.9%
Other values (22) 378
29.1%
Common
ValueCountFrequency (%)
1708
47.7%
( 280
 
7.8%
) 280
 
7.8%
1 256
 
7.1%
& 170
 
4.7%
9 169
 
4.7%
8 147
 
4.1%
3 144
 
4.0%
0 95
 
2.7%
4 81
 
2.3%
Other values (7) 253
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 56691
92.1%
ASCII 4883
 
7.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1922
 
3.4%
1769
 
3.1%
1325
 
2.3%
1103
 
1.9%
1069
 
1.9%
1052
 
1.9%
884
 
1.6%
844
 
1.5%
809
 
1.4%
785
 
1.4%
Other values (517) 45129
79.6%
ASCII
ValueCountFrequency (%)
1708
35.0%
( 280
 
5.7%
) 280
 
5.7%
1 256
 
5.2%
& 170
 
3.5%
9 169
 
3.5%
o 168
 
3.4%
8 147
 
3.0%
e 146
 
3.0%
3 144
 
2.9%
Other values (39) 1415
29.0%
Distinct150
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T14:30:50.427090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length3.2186
Min length1

Characters and Unicode

Total characters32186
Distinct characters257
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4 ?
Unique (%)< 0.1%

Sample

1st row외국인국내관광
2nd row전통주
3rd row비오는날가기좋은
4th row수영장
5th row시끌벅적한
ValueCountFrequency (%)
가족과함께 507
 
5.1%
맛집 445
 
4.5%
힐링 364
 
3.6%
편안한 292
 
2.9%
데이트 281
 
2.8%
단체 256
 
2.6%
감성 246
 
2.5%
아이 220
 
2.2%
모임 210
 
2.1%
뷰맛집 208
 
2.1%
Other values (140) 6971
69.7%
2023-12-12T14:30:50.918182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1357
 
4.2%
1209
 
3.8%
1093
 
3.4%
845
 
2.6%
828
 
2.6%
542
 
1.7%
507
 
1.6%
507
 
1.6%
507
 
1.6%
507
 
1.6%
Other values (247) 24284
75.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 31546
98.0%
Uppercase Letter 316
 
1.0%
Other Punctuation 206
 
0.6%
Close Punctuation 56
 
0.2%
Open Punctuation 56
 
0.2%
Decimal Number 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1357
 
4.3%
1209
 
3.8%
1093
 
3.5%
845
 
2.7%
828
 
2.6%
542
 
1.7%
507
 
1.6%
507
 
1.6%
507
 
1.6%
507
 
1.6%
Other values (238) 23644
75.0%
Uppercase Letter
ValueCountFrequency (%)
Z 149
47.2%
M 149
47.2%
S 12
 
3.8%
N 6
 
1.9%
Other Punctuation
ValueCountFrequency (%)
& 113
54.9%
/ 93
45.1%
Close Punctuation
ValueCountFrequency (%)
) 56
100.0%
Open Punctuation
ValueCountFrequency (%)
( 56
100.0%
Decimal Number
ValueCountFrequency (%)
8 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 31546
98.0%
Common 324
 
1.0%
Latin 316
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1357
 
4.3%
1209
 
3.8%
1093
 
3.5%
845
 
2.7%
828
 
2.6%
542
 
1.7%
507
 
1.6%
507
 
1.6%
507
 
1.6%
507
 
1.6%
Other values (238) 23644
75.0%
Common
ValueCountFrequency (%)
& 113
34.9%
/ 93
28.7%
) 56
17.3%
( 56
17.3%
8 6
 
1.9%
Latin
ValueCountFrequency (%)
Z 149
47.2%
M 149
47.2%
S 12
 
3.8%
N 6
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 31546
98.0%
ASCII 640
 
2.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1357
 
4.3%
1209
 
3.8%
1093
 
3.5%
845
 
2.7%
828
 
2.6%
542
 
1.7%
507
 
1.6%
507
 
1.6%
507
 
1.6%
507
 
1.6%
Other values (238) 23644
75.0%
ASCII
ValueCountFrequency (%)
Z 149
23.3%
M 149
23.3%
& 113
17.7%
/ 93
14.5%
) 56
 
8.8%
( 56
 
8.8%
S 12
 
1.9%
8 6
 
0.9%
N 6
 
0.9%
Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
방문목적
4406 
분위기
2179 
동반유형
2112 
편의시설
945 
방문시기
 
358

Length

Max length4
Median length4
Mean length3.7821
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row방문목적
2nd row방문목적
3rd row방문목적
4th row편의시설
5th row분위기

Common Values

ValueCountFrequency (%)
방문목적 4406
44.1%
분위기 2179
21.8%
동반유형 2112
21.1%
편의시설 945
 
9.4%
방문시기 358
 
3.6%

Length

2023-12-12T14:30:51.075089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:30:51.180798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
방문목적 4406
44.1%
분위기 2179
21.8%
동반유형 2112
21.1%
편의시설 945
 
9.4%
방문시기 358
 
3.6%

정렬순서
Real number (ℝ)

Distinct48
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.1703
Minimum1
Maximum48
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T14:30:51.316305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q14
median8
Q314
95-th percentile26
Maximum48
Range47
Interquartile range (IQR)10

Descriptive statistics

Standard deviation7.8711558
Coefficient of variation (CV)0.77393546
Kurtosis1.3281432
Mean10.1703
Median Absolute Deviation (MAD)5
Skewness1.1722177
Sum101703
Variance61.955093
MonotonicityNot monotonic
2023-12-12T14:30:51.482101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=48)
ValueCountFrequency (%)
1 758
 
7.6%
3 721
 
7.2%
2 721
 
7.2%
4 666
 
6.7%
5 595
 
5.9%
6 569
 
5.7%
8 523
 
5.2%
7 521
 
5.2%
9 489
 
4.9%
10 485
 
4.9%
Other values (38) 3952
39.5%
ValueCountFrequency (%)
1 758
7.6%
2 721
7.2%
3 721
7.2%
4 666
6.7%
5 595
5.9%
6 569
5.7%
7 521
5.2%
8 523
5.2%
9 489
4.9%
10 485
4.9%
ValueCountFrequency (%)
48 1
 
< 0.1%
47 2
 
< 0.1%
46 3
 
< 0.1%
45 2
 
< 0.1%
44 5
0.1%
43 3
 
< 0.1%
42 7
0.1%
41 8
0.1%
40 7
0.1%
39 8
0.1%

생성일
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2022-11-21
10000 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-11-21
2nd row2022-11-21
3rd row2022-11-21
4th row2022-11-21
5th row2022-11-21

Common Values

ValueCountFrequency (%)
2022-11-21 10000
100.0%

Length

2023-12-12T14:30:51.635201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:30:51.736580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-11-21 10000
100.0%

Interactions

2023-12-12T14:30:47.594470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:30:46.761030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:30:47.200938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:30:47.702605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:30:46.898275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:30:47.350676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:30:47.837739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:30:47.046971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:30:47.479324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:30:51.801127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호(ID)관광상품번호(NameID)관광상품분류해시태그분류정렬순서
번호(ID)1.0000.3140.3910.1320.116
관광상품번호(NameID)0.3141.0001.0000.3700.212
관광상품분류0.3911.0001.0000.6110.262
해시태그분류0.1320.3700.6111.0000.279
정렬순서0.1160.2120.2620.2791.000
2023-12-12T14:30:51.914722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
해시태그분류관광상품분류
해시태그분류1.0000.268
관광상품분류0.2681.000
2023-12-12T14:30:52.006550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호(ID)관광상품번호(NameID)정렬순서관광상품분류해시태그분류
번호(ID)1.000-0.013-0.0100.1730.055
관광상품번호(NameID)-0.0131.000-0.1761.0000.310
정렬순서-0.010-0.1761.0000.1120.119
관광상품분류0.1731.0000.1121.0000.268
해시태그분류0.0550.3100.1190.2681.000

Missing values

2023-12-12T14:30:47.987145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:30:48.169645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호(ID)관광상품번호(NameID)관광상품분류관광상품명해시태그해시태그분류정렬순서생성일
428980042901000031관광지밀양시국궁장외국인국내관광방문목적32022-11-21
287680028774001205일반음식점막창꾼전통주방문목적52022-11-21
349480034954000619일반음식점미소육개장비오는날가기좋은방문목적52022-11-21
419680041973000189숙박업소밀양밸리리조트수영장편의시설112022-11-21
798380079841000063관광지영남루시끌벅적한분위기162022-11-21
816980081704000296일반음식점옛날꼼장어집단체동반유형112022-11-21
726080072614000149일반음식점암새들뷰맛집방문목적32022-11-21
773780077384001221일반음식점에르모사편안한분위기132022-11-21
410180041023000144숙박업소밀양로그펜션고급스러운분위기112022-11-21
1029680102974000827일반음식점카페달리아 삼랑진깔끔한분위기82022-11-21
번호(ID)관광상품번호(NameID)관광상품분류관광상품명해시태그해시태그분류정렬순서생성일
38580003864000683일반음식점가자미가힐링분위기52022-11-21
1105980110603000163숙박업소푸른계곡펜션고급스러운분위기192022-11-21
1053080105314000252일반음식점카페평리조용한분위기192022-11-21
384080038411000100관광지밀양교동손씨고가역사방문목적42022-11-21
954480095453000106숙박업소좋은카라반효도여행동반유형102022-11-21
1049980105004001419일반음식점카페테라스뷰맛집방문목적12022-11-21
1019080101914000739일반음식점카페 잇소디저트&간식방문목적232022-11-21
834980083504000073일반음식점온오프뷰맛집방문목적22022-11-21
392880039291000029관광지밀양댐생태공원아이동반동반유형82022-11-21
928680092874000150일반음식점장어본가얼큰한방문목적22022-11-21