Overview

Dataset statistics

Number of variables11
Number of observations207
Missing cells2
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory18.7 KiB
Average record size in memory92.6 B

Variable types

Numeric3
Categorical2
Text1
Boolean3
DateTime2

Dataset

Description제주관광정보시스템(VISIT JEJU)의 태그관리 정보로 태그아이디, 언어, 상위태그아이디, 태그명, 깊이, 정렬순서등의 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15118457/fileData.do

Alerts

사용여부 has constant value ""Constant
주요태그노출여부 is highly overall correlated with 깊이 and 1 other fieldsHigh correlation
깊이 is highly overall correlated with 상위태그아이디 and 2 other fieldsHigh correlation
태그아이디 is highly overall correlated with 상위태그아이디 and 1 other fieldsHigh correlation
상위태그아이디 is highly overall correlated with 태그아이디 and 2 other fieldsHigh correlation
언어 is highly overall correlated with 태그아이디 and 1 other fieldsHigh correlation
여행큐레이션노출여부 is highly overall correlated with 깊이 and 1 other fieldsHigh correlation
태그아이디 has unique valuesUnique
상위태그아이디 has 32 (15.5%) zerosZeros
정렬순서 has 21 (10.1%) zerosZeros

Reproduction

Analysis started2023-12-12 23:13:06.226504
Analysis finished2023-12-12 23:13:07.916096
Duration1.69 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

태그아이디
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct207
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean242.69565
Minimum101
Maximum373
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-13T08:13:07.995990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum101
5-th percentile111.6
Q1163.5
median253
Q3316.5
95-th percentile360.7
Maximum373
Range272
Interquartile range (IQR)153

Descriptive statistics

Standard deviation83.573694
Coefficient of variation (CV)0.34435596
Kurtosis-1.3301818
Mean242.69565
Median Absolute Deviation (MAD)74
Skewness-0.15729079
Sum50238
Variance6984.5623
MonotonicityNot monotonic
2023-12-13T08:13:08.134314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
347 1
 
0.5%
351 1
 
0.5%
212 1
 
0.5%
213 1
 
0.5%
214 1
 
0.5%
215 1
 
0.5%
216 1
 
0.5%
316 1
 
0.5%
218 1
 
0.5%
219 1
 
0.5%
Other values (197) 197
95.2%
ValueCountFrequency (%)
101 1
0.5%
102 1
0.5%
103 1
0.5%
104 1
0.5%
105 1
0.5%
106 1
0.5%
107 1
0.5%
108 1
0.5%
109 1
0.5%
110 1
0.5%
ValueCountFrequency (%)
373 1
0.5%
370 1
0.5%
369 1
0.5%
368 1
0.5%
367 1
0.5%
366 1
0.5%
365 1
0.5%
364 1
0.5%
363 1
0.5%
362 1
0.5%

언어
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
국문
41 
말레이문
39 
일문
35 
영문
34 
중문간체
29 

Length

Max length4
Median length2
Mean length2.9371981
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국문
2nd row영문
3rd row중문간체
4th row중문간체
5th row중문간체

Common Values

ValueCountFrequency (%)
국문 41
19.8%
말레이문 39
18.8%
일문 35
16.9%
영문 34
16.4%
중문간체 29
14.0%
중문번체 29
14.0%

Length

2023-12-13T08:13:08.288291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:13:08.446124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국문 41
19.8%
말레이문 39
18.8%
일문 35
16.9%
영문 34
16.4%
중문간체 29
14.0%
중문번체 29
14.0%

상위태그아이디
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct38
Distinct (%)18.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean173.90338
Minimum0
Maximum347
Zeros32
Zeros (%)15.5%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-13T08:13:08.593956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1110
median181
Q3259
95-th percentile312
Maximum347
Range347
Interquartile range (IQR)149

Descriptive statistics

Standard deviation99.447065
Coefficient of variation (CV)0.57185239
Kurtosis-0.80145772
Mean173.90338
Median Absolute Deviation (MAD)76
Skewness-0.34491194
Sum35998
Variance9889.7188
MonotonicityNot monotonic
2023-12-13T08:13:08.727444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=38)
ValueCountFrequency (%)
0 32
 
15.5%
296 12
 
5.8%
188 10
 
4.8%
110 9
 
4.3%
220 8
 
3.9%
181 8
 
3.9%
259 8
 
3.9%
142 8
 
3.9%
103 8
 
3.9%
266 7
 
3.4%
Other values (28) 97
46.9%
ValueCountFrequency (%)
0 32
15.5%
101 5
 
2.4%
103 8
 
3.9%
106 3
 
1.4%
110 9
 
4.3%
117 5
 
2.4%
123 3
 
1.4%
140 5
 
2.4%
142 8
 
3.9%
145 2
 
1.0%
ValueCountFrequency (%)
347 2
 
1.0%
323 2
 
1.0%
322 2
 
1.0%
318 3
 
1.4%
312 5
2.4%
305 6
2.9%
301 3
 
1.4%
296 12
5.8%
279 3
 
1.4%
273 2
 
1.0%
Distinct180
Distinct (%)87.8%
Missing2
Missing (%)1.0%
Memory size1.7 KiB
2023-12-13T08:13:09.153003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length18
Mean length4.7365854
Min length1

Characters and Unicode

Total characters971
Distinct characters247
Distinct categories7 ?
Distinct scripts6 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique161 ?
Unique (%)78.5%

Sample

1st row컨셉
2nd rowFour seasons
3rd row
4th row
5th row4.3
ValueCountFrequency (%)
4.3 5
 
2.2%
musim 5
 
2.2%
4
 
1.8%
3
 
1.3%
父母 2
 
0.9%
emas 2
 
0.9%
一行 2
 
0.9%
春天 2
 
0.9%
夏天 2
 
0.9%
中/老年 2
 
0.9%
Other values (185) 197
87.2%
2023-12-13T08:13:09.738685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 85
 
8.8%
e 56
 
5.8%
n 51
 
5.3%
i 42
 
4.3%
s 37
 
3.8%
r 37
 
3.8%
u 34
 
3.5%
l 24
 
2.5%
t 23
 
2.4%
/ 22
 
2.3%
Other values (237) 560
57.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 530
54.6%
Other Letter 301
31.0%
Uppercase Letter 74
 
7.6%
Other Punctuation 32
 
3.3%
Space Separator 21
 
2.2%
Decimal Number 10
 
1.0%
Dash Punctuation 3
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8
 
2.7%
7
 
2.3%
6
 
2.0%
4
 
1.3%
4
 
1.3%
3
 
1.0%
3
 
1.0%
3
 
1.0%
3
 
1.0%
3
 
1.0%
Other values (186) 257
85.4%
Lowercase Letter
ValueCountFrequency (%)
a 85
16.0%
e 56
10.6%
n 51
9.6%
i 42
 
7.9%
s 37
 
7.0%
r 37
 
7.0%
u 34
 
6.4%
l 24
 
4.5%
t 23
 
4.3%
m 21
 
4.0%
Other values (14) 120
22.6%
Uppercase Letter
ValueCountFrequency (%)
S 12
16.2%
P 9
12.2%
M 9
12.2%
C 5
 
6.8%
W 5
 
6.8%
A 4
 
5.4%
T 4
 
5.4%
L 3
 
4.1%
E 3
 
4.1%
K 3
 
4.1%
Other values (10) 17
23.0%
Other Punctuation
ValueCountFrequency (%)
/ 22
68.8%
. 9
28.1%
, 1
 
3.1%
Decimal Number
ValueCountFrequency (%)
4 5
50.0%
3 5
50.0%
Space Separator
ValueCountFrequency (%)
21
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 604
62.2%
Han 161
 
16.6%
Hangul 103
 
10.6%
Common 66
 
6.8%
Katakana 29
 
3.0%
Hiragana 8
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3
 
2.9%
3
 
2.9%
3
 
2.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
Other values (76) 80
77.7%
Han
ValueCountFrequency (%)
8
 
5.0%
7
 
4.3%
6
 
3.7%
4
 
2.5%
4
 
2.5%
3
 
1.9%
3
 
1.9%
3
 
1.9%
3
 
1.9%
3
 
1.9%
Other values (70) 117
72.7%
Latin
ValueCountFrequency (%)
a 85
14.1%
e 56
 
9.3%
n 51
 
8.4%
i 42
 
7.0%
s 37
 
6.1%
r 37
 
6.1%
u 34
 
5.6%
l 24
 
4.0%
t 23
 
3.8%
m 21
 
3.5%
Other values (34) 194
32.1%
Katakana
ValueCountFrequency (%)
3
 
10.3%
2
 
6.9%
2
 
6.9%
2
 
6.9%
2
 
6.9%
1
 
3.4%
1
 
3.4%
1
 
3.4%
1
 
3.4%
1
 
3.4%
Other values (13) 13
44.8%
Common
ValueCountFrequency (%)
/ 22
33.3%
21
31.8%
. 9
13.6%
4 5
 
7.6%
3 5
 
7.6%
- 3
 
4.5%
, 1
 
1.5%
Hiragana
ValueCountFrequency (%)
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 670
69.0%
CJK 161
 
16.6%
Hangul 103
 
10.6%
Katakana 29
 
3.0%
Hiragana 8
 
0.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 85
 
12.7%
e 56
 
8.4%
n 51
 
7.6%
i 42
 
6.3%
s 37
 
5.5%
r 37
 
5.5%
u 34
 
5.1%
l 24
 
3.6%
t 23
 
3.4%
/ 22
 
3.3%
Other values (41) 259
38.7%
CJK
ValueCountFrequency (%)
8
 
5.0%
7
 
4.3%
6
 
3.7%
4
 
2.5%
4
 
2.5%
3
 
1.9%
3
 
1.9%
3
 
1.9%
3
 
1.9%
3
 
1.9%
Other values (70) 117
72.7%
Katakana
ValueCountFrequency (%)
3
 
10.3%
2
 
6.9%
2
 
6.9%
2
 
6.9%
2
 
6.9%
1
 
3.4%
1
 
3.4%
1
 
3.4%
1
 
3.4%
1
 
3.4%
Other values (13) 13
44.8%
Hangul
ValueCountFrequency (%)
3
 
2.9%
3
 
2.9%
3
 
2.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
2
 
1.9%
Other values (76) 80
77.7%
Hiragana
ValueCountFrequency (%)
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%

깊이
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2
125 
3
50 
1
32 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row2
3rd row2
4th row2
5th row3

Common Values

ValueCountFrequency (%)
2 125
60.4%
3 50
 
24.2%
1 32
 
15.5%

Length

2023-12-13T08:13:09.906477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:13:10.026235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 125
60.4%
3 50
 
24.2%
1 32
 
15.5%

정렬순서
Real number (ℝ)

ZEROS 

Distinct11
Distinct (%)5.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.5507246
Minimum0
Maximum10
Zeros21
Zeros (%)10.1%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-13T08:13:10.165017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median3
Q35.5
95-th percentile9
Maximum10
Range10
Interquartile range (IQR)4.5

Descriptive statistics

Standard deviation2.6814876
Coefficient of variation (CV)0.75519446
Kurtosis-0.37140631
Mean3.5507246
Median Absolute Deviation (MAD)2
Skewness0.67629382
Sum735
Variance7.1903757
MonotonicityNot monotonic
2023-12-13T08:13:10.293151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
1 34
16.4%
2 34
16.4%
3 30
14.5%
6 22
10.6%
0 21
10.1%
4 20
9.7%
5 16
7.7%
7 9
 
4.3%
8 8
 
3.9%
10 7
 
3.4%
ValueCountFrequency (%)
0 21
10.1%
1 34
16.4%
2 34
16.4%
3 30
14.5%
4 20
9.7%
5 16
7.7%
6 22
10.6%
7 9
 
4.3%
8 8
 
3.9%
9 6
 
2.9%
ValueCountFrequency (%)
10 7
 
3.4%
9 6
 
2.9%
8 8
 
3.9%
7 9
 
4.3%
6 22
10.6%
5 16
7.7%
4 20
9.7%
3 30
14.5%
2 34
16.4%
1 34
16.4%

사용여부
Boolean

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size339.0 B
True
207 
ValueCountFrequency (%)
True 207
100.0%
2023-12-13T08:13:10.414801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct12
Distinct (%)5.8%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
Minimum2018-02-14 00:00:00
Maximum2019-02-26 00:00:00
2023-12-13T08:13:10.496014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:13:10.606126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
Distinct12
Distinct (%)5.8%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
Minimum2018-04-05 00:00:00
Maximum2019-02-26 00:00:00
2023-12-13T08:13:10.731073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:13:10.862952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)

여행큐레이션노출여부
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size339.0 B
True
139 
False
68 
ValueCountFrequency (%)
True 139
67.1%
False 68
32.9%
2023-12-13T08:13:10.967148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

주요태그노출여부
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size339.0 B
False
157 
True
50 
ValueCountFrequency (%)
False 157
75.8%
True 50
 
24.2%
2023-12-13T08:13:11.076231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Interactions

2023-12-13T08:13:07.301689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:13:06.692364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:13:06.980480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:13:07.386259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:13:06.779640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:13:07.095263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:13:07.499018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:13:06.874007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:13:07.215621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:13:11.165111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
태그아이디언어상위태그아이디깊이정렬순서등록일시수정일시여행큐레이션노출여부주요태그노출여부
태그아이디1.0000.9060.8200.3750.5990.8300.7290.3650.414
언어0.9061.0000.9820.0000.2810.9210.9190.0000.000
상위태그아이디0.8200.9821.0000.9460.3530.7910.7520.1720.335
깊이0.3750.0000.9461.0000.4970.3420.0000.5411.000
정렬순서0.5990.2810.3530.4971.0000.5300.0000.3230.470
등록일시0.8300.9210.7910.3420.5301.0000.8640.2060.265
수정일시0.7290.9190.7520.0000.0000.8641.0000.2490.000
여행큐레이션노출여부0.3650.0000.1720.5410.3230.2060.2491.0000.948
주요태그노출여부0.4140.0000.3351.0000.4700.2650.0000.9481.000
2023-12-13T08:13:11.659659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주요태그노출여부깊이여행큐레이션노출여부언어
주요태그노출여부1.0000.9980.7940.000
깊이0.9981.0000.8060.000
여행큐레이션노출여부0.7940.8061.0000.000
언어0.0000.0000.0001.000
2023-12-13T08:13:11.794699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
태그아이디상위태그아이디정렬순서언어깊이여행큐레이션노출여부주요태그노출여부
태그아이디1.0000.540-0.1760.7610.2390.2740.311
상위태그아이디0.5401.000-0.1450.8810.7160.1680.329
정렬순서-0.176-0.1451.0000.0000.3270.2440.356
언어0.7610.8810.0001.0000.0000.0000.000
깊이0.2390.7160.3270.0001.0000.8060.998
여행큐레이션노출여부0.2740.1680.2440.0000.8061.0000.794
주요태그노출여부0.3110.3290.3560.0000.9980.7941.000

Missing values

2023-12-13T08:13:07.631467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:13:07.841473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

태그아이디언어상위태그아이디태그명깊이정렬순서사용여부등록일시수정일시여행큐레이션노출여부주요태그노출여부
0347국문0컨셉16y2018-05-312018-05-31nn
1351영문140Four seasons20y2018-05-312018-05-31yn
2352중문간체22729y2018-05-312018-05-31yn
3353중문간체227210y2018-05-312018-05-31yn
4354중문간체2204.330y2018-05-312018-05-31ny
5355일문19520y2018-05-312018-05-31yn
6356일문195友達20y2018-05-312018-05-31yn
7357일문195カップル20y2018-05-312018-05-31yn
8358일문195ひとり20y2018-05-312018-05-31yn
9359일문195子供20y2018-05-312018-05-31yn
태그아이디언어상위태그아이디태그명깊이정렬순서사용여부등록일시수정일시여행큐레이션노출여부주요태그노출여부
197281중문번체279中/老年21y2019-02-112019-02-26nn
198282중문번체279老年22y2019-02-112019-02-11yn
199288중문번체259室內32y2019-02-112019-02-26ny
200289중문번체259日出33y2019-02-112019-02-26ny
201292중문번체259偶來小路36y2019-02-112019-02-26ny
202293중문번체259小火山37y2019-02-112019-02-26ny
203294중문번체259海邊38y2019-02-112019-02-26ny
204298말레이문296Musim Panas22y2019-02-112019-02-11yn
205299말레이문296Musim Luruh23y2019-02-112019-02-11yn
206300말레이문296Musim Sejuk24y2019-02-112019-02-11yn