Overview

Dataset statistics

Number of variables5
Number of observations4188
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory167.8 KiB
Average record size in memory41.0 B

Variable types

Numeric1
Text2
Categorical2

Dataset

Description제주특별자치도 관내 관광지의 다국어, 음성 등의 기준으로 구분되어 기재되어있는 제주특별자치도 관광지의 음성 현황에 대한 정보를 제공합니다.
Author제주특별자치도
URLhttps://www.data.go.kr/data/15111743/fileData.do

Alerts

관광지 분야 has constant value ""Constant
연번 is highly overall correlated with 언어High correlation
언어 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
관광지명 has unique valuesUnique
관광지 안내 음성 (URL) has unique valuesUnique

Reproduction

Analysis started2023-12-12 04:21:39.795580
Analysis finished2023-12-12 04:21:41.331172
Duration1.54 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct4188
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2094.5
Minimum1
Maximum4188
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size36.9 KiB
2023-12-12T13:21:41.432234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile210.35
Q11047.75
median2094.5
Q33141.25
95-th percentile3978.65
Maximum4188
Range4187
Interquartile range (IQR)2093.5

Descriptive statistics

Standard deviation1209.1158
Coefficient of variation (CV)0.57728135
Kurtosis-1.2
Mean2094.5
Median Absolute Deviation (MAD)1047
Skewness0
Sum8771766
Variance1461961
MonotonicityStrictly increasing
2023-12-12T13:21:41.614797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
2784 1
 
< 0.1%
2786 1
 
< 0.1%
2787 1
 
< 0.1%
2788 1
 
< 0.1%
2789 1
 
< 0.1%
2790 1
 
< 0.1%
2791 1
 
< 0.1%
2792 1
 
< 0.1%
2793 1
 
< 0.1%
Other values (4178) 4178
99.8%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
4188 1
< 0.1%
4187 1
< 0.1%
4186 1
< 0.1%
4185 1
< 0.1%
4184 1
< 0.1%
4183 1
< 0.1%
4182 1
< 0.1%
4181 1
< 0.1%
4180 1
< 0.1%
4179 1
< 0.1%

관광지명
Text

UNIQUE 

Distinct4188
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size32.8 KiB
2023-12-12T13:21:41.994931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length35
Mean length19.709885
Min length13

Characters and Unicode

Total characters82545
Distinct characters594
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4188 ?
Unique (%)100.0%

Sample

1st row1_kor_1100고지(휴게소).mp3
2nd row2_kor_1100고지습지.mp3
3rd row3_kor_1112도로.mp3
4th row4_kor_4.3해원방사탑.mp3
5th row5_kor_5.16도로숲터널.mp3
ValueCountFrequency (%)
지질트레일 32
 
0.6%
길.mp3 24
 
0.5%
옛터.mp3 24
 
0.5%
순례길 20
 
0.4%
축제.mp3 16
 
0.3%
16
 
0.3%
밭담길.mp3 12
 
0.2%
용머리해안 12
 
0.2%
해안도로.mp3 12
 
0.2%
절로 12
 
0.2%
Other values (4408) 5124
96.6%
2023-12-12T13:21:42.578955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
_ 8376
 
10.1%
3 6499
 
7.9%
p 5235
 
6.3%
. 4240
 
5.1%
m 4188
 
5.1%
1 2472
 
3.0%
2 2291
 
2.8%
n 2102
 
2.5%
4 1492
 
1.8%
5 1267
 
1.5%
Other values (584) 44383
53.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 26304
31.9%
Lowercase Letter 20968
25.4%
Decimal Number 20281
24.6%
Connector Punctuation 8376
 
10.1%
Other Punctuation 4344
 
5.3%
Space Separator 1124
 
1.4%
Open Punctuation 460
 
0.6%
Close Punctuation 460
 
0.6%
Uppercase Letter 132
 
0.2%
Dash Punctuation 88
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
756
 
2.9%
748
 
2.8%
652
 
2.5%
600
 
2.3%
560
 
2.1%
452
 
1.7%
440
 
1.7%
400
 
1.5%
396
 
1.5%
372
 
1.4%
Other values (538) 20928
79.6%
Uppercase Letter
ValueCountFrequency (%)
E 24
18.2%
B 16
12.1%
A 16
12.1%
W 12
9.1%
S 12
9.1%
C 8
 
6.1%
D 8
 
6.1%
L 8
 
6.1%
N 4
 
3.0%
R 4
 
3.0%
Other values (5) 20
15.2%
Lowercase Letter
ValueCountFrequency (%)
p 5235
25.0%
m 4188
20.0%
n 2102
10.0%
i 1059
 
5.1%
c 1055
 
5.0%
e 1047
 
5.0%
h 1047
 
5.0%
j 1047
 
5.0%
a 1047
 
5.0%
r 1047
 
5.0%
Other values (2) 2094
 
10.0%
Decimal Number
ValueCountFrequency (%)
3 6499
32.0%
1 2472
 
12.2%
2 2291
 
11.3%
4 1492
 
7.4%
5 1267
 
6.2%
0 1260
 
6.2%
6 1255
 
6.2%
8 1254
 
6.2%
7 1251
 
6.2%
9 1240
 
6.1%
Other Punctuation
ValueCountFrequency (%)
. 4240
97.6%
, 64
 
1.5%
& 40
 
0.9%
Connector Punctuation
ValueCountFrequency (%)
_ 8376
100.0%
Space Separator
ValueCountFrequency (%)
1124
100.0%
Open Punctuation
ValueCountFrequency (%)
( 460
100.0%
Close Punctuation
ValueCountFrequency (%)
) 460
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 88
100.0%
Math Symbol
ValueCountFrequency (%)
~ 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 35141
42.6%
Hangul 26304
31.9%
Latin 21100
25.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
756
 
2.9%
748
 
2.8%
652
 
2.5%
600
 
2.3%
560
 
2.1%
452
 
1.7%
440
 
1.7%
400
 
1.5%
396
 
1.5%
372
 
1.4%
Other values (538) 20928
79.6%
Latin
ValueCountFrequency (%)
p 5235
24.8%
m 4188
19.8%
n 2102
10.0%
i 1059
 
5.0%
c 1055
 
5.0%
e 1047
 
5.0%
h 1047
 
5.0%
j 1047
 
5.0%
a 1047
 
5.0%
r 1047
 
5.0%
Other values (17) 2226
10.5%
Common
ValueCountFrequency (%)
_ 8376
23.8%
3 6499
18.5%
. 4240
12.1%
1 2472
 
7.0%
2 2291
 
6.5%
4 1492
 
4.2%
5 1267
 
3.6%
0 1260
 
3.6%
6 1255
 
3.6%
8 1254
 
3.6%
Other values (9) 4735
13.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 56241
68.1%
Hangul 26300
31.9%
Compat Jamo 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
_ 8376
14.9%
3 6499
 
11.6%
p 5235
 
9.3%
. 4240
 
7.5%
m 4188
 
7.4%
1 2472
 
4.4%
2 2291
 
4.1%
n 2102
 
3.7%
4 1492
 
2.7%
5 1267
 
2.3%
Other values (36) 18079
32.1%
Hangul
ValueCountFrequency (%)
756
 
2.9%
748
 
2.8%
652
 
2.5%
600
 
2.3%
560
 
2.1%
452
 
1.7%
440
 
1.7%
400
 
1.5%
396
 
1.5%
372
 
1.4%
Other values (537) 20924
79.6%
Compat Jamo
ValueCountFrequency (%)
4
100.0%

관광지 분야
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size32.8 KiB
관광지
4188 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row관광지
2nd row관광지
3rd row관광지
4th row관광지
5th row관광지

Common Values

ValueCountFrequency (%)
관광지 4188
100.0%

Length

2023-12-12T13:21:42.762313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:21:42.887664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
관광지 4188
100.0%

언어
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size32.8 KiB
한국어
1047 
중국어
1047 
영어
1047 
일본어
1047 

Length

Max length3
Median length3
Mean length2.75
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row한국어
2nd row한국어
3rd row한국어
4th row한국어
5th row한국어

Common Values

ValueCountFrequency (%)
한국어 1047
25.0%
중국어 1047
25.0%
영어 1047
25.0%
일본어 1047
25.0%

Length

2023-12-12T13:21:43.023619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:21:43.198581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한국어 1047
25.0%
중국어 1047
25.0%
영어 1047
25.0%
일본어 1047
25.0%
Distinct4188
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size32.8 KiB
2023-12-12T13:21:43.528339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length85
Median length81
Mean length64.707975
Min length54

Characters and Unicode

Total characters270997
Distinct characters601
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4188 ?
Unique (%)100.0%

Sample

1st rowhttp://api.brandcontents.or.kr/jejuVoice/kor/1_kor_1100고지(휴게소).mp3
2nd rowhttp://api.brandcontents.or.kr/jejuVoice/kor/2_kor_1100고지습지.mp3
3rd rowhttp://api.brandcontents.or.kr/jejuVoice/kor/3_kor_1112도로.mp3
4th rowhttp://api.brandcontents.or.kr/jejuVoice/kor/4_kor_4.3해원방사탑.mp3
5th rowhttp://api.brandcontents.or.kr/jejuVoice/kor/5_kor_5.16도로숲터널.mp3
ValueCountFrequency (%)
지질트레일 32
 
0.6%
길.mp3 24
 
0.5%
옛터.mp3 24
 
0.5%
순례길 20
 
0.4%
축제.mp3 16
 
0.3%
16
 
0.3%
밭담길.mp3 12
 
0.2%
용머리해안 12
 
0.2%
해안도로.mp3 12
 
0.2%
절로 12
 
0.2%
Other values (4408) 5124
96.6%
2023-12-12T13:21:44.079976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 20940
 
7.7%
. 16802
 
6.2%
n 16760
 
6.2%
t 16752
 
6.2%
e 14658
 
5.4%
o 14658
 
5.4%
r 14658
 
5.4%
p 14656
 
5.4%
i 10482
 
3.9%
c 10478
 
3.9%
Other values (591) 120153
44.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 167544
61.8%
Other Punctuation 42034
 
15.5%
Other Letter 26304
 
9.7%
Decimal Number 20279
 
7.5%
Connector Punctuation 8376
 
3.1%
Uppercase Letter 4320
 
1.6%
Space Separator 1124
 
0.4%
Close Punctuation 460
 
0.2%
Open Punctuation 460
 
0.2%
Dash Punctuation 88
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
756
 
2.9%
748
 
2.8%
652
 
2.5%
600
 
2.3%
560
 
2.1%
452
 
1.7%
440
 
1.7%
400
 
1.5%
396
 
1.5%
372
 
1.4%
Other values (538) 20928
79.6%
Lowercase Letter
ValueCountFrequency (%)
n 16760
10.0%
t 16752
10.0%
e 14658
8.7%
o 14658
8.7%
r 14658
8.7%
p 14656
8.7%
i 10482
 
6.3%
c 10478
 
6.3%
a 10470
 
6.2%
j 10470
 
6.2%
Other values (7) 33502
20.0%
Uppercase Letter
ValueCountFrequency (%)
V 4192
97.0%
E 24
 
0.6%
B 16
 
0.4%
A 16
 
0.4%
S 12
 
0.3%
W 12
 
0.3%
C 8
 
0.2%
L 8
 
0.2%
D 8
 
0.2%
N 4
 
0.1%
Other values (5) 20
 
0.5%
Decimal Number
ValueCountFrequency (%)
3 6497
32.0%
1 2472
 
12.2%
2 2291
 
11.3%
4 1492
 
7.4%
5 1267
 
6.2%
0 1260
 
6.2%
6 1255
 
6.2%
8 1254
 
6.2%
7 1251
 
6.2%
9 1240
 
6.1%
Other Punctuation
ValueCountFrequency (%)
/ 20940
49.8%
. 16802
40.0%
: 4188
 
10.0%
, 64
 
0.2%
& 40
 
0.1%
Connector Punctuation
ValueCountFrequency (%)
_ 8376
100.0%
Space Separator
ValueCountFrequency (%)
1124
100.0%
Close Punctuation
ValueCountFrequency (%)
) 460
100.0%
Open Punctuation
ValueCountFrequency (%)
( 460
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 88
100.0%
Math Symbol
ValueCountFrequency (%)
~ 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 171864
63.4%
Common 72829
26.9%
Hangul 26304
 
9.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
756
 
2.9%
748
 
2.8%
652
 
2.5%
600
 
2.3%
560
 
2.1%
452
 
1.7%
440
 
1.7%
400
 
1.5%
396
 
1.5%
372
 
1.4%
Other values (538) 20928
79.6%
Latin
ValueCountFrequency (%)
n 16760
9.8%
t 16752
9.7%
e 14658
 
8.5%
o 14658
 
8.5%
r 14658
 
8.5%
p 14656
 
8.5%
i 10482
 
6.1%
c 10478
 
6.1%
a 10470
 
6.1%
j 10470
 
6.1%
Other values (22) 37822
22.0%
Common
ValueCountFrequency (%)
/ 20940
28.8%
. 16802
23.1%
_ 8376
 
11.5%
3 6497
 
8.9%
: 4188
 
5.8%
1 2472
 
3.4%
2 2291
 
3.1%
4 1492
 
2.0%
5 1267
 
1.7%
0 1260
 
1.7%
Other values (11) 7244
 
9.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 244693
90.3%
Hangul 26300
 
9.7%
Compat Jamo 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 20940
 
8.6%
. 16802
 
6.9%
n 16760
 
6.8%
t 16752
 
6.8%
e 14658
 
6.0%
o 14658
 
6.0%
r 14658
 
6.0%
p 14656
 
6.0%
i 10482
 
4.3%
c 10478
 
4.3%
Other values (43) 93849
38.4%
Hangul
ValueCountFrequency (%)
756
 
2.9%
748
 
2.8%
652
 
2.5%
600
 
2.3%
560
 
2.1%
452
 
1.7%
440
 
1.7%
400
 
1.5%
396
 
1.5%
372
 
1.4%
Other values (537) 20924
79.6%
Compat Jamo
ValueCountFrequency (%)
4
100.0%

Interactions

2023-12-12T13:21:40.990257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T13:21:44.181493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번언어
연번1.0000.980
언어0.9801.000
2023-12-12T13:21:44.284655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번언어
연번1.0000.930
언어0.9301.000

Missing values

2023-12-12T13:21:41.135986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:21:41.276642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번관광지명관광지 분야언어관광지 안내 음성 (URL)
011_kor_1100고지(휴게소).mp3관광지한국어http://api.brandcontents.or.kr/jejuVoice/kor/1_kor_1100고지(휴게소).mp3
122_kor_1100고지습지.mp3관광지한국어http://api.brandcontents.or.kr/jejuVoice/kor/2_kor_1100고지습지.mp3
233_kor_1112도로.mp3관광지한국어http://api.brandcontents.or.kr/jejuVoice/kor/3_kor_1112도로.mp3
344_kor_4.3해원방사탑.mp3관광지한국어http://api.brandcontents.or.kr/jejuVoice/kor/4_kor_4.3해원방사탑.mp3
455_kor_5.16도로숲터널.mp3관광지한국어http://api.brandcontents.or.kr/jejuVoice/kor/5_kor_5.16도로숲터널.mp3
566_kor_9연대 본부 옛터.mp3관광지한국어http://api.brandcontents.or.kr/jejuVoice/kor/6_kor_9연대 본부 옛터.mp3
677_kor_SM디지털아트뮤지엄.mp3관광지한국어http://api.brandcontents.or.kr/jejuVoice/kor/7_kor_SM디지털아트뮤지엄.mp3
788_kor_SOS박물관.mp3관광지한국어http://api.brandcontents.or.kr/jejuVoice/kor/8_kor_SOS박물관.mp3
899_kor_THE WE.mp3관광지한국어http://api.brandcontents.or.kr/jejuVoice/kor/9_kor_THE WE.mp3
91010_kor_WE호텔 웰니스센터.mp3관광지한국어http://api.brandcontents.or.kr/jejuVoice/kor/10_kor_WE호텔 웰니스센터.mp3
연번관광지명관광지 분야언어관광지 안내 음성 (URL)
417841794179_jap_황금굴(한림공원).mp3관광지일본어http://api.brandcontents.or.kr/jejuVoice/jap/4179_jap_황금굴(한림공원).mp3
417941804180_jap_황우럭만화천국사회적협동조함.mp3관광지일본어http://api.brandcontents.or.kr/jejuVoice/jap/4180_jap_황우럭만화천국사회적협동조함.mp3
418041814181_jap_황우지해안.mp3관광지일본어http://api.brandcontents.or.kr/jejuVoice/jap/4181_jap_황우지해안.mp3
418141824182_jap_황우지해안 열두굴.mp3관광지일본어http://api.brandcontents.or.kr/jejuVoice/jap/4182_jap_황우지해안 열두굴.mp3
418241834183_jap_효명사.mp3관광지일본어http://api.brandcontents.or.kr/jejuVoice/jap/4183_jap_효명사.mp3
418341844184_jap_후포해변.mp3관광지일본어http://api.brandcontents.or.kr/jejuVoice/jap/4184_jap_후포해변.mp3
418441854185_jap_훈데르트바서파크.mp3관광지일본어http://api.brandcontents.or.kr/jejuVoice/jap/4185_jap_훈데르트바서파크.mp3
418541864186_jap_휴애리 매화축제.mp3관광지일본어http://api.brandcontents.or.kr/jejuVoice/jap/4186_jap_휴애리 매화축제.mp3
418641874187_jap_휴애리 자연생활공원.mp3관광지일본어http://api.brandcontents.or.kr/jejuVoice/jap/4187_jap_휴애리 자연생활공원.mp3
418741884188_jap_흙불은오름.mp3관광지일본어http://api.brandcontents.or.kr/jejuVoice/jap/4188_jap_흙불은오름.mp3