Overview

Dataset statistics

Number of variables5
Number of observations2785
Missing cells0
Missing cells (%)0.0%
Duplicate rows302
Duplicate rows (%)10.8%
Total size in memory114.4 KiB
Average record size in memory42.0 B

Variable types

Text1
Numeric2
Categorical2

Dataset

Description전라남도 공간정보 통합 플랫폼의 관광 관련 정보입니다. 관광지별 위경도 정보와 테마 등의 정보를 조회하실 수 있습니다.
URLhttps://www.data.go.kr/data/15122190/fileData.do

Alerts

Dataset has 302 (10.8%) duplicate rowsDuplicates
실내구분 is highly imbalanced (67.7%)Imbalance

Reproduction

Analysis started2023-12-12 16:11:59.494719
Analysis finished2023-12-12 16:12:00.939171
Duration1.44 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct2104
Distinct (%)75.5%
Missing0
Missing (%)0.0%
Memory size21.9 KiB
2023-12-13T01:12:01.176760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length21
Mean length8.9048474
Min length3

Characters and Unicode

Total characters24800
Distinct characters611
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1695 ?
Unique (%)60.9%

Sample

1st row통영_세병관통제영지_
2nd row통영_충렬사
3rd row통영_해저터널
4th row통영_착량묘
5th row통영_도남관광지
ValueCountFrequency (%)
자연휴양림 11
 
0.3%
전주_한옥마을 10
 
0.3%
해수욕장 9
 
0.3%
전망대 9
 
0.3%
광주_국립5·18민주묘지 9
 
0.3%
광주_시립민속박물관 8
 
0.2%
통영_제승당 8
 
0.2%
거제_해금강 8
 
0.2%
광주_전통문화관 8
 
0.2%
김해_국립김해박물관 8
 
0.2%
Other values (2390) 3261
97.4%
2023-12-13T01:12:01.630169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
_ 2916
 
11.8%
681
 
2.7%
659
 
2.7%
654
 
2.6%
433
 
1.7%
397
 
1.6%
397
 
1.6%
394
 
1.6%
358
 
1.4%
350
 
1.4%
Other values (601) 17561
70.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 20978
84.6%
Connector Punctuation 2916
 
11.8%
Space Separator 681
 
2.7%
Decimal Number 98
 
0.4%
Uppercase Letter 63
 
0.3%
Other Punctuation 41
 
0.2%
Lowercase Letter 20
 
0.1%
Dash Punctuation 2
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
659
 
3.1%
654
 
3.1%
433
 
2.1%
397
 
1.9%
397
 
1.9%
394
 
1.9%
358
 
1.7%
350
 
1.7%
348
 
1.7%
329
 
1.6%
Other values (558) 16659
79.4%
Uppercase Letter
ValueCountFrequency (%)
F 10
15.9%
T 9
14.3%
K 6
9.5%
I 5
7.9%
C 4
 
6.3%
B 4
 
6.3%
N 4
 
6.3%
S 3
 
4.8%
G 3
 
4.8%
M 3
 
4.8%
Other values (8) 12
19.0%
Lowercase Letter
ValueCountFrequency (%)
u 3
15.0%
t 2
10.0%
s 2
10.0%
e 2
10.0%
r 2
10.0%
o 2
10.0%
f 2
10.0%
b 2
10.0%
y 2
10.0%
m 1
 
5.0%
Decimal Number
ValueCountFrequency (%)
1 27
27.6%
5 17
17.3%
8 17
17.3%
3 16
16.3%
2 9
 
9.2%
4 6
 
6.1%
6 4
 
4.1%
0 2
 
2.0%
Other Punctuation
ValueCountFrequency (%)
· 24
58.5%
& 12
29.3%
. 5
 
12.2%
Connector Punctuation
ValueCountFrequency (%)
_ 2916
100.0%
Space Separator
ValueCountFrequency (%)
681
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 20974
84.6%
Common 3739
 
15.1%
Latin 83
 
0.3%
Han 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
659
 
3.1%
654
 
3.1%
433
 
2.1%
397
 
1.9%
397
 
1.9%
394
 
1.9%
358
 
1.7%
350
 
1.7%
348
 
1.7%
329
 
1.6%
Other values (555) 16655
79.4%
Latin
ValueCountFrequency (%)
F 10
 
12.0%
T 9
 
10.8%
K 6
 
7.2%
I 5
 
6.0%
C 4
 
4.8%
B 4
 
4.8%
N 4
 
4.8%
u 3
 
3.6%
S 3
 
3.6%
G 3
 
3.6%
Other values (18) 32
38.6%
Common
ValueCountFrequency (%)
_ 2916
78.0%
681
 
18.2%
1 27
 
0.7%
· 24
 
0.6%
5 17
 
0.5%
8 17
 
0.5%
3 16
 
0.4%
& 12
 
0.3%
2 9
 
0.2%
4 6
 
0.2%
Other values (5) 14
 
0.4%
Han
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 20974
84.6%
ASCII 3798
 
15.3%
None 24
 
0.1%
CJK 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
_ 2916
76.8%
681
 
17.9%
1 27
 
0.7%
5 17
 
0.4%
8 17
 
0.4%
3 16
 
0.4%
& 12
 
0.3%
F 10
 
0.3%
2 9
 
0.2%
T 9
 
0.2%
Other values (32) 84
 
2.2%
Hangul
ValueCountFrequency (%)
659
 
3.1%
654
 
3.1%
433
 
2.1%
397
 
1.9%
397
 
1.9%
394
 
1.9%
358
 
1.7%
350
 
1.7%
348
 
1.7%
329
 
1.6%
Other values (555) 16655
79.4%
None
ValueCountFrequency (%)
· 24
100.0%
CJK
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%

경도
Real number (ℝ)

Distinct2175
Distinct (%)78.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean127.76621
Minimum124.63418
Maximum131.85972
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size24.6 KiB
2023-12-13T01:12:01.787938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum124.63418
5-th percentile126.42079
Q1126.92674
median127.7409
Q3128.58147
95-th percentile129.18254
Maximum131.85972
Range7.225543
Interquartile range (IQR)1.654731

Descriptive statistics

Standard deviation0.94001288
Coefficient of variation (CV)0.0073572887
Kurtosis-0.57974324
Mean127.76621
Median Absolute Deviation (MAD)0.8255452
Skewness0.15986896
Sum355828.89
Variance0.88362422
MonotonicityNot monotonic
2023-12-13T01:12:01.931986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
126.9404829 9
 
0.3%
128.6735 9
 
0.3%
126.9523632 8
 
0.3%
126.8885119 8
 
0.3%
128.873502 8
 
0.3%
127.146416 8
 
0.3%
126.9555142 7
 
0.3%
128.472291 7
 
0.3%
128.076013 7
 
0.3%
127.0014328 6
 
0.2%
Other values (2165) 2708
97.2%
ValueCountFrequency (%)
124.63418 1
< 0.1%
124.698629 1
< 0.1%
124.713848 1
< 0.1%
124.723704 1
< 0.1%
124.730989 1
< 0.1%
125.827461 1
< 0.1%
125.844488 1
< 0.1%
125.895341 1
< 0.1%
125.901859 1
< 0.1%
125.938087 1
< 0.1%
ValueCountFrequency (%)
131.859723 1
 
< 0.1%
130.937683 1
 
< 0.1%
130.920326 1
 
< 0.1%
130.909956 6
0.2%
130.908649 1
 
< 0.1%
130.900928 1
 
< 0.1%
130.899223 1
 
< 0.1%
130.88354 1
 
< 0.1%
130.866153 1
 
< 0.1%
129.566653 1
 
< 0.1%

위도
Real number (ℝ)

Distinct2171
Distinct (%)78.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean35.83793
Minimum33.207767
Maximum38.514711
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size24.6 KiB
2023-12-13T01:12:02.110138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum33.207767
5-th percentile33.387826
Q135.11493
median35.760939
Q336.785731
95-th percentile37.788491
Maximum38.514711
Range5.3069437
Interquartile range (IQR)1.6708014

Descriptive statistics

Standard deviation1.27869
Coefficient of variation (CV)0.035679796
Kurtosis-0.55563168
Mean35.83793
Median Absolute Deviation (MAD)0.855841
Skewness-0.17907201
Sum99808.634
Variance1.6350482
MonotonicityNot monotonic
2023-12-13T01:12:02.293756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
34.737714 9
 
0.3%
35.23543943 9
 
0.3%
35.242827 8
 
0.3%
35.18328296 8
 
0.3%
35.13357345 8
 
0.3%
35.824768 8
 
0.3%
35.13395322 7
 
0.3%
34.793922 7
 
0.3%
35.188478 7
 
0.3%
35.227134 6
 
0.2%
Other values (2161) 2708
97.2%
ValueCountFrequency (%)
33.20776729 1
< 0.1%
33.22234037 1
< 0.1%
33.22261902 2
0.1%
33.2296628 1
< 0.1%
33.23007721 1
< 0.1%
33.23169414 2
0.1%
33.23535816 1
< 0.1%
33.23565139 1
< 0.1%
33.2362215 1
< 0.1%
33.23711594 2
0.1%
ValueCountFrequency (%)
38.514711 1
 
< 0.1%
38.5145523 3
0.1%
38.4820331 1
 
< 0.1%
38.46362424 1
 
< 0.1%
38.44764887 1
 
< 0.1%
38.4028797 1
 
< 0.1%
38.3957417 1
 
< 0.1%
38.35727077 1
 
< 0.1%
38.34397674 1
 
< 0.1%
38.34111051 1
 
< 0.1%

실내구분
Categorical

IMBALANCE 

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size21.9 KiB
실외
2335 
실내
448 
살내
 
1
 
1

Length

Max length2
Median length2
Mean length1.9996409
Min length1

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row실외
2nd row실외
3rd row실외
4th row실외
5th row실외

Common Values

ValueCountFrequency (%)
실외 2335
83.8%
실내 448
 
16.1%
살내 1
 
< 0.1%
1
 
< 0.1%

Length

2023-12-13T01:12:02.455290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:12:02.589011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
실외 2335
83.8%
실내 448
 
16.1%
살내 1
 
< 0.1%
1
 
< 0.1%

테마명
Categorical

Distinct6
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size21.9 KiB
자연_힐링
1221 
종교_역사_전통
813 
체험_학습_산업
319 
문화_예술
173 
캠핑_스포츠
161 

Length

Max length8
Median length5
Mean length6.2771993
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row종교_역사_전통
2nd row종교_역사_전통
3rd row체험_학습_산업
4th row종교_역사_전통
5th row자연_힐링

Common Values

ValueCountFrequency (%)
자연_힐링 1221
43.8%
종교_역사_전통 813
29.2%
체험_학습_산업 319
 
11.5%
문화_예술 173
 
6.2%
캠핑_스포츠 161
 
5.8%
쇼핑_놀이 98
 
3.5%

Length

2023-12-13T01:12:02.728751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:12:02.870337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
자연_힐링 1221
43.8%
종교_역사_전통 813
29.2%
체험_학습_산업 319
 
11.5%
문화_예술 173
 
6.2%
캠핑_스포츠 161
 
5.8%
쇼핑_놀이 98
 
3.5%

Interactions

2023-12-13T01:12:00.546396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:12:00.301274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:12:00.657407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:12:00.436340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T01:12:03.009151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
경도위도실내구분테마명
경도1.0000.5140.0960.190
위도0.5141.0000.4170.305
실내구분0.0960.4171.0000.324
테마명0.1900.3050.3241.000
2023-12-13T01:12:03.139305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
테마명실내구분
테마명1.0000.214
실내구분0.2141.000
2023-12-13T01:12:03.237695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
경도위도실내구분테마명
경도1.0000.1910.0610.095
위도0.1911.0000.2610.165
실내구분0.0610.2611.0000.214
테마명0.0950.1650.2141.000

Missing values

2023-12-13T01:12:00.787638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:12:00.896197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

관광지명경도위도실내구분테마명
0통영_세병관통제영지_128.42323834.847749실외종교_역사_전통
1통영_충렬사128.41784734.846626실외종교_역사_전통
2통영_해저터널128.40990834.834504실외체험_학습_산업
3통영_착량묘128.41055834.835804실외종교_역사_전통
4통영_도남관광지128.43276634.828362실외자연_힐링
5통영_전혁림미술관128.41515734.826556실내문화_예술
6통영_중앙활어시장128.42427934.845701실외쇼핑_놀이
7통영_남망산국제조각공원128.42971534.841168실외문화_예술
8통영_도남관광지128.43276634.828362실외자연_힐링
9통영_한산도 제승당128.47229134.793922실외종교_역사_전통
관광지명경도위도실내구분테마명
2775해남_미황사126.57755234.382703실외종교_역사_전통
2776해남_녹우당126.62261334.55123실외자연_힐링
2777영암_왕인박사유적지126.6331634.755387실외종교_역사_전통
2778나주_반남고분군126.64733734.912172실외종교_역사_전통
2779나주_나주천연염색박물관126.66505734.999032실내체험_학습_산업
2780광주_빛고을공예창작촌126.86608235.085762실외체험_학습_산업
2781광주_포충사126.84862135.08979실외종교_역사_전통
2782광주_국립5·18민주묘지126.94048335.235439실외종교_역사_전통
2783담양_죽녹원126.98646835.32823실외자연_힐링
2784담양_메타세퀘이어길127.00201535.323443실외자연_힐링

Duplicate rows

Most frequently occurring

관광지명경도위도실내구분테마명# duplicates
36광주_국립5·18민주묘지126.94048335.235439실외종교_역사_전통9
19거제_해금강128.673534.737714실외자연_힐링8
45광주_시립민속박물관126.88851235.183283실내종교_역사_전통8
52광주_전통문화관126.95236335.133573실외종교_역사_전통8
59김해_국립김해박물관128.87350235.242827실내종교_역사_전통8
53광주_증심사126.95551435.133953실외종교_역사_전통7
212전주_한옥마을127.14641635.824768실외종교_역사_전통7
241진주_진주성128.07601335.188478실외종교_역사_전통7
37광주_국립광주박물관126.88395335.189657실내종교_역사_전통6
17거제_외도해상농원128.7120234.769069실외자연_힐링5