Overview

Dataset statistics

Number of variables8
Number of observations55
Missing cells45
Missing cells (%)10.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.7 KiB
Average record size in memory69.3 B

Variable types

Numeric3
Text3
Categorical1
DateTime1

Dataset

Description충청남도 공주시 주요관광지 현황에 대한 데이터로 컬럼은 관광지명, 주소, 위도, 경도, 유네스코 세계유산 등재여부, 비고, 데이터기준일 이 포함되어 있음
Author공공데이터포털
URLhttps://www.data.go.kr/data/15119151/fileData.do

Alerts

데이터기준일 has constant value ""Constant
유네스코 세계문화유산 등재여부 is highly imbalanced (69.5%)Imbalance
비고 has 45 (81.8%) missing valuesMissing
번호 has unique valuesUnique
관광지명 has unique valuesUnique

Reproduction

Analysis started2024-04-21 14:43:29.864251
Analysis finished2024-04-21 14:43:33.237292
Duration3.37 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct55
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean28
Minimum1
Maximum55
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size623.0 B
2024-04-21T23:43:33.647449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.7
Q114.5
median28
Q341.5
95-th percentile52.3
Maximum55
Range54
Interquartile range (IQR)27

Descriptive statistics

Standard deviation16.02082
Coefficient of variation (CV)0.57217214
Kurtosis-1.2
Mean28
Median Absolute Deviation (MAD)14
Skewness0
Sum1540
Variance256.66667
MonotonicityStrictly increasing
2024-04-21T23:43:34.081640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.8%
2 1
 
1.8%
31 1
 
1.8%
32 1
 
1.8%
33 1
 
1.8%
34 1
 
1.8%
35 1
 
1.8%
36 1
 
1.8%
37 1
 
1.8%
38 1
 
1.8%
Other values (45) 45
81.8%
ValueCountFrequency (%)
1 1
1.8%
2 1
1.8%
3 1
1.8%
4 1
1.8%
5 1
1.8%
6 1
1.8%
7 1
1.8%
8 1
1.8%
9 1
1.8%
10 1
1.8%
ValueCountFrequency (%)
55 1
1.8%
54 1
1.8%
53 1
1.8%
52 1
1.8%
51 1
1.8%
50 1
1.8%
49 1
1.8%
48 1
1.8%
47 1
1.8%
46 1
1.8%

관광지명
Text

UNIQUE 

Distinct55
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size568.0 B
2024-04-21T23:43:34.934290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length11
Mean length6.6727273
Min length2

Characters and Unicode

Total characters367
Distinct characters144
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique55 ?
Unique (%)100.0%

Sample

1st row계룡산
2nd row갑사
3rd row금강
4th row공산성
5th row고마나루
ValueCountFrequency (%)
공주 5
 
6.7%
동학사 2
 
2.7%
계룡산 2
 
2.7%
지당자연사박물관 1
 
1.3%
이안숲속 1
 
1.3%
유허지 1
 
1.3%
의당 1
 
1.3%
메타세콰이어길 1
 
1.3%
신원사 1
 
1.3%
도예촌 1
 
1.3%
Other values (59) 59
78.7%
2024-04-21T23:43:36.198727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
21
 
5.7%
17
 
4.6%
16
 
4.4%
13
 
3.5%
11
 
3.0%
9
 
2.5%
9
 
2.5%
7
 
1.9%
7
 
1.9%
7
 
1.9%
Other values (134) 250
68.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 340
92.6%
Space Separator 21
 
5.7%
Close Punctuation 3
 
0.8%
Open Punctuation 3
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
17
 
5.0%
16
 
4.7%
13
 
3.8%
11
 
3.2%
9
 
2.6%
9
 
2.6%
7
 
2.1%
7
 
2.1%
7
 
2.1%
6
 
1.8%
Other values (131) 238
70.0%
Space Separator
ValueCountFrequency (%)
21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 340
92.6%
Common 27
 
7.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
17
 
5.0%
16
 
4.7%
13
 
3.8%
11
 
3.2%
9
 
2.6%
9
 
2.6%
7
 
2.1%
7
 
2.1%
7
 
2.1%
6
 
1.8%
Other values (131) 238
70.0%
Common
ValueCountFrequency (%)
21
77.8%
) 3
 
11.1%
( 3
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 340
92.6%
ASCII 27
 
7.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
21
77.8%
) 3
 
11.1%
( 3
 
11.1%
Hangul
ValueCountFrequency (%)
17
 
5.0%
16
 
4.7%
13
 
3.8%
11
 
3.2%
9
 
2.6%
9
 
2.6%
7
 
2.1%
7
 
2.1%
7
 
2.1%
6
 
1.8%
Other values (131) 238
70.0%

주소
Text

Distinct52
Distinct (%)94.5%
Missing0
Missing (%)0.0%
Memory size568.0 B
2024-04-21T23:43:37.149210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length22
Mean length18.745455
Min length14

Characters and Unicode

Total characters1031
Distinct characters101
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique49 ?
Unique (%)89.1%

Sample

1st row충청남도 공주시 반포면 동학사1로 327-6
2nd row충청남도 공주시 계룡면 갑사로 567-3
3rd row충청남도 공주시 금벽로 551(반포면~탄천면)
4th row충청남도 공주시 웅진로 280
5th row충청남도 공주시 백제큰길 2045
ValueCountFrequency (%)
충청남도 55
23.9%
공주시 55
23.9%
반포면 6
 
2.6%
의당면 5
 
2.2%
수원지공원길 4
 
1.7%
유구읍 4
 
1.7%
관광단지길 3
 
1.3%
왕릉로 3
 
1.3%
계룡면 3
 
1.3%
봉황로 2
 
0.9%
Other values (84) 90
39.1%
2024-04-21T23:43:38.509123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
175
17.0%
59
 
5.7%
56
 
5.4%
56
 
5.4%
56
 
5.4%
56
 
5.4%
55
 
5.3%
55
 
5.3%
1 31
 
3.0%
30
 
2.9%
Other values (91) 402
39.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 675
65.5%
Space Separator 175
 
17.0%
Decimal Number 161
 
15.6%
Dash Punctuation 17
 
1.6%
Open Punctuation 1
 
0.1%
Math Symbol 1
 
0.1%
Close Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
59
 
8.7%
56
 
8.3%
56
 
8.3%
56
 
8.3%
56
 
8.3%
55
 
8.1%
55
 
8.1%
30
 
4.4%
20
 
3.0%
17
 
2.5%
Other values (76) 215
31.9%
Decimal Number
ValueCountFrequency (%)
1 31
19.3%
2 28
17.4%
3 17
10.6%
0 15
9.3%
8 14
8.7%
4 13
8.1%
5 13
8.1%
7 11
 
6.8%
9 10
 
6.2%
6 9
 
5.6%
Space Separator
ValueCountFrequency (%)
175
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 17
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 675
65.5%
Common 356
34.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
59
 
8.7%
56
 
8.3%
56
 
8.3%
56
 
8.3%
56
 
8.3%
55
 
8.1%
55
 
8.1%
30
 
4.4%
20
 
3.0%
17
 
2.5%
Other values (76) 215
31.9%
Common
ValueCountFrequency (%)
175
49.2%
1 31
 
8.7%
2 28
 
7.9%
- 17
 
4.8%
3 17
 
4.8%
0 15
 
4.2%
8 14
 
3.9%
4 13
 
3.7%
5 13
 
3.7%
7 11
 
3.1%
Other values (5) 22
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 675
65.5%
ASCII 356
34.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
175
49.2%
1 31
 
8.7%
2 28
 
7.9%
- 17
 
4.8%
3 17
 
4.8%
0 15
 
4.2%
8 14
 
3.9%
4 13
 
3.7%
5 13
 
3.7%
7 11
 
3.1%
Other values (5) 22
 
6.2%
Hangul
ValueCountFrequency (%)
59
 
8.7%
56
 
8.3%
56
 
8.3%
56
 
8.3%
56
 
8.3%
55
 
8.1%
55
 
8.1%
30
 
4.4%
20
 
3.0%
17
 
2.5%
Other values (76) 215
31.9%

위도
Real number (ℝ)

Distinct51
Distinct (%)92.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean36.454975
Minimum36.324741
Maximum36.624617
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size623.0 B
2024-04-21T23:43:38.904678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36.324741
5-th percentile36.355773
Q136.433402
median36.454413
Q336.467101
95-th percentile36.555651
Maximum36.624617
Range0.299876
Interquartile range (IQR)0.0336985

Descriptive statistics

Standard deviation0.060080779
Coefficient of variation (CV)0.0016480817
Kurtosis1.2954223
Mean36.454975
Median Absolute Deviation (MAD)0.018744
Skewness0.38878559
Sum2005.0236
Variance0.0036097
MonotonicityNot monotonic
2024-04-21T23:43:39.327700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
36.356884 2
 
3.6%
36.461293 2
 
3.6%
36.447565 2
 
3.6%
36.421765 2
 
3.6%
36.545107 1
 
1.8%
36.390008 1
 
1.8%
36.366461 1
 
1.8%
36.478553 1
 
1.8%
36.5544 1
 
1.8%
36.554894 1
 
1.8%
Other values (41) 41
74.5%
ValueCountFrequency (%)
36.324741 1
1.8%
36.335303 1
1.8%
36.353182 1
1.8%
36.356884 2
3.6%
36.365366 1
1.8%
36.366461 1
1.8%
36.390008 1
1.8%
36.396136 1
1.8%
36.420944 1
1.8%
36.421765 2
3.6%
ValueCountFrequency (%)
36.624617 1
1.8%
36.613005 1
1.8%
36.557416 1
1.8%
36.554894 1
1.8%
36.5544 1
1.8%
36.545107 1
1.8%
36.531232 1
1.8%
36.501318 1
1.8%
36.493809 1
1.8%
36.491406 1
1.8%

경도
Real number (ℝ)

Distinct51
Distinct (%)92.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean127.12333
Minimum126.95164
Maximum127.2478
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size623.0 B
2024-04-21T23:43:39.729017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.95164
5-th percentile126.96959
Q1127.11171
median127.12378
Q3127.1459
95-th percentile127.2232
Maximum127.2478
Range0.296163
Interquartile range (IQR)0.0341895

Descriptive statistics

Standard deviation0.063868607
Coefficient of variation (CV)0.0005024145
Kurtosis1.732621
Mean127.12333
Median Absolute Deviation (MAD)0.014697
Skewness-0.88831717
Sum6991.7834
Variance0.0040791989
MonotonicityNot monotonic
2024-04-21T23:43:40.067552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
127.232444 2
 
3.6%
127.113364 2
 
3.6%
127.189653 2
 
3.6%
127.120633 2
 
3.6%
127.155362 1
 
1.8%
127.215327 1
 
1.8%
127.247804 1
 
1.8%
127.042682 1
 
1.8%
126.956257 1
 
1.8%
126.951641 1
 
1.8%
Other values (41) 41
74.5%
ValueCountFrequency (%)
126.951641 1
1.8%
126.954933 1
1.8%
126.956257 1
1.8%
126.975298 1
1.8%
127.011608 1
1.8%
127.042682 1
1.8%
127.066957 1
1.8%
127.071556 1
1.8%
127.102603 1
1.8%
127.106327 1
1.8%
ValueCountFrequency (%)
127.247804 1
1.8%
127.232444 2
3.6%
127.219233 1
1.8%
127.215327 1
1.8%
127.202705 1
1.8%
127.189653 2
3.6%
127.187194 1
1.8%
127.18362 1
1.8%
127.172894 1
1.8%
127.155362 1
1.8%
Distinct2
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Memory size568.0 B
52 
 
3

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
52
94.5%
3
 
5.5%

Length

2024-04-21T23:43:40.299295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T23:43:40.462398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
52
94.5%
3
 
5.5%

비고
Text

MISSING 

Distinct10
Distinct (%)100.0%
Missing45
Missing (%)81.8%
Memory size568.0 B
2024-04-21T23:43:40.943185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length4
Mean length4.1
Min length4

Characters and Unicode

Total characters41
Distinct characters13
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)100.0%

Sample

1st row공주1경
2nd row공주2경
3rd row공주3경
4th row공주4경
5th row공주5경
ValueCountFrequency (%)
공주1경 1
10.0%
공주2경 1
10.0%
공주3경 1
10.0%
공주4경 1
10.0%
공주5경 1
10.0%
공주6경 1
10.0%
공주7경 1
10.0%
공주8경 1
10.0%
공주9경 1
10.0%
공주10경 1
10.0%
2024-04-21T23:43:41.673103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10
24.4%
10
24.4%
10
24.4%
1 2
 
4.9%
2 1
 
2.4%
3 1
 
2.4%
4 1
 
2.4%
5 1
 
2.4%
6 1
 
2.4%
7 1
 
2.4%
Other values (3) 3
 
7.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 30
73.2%
Decimal Number 11
 
26.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 2
18.2%
2 1
9.1%
3 1
9.1%
4 1
9.1%
5 1
9.1%
6 1
9.1%
7 1
9.1%
8 1
9.1%
9 1
9.1%
0 1
9.1%
Other Letter
ValueCountFrequency (%)
10
33.3%
10
33.3%
10
33.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 30
73.2%
Common 11
 
26.8%

Most frequent character per script

Common
ValueCountFrequency (%)
1 2
18.2%
2 1
9.1%
3 1
9.1%
4 1
9.1%
5 1
9.1%
6 1
9.1%
7 1
9.1%
8 1
9.1%
9 1
9.1%
0 1
9.1%
Hangul
ValueCountFrequency (%)
10
33.3%
10
33.3%
10
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 30
73.2%
ASCII 11
 
26.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
10
33.3%
10
33.3%
10
33.3%
ASCII
ValueCountFrequency (%)
1 2
18.2%
2 1
9.1%
3 1
9.1%
4 1
9.1%
5 1
9.1%
6 1
9.1%
7 1
9.1%
8 1
9.1%
9 1
9.1%
0 1
9.1%

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size568.0 B
Minimum2023-08-18 00:00:00
Maximum2023-08-18 00:00:00
2024-04-21T23:43:41.849098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T23:43:42.008705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-04-21T23:43:31.834323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T23:43:30.384996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T23:43:31.109876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T23:43:32.077555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T23:43:30.620277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T23:43:31.348425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T23:43:32.323015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T23:43:30.863404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T23:43:31.587369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T23:43:42.140258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호관광지명주소위도경도유네스코 세계문화유산 등재여부비고
번호1.0001.0000.7830.4230.0820.4131.000
관광지명1.0001.0001.0001.0001.0001.0001.000
주소0.7831.0001.0001.0001.0000.0001.000
위도0.4231.0001.0001.0000.8950.0001.000
경도0.0821.0001.0000.8951.0000.4801.000
유네스코 세계문화유산 등재여부0.4131.0000.0000.0000.4801.0001.000
비고1.0001.0001.0001.0001.0001.0001.000
2024-04-21T23:43:42.326473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호위도경도유네스코 세계문화유산 등재여부
번호1.0000.078-0.0870.288
위도0.0781.000-0.4900.000
경도-0.087-0.4901.0000.446
유네스코 세계문화유산 등재여부0.2880.0000.4461.000

Missing values

2024-04-21T23:43:32.671047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T23:43:33.080530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호관광지명주소위도경도유네스코 세계문화유산 등재여부비고데이터기준일
01계룡산충청남도 공주시 반포면 동학사1로 327-636.356884127.232444공주1경2023-08-18
12갑사충청남도 공주시 계룡면 갑사로 567-336.365366127.187194공주2경2023-08-18
23금강충청남도 공주시 금벽로 551(반포면~탄천면)36.461569127.14465공주3경2023-08-18
34공산성충청남도 공주시 웅진로 28036.464603127.123512공주4경2023-08-18
45고마나루충청남도 공주시 백제큰길 204536.468657127.106327공주5경2023-08-18
56공주 무령왕릉과 왕릉원충청남도 공주시 왕릉로 3736.461293127.113364공주6경2023-08-18
67마곡사충청남도 공주시 사곡면 마곡사로96636.557416127.011608공주7경2023-08-18
78동학사 은선폭포충청남도 공주시 반포면 동학사1로 327-636.356884127.232444공주8경2023-08-18
89석장리풍경충청남도 공주시 금벽로99036.447565127.189653공주9경2023-08-18
910금학생태공원충청남도 공주시 수원지공원길 7436.434033127.123784공주10경2023-08-18
번호관광지명주소위도경도유네스코 세계문화유산 등재여부비고데이터기준일
4546임립미술관충청남도 공주시 계룡면 봉곡길 77-1336.396136127.13952<NA>2023-08-18
4647대통사지(반죽동당간지주)충청남도 공주시 반죽동 300-236.451711127.122458<NA>2023-08-18
4748구 선교사의가옥충청남도 공주시 쪽지골길18-1336.449453127.126441<NA>2023-08-18
4849독립운동 기념관충청남도 공주시 중동 30836.452387127.12734<NA>2023-08-18
4950공주산성시장충청남도 공주시 산성동 181-8236.457161127.122988<NA>2023-08-18
5051우금티전적지충청남도 공주시 금학동 산80-236.432771127.110079<NA>2023-08-18
5152이안숲속충청남도 공주시 반포면 수목원길 2536.420944127.202705<NA>2023-08-18
5253지당자연사박물관충청남도 공주시 탄천면 장마루로1241-9036.324741127.066957<NA>2023-08-18
5354공주민속극박물관충청남도 공주시 의당면 돌모루1길4036.501318127.147157<NA>2023-08-18
5455요골공소충청남도 공주시 유구읍 명곡요골길 1536.613005126.954933<NA>2023-08-18