Overview

Dataset statistics

Number of variables5
Number of observations2557
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory105.0 KiB
Average record size in memory42.1 B

Variable types

DateTime1
Numeric2
Categorical2

Dataset

Description제주특별자치도 세계유산본부에서 관리하는 성산일출봉의 2015년 ~ 2021년 일일 탐방객 (외국인 / 내국인) 현황 데이터입니다.
Author제주특별자치도
URLhttps://www.data.go.kr/data/15102851/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
외국인 is highly overall correlated with 내국인High correlation
내국인 is highly overall correlated with 외국인High correlation
특이사항 is highly imbalanced (93.1%)Imbalance
해당일 has unique valuesUnique
외국인 has 151 (5.9%) zerosZeros
내국인 has 71 (2.8%) zerosZeros

Reproduction

Analysis started2023-12-12 17:05:42.390326
Analysis finished2023-12-12 17:05:43.668622
Duration1.28 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

해당일
Date

UNIQUE 

Distinct2557
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size20.1 KiB
Minimum2015-01-01 00:00:00
Maximum2021-12-31 00:00:00
2023-12-13T02:05:43.750472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:05:43.895453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

외국인
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct1510
Distinct (%)59.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1484.4865
Minimum0
Maximum9274
Zeros151
Zeros (%)5.9%
Negative0
Negative (%)0.0%
Memory size22.6 KiB
2023-12-13T02:05:44.055960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q132
median879
Q32296
95-th percentile5122.2
Maximum9274
Range9274
Interquartile range (IQR)2264

Descriptive statistics

Standard deviation1686.3027
Coefficient of variation (CV)1.1359502
Kurtosis1.1404359
Mean1484.4865
Median Absolute Deviation (MAD)863
Skewness1.3505481
Sum3795832
Variance2843616.8
MonotonicityNot monotonic
2023-12-13T02:05:44.207309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 151
 
5.9%
2 44
 
1.7%
3 37
 
1.4%
4 33
 
1.3%
6 30
 
1.2%
7 28
 
1.1%
9 25
 
1.0%
10 25
 
1.0%
14 23
 
0.9%
1 22
 
0.9%
Other values (1500) 2139
83.7%
ValueCountFrequency (%)
0 151
5.9%
1 22
 
0.9%
2 44
 
1.7%
3 37
 
1.4%
4 33
 
1.3%
5 21
 
0.8%
6 30
 
1.2%
7 28
 
1.1%
8 20
 
0.8%
9 25
 
1.0%
ValueCountFrequency (%)
9274 1
< 0.1%
8341 1
< 0.1%
7833 1
< 0.1%
7672 1
< 0.1%
7444 1
< 0.1%
7386 1
< 0.1%
7345 1
< 0.1%
7310 1
< 0.1%
7196 1
< 0.1%
7156 1
< 0.1%

내국인
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct2068
Distinct (%)80.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3560.8357
Minimum0
Maximum14955
Zeros71
Zeros (%)2.8%
Negative0
Negative (%)0.0%
Memory size22.6 KiB
2023-12-13T02:05:44.359281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile662.8
Q11800
median3465
Q34962
95-th percentile7318.8
Maximum14955
Range14955
Interquartile range (IQR)3162

Descriptive statistics

Standard deviation2126.038
Coefficient of variation (CV)0.59706151
Kurtosis0.32668498
Mean3560.8357
Median Absolute Deviation (MAD)1594
Skewness0.59213147
Sum9105057
Variance4520037.5
MonotonicityNot monotonic
2023-12-13T02:05:44.512881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 71
 
2.8%
1199 6
 
0.2%
1204 5
 
0.2%
1200 5
 
0.2%
1795 4
 
0.2%
704 4
 
0.2%
1205 4
 
0.2%
1999 4
 
0.2%
4789 4
 
0.2%
3042 4
 
0.2%
Other values (2058) 2446
95.7%
ValueCountFrequency (%)
0 71
2.8%
66 1
 
< 0.1%
78 1
 
< 0.1%
112 1
 
< 0.1%
118 1
 
< 0.1%
178 1
 
< 0.1%
197 1
 
< 0.1%
256 1
 
< 0.1%
278 1
 
< 0.1%
311 1
 
< 0.1%
ValueCountFrequency (%)
14955 1
< 0.1%
12730 1
< 0.1%
12534 1
< 0.1%
11441 1
< 0.1%
11186 1
< 0.1%
11079 1
< 0.1%
10279 1
< 0.1%
10219 1
< 0.1%
10007 1
< 0.1%
9976 1
< 0.1%

특이사항
Categorical

IMBALANCE 

Distinct14
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size20.1 KiB
정상운영
2486 
휴관일
 
30
코로나19 확산방지 입장통제
 
11
기상악화(태풍) 통제
 
7
대설특보 입장통제
 
4
Other values (9)
 
19

Length

Max length16
Median length4
Mean length4.1286664
Min length3

Unique

Unique4 ?
Unique (%)0.2%

Sample

1st row정상운영
2nd row정상운영
3rd row정상운영
4th row정상운영
5th row정상운영

Common Values

ValueCountFrequency (%)
정상운영 2486
97.2%
휴관일 30
 
1.2%
코로나19 확산방지 입장통제 11
 
0.4%
기상악화(태풍) 통제 7
 
0.3%
대설특보 입장통제 4
 
0.2%
기상특보(폭설) 입장통제 4
 
0.2%
기상악화(태풍 찬투) 통제 4
 
0.2%
기상악화(폭설)통제 3
 
0.1%
입장통제(기상특보) 2
 
0.1%
2019 문화의달 무료입장 2
 
0.1%
Other values (4) 4
 
0.2%

Length

2023-12-13T02:05:44.656731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
정상운영 2486
95.2%
휴관일 30
 
1.1%
입장통제 19
 
0.7%
기상악화(태풍 14
 
0.5%
통제 13
 
0.5%
코로나19 11
 
0.4%
확산방지 11
 
0.4%
기상특보(폭설 4
 
0.2%
찬투 4
 
0.2%
대설특보 4
 
0.2%
Other values (9) 15
 
0.6%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size20.1 KiB
2022-07-26
2557 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-07-26
2nd row2022-07-26
3rd row2022-07-26
4th row2022-07-26
5th row2022-07-26

Common Values

ValueCountFrequency (%)
2022-07-26 2557
100.0%

Length

2023-12-13T02:05:44.792837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:05:44.890559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-07-26 2557
100.0%

Interactions

2023-12-13T02:05:43.227966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:05:42.954106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:05:43.349699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:05:43.081235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:05:44.949994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
외국인내국인특이사항
외국인1.0000.5520.000
내국인0.5521.0000.213
특이사항0.0000.2131.000
2023-12-13T02:05:45.344275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
외국인내국인특이사항
외국인1.0000.7080.000
내국인0.7081.0000.087
특이사항0.0000.0871.000

Missing values

2023-12-13T02:05:43.503225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:05:43.621456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

해당일외국인내국인특이사항데이터기준일자
02015-01-012732030정상운영2022-07-26
12015-01-0225455843정상운영2022-07-26
22015-01-0320136998정상운영2022-07-26
32015-01-0427764969정상운영2022-07-26
42015-01-0520734052정상운영2022-07-26
52015-01-0617783700정상운영2022-07-26
62015-01-0718263901정상운영2022-07-26
72015-01-0820613758정상운영2022-07-26
82015-01-0925503946정상운영2022-07-26
92015-01-1016264864정상운영2022-07-26
해당일외국인내국인특이사항데이터기준일자
25472021-12-22221723정상운영2022-07-26
25482021-12-23191468정상운영2022-07-26
25492021-12-24171295정상운영2022-07-26
25502021-12-2561152정상운영2022-07-26
25512021-12-2631027정상운영2022-07-26
25522021-12-2791723정상운영2022-07-26
25532021-12-28452225정상운영2022-07-26
25542021-12-2982210정상운영2022-07-26
25552021-12-30642421정상운영2022-07-26
25562021-12-3182071정상운영2022-07-26