Overview

Dataset statistics

Number of variables6
Number of observations67
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.5 KiB
Average record size in memory54.0 B

Variable types

Numeric3
Categorical2
Text1

Dataset

Description샘플 데이터
Author지디에스컨설팅그룹
URLhttps://www.bigdata-environment.kr/user/data_market/detail.do?id=46839e50-2e00-11ea-9713-eb3e5186fb38

Alerts

전국 미세먼지 수치 has constant value ""Constant
생태관광고유번호 is highly overall correlated with 생태관광지군High correlation
생태관광지별 미세먼지 수치 is highly overall correlated with 생태관광지별 미세먼지 비율High correlation
생태관광지별 미세먼지 비율 is highly overall correlated with 생태관광지별 미세먼지 수치High correlation
생태관광지군 is highly overall correlated with 생태관광고유번호High correlation
생태관광고유번호 has unique valuesUnique
생태관광지명 has unique valuesUnique

Reproduction

Analysis started2023-12-10 13:16:20.470961
Analysis finished2023-12-10 13:16:23.345904
Duration2.87 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

생태관광고유번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct67
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean34
Minimum1
Maximum67
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size735.0 B
2023-12-10T22:16:23.469601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.3
Q117.5
median34
Q350.5
95-th percentile63.7
Maximum67
Range66
Interquartile range (IQR)33

Descriptive statistics

Standard deviation19.485037
Coefficient of variation (CV)0.57308932
Kurtosis-1.2
Mean34
Median Absolute Deviation (MAD)17
Skewness0
Sum2278
Variance379.66667
MonotonicityStrictly increasing
2023-12-10T22:16:23.705945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.5%
44 1
 
1.5%
50 1
 
1.5%
49 1
 
1.5%
48 1
 
1.5%
47 1
 
1.5%
46 1
 
1.5%
45 1
 
1.5%
43 1
 
1.5%
2 1
 
1.5%
Other values (57) 57
85.1%
ValueCountFrequency (%)
1 1
1.5%
2 1
1.5%
3 1
1.5%
4 1
1.5%
5 1
1.5%
6 1
1.5%
7 1
1.5%
8 1
1.5%
9 1
1.5%
10 1
1.5%
ValueCountFrequency (%)
67 1
1.5%
66 1
1.5%
65 1
1.5%
64 1
1.5%
63 1
1.5%
62 1
1.5%
61 1
1.5%
60 1
1.5%
59 1
1.5%
58 1
1.5%

생태관광지군
Categorical

HIGH CORRELATION 

Distinct26
Distinct (%)38.8%
Missing0
Missing (%)0.0%
Memory size668.0 B
밀양 사자평습지와 재약산
남해 앵강만
 
4
제주 저지곶자왈과 저지오름
 
4
안산 대부도·대송습지
 
4
양구 DMZ
 
4
Other values (21)
46 

Length

Max length21
Median length13
Mean length10.626866
Min length6

Unique

Unique6 ?
Unique (%)9.0%

Sample

1st row제주 동백동산습지
2nd row고창 고인돌운곡습지
3rd row고창 고인돌운곡습지
4th row인제 생태마을(용늪)
5th row인제 생태마을(용늪)

Common Values

ValueCountFrequency (%)
밀양 사자평습지와 재약산 5
 
7.5%
남해 앵강만 4
 
6.0%
제주 저지곶자왈과 저지오름 4
 
6.0%
안산 대부도·대송습지 4
 
6.0%
양구 DMZ 4
 
6.0%
영양 밤하늘 반딧불이 공원 4
 
6.0%
철원DMZ 두루미평화타운 및 철새도래지 4
 
6.0%
김해 화포천습지 3
 
4.5%
울산 태화강 3
 
4.5%
강릉 가시연습지경포호 3
 
4.5%
Other values (16) 29
43.3%

Length

2023-12-10T22:16:23.945471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
7
 
4.0%
밀양 5
 
2.9%
재약산 5
 
2.9%
사자평습지와 5
 
2.9%
제주 5
 
2.9%
영양 4
 
2.3%
철새도래지 4
 
2.3%
두루미평화타운 4
 
2.3%
철원dmz 4
 
2.3%
공원 4
 
2.3%
Other values (53) 126
72.8%

생태관광지명
Text

UNIQUE 

Distinct67
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size668.0 B
2023-12-10T22:16:24.367799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length13
Mean length7.0746269
Min length2

Characters and Unicode

Total characters474
Distinct characters175
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique67 ?
Unique (%)100.0%

Sample

1st row동백동산습지센터
2nd row고창고인돌유적 안내소
3rd row고인돌박물관
4th row대암산과 용늪
5th row자작나무숲
ValueCountFrequency (%)
봉하마을 2
 
2.2%
백제가요 1
 
1.1%
가천 1
 
1.1%
영양군청소년수련원 1
 
1.1%
반딧불이생태공원 1
 
1.1%
생태학교 1
 
1.1%
1
 
1.1%
천문대 1
 
1.1%
반딧불이 1
 
1.1%
오솔길 1
 
1.1%
Other values (82) 82
88.2%
2023-12-10T22:16:25.119178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
26
 
5.5%
17
 
3.6%
14
 
3.0%
12
 
2.5%
12
 
2.5%
12
 
2.5%
9
 
1.9%
8
 
1.7%
8
 
1.7%
7
 
1.5%
Other values (165) 349
73.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 435
91.8%
Space Separator 26
 
5.5%
Close Punctuation 5
 
1.1%
Open Punctuation 5
 
1.1%
Other Punctuation 2
 
0.4%
Decimal Number 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
17
 
3.9%
14
 
3.2%
12
 
2.8%
12
 
2.8%
12
 
2.8%
9
 
2.1%
8
 
1.8%
8
 
1.8%
7
 
1.6%
7
 
1.6%
Other values (160) 329
75.6%
Space Separator
ValueCountFrequency (%)
26
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Other Punctuation
ValueCountFrequency (%)
· 2
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 435
91.8%
Common 39
 
8.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
17
 
3.9%
14
 
3.2%
12
 
2.8%
12
 
2.8%
12
 
2.8%
9
 
2.1%
8
 
1.8%
8
 
1.8%
7
 
1.6%
7
 
1.6%
Other values (160) 329
75.6%
Common
ValueCountFrequency (%)
26
66.7%
) 5
 
12.8%
( 5
 
12.8%
· 2
 
5.1%
1 1
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 435
91.8%
ASCII 37
 
7.8%
None 2
 
0.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
26
70.3%
) 5
 
13.5%
( 5
 
13.5%
1 1
 
2.7%
Hangul
ValueCountFrequency (%)
17
 
3.9%
14
 
3.2%
12
 
2.8%
12
 
2.8%
12
 
2.8%
9
 
2.1%
8
 
1.8%
8
 
1.8%
7
 
1.6%
7
 
1.6%
Other values (160) 329
75.6%
None
ValueCountFrequency (%)
· 2
100.0%

생태관광지별 미세먼지 수치
Real number (ℝ)

HIGH CORRELATION 

Distinct28
Distinct (%)41.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean34.842343
Minimum26.114
Maximum39.544
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size735.0 B
2023-12-10T22:16:25.369319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum26.114
5-th percentile30.7517
Q135.097
median35.339
Q335.339
95-th percentile37.1981
Maximum39.544
Range13.43
Interquartile range (IQR)0.242

Descriptive statistics

Standard deviation2.0826685
Coefficient of variation (CV)0.059774065
Kurtosis5.5843879
Mean34.842343
Median Absolute Deviation (MAD)0
Skewness-1.6567907
Sum2334.437
Variance4.337508
MonotonicityNot monotonic
2023-12-10T22:16:25.635499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=28)
ValueCountFrequency (%)
35.339 36
53.7%
36.128 3
 
4.5%
32.948 2
 
3.0%
34.503 2
 
3.0%
39.48 1
 
1.5%
35.111 1
 
1.5%
34.372 1
 
1.5%
35.907 1
 
1.5%
35.404 1
 
1.5%
33.186 1
 
1.5%
Other values (18) 18
26.9%
ValueCountFrequency (%)
26.114 1
1.5%
28.361 1
1.5%
30.69 1
1.5%
30.701 1
1.5%
30.87 1
1.5%
31.14 1
1.5%
31.889 1
1.5%
32.948 2
3.0%
33.135 1
1.5%
33.186 1
1.5%
ValueCountFrequency (%)
39.544 1
 
1.5%
39.48 1
 
1.5%
38.37 1
 
1.5%
37.562 1
 
1.5%
36.349 1
 
1.5%
36.128 3
 
4.5%
35.907 1
 
1.5%
35.668 1
 
1.5%
35.404 1
 
1.5%
35.339 36
53.7%

전국 미세먼지 수치
Categorical

CONSTANT 

Distinct1
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size668.0 B
34.842
67 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row34.842
2nd row34.842
3rd row34.842
4th row34.842
5th row34.842

Common Values

ValueCountFrequency (%)
34.842 67
100.0%

Length

2023-12-10T22:16:25.849650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:16:26.028728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
34.842 67
100.0%

생태관광지별 미세먼지 비율
Real number (ℝ)

HIGH CORRELATION 

Distinct25
Distinct (%)37.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.99986567
Minimum0.75
Maximum1.135
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size735.0 B
2023-12-10T22:16:26.182638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.75
5-th percentile0.8825
Q11.0075
median1.014
Q31.014
95-th percentile1.0675
Maximum1.135
Range0.385
Interquartile range (IQR)0.0065

Descriptive statistics

Standard deviation0.05969922
Coefficient of variation (CV)0.05970724
Kurtosis5.5663843
Mean0.99986567
Median Absolute Deviation (MAD)0
Skewness-1.6493752
Sum66.991
Variance0.0035639968
MonotonicityNot monotonic
2023-12-10T22:16:26.428471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
1.014 37
55.2%
1.037 3
 
4.5%
0.99 2
 
3.0%
0.881 2
 
3.0%
0.946 2
 
3.0%
1.008 2
 
3.0%
0.915 1
 
1.5%
1.133 1
 
1.5%
0.986 1
 
1.5%
1.031 1
 
1.5%
Other values (15) 15
22.4%
ValueCountFrequency (%)
0.75 1
1.5%
0.814 1
1.5%
0.881 2
3.0%
0.886 1
1.5%
0.894 1
1.5%
0.915 1
1.5%
0.946 2
3.0%
0.951 1
1.5%
0.952 1
1.5%
0.986 1
1.5%
ValueCountFrequency (%)
1.135 1
 
1.5%
1.133 1
 
1.5%
1.101 1
 
1.5%
1.078 1
 
1.5%
1.043 1
 
1.5%
1.037 3
 
4.5%
1.031 1
 
1.5%
1.024 1
 
1.5%
1.016 1
 
1.5%
1.014 37
55.2%

Interactions

2023-12-10T22:16:22.542828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:16:20.861808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:16:21.735390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:16:22.705957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:16:21.023617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:16:22.060133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:16:22.874367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:16:21.544936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:16:22.300368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T22:16:26.605698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
생태관광고유번호생태관광지군생태관광지명생태관광지별 미세먼지 수치생태관광지별 미세먼지 비율
생태관광고유번호1.0000.9821.0000.4280.428
생태관광지군0.9821.0001.0000.8550.855
생태관광지명1.0001.0001.0001.0001.000
생태관광지별 미세먼지 수치0.4280.8551.0001.0001.000
생태관광지별 미세먼지 비율0.4280.8551.0001.0001.000
2023-12-10T22:16:26.866346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
생태관광고유번호생태관광지별 미세먼지 수치생태관광지별 미세먼지 비율생태관광지군
생태관광고유번호1.0000.1130.1060.751
생태관광지별 미세먼지 수치0.1131.0000.9920.445
생태관광지별 미세먼지 비율0.1060.9921.0000.445
생태관광지군0.7510.4450.4451.000

Missing values

2023-12-10T22:16:23.116076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T22:16:23.277135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

생태관광고유번호생태관광지군생태관광지명생태관광지별 미세먼지 수치전국 미세먼지 수치생태관광지별 미세먼지 비율
01제주 동백동산습지동백동산습지센터35.33934.8421.014
12고창 고인돌운곡습지고창고인돌유적 안내소34.50334.8420.99
23고창 고인돌운곡습지고인돌박물관34.50334.8420.99
34인제 생태마을(용늪)대암산과 용늪35.33934.8421.014
45인제 생태마을(용늪)자작나무숲35.13334.8421.008
56신안 영산도 명품마을영산도 명품마을35.33934.8421.014
67부산 낙동강하구낙동강하구에코센터38.3734.8421.101
78부산 낙동강하구낙동강하구아미산전망대37.56234.8421.078
89울산 태화강태화강십리대숲31.88934.8420.915
910울산 태화강태화강전망대28.36134.8420.814
생태관광고유번호생태관광지군생태관광지명생태관광지별 미세먼지 수치전국 미세먼지 수치생태관광지별 미세먼지 비율
5758밀양 사자평습지와 재약산사자평습지35.33934.8421.014
5859밀양 사자평습지와 재약산재약산(수미봉)35.33934.8421.014
5960제주 저지곶자왈과 저지오름저지곶자왈 (한수풀)35.33934.8421.014
6061제주 저지곶자왈과 저지오름저지오름 (당오름·닥오름·새오름)35.33934.8421.014
6162제주 저지곶자왈과 저지오름용선달리34.37234.8420.986
6263제주 저지곶자왈과 저지오름현대미술관과 예술인 마을35.33934.8421.014
6364철원DMZ 두루미평화타운 및 철새도래지삽슬봉(아이스크림고지)35.11134.8421.008
6465철원DMZ 두루미평화타운 및 철새도래지약천교35.33934.8421.014
6566철원DMZ 두루미평화타운 및 철새도래지철원 근대문화유적센터35.33934.8421.014
6667철원DMZ 두루미평화타운 및 철새도래지월정리역35.33934.8421.014