Overview

Dataset statistics

Number of variables8
Number of observations23
Missing cells33
Missing cells (%)17.9%
Duplicate rows1
Duplicate rows (%)4.3%
Total size in memory1.6 KiB
Average record size in memory69.7 B

Variable types

Text2
Unsupported6

Dataset

Description2014데미샘자연휴양림통계
Author전라북도
URLhttps://www.bigdatahub.go.kr/opendata/dataSet/detail.nm?contentId=37&rlik=49451aebf056b486&serviceId=202262

Alerts

Dataset has 1 (4.3%) duplicate rowsDuplicates
2014년 데미샘자연휴양림 통계 has 20 (87.0%) missing valuesMissing
Unnamed: 1 has 3 (13.0%) missing valuesMissing
Unnamed: 2 has 1 (4.3%) missing valuesMissing
Unnamed: 3 has 2 (8.7%) missing valuesMissing
Unnamed: 4 has 1 (4.3%) missing valuesMissing
Unnamed: 5 has 2 (8.7%) missing valuesMissing
Unnamed: 6 has 2 (8.7%) missing valuesMissing
Unnamed: 7 has 2 (8.7%) missing valuesMissing
Unnamed: 2 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-03-14 03:00:04.998698
Analysis finished2024-03-14 03:00:05.448231
Duration0.45 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct3
Distinct (%)100.0%
Missing20
Missing (%)87.0%
Memory size316.0 B
2024-03-14T12:00:05.539550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length3
Min length2

Characters and Unicode

Total characters9
Distinct characters9
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)100.0%

Sample

1st row구분
2nd row휴양관
3rd row숲속의집
ValueCountFrequency (%)
구분 1
33.3%
휴양관 1
33.3%
숲속의집 1
33.3%
2024-03-14T12:00:05.792902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%

Unnamed: 1
Text

MISSING 

Distinct20
Distinct (%)100.0%
Missing3
Missing (%)13.0%
Memory size316.0 B
2024-03-14T12:00:05.940660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length4
Mean length3.75
Min length3

Characters and Unicode

Total characters75
Distinct characters38
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)100.0%

Sample

1st row101호
2nd row102호
3rd row103호
4th row104호
5th row105호
ValueCountFrequency (%)
101호 1
 
5.0%
102호 1
 
5.0%
하늘다람쥐 1
 
5.0%
잠자리 1
 
5.0%
산토끼 1
 
5.0%
부엉이 1
 
5.0%
반딧불이 1
 
5.0%
무당벌레 1
 
5.0%
메뚜기 1
 
5.0%
너구리 1
 
5.0%
Other values (10) 10
50.0%
2024-03-14T12:00:06.230265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10
 
13.3%
0 10
 
13.3%
1 7
 
9.3%
2 7
 
9.3%
2
 
2.7%
2
 
2.7%
2
 
2.7%
5 2
 
2.7%
4 2
 
2.7%
3 2
 
2.7%
Other values (28) 29
38.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 45
60.0%
Decimal Number 30
40.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10
22.2%
2
 
4.4%
2
 
4.4%
2
 
4.4%
2
 
4.4%
1
 
2.2%
1
 
2.2%
1
 
2.2%
1
 
2.2%
1
 
2.2%
Other values (22) 22
48.9%
Decimal Number
ValueCountFrequency (%)
0 10
33.3%
1 7
23.3%
2 7
23.3%
5 2
 
6.7%
4 2
 
6.7%
3 2
 
6.7%

Most occurring scripts

ValueCountFrequency (%)
Hangul 45
60.0%
Common 30
40.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10
22.2%
2
 
4.4%
2
 
4.4%
2
 
4.4%
2
 
4.4%
1
 
2.2%
1
 
2.2%
1
 
2.2%
1
 
2.2%
1
 
2.2%
Other values (22) 22
48.9%
Common
ValueCountFrequency (%)
0 10
33.3%
1 7
23.3%
2 7
23.3%
5 2
 
6.7%
4 2
 
6.7%
3 2
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 45
60.0%
ASCII 30
40.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
10
22.2%
2
 
4.4%
2
 
4.4%
2
 
4.4%
2
 
4.4%
1
 
2.2%
1
 
2.2%
1
 
2.2%
1
 
2.2%
1
 
2.2%
Other values (22) 22
48.9%
ASCII
ValueCountFrequency (%)
0 10
33.3%
1 7
23.3%
2 7
23.3%
5 2
 
6.7%
4 2
 
6.7%
3 2
 
6.7%

Unnamed: 2
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1
Missing (%)4.3%
Memory size316.0 B

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)8.7%
Memory size316.0 B

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1
Missing (%)4.3%
Memory size316.0 B

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)8.7%
Memory size316.0 B

Unnamed: 6
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)8.7%
Memory size316.0 B

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)8.7%
Memory size316.0 B

Correlations

2024-03-14T12:00:06.341926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2014년 데미샘자연휴양림 통계Unnamed: 1
2014년 데미샘자연휴양림 통계1.0000.000
Unnamed: 10.0001.000

Missing values

2024-03-14T12:00:05.121414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T12:00:05.249650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-14T12:00:05.364808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

2014년 데미샘자연휴양림 통계Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7
0<NA><NA>NaNNaNNaNNaNNaNNaN
1구분<NA>단가NaN예약NaN예약률공실율
2<NA><NA>비수기성수기비수기성수기NaNNaN
3휴양관101호35000500007740.22190.7781
4<NA>102호35000500006680.20170.7973
5<NA>103호35000500004710.20550.7945
6<NA>104호28000400007670.20270.7973
7<NA>105호280004000015800.26030.7397
8<NA>201호35000500005750.21920.7808
9<NA>202호35000500007700.2110.789
2014년 데미샘자연휴양림 통계Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7
13숲속의집고슴도치91000130000141060.32880.6712
14<NA>너구리9100013000013940.29320.7068
15<NA>메뚜기4900070000301100.38360.6164
16<NA>무당벌레4900070000461150.44110.5589
17<NA>반딧불이4900070000411100.43840.5616
18<NA>부엉이91000130000121030.31510.6849
19<NA>산토끼91000130000131040.32050.6795
20<NA>잠자리4900070000481160.44930.5507
21<NA>하늘다람쥐112000160000391320.46850.5315
22<NA>하늘소4900070000281110.38080.6192

Duplicate rows

Most frequently occurring

2014년 데미샘자연휴양림 통계Unnamed: 1# duplicates
0<NA><NA>2