Overview

Dataset statistics

Number of variables4
Number of observations39
Missing cells7
Missing cells (%)4.5%
Duplicate rows1
Duplicate rows (%)2.6%
Total size in memory1.3 KiB
Average record size in memory35.4 B

Variable types

Unsupported2
Categorical1
Text1

Alerts

Dataset has 1 (2.6%) duplicate rowsDuplicates
도유림 조림 현황 has 3 (7.7%) missing valuesMissing
Unnamed: 2 has 2 (5.1%) missing valuesMissing
Unnamed: 3 has 2 (5.1%) missing valuesMissing
도유림 조림 현황 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 2 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-03-14 00:33:35.113841
Analysis finished2024-03-14 00:33:35.426163
Duration0.31 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

도유림 조림 현황
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)7.7%
Memory size444.0 B

Unnamed: 1
Categorical

Distinct18
Distinct (%)46.2%
Missing0
Missing (%)0.0%
Memory size444.0 B
완주군 동상면 대아리 산1-2
진안군 백운면 신암리 산1-11
<NA>
장수군 장계면 명덕리 산154-1
진안군 백운면 신암리 산1
Other values (13)
16 

Length

Max length21
Median length17
Mean length15.538462
Min length4

Unique

Unique10 ?
Unique (%)25.6%

Sample

1st row<NA>
2nd row<NA>
3rd row위 치
4th row<NA>
5th row장수군 장계면 명덕리 산154-1

Common Values

ValueCountFrequency (%)
완주군 동상면 대아리 산1-2 6
15.4%
진안군 백운면 신암리 산1-11 5
12.8%
<NA> 4
10.3%
장수군 장계면 명덕리 산154-1 4
10.3%
진안군 백운면 신암리 산1 4
10.3%
장수군 장계면 명덕리 산154-93 2
 
5.1%
완주군 운주면 고당리 산30 2
 
5.1%
진안군 백운면 신암리 산1, 산1-11 2
 
5.1%
완주군 소양면 신촌리 산18-1외 2필 1
 
2.6%
진안군 백운면 노촌리 산1외 1필 1
 
2.6%
Other values (8) 8
20.5%

Length

2024-03-14T09:33:35.484525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
진안군 14
 
9.3%
백운면 14
 
9.3%
신암리 11
 
7.3%
완주군 10
 
6.7%
동상면 7
 
4.7%
산1-11 7
 
4.7%
산1 7
 
4.7%
대아리 6
 
4.0%
산1-2 6
 
4.0%
장수군 6
 
4.0%
Other values (33) 62
41.3%

Unnamed: 2
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)5.1%
Memory size444.0 B

Unnamed: 3
Text

MISSING 

Distinct36
Distinct (%)97.3%
Missing2
Missing (%)5.1%
Memory size444.0 B
2024-03-14T09:33:35.645789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length10
Mean length9.5675676
Min length4

Characters and Unicode

Total characters354
Distinct characters24
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)94.6%

Sample

1st row (단위 : ha)
2nd row사업기간
3rd row10.22~11.20
4th row2.26~4.16
5th row2.28~4.19
ValueCountFrequency (%)
03.30~05.16 2
 
5.1%
03.13~04.23 1
 
2.6%
11.07~12.05 1
 
2.6%
03.31~05.29 1
 
2.6%
10.8~11.6 1
 
2.6%
4.22~5.20 1
 
2.6%
10.19~11.08 1
 
2.6%
03.08~04.27 1
 
2.6%
03.14~06.11 1
 
2.6%
10.26~11.23 1
 
2.6%
Other values (28) 28
71.8%
2024-03-14T09:33:35.942251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 70
19.8%
1 69
19.5%
0 37
10.5%
~ 35
9.9%
2 34
9.6%
3 29
8.2%
4 22
 
6.2%
5 12
 
3.4%
6 9
 
2.5%
8 8
 
2.3%
Other values (14) 29
8.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 235
66.4%
Other Punctuation 71
 
20.1%
Math Symbol 35
 
9.9%
Other Letter 6
 
1.7%
Space Separator 3
 
0.8%
Lowercase Letter 2
 
0.6%
Open Punctuation 1
 
0.3%
Close Punctuation 1
 
0.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 69
29.4%
0 37
15.7%
2 34
14.5%
3 29
12.3%
4 22
 
9.4%
5 12
 
5.1%
6 9
 
3.8%
8 8
 
3.4%
9 8
 
3.4%
7 7
 
3.0%
Other Letter
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Other Punctuation
ValueCountFrequency (%)
. 70
98.6%
: 1
 
1.4%
Lowercase Letter
ValueCountFrequency (%)
a 1
50.0%
h 1
50.0%
Math Symbol
ValueCountFrequency (%)
~ 35
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 346
97.7%
Hangul 6
 
1.7%
Latin 2
 
0.6%

Most frequent character per script

Common
ValueCountFrequency (%)
. 70
20.2%
1 69
19.9%
0 37
10.7%
~ 35
10.1%
2 34
9.8%
3 29
8.4%
4 22
 
6.4%
5 12
 
3.5%
6 9
 
2.6%
8 8
 
2.3%
Other values (6) 21
 
6.1%
Hangul
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Latin
ValueCountFrequency (%)
a 1
50.0%
h 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 348
98.3%
Hangul 6
 
1.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 70
20.1%
1 69
19.8%
0 37
10.6%
~ 35
10.1%
2 34
9.8%
3 29
8.3%
4 22
 
6.3%
5 12
 
3.4%
6 9
 
2.6%
8 8
 
2.3%
Other values (8) 23
 
6.6%
Hangul
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Correlations

2024-03-14T09:33:36.015436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 1Unnamed: 3
Unnamed: 11.0000.988
Unnamed: 30.9881.000

Missing values

2024-03-14T09:33:35.214715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T09:33:35.277060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-14T09:33:35.368677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

도유림 조림 현황Unnamed: 1Unnamed: 2Unnamed: 3
0NaN<NA>NaN<NA>
1NaN<NA>NaN(단위 : ha)
2시행년도위 치사업량사업기간
3합계<NA>296.1<NA>
42001장수군 장계면 명덕리 산154-12.410.22~11.20
52001장수군 장계면 명덕리 산154-1152.26~4.16
62001완주군 소양면 신촌리 산18-1외 2필102.28~4.19
72001진안군 백운면 신암리 산153.2~4.17
82002장수군 장계면 명덕리 산154-1123.15~4.23
92002순창군 쌍치면 금성리 산434.610.23~11.22
도유림 조림 현황Unnamed: 1Unnamed: 2Unnamed: 3
292011진안군 백운면 신암리 산1, 산1-11303.30~05.16
302011진안군 백운면 신암리 산11003.30~05.16
312011완주군 동상면 대아리 산1-21503.31~05.29
322012진안군 백운면 신암리 산11503.13~04.23
332012완주군 동상면 대아리 산1-21003.14~06.11
342012진안군 백운면 신암리 산1, 산1-111003.08~04.27
352012완주군 동상면 대아리 산1-2510.19~11.08
362013진안군 백운면 신암리 산1104.22~5.20
37NaN<NA>0.510.8~11.6
382013진안군 백운면 신암리 산1-1123.29~4.28

Duplicate rows

Most frequently occurring

Unnamed: 1Unnamed: 3# duplicates
0<NA><NA>2