Overview

Dataset statistics

Number of variables10
Number of observations82
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.5 KiB
Average record size in memory81.6 B

Variable types

Categorical8
Text2

Dataset

Description시군별농촌체험휴양마을지정현황2014
Author전라북도
URLhttps://www.bigdatahub.go.kr/opendata/dataSet/detail.nm?contentId=37&rlik=49451aebf056b486&serviceId=202294

Alerts

Unnamed: 7 is highly overall correlated with 농촌체험휴양마을 지정현황(전북) and 6 other fieldsHigh correlation
Unnamed: 5 is highly overall correlated with Unnamed: 3 and 3 other fieldsHigh correlation
Unnamed: 8 is highly overall correlated with 농촌체험휴양마을 지정현황(전북) and 6 other fieldsHigh correlation
농촌체험휴양마을 지정현황(전북) is highly overall correlated with Unnamed: 7 and 2 other fieldsHigh correlation
Unnamed: 9 is highly overall correlated with 농촌체험휴양마을 지정현황(전북) and 6 other fieldsHigh correlation
Unnamed: 3 is highly overall correlated with Unnamed: 5 and 3 other fieldsHigh correlation
Unnamed: 6 is highly overall correlated with Unnamed: 7 and 2 other fieldsHigh correlation
Unnamed: 2 is highly overall correlated with Unnamed: 7 and 2 other fieldsHigh correlation
Unnamed: 7 is highly imbalanced (83.6%)Imbalance
Unnamed: 8 is highly imbalanced (90.5%)Imbalance
Unnamed: 9 is highly imbalanced (67.6%)Imbalance
Unnamed: 1 has unique valuesUnique

Reproduction

Analysis started2024-03-14 02:19:33.426019
Analysis finished2024-03-14 02:19:34.179653
Duration0.75 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

농촌체험휴양마을 지정현황(전북)
Categorical

HIGH CORRELATION 

Distinct14
Distinct (%)17.1%
Missing0
Missing (%)0.0%
Memory size788.0 B
남원시
12 
진안군
10 
무주군
정읍시
김제시
Other values (9)
36 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique2 ?
Unique (%)2.4%

Sample

1st row시군명
2nd row전주시
3rd row익산시
4th row익산시
5th row익산시

Common Values

ValueCountFrequency (%)
남원시 12
14.6%
진안군 10
12.2%
무주군 9
11.0%
정읍시 8
9.8%
김제시 7
8.5%
완주군 7
8.5%
임실군 6
7.3%
부안군 6
7.3%
순창군 5
6.1%
익산시 4
 
4.9%
Other values (4) 8
9.8%

Length

2024-03-14T11:19:34.226694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
남원시 12
14.6%
진안군 10
12.2%
무주군 9
11.0%
정읍시 8
9.8%
김제시 7
8.5%
완주군 7
8.5%
임실군 6
7.3%
부안군 6
7.3%
순창군 5
6.1%
익산시 4
 
4.9%
Other values (4) 8
9.8%

Unnamed: 1
Text

UNIQUE 

Distinct82
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size788.0 B
2024-03-14T11:19:34.399667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length6.5243902
Min length3

Characters and Unicode

Total characters535
Distinct characters163
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique82 ?
Unique (%)100.0%

Sample

1st row마을명
2nd row학전마을
3rd row성당포구마을
4th row검지마을
5th row두동편백
ValueCountFrequency (%)
마을 2
 
2.3%
마을명 1
 
1.1%
용계(당그래)마을 1
 
1.1%
원촌마을 1
 
1.1%
치목삼베마을 1
 
1.1%
덕유산신선명품마을 1
 
1.1%
후도마을 1
 
1.1%
休무풍승지(철목)마을 1
 
1.1%
미항마을 1
 
1.1%
물숲(명천)마을 1
 
1.1%
Other values (76) 76
87.4%
2024-03-14T11:19:34.759546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
87
 
16.3%
87
 
16.3%
) 27
 
5.0%
( 27
 
5.0%
8
 
1.5%
7
 
1.3%
6
 
1.1%
6
 
1.1%
5
 
0.9%
5
 
0.9%
Other values (153) 270
50.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 476
89.0%
Close Punctuation 27
 
5.0%
Open Punctuation 27
 
5.0%
Space Separator 5
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
87
 
18.3%
87
 
18.3%
8
 
1.7%
7
 
1.5%
6
 
1.3%
6
 
1.3%
5
 
1.1%
5
 
1.1%
5
 
1.1%
5
 
1.1%
Other values (150) 255
53.6%
Close Punctuation
ValueCountFrequency (%)
) 27
100.0%
Open Punctuation
ValueCountFrequency (%)
( 27
100.0%
Space Separator
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 475
88.8%
Common 59
 
11.0%
Han 1
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
87
 
18.3%
87
 
18.3%
8
 
1.7%
7
 
1.5%
6
 
1.3%
6
 
1.3%
5
 
1.1%
5
 
1.1%
5
 
1.1%
5
 
1.1%
Other values (149) 254
53.5%
Common
ValueCountFrequency (%)
) 27
45.8%
( 27
45.8%
5
 
8.5%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 475
88.8%
ASCII 59
 
11.0%
CJK 1
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
87
 
18.3%
87
 
18.3%
8
 
1.7%
7
 
1.5%
6
 
1.3%
6
 
1.3%
5
 
1.1%
5
 
1.1%
5
 
1.1%
5
 
1.1%
Other values (149) 254
53.5%
ASCII
ValueCountFrequency (%)
) 27
45.8%
( 27
45.8%
5
 
8.5%
CJK
ValueCountFrequency (%)
1
100.0%

Unnamed: 2
Categorical

HIGH CORRELATION 

Distinct19
Distinct (%)23.2%
Missing0
Missing (%)0.0%
Memory size788.0 B
녹색농촌
28 
전통테마
11 
녹색농촌종합개발
10 
지자체
10 
종합개발
Other values (14)
18 

Length

Max length12
Median length4
Mean length4.9512195
Min length3

Unique

Unique10 ?
Unique (%)12.2%

Sample

1st row조성유형
2nd row정보화
3rd row전통테마
4th row녹색농촌
5th row정보화

Common Values

ValueCountFrequency (%)
녹색농촌 28
34.1%
전통테마 11
 
13.4%
녹색농촌종합개발 10
 
12.2%
지자체 10
 
12.2%
종합개발 5
 
6.1%
산촌생태 2
 
2.4%
농촌종합 2
 
2.4%
정보화 2
 
2.4%
정보화녹색농촌 2
 
2.4%
녹색농촌정보화 1
 
1.2%
Other values (9) 9
 
11.0%

Length

2024-03-14T11:19:34.885700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
녹색농촌 28
34.1%
전통테마 11
 
13.4%
녹색농촌종합개발 10
 
12.2%
지자체 10
 
12.2%
종합개발 5
 
6.1%
산촌생태 2
 
2.4%
농촌종합 2
 
2.4%
정보화 2
 
2.4%
정보화녹색농촌 2
 
2.4%
조성유형 1
 
1.2%
Other values (9) 9
 
11.0%

Unnamed: 3
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)15.9%
Missing0
Missing (%)0.0%
Memory size788.0 B
2007년
14 
2008년
10 
2006년
2010년
2005년
Other values (8)
32 

Length

Max length5
Median length5
Mean length4.9878049
Min length4

Unique

Unique1 ?
Unique (%)1.2%

Sample

1st row조성년도
2nd row2005년
3rd row2006년
4th row2010년
5th row2008년

Common Values

ValueCountFrequency (%)
2007년 14
17.1%
2008년 10
12.2%
2006년 9
11.0%
2010년 9
11.0%
2005년 8
9.8%
2009년 8
9.8%
2011년 7
8.5%
2012년 5
 
6.1%
2004년 4
 
4.9%
2002년 3
 
3.7%
Other values (3) 5
 
6.1%

Length

2024-03-14T11:19:34.984142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2007년 14
17.1%
2008년 10
12.2%
2006년 9
11.0%
2010년 9
11.0%
2005년 8
9.8%
2009년 8
9.8%
2011년 7
8.5%
2012년 5
 
6.1%
2004년 4
 
4.9%
2002년 3
 
3.7%
Other values (3) 5
 
6.1%
Distinct81
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size788.0 B
2024-03-14T11:19:35.232560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length7
Mean length8.1219512
Min length2

Characters and Unicode

Total characters666
Distinct characters136
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique80 ?
Unique (%)97.6%

Sample

1st row주소
2nd row완산구 원당동
3rd row성당면 성당리
4th row삼기면 오룡리
5th row성당면 두동리
ValueCountFrequency (%)
산내면 4
 
2.2%
안성면 3
 
1.7%
운봉읍 3
 
1.7%
보안면 2
 
1.1%
쌍치면 2
 
1.1%
무풍면 2
 
1.1%
설천면 2
 
1.1%
구이면 2
 
1.1%
삼계면 2
 
1.1%
부귀면 2
 
1.1%
Other values (147) 156
86.7%
2024-03-14T11:19:35.651733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
105
 
15.8%
69
 
10.4%
61
 
9.2%
16
 
2.4%
2 15
 
2.3%
1 12
 
1.8%
12
 
1.8%
12
 
1.8%
12
 
1.8%
11
 
1.7%
Other values (126) 341
51.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 489
73.4%
Space Separator 105
 
15.8%
Decimal Number 63
 
9.5%
Dash Punctuation 9
 
1.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
69
 
14.1%
61
 
12.5%
16
 
3.3%
12
 
2.5%
12
 
2.5%
12
 
2.5%
11
 
2.2%
10
 
2.0%
9
 
1.8%
9
 
1.8%
Other values (115) 268
54.8%
Decimal Number
ValueCountFrequency (%)
2 15
23.8%
1 12
19.0%
4 8
12.7%
3 5
 
7.9%
0 5
 
7.9%
6 5
 
7.9%
8 5
 
7.9%
5 4
 
6.3%
7 4
 
6.3%
Space Separator
ValueCountFrequency (%)
105
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 489
73.4%
Common 177
 
26.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
69
 
14.1%
61
 
12.5%
16
 
3.3%
12
 
2.5%
12
 
2.5%
12
 
2.5%
11
 
2.2%
10
 
2.0%
9
 
1.8%
9
 
1.8%
Other values (115) 268
54.8%
Common
ValueCountFrequency (%)
105
59.3%
2 15
 
8.5%
1 12
 
6.8%
- 9
 
5.1%
4 8
 
4.5%
3 5
 
2.8%
0 5
 
2.8%
6 5
 
2.8%
8 5
 
2.8%
5 4
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 489
73.4%
ASCII 177
 
26.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
105
59.3%
2 15
 
8.5%
1 12
 
6.8%
- 9
 
5.1%
4 8
 
4.5%
3 5
 
2.8%
0 5
 
2.8%
6 5
 
2.8%
8 5
 
2.8%
5 4
 
2.3%
Hangul
ValueCountFrequency (%)
69
 
14.1%
61
 
12.5%
16
 
3.3%
12
 
2.5%
12
 
2.5%
12
 
2.5%
11
 
2.2%
10
 
2.0%
9
 
1.8%
9
 
1.8%
Other values (115) 268
54.8%

Unnamed: 5
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)9.8%
Missing0
Missing (%)0.0%
Memory size788.0 B
2011년
34 
2012년
24 
2010년
11 
2013년
2014년
 
2
Other values (3)
 
3

Length

Max length6
Median length5
Mean length5
Min length4

Unique

Unique3 ?
Unique (%)3.7%

Sample

1st row지정연도
2nd row2011년
3rd row2009년
4th row2011년
5th row2011년

Common Values

ValueCountFrequency (%)
2011년 34
41.5%
2012년 24
29.3%
2010년 11
 
13.4%
2013년 8
 
9.8%
2014년 2
 
2.4%
지정연도 1
 
1.2%
2009년 1
 
1.2%
2013년 1
 
1.2%

Length

2024-03-14T11:19:35.789542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T11:19:35.900736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2011년 34
41.5%
2012년 24
29.3%
2010년 11
 
13.4%
2013년 9
 
11.0%
2014년 2
 
2.4%
지정연도 1
 
1.2%
2009년 1
 
1.2%

Unnamed: 6
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)15.9%
Missing0
Missing (%)0.0%
Memory size788.0 B
11월
16 
10월
11 
3월
10 
1월
2월
Other values (8)
30 

Length

Max length3
Median length2
Mean length2.4268293
Min length2

Unique

Unique1 ?
Unique (%)1.2%

Sample

1st row지정월
2nd row11월
3rd row9월
4th row11월
5th row11월

Common Values

ValueCountFrequency (%)
11월 16
19.5%
10월 11
13.4%
3월 10
12.2%
1월 8
9.8%
2월 7
8.5%
12월 7
8.5%
8월 4
 
4.9%
6월 4
 
4.9%
7월 4
 
4.9%
4월 4
 
4.9%
Other values (3) 7
8.5%

Length

2024-03-14T11:19:36.016917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
11월 16
19.5%
10월 11
13.4%
3월 10
12.2%
1월 8
9.8%
2월 7
8.5%
12월 7
8.5%
8월 4
 
4.9%
6월 4
 
4.9%
7월 4
 
4.9%
4월 4
 
4.9%
Other values (3) 7
8.5%

Unnamed: 7
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Memory size788.0 B
79 
-
 
2
숙박
 
1

Length

Max length2
Median length1
Mean length1.0121951
Min length1

Unique

Unique1 ?
Unique (%)1.2%

Sample

1st row숙박
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
79
96.3%
- 2
 
2.4%
숙박 1
 
1.2%

Length

2024-03-14T11:19:36.133993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T11:19:36.215502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
79
96.3%
2
 
2.4%
숙박 1
 
1.2%

Unnamed: 8
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size788.0 B
81 
체험
 
1

Length

Max length2
Median length1
Mean length1.0121951
Min length1

Unique

Unique1 ?
Unique (%)1.2%

Sample

1st row체험
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
81
98.8%
체험 1
 
1.2%

Length

2024-03-14T11:19:36.297246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T11:19:36.377686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
81
98.8%
체험 1
 
1.2%

Unnamed: 9
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Memory size788.0 B
74 
-
 
7
음식
 
1

Length

Max length2
Median length1
Mean length1.0121951
Min length1

Unique

Unique1 ?
Unique (%)1.2%

Sample

1st row음식
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
74
90.2%
- 7
 
8.5%
음식 1
 
1.2%

Length

2024-03-14T11:19:36.479076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T11:19:36.576432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
74
90.2%
7
 
8.5%
음식 1
 
1.2%

Correlations

2024-03-14T11:19:36.638502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
농촌체험휴양마을 지정현황(전북)Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9
농촌체험휴양마을 지정현황(전북)1.0001.0000.6720.5901.0000.7270.6270.8221.0000.897
Unnamed: 11.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
Unnamed: 20.6721.0001.0000.6110.9940.8150.7760.8111.0000.884
Unnamed: 30.5901.0000.6111.0000.9790.7950.8260.8291.0000.842
Unnamed: 41.0001.0000.9940.9791.0000.9880.9680.0001.0001.000
Unnamed: 50.7271.0000.8150.7950.9881.0000.7700.7811.0000.778
Unnamed: 60.6271.0000.7760.8260.9680.7701.0000.8171.0000.857
Unnamed: 70.8221.0000.8110.8290.0000.7810.8171.0001.0000.940
Unnamed: 81.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
Unnamed: 90.8971.0000.8840.8421.0000.7780.8570.9401.0001.000
2024-03-14T11:19:36.768018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 7Unnamed: 5Unnamed: 8Unnamed: 2농촌체험휴양마을 지정현황(전북)Unnamed: 9Unnamed: 3Unnamed: 6
Unnamed: 71.0000.6740.9940.5670.6300.6990.6530.636
Unnamed: 50.6741.0000.9620.4770.4090.6700.5030.470
Unnamed: 80.9940.9621.0000.8870.9220.9940.9290.929
Unnamed: 20.5670.4770.8871.0000.2760.6680.2390.374
농촌체험휴양마을 지정현황(전북)0.6300.4090.9220.2761.0000.7440.2520.277
Unnamed: 90.6990.6700.9940.6680.7441.0000.6710.694
Unnamed: 30.6530.5030.9290.2390.2520.6711.0000.361
Unnamed: 60.6360.4700.9290.3740.2770.6940.3611.000
2024-03-14T11:19:36.890070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
농촌체험휴양마을 지정현황(전북)Unnamed: 2Unnamed: 3Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9
농촌체험휴양마을 지정현황(전북)1.0000.2760.2520.4090.2770.6300.9220.744
Unnamed: 20.2761.0000.2390.4770.3740.5670.8870.668
Unnamed: 30.2520.2391.0000.5030.3610.6530.9290.671
Unnamed: 50.4090.4770.5031.0000.4700.6740.9620.670
Unnamed: 60.2770.3740.3610.4701.0000.6360.9290.694
Unnamed: 70.6300.5670.6530.6740.6361.0000.9940.699
Unnamed: 80.9220.8870.9290.9620.9290.9941.0000.994
Unnamed: 90.7440.6680.6710.6700.6940.6990.9941.000

Missing values

2024-03-14T11:19:34.021039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T11:19:34.134410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

농촌체험휴양마을 지정현황(전북)Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9
0시군명마을명조성유형조성년도주소지정연도지정월숙박체험음식
1전주시학전마을정보화2005년완산구 원당동2011년11월
2익산시성당포구마을전통테마2006년성당면 성당리2009년9월
3익산시검지마을녹색농촌2010년삼기면 오룡리2011년11월
4익산시두동편백정보화2008년성당면 두동리2011년11월
5익산시산들강웅포마을(고창)녹색농촌종합개발2007년웅포면 강변로 2842012년3월
6정읍시천단마을녹색농촌종합개발2007년신태인읍 백산리2010년11월
7정읍시공동마을녹색농촌2005년산외면 오공리2010년11월
8정읍시사교마을(달고운마을)녹색농촌2008년산내면 두월리2011년2월
9정읍시신기마을(십장생마을)녹색농촌2005년산내면 능교리2011년2월
농촌체험휴양마을 지정현황(전북)Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9
72순창군도라지마을건강장수종합개발2005년팔덕면 평창길 42013년9월
73순창군종곡마을종합개발2013년쌍치면 순정로 12532014년1월
74고창군고색창연마을전통테마2007년신림면 가평리 651-42011년12월
75고창군고산돌맹(상금)마을녹색농촌2011년대산면 상금리 28-112012년11월
76부안군우동우리밀마을녹색농촌종합개발2007년보안면 우동리2011년3월
77부안군사랑감(용사만회)마을녹색농촌2007년보안면 남포리2011년9월
78부안군계화도(계상)마을녹색농촌종합개발2006년계화면 계상길2012년10월
79부안군각동마을녹색농촌2010년줄포면 선돌로2012년10월
80부안군운호(구름호수)마을녹색농촌2007년진서면 운호길2012년10월
81부안군후촌 갈대숲 마을녹색농촌2007년줄포면 후촌길 672012년12월