Overview

Dataset statistics

Number of variables6
Number of observations72
Missing cells70
Missing cells (%)16.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.6 KiB
Average record size in memory50.8 B

Variable types

Categorical3
Text1
Boolean1
Numeric1

Dataset

Description시군별 6차산업 인증을 받은 도내 농촌교육농장 현황으로 지역, 업체명, 유형, 매출액 비증, 고용임금 등의 정보를 제공합니다.
Author충청남도
URLhttps://www.data.go.kr/data/15040663/fileData.do

Alerts

유형 has constant value ""Constant
6차산업인증 has constant value ""Constant
고용임금(백만원) is highly overall correlated with 매출액 비중High correlation
매출액 비중 is highly overall correlated with 고용임금(백만원)High correlation
6차산업인증 has 70 (97.2%) missing valuesMissing
업체명 has unique valuesUnique
고용임금(백만원) has 53 (73.6%) zerosZeros

Reproduction

Analysis started2023-12-12 16:43:48.989369
Analysis finished2023-12-12 16:43:49.481792
Duration0.49 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지역
Categorical

Distinct14
Distinct (%)19.4%
Missing0
Missing (%)0.0%
Memory size708.0 B
공주시
예산군
태안군
보령시
논산시
Other values (9)
39 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique1 ?
Unique (%)1.4%

Sample

1st row천안시
2nd row천안시
3rd row천안시
4th row천안시
5th row천안시

Common Values

ValueCountFrequency (%)
공주시 7
9.7%
예산군 7
9.7%
태안군 7
9.7%
보령시 6
8.3%
논산시 6
8.3%
청양군 6
8.3%
천안시 5
 
6.9%
서산시 5
 
6.9%
부여군 5
 
6.9%
서천군 5
 
6.9%
Other values (4) 13
18.1%

Length

2023-12-13T01:43:49.562119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
공주시 7
9.7%
예산군 7
9.7%
태안군 7
9.7%
보령시 6
8.3%
논산시 6
8.3%
청양군 6
8.3%
천안시 5
 
6.9%
서산시 5
 
6.9%
부여군 5
 
6.9%
서천군 5
 
6.9%
Other values (4) 13
18.1%

업체명
Text

UNIQUE 

Distinct72
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size708.0 B
2023-12-13T01:43:49.847819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length8.5
Mean length5.2083333
Min length3

Characters and Unicode

Total characters375
Distinct characters190
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique72 ?
Unique (%)100.0%

Sample

1st row아빠사랑팜
2nd row자연누리성
3rd row진제자연농원
4th row봉황농장
5th row썬러브치즈
ValueCountFrequency (%)
아빠사랑팜 1
 
1.3%
갯벌도예체험장 1
 
1.3%
세아유농장 1
 
1.3%
설레임농장 1
 
1.3%
칠갑산그린헬스 1
 
1.3%
계봉농원 1
 
1.3%
혜선식품 1
 
1.3%
혜지원 1
 
1.3%
칠갑산무지개 1
 
1.3%
리꼬베리농장 1
 
1.3%
Other values (66) 66
86.8%
2023-12-13T01:43:50.268853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
34
 
9.1%
25
 
6.7%
18
 
4.8%
11
 
2.9%
9
 
2.4%
6
 
1.6%
5
 
1.3%
5
 
1.3%
4
 
1.1%
4
 
1.1%
Other values (180) 254
67.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 367
97.9%
Space Separator 4
 
1.1%
Open Punctuation 2
 
0.5%
Close Punctuation 2
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
34
 
9.3%
25
 
6.8%
18
 
4.9%
11
 
3.0%
9
 
2.5%
6
 
1.6%
5
 
1.4%
5
 
1.4%
4
 
1.1%
3
 
0.8%
Other values (177) 247
67.3%
Space Separator
ValueCountFrequency (%)
4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 367
97.9%
Common 8
 
2.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
34
 
9.3%
25
 
6.8%
18
 
4.9%
11
 
3.0%
9
 
2.5%
6
 
1.6%
5
 
1.4%
5
 
1.4%
4
 
1.1%
3
 
0.8%
Other values (177) 247
67.3%
Common
ValueCountFrequency (%)
4
50.0%
( 2
25.0%
) 2
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 367
97.9%
ASCII 8
 
2.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
34
 
9.3%
25
 
6.8%
18
 
4.9%
11
 
3.0%
9
 
2.5%
6
 
1.6%
5
 
1.4%
5
 
1.4%
4
 
1.1%
3
 
0.8%
Other values (177) 247
67.3%
ASCII
ValueCountFrequency (%)
4
50.0%
( 2
25.0%
) 2
25.0%

유형
Categorical

CONSTANT 

Distinct1
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size708.0 B
농촌교육농장
72 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row농촌교육농장
2nd row농촌교육농장
3rd row농촌교육농장
4th row농촌교육농장
5th row농촌교육농장

Common Values

ValueCountFrequency (%)
농촌교육농장 72
100.0%

Length

2023-12-13T01:43:50.410160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:43:50.514244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
농촌교육농장 72
100.0%

6차산업인증
Boolean

CONSTANT  MISSING 

Distinct1
Distinct (%)50.0%
Missing70
Missing (%)97.2%
Memory size276.0 B
True
 
2
(Missing)
70 
ValueCountFrequency (%)
True 2
 
2.8%
(Missing) 70
97.2%
2023-12-13T01:43:50.595204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

매출액 비중
Categorical

HIGH CORRELATION 

Distinct22
Distinct (%)30.6%
Missing0
Missing (%)0.0%
Memory size708.0 B
0.0%/0.0%/100.0%
42 
0.0%/100.0%/0.0%
 
4
0.0%/0.0%/100%
 
4
20.0%/0.0%/80.0%
 
2
70.0%/0.0%/30.0%
 
2
Other values (17)
18 

Length

Max length17
Median length16
Mean length15.888889
Min length14

Unique

Unique16 ?
Unique (%)22.2%

Sample

1st row0.0%/0.0%/100%
2nd row0.0%/0.0%/100.0%
3rd row20.0%/0.0%/80.0%
4th row81.0%/0.0%/19.0%
5th row0.0%/0.0%/100.0%

Common Values

ValueCountFrequency (%)
0.0%/0.0%/100.0% 42
58.3%
0.0%/100.0%/0.0% 4
 
5.6%
0.0%/0.0%/100% 4
 
5.6%
20.0%/0.0%/80.0% 2
 
2.8%
70.0%/0.0%/30.0% 2
 
2.8%
60.0%/0.0%/40.0% 2
 
2.8%
47.0%/0.0%/53.0% 1
 
1.4%
81.0%/0.0%/19.0% 1
 
1.4%
10.0%/80.0%/10.0% 1
 
1.4%
10.0%/20.0%/70.0% 1
 
1.4%
Other values (12) 12
 
16.7%

Length

2023-12-13T01:43:50.714662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
0.0%/0.0%/100.0 42
58.3%
0.0%/0.0%/100 4
 
5.6%
0.0%/100.0%/0.0 4
 
5.6%
20.0%/0.0%/80.0 2
 
2.8%
70.0%/0.0%/30.0 2
 
2.8%
60.0%/0.0%/40.0 2
 
2.8%
53.0%/0.0%/47.0 1
 
1.4%
25.0%/0.0%/75.0 1
 
1.4%
30.0%/0.0%/70.0 1
 
1.4%
85.0%/0.0%/15.0 1
 
1.4%
Other values (12) 12
 
16.7%

고용임금(백만원)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct15
Distinct (%)20.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.1111111
Minimum0
Maximum56
Zeros53
Zeros (%)73.6%
Negative0
Negative (%)0.0%
Memory size780.0 B
2023-12-13T01:43:50.863815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile15
Maximum56
Range56
Interquartile range (IQR)1

Descriptive statistics

Standard deviation8.9604853
Coefficient of variation (CV)2.880156
Kurtosis20.567715
Mean3.1111111
Median Absolute Deviation (MAD)0
Skewness4.277088
Sum224
Variance80.290297
MonotonicityNot monotonic
2023-12-13T01:43:50.970933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
0.0 53
73.6%
4.0 3
 
4.2%
15.0 2
 
2.8%
2.0 2
 
2.8%
1.0 2
 
2.8%
2.5 1
 
1.4%
12.0 1
 
1.4%
56.0 1
 
1.4%
12.5 1
 
1.4%
10.0 1
 
1.4%
Other values (5) 5
 
6.9%
ValueCountFrequency (%)
0.0 53
73.6%
1.0 2
 
2.8%
2.0 2
 
2.8%
2.5 1
 
1.4%
3.0 1
 
1.4%
4.0 3
 
4.2%
7.0 1
 
1.4%
8.0 1
 
1.4%
10.0 1
 
1.4%
12.0 1
 
1.4%
ValueCountFrequency (%)
56.0 1
 
1.4%
40.0 1
 
1.4%
25.0 1
 
1.4%
15.0 2
2.8%
12.5 1
 
1.4%
12.0 1
 
1.4%
10.0 1
 
1.4%
8.0 1
 
1.4%
7.0 1
 
1.4%
4.0 3
4.2%

Interactions

2023-12-13T01:43:49.220736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T01:43:51.064531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역업체명매출액 비중고용임금(백만원)
지역1.0001.0000.0880.000
업체명1.0001.0001.0001.000
매출액 비중0.0881.0001.0000.922
고용임금(백만원)0.0001.0000.9221.000
2023-12-13T01:43:51.161813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
매출액 비중지역
매출액 비중1.0000.000
지역0.0001.000
2023-12-13T01:43:51.237179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
고용임금(백만원)지역매출액 비중
고용임금(백만원)1.0000.0000.644
지역0.0001.0000.000
매출액 비중0.6440.0001.000

Missing values

2023-12-13T01:43:49.329041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:43:49.440409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

지역업체명유형6차산업인증매출액 비중고용임금(백만원)
0천안시아빠사랑팜농촌교육농장<NA>0.0%/0.0%/100%0.0
1천안시자연누리성농촌교육농장<NA>0.0%/0.0%/100.0%15.0
2천안시진제자연농원농촌교육농장<NA>20.0%/0.0%/80.0%2.5
3천안시봉황농장농촌교육농장<NA>81.0%/0.0%/19.0%0.0
4천안시썬러브치즈농촌교육농장<NA>0.0%/0.0%/100.0%0.0
5공주시엔젤농장농촌교육농장<NA>70.0%/0.0%/30.0%12.0
6공주시상보안농원농촌교육농장<NA>10.0%/80.0%/10.0%56.0
7공주시이삭농원농촌교육농장<NA>10.0%/20.0%/70.0%0.0
8공주시풀향기(시비)농촌교육농장<NA>80.0%/0.0%/20.0%0.0
9공주시아이러브벅스농촌교육농장<NA>0.0%/0.0%/100%0.0
지역업체명유형6차산업인증매출액 비중고용임금(백만원)
62예산군아람농장농촌교육농장<NA>0.0%/100.0%/0.0%0.0
63예산군게으름뱅이농장농촌교육농장<NA>90.0%/0.0%/10.0%8.0
64예산군전통예산옹기농촌교육농장<NA>0.0%/0.0%/100.0%0.0
65태안군다솜농원농촌교육농장<NA>60.0%/0.0%/40.0%3.0
66태안군상옥농장농촌교육농장<NA>0.0%/0.0%/100.0%1.0
67태안군산들바농장농촌교육농장<NA>0.0%/0.0%/100.0%0.0
68태안군놀샘터농촌교육농장<NA>0.0%/0.0%/100.0%0.0
69태안군연휴일농촌교육농장<NA>0.0%/0.0%/100.0%0.0
70태안군뜨락애농촌교육농장<NA>0.0%/0.0%/100.0%0.0
71태안군나오리 생태예술원농촌교육농장<NA>0.0%/0.0%/100.0%0.0