Overview

Dataset statistics

Number of variables4
Number of observations256
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.1 KiB
Average record size in memory32.5 B

Variable types

Categorical2
Text2

Dataset

Description충청남도내 농촌융복합산업 인증경영체 현황을 담은 데이터로 경영체명, 산업유형, 제조품목 등 6차산업에 경영체 현황을 포함합니다.
Author충청남도
URLhttps://www.data.go.kr/data/15040666/fileData.do

Alerts

업체명 has unique valuesUnique

Reproduction

Analysis started2024-03-14 20:54:32.281473
Analysis finished2024-03-14 20:54:33.253362
Duration0.97 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지역
Categorical

Distinct30
Distinct (%)11.7%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
논산시
26 
공주시
19 
청양군
17 
예산군
17 
서산시
17 
Other values (25)
160 

Length

Max length6
Median length5
Mean length4.6445312
Min length3

Unique

Unique2 ?
Unique (%)0.8%

Sample

1st row 홍성군
2nd row 당진시
3rd row 예산군
4th row 예산군
5th row 천안시

Common Values

ValueCountFrequency (%)
논산시 26
 
10.2%
공주시 19
 
7.4%
청양군 17
 
6.6%
예산군 17
 
6.6%
서산시 17
 
6.6%
아산시 15
 
5.9%
서천군 15
 
5.9%
금산군 15
 
5.9%
부여군 15
 
5.9%
천안시 13
 
5.1%
Other values (20) 87
34.0%

Length

2024-03-15T05:54:33.485392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
논산시 31
12.1%
공주시 23
9.0%
예산군 20
 
7.8%
부여군 20
 
7.8%
청양군 19
 
7.4%
서산시 19
 
7.4%
아산시 18
 
7.0%
서천군 18
 
7.0%
금산군 18
 
7.0%
천안시 16
 
6.2%
Other values (5) 54
21.1%

업체명
Text

UNIQUE 

Distinct256
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2024-03-15T05:54:34.437279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length17
Mean length12.125
Min length2

Characters and Unicode

Total characters3104
Distinct characters344
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique256 ?
Unique (%)100.0%

Sample

1st row (농)마동이㈜
2nd row 백석올미(영)
3rd row (농)예산사과와인㈜
4th row (농.주)태신목장
5th row Sun-Love 치즈
ValueCountFrequency (%)
농업회사법인 78
 
17.4%
주식회사 69
 
15.4%
영농조합법인 11
 
2.4%
9
 
2.0%
농업회사법인㈜ 2
 
0.4%
유한회사 2
 
0.4%
한국흑홍삼 2
 
0.4%
베릴리 1
 
0.2%
berily 1
 
0.2%
than 1
 
0.2%
Other values (273) 273
60.8%
2024-03-15T05:54:35.980371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
543
 
17.5%
182
 
5.9%
180
 
5.8%
176
 
5.7%
149
 
4.8%
136
 
4.4%
104
 
3.4%
96
 
3.1%
93
 
3.0%
61
 
2.0%
Other values (334) 1384
44.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2453
79.0%
Space Separator 543
 
17.5%
Other Symbol 27
 
0.9%
Open Punctuation 22
 
0.7%
Close Punctuation 22
 
0.7%
Uppercase Letter 20
 
0.6%
Lowercase Letter 11
 
0.4%
Other Punctuation 3
 
0.1%
Decimal Number 2
 
0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
182
 
7.4%
180
 
7.3%
176
 
7.2%
149
 
6.1%
136
 
5.5%
104
 
4.2%
96
 
3.9%
93
 
3.8%
61
 
2.5%
51
 
2.1%
Other values (303) 1225
49.9%
Uppercase Letter
ValueCountFrequency (%)
A 4
20.0%
T 3
15.0%
E 2
10.0%
V 2
10.0%
H 2
10.0%
L 1
 
5.0%
S 1
 
5.0%
N 1
 
5.0%
R 1
 
5.0%
B 1
 
5.0%
Other values (2) 2
10.0%
Lowercase Letter
ValueCountFrequency (%)
e 2
18.2%
n 1
9.1%
o 1
9.1%
r 1
9.1%
i 1
9.1%
l 1
9.1%
y 1
9.1%
v 1
9.1%
u 1
9.1%
b 1
9.1%
Other Punctuation
ValueCountFrequency (%)
, 1
33.3%
. 1
33.3%
· 1
33.3%
Space Separator
ValueCountFrequency (%)
543
100.0%
Other Symbol
ValueCountFrequency (%)
27
100.0%
Open Punctuation
ValueCountFrequency (%)
( 22
100.0%
Close Punctuation
ValueCountFrequency (%)
) 22
100.0%
Decimal Number
ValueCountFrequency (%)
8 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2480
79.9%
Common 593
 
19.1%
Latin 31
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
182
 
7.3%
180
 
7.3%
176
 
7.1%
149
 
6.0%
136
 
5.5%
104
 
4.2%
96
 
3.9%
93
 
3.8%
61
 
2.5%
51
 
2.1%
Other values (304) 1252
50.5%
Latin
ValueCountFrequency (%)
A 4
 
12.9%
T 3
 
9.7%
E 2
 
6.5%
V 2
 
6.5%
H 2
 
6.5%
e 2
 
6.5%
n 1
 
3.2%
L 1
 
3.2%
o 1
 
3.2%
r 1
 
3.2%
Other values (12) 12
38.7%
Common
ValueCountFrequency (%)
543
91.6%
( 22
 
3.7%
) 22
 
3.7%
8 2
 
0.3%
- 1
 
0.2%
, 1
 
0.2%
. 1
 
0.2%
· 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2453
79.0%
ASCII 623
 
20.1%
None 28
 
0.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
543
87.2%
( 22
 
3.5%
) 22
 
3.5%
A 4
 
0.6%
T 3
 
0.5%
E 2
 
0.3%
V 2
 
0.3%
H 2
 
0.3%
8 2
 
0.3%
e 2
 
0.3%
Other values (19) 19
 
3.0%
Hangul
ValueCountFrequency (%)
182
 
7.4%
180
 
7.3%
176
 
7.2%
149
 
6.1%
136
 
5.5%
104
 
4.2%
96
 
3.9%
93
 
3.8%
61
 
2.5%
51
 
2.1%
Other values (303) 1225
49.9%
None
ValueCountFrequency (%)
27
96.4%
· 1
 
3.6%

업태
Text

Distinct191
Distinct (%)74.6%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2024-03-15T05:54:36.692025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length126
Median length64
Mean length14.605469
Min length2

Characters and Unicode

Total characters3739
Distinct characters219
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique180 ?
Unique (%)70.3%

Sample

1st row 제조업, 농업, 도소매
2nd row 도,소매,제조
3rd row 제조업
4th row 축산업 육우사육
5th row 유제품
ValueCountFrequency (%)
제조업 97
 
18.0%
31
 
5.8%
농업 28
 
5.2%
소매업 23
 
4.3%
제조 21
 
3.9%
도소매 19
 
3.5%
도매 17
 
3.2%
서비스 10
 
1.9%
도소매업 9
 
1.7%
서비스업 8
 
1.5%
Other values (234) 275
51.1%
2024-03-15T05:54:37.804327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
605
16.2%
338
 
9.0%
, 268
 
7.2%
224
 
6.0%
216
 
5.8%
181
 
4.8%
133
 
3.6%
115
 
3.1%
115
 
3.1%
/ 107
 
2.9%
Other values (209) 1437
38.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2672
71.5%
Space Separator 605
 
16.2%
Other Punctuation 393
 
10.5%
Open Punctuation 34
 
0.9%
Close Punctuation 34
 
0.9%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
338
 
12.6%
224
 
8.4%
216
 
8.1%
181
 
6.8%
133
 
5.0%
115
 
4.3%
115
 
4.3%
63
 
2.4%
61
 
2.3%
55
 
2.1%
Other values (200) 1171
43.8%
Other Punctuation
ValueCountFrequency (%)
, 268
68.2%
/ 107
 
27.2%
. 14
 
3.6%
: 3
 
0.8%
· 1
 
0.3%
Space Separator
ValueCountFrequency (%)
605
100.0%
Open Punctuation
ValueCountFrequency (%)
( 34
100.0%
Close Punctuation
ValueCountFrequency (%)
) 34
100.0%
Lowercase Letter
ValueCountFrequency (%)
o 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2672
71.5%
Common 1066
 
28.5%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
338
 
12.6%
224
 
8.4%
216
 
8.1%
181
 
6.8%
133
 
5.0%
115
 
4.3%
115
 
4.3%
63
 
2.4%
61
 
2.3%
55
 
2.1%
Other values (200) 1171
43.8%
Common
ValueCountFrequency (%)
605
56.8%
, 268
25.1%
/ 107
 
10.0%
( 34
 
3.2%
) 34
 
3.2%
. 14
 
1.3%
: 3
 
0.3%
· 1
 
0.1%
Latin
ValueCountFrequency (%)
o 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2672
71.5%
ASCII 1066
 
28.5%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
605
56.8%
, 268
25.1%
/ 107
 
10.0%
( 34
 
3.2%
) 34
 
3.2%
. 14
 
1.3%
: 3
 
0.3%
o 1
 
0.1%
Hangul
ValueCountFrequency (%)
338
 
12.6%
224
 
8.4%
216
 
8.1%
181
 
6.8%
133
 
5.0%
115
 
4.3%
115
 
4.3%
63
 
2.4%
61
 
2.3%
55
 
2.1%
Other values (200) 1171
43.8%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct12
Distinct (%)4.7%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
1*2*3형
126 
1*2*3형
54 
1*2형
25 
1*2형
23 
1*3형
 
9
Other values (7)
19 

Length

Max length10
Median length8
Mean length6.7070312
Min length3

Unique

Unique5 ?
Unique (%)2.0%

Sample

1st row 1*2*3형
2nd row 1*2*3형
3rd row 1*2*3형
4th row 1*2*3형
5th row 1*2*3형

Common Values

ValueCountFrequency (%)
1*2*3형 126
49.2%
1*2*3형 54
21.1%
1*2형 25
 
9.8%
1*2형 23
 
9.0%
1*3형 9
 
3.5%
1*3형 8
 
3.1%
1*2*3 6
 
2.3%
1.,2.,3차 1
 
0.4%
1*2차 1
 
0.4%
1*2*3차형 1
 
0.4%
Other values (2) 2
 
0.8%

Length

2024-03-15T05:54:38.065533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1*2*3형 180
70.3%
1*2형 48
 
18.8%
1*3형 17
 
6.6%
1*2*3 6
 
2.3%
1.,2.,3차 1
 
0.4%
1*2차 1
 
0.4%
1*2*3차형 1
 
0.4%
1*3 1
 
0.4%
1*2 1
 
0.4%

Correlations

2024-03-15T05:54:38.339619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역농촌융복합산업 유형
지역1.0000.569
농촌융복합산업 유형0.5691.000
2024-03-15T05:54:38.565117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
농촌융복합산업 유형지역
농촌융복합산업 유형1.0000.208
지역0.2081.000
2024-03-15T05:54:38.806575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역농촌융복합산업 유형
지역1.0000.208
농촌융복합산업 유형0.2081.000

Missing values

2024-03-15T05:54:32.858397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T05:54:33.142024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

지역업체명업태농촌융복합산업 유형
0홍성군(농)마동이㈜제조업, 농업, 도소매1*2*3형
1당진시백석올미(영)도,소매,제조1*2*3형
2예산군(농)예산사과와인㈜제조업1*2*3형
3예산군(농.주)태신목장축산업 육우사육1*2*3형
4천안시Sun-Love 치즈유제품1*2*3형
5서천군해가마을(영)제조업 (o)1*2*3형
6공주시농가애 주식회사 농업회사법인제조업1*2형
7공주시(농)미마지㈜농업1*2*3형
8논산시궁골식품(영)제조업1*2*3형
9논산시수림원 농업회사법인 주식회사젓갈류,장류,절임식품,농산물,수산물,농수산물 연구개발,체험 및 교육농장1*2*3형
지역업체명업태농촌융복합산업 유형
246서천군강산소곡주제조업1*2*3형
247보령시농업회사법인 우유창고 주식회사휴게음식점, 카페1*2*3형
248아산시㈜이미선텍스타일아트섬유제품 염색 및 임가공1*2*3형
249논산시성동식품제조업1*2형
250금산군금산88홍삼제조업1*2형
251금산군농업회사법인 ㈜ 한국흑홍삼제조업1*2*3형
252금산군농업회사법인 ㈜ 순하늘홍삼제조업1*2*3형
253금산군손끝으로만드는세상제과점업, 과자, 빵류, 인삼빵1*2*3형
254예산군엘캄포농업, 소매업1*2형
255태안군더맘유가공연구소제조업(축산물가공업)1*2*3형