Overview

Dataset statistics

Number of variables4
Number of observations254
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.1 KiB
Average record size in memory32.5 B

Variable types

Categorical2
Text2

Dataset

Description충청남도내 농촌융복합산업 인증경영체 현황을 담은 데이터로 경영체명, 산업유형, 제조품목 등 6차산업에 경영체 현황을 포함합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=391&beforeMenuCd=DOM_000000201001001000&publicdatapk=15040666

Alerts

Column2 has unique valuesUnique

Reproduction

Analysis started2024-01-09 20:37:46.111618
Analysis finished2024-01-09 20:37:46.522942
Duration0.41 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Column1
Categorical

Distinct31
Distinct (%)12.2%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
논산시
25 
공주시
19 
서산시
18 
청양군
17 
예산군
16 
Other values (26)
159 

Length

Max length6
Median length5
Mean length4.6299213
Min length2

Unique

Unique3 ?
Unique (%)1.2%

Sample

1st row지역
2nd row 홍성군
3rd row 당진시
4th row 예산군
5th row 예산군

Common Values

ValueCountFrequency (%)
논산시 25
 
9.8%
공주시 19
 
7.5%
서산시 18
 
7.1%
청양군 17
 
6.7%
예산군 16
 
6.3%
부여군 16
 
6.3%
아산시 15
 
5.9%
서천군 14
 
5.5%
금산군 14
 
5.5%
천안시 13
 
5.1%
Other values (21) 87
34.3%

Length

2024-01-10T05:37:46.585286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
논산시 30
11.8%
공주시 23
9.1%
부여군 21
 
8.3%
서산시 20
 
7.9%
청양군 19
 
7.5%
예산군 19
 
7.5%
아산시 18
 
7.1%
서천군 17
 
6.7%
금산군 17
 
6.7%
천안시 16
 
6.3%
Other values (6) 54
21.3%

Column2
Text

UNIQUE 

Distinct254
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2024-01-10T05:37:46.787960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length17
Mean length12.161417
Min length2

Characters and Unicode

Total characters3089
Distinct characters341
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique254 ?
Unique (%)100.0%

Sample

1st row업체명
2nd row (농)마동이㈜
3rd row 백석올미(영)
4th row (농)예산사과와인㈜
5th row (농.주)태신목장
ValueCountFrequency (%)
농업회사법인 75
 
16.9%
주식회사 68
 
15.3%
영농조합법인 12
 
2.7%
7
 
1.6%
유한회사 2
 
0.5%
농업회사법인㈜ 2
 
0.5%
berily 1
 
0.2%
송풍방앗간 1
 
0.2%
한산소곡주 1
 
0.2%
백제향 1
 
0.2%
Other values (274) 274
61.7%
2024-01-10T05:37:47.106619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
560
18.1%
180
 
5.8%
179
 
5.8%
175
 
5.7%
147
 
4.8%
134
 
4.3%
103
 
3.3%
95
 
3.1%
92
 
3.0%
62
 
2.0%
Other values (331) 1362
44.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2423
78.4%
Space Separator 560
 
18.1%
Other Symbol 24
 
0.8%
Open Punctuation 23
 
0.7%
Close Punctuation 23
 
0.7%
Uppercase Letter 20
 
0.6%
Lowercase Letter 11
 
0.4%
Other Punctuation 4
 
0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
180
 
7.4%
179
 
7.4%
175
 
7.2%
147
 
6.1%
134
 
5.5%
103
 
4.3%
95
 
3.9%
92
 
3.8%
62
 
2.6%
52
 
2.1%
Other values (301) 1204
49.7%
Uppercase Letter
ValueCountFrequency (%)
A 4
20.0%
T 3
15.0%
V 2
10.0%
H 2
10.0%
E 2
10.0%
B 1
 
5.0%
R 1
 
5.0%
N 1
 
5.0%
L 1
 
5.0%
M 1
 
5.0%
Other values (2) 2
10.0%
Lowercase Letter
ValueCountFrequency (%)
e 2
18.2%
l 1
9.1%
y 1
9.1%
i 1
9.1%
r 1
9.1%
b 1
9.1%
v 1
9.1%
o 1
9.1%
n 1
9.1%
u 1
9.1%
Other Punctuation
ValueCountFrequency (%)
· 2
50.0%
, 1
25.0%
. 1
25.0%
Space Separator
ValueCountFrequency (%)
560
100.0%
Other Symbol
ValueCountFrequency (%)
24
100.0%
Open Punctuation
ValueCountFrequency (%)
( 23
100.0%
Close Punctuation
ValueCountFrequency (%)
) 23
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2447
79.2%
Common 611
 
19.8%
Latin 31
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
180
 
7.4%
179
 
7.3%
175
 
7.2%
147
 
6.0%
134
 
5.5%
103
 
4.2%
95
 
3.9%
92
 
3.8%
62
 
2.5%
52
 
2.1%
Other values (302) 1228
50.2%
Latin
ValueCountFrequency (%)
A 4
 
12.9%
T 3
 
9.7%
e 2
 
6.5%
V 2
 
6.5%
H 2
 
6.5%
E 2
 
6.5%
l 1
 
3.2%
y 1
 
3.2%
i 1
 
3.2%
r 1
 
3.2%
Other values (12) 12
38.7%
Common
ValueCountFrequency (%)
560
91.7%
( 23
 
3.8%
) 23
 
3.8%
· 2
 
0.3%
, 1
 
0.2%
- 1
 
0.2%
. 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2423
78.4%
ASCII 640
 
20.7%
None 26
 
0.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
560
87.5%
( 23
 
3.6%
) 23
 
3.6%
A 4
 
0.6%
T 3
 
0.5%
e 2
 
0.3%
V 2
 
0.3%
H 2
 
0.3%
E 2
 
0.3%
l 1
 
0.2%
Other values (18) 18
 
2.8%
Hangul
ValueCountFrequency (%)
180
 
7.4%
179
 
7.4%
175
 
7.2%
147
 
6.1%
134
 
5.5%
103
 
4.3%
95
 
3.9%
92
 
3.8%
62
 
2.6%
52
 
2.1%
Other values (301) 1204
49.7%
None
ValueCountFrequency (%)
24
92.3%
· 2
 
7.7%
Distinct193
Distinct (%)76.0%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2024-01-10T05:37:47.279711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length126
Median length64
Mean length14.996063
Min length2

Characters and Unicode

Total characters3809
Distinct characters213
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique182 ?
Unique (%)71.7%

Sample

1st row업태
2nd row 제조업, 농업, 도소매
3rd row 도,소매,제조
4th row 제조업
5th row 축산업 육우사육
ValueCountFrequency (%)
제조업 92
 
17.1%
33
 
6.1%
농업 30
 
5.6%
소매업 23
 
4.3%
제조 21
 
3.9%
도소매 20
 
3.7%
도매 18
 
3.3%
서비스 11
 
2.0%
도소매업 9
 
1.7%
서비스업 8
 
1.5%
Other values (233) 274
50.8%
2024-01-10T05:37:47.591874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
628
16.5%
343
 
9.0%
, 269
 
7.1%
221
 
5.8%
214
 
5.6%
187
 
4.9%
137
 
3.6%
118
 
3.1%
118
 
3.1%
/ 115
 
3.0%
Other values (203) 1459
38.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2714
71.3%
Space Separator 628
 
16.5%
Other Punctuation 403
 
10.6%
Close Punctuation 32
 
0.8%
Open Punctuation 32
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
343
 
12.6%
221
 
8.1%
214
 
7.9%
187
 
6.9%
137
 
5.0%
118
 
4.3%
118
 
4.3%
62
 
2.3%
60
 
2.2%
58
 
2.1%
Other values (195) 1196
44.1%
Other Punctuation
ValueCountFrequency (%)
, 269
66.7%
/ 115
28.5%
. 15
 
3.7%
: 3
 
0.7%
· 1
 
0.2%
Space Separator
ValueCountFrequency (%)
628
100.0%
Close Punctuation
ValueCountFrequency (%)
) 32
100.0%
Open Punctuation
ValueCountFrequency (%)
( 32
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2714
71.3%
Common 1095
28.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
343
 
12.6%
221
 
8.1%
214
 
7.9%
187
 
6.9%
137
 
5.0%
118
 
4.3%
118
 
4.3%
62
 
2.3%
60
 
2.2%
58
 
2.1%
Other values (195) 1196
44.1%
Common
ValueCountFrequency (%)
628
57.4%
, 269
24.6%
/ 115
 
10.5%
) 32
 
2.9%
( 32
 
2.9%
. 15
 
1.4%
: 3
 
0.3%
· 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2714
71.3%
ASCII 1094
28.7%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
628
57.4%
, 269
24.6%
/ 115
 
10.5%
) 32
 
2.9%
( 32
 
2.9%
. 15
 
1.4%
: 3
 
0.3%
Hangul
ValueCountFrequency (%)
343
 
12.6%
221
 
8.1%
214
 
7.9%
187
 
6.9%
137
 
5.0%
118
 
4.3%
118
 
4.3%
62
 
2.3%
60
 
2.2%
58
 
2.1%
Other values (195) 1196
44.1%
None
ValueCountFrequency (%)
· 1
100.0%

Column4
Categorical

Distinct13
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
1*2*3형
134 
1*2*3형
46 
1*2형
27 
1*2형
18 
1*3형
 
9
Other values (8)
20 

Length

Max length10
Median length8
Mean length6.8307087
Min length3

Unique

Unique6 ?
Unique (%)2.4%

Sample

1st row농촌융복합산업 유형
2nd row 1*2*3형
3rd row 1*2*3형
4th row 1*2*3형
5th row 1*2*3형

Common Values

ValueCountFrequency (%)
1*2*3형 134
52.8%
1*2*3형 46
 
18.1%
1*2형 27
 
10.6%
1*2형 18
 
7.1%
1*3형 9
 
3.5%
1*3형 8
 
3.1%
1*2*3 6
 
2.4%
농촌융복합산업 유형 1
 
0.4%
1.,2.,3차 1
 
0.4%
1*2차 1
 
0.4%
Other values (3) 3
 
1.2%

Length

2024-01-10T05:37:47.732984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1*2*3형 180
70.6%
1*2형 45
 
17.6%
1*3형 17
 
6.7%
1*2*3 6
 
2.4%
농촌융복합산업 1
 
0.4%
유형 1
 
0.4%
1.,2.,3차 1
 
0.4%
1*2차 1
 
0.4%
1*2*3차형 1
 
0.4%
1*3 1
 
0.4%

Correlations

2024-01-10T05:37:47.822116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Column1Column4
Column11.0000.765
Column40.7651.000
2024-01-10T05:37:47.919042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Column1Column4
Column11.0000.344
Column40.3441.000
2024-01-10T05:37:48.015604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Column1Column4
Column11.0000.344
Column40.3441.000

Missing values

2024-01-10T05:37:46.427476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:37:46.494267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

Column1Column2Column3Column4
0지역업체명업태농촌융복합산업 유형
1홍성군(농)마동이㈜제조업, 농업, 도소매1*2*3형
2당진시백석올미(영)도,소매,제조1*2*3형
3예산군(농)예산사과와인㈜제조업1*2*3형
4예산군(농.주)태신목장축산업 육우사육1*2*3형
5천안시Sun-Love 치즈유제품1*2*3형
6서천군해가마을(영)제조업1*2*3형
7공주시농가애 주식회사 농업회사법인제조업1*2형
8공주시(농)미마지㈜농업1*2*3형
9논산시궁골식품(영)제조업1*2*3형
Column1Column2Column3Column4
244아산시농업회사법인 주식회사 인주라이스제조업1*2*3형
245서산시농업회사법인 주식회사 정담농업, 도매 및 소매업, 서비스업,1*2*3형
246논산시농업회사법인 주식회사 양촌윤가농원농업, 도매 및 소매업1*2*3형
247논산시충남승마클럽 농업회사법인 주식회사승마장업, 마필사육 및 판매, 농업, 교육서비스업1*3형
248당진시(사)반딧불나눔복지재단교육서비스업, 제조업, 도매 및 소매업, 서비스업1*2형
249금산군금산홍삼랜드제조, 도소매, 소매1*2*3형
250부여군주암농업, 임원, 제조업, 도소매업, 서비스업1*3형
251서천군서천군표고버섯 영농조합법인농업, 도매 및 소매업1*2*3형
252홍성군홍성베리팜 농업회사법인 주식회사제조업, 도소매1*2*3형
253예산군농업회사법인 엠에스바이오 주식회사제조업, 농업, 도매 및 소매업1*2*3형