Overview

Dataset statistics

Number of variables1
Number of observations714
Missing cells1
Missing cells (%)0.1%
Duplicate rows26
Duplicate rows (%)3.6%
Total size in memory5.7 KiB
Average record size in memory8.2 B

Variable types

Text1

Dataset

Description채소류 재배현황, 생산실적 조회 서비스
Author농림축산식품부
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220216000000001975

Alerts

Dataset has 26 (3.6%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-11 03:46:43.797307
Analysis finished2023-12-11 03:46:44.124751
Duration0.33 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct406
Distinct (%)56.9%
Missing1
Missing (%)0.1%
Memory size5.7 KiB
2023-12-11T12:46:44.374569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length195
Median length103
Mean length17.72791
Min length1

Characters and Unicode

Total characters12640
Distinct characters449
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique380 ?
Unique (%)53.3%

Sample

1st row1. 조사목적과 근거
2nd row⚪시설채소 온실현황 및 채소류 생산실적 조사를 통하여 농산물 생산 및 수급대책 수립 및 학술연구 및 농업정책 등에 필요한 기초자료로 활용코자 함
3rd row⚪시설채소 온실현황 및 채소류 생산실적 조사는 국가승인통계로 「통계법」 제18조 및 「농업통계조사규칙」 기획재정부령 제509호(2015.11.16)에 따라 실시하고 있음
4th row2. 조사시점
5th row⚪시설 : 2020.1.1. ~ 2020.12.31
ValueCountFrequency (%)
552
 
18.9%
단위 99
 
3.4%
ha 98
 
3.3%
84
 
2.9%
kg/10a 70
 
2.4%
38
 
1.3%
자료 33
 
1.1%
상품기준 31
 
1.1%
서울시농수산식품공사 29
 
1.0%
24
 
0.8%
Other values (938) 1868
63.8%
2023-12-11T12:46:44.871056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2525
 
20.0%
; 369
 
2.9%
, 326
 
2.6%
1 273
 
2.2%
: 268
 
2.1%
) 226
 
1.8%
( 221
 
1.7%
a 174
 
1.4%
0 171
 
1.4%
] 153
 
1.2%
Other values (439) 7934
62.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6158
48.7%
Space Separator 2525
20.0%
Other Punctuation 1301
 
10.3%
Decimal Number 1080
 
8.5%
Lowercase Letter 504
 
4.0%
Close Punctuation 382
 
3.0%
Open Punctuation 377
 
3.0%
Dash Punctuation 106
 
0.8%
Uppercase Letter 101
 
0.8%
Other Symbol 56
 
0.4%
Other values (2) 50
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
153
 
2.5%
130
 
2.1%
116
 
1.9%
109
 
1.8%
108
 
1.8%
107
 
1.7%
104
 
1.7%
101
 
1.6%
99
 
1.6%
96
 
1.6%
Other values (367) 5035
81.8%
Uppercase Letter
ValueCountFrequency (%)
P 20
19.8%
C 12
11.9%
E 11
10.9%
V 10
9.9%
A 8
 
7.9%
T 7
 
6.9%
W 6
 
5.9%
H 4
 
4.0%
B 4
 
4.0%
F 4
 
4.0%
Other values (8) 15
14.9%
Lowercase Letter
ValueCountFrequency (%)
a 174
34.5%
h 102
20.2%
k 102
20.2%
g 101
20.0%
m 7
 
1.4%
w 6
 
1.2%
r 3
 
0.6%
t 2
 
0.4%
o 2
 
0.4%
s 2
 
0.4%
Other values (3) 3
 
0.6%
Other Punctuation
ValueCountFrequency (%)
; 369
28.4%
, 326
25.1%
: 268
20.6%
/ 106
 
8.1%
& 63
 
4.8%
# 63
 
4.8%
. 54
 
4.2%
* 43
 
3.3%
· 4
 
0.3%
4
 
0.3%
Decimal Number
ValueCountFrequency (%)
1 273
25.3%
0 171
15.8%
8 129
11.9%
2 113
10.5%
9 110
10.2%
3 76
 
7.0%
6 59
 
5.5%
5 58
 
5.4%
4 52
 
4.8%
7 39
 
3.6%
Other Symbol
ValueCountFrequency (%)
38
67.9%
11
 
19.6%
4
 
7.1%
2
 
3.6%
1
 
1.8%
Close Punctuation
ValueCountFrequency (%)
) 226
59.2%
] 153
40.1%
2
 
0.5%
1
 
0.3%
Open Punctuation
ValueCountFrequency (%)
( 221
58.6%
[ 153
40.6%
2
 
0.5%
1
 
0.3%
Math Symbol
ValueCountFrequency (%)
< 14
36.8%
> 14
36.8%
~ 8
21.1%
+ 2
 
5.3%
Space Separator
ValueCountFrequency (%)
2525
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 106
100.0%
Final Punctuation
ValueCountFrequency (%)
12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6159
48.7%
Common 5876
46.5%
Latin 605
 
4.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
153
 
2.5%
130
 
2.1%
116
 
1.9%
109
 
1.8%
108
 
1.8%
107
 
1.7%
104
 
1.7%
101
 
1.6%
99
 
1.6%
96
 
1.6%
Other values (368) 5036
81.8%
Common
ValueCountFrequency (%)
2525
43.0%
; 369
 
6.3%
, 326
 
5.5%
1 273
 
4.6%
: 268
 
4.6%
) 226
 
3.8%
( 221
 
3.8%
0 171
 
2.9%
] 153
 
2.6%
[ 153
 
2.6%
Other values (30) 1191
20.3%
Latin
ValueCountFrequency (%)
a 174
28.8%
h 102
16.9%
k 102
16.9%
g 101
16.7%
P 20
 
3.3%
C 12
 
2.0%
E 11
 
1.8%
V 10
 
1.7%
A 8
 
1.3%
m 7
 
1.2%
Other values (21) 58
 
9.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6400
50.6%
Hangul 6158
48.7%
Geometric Shapes 49
 
0.4%
None 15
 
0.1%
Punctuation 12
 
0.1%
Letterlike Symbols 4
 
< 0.1%
CJK Compat 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2525
39.5%
; 369
 
5.8%
, 326
 
5.1%
1 273
 
4.3%
: 268
 
4.2%
) 226
 
3.5%
( 221
 
3.5%
a 174
 
2.7%
0 171
 
2.7%
] 153
 
2.4%
Other values (50) 1694
26.5%
Hangul
ValueCountFrequency (%)
153
 
2.5%
130
 
2.1%
116
 
1.9%
109
 
1.8%
108
 
1.8%
107
 
1.7%
104
 
1.7%
101
 
1.6%
99
 
1.6%
96
 
1.6%
Other values (367) 5035
81.8%
Geometric Shapes
ValueCountFrequency (%)
38
77.6%
11
 
22.4%
Punctuation
ValueCountFrequency (%)
12
100.0%
None
ValueCountFrequency (%)
· 4
26.7%
4
26.7%
2
13.3%
2
13.3%
1
 
6.7%
1
 
6.7%
1
 
6.7%
Letterlike Symbols
ValueCountFrequency (%)
4
100.0%
CJK Compat
ValueCountFrequency (%)
2
100.0%

Missing values

2023-12-11T12:46:44.036414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T12:46:44.097960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

I. 조사개요
01. 조사목적과 근거
1&#9898;시설채소 온실현황 및 채소류 생산실적 조사를 통하여 농산물 생산 및 수급대책 수립 및 학술연구 및 농업정책 등에 필요한 기초자료로 활용코자 함
2&#9898;시설채소 온실현황 및 채소류 생산실적 조사는 국가승인통계로 &#65378;통계법&#65379; 제18조 및 &#65378;농업통계조사규칙&#65379; 기획재정부령 제509호(2015.11.16)에 따라 실시하고 있음
32. 조사시점
4&#9898;시설 : 2020.1.1. ~ 2020.12.31
5&#9898;노지 : 2020.1.1. ~ 2020.12.31.
63. 조사대상 : 채소류 전품목
74. 조사방법
8&#9898;표본조사(5종):가을무, 가을배추, 고추, 마늘, 양파
9&#9898;행정조사(38종):봄무, 고랭지무, 겨울무, 당근, 봄배추, 고랭지배추, 겨울배추, 양배추, 시금치, 상추, 수박, 참외, 오이, 호박, 토마토, 딸기, 풋고추, 파, 생강, 연근, 우엉, 토란, 미나리, 쑥갓, 부추, 가지, 멜론, 파프리카, 결구상추, 샐러리, 피망, 적채, 파세리, 꽃양배추, 녹색꽃양배추, 케일, 신선초, 기타채소
I. 조사개요
7042020 주요 채소류 수입실적
705(단위 : 톤, 천불, %)
706;[152]
707;
708* 출처 : 2020 농림수산식품 수출입동향 및 통계(aT)
709;[153]
7102020 시설채소 온실현황 및 채소류 생산실적
7112021년 10월 일 인쇄 2021년 10월 일 발행 발행처 : 농림축산식품부 유통소비정책관 원예산업과 주 소 : 세종특별자치시 다솜2로 94(어진동) 정부세종청사 http://www.mafra.go.kr 전 화 : 044-201-2236~7 팩 스 : 044-868-5310 인 쇄 : ㈜아르빛 044-863-0933
712<NA>
713;

Duplicate rows

Most frequently occurring

I. 조사개요# duplicates
24;153
19(단위 : ha, kg/10a, 톤)70
21* 자료 : 서울시농수산식품공사29
18(단위 : ha)15
20(단위 : ha, 톤)13
25< 용어 해설 >9
05
22* 자료 : 한국농수산식품유통공사(카미스)3
1- EVA : 초산비닐이라고 하는 피복재2
2- PC : 투명하고 골판 또는 복층판 피복재로 폴리카보네이트라 함2