Overview

Dataset statistics

Number of variables3
Number of observations6767
Missing cells0
Missing cells (%)0.0%
Duplicate rows1107
Duplicate rows (%)16.4%
Total size in memory158.7 KiB
Average record size in memory24.0 B

Variable types

Text1
DateTime2

Dataset

Description사이버 거래 품목정보에 대한 데이터로, - 농식품거래소에서 판매하는 상품에 대한 정보(상품 생성일자, 농산물 수확일자)를 제공합니다.
URLhttps://www.data.go.kr/data/15072483/fileData.do

Alerts

Dataset has 1107 (16.4%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-11 23:00:51.817613
Analysis finished2023-12-11 23:00:52.177881
Duration0.36 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1604
Distinct (%)23.7%
Missing0
Missing (%)0.0%
Memory size53.0 KiB
2023-12-12T08:00:52.414921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length34
Mean length26.619329
Min length3

Characters and Unicode

Total characters180133
Distinct characters399
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique947 ?
Unique (%)14.0%

Sample

1st row수산물 냉동 해면어류 삼치 은삼치 10 kg 상자(c/s)
2nd row경매상품_양파_왕특품_13000(망)
3rd row축산물 생축(가축)류 돼지류 돼지류(일반) 110.0 Kg 마리
4th row축산물 생축(가축)류 돼지류 돼지류(일반) 110.0 Kg 마리
5th row농산물 산채류 곤드레나물 곤드레나물 1.0 kg 없음
ValueCountFrequency (%)
kg 5425
 
12.0%
농산물 3567
 
7.9%
축산물 1907
 
4.2%
없음 1709
 
3.8%
미곡 1487
 
3.3%
1.0 1484
 
3.3%
마리 1103
 
2.4%
상자 1041
 
2.3%
생축(가축)류 1000
 
2.2%
돼지류(일반 991
 
2.2%
Other values (1096) 25378
56.3%
2023-12-12T08:00:52.862860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
38512
21.4%
0 8945
 
5.0%
7011
 
3.9%
6640
 
3.7%
6249
 
3.5%
g 5921
 
3.3%
. 5264
 
2.9%
1 5171
 
2.9%
k 4631
 
2.6%
3992
 
2.2%
Other values (389) 87797
48.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 97217
54.0%
Space Separator 38512
 
21.4%
Decimal Number 18048
 
10.0%
Lowercase Letter 11098
 
6.2%
Other Punctuation 5507
 
3.1%
Open Punctuation 3726
 
2.1%
Close Punctuation 3720
 
2.1%
Uppercase Letter 2069
 
1.1%
Dash Punctuation 126
 
0.1%
Connector Punctuation 95
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7011
 
7.2%
6640
 
6.8%
6249
 
6.4%
3992
 
4.1%
3942
 
4.1%
2848
 
2.9%
2772
 
2.9%
2411
 
2.5%
2316
 
2.4%
2232
 
2.3%
Other values (339) 56804
58.4%
Lowercase Letter
ValueCountFrequency (%)
g 5921
53.4%
k 4631
41.7%
p 128
 
1.2%
s 111
 
1.0%
c 104
 
0.9%
x 47
 
0.4%
b 43
 
0.4%
i 34
 
0.3%
r 34
 
0.3%
o 18
 
0.2%
Other values (4) 27
 
0.2%
Uppercase Letter
ValueCountFrequency (%)
K 1072
51.8%
P 324
 
15.7%
E 165
 
8.0%
B 132
 
6.4%
O 127
 
6.1%
X 127
 
6.1%
A 39
 
1.9%
N 29
 
1.4%
T 28
 
1.4%
G 9
 
0.4%
Other values (4) 17
 
0.8%
Decimal Number
ValueCountFrequency (%)
0 8945
49.6%
1 5171
28.7%
5 983
 
5.4%
2 879
 
4.9%
4 634
 
3.5%
6 447
 
2.5%
8 336
 
1.9%
3 294
 
1.6%
7 207
 
1.1%
9 152
 
0.8%
Other Punctuation
ValueCountFrequency (%)
. 5264
95.6%
/ 239
 
4.3%
* 3
 
0.1%
' 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 3697
99.2%
[ 29
 
0.8%
Close Punctuation
ValueCountFrequency (%)
) 3691
99.2%
] 29
 
0.8%
Space Separator
ValueCountFrequency (%)
38512
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 126
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 95
100.0%
Math Symbol
ValueCountFrequency (%)
~ 15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 97214
54.0%
Common 69752
38.7%
Latin 13164
 
7.3%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7011
 
7.2%
6640
 
6.8%
6249
 
6.4%
3992
 
4.1%
3942
 
4.1%
2848
 
2.9%
2772
 
2.9%
2411
 
2.5%
2316
 
2.4%
2232
 
2.3%
Other values (337) 56801
58.4%
Latin
ValueCountFrequency (%)
g 5921
45.0%
k 4631
35.2%
K 1072
 
8.1%
P 324
 
2.5%
E 165
 
1.3%
B 132
 
1.0%
p 128
 
1.0%
O 127
 
1.0%
X 127
 
1.0%
s 111
 
0.8%
Other values (17) 426
 
3.2%
Common
ValueCountFrequency (%)
38512
55.2%
0 8945
 
12.8%
. 5264
 
7.5%
1 5171
 
7.4%
( 3697
 
5.3%
) 3691
 
5.3%
5 983
 
1.4%
2 879
 
1.3%
4 634
 
0.9%
6 447
 
0.6%
Other values (13) 1529
 
2.2%
Han
ValueCountFrequency (%)
2
66.7%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 97214
54.0%
ASCII 82913
46.0%
Letterlike Symbols 3
 
< 0.1%
CJK 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
38512
46.4%
0 8945
 
10.8%
g 5921
 
7.1%
. 5264
 
6.3%
1 5171
 
6.2%
k 4631
 
5.6%
( 3697
 
4.5%
) 3691
 
4.5%
K 1072
 
1.3%
5 983
 
1.2%
Other values (39) 5026
 
6.1%
Hangul
ValueCountFrequency (%)
7011
 
7.2%
6640
 
6.8%
6249
 
6.4%
3992
 
4.1%
3942
 
4.1%
2848
 
2.9%
2772
 
2.9%
2411
 
2.5%
2316
 
2.4%
2232
 
2.3%
Other values (337) 56801
58.4%
Letterlike Symbols
ValueCountFrequency (%)
3
100.0%
CJK
ValueCountFrequency (%)
2
66.7%
1
33.3%
Distinct550
Distinct (%)8.1%
Missing0
Missing (%)0.0%
Memory size53.0 KiB
Minimum2020-01-02 00:00:00
Maximum2021-12-31 00:00:00
2023-12-12T08:00:53.008198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:00:53.139374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct550
Distinct (%)8.1%
Missing0
Missing (%)0.0%
Memory size53.0 KiB
Minimum2020-01-02 00:00:00
Maximum2021-12-31 00:00:00
2023-12-12T08:00:53.301895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T08:00:53.481321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Missing values

2023-12-12T08:00:52.076456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:00:52.145316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

카탈로그상품명수확일자시스템 등록일자
0수산물 냉동 해면어류 삼치 은삼치 10 kg 상자(c/s)2020-02-172020-02-17
1경매상품_양파_왕특품_13000(망)2020-03-162020-03-16
2축산물 생축(가축)류 돼지류 돼지류(일반) 110.0 Kg 마리2020-03-182020-03-18
3축산물 생축(가축)류 돼지류 돼지류(일반) 110.0 Kg 마리2020-03-182020-03-18
4농산물 산채류 곤드레나물 곤드레나물 1.0 kg 없음2020-03-182020-03-18
5농산물 조미채소류 양파 만생양파 1125 kg 파렛트2020-03-202020-03-20
6농산물 조미채소류 양파 만생양파 1125 kg 파렛트2020-03-202020-03-20
7농산물 조미채소류 양파 만생양파 1125 kg 파렛트2020-03-202020-03-20
8농산물 조미채소류 양파 만생양파 1125 kg 파렛트2020-03-202020-03-20
9농산물 조미채소류 양파 만생양파 1125 kg 파렛트2020-03-202020-03-20
카탈로그상품명수확일자시스템 등록일자
6757축산물 기타동물생산물(계란 등) 조란 왕란 1.0 개 개(EA)2021-12-292021-12-29
6758덕연(주) 일반란 대란_보통2021-12-272021-12-27
6759덕연(주) 일반란 특란_보통2021-12-272021-12-27
6760덕연(주) 일반란 특란_보통2021-12-292021-12-29
6761덕연(주) 일반란 대란_보통2021-12-292021-12-29
6762축산물 생축(가축)류 돼지류 돼지류(일반) 60.0 마리(수) 마리2021-12-282021-12-28
6763축산물 기타동물생산물(계란 등) 조란 계란 1.0 개 벌크2021-12-292021-12-29
6764축산물 생축(가축)류 돼지류 돼지류(일반) 60.0 마리(수) 마리2021-12-302021-12-30
6765농산물 농림가공 조미제품 간장 13.5 kg 상자2021-12-302021-12-30
6766농산물 농림가공 조미제품 간장 13.5 kg 상자2021-12-302021-12-30

Duplicate rows

Most frequently occurring

카탈로그상품명수확일자시스템 등록일자# duplicates
141농산물 과일과채류 딸기 설향 1.0 kg 없음2021-05-102021-05-1012
218농산물 과일과채류 수박 수박(일반) 8.0 kg 없음2020-06-102020-06-1012
382농산물 미곡 벼 일반계 666667.0 kg 없음2021-10-122021-10-1212
360농산물 미곡 벼 일반계 40.0 kg 없음2020-10-282020-10-2811
195농산물 과일과채류 설향1kg없음2021-12-272021-12-2710
335농산물 미곡 벼 일반계 1.0 kg 없음2020-11-242020-11-2410
362농산물 미곡 벼 일반계 40.0 kg 없음2020-11-242020-11-2410
769축산물 국내산육류 돈육 이분도체 100.0 kg 박스2021-11-082021-11-0810
924축산물 생축(가축)류 돼지류 돼지류(일반) 110.0 Kg 마리2020-06-092020-06-0910
28가공식품 면류 기타면류 기타 500.0 g2020-02-182020-02-189