Overview

Dataset statistics

Number of variables8
Number of observations9628
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory630.1 KiB
Average record size in memory67.0 B

Variable types

Categorical2
Numeric3
Text1
DateTime2

Dataset

Description무항생제축산물 인증정보에 관한 사항(인증번호, 인증종류명, 인증농가, 인증품목명, 재배(작업장)면적(사육두수), 생산(수입)계획량, 인증기간(시작일), 인증기간(종료일), 원재료인증구분)
Author국립농산물품질관리원
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220711000000002154

Alerts

Dataset has 1 (< 0.1%) duplicate rowsDuplicates
인증번호 is highly overall correlated with 인증구분명High correlation
사육두수 is highly overall correlated with 생산계획량High correlation
생산계획량 is highly overall correlated with 사육두수High correlation
인증구분명 is highly overall correlated with 인증번호High correlation
생산계획량 is highly skewed (γ1 = 97.8346063)Skewed

Reproduction

Analysis started2024-01-05 22:18:20.887134
Analysis finished2024-01-05 22:18:30.462397
Duration9.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

인증구분명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size75.3 KiB
무항생제축산물
7286 
취급자
2342 

Length

Max length7
Median length7
Mean length6.0270046
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row무항생제축산물
2nd row무항생제축산물
3rd row무항생제축산물
4th row무항생제축산물
5th row무항생제축산물

Common Values

ValueCountFrequency (%)
무항생제축산물 7286
75.7%
취급자 2342
 
24.3%

Length

2024-01-05T22:18:30.874909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-05T22:18:31.247319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
무항생제축산물 7286
75.7%
취급자 2342
 
24.3%

인증번호
Real number (ℝ)

HIGH CORRELATION 

Distinct7774
Distinct (%)80.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12918935
Minimum1600002
Maximum18600122
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size84.8 KiB
2024-01-05T22:18:31.917423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1600002
5-th percentile3600325.4
Q110502741
median13502414
Q315502623
95-th percentile17501904
Maximum18600122
Range17000120
Interquartile range (IQR)4999882.5

Descriptive statistics

Standard deviation3746970.3
Coefficient of variation (CV)0.29003708
Kurtosis0.93854119
Mean12918935
Median Absolute Deviation (MAD)2901528.5
Skewness-1.0738371
Sum1.2438351 × 1011
Variance1.4039786 × 1013
MonotonicityNot monotonic
2024-01-05T22:18:32.602502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
17501679 80
 
0.8%
12501871 50
 
0.5%
17501796 44
 
0.5%
17501795 37
 
0.4%
16502137 26
 
0.3%
13501838 21
 
0.2%
15501734 21
 
0.2%
15500004 19
 
0.2%
13502324 16
 
0.2%
16501986 15
 
0.2%
Other values (7764) 9299
96.6%
ValueCountFrequency (%)
1600002 1
 
< 0.1%
1600010 1
 
< 0.1%
1600011 2
< 0.1%
1600013 4
< 0.1%
1600014 1
 
< 0.1%
1600017 1
 
< 0.1%
1600018 1
 
< 0.1%
1600020 2
< 0.1%
1600021 1
 
< 0.1%
1600022 2
< 0.1%
ValueCountFrequency (%)
18600122 1
< 0.1%
18600120 1
< 0.1%
18600115 2
< 0.1%
18600113 1
< 0.1%
18600112 1
< 0.1%
18600109 1
< 0.1%
18600107 1
< 0.1%
18600105 1
< 0.1%
18600093 2
< 0.1%
18600089 2
< 0.1%
Distinct7141
Distinct (%)74.2%
Missing0
Missing (%)0.0%
Memory size75.3 KiB
2024-01-05T22:18:33.186246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length3
Mean length4.1635854
Min length2

Characters and Unicode

Total characters40087
Distinct characters467
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5600 ?
Unique (%)58.2%

Sample

1st row이철민
2nd row서순식
3rd row임종호
4th row임종호
5th row오복심
ValueCountFrequency (%)
강희석 131
 
1.2%
농업회사법인 37
 
0.4%
박홍진 34
 
0.3%
예당한우 18
 
0.2%
주식회사 16
 
0.2%
이범호 15
 
0.1%
이제훈 14
 
0.1%
권봉수 14
 
0.1%
인용식 11
 
0.1%
조현정 10
 
0.1%
Other values (7614) 10183
97.1%
2024-01-05T22:18:34.151696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2002
 
5.0%
1639
 
4.1%
1226
 
3.1%
996
 
2.5%
973
 
2.4%
973
 
2.4%
( 961
 
2.4%
) 960
 
2.4%
856
 
2.1%
853
 
2.1%
Other values (457) 28648
71.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 36949
92.2%
Open Punctuation 961
 
2.4%
Close Punctuation 960
 
2.4%
Space Separator 856
 
2.1%
Other Punctuation 202
 
0.5%
Decimal Number 99
 
0.2%
Uppercase Letter 60
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2002
 
5.4%
1639
 
4.4%
1226
 
3.3%
996
 
2.7%
973
 
2.6%
973
 
2.6%
853
 
2.3%
645
 
1.7%
586
 
1.6%
555
 
1.5%
Other values (424) 26501
71.7%
Uppercase Letter
ValueCountFrequency (%)
B 16
26.7%
F 7
11.7%
S 6
 
10.0%
H 6
 
10.0%
M 5
 
8.3%
D 4
 
6.7%
N 3
 
5.0%
Y 2
 
3.3%
A 2
 
3.3%
G 2
 
3.3%
Other values (6) 7
11.7%
Decimal Number
ValueCountFrequency (%)
2 41
41.4%
1 38
38.4%
3 6
 
6.1%
5 4
 
4.0%
4 3
 
3.0%
0 3
 
3.0%
8 2
 
2.0%
7 1
 
1.0%
9 1
 
1.0%
Other Punctuation
ValueCountFrequency (%)
, 189
93.6%
/ 6
 
3.0%
& 5
 
2.5%
: 1
 
0.5%
. 1
 
0.5%
Open Punctuation
ValueCountFrequency (%)
( 961
100.0%
Close Punctuation
ValueCountFrequency (%)
) 960
100.0%
Space Separator
ValueCountFrequency (%)
856
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 36949
92.2%
Common 3078
 
7.7%
Latin 60
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2002
 
5.4%
1639
 
4.4%
1226
 
3.3%
996
 
2.7%
973
 
2.6%
973
 
2.6%
853
 
2.3%
645
 
1.7%
586
 
1.6%
555
 
1.5%
Other values (424) 26501
71.7%
Common
ValueCountFrequency (%)
( 961
31.2%
) 960
31.2%
856
27.8%
, 189
 
6.1%
2 41
 
1.3%
1 38
 
1.2%
/ 6
 
0.2%
3 6
 
0.2%
& 5
 
0.2%
5 4
 
0.1%
Other values (7) 12
 
0.4%
Latin
ValueCountFrequency (%)
B 16
26.7%
F 7
11.7%
S 6
 
10.0%
H 6
 
10.0%
M 5
 
8.3%
D 4
 
6.7%
N 3
 
5.0%
Y 2
 
3.3%
A 2
 
3.3%
G 2
 
3.3%
Other values (6) 7
11.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 36949
92.2%
ASCII 3138
 
7.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2002
 
5.4%
1639
 
4.4%
1226
 
3.3%
996
 
2.7%
973
 
2.6%
973
 
2.6%
853
 
2.3%
645
 
1.7%
586
 
1.6%
555
 
1.5%
Other values (424) 26501
71.7%
ASCII
ValueCountFrequency (%)
( 961
30.6%
) 960
30.6%
856
27.3%
, 189
 
6.0%
2 41
 
1.3%
1 38
 
1.2%
B 16
 
0.5%
F 7
 
0.2%
S 6
 
0.2%
H 6
 
0.2%
Other values (23) 58
 
1.8%

인증품목명
Categorical

Distinct18
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size75.3 KiB
한우(식육)
4513 
돼지(식육)
1750 
육계(식육)
1219 
산란계(알)
798 
오리(식육)
662 
Other values (13)
686 

Length

Max length9
Median length6
Mean length5.9879518
Min length2

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row젖소(시유)
2nd row육계(식육)
3rd row산란계(알)
4th row산란육성계
5th row젖소(시유)

Common Values

ValueCountFrequency (%)
한우(식육) 4513
46.9%
돼지(식육) 1750
 
18.2%
육계(식육) 1219
 
12.7%
산란계(알) 798
 
8.3%
오리(식육) 662
 
6.9%
젖소(시유) 222
 
2.3%
육우(식육) 183
 
1.9%
산란육성계 146
 
1.5%
메추리 알 73
 
0.8%
재래 산양(염소) 33
 
0.3%
Other values (8) 29
 
0.3%

Length

2024-01-05T22:18:34.586761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한우(식육 4513
46.3%
돼지(식육 1750
 
18.0%
육계(식육 1219
 
12.5%
산란계(알 798
 
8.2%
오리(식육 662
 
6.8%
젖소(시유 222
 
2.3%
육우(식육 183
 
1.9%
산란육성계 146
 
1.5%
메추리 73
 
0.7%
73
 
0.7%
Other values (12) 107
 
1.1%

사육두수
Real number (ℝ)

HIGH CORRELATION 

Distinct1981
Distinct (%)20.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22879.688
Minimum0
Maximum3000000
Zeros3
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size84.8 KiB
2024-01-05T22:18:34.971130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile29
Q185
median210
Q36143.25
95-th percentile111000
Maximum3000000
Range3000000
Interquartile range (IQR)6058.25

Descriptive statistics

Standard deviation83936.926
Coefficient of variation (CV)3.668622
Kurtosis275.89152
Mean22879.688
Median Absolute Deviation (MAD)170
Skewness12.562978
Sum2.2028564 × 108
Variance7.0454075 × 109
MonotonicityNot monotonic
2024-01-05T22:18:35.428766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100 95
 
1.0%
50 91
 
0.9%
40 80
 
0.8%
80 78
 
0.8%
60 77
 
0.8%
30 70
 
0.7%
200 68
 
0.7%
120 67
 
0.7%
70 63
 
0.7%
150 61
 
0.6%
Other values (1971) 8878
92.2%
ValueCountFrequency (%)
0 3
 
< 0.1%
1 13
0.1%
2 10
0.1%
3 9
0.1%
4 12
0.1%
5 18
0.2%
6 5
 
0.1%
7 7
 
0.1%
8 22
0.2%
9 8
 
0.1%
ValueCountFrequency (%)
3000000 1
 
< 0.1%
1840000 1
 
< 0.1%
1756032 1
 
< 0.1%
1750000 1
 
< 0.1%
1540000 1
 
< 0.1%
1330000 1
 
< 0.1%
1285200 1
 
< 0.1%
1160000 1
 
< 0.1%
1050000 3
< 0.1%
963600 1
 
< 0.1%

생산계획량
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct2806
Distinct (%)29.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1473952.2
Minimum0
Maximum9.6808062 × 109
Zeros35
Zeros (%)0.4%
Negative0
Negative (%)0.0%
Memory size84.8 KiB
2024-01-05T22:18:36.027534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1000
Q17000
median33000
Q3365100
95-th percentile1574384
Maximum9.6808062 × 109
Range9.6808062 × 109
Interquartile range (IQR)358100

Descriptive statistics

Standard deviation98754101
Coefficient of variation (CV)66.999526
Kurtosis9589.7114
Mean1473952.2
Median Absolute Deviation (MAD)31775
Skewness97.834606
Sum1.4191212 × 1010
Variance9.7523724 × 1015
MonotonicityNot monotonic
2024-01-05T22:18:36.524490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10000.0 255
 
2.6%
5000.0 205
 
2.1%
2000.0 157
 
1.6%
30000.0 152
 
1.6%
1000.0 150
 
1.6%
20000.0 134
 
1.4%
1.0 128
 
1.3%
50000.0 122
 
1.3%
3000.0 116
 
1.2%
4900.0 115
 
1.2%
Other values (2796) 8094
84.1%
ValueCountFrequency (%)
0.0 35
 
0.4%
0.1 11
 
0.1%
0.2 6
 
0.1%
1.0 128
1.3%
2.0 26
 
0.3%
3.0 10
 
0.1%
4.0 5
 
0.1%
5.0 3
 
< 0.1%
6.0 2
 
< 0.1%
29.0 1
 
< 0.1%
ValueCountFrequency (%)
9680806200.0 1
< 0.1%
329000000.0 1
< 0.1%
220000000.0 1
< 0.1%
90337500.0 1
< 0.1%
73000000.0 1
< 0.1%
49000000.0 1
< 0.1%
40000000.0 1
< 0.1%
30000000.0 1
< 0.1%
29000000.0 1
< 0.1%
27260000.0 1
< 0.1%
Distinct361
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Memory size75.3 KiB
Minimum2021-07-07 00:00:00
Maximum2022-07-06 00:00:00
2024-01-05T22:18:37.131444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-05T22:18:37.539536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct361
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Memory size75.3 KiB
Minimum2022-07-06 00:00:00
Maximum2023-07-05 00:00:00
2024-01-05T22:18:38.010059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-05T22:18:38.451648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2024-01-05T22:18:28.312878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-05T22:18:26.347463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-05T22:18:27.281766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-05T22:18:28.684677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-05T22:18:26.725326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-05T22:18:27.558492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-05T22:18:29.033178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-05T22:18:27.018281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-05T22:18:27.906621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-05T22:18:38.723096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인증구분명인증번호인증품목명사육두수생산계획량
인증구분명1.0000.5590.4650.0750.000
인증번호0.5591.0000.3550.0230.000
인증품목명0.4650.3551.0000.1800.000
사육두수0.0750.0230.1801.0000.000
생산계획량0.0000.0000.0000.0001.000
2024-01-05T22:18:38.985965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인증품목명인증구분명
인증품목명1.0000.367
인증구분명0.3671.000
2024-01-05T22:18:39.236865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인증번호사육두수생산계획량인증구분명인증품목명
인증번호1.0000.1300.0860.5620.125
사육두수0.1301.0000.7570.0560.076
생산계획량0.0860.7571.0000.0000.000
인증구분명0.5620.0560.0001.0000.367
인증품목명0.1250.0760.0000.3671.000

Missing values

2024-01-05T22:18:29.479227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-05T22:18:30.143693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

인증구분명인증번호인증농가인증품목명사육두수생산계획량인증기간(시작일)인증기간(종료일)
0무항생제축산물14501979이철민젖소(시유)150730000.02022-02-252023-02-24
1무항생제축산물15502197서순식육계(식육)1000001120000.02022-04-232023-04-22
2무항생제축산물15500074임종호산란계(알)28000593938.02021-08-182022-08-17
3무항생제축산물15500074임종호산란육성계1450045600.02021-08-182022-08-17
4무항생제축산물15502413오복심젖소(시유)36292000.02022-04-282023-04-27
5무항생제축산물15502413오복심한우(식육)81200.02022-04-282023-04-27
6무항생제축산물18500305제주양계영농조합법인(오종근)산란계(알)1860008200000.02022-05-072023-05-06
7무항생제축산물18500305제주양계영농조합법인(오종근)산란육성계340001.02022-05-072023-05-06
8무항생제축산물14501422마리아농장 송귀자산란계(알)700001226400.02022-05-202023-05-19
9무항생제축산물14500485송제근한우(식육)1000440000.02021-08-062022-08-05
인증구분명인증번호인증농가인증품목명사육두수생산계획량인증기간(시작일)인증기간(종료일)
9618취급자11600267구본태육계(식육)251000.02022-06-152023-06-14
9619취급자11600267구본태한우(식육)31120000.02022-06-152023-06-14
9620취급자10600102이상규돼지(식육)1173000.02021-08-022022-08-01
9621취급자10600102이상규육우(식육)1171000.02021-08-022022-08-01
9622취급자10600102이상규한우(식육)1171000.02021-08-022022-08-01
9623취급자16600452박선엽돼지(식육)19420000.02021-08-262022-08-25
9624취급자6600013이주헌돼지(식육)27350000.02021-08-022022-08-01
9625취급자10600590양두한돼지(식육)4810000.02022-03-062023-03-05
9626취급자10600590양두한한우(식육)485000.02022-03-062023-03-05
9627취급자6600013이주헌한우(식육)27310000.02021-08-022022-08-01

Duplicate rows

Most frequently occurring

인증구분명인증번호인증농가인증품목명사육두수생산계획량인증기간(시작일)인증기간(종료일)# duplicates
0무항생제축산물14502035삼복3농장 김영범한우(식육)521.02021-08-242022-08-232