Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows7
Duplicate rows (%)0.1%
Total size in memory732.4 KiB
Average record size in memory75.0 B

Variable types

Categorical2
Numeric3
Text1
DateTime2

Dataset

Description무항생제축산물 인증정보에 관한 사항(인증번호, 인증종류명, 인증농가, 인증품목명, 재배(작업장)면적(사육두수), 생산(수입)계획량, 인증기간(시작일), 인증기간(종료일), 원재료인증구분)
Author국립농산물품질관리원
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220711000000002154

Alerts

Dataset has 7 (0.1%) duplicate rowsDuplicates
인증번호 is highly overall correlated with 인증구분명High correlation
사육두수 is highly overall correlated with 생산계획량High correlation
생산계획량 is highly overall correlated with 사육두수High correlation
인증구분명 is highly overall correlated with 인증번호High correlation
생산계획량 is highly skewed (γ1 = 65.17170844)Skewed

Reproduction

Analysis started2024-01-05 22:19:02.211734
Analysis finished2024-01-05 22:19:10.556995
Duration8.35 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

인증구분명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
무항생제축산물
7756 
취급자
2244 

Length

Max length7
Median length7
Mean length6.1024
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row무항생제축산물
2nd row취급자
3rd row무항생제축산물
4th row무항생제축산물
5th row무항생제축산물

Common Values

ValueCountFrequency (%)
무항생제축산물 7756
77.6%
취급자 2244
 
22.4%

Length

2024-01-05T22:19:10.790942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-05T22:19:11.147482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
무항생제축산물 7756
77.6%
취급자 2244
 
22.4%

인증번호
Real number (ℝ)

HIGH CORRELATION 

Distinct8084
Distinct (%)80.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13051677
Minimum1600002
Maximum18600127
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-05T22:19:11.537114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1600002
5-th percentile4600073
Q110503115
median13600088
Q315502999
95-th percentile17501883
Maximum18600127
Range17000125
Interquartile range (IQR)4999884.5

Descriptive statistics

Standard deviation3605006.7
Coefficient of variation (CV)0.27621022
Kurtosis1.1813269
Mean13051677
Median Absolute Deviation (MAD)2099913
Skewness-1.1239218
Sum1.3051677 × 1011
Variance1.2996073 × 1013
MonotonicityNot monotonic
2024-01-05T22:19:12.145496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
17501679 82
 
0.8%
12501871 52
 
0.5%
17501796 48
 
0.5%
17501795 41
 
0.4%
15500096 23
 
0.2%
15500004 19
 
0.2%
13501838 19
 
0.2%
15501734 19
 
0.2%
16500183 18
 
0.2%
16501757 16
 
0.2%
Other values (8074) 9663
96.6%
ValueCountFrequency (%)
1600002 1
< 0.1%
1600010 1
< 0.1%
1600011 2
< 0.1%
1600017 1
< 0.1%
1600018 1
< 0.1%
1600020 2
< 0.1%
1600021 1
< 0.1%
1600022 2
< 0.1%
1600024 1
< 0.1%
1600028 1
< 0.1%
ValueCountFrequency (%)
18600127 1
< 0.1%
18600126 1
< 0.1%
18600124 1
< 0.1%
18600122 1
< 0.1%
18600120 1
< 0.1%
18600115 1
< 0.1%
18600113 1
< 0.1%
18600112 1
< 0.1%
18600107 1
< 0.1%
18600105 1
< 0.1%
Distinct7337
Distinct (%)73.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-01-05T22:19:13.238486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length3
Mean length4.0839
Min length2

Characters and Unicode

Total characters40839
Distinct characters460
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5716 ?
Unique (%)57.2%

Sample

1st row김주연
2nd row정연수
3rd row김성홍
4th row농업회사법인(주)에버그린 율면지점
5th row홍경학
ValueCountFrequency (%)
강희석 124
 
1.1%
농업회사법인 40
 
0.4%
박홍진 36
 
0.3%
이제훈 30
 
0.3%
인용식 22
 
0.2%
권민석 20
 
0.2%
주식회사 19
 
0.2%
권혁수 13
 
0.1%
한우진 12
 
0.1%
권봉수 12
 
0.1%
Other values (7809) 10556
97.0%
2024-01-05T22:19:15.047674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2052
 
5.0%
1702
 
4.2%
1241
 
3.0%
1003
 
2.5%
989
 
2.4%
979
 
2.4%
886
 
2.2%
881
 
2.2%
) 870
 
2.1%
( 870
 
2.1%
Other values (450) 29366
71.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 37845
92.7%
Space Separator 886
 
2.2%
Close Punctuation 870
 
2.1%
Open Punctuation 870
 
2.1%
Other Punctuation 240
 
0.6%
Decimal Number 78
 
0.2%
Uppercase Letter 49
 
0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2052
 
5.4%
1702
 
4.5%
1241
 
3.3%
1003
 
2.7%
989
 
2.6%
979
 
2.6%
881
 
2.3%
705
 
1.9%
572
 
1.5%
563
 
1.5%
Other values (418) 27158
71.8%
Uppercase Letter
ValueCountFrequency (%)
H 8
16.3%
S 8
16.3%
F 8
16.3%
M 6
12.2%
D 3
 
6.1%
N 3
 
6.1%
C 2
 
4.1%
B 2
 
4.1%
G 2
 
4.1%
U 1
 
2.0%
Other values (6) 6
12.2%
Other Punctuation
ValueCountFrequency (%)
, 217
90.4%
/ 12
 
5.0%
& 6
 
2.5%
· 2
 
0.8%
. 2
 
0.8%
: 1
 
0.4%
Decimal Number
ValueCountFrequency (%)
2 40
51.3%
1 29
37.2%
3 5
 
6.4%
5 2
 
2.6%
4 1
 
1.3%
8 1
 
1.3%
Space Separator
ValueCountFrequency (%)
886
100.0%
Close Punctuation
ValueCountFrequency (%)
) 870
100.0%
Open Punctuation
ValueCountFrequency (%)
( 870
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 37845
92.7%
Common 2945
 
7.2%
Latin 49
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2052
 
5.4%
1702
 
4.5%
1241
 
3.3%
1003
 
2.7%
989
 
2.6%
979
 
2.6%
881
 
2.3%
705
 
1.9%
572
 
1.5%
563
 
1.5%
Other values (418) 27158
71.8%
Common
ValueCountFrequency (%)
886
30.1%
) 870
29.5%
( 870
29.5%
, 217
 
7.4%
2 40
 
1.4%
1 29
 
1.0%
/ 12
 
0.4%
& 6
 
0.2%
3 5
 
0.2%
· 2
 
0.1%
Other values (6) 8
 
0.3%
Latin
ValueCountFrequency (%)
H 8
16.3%
S 8
16.3%
F 8
16.3%
M 6
12.2%
D 3
 
6.1%
N 3
 
6.1%
C 2
 
4.1%
B 2
 
4.1%
G 2
 
4.1%
U 1
 
2.0%
Other values (6) 6
12.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 37845
92.7%
ASCII 2992
 
7.3%
None 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2052
 
5.4%
1702
 
4.5%
1241
 
3.3%
1003
 
2.7%
989
 
2.6%
979
 
2.6%
881
 
2.3%
705
 
1.9%
572
 
1.5%
563
 
1.5%
Other values (418) 27158
71.8%
ASCII
ValueCountFrequency (%)
886
29.6%
) 870
29.1%
( 870
29.1%
, 217
 
7.3%
2 40
 
1.3%
1 29
 
1.0%
/ 12
 
0.4%
H 8
 
0.3%
S 8
 
0.3%
F 8
 
0.3%
Other values (21) 44
 
1.5%
None
ValueCountFrequency (%)
· 2
100.0%

인증품목명
Categorical

Distinct22
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
한우(식육)
4687 
돼지(식육)
1670 
육계(식육)
1344 
산란계(알)
876 
오리(식육)
664 
Other values (17)
759 

Length

Max length9
Median length6
Mean length5.9887
Min length2

Unique

Unique4 ?
Unique (%)< 0.1%

Sample

1st row육계(식육)
2nd row돼지(식육)
3rd row육계(식육)
4th row메추리 알
5th row한우(식육)

Common Values

ValueCountFrequency (%)
한우(식육) 4687
46.9%
돼지(식육) 1670
 
16.7%
육계(식육) 1344
 
13.4%
산란계(알) 876
 
8.8%
오리(식육) 664
 
6.6%
젖소(시유) 225
 
2.2%
육우(식육) 211
 
2.1%
산란육성계 166
 
1.7%
메추리 알 74
 
0.7%
재래 산양(염소) 41
 
0.4%
Other values (12) 42
 
0.4%

Length

2024-01-05T22:19:15.777027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한우(식육 4687
46.3%
돼지(식육 1670
 
16.5%
육계(식육 1344
 
13.3%
산란계(알 876
 
8.6%
오리(식육 664
 
6.6%
젖소(시유 225
 
2.2%
육우(식육 211
 
2.1%
산란육성계 166
 
1.6%
메추리 77
 
0.8%
74
 
0.7%
Other values (15) 137
 
1.4%

사육두수
Real number (ℝ)

HIGH CORRELATION 

Distinct2112
Distinct (%)21.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean23563.018
Minimum0
Maximum2220000
Zeros2
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-05T22:19:16.660477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile27
Q187
median217
Q38725
95-th percentile115215
Maximum2220000
Range2220000
Interquartile range (IQR)8638

Descriptive statistics

Standard deviation79347.492
Coefficient of variation (CV)3.3674587
Kurtosis165.86516
Mean23563.018
Median Absolute Deviation (MAD)177
Skewness10.036015
Sum2.3563018 × 108
Variance6.2960245 × 109
MonotonicityNot monotonic
2024-01-05T22:19:17.453811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100 102
 
1.0%
70 83
 
0.8%
50 78
 
0.8%
60 76
 
0.8%
40 71
 
0.7%
200 66
 
0.7%
150 65
 
0.7%
120 62
 
0.6%
80 62
 
0.6%
70000 56
 
0.6%
Other values (2102) 9279
92.8%
ValueCountFrequency (%)
0 2
 
< 0.1%
1 16
0.2%
2 17
0.2%
3 15
0.1%
4 12
0.1%
5 12
0.1%
6 8
0.1%
7 14
0.1%
8 14
0.1%
9 9
0.1%
ValueCountFrequency (%)
2220000 1
< 0.1%
1840000 1
< 0.1%
1750000 1
< 0.1%
1380000 1
< 0.1%
1330000 1
< 0.1%
1285200 1
< 0.1%
1057536 1
< 0.1%
1050000 2
< 0.1%
1025610 1
< 0.1%
960000 2
< 0.1%

생산계획량
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct2999
Distinct (%)30.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean471801.2
Minimum0
Maximum3.29 × 108
Zeros25
Zeros (%)0.2%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-05T22:19:18.023505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile900
Q19600
median42000
Q3407790.75
95-th percentile1590435
Maximum3.29 × 108
Range3.29 × 108
Interquartile range (IQR)398190.75

Descriptive statistics

Standard deviation3930352.8
Coefficient of variation (CV)8.3305274
Kurtosis5113.5231
Mean471801.2
Median Absolute Deviation (MAD)40545.5
Skewness65.171708
Sum4.718012 × 109
Variance1.5447673 × 1013
MonotonicityNot monotonic
2024-01-05T22:19:18.691550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10000.0 264
 
2.6%
1.0 168
 
1.7%
20000.0 141
 
1.4%
7000.0 140
 
1.4%
50000.0 138
 
1.4%
5000.0 135
 
1.4%
30000.0 135
 
1.4%
2000.0 127
 
1.3%
14000.0 124
 
1.2%
1000.0 120
 
1.2%
Other values (2989) 8508
85.1%
ValueCountFrequency (%)
0.0 25
 
0.2%
0.1 6
 
0.1%
0.2 1
 
< 0.1%
1.0 168
1.7%
2.0 34
 
0.3%
3.0 13
 
0.1%
4.0 8
 
0.1%
5.0 2
 
< 0.1%
6.0 1
 
< 0.1%
30.0 1
 
< 0.1%
ValueCountFrequency (%)
329000000.0 1
< 0.1%
151920000.0 1
< 0.1%
60000000.0 1
< 0.1%
49000000.0 1
< 0.1%
36000000.0 2
< 0.1%
32850000.0 1
< 0.1%
30000000.0 2
< 0.1%
25500000.0 1
< 0.1%
24000000.0 1
< 0.1%
20844000.0 1
< 0.1%
Distinct362
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2022-12-22 00:00:00
Maximum2023-12-21 00:00:00
2024-01-05T22:19:19.224943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-05T22:19:19.639193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct362
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2023-12-21 00:00:00
Maximum2024-12-20 00:00:00
2024-01-05T22:19:20.162065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-05T22:19:20.761543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2024-01-05T22:19:08.541954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-05T22:19:05.631176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-05T22:19:07.605648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-05T22:19:08.823161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-05T22:19:06.299683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-05T22:19:07.929739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-05T22:19:09.082943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-05T22:19:06.818850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-05T22:19:08.179105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-05T22:19:21.369525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인증구분명인증번호인증품목명사육두수생산계획량
인증구분명1.0000.5570.4620.0870.048
인증번호0.5571.0000.3160.0370.000
인증품목명0.4620.3161.0000.2430.000
사육두수0.0870.0370.2431.0000.065
생산계획량0.0480.0000.0000.0651.000
2024-01-05T22:19:21.748173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인증품목명인증구분명
인증품목명1.0000.366
인증구분명0.3661.000
2024-01-05T22:19:22.034835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인증번호사육두수생산계획량인증구분명인증품목명
인증번호1.0000.0970.0590.5600.128
사육두수0.0971.0000.7740.0660.092
생산계획량0.0590.7741.0000.0320.000
인증구분명0.5600.0660.0321.0000.366
인증품목명0.1280.0920.0000.3661.000

Missing values

2024-01-05T22:19:09.706252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-05T22:19:10.134922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

인증구분명인증번호인증농가인증품목명사육두수생산계획량인증기간(시작일)인증기간(종료일)
5384무항생제축산물14500012김주연육계(식육)70000890000.02023-07-232024-07-22
9133취급자17600095정연수돼지(식육)1397000.02023-11-272024-11-26
2216무항생제축산물18500021김성홍육계(식육)34409280800.02023-09-182024-09-17
3528무항생제축산물10501497농업회사법인(주)에버그린 율면지점메추리 알4920001288576.02023-10-022024-10-01
1548무항생제축산물14502256홍경학한우(식육)321800.02023-06-232024-06-22
1419무항생제축산물12502250정한섭한우(식육)523750.02023-07-242024-07-23
6590무항생제축산물13502131최흥오한우(식육)16814000.02023-03-272024-03-26
7204무항생제축산물13502302김석태한우(식육)9020000.02023-02-082024-02-07
6444무항생제축산물10502077최석철한우(식육)1250.02023-02-172024-02-16
1212무항생제축산물13502562송우농장 정원영한우(식육)1601.02023-04-052024-04-04
인증구분명인증번호인증농가인증품목명사육두수생산계획량인증기간(시작일)인증기간(종료일)
393무항생제축산물14502147이은성육계(식육)30000357000.02023-06-202024-06-19
3332무항생제축산물10502109한영한(용인축협)한우(식육)59750.02023-04-302024-04-29
4222무항생제축산물13501930이무희한우(식육)294000.02023-12-082024-12-07
9186취급자3600218최애경오리(식육)3782000.02023-01-092024-01-08
7164무항생제축산물11501069조근형돼지(식육)630132000.02023-05-162024-05-15
886무항생제축산물16502422방인성육우(식육)18467500.02022-12-272023-12-26
3025무항생제축산물10501568이우재한우(식육)47452500.02023-10-272024-10-26
3119무항생제축산물12501829한형석한우(식육)22.02023-11-072024-11-06
2169무항생제축산물17501679진정순한우(식육)12411250.02023-11-292024-11-28
5201무항생제축산물13501980이재욱한우(식육)7031500.02023-10-162024-10-15

Duplicate rows

Most frequently occurring

인증구분명인증번호인증농가인증품목명사육두수생산계획량인증기간(시작일)인증기간(종료일)# duplicates
0무항생제축산물10503120김동휘한우(식육)444500.02023-10-062024-10-052
1무항생제축산물11501298임학선육계(식육)27500308000.02023-06-272024-06-262
2무항생제축산물13501781유길자한우(식육)6110000.02023-08-052024-08-042
3무항생제축산물14501181이동준육계(식육)65000791700.02023-12-062024-12-052
4무항생제축산물14502283농업회사법인 대지산업(유) 오길자육계(식육)300000360000.02023-10-112024-10-102
5무항생제축산물15500284박남주육계(식육)31500362000.02023-09-302024-09-292
6무항생제축산물16502026권민석한우(식육)446200700.02023-04-252024-04-242