Overview

Dataset statistics

Number of variables6
Number of observations627
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory32.0 KiB
Average record size in memory52.2 B

Variable types

Numeric3
Categorical2
Text1

Dataset

Description국립농산물품질관리원에서 관리하는 농산물 방사능 검사현황 정보(년도, 구분, 품목, 검사건수, 적합건수, 부적합건수)
Author국립농산물품질관리원
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220204000000001682

Alerts

구분 has constant value ""Constant
부적합건수 has constant value ""Constant
검사건수 is highly overall correlated with 적합건수High correlation
적합건수 is highly overall correlated with 검사건수High correlation

Reproduction

Analysis started2024-03-23 07:52:45.892369
Analysis finished2024-03-23 07:52:49.339234
Duration3.45 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

년도
Real number (ℝ)

Distinct11
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2015.1356
Minimum2010
Maximum2020
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.6 KiB
2024-03-23T07:52:49.562944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2010
5-th percentile2011
Q12013
median2015
Q32017
95-th percentile2020
Maximum2020
Range10
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.6594448
Coefficient of variation (CV)0.0013197349
Kurtosis-0.97506129
Mean2015.1356
Median Absolute Deviation (MAD)2
Skewness0.19466434
Sum1263490
Variance7.0726468
MonotonicityDecreasing
2024-03-23T07:52:49.931760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
2015 90
14.4%
2014 81
12.9%
2013 76
12.1%
2012 73
11.6%
2018 63
10.0%
2016 58
9.3%
2017 53
8.5%
2019 43
6.9%
2020 42
6.7%
2011 42
6.7%
ValueCountFrequency (%)
2010 6
 
1.0%
2011 42
6.7%
2012 73
11.6%
2013 76
12.1%
2014 81
12.9%
2015 90
14.4%
2016 58
9.3%
2017 53
8.5%
2018 63
10.0%
2019 43
6.9%
ValueCountFrequency (%)
2020 42
6.7%
2019 43
6.9%
2018 63
10.0%
2017 53
8.5%
2016 58
9.3%
2015 90
14.4%
2014 81
12.9%
2013 76
12.1%
2012 73
11.6%
2011 42
6.7%

구분
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size5.0 KiB
국내산
627 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내산
2nd row국내산
3rd row국내산
4th row국내산
5th row국내산

Common Values

ValueCountFrequency (%)
국내산 627
100.0%

Length

2024-03-23T07:52:50.318079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T07:52:50.617833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내산 627
100.0%

품목
Text

Distinct128
Distinct (%)20.4%
Missing0
Missing (%)0.0%
Memory size5.0 KiB
2024-03-23T07:52:51.078463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length3.0701754
Min length1

Characters and Unicode

Total characters1925
Distinct characters170
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique28 ?
Unique (%)4.5%

Sample

1st row가지
2nd row감귤
3rd row감자
4th row고구마
5th row고구마순(고구마줄기)
ValueCountFrequency (%)
풋고추 11
 
1.8%
대파 11
 
1.8%
10
 
1.6%
마늘 10
 
1.6%
시금치 10
 
1.6%
홍고추(붉은고추 10
 
1.6%
쪽파 10
 
1.6%
오이 10
 
1.6%
열무 10
 
1.6%
호박 10
 
1.6%
Other values (118) 525
83.7%
2024-03-23T07:52:52.120868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
119
 
6.2%
84
 
4.4%
66
 
3.4%
) 56
 
2.9%
( 56
 
2.9%
53
 
2.8%
51
 
2.6%
42
 
2.2%
42
 
2.2%
41
 
2.1%
Other values (160) 1315
68.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1810
94.0%
Close Punctuation 56
 
2.9%
Open Punctuation 56
 
2.9%
Other Punctuation 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
119
 
6.6%
84
 
4.6%
66
 
3.6%
53
 
2.9%
51
 
2.8%
42
 
2.3%
42
 
2.3%
41
 
2.3%
40
 
2.2%
38
 
2.1%
Other values (157) 1234
68.2%
Close Punctuation
ValueCountFrequency (%)
) 56
100.0%
Open Punctuation
ValueCountFrequency (%)
( 56
100.0%
Other Punctuation
ValueCountFrequency (%)
. 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1810
94.0%
Common 115
 
6.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
119
 
6.6%
84
 
4.6%
66
 
3.6%
53
 
2.9%
51
 
2.8%
42
 
2.3%
42
 
2.3%
41
 
2.3%
40
 
2.2%
38
 
2.1%
Other values (157) 1234
68.2%
Common
ValueCountFrequency (%)
) 56
48.7%
( 56
48.7%
. 3
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1810
94.0%
ASCII 115
 
6.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
119
 
6.6%
84
 
4.6%
66
 
3.6%
53
 
2.9%
51
 
2.8%
42
 
2.3%
42
 
2.3%
41
 
2.3%
40
 
2.2%
38
 
2.1%
Other values (157) 1234
68.2%
ASCII
ValueCountFrequency (%)
) 56
48.7%
( 56
48.7%
. 3
 
2.6%

검사건수
Real number (ℝ)

HIGH CORRELATION 

Distinct87
Distinct (%)13.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.735247
Minimum1
Maximum238
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.6 KiB
2024-03-23T07:52:52.716280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median4
Q316
95-th percentile65.7
Maximum238
Range237
Interquartile range (IQR)15

Descriptive statistics

Standard deviation29.473171
Coefficient of variation (CV)1.8730669
Kurtosis20.202172
Mean15.735247
Median Absolute Deviation (MAD)3
Skewness4.0075233
Sum9866
Variance868.66781
MonotonicityNot monotonic
2024-03-23T07:52:53.208766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 159
25.4%
2 71
 
11.3%
3 56
 
8.9%
4 40
 
6.4%
5 23
 
3.7%
6 19
 
3.0%
7 16
 
2.6%
8 16
 
2.6%
16 14
 
2.2%
9 11
 
1.8%
Other values (77) 202
32.2%
ValueCountFrequency (%)
1 159
25.4%
2 71
11.3%
3 56
 
8.9%
4 40
 
6.4%
5 23
 
3.7%
6 19
 
3.0%
7 16
 
2.6%
8 16
 
2.6%
9 11
 
1.8%
10 9
 
1.4%
ValueCountFrequency (%)
238 1
0.2%
231 1
0.2%
198 1
0.2%
196 1
0.2%
195 1
0.2%
169 1
0.2%
162 1
0.2%
160 1
0.2%
158 1
0.2%
152 1
0.2%

적합건수
Real number (ℝ)

HIGH CORRELATION 

Distinct87
Distinct (%)13.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.735247
Minimum1
Maximum238
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.6 KiB
2024-03-23T07:52:53.583245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median4
Q316
95-th percentile65.7
Maximum238
Range237
Interquartile range (IQR)15

Descriptive statistics

Standard deviation29.473171
Coefficient of variation (CV)1.8730669
Kurtosis20.202172
Mean15.735247
Median Absolute Deviation (MAD)3
Skewness4.0075233
Sum9866
Variance868.66781
MonotonicityNot monotonic
2024-03-23T07:52:54.043655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 159
25.4%
2 71
 
11.3%
3 56
 
8.9%
4 40
 
6.4%
5 23
 
3.7%
6 19
 
3.0%
7 16
 
2.6%
8 16
 
2.6%
16 14
 
2.2%
9 11
 
1.8%
Other values (77) 202
32.2%
ValueCountFrequency (%)
1 159
25.4%
2 71
11.3%
3 56
 
8.9%
4 40
 
6.4%
5 23
 
3.7%
6 19
 
3.0%
7 16
 
2.6%
8 16
 
2.6%
9 11
 
1.8%
10 9
 
1.4%
ValueCountFrequency (%)
238 1
0.2%
231 1
0.2%
198 1
0.2%
196 1
0.2%
195 1
0.2%
169 1
0.2%
162 1
0.2%
160 1
0.2%
158 1
0.2%
152 1
0.2%

부적합건수
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size5.0 KiB
0
627 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 627
100.0%

Length

2024-03-23T07:52:54.602824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T07:52:54.899415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 627
100.0%

Interactions

2024-03-23T07:52:47.784241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T07:52:46.235766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T07:52:47.054273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T07:52:48.104291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T07:52:46.487613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T07:52:47.305551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T07:52:48.350507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T07:52:46.737741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T07:52:47.544533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-23T07:52:55.069198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년도검사건수적합건수
년도1.0000.0000.000
검사건수0.0001.0001.000
적합건수0.0001.0001.000
2024-03-23T07:52:55.345552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년도검사건수적합건수
년도1.000-0.031-0.031
검사건수-0.0311.0001.000
적합건수-0.0311.0001.000

Missing values

2024-03-23T07:52:48.795090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T07:52:49.200079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

년도구분품목검사건수적합건수부적합건수
02020국내산가지440
12020국내산감귤440
22020국내산감자39390
32020국내산고구마25250
42020국내산고구마순(고구마줄기)440
52020국내산고사리110
62020국내산근대110
72020국내산단감220
82020국내산대파42420
92020국내산들깻잎440
년도구분품목검사건수적합건수부적합건수
6172011국내산포도220
6182011국내산풋고추28280
6192011국내산호박880
6202011국내산홍고추(붉은고추)10100
6212010국내산당근110
6222010국내산대파110
6232010국내산330
6242010국내산50500
6252010국내산440
6262010국내산풋고추110