Dataset statistics
Number of variables | 12 |
---|---|
Number of observations | 2388 |
Missing cells | 14328 |
Missing cells (%) | 50.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 242.7 KiB |
Average record size in memory | 104.1 B |
Variable types
Categorical | 3 |
---|---|
Text | 2 |
Numeric | 1 |
Unsupported | 6 |
Dataset
Description | 국립농산물품질관리원에서 관리하는 쌀 등 정곡에 대한 검사 실적 정보(신청년도, 시군, 연산, 용도, 원산지, 검사수량 등) |
---|---|
Author | 국립농산물품질관리원 |
URL | https://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220204000000001690 |
용도 is highly imbalanced (91.1%) | Imbalance |
Unnamed: 6 has 2388 (100.0%) missing values | Missing |
Unnamed: 7 has 2388 (100.0%) missing values | Missing |
Unnamed: 8 has 2388 (100.0%) missing values | Missing |
Unnamed: 9 has 2388 (100.0%) missing values | Missing |
Unnamed: 10 has 2388 (100.0%) missing values | Missing |
Unnamed: 11 has 2388 (100.0%) missing values | Missing |
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2024-03-23 07:43:08.785148 |
---|---|
Analysis finished | 2024-03-23 07:43:10.396607 |
Duration | 1.61 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
신청년도
Categorical
Distinct | 5 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 18.8 KiB |
2011 | |
---|---|
2013 | |
2010 | |
2012 | |
2009 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2009 |
---|---|
2nd row | 2009 |
3rd row | 2009 |
4th row | 2009 |
5th row | 2009 |
Common Values
Value | Count | Frequency (%) |
2011 | 598 | |
2013 | 575 | |
2010 | 554 | |
2012 | 552 | |
2009 | 109 | 4.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2011 | 598 | |
2013 | 575 | |
2010 | 554 | |
2012 | 552 | |
2009 | 109 | 4.6% |
시도
Text
Distinct | 90 |
---|---|
Distinct (%) | 3.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 18.8 KiB |
Value | Count | Frequency (%) |
전라남도 | 449 | 9.1% |
경상북도 | 394 | 8.0% |
전라북도 | 256 | 5.2% |
경상남도 | 252 | 5.1% |
경기도 | 245 | 5.0% |
강원도 | 221 | 4.5% |
충청북도 | 217 | 4.4% |
충청남도 | 205 | 4.2% |
북구 | 52 | 1.1% |
논산시 | 43 | 0.9% |
Other values (97) | 2600 |
Most occurring characters
Value | Count | Frequency (%) |
2546 | 13.1% | |
도 | 2239 | 11.5% |
시 | 1274 | 6.6% |
군 | 1151 | 5.9% |
남 | 995 | 5.1% |
경 | 968 | 5.0% |
북 | 919 | 4.7% |
전 | 757 | 3.9% |
라 | 705 | 3.6% |
상 | 672 | 3.5% |
Other values (77) | 7173 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 16853 | |
Space Separator | 2546 | 13.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
도 | 2239 | 13.3% |
시 | 1274 | 7.6% |
군 | 1151 | 6.8% |
남 | 995 | 5.9% |
경 | 968 | 5.7% |
북 | 919 | 5.5% |
전 | 757 | 4.5% |
라 | 705 | 4.2% |
상 | 672 | 4.0% |
청 | 479 | 2.8% |
Other values (76) | 6694 |
Space Separator
Value | Count | Frequency (%) |
2546 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 16853 | |
Common | 2546 | 13.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
도 | 2239 | 13.3% |
시 | 1274 | 7.6% |
군 | 1151 | 6.8% |
남 | 995 | 5.9% |
경 | 968 | 5.7% |
북 | 919 | 5.5% |
전 | 757 | 4.5% |
라 | 705 | 4.2% |
상 | 672 | 4.0% |
청 | 479 | 2.8% |
Other values (76) | 6694 |
Common
Value | Count | Frequency (%) |
2546 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 16853 | |
ASCII | 2546 | 13.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
2546 |
Hangul
Value | Count | Frequency (%) |
도 | 2239 | 13.3% |
시 | 1274 | 7.6% |
군 | 1151 | 6.8% |
남 | 995 | 5.9% |
경 | 968 | 5.7% |
북 | 919 | 5.5% |
전 | 757 | 4.5% |
라 | 705 | 4.2% |
상 | 672 | 4.0% |
청 | 479 | 2.8% |
Other values (76) | 6694 |
연산
Real number (ℝ)
Distinct | 9 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2009.1323 |
Minimum | 2005 |
---|---|
Maximum | 2013 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 21.1 KiB |
Quantile statistics
Minimum | 2005 |
---|---|
5-th percentile | 2005 |
Q1 | 2008 |
median | 2009 |
Q3 | 2011 |
95-th percentile | 2012 |
Maximum | 2013 |
Range | 8 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 1.8817342 |
---|---|
Coefficient of variation (CV) | 0.00093659045 |
Kurtosis | -0.40265705 |
Mean | 2009.1323 |
Median Absolute Deviation (MAD) | 1 |
Skewness | -0.45401308 |
Sum | 4797808 |
Variance | 3.5409234 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2009 | 491 | |
2010 | 447 | |
2011 | 425 | |
2008 | 386 | |
2012 | 204 | |
2007 | 168 | 7.0% |
2005 | 140 | 5.9% |
2006 | 117 | 4.9% |
2013 | 10 | 0.4% |
Value | Count | Frequency (%) |
2005 | 140 | 5.9% |
2006 | 117 | 4.9% |
2007 | 168 | 7.0% |
2008 | 386 | |
2009 | 491 | |
2010 | 447 | |
2011 | 425 | |
2012 | 204 | |
2013 | 10 | 0.4% |
Value | Count | Frequency (%) |
2013 | 10 | 0.4% |
2012 | 204 | |
2011 | 425 | |
2010 | 447 | |
2009 | 491 | |
2008 | 386 | |
2007 | 168 | 7.0% |
2006 | 117 | 4.9% |
2005 | 140 | 5.9% |
용도
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 18.8 KiB |
정곡 | |
---|---|
대북 | 27 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 정곡 |
---|---|
2nd row | 정곡 |
3rd row | 정곡 |
4th row | 정곡 |
5th row | 정곡 |
Common Values
Value | Count | Frequency (%) |
정곡 | 2361 | |
대북 | 27 | 1.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
정곡 | 2361 | |
대북 | 27 | 1.1% |
원산지
Categorical
Distinct | 8 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 18.8 KiB |
국산 | |
---|---|
중국 | |
미국 | |
태국 | |
호주 | 5 |
Other values (3) | 6 |
Length
Max length | 4 |
---|---|
Median length | 2 |
Mean length | 2.001675 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 국산 |
---|---|
2nd row | 미국 |
3rd row | 중국 |
4th row | 태국 |
5th row | 국산 |
Common Values
Value | Count | Frequency (%) |
국산 | 1275 | |
중국 | 455 | 19.1% |
미국 | 415 | 17.4% |
태국 | 232 | 9.7% |
호주 | 5 | 0.2% |
인도 | 3 | 0.1% |
베트남 | 2 | 0.1% |
파키스탄 | 1 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
국산 | 1275 | |
중국 | 455 | 19.1% |
미국 | 415 | 17.4% |
태국 | 232 | 9.7% |
호주 | 5 | 0.2% |
인도 | 3 | 0.1% |
베트남 | 2 | 0.1% |
파키스탄 | 1 | < 0.1% |
검사수량(kg)
Text
Distinct | 1969 |
---|---|
Distinct (%) | 82.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 18.8 KiB |
Length
Max length | 11 |
---|---|
Median length | 8 |
Mean length | 8.2039363 |
Min length | 3 |
Characters and Unicode
Total characters | 19591 |
---|---|
Distinct characters | 12 |
Distinct categories | 3 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 1808 ? |
---|---|
Unique (%) | 75.7% |
Sample
1st row | 3,657,080 |
---|---|
2nd row | 137,800 |
3rd row | 482,000 |
4th row | 15,000 |
5th row | 2,423,600 |
Value | Count | Frequency (%) |
100,000 | 27 | 1.1% |
50,000 | 24 | 1.0% |
200,000 | 20 | 0.8% |
150,000 | 16 | 0.7% |
30,000 | 15 | 0.6% |
20,000 | 13 | 0.5% |
120,000 | 13 | 0.5% |
10,000 | 13 | 0.5% |
80,000 | 10 | 0.4% |
140,000 | 9 | 0.4% |
Other values (1959) | 2228 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 5106 | |
, | 2973 | |
2388 | ||
2 | 1455 | 7.4% |
1 | 1294 | 6.6% |
4 | 1188 | 6.1% |
6 | 1093 | 5.6% |
8 | 975 | 5.0% |
3 | 933 | 4.8% |
5 | 863 | 4.4% |
Other values (2) | 1323 | 6.8% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 14230 | |
Other Punctuation | 2973 | 15.2% |
Space Separator | 2388 | 12.2% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 5106 | |
2 | 1455 | 10.2% |
1 | 1294 | 9.1% |
4 | 1188 | 8.3% |
6 | 1093 | 7.7% |
8 | 975 | 6.9% |
3 | 933 | 6.6% |
5 | 863 | 6.1% |
9 | 687 | 4.8% |
7 | 636 | 4.5% |
Other Punctuation
Value | Count | Frequency (%) |
, | 2973 |
Space Separator
Value | Count | Frequency (%) |
2388 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 19591 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 5106 | |
, | 2973 | |
2388 | ||
2 | 1455 | 7.4% |
1 | 1294 | 6.6% |
4 | 1188 | 6.1% |
6 | 1093 | 5.6% |
8 | 975 | 5.0% |
3 | 933 | 4.8% |
5 | 863 | 4.4% |
Other values (2) | 1323 | 6.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 19591 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 5106 | |
, | 2973 | |
2388 | ||
2 | 1455 | 7.4% |
1 | 1294 | 6.6% |
4 | 1188 | 6.1% |
6 | 1093 | 5.6% |
8 | 975 | 5.0% |
3 | 933 | 4.8% |
5 | 863 | 4.4% |
Other values (2) | 1323 | 6.8% |
Unnamed: 6
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 2388 |
---|---|
Missing (%) | 100.0% |
Memory size | 21.1 KiB |
Unnamed: 7
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 2388 |
---|---|
Missing (%) | 100.0% |
Memory size | 21.1 KiB |
Unnamed: 8
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 2388 |
---|---|
Missing (%) | 100.0% |
Memory size | 21.1 KiB |
Unnamed: 9
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 2388 |
---|---|
Missing (%) | 100.0% |
Memory size | 21.1 KiB |
Unnamed: 10
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 2388 |
---|---|
Missing (%) | 100.0% |
Memory size | 21.1 KiB |
Unnamed: 11
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 2388 |
---|---|
Missing (%) | 100.0% |
Memory size | 21.1 KiB |
신청년도 | 시도 | 연산 | 용도 | 원산지 | |
---|---|---|---|---|---|
신청년도 | 1.000 | 0.000 | 0.653 | 0.140 | 0.134 |
시도 | 0.000 | 1.000 | 0.000 | 0.000 | 0.233 |
연산 | 0.653 | 0.000 | 1.000 | 0.469 | 0.525 |
용도 | 0.140 | 0.000 | 0.469 | 1.000 | 0.100 |
원산지 | 0.134 | 0.233 | 0.525 | 0.100 | 1.000 |
용도 | 신청년도 | 원산지 | |
---|---|---|---|
용도 | 1.000 | 0.171 | 0.075 |
신청년도 | 0.171 | 1.000 | 0.082 |
원산지 | 0.075 | 0.082 | 1.000 |
연산 | 신청년도 | 용도 | 원산지 | |
---|---|---|---|---|
연산 | 1.000 | 0.475 | 0.353 | 0.213 |
신청년도 | 0.475 | 1.000 | 0.171 | 0.082 |
용도 | 0.353 | 0.171 | 1.000 | 0.075 |
원산지 | 0.213 | 0.082 | 0.075 | 1.000 |
신청년도 | 시도 | 연산 | 용도 | 원산지 | 검사수량(kg) | Unnamed: 6 | Unnamed: 7 | Unnamed: 8 | Unnamed: 9 | Unnamed: 10 | Unnamed: 11 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 2009 | 강원도 고성군 | 2008 | 정곡 | 국산 | 3,657,080 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
1 | 2009 | 강원도 고성군 | 2008 | 정곡 | 미국 | 137,800 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
2 | 2009 | 강원도 고성군 | 2008 | 정곡 | 중국 | 482,000 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
3 | 2009 | 강원도 고성군 | 2008 | 정곡 | 태국 | 15,000 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
4 | 2009 | 강원도 인제군 | 2008 | 정곡 | 국산 | 2,423,600 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
5 | 2009 | 강원도 인제군 | 2008 | 정곡 | 중국 | 302,880 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
6 | 2009 | 강원도 인제군 | 2008 | 정곡 | 태국 | 200,000 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
7 | 2009 | 강원도 춘천시 | 2008 | 정곡 | 국산 | 583,440 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
8 | 2009 | 강원도 춘천시 | 2008 | 정곡 | 중국 | 504,000 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
9 | 2009 | 강원도 홍천군 | 2007 | 정곡 | 중국 | 56,760 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
신청년도 | 시도 | 연산 | 용도 | 원산지 | 검사수량(kg) | Unnamed: 6 | Unnamed: 7 | Unnamed: 8 | Unnamed: 9 | Unnamed: 10 | Unnamed: 11 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
2378 | 2013 | 충청북도 청주시 흥덕구 | 2012 | 정곡 | 미국 | 165,880 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
2379 | 2013 | 충청북도 청주시 흥덕구 | 2012 | 정곡 | 중국 | 462,880 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
2380 | 2013 | 충청북도 충주시 | 2009 | 정곡 | 국산 | 94,440 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
2381 | 2013 | 충청북도 충주시 | 2010 | 정곡 | 국산 | 56,040 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
2382 | 2013 | 충청북도 충주시 | 2010 | 정곡 | 중국 | 139,200 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
2383 | 2013 | 충청북도 충주시 | 2011 | 정곡 | 중국 | 423,560 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
2384 | 2013 | 충청북도 충주시 | 2011 | 정곡 | 태국 | 84,120 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
2385 | 2013 | 충청북도 충주시 | 2012 | 정곡 | 국산 | 2,040,600 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
2386 | 2013 | 충청북도 충주시 | 2012 | 정곡 | 미국 | 80,000 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
2387 | 2013 | 충청북도 충주시 | 2012 | 정곡 | 중국 | 165,000 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |