Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 31 |
Missing cells | 63 |
Missing cells (%) | 33.9% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.7 KiB |
Average record size in memory | 57.3 B |
Variable types
Unsupported | 2 |
---|---|
Text | 1 |
Numeric | 2 |
Categorical | 1 |
Dataset
Description | 유통식품및자가품질검사현황2017년상반기 |
---|---|
Author | 전라북도 |
URL | https://www.bigdatahub.go.kr/opendata/dataSet/detail.nm?contentId=37&rlik=49451aebf056b486&serviceId=202925 |
자가품질건수 is highly overall correlated with 적합 and 1 other fields | High correlation |
적합 is highly overall correlated with 자가품질건수 and 1 other fields | High correlation |
부적합 is highly overall correlated with 자가품질건수 and 1 other fields | High correlation |
부적합 is highly imbalanced (54.5%) | Imbalance |
Unnamed: 0 has 31 (100.0%) missing values | Missing |
Unnamed: 1 has 31 (100.0%) missing values | Missing |
식품 유형 has 1 (3.2%) missing values | Missing |
Unnamed: 0 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 1 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
자가품질건수 has 4 (12.9%) zeros | Zeros |
적합 has 4 (12.9%) zeros | Zeros |
Reproduction
Analysis started | 2024-03-14 01:52:51.816897 |
---|---|
Analysis finished | 2024-03-14 01:52:52.367197 |
Duration | 0.55 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
Unnamed: 0
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 31 |
---|---|
Missing (%) | 100.0% |
Memory size | 411.0 B |
Unnamed: 1
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 31 |
---|---|
Missing (%) | 100.0% |
Memory size | 411.0 B |
식품 유형
Text
MISSING
 
Distinct | 30 |
---|---|
Distinct (%) | 100.0% |
Missing | 1 |
Missing (%) | 3.2% |
Memory size | 380.0 B |
Value | Count | Frequency (%) |
과자류 | 1 | 2.9% |
규격외일반가공품 | 1 | 2.9% |
젓갈류 | 1 | 2.9% |
절임식품 | 1 | 2.9% |
조림식품 | 1 | 2.9% |
주류 | 1 | 2.9% |
건포류 | 1 | 2.9% |
기타식품류 | 1 | 2.9% |
장기보존식품 | 1 | 2.9% |
드레싱류 | 1 | 2.9% |
Other values (25) | 25 |
Most occurring characters
Value | Count | Frequency (%) |
류 | 19 | 14.3% |
품 | 11 | 8.3% |
식 | 8 | 6.0% |
기 | 5 | 3.8% |
5 | 3.8% | |
가 | 4 | 3.0% |
장 | 3 | 2.3% |
물 | 3 | 2.3% |
공 | 3 | 2.3% |
용 | 3 | 2.3% |
Other values (61) | 69 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 128 | |
Space Separator | 5 | 3.8% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
류 | 19 | 14.8% |
품 | 11 | 8.6% |
식 | 8 | 6.2% |
기 | 5 | 3.9% |
가 | 4 | 3.1% |
장 | 3 | 2.3% |
물 | 3 | 2.3% |
공 | 3 | 2.3% |
용 | 3 | 2.3% |
및 | 2 | 1.6% |
Other values (60) | 67 |
Space Separator
Value | Count | Frequency (%) |
5 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 128 | |
Common | 5 | 3.8% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
류 | 19 | 14.8% |
품 | 11 | 8.6% |
식 | 8 | 6.2% |
기 | 5 | 3.9% |
가 | 4 | 3.1% |
장 | 3 | 2.3% |
물 | 3 | 2.3% |
공 | 3 | 2.3% |
용 | 3 | 2.3% |
및 | 2 | 1.6% |
Other values (60) | 67 |
Common
Value | Count | Frequency (%) |
5 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 128 | |
ASCII | 5 | 3.8% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
류 | 19 | 14.8% |
품 | 11 | 8.6% |
식 | 8 | 6.2% |
기 | 5 | 3.9% |
가 | 4 | 3.1% |
장 | 3 | 2.3% |
물 | 3 | 2.3% |
공 | 3 | 2.3% |
용 | 3 | 2.3% |
및 | 2 | 1.6% |
Other values (60) | 67 |
ASCII
Value | Count | Frequency (%) |
5 |
자가품질건수
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 24 |
---|---|
Distinct (%) | 77.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 47.677419 |
Minimum | 0 |
---|---|
Maximum | 739 |
Zeros | 4 |
Zeros (%) | 12.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 411.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 2 |
median | 9 |
Q3 | 26.5 |
95-th percentile | 157 |
Maximum | 739 |
Range | 739 |
Interquartile range (IQR) | 24.5 |
Descriptive statistics
Standard deviation | 135.71671 |
---|---|
Coefficient of variation (CV) | 2.8465616 |
Kurtosis | 24.228008 |
Mean | 47.677419 |
Median Absolute Deviation (MAD) | 8 |
Skewness | 4.7709373 |
Sum | 1478 |
Variance | 18419.026 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 4 | 12.9% |
2 | 3 | 9.7% |
7 | 2 | 6.5% |
1 | 2 | 6.5% |
45 | 1 | 3.2% |
74 | 1 | 3.2% |
739 | 1 | 3.2% |
86 | 1 | 3.2% |
11 | 1 | 3.2% |
18 | 1 | 3.2% |
Other values (14) | 14 |
Value | Count | Frequency (%) |
0 | 4 | |
1 | 2 | |
2 | 3 | |
4 | 1 | 3.2% |
5 | 1 | 3.2% |
6 | 1 | 3.2% |
7 | 2 | |
8 | 1 | 3.2% |
9 | 1 | 3.2% |
10 | 1 | 3.2% |
Value | Count | Frequency (%) |
739 | 1 | |
228 | 1 | |
86 | 1 | |
75 | 1 | |
74 | 1 | |
45 | 1 | |
36 | 1 | |
29 | 1 | |
24 | 1 | |
19 | 1 |
적합
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 22 |
---|---|
Distinct (%) | 71.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 47.032258 |
Minimum | 0 |
---|---|
Maximum | 729 |
Zeros | 4 |
Zeros (%) | 12.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 411.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 2 |
median | 9 |
Q3 | 26.5 |
95-th percentile | 154.5 |
Maximum | 729 |
Range | 729 |
Interquartile range (IQR) | 24.5 |
Descriptive statistics
Standard deviation | 133.82463 |
---|---|
Coefficient of variation (CV) | 2.8453797 |
Kurtosis | 24.268527 |
Mean | 47.032258 |
Median Absolute Deviation (MAD) | 8 |
Skewness | 4.774219 |
Sum | 1458 |
Variance | 17909.032 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 4 | 12.9% |
2 | 3 | 9.7% |
74 | 2 | 6.5% |
5 | 2 | 6.5% |
1 | 2 | 6.5% |
7 | 2 | 6.5% |
45 | 1 | 3.2% |
729 | 1 | 3.2% |
86 | 1 | 3.2% |
11 | 1 | 3.2% |
Other values (12) | 12 |
Value | Count | Frequency (%) |
0 | 4 | |
1 | 2 | |
2 | 3 | |
4 | 1 | 3.2% |
5 | 2 | |
6 | 1 | 3.2% |
7 | 2 | |
9 | 1 | 3.2% |
10 | 1 | 3.2% |
11 | 1 | 3.2% |
Value | Count | Frequency (%) |
729 | 1 | |
223 | 1 | |
86 | 1 | |
74 | 2 | |
45 | 1 | |
35 | 1 | |
29 | 1 | |
24 | 1 | |
19 | 1 | |
18 | 1 |
부적합
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 5 |
---|---|
Distinct (%) | 16.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 380.0 B |
0 | |
---|---|
1 | |
2 | 1 |
5 | 1 |
10 | 1 |
Length
Max length | 2 |
---|---|
Median length | 1 |
Mean length | 1.0322581 |
Min length | 1 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | 9.7% |
Sample
1st row | 0 |
---|---|
2nd row | 1 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 25 | |
1 | 3 | 9.7% |
2 | 1 | 3.2% |
5 | 1 | 3.2% |
10 | 1 | 3.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 25 | |
1 | 3 | 9.7% |
2 | 1 | 3.2% |
5 | 1 | 3.2% |
10 | 1 | 3.2% |
식품 유형 | 자가품질건수 | 적합 | 부적합 | |
---|---|---|---|---|
식품 유형 | 1.000 | 1.000 | 1.000 | 1.000 |
자가품질건수 | 1.000 | 1.000 | 1.000 | 0.831 |
적합 | 1.000 | 1.000 | 1.000 | 0.831 |
부적합 | 1.000 | 0.831 | 0.831 | 1.000 |
자가품질건수 | 적합 | 부적합 | |
---|---|---|---|
자가품질건수 | 1.000 | 0.999 | 0.786 |
적합 | 0.999 | 1.000 | 0.786 |
부적합 | 0.786 | 0.786 | 1.000 |
Unnamed: 0 | Unnamed: 1 | 식품 유형 | 자가품질건수 | 적합 | 부적합 | |
---|---|---|---|---|---|---|
0 | <NA> | <NA> | 과자류 | 45 | 45 | 0 |
1 | <NA> | <NA> | 빵또는 떡류 | 36 | 35 | 1 |
2 | <NA> | <NA> | 코코아가공품 및 초콜릿류 | 2 | 2 | 0 |
3 | <NA> | <NA> | 잼류 | 2 | 2 | 0 |
4 | <NA> | <NA> | 올리고당류 | 0 | 0 | 0 |
5 | <NA> | <NA> | 두부류 또는 묵류 | 5 | 5 | 0 |
6 | <NA> | <NA> | 식용유지류 | 6 | 6 | 0 |
7 | <NA> | <NA> | 면류 | 9 | 9 | 0 |
8 | <NA> | <NA> | 다류 | 19 | 19 | 0 |
9 | <NA> | <NA> | 커피 | 13 | 13 | 0 |
Unnamed: 0 | Unnamed: 1 | 식품 유형 | 자가품질건수 | 적합 | 부적합 | |
---|---|---|---|---|---|---|
21 | <NA> | <NA> | 규격외일반가공품 | 75 | 74 | 1 |
22 | <NA> | <NA> | 장기보존식품 | 1 | 1 | 0 |
23 | <NA> | <NA> | 건강기능식품 | 0 | 0 | 0 |
24 | <NA> | <NA> | 식품첨가물 | 29 | 29 | 0 |
25 | <NA> | <NA> | 기구및용기포장 | 18 | 18 | 0 |
26 | <NA> | <NA> | 위생용품 | 11 | 11 | 0 |
27 | <NA> | <NA> | 농산물 | 7 | 7 | 0 |
28 | <NA> | <NA> | 서류가공품 | 1 | 1 | 0 |
29 | <NA> | <NA> | 수산물 | 86 | 86 | 0 |
30 | <NA> | <NA> | <NA> | 739 | 729 | 10 |