Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 36 |
Missing cells | 4 |
Missing cells (%) | 2.8% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.4 KiB |
Average record size in memory | 38.7 B |
Variable types
Text | 1 |
---|---|
Numeric | 2 |
Categorical | 1 |
Dataset
Description | 도내 유통식품, 농산물, 수산물 검사 현황 |
---|---|
Author | 전라북도 |
URL | https://www.bigdatahub.go.kr/index.jeonbuk?startPage=24&menuCd=DOM_000000103007001000&pListTypeStr=&pId=3084463 |
비율 is highly overall correlated with 유통식품 and 1 other fields | High correlation |
유통식품 is highly overall correlated with 비율 and 1 other fields | High correlation |
부적합 is highly overall correlated with 비율 and 1 other fields | High correlation |
부적합 is highly imbalanced (69.1%) | Imbalance |
유통식품 has 4 (11.1%) missing values | Missing |
식품유형 has unique values | Unique |
비율 has 4 (11.1%) zeros | Zeros |
Reproduction
Analysis started | 2024-03-14 03:24:55.201795 |
---|---|
Analysis finished | 2024-03-14 03:24:55.692913 |
Duration | 0.49 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
식품유형
Text
UNIQUE
 
Distinct | 36 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 420.0 B |
Value | Count | Frequency (%) |
과자류 | 1 | 2.4% |
기타가공품 | 1 | 2.4% |
드레싱류 | 1 | 2.4% |
기타식품류 | 1 | 2.4% |
김치류 | 1 | 2.4% |
젓갈류 | 1 | 2.4% |
절임식품 | 1 | 2.4% |
조림식품 | 1 | 2.4% |
주류 | 1 | 2.4% |
건포류 | 1 | 2.4% |
Other values (32) | 32 |
Most occurring characters
Value | Count | Frequency (%) |
류 | 19 | 11.6% |
품 | 14 | 8.5% |
식 | 13 | 7.9% |
기 | 6 | 3.7% |
6 | 3.7% | |
가 | 4 | 2.4% |
용 | 4 | 2.4% |
장 | 3 | 1.8% |
조 | 3 | 1.8% |
공 | 3 | 1.8% |
Other values (72) | 89 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 156 | |
Space Separator | 6 | 3.7% |
Open Punctuation | 1 | 0.6% |
Close Punctuation | 1 | 0.6% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
류 | 19 | 12.2% |
품 | 14 | 9.0% |
식 | 13 | 8.3% |
기 | 6 | 3.8% |
가 | 4 | 2.6% |
용 | 4 | 2.6% |
장 | 3 | 1.9% |
조 | 3 | 1.9% |
공 | 3 | 1.9% |
포 | 3 | 1.9% |
Other values (69) | 84 |
Space Separator
Value | Count | Frequency (%) |
6 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 156 | |
Common | 8 | 4.9% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
류 | 19 | 12.2% |
품 | 14 | 9.0% |
식 | 13 | 8.3% |
기 | 6 | 3.8% |
가 | 4 | 2.6% |
용 | 4 | 2.6% |
장 | 3 | 1.9% |
조 | 3 | 1.9% |
공 | 3 | 1.9% |
포 | 3 | 1.9% |
Other values (69) | 84 |
Common
Value | Count | Frequency (%) |
6 | ||
( | 1 | 12.5% |
) | 1 | 12.5% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 156 | |
ASCII | 8 | 4.9% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
류 | 19 | 12.2% |
품 | 14 | 9.0% |
식 | 13 | 8.3% |
기 | 6 | 3.8% |
가 | 4 | 2.6% |
용 | 4 | 2.6% |
장 | 3 | 1.9% |
조 | 3 | 1.9% |
공 | 3 | 1.9% |
포 | 3 | 1.9% |
Other values (69) | 84 |
ASCII
Value | Count | Frequency (%) |
6 | ||
( | 1 | 12.5% |
) | 1 | 12.5% |
비율
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 23 |
---|---|
Distinct (%) | 63.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.7805556 |
Minimum | 0 |
---|---|
Maximum | 23.5 |
Zeros | 4 |
Zeros (%) | 11.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 456.0 B |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0.2 |
median | 0.85 |
Q3 | 2.85 |
95-th percentile | 12.325 |
Maximum | 23.5 |
Range | 23.5 |
Interquartile range (IQR) | 2.65 |
Descriptive statistics
Standard deviation | 4.8734745 |
---|---|
Coefficient of variation (CV) | 1.7526981 |
Kurtosis | 9.7377647 |
Mean | 2.7805556 |
Median Absolute Deviation (MAD) | 0.75 |
Skewness | 2.9687576 |
Sum | 100.1 |
Variance | 23.750754 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.1 | 4 | 11.1% |
0.0 | 4 | 11.1% |
0.3 | 4 | 11.1% |
0.4 | 2 | 5.6% |
1.6 | 2 | 5.6% |
0.2 | 2 | 5.6% |
2.4 | 2 | 5.6% |
2.8 | 1 | 2.8% |
3.6 | 1 | 2.8% |
23.5 | 1 | 2.8% |
Other values (13) | 13 |
Value | Count | Frequency (%) |
0.0 | 4 | |
0.1 | 4 | |
0.2 | 2 | |
0.3 | 4 | |
0.4 | 2 | |
0.5 | 1 | 2.8% |
0.6 | 1 | 2.8% |
1.1 | 1 | 2.8% |
1.2 | 1 | 2.8% |
1.4 | 1 | 2.8% |
Value | Count | Frequency (%) |
23.5 | 1 | |
15.4 | 1 | |
11.3 | 1 | |
7.6 | 1 | |
6.0 | 1 | |
5.0 | 1 | |
4.0 | 1 | |
3.6 | 1 | |
3.0 | 1 | |
2.8 | 1 |
유통식품
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 28 |
---|---|
Distinct (%) | 87.5% |
Missing | 4 |
Missing (%) | 11.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 85.46875 |
Minimum | 2 |
---|---|
Maximum | 644 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 456.0 B |
Quantile statistics
Minimum | 2 |
---|---|
5-th percentile | 2.55 |
Q1 | 7.75 |
median | 35.5 |
Q3 | 85.25 |
95-th percentile | 358.75 |
Maximum | 644 |
Range | 642 |
Interquartile range (IQR) | 77.5 |
Descriptive statistics
Standard deviation | 138.80503 |
---|---|
Coefficient of variation (CV) | 1.6240443 |
Kurtosis | 8.645448 |
Mean | 85.46875 |
Median Absolute Deviation (MAD) | 30.5 |
Skewness | 2.8096172 |
Sum | 2735 |
Variance | 19266.838 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3 | 2 | 5.6% |
2 | 2 | 5.6% |
7 | 2 | 5.6% |
8 | 2 | 5.6% |
10 | 1 | 2.8% |
6 | 1 | 2.8% |
98 | 1 | 2.8% |
644 | 1 | 2.8% |
307 | 1 | 2.8% |
43 | 1 | 2.8% |
Other values (18) | 18 | |
(Missing) | 4 | 11.1% |
Value | Count | Frequency (%) |
2 | 2 | |
3 | 2 | |
5 | 1 | |
6 | 1 | |
7 | 2 | |
8 | 2 | |
10 | 1 | |
12 | 1 | |
13 | 1 | |
17 | 1 |
Value | Count | Frequency (%) |
644 | 1 | |
422 | 1 | |
307 | 1 | |
208 | 1 | |
164 | 1 | |
136 | 1 | |
110 | 1 | |
98 | 1 | |
81 | 1 | |
76 | 1 |
부적합
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 8.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 420.0 B |
<NA> | |
---|---|
1 | 2 |
3 | 1 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.75 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 2.8% |
Sample
1st row | 1 |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 33 | |
1 | 2 | 5.6% |
3 | 1 | 2.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 33 | |
1 | 2 | 5.6% |
3 | 1 | 2.8% |
식품유형 | 비율 | 유통식품 | 부적합 | |
---|---|---|---|---|
식품유형 | 1.000 | 1.000 | 1.000 | 1.000 |
비율 | 1.000 | 1.000 | 1.000 | 1.000 |
유통식품 | 1.000 | 1.000 | 1.000 | 1.000 |
부적합 | 1.000 | 1.000 | 1.000 | 1.000 |
비율 | 유통식품 | 부적합 | |
---|---|---|---|
비율 | 1.000 | 0.998 | 1.000 |
유통식품 | 0.998 | 1.000 | 1.000 |
부적합 | 1.000 | 1.000 | 1.000 |
식품유형 | 비율 | 유통식품 | 부적합 | |
---|---|---|---|---|
0 | 과자류 | 15.4 | 422 | 1 |
1 | 빵또는 떡류 | 2.4 | 67 | <NA> |
2 | 초콜릿류 | 3.0 | 81 | <NA> |
3 | 잼류 | 0.1 | 3 | <NA> |
4 | 설탕 | 0.1 | 2 | <NA> |
5 | 과당 | 0.0 | <NA> | <NA> |
6 | 엿류 | 0.3 | 7 | <NA> |
7 | 올리고당류 | 0.5 | 13 | <NA> |
8 | 식육또는 알가공품 | 0.0 | <NA> | <NA> |
9 | 어육가공품 | 1.6 | 45 | <NA> |
식품유형 | 비율 | 유통식품 | 부적합 | |
---|---|---|---|---|
26 | 기타식품류 | 5.0 | 136 | <NA> |
27 | 기타가공품 | 2.4 | 66 | <NA> |
28 | 장기보존식품 | 1.6 | 43 | <NA> |
29 | 건강기능식품 | 0.3 | 8 | <NA> |
30 | 식품첨가물 | 0.0 | <NA> | <NA> |
31 | 기구및용기포장 | 0.3 | 8 | <NA> |
32 | 식품접객업소(집단금식소 포함)의 조리식품 | 11.3 | 307 | 3 |
33 | 농산물 | 23.5 | 644 | <NA> |
34 | 수산물 | 3.6 | 98 | <NA> |
35 | 위생용품 | 0.2 | 6 | <NA> |