Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 752.0 KiB |
Average record size in memory | 77.0 B |
Variable types
Text | 1 |
---|---|
Categorical | 2 |
Numeric | 5 |
Dataset
Description | 국립농산물품질관리원에서 관리하는 농축산물 유통조사 정보(처분년월, 업무구분명, 시도명, 조사장소수, 위반업소수, 형사처벌건수, 고발건수, 과태료부과건수) |
---|---|
Author | 국립농산물품질관리원 |
URL | https://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220204000000001683 |
조사장소수 is highly overall correlated with 과태료부과건수 | High correlation |
위반업소수 is highly overall correlated with 형사처벌건수 and 1 other fields | High correlation |
형사처벌건수 is highly overall correlated with 위반업소수 | High correlation |
과태료부과건수 is highly overall correlated with 조사장소수 and 1 other fields | High correlation |
위반업소수 has 4163 (41.6%) zeros | Zeros |
형사처벌건수 has 6175 (61.8%) zeros | Zeros |
고발건수 has 9527 (95.3%) zeros | Zeros |
과태료부과건수 has 5555 (55.5%) zeros | Zeros |
Reproduction
Analysis started | 2024-03-23 07:51:21.845403 |
---|---|
Analysis finished | 2024-03-23 07:51:31.135188 |
Duration | 9.29 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
처분년월
Text
Distinct | 262 |
---|---|
Distinct (%) | 2.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
oct-11 | 74 | 0.7% |
jul-12 | 74 | 0.7% |
dec-12 | 74 | 0.7% |
nov-11 | 73 | 0.7% |
jan-12 | 73 | 0.7% |
may-12 | 72 | 0.7% |
jul-11 | 71 | 0.7% |
sep-12 | 71 | 0.7% |
apr-12 | 71 | 0.7% |
sep-11 | 69 | 0.7% |
Other values (252) | 9278 |
Most occurring characters
Value | Count | Frequency (%) |
- | 10000 | |
1 | 7749 | 12.9% |
0 | 4008 | 6.7% |
a | 2536 | 4.2% |
u | 2533 | 4.2% |
J | 2529 | 4.2% |
e | 2469 | 4.1% |
A | 1715 | 2.9% |
r | 1688 | 2.8% |
M | 1684 | 2.8% |
Other values (23) | 23089 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 20000 | |
Lowercase Letter | 20000 | |
Dash Punctuation | 10000 | |
Uppercase Letter | 10000 |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
a | 2536 | |
u | 2533 | |
e | 2469 | |
r | 1688 | |
n | 1665 | |
p | 1661 | |
c | 1609 | |
l | 864 | 4.3% |
b | 860 | 4.3% |
g | 856 | 4.3% |
Other values (4) | 3259 |
Decimal Number
Value | Count | Frequency (%) |
1 | 7749 | |
0 | 4008 | |
2 | 1416 | 7.1% |
9 | 1215 | 6.1% |
8 | 1127 | 5.6% |
7 | 1004 | 5.0% |
6 | 964 | 4.8% |
3 | 864 | 4.3% |
5 | 855 | 4.3% |
4 | 798 | 4.0% |
Uppercase Letter
Value | Count | Frequency (%) |
J | 2529 | |
A | 1715 | |
M | 1684 | |
F | 860 | 8.6% |
D | 807 | 8.1% |
S | 802 | 8.0% |
O | 802 | 8.0% |
N | 801 | 8.0% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 30000 | |
Latin | 30000 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
a | 2536 | 8.5% |
u | 2533 | 8.4% |
J | 2529 | 8.4% |
e | 2469 | 8.2% |
A | 1715 | 5.7% |
r | 1688 | 5.6% |
M | 1684 | 5.6% |
n | 1665 | 5.5% |
p | 1661 | 5.5% |
c | 1609 | 5.4% |
Other values (12) | 9911 |
Common
Value | Count | Frequency (%) |
- | 10000 | |
1 | 7749 | |
0 | 4008 | |
2 | 1416 | 4.7% |
9 | 1215 | 4.0% |
8 | 1127 | 3.8% |
7 | 1004 | 3.3% |
6 | 964 | 3.2% |
3 | 864 | 2.9% |
5 | 855 | 2.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 60000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 10000 | |
1 | 7749 | 12.9% |
0 | 4008 | 6.7% |
a | 2536 | 4.2% |
u | 2533 | 4.2% |
J | 2529 | 4.2% |
e | 2469 | 4.1% |
A | 1715 | 2.9% |
r | 1688 | 2.8% |
M | 1684 | 2.8% |
Other values (23) | 23089 |
업무구분명
Categorical
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
원산지단속 | |
---|---|
양곡표시 | |
축산물이력 | |
GMO | |
미검사품 |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 4.4172 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 원산지단속 |
---|---|
2nd row | 양곡표시 |
3rd row | GMO |
4th row | 원산지단속 |
5th row | 원산지단속 |
Common Values
Value | Count | Frequency (%) |
원산지단속 | 3514 | |
양곡표시 | 2134 | |
축산물이력 | 1893 | |
GMO | 1244 | 12.4% |
미검사품 | 1206 | 12.1% |
재사용화환 | 9 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
원산지단속 | 3514 | |
양곡표시 | 2134 | |
축산물이력 | 1893 | |
gmo | 1244 | 12.4% |
미검사품 | 1206 | 12.1% |
재사용화환 | 9 | 0.1% |
시도명
Categorical
Distinct | 17 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
전라남도 | 635 |
---|---|
전라북도 | 629 |
경상북도 | 621 |
경기도 | 615 |
충청북도 | 615 |
Other values (12) |
Length
Max length | 7 |
---|---|
Median length | 5 |
Mean length | 4.6001 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 전라북도 |
---|---|
2nd row | 인천광역시 |
3rd row | 강원도 |
4th row | 충청남도 |
5th row | 대전광역시 |
Common Values
Value | Count | Frequency (%) |
전라남도 | 635 | 6.3% |
전라북도 | 629 | 6.3% |
경상북도 | 621 | 6.2% |
경기도 | 615 | 6.2% |
충청북도 | 615 | 6.2% |
충청남도 | 613 | 6.1% |
강원도 | 609 | 6.1% |
경상남도 | 604 | 6.0% |
서울특별시 | 592 | 5.9% |
제주특별자치도 | 591 | 5.9% |
Other values (7) | 3876 |
Length
Value | Count | Frequency (%) |
전라남도 | 635 | 6.3% |
전라북도 | 629 | 6.3% |
경상북도 | 621 | 6.2% |
경기도 | 615 | 6.2% |
충청북도 | 615 | 6.2% |
충청남도 | 613 | 6.1% |
강원도 | 609 | 6.1% |
경상남도 | 604 | 6.0% |
서울특별시 | 592 | 5.9% |
제주특별자치도 | 591 | 5.9% |
Other values (7) | 3876 |
조사장소수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 1996 |
---|---|
Distinct (%) | 20.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 474.2994 |
Minimum | 0 |
---|---|
Maximum | 13387 |
Zeros | 22 |
Zeros (%) | 0.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 3 |
Q1 | 31 |
median | 147 |
Q3 | 482 |
95-th percentile | 2306.6 |
Maximum | 13387 |
Range | 13387 |
Interquartile range (IQR) | 451 |
Descriptive statistics
Standard deviation | 858.98611 |
---|---|
Coefficient of variation (CV) | 1.811063 |
Kurtosis | 19.339293 |
Mean | 474.2994 |
Median Absolute Deviation (MAD) | 135 |
Skewness | 3.6136975 |
Sum | 4742994 |
Variance | 737857.14 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 211 | 2.1% |
2 | 148 | 1.5% |
3 | 124 | 1.2% |
4 | 100 | 1.0% |
7 | 99 | 1.0% |
6 | 98 | 1.0% |
5 | 95 | 0.9% |
11 | 90 | 0.9% |
10 | 89 | 0.9% |
8 | 85 | 0.9% |
Other values (1986) | 8861 |
Value | Count | Frequency (%) |
0 | 22 | 0.2% |
1 | 211 | |
2 | 148 | |
3 | 124 | |
4 | 100 | |
5 | 95 | |
6 | 98 | |
7 | 99 | |
8 | 85 | |
9 | 73 | 0.7% |
Value | Count | Frequency (%) |
13387 | 1 | |
10421 | 1 | |
8803 | 1 | |
8055 | 1 | |
7923 | 1 | |
7658 | 1 | |
7540 | 1 | |
7167 | 1 | |
6690 | 1 | |
6637 | 1 |
위반업소수
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 102 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 7.6384 |
Minimum | 0 |
---|---|
Maximum | 189 |
Zeros | 4163 |
Zeros (%) | 41.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 1 |
Q3 | 9 |
95-th percentile | 38 |
Maximum | 189 |
Range | 189 |
Interquartile range (IQR) | 9 |
Descriptive statistics
Standard deviation | 13.91101 |
---|---|
Coefficient of variation (CV) | 1.8211942 |
Kurtosis | 15.268332 |
Mean | 7.6384 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 3.1512628 |
Sum | 76384 |
Variance | 193.5162 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 4163 | |
1 | 941 | 9.4% |
2 | 623 | 6.2% |
3 | 459 | 4.6% |
4 | 345 | 3.5% |
5 | 304 | 3.0% |
6 | 241 | 2.4% |
7 | 193 | 1.9% |
8 | 187 | 1.9% |
9 | 175 | 1.8% |
Other values (92) | 2369 |
Value | Count | Frequency (%) |
0 | 4163 | |
1 | 941 | 9.4% |
2 | 623 | 6.2% |
3 | 459 | 4.6% |
4 | 345 | 3.5% |
5 | 304 | 3.0% |
6 | 241 | 2.4% |
7 | 193 | 1.9% |
8 | 187 | 1.9% |
9 | 175 | 1.8% |
Value | Count | Frequency (%) |
189 | 1 | |
169 | 1 | |
141 | 1 | |
138 | 1 | |
132 | 1 | |
131 | 1 | |
128 | 1 | |
123 | 1 | |
116 | 1 | |
110 | 1 |
형사처벌건수
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 79 |
---|---|
Distinct (%) | 0.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.5517 |
Minimum | 0 |
---|---|
Maximum | 131 |
Zeros | 6175 |
Zeros (%) | 61.8% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 4 |
95-th percentile | 25 |
Maximum | 131 |
Range | 131 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 9.5090993 |
---|---|
Coefficient of variation (CV) | 2.0891314 |
Kurtosis | 14.587192 |
Mean | 4.5517 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 3.1883696 |
Sum | 45517 |
Variance | 90.422969 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 6175 | |
1 | 571 | 5.7% |
2 | 305 | 3.0% |
3 | 260 | 2.6% |
4 | 225 | 2.2% |
5 | 186 | 1.9% |
6 | 162 | 1.6% |
7 | 154 | 1.5% |
9 | 140 | 1.4% |
10 | 123 | 1.2% |
Other values (69) | 1699 | 17.0% |
Value | Count | Frequency (%) |
0 | 6175 | |
1 | 571 | 5.7% |
2 | 305 | 3.0% |
3 | 260 | 2.6% |
4 | 225 | 2.2% |
5 | 186 | 1.9% |
6 | 162 | 1.6% |
7 | 154 | 1.5% |
8 | 122 | 1.2% |
9 | 140 | 1.4% |
Value | Count | Frequency (%) |
131 | 1 | |
93 | 1 | |
91 | 2 | |
86 | 1 | |
84 | 1 | |
81 | 1 | |
80 | 1 | |
74 | 1 | |
73 | 1 | |
70 | 2 |
고발건수
Real number (ℝ)
ZEROS
 
Distinct | 19 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.1199 |
Minimum | 0 |
---|---|
Maximum | 19 |
Zeros | 9527 |
Zeros (%) | 95.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 0 |
Maximum | 19 |
Range | 19 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 0.8111657 |
---|---|
Coefficient of variation (CV) | 6.7653519 |
Kurtosis | 182.62246 |
Mean | 0.1199 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 11.925742 |
Sum | 1199 |
Variance | 0.65798979 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 9527 | |
1 | 233 | 2.3% |
2 | 117 | 1.2% |
3 | 43 | 0.4% |
4 | 17 | 0.2% |
5 | 15 | 0.1% |
6 | 13 | 0.1% |
9 | 8 | 0.1% |
8 | 4 | < 0.1% |
10 | 4 | < 0.1% |
Other values (9) | 19 | 0.2% |
Value | Count | Frequency (%) |
0 | 9527 | |
1 | 233 | 2.3% |
2 | 117 | 1.2% |
3 | 43 | 0.4% |
4 | 17 | 0.2% |
5 | 15 | 0.1% |
6 | 13 | 0.1% |
7 | 4 | < 0.1% |
8 | 4 | < 0.1% |
9 | 8 | 0.1% |
Value | Count | Frequency (%) |
19 | 1 | < 0.1% |
18 | 1 | < 0.1% |
17 | 1 | < 0.1% |
15 | 1 | < 0.1% |
14 | 4 | |
13 | 3 | < 0.1% |
12 | 2 | < 0.1% |
11 | 2 | < 0.1% |
10 | 4 | |
9 | 8 |
과태료부과건수
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 68 |
---|---|
Distinct (%) | 0.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.9668 |
Minimum | 0 |
---|---|
Maximum | 182 |
Zeros | 5555 |
Zeros (%) | 55.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 3 |
95-th percentile | 14.05 |
Maximum | 182 |
Range | 182 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 6.9511531 |
---|---|
Coefficient of variation (CV) | 2.34298 |
Kurtosis | 110.9794 |
Mean | 2.9668 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 7.313083 |
Sum | 29668 |
Variance | 48.31853 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 5555 | |
1 | 989 | 9.9% |
2 | 670 | 6.7% |
3 | 485 | 4.9% |
4 | 379 | 3.8% |
5 | 286 | 2.9% |
6 | 204 | 2.0% |
7 | 200 | 2.0% |
8 | 159 | 1.6% |
9 | 132 | 1.3% |
Other values (58) | 941 | 9.4% |
Value | Count | Frequency (%) |
0 | 5555 | |
1 | 989 | 9.9% |
2 | 670 | 6.7% |
3 | 485 | 4.9% |
4 | 379 | 3.8% |
5 | 286 | 2.9% |
6 | 204 | 2.0% |
7 | 200 | 2.0% |
8 | 159 | 1.6% |
9 | 132 | 1.3% |
Value | Count | Frequency (%) |
182 | 1 | |
159 | 1 | |
136 | 1 | |
120 | 1 | |
93 | 1 | |
89 | 1 | |
87 | 1 | |
86 | 1 | |
72 | 1 | |
68 | 1 |
업무구분명 | 시도명 | 조사장소수 | 위반업소수 | 형사처벌건수 | 고발건수 | 과태료부과건수 | |
---|---|---|---|---|---|---|---|
업무구분명 | 1.000 | 0.079 | 0.367 | 0.423 | 0.431 | 0.163 | 0.170 |
시도명 | 0.079 | 1.000 | 0.309 | 0.235 | 0.228 | 0.138 | 0.137 |
조사장소수 | 0.367 | 0.309 | 1.000 | 0.381 | 0.445 | 0.212 | 0.232 |
위반업소수 | 0.423 | 0.235 | 0.381 | 1.000 | 0.722 | 0.346 | 0.971 |
형사처벌건수 | 0.431 | 0.228 | 0.445 | 0.722 | 1.000 | 0.163 | 0.237 |
고발건수 | 0.163 | 0.138 | 0.212 | 0.346 | 0.163 | 1.000 | 0.207 |
과태료부과건수 | 0.170 | 0.137 | 0.232 | 0.971 | 0.237 | 0.207 | 1.000 |
업무구분명 | 시도명 | |
---|---|---|
업무구분명 | 1.000 | 0.037 |
시도명 | 0.037 | 1.000 |
조사장소수 | 위반업소수 | 형사처벌건수 | 고발건수 | 과태료부과건수 | 업무구분명 | 시도명 | |
---|---|---|---|---|---|---|---|
조사장소수 | 1.000 | 0.490 | 0.367 | 0.177 | 0.596 | 0.192 | 0.128 |
위반업소수 | 0.490 | 1.000 | 0.835 | 0.274 | 0.757 | 0.238 | 0.093 |
형사처벌건수 | 0.367 | 0.835 | 1.000 | 0.268 | 0.407 | 0.230 | 0.093 |
고발건수 | 0.177 | 0.274 | 0.268 | 1.000 | 0.197 | 0.086 | 0.054 |
과태료부과건수 | 0.596 | 0.757 | 0.407 | 0.197 | 1.000 | 0.090 | 0.053 |
업무구분명 | 0.192 | 0.238 | 0.230 | 0.086 | 0.090 | 1.000 | 0.037 |
시도명 | 0.128 | 0.093 | 0.093 | 0.054 | 0.053 | 0.037 | 1.000 |
처분년월 | 업무구분명 | 시도명 | 조사장소수 | 위반업소수 | 형사처벌건수 | 고발건수 | 과태료부과건수 | |
---|---|---|---|---|---|---|---|---|
11727 | Apr-01 | 원산지단속 | 전라북도 | 46 | 24 | 24 | 0 | 0 |
1442 | Sep-18 | 양곡표시 | 인천광역시 | 392 | 0 | 0 | 0 | 0 |
10039 | Aug-07 | GMO | 강원도 | 247 | 0 | 0 | 0 | 0 |
9799 | Mar-08 | 원산지단속 | 충청남도 | 536 | 17 | 9 | 0 | 8 |
10858 | Jun-05 | 원산지단속 | 대전광역시 | 2 | 2 | 2 | 0 | 0 |
6105 | Oct-12 | 양곡표시 | 충청북도 | 254 | 0 | 0 | 0 | 0 |
2196 | Sep-17 | 양곡표시 | 세종특별자치시 | 40 | 0 | 0 | 0 | 0 |
6230 | Aug-12 | GMO | 인천광역시 | 39 | 0 | 0 | 0 | 0 |
9932 | Dec-07 | 원산지단속 | 전라남도 | 1619 | 9 | 2 | 0 | 7 |
12013 | May-98 | 원산지단속 | 부산광역시 | 1 | 1 | 0 | 1 | 0 |
처분년월 | 업무구분명 | 시도명 | 조사장소수 | 위반업소수 | 형사처벌건수 | 고발건수 | 과태료부과건수 | |
---|---|---|---|---|---|---|---|---|
3041 | Aug-16 | 원산지단속 | 대구광역시 | 707 | 43 | 27 | 0 | 16 |
2974 | Sep-16 | 원산지단속 | 경상남도 | 4719 | 37 | 20 | 0 | 17 |
4190 | Feb-15 | 양곡표시 | 인천광역시 | 71 | 1 | 0 | 0 | 1 |
4621 | Jul-14 | 미검사품 | 광주광역시 | 2 | 0 | 0 | 0 | 0 |
6196 | Sep-12 | 원산지단속 | 인천광역시 | 873 | 18 | 14 | 0 | 4 |
6487 | May-12 | 미검사품 | 경상남도 | 231 | 3 | 3 | 0 | 0 |
7267 | Aug-11 | 양곡표시 | 서울특별시 | 758 | 5 | 1 | 0 | 4 |
4877 | Mar-14 | 미검사품 | 인천광역시 | 13 | 0 | 0 | 0 | 0 |
10353 | Dec-06 | 원산지단속 | 제주특별자치도 | 55 | 0 | 0 | 0 | 0 |
6152 | Sep-12 | GMO | 제주특별자치도 | 153 | 0 | 0 | 0 | 0 |