Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 830.1 KiB |
Average record size in memory | 85.0 B |
Variable types
Numeric | 4 |
---|---|
Categorical | 5 |
Dataset
Description | 충청남도 전역에서 포획한 유해야생동물 중, 야생멧돼지의 포획일자, 포획 개체 수, 포획한 지점의 지목 등을 격자 단위로 제공하는 자료임※ 좌표 : 카텍좌표 |
---|---|
Author | 충청남도 |
URL | https://www.data.go.kr/data/15109317/fileData.do |
데이터 생성일 has constant value "" | Constant |
포획건수 is highly overall correlated with 동물국문명 | High correlation |
발생연월 is highly overall correlated with 동물국문명 | High correlation |
동물국문명 is highly overall correlated with 격자 아이디(ID) and 6 other fields | High correlation |
대표 지목명 is highly overall correlated with 대표 지목 코드 and 1 other fields | High correlation |
격자 아이디(ID) is highly overall correlated with 격자 가로 (X축) 좌표 and 1 other fields | High correlation |
격자 가로 (X축) 좌표 is highly overall correlated with 격자 아이디(ID) and 1 other fields | High correlation |
격자 세로 (Y축) 좌표 is highly overall correlated with 동물국문명 | High correlation |
대표 지목 코드 is highly overall correlated with 동물국문명 and 1 other fields | High correlation |
동물국문명 is highly imbalanced (88.9%) | Imbalance |
포획건수 is highly imbalanced (94.7%) | Imbalance |
Reproduction
Analysis started | 2024-03-15 01:10:02.005295 |
---|---|
Analysis finished | 2024-03-15 01:10:08.789094 |
Duration | 6.78 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
격자 아이디(ID)
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 5940 |
---|---|
Distinct (%) | 59.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 296585.74 |
Minimum | 216459 |
---|---|
Maximum | 366386 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 216459 |
---|---|
5-th percentile | 250463 |
Q1 | 275409.5 |
median | 294433.5 |
Q3 | 318429 |
95-th percentile | 348392.1 |
Maximum | 366386 |
Range | 149927 |
Interquartile range (IQR) | 43019.5 |
Descriptive statistics
Standard deviation | 29076.461 |
---|---|
Coefficient of variation (CV) | 0.098037285 |
Kurtosis | -0.58549722 |
Mean | 296585.74 |
Median Absolute Deviation (MAD) | 21953.5 |
Skewness | 0.1484223 |
Sum | 2.9658574 × 109 |
Variance | 8.4544057 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
279486 | 6 | 0.1% |
324458 | 6 | 0.1% |
281477 | 6 | 0.1% |
279483 | 6 | 0.1% |
326438 | 5 | 0.1% |
347397 | 5 | 0.1% |
270413 | 5 | 0.1% |
278430 | 5 | 0.1% |
332415 | 5 | 0.1% |
323421 | 5 | 0.1% |
Other values (5930) | 9946 |
Value | Count | Frequency (%) |
216459 | 1 | < 0.1% |
217405 | 3 | |
222403 | 3 | |
223402 | 1 | < 0.1% |
227404 | 3 | |
228404 | 2 | |
228454 | 2 | |
231464 | 3 | |
232454 | 3 | |
232459 | 1 | < 0.1% |
Value | Count | Frequency (%) |
366386 | 2 | |
366381 | 2 | |
365389 | 1 | < 0.1% |
365388 | 1 | < 0.1% |
365387 | 3 | |
365385 | 1 | < 0.1% |
365384 | 1 | < 0.1% |
365382 | 1 | < 0.1% |
365381 | 1 | < 0.1% |
365380 | 2 |
격자 가로 (X축) 좌표
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 142 |
---|---|
Distinct (%) | 1.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 296855.3 |
Minimum | 216706 |
---|---|
Maximum | 366706 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 216706 |
---|---|
5-th percentile | 250706 |
Q1 | 275706 |
median | 294706 |
Q3 | 318706 |
95-th percentile | 348706 |
Maximum | 366706 |
Range | 150000 |
Interquartile range (IQR) | 43000 |
Descriptive statistics
Standard deviation | 29088.744 |
---|---|
Coefficient of variation (CV) | 0.09798964 |
Kurtosis | -0.58437407 |
Mean | 296855.3 |
Median Absolute Deviation (MAD) | 22000 |
Skewness | 0.14864985 |
Sum | 2.968553 × 109 |
Variance | 8.4615503 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
283706 | 154 | 1.5% |
278706 | 149 | 1.5% |
285706 | 148 | 1.5% |
290706 | 144 | 1.4% |
281706 | 143 | 1.4% |
279706 | 133 | 1.3% |
287706 | 132 | 1.3% |
320706 | 131 | 1.3% |
284706 | 131 | 1.3% |
282706 | 130 | 1.3% |
Other values (132) | 8605 |
Value | Count | Frequency (%) |
216706 | 1 | < 0.1% |
217706 | 3 | < 0.1% |
222706 | 3 | < 0.1% |
223706 | 1 | < 0.1% |
227706 | 3 | < 0.1% |
228706 | 4 | < 0.1% |
231706 | 3 | < 0.1% |
232706 | 7 | |
233706 | 14 | |
234706 | 13 |
Value | Count | Frequency (%) |
366706 | 4 | < 0.1% |
365706 | 12 | 0.1% |
364706 | 12 | 0.1% |
363706 | 14 | 0.1% |
362706 | 22 | |
361706 | 29 | |
360706 | 23 | |
359706 | 29 | |
358706 | 35 | |
357706 | 29 |
격자 세로 (Y축) 좌표
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 122 |
---|---|
Distinct (%) | 1.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 437409.1 |
Minimum | 375969 |
---|---|
Maximum | 496969 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 375969 |
---|---|
5-th percentile | 388969 |
Q1 | 409969 |
median | 439969 |
Q3 | 463969 |
95-th percentile | 481969 |
Maximum | 496969 |
Range | 121000 |
Interquartile range (IQR) | 54000 |
Descriptive statistics
Standard deviation | 30688.027 |
---|---|
Coefficient of variation (CV) | 0.070158638 |
Kurtosis | -1.228374 |
Mean | 437409.1 |
Median Absolute Deviation (MAD) | 26000 |
Skewness | -0.14473215 |
Sum | 4.374091 × 109 |
Variance | 9.4175499 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
463969 | 152 | 1.5% |
467969 | 146 | 1.5% |
462969 | 140 | 1.4% |
464969 | 136 | 1.4% |
461969 | 135 | 1.4% |
465969 | 134 | 1.3% |
459969 | 134 | 1.3% |
460969 | 132 | 1.3% |
468969 | 132 | 1.3% |
471969 | 128 | 1.3% |
Other values (112) | 8631 |
Value | Count | Frequency (%) |
375969 | 1 | < 0.1% |
376969 | 15 | 0.1% |
377969 | 14 | 0.1% |
378969 | 12 | 0.1% |
379969 | 26 | |
380969 | 31 | |
381969 | 43 | |
382969 | 38 | |
383969 | 49 | |
384969 | 39 |
Value | Count | Frequency (%) |
496969 | 1 | < 0.1% |
495969 | 7 | 0.1% |
494969 | 3 | < 0.1% |
493969 | 3 | < 0.1% |
492969 | 6 | 0.1% |
491969 | 12 | 0.1% |
490969 | 23 | |
489969 | 43 | |
488969 | 45 | |
487969 | 32 |
동물국문명
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
멧돼지 | 148 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.9852 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 9852 | |
멧돼지 | 148 | 1.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 9852 | |
멧돼지 | 148 | 1.5% |
발생연월
Categorical
HIGH CORRELATION
 
Distinct | 13 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
2020-03 | |
---|---|
2020-01 | |
2020-10 | |
2020-05 | |
2020-07 | |
Other values (8) |
Length
Max length | 7 |
---|---|
Median length | 7 |
Mean length | 7 |
Min length | 7 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2020-06 |
---|---|
2nd row | 2020-07 |
3rd row | 2020-10 |
4th row | 2020-05 |
5th row | 2020-01 |
Common Values
Value | Count | Frequency (%) |
2020-03 | 872 | |
2020-01 | 857 | |
2020-10 | 856 | |
2020-05 | 854 | |
2020-07 | 848 | |
2020-12 | 841 | |
2020-02 | 823 | |
2020-06 | 811 | |
2020-04 | 802 | |
2020-08 | 793 | |
Other values (3) | 1643 |
Length
Value | Count | Frequency (%) |
2020-03 | 872 | |
2020-01 | 857 | |
2020-10 | 856 | |
2020-05 | 854 | |
2020-07 | 848 | |
2020-12 | 841 | |
2020-02 | 823 | |
2020-06 | 811 | |
2020-04 | 802 | |
2020-08 | 793 | |
Other values (3) | 1643 |
대표 지목명
Categorical
HIGH CORRELATION
 
Distinct | 18 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
답 | |
---|---|
전 | |
임야 | |
대지 | |
도로 | |
Other values (13) | 270 |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.3434 |
Min length | 1 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 대지 |
---|---|
2nd row | 전 |
3rd row | 전 |
4th row | 답 |
5th row | 답 |
Common Values
Value | Count | Frequency (%) |
답 | 3997 | |
전 | 2759 | |
임야 | 2079 | |
대지 | 483 | 4.8% |
도로 | 412 | 4.1% |
잡종지 | 87 | 0.9% |
유지 | 60 | 0.6% |
공장용지 | 33 | 0.3% |
하천 | 30 | 0.3% |
구거 | 19 | 0.2% |
Other values (8) | 41 | 0.4% |
Length
Value | Count | Frequency (%) |
답 | 3997 | |
전 | 2759 | |
임야 | 2079 | |
대지 | 483 | 4.8% |
도로 | 412 | 4.1% |
잡종지 | 87 | 0.9% |
유지 | 60 | 0.6% |
공장용지 | 33 | 0.3% |
하천 | 30 | 0.3% |
구거 | 19 | 0.2% |
Other values (8) | 41 | 0.4% |
대표 지목 코드
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 18 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.604 |
Minimum | 1 |
---|---|
Maximum | 28 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 2 |
Q3 | 5 |
95-th percentile | 14 |
Maximum | 28 |
Range | 27 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 4.1305072 |
---|---|
Coefficient of variation (CV) | 1.1460897 |
Kurtosis | 12.254299 |
Mean | 3.604 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 3.1416838 |
Sum | 36040 |
Variance | 17.06109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 3997 | |
1 | 2759 | |
5 | 2079 | |
8 | 483 | 4.8% |
14 | 412 | 4.1% |
28 | 87 | 0.9% |
19 | 60 | 0.6% |
9 | 33 | 0.3% |
17 | 30 | 0.3% |
18 | 19 | 0.2% |
Other values (8) | 41 | 0.4% |
Value | Count | Frequency (%) |
1 | 2759 | |
2 | 3997 | |
3 | 10 | 0.1% |
4 | 1 | < 0.1% |
5 | 2079 | |
7 | 13 | 0.1% |
8 | 483 | 4.8% |
9 | 33 | 0.3% |
14 | 412 | 4.1% |
16 | 1 | < 0.1% |
Value | Count | Frequency (%) |
28 | 87 | 0.9% |
26 | 5 | 0.1% |
25 | 6 | 0.1% |
23 | 4 | < 0.1% |
22 | 1 | < 0.1% |
19 | 60 | 0.6% |
18 | 19 | 0.2% |
17 | 30 | 0.3% |
16 | 1 | < 0.1% |
14 | 412 |
포획건수
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
1 | 126 |
2 | 17 |
3 | 4 |
5 | 1 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.9556 |
Min length | 1 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 9852 | |
1 | 126 | 1.3% |
2 | 17 | 0.2% |
3 | 4 | < 0.1% |
5 | 1 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 9852 | |
1 | 126 | 1.3% |
2 | 17 | 0.2% |
3 | 4 | < 0.1% |
5 | 1 | < 0.1% |
데이터 생성일
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
2022-11-07 |
---|
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2022-11-07 |
---|---|
2nd row | 2022-11-07 |
3rd row | 2022-11-07 |
4th row | 2022-11-07 |
5th row | 2022-11-07 |
Common Values
Value | Count | Frequency (%) |
2022-11-07 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2022-11-07 | 10000 |
격자 아이디(ID) | 격자 가로 (X축) 좌표 | 격자 세로 (Y축) 좌표 | 발생연월 | 대표 지목명 | 대표 지목 코드 | 포획건수 | |
---|---|---|---|---|---|---|---|
격자 아이디(ID) | 1.000 | 0.999 | 0.620 | 0.000 | 0.249 | 0.165 | 0.267 |
격자 가로 (X축) 좌표 | 0.999 | 1.000 | 0.626 | 0.000 | 0.242 | 0.165 | 0.318 |
격자 세로 (Y축) 좌표 | 0.620 | 0.626 | 1.000 | 0.021 | 0.327 | 0.136 | 0.244 |
발생연월 | 0.000 | 0.000 | 0.021 | 1.000 | 0.050 | 0.024 | 0.126 |
대표 지목명 | 0.249 | 0.242 | 0.327 | 0.050 | 1.000 | 1.000 | 0.000 |
대표 지목 코드 | 0.165 | 0.165 | 0.136 | 0.024 | 1.000 | 1.000 | 0.000 |
포획건수 | 0.267 | 0.318 | 0.244 | 0.126 | 0.000 | 0.000 | 1.000 |
포획건수 | 발생연월 | 동물국문명 | 대표 지목명 | |
---|---|---|---|---|
포획건수 | 1.000 | 0.071 | 1.000 | 0.000 |
발생연월 | 0.071 | 1.000 | 1.000 | 0.017 |
동물국문명 | 1.000 | 1.000 | 1.000 | 1.000 |
대표 지목명 | 0.000 | 0.017 | 1.000 | 1.000 |
격자 아이디(ID) | 격자 가로 (X축) 좌표 | 격자 세로 (Y축) 좌표 | 대표 지목 코드 | 동물국문명 | 발생연월 | 대표 지목명 | 포획건수 | |
---|---|---|---|---|---|---|---|---|
격자 아이디(ID) | 1.000 | 1.000 | -0.353 | -0.003 | 1.000 | 0.000 | 0.098 | 0.170 |
격자 가로 (X축) 좌표 | 1.000 | 1.000 | -0.362 | -0.003 | 1.000 | 0.000 | 0.095 | 0.204 |
격자 세로 (Y축) 좌표 | -0.353 | -0.362 | 1.000 | 0.031 | 1.000 | 0.009 | 0.131 | 0.143 |
대표 지목 코드 | -0.003 | -0.003 | 0.031 | 1.000 | 1.000 | 0.010 | 1.000 | 0.000 |
동물국문명 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
발생연월 | 0.000 | 0.000 | 0.009 | 0.010 | 1.000 | 1.000 | 0.017 | 0.071 |
대표 지목명 | 0.098 | 0.095 | 0.131 | 1.000 | 1.000 | 0.017 | 1.000 | 0.000 |
포획건수 | 0.170 | 0.204 | 0.143 | 0.000 | 1.000 | 0.071 | 0.000 | 1.000 |
격자 아이디(ID) | 격자 가로 (X축) 좌표 | 격자 세로 (Y축) 좌표 | 동물국문명 | 발생연월 | 대표 지목명 | 대표 지목 코드 | 포획건수 | 데이터 생성일 | |
---|---|---|---|---|---|---|---|---|---|
41307 | 318391 | 318706 | 391969 | <NA> | 2020-06 | 대지 | 8 | <NA> | 2022-11-07 |
50046 | 286446 | 286706 | 446969 | <NA> | 2020-07 | 전 | 1 | <NA> | 2022-11-07 |
80639 | 316416 | 316706 | 416969 | <NA> | 2020-10 | 전 | 1 | <NA> | 2022-11-07 |
35262 | 251460 | 251706 | 460969 | <NA> | 2020-05 | 답 | 2 | <NA> | 2022-11-07 |
1782 | 273436 | 273706 | 436969 | <NA> | 2020-01 | 답 | 2 | <NA> | 2022-11-07 |
92856 | 249454 | 249706 | 454969 | <NA> | 2020-12 | 답 | 2 | <NA> | 2022-11-07 |
83912 | 289421 | 289706 | 421969 | <NA> | 2020-11 | 답 | 2 | <NA> | 2022-11-07 |
27068 | 252460 | 252706 | 460969 | <NA> | 2020-04 | 답 | 2 | <NA> | 2022-11-07 |
94256 | 276436 | 276706 | 436969 | <NA> | 2020-12 | 전 | 1 | <NA> | 2022-11-07 |
98185 | 336455 | 336706 | 455969 | <NA> | 2020-12 | 임야 | 5 | <NA> | 2022-11-07 |
격자 아이디(ID) | 격자 가로 (X축) 좌표 | 격자 세로 (Y축) 좌표 | 동물국문명 | 발생연월 | 대표 지목명 | 대표 지목 코드 | 포획건수 | 데이터 생성일 | |
---|---|---|---|---|---|---|---|---|---|
18824 | 252452 | 252706 | 452969 | <NA> | 2020-03 | 유지 | 19 | <NA> | 2022-11-07 |
83076 | 284476 | 284706 | 476969 | <NA> | 2020-11 | 답 | 2 | <NA> | 2022-11-07 |
72149 | 312434 | 312706 | 434969 | <NA> | 2020-09 | 도로 | 14 | <NA> | 2022-11-07 |
3090 | 269420 | 269706 | 420969 | <NA> | 2020-01 | 전 | 1 | <NA> | 2022-11-07 |
62964 | 297429 | 297706 | 429969 | <NA> | 2020-08 | 도로 | 14 | <NA> | 2022-11-07 |
34265 | 284473 | 284706 | 473969 | <NA> | 2020-05 | 임야 | 5 | <NA> | 2022-11-07 |
18697 | 248459 | 248706 | 459969 | <NA> | 2020-03 | 전 | 1 | <NA> | 2022-11-07 |
10864 | 259486 | 259706 | 486969 | <NA> | 2020-02 | 임야 | 5 | <NA> | 2022-11-07 |
19880 | 273419 | 273706 | 419969 | <NA> | 2020-03 | 답 | 2 | <NA> | 2022-11-07 |
52799 | 272479 | 272706 | 479969 | <NA> | 2020-07 | 임야 | 5 | <NA> | 2022-11-07 |