Overview

Dataset statistics

Number of variables9
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory830.1 KiB
Average record size in memory85.0 B

Variable types

Numeric4
Categorical5

Dataset

Description충청남도 전역에서 포획한 유해야생동물 중, 야생멧돼지의 포획일자, 포획 개체 수, 포획한 지점의 지목 등을 격자 단위로 제공하는 자료임※ 좌표 : 카텍좌표
Author충청남도
URLhttps://www.data.go.kr/data/15109317/fileData.do

Alerts

데이터 생성일 has constant value ""Constant
포획건수 is highly overall correlated with 동물국문명High correlation
발생연월 is highly overall correlated with 동물국문명High correlation
동물국문명 is highly overall correlated with 격자 아이디(ID) and 6 other fieldsHigh correlation
대표 지목명 is highly overall correlated with 대표 지목 코드 and 1 other fieldsHigh correlation
격자 아이디(ID) is highly overall correlated with 격자 가로 (X축) 좌표 and 1 other fieldsHigh correlation
격자 가로 (X축) 좌표 is highly overall correlated with 격자 아이디(ID) and 1 other fieldsHigh correlation
격자 세로 (Y축) 좌표 is highly overall correlated with 동물국문명High correlation
대표 지목 코드 is highly overall correlated with 동물국문명 and 1 other fieldsHigh correlation
동물국문명 is highly imbalanced (88.9%)Imbalance
포획건수 is highly imbalanced (94.7%)Imbalance

Reproduction

Analysis started2024-03-15 01:10:02.005295
Analysis finished2024-03-15 01:10:08.789094
Duration6.78 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

격자 아이디(ID)
Real number (ℝ)

HIGH CORRELATION 

Distinct5940
Distinct (%)59.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean296585.74
Minimum216459
Maximum366386
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T10:10:08.976582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum216459
5-th percentile250463
Q1275409.5
median294433.5
Q3318429
95-th percentile348392.1
Maximum366386
Range149927
Interquartile range (IQR)43019.5

Descriptive statistics

Standard deviation29076.461
Coefficient of variation (CV)0.098037285
Kurtosis-0.58549722
Mean296585.74
Median Absolute Deviation (MAD)21953.5
Skewness0.1484223
Sum2.9658574 × 109
Variance8.4544057 × 108
MonotonicityNot monotonic
2024-03-15T10:10:09.420438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
279486 6
 
0.1%
324458 6
 
0.1%
281477 6
 
0.1%
279483 6
 
0.1%
326438 5
 
0.1%
347397 5
 
0.1%
270413 5
 
0.1%
278430 5
 
0.1%
332415 5
 
0.1%
323421 5
 
0.1%
Other values (5930) 9946
99.5%
ValueCountFrequency (%)
216459 1
 
< 0.1%
217405 3
< 0.1%
222403 3
< 0.1%
223402 1
 
< 0.1%
227404 3
< 0.1%
228404 2
< 0.1%
228454 2
< 0.1%
231464 3
< 0.1%
232454 3
< 0.1%
232459 1
 
< 0.1%
ValueCountFrequency (%)
366386 2
< 0.1%
366381 2
< 0.1%
365389 1
 
< 0.1%
365388 1
 
< 0.1%
365387 3
< 0.1%
365385 1
 
< 0.1%
365384 1
 
< 0.1%
365382 1
 
< 0.1%
365381 1
 
< 0.1%
365380 2
< 0.1%

격자 가로 (X축) 좌표
Real number (ℝ)

HIGH CORRELATION 

Distinct142
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean296855.3
Minimum216706
Maximum366706
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T10:10:09.837232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum216706
5-th percentile250706
Q1275706
median294706
Q3318706
95-th percentile348706
Maximum366706
Range150000
Interquartile range (IQR)43000

Descriptive statistics

Standard deviation29088.744
Coefficient of variation (CV)0.09798964
Kurtosis-0.58437407
Mean296855.3
Median Absolute Deviation (MAD)22000
Skewness0.14864985
Sum2.968553 × 109
Variance8.4615503 × 108
MonotonicityNot monotonic
2024-03-15T10:10:10.257053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
283706 154
 
1.5%
278706 149
 
1.5%
285706 148
 
1.5%
290706 144
 
1.4%
281706 143
 
1.4%
279706 133
 
1.3%
287706 132
 
1.3%
320706 131
 
1.3%
284706 131
 
1.3%
282706 130
 
1.3%
Other values (132) 8605
86.1%
ValueCountFrequency (%)
216706 1
 
< 0.1%
217706 3
 
< 0.1%
222706 3
 
< 0.1%
223706 1
 
< 0.1%
227706 3
 
< 0.1%
228706 4
 
< 0.1%
231706 3
 
< 0.1%
232706 7
0.1%
233706 14
0.1%
234706 13
0.1%
ValueCountFrequency (%)
366706 4
 
< 0.1%
365706 12
 
0.1%
364706 12
 
0.1%
363706 14
 
0.1%
362706 22
0.2%
361706 29
0.3%
360706 23
0.2%
359706 29
0.3%
358706 35
0.4%
357706 29
0.3%

격자 세로 (Y축) 좌표
Real number (ℝ)

HIGH CORRELATION 

Distinct122
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean437409.1
Minimum375969
Maximum496969
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T10:10:10.791624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum375969
5-th percentile388969
Q1409969
median439969
Q3463969
95-th percentile481969
Maximum496969
Range121000
Interquartile range (IQR)54000

Descriptive statistics

Standard deviation30688.027
Coefficient of variation (CV)0.070158638
Kurtosis-1.228374
Mean437409.1
Median Absolute Deviation (MAD)26000
Skewness-0.14473215
Sum4.374091 × 109
Variance9.4175499 × 108
MonotonicityNot monotonic
2024-03-15T10:10:11.255550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
463969 152
 
1.5%
467969 146
 
1.5%
462969 140
 
1.4%
464969 136
 
1.4%
461969 135
 
1.4%
465969 134
 
1.3%
459969 134
 
1.3%
460969 132
 
1.3%
468969 132
 
1.3%
471969 128
 
1.3%
Other values (112) 8631
86.3%
ValueCountFrequency (%)
375969 1
 
< 0.1%
376969 15
 
0.1%
377969 14
 
0.1%
378969 12
 
0.1%
379969 26
0.3%
380969 31
0.3%
381969 43
0.4%
382969 38
0.4%
383969 49
0.5%
384969 39
0.4%
ValueCountFrequency (%)
496969 1
 
< 0.1%
495969 7
 
0.1%
494969 3
 
< 0.1%
493969 3
 
< 0.1%
492969 6
 
0.1%
491969 12
 
0.1%
490969 23
0.2%
489969 43
0.4%
488969 45
0.4%
487969 32
0.3%

동물국문명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9852 
멧돼지
 
148

Length

Max length4
Median length4
Mean length3.9852
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9852
98.5%
멧돼지 148
 
1.5%

Length

2024-03-15T10:10:11.719812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T10:10:11.890926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9852
98.5%
멧돼지 148
 
1.5%

발생연월
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2020-03
872 
2020-01
857 
2020-10
856 
2020-05
854 
2020-07
848 
Other values (8)
5713 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020-06
2nd row2020-07
3rd row2020-10
4th row2020-05
5th row2020-01

Common Values

ValueCountFrequency (%)
2020-03 872
8.7%
2020-01 857
8.6%
2020-10 856
8.6%
2020-05 854
8.5%
2020-07 848
8.5%
2020-12 841
8.4%
2020-02 823
8.2%
2020-06 811
8.1%
2020-04 802
8.0%
2020-08 793
7.9%
Other values (3) 1643
16.4%

Length

2024-03-15T10:10:12.062545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2020-03 872
8.7%
2020-01 857
8.6%
2020-10 856
8.6%
2020-05 854
8.5%
2020-07 848
8.5%
2020-12 841
8.4%
2020-02 823
8.2%
2020-06 811
8.1%
2020-04 802
8.0%
2020-08 793
7.9%
Other values (3) 1643
16.4%

대표 지목명
Categorical

HIGH CORRELATION 

Distinct18
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
3997 
2759 
임야
2079 
대지
483 
도로
412 
Other values (13)
 
270

Length

Max length4
Median length1
Mean length1.3434
Min length1

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st row대지
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
3997
40.0%
2759
27.6%
임야 2079
20.8%
대지 483
 
4.8%
도로 412
 
4.1%
잡종지 87
 
0.9%
유지 60
 
0.6%
공장용지 33
 
0.3%
하천 30
 
0.3%
구거 19
 
0.2%
Other values (8) 41
 
0.4%

Length

2024-03-15T10:10:12.307218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
3997
40.0%
2759
27.6%
임야 2079
20.8%
대지 483
 
4.8%
도로 412
 
4.1%
잡종지 87
 
0.9%
유지 60
 
0.6%
공장용지 33
 
0.3%
하천 30
 
0.3%
구거 19
 
0.2%
Other values (8) 41
 
0.4%

대표 지목 코드
Real number (ℝ)

HIGH CORRELATION 

Distinct18
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.604
Minimum1
Maximum28
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T10:10:12.682892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q35
95-th percentile14
Maximum28
Range27
Interquartile range (IQR)4

Descriptive statistics

Standard deviation4.1305072
Coefficient of variation (CV)1.1460897
Kurtosis12.254299
Mean3.604
Median Absolute Deviation (MAD)1
Skewness3.1416838
Sum36040
Variance17.06109
MonotonicityNot monotonic
2024-03-15T10:10:12.992810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
2 3997
40.0%
1 2759
27.6%
5 2079
20.8%
8 483
 
4.8%
14 412
 
4.1%
28 87
 
0.9%
19 60
 
0.6%
9 33
 
0.3%
17 30
 
0.3%
18 19
 
0.2%
Other values (8) 41
 
0.4%
ValueCountFrequency (%)
1 2759
27.6%
2 3997
40.0%
3 10
 
0.1%
4 1
 
< 0.1%
5 2079
20.8%
7 13
 
0.1%
8 483
 
4.8%
9 33
 
0.3%
14 412
 
4.1%
16 1
 
< 0.1%
ValueCountFrequency (%)
28 87
 
0.9%
26 5
 
0.1%
25 6
 
0.1%
23 4
 
< 0.1%
22 1
 
< 0.1%
19 60
 
0.6%
18 19
 
0.2%
17 30
 
0.3%
16 1
 
< 0.1%
14 412
4.1%

포획건수
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9852 
1
 
126
2
 
17
3
 
4
5
 
1

Length

Max length4
Median length4
Mean length3.9556
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9852
98.5%
1 126
 
1.3%
2 17
 
0.2%
3 4
 
< 0.1%
5 1
 
< 0.1%

Length

2024-03-15T10:10:13.220883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T10:10:13.521024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9852
98.5%
1 126
 
1.3%
2 17
 
0.2%
3 4
 
< 0.1%
5 1
 
< 0.1%

데이터 생성일
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2022-11-07
10000 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-11-07
2nd row2022-11-07
3rd row2022-11-07
4th row2022-11-07
5th row2022-11-07

Common Values

ValueCountFrequency (%)
2022-11-07 10000
100.0%

Length

2024-03-15T10:10:13.877600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T10:10:14.123574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-11-07 10000
100.0%

Interactions

2024-03-15T10:10:06.941017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T10:10:03.190299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T10:10:04.606547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T10:10:05.814319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T10:10:07.215148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T10:10:03.463300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T10:10:04.894373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T10:10:06.104443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T10:10:07.490492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T10:10:03.953666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T10:10:05.179894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T10:10:06.391219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T10:10:07.759549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T10:10:04.236673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T10:10:05.542029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T10:10:06.665716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T10:10:14.228627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
격자 아이디(ID)격자 가로 (X축) 좌표격자 세로 (Y축) 좌표발생연월대표 지목명대표 지목 코드포획건수
격자 아이디(ID)1.0000.9990.6200.0000.2490.1650.267
격자 가로 (X축) 좌표0.9991.0000.6260.0000.2420.1650.318
격자 세로 (Y축) 좌표0.6200.6261.0000.0210.3270.1360.244
발생연월0.0000.0000.0211.0000.0500.0240.126
대표 지목명0.2490.2420.3270.0501.0001.0000.000
대표 지목 코드0.1650.1650.1360.0241.0001.0000.000
포획건수0.2670.3180.2440.1260.0000.0001.000
2024-03-15T10:10:14.442327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
포획건수발생연월동물국문명대표 지목명
포획건수1.0000.0711.0000.000
발생연월0.0711.0001.0000.017
동물국문명1.0001.0001.0001.000
대표 지목명0.0000.0171.0001.000
2024-03-15T10:10:14.934978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
격자 아이디(ID)격자 가로 (X축) 좌표격자 세로 (Y축) 좌표대표 지목 코드동물국문명발생연월대표 지목명포획건수
격자 아이디(ID)1.0001.000-0.353-0.0031.0000.0000.0980.170
격자 가로 (X축) 좌표1.0001.000-0.362-0.0031.0000.0000.0950.204
격자 세로 (Y축) 좌표-0.353-0.3621.0000.0311.0000.0090.1310.143
대표 지목 코드-0.003-0.0030.0311.0001.0000.0101.0000.000
동물국문명1.0001.0001.0001.0001.0001.0001.0001.000
발생연월0.0000.0000.0090.0101.0001.0000.0170.071
대표 지목명0.0980.0950.1311.0001.0000.0171.0000.000
포획건수0.1700.2040.1430.0001.0000.0710.0001.000

Missing values

2024-03-15T10:10:08.115789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T10:10:08.588597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

격자 아이디(ID)격자 가로 (X축) 좌표격자 세로 (Y축) 좌표동물국문명발생연월대표 지목명대표 지목 코드포획건수데이터 생성일
41307318391318706391969<NA>2020-06대지8<NA>2022-11-07
50046286446286706446969<NA>2020-071<NA>2022-11-07
80639316416316706416969<NA>2020-101<NA>2022-11-07
35262251460251706460969<NA>2020-052<NA>2022-11-07
1782273436273706436969<NA>2020-012<NA>2022-11-07
92856249454249706454969<NA>2020-122<NA>2022-11-07
83912289421289706421969<NA>2020-112<NA>2022-11-07
27068252460252706460969<NA>2020-042<NA>2022-11-07
94256276436276706436969<NA>2020-121<NA>2022-11-07
98185336455336706455969<NA>2020-12임야5<NA>2022-11-07
격자 아이디(ID)격자 가로 (X축) 좌표격자 세로 (Y축) 좌표동물국문명발생연월대표 지목명대표 지목 코드포획건수데이터 생성일
18824252452252706452969<NA>2020-03유지19<NA>2022-11-07
83076284476284706476969<NA>2020-112<NA>2022-11-07
72149312434312706434969<NA>2020-09도로14<NA>2022-11-07
3090269420269706420969<NA>2020-011<NA>2022-11-07
62964297429297706429969<NA>2020-08도로14<NA>2022-11-07
34265284473284706473969<NA>2020-05임야5<NA>2022-11-07
18697248459248706459969<NA>2020-031<NA>2022-11-07
10864259486259706486969<NA>2020-02임야5<NA>2022-11-07
19880273419273706419969<NA>2020-032<NA>2022-11-07
52799272479272706479969<NA>2020-07임야5<NA>2022-11-07