Overview

Dataset statistics

Number of variables9
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory830.1 KiB
Average record size in memory85.0 B

Variable types

Numeric4
Categorical3
DateTime2

Dataset

Description충청남도 전역에서 포획한 유해야생동물 중, 야생멧돼지의 포획일자, 포획 개체 수, 포획한 지점의 지목 등을 격자 단위로 제공하는 자료임
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=71&beforeMenuCd=DOM_000000201001001000&publicdatapk=15109317

Alerts

데이터 생성일 has constant value ""Constant
동물국문명 is highly overall correlated with 격자 아이디(ID) and 5 other fieldsHigh correlation
포획건수 is highly overall correlated with 동물국문명High correlation
대표 지목명 is highly overall correlated with 대표 지목 코드 and 1 other fieldsHigh correlation
격자 아이디(ID) is highly overall correlated with 격자 가로 (X축) 좌표 and 1 other fieldsHigh correlation
격자 가로 (X축) 좌표 is highly overall correlated with 격자 아이디(ID) and 1 other fieldsHigh correlation
격자 세로 (Y축) 좌표 is highly overall correlated with 동물국문명High correlation
대표 지목 코드 is highly overall correlated with 동물국문명 and 1 other fieldsHigh correlation
동물국문명 is highly imbalanced (88.6%)Imbalance
포획건수 is highly imbalanced (94.6%)Imbalance

Reproduction

Analysis started2024-01-09 21:47:30.267107
Analysis finished2024-01-09 21:47:32.721689
Duration2.45 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

격자 아이디(ID)
Real number (ℝ)

HIGH CORRELATION 

Distinct5991
Distinct (%)59.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean296512.83
Minimum217405
Maximum366385
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-10T06:47:32.775239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum217405
5-th percentile251458.95
Q1275407.5
median294407
Q3318391.25
95-th percentile348393
Maximum366385
Range148980
Interquartile range (IQR)42983.75

Descriptive statistics

Standard deviation28717.465
Coefficient of variation (CV)0.096850664
Kurtosis-0.56779947
Mean296512.83
Median Absolute Deviation (MAD)21051
Skewness0.20429805
Sum2.9651283 × 109
Variance8.2469279 × 108
MonotonicityNot monotonic
2024-01-10T06:47:32.879740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
306446 7
 
0.1%
294475 7
 
0.1%
348393 6
 
0.1%
263477 6
 
0.1%
323481 6
 
0.1%
264478 6
 
0.1%
273452 5
 
0.1%
343467 5
 
0.1%
263466 5
 
0.1%
328422 5
 
0.1%
Other values (5981) 9942
99.4%
ValueCountFrequency (%)
217405 2
< 0.1%
222403 2
< 0.1%
223402 1
< 0.1%
225405 1
< 0.1%
228454 2
< 0.1%
231464 1
< 0.1%
232454 2
< 0.1%
232464 1
< 0.1%
233454 1
< 0.1%
233459 1
< 0.1%
ValueCountFrequency (%)
366385 1
 
< 0.1%
365389 1
 
< 0.1%
365388 1
 
< 0.1%
365387 3
< 0.1%
365386 2
< 0.1%
365385 1
 
< 0.1%
365384 2
< 0.1%
365382 1
 
< 0.1%
365381 1
 
< 0.1%
365380 2
< 0.1%

격자 가로 (X축) 좌표
Real number (ℝ)

HIGH CORRELATION 

Distinct141
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean296782.2
Minimum217706
Maximum366706
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-10T06:47:32.987959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum217706
5-th percentile251706
Q1275706
median294706
Q3318706
95-th percentile348706
Maximum366706
Range149000
Interquartile range (IQR)43000

Descriptive statistics

Standard deviation28729.576
Coefficient of variation (CV)0.096803568
Kurtosis-0.56654353
Mean296782.2
Median Absolute Deviation (MAD)21000
Skewness0.20453133
Sum2.967822 × 109
Variance8.2538853 × 108
MonotonicityNot monotonic
2024-01-10T06:47:33.092725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
279706 155
 
1.6%
287706 154
 
1.5%
276706 151
 
1.5%
288706 143
 
1.4%
283706 140
 
1.4%
284706 138
 
1.4%
270706 136
 
1.4%
275706 133
 
1.3%
289706 132
 
1.3%
277706 131
 
1.3%
Other values (131) 8587
85.9%
ValueCountFrequency (%)
217706 2
 
< 0.1%
222706 2
 
< 0.1%
223706 1
 
< 0.1%
225706 1
 
< 0.1%
228706 2
 
< 0.1%
231706 1
 
< 0.1%
232706 3
 
< 0.1%
233706 7
0.1%
234706 3
 
< 0.1%
235706 10
0.1%
ValueCountFrequency (%)
366706 1
 
< 0.1%
365706 16
 
0.2%
364706 12
 
0.1%
363706 23
0.2%
362706 29
0.3%
361706 30
0.3%
360706 35
0.4%
359706 27
0.3%
358706 32
0.3%
357706 42
0.4%

격자 세로 (Y축) 좌표
Real number (ℝ)

HIGH CORRELATION 

Distinct122
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean437603.1
Minimum375969
Maximum496969
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-10T06:47:33.204569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum375969
5-th percentile387969
Q1409969
median440969
Q3463969
95-th percentile480969
Maximum496969
Range121000
Interquartile range (IQR)54000

Descriptive statistics

Standard deviation30711.883
Coefficient of variation (CV)0.07018205
Kurtosis-1.2179782
Mean437603.1
Median Absolute Deviation (MAD)26000
Skewness-0.16091028
Sum4.376031 × 109
Variance9.4321974 × 108
MonotonicityNot monotonic
2024-01-10T06:47:33.311341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
463969 156
 
1.6%
466969 150
 
1.5%
461969 144
 
1.4%
462969 141
 
1.4%
464969 141
 
1.4%
465969 136
 
1.4%
468969 136
 
1.4%
459969 135
 
1.4%
458969 131
 
1.3%
471969 130
 
1.3%
Other values (112) 8600
86.0%
ValueCountFrequency (%)
375969 4
 
< 0.1%
376969 13
 
0.1%
377969 10
 
0.1%
378969 16
 
0.2%
379969 26
0.3%
380969 39
0.4%
381969 36
0.4%
382969 42
0.4%
383969 47
0.5%
384969 59
0.6%
ValueCountFrequency (%)
496969 4
 
< 0.1%
495969 1
 
< 0.1%
494969 13
 
0.1%
493969 10
 
0.1%
492969 10
 
0.1%
491969 12
 
0.1%
490969 20
0.2%
489969 24
0.2%
488969 37
0.4%
487969 40
0.4%

동물국문명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9848 
멧돼지
 
152

Length

Max length4
Median length4
Mean length3.9848
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9848
98.5%
멧돼지 152
 
1.5%

Length

2024-01-10T06:47:33.412728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:47:33.482658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9848
98.5%
멧돼지 152
 
1.5%
Distinct12
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2020-01-01 00:00:00
Maximum2020-12-01 00:00:00
2024-01-10T06:47:33.540507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:47:33.614974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)

대표 지목명
Categorical

HIGH CORRELATION 

Distinct18
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
3855 
2811 
임야
2147 
대지
503 
도로
428 
Other values (13)
 
256

Length

Max length4
Median length1
Mean length1.3527
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row도로
2nd row
3rd row임야
4th row
5th row임야

Common Values

ValueCountFrequency (%)
3855
38.6%
2811
28.1%
임야 2147
21.5%
대지 503
 
5.0%
도로 428
 
4.3%
잡종지 80
 
0.8%
유지 56
 
0.6%
공장용지 23
 
0.2%
구거 22
 
0.2%
하천 19
 
0.2%
Other values (8) 56
 
0.6%

Length

2024-01-10T06:47:33.707766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
3855
38.6%
2811
28.1%
임야 2147
21.5%
대지 503
 
5.0%
도로 428
 
4.3%
잡종지 80
 
0.8%
유지 56
 
0.6%
공장용지 23
 
0.2%
구거 22
 
0.2%
하천 19
 
0.2%
Other values (8) 56
 
0.6%

대표 지목 코드
Real number (ℝ)

HIGH CORRELATION 

Distinct18
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.6282
Minimum1
Maximum28
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-10T06:47:33.806992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q35
95-th percentile14
Maximum28
Range27
Interquartile range (IQR)4

Descriptive statistics

Standard deviation4.1217549
Coefficient of variation (CV)1.136033
Kurtosis11.814556
Mean3.6282
Median Absolute Deviation (MAD)1
Skewness3.0803838
Sum36282
Variance16.988864
MonotonicityNot monotonic
2024-01-10T06:47:33.913676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
2 3855
38.6%
1 2811
28.1%
5 2147
21.5%
8 503
 
5.0%
14 428
 
4.3%
28 80
 
0.8%
19 56
 
0.6%
9 23
 
0.2%
18 22
 
0.2%
17 19
 
0.2%
Other values (8) 56
 
0.6%
ValueCountFrequency (%)
1 2811
28.1%
2 3855
38.6%
3 15
 
0.1%
4 4
 
< 0.1%
5 2147
21.5%
7 9
 
0.1%
8 503
 
5.0%
9 23
 
0.2%
14 428
 
4.3%
16 1
 
< 0.1%
ValueCountFrequency (%)
28 80
 
0.8%
26 2
 
< 0.1%
25 8
 
0.1%
23 14
 
0.1%
22 3
 
< 0.1%
19 56
 
0.6%
18 22
 
0.2%
17 19
 
0.2%
16 1
 
< 0.1%
14 428
4.3%

포획건수
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9848 
1
 
130
2
 
13
3
 
5
4
 
4

Length

Max length4
Median length4
Mean length3.9544
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9848
98.5%
1 130
 
1.3%
2 13
 
0.1%
3 5
 
0.1%
4 4
 
< 0.1%

Length

2024-01-10T06:47:34.011185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:47:34.091435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9848
98.5%
1 130
 
1.3%
2 13
 
0.1%
3 5
 
< 0.1%
4 4
 
< 0.1%

데이터 생성일
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2022-11-07 00:00:00
Maximum2022-11-07 00:00:00
2024-01-10T06:47:34.162945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:47:34.236338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-01-10T06:47:32.131012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:47:30.888869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:47:31.205073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:47:31.515555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:47:32.206442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:47:30.968310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:47:31.284829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:47:31.605397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:47:32.310582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:47:31.048475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:47:31.363045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:47:31.716851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:47:32.408705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:47:31.131107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:47:31.440398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:47:31.815910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T06:47:34.298318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
격자 아이디(ID)격자 가로 (X축) 좌표격자 세로 (Y축) 좌표발생연월대표 지목명대표 지목 코드포획건수
격자 아이디(ID)1.0001.0000.6180.1290.2390.1750.218
격자 가로 (X축) 좌표1.0001.0000.6180.1290.2390.1750.218
격자 세로 (Y축) 좌표0.6180.6181.0000.0450.2830.1460.156
발생연월0.1290.1290.0451.0000.0000.0000.189
대표 지목명0.2390.2390.2830.0001.0001.0000.000
대표 지목 코드0.1750.1750.1460.0001.0001.0000.000
포획건수0.2180.2180.1560.1890.0000.0001.000
2024-01-10T06:47:34.417949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
동물국문명포획건수대표 지목명
동물국문명1.0001.0001.000
포획건수1.0001.0000.000
대표 지목명1.0000.0001.000
2024-01-10T06:47:34.514760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
격자 아이디(ID)격자 가로 (X축) 좌표격자 세로 (Y축) 좌표대표 지목 코드동물국문명대표 지목명포획건수
격자 아이디(ID)1.0001.000-0.3440.0101.0000.0940.137
격자 가로 (X축) 좌표1.0001.000-0.3530.0101.0000.0940.137
격자 세로 (Y축) 좌표-0.344-0.3531.0000.0471.0000.1120.089
대표 지목 코드0.0100.0100.0471.0001.0001.0000.000
동물국문명1.0001.0001.0001.0001.0001.0001.000
대표 지목명0.0940.0940.1121.0001.0001.0000.000
포획건수0.1370.1370.0890.0001.0000.0001.000

Missing values

2024-01-10T06:47:32.530938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T06:47:32.669895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

격자 아이디(ID)격자 가로 (X축) 좌표격자 세로 (Y축) 좌표동물국문명발생연월대표 지목명대표 지목 코드포획건수데이터 생성일
20106276414276706414969<NA>2020-03도로14<NA>2022-11-07
93756270402270706402969<NA>2020-122<NA>2022-11-07
7986354376354706376969<NA>2020-01임야5<NA>2022-11-07
32317337398337706398969<NA>2020-041<NA>2022-11-07
73234330414330706414969<NA>2020-09임야5<NA>2022-11-07
14780316459316706459969<NA>2020-022<NA>2022-11-07
62181287400287706400969<NA>2020-08도로14<NA>2022-11-07
21423292429292706429969<NA>2020-03도로14<NA>2022-11-07
10144268414268706414969<NA>2020-022<NA>2022-11-07
90299351390351706390969멧돼지2020-11212022-11-07
격자 아이디(ID)격자 가로 (X축) 좌표격자 세로 (Y축) 좌표동물국문명발생연월대표 지목명대표 지목 코드포획건수데이터 생성일
30326302440302706440969<NA>2020-041<NA>2022-11-07
1377331479331706479969<NA>2020-012<NA>2022-11-07
91061335455335706455969<NA>2020-121<NA>2022-11-07
58536304475304706475969<NA>2020-082<NA>2022-11-07
4089282384282706384969<NA>2020-012<NA>2022-11-07
95045286401286706401969멧돼지2020-12임야512022-11-07
26583322459322706459969<NA>2020-042<NA>2022-11-07
60161256457256706457969<NA>2020-081<NA>2022-11-07
13086291396291706396969<NA>2020-021<NA>2022-11-07
70936293460293706460969<NA>2020-091<NA>2022-11-07