Overview

Dataset statistics

Number of variables9
Number of observations10000
Missing cells9873
Missing cells (%)11.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory830.1 KiB
Average record size in memory85.0 B

Variable types

Numeric5
Categorical4

Dataset

Description충남 야생동물 구조센터에서 구조한 충청남도 전역의 구조 동물, 구조시점, 구조지역의 지목 등의 자료를 격자 단위의 데이터로 제공※ 좌표 : 카텍좌표
Author충청남도
URLhttps://www.data.go.kr/data/15109316/fileData.do

Alerts

데이터 생성일 has constant value ""Constant
격자 아이디(ID) is highly overall correlated with 격자 가로 (X축) 좌표High correlation
격자 가로 (X축) 좌표 is highly overall correlated with 격자 아이디(ID)High correlation
대표 지목 코드 is highly overall correlated with 대표 지목명High correlation
구조 건수 is highly overall correlated with 동물국문명High correlation
동물국문명 is highly overall correlated with 구조 건수High correlation
대표 지목명 is highly overall correlated with 대표 지목 코드High correlation
동물국문명 is highly imbalanced (97.1%)Imbalance
구조 건수 has 9873 (98.7%) missing valuesMissing

Reproduction

Analysis started2024-03-14 16:58:25.067486
Analysis finished2024-03-14 16:58:32.326783
Duration7.26 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

격자 아이디(ID)
Real number (ℝ)

HIGH CORRELATION 

Distinct5942
Distinct (%)59.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean296547.69
Minimum216459
Maximum366386
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T01:58:32.511476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum216459
5-th percentile250461.95
Q1274470
median294465
Q3318401
95-th percentile347461.05
Maximum366386
Range149927
Interquartile range (IQR)43931

Descriptive statistics

Standard deviation29019.896
Coefficient of variation (CV)0.097859122
Kurtosis-0.59824955
Mean296547.69
Median Absolute Deviation (MAD)21968
Skewness0.16424597
Sum2.9654769 × 109
Variance8.4215437 × 108
MonotonicityNot monotonic
2024-03-15T01:58:32.959314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
268478 7
 
0.1%
303476 7
 
0.1%
279485 6
 
0.1%
269405 6
 
0.1%
259451 6
 
0.1%
289397 5
 
0.1%
313452 5
 
0.1%
279398 5
 
0.1%
331475 5
 
0.1%
254430 5
 
0.1%
Other values (5932) 9943
99.4%
ValueCountFrequency (%)
216459 2
< 0.1%
217405 1
 
< 0.1%
223402 2
< 0.1%
228404 2
< 0.1%
231464 2
< 0.1%
232459 1
 
< 0.1%
232462 3
< 0.1%
232463 3
< 0.1%
233454 1
 
< 0.1%
233459 1
 
< 0.1%
ValueCountFrequency (%)
366386 1
 
< 0.1%
366385 1
 
< 0.1%
365388 3
< 0.1%
365385 1
 
< 0.1%
365384 1
 
< 0.1%
365381 1
 
< 0.1%
365380 3
< 0.1%
364389 2
< 0.1%
364388 1
 
< 0.1%
364387 1
 
< 0.1%

격자 가로 (X축) 좌표
Real number (ℝ)

HIGH CORRELATION 

Distinct140
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean296816.8
Minimum216706
Maximum366706
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T01:58:33.497411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum216706
5-th percentile250706
Q1274706
median294706
Q3318706
95-th percentile347706
Maximum366706
Range150000
Interquartile range (IQR)44000

Descriptive statistics

Standard deviation29032.306
Coefficient of variation (CV)0.097812207
Kurtosis-0.59703559
Mean296816.8
Median Absolute Deviation (MAD)22000
Skewness0.16444521
Sum2.968168 × 109
Variance8.4287481 × 108
MonotonicityNot monotonic
2024-03-15T01:58:33.886512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
284706 141
 
1.4%
272706 140
 
1.4%
322706 137
 
1.4%
279706 136
 
1.4%
317706 136
 
1.4%
321706 134
 
1.3%
283706 133
 
1.3%
282706 133
 
1.3%
287706 132
 
1.3%
288706 132
 
1.3%
Other values (130) 8646
86.5%
ValueCountFrequency (%)
216706 2
 
< 0.1%
217706 1
 
< 0.1%
223706 2
 
< 0.1%
228706 2
 
< 0.1%
231706 2
 
< 0.1%
232706 7
0.1%
233706 13
0.1%
234706 10
0.1%
235706 15
0.1%
236706 13
0.1%
ValueCountFrequency (%)
366706 2
 
< 0.1%
365706 9
 
0.1%
364706 11
 
0.1%
363706 32
0.3%
362706 23
0.2%
361706 33
0.3%
360706 29
0.3%
359706 25
0.2%
358706 36
0.4%
357706 39
0.4%
Distinct121
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean437856.4
Minimum375969
Maximum495969
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T01:58:34.165076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum375969
5-th percentile388969
Q1409969
median441469
Q3464969
95-th percentile481969
Maximum495969
Range120000
Interquartile range (IQR)55000

Descriptive statistics

Standard deviation30798.519
Coefficient of variation (CV)0.070339315
Kurtosis-1.2268203
Mean437856.4
Median Absolute Deviation (MAD)26500
Skewness-0.17163767
Sum4.378564 × 109
Variance9.4854878 × 108
MonotonicityNot monotonic
2024-03-15T01:58:34.444667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
467969 158
 
1.6%
465969 151
 
1.5%
468969 142
 
1.4%
471969 141
 
1.4%
462969 141
 
1.4%
461969 140
 
1.4%
466969 139
 
1.4%
464969 136
 
1.4%
395969 135
 
1.4%
463969 135
 
1.4%
Other values (111) 8582
85.8%
ValueCountFrequency (%)
375969 4
 
< 0.1%
376969 5
 
0.1%
377969 14
 
0.1%
378969 14
 
0.1%
379969 25
0.2%
380969 37
0.4%
381969 41
0.4%
382969 46
0.5%
383969 47
0.5%
384969 55
0.5%
ValueCountFrequency (%)
495969 8
 
0.1%
494969 8
 
0.1%
493969 6
 
0.1%
492969 11
 
0.1%
491969 9
 
0.1%
490969 20
 
0.2%
489969 30
0.3%
488969 39
0.4%
487969 52
0.5%
486969 48
0.5%

동물국문명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct41
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9873 
고라니
 
26
너구리
 
12
멧비둘기
 
10
참새
 
9
Other values (36)
 
70

Length

Max length7
Median length4
Mean length3.9917
Min length1

Unique

Unique20 ?
Unique (%)0.2%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9873
98.7%
고라니 26
 
0.3%
너구리 12
 
0.1%
멧비둘기 10
 
0.1%
참새 9
 
0.1%
황조롱이 8
 
0.1%
까치 7
 
0.1%
수리부엉이 6
 
0.1%
소쩍새 4
 
< 0.1%
중대백로㉵ 3
 
< 0.1%
Other values (31) 42
 
0.4%

Length

2024-03-15T01:58:34.921456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 9873
98.7%
고라니 26
 
0.3%
너구리 12
 
0.1%
멧비둘기 10
 
0.1%
참새 9
 
0.1%
황조롱이 8
 
0.1%
까치 7
 
0.1%
수리부엉이 6
 
0.1%
소쩍새 4
 
< 0.1%
중대백로㉵ 3
 
< 0.1%
Other values (31) 42
 
0.4%

발생연월
Categorical

Distinct13
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2019-04
885 
2019-06
857 
2019-03
840 
2019-07
835 
2019-11
818 
Other values (8)
5765 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019-05
2nd row2019-05
3rd row2020-01
4th row2019-11
5th row2019-09

Common Values

ValueCountFrequency (%)
2019-04 885
8.8%
2019-06 857
8.6%
2019-03 840
8.4%
2019-07 835
8.3%
2019-11 818
8.2%
2019-10 818
8.2%
2019-05 817
8.2%
2019-08 816
8.2%
2019-12 810
8.1%
2019-09 798
8.0%
Other values (3) 1706
17.1%

Length

2024-03-15T01:58:35.321832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2019-04 885
8.8%
2019-06 857
8.6%
2019-03 840
8.4%
2019-07 835
8.3%
2019-11 818
8.2%
2019-10 818
8.2%
2019-05 817
8.2%
2019-08 816
8.2%
2019-12 810
8.1%
2019-09 798
8.0%
Other values (3) 1706
17.1%

대표 지목명
Categorical

HIGH CORRELATION 

Distinct18
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
3983 
2739 
임야
2025 
대지
511 
도로
457 
Other values (13)
 
285

Length

Max length4
Median length1
Mean length1.3446
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row
2nd row잡종지
3rd row대지
4th row
5th row

Common Values

ValueCountFrequency (%)
3983
39.8%
2739
27.4%
임야 2025
20.2%
대지 511
 
5.1%
도로 457
 
4.6%
잡종지 88
 
0.9%
유지 72
 
0.7%
하천 34
 
0.3%
구거 28
 
0.3%
공장용지 21
 
0.2%
Other values (8) 42
 
0.4%

Length

2024-03-15T01:58:35.740374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
3983
39.8%
2739
27.4%
임야 2025
20.2%
대지 511
 
5.1%
도로 457
 
4.6%
잡종지 88
 
0.9%
유지 72
 
0.7%
하천 34
 
0.3%
구거 28
 
0.3%
공장용지 21
 
0.2%
Other values (8) 42
 
0.4%

대표 지목 코드
Real number (ℝ)

HIGH CORRELATION 

Distinct18
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.6927
Minimum1
Maximum28
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T01:58:36.154196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q35
95-th percentile14
Maximum28
Range27
Interquartile range (IQR)4

Descriptive statistics

Standard deviation4.2484199
Coefficient of variation (CV)1.1504915
Kurtosis10.839317
Mean3.6927
Median Absolute Deviation (MAD)1
Skewness2.9856425
Sum36927
Variance18.049072
MonotonicityNot monotonic
2024-03-15T01:58:36.574184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
2 3983
39.8%
1 2739
27.4%
5 2025
20.2%
8 511
 
5.1%
14 457
 
4.6%
28 88
 
0.9%
19 72
 
0.7%
17 34
 
0.3%
18 28
 
0.3%
9 21
 
0.2%
Other values (8) 42
 
0.4%
ValueCountFrequency (%)
1 2739
27.4%
2 3983
39.8%
3 12
 
0.1%
4 2
 
< 0.1%
5 2025
20.2%
7 11
 
0.1%
8 511
 
5.1%
9 21
 
0.2%
14 457
 
4.6%
16 3
 
< 0.1%
ValueCountFrequency (%)
28 88
 
0.9%
26 2
 
< 0.1%
25 6
 
0.1%
23 5
 
0.1%
22 1
 
< 0.1%
19 72
 
0.7%
18 28
 
0.3%
17 34
 
0.3%
16 3
 
< 0.1%
14 457
4.6%

구조 건수
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct8
Distinct (%)6.3%
Missing9873
Missing (%)98.7%
Infinite0
Infinite (%)0.0%
Mean1.4488189
Minimum1
Maximum22
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T01:58:36.987632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q31
95-th percentile2
Maximum22
Range21
Interquartile range (IQR)0

Descriptive statistics

Standard deviation2.3560414
Coefficient of variation (CV)1.6261808
Kurtosis51.670036
Mean1.4488189
Median Absolute Deviation (MAD)0
Skewness6.8165387
Sum184
Variance5.5509311
MonotonicityNot monotonic
2024-03-15T01:58:37.367861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
1 118
 
1.2%
2 3
 
< 0.1%
13 1
 
< 0.1%
22 1
 
< 0.1%
7 1
 
< 0.1%
5 1
 
< 0.1%
3 1
 
< 0.1%
10 1
 
< 0.1%
(Missing) 9873
98.7%
ValueCountFrequency (%)
1 118
1.2%
2 3
 
< 0.1%
3 1
 
< 0.1%
5 1
 
< 0.1%
7 1
 
< 0.1%
10 1
 
< 0.1%
13 1
 
< 0.1%
22 1
 
< 0.1%
ValueCountFrequency (%)
22 1
 
< 0.1%
13 1
 
< 0.1%
10 1
 
< 0.1%
7 1
 
< 0.1%
5 1
 
< 0.1%
3 1
 
< 0.1%
2 3
 
< 0.1%
1 118
1.2%

데이터 생성일
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2022-11-07
10000 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-11-07
2nd row2022-11-07
3rd row2022-11-07
4th row2022-11-07
5th row2022-11-07

Common Values

ValueCountFrequency (%)
2022-11-07 10000
100.0%

Length

2024-03-15T01:58:37.784173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T01:58:38.108483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-11-07 10000
100.0%

Interactions

2024-03-15T01:58:30.481236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T01:58:25.922724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T01:58:27.110011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T01:58:28.427592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T01:58:29.434529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T01:58:30.732423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T01:58:26.213883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T01:58:27.526966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T01:58:28.643950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T01:58:29.619800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T01:58:30.984220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T01:58:26.495987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T01:58:27.802129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T01:58:28.827438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T01:58:29.808339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T01:58:31.228843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T01:58:26.779009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T01:58:28.088619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T01:58:29.086866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T01:58:30.012993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T01:58:31.431827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T01:58:26.950832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T01:58:28.282222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T01:58:29.290885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T01:58:30.247674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T01:58:38.371636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
격자 아이디(ID)격자 가로 (X축) 좌표격자 세로 (Y축) 좌표동물국문명발생연월대표 지목명대표 지목 코드구조 건수
격자 아이디(ID)1.0000.9990.6150.5800.0000.2440.1700.000
격자 가로 (X축) 좌표0.9991.0000.6200.4950.0000.2400.1700.000
격자 세로 (Y축) 좌표0.6150.6201.0000.0000.0140.2810.1320.000
동물국문명0.5800.4950.0001.0000.7650.0000.0000.906
발생연월0.0000.0000.0140.7651.0000.0450.0560.000
대표 지목명0.2440.2400.2810.0000.0451.0001.0000.000
대표 지목 코드0.1700.1700.1320.0000.0561.0001.0000.000
구조 건수0.0000.0000.0000.9060.0000.0000.0001.000
2024-03-15T01:58:38.727666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
동물국문명대표 지목명발생연월
동물국문명1.0000.0000.300
대표 지목명0.0001.0000.016
발생연월0.3000.0161.000
2024-03-15T01:58:39.205767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
격자 아이디(ID)격자 가로 (X축) 좌표격자 세로 (Y축) 좌표대표 지목 코드구조 건수동물국문명발생연월대표 지목명
격자 아이디(ID)1.0001.000-0.3550.0090.0190.2110.0000.095
격자 가로 (X축) 좌표1.0001.000-0.3630.0090.0170.1680.0000.093
격자 세로 (Y축) 좌표-0.355-0.3631.0000.034-0.0280.0000.0060.111
대표 지목 코드0.0090.0090.0341.0000.0230.0000.0241.000
구조 건수0.0190.017-0.0280.0231.0000.5690.0000.000
동물국문명0.2110.1680.0000.0000.5691.0000.3000.000
발생연월0.0000.0000.0060.0240.0000.3001.0000.016
대표 지목명0.0950.0930.1111.0000.0000.0000.0161.000

Missing values

2024-03-15T01:58:31.652634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T01:58:32.123531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

격자 아이디(ID)격자 가로 (X축) 좌표격자 세로 (Y축) 좌표동물국문명발생연월대표 지목명대표 지목 코드구조 건수데이터 생성일
36510274477274706477969<NA>2019-051<NA>2022-11-07
34396295483295706483969<NA>2019-05잡종지28<NA>2022-11-07
99430300399300706399969<NA>2020-01대지8<NA>2022-11-07
90691364385364706385969<NA>2019-112<NA>2022-11-07
67671281468281706468969<NA>2019-091<NA>2022-11-07
19013257449257706449969<NA>2019-032<NA>2022-11-07
48764334475334706475969<NA>2019-06임야5<NA>2022-11-07
59542316462316706462969<NA>2019-082<NA>2022-11-07
94320275484275706484969<NA>2019-121<NA>2022-11-07
76614253440253706440969<NA>2019-102<NA>2022-11-07
격자 아이디(ID)격자 가로 (X축) 좌표격자 세로 (Y축) 좌표동물국문명발생연월대표 지목명대표 지목 코드구조 건수데이터 생성일
66545309472309706472969<NA>2019-09대지8<NA>2022-11-07
14497312440312706440969<NA>2019-021<NA>2022-11-07
51536237456237706456969<NA>2019-07임야5<NA>2022-11-07
50211291454291706454969<NA>2019-071<NA>2022-11-07
57345347393347706393969<NA>2019-071<NA>2022-11-07
11517271457271706457969<NA>2019-022<NA>2022-11-07
69496274444274706444969<NA>2019-091<NA>2022-11-07
22296305453305706453969<NA>2019-031<NA>2022-11-07
99709281486281706486969<NA>2020-01임야5<NA>2022-11-07
56907332393332706393969<NA>2019-071<NA>2022-11-07