Overview

Dataset statistics

Number of variables10
Number of observations1526
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory119.3 KiB
Average record size in memory80.1 B

Variable types

Categorical6
Text2
DateTime2

Dataset

Description울산시설공단 야생동물구조센터에서 야생동물 구조현황을 접수번호, 국명, 구조일자, 구조결과일자, 구조결과 등 항목으로 데이터를 제공합니다.
URLhttps://www.data.go.kr/data/15089174/fileData.do

Alerts

기준일자 has constant value ""Constant
천연기념물 is highly overall correlated with 멸종 위기종High correlation
멸종 위기종 is highly overall correlated with 천연기념물High correlation
천연기념물 is highly imbalanced (82.4%)Imbalance
멸종 위기종 is highly imbalanced (81.2%)Imbalance
접수번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 23:35:35.788751
Analysis finished2023-12-12 23:35:36.730405
Duration0.94 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables


Categorical

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
조강(Aves)
1183 
포유강(Mammalia)
336 
파충강(Reptilia)
 
7

Length

Max length13
Median length8
Mean length9.1238532
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row조강(Aves)
2nd row포유강(Mammalia)
3rd row포유강(Mammalia)
4th row조강(Aves)
5th row조강(Aves)

Common Values

ValueCountFrequency (%)
조강(Aves) 1183
77.5%
포유강(Mammalia) 336
 
22.0%
파충강(Reptilia) 7
 
0.5%

Length

2023-12-13T08:35:36.787393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:35:36.882804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
조강(aves 1183
77.5%
포유강(mammalia 336
 
22.0%
파충강(reptilia 7
 
0.5%

접수번호
Text

UNIQUE 

Distinct1526
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
2023-12-13T08:35:37.134250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length9
Mean length9
Min length9

Characters and Unicode

Total characters13734
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1526 ?
Unique (%)100.0%

Sample

1st row2021-0440
2nd row2021-0441
3rd row2021-0442
4th row2021-0443
5th row2021-0444
ValueCountFrequency (%)
2021-0440 1
 
0.1%
2022-0578 1
 
0.1%
2022-0587 1
 
0.1%
2022-0586 1
 
0.1%
2022-0585 1
 
0.1%
2022-0584 1
 
0.1%
2022-0583 1
 
0.1%
2022-0582 1
 
0.1%
2022-0581 1
 
0.1%
2022-0580 1
 
0.1%
Other values (1516) 1516
99.3%
2023-12-13T08:35:37.502154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 4360
31.7%
0 3555
25.9%
- 1526
 
11.1%
1 927
 
6.7%
3 590
 
4.3%
5 513
 
3.7%
6 512
 
3.7%
7 507
 
3.7%
4 474
 
3.5%
8 474
 
3.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 12208
88.9%
Dash Punctuation 1526
 
11.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 4360
35.7%
0 3555
29.1%
1 927
 
7.6%
3 590
 
4.8%
5 513
 
4.2%
6 512
 
4.2%
7 507
 
4.2%
4 474
 
3.9%
8 474
 
3.9%
9 296
 
2.4%
Dash Punctuation
ValueCountFrequency (%)
- 1526
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 13734
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 4360
31.7%
0 3555
25.9%
- 1526
 
11.1%
1 927
 
6.7%
3 590
 
4.3%
5 513
 
3.7%
6 512
 
3.7%
7 507
 
3.7%
4 474
 
3.5%
8 474
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 13734
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 4360
31.7%
0 3555
25.9%
- 1526
 
11.1%
1 927
 
6.7%
3 590
 
4.3%
5 513
 
3.7%
6 512
 
3.7%
7 507
 
3.7%
4 474
 
3.5%
8 474
 
3.5%
Distinct123
Distinct (%)8.1%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
2023-12-13T08:35:37.730354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length3.6284404
Min length1

Characters and Unicode

Total characters5537
Distinct characters169
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)2.4%

Sample

1st row
2nd row고라니
3rd row고라니
4th row직박구리
5th row황조롱이
ValueCountFrequency (%)
집비둘기 209
 
13.7%
고라니 164
 
10.7%
멧비둘기 105
 
6.9%
너구리 102
 
6.7%
직박구리 77
 
5.0%
까치 71
 
4.7%
떼까마귀 66
 
4.3%
참새 65
 
4.3%
흰뺨검둥오리 48
 
3.1%
딱새 32
 
2.1%
Other values (113) 587
38.5%
2023-12-13T08:35:38.101495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
383
 
6.9%
380
 
6.9%
353
 
6.4%
314
 
5.7%
218
 
3.9%
212
 
3.8%
208
 
3.8%
173
 
3.1%
165
 
3.0%
164
 
3.0%
Other values (159) 2967
53.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5535
> 99.9%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
383
 
6.9%
380
 
6.9%
353
 
6.4%
314
 
5.7%
218
 
3.9%
212
 
3.8%
208
 
3.8%
173
 
3.1%
165
 
3.0%
164
 
3.0%
Other values (157) 2965
53.6%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5535
> 99.9%
Common 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
383
 
6.9%
380
 
6.9%
353
 
6.4%
314
 
5.7%
218
 
3.9%
212
 
3.8%
208
 
3.8%
173
 
3.1%
165
 
3.0%
164
 
3.0%
Other values (157) 2965
53.6%
Common
ValueCountFrequency (%)
( 1
50.0%
) 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5535
> 99.9%
ASCII 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
383
 
6.9%
380
 
6.9%
353
 
6.4%
314
 
5.7%
218
 
3.9%
212
 
3.8%
208
 
3.8%
173
 
3.1%
165
 
3.0%
164
 
3.0%
Other values (157) 2965
53.6%
ASCII
ValueCountFrequency (%)
( 1
50.0%
) 1
50.0%

천연기념물
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct16
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
해당없음
1398 
323-8
 
23
324-3
 
20
323-4
 
16
324-2
 
13
Other values (11)
 
56

Length

Max length5
Median length4
Mean length4.0655308
Min length3

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row해당없음
2nd row해당없음
3rd row해당없음
4th row해당없음
5th row323-8

Common Values

ValueCountFrequency (%)
해당없음 1398
91.6%
323-8 23
 
1.5%
324-3 20
 
1.3%
323-4 16
 
1.0%
324-2 13
 
0.9%
323-1 11
 
0.7%
243-1 11
 
0.7%
324-6 10
 
0.7%
324-7 6
 
0.4%
328 5
 
0.3%
Other values (6) 13
 
0.9%

Length

2023-12-13T08:35:38.249578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
해당없음 1398
91.6%
323-8 23
 
1.5%
324-3 20
 
1.3%
323-4 16
 
1.0%
324-2 13
 
0.9%
323-1 11
 
0.7%
243-1 11
 
0.7%
324-6 10
 
0.7%
324-7 6
 
0.4%
328 5
 
0.3%
Other values (6) 13
 
0.9%

멸종 위기종
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
해당없음
1452 
2 급
 
69
1 급
 
5

Length

Max length4
Median length4
Mean length3.9515072
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row해당없음
2nd row해당없음
3rd row해당없음
4th row해당없음
5th row해당없음

Common Values

ValueCountFrequency (%)
해당없음 1452
95.2%
2 급 69
 
4.5%
1 급 5
 
0.3%

Length

2023-12-13T08:35:38.380708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:35:38.478146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
해당없음 1452
90.8%
74
 
4.6%
2 69
 
4.3%
1 5
 
0.3%
Distinct553
Distinct (%)36.2%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
Minimum2021-07-01 00:00:00
Maximum2023-12-31 00:00:00
2023-12-13T08:35:38.580018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:35:38.707958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct533
Distinct (%)34.9%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
Minimum2021-07-01 00:00:00
Maximum2023-04-25 00:00:00
2023-12-13T08:35:38.834455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:35:38.977733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

구조 결과
Categorical

Distinct7
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
방생
434 
DOA
384 
안락사
353 
폐사
223 
계류
105 
Other values (2)
 
27

Length

Max length3
Median length2.5
Mean length2.5
Min length2

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row안락사
2nd row방생
3rd row방생
4th row폐사
5th row방생

Common Values

ValueCountFrequency (%)
방생 434
28.4%
DOA 384
25.2%
안락사 353
23.1%
폐사 223
14.6%
계류 105
 
6.9%
폐사체 26
 
1.7%
이첩 1
 
0.1%

Length

2023-12-13T08:35:39.121425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:35:39.242335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
방생 434
28.4%
doa 384
25.2%
안락사 353
23.1%
폐사 223
14.6%
계류 105
 
6.9%
폐사체 26
 
1.7%
이첩 1
 
0.1%
Distinct7
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
건물옆
794 
도로변
365 
기타
115 
87 
농경지
81 
Other values (2)
84 

Length

Max length5
Median length3
Mean length2.9108781
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row도로변
2nd row농경지
3rd row농경지
4th row건물옆
5th row건물옆

Common Values

ValueCountFrequency (%)
건물옆 794
52.0%
도로변 365
23.9%
기타 115
 
7.5%
87
 
5.7%
농경지 81
 
5.3%
강, 바다 69
 
4.5%
강 바다 15
 
1.0%

Length

2023-12-13T08:35:39.389606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:35:39.514757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
건물옆 794
49.3%
도로변 365
22.7%
기타 115
 
7.1%
87
 
5.4%
84
 
5.2%
바다 84
 
5.2%
농경지 81
 
5.0%

기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
2023-05-01
1526 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-05-01
2nd row2023-05-01
3rd row2023-05-01
4th row2023-05-01
5th row2023-05-01

Common Values

ValueCountFrequency (%)
2023-05-01 1526
100.0%

Length

2023-12-13T08:35:39.671038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:35:39.777362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-05-01 1526
100.0%

Correlations

2023-12-13T08:35:39.845627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
천연기념물멸종 위기종구조 결과발견장소 특징
1.0000.6700.2880.0130.250
천연기념물0.6701.0000.9800.3300.275
멸종 위기종0.2880.9801.0000.1390.233
구조 결과0.0130.3300.1391.0000.222
발견장소 특징0.2500.2750.2330.2221.000
2023-12-13T08:35:39.947501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
천연기념물구조 결과발견장소 특징멸종 위기종
천연기념물1.0000.4740.1570.1290.965
0.4741.0000.0090.1730.096
구조 결과0.1570.0091.0000.0790.093
발견장소 특징0.1290.1730.0791.0000.160
멸종 위기종0.9650.0960.0930.1601.000
2023-12-13T08:35:40.039621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
천연기념물멸종 위기종구조 결과발견장소 특징
1.0000.4740.0960.0090.173
천연기념물0.4741.0000.9650.1570.129
멸종 위기종0.0960.9651.0000.0930.160
구조 결과0.0090.1570.0931.0000.079
발견장소 특징0.1730.1290.1600.0791.000

Missing values

2023-12-13T08:35:36.535052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:35:36.675670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

접수번호국 명천연기념물멸종 위기종구조 일자구조결과일자구조 결과발견장소 특징기준일자
0조강(Aves)2021-0440해당없음해당없음2021-07-012021-07-12안락사도로변2023-05-01
1포유강(Mammalia)2021-0441고라니해당없음해당없음2021-07-012021-07-01방생농경지2023-05-01
2포유강(Mammalia)2021-0442고라니해당없음해당없음2021-07-012021-07-01방생농경지2023-05-01
3조강(Aves)2021-0443직박구리해당없음해당없음2021-07-012021-07-04폐사건물옆2023-05-01
4조강(Aves)2021-0444황조롱이323-8해당없음2021-07-012021-08-05방생건물옆2023-05-01
5포유강(Mammalia)2021-0445고라니해당없음해당없음2021-07-012021-07-14안락사도로변2023-05-01
6조강(Aves)2021-0446까치해당없음해당없음2021-07-022021-07-02안락사농경지2023-05-01
7조강(Aves)2021-0447딱새해당없음해당없음2021-07-022021-07-10안락사건물옆2023-05-01
8조강(Aves)2021-0448딱새해당없음해당없음2021-07-022021-07-11폐사건물옆2023-05-01
9조강(Aves)2021-0449참새해당없음해당없음2021-07-022021-07-10방생건물옆2023-05-01
접수번호국 명천연기념물멸종 위기종구조 일자구조결과일자구조 결과발견장소 특징기준일자
1516조강(Aves)2023-0177딱새해당없음해당없음2023-04-232023-04-23DOA건물옆2023-05-01
1517포유강(Mammalia)2023-0178고라니해당없음해당없음2023-04-232023-04-24DOA도로변2023-05-01
1518조강(Aves)2023-0179까치해당없음해당없음2023-04-232023-04-23계류도로변2023-05-01
1519포유강(Mammalia)2023-0180고라니해당없음해당없음2023-04-232023-04-24DOA도로변2023-05-01
1520조강(Aves)2023-0181까치해당없음해당없음2023-04-242023-04-24계류도로변2023-05-01
1521포유강(Mammalia)2023-0182고라니해당없음해당없음2023-04-242023-04-24DOA도로변2023-05-01
1522포유강(Mammalia)2023-0183노루해당없음해당없음2023-04-242023-04-24DOA도로변2023-05-01
1523조강(Aves)2023-0184까치해당없음해당없음2023-04-242023-04-24계류건물옆2023-05-01
1524포유강(Mammalia)2023-0185노루해당없음해당없음2023-04-242023-04-24DOA도로변2023-05-01
1525포유강(Mammalia)2023-0186고라니해당없음해당없음2023-04-252023-04-25계류도로변2023-05-01