Overview

Dataset statistics

Number of variables8
Number of observations67
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.4 KiB
Average record size in memory68.0 B

Variable types

Numeric1
Categorical4
DateTime2
Text1

Alerts

조사분야구분코드명 has constant value ""Constant
종국명 is highly overall correlated with 종학명 and 1 other fieldsHigh correlation
조사년도 is highly overall correlated with 종국명 and 1 other fieldsHigh correlation
종학명 is highly overall correlated with 종국명 and 1 other fieldsHigh correlation
조사년도 is highly imbalanced (88.8%)Imbalance
공간아이디 has unique valuesUnique

Reproduction

Analysis started2023-12-10 11:25:02.691911
Analysis finished2023-12-10 11:25:05.661765
Duration2.97 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

공간아이디
Real number (ℝ)

UNIQUE 

Distinct67
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8 × 1015
Minimum8 × 1015
Maximum8 × 1015
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size735.0 B
2023-12-10T20:25:05.797861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum8 × 1015
5-th percentile8 × 1015
Q18 × 1015
median8 × 1015
Q38 × 1015
95-th percentile8 × 1015
Maximum8 × 1015
Range67
Interquartile range (IQR)34

Descriptive statistics

Standard deviation19.723083
Coefficient of variation (CV)2.4653854 × 10-15
Kurtosis-1.1610997
Mean8 × 1015
Median Absolute Deviation (MAD)17
Skewness-0.27621506
Sum5.36 × 1017
Variance389
MonotonicityStrictly increasing
2023-12-10T20:25:06.106984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
8000000002096166 1
 
1.5%
8000000002096209 1
 
1.5%
8000000002096215 1
 
1.5%
8000000002096214 1
 
1.5%
8000000002096213 1
 
1.5%
8000000002096212 1
 
1.5%
8000000002096211 1
 
1.5%
8000000002096210 1
 
1.5%
8000000002096208 1
 
1.5%
8000000002096167 1
 
1.5%
Other values (57) 57
85.1%
ValueCountFrequency (%)
8000000002096166 1
1.5%
8000000002096167 1
1.5%
8000000002096168 1
1.5%
8000000002096169 1
1.5%
8000000002096170 1
1.5%
8000000002096171 1
1.5%
8000000002096172 1
1.5%
8000000002096173 1
1.5%
8000000002096174 1
1.5%
8000000002096175 1
1.5%
ValueCountFrequency (%)
8000000002096233 1
1.5%
8000000002096232 1
1.5%
8000000002096231 1
1.5%
8000000002096230 1
1.5%
8000000002096229 1
1.5%
8000000002096228 1
1.5%
8000000002096226 1
1.5%
8000000002096225 1
1.5%
8000000002096224 1
1.5%
8000000002096223 1
1.5%

조사분야구분코드명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size668.0 B
조류
67 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row조류
2nd row조류
3rd row조류
4th row조류
5th row조류

Common Values

ValueCountFrequency (%)
조류 67
100.0%

Length

2023-12-10T20:25:06.373271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:25:06.550810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
조류 67
100.0%

종국명
Categorical

HIGH CORRELATION 

Distinct12
Distinct (%)17.9%
Missing0
Missing (%)0.0%
Memory size668.0 B
황조롱이
20 
원앙
12 
꾀꼬리
12 
파랑새
뻐꾸기
Other values (7)
12 

Length

Max length6
Median length5
Mean length3.3731343
Min length2

Unique

Unique4 ?
Unique (%)6.0%

Sample

1st row원앙
2nd row황조롱이
3rd row황조롱이
4th row황조롱이
5th row꾀꼬리

Common Values

ValueCountFrequency (%)
황조롱이 20
29.9%
원앙 12
17.9%
꾀꼬리 12
17.9%
파랑새 6
 
9.0%
뻐꾸기 5
 
7.5%
청딱다구리 4
 
6.0%
물총새 2
 
3.0%
오색딱다구리 2
 
3.0%
칼새 1
 
1.5%
뿔논병아리 1
 
1.5%
Other values (2) 2
 
3.0%

Length

2023-12-10T20:25:07.078755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
황조롱이 20
29.9%
원앙 12
17.9%
꾀꼬리 12
17.9%
파랑새 6
 
9.0%
뻐꾸기 5
 
7.5%
청딱다구리 4
 
6.0%
물총새 2
 
3.0%
오색딱다구리 2
 
3.0%
칼새 1
 
1.5%
뿔논병아리 1
 
1.5%
Other values (2) 2
 
3.0%

종학명
Categorical

HIGH CORRELATION 

Distinct12
Distinct (%)17.9%
Missing0
Missing (%)0.0%
Memory size668.0 B
Falco tinnunculus Linnaeus, 1758
20 
Aix galericulata (Linnaeus, 1758)
12 
Oriolus chinensis Linnaeus, 1766
12 
Eurystomus orientalis (Linnaeus, 1766)
Cuculus canorus Linnaeus, 1758
Other values (7)
12 

Length

Max length38
Median length35
Mean length32.089552
Min length24

Unique

Unique4 ?
Unique (%)6.0%

Sample

1st rowAix galericulata (Linnaeus, 1758)
2nd rowFalco tinnunculus Linnaeus, 1758
3rd rowFalco tinnunculus Linnaeus, 1758
4th rowFalco tinnunculus Linnaeus, 1758
5th rowOriolus chinensis Linnaeus, 1766

Common Values

ValueCountFrequency (%)
Falco tinnunculus Linnaeus, 1758 20
29.9%
Aix galericulata (Linnaeus, 1758) 12
17.9%
Oriolus chinensis Linnaeus, 1766 12
17.9%
Eurystomus orientalis (Linnaeus, 1766) 6
 
9.0%
Cuculus canorus Linnaeus, 1758 5
 
7.5%
Picus canus Gmelin, 1788 4
 
6.0%
Alcedo atthis (Linnaeus, 1758) 2
 
3.0%
Dendrocopos major (Linnaeus, 1758) 2
 
3.0%
Apus pacificus (Latham, 1801) 1
 
1.5%
Podiceps cristatus (Linnaeus, 1758) 1
 
1.5%
Other values (2) 2
 
3.0%

Length

2023-12-10T20:25:07.238326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
linnaeus 62
23.1%
1758 43
16.0%
falco 20
 
7.5%
tinnunculus 20
 
7.5%
1766 19
 
7.1%
aix 12
 
4.5%
galericulata 12
 
4.5%
oriolus 12
 
4.5%
chinensis 12
 
4.5%
eurystomus 6
 
2.2%
Other values (20) 50
18.7%

조사년도
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size668.0 B
2014
66 
2017
 
1

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique1 ?
Unique (%)1.5%

Sample

1st row2014
2nd row2014
3rd row2014
4th row2014
5th row2014

Common Values

ValueCountFrequency (%)
2014 66
98.5%
2017 1
 
1.5%

Length

2023-12-10T20:25:07.386576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:25:07.528412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2014 66
98.5%
2017 1
 
1.5%
Distinct16
Distinct (%)23.9%
Missing0
Missing (%)0.0%
Memory size668.0 B
Minimum2014-04-15 00:00:00
Maximum2017-09-26 00:00:00
2023-12-10T20:25:07.641019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:25:07.812491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
Distinct16
Distinct (%)23.9%
Missing0
Missing (%)0.0%
Memory size668.0 B
Minimum2014-04-15 00:00:00
Maximum2017-09-26 00:00:00
2023-12-10T20:25:07.980269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:25:08.105354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
Distinct37
Distinct (%)55.2%
Missing0
Missing (%)0.0%
Memory size668.0 B
2023-12-10T20:25:08.315400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length44
Mean length43.970149
Min length37

Characters and Unicode

Total characters2946
Distinct characters19
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)38.8%

Sample

1st rowPOINT (179527.16590000037 462224.5848999992)
2nd rowPOINT (180499.72429999989 462324.0987)
3rd rowPOINT (178838.9386 462744.2013000008)
4th rowPOINT (178653.08270000014 462800.15059999935)
5th rowPOINT (178277.36459999997 463328.2248999998)
ValueCountFrequency (%)
point 67
33.3%
463694.9548000004 10
 
5.0%
178293.22410000023 10
 
5.0%
184582.9428000003 6
 
3.0%
472257.1621000003 6
 
3.0%
168934.97890000045 5
 
2.5%
468091.95160000026 5
 
2.5%
171523.80810000002 4
 
2.0%
467669.8738000002 4
 
2.0%
177045.9084999999 3
 
1.5%
Other values (65) 81
40.3%
2023-12-10T20:25:08.763555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 517
17.5%
9 360
12.2%
4 215
 
7.3%
6 195
 
6.6%
8 166
 
5.6%
1 164
 
5.6%
7 160
 
5.4%
2 153
 
5.2%
3 146
 
5.0%
134
 
4.5%
Other values (9) 736
25.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2209
75.0%
Uppercase Letter 335
 
11.4%
Space Separator 134
 
4.5%
Other Punctuation 134
 
4.5%
Open Punctuation 67
 
2.3%
Close Punctuation 67
 
2.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 517
23.4%
9 360
16.3%
4 215
9.7%
6 195
 
8.8%
8 166
 
7.5%
1 164
 
7.4%
7 160
 
7.2%
2 153
 
6.9%
3 146
 
6.6%
5 133
 
6.0%
Uppercase Letter
ValueCountFrequency (%)
P 67
20.0%
O 67
20.0%
T 67
20.0%
N 67
20.0%
I 67
20.0%
Space Separator
ValueCountFrequency (%)
134
100.0%
Other Punctuation
ValueCountFrequency (%)
. 134
100.0%
Open Punctuation
ValueCountFrequency (%)
( 67
100.0%
Close Punctuation
ValueCountFrequency (%)
) 67
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2611
88.6%
Latin 335
 
11.4%

Most frequent character per script

Common
ValueCountFrequency (%)
0 517
19.8%
9 360
13.8%
4 215
8.2%
6 195
 
7.5%
8 166
 
6.4%
1 164
 
6.3%
7 160
 
6.1%
2 153
 
5.9%
3 146
 
5.6%
134
 
5.1%
Other values (4) 401
15.4%
Latin
ValueCountFrequency (%)
P 67
20.0%
O 67
20.0%
T 67
20.0%
N 67
20.0%
I 67
20.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2946
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 517
17.5%
9 360
12.2%
4 215
 
7.3%
6 195
 
6.6%
8 166
 
5.6%
1 164
 
5.6%
7 160
 
5.4%
2 153
 
5.2%
3 146
 
5.0%
134
 
4.5%
Other values (9) 736
25.0%

Interactions

2023-12-10T20:25:04.978410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T20:25:08.931550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공간아이디종국명종학명조사년도조사시작일자조사종료일자지오메트리
공간아이디1.0000.4460.4460.0000.8380.8380.977
종국명0.4461.0001.0001.0000.8090.8090.917
종학명0.4461.0001.0001.0000.8090.8090.917
조사년도0.0001.0001.0001.0001.0001.0001.000
조사시작일자0.8380.8090.8091.0001.0001.0001.000
조사종료일자0.8380.8090.8091.0001.0001.0001.000
지오메트리0.9770.9170.9171.0001.0001.0001.000
2023-12-10T20:25:09.079933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
종국명조사년도종학명
종국명1.0000.9201.000
조사년도0.9201.0000.920
종학명1.0000.9201.000
2023-12-10T20:25:09.210885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공간아이디종국명종학명조사년도
공간아이디1.0000.1940.1940.000
종국명0.1941.0001.0000.920
종학명0.1941.0001.0000.920
조사년도0.0000.9200.9201.000

Missing values

2023-12-10T20:25:05.324572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T20:25:05.555543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

공간아이디조사분야구분코드명종국명종학명조사년도조사시작일자조사종료일자지오메트리
08000000002096166조류원앙Aix galericulata (Linnaeus, 1758)20142014-06-142014-06-14POINT (179527.16590000037 462224.5848999992)
18000000002096167조류황조롱이Falco tinnunculus Linnaeus, 175820142014-06-142014-06-14POINT (180499.72429999989 462324.0987)
28000000002096168조류황조롱이Falco tinnunculus Linnaeus, 175820142014-04-202014-04-20POINT (178838.9386 462744.2013000008)
38000000002096169조류황조롱이Falco tinnunculus Linnaeus, 175820142014-04-202014-04-20POINT (178653.08270000014 462800.15059999935)
48000000002096170조류꾀꼬리Oriolus chinensis Linnaeus, 176620142014-06-142014-06-14POINT (178277.36459999997 463328.2248999998)
58000000002096171조류청딱다구리Picus canus Gmelin, 178820142014-09-282014-09-28POINT (177902.9754999997 463347.58770000003)
68000000002096172조류황조롱이Falco tinnunculus Linnaeus, 175820142014-10-272014-10-27POINT (178394.07610000018 463367.9718999993)
78000000002096173조류꾀꼬리Oriolus chinensis Linnaeus, 176620142014-06-142014-06-14POINT (178419.20859999955 463488.1002999991)
88000000002096174조류원앙Aix galericulata (Linnaeus, 1758)20142014-04-202014-04-20POINT (178330.09310000017 463553.03050000034)
98000000002096175조류원앙Aix galericulata (Linnaeus, 1758)20142014-09-292014-09-29POINT (182330.2577999998 463556.1219999995)
공간아이디조사분야구분코드명종국명종학명조사년도조사시작일자조사종료일자지오메트리
578000000002096223조류꾀꼬리Oriolus chinensis Linnaeus, 176620142014-06-262014-06-26POINT (168934.97890000045 468091.95160000026)
588000000002096224조류황조롱이Falco tinnunculus Linnaeus, 175820142014-06-262014-06-26POINT (168934.97890000045 468091.95160000026)
598000000002096225조류노랑때까치Lanius cristatus Linnaeus, 175820142014-10-262014-10-26POINT (169656.5081000002 468169.47440000065)
608000000002096226조류청딱다구리Picus canus Gmelin, 178820142014-04-152014-04-15POINT (167270.4765999997 469041.47389999963)
618000000002096228조류황조롱이Falco tinnunculus Linnaeus, 175820142014-06-132014-06-13POINT (184582.9428000003 472257.1621000003)
628000000002096229조류뻐꾸기Cuculus canorus Linnaeus, 175820142014-06-132014-06-13POINT (184582.9428000003 472257.1621000003)
638000000002096230조류원앙Aix galericulata (Linnaeus, 1758)20142014-06-132014-06-13POINT (184582.9428000003 472257.1621000003)
648000000002096231조류원앙Aix galericulata (Linnaeus, 1758)20142014-06-132014-06-13POINT (184582.9428000003 472257.1621000003)
658000000002096232조류원앙Aix galericulata (Linnaeus, 1758)20142014-06-132014-06-13POINT (184582.9428000003 472257.1621000003)
668000000002096233조류원앙Aix galericulata (Linnaeus, 1758)20142014-06-132014-06-13POINT (184582.9428000003 472257.1621000003)