Overview

Dataset statistics

Number of variables8
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.6 KiB
Average record size in memory67.3 B

Variable types

Numeric1
Categorical5
Text2

Alerts

조사분야구분코드명 has constant value ""Constant
조사년도 has constant value ""Constant
지오메트리 is highly overall correlated with 공간아이디 and 2 other fieldsHigh correlation
조사시작일자 is highly overall correlated with 공간아이디 and 2 other fieldsHigh correlation
조사종료일자 is highly overall correlated with 공간아이디 and 2 other fieldsHigh correlation
공간아이디 is highly overall correlated with 조사시작일자 and 2 other fieldsHigh correlation
공간아이디 has unique valuesUnique

Reproduction

Analysis started2023-12-10 11:08:00.680255
Analysis finished2023-12-10 11:08:04.564875
Duration3.88 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

공간아이디
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4 × 1015
Minimum4 × 1015
Maximum4 × 1015
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T20:08:04.714388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4 × 1015
5-th percentile4 × 1015
Q14 × 1015
median4 × 1015
Q34 × 1015
95-th percentile4 × 1015
Maximum4 × 1015
Range99
Interquartile range (IQR)49

Descriptive statistics

Standard deviation29.015844
Coefficient of variation (CV)7.253961 × 10-15
Kurtosis-1.199243
Mean4 × 1015
Median Absolute Deviation (MAD)25
Skewness-0.052740365
Sum4 × 1017
Variance841.91919
MonotonicityStrictly increasing
2023-12-10T20:08:05.026060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4000000000003848 1
 
1.0%
4000000000003912 1
 
1.0%
4000000000003922 1
 
1.0%
4000000000003921 1
 
1.0%
4000000000003920 1
 
1.0%
4000000000003919 1
 
1.0%
4000000000003918 1
 
1.0%
4000000000003917 1
 
1.0%
4000000000003916 1
 
1.0%
4000000000003915 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
4000000000003848 1
1.0%
4000000000003849 1
1.0%
4000000000003850 1
1.0%
4000000000003851 1
1.0%
4000000000003852 1
1.0%
4000000000003853 1
1.0%
4000000000003854 1
1.0%
4000000000003855 1
1.0%
4000000000003856 1
1.0%
4000000000003857 1
1.0%
ValueCountFrequency (%)
4000000000003947 1
1.0%
4000000000003946 1
1.0%
4000000000003945 1
1.0%
4000000000003944 1
1.0%
4000000000003943 1
1.0%
4000000000003942 1
1.0%
4000000000003941 1
1.0%
4000000000003940 1
1.0%
4000000000003939 1
1.0%
4000000000003938 1
1.0%

조사분야구분코드명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
저서무척추동물
100 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row저서무척추동물
2nd row저서무척추동물
3rd row저서무척추동물
4th row저서무척추동물
5th row저서무척추동물

Common Values

ValueCountFrequency (%)
저서무척추동물 100
100.0%

Length

2023-12-10T20:08:05.275246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:08:05.457070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
저서무척추동물 100
100.0%
Distinct58
Distinct (%)58.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T20:08:05.794201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length6.52
Min length2

Characters and Unicode

Total characters652
Distinct characters129
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique28 ?
Unique (%)28.0%

Sample

1st row물벌레
2nd row피라미하루살이
3rd row무늬하루살이
4th row좀뱀잠자리 KUa
5th row플라나리아
ValueCountFrequency (%)
kua 13
 
11.4%
피라미하루살이 4
 
3.5%
무늬하루살이 4
 
3.5%
큰등그물강도래 3
 
2.6%
흰부채하루살이 3
 
2.6%
플라나리아 3
 
2.6%
두갈래하루살이 3
 
2.6%
민하루살이 3
 
2.6%
개똥하루살이 3
 
2.6%
두점하루살이 3
 
2.6%
Other values (50) 72
63.2%
2023-12-10T20:08:06.439489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
54
 
8.3%
50
 
7.7%
50
 
7.7%
50
 
7.7%
27
 
4.1%
24
 
3.7%
17
 
2.6%
K 15
 
2.3%
U 15
 
2.3%
a 14
 
2.1%
Other values (119) 336
51.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 582
89.3%
Uppercase Letter 31
 
4.8%
Lowercase Letter 25
 
3.8%
Space Separator 14
 
2.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
54
 
9.3%
50
 
8.6%
50
 
8.6%
50
 
8.6%
27
 
4.6%
24
 
4.1%
17
 
2.9%
12
 
2.1%
12
 
2.1%
11
 
1.9%
Other values (106) 275
47.3%
Lowercase Letter
ValueCountFrequency (%)
a 14
56.0%
b 2
 
8.0%
e 2
 
8.0%
o 2
 
8.0%
r 1
 
4.0%
d 1
 
4.0%
s 1
 
4.0%
k 1
 
4.0%
u 1
 
4.0%
Uppercase Letter
ValueCountFrequency (%)
K 15
48.4%
U 15
48.4%
G 1
 
3.2%
Space Separator
ValueCountFrequency (%)
14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 582
89.3%
Latin 56
 
8.6%
Common 14
 
2.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
54
 
9.3%
50
 
8.6%
50
 
8.6%
50
 
8.6%
27
 
4.6%
24
 
4.1%
17
 
2.9%
12
 
2.1%
12
 
2.1%
11
 
1.9%
Other values (106) 275
47.3%
Latin
ValueCountFrequency (%)
K 15
26.8%
U 15
26.8%
a 14
25.0%
b 2
 
3.6%
e 2
 
3.6%
o 2
 
3.6%
G 1
 
1.8%
r 1
 
1.8%
d 1
 
1.8%
s 1
 
1.8%
Other values (2) 2
 
3.6%
Common
ValueCountFrequency (%)
14
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 582
89.3%
ASCII 70
 
10.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
54
 
9.3%
50
 
8.6%
50
 
8.6%
50
 
8.6%
27
 
4.6%
24
 
4.1%
17
 
2.9%
12
 
2.1%
12
 
2.1%
11
 
1.9%
Other values (106) 275
47.3%
ASCII
ValueCountFrequency (%)
K 15
21.4%
U 15
21.4%
a 14
20.0%
14
20.0%
b 2
 
2.9%
e 2
 
2.9%
o 2
 
2.9%
G 1
 
1.4%
r 1
 
1.4%
d 1
 
1.4%
Other values (3) 3
 
4.3%
Distinct59
Distinct (%)59.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T20:08:06.988886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length38
Mean length28.55
Min length10

Characters and Unicode

Total characters2855
Distinct characters60
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)30.0%

Sample

1st rowAsellus (Asellus) hilgendorfii Bovalius, 1886
2nd rowAmeletus costalis (Matsumura), 1931
3rd rowEphemera strigata Eaton, 1892
4th rowSialis KUa
5th rowDugesia japonica
ValueCountFrequency (%)
kua 14
 
4.1%
1931 11
 
3.3%
japonica 8
 
2.4%
matsumura 8
 
2.4%
ecdyonurus 7
 
2.1%
ephemera 6
 
1.8%
eaton 6
 
1.8%
tshernova 6
 
1.8%
1952 5
 
1.5%
drunella 5
 
1.5%
Other values (134) 262
77.5%
2023-12-10T20:08:07.794345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 279
 
9.8%
238
 
8.3%
e 176
 
6.2%
s 171
 
6.0%
i 146
 
5.1%
o 144
 
5.0%
n 137
 
4.8%
u 126
 
4.4%
t 118
 
4.1%
l 117
 
4.1%
Other values (50) 1203
42.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2009
70.4%
Decimal Number 268
 
9.4%
Space Separator 238
 
8.3%
Uppercase Letter 199
 
7.0%
Other Punctuation 69
 
2.4%
Close Punctuation 36
 
1.3%
Open Punctuation 36
 
1.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 279
13.9%
e 176
 
8.8%
s 171
 
8.5%
i 146
 
7.3%
o 144
 
7.2%
n 137
 
6.8%
u 126
 
6.3%
t 118
 
5.9%
l 117
 
5.8%
r 110
 
5.5%
Other values (15) 485
24.1%
Uppercase Letter
ValueCountFrequency (%)
E 27
13.6%
K 20
10.1%
U 19
 
9.5%
M 15
 
7.5%
A 15
 
7.5%
B 12
 
6.0%
N 11
 
5.5%
D 11
 
5.5%
S 11
 
5.5%
C 9
 
4.5%
Other values (10) 49
24.6%
Decimal Number
ValueCountFrequency (%)
1 94
35.1%
9 60
22.4%
3 24
 
9.0%
2 21
 
7.8%
8 18
 
6.7%
6 18
 
6.7%
7 15
 
5.6%
5 8
 
3.0%
4 7
 
2.6%
0 3
 
1.1%
Other Punctuation
ValueCountFrequency (%)
, 67
97.1%
? 2
 
2.9%
Space Separator
ValueCountFrequency (%)
238
100.0%
Close Punctuation
ValueCountFrequency (%)
) 36
100.0%
Open Punctuation
ValueCountFrequency (%)
( 36
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2208
77.3%
Common 647
 
22.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 279
12.6%
e 176
 
8.0%
s 171
 
7.7%
i 146
 
6.6%
o 144
 
6.5%
n 137
 
6.2%
u 126
 
5.7%
t 118
 
5.3%
l 117
 
5.3%
r 110
 
5.0%
Other values (35) 684
31.0%
Common
ValueCountFrequency (%)
238
36.8%
1 94
 
14.5%
, 67
 
10.4%
9 60
 
9.3%
) 36
 
5.6%
( 36
 
5.6%
3 24
 
3.7%
2 21
 
3.2%
8 18
 
2.8%
6 18
 
2.8%
Other values (5) 35
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2855
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 279
 
9.8%
238
 
8.3%
e 176
 
6.2%
s 171
 
6.0%
i 146
 
5.1%
o 144
 
5.0%
n 137
 
4.8%
u 126
 
4.4%
t 118
 
4.1%
l 117
 
4.1%
Other values (50) 1203
42.1%

조사년도
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2015
100 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2015
2nd row2015
3rd row2015
4th row2015
5th row2015

Common Values

ValueCountFrequency (%)
2015 100
100.0%

Length

2023-12-10T20:08:08.045203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:08:08.196033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2015 100
100.0%

조사시작일자
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2015-04-18
61 
2015-04-17
39 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2015-04-18
2nd row2015-04-18
3rd row2015-04-18
4th row2015-04-18
5th row2015-04-18

Common Values

ValueCountFrequency (%)
2015-04-18 61
61.0%
2015-04-17 39
39.0%

Length

2023-12-10T20:08:08.355740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:08:08.509443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2015-04-18 61
61.0%
2015-04-17 39
39.0%

조사종료일자
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2015-04-18
61 
2015-04-17
39 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2015-04-18
2nd row2015-04-18
3rd row2015-04-18
4th row2015-04-18
5th row2015-04-18

Common Values

ValueCountFrequency (%)
2015-04-18 61
61.0%
2015-04-17 39
39.0%

Length

2023-12-10T20:08:08.702474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:08:08.867185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2015-04-18 61
61.0%
2015-04-17 39
39.0%

지오메트리
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
POINT (316880.4672395752 479586.75579616614)
39 
POINT (316053.2204866735 477405.9025693582)
36 
POINT (314196.1671657277 478214.53178796865)
19 
POINT (314784.8683120534 477937.2377623521)
POINT (316016.5581539987 477406.3271925916)
 
2

Length

Max length44
Median length44
Mean length43.58
Min length43

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPOINT (314784.8683120534 477937.2377623521)
2nd rowPOINT (314784.8683120534 477937.2377623521)
3rd rowPOINT (314784.8683120534 477937.2377623521)
4th rowPOINT (314784.8683120534 477937.2377623521)
5th rowPOINT (316053.2204866735 477405.9025693582)

Common Values

ValueCountFrequency (%)
POINT (316880.4672395752 479586.75579616614) 39
39.0%
POINT (316053.2204866735 477405.9025693582) 36
36.0%
POINT (314196.1671657277 478214.53178796865) 19
19.0%
POINT (314784.8683120534 477937.2377623521) 4
 
4.0%
POINT (316016.5581539987 477406.3271925916) 2
 
2.0%

Length

2023-12-10T20:08:09.038641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:08:09.227239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
point 100
33.3%
316880.4672395752 39
 
13.0%
479586.75579616614 39
 
13.0%
316053.2204866735 36
 
12.0%
477405.9025693582 36
 
12.0%
314196.1671657277 19
 
6.3%
478214.53178796865 19
 
6.3%
314784.8683120534 4
 
1.3%
477937.2377623521 4
 
1.3%
316016.5581539987 2
 
0.7%

Interactions

2023-12-10T20:08:03.832533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T20:08:09.386341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공간아이디종국명종학명조사시작일자조사종료일자지오메트리
공간아이디1.0000.0000.0000.9950.9950.964
종국명0.0001.0001.0000.0000.0000.000
종학명0.0001.0001.0000.0000.0000.000
조사시작일자0.9950.0000.0001.0000.9991.000
조사종료일자0.9950.0000.0000.9991.0001.000
지오메트리0.9640.0000.0001.0001.0001.000
2023-12-10T20:08:09.538485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지오메트리조사시작일자조사종료일자
지오메트리1.0000.9850.985
조사시작일자0.9851.0000.979
조사종료일자0.9850.9791.000
2023-12-10T20:08:09.677966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공간아이디조사시작일자조사종료일자지오메트리
공간아이디1.0000.9000.9000.715
조사시작일자0.9001.0000.9790.985
조사종료일자0.9000.9791.0000.985
지오메트리0.7150.9850.9851.000

Missing values

2023-12-10T20:08:04.168804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T20:08:04.448724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

공간아이디조사분야구분코드명종국명종학명조사년도조사시작일자조사종료일자지오메트리
04000000000003848저서무척추동물물벌레Asellus (Asellus) hilgendorfii Bovalius, 188620152015-04-182015-04-18POINT (314784.8683120534 477937.2377623521)
14000000000003849저서무척추동물피라미하루살이Ameletus costalis (Matsumura), 193120152015-04-182015-04-18POINT (314784.8683120534 477937.2377623521)
24000000000003850저서무척추동물무늬하루살이Ephemera strigata Eaton, 189220152015-04-182015-04-18POINT (314784.8683120534 477937.2377623521)
34000000000003851저서무척추동물좀뱀잠자리 KUaSialis KUa20152015-04-182015-04-18POINT (314784.8683120534 477937.2377623521)
44000000000003852저서무척추동물플라나리아Dugesia japonica20152015-04-182015-04-18POINT (316053.2204866735 477405.9025693582)
54000000000003853저서무척추동물다슬기Semisulcospira libertina20152015-04-182015-04-18POINT (316053.2204866735 477405.9025693582)
64000000000003854저서무척추동물왼돌이물달팽이Physa acuta20152015-04-182015-04-18POINT (316053.2204866735 477405.9025693582)
74000000000003855저서무척추동물피라미하루살이Ameletus costalis (Matsumura), 193120152015-04-182015-04-18POINT (316053.2204866735 477405.9025693582)
84000000000003856저서무척추동물개똥하루살이Baetis fuscatus (Linnaeus), 176120152015-04-182015-04-18POINT (316053.2204866735 477405.9025693582)
94000000000003857저서무척추동물방울하루살이Baetis ursinus Kazlauskas, 196320152015-04-182015-04-18POINT (316053.2204866735 477405.9025693582)
공간아이디조사분야구분코드명종국명종학명조사년도조사시작일자조사종료일자지오메트리
904000000000003938저서무척추동물무늬하루살이Ephemera strigata Eaton, 189220152015-04-182015-04-18POINT (314196.1671657277 478214.53178796865)
914000000000003939저서무척추동물가는무늬하루살이Ephemera separigata Bae, 199520152015-04-182015-04-18POINT (314196.1671657277 478214.53178796865)
924000000000003940저서무척추동물두점하루살이Ecdyonurus kibunensis Imanishi, 193620152015-04-182015-04-18POINT (314196.1671657277 478214.53178796865)
934000000000003941저서무척추동물흰부채하루살이Epeorus nipponicus (Ueno), 193120152015-04-182015-04-18POINT (314196.1671657277 478214.53178796865)
944000000000003942저서무척추동물봄처녀하루살이Cinygmula grandifolia Tshernova, 195220152015-04-182015-04-18POINT (314196.1671657277 478214.53178796865)
954000000000003943저서무척추동물두갈래하루살이Paraleptophlebia japonica (Matsumura), 193120152015-04-182015-04-18POINT (314196.1671657277 478214.53178796865)
964000000000003944저서무척추동물쇠측범잠자리Davidius lunatus (Bartenef), 191420152015-04-182015-04-18POINT (314196.1671657277 478214.53178796865)
974000000000003945저서무척추동물녹색강도래Sweltsa nikkoensis (Okamoto), 191220152015-04-182015-04-18POINT (314196.1671657277 478214.53178796865)
984000000000003946저서무척추동물총채민강도래Amphinemura coreana Zwick, 197320152015-04-182015-04-18POINT (314196.1671657277 478214.53178796865)
994000000000003947저서무척추동물토우민강도래Nemoura tau Zwick, 197320152015-04-182015-04-18POINT (314196.1671657277 478214.53178796865)