Overview

Dataset statistics

Number of variables8
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.6 KiB
Average record size in memory67.3 B

Variable types

Numeric1
Categorical5
Text2

Alerts

조사분야구분코드명 has constant value ""Constant
조사년도 has constant value ""Constant
조사종료일자 is highly overall correlated with 공간아이디 and 2 other fieldsHigh correlation
지오메트리 is highly overall correlated with 공간아이디 and 2 other fieldsHigh correlation
조사시작일자 is highly overall correlated with 공간아이디 and 2 other fieldsHigh correlation
공간아이디 is highly overall correlated with 조사시작일자 and 2 other fieldsHigh correlation
공간아이디 has unique valuesUnique

Reproduction

Analysis started2023-12-10 10:55:57.152596
Analysis finished2023-12-10 10:55:58.379715
Duration1.23 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

공간아이디
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5 × 1015
Minimum5 × 1015
Maximum5 × 1015
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:55:58.532496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5 × 1015
5-th percentile5 × 1015
Q15 × 1015
median5 × 1015
Q35 × 1015
95-th percentile5 × 1015
Maximum5 × 1015
Range100
Interquartile range (IQR)50

Descriptive statistics

Standard deviation29.460811
Coefficient of variation (CV)5.8921622 × 10-15
Kurtosis-1.2147219
Mean5 × 1015
Median Absolute Deviation (MAD)25.5
Skewness0.10380678
Sum5 × 1017
Variance867.93939
MonotonicityStrictly increasing
2023-12-10T19:55:58.837632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5000000000024948 1
 
1.0%
5000000000025013 1
 
1.0%
5000000000025023 1
 
1.0%
5000000000025022 1
 
1.0%
5000000000025021 1
 
1.0%
5000000000025020 1
 
1.0%
5000000000025019 1
 
1.0%
5000000000025018 1
 
1.0%
5000000000025017 1
 
1.0%
5000000000025016 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
5000000000024948 1
1.0%
5000000000024949 1
1.0%
5000000000024950 1
1.0%
5000000000024951 1
1.0%
5000000000024952 1
1.0%
5000000000024953 1
1.0%
5000000000024954 1
1.0%
5000000000024955 1
1.0%
5000000000024956 1
1.0%
5000000000024957 1
1.0%
ValueCountFrequency (%)
5000000000025048 1
1.0%
5000000000025047 1
1.0%
5000000000025046 1
1.0%
5000000000025045 1
1.0%
5000000000025044 1
1.0%
5000000000025043 1
1.0%
5000000000025042 1
1.0%
5000000000025041 1
1.0%
5000000000025040 1
1.0%
5000000000025039 1
1.0%

조사분야구분코드명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
곤충
100 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row곤충
2nd row곤충
3rd row곤충
4th row곤충
5th row곤충

Common Values

ValueCountFrequency (%)
곤충 100
100.0%

Length

2023-12-10T19:55:59.114469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:55:59.300035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
곤충 100
100.0%
Distinct87
Distinct (%)87.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:55:59.636557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length17
Mean length6.75
Min length3

Characters and Unicode

Total characters675
Distinct characters204
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique77 ?
Unique (%)77.0%

Sample

1st rowMantis religiosa
2nd row배노랑물결자나방
3rd row소금쟁이
4th row긴다리범하늘소
5th row날개알락파리
ValueCountFrequency (%)
날개알락파리 4
 
3.8%
참밑들이 3
 
2.8%
왜무잎벌 2
 
1.9%
삿포로수염치레꽃등에 2
 
1.9%
대벌레 2
 
1.9%
모시밑들이 2
 
1.9%
잎벌레붙이 2
 
1.9%
가시털바구미 2
 
1.9%
딱부리소바구미 2
 
1.9%
사과곰보바구미 2
 
1.9%
Other values (83) 83
78.3%
2023-12-10T19:56:00.342390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
31
 
4.6%
19
 
2.8%
19
 
2.8%
18
 
2.7%
17
 
2.5%
17
 
2.5%
16
 
2.4%
12
 
1.8%
12
 
1.8%
11
 
1.6%
Other values (194) 503
74.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 581
86.1%
Lowercase Letter 76
 
11.3%
Space Separator 6
 
0.9%
Uppercase Letter 5
 
0.7%
Decimal Number 4
 
0.6%
Open Punctuation 1
 
0.1%
Close Punctuation 1
 
0.1%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
31
 
5.3%
19
 
3.3%
19
 
3.3%
18
 
3.1%
17
 
2.9%
17
 
2.9%
16
 
2.8%
12
 
2.1%
12
 
2.1%
11
 
1.9%
Other values (164) 409
70.4%
Lowercase Letter
ValueCountFrequency (%)
i 11
14.5%
s 9
11.8%
a 8
10.5%
t 6
7.9%
n 6
7.9%
r 6
7.9%
o 6
7.9%
e 5
 
6.6%
c 3
 
3.9%
p 3
 
3.9%
Other values (8) 13
17.1%
Uppercase Letter
ValueCountFrequency (%)
M 2
40.0%
L 1
20.0%
C 1
20.0%
P 1
20.0%
Decimal Number
ValueCountFrequency (%)
4 1
25.0%
6 1
25.0%
8 1
25.0%
1 1
25.0%
Space Separator
ValueCountFrequency (%)
6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 581
86.1%
Latin 81
 
12.0%
Common 13
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
31
 
5.3%
19
 
3.3%
19
 
3.3%
18
 
3.1%
17
 
2.9%
17
 
2.9%
16
 
2.8%
12
 
2.1%
12
 
2.1%
11
 
1.9%
Other values (164) 409
70.4%
Latin
ValueCountFrequency (%)
i 11
13.6%
s 9
11.1%
a 8
9.9%
t 6
 
7.4%
n 6
 
7.4%
r 6
 
7.4%
o 6
 
7.4%
e 5
 
6.2%
c 3
 
3.7%
p 3
 
3.7%
Other values (12) 18
22.2%
Common
ValueCountFrequency (%)
6
46.2%
( 1
 
7.7%
) 1
 
7.7%
4 1
 
7.7%
6 1
 
7.7%
8 1
 
7.7%
1 1
 
7.7%
, 1
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 581
86.1%
ASCII 94
 
13.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
31
 
5.3%
19
 
3.3%
19
 
3.3%
18
 
3.1%
17
 
2.9%
17
 
2.9%
16
 
2.8%
12
 
2.1%
12
 
2.1%
11
 
1.9%
Other values (164) 409
70.4%
ASCII
ValueCountFrequency (%)
i 11
 
11.7%
s 9
 
9.6%
a 8
 
8.5%
t 6
 
6.4%
6
 
6.4%
n 6
 
6.4%
r 6
 
6.4%
o 6
 
6.4%
e 5
 
5.3%
c 3
 
3.2%
Other values (20) 28
29.8%
Distinct87
Distinct (%)87.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:56:00.774972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length58
Median length43.5
Mean length34.05
Min length15

Characters and Unicode

Total characters3405
Distinct characters66
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique77 ?
Unique (%)77.0%

Sample

1st rowMantis religiosa
2nd rowCallygris compositata
3rd rowAquarius paludum paludum (Fabricius, 1794)
4th rowRhaphuma gracilipes (Falbermann)
5th rowProsthiochaeta bifasciata Hara, 1987
ValueCountFrequency (%)
japonica 7
 
1.8%
motschulsky 6
 
1.6%
coreana 4
 
1.0%
bifasciata 4
 
1.0%
lagria 4
 
1.0%
prosthiochaeta 4
 
1.0%
panorpa 4
 
1.0%
hara 4
 
1.0%
1987 4
 
1.0%
matsumura 3
 
0.8%
Other values (266) 338
88.5%
2023-12-10T19:56:01.557411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 330
 
9.7%
282
 
8.3%
i 216
 
6.3%
s 202
 
5.9%
e 179
 
5.3%
r 178
 
5.2%
o 174
 
5.1%
t 146
 
4.3%
n 141
 
4.1%
u 139
 
4.1%
Other values (56) 1418
41.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2397
70.4%
Decimal Number 320
 
9.4%
Space Separator 282
 
8.3%
Uppercase Letter 205
 
6.0%
Other Punctuation 89
 
2.6%
Open Punctuation 55
 
1.6%
Close Punctuation 55
 
1.6%
Dash Punctuation 2
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 330
13.8%
i 216
 
9.0%
s 202
 
8.4%
e 179
 
7.5%
r 178
 
7.4%
o 174
 
7.3%
t 146
 
6.1%
n 141
 
5.9%
u 139
 
5.8%
l 122
 
5.1%
Other values (16) 570
23.8%
Uppercase Letter
ValueCountFrequency (%)
P 24
11.7%
S 20
9.8%
M 20
9.8%
C 18
 
8.8%
L 17
 
8.3%
A 15
 
7.3%
H 14
 
6.8%
B 13
 
6.3%
R 8
 
3.9%
K 7
 
3.4%
Other values (12) 49
23.9%
Decimal Number
ValueCountFrequency (%)
1 97
30.3%
8 67
20.9%
9 40
12.5%
7 35
 
10.9%
5 19
 
5.9%
6 16
 
5.0%
3 13
 
4.1%
2 12
 
3.8%
0 11
 
3.4%
4 10
 
3.1%
Other Punctuation
ValueCountFrequency (%)
, 80
89.9%
? 9
 
10.1%
Open Punctuation
ValueCountFrequency (%)
( 54
98.2%
[ 1
 
1.8%
Close Punctuation
ValueCountFrequency (%)
) 54
98.2%
] 1
 
1.8%
Space Separator
ValueCountFrequency (%)
282
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2602
76.4%
Common 803
 
23.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 330
12.7%
i 216
 
8.3%
s 202
 
7.8%
e 179
 
6.9%
r 178
 
6.8%
o 174
 
6.7%
t 146
 
5.6%
n 141
 
5.4%
u 139
 
5.3%
l 122
 
4.7%
Other values (38) 775
29.8%
Common
ValueCountFrequency (%)
282
35.1%
1 97
 
12.1%
, 80
 
10.0%
8 67
 
8.3%
( 54
 
6.7%
) 54
 
6.7%
9 40
 
5.0%
7 35
 
4.4%
5 19
 
2.4%
6 16
 
2.0%
Other values (8) 59
 
7.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3405
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 330
 
9.7%
282
 
8.3%
i 216
 
6.3%
s 202
 
5.9%
e 179
 
5.3%
r 178
 
5.2%
o 174
 
5.1%
t 146
 
4.3%
n 141
 
4.1%
u 139
 
4.1%
Other values (56) 1418
41.6%

조사년도
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2015
100 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2015
2nd row2015
3rd row2015
4th row2015
5th row2015

Common Values

ValueCountFrequency (%)
2015 100
100.0%

Length

2023-12-10T19:56:01.809253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:56:01.947606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2015 100
100.0%

조사시작일자
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2015-07-14
43 
2015-07-10
34 
2015-07-16
23 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2015-07-10
2nd row2015-07-10
3rd row2015-07-10
4th row2015-07-10
5th row2015-07-10

Common Values

ValueCountFrequency (%)
2015-07-14 43
43.0%
2015-07-10 34
34.0%
2015-07-16 23
23.0%

Length

2023-12-10T19:56:02.083366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:56:02.221142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2015-07-14 43
43.0%
2015-07-10 34
34.0%
2015-07-16 23
23.0%

조사종료일자
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2015-07-14
43 
2015-07-10
34 
2015-07-16
23 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2015-07-10
2nd row2015-07-10
3rd row2015-07-10
4th row2015-07-10
5th row2015-07-10

Common Values

ValueCountFrequency (%)
2015-07-14 43
43.0%
2015-07-10 34
34.0%
2015-07-16 23
23.0%

Length

2023-12-10T19:56:02.389782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:56:02.559322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2015-07-14 43
43.0%
2015-07-10 34
34.0%
2015-07-16 23
23.0%

지오메트리
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
POINT (309268.62484358996 471832.6557016486)
43 
POINT (310367.83953518246 469369.6240978787)
34 
POINT (312371.48108733847 488267.6946266303)
23 

Length

Max length44
Median length44
Mean length44
Min length44

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPOINT (310367.83953518246 469369.6240978787)
2nd rowPOINT (310367.83953518246 469369.6240978787)
3rd rowPOINT (310367.83953518246 469369.6240978787)
4th rowPOINT (310367.83953518246 469369.6240978787)
5th rowPOINT (310367.83953518246 469369.6240978787)

Common Values

ValueCountFrequency (%)
POINT (309268.62484358996 471832.6557016486) 43
43.0%
POINT (310367.83953518246 469369.6240978787) 34
34.0%
POINT (312371.48108733847 488267.6946266303) 23
23.0%

Length

2023-12-10T19:56:02.744003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:56:02.917792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
point 100
33.3%
309268.62484358996 43
14.3%
471832.6557016486 43
14.3%
310367.83953518246 34
 
11.3%
469369.6240978787 34
 
11.3%
312371.48108733847 23
 
7.7%
488267.6946266303 23
 
7.7%

Interactions

2023-12-10T19:55:57.833188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T19:56:03.065352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공간아이디종국명종학명조사시작일자조사종료일자지오메트리
공간아이디1.0000.5500.5500.9360.9360.936
종국명0.5501.0001.0000.0000.0000.000
종학명0.5501.0001.0000.0000.0000.000
조사시작일자0.9360.0000.0001.0001.0001.000
조사종료일자0.9360.0000.0001.0001.0001.000
지오메트리0.9360.0000.0001.0001.0001.000
2023-12-10T19:56:03.317669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
조사종료일자지오메트리조사시작일자
조사종료일자1.0001.0001.000
지오메트리1.0001.0001.000
조사시작일자1.0001.0001.000
2023-12-10T19:56:03.508106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공간아이디조사시작일자조사종료일자지오메트리
공간아이디1.0000.8830.8830.883
조사시작일자0.8831.0001.0001.000
조사종료일자0.8831.0001.0001.000
지오메트리0.8831.0001.0001.000

Missing values

2023-12-10T19:55:58.053268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:55:58.285357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

공간아이디조사분야구분코드명종국명종학명조사년도조사시작일자조사종료일자지오메트리
05000000000024948곤충Mantis religiosaMantis religiosa20152015-07-102015-07-10POINT (310367.83953518246 469369.6240978787)
15000000000024949곤충배노랑물결자나방Callygris compositata20152015-07-102015-07-10POINT (310367.83953518246 469369.6240978787)
25000000000024950곤충소금쟁이Aquarius paludum paludum (Fabricius, 1794)20152015-07-102015-07-10POINT (310367.83953518246 469369.6240978787)
35000000000024951곤충긴다리범하늘소Rhaphuma gracilipes (Falbermann)20152015-07-102015-07-10POINT (310367.83953518246 469369.6240978787)
45000000000024952곤충날개알락파리Prosthiochaeta bifasciata Hara, 198720152015-07-102015-07-10POINT (310367.83953518246 469369.6240978787)
55000000000024953곤충북방목가는병대벌레Podabrus annulatus Mannerheim, 182520152015-07-102015-07-10POINT (310367.83953518246 469369.6240978787)
65000000000024954곤충일본광채꽃벌Ceratina (Ceratinida) japonica Cockerell, 191120152015-07-102015-07-10POINT (310367.83953518246 469369.6240978787)
75000000000024955곤충소나무무당벌레Harmonia yedoensis (Takizawa, 1917)20152015-07-102015-07-10POINT (310367.83953518246 469369.6240978787)
85000000000024956곤충모시밑들이Panorpodes paradoxus MacLachlan, 187520152015-07-102015-07-10POINT (310367.83953518246 469369.6240978787)
95000000000024957곤충사과곰보바구미Dyscerus exsculptus20152015-07-102015-07-10POINT (310367.83953518246 469369.6240978787)
공간아이디조사분야구분코드명종국명종학명조사년도조사시작일자조사종료일자지오메트리
905000000000025039곤충총채민강도래Amphinemura coreana Zwick, 197320152015-07-142015-07-14POINT (309268.62484358996 471832.6557016486)
915000000000025040곤충탈장님노린재Eurystylus coelestialium (Kirkaldy), 190220152015-07-142015-07-14POINT (309268.62484358996 471832.6557016486)
925000000000025041곤충끝검정알락꽃등에Allobaccha apicalis (Loew, 1858)20152015-07-142015-07-14POINT (309268.62484358996 471832.6557016486)
935000000000025042곤충날개알락파리Prosthiochaeta bifasciata Hara, 198720152015-07-142015-07-14POINT (309268.62484358996 471832.6557016486)
945000000000025043곤충루리허리꽃등에Xylota coquilletti Herv?Bazin, 191420152015-07-142015-07-14POINT (309268.62484358996 471832.6557016486)
955000000000025044곤충수중다리꽃등에Helophilus virgatus Coquillett, 189820152015-07-142015-07-14POINT (309268.62484358996 471832.6557016486)
965000000000025045곤충알락꽃등에Baccha maculata Walker, 185220152015-07-142015-07-14POINT (309268.62484358996 471832.6557016486)
975000000000025046곤충울보꽃등에Pseudovolucella decipiens (Herv?Bazin, 1914)20152015-07-142015-07-14POINT (309268.62484358996 471832.6557016486)
985000000000025047곤충쟈바꽃등에Allograpta javana (Wiedemann), 182420152015-07-142015-07-14POINT (309268.62484358996 471832.6557016486)
995000000000025048곤충참꽃등에Xanthogramma coreanum Shiraki, 193020152015-07-142015-07-14POINT (309268.62484358996 471832.6557016486)