Overview

Dataset statistics

Number of variables8
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.6 KiB
Average record size in memory67.3 B

Variable types

Numeric1
Categorical3
Text2
DateTime2

Alerts

조사분야구분코드명 has constant value ""Constant
조사년도 has constant value ""Constant
공간아이디 is highly overall correlated with 지오메트리High correlation
지오메트리 is highly overall correlated with 공간아이디High correlation
공간아이디 has unique valuesUnique

Reproduction

Analysis started2023-12-10 12:30:48.051800
Analysis finished2023-12-10 12:30:49.236899
Duration1.19 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

공간아이디
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3 × 1015
Minimum3 × 1015
Maximum3 × 1015
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:30:49.691575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3 × 1015
5-th percentile3 × 1015
Q13 × 1015
median3 × 1015
Q33 × 1015
95-th percentile3 × 1015
Maximum3 × 1015
Range100
Interquartile range (IQR)50

Descriptive statistics

Standard deviation29.393361
Coefficient of variation (CV)9.7977871 × 10-15
Kurtosis-1.204652
Mean3 × 1015
Median Absolute Deviation (MAD)25.5
Skewness0.049701676
Sum3 × 1017
Variance863.9697
MonotonicityStrictly increasing
2023-12-10T21:30:50.103517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3000000001060775 1
 
1.0%
3000000001060839 1
 
1.0%
3000000001060850 1
 
1.0%
3000000001060849 1
 
1.0%
3000000001060848 1
 
1.0%
3000000001060847 1
 
1.0%
3000000001060846 1
 
1.0%
3000000001060845 1
 
1.0%
3000000001060844 1
 
1.0%
3000000001060842 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
3000000001060775 1
1.0%
3000000001060776 1
1.0%
3000000001060777 1
1.0%
3000000001060778 1
1.0%
3000000001060779 1
1.0%
3000000001060780 1
1.0%
3000000001060781 1
1.0%
3000000001060782 1
1.0%
3000000001060783 1
1.0%
3000000001060784 1
1.0%
ValueCountFrequency (%)
3000000001060875 1
1.0%
3000000001060874 1
1.0%
3000000001060873 1
1.0%
3000000001060872 1
1.0%
3000000001060871 1
1.0%
3000000001060870 1
1.0%
3000000001060869 1
1.0%
3000000001060868 1
1.0%
3000000001060867 1
1.0%
3000000001060866 1
1.0%

조사분야구분코드명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
식물상
100 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row식물상
2nd row식물상
3rd row식물상
4th row식물상
5th row식물상

Common Values

ValueCountFrequency (%)
식물상 100
100.0%

Length

2023-12-10T21:30:50.318422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:30:50.450994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
식물상 100
100.0%
Distinct74
Distinct (%)74.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T21:30:50.804599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length5
Mean length4.02
Min length2

Characters and Unicode

Total characters402
Distinct characters131
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique55 ?
Unique (%)55.0%

Sample

1st row일본잎갈나무
2nd row멸가치
3rd row그늘사초
4th row생강나무
5th row졸방제비꽃
ValueCountFrequency (%)
산괴불주머니 4
 
4.0%
소나무 4
 
4.0%
일본잎갈나무 3
 
3.0%
생강나무 3
 
3.0%
꽃다지 3
 
3.0%
미국쑥부쟁이 2
 
2.0%
버드나무 2
 
2.0%
은행나무 2
 
2.0%
층층나무 2
 
2.0%
쪽동백나무 2
 
2.0%
Other values (64) 73
73.0%
2023-12-10T21:30:51.482574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
52
 
12.9%
46
 
11.4%
12
 
3.0%
8
 
2.0%
8
 
2.0%
8
 
2.0%
7
 
1.7%
7
 
1.7%
7
 
1.7%
6
 
1.5%
Other values (121) 241
60.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 402
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
52
 
12.9%
46
 
11.4%
12
 
3.0%
8
 
2.0%
8
 
2.0%
8
 
2.0%
7
 
1.7%
7
 
1.7%
7
 
1.7%
6
 
1.5%
Other values (121) 241
60.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 402
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
52
 
12.9%
46
 
11.4%
12
 
3.0%
8
 
2.0%
8
 
2.0%
8
 
2.0%
7
 
1.7%
7
 
1.7%
7
 
1.7%
6
 
1.5%
Other values (121) 241
60.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 402
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
52
 
12.9%
46
 
11.4%
12
 
3.0%
8
 
2.0%
8
 
2.0%
8
 
2.0%
7
 
1.7%
7
 
1.7%
7
 
1.7%
6
 
1.5%
Other values (121) 241
60.0%
Distinct74
Distinct (%)74.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T21:30:52.001181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length29
Mean length17.86
Min length11

Characters and Unicode

Total characters1786
Distinct characters50
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique55 ?
Unique (%)55.0%

Sample

1st rowLarix kaempferi
2nd rowAdenocaulon himalaicum
3rd rowCarex lanceolata
4th rowLindera obtusiloba
5th rowViola acuminata
ValueCountFrequency (%)
corydalis 6
 
2.8%
salix 6
 
2.8%
pinus 5
 
2.3%
prunus 4
 
1.9%
densiflora 4
 
1.9%
subsp 4
 
1.9%
viola 4
 
1.9%
speciosa 4
 
1.9%
var 3
 
1.4%
quercus 3
 
1.4%
Other values (129) 173
80.1%
2023-12-10T21:30:53.026790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 233
13.0%
i 162
 
9.1%
s 134
 
7.5%
e 117
 
6.6%
116
 
6.5%
r 110
 
6.2%
o 100
 
5.6%
n 96
 
5.4%
u 95
 
5.3%
l 92
 
5.2%
Other values (40) 531
29.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1560
87.3%
Space Separator 116
 
6.5%
Uppercase Letter 100
 
5.6%
Other Punctuation 7
 
0.4%
Math Symbol 2
 
0.1%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 233
14.9%
i 162
10.4%
s 134
 
8.6%
e 117
 
7.5%
r 110
 
7.1%
o 100
 
6.4%
n 96
 
6.2%
u 95
 
6.1%
l 92
 
5.9%
t 70
 
4.5%
Other values (16) 351
22.5%
Uppercase Letter
ValueCountFrequency (%)
C 16
16.0%
P 16
16.0%
A 14
14.0%
S 11
11.0%
D 7
7.0%
L 7
7.0%
M 4
 
4.0%
R 4
 
4.0%
V 4
 
4.0%
Q 3
 
3.0%
Other values (10) 14
14.0%
Space Separator
ValueCountFrequency (%)
116
100.0%
Other Punctuation
ValueCountFrequency (%)
. 7
100.0%
Math Symbol
ValueCountFrequency (%)
× 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1660
92.9%
Common 126
 
7.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 233
14.0%
i 162
 
9.8%
s 134
 
8.1%
e 117
 
7.0%
r 110
 
6.6%
o 100
 
6.0%
n 96
 
5.8%
u 95
 
5.7%
l 92
 
5.5%
t 70
 
4.2%
Other values (36) 451
27.2%
Common
ValueCountFrequency (%)
116
92.1%
. 7
 
5.6%
× 2
 
1.6%
- 1
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1784
99.9%
None 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 233
13.1%
i 162
 
9.1%
s 134
 
7.5%
e 117
 
6.6%
116
 
6.5%
r 110
 
6.2%
o 100
 
5.6%
n 96
 
5.4%
u 95
 
5.3%
l 92
 
5.2%
Other values (39) 529
29.7%
None
ValueCountFrequency (%)
× 2
100.0%

조사년도
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2016
100 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2016
2nd row2016
3rd row2016
4th row2016
5th row2016

Common Values

ValueCountFrequency (%)
2016 100
100.0%

Length

2023-12-10T21:30:53.270900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:30:53.504652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2016 100
100.0%
Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
Minimum2016-04-02 00:00:00
Maximum2016-04-27 00:00:00
2023-12-10T21:30:53.681149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:30:53.861039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=2)
Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
Minimum2016-04-02 00:00:00
Maximum2016-04-27 00:00:00
2023-12-10T21:30:54.042966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:30:54.233287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=2)

지오메트리
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)7.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
POINT (256189.1797000002 591938.8450000007)
26 
POINT (257568.9894000003 595435.7544)
22 
POINT (254891.85039999988 596373.2191000003)
21 
POINT (257158.4592000004 593040.1538999993)
13 
POINT (256692.8049999997 592574.4085000008)
Other values (2)

Length

Max length44
Median length43
Mean length41.98
Min length37

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPOINT (257158.4592000004 593040.1538999993)
2nd rowPOINT (257158.4592000004 593040.1538999993)
3rd rowPOINT (257158.4592000004 593040.1538999993)
4th rowPOINT (257158.4592000004 593040.1538999993)
5th rowPOINT (257158.4592000004 593040.1538999993)

Common Values

ValueCountFrequency (%)
POINT (256189.1797000002 591938.8450000007) 26
26.0%
POINT (257568.9894000003 595435.7544) 22
22.0%
POINT (254891.85039999988 596373.2191000003) 21
21.0%
POINT (257158.4592000004 593040.1538999993) 13
13.0%
POINT (256692.8049999997 592574.4085000008) 9
 
9.0%
POINT (258488.64910000004 593145.1279000007) 7
 
7.0%
POINT (257294.08449999988 592853.0142000001) 2
 
2.0%

Length

2023-12-10T21:30:54.468040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:30:54.661382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
point 100
33.3%
256189.1797000002 26
 
8.7%
591938.8450000007 26
 
8.7%
257568.9894000003 22
 
7.3%
595435.7544 22
 
7.3%
254891.85039999988 21
 
7.0%
596373.2191000003 21
 
7.0%
257158.4592000004 13
 
4.3%
593040.1538999993 13
 
4.3%
256692.8049999997 9
 
3.0%
Other values (5) 27
 
9.0%

Interactions

2023-12-10T21:30:48.625309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T21:30:54.828658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공간아이디종국명종학명조사시작일자조사종료일자지오메트리
공간아이디1.0000.0000.0000.9930.9930.916
종국명0.0001.0001.0000.6140.6140.000
종학명0.0001.0001.0000.6140.6140.000
조사시작일자0.9930.6140.6141.0000.9991.000
조사종료일자0.9930.6140.6140.9991.0001.000
지오메트리0.9160.0000.0001.0001.0001.000
2023-12-10T21:30:54.985773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공간아이디지오메트리
공간아이디1.0000.772
지오메트리0.7721.000

Missing values

2023-12-10T21:30:48.931743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T21:30:49.157548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

공간아이디조사분야구분코드명종국명종학명조사년도조사시작일자조사종료일자지오메트리
03000000001060775식물상일본잎갈나무Larix kaempferi20162016-04-022016-04-02POINT (257158.4592000004 593040.1538999993)
13000000001060776식물상멸가치Adenocaulon himalaicum20162016-04-022016-04-02POINT (257158.4592000004 593040.1538999993)
23000000001060777식물상그늘사초Carex lanceolata20162016-04-022016-04-02POINT (257158.4592000004 593040.1538999993)
33000000001060778식물상생강나무Lindera obtusiloba20162016-04-022016-04-02POINT (257158.4592000004 593040.1538999993)
43000000001060779식물상졸방제비꽃Viola acuminata20162016-04-022016-04-02POINT (257158.4592000004 593040.1538999993)
53000000001060780식물상가락지나물Potentilla kleiniana20162016-04-022016-04-02POINT (257158.4592000004 593040.1538999993)
63000000001060781식물상개별꽃Pseudostellaria heterophylla20162016-04-022016-04-02POINT (257158.4592000004 593040.1538999993)
73000000001060782식물상올괴불나무Lonicera praeflorens20162016-04-022016-04-02POINT (257158.4592000004 593040.1538999993)
83000000001060783식물상점현호색Corydalis maculata20162016-04-022016-04-02POINT (257158.4592000004 593040.1538999993)
93000000001060784식물상개고사리Athyrium niponicum20162016-04-022016-04-02POINT (257158.4592000004 593040.1538999993)
공간아이디조사분야구분코드명종국명종학명조사년도조사시작일자조사종료일자지오메트리
903000000001060866식물상십자고사리Polystichum tripteron20162016-04-272016-04-27POINT (256189.1797000002 591938.8450000007)
913000000001060867식물상양지꽃Potentilla fragarioides20162016-04-272016-04-27POINT (256189.1797000002 591938.8450000007)
923000000001060868식물상갈참나무Quercus aliena20162016-04-272016-04-27POINT (256189.1797000002 591938.8450000007)
933000000001060869식물상고광나무Philadelphus schrenkii20162016-04-272016-04-27POINT (256189.1797000002 591938.8450000007)
943000000001060870식물상산뽕나무Morus bombycis20162016-04-272016-04-27POINT (256189.1797000002 591938.8450000007)
953000000001060871식물상제비꽃Viola mandshurica20162016-04-272016-04-27POINT (256189.1797000002 591938.8450000007)
963000000001060872식물상실청사초Carex sabynensis20162016-04-272016-04-27POINT (256189.1797000002 591938.8450000007)
973000000001060873식물상점나도나물Cerastium fontanum subsp. vulgare20162016-04-272016-04-27POINT (256189.1797000002 591938.8450000007)
983000000001060874식물상당느릅나무Ulmus davidiana20162016-04-272016-04-27POINT (256189.1797000002 591938.8450000007)
993000000001060875식물상애기수영Rumex acetosella20162016-04-272016-04-27POINT (256189.1797000002 591938.8450000007)