Dataset statistics
Number of variables | 17 |
---|---|
Number of observations | 10000 |
Missing cells | 22206 |
Missing cells (%) | 13.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.4 MiB |
Average record size in memory | 142.0 B |
Variable types
Numeric | 1 |
---|---|
Categorical | 14 |
Unsupported | 2 |
Dataset
Description | 식물자원정보 |
---|---|
Author | 농림수산식품교육문화정보원 |
URL | https://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220210000000001803 |
LIFE_RESRCE_LTTOT_AT has constant value "-" | Constant |
RESRCE_NO has a high cardinality: 10000 distinct values | High cardinality |
SCNCENM_CD has a high cardinality: 1344 distinct values | High cardinality |
SCNCENM has a high cardinality: 1461 distinct values | High cardinality |
TNOAC has a high cardinality: 1409 distinct values | High cardinality |
IMAGE_URL has a high cardinality: 848 distinct values | High cardinality |
LIFE_RESRCE_LTTOT_AT is highly correlated with LAST_UPDT_DE and 7 other fields | High correlation |
LAST_UPDT_DE is highly correlated with LIFE_RESRCE_LTTOT_AT and 2 other fields | High correlation |
LIFE_RESRCE_STLE_CD_NM is highly correlated with LIFE_RESRCE_LTTOT_AT and 3 other fields | High correlation |
INSTT_CD_KOREA_NM is highly correlated with LIFE_RESRCE_LTTOT_AT and 1 other fields | High correlation |
OUTNATN_TKOUT_AT is highly correlated with LIFE_RESRCE_LTTOT_AT | High correlation |
INSTT_CD is highly correlated with LIFE_RESRCE_LTTOT_AT and 1 other fields | High correlation |
LIFE_RESRCE_KND_CD_NM is highly correlated with LIFE_RESRCE_LTTOT_AT and 4 other fields | High correlation |
LIFE_RESRCE_KND_CD is highly correlated with LIFE_RESRCE_LTTOT_AT and 4 other fields | High correlation |
LIFE_RESRCE_STLE_CD is highly correlated with LIFE_RESRCE_LTTOT_AT and 3 other fields | High correlation |
DETAIL_INFO_URL has 10000 (100.0%) missing values | Missing |
IMAGE_URL has 2186 (21.9%) missing values | Missing |
SPCIES_PRTC_APLC_AT has 10000 (100.0%) missing values | Missing |
df_index has unique values | Unique |
RESRCE_NO has unique values | Unique |
DETAIL_INFO_URL is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
SPCIES_PRTC_APLC_AT is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2022-08-12 14:44:46.341854 |
---|---|
Analysis finished | 2022-08-12 14:44:50.248796 |
Duration | 3.91 seconds |
Software version | pandas-profiling v3.2.0 |
Download configuration | config.json |
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 23734.2535 |
Minimum | 15 |
---|---|
Maximum | 47825 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 78.2 KiB |
Quantile statistics
Minimum | 15 |
---|---|
5-th percentile | 2279.9 |
Q1 | 11760.5 |
median | 23708.5 |
Q3 | 35684.5 |
95-th percentile | 45357.15 |
Maximum | 47825 |
Range | 47810 |
Interquartile range (IQR) | 23924 |
Descriptive statistics
Standard deviation | 13854.30096 |
---|---|
Coefficient of variation (CV) | 0.5837260043 |
Kurtosis | -1.20696792 |
Mean | 23734.2535 |
Median Absolute Deviation (MAD) | 11963.5 |
Skewness | 0.0127921497 |
Sum | 237342535 |
Variance | 191941655.1 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
786 | 1 | < 0.1% |
24310 | 1 | < 0.1% |
4314 | 1 | < 0.1% |
32803 | 1 | < 0.1% |
21444 | 1 | < 0.1% |
45918 | 1 | < 0.1% |
11277 | 1 | < 0.1% |
3720 | 1 | < 0.1% |
29717 | 1 | < 0.1% |
34661 | 1 | < 0.1% |
Other values (9990) | 9990 |
Value | Count | Frequency (%) |
15 | 1 | |
16 | 1 | |
21 | 1 | |
23 | 1 | |
24 | 1 | |
26 | 1 | |
27 | 1 | |
33 | 1 | |
35 | 1 | |
40 | 1 |
Value | Count | Frequency (%) |
47825 | 1 | |
47822 | 1 | |
47813 | 1 | |
47809 | 1 | |
47799 | 1 | |
47789 | 1 | |
47786 | 1 | |
47783 | 1 | |
47779 | 1 | |
47776 | 1 |
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
14001191150200-071-00036799 | 1 |
---|---|
14001191120200-020-00013204 | 1 |
14003771110102-010-00002132 | 1 |
14003771110103-010-00002249 | 1 |
14001191120200-020-00001220 | 1 |
Other values (9995) |
Length
Max length | 27 |
---|---|
Median length | 27 |
Mean length | 27 |
Min length | 27 |
Unique
Unique | 10000 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 14001191150200-071-00036799 |
---|---|
2nd row | 14001191120200-020-00001235 |
3rd row | 14003771110103-010-00002249 |
4th row | 14001191120200-020-00001220 |
5th row | 14001191110200-010-00003358 |
Common Values
Value | Count | Frequency (%) |
14001191150200-071-00036799 | 1 | < 0.1% |
14001191120200-020-00013204 | 1 | < 0.1% |
14003771110102-010-00002132 | 1 | < 0.1% |
14003771110103-010-00002249 | 1 | < 0.1% |
14001191120200-020-00001220 | 1 | < 0.1% |
14001191110200-010-00003358 | 1 | < 0.1% |
14003771110102-010-00005164 | 1 | < 0.1% |
14003771110103-010-00001813 | 1 | < 0.1% |
14001191120200-020-00007281 | 1 | < 0.1% |
14001191120200-020-00009602 | 1 | < 0.1% |
Other values (9990) | 9990 |
Length
Value | Count | Frequency (%) |
14001191150200-071-00036799 | 1 | < 0.1% |
14003771110102-010-00009472 | 1 | < 0.1% |
14003771110102-010-00001342 | 1 | < 0.1% |
14003771110102-010-00004960 | 1 | < 0.1% |
14001191120200-020-00001055 | 1 | < 0.1% |
14001191110200-010-00003838 | 1 | < 0.1% |
14001191120200-020-00017104 | 1 | < 0.1% |
14003771110102-010-00007390 | 1 | < 0.1% |
14003771110102-010-00002653 | 1 | < 0.1% |
14001191120200-020-00003870 | 1 | < 0.1% |
Other values (9990) | 9990 |
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
1 | |
---|---|
2 | |
5 | 224 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 5 |
---|---|
2nd row | 2 |
3rd row | 1 |
4th row | 2 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 6272 | |
2 | 3504 | |
5 | 224 | 2.2% |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
1 | 6272 | |
2 | 3504 | |
5 | 224 | 2.2% |
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
종자 | |
---|---|
영양체 | |
표본 | 224 |
Length
Max length | 3 |
---|---|
Median length | 2 |
Mean length | 2.3504 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 표본 |
---|---|
2nd row | 영양체 |
3rd row | 종자 |
4th row | 영양체 |
5th row | 종자 |
Common Values
Value | Count | Frequency (%) |
종자 | 6272 | |
영양체 | 3504 | |
표본 | 224 | 2.2% |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
종자 | 6272 | |
영양체 | 3504 | |
표본 | 224 | 2.2% |
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
1 | |
---|---|
7 | 224 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 7 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 9776 | |
7 | 224 | 2.2% |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
1 | 9776 | |
7 | 224 | 2.2% |
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
식량작물 | |
---|---|
수목류 | 224 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.9776 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 수목류 |
---|---|
2nd row | 식량작물 |
3rd row | 식량작물 |
4th row | 식량작물 |
5th row | 식량작물 |
Common Values
Value | Count | Frequency (%) |
식량작물 | 9776 | |
수목류 | 224 | 2.2% |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
식량작물 | 9776 | |
수목류 | 224 | 2.2% |
Distinct | 1344 |
---|---|
Distinct (%) | 13.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
BSD0003395965 | 249 |
---|---|
BSD0001596891 | 198 |
BSD0001918795 | 164 |
BSD0003483320 | 144 |
BSD0001723541 | 133 |
Other values (1339) |
Length
Max length | 13 |
---|---|
Median length | 13 |
Mean length | 13 |
Min length | 13 |
Unique
Unique | 478 ? |
---|---|
Unique (%) | 4.8% |
Sample
1st row | BSD0000755943 |
---|---|
2nd row | BSD0000792728 |
3rd row | BSD0003129056 |
4th row | BSD0002676251 |
5th row | BSD0001384123 |
Common Values
Value | Count | Frequency (%) |
BSD0003395965 | 249 | 2.5% |
BSD0001596891 | 198 | 2.0% |
BSD0001918795 | 164 | 1.6% |
BSD0003483320 | 144 | 1.4% |
BSD0001723541 | 133 | 1.3% |
BSD0003400683 | 128 | 1.3% |
BSD0000846780 | 125 | 1.2% |
BSD0003330177 | 115 | 1.1% |
BSD0003250789 | 114 | 1.1% |
BSD0002997219 | 110 | 1.1% |
Other values (1334) | 8520 |
Length
Value | Count | Frequency (%) |
bsd0003395965 | 249 | 2.5% |
bsd0001596891 | 198 | 2.0% |
bsd0001918795 | 164 | 1.6% |
bsd0003483320 | 144 | 1.4% |
bsd0001723541 | 133 | 1.3% |
bsd0003400683 | 128 | 1.3% |
bsd0000846780 | 125 | 1.2% |
bsd0003330177 | 115 | 1.1% |
bsd0003250789 | 114 | 1.1% |
bsd0002997219 | 110 | 1.1% |
Other values (1334) | 8520 |
Distinct | 1461 |
---|---|
Distinct (%) | 14.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
Abies koreana Wilson | 234 |
---|---|
Pinus densiflora Siebold & Zucc. | 167 |
Rhododendron yedoense for. poukhanense (H.Lev.) Sugim. | 161 |
Acer pseudosieboldianum (Pax) Kom. | 144 |
Rhododendron schlippenbachii Maxim. | 133 |
Other values (1456) |
Length
Max length | 72 |
---|---|
Median length | 58 |
Mean length | 31.958 |
Min length | 8 |
Unique
Unique | 569 ? |
---|---|
Unique (%) | 5.7% |
Sample
1st row | Carduus crispus L. |
---|---|
2nd row | Magnolia sieboldii k.koch |
3rd row | Machilus thunbergii Siebold & Zucc. |
4th row | Berberis thunbergii DC. |
5th row | Bidens frondosa L. |
Common Values
Value | Count | Frequency (%) |
Abies koreana Wilson | 234 | 2.3% |
Pinus densiflora Siebold & Zucc. | 167 | 1.7% |
Rhododendron yedoense for. poukhanense (H.Lev.) Sugim. | 161 | 1.6% |
Acer pseudosieboldianum (Pax) Kom. | 144 | 1.4% |
Rhododendron schlippenbachii Maxim. | 133 | 1.3% |
Viburnum odoratissimum var. awabuki (K.Koch) Zabel ex Rumpler | 128 | 1.3% |
Camellia sinensis L. | 125 | 1.2% |
Melia azedarach L. | 115 | 1.1% |
Quercus aliena Blume | 114 | 1.1% |
Pinus koraiensis Siebold & Zucc. | 110 | 1.1% |
Other values (1451) | 8569 |
Length
Value | Count | Frequency (%) |
1481 | 3.5% | |
var | 1433 | 3.4% |
thunb | 1299 | 3.1% |
l | 1123 | 2.7% |
siebold | 1098 | 2.6% |
zucc | 1061 | 2.5% |
ex | 893 | 2.1% |
nakai | 875 | 2.1% |
maxim | 781 | 1.9% |
for | 651 | 1.5% |
Other values (2136) | 31460 |
Distinct | 1409 |
---|---|
Distinct (%) | 14.1% |
Missing | 20 |
Missing (%) | 0.2% |
Memory size | 78.2 KiB |
구상나무 | 245 |
---|---|
소나무 | 167 |
산철쭉 | 164 |
당단풍나무 | 144 |
철쭉 | 133 |
Other values (1404) |
Length
Max length | 17 |
---|---|
Median length | 14 |
Mean length | 3.9 |
Min length | 1 |
Unique
Unique | 533 ? |
---|---|
Unique (%) | 5.3% |
Sample
1st row | 지느러미엉겅퀴 |
---|---|
2nd row | 함박꽃나무 |
3rd row | 후박나무 |
4th row | 일본매자나무 |
5th row | 미국가막사리 |
Common Values
Value | Count | Frequency (%) |
구상나무 | 245 | 2.5% |
소나무 | 167 | 1.7% |
산철쭉 | 164 | 1.6% |
당단풍나무 | 144 | 1.4% |
철쭉 | 133 | 1.3% |
아왜나무 | 128 | 1.3% |
차나무 | 125 | 1.2% |
멀구슬나무 | 115 | 1.1% |
갈참나무 | 114 | 1.1% |
잣나무 | 110 | 1.1% |
Other values (1399) | 8535 |
Length
Value | Count | Frequency (%) |
구상나무 | 245 | 2.4% |
소나무 | 167 | 1.7% |
산철쭉 | 164 | 1.6% |
당단풍나무 | 144 | 1.4% |
철쭉 | 133 | 1.3% |
아왜나무 | 128 | 1.3% |
차나무 | 125 | 1.2% |
멀구슬나무 | 115 | 1.1% |
갈참나무 | 114 | 1.1% |
잣나무 | 110 | 1.1% |
Other values (1406) | 8582 |
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
1400119 | |
---|---|
1400377 | |
1400573 | 49 |
Length
Max length | 7 |
---|---|
Median length | 7 |
Mean length | 7 |
Min length | 7 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1400119 |
---|---|
2nd row | 1400119 |
3rd row | 1400377 |
4th row | 1400119 |
5th row | 1400119 |
Common Values
Value | Count | Frequency (%) |
1400119 | 5440 | |
1400377 | 4511 | |
1400573 | 49 | 0.5% |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
1400119 | 5440 | |
1400377 | 4511 | |
1400573 | 49 | 0.5% |
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
국립수목원 | |
---|---|
국립산림과학원 | |
국립산림품종관리센터 | 49 |
Length
Max length | 10 |
---|---|
Median length | 5 |
Mean length | 5.9267 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 국립수목원 |
---|---|
2nd row | 국립수목원 |
3rd row | 국립산림과학원 |
4th row | 국립수목원 |
5th row | 국립수목원 |
Common Values
Value | Count | Frequency (%) |
국립수목원 | 5440 | |
국립산림과학원 | 4511 | |
국립산림품종관리센터 | 49 | 0.5% |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
국립수목원 | 5440 | |
국립산림과학원 | 4511 | |
국립산림품종관리센터 | 49 | 0.5% |
Distinct | 848 |
---|---|
Distinct (%) | 10.9% |
Missing | 2186 |
Missing (%) | 21.9% |
Memory size | 78.2 KiB |
http://www.forest.go.kr/images/fgri/2012/image/10000004-xxxxx-01.jpg | 249 |
---|---|
http://www.forest.go.kr/images/fgri/2012/image/10000652-xxxxx-01.jpg | 198 |
http://www.forest.go.kr/images/fgri/2012/image/10003015-xxxxx-01.jpg | 164 |
http://www.forest.go.kr/images/fgri/2012/image/10002397-41343-01.jpg | 144 |
http://www.forest.go.kr/images/fgri/2012/image/10003017-xxxxx-01.jpg | 133 |
Other values (843) |
Length
Max length | 94 |
---|---|
Median length | 68 |
Mean length | 68.00934221 |
Min length | 57 |
Unique
Unique | 253 ? |
---|---|
Unique (%) | 3.2% |
Sample
1st row | http://www.forest.go.kr/images/fgri/2012/image/10000951-40779-01.jpg |
---|---|
2nd row | http://www.forest.go.kr/images/fgri/2012/image/10013994-29745-02.jpg |
3rd row | http://www.forest.go.kr/images/fgri/2012/image/10012789-28540-01.jpg |
4th row | http://www.forest.go.kr/images/fgri/2012/image/10000652-xxxxx-01.jpg |
5th row | http://www.forest.go.kr/images/fgri/2012/image/10001073-xxxxx-01.jpg |
Common Values
Value | Count | Frequency (%) |
http://www.forest.go.kr/images/fgri/2012/image/10000004-xxxxx-01.jpg | 249 | 2.5% |
http://www.forest.go.kr/images/fgri/2012/image/10000652-xxxxx-01.jpg | 198 | 2.0% |
http://www.forest.go.kr/images/fgri/2012/image/10003015-xxxxx-01.jpg | 164 | 1.6% |
http://www.forest.go.kr/images/fgri/2012/image/10002397-41343-01.jpg | 144 | 1.4% |
http://www.forest.go.kr/images/fgri/2012/image/10003017-xxxxx-01.jpg | 133 | 1.3% |
http://www.forest.go.kr/images/fgri/2012/image/10013889-29640-02.jpg | 128 | 1.3% |
http://www.forest.go.kr/images/fgri/2012/image/10002504-xxxxx-01.jpg | 125 | 1.2% |
http://www.forest.go.kr/images/fgri/2012/image/10013810-29561-01.jpg | 115 | 1.1% |
http://www.forest.go.kr/images/fgri/2012/image/10004710-30684-01.jpg | 110 | 1.1% |
http://www.forest.go.kr/images/fgri/2012/image/10001815-xxxxx-01.jpg | 98 | 1.0% |
Other values (838) | 6350 | |
(Missing) | 2186 | 21.9% |
Length
Value | Count | Frequency (%) |
http://www.forest.go.kr/images/fgri/2012/image/10000004-xxxxx-01.jpg | 249 | 3.2% |
http://www.forest.go.kr/images/fgri/2012/image/10000652-xxxxx-01.jpg | 198 | 2.5% |
http://www.forest.go.kr/images/fgri/2012/image/10003015-xxxxx-01.jpg | 164 | 2.1% |
http://www.forest.go.kr/images/fgri/2012/image/10002397-41343-01.jpg | 144 | 1.8% |
http://www.forest.go.kr/images/fgri/2012/image/10003017-xxxxx-01.jpg | 133 | 1.7% |
http://www.forest.go.kr/images/fgri/2012/image/10013889-29640-02.jpg | 128 | 1.6% |
http://www.forest.go.kr/images/fgri/2012/image/10002504-xxxxx-01.jpg | 125 | 1.6% |
http://www.forest.go.kr/images/fgri/2012/image/10013810-29561-01.jpg | 115 | 1.5% |
http://www.forest.go.kr/images/fgri/2012/image/10004710-30684-01.jpg | 110 | 1.4% |
http://www.forest.go.kr/images/fgri/2012/image/10001815-xxxxx-01.jpg | 98 | 1.3% |
Other values (838) | 6350 |
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
불가능 | |
---|---|
가능 |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 2.7474 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 가능 |
---|---|
2nd row | 불가능 |
3rd row | 불가능 |
4th row | 가능 |
5th row | 가능 |
Common Values
Value | Count | Frequency (%) |
불가능 | 7474 | |
가능 | 2526 | 25.3% |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
불가능 | 7474 | |
가능 | 2526 | 25.3% |
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
- |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | - |
---|---|
2nd row | - |
3rd row | - |
4th row | - |
5th row | - |
Common Values
Value | Count | Frequency (%) |
- | 10000 |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
10000 |
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 78.2 KiB |
20121210 | |
---|---|
20121209 | |
20121203 | 224 |
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 8 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 20121203 |
---|---|
2nd row | 20121210 |
3rd row | 20121209 |
4th row | 20121210 |
5th row | 20121210 |
Common Values
Value | Count | Frequency (%) |
20121210 | 5265 | |
20121209 | 4511 | |
20121203 | 224 | 2.2% |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
20121210 | 5265 | |
20121209 | 4511 | |
20121203 | 224 | 2.2% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
df_index | RESRCE_NO | LIFE_RESRCE_STLE_CD | LIFE_RESRCE_STLE_CD_NM | LIFE_RESRCE_KND_CD | LIFE_RESRCE_KND_CD_NM | SCNCENM_CD | SCNCENM | TNOAC | INSTT_CD | INSTT_CD_KOREA_NM | DETAIL_INFO_URL | IMAGE_URL | OUTNATN_TKOUT_AT | SPCIES_PRTC_APLC_AT | LIFE_RESRCE_LTTOT_AT | LAST_UPDT_DE | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 786 | 14001191150200-071-00036799 | 5 | 표본 | 7 | 수목류 | BSD0000755943 | Carduus crispus L. | 지느러미엉겅퀴 | 1400119 | 국립수목원 | <NA> | <NA> | 가능 | <NA> | - | 20121203 |
1 | 30704 | 14001191120200-020-00001235 | 2 | 영양체 | 1 | 식량작물 | BSD0000792728 | Magnolia sieboldii k.koch | 함박꽃나무 | 1400119 | 국립수목원 | <NA> | http://www.forest.go.kr/images/fgri/2012/image/10000951-40779-01.jpg | 불가능 | <NA> | - | 20121210 |
2 | 7303 | 14003771110103-010-00002249 | 1 | 종자 | 1 | 식량작물 | BSD0003129056 | Machilus thunbergii Siebold & Zucc. | 후박나무 | 1400377 | 국립산림과학원 | <NA> | http://www.forest.go.kr/images/fgri/2012/image/10013994-29745-02.jpg | 불가능 | <NA> | - | 20121209 |
3 | 30589 | 14001191120200-020-00001220 | 2 | 영양체 | 1 | 식량작물 | BSD0002676251 | Berberis thunbergii DC. | 일본매자나무 | 1400119 | 국립수목원 | <NA> | <NA> | 가능 | <NA> | - | 20121210 |
4 | 19640 | 14001191110200-010-00003358 | 1 | 종자 | 1 | 식량작물 | BSD0001384123 | Bidens frondosa L. | 미국가막사리 | 1400119 | 국립수목원 | <NA> | http://www.forest.go.kr/images/fgri/2012/image/10012789-28540-01.jpg | 가능 | <NA> | - | 20121210 |
5 | 5578 | 14003771110102-010-00005164 | 1 | 종자 | 1 | 식량작물 | BSD0001596891 | Pinus densiflora Siebold & Zucc. | 소나무 | 1400377 | 국립산림과학원 | <NA> | http://www.forest.go.kr/images/fgri/2012/image/10000652-xxxxx-01.jpg | 불가능 | <NA> | - | 20121209 |
6 | 9402 | 14003771110103-010-00001813 | 1 | 종자 | 1 | 식량작물 | BSD0004075848 | Actinodaphne lancifolia (Siebold & Zucc.) Meisn. | 육박나무 | 1400377 | 국립산림과학원 | <NA> | http://www.forest.go.kr/images/fgri/2012/image/10001073-xxxxx-01.jpg | 가능 | <NA> | - | 20121209 |
7 | 34236 | 14001191120200-020-00007281 | 2 | 영양체 | 1 | 식량작물 | BSD0000262747 | Taxus cuspidata Siebold & Zucc. | 주목 | 1400119 | 국립수목원 | <NA> | http://www.forest.go.kr/images/fgri/2012/image/10012841-28592-01.jpg | 불가능 | <NA> | - | 20121210 |
8 | 41874 | 14001191120200-020-00013204 | 2 | 영양체 | 1 | 식량작물 | BSD0000394084 | Chaenomeles speciosa (Sweet) Nakai | 산당화 | 1400119 | 국립수목원 | <NA> | http://www.forest.go.kr/images/fgri/2012/image/10001815-xxxxx-01.jpg | 가능 | <NA> | - | 20121210 |
9 | 44288 | 14001191120200-020-00009602 | 2 | 영양체 | 1 | 식량작물 | BSD0000484275 | Rhododendron sp. | 산철쭉속 | 1400119 | 국립수목원 | <NA> | <NA> | 가능 | <NA> | - | 20121210 |
Last rows
df_index | RESRCE_NO | LIFE_RESRCE_STLE_CD | LIFE_RESRCE_STLE_CD_NM | LIFE_RESRCE_KND_CD | LIFE_RESRCE_KND_CD_NM | SCNCENM_CD | SCNCENM | TNOAC | INSTT_CD | INSTT_CD_KOREA_NM | DETAIL_INFO_URL | IMAGE_URL | OUTNATN_TKOUT_AT | SPCIES_PRTC_APLC_AT | LIFE_RESRCE_LTTOT_AT | LAST_UPDT_DE | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
9990 | 34104 | 14001191120200-020-00004746 | 2 | 영양체 | 1 | 식량작물 | BSD0001991875 | Forsythia koreana (Rehder) Nakai | 개나리 | 1400119 | 국립수목원 | <NA> | http://www.forest.go.kr/images/fgri/2012/image/10003197-xxxxx-01.jpg | 불가능 | <NA> | - | 20121210 |
9991 | 14415 | 14003771110103-010-00006080 | 1 | 종자 | 1 | 식량작물 | BSD0003437192 | Viburnum dilatatum Thunb. ex Murray | 가막살나무 | 1400377 | 국립산림과학원 | <NA> | http://www.forest.go.kr/images/fgri/2012/image/10012796-28547-01.jpg | 불가능 | <NA> | - | 20121209 |
9992 | 46274 | 14001191120200-020-00014451 | 2 | 영양체 | 1 | 식량작물 | BSD0002414104 | Rhododendron japonicum for. flavum (Miyoshi) Nakai | 황철쭉 | 1400119 | 국립수목원 | <NA> | <NA> | 불가능 | <NA> | - | 20121210 |
9993 | 21853 | 14001191110200-010-00006099 | 1 | 종자 | 1 | 식량작물 | BSD0003401159 | Akebia quinata (Thunb.) Decne. | 으름덩굴 | 1400119 | 국립수목원 | <NA> | http://www.forest.go.kr/images/fgri/2012/image/10000915-xxxxx-02.jpg | 불가능 | <NA> | - | 20121210 |
9994 | 1545 | 14003771110102-010-00002374 | 1 | 종자 | 1 | 식량작물 | BSD0000047354 | Liriodendron tulipifera L. | 튜울립나무 | 1400377 | 국립산림과학원 | <NA> | <NA> | 불가능 | <NA> | - | 20121209 |
9995 | 47243 | 14001191120200-020-00014671 | 2 | 영양체 | 1 | 식량작물 | BSD0003250789 | Quercus aliena Blume | 갈참나무 | 1400119 | 국립수목원 | <NA> | <NA> | 불가능 | <NA> | - | 20121210 |
9996 | 730 | 14001191150200-071-00031890 | 5 | 표본 | 7 | 수목류 | BSD0003919899 | Gnaphalium uliginosum L. | 왜떡쑥 | 1400119 | 국립수목원 | <NA> | http://www.forest.go.kr/images/fgri/2012/image/10004424-xxxxx-01.jpg | 가능 | <NA> | - | 20121203 |
9997 | 24474 | 14001191110200-010-00002796 | 1 | 종자 | 1 | 식량작물 | BSD0004000012 | Patrinia villosa (Thunb.) Juss. | 뚝갈 | 1400119 | 국립수목원 | <NA> | http://www.forest.go.kr/images/fgri/2012/image/10013074-28825-01.jpg | 불가능 | <NA> | - | 20121210 |
9998 | 33909 | 14001191120200-020-00004400 | 2 | 영양체 | 1 | 식량작물 | BSD0002487332 | Eucommia ulmoides Oliv. | 두충 | 1400119 | 국립수목원 | <NA> | <NA> | 불가능 | <NA> | - | 20121210 |
9999 | 29060 | 14001191110200-010-00006675 | 1 | 종자 | 1 | 식량작물 | BSD0000599862 | Thalictrum kemense var. hypoleucum (Siebold & Zucc.) Kitag. | 좀꿩의다리 | 1400119 | 국립수목원 | <NA> | <NA> | 가능 | <NA> | - | 20121210 |