Overview

Dataset statistics

Number of variables17
Number of observations10000
Missing cells25797
Missing cells (%)15.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 MiB
Average record size in memory138.0 B

Variable types

Numeric2
Categorical14
Unsupported1

Dataset

Description동물자원 정보
Author농림수산식품교육문화정보원
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220210000000001809

Alerts

RESRCE_NO has a high cardinality: 10000 distinct values High cardinality
SCNCENM_CD has a high cardinality: 150 distinct values High cardinality
SCNCENM has a high cardinality: 190 distinct values High cardinality
TNOAC has a high cardinality: 124 distinct values High cardinality
DETAIL_INFO_URL has a high cardinality: 9672 distinct values High cardinality
LIFE_RESRCE_LTTOT_AT is highly correlated with INSTT_CD_KOREA_NM and 5 other fieldsHigh correlation
LIFE_RESRCE_STLE_CD_NM is highly correlated with LIFE_RESRCE_STLE_CDHigh correlation
INSTT_CD_KOREA_NM is highly correlated with LIFE_RESRCE_LTTOT_AT and 5 other fieldsHigh correlation
IMAGE_URL is highly correlated with LIFE_RESRCE_LTTOT_AT and 5 other fieldsHigh correlation
OUTNATN_TKOUT_AT is highly correlated with IMAGE_URLHigh correlation
INSTT_CD is highly correlated with LIFE_RESRCE_LTTOT_AT and 5 other fieldsHigh correlation
LIFE_RESRCE_KND_CD_NM is highly correlated with LIFE_RESRCE_LTTOT_AT and 4 other fieldsHigh correlation
LIFE_RESRCE_KND_CD is highly correlated with LIFE_RESRCE_LTTOT_AT and 4 other fieldsHigh correlation
LIFE_RESRCE_STLE_CD is highly correlated with LIFE_RESRCE_LTTOT_AT and 3 other fieldsHigh correlation
LIFE_RESRCE_STLE_CD_NM has 5993 (59.9%) missing values Missing
TNOAC has 366 (3.7%) missing values Missing
IMAGE_URL has 9433 (94.3%) missing values Missing
SPCIES_PRTC_APLC_AT has 10000 (100.0%) missing values Missing
df_index has unique values Unique
RESRCE_NO has unique values Unique
SPCIES_PRTC_APLC_AT is an unsupported type, check if it needs cleaning or further analysis Unsupported

Reproduction

Analysis started2022-08-12 14:47:46.015115
Analysis finished2022-08-12 14:47:50.333637
Duration4.32 seconds
Software versionpandas-profiling v3.2.0
Download configurationconfig.json

Variables

df_index
Real number (ℝ≥0)

UNIQUE

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25504.6775
Minimum3
Maximum51283
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size78.2 KiB
2022-08-12T23:47:50.432946image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum3
5-th percentile2677.95
Q112565.5
median25461.5
Q338324.25
95-th percentile48723.3
Maximum51283
Range51280
Interquartile range (IQR)25758.75

Descriptive statistics

Standard deviation14814.89231
Coefficient of variation (CV)0.5808696195
Kurtosis-1.211374215
Mean25504.6775
Median Absolute Deviation (MAD)12871.5
Skewness0.02068099282
Sum255046775
Variance219481034.3
MonotonicityNot monotonic
2022-08-12T23:47:50.658096image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
259161
 
< 0.1%
460151
 
< 0.1%
62521
 
< 0.1%
200711
 
< 0.1%
472051
 
< 0.1%
403091
 
< 0.1%
324751
 
< 0.1%
141101
 
< 0.1%
331041
 
< 0.1%
349211
 
< 0.1%
Other values (9990)9990
99.9%
ValueCountFrequency (%)
31
< 0.1%
101
< 0.1%
111
< 0.1%
121
< 0.1%
361
< 0.1%
381
< 0.1%
411
< 0.1%
421
< 0.1%
461
< 0.1%
511
< 0.1%
ValueCountFrequency (%)
512831
< 0.1%
512731
< 0.1%
512701
< 0.1%
512671
< 0.1%
512631
< 0.1%
512591
< 0.1%
512481
< 0.1%
512451
< 0.1%
512431
< 0.1%
512401
< 0.1%

RESRCE_NO
Categorical

HIGH CARDINALITY
UNIQUE

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size78.2 KiB
F00001113Z20422260
 
1
F00001113Z20439774
 
1
F00001113Z20426998
 
1
139090613122000136897
 
1
139090613114000003664
 
1
Other values (9995)
9995 

Length

Max length25
Median length18
Mean length19.7709
Min length17

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st rowF00001113Z20422260
2nd rowF00001113Z20440158
3rd row139090613122000136897
4th row139090613114000003664
5th row1390906131KOR022000106017

Common Values

ValueCountFrequency (%)
F00001113Z204222601
 
< 0.1%
F00001113Z204397741
 
< 0.1%
F00001113Z204269981
 
< 0.1%
1390906131220001368971
 
< 0.1%
1390906131140000036641
 
< 0.1%
1390906131KOR0220001060171
 
< 0.1%
F00001113Z204211521
 
< 0.1%
F00001113Z204395121
 
< 0.1%
F00001113Z207878121
 
< 0.1%
F00001113Z204366651
 
< 0.1%
Other values (9990)9990
99.9%

Length

2022-08-12T23:47:50.834048image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
f00001113z204222601
 
< 0.1%
f00001113z82976191
 
< 0.1%
f00001113z203978041
 
< 0.1%
1390906131kor0120001627641
 
< 0.1%
f00001113z204376291
 
< 0.1%
f00001113z204033531
 
< 0.1%
f00001113z206411441
 
< 0.1%
f000011131202937051
 
< 0.1%
1390906131220001514231
 
< 0.1%
f00001113z203976281
 
< 0.1%
Other values (9990)9990
99.9%

LIFE_RESRCE_STLE_CD
Categorical

HIGH CORRELATION

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size78.2 KiB
Z
5993 
1
3986 
3
 
21

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowZ
2nd rowZ
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
Z5993
59.9%
13986
39.9%
321
 
0.2%

Length

2022-08-12T23:47:51.007007image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-08-12T23:47:51.150765image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
ValueCountFrequency (%)
z5993
59.9%
13986
39.9%
321
 
0.2%

LIFE_RESRCE_STLE_CD_NM
Categorical

HIGH CORRELATION
MISSING

Distinct2
Distinct (%)< 0.1%
Missing5993
Missing (%)59.9%
Memory size78.2 KiB
개체
3986 
세포주
 
21

Length

Max length3
Median length2
Mean length2.005240829
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개체
2nd row개체
3rd row개체
4th row개체
5th row개체

Common Values

ValueCountFrequency (%)
개체3986
39.9%
세포주21
 
0.2%
(Missing)5993
59.9%

Length

2022-08-12T23:47:51.301005image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-08-12T23:47:51.530206image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
ValueCountFrequency (%)
개체3986
99.5%
세포주21
 
0.5%

LIFE_RESRCE_KND_CD
Categorical

HIGH CORRELATION

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size78.2 KiB
Z
6172 
3
1536 
2
1496 
1
796 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowZ
2nd rowZ
3rd row3
4th row2
5th row3

Common Values

ValueCountFrequency (%)
Z6172
61.7%
31536
 
15.4%
21496
 
15.0%
1796
 
8.0%

Length

2022-08-12T23:47:51.675220image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-08-12T23:47:51.870239image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
ValueCountFrequency (%)
z6172
61.7%
31536
 
15.4%
21496
 
15.0%
1796
 
8.0%

LIFE_RESRCE_KND_CD_NM
Categorical

HIGH CORRELATION

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size78.2 KiB
기타
6172 
1536 
돼지
1496 
796 

Length

Max length2
Median length2
Mean length1.7668
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row기타
2nd row기타
3rd row
4th row돼지
5th row

Common Values

ValueCountFrequency (%)
기타6172
61.7%
1536
 
15.4%
돼지1496
 
15.0%
796
 
8.0%

Length

2022-08-12T23:47:52.056250image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-08-12T23:47:52.254825image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
ValueCountFrequency (%)
기타6172
61.7%
1536
 
15.4%
돼지1496
 
15.0%
796
 
8.0%

SCNCENM_CD
Categorical

HIGH CARDINALITY

Distinct150
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size78.2 KiB
BSD0001646277
4571 
BSD0002973837
1536 
BSD0002036977
1529 
BSD0001760258
796 
BSD0001408785
502 
Other values (145)
1066 

Length

Max length13
Median length13
Mean length13
Min length13

Unique

Unique76 ?
Unique (%)0.8%

Sample

1st rowBSD0001646277
2nd rowBSD0001646277
3rd rowBSD0002973837
4th rowBSD0002036977
5th rowBSD0002973837

Common Values

ValueCountFrequency (%)
BSD00016462774571
45.7%
BSD00029738371536
 
15.4%
BSD00020369771529
 
15.3%
BSD0001760258796
 
8.0%
BSD0001408785502
 
5.0%
BSD0004271405457
 
4.6%
BSD000197931857
 
0.6%
BSD000038720450
 
0.5%
BSD000076271549
 
0.5%
BSD000043170743
 
0.4%
Other values (140)410
 
4.1%

Length

2022-08-12T23:47:52.401148image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
bsd00016462774571
45.7%
bsd00029738371536
 
15.4%
bsd00020369771529
 
15.3%
bsd0001760258796
 
8.0%
bsd0001408785502
 
5.0%
bsd0004271405457
 
4.6%
bsd000197931857
 
0.6%
bsd000038720450
 
0.5%
bsd000076271549
 
0.5%
bsd000043170743
 
0.4%
Other values (140)410
 
4.1%

SCNCENM
Categorical

HIGH CARDINALITY

Distinct190
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size78.2 KiB
Mus musculus
4571 
Gallus gallus
1536 
Sus scrofa
1529 
Bos taurus
796 
Carthamus tinctorius
502 
Other values (185)
1066 

Length

Max length30
Median length29
Mean length12.7076
Min length7

Unique

Unique113 ?
Unique (%)1.1%

Sample

1st rowMus musculus
2nd rowMus musculus
3rd rowGallus gallus
4th rowSus scrofa
5th rowGallus gallus

Common Values

ValueCountFrequency (%)
Mus musculus4571
45.7%
Gallus gallus1536
 
15.4%
Sus scrofa1529
 
15.3%
Bos taurus796
 
8.0%
Carthamus tinctorius502
 
5.0%
Pisidium coreanum457
 
4.6%
Macaca fascicularis57
 
0.6%
Danio rerio49
 
0.5%
Plasmodium falciparum43
 
0.4%
Macaca mulatta36
 
0.4%
Other values (180)424
 
4.2%

Length

2022-08-12T23:47:52.590037image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
mus4571
22.9%
musculus4571
22.9%
gallus3072
15.4%
sus1529
 
7.6%
scrofa1529
 
7.6%
bos796
 
4.0%
taurus796
 
4.0%
carthamus502
 
2.5%
tinctorius502
 
2.5%
pisidium457
 
2.3%
Other values (250)1679
 
8.4%

TNOAC
Categorical

HIGH CARDINALITY
MISSING

Distinct124
Distinct (%)1.3%
Missing366
Missing (%)3.7%
Memory size78.2 KiB
생쥐
4571 
Berkshire
1210 
Korean chicken
558 
Korean native
555 
잇꽃
502 
Other values (119)
2238 

Length

Max length16
Median length2
Mean length5.401390907
Min length2

Unique

Unique54 ?
Unique (%)0.6%

Sample

1st row생쥐
2nd row생쥐
3rd rowKorean native
4th rowBerkshire
5th rowKorean chicken

Common Values

ValueCountFrequency (%)
생쥐4571
45.7%
Berkshire1210
 
12.1%
Korean chicken558
 
5.6%
Korean native555
 
5.5%
잇꽃502
 
5.0%
산골조개457
 
4.6%
Hanwoo383
 
3.8%
Korean brown297
 
3.0%
Cornish296
 
3.0%
Korean pig84
 
0.8%
Other values (114)721
 
7.2%
(Missing)366
 
3.7%

Length

2022-08-12T23:47:52.888387image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
생쥐4571
40.5%
korean1592
 
14.1%
berkshire1210
 
10.7%
chicken558
 
4.9%
native555
 
4.9%
잇꽃502
 
4.4%
산골조개457
 
4.0%
hanwoo383
 
3.4%
brown297
 
2.6%
cornish296
 
2.6%
Other values (115)876
 
7.8%

INSTT_CD
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size78.2 KiB
F000011
6172 
1390906
3828 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowF000011
2nd rowF000011
3rd row1390906
4th row1390906
5th row1390906

Common Values

ValueCountFrequency (%)
F0000116172
61.7%
13909063828
38.3%

Length

2022-08-12T23:47:53.068668image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-08-12T23:47:53.259913image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
ValueCountFrequency (%)
f0000116172
61.7%
13909063828
38.3%

INSTT_CD_KOREA_NM
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size78.2 KiB
국가생명연구자원정보센터
6172 
국립축산과학원
3828 

Length

Max length12
Median length12
Mean length10.086
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국가생명연구자원정보센터
2nd row국가생명연구자원정보센터
3rd row국립축산과학원
4th row국립축산과학원
5th row국립축산과학원

Common Values

ValueCountFrequency (%)
국가생명연구자원정보센터6172
61.7%
국립축산과학원3828
38.3%

Length

2022-08-12T23:47:53.365993image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-08-12T23:47:53.507816image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
ValueCountFrequency (%)
국가생명연구자원정보센터6172
61.7%
국립축산과학원3828
38.3%

DETAIL_INFO_URL
Categorical

HIGH CARDINALITY

Distinct9672
Distinct (%)96.8%
Missing5
Missing (%)< 0.1%
Memory size78.2 KiB
http://angr.nias.go.kr/agrims/species/chicken/individualdetail.do?individual_id=KOR022000138401
 
2
http://angr.nias.go.kr/agrims/species/pig/individualdetail.do?individual_id=AGR014000001157
 
2
http://angr.nias.go.kr/agrims/species/chicken/individualdetail.do?individual_id=KOR022000075252
 
2
http://angr.nias.go.kr/agrims/species/chicken/individualdetail.do?individual_id=KOR022000151232
 
2
http://angr.nias.go.kr/agrims/species/pig/individualdetail.do?individual_id=AGR014000001506
 
2
Other values (9667)
9985 

Length

Max length95
Median length74
Mean length81.12556278
Min length74

Unique

Unique9349 ?
Unique (%)93.5%

Sample

1st rowhttp://210.218.221.118/app/resources/resClsComm.do?resorceDistNo=180416386
2nd rowhttp://210.218.221.118/app/resources/resClsComm.do?resorceDistNo=180434284
3rd rowhttp://angr.nias.go.kr/agrims/species/chicken/individualdetail.do?individual_id=KOR022000136897
4th rowhttp://angr.nias.go.kr/agrims/species/pig/individualdetail.do?individual_id=AGR014000003664
5th rowhttp://angr.nias.go.kr/agrims/species/chicken/individualdetail.do?individual_id=KOR022000106017

Common Values

ValueCountFrequency (%)
http://angr.nias.go.kr/agrims/species/chicken/individualdetail.do?individual_id=KOR0220001384012
 
< 0.1%
http://angr.nias.go.kr/agrims/species/pig/individualdetail.do?individual_id=AGR0140000011572
 
< 0.1%
http://angr.nias.go.kr/agrims/species/chicken/individualdetail.do?individual_id=KOR0220000752522
 
< 0.1%
http://angr.nias.go.kr/agrims/species/chicken/individualdetail.do?individual_id=KOR0220001512322
 
< 0.1%
http://angr.nias.go.kr/agrims/species/pig/individualdetail.do?individual_id=AGR0140000015062
 
< 0.1%
http://angr.nias.go.kr/agrims/species/chicken/individualdetail.do?individual_id=KOR0220001507032
 
< 0.1%
http://angr.nias.go.kr/agrims/species/chicken/individualdetail.do?individual_id=KOR0220001130392
 
< 0.1%
http://angr.nias.go.kr/agrims/species/pig/individualdetail.do?individual_id=KOR0120000386392
 
< 0.1%
http://angr.nias.go.kr/agrims/species/chicken/individualdetail.do?individual_id=KOR0220001514402
 
< 0.1%
http://angr.nias.go.kr/agrims/species/pig/individualdetail.do?individual_id=KOR0120002331572
 
< 0.1%
Other values (9662)9975
99.8%
(Missing)5
 
0.1%

Length

2022-08-12T23:47:53.671333image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
http://angr.nias.go.kr/agrims/species/chicken/individualdetail.do?individual_id=kor0220001384012
 
< 0.1%
http://angr.nias.go.kr/agrims/species/cow/individualdetail.do?individual_id=kor0001906172552
 
< 0.1%
http://angr.nias.go.kr/agrims/species/cow/individualdetail.do?individual_id=kor0002214556712
 
< 0.1%
http://angr.nias.go.kr/agrims/species/pig/individualdetail.do?individual_id=kor0120002359372
 
< 0.1%
http://angr.nias.go.kr/agrims/species/pig/individualdetail.do?individual_id=kor0120000373322
 
< 0.1%
http://angr.nias.go.kr/agrims/species/pig/individualdetail.do?individual_id=agr0140000010572
 
< 0.1%
http://angr.nias.go.kr/agrims/species/cow/individualdetail.do?individual_id=kor0001853238642
 
< 0.1%
http://angr.nias.go.kr/agrims/species/cow/individualdetail.do?individual_id=kor0001526078452
 
< 0.1%
http://angr.nias.go.kr/agrims/species/cow/individualdetail.do?individual_id=agr0040000003702
 
< 0.1%
http://angr.nias.go.kr/agrims/species/chicken/individualdetail.do?individual_id=kor0220001563152
 
< 0.1%
Other values (9662)9975
99.8%

IMAGE_URL
Categorical

HIGH CORRELATION
MISSING

Distinct15
Distinct (%)2.6%
Missing9433
Missing (%)94.3%
Memory size78.2 KiB
http://genebank.rda.go.kr:8080/attachfile/gp/image_info/watermark/01186/20151104171547_086.jpg
502 
http://www.naris.go.kr/specIMG/2/7/31/28631/GNHM-MM-0000067-05.JPG
 
33
http://www.bris.go.kr/life/images/2013/01/25/GD13004205_1.jpg
 
10
http://usgs.wildlifeinformation.org/S/0MRodenti/muridae/rattus/rattus_norvegicus/Rattus_norvegicus_DT.jpg
 
7
http://www.naris.go.kr/specIMG/4/8/77/1290877/JNHM-IN-0001689_4.jpg
 
2
Other values (10)
 
13

Length

Max length131
Median length94
Mean length91.27513228
Min length56

Unique

Unique7 ?
Unique (%)1.2%

Sample

1st rowhttp://genebank.rda.go.kr:8080/attachfile/gp/image_info/watermark/01186/20151104171547_086.jpg
2nd rowhttp://genebank.rda.go.kr:8080/attachfile/gp/image_info/watermark/01186/20151104171547_086.jpg
3rd rowhttp://genebank.rda.go.kr:8080/attachfile/gp/image_info/watermark/01186/20151104171547_086.jpg
4th rowhttp://www.naris.go.kr/specIMG/2/7/31/28631/GNHM-MM-0000067-05.JPG
5th rowhttp://genebank.rda.go.kr:8080/attachfile/gp/image_info/watermark/01186/20151104171547_086.jpg

Common Values

ValueCountFrequency (%)
http://genebank.rda.go.kr:8080/attachfile/gp/image_info/watermark/01186/20151104171547_086.jpg502
 
5.0%
http://www.naris.go.kr/specIMG/2/7/31/28631/GNHM-MM-0000067-05.JPG33
 
0.3%
http://www.bris.go.kr/life/images/2013/01/25/GD13004205_1.jpg10
 
0.1%
http://usgs.wildlifeinformation.org/S/0MRodenti/muridae/rattus/rattus_norvegicus/Rattus_norvegicus_DT.jpg7
 
0.1%
http://www.naris.go.kr/specIMG/4/8/77/1290877/JNHM-IN-0001689_4.jpg2
 
< 0.1%
http://www.bris.go.kr/life/images/2013/01/25/GD13004208_1.jpg2
 
< 0.1%
http://www.bris.go.kr/life/images/2013/01/25/GD13004207_1.jpg2
 
< 0.1%
http://rexee-12.vo.llnwd.net/d1/video_image_1/2334/151964332_1896.jpg2
 
< 0.1%
http://www.naris.go.kr/specIMG/6/8/69/1415769/NHMC-IN-0000318-02.JPG1
 
< 0.1%
http://www.radioactiverobins.com/wagtails-eastrn-othr%20white-/motaccilla%20alba%20baicalensis-maasvlakte-08-05-2001%20b20cmweb.jpg1
 
< 0.1%
Other values (5)5
 
0.1%
(Missing)9433
94.3%

Length

2022-08-12T23:47:53.839296image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
http://genebank.rda.go.kr:8080/attachfile/gp/image_info/watermark/01186/20151104171547_086.jpg502
88.4%
http://www.naris.go.kr/specimg/2/7/31/28631/gnhm-mm-0000067-05.jpg33
 
5.8%
http://www.bris.go.kr/life/images/2013/01/25/gd13004205_1.jpg10
 
1.8%
http://usgs.wildlifeinformation.org/s/0mrodenti/muridae/rattus/rattus_norvegicus/rattus_norvegicus_dt.jpg7
 
1.2%
http://www.naris.go.kr/specimg/4/8/77/1290877/jnhm-in-0001689_4.jpg2
 
0.4%
http://www.bris.go.kr/life/images/2013/01/25/gd13004208_1.jpg2
 
0.4%
http://www.bris.go.kr/life/images/2013/01/25/gd13004207_1.jpg2
 
0.4%
http://rexee-12.vo.llnwd.net/d1/video_image_1/2334/151964332_1896.jpg2
 
0.4%
http://www.naris.go.kr/specimg/6/8/69/1415769/nhmc-in-0000318-02.jpg1
 
0.2%
http://www.radioactiverobins.com/wagtails-eastrn-othr%20white-/motaccilla%20alba%20baicalensis-maasvlakte-08-05-2001%20b20cmweb.jpg1
 
0.2%
Other values (6)6
 
1.1%

OUTNATN_TKOUT_AT
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size78.2 KiB
가능
5637 
불가능
4363 

Length

Max length3
Median length2
Mean length2.4363
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가능
2nd row가능
3rd row불가능
4th row불가능
5th row불가능

Common Values

ValueCountFrequency (%)
가능5637
56.4%
불가능4363
43.6%

Length

2022-08-12T23:47:54.051747image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-08-12T23:47:54.213223image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
ValueCountFrequency (%)
가능5637
56.4%
불가능4363
43.6%

SPCIES_PRTC_APLC_AT
Unsupported

MISSING
REJECTED
UNSUPPORTED

Missing10000
Missing (%)100.0%
Memory size88.0 KiB

LIFE_RESRCE_LTTOT_AT
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size78.2 KiB
-
6172 
불가능
3828 

Length

Max length3
Median length1
Mean length1.7656
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row-
2nd row-
3rd row불가능
4th row불가능
5th row불가능

Common Values

ValueCountFrequency (%)
-6172
61.7%
불가능3828
38.3%

Length

2022-08-12T23:47:54.398074image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram of lengths of the category

Category Frequency Plot

2022-08-12T23:47:54.635754image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
ValueCountFrequency (%)
6172
61.7%
불가능3828
38.3%

LAST_UPDT_DE
Real number (ℝ≥0)

Distinct10
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20175870.36
Minimum20140811
Maximum20201201
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size88.0 KiB
2022-08-12T23:47:54.795778image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum20140811
5-th percentile20140811
Q120171020
median20181226
Q320181226
95-th percentile20201122
Maximum20201201
Range60390
Interquartile range (IQR)10206

Descriptive statistics

Standard deviation16917.95703
Coefficient of variation (CV)0.0008385242732
Kurtosis0.4823906332
Mean20175870.36
Median Absolute Deviation (MAD)0
Skewness-1.024032208
Sum2.017587036 × 1011
Variance286217270.1
MonotonicityNot monotonic
2022-08-12T23:47:54.994821image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
201812266172
61.7%
201408111584
 
15.8%
202011221111
 
11.1%
201710201088
 
10.9%
2017101529
 
0.3%
201610118
 
0.1%
201610153
 
< 0.1%
201710143
 
< 0.1%
201710131
 
< 0.1%
202012011
 
< 0.1%
ValueCountFrequency (%)
201408111584
 
15.8%
201610118
 
0.1%
201610153
 
< 0.1%
201710131
 
< 0.1%
201710143
 
< 0.1%
2017101529
 
0.3%
201710201088
 
10.9%
201812266172
61.7%
202011221111
 
11.1%
202012011
 
< 0.1%
ValueCountFrequency (%)
202012011
 
< 0.1%
202011221111
 
11.1%
201812266172
61.7%
201710201088
 
10.9%
2017101529
 
0.3%
201710143
 
< 0.1%
201710131
 
< 0.1%
201610153
 
< 0.1%
201610118
 
0.1%
201408111584
 
15.8%

Interactions

2022-08-12T23:47:48.416703image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-08-12T23:47:47.963942image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-08-12T23:47:48.636061image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-08-12T23:47:48.231790image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Correlations

2022-08-12T23:47:55.155357image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
2022-08-12T23:47:55.406508image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
2022-08-12T23:47:55.810841image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
2022-08-12T23:47:56.070738image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.
2022-08-12T23:47:56.411460image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

2022-08-12T23:47:49.052925image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
A simple visualization of nullity by column.
2022-08-12T23:47:49.516245image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2022-08-12T23:47:49.821331image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.
2022-08-12T23:47:50.138358image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
The dendrogram allows you to more fully correlate variable completion, revealing trends deeper than the pairwise ones visible in the correlation heatmap.

Sample

First rows

df_indexRESRCE_NOLIFE_RESRCE_STLE_CDLIFE_RESRCE_STLE_CD_NMLIFE_RESRCE_KND_CDLIFE_RESRCE_KND_CD_NMSCNCENM_CDSCNCENMTNOACINSTT_CDINSTT_CD_KOREA_NMDETAIL_INFO_URLIMAGE_URLOUTNATN_TKOUT_ATSPCIES_PRTC_APLC_ATLIFE_RESRCE_LTTOT_ATLAST_UPDT_DE
025916F00001113Z20422260Z<NA>Z기타BSD0001646277Mus musculus생쥐F000011국가생명연구자원정보센터http://210.218.221.118/app/resources/resClsComm.do?resorceDistNo=180416386<NA>가능<NA>-20181226
141050F00001113Z20440158Z<NA>Z기타BSD0001646277Mus musculus생쥐F000011국가생명연구자원정보센터http://210.218.221.118/app/resources/resClsComm.do?resorceDistNo=180434284<NA>가능<NA>-20181226
2144921390906131220001368971개체3BSD0002973837Gallus gallusKorean native1390906국립축산과학원http://angr.nias.go.kr/agrims/species/chicken/individualdetail.do?individual_id=KOR022000136897<NA>불가능<NA>불가능20201122
3198861390906131140000036641개체2돼지BSD0002036977Sus scrofaBerkshire1390906국립축산과학원http://angr.nias.go.kr/agrims/species/pig/individualdetail.do?individual_id=AGR014000003664<NA>불가능<NA>불가능20171015
499791390906131KOR0220001060171개체3BSD0002973837Gallus gallusKorean chicken1390906국립축산과학원http://angr.nias.go.kr/agrims/species/chicken/individualdetail.do?individual_id=KOR022000106017<NA>불가능<NA>불가능20140811
526271F00001113Z20421152Z<NA>Z기타BSD0001646277Mus musculus생쥐F000011국가생명연구자원정보센터http://210.218.221.118/app/resources/resClsComm.do?resorceDistNo=180415278<NA>가능<NA>-20181226
640868F00001113Z20439512Z<NA>Z기타BSD0001646277Mus musculus생쥐F000011국가생명연구자원정보센터http://210.218.221.118/app/resources/resClsComm.do?resorceDistNo=180433638<NA>가능<NA>-20181226
748996F00001113Z20787812Z<NA>Z기타BSD0004271405Pisidium coreanum산골조개F000011국가생명연구자원정보센터http://210.218.221.118/app/resources/resClsComm.do?resorceDistNo=180877943<NA>가능<NA>-20181226
820012F00001113Z20439774Z<NA>Z기타BSD0001646277Mus musculus생쥐F000011국가생명연구자원정보센터http://210.218.221.118/app/resources/resClsComm.do?resorceDistNo=180433900<NA>가능<NA>-20181226
934037F00001113Z20436665Z<NA>Z기타BSD0001646277Mus musculus생쥐F000011국가생명연구자원정보센터http://210.218.221.118/app/resources/resClsComm.do?resorceDistNo=180430791<NA>가능<NA>-20181226

Last rows

df_indexRESRCE_NOLIFE_RESRCE_STLE_CDLIFE_RESRCE_STLE_CD_NMLIFE_RESRCE_KND_CDLIFE_RESRCE_KND_CD_NMSCNCENM_CDSCNCENMTNOACINSTT_CDINSTT_CD_KOREA_NMDETAIL_INFO_URLIMAGE_URLOUTNATN_TKOUT_ATSPCIES_PRTC_APLC_ATLIFE_RESRCE_LTTOT_ATLAST_UPDT_DE
9990174821390906131120002376231개체2돼지BSD0002036977Sus scrofaBerkshire1390906국립축산과학원http://angr.nias.go.kr/agrims/species/pig/individualdetail.do?individual_id=KOR012000237623<NA>불가능<NA>불가능20171020
9991171541390906131001952893891개체1BSD0001760258Bos taurusHanwoo1390906국립축산과학원http://angr.nias.go.kr/agrims/species/cow/individualdetail.do?individual_id=KOR000195289389<NA>불가능<NA>불가능20201122
9992170451390906131001952825951개체1BSD0001760258Bos taurusHanwoo1390906국립축산과학원http://angr.nias.go.kr/agrims/species/cow/individualdetail.do?individual_id=KOR000195282595<NA>불가능<NA>불가능20171020
9993104111390906131140000046631개체2돼지BSD0002036977Sus scrofaBerkshire1390906국립축산과학원http://angr.nias.go.kr/agrims/species/pig/individualdetail.do?individual_id=AGR014000004663<NA>불가능<NA>불가능20201122
9994127411390906131140000024171개체2돼지BSD0002036977Sus scrofaBerkshire1390906국립축산과학원http://angr.nias.go.kr/agrims/species/pig/individualdetail.do?individual_id=AGR014000002417<NA>불가능<NA>불가능20171020
9995169991390906131220001054011개체3BSD0002973837Gallus gallusKorean native1390906국립축산과학원http://angr.nias.go.kr/agrims/species/chicken/individualdetail.do?individual_id=KOR022000105401<NA>불가능<NA>불가능20201122
999648315F00001113Z20433764Z<NA>Z기타BSD0001646277Mus musculus생쥐F000011국가생명연구자원정보센터http://210.218.221.118/app/resources/resClsComm.do?resorceDistNo=180427890<NA>가능<NA>-20181226
999732118F00001113Z20425708Z<NA>Z기타BSD0001646277Mus musculus생쥐F000011국가생명연구자원정보센터http://210.218.221.118/app/resources/resClsComm.do?resorceDistNo=180419834<NA>가능<NA>-20181226
999829392F00001113Z20443823Z<NA>Z기타BSD0001646277Mus musculus생쥐F000011국가생명연구자원정보센터http://210.218.221.118/app/resources/resClsComm.do?resorceDistNo=180437949<NA>가능<NA>-20181226
999930243F00001113Z20429122Z<NA>Z기타BSD0001646277Mus musculus생쥐F000011국가생명연구자원정보센터http://210.218.221.118/app/resources/resClsComm.do?resorceDistNo=180423248<NA>가능<NA>-20181226