Overview

Dataset statistics

Number of variables17
Number of observations3991
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory534.1 KiB
Average record size in memory137.0 B

Variable types

Categorical10
Text6
DateTime1

Dataset

Description제주 민속자연사박물관에서 소장하고 있는 자연사자료(해양생물)의 분류, 속명, 종명, 수량, 수집방법, 현위치 등 정보 제공
Author제주특별자치도
URLhttps://www.data.go.kr/data/15045470/fileData.do

Alerts

기관코드 has constant value ""Constant
대분류 has constant value ""Constant
수량 has constant value ""Constant
현 상태 has constant value ""Constant
데이터기준일자 has constant value ""Constant
중분류 is highly overall correlated with 소분류 and 2 other fieldsHigh correlation
소분류 is highly overall correlated with 중분류 and 2 other fieldsHigh correlation
사진분류 is highly overall correlated with 중분류 and 2 other fieldsHigh correlation
상태 is highly overall correlated with 중분류 and 2 other fieldsHigh correlation
이명 is highly imbalanced (90.0%)Imbalance
자료수집번호(통합) has unique valuesUnique
자료세부분류번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 11:42:49.196610
Analysis finished2023-12-12 11:42:51.819835
Duration2.62 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기관코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size31.3 KiB
JFNM
3991 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowJFNM
2nd rowJFNM
3rd rowJFNM
4th rowJFNM
5th rowJFNM

Common Values

ValueCountFrequency (%)
JFNM 3991
100.0%

Length

2023-12-12T20:42:51.950264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:42:52.146673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
jfnm 3991
100.0%

대분류
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size31.3 KiB
동물
3991 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row동물
2nd row동물
3rd row동물
4th row동물
5th row동물

Common Values

ValueCountFrequency (%)
동물 3991
100.0%

Length

2023-12-12T20:42:52.351129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:42:52.539301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
동물 3991
100.0%

중분류
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size31.3 KiB
무척추동물
2582 
어류
1409 

Length

Max length5
Median length5
Mean length3.940867
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row어류
2nd row어류
3rd row어류
4th row어류
5th row어류

Common Values

ValueCountFrequency (%)
무척추동물 2582
64.7%
어류 1409
35.3%

Length

2023-12-12T20:42:52.772429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:42:52.983628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
무척추동물 2582
64.7%
어류 1409
35.3%

소분류
Categorical

HIGH CORRELATION 

Distinct46
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size31.3 KiB
패류
1449 
농어목
740 
갑각류
585 
강장동물
253 
극피동물
200 
Other values (41)
764 

Length

Max length6
Median length5
Mean length2.89777
Min length2

Unique

Unique8 ?
Unique (%)0.2%

Sample

1st row홍어목
2nd row농어목
3rd row농어목
4th row쏨뱅이목
5th row농어목

Common Values

ValueCountFrequency (%)
패류 1449
36.3%
농어목 740
18.5%
갑각류 585
14.7%
강장동물 253
 
6.3%
극피동물 200
 
5.0%
쏨뱅이목 174
 
4.4%
복어목 128
 
3.2%
해면동물 46
 
1.2%
홍어목 42
 
1.1%
해당없음 41
 
1.0%
Other values (36) 333
 
8.3%

Length

2023-12-12T20:42:53.226687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
패류 1449
36.3%
농어목 740
18.5%
갑각류 585
14.7%
강장동물 253
 
6.3%
극피동물 200
 
5.0%
쏨뱅이목 175
 
4.4%
복어목 128
 
3.2%
해면동물 46
 
1.2%
홍어목 44
 
1.1%
해당없음 41
 
1.0%
Other values (32) 330
 
8.3%
Distinct160
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size31.3 KiB
2023-12-12T20:42:53.757367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length3
Mean length3.7607116
Min length2

Characters and Unicode

Total characters15009
Distinct characters182
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)0.6%

Sample

1st row수구리과
2nd row도미과
3rd row바리과
4th row양볼락과
5th row바리과
ValueCountFrequency (%)
십각목 584
 
14.6%
원시복족목 442
 
11.1%
흡강목 334
 
8.4%
해당없음 333
 
8.3%
산호류 232
 
5.8%
백합목 177
 
4.4%
양볼락과 105
 
2.6%
놀래기과 83
 
2.1%
자리돔과 78
 
2.0%
참복과 64
 
1.6%
Other values (148) 1559
39.1%
2023-12-12T20:42:54.544513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1829
 
12.2%
1525
 
10.2%
597
 
4.0%
588
 
3.9%
586
 
3.9%
519
 
3.5%
468
 
3.1%
442
 
2.9%
396
 
2.6%
354
 
2.4%
Other values (172) 7705
51.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14976
99.8%
Space Separator 33
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1829
 
12.2%
1525
 
10.2%
597
 
4.0%
588
 
3.9%
586
 
3.9%
519
 
3.5%
468
 
3.1%
442
 
3.0%
396
 
2.6%
354
 
2.4%
Other values (171) 7672
51.2%
Space Separator
ValueCountFrequency (%)
33
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14976
99.8%
Common 33
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1829
 
12.2%
1525
 
10.2%
597
 
4.0%
588
 
3.9%
586
 
3.9%
519
 
3.5%
468
 
3.1%
442
 
3.0%
396
 
2.6%
354
 
2.4%
Other values (171) 7672
51.2%
Common
ValueCountFrequency (%)
33
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14976
99.8%
ASCII 33
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1829
 
12.2%
1525
 
10.2%
597
 
4.0%
588
 
3.9%
586
 
3.9%
519
 
3.5%
468
 
3.1%
442
 
3.0%
396
 
2.6%
354
 
2.4%
Other values (171) 7672
51.2%
ASCII
ValueCountFrequency (%)
33
100.0%

속명
Text

Distinct478
Distinct (%)12.0%
Missing0
Missing (%)0.0%
Memory size31.3 KiB
2023-12-12T20:42:55.030517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length16
Mean length8.599098
Min length3

Characters and Unicode

Total characters34319
Distinct characters56
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique117 ?
Unique (%)2.9%

Sample

1st rowRhina
2nd rowPagrus
3rd rowEpinephelus
4th rowInimicus
5th rowEpinephelus
ValueCountFrequency (%)
해당없음 492
 
12.3%
turbo 74
 
1.8%
chromis 60
 
1.5%
nodilittorina 58
 
1.4%
pomaulax 55
 
1.4%
cyclina 50
 
1.2%
sebastes 50
 
1.2%
sulculus 47
 
1.2%
gomphina 42
 
1.0%
takifugu 40
 
1.0%
Other values (452) 3034
75.8%
2023-12-12T20:42:55.772525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 3596
 
10.5%
o 2579
 
7.5%
i 2446
 
7.1%
s 2358
 
6.9%
e 2085
 
6.1%
r 2055
 
6.0%
u 1877
 
5.5%
l 1802
 
5.3%
t 1762
 
5.1%
n 1616
 
4.7%
Other values (46) 12143
35.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 28540
83.2%
Uppercase Letter 3508
 
10.2%
Other Letter 1968
 
5.7%
Space Separator 285
 
0.8%
Open Punctuation 9
 
< 0.1%
Close Punctuation 9
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 3596
12.6%
o 2579
9.0%
i 2446
 
8.6%
s 2358
 
8.3%
e 2085
 
7.3%
r 2055
 
7.2%
u 1877
 
6.6%
l 1802
 
6.3%
t 1762
 
6.2%
n 1616
 
5.7%
Other values (16) 6364
22.3%
Uppercase Letter
ValueCountFrequency (%)
C 608
17.3%
S 405
11.5%
P 398
11.3%
T 284
8.1%
H 268
7.6%
M 241
 
6.9%
A 231
 
6.6%
N 200
 
5.7%
O 178
 
5.1%
L 129
 
3.7%
Other values (13) 566
16.1%
Other Letter
ValueCountFrequency (%)
492
25.0%
492
25.0%
492
25.0%
492
25.0%
Space Separator
ValueCountFrequency (%)
285
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 32048
93.4%
Hangul 1968
 
5.7%
Common 303
 
0.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 3596
 
11.2%
o 2579
 
8.0%
i 2446
 
7.6%
s 2358
 
7.4%
e 2085
 
6.5%
r 2055
 
6.4%
u 1877
 
5.9%
l 1802
 
5.6%
t 1762
 
5.5%
n 1616
 
5.0%
Other values (39) 9872
30.8%
Hangul
ValueCountFrequency (%)
492
25.0%
492
25.0%
492
25.0%
492
25.0%
Common
ValueCountFrequency (%)
285
94.1%
( 9
 
3.0%
) 9
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 32351
94.3%
Hangul 1968
 
5.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 3596
 
11.1%
o 2579
 
8.0%
i 2446
 
7.6%
s 2358
 
7.3%
e 2085
 
6.4%
r 2055
 
6.4%
u 1877
 
5.8%
l 1802
 
5.6%
t 1762
 
5.4%
n 1616
 
5.0%
Other values (42) 10175
31.5%
Hangul
ValueCountFrequency (%)
492
25.0%
492
25.0%
492
25.0%
492
25.0%

종명
Text

Distinct508
Distinct (%)12.7%
Missing0
Missing (%)0.0%
Memory size31.3 KiB
2023-12-12T20:42:56.367446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length17
Mean length8.4349787
Min length3

Characters and Unicode

Total characters33664
Distinct characters34
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique144 ?
Unique (%)3.6%

Sample

1st rowancylostoma
2nd rowmajor
3rd rowfario
4th rowjaponicus
5th rowseptemfasciatus
ValueCountFrequency (%)
해당없음 492
 
12.1%
japonicus 223
 
5.5%
japonica 94
 
2.3%
sinensis 63
 
1.6%
notatus 59
 
1.5%
radiata 58
 
1.4%
cornutus 53
 
1.3%
diversicolor 50
 
1.2%
discus 46
 
1.1%
supertexta 44
 
1.1%
Other values (487) 2877
70.9%
2023-12-12T20:42:57.065295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 3573
 
10.6%
i 3516
 
10.4%
s 3132
 
9.3%
u 2330
 
6.9%
r 2198
 
6.5%
n 2184
 
6.5%
e 2139
 
6.4%
t 1951
 
5.8%
o 1917
 
5.7%
c 1608
 
4.8%
Other values (24) 9116
27.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 31101
92.4%
Other Letter 1968
 
5.8%
Space Separator 589
 
1.7%
Uppercase Letter 6
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 3573
11.5%
i 3516
11.3%
s 3132
10.1%
u 2330
 
7.5%
r 2198
 
7.1%
n 2184
 
7.0%
e 2139
 
6.9%
t 1951
 
6.3%
o 1917
 
6.2%
c 1608
 
5.2%
Other values (16) 6553
21.1%
Other Letter
ValueCountFrequency (%)
492
25.0%
492
25.0%
492
25.0%
492
25.0%
Uppercase Letter
ValueCountFrequency (%)
B 2
33.3%
G 2
33.3%
O 2
33.3%
Space Separator
ValueCountFrequency (%)
589
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 31107
92.4%
Hangul 1968
 
5.8%
Common 589
 
1.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 3573
11.5%
i 3516
11.3%
s 3132
10.1%
u 2330
 
7.5%
r 2198
 
7.1%
n 2184
 
7.0%
e 2139
 
6.9%
t 1951
 
6.3%
o 1917
 
6.2%
c 1608
 
5.2%
Other values (19) 6559
21.1%
Hangul
ValueCountFrequency (%)
492
25.0%
492
25.0%
492
25.0%
492
25.0%
Common
ValueCountFrequency (%)
589
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 31696
94.2%
Hangul 1968
 
5.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 3573
11.3%
i 3516
11.1%
s 3132
9.9%
u 2330
 
7.4%
r 2198
 
6.9%
n 2184
 
6.9%
e 2139
 
6.7%
t 1951
 
6.2%
o 1917
 
6.0%
c 1608
 
5.1%
Other values (20) 7148
22.6%
Hangul
ValueCountFrequency (%)
492
25.0%
492
25.0%
492
25.0%
492
25.0%

명칭
Text

Distinct844
Distinct (%)21.1%
Missing0
Missing (%)0.0%
Memory size31.3 KiB
2023-12-12T20:42:57.656764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length21
Mean length4.1743924
Min length1

Characters and Unicode

Total characters16660
Distinct characters461
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique315 ?
Unique (%)7.9%

Sample

1st row목탁수구리
2nd row참돔
3rd row별우럭
4th row쑤기미
5th row능성어
ValueCountFrequency (%)
자리돔 59
 
1.4%
납작소라 54
 
1.3%
소라 52
 
1.3%
가무락조개 50
 
1.2%
오분자기 46
 
1.1%
참돔 37
 
0.9%
전복 37
 
0.9%
보라성게 36
 
0.9%
개조개 33
 
0.8%
부채새우 31
 
0.8%
Other values (859) 3668
89.4%
2023-12-12T20:42:58.578587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
632
 
3.8%
566
 
3.4%
454
 
2.7%
449
 
2.7%
393
 
2.4%
336
 
2.0%
306
 
1.8%
300
 
1.8%
266
 
1.6%
254
 
1.5%
Other values (451) 12704
76.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 15830
95.0%
Lowercase Letter 654
 
3.9%
Space Separator 132
 
0.8%
Uppercase Letter 40
 
0.2%
Math Symbol 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
632
 
4.0%
566
 
3.6%
454
 
2.9%
449
 
2.8%
393
 
2.5%
336
 
2.1%
306
 
1.9%
300
 
1.9%
266
 
1.7%
254
 
1.6%
Other values (414) 11874
75.0%
Lowercase Letter
ValueCountFrequency (%)
a 92
14.1%
s 65
9.9%
o 62
 
9.5%
i 56
 
8.6%
l 44
 
6.7%
r 44
 
6.7%
e 39
 
6.0%
u 35
 
5.4%
h 29
 
4.4%
m 27
 
4.1%
Other values (11) 161
24.6%
Uppercase Letter
ValueCountFrequency (%)
R 16
40.0%
M 6
 
15.0%
P 4
 
10.0%
A 3
 
7.5%
E 3
 
7.5%
C 2
 
5.0%
N 2
 
5.0%
T 1
 
2.5%
H 1
 
2.5%
Q 1
 
2.5%
Space Separator
ValueCountFrequency (%)
132
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Other Punctuation
ValueCountFrequency (%)
' 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 15830
95.0%
Latin 694
 
4.2%
Common 136
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
632
 
4.0%
566
 
3.6%
454
 
2.9%
449
 
2.8%
393
 
2.5%
336
 
2.1%
306
 
1.9%
300
 
1.9%
266
 
1.7%
254
 
1.6%
Other values (414) 11874
75.0%
Latin
ValueCountFrequency (%)
a 92
13.3%
s 65
 
9.4%
o 62
 
8.9%
i 56
 
8.1%
l 44
 
6.3%
r 44
 
6.3%
e 39
 
5.6%
u 35
 
5.0%
h 29
 
4.2%
m 27
 
3.9%
Other values (22) 201
29.0%
Common
ValueCountFrequency (%)
132
97.1%
+ 1
 
0.7%
) 1
 
0.7%
( 1
 
0.7%
' 1
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 15830
95.0%
ASCII 830
 
5.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
632
 
4.0%
566
 
3.6%
454
 
2.9%
449
 
2.8%
393
 
2.5%
336
 
2.1%
306
 
1.9%
300
 
1.9%
266
 
1.7%
254
 
1.6%
Other values (414) 11874
75.0%
ASCII
ValueCountFrequency (%)
132
15.9%
a 92
 
11.1%
s 65
 
7.8%
o 62
 
7.5%
i 56
 
6.7%
l 44
 
5.3%
r 44
 
5.3%
e 39
 
4.7%
u 35
 
4.2%
h 29
 
3.5%
Other values (27) 232
28.0%

이명
Categorical

IMBALANCE 

Distinct38
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size31.3 KiB
해당없음
3785 
오분자기의 성장과정
 
21
전복(성장과정 0.3~6년)
 
20
좁살무늬총알고둥
 
18
좁깔무늬총알고둥
 
15
Other values (33)
 
132

Length

Max length19
Median length4
Mean length4.1666249
Min length2

Unique

Unique13 ?
Unique (%)0.3%

Sample

1st row해당없음
2nd row해당없음
3rd row해당없음
4th row해당없음
5th row구문쟁이

Common Values

ValueCountFrequency (%)
해당없음 3785
94.8%
오분자기의 성장과정 21
 
0.5%
전복(성장과정 0.3~6년) 20
 
0.5%
좁살무늬총알고둥 18
 
0.5%
좁깔무늬총알고둥 15
 
0.4%
백산호 14
 
0.4%
성장과정 12
 
0.3%
성장과정 1~6년 12
 
0.3%
산호붙이 히드라 10
 
0.3%
전복의성장과정 10
 
0.3%
Other values (28) 74
 
1.9%

Length

2023-12-12T20:42:58.836961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
해당없음 3785
93.3%
성장과정 45
 
1.1%
오분자기의 21
 
0.5%
전복(성장과정 20
 
0.5%
0.3~6년 20
 
0.5%
좁살무늬총알고둥 18
 
0.4%
좁깔무늬총알고둥 15
 
0.4%
백산호 14
 
0.3%
1~6년 12
 
0.3%
전복의성장과정 10
 
0.2%
Other values (31) 95
 
2.3%
Distinct3991
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size31.3 KiB
2023-12-12T20:42:59.875097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.6469557
Min length9

Characters and Unicode

Total characters38501
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3991 ?
Unique (%)100.0%

Sample

1st rowJFNM-1404
2nd rowJFNM-1405
3rd rowJFNM-1406
4th rowJFNM-1407
5th rowJFNM-1408
ValueCountFrequency (%)
jfnm-1404 1
 
< 0.1%
jfnm-15648 1
 
< 0.1%
jfnm-15663 1
 
< 0.1%
jfnm-15636 1
 
< 0.1%
jfnm-15637 1
 
< 0.1%
jfnm-15638 1
 
< 0.1%
jfnm-15639 1
 
< 0.1%
jfnm-15640 1
 
< 0.1%
jfnm-15641 1
 
< 0.1%
jfnm-15642 1
 
< 0.1%
Other values (3981) 3981
99.7%
2023-12-12T20:43:00.671849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 4280
11.1%
J 3991
10.4%
F 3991
10.4%
N 3991
10.4%
M 3991
10.4%
- 3991
10.4%
5 2299
 
6.0%
6 2273
 
5.9%
2 1913
 
5.0%
4 1903
 
4.9%
Other values (5) 5878
15.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 18546
48.2%
Uppercase Letter 15964
41.5%
Dash Punctuation 3991
 
10.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 4280
23.1%
5 2299
12.4%
6 2273
12.3%
2 1913
10.3%
4 1903
10.3%
7 1293
 
7.0%
8 1202
 
6.5%
9 1171
 
6.3%
3 1107
 
6.0%
0 1105
 
6.0%
Uppercase Letter
ValueCountFrequency (%)
J 3991
25.0%
F 3991
25.0%
N 3991
25.0%
M 3991
25.0%
Dash Punctuation
ValueCountFrequency (%)
- 3991
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 22537
58.5%
Latin 15964
41.5%

Most frequent character per script

Common
ValueCountFrequency (%)
1 4280
19.0%
- 3991
17.7%
5 2299
10.2%
6 2273
10.1%
2 1913
8.5%
4 1903
8.4%
7 1293
 
5.7%
8 1202
 
5.3%
9 1171
 
5.2%
3 1107
 
4.9%
Latin
ValueCountFrequency (%)
J 3991
25.0%
F 3991
25.0%
N 3991
25.0%
M 3991
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 38501
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 4280
11.1%
J 3991
10.4%
F 3991
10.4%
N 3991
10.4%
M 3991
10.4%
- 3991
10.4%
5 2299
 
6.0%
6 2273
 
5.9%
2 1913
 
5.0%
4 1903
 
4.9%
Other values (5) 5878
15.3%
Distinct3991
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size31.3 KiB
2023-12-12T20:43:01.149336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length11.445252
Min length9

Characters and Unicode

Total characters45678
Distinct characters17
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3991 ?
Unique (%)100.0%

Sample

1st rowJFNM-FI-1
2nd rowJFNM-FI-2
3rd rowJFNM-FI-3
4th rowJFNM-FI-4
5th rowJFNM-FI-5
ValueCountFrequency (%)
jfnm-fi-1 1
 
< 0.1%
jfnm-iv-1257 1
 
< 0.1%
jfnm-iv-1272 1
 
< 0.1%
jfnm-iv-1245 1
 
< 0.1%
jfnm-iv-1246 1
 
< 0.1%
jfnm-iv-1247 1
 
< 0.1%
jfnm-iv-1248 1
 
< 0.1%
jfnm-iv-1249 1
 
< 0.1%
jfnm-iv-1250 1
 
< 0.1%
jfnm-iv-1251 1
 
< 0.1%
Other values (3981) 3981
99.7%
2023-12-12T20:43:01.867443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 7982
17.5%
F 5400
11.8%
J 3991
8.7%
N 3991
8.7%
M 3991
8.7%
I 3991
8.7%
1 2710
 
5.9%
V 2582
 
5.7%
2 1883
 
4.1%
3 1299
 
2.8%
Other values (7) 7858
17.2%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 23946
52.4%
Decimal Number 13750
30.1%
Dash Punctuation 7982
 
17.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 2710
19.7%
2 1883
13.7%
3 1299
9.4%
4 1209
8.8%
5 1182
8.6%
6 1099
8.0%
7 1099
8.0%
8 1092
7.9%
9 1089
7.9%
0 1088
7.9%
Uppercase Letter
ValueCountFrequency (%)
F 5400
22.6%
J 3991
16.7%
N 3991
16.7%
M 3991
16.7%
I 3991
16.7%
V 2582
10.8%
Dash Punctuation
ValueCountFrequency (%)
- 7982
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 23946
52.4%
Common 21732
47.6%

Most frequent character per script

Common
ValueCountFrequency (%)
- 7982
36.7%
1 2710
 
12.5%
2 1883
 
8.7%
3 1299
 
6.0%
4 1209
 
5.6%
5 1182
 
5.4%
6 1099
 
5.1%
7 1099
 
5.1%
8 1092
 
5.0%
9 1089
 
5.0%
Latin
ValueCountFrequency (%)
F 5400
22.6%
J 3991
16.7%
N 3991
16.7%
M 3991
16.7%
I 3991
16.7%
V 2582
10.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 45678
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 7982
17.5%
F 5400
11.8%
J 3991
8.7%
N 3991
8.7%
M 3991
8.7%
I 3991
8.7%
1 2710
 
5.9%
V 2582
 
5.7%
2 1883
 
4.1%
3 1299
 
2.8%
Other values (7) 7858
17.2%

수량
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size31.3 KiB
1
3991 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 3991
100.0%

Length

2023-12-12T20:43:02.081591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:43:02.241105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 3991
100.0%

사진분류
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size31.3 KiB
패류
1449 
어류
1409 
갑각류
585 
산호류
232 
극피동물
200 
Other values (6)
 
116

Length

Max length4
Median length2
Mean length2.351541
Min length2

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row어류
2nd row어류
3rd row어류
4th row어류
5th row어류

Common Values

ValueCountFrequency (%)
패류 1449
36.3%
어류 1409
35.3%
갑각류 585
14.7%
산호류 232
 
5.8%
극피동물 200
 
5.0%
해면류 46
 
1.2%
연체동물 39
 
1.0%
강장동물 21
 
0.5%
완흉동물 8
 
0.2%
태형동물 1
 
< 0.1%

Length

2023-12-12T20:43:02.513807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
패류 1449
36.3%
어류 1409
35.3%
갑각류 585
14.7%
산호류 232
 
5.8%
극피동물 200
 
5.0%
해면류 46
 
1.2%
연체동물 39
 
1.0%
강장동물 21
 
0.5%
완흉동물 8
 
0.2%
태형동물 1
 
< 0.1%

상태
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size31.3 KiB
건조
1605 
액침
1551 
박제
835 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row박제
2nd row박제
3rd row박제
4th row박제
5th row박제

Common Values

ValueCountFrequency (%)
건조 1605
40.2%
액침 1551
38.9%
박제 835
20.9%

Length

2023-12-12T20:43:02.769723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:43:02.968233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
건조 1605
40.2%
액침 1551
38.9%
박제 835
20.9%

수집방법
Categorical

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size31.3 KiB
구입
1719 
자체
1683 
기증
589 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row구입
2nd row구입
3rd row구입
4th row구입
5th row구입

Common Values

ValueCountFrequency (%)
구입 1719
43.1%
자체 1683
42.2%
기증 589
 
14.8%

Length

2023-12-12T20:43:03.178522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:43:03.419706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
구입 1719
43.1%
자체 1683
42.2%
기증 589
 
14.8%

현 상태
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size31.3 KiB
해양수장고
3991 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row해양수장고
2nd row해양수장고
3rd row해양수장고
4th row해양수장고
5th row해양수장고

Common Values

ValueCountFrequency (%)
해양수장고 3991
100.0%

Length

2023-12-12T20:43:03.628869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:43:03.830519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
해양수장고 3991
100.0%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size31.3 KiB
Minimum2016-09-30 00:00:00
Maximum2016-09-30 00:00:00
2023-12-12T20:43:03.997428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:43:04.176089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Correlations

2023-12-12T20:43:04.341277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
중분류소분류이명사진분류상태수집방법
중분류1.0001.0000.2151.0000.5100.232
소분류1.0001.0000.0000.9950.8190.550
이명0.2150.0001.0000.6180.3440.353
사진분류1.0000.9950.6181.0000.7430.441
상태0.5100.8190.3440.7431.0000.640
수집방법0.2320.5500.3530.4410.6401.000
2023-12-12T20:43:04.542360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
중분류소분류사진분류이명상태수집방법
중분류1.0000.9940.9990.1710.7720.379
소분류0.9941.0000.9440.0000.6000.323
사진분류0.9990.9441.0000.2550.5970.288
이명0.1710.0000.2551.0000.1810.187
상태0.7720.6000.5970.1811.0000.303
수집방법0.3790.3230.2880.1870.3031.000
2023-12-12T20:43:04.766269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
중분류소분류이명사진분류상태수집방법
중분류1.0000.9940.1710.9990.7720.379
소분류0.9941.0000.0000.9440.6000.323
이명0.1710.0001.0000.2550.1810.187
사진분류0.9990.9440.2551.0000.5970.288
상태0.7720.6000.1810.5971.0000.303
수집방법0.3790.3230.1870.2880.3031.000

Missing values

2023-12-12T20:42:51.209170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:42:51.625611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기관코드대분류중분류소분류세분류속명종명명칭이명자료수집번호(통합)자료세부분류번호수량사진분류상태수집방법현 상태데이터기준일자
0JFNM동물어류홍어목수구리과Rhinaancylostoma목탁수구리해당없음JFNM-1404JFNM-FI-11어류박제구입해양수장고2016-09-30
1JFNM동물어류농어목도미과Pagrusmajor참돔해당없음JFNM-1405JFNM-FI-21어류박제구입해양수장고2016-09-30
2JFNM동물어류농어목바리과Epinephelusfario별우럭해당없음JFNM-1406JFNM-FI-31어류박제구입해양수장고2016-09-30
3JFNM동물어류쏨뱅이목양볼락과Inimicusjaponicus쑤기미해당없음JFNM-1407JFNM-FI-41어류박제구입해양수장고2016-09-30
4JFNM동물어류농어목바리과Epinephelusseptemfasciatus능성어구문쟁이JFNM-1408JFNM-FI-51어류박제구입해양수장고2016-09-30
5JFNM동물어류농어목바리과Epinephelusbruneus자바리자바리(다금바리)JFNM-1409JFNM-FI-61어류박제구입해양수장고2016-09-30
6JFNM동물어류홍어목색가오리과Neotrygonkuhlii꽁지가오리해당없음JFNM-1410JFNM-FI-71어류박제기증해양수장고2016-09-30
7JFNM동물어류홍어목색가오리과Neotrygonkuhlii꽁지가오리해당없음JFNM-1411JFNM-FI-81어류박제기증해양수장고2016-09-30
8JFNM동물어류홍어목색가오리과Neotrygonkuhlii꽁지가오리해당없음JFNM-1412JFNM-FI-91어류박제기증해양수장고2016-09-30
9JFNM동물어류농어목돌돔과Oplegnathusfasciatus돌돔해당없음JFNM-1413JFNM-FI-101어류박제구입해양수장고2016-09-30
기관코드대분류중분류소분류세분류속명종명명칭이명자료수집번호(통합)자료세부분류번호수량사진분류상태수집방법현 상태데이터기준일자
3981JFNM동물무척추동물완흉동물해당없음해당없음해당없음검은큰따개비해당없음JFNM-16964JFNM-IV-25731완흉동물건조자체해양수장고2016-09-30
3982JFNM동물무척추동물완흉동물해당없음해당없음해당없음검은큰따개비해당없음JFNM-16965JFNM-IV-25741완흉동물건조자체해양수장고2016-09-30
3983JFNM동물무척추동물완흉동물해당없음해당없음해당없음검은큰따개비해당없음JFNM-16966JFNM-IV-25751완흉동물건조자체해양수장고2016-09-30
3984JFNM동물무척추동물완흉동물해당없음해당없음해당없음검은큰따개비해당없음JFNM-16967JFNM-IV-25761완흉동물건조자체해양수장고2016-09-30
3985JFNM동물무척추동물완흉동물해당없음해당없음해당없음검은큰따개비해당없음JFNM-16968JFNM-IV-25771완흉동물건조자체해양수장고2016-09-30
3986JFNM동물무척추동물완흉동물해당없음해당없음해당없음거북손해당없음JFNM-16969JFNM-IV-25781완흉동물건조자체해양수장고2016-09-30
3987JFNM동물무척추동물완흉동물해당없음해당없음해당없음거북손해당없음JFNM-16970JFNM-IV-25791완흉동물건조자체해양수장고2016-09-30
3988JFNM동물무척추동물완흉동물해당없음해당없음해당없음거북손해당없음JFNM-16971JFNM-IV-25801완흉동물건조자체해양수장고2016-09-30
3989JFNM동물무척추동물태형동물해당없음해당없음해당없음빨간망이끼벌레해당없음JFNM-16972JFNM-IV-25811태형동물건조자체해양수장고2016-09-30
3990JFNM동물무척추동물환형동물해당없음해당없음해당없음바위털갯지렁이해당없음JFNM-16973JFNM-IV-25821환형동물액침자체해양수장고2016-09-30