Overview

Dataset statistics

Number of variables21
Number of observations2179
Missing cells41312
Missing cells (%)90.3%
Duplicate rows6
Duplicate rows (%)0.3%
Total size in memory395.9 KiB
Average record size in memory186.1 B

Variable types

Categorical2
Text1
Unsupported18

Dataset

Description식물검역기술개발센터의 식물검역연구장비 보유 현황 정보에 대한 데이터로
Author농림축산식품부 농림축산검역본부
URLhttps://www.data.go.kr/data/15047547/fileData.do

Alerts

Dataset has 6 (0.3%) duplicate rowsDuplicates
장비등록검역지명 is highly imbalanced (75.4%)Imbalance
장비명 is highly imbalanced (91.7%)Imbalance
규격 has 2090 (95.9%) missing valuesMissing
Unnamed: 3 has 2179 (100.0%) missing valuesMissing
Unnamed: 4 has 2179 (100.0%) missing valuesMissing
Unnamed: 5 has 2179 (100.0%) missing valuesMissing
Unnamed: 6 has 2179 (100.0%) missing valuesMissing
Unnamed: 7 has 2179 (100.0%) missing valuesMissing
Unnamed: 8 has 2179 (100.0%) missing valuesMissing
Unnamed: 9 has 2179 (100.0%) missing valuesMissing
Unnamed: 10 has 2179 (100.0%) missing valuesMissing
Unnamed: 11 has 2179 (100.0%) missing valuesMissing
Unnamed: 12 has 2179 (100.0%) missing valuesMissing
Unnamed: 13 has 2179 (100.0%) missing valuesMissing
Unnamed: 14 has 2179 (100.0%) missing valuesMissing
Unnamed: 15 has 2179 (100.0%) missing valuesMissing
Unnamed: 16 has 2179 (100.0%) missing valuesMissing
Unnamed: 17 has 2179 (100.0%) missing valuesMissing
Unnamed: 18 has 2179 (100.0%) missing valuesMissing
Unnamed: 19 has 2179 (100.0%) missing valuesMissing
Unnamed: 20 has 2179 (100.0%) missing valuesMissing
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 12 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 13 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 14 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 15 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 16 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 17 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 18 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 19 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 20 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 01:14:25.674945
Analysis finished2023-12-12 01:14:25.933864
Duration0.26 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

장비등록검역지명
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size17.2 KiB
<NA>
2090 
식물검역기술개발센터
 
89

Length

Max length10
Median length4
Mean length4.2450665
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row식물검역기술개발센터
2nd row식물검역기술개발센터
3rd row식물검역기술개발센터
4th row식물검역기술개발센터
5th row식물검역기술개발센터

Common Values

ValueCountFrequency (%)
<NA> 2090
95.9%
식물검역기술개발센터 89
 
4.1%

Length

2023-12-12T10:14:26.004845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:14:26.115787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 2090
95.9%
식물검역기술개발센터 89
 
4.1%

장비명
Categorical

IMBALANCE 

Distinct38
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size17.2 KiB
<NA>
2090 
해부현미경
 
13
유전자증폭기
 
9
원심분리기
 
7
전기영동장치
 
7
Other values (33)
 
53

Length

Max length13
Median length4
Mean length4.0761817
Min length3

Unique

Unique21 ?
Unique (%)1.0%

Sample

1st row해부현미경
2nd row해부현미경
3rd row해부현미경
4th row저온저장고
5th row항온수조

Common Values

ValueCountFrequency (%)
<NA> 2090
95.9%
해부현미경 13
 
0.6%
유전자증폭기 9
 
0.4%
원심분리기 7
 
0.3%
전기영동장치 7
 
0.3%
광학현미경 5
 
0.2%
무균상 4
 
0.2%
가스크로마토그라프악세사리 3
 
0.1%
초순수제조기 3
 
0.1%
고압멸균기 3
 
0.1%
Other values (28) 35
 
1.6%

Length

2023-12-12T10:14:26.228354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 2090
95.9%
해부현미경 13
 
0.6%
유전자증폭기 9
 
0.4%
원심분리기 7
 
0.3%
전기영동장치 7
 
0.3%
광학현미경 5
 
0.2%
무균상 4
 
0.2%
가스크로마토그라프악세사리 3
 
0.1%
초순수제조기 3
 
0.1%
고압멸균기 3
 
0.1%
Other values (28) 35
 
1.6%

규격
Text

MISSING 

Distinct75
Distinct (%)84.3%
Missing2090
Missing (%)95.9%
Memory size17.2 KiB
2023-12-12T10:14:26.563534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length34
Mean length26.651685
Min length5

Characters and Unicode

Total characters2372
Distinct characters201
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique70 ?
Unique (%)78.7%

Sample

1st rowLeica microsystem, DE/S8 APO
2nd rowLeica microsystem, DE/S8 APO
3rd rowCarl zeiss, DE/V20, Motorized stereo microscope
4th row엘지전자, R-B315GB, 314L
5th row항온수조, 다솔과학, DS-21L, 45L
ValueCountFrequency (%)
leica 11
 
3.8%
carl 6
 
2.1%
bioer 5
 
1.7%
biosciences 5
 
1.7%
cn/tc-s/byq6067 5
 
1.7%
us/he99x 5
 
1.7%
zeiss 5
 
1.7%
amersham 5
 
1.7%
apo 5
 
1.7%
de/s8 5
 
1.7%
Other values (205) 232
80.3%
2023-12-12T10:14:27.055322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
200
 
8.4%
e 118
 
5.0%
, 107
 
4.5%
i 97
 
4.1%
0 97
 
4.1%
r 72
 
3.0%
o 70
 
3.0%
s 70
 
3.0%
a 68
 
2.9%
c 66
 
2.8%
Other values (191) 1407
59.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 888
37.4%
Uppercase Letter 483
20.4%
Decimal Number 285
 
12.0%
Other Letter 273
 
11.5%
Space Separator 200
 
8.4%
Other Punctuation 177
 
7.5%
Dash Punctuation 48
 
2.0%
Open Punctuation 6
 
0.3%
Close Punctuation 6
 
0.3%
Math Symbol 6
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12
 
4.4%
10
 
3.7%
10
 
3.7%
8
 
2.9%
6
 
2.2%
6
 
2.2%
5
 
1.8%
5
 
1.8%
5
 
1.8%
5
 
1.8%
Other values (119) 201
73.6%
Uppercase Letter
ValueCountFrequency (%)
S 53
 
11.0%
C 38
 
7.9%
D 35
 
7.2%
E 34
 
7.0%
A 33
 
6.8%
L 30
 
6.2%
M 28
 
5.8%
B 27
 
5.6%
T 26
 
5.4%
P 25
 
5.2%
Other values (16) 154
31.9%
Lowercase Letter
ValueCountFrequency (%)
e 118
13.3%
i 97
10.9%
r 72
 
8.1%
o 70
 
7.9%
s 70
 
7.9%
a 68
 
7.7%
c 66
 
7.4%
t 52
 
5.9%
m 51
 
5.7%
l 41
 
4.6%
Other values (15) 183
20.6%
Decimal Number
ValueCountFrequency (%)
0 97
34.0%
1 41
14.4%
2 33
 
11.6%
5 22
 
7.7%
6 21
 
7.4%
9 18
 
6.3%
8 17
 
6.0%
7 12
 
4.2%
3 12
 
4.2%
4 12
 
4.2%
Other Punctuation
ValueCountFrequency (%)
, 107
60.5%
/ 60
33.9%
. 5
 
2.8%
* 4
 
2.3%
& 1
 
0.6%
Math Symbol
ValueCountFrequency (%)
× 4
66.7%
~ 2
33.3%
Space Separator
ValueCountFrequency (%)
200
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 48
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1371
57.8%
Common 728
30.7%
Hangul 273
 
11.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12
 
4.4%
10
 
3.7%
10
 
3.7%
8
 
2.9%
6
 
2.2%
6
 
2.2%
5
 
1.8%
5
 
1.8%
5
 
1.8%
5
 
1.8%
Other values (119) 201
73.6%
Latin
ValueCountFrequency (%)
e 118
 
8.6%
i 97
 
7.1%
r 72
 
5.3%
o 70
 
5.1%
s 70
 
5.1%
a 68
 
5.0%
c 66
 
4.8%
S 53
 
3.9%
t 52
 
3.8%
m 51
 
3.7%
Other values (41) 654
47.7%
Common
ValueCountFrequency (%)
200
27.5%
, 107
14.7%
0 97
13.3%
/ 60
 
8.2%
- 48
 
6.6%
1 41
 
5.6%
2 33
 
4.5%
5 22
 
3.0%
6 21
 
2.9%
9 18
 
2.5%
Other values (11) 81
11.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2095
88.3%
Hangul 273
 
11.5%
None 4
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
200
 
9.5%
e 118
 
5.6%
, 107
 
5.1%
i 97
 
4.6%
0 97
 
4.6%
r 72
 
3.4%
o 70
 
3.3%
s 70
 
3.3%
a 68
 
3.2%
c 66
 
3.2%
Other values (61) 1130
53.9%
Hangul
ValueCountFrequency (%)
12
 
4.4%
10
 
3.7%
10
 
3.7%
8
 
2.9%
6
 
2.2%
6
 
2.2%
5
 
1.8%
5
 
1.8%
5
 
1.8%
5
 
1.8%
Other values (119) 201
73.6%
None
ValueCountFrequency (%)
× 4
100.0%

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2179
Missing (%)100.0%
Memory size19.3 KiB

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2179
Missing (%)100.0%
Memory size19.3 KiB

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2179
Missing (%)100.0%
Memory size19.3 KiB

Unnamed: 6
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2179
Missing (%)100.0%
Memory size19.3 KiB

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2179
Missing (%)100.0%
Memory size19.3 KiB

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2179
Missing (%)100.0%
Memory size19.3 KiB

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2179
Missing (%)100.0%
Memory size19.3 KiB

Unnamed: 10
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2179
Missing (%)100.0%
Memory size19.3 KiB

Unnamed: 11
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2179
Missing (%)100.0%
Memory size19.3 KiB

Unnamed: 12
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2179
Missing (%)100.0%
Memory size19.3 KiB

Unnamed: 13
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2179
Missing (%)100.0%
Memory size19.3 KiB

Unnamed: 14
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2179
Missing (%)100.0%
Memory size19.3 KiB

Unnamed: 15
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2179
Missing (%)100.0%
Memory size19.3 KiB

Unnamed: 16
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2179
Missing (%)100.0%
Memory size19.3 KiB

Unnamed: 17
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2179
Missing (%)100.0%
Memory size19.3 KiB

Unnamed: 18
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2179
Missing (%)100.0%
Memory size19.3 KiB

Unnamed: 19
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2179
Missing (%)100.0%
Memory size19.3 KiB

Unnamed: 20
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2179
Missing (%)100.0%
Memory size19.3 KiB

Sample

장비등록검역지명장비명규격Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17Unnamed: 18Unnamed: 19Unnamed: 20
0식물검역기술개발센터해부현미경Leica microsystem, DE/S8 APO<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
1식물검역기술개발센터해부현미경Leica microsystem, DE/S8 APO<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
2식물검역기술개발센터해부현미경Carl zeiss, DE/V20, Motorized stereo microscope<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
3식물검역기술개발센터저온저장고엘지전자, R-B315GB, 314L<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
4식물검역기술개발센터항온수조항온수조, 다솔과학, DS-21L, 45L<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
5식물검역기술개발센터유전자증폭기MyGenie96ThermalBlock<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
6식물검역기술개발센터해부현미경Leica microsystem, DE/S8 APO<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
7식물검역기술개발센터해부현미경Leica Microsystems, CH/M205C, 7.8~160배<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
8식물검역기술개발센터겔다큐멘테이션시스템DNR BioImaging Systems, IT/MiniBIS Pro<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
9식물검역기술개발센터원심분리기17000rpm<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
장비등록검역지명장비명규격Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17Unnamed: 18Unnamed: 19Unnamed: 20
2169<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
2170<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
2171<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
2172<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
2173<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
2174<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
2175<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
2176<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
2177<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
2178<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

장비등록검역지명장비명규격# duplicates
5<NA><NA><NA>2090
2식물검역기술개발센터유전자증폭기Bioer, CN/TC-S/BYQ60675
3식물검역기술개발센터전기영동장치Amersham biosciences, US/HE99X5
4식물검역기술개발센터해부현미경Leica microsystem, DE/S8 APO5
0식물검역기술개발센터겔다큐멘테이션시스템DNR BioImaging Systems, IT/MiniBIS Pro2
1식물검역기술개발센터무균상Eriab, FR/captair biotair biocap RNA/DNA2