Overview

Dataset statistics

Number of variables13
Number of observations44
Missing cells115
Missing cells (%)20.1%
Duplicate rows2
Duplicate rows (%)4.5%
Total size in memory4.6 KiB
Average record size in memory107.0 B

Variable types

DateTime1
Text6
Categorical6

Dataset

Description한국환경산업기술원 환경정책자금융자시스템 천연가스 압축기 데이터(신청자, 모델명 등) 2020년도 정보 입니다.
URLhttps://www.data.go.kr/data/15120433/fileData.do

Alerts

Dataset has 2 (4.5%) duplicate rowsDuplicates
프레임 is highly overall correlated with 모델명 and 3 other fieldsHigh correlation
모델명 is highly overall correlated with 프레임 and 3 other fieldsHigh correlation
구동기 종류 is highly overall correlated with 프레임 and 1 other fieldsHigh correlation
구동기 용량 is highly overall correlated with 프레임 and 4 other fieldsHigh correlation
압축단수 is highly overall correlated with 모델명 and 1 other fieldsHigh correlation
냉각방식 is highly overall correlated with 프레임 and 2 other fieldsHigh correlation
흡입측 압력 has 19 (43.2%) missing valuesMissing
토출용량 has 19 (43.2%) missing valuesMissing
윤활방식 has 19 (43.2%) missing valuesMissing
첫번째 단의 크기 has 29 (65.9%) missing valuesMissing
최종단의 크기 has 29 (65.9%) missing valuesMissing

Reproduction

Analysis started2023-12-12 13:57:52.656773
Analysis finished2023-12-12 13:57:54.294927
Duration1.64 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct38
Distinct (%)86.4%
Missing0
Missing (%)0.0%
Memory size484.0 B
Minimum2015-01-13 00:00:00
Maximum2020-11-30 00:00:00
2023-12-12T22:57:54.360358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:57:54.513434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=38)
Distinct24
Distinct (%)54.5%
Missing0
Missing (%)0.0%
Memory size484.0 B
2023-12-12T22:57:54.713859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length5
Mean length6.5681818
Min length4

Characters and Unicode

Total characters289
Distinct characters15
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16 ?
Unique (%)36.4%

Sample

1st row22421
2nd row20033
3rd rowA210111534
4th row20036
5th row10712
ValueCountFrequency (%)
20036 8
18.2%
11090 5
 
11.4%
mgr0003914 4
 
9.1%
10712 3
 
6.8%
a210111534 2
 
4.5%
c201090001 2
 
4.5%
mgr0005400 2
 
4.5%
8453 2
 
4.5%
16749 1
 
2.3%
22421 1
 
2.3%
Other values (14) 14
31.8%
2023-12-12T22:57:55.082994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 78
27.0%
1 53
18.3%
2 27
 
9.3%
3 26
 
9.0%
6 19
 
6.6%
9 17
 
5.9%
4 14
 
4.8%
5 10
 
3.5%
M 8
 
2.8%
G 8
 
2.8%
Other values (5) 29
 
10.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 258
89.3%
Uppercase Letter 31
 
10.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 78
30.2%
1 53
20.5%
2 27
 
10.5%
3 26
 
10.1%
6 19
 
7.4%
9 17
 
6.6%
4 14
 
5.4%
5 10
 
3.9%
8 8
 
3.1%
7 6
 
2.3%
Uppercase Letter
ValueCountFrequency (%)
M 8
25.8%
G 8
25.8%
R 8
25.8%
A 5
16.1%
C 2
 
6.5%

Most occurring scripts

ValueCountFrequency (%)
Common 258
89.3%
Latin 31
 
10.7%

Most frequent character per script

Common
ValueCountFrequency (%)
0 78
30.2%
1 53
20.5%
2 27
 
10.5%
3 26
 
10.1%
6 19
 
7.4%
9 17
 
6.6%
4 14
 
5.4%
5 10
 
3.9%
8 8
 
3.1%
7 6
 
2.3%
Latin
ValueCountFrequency (%)
M 8
25.8%
G 8
25.8%
R 8
25.8%
A 5
16.1%
C 2
 
6.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 289
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 78
27.0%
1 53
18.3%
2 27
 
9.3%
3 26
 
9.0%
6 19
 
6.6%
9 17
 
5.9%
4 14
 
4.8%
5 10
 
3.5%
M 8
 
2.8%
G 8
 
2.8%
Other values (5) 29
 
10.0%

프레임
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size484.0 B
<NA>
23 
JGQ
LNG PUMP CASING VESSEL
초저온LNG펌프
초저온 LNG 펌프
 
2
Other values (6)

Length

Max length22
Median length4
Mean length6.9772727
Min length3

Unique

Unique6 ?
Unique (%)13.6%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th rowLNG PUMP CASING VESSEL

Common Values

ValueCountFrequency (%)
<NA> 23
52.3%
JGQ 7
 
15.9%
LNG PUMP CASING VESSEL 3
 
6.8%
초저온LNG펌프 3
 
6.8%
초저온 LNG 펌프 2
 
4.5%
초저온 LNG펌프 1
 
2.3%
W3420*L2520*H2645 1
 
2.3%
Horizontal heavy duty 1
 
2.3%
CC250-001 1
 
2.3%
W4000*L3080*H3968 1
 
2.3%

Length

2023-12-12T22:57:55.254741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 23
37.7%
jgq 8
 
13.1%
lng 5
 
8.2%
pump 3
 
4.9%
casing 3
 
4.9%
vessel 3
 
4.9%
초저온lng펌프 3
 
4.9%
초저온 3
 
4.9%
펌프 2
 
3.3%
lng펌프 1
 
1.6%
Other values (7) 7
 
11.5%

모델명
Categorical

HIGH CORRELATION 

Distinct12
Distinct (%)27.3%
Missing0
Missing (%)0.0%
Memory size484.0 B
<NA>
19 
AC-32
JGQ2/4
JGQ/2/4
AC34 164508211-2
Other values (7)

Length

Max length22
Median length16
Mean length6.2727273
Min length4

Unique

Unique5 ?
Unique (%)11.4%

Sample

1st row250HP
2nd row<NA>
3rd rowGEO-B250
4th row<NA>
5th rowAC34 164508211-2

Common Values

ValueCountFrequency (%)
<NA> 19
43.2%
AC-32 6
 
13.6%
JGQ2/4 5
 
11.4%
JGQ/2/4 3
 
6.8%
AC34 164508211-2 2
 
4.5%
JGQ 2/4 2
 
4.5%
GEO-B-250 2
 
4.5%
250HP 1
 
2.3%
GEO-B250 1
 
2.3%
75~400LPM 1
 
2.3%
Other values (2) 2
 
4.5%

Length

2023-12-12T22:57:55.406745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 19
39.6%
ac-32 6
 
12.5%
jgq2/4 5
 
10.4%
jgq/2/4 3
 
6.2%
ac34 2
 
4.2%
164508211-2 2
 
4.2%
jgq 2
 
4.2%
2/4 2
 
4.2%
geo-b-250 2
 
4.2%
250hp 1
 
2.1%
Other values (4) 4
 
8.3%

구동기 종류
Categorical

HIGH CORRELATION 

Distinct12
Distinct (%)27.3%
Missing0
Missing (%)0.0%
Memory size484.0 B
<NA>
21 
Electric Motor
ELECTRIC MOTOR
잠수펌프
왕복동식
 
2
Other values (7)

Length

Max length14
Median length4
Mean length6.7727273
Min length4

Unique

Unique5 ?
Unique (%)11.4%

Sample

1st row<NA>
2nd row<NA>
3rd row왕복동식
4th row<NA>
5th row잠수펌프

Common Values

ValueCountFrequency (%)
<NA> 21
47.7%
Electric Motor 5
 
11.4%
ELECTRIC MOTOR 4
 
9.1%
잠수펌프 3
 
6.8%
왕복동식 2
 
4.5%
AC-32 2
 
4.5%
초저운액중펌프 2
 
4.5%
ElectricMotor 1
 
2.3%
초저운 액중펌프 1
 
2.3%
초저온 액중펌프 1
 
2.3%
Other values (2) 2
 
4.5%

Length

2023-12-12T22:57:55.533932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 21
37.5%
motor 11
19.6%
electric 9
16.1%
잠수펌프 3
 
5.4%
왕복동식 2
 
3.6%
ac-32 2
 
3.6%
초저운액중펌프 2
 
3.6%
액중펌프 2
 
3.6%
electricmotor 1
 
1.8%
초저운 1
 
1.8%
Other values (2) 2
 
3.6%

구동기 용량
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)15.9%
Missing0
Missing (%)0.0%
Memory size484.0 B
<NA>
19 
250
11 
18.6Kw
30HP 120HZ 460VAC
250Hp
Other values (2)

Length

Max length17
Median length11
Mean length5.1590909
Min length3

Unique

Unique2 ?
Unique (%)4.5%

Sample

1st row250
2nd row<NA>
3rd row250
4th row<NA>
5th row30HP 120HZ 460VAC

Common Values

ValueCountFrequency (%)
<NA> 19
43.2%
250 11
25.0%
18.6Kw 6
 
13.6%
30HP 120HZ 460VAC 3
 
6.8%
250Hp 3
 
6.8%
250HP/185KW 1
 
2.3%
250HP 1
 
2.3%

Length

2023-12-12T22:57:55.666465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:57:55.801747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 19
38.0%
250 11
22.0%
18.6kw 6
 
12.0%
250hp 4
 
8.0%
30hp 3
 
6.0%
120hz 3
 
6.0%
460vac 3
 
6.0%
250hp/185kw 1
 
2.0%

흡입측 압력
Text

MISSING 

Distinct15
Distinct (%)60.0%
Missing19
Missing (%)43.2%
Memory size484.0 B
2023-12-12T22:57:55.946993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length11
Mean length5.12
Min length1

Characters and Unicode

Total characters128
Distinct characters30
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique12 ?
Unique (%)48.0%

Sample

1st row4BAR
2nd row0.3292Mpa
3rd row12BAR
4th row4.5
5th row2bar
ValueCountFrequency (%)
4.5 7
23.3%
2bar 5
16.7%
12bar 3
10.0%
bar 2
 
6.7%
0.3292mpa 1
 
3.3%
0.5 1
 
3.3%
2.5 1
 
3.3%
2 1
 
3.3%
4bar 1
 
3.3%
2.5~8.5 1
 
3.3%
Other values (7) 7
23.3%
2023-12-12T22:57:56.238828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 17
13.3%
2 13
10.2%
5 12
 
9.4%
a 12
 
9.4%
4 11
 
8.6%
b 9
 
7.0%
r 9
 
7.0%
0 6
 
4.7%
5
 
3.9%
~ 3
 
2.3%
Other values (20) 31
24.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 51
39.8%
Lowercase Letter 33
25.8%
Other Punctuation 17
 
13.3%
Uppercase Letter 11
 
8.6%
Other Letter 6
 
4.7%
Space Separator 5
 
3.9%
Math Symbol 3
 
2.3%
Close Punctuation 1
 
0.8%
Open Punctuation 1
 
0.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 13
25.5%
5 12
23.5%
4 11
21.6%
0 6
11.8%
1 3
 
5.9%
8 2
 
3.9%
9 2
 
3.9%
3 1
 
2.0%
6 1
 
2.0%
Uppercase Letter
ValueCountFrequency (%)
M 3
27.3%
R 2
18.2%
A 2
18.2%
B 2
18.2%
G 1
 
9.1%
K 1
 
9.1%
Other Letter
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Lowercase Letter
ValueCountFrequency (%)
a 12
36.4%
b 9
27.3%
r 9
27.3%
p 3
 
9.1%
Other Punctuation
ValueCountFrequency (%)
. 17
100.0%
Space Separator
ValueCountFrequency (%)
5
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 78
60.9%
Latin 44
34.4%
Hangul 6
 
4.7%

Most frequent character per script

Common
ValueCountFrequency (%)
. 17
21.8%
2 13
16.7%
5 12
15.4%
4 11
14.1%
0 6
 
7.7%
5
 
6.4%
~ 3
 
3.8%
1 3
 
3.8%
8 2
 
2.6%
9 2
 
2.6%
Other values (4) 4
 
5.1%
Latin
ValueCountFrequency (%)
a 12
27.3%
b 9
20.5%
r 9
20.5%
M 3
 
6.8%
p 3
 
6.8%
R 2
 
4.5%
A 2
 
4.5%
B 2
 
4.5%
G 1
 
2.3%
K 1
 
2.3%
Hangul
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 122
95.3%
Hangul 6
 
4.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 17
13.9%
2 13
10.7%
5 12
9.8%
a 12
9.8%
4 11
9.0%
b 9
 
7.4%
r 9
 
7.4%
0 6
 
4.9%
5
 
4.1%
~ 3
 
2.5%
Other values (14) 25
20.5%
Hangul
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

토출용량
Text

MISSING 

Distinct13
Distinct (%)52.0%
Missing19
Missing (%)43.2%
Memory size484.0 B
2023-12-12T22:57:56.406118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length19
Mean length8
Min length3

Characters and Unicode

Total characters200
Distinct characters35
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8 ?
Unique (%)32.0%

Sample

1st row250BAR
2nd row925Nm3/hr
3rd row75~400LPM
4th row900
5th row340L/MIN
ValueCountFrequency (%)
340l/min 6
19.4%
881 4
12.9%
75~400lpm 3
 
9.7%
925nm3/hr 2
 
6.5%
900 2
 
6.5%
925 2
 
6.5%
5.5 1
 
3.2%
1205nm3/hr 1
 
3.2%
n㎥/hr 1
 
3.2%
1200.0n㎥/hr@0.85mpa 1
 
3.2%
Other values (8) 8
25.8%
2023-12-12T22:57:56.730469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 24
 
12.0%
5 15
 
7.5%
/ 12
 
6.0%
N 12
 
6.0%
M 10
 
5.0%
1 10
 
5.0%
3 9
 
4.5%
4 9
 
4.5%
L 9
 
4.5%
8 9
 
4.5%
Other values (25) 81
40.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 98
49.0%
Uppercase Letter 44
22.0%
Lowercase Letter 26
 
13.0%
Other Punctuation 20
 
10.0%
Space Separator 6
 
3.0%
Math Symbol 3
 
1.5%
Other Symbol 2
 
1.0%
Other Number 1
 
0.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 24
24.5%
5 15
15.3%
1 10
10.2%
3 9
 
9.2%
4 9
 
9.2%
8 9
 
9.2%
2 9
 
9.2%
9 7
 
7.1%
7 4
 
4.1%
6 2
 
2.0%
Uppercase Letter
ValueCountFrequency (%)
N 12
27.3%
M 10
22.7%
L 9
20.5%
I 6
13.6%
P 3
 
6.8%
H 1
 
2.3%
R 1
 
2.3%
A 1
 
2.3%
B 1
 
2.3%
Lowercase Letter
ValueCountFrequency (%)
r 8
30.8%
h 5
19.2%
m 4
15.4%
a 4
15.4%
b 2
 
7.7%
p 1
 
3.8%
t 1
 
3.8%
g 1
 
3.8%
Other Punctuation
ValueCountFrequency (%)
/ 12
60.0%
. 5
25.0%
@ 2
 
10.0%
, 1
 
5.0%
Space Separator
ValueCountFrequency (%)
6
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Other Number
ValueCountFrequency (%)
³ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 130
65.0%
Latin 70
35.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 24
18.5%
5 15
11.5%
/ 12
9.2%
1 10
7.7%
3 9
 
6.9%
4 9
 
6.9%
8 9
 
6.9%
2 9
 
6.9%
9 7
 
5.4%
6
 
4.6%
Other values (8) 20
15.4%
Latin
ValueCountFrequency (%)
N 12
17.1%
M 10
14.3%
L 9
12.9%
r 8
11.4%
I 6
8.6%
h 5
7.1%
m 4
 
5.7%
a 4
 
5.7%
P 3
 
4.3%
b 2
 
2.9%
Other values (7) 7
10.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 197
98.5%
CJK Compat 2
 
1.0%
None 1
 
0.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 24
 
12.2%
5 15
 
7.6%
/ 12
 
6.1%
N 12
 
6.1%
M 10
 
5.1%
1 10
 
5.1%
3 9
 
4.6%
4 9
 
4.6%
L 9
 
4.6%
8 9
 
4.6%
Other values (23) 78
39.6%
CJK Compat
ValueCountFrequency (%)
2
100.0%
None
ValueCountFrequency (%)
³ 1
100.0%

압축단수
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)18.2%
Missing0
Missing (%)0.0%
Memory size484.0 B
<NA>
19 
4
10 
1단
4단
1단1회전
 
1
Other values (3)

Length

Max length12
Median length9
Mean length3.2045455
Min length1

Unique

Unique4 ?
Unique (%)9.1%

Sample

1st row4
2nd row<NA>
3rd row4단
4th row<NA>
5th row1단1회전

Common Values

ValueCountFrequency (%)
<NA> 19
43.2%
4 10
22.7%
1단 6
 
13.6%
4단 5
 
11.4%
1단1회전 1
 
2.3%
1단 1회전 타입 1
 
2.3%
1 단 1회전 type 1
 
2.3%
4 Stage 1
 
2.3%

Length

2023-12-12T22:57:56.874444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:57:57.045102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 19
38.0%
4 11
22.0%
1단 7
 
14.0%
4단 5
 
10.0%
1회전 2
 
4.0%
1단1회전 1
 
2.0%
타입 1
 
2.0%
1 1
 
2.0%
1
 
2.0%
type 1
 
2.0%

윤활방식
Text

MISSING 

Distinct17
Distinct (%)68.0%
Missing19
Missing (%)43.2%
Memory size484.0 B
2023-12-12T22:57:57.241126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length17
Mean length10.08
Min length2

Characters and Unicode

Total characters252
Distinct characters46
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique13 ?
Unique (%)52.0%

Sample

1st row급유식
2nd rowOil Pump 가동방식
3rd row없음(LNG 잠수 TYPE
4th rowOilPump에의한강제윤활
5th rowLNG
ValueCountFrequency (%)
oil 8
12.7%
의한 8
12.7%
lng 6
9.5%
pump에 5
 
7.9%
강제 5
 
7.9%
강제윤활 4
 
6.3%
윤활 4
 
6.3%
없음(lng 3
 
4.8%
pump 3
 
4.8%
잠수 3
 
4.8%
Other values (11) 14
22.2%
2023-12-12T22:57:57.604632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
38
 
15.1%
P 13
 
5.2%
L 12
 
4.8%
11
 
4.4%
11
 
4.4%
11
 
4.4%
11
 
4.4%
G 9
 
3.6%
N 9
 
3.6%
9
 
3.6%
Other values (36) 118
46.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 103
40.9%
Uppercase Letter 67
26.6%
Lowercase Letter 40
 
15.9%
Space Separator 38
 
15.1%
Open Punctuation 3
 
1.2%
Close Punctuation 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11
10.7%
11
10.7%
11
10.7%
11
10.7%
9
8.7%
9
8.7%
9
8.7%
5
 
4.9%
3
 
2.9%
3
 
2.9%
Other values (11) 21
20.4%
Uppercase Letter
ValueCountFrequency (%)
P 13
19.4%
L 12
17.9%
G 9
13.4%
N 9
13.4%
O 9
13.4%
I 3
 
4.5%
F 2
 
3.0%
E 2
 
3.0%
Y 2
 
3.0%
T 2
 
3.0%
Other values (2) 4
 
6.0%
Lowercase Letter
ValueCountFrequency (%)
m 7
17.5%
p 7
17.5%
u 7
17.5%
l 6
15.0%
i 6
15.0%
e 3
7.5%
o 1
 
2.5%
r 1
 
2.5%
c 1
 
2.5%
d 1
 
2.5%
Space Separator
ValueCountFrequency (%)
38
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 107
42.5%
Hangul 103
40.9%
Common 42
 
16.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
P 13
12.1%
L 12
11.2%
G 9
8.4%
N 9
8.4%
O 9
8.4%
m 7
 
6.5%
p 7
 
6.5%
u 7
 
6.5%
l 6
 
5.6%
i 6
 
5.6%
Other values (12) 22
20.6%
Hangul
ValueCountFrequency (%)
11
10.7%
11
10.7%
11
10.7%
11
10.7%
9
8.7%
9
8.7%
9
8.7%
5
 
4.9%
3
 
2.9%
3
 
2.9%
Other values (11) 21
20.4%
Common
ValueCountFrequency (%)
38
90.5%
( 3
 
7.1%
) 1
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 149
59.1%
Hangul 103
40.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
38
25.5%
P 13
 
8.7%
L 12
 
8.1%
G 9
 
6.0%
N 9
 
6.0%
O 9
 
6.0%
m 7
 
4.7%
p 7
 
4.7%
u 7
 
4.7%
l 6
 
4.0%
Other values (15) 32
21.5%
Hangul
ValueCountFrequency (%)
11
10.7%
11
10.7%
11
10.7%
11
10.7%
9
8.7%
9
8.7%
9
8.7%
5
 
4.9%
3
 
2.9%
3
 
2.9%
Other values (11) 21
20.4%

냉각방식
Categorical

HIGH CORRELATION 

Distinct12
Distinct (%)27.3%
Missing0
Missing (%)0.0%
Memory size484.0 B
<NA>
19 
LNG
공냉식
LNG 냉열
공랭식(Air)
Other values (7)

Length

Max length10
Median length8
Mean length4.5909091
Min length3

Unique

Unique6 ?
Unique (%)13.6%

Sample

1st row공냉식
2nd row<NA>
3rd row공냉식 열교환기
4th row<NA>
5th rowLNG 냉열

Common Values

ValueCountFrequency (%)
<NA> 19
43.2%
LNG 6
 
13.6%
공냉식 5
 
11.4%
LNG 냉열 3
 
6.8%
공랭식(Air) 3
 
6.8%
공냉식 열교환기 2
 
4.5%
공랭식 1
 
2.3%
공냉식(Air) 1
 
2.3%
공냉식(AIR) 1
 
2.3%
AIR 1
 
2.3%
Other values (2) 2
 
4.5%

Length

2023-12-12T22:57:57.764419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 19
38.0%
lng 9
18.0%
공냉식 7
 
14.0%
냉열 3
 
6.0%
공랭식(air 3
 
6.0%
air 3
 
6.0%
열교환기 2
 
4.0%
공냉식(air 2
 
4.0%
공랭식 1
 
2.0%
cooled 1
 
2.0%
Distinct9
Distinct (%)60.0%
Missing29
Missing (%)65.9%
Memory size484.0 B
2023-12-12T22:57:57.894259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length1
Mean length3.5333333
Min length1

Characters and Unicode

Total characters53
Distinct characters27
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8 ?
Unique (%)53.3%

Sample

1st row2-1/2&quot; ANSI150lb RF
2nd row8
3rd row-
4th row8
5th row8
ValueCountFrequency (%)
8 7
41.2%
2-1/2&quot 1
 
5.9%
ansi150lb 1
 
5.9%
rf 1
 
5.9%
1
 
5.9%
8“ 1
 
5.9%
270 1
 
5.9%
80a 1
 
5.9%
215mm 1
 
5.9%
280 1
 
5.9%
2023-12-12T22:57:58.207139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8 10
18.9%
2 7
13.2%
0 5
 
9.4%
1 3
 
5.7%
5 2
 
3.8%
- 2
 
3.8%
m 2
 
3.8%
2
 
3.8%
A 2
 
3.8%
3 1
 
1.9%
Other values (17) 17
32.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 29
54.7%
Lowercase Letter 8
 
15.1%
Uppercase Letter 7
 
13.2%
Other Punctuation 4
 
7.5%
Dash Punctuation 2
 
3.8%
Space Separator 2
 
3.8%
Initial Punctuation 1
 
1.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
8 10
34.5%
2 7
24.1%
0 5
17.2%
1 3
 
10.3%
5 2
 
6.9%
3 1
 
3.4%
7 1
 
3.4%
Lowercase Letter
ValueCountFrequency (%)
m 2
25.0%
b 1
12.5%
l 1
12.5%
t 1
12.5%
o 1
12.5%
u 1
12.5%
q 1
12.5%
Uppercase Letter
ValueCountFrequency (%)
A 2
28.6%
F 1
14.3%
R 1
14.3%
N 1
14.3%
I 1
14.3%
S 1
14.3%
Other Punctuation
ValueCountFrequency (%)
; 1
25.0%
& 1
25.0%
/ 1
25.0%
. 1
25.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 38
71.7%
Latin 15
 
28.3%

Most frequent character per script

Common
ValueCountFrequency (%)
8 10
26.3%
2 7
18.4%
0 5
13.2%
1 3
 
7.9%
5 2
 
5.3%
- 2
 
5.3%
2
 
5.3%
3 1
 
2.6%
7 1
 
2.6%
1
 
2.6%
Other values (4) 4
 
10.5%
Latin
ValueCountFrequency (%)
m 2
13.3%
A 2
13.3%
F 1
 
6.7%
R 1
 
6.7%
b 1
 
6.7%
l 1
 
6.7%
N 1
 
6.7%
I 1
 
6.7%
S 1
 
6.7%
t 1
 
6.7%
Other values (3) 3
20.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 52
98.1%
Punctuation 1
 
1.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8 10
19.2%
2 7
13.5%
0 5
 
9.6%
1 3
 
5.8%
5 2
 
3.8%
- 2
 
3.8%
m 2
 
3.8%
2
 
3.8%
A 2
 
3.8%
3 1
 
1.9%
Other values (16) 16
30.8%
Punctuation
ValueCountFrequency (%)
1
100.0%

최종단의 크기
Text

MISSING 

Distinct8
Distinct (%)53.3%
Missing29
Missing (%)65.9%
Memory size484.0 B
2023-12-12T22:57:58.356864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length4
Mean length4.5333333
Min length1

Characters and Unicode

Total characters68
Distinct characters22
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6 ?
Unique (%)40.0%

Sample

1st row3/4&quot;NPT
2nd row1.75
3rd row-
4th row1.75
5th row1.75
ValueCountFrequency (%)
1.75 7
46.7%
52 2
 
13.3%
3/4&quot;npt 1
 
6.7%
1
 
6.7%
1-3/4“ 1
 
6.7%
3/4&quot 1
 
6.7%
45mm 1
 
6.7%
50.8 1
 
6.7%
2023-12-12T22:57:58.668905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 11
16.2%
1 8
11.8%
. 8
11.8%
7 7
 
10.3%
4 4
 
5.9%
3 3
 
4.4%
/ 3
 
4.4%
m 2
 
2.9%
- 2
 
2.9%
; 2
 
2.9%
Other values (12) 18
26.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 37
54.4%
Other Punctuation 15
22.1%
Lowercase Letter 10
 
14.7%
Uppercase Letter 3
 
4.4%
Dash Punctuation 2
 
2.9%
Initial Punctuation 1
 
1.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 11
29.7%
1 8
21.6%
7 7
18.9%
4 4
 
10.8%
3 3
 
8.1%
2 2
 
5.4%
0 1
 
2.7%
8 1
 
2.7%
Lowercase Letter
ValueCountFrequency (%)
m 2
20.0%
t 2
20.0%
o 2
20.0%
u 2
20.0%
q 2
20.0%
Other Punctuation
ValueCountFrequency (%)
. 8
53.3%
/ 3
 
20.0%
; 2
 
13.3%
& 2
 
13.3%
Uppercase Letter
ValueCountFrequency (%)
N 1
33.3%
P 1
33.3%
T 1
33.3%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 55
80.9%
Latin 13
 
19.1%

Most frequent character per script

Common
ValueCountFrequency (%)
5 11
20.0%
1 8
14.5%
. 8
14.5%
7 7
12.7%
4 4
 
7.3%
3 3
 
5.5%
/ 3
 
5.5%
- 2
 
3.6%
; 2
 
3.6%
& 2
 
3.6%
Other values (4) 5
9.1%
Latin
ValueCountFrequency (%)
m 2
15.4%
t 2
15.4%
o 2
15.4%
u 2
15.4%
q 2
15.4%
N 1
7.7%
P 1
7.7%
T 1
7.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 67
98.5%
Punctuation 1
 
1.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 11
16.4%
1 8
11.9%
. 8
11.9%
7 7
10.4%
4 4
 
6.0%
3 3
 
4.5%
/ 3
 
4.5%
m 2
 
3.0%
- 2
 
3.0%
; 2
 
3.0%
Other values (11) 17
25.4%
Punctuation
ValueCountFrequency (%)
1
100.0%

Correlations

2023-12-12T22:57:58.782224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
등록일자등록자프레임모델명구동기 종류구동기 용량흡입측 압력토출용량압축단수윤활방식냉각방식첫번째 단의 크기최종단의 크기
등록일자1.0000.9730.0000.9280.5800.5730.5810.9421.0000.9730.7810.5210.740
등록자0.9731.0000.8870.8890.0000.8790.9160.9710.4150.9120.9280.7780.650
프레임0.0000.8871.0000.9220.9440.9630.9520.9850.7540.6780.9661.0001.000
모델명0.9280.8890.9221.0000.6050.8530.9640.9240.9110.9080.9300.9440.972
구동기 종류0.5800.0000.9440.6051.0000.8520.8300.8880.8080.7200.6830.8650.972
구동기 용량0.5730.8790.9630.8530.8521.0000.9850.9790.7960.9840.9410.9470.940
흡입측 압력0.5810.9160.9520.9640.8300.9851.0000.9820.9130.8800.9181.0001.000
토출용량0.9420.9710.9850.9240.8880.9790.9821.0000.7630.9270.9180.9510.953
압축단수1.0000.4150.7540.9110.8080.7960.9130.7631.0000.9620.6350.9310.994
윤활방식0.9730.9120.6780.9080.7200.9840.8800.9270.9621.0000.9430.8651.000
냉각방식0.7810.9280.9660.9300.6830.9410.9180.9180.6350.9431.0000.8860.839
첫번째 단의 크기0.5210.7781.0000.9440.8650.9471.0000.9510.9310.8650.8861.0001.000
최종단의 크기0.7400.6501.0000.9720.9720.9401.0000.9530.9941.0000.8391.0001.000
2023-12-12T22:57:58.948670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구동기 용량구동기 종류프레임냉각방식모델명압축단수
구동기 용량1.0000.5460.7680.7160.5530.607
구동기 종류0.5461.0000.5980.1880.2420.479
프레임0.7680.5981.0000.6650.7070.430
냉각방식0.7160.1880.6651.0000.5330.306
모델명0.5530.2420.7070.5331.0000.667
압축단수0.6070.4790.4300.3060.6671.000
2023-12-12T22:57:59.053073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
프레임모델명구동기 종류구동기 용량압축단수냉각방식
프레임1.0000.7070.5980.7680.4300.665
모델명0.7071.0000.2420.5530.6670.533
구동기 종류0.5980.2421.0000.5460.4790.188
구동기 용량0.7680.5530.5461.0000.6070.716
압축단수0.4300.6670.4790.6071.0000.306
냉각방식0.6650.5330.1880.7160.3061.000

Missing values

2023-12-12T22:57:53.431216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:57:53.630580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T22:57:54.144056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

등록일자등록자프레임모델명구동기 종류구동기 용량흡입측 압력토출용량압축단수윤활방식냉각방식첫번째 단의 크기최종단의 크기
02020-11-3022421<NA>250HP<NA>2504BAR250BAR4급유식공냉식<NA><NA>
12020-05-2020033<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
22020-03-19A210111534<NA>GEO-B250왕복동식2500.3292Mpa925Nm3/hr4단Oil Pump 가동방식공냉식 열교환기2-1/2&quot; ANSI150lb RF3/4&quot;NPT
32020-03-1820036<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
42020-03-2010712LNG PUMP CASING VESSELAC34 164508211-2잠수펌프30HP 120HZ 460VAC12BAR75~400LPM1단1회전없음(LNG 잠수 TYPELNG 냉열<NA><NA>
52020-03-1120036<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
62020-03-1120036<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
72020-02-0520036<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
82020-01-2020036<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
92020-01-2020036<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
등록일자등록자프레임모델명구동기 종류구동기 용량흡입측 압력토출용량압축단수윤활방식냉각방식첫번째 단의 크기최종단의 크기
342015-12-21MGR0003914<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
352015-12-02A210113258Horizontal heavy dutyGEO-B-250<NA>250HP/185KW0.8~0.9Mpa(정압시설없음)1200.0N㎥/hr@0.85Mpa4단OIL PUMP 에 의한 강제윤활공냉식 열교환기80A3/4&quot;
362015-11-06MGR0005400CC250-001WPT-250HP-2C(2.5~4.0K)motor type250Hp0.4 Mpa925 N㎥/hr4 Stage급유식공냉식215mm45mm
372015-06-268356JGQJGQ2/4Electric Motor2504.58814Oil Pump 에 의한 강제 윤활공랭식(Air)81.75
382015-05-27C201090001JGQJGQ2/4Electric Motor2504.59004Oil Pump에 의한 강제윤활공랭식(Air)81.75
392015-04-13A210112289<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
402015-01-267643W4000*L3080*H3968GEO-B11-250ELECTRIC MOTOR2504.0KG925Nm3/hr4강제윤활air28052
412015-01-26A210111534미)ARIEL JGQJGQ/2/4Electric Motor250HP4.5 ~ 6.51205Nm3/Hr @6.2barg4Force FeedAir Cooled203.250.8
422015-01-20MGR0003914<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>
432015-01-13MGR0003914<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

등록일자등록자프레임모델명구동기 종류구동기 용량흡입측 압력토출용량압축단수윤활방식냉각방식첫번째 단의 크기최종단의 크기# duplicates
02020-01-2020036<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>2
12020-03-1120036<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>2