Overview

Dataset statistics

Number of variables3
Number of observations246
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.9 KiB
Average record size in memory24.5 B

Variable types

Categorical2
Text1

Dataset

Description본 데이터는 관리원 내 시험장비(가스크로마토그래피, 고성능액체크로마토그래피, 황분시험기, 동점도시험기 등)의 유종별 품명을 포함하고 있는 데이터입니다.
Author한국석유관리원
URLhttps://www.data.go.kr/data/15090322/fileData.do

Reproduction

Analysis started2023-12-12 12:22:53.807964
Analysis finished2023-12-12 12:22:54.277814
Duration0.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

분야별
Categorical

Distinct10
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
연료유
85 
윤활유
57 
성능평가
35 
보조
24 
공용
20 
Other values (5)
25 

Length

Max length6
Median length3
Mean length3.1056911
Min length2

Unique

Unique1 ?
Unique (%)0.4%

Sample

1st row성능평가
2nd row성능평가
3rd row성능평가
4th row윤활유
5th row윤활유

Common Values

ValueCountFrequency (%)
연료유 85
34.6%
윤활유 57
23.2%
성능평가 35
14.2%
보조 24
 
9.8%
공용 20
 
8.1%
석유대체연료 13
 
5.3%
부속 5
 
2.0%
항공유 3
 
1.2%
정제유 3
 
1.2%
토양분석 1
 
0.4%

Length

2023-12-12T21:22:54.392626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:22:54.615421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
연료유 85
34.6%
윤활유 57
23.2%
성능평가 35
14.2%
보조 24
 
9.8%
공용 20
 
8.1%
석유대체연료 13
 
5.3%
부속 5
 
2.0%
항공유 3
 
1.2%
정제유 3
 
1.2%
토양분석 1
 
0.4%
Distinct245
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2023-12-12T21:22:54.980208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length21
Mean length10.361789
Min length2

Characters and Unicode

Total characters2549
Distinct characters306
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique244 ?
Unique (%)99.2%

Sample

1st rowEURO6 차대동력계 업그레이드
2nd row차대동력계WLTP평가시스템-Horiba
3rd rowAC동력계(AVL)
4th rowASTM 및 Saybolt 색도계
5th rowASTM색도계
ValueCountFrequency (%)
시험기 6
 
1.9%
4
 
1.3%
euro6 3
 
1.0%
배출가스 3
 
1.0%
가스크로마토그래피(등경유분 2
 
0.6%
차대동력계 2
 
0.6%
분석기 2
 
0.6%
황분시험기(uv 2
 
0.6%
측정 2
 
0.6%
연소해석기 2
 
0.6%
Other values (281) 286
91.1%
2023-12-12T21:22:55.536944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
149
 
5.8%
101
 
4.0%
92
 
3.6%
) 80
 
3.1%
( 80
 
3.1%
70
 
2.7%
62
 
2.4%
59
 
2.3%
56
 
2.2%
47
 
1.8%
Other values (296) 1753
68.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1929
75.7%
Uppercase Letter 214
 
8.4%
Lowercase Letter 125
 
4.9%
Close Punctuation 80
 
3.1%
Open Punctuation 80
 
3.1%
Space Separator 70
 
2.7%
Decimal Number 20
 
0.8%
Other Punctuation 16
 
0.6%
Dash Punctuation 14
 
0.5%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
149
 
7.7%
101
 
5.2%
92
 
4.8%
62
 
3.2%
59
 
3.1%
56
 
2.9%
47
 
2.4%
41
 
2.1%
37
 
1.9%
36
 
1.9%
Other values (238) 1249
64.7%
Uppercase Letter
ValueCountFrequency (%)
P 22
 
10.3%
C 18
 
8.4%
G 18
 
8.4%
L 17
 
7.9%
M 16
 
7.5%
A 13
 
6.1%
E 12
 
5.6%
R 11
 
5.1%
D 11
 
5.1%
S 11
 
5.1%
Other values (11) 65
30.4%
Lowercase Letter
ValueCountFrequency (%)
r 17
13.6%
a 14
11.2%
e 13
10.4%
o 11
8.8%
l 9
 
7.2%
t 8
 
6.4%
s 8
 
6.4%
y 6
 
4.8%
u 6
 
4.8%
c 6
 
4.8%
Other values (10) 27
21.6%
Decimal Number
ValueCountFrequency (%)
6 5
25.0%
2 5
25.0%
1 2
 
10.0%
8 2
 
10.0%
0 2
 
10.0%
4 2
 
10.0%
5 1
 
5.0%
3 1
 
5.0%
Other Punctuation
ValueCountFrequency (%)
, 6
37.5%
/ 5
31.2%
& 4
25.0%
. 1
 
6.2%
Close Punctuation
ValueCountFrequency (%)
) 80
100.0%
Open Punctuation
ValueCountFrequency (%)
( 80
100.0%
Space Separator
ValueCountFrequency (%)
70
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 14
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1929
75.7%
Latin 340
 
13.3%
Common 280
 
11.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
149
 
7.7%
101
 
5.2%
92
 
4.8%
62
 
3.2%
59
 
3.1%
56
 
2.9%
47
 
2.4%
41
 
2.1%
37
 
1.9%
36
 
1.9%
Other values (238) 1249
64.7%
Latin
ValueCountFrequency (%)
P 22
 
6.5%
C 18
 
5.3%
G 18
 
5.3%
r 17
 
5.0%
L 17
 
5.0%
M 16
 
4.7%
a 14
 
4.1%
e 13
 
3.8%
A 13
 
3.8%
E 12
 
3.5%
Other values (32) 180
52.9%
Common
ValueCountFrequency (%)
) 80
28.6%
( 80
28.6%
70
25.0%
- 14
 
5.0%
, 6
 
2.1%
6 5
 
1.8%
/ 5
 
1.8%
2 5
 
1.8%
& 4
 
1.4%
1 2
 
0.7%
Other values (6) 9
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1929
75.7%
ASCII 619
 
24.3%
Number Forms 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
149
 
7.7%
101
 
5.2%
92
 
4.8%
62
 
3.2%
59
 
3.1%
56
 
2.9%
47
 
2.4%
41
 
2.1%
37
 
1.9%
36
 
1.9%
Other values (238) 1249
64.7%
ASCII
ValueCountFrequency (%)
) 80
 
12.9%
( 80
 
12.9%
70
 
11.3%
P 22
 
3.6%
C 18
 
2.9%
G 18
 
2.9%
r 17
 
2.7%
L 17
 
2.7%
M 16
 
2.6%
a 14
 
2.3%
Other values (47) 267
43.1%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct7
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
6
96 
12
77 
측정시
42 
3
24 
측정일
 
4
Other values (2)
 
3

Length

Max length3
Median length2
Mean length1.695122
Min length1

Unique

Unique1 ?
Unique (%)0.4%

Sample

1st row12
2nd row12
3rd row12
4th row측정시
5th row6

Common Values

ValueCountFrequency (%)
6 96
39.0%
12 77
31.3%
측정시 42
17.1%
3 24
 
9.8%
측정일 4
 
1.6%
24 2
 
0.8%
5 1
 
0.4%

Length

2023-12-12T21:22:55.739272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:22:55.902638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
6 96
39.0%
12 77
31.3%
측정시 42
17.1%
3 24
 
9.8%
측정일 4
 
1.6%
24 2
 
0.8%
5 1
 
0.4%

Correlations

2023-12-12T21:22:56.000389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분야별점검주기(개월)
분야별1.0000.443
점검주기(개월)0.4431.000
2023-12-12T21:22:56.099181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분야별점검주기(개월)
분야별1.0000.241
점검주기(개월)0.2411.000
2023-12-12T21:22:56.188359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분야별점검주기(개월)
분야별1.0000.241
점검주기(개월)0.2411.000

Missing values

2023-12-12T21:22:54.111384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:22:54.234107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

분야별장비명점검주기(개월)
0성능평가EURO6 차대동력계 업그레이드12
1성능평가차대동력계WLTP평가시스템-Horiba12
2성능평가AC동력계(AVL)12
3윤활유ASTM 및 Saybolt 색도계측정시
4윤활유ASTM색도계6
5부속DME-LPG혼합연료차량적용평가 아반테LPI하이브리드6
6성능평가EC동력계(AVL)12
7성능평가EURO6 대응 차대동력계용 냉각팬12
8성능평가EURO6 차량 배출가스 분석기12
9성능평가EuroⅥ엔진동력계용12
분야별장비명점검주기(개월)
236연료유황분시험기(UV, LPG용)12
237연료유황분시험기(WD-XR)3
238연료유황분시험기(X-ray)3
239연료유황분시험기(가스연료용)3
240윤활유황분시험기(고온법)6
241석유대체연료황화수소(H2S) 분석기6
242윤활유회전 점도계6
243윤활유회전봄베식산화안정도6
244보조휘발성유기화합물측정기기12
245보조히팅멘틀12