Overview

Dataset statistics

Number of variables4
Number of observations95
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.2 KiB
Average record size in memory34.4 B

Variable types

Numeric1
Categorical2
Text1

Dataset

Description가스시설 검사를 위해 한국가스안전공사가 보유하고 있는 검사장비 리스트 현황(검사분야, 검사장비명, 장비관리자)에 관한 데이터입니다.
Author한국가스안전공사
URLhttps://www.data.go.kr/data/15001478/fileData.do

Alerts

No is highly overall correlated with 검사 분야 and 1 other fieldsHigh correlation
검사 분야 is highly overall correlated with No and 1 other fieldsHigh correlation
장비 관리자 is highly overall correlated with No and 1 other fieldsHigh correlation
No has unique valuesUnique
검사장비명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 05:56:46.011492
Analysis finished2023-12-12 05:56:46.395162
Duration0.38 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

No
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct95
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean48
Minimum1
Maximum95
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size987.0 B
2023-12-12T14:56:46.461140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.7
Q124.5
median48
Q371.5
95-th percentile90.3
Maximum95
Range94
Interquartile range (IQR)47

Descriptive statistics

Standard deviation27.568098
Coefficient of variation (CV)0.57433536
Kurtosis-1.2
Mean48
Median Absolute Deviation (MAD)24
Skewness0
Sum4560
Variance760
MonotonicityStrictly increasing
2023-12-12T14:56:46.581030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.1%
2 1
 
1.1%
71 1
 
1.1%
70 1
 
1.1%
69 1
 
1.1%
68 1
 
1.1%
67 1
 
1.1%
66 1
 
1.1%
65 1
 
1.1%
64 1
 
1.1%
Other values (85) 85
89.5%
ValueCountFrequency (%)
1 1
1.1%
2 1
1.1%
3 1
1.1%
4 1
1.1%
5 1
1.1%
6 1
1.1%
7 1
1.1%
8 1
1.1%
9 1
1.1%
10 1
1.1%
ValueCountFrequency (%)
95 1
1.1%
94 1
1.1%
93 1
1.1%
92 1
1.1%
91 1
1.1%
90 1
1.1%
89 1
1.1%
88 1
1.1%
87 1
1.1%
86 1
1.1%

검사 분야
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size892.0 B
시설분야
52 
기타공구
25 
용기특정설비분야
15 
비파괴분야
 
3

Length

Max length8
Median length4
Mean length4.6631579
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row용기특정설비분야
2nd row용기특정설비분야
3rd row용기특정설비분야
4th row용기특정설비분야
5th row용기특정설비분야

Common Values

ValueCountFrequency (%)
시설분야 52
54.7%
기타공구 25
26.3%
용기특정설비분야 15
 
15.8%
비파괴분야 3
 
3.2%

Length

2023-12-12T14:56:46.710812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:56:46.809581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
시설분야 52
54.7%
기타공구 25
26.3%
용기특정설비분야 15
 
15.8%
비파괴분야 3
 
3.2%

검사장비명
Text

UNIQUE 

Distinct95
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size892.0 B
2023-12-12T14:56:47.070366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length16
Mean length8.6105263
Min length2

Characters and Unicode

Total characters818
Distinct characters191
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique95 ?
Unique (%)100.0%

Sample

1st row도막두께측정기
2nd row링및플러그게이지(CNG용기)
3rd row링및플러그게이지(LPG용기)
4th row마이크로메타(400이하)
5th row버어니어캘리퍼스
ValueCountFrequency (%)
도막두께측정기 1
 
1.1%
보링바 1
 
1.1%
필름판독기 1
 
1.1%
필름농도측정기 1
 
1.1%
필름감광도 1
 
1.1%
co/co₂측정장치 1
 
1.1%
회전계 1
 
1.1%
피치게이지 1
 
1.1%
풍압계 1
 
1.1%
절연저항측정기 1
 
1.1%
Other values (85) 85
89.5%
2023-12-12T14:56:47.888667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
75
 
9.2%
27
 
3.3%
( 26
 
3.2%
) 26
 
3.2%
25
 
3.1%
M 24
 
2.9%
23
 
2.8%
22
 
2.7%
21
 
2.6%
20
 
2.4%
Other values (181) 529
64.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 605
74.0%
Decimal Number 68
 
8.3%
Uppercase Letter 64
 
7.8%
Open Punctuation 26
 
3.2%
Close Punctuation 26
 
3.2%
Lowercase Letter 13
 
1.6%
Math Symbol 11
 
1.3%
Other Punctuation 4
 
0.5%
Other Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
75
 
12.4%
27
 
4.5%
25
 
4.1%
23
 
3.8%
22
 
3.6%
21
 
3.5%
20
 
3.3%
18
 
3.0%
16
 
2.6%
13
 
2.1%
Other values (157) 345
57.0%
Uppercase Letter
ValueCountFrequency (%)
M 24
37.5%
P 15
23.4%
D 11
17.2%
G 4
 
6.2%
C 3
 
4.7%
L 3
 
4.7%
O 2
 
3.1%
H 1
 
1.6%
N 1
 
1.6%
Decimal Number
ValueCountFrequency (%)
0 19
27.9%
1 15
22.1%
3 14
20.6%
6 11
16.2%
2 4
 
5.9%
5 4
 
5.9%
4 1
 
1.5%
Lowercase Letter
ValueCountFrequency (%)
a 12
92.3%
p 1
 
7.7%
Other Punctuation
ValueCountFrequency (%)
. 2
50.0%
/ 2
50.0%
Open Punctuation
ValueCountFrequency (%)
( 26
100.0%
Close Punctuation
ValueCountFrequency (%)
) 26
100.0%
Math Symbol
ValueCountFrequency (%)
~ 11
100.0%
Other Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 605
74.0%
Common 136
 
16.6%
Latin 77
 
9.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
75
 
12.4%
27
 
4.5%
25
 
4.1%
23
 
3.8%
22
 
3.6%
21
 
3.5%
20
 
3.3%
18
 
3.0%
16
 
2.6%
13
 
2.1%
Other values (157) 345
57.0%
Common
ValueCountFrequency (%)
( 26
19.1%
) 26
19.1%
0 19
14.0%
1 15
11.0%
3 14
10.3%
6 11
8.1%
~ 11
8.1%
2 4
 
2.9%
5 4
 
2.9%
. 2
 
1.5%
Other values (3) 4
 
2.9%
Latin
ValueCountFrequency (%)
M 24
31.2%
P 15
19.5%
a 12
15.6%
D 11
14.3%
G 4
 
5.2%
C 3
 
3.9%
L 3
 
3.9%
O 2
 
2.6%
H 1
 
1.3%
p 1
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 605
74.0%
ASCII 212
 
25.9%
None 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
75
 
12.4%
27
 
4.5%
25
 
4.1%
23
 
3.8%
22
 
3.6%
21
 
3.5%
20
 
3.3%
18
 
3.0%
16
 
2.6%
13
 
2.1%
Other values (157) 345
57.0%
ASCII
ValueCountFrequency (%)
( 26
12.3%
) 26
12.3%
M 24
11.3%
0 19
9.0%
P 15
 
7.1%
1 15
 
7.1%
3 14
 
6.6%
a 12
 
5.7%
6 11
 
5.2%
~ 11
 
5.2%
Other values (13) 39
18.4%
None
ValueCountFrequency (%)
1
100.0%

장비 관리자
Categorical

HIGH CORRELATION 

Distinct20
Distinct (%)21.1%
Missing0
Missing (%)0.0%
Memory size892.0 B
장비 담당자
23 
가스용품 검사원
10 
용기 검사원
냉동제조시설 검사원
고법 시설 검사원
Other values (15)
38 

Length

Max length16
Median length12
Mean length8.6210526
Min length5

Unique

Unique7 ?
Unique (%)7.4%

Sample

1st row용기 검사원
2nd row용기 검사원
3rd row용기 검사원
4th row용기, 특정설비 검사원
5th row용기, 특정설비 검사원

Common Values

ValueCountFrequency (%)
장비 담당자 23
24.2%
가스용품 검사원 10
10.5%
용기 검사원 8
 
8.4%
냉동제조시설 검사원 8
 
8.4%
고법 시설 검사원 8
 
8.4%
용기, 특정설비 검사원 7
 
7.4%
도법 공급시설 검사원 6
 
6.3%
액법, 도법 시설 검사원 4
 
4.2%
PE융착기 성능확인 4
 
4.2%
고법, 액법 시설 검사원 3
 
3.2%
Other values (10) 14
14.7%

Length

2023-12-12T14:56:48.078099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
검사원 67
28.8%
장비 23
 
9.9%
담당자 23
 
9.9%
시설 21
 
9.0%
용기 16
 
6.9%
도법 14
 
6.0%
고법 13
 
5.6%
가스용품 10
 
4.3%
액법 10
 
4.3%
특정설비 8
 
3.4%
Other values (7) 28
12.0%

Interactions

2023-12-12T14:56:46.200645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:56:48.175286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
No검사 분야검사장비명장비 관리자
No1.0000.9071.0000.925
검사 분야0.9071.0001.0000.997
검사장비명1.0001.0001.0001.000
장비 관리자0.9250.9971.0001.000
2023-12-12T14:56:48.269966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
장비 관리자검사 분야
장비 관리자1.0000.837
검사 분야0.8371.000
2023-12-12T14:56:48.359860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
No검사 분야장비 관리자
No1.0000.7670.548
검사 분야0.7671.0000.837
장비 관리자0.5480.8371.000

Missing values

2023-12-12T14:56:46.286851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:56:46.364678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

No검사 분야검사장비명장비 관리자
01용기특정설비분야도막두께측정기용기 검사원
12용기특정설비분야링및플러그게이지(CNG용기)용기 검사원
23용기특정설비분야링및플러그게이지(LPG용기)용기 검사원
34용기특정설비분야마이크로메타(400이하)용기, 특정설비 검사원
45용기특정설비분야버어니어캘리퍼스용기, 특정설비 검사원
56용기특정설비분야복합재료두께측정기용기 검사원
67용기특정설비분야부식측정기(깊이게이지)용기, 특정설비 검사원
78용기특정설비분야용기밸브용링게이지(국내)용기 검사원
89용기특정설비분야용기파열시험기용기 검사원
910용기특정설비분야용기표면조도측정기용기 검사원
No검사 분야검사장비명장비 관리자
8586기타공구파이프렌치장비 담당자
8687기타공구몽키장비 담당자
8788기타공구스패너장비 담당자
8889기타공구바이스플라이어장비 담당자
8990기타공구바이스장비 담당자
9091기타공구동관확장기장비 담당자
9192기타공구공구박스장비 담당자
9293기타공구공구세트장비 담당자
9394기타공구쇠톱장비 담당자
9495기타공구무전기장비 담당자