Overview

Dataset statistics

Number of variables5
Number of observations105
Missing cells129
Missing cells (%)24.6%
Duplicate rows1
Duplicate rows (%)1.0%
Total size in memory4.2 KiB
Average record size in memory41.3 B

Variable types

Categorical2
Text2
DateTime1

Dataset

Description고압가스 가스안전분야 검사,기술,시설 상세기준인 KGS CODE의 개정현황(분류, 주요내용, 코드명, 개정일자)을 공개하여 가스업계 종사자분들에게 도움이 되고자 제공하는 데이터입니다.
Author한국가스안전공사
URLhttps://www.data.go.kr/data/15091486/fileData.do

Alerts

Dataset has 1 (1.0%) duplicate rowsDuplicates
구분 is highly overall correlated with 제개정 분류High correlation
제개정 분류 is highly overall correlated with 구분High correlation
주요내용 has 43 (41.0%) missing valuesMissing
코 드 명 has 43 (41.0%) missing valuesMissing
개정일자 has 43 (41.0%) missing valuesMissing

Reproduction

Analysis started2023-12-11 22:49:56.260990
Analysis finished2023-12-11 22:49:56.743838
Duration0.48 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size972.0 B
고법
62 
<NA>
43 

Length

Max length4
Median length2
Mean length2.8190476
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row고법
2nd row고법
3rd row고법
4th row고법
5th row고법

Common Values

ValueCountFrequency (%)
고법 62
59.0%
<NA> 43
41.0%

Length

2023-12-12T07:49:56.811203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T07:49:56.896860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
고법 62
59.0%
na 43
41.0%

제개정 분류
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size972.0 B
개정
62 
<NA>
43 

Length

Max length4
Median length2
Mean length2.8190476
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개정
2nd row개정
3rd row개정
4th row개정
5th row개정

Common Values

ValueCountFrequency (%)
개정 62
59.0%
<NA> 43
41.0%

Length

2023-12-12T07:49:56.988348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T07:49:57.100208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개정 62
59.0%
na 43
41.0%

주요내용
Text

MISSING 

Distinct32
Distinct (%)51.6%
Missing43
Missing (%)41.0%
Memory size972.0 B
2023-12-12T07:49:57.336384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length208
Median length88
Mean length53.435484
Min length11

Characters and Unicode

Total characters3313
Distinct characters223
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)37.1%

Sample

1st rowKGS Code의 체계적인 운영을 위해, 5년 이상 개정 이력이 전무한 Code에 대한 유효성 검토 실시
2nd rowKGS Code의 체계적인 운영을 위해, 5년 이상 개정 이력이 전무한 Code에 대한 유효성 검토 실시
3rd rowKGS Code의 체계적인 운영을 위해, 5년 이상 개정 이력이 전무한 Code에 대한 유효성 검토 실시
4th row국민이 이해하기 쉬운 KGS Code(상세기준) 문화 정착을 위한 문구·용어 전면개편
5th row국민이 이해하기 쉬운 KGS Code(상세기준) 문화 정착을 위한 문구·용어 전면개편
ValueCountFrequency (%)
개정 20
 
2.7%
kgs 17
 
2.3%
명확화 16
 
2.2%
방호벽 15
 
2.0%
위한 15
 
2.0%
국민이 14
 
1.9%
문구·용어 14
 
1.9%
전면개편 14
 
1.9%
이해하기 14
 
1.9%
정착을 14
 
1.9%
Other values (210) 581
79.2%
2023-12-12T07:49:57.754028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
672
 
20.3%
84
 
2.5%
67
 
2.0%
56
 
1.7%
, 55
 
1.7%
50
 
1.5%
46
 
1.4%
45
 
1.4%
44
 
1.3%
43
 
1.3%
Other values (213) 2151
64.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2243
67.7%
Space Separator 672
 
20.3%
Uppercase Letter 104
 
3.1%
Other Punctuation 95
 
2.9%
Decimal Number 85
 
2.6%
Lowercase Letter 60
 
1.8%
Open Punctuation 27
 
0.8%
Close Punctuation 27
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
84
 
3.7%
67
 
3.0%
56
 
2.5%
50
 
2.2%
46
 
2.1%
45
 
2.0%
44
 
2.0%
43
 
1.9%
43
 
1.9%
41
 
1.8%
Other values (182) 1724
76.9%
Uppercase Letter
ValueCountFrequency (%)
S 26
25.0%
K 24
23.1%
C 22
21.2%
G 19
18.3%
I 2
 
1.9%
N 2
 
1.9%
F 2
 
1.9%
P 2
 
1.9%
A 1
 
1.0%
M 1
 
1.0%
Other values (3) 3
 
2.9%
Decimal Number
ValueCountFrequency (%)
2 34
40.0%
6 18
21.2%
0 9
 
10.6%
9 8
 
9.4%
4 8
 
9.4%
5 6
 
7.1%
1 2
 
2.4%
Other Punctuation
ValueCountFrequency (%)
, 55
57.9%
. 24
25.3%
· 14
 
14.7%
' 1
 
1.1%
1
 
1.1%
Lowercase Letter
ValueCountFrequency (%)
d 20
33.3%
e 20
33.3%
o 20
33.3%
Space Separator
ValueCountFrequency (%)
672
100.0%
Open Punctuation
ValueCountFrequency (%)
( 27
100.0%
Close Punctuation
ValueCountFrequency (%)
) 27
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2243
67.7%
Common 906
27.3%
Latin 164
 
5.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
84
 
3.7%
67
 
3.0%
56
 
2.5%
50
 
2.2%
46
 
2.1%
45
 
2.0%
44
 
2.0%
43
 
1.9%
43
 
1.9%
41
 
1.8%
Other values (182) 1724
76.9%
Latin
ValueCountFrequency (%)
S 26
15.9%
K 24
14.6%
C 22
13.4%
d 20
12.2%
e 20
12.2%
o 20
12.2%
G 19
11.6%
I 2
 
1.2%
N 2
 
1.2%
F 2
 
1.2%
Other values (6) 7
 
4.3%
Common
ValueCountFrequency (%)
672
74.2%
, 55
 
6.1%
2 34
 
3.8%
( 27
 
3.0%
) 27
 
3.0%
. 24
 
2.6%
6 18
 
2.0%
· 14
 
1.5%
0 9
 
1.0%
9 8
 
0.9%
Other values (5) 18
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2243
67.7%
ASCII 1055
31.8%
None 14
 
0.4%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
672
63.7%
, 55
 
5.2%
2 34
 
3.2%
( 27
 
2.6%
) 27
 
2.6%
S 26
 
2.5%
. 24
 
2.3%
K 24
 
2.3%
C 22
 
2.1%
d 20
 
1.9%
Other values (19) 124
 
11.8%
Hangul
ValueCountFrequency (%)
84
 
3.7%
67
 
3.0%
56
 
2.5%
50
 
2.2%
46
 
2.1%
45
 
2.0%
44
 
2.0%
43
 
1.9%
43
 
1.9%
41
 
1.8%
Other values (182) 1724
76.9%
None
ValueCountFrequency (%)
· 14
100.0%
Punctuation
ValueCountFrequency (%)
1
100.0%

코 드 명
Text

MISSING 

Distinct41
Distinct (%)66.1%
Missing43
Missing (%)41.0%
Memory size972.0 B
2023-12-12T07:49:57.935952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length5
Mean length5
Min length5

Characters and Unicode

Total characters310
Distinct characters15
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)41.9%

Sample

1st rowAA314
2nd rowAA315
3rd rowAC312
4th rowAA211
5th rowAA212
ValueCountFrequency (%)
ac111 3
 
4.8%
fs112 3
 
4.8%
fu111 3
 
4.8%
fp112 3
 
4.8%
fp111 3
 
4.8%
fu211 3
 
4.8%
aa312 2
 
3.2%
aa318 2
 
3.2%
aa913 2
 
3.2%
fp217 2
 
3.2%
Other values (31) 36
58.1%
2023-12-12T07:49:58.217719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 102
32.9%
A 57
18.4%
2 37
 
11.9%
F 25
 
8.1%
3 21
 
6.8%
C 17
 
5.5%
P 13
 
4.2%
U 8
 
2.6%
4 7
 
2.3%
9 7
 
2.3%
Other values (5) 16
 
5.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 186
60.0%
Uppercase Letter 124
40.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 102
54.8%
2 37
 
19.9%
3 21
 
11.3%
4 7
 
3.8%
9 7
 
3.8%
6 3
 
1.6%
7 3
 
1.6%
8 3
 
1.6%
5 3
 
1.6%
Uppercase Letter
ValueCountFrequency (%)
A 57
46.0%
F 25
20.2%
C 17
 
13.7%
P 13
 
10.5%
U 8
 
6.5%
S 4
 
3.2%

Most occurring scripts

ValueCountFrequency (%)
Common 186
60.0%
Latin 124
40.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 102
54.8%
2 37
 
19.9%
3 21
 
11.3%
4 7
 
3.8%
9 7
 
3.8%
6 3
 
1.6%
7 3
 
1.6%
8 3
 
1.6%
5 3
 
1.6%
Latin
ValueCountFrequency (%)
A 57
46.0%
F 25
20.2%
C 17
 
13.7%
P 13
 
10.5%
U 8
 
6.5%
S 4
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 310
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 102
32.9%
A 57
18.4%
2 37
 
11.9%
F 25
 
8.1%
3 21
 
6.8%
C 17
 
5.5%
P 13
 
4.2%
U 8
 
2.6%
4 7
 
2.3%
9 7
 
2.3%
Other values (5) 16
 
5.2%

개정일자
Date

MISSING 

Distinct13
Distinct (%)21.0%
Missing43
Missing (%)41.0%
Memory size972.0 B
Minimum2022-03-28 00:00:00
Maximum2023-11-07 00:00:00
2023-12-12T07:49:58.329727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:49:58.448395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)

Correlations

2023-12-12T07:49:58.558309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주요내용코 드 명개정일자
주요내용1.0000.0000.985
코 드 명0.0001.0000.574
개정일자0.9850.5741.000
2023-12-12T07:49:58.637156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분제개정 분류
구분1.0001.000
제개정 분류1.0001.000
2023-12-12T07:49:58.706212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분제개정 분류
구분1.0001.000
제개정 분류1.0001.000

Missing values

2023-12-12T07:49:56.519414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T07:49:56.601021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T07:49:56.684550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

구분제개정 분류주요내용코 드 명개정일자
0고법개정KGS Code의 체계적인 운영을 위해, 5년 이상 개정 이력이 전무한 Code에 대한 유효성 검토 실시AA3142022-03-28
1고법개정KGS Code의 체계적인 운영을 위해, 5년 이상 개정 이력이 전무한 Code에 대한 유효성 검토 실시AA3152022-03-28
2고법개정KGS Code의 체계적인 운영을 위해, 5년 이상 개정 이력이 전무한 Code에 대한 유효성 검토 실시AC3122022-03-28
3고법개정국민이 이해하기 쉬운 KGS Code(상세기준) 문화 정착을 위한 문구·용어 전면개편AA2112022-06-14
4고법개정국민이 이해하기 쉬운 KGS Code(상세기준) 문화 정착을 위한 문구·용어 전면개편AA2122022-06-14
5고법개정국민이 이해하기 쉬운 KGS Code(상세기준) 문화 정착을 위한 문구·용어 전면개편AA2132022-06-14
6고법개정국민이 이해하기 쉬운 KGS Code(상세기준) 문화 정착을 위한 문구·용어 전면개편AA3112022-06-14
7고법개정국민이 이해하기 쉬운 KGS Code(상세기준) 문화 정착을 위한 문구·용어 전면개편AA3122022-06-14
8고법개정국민이 이해하기 쉬운 KGS Code(상세기준) 문화 정착을 위한 문구·용어 전면개편AA3132022-06-14
9고법개정과압안전장치 방출관 설치 위치 현실화FS1122022-06-14
구분제개정 분류주요내용코 드 명개정일자
95<NA><NA><NA><NA><NA>
96<NA><NA><NA><NA><NA>
97<NA><NA><NA><NA><NA>
98<NA><NA><NA><NA><NA>
99<NA><NA><NA><NA><NA>
100<NA><NA><NA><NA><NA>
101<NA><NA><NA><NA><NA>
102<NA><NA><NA><NA><NA>
103<NA><NA><NA><NA><NA>
104<NA><NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

구분제개정 분류주요내용코 드 명개정일자# duplicates
0<NA><NA><NA><NA><NA>43