Overview

Dataset statistics

Number of variables5
Number of observations76
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.2 KiB
Average record size in memory43.7 B

Variable types

Categorical1
Text2
Numeric2

Dataset

DescriptionKGS Code는 정부에서 승인한 가스안전분야 시설,검사,기술 상세기준으로, 한국가스안공사에서 보유하고 있는 액화석유가스 코드 현황(코드번호, 코드명, 최신버젼 등) 데이터입니다.
Author한국가스안전공사
URLhttps://www.data.go.kr/data/15091491/fileData.do

Alerts

내용쪽수 is highly overall correlated with 분류High correlation
분류 is highly overall correlated with 내용쪽수High correlation
코드번호 has unique valuesUnique
코드명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 08:23:56.680038
Analysis finished2023-12-12 08:23:57.633383
Duration0.95 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

분류
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size740.0 B
용품
58 
시설
18 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row용품
2nd row용품
3rd row용품
4th row용품
5th row용품

Common Values

ValueCountFrequency (%)
용품 58
76.3%
시설 18
 
23.7%

Length

2023-12-12T17:23:57.705677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:23:57.818976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
용품 58
76.3%
시설 18
 
23.7%

코드번호
Text

UNIQUE 

Distinct76
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size740.0 B
2023-12-12T17:23:58.033118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length9
Mean length9.0263158
Min length9

Characters and Unicode

Total characters686
Distinct characters20
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique76 ?
Unique (%)100.0%

Sample

1st rowKGS AA231
2nd rowKGS AA232
3rd rowKGS AA233
4th rowKGS AA234
5th rowKGS AA235
ValueCountFrequency (%)
kgs 76
49.7%
ab336 1
 
0.7%
ab934 1
 
0.7%
ab933 1
 
0.7%
ab932 1
 
0.7%
ab931 1
 
0.7%
ab341 1
 
0.7%
ab339 1
 
0.7%
ab338 1
 
0.7%
aa232 1
 
0.7%
Other values (68) 68
44.4%
2023-12-12T17:23:58.472849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 114
16.6%
A 93
13.6%
S 83
12.1%
G 78
11.4%
77
11.2%
K 76
11.1%
2 28
 
4.1%
4 24
 
3.5%
1 23
 
3.4%
B 23
 
3.4%
Other values (10) 67
9.8%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 381
55.5%
Decimal Number 228
33.2%
Space Separator 77
 
11.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 114
50.0%
2 28
 
12.3%
4 24
 
10.5%
1 23
 
10.1%
5 14
 
6.1%
6 9
 
3.9%
9 7
 
3.1%
7 4
 
1.8%
8 3
 
1.3%
0 2
 
0.9%
Uppercase Letter
ValueCountFrequency (%)
A 93
24.4%
S 83
21.8%
G 78
20.5%
K 76
19.9%
B 23
 
6.0%
F 16
 
4.2%
U 6
 
1.6%
P 4
 
1.0%
C 2
 
0.5%
Space Separator
ValueCountFrequency (%)
77
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 381
55.5%
Common 305
44.5%

Most frequent character per script

Common
ValueCountFrequency (%)
3 114
37.4%
77
25.2%
2 28
 
9.2%
4 24
 
7.9%
1 23
 
7.5%
5 14
 
4.6%
6 9
 
3.0%
9 7
 
2.3%
7 4
 
1.3%
8 3
 
1.0%
Latin
ValueCountFrequency (%)
A 93
24.4%
S 83
21.8%
G 78
20.5%
K 76
19.9%
B 23
 
6.0%
F 16
 
4.2%
U 6
 
1.6%
P 4
 
1.0%
C 2
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 686
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 114
16.6%
A 93
13.6%
S 83
12.1%
G 78
11.4%
77
11.2%
K 76
11.1%
2 28
 
4.1%
4 24
 
3.5%
1 23
 
3.4%
B 23
 
3.4%
Other values (10) 67
9.8%

코드명
Text

UNIQUE 

Distinct76
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size740.0 B
2023-12-12T17:23:58.857860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length35.5
Mean length28.763158
Min length15

Characters and Unicode

Total characters2186
Distinct characters157
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique76 ?
Unique (%)100.0%

Sample

1st row가스용 전기절연이음관 제조의 시설ㆍ기술ㆍ검사 기준
2nd row가스용 전기융착폴리에틸렌이음관 제조의 시설ㆍ기술ㆍ검사 기준
3rd row가스용 이형질이음관 제조의 시설ㆍ기술ㆍ검사 기준
4th row가스용 퀵카플러 제조의 시설ㆍ기술ㆍ검사 기준
5th row액화석유가스용 세이프티커플링 제조의 시설ㆍ기술ㆍ검사 기준
ValueCountFrequency (%)
기준 76
18.9%
시설ㆍ기술ㆍ검사 62
15.4%
제조의 58
14.4%
액화석유가스 24
 
6.0%
가스용 14
 
3.5%
압력조정기 6
 
1.5%
시설ㆍ기술ㆍ검사ㆍ정밀안전진단ㆍ안전성평가 5
 
1.2%
의한 5
 
1.2%
밖의 5
 
1.2%
배관용 4
 
1.0%
Other values (106) 144
35.7%
2023-12-12T17:23:59.431854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
327
 
15.0%
192
 
8.8%
144
 
6.6%
86
 
3.9%
80
 
3.7%
79
 
3.6%
78
 
3.6%
76
 
3.5%
74
 
3.4%
74
 
3.4%
Other values (147) 976
44.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1840
84.2%
Space Separator 327
 
15.0%
Other Punctuation 17
 
0.8%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
192
 
10.4%
144
 
7.8%
86
 
4.7%
80
 
4.3%
79
 
4.3%
78
 
4.2%
76
 
4.1%
74
 
4.0%
74
 
4.0%
72
 
3.9%
Other values (142) 885
48.1%
Other Punctuation
ValueCountFrequency (%)
· 15
88.2%
? 2
 
11.8%
Space Separator
ValueCountFrequency (%)
327
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1840
84.2%
Common 346
 
15.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
192
 
10.4%
144
 
7.8%
86
 
4.7%
80
 
4.3%
79
 
4.3%
78
 
4.2%
76
 
4.1%
74
 
4.0%
74
 
4.0%
72
 
3.9%
Other values (142) 885
48.1%
Common
ValueCountFrequency (%)
327
94.5%
· 15
 
4.3%
? 2
 
0.6%
( 1
 
0.3%
) 1
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1696
77.6%
ASCII 331
 
15.1%
Compat Jamo 144
 
6.6%
None 15
 
0.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
327
98.8%
? 2
 
0.6%
( 1
 
0.3%
) 1
 
0.3%
Hangul
ValueCountFrequency (%)
192
 
11.3%
86
 
5.1%
80
 
4.7%
79
 
4.7%
78
 
4.6%
76
 
4.5%
74
 
4.4%
74
 
4.4%
72
 
4.2%
71
 
4.2%
Other values (141) 814
48.0%
Compat Jamo
ValueCountFrequency (%)
144
100.0%
None
ValueCountFrequency (%)
· 15
100.0%

최신버전
Real number (ℝ)

Distinct19
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean221102.62
Minimum170929
Maximum230825
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size816.0 B
2023-12-12T17:23:59.582127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum170929
5-th percentile188263.75
Q1220715
median221201
Q3230504
95-th percentile230825
Maximum230825
Range59896
Interquartile range (IQR)9789

Descriptive statistics

Standard deviation12319.697
Coefficient of variation (CV)0.055719361
Kurtosis6.3396638
Mean221102.62
Median Absolute Deviation (MAD)9105
Skewness-2.3991411
Sum16803799
Variance1.5177493 × 108
MonotonicityNot monotonic
2023-12-12T17:23:59.732721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
221201 10
13.2%
221012 10
13.2%
230504 9
11.8%
230825 9
11.8%
220715 8
10.5%
221230 5
6.6%
211118 4
 
5.3%
230306 4
 
5.3%
230614 3
 
3.9%
181213 3
 
3.9%
Other values (9) 11
14.5%
ValueCountFrequency (%)
170929 1
 
1.3%
181213 3
 
3.9%
190614 1
 
1.3%
210112 1
 
1.3%
210402 1
 
1.3%
211118 4
 
5.3%
220328 1
 
1.3%
220331 1
 
1.3%
220715 8
10.5%
221012 10
13.2%
ValueCountFrequency (%)
230825 9
11.8%
230703 2
 
2.6%
230614 3
 
3.9%
230504 9
11.8%
230331 1
 
1.3%
230306 4
 
5.3%
221230 5
6.6%
221201 10
13.2%
221104 2
 
2.6%
221012 10
13.2%

내용쪽수
Real number (ℝ)

HIGH CORRELATION 

Distinct42
Distinct (%)55.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean56.302632
Minimum15
Maximum165
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size816.0 B
2023-12-12T17:23:59.864586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum15
5-th percentile27.75
Q131
median38
Q354.75
95-th percentile153.25
Maximum165
Range150
Interquartile range (IQR)23.75

Descriptive statistics

Standard deviation40.615603
Coefficient of variation (CV)0.72138019
Kurtosis0.99247763
Mean56.302632
Median Absolute Deviation (MAD)8
Skewness1.559542
Sum4279
Variance1649.6272
MonotonicityNot monotonic
2023-12-12T17:24:00.017503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=42)
ValueCountFrequency (%)
31 8
 
10.5%
30 7
 
9.2%
38 5
 
6.6%
34 4
 
5.3%
33 3
 
3.9%
46 2
 
2.6%
51 2
 
2.6%
54 2
 
2.6%
45 2
 
2.6%
123 2
 
2.6%
Other values (32) 39
51.3%
ValueCountFrequency (%)
15 1
 
1.3%
16 1
 
1.3%
19 1
 
1.3%
27 1
 
1.3%
28 1
 
1.3%
29 1
 
1.3%
30 7
9.2%
31 8
10.5%
32 2
 
2.6%
33 3
 
3.9%
ValueCountFrequency (%)
165 1
1.3%
157 1
1.3%
155 1
1.3%
154 1
1.3%
153 1
1.3%
139 1
1.3%
137 1
1.3%
133 1
1.3%
132 1
1.3%
125 1
1.3%

Interactions

2023-12-12T17:23:57.219672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:23:57.002661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:23:57.339656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:23:57.111615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:24:00.156705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분류코드번호코드명최신버전내용쪽수
분류1.0001.0001.0000.1650.739
코드번호1.0001.0001.0001.0001.000
코드명1.0001.0001.0001.0001.000
최신버전0.1651.0001.0001.0000.000
내용쪽수0.7391.0001.0000.0001.000
2023-12-12T17:24:00.271136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
최신버전내용쪽수분류
최신버전1.0000.3500.171
내용쪽수0.3501.0000.671
분류0.1710.6711.000

Missing values

2023-12-12T17:23:57.475688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:23:57.591070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

분류코드번호코드명최신버전내용쪽수
0용품KGS AA231가스용 전기절연이음관 제조의 시설ㆍ기술ㆍ검사 기준22120133
1용품KGS AA232가스용 전기융착폴리에틸렌이음관 제조의 시설ㆍ기술ㆍ검사 기준22120129
2용품KGS AA233가스용 이형질이음관 제조의 시설ㆍ기술ㆍ검사 기준22120131
3용품KGS AA234가스용 퀵카플러 제조의 시설ㆍ기술ㆍ검사 기준22120131
4용품KGS AA235액화석유가스용 세이프티커플링 제조의 시설ㆍ기술ㆍ검사 기준22120128
5용품KGS AA236가스용 로딩암 제조의 시설ㆍ기술ㆍ검사 기준22071530
6용품KGS AA331그 밖의 배관용 밸브 제조의 시설ㆍ기술ㆍ검사 기준22101241
7용품KGS AA332매몰용접형 가스용 볼밸브 제조의 시설ㆍ기술ㆍ검사 기준22101236
8용품KGS AA333가스용 폴리에틸렌밸브 제조의 시설ㆍ기술ㆍ검사 기준22101231
9용품KGS AA334가스용 콕 제조의 시설ㆍ기술ㆍ검사 기준22101238
분류코드번호코드명최신버전내용쪽수
66시설KGS FS334액화석유가스 배관망공급 제조소 밖의 배관의 시설·기술·검사·정밀안전진단 기준230825153
67시설KGS FU331저장탱크에 의한 액화석유가스 저장소의 시설ㆍ기술ㆍ검사ㆍ정밀안전진단ㆍ안전성평가 기준230825154
68시설KGS FU332용기에 의한 액화석유가스 저장소의 시설ㆍ기술ㆍ검사 기준19061430
69시설KGS FU431용기에 의한 액화석유가스 사용시설의 시설ㆍ기술ㆍ검사 기준230614137
70시설KGS FU432소형저장탱크에 의한 액화석유가스 사용시설의 시설ㆍ기술ㆍ검사 기준230614155
71시설KGS FU433저장탱크에 의한 액화석유가스 사용시설의 시설ㆍ기술ㆍ검사 기준230614157
72시설KGS FU434액화석유가스 자동차 연료장치의 시설ㆍ기술ㆍ검사 기준21111819
73시설KGS GC231액화석유가스 안전성평가 기준21111815
74시설KGS GC232액화석유가스 배관망공급 시공감리 기준22123016
75용품KGS S AA012막음조치용 안전 퓨즈콕 제조의 시설?기술?검사 기준22033137