Overview

Dataset statistics

Number of variables3
Number of observations245
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.1 KiB
Average record size in memory25.5 B

Variable types

Numeric1
DateTime1
Text1

Dataset

Description한국가스공사 "Gas Inform" 논문 현황 자료 데이터로 논문 제목과 발간일(연월일)의 항목을 제공하고 있습니다.
URLhttps://www.data.go.kr/data/15065812/fileData.do

Alerts

순번 has unique valuesUnique
제목 has unique valuesUnique

Reproduction

Analysis started2023-12-12 21:23:57.127251
Analysis finished2023-12-12 21:23:57.553571
Duration0.43 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct245
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean123
Minimum1
Maximum245
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2023-12-13T06:23:57.649900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile13.2
Q162
median123
Q3184
95-th percentile232.8
Maximum245
Range244
Interquartile range (IQR)122

Descriptive statistics

Standard deviation70.869599
Coefficient of variation (CV)0.5761756
Kurtosis-1.2
Mean123
Median Absolute Deviation (MAD)61
Skewness0
Sum30135
Variance5022.5
MonotonicityStrictly increasing
2023-12-13T06:23:57.782057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
155 1
 
0.4%
157 1
 
0.4%
158 1
 
0.4%
159 1
 
0.4%
160 1
 
0.4%
161 1
 
0.4%
162 1
 
0.4%
163 1
 
0.4%
164 1
 
0.4%
Other values (235) 235
95.9%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
245 1
0.4%
244 1
0.4%
243 1
0.4%
242 1
0.4%
241 1
0.4%
240 1
0.4%
239 1
0.4%
238 1
0.4%
237 1
0.4%
236 1
0.4%
Distinct174
Distinct (%)71.0%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
Minimum2019-01-25 00:00:00
Maximum2023-04-12 00:00:00
2023-12-13T06:23:57.911657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:23:58.070428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

제목
Text

UNIQUE 

Distinct245
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-12-13T06:23:58.383430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length39
Mean length27.995918
Min length10

Characters and Unicode

Total characters6859
Distinct characters449
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique245 ?
Unique (%)100.0%

Sample

1st row중국 동절기 수요약세로 국내 도입예정 LNG 해외 판매
2nd row월성3 한빛2 정지
3rd row독일 탈원전 이어 2038년 석탄화력발전 전면 중단 결정
4th row미국 LNG수출프로젝트 FERC 인허가 리스크 대두
5th row신고리원전 4호기 운영허가
ValueCountFrequency (%)
lng 56
 
3.3%
발표 27
 
1.6%
전망 25
 
1.5%
천연가스 22
 
1.3%
가격 17
 
1.0%
16
 
0.9%
유럽 16
 
0.9%
프로젝트 15
 
0.9%
계획 15
 
0.9%
중국 14
 
0.8%
Other values (919) 1482
86.9%
2023-12-13T06:23:58.810114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1609
 
23.5%
120
 
1.7%
93
 
1.4%
G 80
 
1.2%
79
 
1.2%
0 75
 
1.1%
N 75
 
1.1%
71
 
1.0%
70
 
1.0%
L 68
 
1.0%
Other values (439) 4519
65.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3967
57.8%
Space Separator 1609
23.5%
Uppercase Letter 558
 
8.1%
Lowercase Letter 390
 
5.7%
Decimal Number 256
 
3.7%
Other Punctuation 40
 
0.6%
Dash Punctuation 11
 
0.2%
Math Symbol 9
 
0.1%
Close Punctuation 7
 
0.1%
Open Punctuation 7
 
0.1%
Other values (3) 5
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
120
 
3.0%
93
 
2.3%
79
 
2.0%
71
 
1.8%
70
 
1.8%
64
 
1.6%
62
 
1.6%
62
 
1.6%
61
 
1.5%
55
 
1.4%
Other values (364) 3230
81.4%
Uppercase Letter
ValueCountFrequency (%)
G 80
14.3%
N 75
13.4%
L 68
12.2%
E 54
9.7%
C 41
 
7.3%
P 35
 
6.3%
S 22
 
3.9%
A 22
 
3.9%
U 21
 
3.8%
O 18
 
3.2%
Other values (15) 122
21.9%
Lowercase Letter
ValueCountFrequency (%)
e 49
12.6%
a 42
10.8%
r 39
10.0%
o 38
9.7%
i 28
 
7.2%
t 26
 
6.7%
n 26
 
6.7%
l 22
 
5.6%
h 14
 
3.6%
s 14
 
3.6%
Other values (14) 92
23.6%
Decimal Number
ValueCountFrequency (%)
0 75
29.3%
2 62
24.2%
1 30
 
11.7%
5 26
 
10.2%
3 19
 
7.4%
4 16
 
6.2%
9 11
 
4.3%
7 9
 
3.5%
6 5
 
2.0%
8 3
 
1.2%
Other Punctuation
ValueCountFrequency (%)
% 13
32.5%
. 10
25.0%
' 9
22.5%
· 4
 
10.0%
& 3
 
7.5%
/ 1
 
2.5%
Math Symbol
ValueCountFrequency (%)
+ 5
55.6%
~ 2
 
22.2%
2
 
22.2%
Space Separator
ValueCountFrequency (%)
1609
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 3
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3966
57.8%
Common 1944
28.3%
Latin 948
 
13.8%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
120
 
3.0%
93
 
2.3%
79
 
2.0%
71
 
1.8%
70
 
1.8%
64
 
1.6%
62
 
1.6%
62
 
1.6%
61
 
1.5%
55
 
1.4%
Other values (363) 3229
81.4%
Latin
ValueCountFrequency (%)
G 80
 
8.4%
N 75
 
7.9%
L 68
 
7.2%
E 54
 
5.7%
e 49
 
5.2%
a 42
 
4.4%
C 41
 
4.3%
r 39
 
4.1%
o 38
 
4.0%
P 35
 
3.7%
Other values (39) 427
45.0%
Common
ValueCountFrequency (%)
1609
82.8%
0 75
 
3.9%
2 62
 
3.2%
1 30
 
1.5%
5 26
 
1.3%
3 19
 
1.0%
4 16
 
0.8%
% 13
 
0.7%
9 11
 
0.6%
- 11
 
0.6%
Other values (16) 72
 
3.7%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3964
57.8%
ASCII 2885
42.1%
None 4
 
0.1%
Compat Jamo 2
 
< 0.1%
Arrows 2
 
< 0.1%
Punctuation 1
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1609
55.8%
G 80
 
2.8%
0 75
 
2.6%
N 75
 
2.6%
L 68
 
2.4%
2 62
 
2.1%
E 54
 
1.9%
e 49
 
1.7%
a 42
 
1.5%
C 41
 
1.4%
Other values (62) 730
25.3%
Hangul
ValueCountFrequency (%)
120
 
3.0%
93
 
2.3%
79
 
2.0%
71
 
1.8%
70
 
1.8%
64
 
1.6%
62
 
1.6%
62
 
1.6%
61
 
1.5%
55
 
1.4%
Other values (362) 3227
81.4%
None
ValueCountFrequency (%)
· 4
100.0%
Compat Jamo
ValueCountFrequency (%)
2
100.0%
Arrows
ValueCountFrequency (%)
2
100.0%
Punctuation
ValueCountFrequency (%)
1
100.0%
CJK
ValueCountFrequency (%)
1
100.0%

Interactions

2023-12-13T06:23:57.353565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-13T06:23:57.452483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:23:57.524549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번발간일제목
012019-01-25중국 동절기 수요약세로 국내 도입예정 LNG 해외 판매
122019-01-25월성3 한빛2 정지
232019-01-30독일 탈원전 이어 2038년 석탄화력발전 전면 중단 결정
342019-01-30미국 LNG수출프로젝트 FERC 인허가 리스크 대두
452019-02-07신고리원전 4호기 운영허가
562019-02-14Golden Pass LNG FID 결정
672019-02-19모잠비크 Area1 LNG프로젝트 연이은 장기공급계약 체결로 FID임박
782019-02-19독일 탈원전탈석탄 결정 이후 LNG 터미널 건설 논의
892019-02-27네덜란드 가스 순수입국으로 전환
9102019-02-27美 FERC 위원회 Calcasieu Pass LNG 프로젝트 최종 인허가 결정
순번발간일제목
2352362022-11-30러시아산 원유 금수 및 가격 상한제 도입 영향
2362372022-12-20유럽 천연가스 가격 상한제 최종 합의
2372382023-01-18인도 수소산업 강화위해 20억 달러 규모 인센티브 계획
2382392023-02-09EU 그린딜 산업 계획 발표
2392402023-02-23러시아산 석유제품 가격상한제 시행 및 영향
2402412023-03-06유럽 배출권거래제 가격 톤당 100유로 첫 돌파
2412422023-03-17산업부 세계 최초 수소발전 입찰시장 개설 예정
2422432023-03-23EU 집행위 탄소중립산업법 핵심원자재법 초안 발표
2432442023-04-06OPEC+ 석유시장 안정을 위한 자발적 감산 선언
2442452023-04-12일본 수소기본전략 개정 내용 발표 예정