Overview

Dataset statistics

Number of variables5
Number of observations151
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.2 KiB
Average record size in memory41.9 B

Variable types

Text2
Numeric1
Categorical2

Dataset

Description수소유통 전담기관에서 조사한 충전소별 현재 판매단가와 충전소별 할인정보(할인여부 및 할인 상세 내용)를 제공합니다.
Author한국가스공사
URLhttps://www.data.go.kr/data/15102841/fileData.do

Alerts

할인 상세 내용 is highly overall correlated with 할인여부High correlation
할인여부 is highly overall correlated with 할인 상세 내용High correlation
할인여부 is highly imbalanced (89.8%)Imbalance
할인 상세 내용 is highly imbalanced (91.4%)Imbalance
충전소 코드 has unique valuesUnique
충전소 명 has unique valuesUnique

Reproduction

Analysis started2024-04-06 09:02:04.351222
Analysis finished2024-04-06 09:02:05.199303
Duration0.85 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

충전소 코드
Text

UNIQUE 

Distinct151
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2024-04-06T18:02:05.443562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length19
Mean length19
Min length19

Characters and Unicode

Total characters2869
Distinct characters12
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique151 ?
Unique (%)100.0%

Sample

1st row2920020121HS2014001
2nd row4480020121HS2015001
3rd row3114020121HS2017001
4th row4812120121HS2017002
5th row2771020121HS2017003
ValueCountFrequency (%)
2920020121hs2014001 1
 
0.7%
4686020121hs2022035 1
 
0.7%
4372020121hs2022018 1
 
0.7%
4157020121hs2022019 1
 
0.7%
2671020121hs2022020 1
 
0.7%
2917020121hs2022021 1
 
0.7%
4281020121hs2022022 1
 
0.7%
4128120121hs2022023 1
 
0.7%
4623020121hs2022024 1
 
0.7%
1150020121hs2022025 1
 
0.7%
Other values (141) 141
93.4%
2024-04-06T18:02:05.947472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 757
26.4%
0 679
23.7%
1 587
20.5%
4 161
 
5.6%
H 151
 
5.3%
S 151
 
5.3%
3 123
 
4.3%
5 65
 
2.3%
7 60
 
2.1%
8 49
 
1.7%
Other values (2) 86
 
3.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2567
89.5%
Uppercase Letter 302
 
10.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 757
29.5%
0 679
26.5%
1 587
22.9%
4 161
 
6.3%
3 123
 
4.8%
5 65
 
2.5%
7 60
 
2.3%
8 49
 
1.9%
9 45
 
1.8%
6 41
 
1.6%
Uppercase Letter
ValueCountFrequency (%)
H 151
50.0%
S 151
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2567
89.5%
Latin 302
 
10.5%

Most frequent character per script

Common
ValueCountFrequency (%)
2 757
29.5%
0 679
26.5%
1 587
22.9%
4 161
 
6.3%
3 123
 
4.8%
5 65
 
2.5%
7 60
 
2.3%
8 49
 
1.9%
9 45
 
1.8%
6 41
 
1.6%
Latin
ValueCountFrequency (%)
H 151
50.0%
S 151
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2869
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 757
26.4%
0 679
23.7%
1 587
20.5%
4 161
 
5.6%
H 151
 
5.3%
S 151
 
5.3%
3 123
 
4.3%
5 65
 
2.3%
7 60
 
2.1%
8 49
 
1.7%
Other values (2) 86
 
3.0%

충전소 명
Text

UNIQUE 

Distinct151
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2024-04-06T18:02:06.353712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length20
Mean length12.695364
Min length6

Characters and Unicode

Total characters1917
Distinct characters209
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique151 ?
Unique (%)100.0%

Sample

1st row광주 진곡수소충전소
2nd row내포수소충전소
3rd row옥동LPG 수소 복합충전소
4th row창원팔룡 수소충전소
5th row대구 주행시험장 수소충전소
ValueCountFrequency (%)
수소충전소 99
27.7%
하이넷 41
 
11.5%
광주 4
 
1.1%
충전소 4
 
1.1%
서울특별시 3
 
0.8%
e1 3
 
0.8%
수소 3
 
0.8%
린데 3
 
0.8%
서울 2
 
0.6%
버스 2
 
0.6%
Other values (188) 193
54.1%
2024-04-06T18:02:07.035606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
321
16.7%
212
 
11.1%
158
 
8.2%
153
 
8.0%
151
 
7.9%
49
 
2.6%
49
 
2.6%
42
 
2.2%
27
 
1.4%
) 27
 
1.4%
Other values (199) 728
38.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1587
82.8%
Space Separator 212
 
11.1%
Uppercase Letter 36
 
1.9%
Close Punctuation 27
 
1.4%
Open Punctuation 27
 
1.4%
Lowercase Letter 12
 
0.6%
Decimal Number 11
 
0.6%
Other Punctuation 5
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
321
20.2%
158
 
10.0%
153
 
9.6%
151
 
9.5%
49
 
3.1%
49
 
3.1%
42
 
2.6%
27
 
1.7%
26
 
1.6%
26
 
1.6%
Other values (173) 585
36.9%
Uppercase Letter
ValueCountFrequency (%)
H 13
36.1%
E 4
 
11.1%
S 3
 
8.3%
G 3
 
8.3%
P 3
 
8.3%
K 2
 
5.6%
L 2
 
5.6%
T 2
 
5.6%
M 1
 
2.8%
C 1
 
2.8%
Other values (2) 2
 
5.6%
Lowercase Letter
ValueCountFrequency (%)
o 2
16.7%
n 2
16.7%
i 2
16.7%
t 2
16.7%
g 1
8.3%
a 1
8.3%
v 1
8.3%
e 1
8.3%
Decimal Number
ValueCountFrequency (%)
1 6
54.5%
2 5
45.5%
Space Separator
ValueCountFrequency (%)
212
100.0%
Close Punctuation
ValueCountFrequency (%)
) 27
100.0%
Open Punctuation
ValueCountFrequency (%)
( 27
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1587
82.8%
Common 282
 
14.7%
Latin 48
 
2.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
321
20.2%
158
 
10.0%
153
 
9.6%
151
 
9.5%
49
 
3.1%
49
 
3.1%
42
 
2.6%
27
 
1.7%
26
 
1.6%
26
 
1.6%
Other values (173) 585
36.9%
Latin
ValueCountFrequency (%)
H 13
27.1%
E 4
 
8.3%
S 3
 
6.2%
G 3
 
6.2%
P 3
 
6.2%
K 2
 
4.2%
o 2
 
4.2%
n 2
 
4.2%
i 2
 
4.2%
L 2
 
4.2%
Other values (10) 12
25.0%
Common
ValueCountFrequency (%)
212
75.2%
) 27
 
9.6%
( 27
 
9.6%
1 6
 
2.1%
/ 5
 
1.8%
2 5
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1587
82.8%
ASCII 330
 
17.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
321
20.2%
158
 
10.0%
153
 
9.6%
151
 
9.5%
49
 
3.1%
49
 
3.1%
42
 
2.6%
27
 
1.7%
26
 
1.6%
26
 
1.6%
Other values (173) 585
36.9%
ASCII
ValueCountFrequency (%)
212
64.2%
) 27
 
8.2%
( 27
 
8.2%
H 13
 
3.9%
1 6
 
1.8%
/ 5
 
1.5%
2 5
 
1.5%
E 4
 
1.2%
S 3
 
0.9%
G 3
 
0.9%
Other values (16) 25
 
7.6%

판매단가(vat포함)
Real number (ℝ)

Distinct22
Distinct (%)14.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9692.4503
Minimum7700
Maximum12400
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2024-04-06T18:02:07.308459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum7700
5-th percentile8500
Q19400
median9900
Q39900
95-th percentile10400
Maximum12400
Range4700
Interquartile range (IQR)500

Descriptive statistics

Standard deviation703.47231
Coefficient of variation (CV)0.072579408
Kurtosis4.4306509
Mean9692.4503
Median Absolute Deviation (MAD)0
Skewness0.23764607
Sum1463560
Variance494873.29
MonotonicityNot monotonic
2024-04-06T18:02:07.530145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%)
9900 83
55.0%
9400 15
 
9.9%
9100 6
 
4.0%
8500 6
 
4.0%
8800 6
 
4.0%
10400 5
 
3.3%
9700 5
 
3.3%
9600 4
 
2.6%
7700 3
 
2.0%
9500 3
 
2.0%
Other values (12) 15
 
9.9%
ValueCountFrequency (%)
7700 3
 
2.0%
7800 1
 
0.7%
7900 1
 
0.7%
8300 2
 
1.3%
8500 6
 
4.0%
8800 6
 
4.0%
9100 6
 
4.0%
9200 1
 
0.7%
9400 15
9.9%
9500 3
 
2.0%
ValueCountFrequency (%)
12400 2
 
1.3%
12100 1
 
0.7%
11900 1
 
0.7%
10900 1
 
0.7%
10600 1
 
0.7%
10560 1
 
0.7%
10400 5
 
3.3%
10000 1
 
0.7%
9900 83
55.0%
9800 2
 
1.3%

할인여부
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
x
149 
o
 
2

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowx
2nd rowx
3rd rowx
4th rowx
5th rowx

Common Values

ValueCountFrequency (%)
x 149
98.7%
o 2
 
1.3%

Length

2024-04-06T18:02:07.759097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T18:02:07.956730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
x 149
98.7%
o 2
 
1.3%

할인 상세 내용
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
없음
148 
셀프충전시 9,400원
 
1
중구민 30%할인(23.09까지)
 
1
버스 11,000원
 
1

Length

Max length18
Median length2
Mean length2.2251656
Min length2

Unique

Unique3 ?
Unique (%)2.0%

Sample

1st row없음
2nd row없음
3rd row없음
4th row없음
5th row없음

Common Values

ValueCountFrequency (%)
없음 148
98.0%
셀프충전시 9,400원 1
 
0.7%
중구민 30%할인(23.09까지) 1
 
0.7%
버스 11,000원 1
 
0.7%

Length

2024-04-06T18:02:08.166939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T18:02:08.382468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
없음 148
96.1%
셀프충전시 1
 
0.6%
9,400원 1
 
0.6%
중구민 1
 
0.6%
30%할인(23.09까지 1
 
0.6%
버스 1
 
0.6%
11,000원 1
 
0.6%

Interactions

2024-04-06T18:02:04.691493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-06T18:02:08.523505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
판매단가(vat포함)할인여부할인 상세 내용
판매단가(vat포함)1.0000.0000.000
할인여부0.0001.0001.000
할인 상세 내용0.0001.0001.000
2024-04-06T18:02:08.695749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
할인 상세 내용할인여부
할인 상세 내용1.0000.993
할인여부0.9931.000
2024-04-06T18:02:08.864273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
판매단가(vat포함)할인여부할인 상세 내용
판매단가(vat포함)1.0000.0000.000
할인여부0.0001.0000.993
할인 상세 내용0.0000.9931.000

Missing values

2024-04-06T18:02:04.923251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T18:02:05.099662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

충전소 코드충전소 명판매단가(vat포함)할인여부할인 상세 내용
02920020121HS2014001광주 진곡수소충전소9100x없음
14480020121HS2015001내포수소충전소9900x없음
23114020121HS2017001옥동LPG 수소 복합충전소8500x없음
34812120121HS2017002창원팔룡 수소충전소9400x없음
42771020121HS2017003대구 주행시험장 수소충전소9900x없음
52920020121HS2018001광주 동곡수소충전소9100x없음
63120020121HS2018002경동수소복합충전소8500x없음
74812320121HS2018003창원성주수소충전소9400x없음
83171020121HS2019001신일복합 수소충전소8500x없음
94155020121HS2019002H안성휴게소 수소충전소 (서울 상행)10400x없음
충전소 코드충전소 명판매단가(vat포함)할인여부할인 상세 내용
1414812720121HS2023009마산자유무역지역 수소충전소9400x없음
1424215020121HS2022056하이넷 강릉시청 수소충전소9900x없음
1434155020121HS2023010안성맞춤(제천)휴게소 수소충전소12400x없음
1444427020121HS2023011현대제철 H수소충전소9900x없음
1453114020121HS2023012울산상개 SK수소충전소8500x없음
1464143020121HS2023013하이넷 의왕왕곡 수소충전소9900x없음
1474146320121HS2023014기흥휴게소(부산방향) 수소충전소10400x없음
1483011020121HS2023015하이넷 대전삼정 수소충전소9900x없음
1494418020121HS2023016보령1호 수소충전소9900x없음
1504580020121HS2023017부안 곰소 수소충전소9500x없음