Overview

Dataset statistics

Number of variables6
Number of observations7727
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory369.9 KiB
Average record size in memory49.0 B

Variable types

Numeric1
DateTime1
Categorical2
Text2

Dataset

Description전국 주유소 현황 (전국 주유소 신규 등록 , 휴업, 폐업 현황 정보)에 대해 알려드립니다. (연도, 변동사유발생연월일, 판매업종류 등)
Author산업통상자원부
URLhttps://www.data.go.kr/data/3076606/fileData.do

Alerts

판매업종류 has constant value ""Constant

Reproduction

Analysis started2024-03-23 04:44:14.913615
Analysis finished2024-03-23 04:44:18.511283
Duration3.6 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Real number (ℝ)

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2018.9508
Minimum2015
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size68.0 KiB
2024-03-23T04:44:18.668108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2015
5-th percentile2015
Q12017
median2019
Q32021
95-th percentile2023
Maximum2023
Range8
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.5822056
Coefficient of variation (CV)0.0012789839
Kurtosis-1.2040442
Mean2018.9508
Median Absolute Deviation (MAD)2
Skewness-0.026414192
Sum15600433
Variance6.6677857
MonotonicityIncreasing
2024-03-23T04:44:19.074464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
2015 1030
13.3%
2018 999
12.9%
2021 894
11.6%
2022 881
11.4%
2019 864
11.2%
2020 842
10.9%
2023 770
10.0%
2017 727
9.4%
2016 720
9.3%
ValueCountFrequency (%)
2015 1030
13.3%
2016 720
9.3%
2017 727
9.4%
2018 999
12.9%
2019 864
11.2%
2020 842
10.9%
2021 894
11.6%
2022 881
11.4%
2023 770
10.0%
ValueCountFrequency (%)
2023 770
10.0%
2022 881
11.4%
2021 894
11.6%
2020 842
10.9%
2019 864
11.2%
2018 999
12.9%
2017 727
9.4%
2016 720
9.3%
2015 1030
13.3%
Distinct2205
Distinct (%)28.5%
Missing0
Missing (%)0.0%
Memory size60.5 KiB
Minimum2015-01-02 00:00:00
Maximum2023-12-29 00:00:00
2024-03-23T04:44:19.621390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T04:44:20.060266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

판매업종류
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size60.5 KiB
주유소
7727 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row주유소
2nd row주유소
3rd row주유소
4th row주유소
5th row주유소

Common Values

ValueCountFrequency (%)
주유소 7727
100.0%

Length

2024-03-23T04:44:20.622490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T04:44:21.080335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
주유소 7727
100.0%

구분
Categorical

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size60.5 KiB
휴업
4452 
폐업
2212 
신규등록
774 
등록취소
 
177
신규
 
112

Length

Max length4
Median length2
Mean length2.2461499
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row휴업
2nd row신규등록
3rd row폐업
4th row휴업
5th row신규등록

Common Values

ValueCountFrequency (%)
휴업 4452
57.6%
폐업 2212
28.6%
신규등록 774
 
10.0%
등록취소 177
 
2.3%
신규 112
 
1.4%

Length

2024-03-23T04:44:21.422040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T04:44:21.790300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
휴업 4452
57.6%
폐업 2212
28.6%
신규등록 774
 
10.0%
등록취소 177
 
2.3%
신규 112
 
1.4%
Distinct4169
Distinct (%)54.0%
Missing0
Missing (%)0.0%
Memory size60.5 KiB
2024-03-23T04:44:22.688825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length26
Mean length7.4652517
Min length2

Characters and Unicode

Total characters57684
Distinct characters608
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2741 ?
Unique (%)35.5%

Sample

1st row(주)유림에너지 대기주유소
2nd row광나는 주유소
3rd row㈜소모석유 상무제일주유소
4th row활주로주유소
5th row남상주농협주유소
ValueCountFrequency (%)
현대오일뱅크㈜직영 79
 
0.9%
주유소 66
 
0.7%
주식회사 60
 
0.7%
sk네트웍스(주 59
 
0.7%
현대주유소 45
 
0.5%
현대오일뱅크(주)직영 38
 
0.4%
제일주유소 35
 
0.4%
지에스칼텍스(주 33
 
0.4%
대성주유소 32
 
0.4%
삼성주유소 31
 
0.3%
Other values (4374) 8538
94.7%
2024-03-23T04:44:24.743280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8604
 
14.9%
7644
 
13.3%
7355
 
12.8%
) 1295
 
2.2%
( 1292
 
2.2%
1289
 
2.2%
1007
 
1.7%
813
 
1.4%
688
 
1.2%
625
 
1.1%
Other values (598) 27072
46.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 51949
90.1%
Close Punctuation 1296
 
2.2%
Open Punctuation 1293
 
2.2%
Space Separator 1289
 
2.2%
Uppercase Letter 1009
 
1.7%
Other Symbol 348
 
0.6%
Decimal Number 249
 
0.4%
Lowercase Letter 217
 
0.4%
Dash Punctuation 22
 
< 0.1%
Other Punctuation 11
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8604
 
16.6%
7644
 
14.7%
7355
 
14.2%
1007
 
1.9%
813
 
1.6%
688
 
1.3%
625
 
1.2%
607
 
1.2%
533
 
1.0%
528
 
1.0%
Other values (536) 23545
45.3%
Uppercase Letter
ValueCountFrequency (%)
S 298
29.5%
K 251
24.9%
I 117
 
11.6%
C 112
 
11.1%
G 47
 
4.7%
H 34
 
3.4%
O 34
 
3.4%
J 19
 
1.9%
D 19
 
1.9%
L 15
 
1.5%
Other values (14) 63
 
6.2%
Lowercase Letter
ValueCountFrequency (%)
e 46
21.2%
f 45
20.7%
l 45
20.7%
s 44
20.3%
p 6
 
2.8%
k 5
 
2.3%
o 5
 
2.3%
h 4
 
1.8%
t 4
 
1.8%
r 4
 
1.8%
Other values (7) 9
 
4.1%
Decimal Number
ValueCountFrequency (%)
2 85
34.1%
1 66
26.5%
0 22
 
8.8%
3 18
 
7.2%
4 17
 
6.8%
5 16
 
6.4%
8 13
 
5.2%
9 8
 
3.2%
6 3
 
1.2%
7 1
 
0.4%
Other Punctuation
ValueCountFrequency (%)
& 6
54.5%
. 3
27.3%
; 2
 
18.2%
Close Punctuation
ValueCountFrequency (%)
) 1295
99.9%
1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1292
99.9%
1
 
0.1%
Space Separator
ValueCountFrequency (%)
1289
100.0%
Other Symbol
ValueCountFrequency (%)
348
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 22
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 52297
90.7%
Common 4160
 
7.2%
Latin 1227
 
2.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8604
 
16.5%
7644
 
14.6%
7355
 
14.1%
1007
 
1.9%
813
 
1.6%
688
 
1.3%
625
 
1.2%
607
 
1.2%
533
 
1.0%
528
 
1.0%
Other values (537) 23893
45.7%
Latin
ValueCountFrequency (%)
S 298
24.3%
K 251
20.5%
I 117
 
9.5%
C 112
 
9.1%
G 47
 
3.8%
e 46
 
3.7%
f 45
 
3.7%
l 45
 
3.7%
s 44
 
3.6%
H 34
 
2.8%
Other values (32) 188
15.3%
Common
ValueCountFrequency (%)
) 1295
31.1%
( 1292
31.1%
1289
31.0%
2 85
 
2.0%
1 66
 
1.6%
- 22
 
0.5%
0 22
 
0.5%
3 18
 
0.4%
4 17
 
0.4%
5 16
 
0.4%
Other values (9) 38
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 51949
90.1%
ASCII 5384
 
9.3%
None 350
 
0.6%
Number Forms 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
8604
 
16.6%
7644
 
14.7%
7355
 
14.2%
1007
 
1.9%
813
 
1.6%
688
 
1.3%
625
 
1.2%
607
 
1.2%
533
 
1.0%
528
 
1.0%
Other values (536) 23545
45.3%
ASCII
ValueCountFrequency (%)
) 1295
24.1%
( 1292
24.0%
1289
23.9%
S 298
 
5.5%
K 251
 
4.7%
I 117
 
2.2%
C 112
 
2.1%
2 85
 
1.6%
1 66
 
1.2%
G 47
 
0.9%
Other values (48) 532
9.9%
None
ValueCountFrequency (%)
348
99.4%
1
 
0.3%
1
 
0.3%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct4814
Distinct (%)62.3%
Missing0
Missing (%)0.0%
Memory size60.5 KiB
2024-03-23T04:44:25.706002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length44
Mean length22.334153
Min length13

Characters and Unicode

Total characters172576
Distinct characters463
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3247 ?
Unique (%)42.0%

Sample

1st row대구광역시 북구 팔달로41 (노원동3가)
2nd row경기도 동두천시 평화로3059
3rd row광주광역시 서구 상무시민로142 (치평동)
4th row전라남도 나주시 산남로141 (산포면)
5th row경상북도 상주시 청리면 남상주로 1044
ValueCountFrequency (%)
경기도 1433
 
4.4%
경상북도 830
 
2.6%
경상남도 769
 
2.4%
충청남도 647
 
2.0%
전라북도 586
 
1.8%
충청북도 548
 
1.7%
강원도 535
 
1.6%
전라남도 519
 
1.6%
서울특별시 417
 
1.3%
부산광역시 290
 
0.9%
Other values (7404) 25868
79.7%
2024-03-23T04:44:27.185682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24715
 
14.3%
7304
 
4.2%
( 7104
 
4.1%
) 7045
 
4.1%
6300
 
3.7%
6145
 
3.6%
4996
 
2.9%
1 4748
 
2.8%
3570
 
2.1%
3326
 
1.9%
Other values (453) 97323
56.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 107524
62.3%
Decimal Number 25586
 
14.8%
Space Separator 24715
 
14.3%
Open Punctuation 7104
 
4.1%
Close Punctuation 7045
 
4.1%
Dash Punctuation 508
 
0.3%
Other Punctuation 84
 
< 0.1%
Uppercase Letter 6
 
< 0.1%
Lowercase Letter 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7304
 
6.8%
6300
 
5.9%
6145
 
5.7%
4996
 
4.6%
3570
 
3.3%
3326
 
3.1%
3167
 
2.9%
2767
 
2.6%
2711
 
2.5%
2511
 
2.3%
Other values (426) 64727
60.2%
Decimal Number
ValueCountFrequency (%)
1 4748
18.6%
2 3320
13.0%
3 2692
10.5%
4 2509
9.8%
5 2344
9.2%
7 2201
8.6%
6 2069
8.1%
8 1950
7.6%
0 1888
 
7.4%
9 1865
 
7.3%
Other Punctuation
ValueCountFrequency (%)
, 58
69.0%
· 15
 
17.9%
. 9
 
10.7%
/ 1
 
1.2%
: 1
 
1.2%
Uppercase Letter
ValueCountFrequency (%)
G 2
33.3%
L 1
16.7%
B 1
16.7%
C 1
16.7%
S 1
16.7%
Lowercase Letter
ValueCountFrequency (%)
s 2
50.0%
g 1
25.0%
k 1
25.0%
Space Separator
ValueCountFrequency (%)
24715
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7104
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7045
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 508
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 107524
62.3%
Common 65042
37.7%
Latin 10
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7304
 
6.8%
6300
 
5.9%
6145
 
5.7%
4996
 
4.6%
3570
 
3.3%
3326
 
3.1%
3167
 
2.9%
2767
 
2.6%
2711
 
2.5%
2511
 
2.3%
Other values (426) 64727
60.2%
Common
ValueCountFrequency (%)
24715
38.0%
( 7104
 
10.9%
) 7045
 
10.8%
1 4748
 
7.3%
2 3320
 
5.1%
3 2692
 
4.1%
4 2509
 
3.9%
5 2344
 
3.6%
7 2201
 
3.4%
6 2069
 
3.2%
Other values (9) 6295
 
9.7%
Latin
ValueCountFrequency (%)
s 2
20.0%
G 2
20.0%
L 1
10.0%
B 1
10.0%
C 1
10.0%
g 1
10.0%
S 1
10.0%
k 1
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 107524
62.3%
ASCII 65037
37.7%
None 15
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
24715
38.0%
( 7104
 
10.9%
) 7045
 
10.8%
1 4748
 
7.3%
2 3320
 
5.1%
3 2692
 
4.1%
4 2509
 
3.9%
5 2344
 
3.6%
7 2201
 
3.4%
6 2069
 
3.2%
Other values (16) 6290
 
9.7%
Hangul
ValueCountFrequency (%)
7304
 
6.8%
6300
 
5.9%
6145
 
5.7%
4996
 
4.6%
3570
 
3.3%
3326
 
3.1%
3167
 
2.9%
2767
 
2.6%
2711
 
2.5%
2511
 
2.3%
Other values (426) 64727
60.2%
None
ValueCountFrequency (%)
· 15
100.0%

Interactions

2024-03-23T04:44:16.987929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-23T04:44:27.501714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도구분
연도1.0000.269
구분0.2691.000
2024-03-23T04:44:27.721017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도구분
연도1.0000.170
구분0.1701.000

Missing values

2024-03-23T04:44:17.592008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T04:44:18.357649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도변동사유발생연월일판매업종류구분업체명소재지
020152015-01-02주유소휴업(주)유림에너지 대기주유소대구광역시 북구 팔달로41 (노원동3가)
120152015-01-05주유소신규등록광나는 주유소경기도 동두천시 평화로3059
220152015-01-05주유소폐업㈜소모석유 상무제일주유소광주광역시 서구 상무시민로142 (치평동)
320152015-01-05주유소휴업활주로주유소전라남도 나주시 산남로141 (산포면)
420152015-01-06주유소신규등록남상주농협주유소경상북도 상주시 청리면 남상주로 1044
520152015-01-06주유소휴업팔공산한마음주유소대구광역시 동구 팔공산로283 (덕곡동)
620152015-01-06주유소휴업태양주유소부산광역시 부산진구 성지로60 (연지동)
720152015-01-06주유소휴업몽촌주유소서울특별시 송파구 백제고분로475 (방이동)
820152015-01-06주유소폐업쇄노재주유소전라남도 해남군 북평면 백도로 531
920152015-01-06주유소휴업삽교호주유소충청남도 당진시 삽교천길88
연도변동사유발생연월일판매업종류구분업체명소재지
771720232023-12-28주유소휴업중앙주유소충청북도 충주시 탄금대로107 (봉방동)
771820232023-12-28주유소휴업국도주유소충청북도 진천군 진광로62 (진천읍)
771920232023-12-28주유소휴업남군산주유소전라북도 군산시 상평로16 (옥구읍)
772020232023-12-29주유소휴업HD현대오일뱅크(주)직영 대동현대셀프주유소대전광역시 동구 계족로126 (대동)
772120232023-12-29주유소휴업은마석유(주) 노포주유소부산광역시 금정구 중앙대로2191 (노포동)
772220232023-12-29주유소휴업고래주유소울산광역시 남구 산업로138 (상개동)
772320232023-12-29주유소휴업(주)유웨이직영 사이다셀프충청남도 천안시 동남구 천안대로784 (신부동)
772420232023-12-29주유소폐업첨단주유소전라북도 전주시 덕진구 기린대로894 (여의동2가)
772520232023-12-29주유소폐업새한에너지경기도 양주시 은현로482 (은현면)
772620232023-12-29주유소휴업천봉주유소충청북도 보은군 남부로5084 (보은읍)