Overview

Dataset statistics

Number of variables4
Number of observations1289
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory40.4 KiB
Average record size in memory32.1 B

Variable types

Text2
Categorical1
Boolean1

Dataset

Description현재 영업중인 전국의 알뜰주유소의 현황 정보이며 자영알뜰주유소, EX알뜰주유소(한국도로공사 운영), NH알뜰주유소(농협 운영) 등으로 구분됨.
Author한국석유공사
URLhttps://www.data.go.kr/data/15076635/fileData.do

Alerts

상표 is highly overall correlated with 셀프여부High correlation
셀프여부 is highly overall correlated with 상표High correlation

Reproduction

Analysis started2024-03-14 16:53:16.500742
Analysis finished2024-03-14 16:53:17.608611
Duration1.11 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

상호
Text

Distinct1263
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
2024-03-15T01:53:18.507934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length23
Mean length9.1442979
Min length3

Characters and Unicode

Total characters11787
Distinct characters396
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1242 ?
Unique (%)96.4%

Sample

1st row당진농협주유소
2nd row안동농협주유소
3rd row여천농협주유소
4th row원주농협충전소
5th row수곡농협
ValueCountFrequency (%)
주유소 11
 
0.7%
한국도로공사 8
 
0.5%
클린주유소 7
 
0.5%
우리주유소 6
 
0.4%
대보건설(주 4
 
0.3%
주식회사 4
 
0.3%
시카프관광개발(주 3
 
0.2%
그린주유소 3
 
0.2%
풀무원식품(주 3
 
0.2%
서창산업(주 3
 
0.2%
Other values (1369) 1426
96.5%
2024-03-15T01:53:19.939260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1435
 
12.2%
1339
 
11.4%
1280
 
10.9%
691
 
5.9%
691
 
5.9%
) 315
 
2.7%
( 315
 
2.7%
189
 
1.6%
177
 
1.5%
150
 
1.3%
Other values (386) 5205
44.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10748
91.2%
Close Punctuation 315
 
2.7%
Open Punctuation 315
 
2.7%
Space Separator 189
 
1.6%
Other Punctuation 95
 
0.8%
Other Symbol 48
 
0.4%
Uppercase Letter 38
 
0.3%
Decimal Number 28
 
0.2%
Lowercase Letter 10
 
0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1435
 
13.4%
1339
 
12.5%
1280
 
11.9%
691
 
6.4%
691
 
6.4%
177
 
1.6%
150
 
1.4%
149
 
1.4%
132
 
1.2%
126
 
1.2%
Other values (357) 4578
42.6%
Uppercase Letter
ValueCountFrequency (%)
I 10
26.3%
C 9
23.7%
K 5
13.2%
S 4
 
10.5%
Y 3
 
7.9%
M 3
 
7.9%
H 2
 
5.3%
E 1
 
2.6%
N 1
 
2.6%
Decimal Number
ValueCountFrequency (%)
2 12
42.9%
1 7
25.0%
3 3
 
10.7%
4 2
 
7.1%
0 2
 
7.1%
5 1
 
3.6%
8 1
 
3.6%
Lowercase Letter
ValueCountFrequency (%)
f 2
20.0%
l 2
20.0%
e 2
20.0%
s 2
20.0%
a 1
10.0%
r 1
10.0%
Other Punctuation
ValueCountFrequency (%)
/ 94
98.9%
, 1
 
1.1%
Close Punctuation
ValueCountFrequency (%)
) 315
100.0%
Open Punctuation
ValueCountFrequency (%)
( 315
100.0%
Space Separator
ValueCountFrequency (%)
189
100.0%
Other Symbol
ValueCountFrequency (%)
48
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10796
91.6%
Common 943
 
8.0%
Latin 48
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1435
 
13.3%
1339
 
12.4%
1280
 
11.9%
691
 
6.4%
691
 
6.4%
177
 
1.6%
150
 
1.4%
149
 
1.4%
132
 
1.2%
126
 
1.2%
Other values (358) 4626
42.8%
Latin
ValueCountFrequency (%)
I 10
20.8%
C 9
18.8%
K 5
10.4%
S 4
 
8.3%
Y 3
 
6.2%
M 3
 
6.2%
f 2
 
4.2%
l 2
 
4.2%
e 2
 
4.2%
s 2
 
4.2%
Other values (5) 6
12.5%
Common
ValueCountFrequency (%)
) 315
33.4%
( 315
33.4%
189
20.0%
/ 94
 
10.0%
2 12
 
1.3%
1 7
 
0.7%
3 3
 
0.3%
4 2
 
0.2%
0 2
 
0.2%
5 1
 
0.1%
Other values (3) 3
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10748
91.2%
ASCII 991
 
8.4%
None 48
 
0.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1435
 
13.4%
1339
 
12.5%
1280
 
11.9%
691
 
6.4%
691
 
6.4%
177
 
1.6%
150
 
1.4%
149
 
1.4%
132
 
1.2%
126
 
1.2%
Other values (357) 4578
42.6%
ASCII
ValueCountFrequency (%)
) 315
31.8%
( 315
31.8%
189
19.1%
/ 94
 
9.5%
2 12
 
1.2%
I 10
 
1.0%
C 9
 
0.9%
1 7
 
0.7%
K 5
 
0.5%
S 4
 
0.4%
Other values (18) 31
 
3.1%
None
ValueCountFrequency (%)
48
100.0%

주소
Text

Distinct1287
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
2024-03-15T01:53:21.345201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length29
Mean length19.672614
Min length12

Characters and Unicode

Total characters25358
Distinct characters365
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1285 ?
Unique (%)99.7%

Sample

1st row충남 당진시 동부1로 15
2nd row경북 안동시 경동로 173
3rd row전남 여수시 도원로 163
4th row강원 원주시 호저로 230
5th row경남 진주시 수곡면 곤수로 922
ValueCountFrequency (%)
경북 174
 
2.7%
경남 154
 
2.4%
전남 150
 
2.3%
경기 148
 
2.3%
전북 128
 
2.0%
충남 116
 
1.8%
강원 96
 
1.5%
충북 93
 
1.4%
청주시 36
 
0.6%
대구 35
 
0.5%
Other values (2859) 5320
82.5%
2024-03-15T01:53:22.893938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5183
 
20.4%
1168
 
4.6%
1 809
 
3.2%
754
 
3.0%
702
 
2.8%
639
 
2.5%
586
 
2.3%
547
 
2.2%
2 545
 
2.1%
519
 
2.0%
Other values (355) 13906
54.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 15467
61.0%
Space Separator 5183
 
20.4%
Decimal Number 4115
 
16.2%
Close Punctuation 226
 
0.9%
Open Punctuation 226
 
0.9%
Dash Punctuation 138
 
0.5%
Uppercase Letter 2
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1168
 
7.6%
754
 
4.9%
702
 
4.5%
639
 
4.1%
586
 
3.8%
547
 
3.5%
519
 
3.4%
414
 
2.7%
369
 
2.4%
362
 
2.3%
Other values (338) 9407
60.8%
Decimal Number
ValueCountFrequency (%)
1 809
19.7%
2 545
13.2%
3 440
10.7%
4 397
9.6%
5 370
9.0%
7 354
8.6%
6 318
 
7.7%
0 305
 
7.4%
8 302
 
7.3%
9 275
 
6.7%
Uppercase Letter
ValueCountFrequency (%)
K 1
50.0%
S 1
50.0%
Space Separator
ValueCountFrequency (%)
5183
100.0%
Close Punctuation
ValueCountFrequency (%)
) 226
100.0%
Open Punctuation
ValueCountFrequency (%)
( 226
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 138
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 15467
61.0%
Common 9889
39.0%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1168
 
7.6%
754
 
4.9%
702
 
4.5%
639
 
4.1%
586
 
3.8%
547
 
3.5%
519
 
3.4%
414
 
2.7%
369
 
2.4%
362
 
2.3%
Other values (338) 9407
60.8%
Common
ValueCountFrequency (%)
5183
52.4%
1 809
 
8.2%
2 545
 
5.5%
3 440
 
4.4%
4 397
 
4.0%
5 370
 
3.7%
7 354
 
3.6%
6 318
 
3.2%
0 305
 
3.1%
8 302
 
3.1%
Other values (5) 866
 
8.8%
Latin
ValueCountFrequency (%)
K 1
50.0%
S 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 15467
61.0%
ASCII 9891
39.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5183
52.4%
1 809
 
8.2%
2 545
 
5.5%
3 440
 
4.4%
4 397
 
4.0%
5 370
 
3.7%
7 354
 
3.6%
6 318
 
3.2%
0 305
 
3.1%
8 302
 
3.1%
Other values (7) 868
 
8.8%
Hangul
ValueCountFrequency (%)
1168
 
7.6%
754
 
4.9%
702
 
4.5%
639
 
4.1%
586
 
3.8%
547
 
3.5%
519
 
3.4%
414
 
2.7%
369
 
2.4%
362
 
2.3%
Other values (338) 9407
60.8%

상표
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
농협알뜰
692 
자영알뜰
404 
도로공사알뜰
193 

Length

Max length6
Median length4
Mean length4.2994569
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row농협알뜰
2nd row농협알뜰
3rd row농협알뜰
4th row농협알뜰
5th row농협알뜰

Common Values

ValueCountFrequency (%)
농협알뜰 692
53.7%
자영알뜰 404
31.3%
도로공사알뜰 193
 
15.0%

Length

2024-03-15T01:53:23.349064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T01:53:23.697187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
농협알뜰 692
53.7%
자영알뜰 404
31.3%
도로공사알뜰 193
 
15.0%

셀프여부
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
True
674 
False
615 
ValueCountFrequency (%)
True 674
52.3%
False 615
47.7%
2024-03-15T01:53:23.997385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T01:53:24.187806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상표셀프여부
상표1.0000.360
셀프여부0.3601.000
2024-03-15T01:53:24.425105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
셀프여부상표
셀프여부1.0000.573
상표0.5731.000
2024-03-15T01:53:24.652503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상표셀프여부
상표1.0000.573
셀프여부0.5731.000

Missing values

2024-03-15T01:53:17.300177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T01:53:17.548497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상호주소상표셀프여부
0당진농협주유소충남 당진시 동부1로 15농협알뜰N
1안동농협주유소경북 안동시 경동로 173농협알뜰N
2여천농협주유소전남 여수시 도원로 163농협알뜰N
3원주농협충전소강원 원주시 호저로 230농협알뜰N
4수곡농협경남 진주시 수곡면 곤수로 922농협알뜰N
5평택농협주유소경기 평택시 만세로 1649농협알뜰N
6서세종농협주유소세종 연서면 함박로 122농협알뜰N
7양산농협주유소경남 양산시 양산대로 951농협알뜰N
8제천농협주유소충북 제천시 의림대로 394농협알뜰N
9순천원예농협주유소전남 순천시 남산로 39농협알뜰N
상호주소상표셀프여부
1279역전셀프주유소전북 전주시 덕진구 동부대로 748 (우아동3가)자영알뜰Y
1280카(Car)놀라유주유소충북 청주시 흥덕구 1순환로 615 (봉명동)자영알뜰Y
1281예루살렘셀프주유소경기도 포천시 일동면 금강로 4193 (일동면)자영알뜰Y
1282착한주유소 주식회사 상일물류충북 제천시 단양로 3400 (신백동)자영알뜰Y
1283(주)신마산주유소경남 창원시 마산합포구 밤밭고개로 390(월영동)자영알뜰Y
1284(주)메이저플러스 대전IC주유소대전 대덕구 동서대로 1787 (송촌동)자영알뜰Y
1285씨엘무역동두천지점 주식회사경기 동두천시 평화로 3122 (하봉암동)자영알뜰Y
1286(주)원일유통 춘천강원 춘천시 동내면 영서로 1432 춘천휴게소(내)SK주유소자영알뜰Y
1287대영에너지㈜세종지점 다온알뜰주유소세종특별자치시 세종로 1328 (아름동)자영알뜰Y
1288신덕계클린주유소 경동제이에이치개발경남 양산시 덕명로 48 (덕계동 177-34)자영알뜰Y