Overview

Dataset statistics

Number of variables6
Number of observations40
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.0 KiB
Average record size in memory51.3 B

Variable types

Text3
Categorical3

Dataset

Description이 데이터는 충청남도 금산군 내 주유소 현황으로 주유소명, 주유소위치, 주유소연락처, 영업상태 등의 데이터를 제공합니다.
URLhttps://www.data.go.kr/data/15118432/fileData.do

Alerts

데이터기준일 has constant value ""Constant
영업구분 is highly imbalanced (71.4%)Imbalance
상호 has unique valuesUnique
영업소 도로명 주소 has unique valuesUnique
영업소전화번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 04:18:43.164884
Analysis finished2023-12-12 04:18:43.573308
Duration0.41 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

상호
Text

UNIQUE 

Distinct40
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size452.0 B
2023-12-12T13:18:43.712860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length5
Mean length6.6
Min length5

Characters and Unicode

Total characters264
Distinct characters81
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique40 ?
Unique (%)100.0%

Sample

1st row꿀벌주유소
2nd row부리농협주유소
3rd row금산알뜰주유소
4th row진산농협주유소
5th row대둔산주유소
ValueCountFrequency (%)
꿀벌주유소 1
 
2.3%
평화주유소 1
 
2.3%
서대산주유소 1
 
2.3%
삼삼주유소 1
 
2.3%
일등주유소 1
 
2.3%
금산농협주유소 1
 
2.3%
삼마주유소 1
 
2.3%
아인주유소 1
 
2.3%
가까운주유소 1
 
2.3%
군북주유소 1
 
2.3%
Other values (33) 33
76.7%
2023-12-12T13:18:44.070590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
43
16.3%
38
 
14.4%
38
 
14.4%
13
 
4.9%
8
 
3.0%
7
 
2.7%
( 6
 
2.3%
) 6
 
2.3%
5
 
1.9%
4
 
1.5%
Other values (71) 96
36.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 247
93.6%
Open Punctuation 6
 
2.3%
Close Punctuation 6
 
2.3%
Space Separator 3
 
1.1%
Uppercase Letter 2
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
43
17.4%
38
15.4%
38
15.4%
13
 
5.3%
8
 
3.2%
7
 
2.8%
5
 
2.0%
4
 
1.6%
3
 
1.2%
3
 
1.2%
Other values (66) 85
34.4%
Uppercase Letter
ValueCountFrequency (%)
S 1
50.0%
K 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 247
93.6%
Common 15
 
5.7%
Latin 2
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
43
17.4%
38
15.4%
38
15.4%
13
 
5.3%
8
 
3.2%
7
 
2.8%
5
 
2.0%
4
 
1.6%
3
 
1.2%
3
 
1.2%
Other values (66) 85
34.4%
Common
ValueCountFrequency (%)
( 6
40.0%
) 6
40.0%
3
20.0%
Latin
ValueCountFrequency (%)
S 1
50.0%
K 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 247
93.6%
ASCII 17
 
6.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
43
17.4%
38
15.4%
38
15.4%
13
 
5.3%
8
 
3.2%
7
 
2.8%
5
 
2.0%
4
 
1.6%
3
 
1.2%
3
 
1.2%
Other values (66) 85
34.4%
ASCII
ValueCountFrequency (%)
( 6
35.3%
) 6
35.3%
3
17.6%
S 1
 
5.9%
K 1
 
5.9%
Distinct2
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size452.0 B
개인
35 
법인

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개인
2nd row법인
3rd row개인
4th row개인
5th row개인

Common Values

ValueCountFrequency (%)
개인 35
87.5%
법인 5
 
12.5%

Length

2023-12-12T13:18:44.234423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:18:44.353035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개인 35
87.5%
법인 5
 
12.5%

영업구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size452.0 B
영업개시
38 
휴지사업재개
 
2

Length

Max length6
Median length4
Mean length4.1
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업개시
2nd row영업개시
3rd row영업개시
4th row영업개시
5th row휴지사업재개

Common Values

ValueCountFrequency (%)
영업개시 38
95.0%
휴지사업재개 2
 
5.0%

Length

2023-12-12T13:18:44.486573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:18:44.605618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업개시 38
95.0%
휴지사업재개 2
 
5.0%
Distinct40
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size452.0 B
2023-12-12T13:18:44.840627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length29
Mean length21.25
Min length18

Characters and Unicode

Total characters850
Distinct characters61
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique40 ?
Unique (%)100.0%

Sample

1st row충청남도 금산군 추부면 서대산로 540
2nd row충청남도 금산군 부리면 무금로 1585, 부리농협주유소
3rd row충청남도 금산군 제원면 군북로 464
4th row충청남도 금산군 진산면 대둔산로 596
5th row충청남도 금산군 진산면 태고사로 450
ValueCountFrequency (%)
충청남도 40
19.3%
금산군 40
19.3%
금산로 11
 
5.3%
추부면 8
 
3.9%
복수면 7
 
3.4%
군북면 6
 
2.9%
금산읍 6
 
2.9%
다복로 5
 
2.4%
진산면 4
 
1.9%
복수로 3
 
1.4%
Other values (58) 77
37.2%
2023-12-12T13:18:45.296451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
167
19.6%
68
 
8.0%
63
 
7.4%
48
 
5.6%
42
 
4.9%
40
 
4.7%
40
 
4.7%
40
 
4.7%
39
 
4.6%
34
 
4.0%
Other values (51) 269
31.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 545
64.1%
Space Separator 167
 
19.6%
Decimal Number 130
 
15.3%
Open Punctuation 3
 
0.4%
Close Punctuation 3
 
0.4%
Dash Punctuation 1
 
0.1%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
68
12.5%
63
11.6%
48
8.8%
42
 
7.7%
40
 
7.3%
40
 
7.3%
40
 
7.3%
39
 
7.2%
34
 
6.2%
15
 
2.8%
Other values (36) 116
21.3%
Decimal Number
ValueCountFrequency (%)
1 26
20.0%
4 23
17.7%
2 14
10.8%
7 13
10.0%
3 12
9.2%
6 11
8.5%
5 10
 
7.7%
0 9
 
6.9%
8 7
 
5.4%
9 5
 
3.8%
Space Separator
ValueCountFrequency (%)
167
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 545
64.1%
Common 305
35.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
68
12.5%
63
11.6%
48
8.8%
42
 
7.7%
40
 
7.3%
40
 
7.3%
40
 
7.3%
39
 
7.2%
34
 
6.2%
15
 
2.8%
Other values (36) 116
21.3%
Common
ValueCountFrequency (%)
167
54.8%
1 26
 
8.5%
4 23
 
7.5%
2 14
 
4.6%
7 13
 
4.3%
3 12
 
3.9%
6 11
 
3.6%
5 10
 
3.3%
0 9
 
3.0%
8 7
 
2.3%
Other values (5) 13
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 545
64.1%
ASCII 305
35.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
167
54.8%
1 26
 
8.5%
4 23
 
7.5%
2 14
 
4.6%
7 13
 
4.3%
3 12
 
3.9%
6 11
 
3.6%
5 10
 
3.3%
0 9
 
3.0%
8 7
 
2.3%
Other values (5) 13
 
4.3%
Hangul
ValueCountFrequency (%)
68
12.5%
63
11.6%
48
8.8%
42
 
7.7%
40
 
7.3%
40
 
7.3%
40
 
7.3%
39
 
7.2%
34
 
6.2%
15
 
2.8%
Other values (36) 116
21.3%
Distinct40
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size452.0 B
2023-12-12T13:18:45.562795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters480
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique40 ?
Unique (%)100.0%

Sample

1st row041-752-5826
2nd row041-754-5188
3rd row041-752-2125
4th row041-752-4238
5th row041-752-9734
ValueCountFrequency (%)
041-752-5826 1
 
2.5%
041-754-5188 1
 
2.5%
041-754-5123 1
 
2.5%
041-754-8200 1
 
2.5%
041-754-1155 1
 
2.5%
041-751-3377 1
 
2.5%
041-754-5185 1
 
2.5%
041-753-8686 1
 
2.5%
041-752-0236 1
 
2.5%
041-754-5116 1
 
2.5%
Other values (30) 30
75.0%
2023-12-12T13:18:45.960160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 80
16.7%
1 75
15.6%
5 67
14.0%
0 60
12.5%
4 59
12.3%
7 45
9.4%
2 29
 
6.0%
3 29
 
6.0%
8 19
 
4.0%
6 10
 
2.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 400
83.3%
Dash Punctuation 80
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 75
18.8%
5 67
16.8%
0 60
15.0%
4 59
14.8%
7 45
11.2%
2 29
 
7.2%
3 29
 
7.2%
8 19
 
4.8%
6 10
 
2.5%
9 7
 
1.8%
Dash Punctuation
ValueCountFrequency (%)
- 80
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 480
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 80
16.7%
1 75
15.6%
5 67
14.0%
0 60
12.5%
4 59
12.3%
7 45
9.4%
2 29
 
6.0%
3 29
 
6.0%
8 19
 
4.0%
6 10
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 480
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 80
16.7%
1 75
15.6%
5 67
14.0%
0 60
12.5%
4 59
12.3%
7 45
9.4%
2 29
 
6.0%
3 29
 
6.0%
8 19
 
4.0%
6 10
 
2.1%

데이터기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size452.0 B
2023-08-11
40 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-08-11
2nd row2023-08-11
3rd row2023-08-11
4th row2023-08-11
5th row2023-08-11

Common Values

ValueCountFrequency (%)
2023-08-11 40
100.0%

Length

2023-12-12T13:18:46.166833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:18:46.299009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-08-11 40
100.0%

Correlations

2023-12-12T13:18:46.387393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상호개인법인구분영업구분영업소 도로명 주소영업소전화번호
상호1.0001.0001.0001.0001.000
개인법인구분1.0001.0000.0001.0001.000
영업구분1.0000.0001.0001.0001.000
영업소 도로명 주소1.0001.0001.0001.0001.000
영업소전화번호1.0001.0001.0001.0001.000
2023-12-12T13:18:46.492536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
개인법인구분영업구분
개인법인구분1.0000.000
영업구분0.0001.000
2023-12-12T13:18:46.590846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
개인법인구분영업구분
개인법인구분1.0000.000
영업구분0.0001.000

Missing values

2023-12-12T13:18:43.442699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:18:43.534861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상호개인법인구분영업구분영업소 도로명 주소영업소전화번호데이터기준일
0꿀벌주유소개인영업개시충청남도 금산군 추부면 서대산로 540041-752-58262023-08-11
1부리농협주유소법인영업개시충청남도 금산군 부리면 무금로 1585, 부리농협주유소041-754-51882023-08-11
2금산알뜰주유소개인영업개시충청남도 금산군 제원면 군북로 464041-752-21252023-08-11
3진산농협주유소개인영업개시충청남도 금산군 진산면 대둔산로 596041-752-42382023-08-11
4대둔산주유소개인휴지사업재개충청남도 금산군 진산면 태고사로 450041-752-97342023-08-11
5범아주유소개인영업개시충청남도 금산군 군북면 어필각로 241041-753-66282023-08-11
6대둔주유소개인영업개시충청남도 금산군 복수면 다복로 121041-754-31032023-08-11
7금정주유소법인영업개시충청남도 금산군 추부면 금산로 2341041-751-51892023-08-11
8우영주유소개인영업개시충청남도 금산군 복수면 다복로 437041-753-18182023-08-11
9(주)부일에너지 깻잎셀프주유소개인휴지사업재개충청남도 금산군 추부면 금산로 2426041-753-51332023-08-11
상호개인법인구분영업구분영업소 도로명 주소영업소전화번호데이터기준일
30진산자연휴양림개인영업개시충청남도 금산군 진산면 대둔산로 6041-752-09332023-08-11
31추정주유소개인영업개시충청남도 금산군 추부면 금산로 2284041-751-18802023-08-11
32새말주유소개인영업개시충청남도 금산군 군북면 금산로 1746041-752-23892023-08-11
33봉황주유소개인영업개시충청남도 금산군 제원면 금강로 172041-754-51012023-08-11
34하나SK주유소개인영업개시충청남도 금산군 금성면 진산로 142041-751-02542023-08-11
35공단주유소개인영업개시충청남도 금산군 복수면 다복로 477041-753-39332023-08-11
36진산주유소개인영업개시충청남도 금산군 진산면 대둔산로 410041-752-40472023-08-11
37금산주유소개인영업개시충청남도 금산군 금산읍 금산로 1486041-752-21222023-08-11
38금산흥국주유소개인영업개시충청남도 금산군 금산읍 금산로 1544041-753-62502023-08-11
39마전주유소개인영업개시충청남도 금산군 추부면 마전로 20 (외 1필지)041-752-50102023-08-11