Overview

Dataset statistics

Number of variables5
Number of observations45
Missing cells45
Missing cells (%)20.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.9 KiB
Average record size in memory43.9 B

Variable types

Categorical1
Text3
Unsupported1

Dataset

DescriptionLPG충전소, 고압가스제조사업자, CNG충전사업자
Author경기도 용인시
URLhttps://www.data.go.kr/data/15044239/fileData.do

Alerts

구분 is highly imbalanced (52.2%)Imbalance
Unnamed: 4 has 45 (100.0%) missing valuesMissing
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 08:20:08.449530
Analysis finished2023-12-12 08:20:08.995067
Duration0.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

IMBALANCE 

Distinct3
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size492.0 B
LPG충전소
38 
고압가스제조사업자
CNG충전사업자
 
2

Length

Max length9
Median length6
Mean length6.4222222
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowLPG충전소
2nd rowLPG충전소
3rd rowLPG충전소
4th rowLPG충전소
5th rowLPG충전소

Common Values

ValueCountFrequency (%)
LPG충전소 38
84.4%
고압가스제조사업자 5
 
11.1%
CNG충전사업자 2
 
4.4%

Length

2023-12-12T17:20:09.077793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:20:09.212562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
lpg충전소 38
84.4%
고압가스제조사업자 5
 
11.1%
cng충전사업자 2
 
4.4%
Distinct43
Distinct (%)95.6%
Missing0
Missing (%)0.0%
Memory size492.0 B
2023-12-12T17:20:09.512614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length21
Mean length9.3111111
Min length5

Characters and Unicode

Total characters419
Distinct characters96
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique41 ?
Unique (%)91.1%

Sample

1st row(주)대호에너지
2nd row(주)성림가스 마북LPG충전소
3rd row(주)원일석유 용인(하)고속도로 주유소.충전소
4th row(주)한미석유 기흥(하) LPG 충전소
5th row(주)한유에너지 수지충전소
ValueCountFrequency (%)
충전소 4
 
6.9%
주)린데코리아 2
 
3.4%
lpg 2
 
3.4%
프렉스에어코리아(주 2
 
3.4%
유방동lpg충전소 1
 
1.7%
신한에너지 1
 
1.7%
서일 1
 
1.7%
선봉대에너지 1
 
1.7%
수원 1
 
1.7%
ic 1
 
1.7%
Other values (42) 42
72.4%
2023-12-12T17:20:10.135746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
32
 
7.6%
31
 
7.4%
30
 
7.2%
( 19
 
4.5%
) 19
 
4.5%
18
 
4.3%
L 14
 
3.3%
P 14
 
3.3%
13
 
3.1%
13
 
3.1%
Other values (86) 216
51.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 320
76.4%
Uppercase Letter 43
 
10.3%
Open Punctuation 19
 
4.5%
Close Punctuation 19
 
4.5%
Space Separator 13
 
3.1%
Other Punctuation 4
 
1.0%
Dash Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
32
 
10.0%
31
 
9.7%
30
 
9.4%
18
 
5.6%
13
 
4.1%
12
 
3.8%
8
 
2.5%
7
 
2.2%
7
 
2.2%
7
 
2.2%
Other values (75) 155
48.4%
Uppercase Letter
ValueCountFrequency (%)
L 14
32.6%
P 14
32.6%
G 13
30.2%
C 1
 
2.3%
I 1
 
2.3%
Other Punctuation
ValueCountFrequency (%)
. 3
75.0%
/ 1
 
25.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Space Separator
ValueCountFrequency (%)
13
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 320
76.4%
Common 56
 
13.4%
Latin 43
 
10.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
32
 
10.0%
31
 
9.7%
30
 
9.4%
18
 
5.6%
13
 
4.1%
12
 
3.8%
8
 
2.5%
7
 
2.2%
7
 
2.2%
7
 
2.2%
Other values (75) 155
48.4%
Common
ValueCountFrequency (%)
( 19
33.9%
) 19
33.9%
13
23.2%
. 3
 
5.4%
- 1
 
1.8%
/ 1
 
1.8%
Latin
ValueCountFrequency (%)
L 14
32.6%
P 14
32.6%
G 13
30.2%
C 1
 
2.3%
I 1
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 320
76.4%
ASCII 99
 
23.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
32
 
10.0%
31
 
9.7%
30
 
9.4%
18
 
5.6%
13
 
4.1%
12
 
3.8%
8
 
2.5%
7
 
2.2%
7
 
2.2%
7
 
2.2%
Other values (75) 155
48.4%
ASCII
ValueCountFrequency (%)
( 19
19.2%
) 19
19.2%
L 14
14.1%
P 14
14.1%
13
13.1%
G 13
13.1%
. 3
 
3.0%
- 1
 
1.0%
C 1
 
1.0%
I 1
 
1.0%
Distinct44
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Memory size492.0 B
2023-12-12T17:20:10.536985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length28
Mean length25.066667
Min length18

Characters and Unicode

Total characters1128
Distinct characters78
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)95.6%

Sample

1st row경기도 용인시기흥구 영덕동 104-6외4필지호
2nd row경기도 용인시 기흥구 용구대로 2294 (마북동)
3rd row경기도 용인시처인구 양지면 주북리 958-6호
4th row경기도 용인시 기흥구 공세로 173 (공세동)
5th row경기도 용인시 기흥구 신정로 609 (보정동)
ValueCountFrequency (%)
경기도 45
17.8%
용인시 37
 
14.6%
처인구 20
 
7.9%
기흥구 17
 
6.7%
백옥대로 7
 
2.8%
용인시기흥구 5
 
2.0%
중부대로 5
 
2.0%
영덕동 4
 
1.6%
양지면 4
 
1.6%
농서동 4
 
1.6%
Other values (83) 105
41.5%
2023-12-12T17:20:11.079527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
210
18.6%
68
 
6.0%
67
 
5.9%
48
 
4.3%
47
 
4.2%
45
 
4.0%
45
 
4.0%
45
 
4.0%
37
 
3.3%
1 34
 
3.0%
Other values (68) 482
42.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 694
61.5%
Space Separator 210
 
18.6%
Decimal Number 166
 
14.7%
Open Punctuation 23
 
2.0%
Close Punctuation 23
 
2.0%
Dash Punctuation 12
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
68
 
9.8%
67
 
9.7%
48
 
6.9%
47
 
6.8%
45
 
6.5%
45
 
6.5%
45
 
6.5%
37
 
5.3%
34
 
4.9%
23
 
3.3%
Other values (54) 235
33.9%
Decimal Number
ValueCountFrequency (%)
1 34
20.5%
2 25
15.1%
6 21
12.7%
5 17
10.2%
4 15
9.0%
7 13
 
7.8%
0 11
 
6.6%
8 11
 
6.6%
9 10
 
6.0%
3 9
 
5.4%
Space Separator
ValueCountFrequency (%)
210
100.0%
Open Punctuation
ValueCountFrequency (%)
( 23
100.0%
Close Punctuation
ValueCountFrequency (%)
) 23
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 694
61.5%
Common 434
38.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
68
 
9.8%
67
 
9.7%
48
 
6.9%
47
 
6.8%
45
 
6.5%
45
 
6.5%
45
 
6.5%
37
 
5.3%
34
 
4.9%
23
 
3.3%
Other values (54) 235
33.9%
Common
ValueCountFrequency (%)
210
48.4%
1 34
 
7.8%
2 25
 
5.8%
( 23
 
5.3%
) 23
 
5.3%
6 21
 
4.8%
5 17
 
3.9%
4 15
 
3.5%
7 13
 
3.0%
- 12
 
2.8%
Other values (4) 41
 
9.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 694
61.5%
ASCII 434
38.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
210
48.4%
1 34
 
7.8%
2 25
 
5.8%
( 23
 
5.3%
) 23
 
5.3%
6 21
 
4.8%
5 17
 
3.9%
4 15
 
3.5%
7 13
 
3.0%
- 12
 
2.8%
Other values (4) 41
 
9.4%
Hangul
ValueCountFrequency (%)
68
 
9.8%
67
 
9.7%
48
 
6.9%
47
 
6.8%
45
 
6.5%
45
 
6.5%
45
 
6.5%
37
 
5.3%
34
 
4.9%
23
 
3.3%
Other values (54) 235
33.9%
Distinct44
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Memory size492.0 B
2023-12-12T17:20:11.421997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters540
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)95.6%

Sample

1st row031-281-4446
2nd row031-284-4166
3rd row031-332-3221
4th row031-286-5182
5th row031-265-5159
ValueCountFrequency (%)
031-286-2493 2
 
4.4%
031-281-4446 1
 
2.2%
031-339-2262 1
 
2.2%
031-338-9583 1
 
2.2%
031-265-8844 1
 
2.2%
031-285-2245 1
 
2.2%
031-321-2260 1
 
2.2%
031-338-3686 1
 
2.2%
031-334-5858 1
 
2.2%
031-206-8581 1
 
2.2%
Other values (34) 34
75.6%
2023-12-12T17:20:11.867071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 103
19.1%
- 90
16.7%
1 79
14.6%
0 71
13.1%
2 60
11.1%
8 35
 
6.5%
5 31
 
5.7%
4 28
 
5.2%
6 21
 
3.9%
9 12
 
2.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 450
83.3%
Dash Punctuation 90
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 103
22.9%
1 79
17.6%
0 71
15.8%
2 60
13.3%
8 35
 
7.8%
5 31
 
6.9%
4 28
 
6.2%
6 21
 
4.7%
9 12
 
2.7%
7 10
 
2.2%
Dash Punctuation
ValueCountFrequency (%)
- 90
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 540
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 103
19.1%
- 90
16.7%
1 79
14.6%
0 71
13.1%
2 60
11.1%
8 35
 
6.5%
5 31
 
5.7%
4 28
 
5.2%
6 21
 
3.9%
9 12
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 540
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 103
19.1%
- 90
16.7%
1 79
14.6%
0 71
13.1%
2 60
11.1%
8 35
 
6.5%
5 31
 
5.7%
4 28
 
5.2%
6 21
 
3.9%
9 12
 
2.2%

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing45
Missing (%)100.0%
Memory size537.0 B

Correlations

2023-12-12T17:20:11.989232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분상 호주 소 지전화번호
구분1.0001.0001.0001.000
상 호1.0001.0000.9961.000
주 소 지1.0000.9961.0000.998
전화번호1.0001.0000.9981.000

Missing values

2023-12-12T17:20:08.819912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:20:08.952475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분상 호주 소 지전화번호Unnamed: 4
0LPG충전소(주)대호에너지경기도 용인시기흥구 영덕동 104-6외4필지호031-281-4446<NA>
1LPG충전소(주)성림가스 마북LPG충전소경기도 용인시 기흥구 용구대로 2294 (마북동)031-284-4166<NA>
2LPG충전소(주)원일석유 용인(하)고속도로 주유소.충전소경기도 용인시처인구 양지면 주북리 958-6호031-332-3221<NA>
3LPG충전소(주)한미석유 기흥(하) LPG 충전소경기도 용인시 기흥구 공세로 173 (공세동)031-286-5182<NA>
4LPG충전소(주)한유에너지 수지충전소경기도 용인시 기흥구 신정로 609 (보정동)031-265-5159<NA>
5LPG충전소(주)한진가스경기도 용인시 처인구 중부대로 1148 (삼가동)031-336-4343<NA>
6LPG충전소경안충전소경기도 용인시 처인구 모현면 백옥대로 2547031-334-2427<NA>
7LPG충전소고려LPG충전소경기도 용인시 처인구 중부대로 1558 (마평동)031-321-1248<NA>
8LPG충전소구갈동충전소경기도 용인시 기흥구 중부대로 514 (구갈동)031-284-1891<NA>
9LPG충전소구성그린LPG충전소경기도 용인시기흥구 마북동 502-146호031-283-7474<NA>
구분상 호주 소 지전화번호Unnamed: 4
35LPG충전소유림충전소경기도 용인시 처인구 백옥대로 1214 (유방동)031-337-3800<NA>
36LPG충전소유방동LPG충전소경기도 용인시 처인구 백옥대로 1260 (유방동)031-335-5186<NA>
37LPG충전소제이에스에너지(주)경기도 용인시 기흥구 용구대로 1875 (보라동)031-285-0497<NA>
38고압가스제조사업자프렉스에어코리아(주)경기도 용인시 기흥구 삼성2로96번길 20031-260-3000<NA>
39고압가스제조사업자(주)린데코리아경기도 용인시 기흥구 삼성2로96번길 23 (농서동)031-286-2493<NA>
40고압가스제조사업자프렉스에어코리아(주)경기도 용인시 기흥구 농서로 60 (농서동)031-260-3024<NA>
41고압가스제조사업자에어프로덕츠코리아(주)경기도 용인시 기흥구 농서로 48031-280-2000<NA>
42고압가스제조사업자(주)린데코리아경기도 용인시 기흥구 농서로 60 (농서동)031-286-2493<NA>
43CNG충전사업자(주)경남씨엔지경기도 용인시 처인구 남동 476-2번지031-251-3721<NA>
44CNG충전사업자죽전씨엔지충전소-(주)경기고속/(주)대원고속경기도 용인시 처인구 모현면 오산리 274-4번지031-332-1019<NA>