Overview

Dataset statistics

Number of variables4
Number of observations23
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory868.0 B
Average record size in memory37.7 B

Variable types

Categorical2
Text2

Dataset

Description인천광역시 동구 관내에 위치한 가스사업자 현황에 대한 데이터로, 사업종류, 구분, 상호, 주소 등의 항목을 게시하였습니다.
URLhttps://www.data.go.kr/data/15006345/fileData.do

Alerts

사업종류 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 사업종류High correlation

Reproduction

Analysis started2023-12-12 16:50:01.313470
Analysis finished2023-12-12 16:50:01.715133
Duration0.4 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사업종류
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)8.7%
Missing0
Missing (%)0.0%
Memory size316.0 B
고압가스
19 
액화석유가스

Length

Max length6
Median length4
Mean length4.3478261
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row고압가스
2nd row고압가스
3rd row고압가스
4th row고압가스
5th row고압가스

Common Values

ValueCountFrequency (%)
고압가스 19
82.6%
액화석유가스 4
 
17.4%

Length

2023-12-13T01:50:01.799511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:50:01.922626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
고압가스 19
82.6%
액화석유가스 4
 
17.4%

구분
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)39.1%
Missing0
Missing (%)0.0%
Memory size316.0 B
운반자
판매소
냉동제조
저장소
충전소
Other values (4)

Length

Max length13
Median length3
Mean length4
Min length3

Unique

Unique4 ?
Unique (%)17.4%

Sample

1st row특정제조/냉동제조/저장소
2nd row특정제조/냉동제조
3rd row냉동제조
4th row일반제조/판매
5th row저장소

Common Values

ValueCountFrequency (%)
운반자 9
39.1%
판매소 4
17.4%
냉동제조 2
 
8.7%
저장소 2
 
8.7%
충전소 2
 
8.7%
특정제조/냉동제조/저장소 1
 
4.3%
특정제조/냉동제조 1
 
4.3%
일반제조/판매 1
 
4.3%
용품제조 1
 
4.3%

Length

2023-12-13T01:50:02.022977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:50:02.153718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
운반자 9
39.1%
판매소 4
17.4%
냉동제조 2
 
8.7%
저장소 2
 
8.7%
충전소 2
 
8.7%
특정제조/냉동제조/저장소 1
 
4.3%
특정제조/냉동제조 1
 
4.3%
일반제조/판매 1
 
4.3%
용품제조 1
 
4.3%

상호
Text

Distinct20
Distinct (%)87.0%
Missing0
Missing (%)0.0%
Memory size316.0 B
2023-12-13T01:50:02.415111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length10
Mean length7.4347826
Min length5

Characters and Unicode

Total characters171
Distinct characters65
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique17 ?
Unique (%)73.9%

Sample

1st row현대제철(주)
2nd row동국제강(주)
3rd rowHD현대인프라코어(주)
4th row엠에스인천가스(주)
5th row인천연료전지(주)
ValueCountFrequency (%)
한성특수가스 2
 
8.3%
대성가스텍 2
 
8.3%
대한제일연합가스 2
 
8.3%
디에이치솔루션㈜ 1
 
4.2%
현대제철(주 1
 
4.2%
엠에스인천가스㈜ 1
 
4.2%
송현충전소 1
 
4.2%
인천개인택시복지제1충전소 1
 
4.2%
㈜태성로지스 1
 
4.2%
물류 1
 
4.2%
Other values (11) 11
45.8%
2023-12-13T01:50:02.820235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11
 
6.4%
8
 
4.7%
8
 
4.7%
8
 
4.7%
6
 
3.5%
6
 
3.5%
6
 
3.5%
6
 
3.5%
( 5
 
2.9%
5
 
2.9%
Other values (55) 102
59.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 151
88.3%
Other Symbol 6
 
3.5%
Open Punctuation 5
 
2.9%
Close Punctuation 5
 
2.9%
Uppercase Letter 2
 
1.2%
Space Separator 1
 
0.6%
Decimal Number 1
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11
 
7.3%
8
 
5.3%
8
 
5.3%
8
 
5.3%
6
 
4.0%
6
 
4.0%
6
 
4.0%
5
 
3.3%
5
 
3.3%
5
 
3.3%
Other values (48) 83
55.0%
Uppercase Letter
ValueCountFrequency (%)
D 1
50.0%
H 1
50.0%
Other Symbol
ValueCountFrequency (%)
6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 157
91.8%
Common 12
 
7.0%
Latin 2
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11
 
7.0%
8
 
5.1%
8
 
5.1%
8
 
5.1%
6
 
3.8%
6
 
3.8%
6
 
3.8%
6
 
3.8%
5
 
3.2%
5
 
3.2%
Other values (49) 88
56.1%
Common
ValueCountFrequency (%)
( 5
41.7%
) 5
41.7%
1
 
8.3%
1 1
 
8.3%
Latin
ValueCountFrequency (%)
D 1
50.0%
H 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 151
88.3%
ASCII 14
 
8.2%
None 6
 
3.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
11
 
7.3%
8
 
5.3%
8
 
5.3%
8
 
5.3%
6
 
4.0%
6
 
4.0%
6
 
4.0%
5
 
3.3%
5
 
3.3%
5
 
3.3%
Other values (48) 83
55.0%
None
ValueCountFrequency (%)
6
100.0%
ASCII
ValueCountFrequency (%)
( 5
35.7%
) 5
35.7%
1
 
7.1%
1 1
 
7.1%
D 1
 
7.1%
H 1
 
7.1%

주소
Text

Distinct20
Distinct (%)87.0%
Missing0
Missing (%)0.0%
Memory size316.0 B
2023-12-13T01:50:03.053720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length25
Mean length20.956522
Min length15

Characters and Unicode

Total characters482
Distinct characters47
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique17 ?
Unique (%)73.9%

Sample

1st row인천광역시 동구 중봉대로 63
2nd row인천광역시 동구 중봉대로 15
3rd row인천광역시 동구 인중로 489
4th row인천광역시 동구 방축로9번길 40
5th row인천광역시 동구 방축로 42
ValueCountFrequency (%)
인천광역시 23
23.0%
동구 23
23.0%
방축로 4
 
4.0%
중봉대로 4
 
4.0%
40 3
 
3.0%
염전로40번길 3
 
3.0%
만석로 2
 
2.0%
45 2
 
2.0%
방축로9번길 2
 
2.0%
11 2
 
2.0%
Other values (29) 32
32.0%
2023-12-13T01:50:03.447336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
78
16.2%
32
 
6.6%
24
 
5.0%
23
 
4.8%
23
 
4.8%
23
 
4.8%
23
 
4.8%
23
 
4.8%
23
 
4.8%
3 17
 
3.5%
Other values (37) 193
40.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 292
60.6%
Decimal Number 94
 
19.5%
Space Separator 78
 
16.2%
Close Punctuation 7
 
1.5%
Open Punctuation 7
 
1.5%
Other Punctuation 3
 
0.6%
Dash Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
32
11.0%
24
 
8.2%
23
 
7.9%
23
 
7.9%
23
 
7.9%
23
 
7.9%
23
 
7.9%
23
 
7.9%
13
 
4.5%
13
 
4.5%
Other values (22) 72
24.7%
Decimal Number
ValueCountFrequency (%)
3 17
18.1%
4 16
17.0%
1 15
16.0%
2 11
11.7%
0 10
10.6%
6 7
7.4%
7 6
 
6.4%
9 5
 
5.3%
5 4
 
4.3%
8 3
 
3.2%
Space Separator
ValueCountFrequency (%)
78
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Other Punctuation
ValueCountFrequency (%)
, 3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 292
60.6%
Common 190
39.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
32
11.0%
24
 
8.2%
23
 
7.9%
23
 
7.9%
23
 
7.9%
23
 
7.9%
23
 
7.9%
23
 
7.9%
13
 
4.5%
13
 
4.5%
Other values (22) 72
24.7%
Common
ValueCountFrequency (%)
78
41.1%
3 17
 
8.9%
4 16
 
8.4%
1 15
 
7.9%
2 11
 
5.8%
0 10
 
5.3%
) 7
 
3.7%
6 7
 
3.7%
( 7
 
3.7%
7 6
 
3.2%
Other values (5) 16
 
8.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 292
60.6%
ASCII 190
39.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
78
41.1%
3 17
 
8.9%
4 16
 
8.4%
1 15
 
7.9%
2 11
 
5.8%
0 10
 
5.3%
) 7
 
3.7%
6 7
 
3.7%
( 7
 
3.7%
7 6
 
3.2%
Other values (5) 16
 
8.4%
Hangul
ValueCountFrequency (%)
32
11.0%
24
 
8.2%
23
 
7.9%
23
 
7.9%
23
 
7.9%
23
 
7.9%
23
 
7.9%
23
 
7.9%
13
 
4.5%
13
 
4.5%
Other values (22) 72
24.7%

Correlations

2023-12-13T01:50:03.887609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업종류구분상호주소
사업종류1.0000.7860.4551.000
구분0.7861.0000.9490.869
상호0.4550.9491.0000.976
주소1.0000.8690.9761.000
2023-12-13T01:50:03.971590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분사업종류
구분1.0000.655
사업종류0.6551.000
2023-12-13T01:50:04.059726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업종류구분
사업종류1.0000.655
구분0.6551.000

Missing values

2023-12-13T01:50:01.564031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:50:01.669767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업종류구분상호주소
0고압가스특정제조/냉동제조/저장소현대제철(주)인천광역시 동구 중봉대로 63
1고압가스특정제조/냉동제조동국제강(주)인천광역시 동구 중봉대로 15
2고압가스냉동제조HD현대인프라코어(주)인천광역시 동구 인중로 489
3고압가스일반제조/판매엠에스인천가스(주)인천광역시 동구 방축로9번길 40
4고압가스저장소인천연료전지(주)인천광역시 동구 방축로 42
5고압가스저장소인천광역시의료원인천광역시 동구 방축로 217
6고압가스판매소한성특수가스인천광역시 동구 염전로40번길 44
7고압가스판매소대성가스텍인천광역시 동구 방축로167번길 11
8고압가스판매소한국에어텍인천광역시 동구 만석로 45
9고압가스냉동제조인천광역시동구청인천광역시 동구 금곡로 67
사업종류구분상호주소
13고압가스운반자대한제일연합가스인천광역시 동구 방축로 23번길 40
14고압가스운반자대성가스텍인천광역시 동구 방축로 167번길 11
15고압가스운반자㈜한국에어텍인천광역시 동구 만석로 45
16고압가스운반자디에이치솔루션㈜인천광역시 동구 중봉대로 91, 1층(송현동)
17고압가스운반자㈜케이디에이치 물류인천광역시 동구 방축로83번길 23, 26동 3층 312호(송림동)
18고압가스운반자㈜태성로지스인천광역시 동구 방축로83번길 23, 10동236호
19액화석유가스충전소인천개인택시복지제1충전소인천광역시 동구 방축로157번길 12(송림동)
20액화석유가스충전소송현충전소인천광역시 동구 중봉대로 93(송현동)
21액화석유가스판매소대한제일연합가스인천광역시 동구 방축로23번길 40(송현동)
22액화석유가스용품제조씨앤에이치인천광역시 동구 염전로40번길 40-23 (송림동)