Overview

Dataset statistics

Number of variables6
Number of observations123
Missing cells30
Missing cells (%)4.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.0 KiB
Average record size in memory50.1 B

Variable types

Numeric1
Categorical2
Text3

Dataset

Description가스사업자에 대한 정보이며 업종(액화석유가스, 고압가스, 도시가스), 업소명, 소재지, 전화번호를 안내합니다.
URLhttps://www.data.go.kr/data/15001656/fileData.do

Alerts

사업의종류 is highly overall correlated with 세부종류High correlation
세부종류 is highly overall correlated with 사업의종류High correlation
전화번호 has 30 (24.4%) missing valuesMissing
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 17:14:46.516134
Analysis finished2023-12-12 17:14:47.447550
Duration0.93 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct123
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean66.487805
Minimum1
Maximum130
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2023-12-13T02:14:47.508895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile7.1
Q132.5
median69
Q399.5
95-th percentile123.9
Maximum130
Range129
Interquartile range (IQR)67

Descriptive statistics

Standard deviation38.329966
Coefficient of variation (CV)0.57649619
Kurtosis-1.2493019
Mean66.487805
Median Absolute Deviation (MAD)34
Skewness-0.056461244
Sum8178
Variance1469.1863
MonotonicityStrictly increasing
2023-12-13T02:14:47.631926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.8%
86 1
 
0.8%
99 1
 
0.8%
98 1
 
0.8%
97 1
 
0.8%
96 1
 
0.8%
95 1
 
0.8%
94 1
 
0.8%
93 1
 
0.8%
92 1
 
0.8%
Other values (113) 113
91.9%
ValueCountFrequency (%)
1 1
0.8%
2 1
0.8%
3 1
0.8%
4 1
0.8%
5 1
0.8%
6 1
0.8%
7 1
0.8%
8 1
0.8%
9 1
0.8%
10 1
0.8%
ValueCountFrequency (%)
130 1
0.8%
129 1
0.8%
128 1
0.8%
127 1
0.8%
126 1
0.8%
125 1
0.8%
124 1
0.8%
123 1
0.8%
122 1
0.8%
121 1
0.8%

사업의종류
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
고압가스 제조
37 
고압가스 판매
27 
판매사업
23 
고압가스 저장소
15 
충전사업
12 

Length

Max length8
Median length7
Mean length6.3414634
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row고압가스 저장소
2nd row고압가스 제조
3rd row고압가스 제조
4th row고압가스 제조
5th row고압가스 제조

Common Values

ValueCountFrequency (%)
고압가스 제조 37
30.1%
고압가스 판매 27
22.0%
판매사업 23
18.7%
고압가스 저장소 15
12.2%
충전사업 12
 
9.8%
가스용품제조사업 9
 
7.3%

Length

2023-12-13T02:14:47.766558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:14:47.881476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
고압가스 79
39.1%
제조 37
18.3%
판매 27
 
13.4%
판매사업 23
 
11.4%
저장소 15
 
7.4%
충전사업 12
 
5.9%
가스용품제조사업 9
 
4.5%

세부종류
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
<NA>
59 
일반
27 
냉동
20 
충전
17 

Length

Max length4
Median length2
Mean length2.9593496
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row냉동
3rd row냉동
4th row냉동
5th row냉동

Common Values

ValueCountFrequency (%)
<NA> 59
48.0%
일반 27
22.0%
냉동 20
 
16.3%
충전 17
 
13.8%

Length

2023-12-13T02:14:48.006738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:14:48.104314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 59
48.0%
일반 27
22.0%
냉동 20
 
16.3%
충전 17
 
13.8%
Distinct98
Distinct (%)79.7%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-13T02:14:48.400579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length15
Mean length7.8455285
Min length4

Characters and Unicode

Total characters965
Distinct characters143
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique82 ?
Unique (%)66.7%

Sample

1st row(주)부성철강산업
2nd row(주)특수건설 시점부
3rd row(주)특수건설 종점부
4th row(주)진양화성
5th row(주)진양화성
ValueCountFrequency (%)
주식회사 6
 
4.2%
주)임창 4
 
2.8%
협신산업가스(주 4
 
2.8%
금강화학 3
 
2.1%
주)대유코아 3
 
2.1%
부산지점 3
 
2.1%
가스텍코리아(주 3
 
2.1%
주)엠에스동남산업가스 3
 
2.1%
한성산업(주 3
 
2.1%
상일가스 2
 
1.4%
Other values (99) 110
76.4%
2023-12-13T02:14:48.850968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
71
 
7.4%
( 64
 
6.6%
) 64
 
6.6%
62
 
6.4%
51
 
5.3%
45
 
4.7%
25
 
2.6%
23
 
2.4%
23
 
2.4%
21
 
2.2%
Other values (133) 516
53.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 802
83.1%
Open Punctuation 64
 
6.6%
Close Punctuation 64
 
6.6%
Space Separator 21
 
2.2%
Decimal Number 7
 
0.7%
Uppercase Letter 7
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
71
 
8.9%
62
 
7.7%
51
 
6.4%
45
 
5.6%
25
 
3.1%
23
 
2.9%
23
 
2.9%
21
 
2.6%
20
 
2.5%
20
 
2.5%
Other values (122) 441
55.0%
Uppercase Letter
ValueCountFrequency (%)
M 2
28.6%
S 2
28.6%
L 1
14.3%
P 1
14.3%
G 1
14.3%
Decimal Number
ValueCountFrequency (%)
1 4
57.1%
9 2
28.6%
2 1
 
14.3%
Open Punctuation
ValueCountFrequency (%)
( 64
100.0%
Close Punctuation
ValueCountFrequency (%)
) 64
100.0%
Space Separator
ValueCountFrequency (%)
21
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 802
83.1%
Common 156
 
16.2%
Latin 7
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
71
 
8.9%
62
 
7.7%
51
 
6.4%
45
 
5.6%
25
 
3.1%
23
 
2.9%
23
 
2.9%
21
 
2.6%
20
 
2.5%
20
 
2.5%
Other values (122) 441
55.0%
Common
ValueCountFrequency (%)
( 64
41.0%
) 64
41.0%
21
 
13.5%
1 4
 
2.6%
9 2
 
1.3%
2 1
 
0.6%
Latin
ValueCountFrequency (%)
M 2
28.6%
S 2
28.6%
L 1
14.3%
P 1
14.3%
G 1
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 802
83.1%
ASCII 163
 
16.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
71
 
8.9%
62
 
7.7%
51
 
6.4%
45
 
5.6%
25
 
3.1%
23
 
2.9%
23
 
2.9%
21
 
2.6%
20
 
2.5%
20
 
2.5%
Other values (122) 441
55.0%
ASCII
ValueCountFrequency (%)
( 64
39.3%
) 64
39.3%
21
 
12.9%
1 4
 
2.5%
M 2
 
1.2%
S 2
 
1.2%
9 2
 
1.2%
2 1
 
0.6%
L 1
 
0.6%
P 1
 
0.6%
Distinct88
Distinct (%)71.5%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-13T02:14:49.174026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length33
Mean length26.593496
Min length21

Characters and Unicode

Total characters3271
Distinct characters89
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique66 ?
Unique (%)53.7%

Sample

1st row부산광역시 사상구 장인로37번길 105 (감전동)
2nd row부산광역시 사상구 학감대로 242, 사상구청 (감전동)
3rd row부산광역시 사상구 학감대로 242, 사상구청 (감전동)
4th row부산광역시 사상구 학장로 237-10 (학장동)
5th row부산광역시 사상구 학장로 237-10 (학장동)
ValueCountFrequency (%)
부산광역시 121
19.3%
사상구 121
19.3%
감전동 44
 
7.0%
학장동 37
 
5.9%
낙동대로 18
 
2.9%
모라동 9
 
1.4%
삼락동 9
 
1.4%
학감대로 8
 
1.3%
주례동 8
 
1.3%
덕포동 7
 
1.1%
Other values (135) 246
39.2%
2023-12-13T02:14:49.652998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
512
 
15.7%
159
 
4.9%
140
 
4.3%
138
 
4.2%
127
 
3.9%
125
 
3.8%
) 124
 
3.8%
( 124
 
3.8%
123
 
3.8%
122
 
3.7%
Other values (79) 1577
48.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1973
60.3%
Space Separator 512
 
15.7%
Decimal Number 506
 
15.5%
Close Punctuation 124
 
3.8%
Open Punctuation 124
 
3.8%
Dash Punctuation 15
 
0.5%
Other Punctuation 10
 
0.3%
Uppercase Letter 7
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
159
 
8.1%
140
 
7.1%
138
 
7.0%
127
 
6.4%
125
 
6.3%
123
 
6.2%
122
 
6.2%
121
 
6.1%
121
 
6.1%
121
 
6.1%
Other values (59) 676
34.3%
Decimal Number
ValueCountFrequency (%)
1 95
18.8%
2 70
13.8%
3 50
9.9%
7 50
9.9%
4 48
9.5%
6 45
8.9%
8 44
8.7%
5 39
7.7%
9 34
 
6.7%
0 31
 
6.1%
Uppercase Letter
ValueCountFrequency (%)
K 3
42.9%
S 2
28.6%
G 1
 
14.3%
E 1
 
14.3%
Other Punctuation
ValueCountFrequency (%)
, 9
90.0%
& 1
 
10.0%
Space Separator
ValueCountFrequency (%)
512
100.0%
Close Punctuation
ValueCountFrequency (%)
) 124
100.0%
Open Punctuation
ValueCountFrequency (%)
( 124
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1973
60.3%
Common 1291
39.5%
Latin 7
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
159
 
8.1%
140
 
7.1%
138
 
7.0%
127
 
6.4%
125
 
6.3%
123
 
6.2%
122
 
6.2%
121
 
6.1%
121
 
6.1%
121
 
6.1%
Other values (59) 676
34.3%
Common
ValueCountFrequency (%)
512
39.7%
) 124
 
9.6%
( 124
 
9.6%
1 95
 
7.4%
2 70
 
5.4%
3 50
 
3.9%
7 50
 
3.9%
4 48
 
3.7%
6 45
 
3.5%
8 44
 
3.4%
Other values (6) 129
 
10.0%
Latin
ValueCountFrequency (%)
K 3
42.9%
S 2
28.6%
G 1
 
14.3%
E 1
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1973
60.3%
ASCII 1298
39.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
512
39.4%
) 124
 
9.6%
( 124
 
9.6%
1 95
 
7.3%
2 70
 
5.4%
3 50
 
3.9%
7 50
 
3.9%
4 48
 
3.7%
6 45
 
3.5%
8 44
 
3.4%
Other values (10) 136
 
10.5%
Hangul
ValueCountFrequency (%)
159
 
8.1%
140
 
7.1%
138
 
7.0%
127
 
6.4%
125
 
6.3%
123
 
6.2%
122
 
6.2%
121
 
6.1%
121
 
6.1%
121
 
6.1%
Other values (59) 676
34.3%

전화번호
Text

MISSING 

Distinct76
Distinct (%)81.7%
Missing30
Missing (%)24.4%
Memory size1.1 KiB
2023-12-13T02:14:49.936329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters1116
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique63 ?
Unique (%)67.7%

Sample

1st row051-327-6995
2nd row031-455-5858
3rd row031-455-5858
4th row051-317-2777
5th row051-314-7070
ValueCountFrequency (%)
051-324-5030 4
 
4.3%
051-327-2261 3
 
3.2%
051-305-5353 3
 
3.2%
051-323-2133 2
 
2.2%
051-313-4631 2
 
2.2%
051-324-7001 2
 
2.2%
051-311-2706 2
 
2.2%
051-366-5385 2
 
2.2%
051-312-2825 2
 
2.2%
051-327-7726 2
 
2.2%
Other values (66) 69
74.2%
2023-12-13T02:14:50.443927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 186
16.7%
0 179
16.0%
1 174
15.6%
5 168
15.1%
3 133
11.9%
2 97
8.7%
7 52
 
4.7%
8 38
 
3.4%
6 37
 
3.3%
4 36
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 930
83.3%
Dash Punctuation 186
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 179
19.2%
1 174
18.7%
5 168
18.1%
3 133
14.3%
2 97
10.4%
7 52
 
5.6%
8 38
 
4.1%
6 37
 
4.0%
4 36
 
3.9%
9 16
 
1.7%
Dash Punctuation
ValueCountFrequency (%)
- 186
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1116
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 186
16.7%
0 179
16.0%
1 174
15.6%
5 168
15.1%
3 133
11.9%
2 97
8.7%
7 52
 
4.7%
8 38
 
3.4%
6 37
 
3.3%
4 36
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1116
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 186
16.7%
0 179
16.0%
1 174
15.6%
5 168
15.1%
3 133
11.9%
2 97
8.7%
7 52
 
4.7%
8 38
 
3.4%
6 37
 
3.3%
4 36
 
3.2%

Interactions

2023-12-13T02:14:47.160598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:14:50.583454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번사업의종류세부종류업소명사업소소재지전화번호
순번1.0000.7280.5160.8670.8140.940
사업의종류0.7281.0000.8660.8780.7600.853
세부종류0.5160.8661.0000.8860.8700.939
업소명0.8670.8780.8861.0001.0001.000
사업소소재지0.8140.7600.8701.0001.0000.999
전화번호0.9400.8530.9391.0000.9991.000
2023-12-13T02:14:50.713992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세부종류사업의종류
세부종류1.0000.557
사업의종류0.5571.000
2023-12-13T02:14:50.807455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번사업의종류세부종류
순번1.0000.4860.388
사업의종류0.4861.0000.557
세부종류0.3880.5571.000

Missing values

2023-12-13T02:14:47.287270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:14:47.413631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번사업의종류세부종류업소명사업소소재지전화번호
01고압가스 저장소<NA>(주)부성철강산업부산광역시 사상구 장인로37번길 105 (감전동)051-327-6995
12고압가스 제조냉동(주)특수건설 시점부부산광역시 사상구 학감대로 242, 사상구청 (감전동)<NA>
23고압가스 제조냉동(주)특수건설 종점부부산광역시 사상구 학감대로 242, 사상구청 (감전동)<NA>
34고압가스 제조냉동(주)진양화성부산광역시 사상구 학장로 237-10 (학장동)<NA>
45고압가스 제조냉동(주)진양화성부산광역시 사상구 학장로 237-10 (학장동)<NA>
56고압가스 제조냉동스마트레일주식회사서울특별시 종로구 인사동7길 32, SK건설빌딩 (관훈동)031-455-5858
67고압가스 제조냉동스마트레일주식회사서울특별시 종로구 인사동7길 32, SK건설빌딩 (관훈동)031-455-5858
78고압가스 저장소일반화성종합가스부산광역시 사상구 장인로78번길 13 (학장동)<NA>
89고압가스 제조충전대도하이젠(주) (복합수소충전소)부산광역시 사상구 학감대로 167 (학장동)051-317-2777
910고압가스 제조충전(주)대광선박설비부산광역시 사상구 낙동대로970번길 78 (감전동)051-314-7070
순번사업의종류세부종류업소명사업소소재지전화번호
113121판매사업<NA>한국가스부산광역시 사상구 가야대로 157 (감전동)<NA>
114122판매사업<NA>칠성에너지기업사부산광역시 사상구 낙동대로 878 (감전동)051-325-1010
115123충전사업<NA>팔성가스충전소부산광역시 사상구 가야대로 75 (감전동)051-314-3555
116124판매사업<NA>부산가스부산광역시 사상구 가야대로282번길 44 (주례동)<NA>
117125충전사업<NA>극동유화(주)감전충전부산광역시 사상구 낙동대로 878 (감전동)<NA>
118126충전사업<NA>극동유화(주)주례충전소부산광역시 사상구 가야대로 203 (주례동)<NA>
119127충전사업<NA>주식회사 엠에스가스부산광역시 사상구 가야대로176번길 92 (학장동)<NA>
120128판매사업<NA>태광가스상사부산광역시 사상구 사상로 568 (모라동)<NA>
121129판매사업<NA>충렬가스부산광역시 사상구 사상로 74 (주례동)<NA>
122130충전사업<NA>광신가스(주)부산광역시 사상구 낙동대로 939 (감전동)051-311-0131