Overview

Dataset statistics

Number of variables6
Number of observations159
Missing cells31
Missing cells (%)3.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.7 KiB
Average record size in memory49.8 B

Variable types

Numeric1
Text2
Categorical2
DateTime1

Dataset

Description경기도 평택시 고압가스 저장소 현황에 대한 데이터로 상호명, 소재지지번주소, 소재지도로명주소, 고압가스종류 등 의 정보를 제공합니다. ※문의 : 평택시 일자리경제과(031-8024-3586)
URLhttps://www.data.go.kr/data/15033480/fileData.do

Alerts

데이터기준일 has constant value ""Constant
연번 is highly overall correlated with 소재지지번주소High correlation
소재지지번주소 is highly overall correlated with 연번High correlation
소재지지번주소 is highly imbalanced (66.4%)Imbalance
소재지도로명주소 has 31 (19.5%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 00:19:34.995686
Analysis finished2023-12-12 00:19:35.699247
Duration0.7 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct159
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean80
Minimum1
Maximum159
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2023-12-12T09:19:35.781750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile8.9
Q140.5
median80
Q3119.5
95-th percentile151.1
Maximum159
Range158
Interquartile range (IQR)79

Descriptive statistics

Standard deviation46.043458
Coefficient of variation (CV)0.57554322
Kurtosis-1.2
Mean80
Median Absolute Deviation (MAD)40
Skewness0
Sum12720
Variance2120
MonotonicityStrictly increasing
2023-12-12T09:19:36.008092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.6%
2 1
 
0.6%
103 1
 
0.6%
104 1
 
0.6%
105 1
 
0.6%
106 1
 
0.6%
107 1
 
0.6%
108 1
 
0.6%
109 1
 
0.6%
110 1
 
0.6%
Other values (149) 149
93.7%
ValueCountFrequency (%)
1 1
0.6%
2 1
0.6%
3 1
0.6%
4 1
0.6%
5 1
0.6%
6 1
0.6%
7 1
0.6%
8 1
0.6%
9 1
0.6%
10 1
0.6%
ValueCountFrequency (%)
159 1
0.6%
158 1
0.6%
157 1
0.6%
156 1
0.6%
155 1
0.6%
154 1
0.6%
153 1
0.6%
152 1
0.6%
151 1
0.6%
150 1
0.6%
Distinct122
Distinct (%)76.7%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2023-12-12T09:19:36.314917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length20
Mean length12.559748
Min length4

Characters and Unicode

Total characters1997
Distinct characters214
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique101 ?
Unique (%)63.5%

Sample

1st row삼성전자(주) [4라인 복합동 소화설비]
2nd row삼성전자(주) [제2방재센터]
3rd row린데코리아(주) 평택공장
4th row삼성전자(주) [4라인 FAB동 소화설비]
5th row삼성전자(주) [4라인 154kv동 소화설비]
ValueCountFrequency (%)
삼성전자(주 27
 
10.0%
소화설비 16
 
5.9%
2라인 12
 
4.4%
린데코리아(주 10
 
3.7%
에어프로덕츠코리아(주 9
 
3.3%
평택사업장 8
 
3.0%
주)원익아이피에스 7
 
2.6%
주식회사 6
 
2.2%
한국서부발전(주 5
 
1.8%
3라인 5
 
1.8%
Other values (121) 166
61.3%
2023-12-12T09:19:36.757352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
142
 
7.1%
( 140
 
7.0%
) 140
 
7.0%
112
 
5.6%
55
 
2.8%
[ 47
 
2.4%
] 47
 
2.4%
45
 
2.3%
43
 
2.2%
41
 
2.1%
Other values (204) 1185
59.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1397
70.0%
Open Punctuation 187
 
9.4%
Close Punctuation 187
 
9.4%
Space Separator 112
 
5.6%
Uppercase Letter 55
 
2.8%
Decimal Number 48
 
2.4%
Lowercase Letter 8
 
0.4%
Other Number 2
 
0.1%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
142
 
10.2%
55
 
3.9%
45
 
3.2%
43
 
3.1%
41
 
2.9%
36
 
2.6%
35
 
2.5%
34
 
2.4%
33
 
2.4%
31
 
2.2%
Other values (172) 902
64.6%
Uppercase Letter
ValueCountFrequency (%)
C 8
14.5%
H 5
9.1%
T 5
9.1%
F 5
9.1%
S 5
9.1%
V 4
 
7.3%
O 4
 
7.3%
K 3
 
5.5%
U 3
 
5.5%
N 3
 
5.5%
Other values (4) 10
18.2%
Lowercase Letter
ValueCountFrequency (%)
k 2
25.0%
e 2
25.0%
n 1
12.5%
a 1
12.5%
v 1
12.5%
i 1
12.5%
Decimal Number
ValueCountFrequency (%)
2 21
43.8%
4 10
20.8%
3 8
 
16.7%
1 6
 
12.5%
5 3
 
6.2%
Open Punctuation
ValueCountFrequency (%)
( 140
74.9%
[ 47
 
25.1%
Close Punctuation
ValueCountFrequency (%)
) 140
74.9%
] 47
 
25.1%
Space Separator
ValueCountFrequency (%)
112
100.0%
Other Number
ValueCountFrequency (%)
2
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1397
70.0%
Common 537
 
26.9%
Latin 63
 
3.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
142
 
10.2%
55
 
3.9%
45
 
3.2%
43
 
3.1%
41
 
2.9%
36
 
2.6%
35
 
2.5%
34
 
2.4%
33
 
2.4%
31
 
2.2%
Other values (172) 902
64.6%
Latin
ValueCountFrequency (%)
C 8
12.7%
H 5
 
7.9%
T 5
 
7.9%
F 5
 
7.9%
S 5
 
7.9%
V 4
 
6.3%
O 4
 
6.3%
K 3
 
4.8%
U 3
 
4.8%
N 3
 
4.8%
Other values (10) 18
28.6%
Common
ValueCountFrequency (%)
( 140
26.1%
) 140
26.1%
112
20.9%
[ 47
 
8.8%
] 47
 
8.8%
2 21
 
3.9%
4 10
 
1.9%
3 8
 
1.5%
1 6
 
1.1%
5 3
 
0.6%
Other values (2) 3
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1397
70.0%
ASCII 598
29.9%
None 2
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
142
 
10.2%
55
 
3.9%
45
 
3.2%
43
 
3.1%
41
 
2.9%
36
 
2.6%
35
 
2.5%
34
 
2.4%
33
 
2.4%
31
 
2.2%
Other values (172) 902
64.6%
ASCII
ValueCountFrequency (%)
( 140
23.4%
) 140
23.4%
112
18.7%
[ 47
 
7.9%
] 47
 
7.9%
2 21
 
3.5%
4 10
 
1.7%
C 8
 
1.3%
3 8
 
1.3%
1 6
 
1.0%
Other values (21) 59
9.9%
None
ValueCountFrequency (%)
2
100.0%

소재지지번주소
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct16
Distinct (%)10.1%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
<NA>
128 
경기도 평택시 고덕면 방축리 162 외 30필지(저장소 위치 : 고덕면 방축리 산163)
 
8
경기도 평택시 고덕면 여염리 1673
 
8
경기도 평택시 고덕동 1695 삼성전자(주) 평택캠퍼스
 
3
경기도 평택시 칠괴동 577-4
 
1
Other values (11)
 
11

Length

Max length59
Median length4
Mean length9.245283
Min length4

Unique

Unique12 ?
Unique (%)7.5%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 128
80.5%
경기도 평택시 고덕면 방축리 162 외 30필지(저장소 위치 : 고덕면 방축리 산163) 8
 
5.0%
경기도 평택시 고덕면 여염리 1673 8
 
5.0%
경기도 평택시 고덕동 1695 삼성전자(주) 평택캠퍼스 3
 
1.9%
경기도 평택시 칠괴동 577-4 1
 
0.6%
경기도 평택시 고덕면 방축리 산 162 외 30필지(저장소 설치위치 : 평택시 지제동 산49 외 3필지) 1
 
0.6%
경기도 평택시 포승읍 내기리 680-1 1
 
0.6%
경기도 평택시 청북읍 고잔리 1361-3 외 2필지 1
 
0.6%
경기도 평택시 합정동 883 1
 
0.6%
경기도 평택시 칠괴동 585-2 1
 
0.6%
Other values (6) 6
 
3.8%

Length

2023-12-12T09:19:36.908738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 128
36.6%
평택시 32
 
9.1%
경기도 31
 
8.9%
고덕면 25
 
7.1%
방축리 17
 
4.9%
11
 
3.1%
30필지(저장소 9
 
2.6%
9
 
2.6%
162 9
 
2.6%
위치 8
 
2.3%
Other values (39) 71
20.3%
Distinct65
Distinct (%)50.8%
Missing31
Missing (%)19.5%
Memory size1.4 KiB
2023-12-12T09:19:37.157657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length35
Mean length25.695312
Min length18

Characters and Unicode

Total characters3289
Distinct characters116
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)35.2%

Sample

1st row경기도 평택시 삼성로 114, 삼성전자(주) 평택캠퍼스 (고덕동)
2nd row경기도 평택시 삼성로 114, 삼성전자(주) 평택캠퍼스 (고덕동)
3rd row경기도 평택시 삼성1로 86 (고덕동)
4th row경기도 평택시 삼성로 114, 삼성전자(주) 평택캠퍼스 (고덕동)
5th row경기도 평택시 삼성로 114, 삼성전자(주) 평택캠퍼스 (고덕동)
ValueCountFrequency (%)
경기도 128
18.1%
평택시 128
18.1%
포승읍 29
 
4.1%
고덕동 25
 
3.5%
114 24
 
3.4%
삼성로 24
 
3.4%
평택캠퍼스 24
 
3.4%
삼성전자(주 24
 
3.4%
진위면 23
 
3.2%
경기대로 13
 
1.8%
Other values (124) 266
37.6%
2023-12-12T09:19:37.620668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
581
 
17.7%
159
 
4.8%
159
 
4.8%
1 143
 
4.3%
141
 
4.3%
141
 
4.3%
130
 
4.0%
128
 
3.9%
110
 
3.3%
( 78
 
2.4%
Other values (106) 1519
46.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2052
62.4%
Space Separator 581
 
17.7%
Decimal Number 444
 
13.5%
Open Punctuation 78
 
2.4%
Close Punctuation 78
 
2.4%
Other Punctuation 34
 
1.0%
Dash Punctuation 22
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
159
 
7.7%
159
 
7.7%
141
 
6.9%
141
 
6.9%
130
 
6.3%
128
 
6.2%
110
 
5.4%
60
 
2.9%
57
 
2.8%
50
 
2.4%
Other values (91) 917
44.7%
Decimal Number
ValueCountFrequency (%)
1 143
32.2%
2 64
14.4%
4 47
 
10.6%
5 38
 
8.6%
3 35
 
7.9%
7 32
 
7.2%
9 27
 
6.1%
8 26
 
5.9%
0 23
 
5.2%
6 9
 
2.0%
Space Separator
ValueCountFrequency (%)
581
100.0%
Open Punctuation
ValueCountFrequency (%)
( 78
100.0%
Close Punctuation
ValueCountFrequency (%)
) 78
100.0%
Other Punctuation
ValueCountFrequency (%)
, 34
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 22
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2052
62.4%
Common 1237
37.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
159
 
7.7%
159
 
7.7%
141
 
6.9%
141
 
6.9%
130
 
6.3%
128
 
6.2%
110
 
5.4%
60
 
2.9%
57
 
2.8%
50
 
2.4%
Other values (91) 917
44.7%
Common
ValueCountFrequency (%)
581
47.0%
1 143
 
11.6%
( 78
 
6.3%
) 78
 
6.3%
2 64
 
5.2%
4 47
 
3.8%
5 38
 
3.1%
3 35
 
2.8%
, 34
 
2.7%
7 32
 
2.6%
Other values (5) 107
 
8.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2052
62.4%
ASCII 1237
37.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
581
47.0%
1 143
 
11.6%
( 78
 
6.3%
) 78
 
6.3%
2 64
 
5.2%
4 47
 
3.8%
5 38
 
3.1%
3 35
 
2.8%
, 34
 
2.7%
7 32
 
2.6%
Other values (5) 107
 
8.6%
Hangul
ValueCountFrequency (%)
159
 
7.7%
159
 
7.7%
141
 
6.9%
141
 
6.9%
130
 
6.3%
128
 
6.2%
110
 
5.4%
60
 
2.9%
57
 
2.8%
50
 
2.4%
Other values (91) 917
44.7%
Distinct24
Distinct (%)15.1%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
질소
47 
기타
46 
탄산
산소
수소
Other values (19)
41 

Length

Max length14
Median length2
Mean length2.9811321
Min length2

Unique

Unique11 ?
Unique (%)6.9%

Sample

1st row질소
2nd row기타
3rd row기타
4th row질소
5th row질소

Common Values

ValueCountFrequency (%)
질소 47
29.6%
기타 46
28.9%
탄산 9
 
5.7%
산소 8
 
5.0%
수소 8
 
5.0%
아르곤 7
 
4.4%
액화암모니아 4
 
2.5%
탄산가스+기타 4
 
2.5%
질소+아르곤 4
 
2.5%
질소+기타 3
 
1.9%
Other values (14) 19
11.9%

Length

2023-12-12T09:19:37.768182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
질소 47
29.6%
기타 46
28.9%
탄산 9
 
5.7%
산소 8
 
5.0%
수소 8
 
5.0%
아르곤 7
 
4.4%
액화암모니아 4
 
2.5%
탄산가스+기타 4
 
2.5%
질소+아르곤 4
 
2.5%
질소+기타 3
 
1.9%
Other values (14) 19
11.9%

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
Minimum2023-07-07 00:00:00
Maximum2023-07-07 00:00:00
2023-12-12T09:19:37.883568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:19:37.970942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T09:19:35.368454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T09:19:38.039159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번소재지지번주소소재지도로명주소고압가스종류
연번1.0000.9120.9060.538
소재지지번주소0.9121.000NaN0.734
소재지도로명주소0.906NaN1.0000.000
고압가스종류0.5380.7340.0001.000
2023-12-12T09:19:38.134889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소재지지번주소고압가스종류
소재지지번주소1.0000.322
고압가스종류0.3221.000
2023-12-12T09:19:38.217235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번소재지지번주소고압가스종류
연번1.0000.5780.220
소재지지번주소0.5781.0000.322
고압가스종류0.2200.3221.000

Missing values

2023-12-12T09:19:35.520424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:19:35.654454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번상호명소재지지번주소소재지도로명주소고압가스종류데이터기준일
01삼성전자(주) [4라인 복합동 소화설비]<NA>경기도 평택시 삼성로 114, 삼성전자(주) 평택캠퍼스 (고덕동)질소2023-07-07
12삼성전자(주) [제2방재센터]<NA>경기도 평택시 삼성로 114, 삼성전자(주) 평택캠퍼스 (고덕동)기타2023-07-07
23린데코리아(주) 평택공장<NA>경기도 평택시 삼성1로 86 (고덕동)기타2023-07-07
34삼성전자(주) [4라인 FAB동 소화설비]<NA>경기도 평택시 삼성로 114, 삼성전자(주) 평택캠퍼스 (고덕동)질소2023-07-07
45삼성전자(주) [4라인 154kv동 소화설비]<NA>경기도 평택시 삼성로 114, 삼성전자(주) 평택캠퍼스 (고덕동)질소2023-07-07
56삼성전자(주) [4라인 인프라복합동]<NA>경기도 평택시 삼성로 114, 삼성전자(주) 평택캠퍼스 (고덕동)질소2023-07-07
67삼성전자(주) [3라인 질소탱크]<NA>경기도 평택시 삼성로 114, 삼성전자(주) 평택캠퍼스 (고덕동)기타2023-07-07
78삼성전자(주) [4라인 그린동 소화설비]<NA>경기도 평택시 삼성로 114, 삼성전자(주) 평택캠퍼스 (고덕동)질소2023-07-07
89(주)원익홀딩스 [헬륨]<NA>경기도 평택시 진위면 마산12로 21, 원익홀딩스 진위사업장기타2023-07-07
910(주)원익홀딩스 [액화산소]<NA>경기도 평택시 진위면 마산12로 21, 원익홀딩스 진위사업장산소2023-07-07
연번상호명소재지지번주소소재지도로명주소고압가스종류데이터기준일
149150에이지(주)<NA>경기도 평택시 팽성읍 추팔산단1길 213, 에이지(주)수소2023-07-07
150151SK가스(주)<NA>경기도 평택시 포승읍 남양만로 138질소2023-07-07
151152신흥특수기계경기도 평택시 모곡동 441-7<NA>탄산2023-07-07
152153송탄가스<NA>경기도 평택시 서정남로 34-10 (서정동)기타2023-07-07
153154유천정수장<NA>경기도 평택시 경기대로 18-34 (유천동)액화염소2023-07-07
154155아시아첨가제<NA>경기도 평택시 세교산단로22번길 21 (세교동)질소2023-07-07
155156송탄정수장<NA>경기도 평택시 진위면 신리길 214액화염소2023-07-07
156157좌동가스경기도 평택시 서정동 271-6<NA>아세틸렌2023-07-07
157158매일유업<NA>경기도 평택시 진위면 진위서로 63질소2023-07-07
158159연흥가스경기도 평택시 지산동 460-1<NA>산소2023-07-07