Overview

Dataset statistics

Number of variables6
Number of observations107
Missing cells9
Missing cells (%)1.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.2 KiB
Average record size in memory50.2 B

Variable types

Numeric1
Categorical1
Text3
DateTime1

Dataset

Description인천광역시 서구에 위치한 고압가스사업자 현황에 관한 데이터셋입니다. 인천광역시 서구에 위치한 고압가스사업자 현황의 인허가 구분, 상호, 도로명 주소, 전화번호에 관한 정보를 포함하고 있습니다.
Author인천광역시 서구
URLhttps://www.data.go.kr/data/15090764/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
연번 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 연번High correlation
전화번호 has 9 (8.4%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 19:34:01.466669
Analysis finished2023-12-12 19:34:02.670655
Duration1.2 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct107
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean54
Minimum1
Maximum107
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-13T04:34:02.770408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.3
Q127.5
median54
Q380.5
95-th percentile101.7
Maximum107
Range106
Interquartile range (IQR)53

Descriptive statistics

Standard deviation31.032241
Coefficient of variation (CV)0.57467114
Kurtosis-1.2
Mean54
Median Absolute Deviation (MAD)27
Skewness0
Sum5778
Variance963
MonotonicityStrictly increasing
2023-12-13T04:34:02.935988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.9%
69 1
 
0.9%
80 1
 
0.9%
79 1
 
0.9%
78 1
 
0.9%
77 1
 
0.9%
76 1
 
0.9%
75 1
 
0.9%
74 1
 
0.9%
73 1
 
0.9%
Other values (97) 97
90.7%
ValueCountFrequency (%)
1 1
0.9%
2 1
0.9%
3 1
0.9%
4 1
0.9%
5 1
0.9%
6 1
0.9%
7 1
0.9%
8 1
0.9%
9 1
0.9%
10 1
0.9%
ValueCountFrequency (%)
107 1
0.9%
106 1
0.9%
105 1
0.9%
104 1
0.9%
103 1
0.9%
102 1
0.9%
101 1
0.9%
100 1
0.9%
99 1
0.9%
98 1
0.9%

구분
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)6.5%
Missing0
Missing (%)0.0%
Memory size988.0 B
고압가스 저장
24 
고압가스 제조
21 
용기 등 제조
20 
고압가스 냉동제조 허가시설
13 
고압가스 냉동제조 신고시설
11 
Other values (2)
18 

Length

Max length19
Median length7
Mean length9.5607477
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row고압가스 제조
2nd row고압가스 제조
3rd row고압가스 제조
4th row고압가스 제조
5th row고압가스 제조

Common Values

ValueCountFrequency (%)
고압가스 저장 24
22.4%
고압가스 제조 21
19.6%
용기 등 제조 20
18.7%
고압가스 냉동제조 허가시설 13
12.1%
고압가스 냉동제조 신고시설 11
10.3%
고압가스 판매소 10
9.3%
고압가스 냉동제조 허가 및 신고시설 8
 
7.5%

Length

2023-12-13T04:34:03.100230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:34:03.238814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
고압가스 87
30.9%
제조 41
14.5%
냉동제조 32
 
11.3%
저장 24
 
8.5%
용기 20
 
7.1%
20
 
7.1%
신고시설 19
 
6.7%
허가시설 13
 
4.6%
판매소 10
 
3.5%
허가 8
 
2.8%

상호
Text

Distinct92
Distinct (%)86.0%
Missing0
Missing (%)0.0%
Memory size988.0 B
2023-12-13T04:34:03.578000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length16
Mean length8.2616822
Min length4

Characters and Unicode

Total characters884
Distinct characters179
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique79 ?
Unique (%)73.8%

Sample

1st rowSK인천석유화학(주)
2nd row(주)엘피지뱅크
3rd row이화산소(주)
4th row삼정가스공업(주)
5th row동서화학(주)
ValueCountFrequency (%)
주)세안 3
 
2.6%
포스코에너지(주 3
 
2.6%
한국중부발전(주 2
 
1.7%
주)가스텍 2
 
1.7%
주)동성엔지니어링 2
 
1.7%
주)머큐리 2
 
1.7%
국제성모병원 2
 
1.7%
인천발전본부 2
 
1.7%
주)한화 2
 
1.7%
kg동부제철(주 2
 
1.7%
Other values (89) 94
81.0%
2023-12-13T04:34:04.148402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 80
 
9.0%
) 80
 
9.0%
80
 
9.0%
30
 
3.4%
22
 
2.5%
16
 
1.8%
16
 
1.8%
15
 
1.7%
14
 
1.6%
14
 
1.6%
Other values (169) 517
58.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 697
78.8%
Open Punctuation 80
 
9.0%
Close Punctuation 80
 
9.0%
Space Separator 10
 
1.1%
Other Symbol 9
 
1.0%
Uppercase Letter 8
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
80
 
11.5%
30
 
4.3%
22
 
3.2%
16
 
2.3%
16
 
2.3%
15
 
2.2%
14
 
2.0%
14
 
2.0%
14
 
2.0%
14
 
2.0%
Other values (162) 462
66.3%
Uppercase Letter
ValueCountFrequency (%)
K 4
50.0%
S 2
25.0%
G 2
25.0%
Open Punctuation
ValueCountFrequency (%)
( 80
100.0%
Close Punctuation
ValueCountFrequency (%)
) 80
100.0%
Space Separator
ValueCountFrequency (%)
10
100.0%
Other Symbol
ValueCountFrequency (%)
9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 706
79.9%
Common 170
 
19.2%
Latin 8
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
80
 
11.3%
30
 
4.2%
22
 
3.1%
16
 
2.3%
16
 
2.3%
15
 
2.1%
14
 
2.0%
14
 
2.0%
14
 
2.0%
14
 
2.0%
Other values (163) 471
66.7%
Common
ValueCountFrequency (%)
( 80
47.1%
) 80
47.1%
10
 
5.9%
Latin
ValueCountFrequency (%)
K 4
50.0%
S 2
25.0%
G 2
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 697
78.8%
ASCII 178
 
20.1%
None 9
 
1.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 80
44.9%
) 80
44.9%
10
 
5.6%
K 4
 
2.2%
S 2
 
1.1%
G 2
 
1.1%
Hangul
ValueCountFrequency (%)
80
 
11.5%
30
 
4.3%
22
 
3.2%
16
 
2.3%
16
 
2.3%
15
 
2.2%
14
 
2.0%
14
 
2.0%
14
 
2.0%
14
 
2.0%
Other values (162) 462
66.3%
None
ValueCountFrequency (%)
9
100.0%
Distinct96
Distinct (%)89.7%
Missing0
Missing (%)0.0%
Memory size988.0 B
2023-12-13T04:34:04.520282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length27
Mean length24.084112
Min length15

Characters and Unicode

Total characters2577
Distinct characters95
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique85 ?
Unique (%)79.4%

Sample

1st row인천광역시 서구 봉수대로 415 (원창동)
2nd row인천광역시 서구 길주로45번길 11 (석남동)
3rd row인천광역시 서구 가정로58번길 32 (가좌동)
4th row인천광역시 서구 봉수대로501번길 3 (신현동)
5th row인천광역시 서구 길주로45번길 17-2 (석남동)
ValueCountFrequency (%)
인천광역시 107
20.3%
서구 107
20.3%
가좌동 21
 
4.0%
오류동 20
 
3.8%
원창동 16
 
3.0%
석남동 11
 
2.1%
경서동 8
 
1.5%
42 7
 
1.3%
백범로 6
 
1.1%
검단로 6
 
1.1%
Other values (155) 219
41.5%
2023-12-13T04:34:05.127268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
447
 
17.3%
119
 
4.6%
109
 
4.2%
108
 
4.2%
107
 
4.2%
107
 
4.2%
107
 
4.2%
107
 
4.2%
106
 
4.1%
104
 
4.0%
Other values (85) 1156
44.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1515
58.8%
Space Separator 447
 
17.3%
Decimal Number 399
 
15.5%
Close Punctuation 100
 
3.9%
Open Punctuation 99
 
3.8%
Dash Punctuation 12
 
0.5%
Other Punctuation 5
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
119
 
7.9%
109
 
7.2%
108
 
7.1%
107
 
7.1%
107
 
7.1%
107
 
7.1%
107
 
7.1%
106
 
7.0%
104
 
6.9%
58
 
3.8%
Other values (70) 483
31.9%
Decimal Number
ValueCountFrequency (%)
1 62
15.5%
2 57
14.3%
4 54
13.5%
5 52
13.0%
3 42
10.5%
6 32
8.0%
8 30
7.5%
7 25
6.3%
9 24
 
6.0%
0 21
 
5.3%
Space Separator
ValueCountFrequency (%)
447
100.0%
Close Punctuation
ValueCountFrequency (%)
) 100
100.0%
Open Punctuation
ValueCountFrequency (%)
( 99
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%
Other Punctuation
ValueCountFrequency (%)
, 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1515
58.8%
Common 1062
41.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
119
 
7.9%
109
 
7.2%
108
 
7.1%
107
 
7.1%
107
 
7.1%
107
 
7.1%
107
 
7.1%
106
 
7.0%
104
 
6.9%
58
 
3.8%
Other values (70) 483
31.9%
Common
ValueCountFrequency (%)
447
42.1%
) 100
 
9.4%
( 99
 
9.3%
1 62
 
5.8%
2 57
 
5.4%
4 54
 
5.1%
5 52
 
4.9%
3 42
 
4.0%
6 32
 
3.0%
8 30
 
2.8%
Other values (5) 87
 
8.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1515
58.8%
ASCII 1062
41.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
447
42.1%
) 100
 
9.4%
( 99
 
9.3%
1 62
 
5.8%
2 57
 
5.4%
4 54
 
5.1%
5 52
 
4.9%
3 42
 
4.0%
6 32
 
3.0%
8 30
 
2.8%
Other values (5) 87
 
8.2%
Hangul
ValueCountFrequency (%)
119
 
7.9%
109
 
7.2%
108
 
7.1%
107
 
7.1%
107
 
7.1%
107
 
7.1%
107
 
7.1%
106
 
7.0%
104
 
6.9%
58
 
3.8%
Other values (70) 483
31.9%

전화번호
Text

MISSING 

Distinct94
Distinct (%)95.9%
Missing9
Missing (%)8.4%
Memory size988.0 B
2023-12-13T04:34:05.463685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.040816
Min length12

Characters and Unicode

Total characters1180
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique90 ?
Unique (%)91.8%

Sample

1st row032-543-2770
2nd row032-870-7337
3rd row032-870-3254
4th row032-575-1441
5th row032-876-0100
ValueCountFrequency (%)
032-570-4283 2
 
2.0%
032-562-5963 2
 
2.0%
032-550-8267 2
 
2.0%
032-571-1955 2
 
2.0%
032-812-9376 1
 
1.0%
032-584-4600 1
 
1.0%
032-772-5494 1
 
1.0%
032-290-2851 1
 
1.0%
032-563-7004 1
 
1.0%
032-590-2030 1
 
1.0%
Other values (84) 84
85.7%
2023-12-13T04:34:05.988934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 196
16.6%
0 178
15.1%
2 160
13.6%
3 144
12.2%
5 126
10.7%
7 99
8.4%
8 67
 
5.7%
1 63
 
5.3%
6 54
 
4.6%
9 47
 
4.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 984
83.4%
Dash Punctuation 196
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 178
18.1%
2 160
16.3%
3 144
14.6%
5 126
12.8%
7 99
10.1%
8 67
 
6.8%
1 63
 
6.4%
6 54
 
5.5%
9 47
 
4.8%
4 46
 
4.7%
Dash Punctuation
ValueCountFrequency (%)
- 196
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1180
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 196
16.6%
0 178
15.1%
2 160
13.6%
3 144
12.2%
5 126
10.7%
7 99
8.4%
8 67
 
5.7%
1 63
 
5.3%
6 54
 
4.6%
9 47
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1180
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 196
16.6%
0 178
15.1%
2 160
13.6%
3 144
12.2%
5 126
10.7%
7 99
8.4%
8 67
 
5.7%
1 63
 
5.3%
6 54
 
4.6%
9 47
 
4.0%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size988.0 B
Minimum2022-09-01 00:00:00
Maximum2022-09-01 00:00:00
2023-12-13T04:34:06.162597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:34:06.293331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-13T04:34:02.366822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T04:34:06.416387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분상호도로명 주소전화번호
연번1.0000.9160.4650.6410.716
구분0.9161.0000.6230.7750.958
상호0.4650.6231.0000.9991.000
도로명 주소0.6410.7750.9991.0000.998
전화번호0.7160.9581.0000.9981.000
2023-12-13T04:34:06.550330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분
연번1.0000.774
구분0.7741.000

Missing values

2023-12-13T04:34:02.495529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:34:02.619938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번구분상호도로명 주소전화번호데이터기준일자
01고압가스 제조SK인천석유화학(주)인천광역시 서구 봉수대로 415 (원창동)032-543-27702022-09-01
12고압가스 제조(주)엘피지뱅크인천광역시 서구 길주로45번길 11 (석남동)032-870-73372022-09-01
23고압가스 제조이화산소(주)인천광역시 서구 가정로58번길 32 (가좌동)032-870-32542022-09-01
34고압가스 제조삼정가스공업(주)인천광역시 서구 봉수대로501번길 3 (신현동)032-575-14412022-09-01
45고압가스 제조동서화학(주)인천광역시 서구 길주로45번길 17-2 (석남동)032-876-01002022-09-01
56고압가스 제조(주)세안인천광역시 서구 길주로44번길 24 (석남동)032-880-21742022-09-01
67고압가스 제조KG동부제철(주)인천광역시 서구 백범로 789 (가좌동)032-570-42832022-09-01
78고압가스 제조(주)제일가스인천광역시 서구 두루물로8번길 46 (오류동)032-567-60502022-09-01
89고압가스 제조(주)한화인천광역시 서구 북항로 268 (원창동)032-255-69952022-09-01
910고압가스 제조선도산업(주)인천광역시 서구 사렴로65번길 42 (경서동)032-588-45042022-09-01
연번구분상호도로명 주소전화번호데이터기준일자
9798용기 등 제조그린냉동공조(주)인천광역시 서구 검단로 326번길 35-72 (왕길동)032-822-15112022-09-01
9899용기 등 제조(주)동성엔지니어링인천광역시 서구 검단로92번길 9, 1층 (오류동)032-562-59632022-09-01
99100용기 등 제조(주)거봉한진인천광역시 서구 북항로 177번길 68032-682-20722022-09-01
100101용기 등 제조한주기계공업(주)인천광역시 서구 보듬로 173, 1동031-999-85692022-09-01
101102용기 등 제조신동해(주)인천광역시 서구 보듬2로 20, 가동 (오류동)032-563-02752022-09-01
102103용기 등 제조신도열기(주)인천광역시 서구 검단로 54번길 25 (오류동)032-563-70092022-09-01
103104용기 등 제조(주)플러스냉동설비인천광역시 서구 건지로 136 (가좌동)032-584-92212022-09-01
104105용기 등 제조㈜동인써모인천광역시 서구 봉화로 223번안길 29(금곡동)032-565-91512022-09-01
105106용기 등 제조㈜삼원이엔지인천광역시 서구 보듬4로 12(오류동)032-553-52762022-09-01
106107용기 등 제조㈜제이에이치기업인천광역시 서구 로봇랜드로 249번길 94(청라동)032-716-63372022-09-01