Overview

Dataset statistics

Number of variables15
Number of observations45
Missing cells203
Missing cells (%)30.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.6 KiB
Average record size in memory126.9 B

Variable types

Text6
DateTime1
Categorical6
Numeric2

Dataset

Description충청북도 충주시 고압가스 저장설비 설치현황에 대한 데이터 제공(업체명, 주소, 허가일자, 사용가스, 사용가스 저장설비 크기, 사용가스 단위 등 )
URLhttps://www.data.go.kr/data/15029720/fileData.do

Alerts

사용가스3 단위 has constant value ""Constant
사용가스4 has constant value ""Constant
사용가스2 단위 is highly overall correlated with 사용가스1 저장설비 크기 and 5 other fieldsHigh correlation
사용가스2 is highly overall correlated with 사용가스1 and 3 other fieldsHigh correlation
사용가스3 저장설비 크기 is highly overall correlated with 사용가스1 저장설비 크기 and 6 other fieldsHigh correlation
사용가스1 단위 is highly overall correlated with 사용가스1 저장설비 크기 and 2 other fieldsHigh correlation
사용가스4 저장설비 크기 is highly overall correlated with 사용가스1 저장설비 크기 and 6 other fieldsHigh correlation
사용가스1 is highly overall correlated with 사용가스2 and 3 other fieldsHigh correlation
사용가스1 저장설비 크기 is highly overall correlated with 사용가스1 단위 and 3 other fieldsHigh correlation
사용가스2 저장설비 크기 is highly overall correlated with 사용가스2 단위 and 2 other fieldsHigh correlation
사용가스1 단위 is highly imbalanced (51.3%)Imbalance
사용가스3 저장설비 크기 is highly imbalanced (80.6%)Imbalance
사용가스4 저장설비 크기 is highly imbalanced (80.6%)Imbalance
사용가스2 저장설비 크기 has 31 (68.9%) missing valuesMissing
사용가스3 has 43 (95.6%) missing valuesMissing
사용가스3 단위 has 43 (95.6%) missing valuesMissing
사용가스4 has 43 (95.6%) missing valuesMissing
사용가스4 단위 has 43 (95.6%) missing valuesMissing

Reproduction

Analysis started2023-12-12 23:08:30.427083
Analysis finished2023-12-12 23:08:32.212137
Duration1.79 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct38
Distinct (%)84.4%
Missing0
Missing (%)0.0%
Memory size492.0 B
2023-12-13T08:08:32.354040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length13
Mean length10.533333
Min length4

Characters and Unicode

Total characters474
Distinct characters135
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)73.3%

Sample

1st row㈜메디오젠 충주공장
2nd row주식회사 재세능원 충주지점
3rd row한국수자원공사 한강권역부문 충주권지사
4th row이연제약(주)
5th row이연제약(주)
ValueCountFrequency (%)
주식회사 7
 
10.3%
롯데칠성음료(주 4
 
5.9%
주류비지 4
 
5.9%
충주공장 4
 
5.9%
코스모신소재(주 2
 
2.9%
현대모비스(주 2
 
2.9%
보성파워텍(주 2
 
2.9%
이연제약(주 2
 
2.9%
한국수자원공사 2
 
2.9%
공군제19전투비행단 1
 
1.5%
Other values (38) 38
55.9%
2023-12-13T08:08:32.649399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
52
 
11.0%
( 28
 
5.9%
) 28
 
5.9%
23
 
4.9%
13
 
2.7%
11
 
2.3%
11
 
2.3%
10
 
2.1%
10
 
2.1%
10
 
2.1%
Other values (125) 278
58.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 382
80.6%
Open Punctuation 28
 
5.9%
Close Punctuation 28
 
5.9%
Space Separator 23
 
4.9%
Uppercase Letter 8
 
1.7%
Decimal Number 2
 
0.4%
Other Punctuation 2
 
0.4%
Other Symbol 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
52
 
13.6%
13
 
3.4%
11
 
2.9%
11
 
2.9%
10
 
2.6%
10
 
2.6%
10
 
2.6%
9
 
2.4%
7
 
1.8%
7
 
1.8%
Other values (110) 242
63.4%
Uppercase Letter
ValueCountFrequency (%)
P 2
25.0%
T 1
12.5%
N 1
12.5%
B 1
12.5%
F 1
12.5%
E 1
12.5%
S 1
12.5%
Decimal Number
ValueCountFrequency (%)
1 1
50.0%
9 1
50.0%
Other Punctuation
ValueCountFrequency (%)
& 1
50.0%
. 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 28
100.0%
Close Punctuation
ValueCountFrequency (%)
) 28
100.0%
Space Separator
ValueCountFrequency (%)
23
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 383
80.8%
Common 83
 
17.5%
Latin 8
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
52
 
13.6%
13
 
3.4%
11
 
2.9%
11
 
2.9%
10
 
2.6%
10
 
2.6%
10
 
2.6%
9
 
2.3%
7
 
1.8%
7
 
1.8%
Other values (111) 243
63.4%
Common
ValueCountFrequency (%)
( 28
33.7%
) 28
33.7%
23
27.7%
1 1
 
1.2%
9 1
 
1.2%
& 1
 
1.2%
. 1
 
1.2%
Latin
ValueCountFrequency (%)
P 2
25.0%
T 1
12.5%
N 1
12.5%
B 1
12.5%
F 1
12.5%
E 1
12.5%
S 1
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 382
80.6%
ASCII 91
 
19.2%
None 1
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
52
 
13.6%
13
 
3.4%
11
 
2.9%
11
 
2.9%
10
 
2.6%
10
 
2.6%
10
 
2.6%
9
 
2.4%
7
 
1.8%
7
 
1.8%
Other values (110) 242
63.4%
ASCII
ValueCountFrequency (%)
( 28
30.8%
) 28
30.8%
23
25.3%
P 2
 
2.2%
T 1
 
1.1%
N 1
 
1.1%
1 1
 
1.1%
9 1
 
1.1%
B 1
 
1.1%
& 1
 
1.1%
Other values (4) 4
 
4.4%
None
ValueCountFrequency (%)
1
100.0%

주소
Text

Distinct39
Distinct (%)86.7%
Missing0
Missing (%)0.0%
Memory size492.0 B
2023-12-13T08:08:32.795359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length33
Mean length24.777778
Min length17

Characters and Unicode

Total characters1115
Distinct characters96
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)77.8%

Sample

1st row충청북도 충주시 대소원면 메가폴리스3로 16
2nd row충청북도 충주시 대소원면 영평리 774
3rd row충청북도 충주시 용탄동 305-3 한국수자원공사충주권관리단
4th row충청북도 충주시 대소원면 영평리 530
5th row충청북도 충주시 대소원면 영평리 530
ValueCountFrequency (%)
충청북도 45
19.5%
충주시 45
19.5%
대소원면 20
 
8.7%
주덕읍 8
 
3.5%
용탄동 6
 
2.6%
메가폴리스로 5
 
2.2%
기업도시1로 5
 
2.2%
87 4
 
1.7%
롯데칠성음료(주)충주2공장 4
 
1.7%
영평리 4
 
1.7%
Other values (68) 85
36.8%
2023-12-13T08:08:33.035566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
196
17.6%
105
 
9.4%
73
 
6.5%
51
 
4.6%
50
 
4.5%
46
 
4.1%
45
 
4.0%
27
 
2.4%
27
 
2.4%
27
 
2.4%
Other values (86) 468
42.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 741
66.5%
Space Separator 196
 
17.6%
Decimal Number 136
 
12.2%
Close Punctuation 14
 
1.3%
Open Punctuation 14
 
1.3%
Other Punctuation 7
 
0.6%
Dash Punctuation 4
 
0.4%
Uppercase Letter 3
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
105
 
14.2%
73
 
9.9%
51
 
6.9%
50
 
6.7%
46
 
6.2%
45
 
6.1%
27
 
3.6%
27
 
3.6%
27
 
3.6%
26
 
3.5%
Other values (68) 264
35.6%
Decimal Number
ValueCountFrequency (%)
1 22
16.2%
3 20
14.7%
2 18
13.2%
6 13
9.6%
8 12
8.8%
7 12
8.8%
5 11
8.1%
4 10
7.4%
0 10
7.4%
9 8
 
5.9%
Uppercase Letter
ValueCountFrequency (%)
S 1
33.3%
P 1
33.3%
E 1
33.3%
Space Separator
ValueCountFrequency (%)
196
100.0%
Close Punctuation
ValueCountFrequency (%)
) 14
100.0%
Open Punctuation
ValueCountFrequency (%)
( 14
100.0%
Other Punctuation
ValueCountFrequency (%)
, 7
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 741
66.5%
Common 371
33.3%
Latin 3
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
105
 
14.2%
73
 
9.9%
51
 
6.9%
50
 
6.7%
46
 
6.2%
45
 
6.1%
27
 
3.6%
27
 
3.6%
27
 
3.6%
26
 
3.5%
Other values (68) 264
35.6%
Common
ValueCountFrequency (%)
196
52.8%
1 22
 
5.9%
3 20
 
5.4%
2 18
 
4.9%
) 14
 
3.8%
( 14
 
3.8%
6 13
 
3.5%
8 12
 
3.2%
7 12
 
3.2%
5 11
 
3.0%
Other values (5) 39
 
10.5%
Latin
ValueCountFrequency (%)
S 1
33.3%
P 1
33.3%
E 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 741
66.5%
ASCII 374
33.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
196
52.4%
1 22
 
5.9%
3 20
 
5.3%
2 18
 
4.8%
) 14
 
3.7%
( 14
 
3.7%
6 13
 
3.5%
8 12
 
3.2%
7 12
 
3.2%
5 11
 
2.9%
Other values (8) 42
 
11.2%
Hangul
ValueCountFrequency (%)
105
 
14.2%
73
 
9.9%
51
 
6.9%
50
 
6.7%
46
 
6.2%
45
 
6.1%
27
 
3.6%
27
 
3.6%
27
 
3.6%
26
 
3.5%
Other values (68) 264
35.6%
Distinct40
Distinct (%)88.9%
Missing0
Missing (%)0.0%
Memory size492.0 B
Minimum1987-08-21 00:00:00
Maximum2022-12-01 00:00:00
2023-12-13T08:08:33.137261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:08:33.236209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=40)

사용가스1
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)20.0%
Missing0
Missing (%)0.0%
Memory size492.0 B
질소
14 
산소
11 
탄산가스
수소
기타
Other values (4)

Length

Max length6
Median length2
Mean length2.6
Min length2

Unique

Unique2 ?
Unique (%)4.4%

Sample

1st row액화암모니아
2nd row산소
3rd row탄산가스
4th row산소
5th row질소

Common Values

ValueCountFrequency (%)
질소 14
31.1%
산소 11
24.4%
탄산가스 7
15.6%
수소 5
 
11.1%
기타 2
 
4.4%
천연가스 2
 
4.4%
액화염소 2
 
4.4%
액화암모니아 1
 
2.2%
아르곤 1
 
2.2%

Length

2023-12-13T08:08:33.342528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:08:33.449203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
질소 14
31.1%
산소 11
24.4%
탄산가스 7
15.6%
수소 5
 
11.1%
기타 2
 
4.4%
천연가스 2
 
4.4%
액화염소 2
 
4.4%
액화암모니아 1
 
2.2%
아르곤 1
 
2.2%

사용가스1 저장설비 크기
Real number (ℝ)

HIGH CORRELATION 

Distinct44
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1749.7614
Minimum0.002
Maximum32220.3
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size537.0 B
2023-12-13T08:08:33.558997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.002
5-th percentile0.7022
Q19.747
median15.062
Q343.63
95-th percentile12512.4
Maximum32220.3
Range32220.298
Interquartile range (IQR)33.883

Descriptive statistics

Standard deviation5641.9238
Coefficient of variation (CV)3.2243961
Kurtosis20.187538
Mean1749.7614
Median Absolute Deviation (MAD)8.778
Skewness4.2615488
Sum78739.262
Variance31831304
MonotonicityNot monotonic
2023-12-13T08:08:33.658286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=44)
ValueCountFrequency (%)
29.929 2
 
4.4%
4.9 1
 
2.2%
4.905 1
 
2.2%
10.7 1
 
2.2%
21.23 1
 
2.2%
8.93 1
 
2.2%
2597.26 1
 
2.2%
6.89 1
 
2.2%
9186.0 1
 
2.2%
10.0 1
 
2.2%
Other values (34) 34
75.6%
ValueCountFrequency (%)
0.002 1
2.2%
0.004 1
2.2%
0.005 1
2.2%
3.491 1
2.2%
4.9 1
2.2%
4.905 1
2.2%
6.284 1
2.2%
6.89 1
2.2%
6.976 1
2.2%
8.0 1
2.2%
ValueCountFrequency (%)
32220.3 1
2.2%
14543.0 1
2.2%
13344.0 1
2.2%
9186.0 1
2.2%
4227.23 1
2.2%
2597.26 1
2.2%
1065.0 1
2.2%
623.28 1
2.2%
324.58 1
2.2%
59.504 1
2.2%

사용가스1 단위
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size492.0 B
37 
Kg
 
1

Length

Max length2
Median length1
Mean length1.0222222
Min length1

Unique

Unique1 ?
Unique (%)2.2%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
37
82.2%
7
 
15.6%
Kg 1
 
2.2%

Length

2023-12-13T08:08:33.756984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:08:33.835093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
37
82.2%
7
 
15.6%
kg 1
 
2.2%

사용가스2
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)13.3%
Missing0
Missing (%)0.0%
Memory size492.0 B
<NA>
31 
질소
탄산가스
 
3
기타
 
3
액화염소
 
1

Length

Max length4
Median length4
Mean length3.5777778
Min length2

Unique

Unique2 ?
Unique (%)4.4%

Sample

1st row<NA>
2nd row질소
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 31
68.9%
질소 6
 
13.3%
탄산가스 3
 
6.7%
기타 3
 
6.7%
액화염소 1
 
2.2%
아르곤 1
 
2.2%

Length

2023-12-13T08:08:33.933182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:08:34.034501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 31
68.9%
질소 6
 
13.3%
탄산가스 3
 
6.7%
기타 3
 
6.7%
액화염소 1
 
2.2%
아르곤 1
 
2.2%

사용가스2 저장설비 크기
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct14
Distinct (%)100.0%
Missing31
Missing (%)68.9%
Infinite0
Infinite (%)0.0%
Mean432.28993
Minimum0.828
Maximum4006.8
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size537.0 B
2023-12-13T08:08:34.120277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.828
5-th percentile2.5817
Q19.9215
median13.712
Q3113.61
95-th percentile2062.1664
Maximum4006.8
Range4005.972
Interquartile range (IQR)103.6885

Descriptive statistics

Standard deviation1075.0705
Coefficient of variation (CV)2.48692
Kurtosis11.154887
Mean432.28993
Median Absolute Deviation (MAD)11.535
Skewness3.2608564
Sum6052.059
Variance1155776.5
MonotonicityNot monotonic
2023-12-13T08:08:34.210002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
14.174 1
 
2.2%
3.527 1
 
2.2%
9.898 1
 
2.2%
3.526 1
 
2.2%
4006.8 1
 
2.2%
730.128 1
 
2.2%
1015.056 1
 
2.2%
43.74 1
 
2.2%
132.0 1
 
2.2%
0.828 1
 
2.2%
Other values (4) 4
 
8.9%
(Missing) 31
68.9%
ValueCountFrequency (%)
0.828 1
2.2%
3.526 1
2.2%
3.527 1
2.2%
9.898 1
2.2%
9.992 1
2.2%
10.7 1
2.2%
13.25 1
2.2%
14.174 1
2.2%
43.74 1
2.2%
58.44 1
2.2%
ValueCountFrequency (%)
4006.8 1
2.2%
1015.056 1
2.2%
730.128 1
2.2%
132.0 1
2.2%
58.44 1
2.2%
43.74 1
2.2%
14.174 1
2.2%
13.25 1
2.2%
10.7 1
2.2%
9.992 1
2.2%

사용가스2 단위
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)8.9%
Missing0
Missing (%)0.0%
Memory size492.0 B
<NA>
31 
10 
 
3
Kg
 
1

Length

Max length4
Median length4
Mean length3.0888889
Min length1

Unique

Unique1 ?
Unique (%)2.2%

Sample

1st row<NA>
2nd row
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 31
68.9%
10
 
22.2%
3
 
6.7%
Kg 1
 
2.2%

Length

2023-12-13T08:08:34.341147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:08:34.464214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 31
68.9%
10
 
22.2%
3
 
6.7%
kg 1
 
2.2%

사용가스3
Text

MISSING 

Distinct2
Distinct (%)100.0%
Missing43
Missing (%)95.6%
Memory size492.0 B
2023-12-13T08:08:34.600851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length2.5
Mean length2.5
Min length2

Characters and Unicode

Total characters5
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st row질소
2nd row아르곤
ValueCountFrequency (%)
질소 1
50.0%
아르곤 1
50.0%
2023-12-13T08:08:34.845682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

사용가스3 저장설비 크기
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size492.0 B
<NA>
43 
42.0
 
1
45.4
 
1

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique2 ?
Unique (%)4.4%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 43
95.6%
42.0 1
 
2.2%
45.4 1
 
2.2%

Length

2023-12-13T08:08:35.026347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:08:35.131854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 43
95.6%
42.0 1
 
2.2%
45.4 1
 
2.2%

사용가스3 단위
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)50.0%
Missing43
Missing (%)95.6%
Memory size492.0 B
2023-12-13T08:08:35.204062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters2
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
ValueCountFrequency (%)
2
100.0%
2023-12-13T08:08:35.364054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2
100.0%

Most occurring categories

ValueCountFrequency (%)
Other Symbol 2
100.0%

Most frequent character per category

Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
CJK Compat 2
100.0%

Most frequent character per block

CJK Compat
ValueCountFrequency (%)
2
100.0%

사용가스4
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)50.0%
Missing43
Missing (%)95.6%
Memory size492.0 B
2023-12-13T08:08:35.446136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters4
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row기타
2nd row기타
ValueCountFrequency (%)
기타 2
100.0%
2023-12-13T08:08:35.642215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2
50.0%
2
50.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2
50.0%
2
50.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2
50.0%
2
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2
50.0%
2
50.0%

사용가스4 저장설비 크기
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size492.0 B
<NA>
43 
120.0
 
1
14.2
 
1

Length

Max length5
Median length4
Mean length4.0222222
Min length4

Unique

Unique2 ?
Unique (%)4.4%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 43
95.6%
120.0 1
 
2.2%
14.2 1
 
2.2%

Length

2023-12-13T08:08:35.763403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:08:35.866087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 43
95.6%
120.0 1
 
2.2%
14.2 1
 
2.2%

사용가스4 단위
Text

MISSING 

Distinct2
Distinct (%)100.0%
Missing43
Missing (%)95.6%
Memory size492.0 B
2023-12-13T08:08:35.986139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length2
Median length1.5
Mean length1.5
Min length1

Characters and Unicode

Total characters3
Distinct characters3
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st rowKg
2nd row
ValueCountFrequency (%)
kg 1
50.0%
1
50.0%
2023-12-13T08:08:36.243234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
K 1
33.3%
g 1
33.3%
1
33.3%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 1
33.3%
Lowercase Letter 1
33.3%
Other Symbol 1
33.3%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
K 1
100.0%
Lowercase Letter
ValueCountFrequency (%)
g 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2
66.7%
Common 1
33.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
K 1
50.0%
g 1
50.0%
Common
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2
66.7%
CJK Compat 1
33.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
K 1
50.0%
g 1
50.0%
CJK Compat
ValueCountFrequency (%)
1
100.0%

Interactions

2023-12-13T08:08:31.289752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:08:31.113621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:08:31.407011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:08:31.190574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:08:36.336787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업체명주소허가일자사용가스1사용가스1 저장설비 크기사용가스1 단위사용가스2사용가스2 저장설비 크기사용가스2 단위사용가스3사용가스3 저장설비 크기사용가스4 저장설비 크기사용가스4 단위
업체명1.0001.0000.9970.9180.8010.8371.0000.0001.0000.0000.0000.0000.000
주소1.0001.0000.9970.8910.7080.8141.0000.0001.0000.0000.0000.0000.000
허가일자0.9970.9971.0000.9751.0001.0001.0000.0001.0000.0000.0000.0000.000
사용가스10.9180.8910.9751.0000.0000.8160.7400.7520.662NaNNaNNaNNaN
사용가스1 저장설비 크기0.8010.7081.0000.0001.0000.6790.4770.0000.6140.0000.0000.0000.000
사용가스1 단위0.8370.8141.0000.8160.6791.0000.4230.0000.231NaNNaNNaNNaN
사용가스21.0001.0001.0000.7400.4770.4231.0000.3111.0000.0000.0000.0000.000
사용가스2 저장설비 크기0.0000.0000.0000.7520.0000.0000.3111.0000.590NaNNaNNaNNaN
사용가스2 단위1.0001.0001.0000.6620.6140.2311.0000.5901.0000.0000.0000.0000.000
사용가스30.0000.0000.000NaN0.000NaN0.000NaN0.0001.0000.0000.0000.000
사용가스3 저장설비 크기0.0000.0000.000NaN0.000NaN0.000NaN0.0000.0001.0000.0000.000
사용가스4 저장설비 크기0.0000.0000.000NaN0.000NaN0.000NaN0.0000.0000.0001.0000.000
사용가스4 단위0.0000.0000.000NaN0.000NaN0.000NaN0.0000.0000.0000.0001.000
2023-12-13T08:08:36.500483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사용가스2 단위사용가스2사용가스3 저장설비 크기사용가스1 단위사용가스4 저장설비 크기사용가스1
사용가스2 단위1.0000.9051.0000.3471.0000.649
사용가스20.9051.0001.0000.4261.0000.637
사용가스3 저장설비 크기1.0001.0001.0001.0001.0001.000
사용가스1 단위0.3470.4261.0001.0001.0000.482
사용가스4 저장설비 크기1.0001.0001.0001.0001.0001.000
사용가스10.6490.6371.0000.4821.0001.000
2023-12-13T08:08:36.615252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사용가스1 저장설비 크기사용가스2 저장설비 크기사용가스1사용가스1 단위사용가스2사용가스2 단위사용가스3 저장설비 크기사용가스4 저장설비 크기
사용가스1 저장설비 크기1.000-0.1250.0000.6370.3540.5921.0001.000
사용가스2 저장설비 크기-0.1251.0000.3650.0000.1830.5641.0001.000
사용가스10.0000.3651.0000.4820.6370.6491.0001.000
사용가스1 단위0.6370.0000.4821.0000.4260.3471.0001.000
사용가스20.3540.1830.6370.4261.0000.9051.0001.000
사용가스2 단위0.5920.5640.6490.3470.9051.0001.0001.000
사용가스3 저장설비 크기1.0001.0001.0001.0001.0001.0001.0001.000
사용가스4 저장설비 크기1.0001.0001.0001.0001.0001.0001.0001.000

Missing values

2023-12-13T08:08:31.548865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:08:31.726046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T08:08:32.120156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업체명주소허가일자사용가스1사용가스1 저장설비 크기사용가스1 단위사용가스2사용가스2 저장설비 크기사용가스2 단위사용가스3사용가스3 저장설비 크기사용가스3 단위사용가스4사용가스4 저장설비 크기사용가스4 단위
0㈜메디오젠 충주공장충청북도 충주시 대소원면 메가폴리스3로 162022-12-01액화암모니아4.9<NA><NA><NA><NA><NA><NA><NA><NA><NA>
1주식회사 재세능원 충주지점충청북도 충주시 대소원면 영평리 7742022-02-24산소324.58질소14.174<NA><NA><NA><NA><NA><NA>
2한국수자원공사 한강권역부문 충주권지사충청북도 충주시 용탄동 305-3 한국수자원공사충주권관리단2022-02-24탄산가스39.862<NA><NA><NA><NA><NA><NA><NA><NA><NA>
3이연제약(주)충청북도 충주시 대소원면 영평리 5302021-05-04산소9.747<NA><NA><NA><NA><NA><NA><NA><NA><NA>
4이연제약(주)충청북도 충주시 대소원면 영평리 5302021-05-04질소9.963<NA><NA><NA><NA><NA><NA><NA><NA><NA>
5(주)중원신소재 영평공장충청북도 충주시 대소원면 영평리 5762021-01-28질소19.999<NA><NA><NA><NA><NA><NA><NA><NA><NA>
6태성EPS충청북도 충주시 대소원면 메가폴리스2로 36, 태성EPS2020-09-25탄산가스49.062<NA><NA><NA><NA><NA><NA><NA><NA><NA>
7코스모신소재(주)충청북도 충주시 충주호수로 36 (목행동)2020-09-22산소59.504질소3.527<NA><NA><NA><NA><NA><NA>
8현대모비스(주)충청북도 충주시 대소원면 기업도시1로 472019-09-19수소32220.3<NA><NA><NA><NA><NA><NA><NA><NA><NA>
9주식회사 에이치제이에프충청북도 충주시 대소원면 메가폴리스로 422018-09-13질소21.1<NA><NA><NA><NA><NA><NA><NA><NA><NA>
업체명주소허가일자사용가스1사용가스1 저장설비 크기사용가스1 단위사용가스2사용가스2 저장설비 크기사용가스2 단위사용가스3사용가스3 저장설비 크기사용가스3 단위사용가스4사용가스4 저장설비 크기사용가스4 단위
35보성파워텍(주)충주공장충청북도 충주시 주덕읍 대창길 532007-08-13산소12.26탄산가스9.992<NA><NA><NA><NA><NA><NA>
36중부수산영어조합법인충청북도 충주시 금가면 연합강변길 922007-06-07산소6.976<NA><NA><NA><NA><NA><NA><NA><NA><NA>
37건국대학교충주병원충청북도 충주시 국원대로 82 (교현동)2007-06-05산소9.764<NA><NA><NA><NA><NA><NA><NA><NA><NA>
38(주)세아특수강 충주공장충청북도 충주시 충주산단2로 103 (용탄동)2005-07-27수소13344.0질소58.44<NA><NA><NA><NA><NA><NA>
39(주)TNP충청북도 충주시 충주산단1로 185 (용탄동)2005-04-30수소1065.0질소10.7아르곤45.4기타14.2
40현대성우캐스팅(주)충주공장충청북도 충주시 충주호수로 344 (용탄동)2000-12-21아르곤6.284<NA><NA><NA><NA><NA><NA><NA><NA><NA>
41(주)신한에스엔지 충주공장충청북도 충주시 주덕읍 충청대로 2195-311999-05-06산소12.264탄산가스13.25<NA><NA><NA><NA><NA><NA>
42한국수자원공사 충주권관리단충청북도 충주시 용탄동 3051998-11-17액화염소12.0<NA><NA><NA><NA><NA><NA><NA><NA><NA>
43써니전자(주)충주지점충청북도 충주시 목행산단2로 59 (목행동)1991-04-28질소21.6<NA><NA><NA><NA><NA><NA><NA><NA><NA>
44충주시청 상수도과충청북도 충주시 중원대로 3036 (단월동)1987-08-21액화염소8.0<NA><NA><NA><NA><NA><NA><NA><NA><NA>