Overview

Dataset statistics

Number of variables3
Number of observations74
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.9 KiB
Average record size in memory25.8 B

Variable types

Text3

Dataset

Description충청남도 당진시의 고압가스 저장소 현황 데이터로 컬럼으로는 상호, 주소, 규모가 있습니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=438&beforeMenuCd=DOM_000000201001001000&publicdatapk=15029694

Reproduction

Analysis started2024-01-09 21:34:28.201729
Analysis finished2024-01-09 21:34:28.614432
Duration0.41 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct62
Distinct (%)83.8%
Missing0
Missing (%)0.0%
Memory size724.0 B
2024-01-10T06:34:28.759379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length15.5
Mean length9.8243243
Min length4

Characters and Unicode

Total characters727
Distinct characters144
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique58 ?
Unique (%)78.4%

Sample

1st row(주)수석
2nd row(주)하이테크이엔브이
3rd row엔씨케이(주)
4th row당진종합병원
5th row(주)신한씨에스
ValueCountFrequency (%)
현대제철(주 10
 
10.3%
케이지스틸(주 4
 
4.1%
한국동서발전(주 4
 
4.1%
당진화력본부 4
 
4.1%
한국동서발전(주)당진화력본부 4
 
4.1%
일관 2
 
2.1%
현대하이스코(주 2
 
2.1%
삼원중공업(주 1
 
1.0%
고로1기 1
 
1.0%
제강사무실 1
 
1.0%
Other values (64) 64
66.0%
2024-01-10T06:34:29.046592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
66
 
9.1%
( 64
 
8.8%
) 64
 
8.8%
23
 
3.2%
22
 
3.0%
21
 
2.9%
20
 
2.8%
20
 
2.8%
19
 
2.6%
19
 
2.6%
Other values (134) 389
53.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 556
76.5%
Open Punctuation 64
 
8.8%
Close Punctuation 64
 
8.8%
Space Separator 23
 
3.2%
Math Symbol 6
 
0.8%
Uppercase Letter 6
 
0.8%
Decimal Number 5
 
0.7%
Other Punctuation 3
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
66
 
11.9%
22
 
4.0%
21
 
3.8%
20
 
3.6%
20
 
3.6%
19
 
3.4%
19
 
3.4%
17
 
3.1%
16
 
2.9%
12
 
2.2%
Other values (119) 324
58.3%
Uppercase Letter
ValueCountFrequency (%)
C 1
16.7%
A 1
16.7%
P 1
16.7%
M 1
16.7%
I 1
16.7%
S 1
16.7%
Decimal Number
ValueCountFrequency (%)
1 2
40.0%
2 2
40.0%
3 1
20.0%
Math Symbol
ValueCountFrequency (%)
< 3
50.0%
> 3
50.0%
Open Punctuation
ValueCountFrequency (%)
( 64
100.0%
Close Punctuation
ValueCountFrequency (%)
) 64
100.0%
Space Separator
ValueCountFrequency (%)
23
100.0%
Other Punctuation
ValueCountFrequency (%)
, 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 556
76.5%
Common 165
 
22.7%
Latin 6
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
66
 
11.9%
22
 
4.0%
21
 
3.8%
20
 
3.6%
20
 
3.6%
19
 
3.4%
19
 
3.4%
17
 
3.1%
16
 
2.9%
12
 
2.2%
Other values (119) 324
58.3%
Common
ValueCountFrequency (%)
( 64
38.8%
) 64
38.8%
23
 
13.9%
, 3
 
1.8%
< 3
 
1.8%
> 3
 
1.8%
1 2
 
1.2%
2 2
 
1.2%
3 1
 
0.6%
Latin
ValueCountFrequency (%)
C 1
16.7%
A 1
16.7%
P 1
16.7%
M 1
16.7%
I 1
16.7%
S 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 556
76.5%
ASCII 171
 
23.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
66
 
11.9%
22
 
4.0%
21
 
3.8%
20
 
3.6%
20
 
3.6%
19
 
3.4%
19
 
3.4%
17
 
3.1%
16
 
2.9%
12
 
2.2%
Other values (119) 324
58.3%
ASCII
ValueCountFrequency (%)
( 64
37.4%
) 64
37.4%
23
 
13.5%
, 3
 
1.8%
< 3
 
1.8%
> 3
 
1.8%
1 2
 
1.2%
2 2
 
1.2%
3 1
 
0.6%
C 1
 
0.6%
Other values (5) 5
 
2.9%
Distinct56
Distinct (%)75.7%
Missing0
Missing (%)0.0%
Memory size724.0 B
2024-01-10T06:34:29.294911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length34
Mean length23.972973
Min length17

Characters and Unicode

Total characters1774
Distinct characters115
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique49 ?
Unique (%)66.2%

Sample

1st row충청남도 당진시 합덕읍 인더스파크로 21
2nd row충청남도 당진시 정미면 4.4만세로 574
3rd row충청남도 당진시 송산면 동곡리 381-5
4th row충청남도 당진시 반촌로 5-15, 당진종합병원 (시곡동)
5th row충청남도 당진시 합덕읍 면천로 1339
ValueCountFrequency (%)
충청남도 74
18.8%
당진시 74
18.8%
송악읍 27
 
6.9%
북부산업로 15
 
3.8%
석문면 13
 
3.3%
1480 10
 
2.5%
송산면 10
 
2.5%
교로길 8
 
2.0%
30 8
 
2.0%
합덕읍 7
 
1.8%
Other values (110) 147
37.4%
2024-01-10T06:34:29.626707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
336
18.9%
78
 
4.4%
77
 
4.3%
76
 
4.3%
74
 
4.2%
74
 
4.2%
74
 
4.2%
74
 
4.2%
1 53
 
3.0%
47
 
2.6%
Other values (105) 811
45.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1102
62.1%
Space Separator 336
 
18.9%
Decimal Number 279
 
15.7%
Dash Punctuation 22
 
1.2%
Open Punctuation 11
 
0.6%
Close Punctuation 11
 
0.6%
Other Punctuation 10
 
0.6%
Uppercase Letter 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
78
 
7.1%
77
 
7.0%
76
 
6.9%
74
 
6.7%
74
 
6.7%
74
 
6.7%
74
 
6.7%
47
 
4.3%
47
 
4.3%
41
 
3.7%
Other values (87) 440
39.9%
Decimal Number
ValueCountFrequency (%)
1 53
19.0%
4 38
13.6%
3 35
12.5%
0 32
11.5%
2 27
9.7%
8 26
9.3%
6 24
8.6%
7 17
 
6.1%
5 17
 
6.1%
9 10
 
3.6%
Other Punctuation
ValueCountFrequency (%)
, 9
90.0%
. 1
 
10.0%
Uppercase Letter
ValueCountFrequency (%)
A 2
66.7%
C 1
33.3%
Space Separator
ValueCountFrequency (%)
336
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 22
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1102
62.1%
Common 669
37.7%
Latin 3
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
78
 
7.1%
77
 
7.0%
76
 
6.9%
74
 
6.7%
74
 
6.7%
74
 
6.7%
74
 
6.7%
47
 
4.3%
47
 
4.3%
41
 
3.7%
Other values (87) 440
39.9%
Common
ValueCountFrequency (%)
336
50.2%
1 53
 
7.9%
4 38
 
5.7%
3 35
 
5.2%
0 32
 
4.8%
2 27
 
4.0%
8 26
 
3.9%
6 24
 
3.6%
- 22
 
3.3%
7 17
 
2.5%
Other values (6) 59
 
8.8%
Latin
ValueCountFrequency (%)
A 2
66.7%
C 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1102
62.1%
ASCII 672
37.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
336
50.0%
1 53
 
7.9%
4 38
 
5.7%
3 35
 
5.2%
0 32
 
4.8%
2 27
 
4.0%
8 26
 
3.9%
6 24
 
3.6%
- 22
 
3.3%
7 17
 
2.5%
Other values (8) 62
 
9.2%
Hangul
ValueCountFrequency (%)
78
 
7.1%
77
 
7.0%
76
 
6.9%
74
 
6.7%
74
 
6.7%
74
 
6.7%
74
 
6.7%
47
 
4.3%
47
 
4.3%
41
 
3.7%
Other values (87) 440
39.9%
Distinct61
Distinct (%)82.4%
Missing0
Missing (%)0.0%
Memory size724.0 B
2024-01-10T06:34:29.816093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length232
Median length35
Mean length20.864865
Min length6

Characters and Unicode

Total characters1544
Distinct characters43
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique50 ?
Unique (%)67.6%

Sample

1st row 산소 (60.141톤)
2nd row 산소 (10.2Kg), 질소 (30Kg/㎠)
3rd row 질소 (14.54톤)
4th row 산소 (12.093톤)
5th row 산소 (15.416톤), 탄산가스 (14.96톤)
ValueCountFrequency (%)
0kg/㎠ 43
15.8%
산소 34
12.5%
기타 31
11.4%
25
 
9.2%
탄산가스 21
 
7.7%
질소 19
 
7.0%
아르곤 11
 
4.0%
수소 11
 
4.0%
액화암모니아 7
 
2.6%
0 6
 
2.2%
Other values (54) 64
23.5%
2024-01-10T06:34:30.099712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
272
17.6%
( 136
 
8.8%
) 136
 
8.8%
0 80
 
5.2%
K 68
 
4.4%
g 68
 
4.4%
/ 66
 
4.3%
66
 
4.3%
64
 
4.1%
, 62
 
4.0%
Other values (33) 526
34.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 381
24.7%
Space Separator 272
17.6%
Decimal Number 249
16.1%
Other Punctuation 155
10.0%
Open Punctuation 136
 
8.8%
Close Punctuation 136
 
8.8%
Other Symbol 79
 
5.1%
Uppercase Letter 68
 
4.4%
Lowercase Letter 68
 
4.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
64
16.8%
55
14.4%
31
8.1%
31
8.1%
24
 
6.3%
22
 
5.8%
22
 
5.8%
21
 
5.5%
19
 
5.0%
19
 
5.0%
Other values (13) 73
19.2%
Decimal Number
ValueCountFrequency (%)
0 80
32.1%
1 35
14.1%
9 24
 
9.6%
4 22
 
8.8%
2 19
 
7.6%
6 17
 
6.8%
7 15
 
6.0%
8 14
 
5.6%
3 12
 
4.8%
5 11
 
4.4%
Other Punctuation
ValueCountFrequency (%)
/ 66
42.6%
, 62
40.0%
. 27
17.4%
Other Symbol
ValueCountFrequency (%)
66
83.5%
13
 
16.5%
Space Separator
ValueCountFrequency (%)
272
100.0%
Open Punctuation
ValueCountFrequency (%)
( 136
100.0%
Close Punctuation
ValueCountFrequency (%)
) 136
100.0%
Uppercase Letter
ValueCountFrequency (%)
K 68
100.0%
Lowercase Letter
ValueCountFrequency (%)
g 68
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1027
66.5%
Hangul 381
 
24.7%
Latin 136
 
8.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
64
16.8%
55
14.4%
31
8.1%
31
8.1%
24
 
6.3%
22
 
5.8%
22
 
5.8%
21
 
5.5%
19
 
5.0%
19
 
5.0%
Other values (13) 73
19.2%
Common
ValueCountFrequency (%)
272
26.5%
( 136
13.2%
) 136
13.2%
0 80
 
7.8%
/ 66
 
6.4%
66
 
6.4%
, 62
 
6.0%
1 35
 
3.4%
. 27
 
2.6%
9 24
 
2.3%
Other values (8) 123
12.0%
Latin
ValueCountFrequency (%)
K 68
50.0%
g 68
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1084
70.2%
Hangul 381
 
24.7%
CJK Compat 79
 
5.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
272
25.1%
( 136
12.5%
) 136
12.5%
0 80
 
7.4%
K 68
 
6.3%
g 68
 
6.3%
/ 66
 
6.1%
, 62
 
5.7%
1 35
 
3.2%
. 27
 
2.5%
Other values (8) 134
12.4%
CJK Compat
ValueCountFrequency (%)
66
83.5%
13
 
16.5%
Hangul
ValueCountFrequency (%)
64
16.8%
55
14.4%
31
8.1%
31
8.1%
24
 
6.3%
22
 
5.8%
22
 
5.8%
21
 
5.5%
19
 
5.0%
19
 
5.0%
Other values (13) 73
19.2%

Correlations

2024-01-10T06:34:30.172045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
법인명(상호)사업소소재지취급가스정보
법인명(상호)1.0000.9960.365
사업소소재지0.9961.0000.816
취급가스정보0.3650.8161.000

Missing values

2024-01-10T06:34:28.534453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T06:34:28.591083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

법인명(상호)사업소소재지취급가스정보
0(주)수석충청남도 당진시 합덕읍 인더스파크로 21산소 (60.141톤)
1(주)하이테크이엔브이충청남도 당진시 정미면 4.4만세로 574산소 (10.2Kg), 질소 (30Kg/㎠)
2엔씨케이(주)충청남도 당진시 송산면 동곡리 381-5질소 (14.54톤)
3당진종합병원충청남도 당진시 반촌로 5-15, 당진종합병원 (시곡동)산소 (12.093톤)
4(주)신한씨에스충청남도 당진시 합덕읍 면천로 1339산소 (15.416톤), 탄산가스 (14.96톤)
5엔아이스틸(주)충청남도 당진시 송악읍 순성로 777-57, 한두철강(주)산소 (4.896톤), 탄산가스 (4.899톤)
6대아에너지(주)충청남도 당진시 석문면 산단3로10길 72질소 ()
7(주)대상이엔지충청남도 당진시 석문면 통정리 1322산소 (), 아르곤 (), 탄산가스 ()
8당진파머스충청남도 당진시 석문면 교로길 30, 관사액화암모니아 ()
9삼남엔지니어링(주)충청남도 당진시 신평면 신평로 646산소 (), 탄산가스 ()
법인명(상호)사업소소재지취급가스정보
64(주)휴스틸충청남도 당진시 송악읍 부곡공단로 131질소 (1397Kg/㎠)
65우신공업(주)충청남도 당진시 면천면 아미로 94산소 (8Kg/㎠), 아르곤 (0Kg/㎠)
66당진에너지충청남도 당진시 정미면 정미로 569수소 (0Kg/㎠)
67국민종합가스충청남도 당진시 송산면 동곡리 166-28수소 (0Kg/㎠)
68대정종합가스충청남도 당진시 합덕읍 성동리 273수소 (0Kg/㎠)
69동우에이치에스티(주)충청남도 당진시 순성면 틀모시로 46-5액화암모니아 (0톤), 질소 (0톤)
70생고뱅이소바코리아(주)충청남도 당진시 송악읍 부곡공단1길 70산소 (0Kg/㎠)
71(주)원당철강충청남도 당진시 면천면 산업단지길 40산소 (7Kg/㎠), 탄산가스 ()
72신평현대가스충청남도 당진시 신평면 금천리 산 345-40산소 (0Kg/㎠)
73당진산소충청남도 당진시 송악읍 송악로 418산소 (0Kg/㎠)