Overview

Dataset statistics

Number of variables5
Number of observations35
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.5 KiB
Average record size in memory44.8 B

Variable types

Numeric1
Text3
Categorical1

Dataset

Description경상북도 구미시 관내의 가스판매업소 현황에 대한 데이터로 업소의 상호 및 소재지, 전화 번호 정보 등을 제공하고 있습니다.
Author경상북도 구미시
URLhttps://www.data.go.kr/data/3069584/fileData.do

Alerts

번호 has unique valuesUnique
상호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 14:40:42.168153
Analysis finished2023-12-12 14:40:42.711260
Duration0.54 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct35
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean18
Minimum1
Maximum35
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size447.0 B
2023-12-12T23:40:42.782419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.7
Q19.5
median18
Q326.5
95-th percentile33.3
Maximum35
Range34
Interquartile range (IQR)17

Descriptive statistics

Standard deviation10.246951
Coefficient of variation (CV)0.56927504
Kurtosis-1.2
Mean18
Median Absolute Deviation (MAD)9
Skewness0
Sum630
Variance105
MonotonicityStrictly increasing
2023-12-12T23:40:42.941843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)
ValueCountFrequency (%)
1 1
 
2.9%
2 1
 
2.9%
21 1
 
2.9%
22 1
 
2.9%
23 1
 
2.9%
24 1
 
2.9%
25 1
 
2.9%
26 1
 
2.9%
27 1
 
2.9%
28 1
 
2.9%
Other values (25) 25
71.4%
ValueCountFrequency (%)
1 1
2.9%
2 1
2.9%
3 1
2.9%
4 1
2.9%
5 1
2.9%
6 1
2.9%
7 1
2.9%
8 1
2.9%
9 1
2.9%
10 1
2.9%
ValueCountFrequency (%)
35 1
2.9%
34 1
2.9%
33 1
2.9%
32 1
2.9%
31 1
2.9%
30 1
2.9%
29 1
2.9%
28 1
2.9%
27 1
2.9%
26 1
2.9%

상호
Text

UNIQUE 

Distinct35
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size412.0 B
2023-12-12T23:40:43.173601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length10
Mean length5.8857143
Min length4

Characters and Unicode

Total characters206
Distinct characters72
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)100.0%

Sample

1st row경북가스
2nd row경북산업에너지가스
3rd row국제신라가스
4th row구미연합가스
5th row국제신라종합가스
ValueCountFrequency (%)
주식회사 2
 
5.4%
경북가스 1
 
2.7%
유성가스 1
 
2.7%
현대가스 1
 
2.7%
한양가스 1
 
2.7%
대원가스텍 1
 
2.7%
구미종합가스 1
 
2.7%
중앙가스 1
 
2.7%
금오상신종합가스 1
 
2.7%
한일에너지 1
 
2.7%
Other values (26) 26
70.3%
2023-12-12T23:40:43.557133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
30
 
14.6%
30
 
14.6%
10
 
4.9%
9
 
4.4%
6
 
2.9%
6
 
2.9%
6
 
2.9%
5
 
2.4%
5
 
2.4%
5
 
2.4%
Other values (62) 94
45.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 199
96.6%
Space Separator 2
 
1.0%
Uppercase Letter 2
 
1.0%
Other Symbol 1
 
0.5%
Open Punctuation 1
 
0.5%
Close Punctuation 1
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
30
 
15.1%
30
 
15.1%
10
 
5.0%
9
 
4.5%
6
 
3.0%
6
 
3.0%
6
 
3.0%
5
 
2.5%
5
 
2.5%
5
 
2.5%
Other values (56) 87
43.7%
Uppercase Letter
ValueCountFrequency (%)
S 1
50.0%
K 1
50.0%
Space Separator
ValueCountFrequency (%)
2
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 200
97.1%
Common 4
 
1.9%
Latin 2
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
30
 
15.0%
30
 
15.0%
10
 
5.0%
9
 
4.5%
6
 
3.0%
6
 
3.0%
6
 
3.0%
5
 
2.5%
5
 
2.5%
5
 
2.5%
Other values (57) 88
44.0%
Common
ValueCountFrequency (%)
2
50.0%
( 1
25.0%
) 1
25.0%
Latin
ValueCountFrequency (%)
S 1
50.0%
K 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 199
96.6%
ASCII 6
 
2.9%
None 1
 
0.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
30
 
15.1%
30
 
15.1%
10
 
5.0%
9
 
4.5%
6
 
3.0%
6
 
3.0%
6
 
3.0%
5
 
2.5%
5
 
2.5%
5
 
2.5%
Other values (56) 87
43.7%
ASCII
ValueCountFrequency (%)
2
33.3%
S 1
16.7%
K 1
16.7%
( 1
16.7%
) 1
16.7%
None
ValueCountFrequency (%)
1
100.0%
Distinct28
Distinct (%)80.0%
Missing0
Missing (%)0.0%
Memory size412.0 B
2023-12-12T23:40:43.820050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length25
Mean length21.657143
Min length20

Characters and Unicode

Total characters758
Distinct characters66
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)71.4%

Sample

1st row경상북도 구미시 고아읍 대평20길 17
2nd row경상북도 구미시 고아읍 선산대로 518
3rd row경상북도 구미시 고아읍 선산대로 1211
4th row경상북도 구미시 고아읍 선산대로 518
5th row경상북도 구미시 고아읍 선산대로 518
ValueCountFrequency (%)
경상북도 35
21.5%
구미시 35
21.5%
고아읍 12
 
7.4%
선산대로 8
 
4.9%
518 6
 
3.7%
장천면 6
 
3.7%
강동로 6
 
3.7%
617 2
 
1.2%
임은5길 2
 
1.2%
4-10(임은동 2
 
1.2%
Other values (48) 49
30.1%
2023-12-12T23:40:44.642152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
128
 
16.9%
38
 
5.0%
36
 
4.7%
36
 
4.7%
36
 
4.7%
35
 
4.6%
35
 
4.6%
35
 
4.6%
1 35
 
4.6%
24
 
3.2%
Other values (56) 320
42.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 474
62.5%
Space Separator 128
 
16.9%
Decimal Number 121
 
16.0%
Close Punctuation 13
 
1.7%
Open Punctuation 13
 
1.7%
Dash Punctuation 9
 
1.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
38
 
8.0%
36
 
7.6%
36
 
7.6%
36
 
7.6%
35
 
7.4%
35
 
7.4%
35
 
7.4%
24
 
5.1%
21
 
4.4%
16
 
3.4%
Other values (42) 162
34.2%
Decimal Number
ValueCountFrequency (%)
1 35
28.9%
5 16
13.2%
8 14
 
11.6%
2 13
 
10.7%
6 10
 
8.3%
3 10
 
8.3%
4 8
 
6.6%
7 7
 
5.8%
0 6
 
5.0%
9 2
 
1.7%
Space Separator
ValueCountFrequency (%)
128
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 474
62.5%
Common 284
37.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
38
 
8.0%
36
 
7.6%
36
 
7.6%
36
 
7.6%
35
 
7.4%
35
 
7.4%
35
 
7.4%
24
 
5.1%
21
 
4.4%
16
 
3.4%
Other values (42) 162
34.2%
Common
ValueCountFrequency (%)
128
45.1%
1 35
 
12.3%
5 16
 
5.6%
8 14
 
4.9%
2 13
 
4.6%
) 13
 
4.6%
( 13
 
4.6%
6 10
 
3.5%
3 10
 
3.5%
- 9
 
3.2%
Other values (4) 23
 
8.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 474
62.5%
ASCII 284
37.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
128
45.1%
1 35
 
12.3%
5 16
 
5.6%
8 14
 
4.9%
2 13
 
4.6%
) 13
 
4.6%
( 13
 
4.6%
6 10
 
3.5%
3 10
 
3.5%
- 9
 
3.2%
Other values (4) 23
 
8.1%
Hangul
ValueCountFrequency (%)
38
 
8.0%
36
 
7.6%
36
 
7.6%
36
 
7.6%
35
 
7.4%
35
 
7.4%
35
 
7.4%
24
 
5.1%
21
 
4.4%
16
 
3.4%
Other values (42) 162
34.2%
Distinct33
Distinct (%)94.3%
Missing0
Missing (%)0.0%
Memory size412.0 B
2023-12-12T23:40:44.886843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters420
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)88.6%

Sample

1st row054-453-6633
2nd row054-481-8900
3rd row054-481-3333
4th row054-458-8888
5th row054-452-7777
ValueCountFrequency (%)
054-462-1555 2
 
5.7%
054-456-7777 2
 
5.7%
054-453-6633 1
 
2.9%
054-463-8600 1
 
2.9%
054-473-8282 1
 
2.9%
054-471-7733 1
 
2.9%
054-472-8000 1
 
2.9%
054-464-7778 1
 
2.9%
054-464-8285 1
 
2.9%
054-472-9797 1
 
2.9%
Other values (23) 23
65.7%
2023-12-12T23:40:45.304634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 88
21.0%
- 70
16.7%
5 65
15.5%
0 55
13.1%
7 36
8.6%
8 22
 
5.2%
2 19
 
4.5%
3 19
 
4.5%
6 16
 
3.8%
1 15
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 350
83.3%
Dash Punctuation 70
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 88
25.1%
5 65
18.6%
0 55
15.7%
7 36
10.3%
8 22
 
6.3%
2 19
 
5.4%
3 19
 
5.4%
6 16
 
4.6%
1 15
 
4.3%
9 15
 
4.3%
Dash Punctuation
ValueCountFrequency (%)
- 70
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 420
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 88
21.0%
- 70
16.7%
5 65
15.5%
0 55
13.1%
7 36
8.6%
8 22
 
5.2%
2 19
 
4.5%
3 19
 
4.5%
6 16
 
3.8%
1 15
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 420
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 88
21.0%
- 70
16.7%
5 65
15.5%
0 55
13.1%
7 36
8.6%
8 22
 
5.2%
2 19
 
4.5%
3 19
 
4.5%
6 16
 
3.8%
1 15
 
3.6%

비고
Categorical

Distinct2
Distinct (%)5.7%
Missing0
Missing (%)0.0%
Memory size412.0 B
LPG
22 
LPG, 고압가스
13 

Length

Max length9
Median length3
Mean length5.2285714
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowLPG
2nd rowLPG
3rd rowLPG
4th rowLPG
5th rowLPG, 고압가스

Common Values

ValueCountFrequency (%)
LPG 22
62.9%
LPG, 고압가스 13
37.1%

Length

2023-12-12T23:40:45.490073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:40:45.608103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
lpg 35
72.9%
고압가스 13
 
27.1%

Interactions

2023-12-12T23:40:42.456683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:40:45.702890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호상호소재지전화번호비고
번호1.0001.0000.8390.9630.427
상호1.0001.0001.0001.0001.000
소재지0.8391.0001.0000.8620.879
전화번호0.9631.0000.8621.0000.623
비고0.4271.0000.8790.6231.000
2023-12-12T23:40:45.834947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호비고
번호1.0000.398
비고0.3981.000

Missing values

2023-12-12T23:40:42.576577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:40:42.677317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호상호소재지전화번호비고
01경북가스경상북도 구미시 고아읍 대평20길 17054-453-6633LPG
12경북산업에너지가스경상북도 구미시 고아읍 선산대로 518054-481-8900LPG
23국제신라가스경상북도 구미시 고아읍 선산대로 1211054-481-3333LPG
34구미연합가스경상북도 구미시 고아읍 선산대로 518054-458-8888LPG
45국제신라종합가스경상북도 구미시 고아읍 선산대로 518054-452-7777LPG, 고압가스
56동양종합가스경상북도 구미시 고아읍 봉한4안길 15-7054-455-5599LPG, 고압가스
67린나이종합가스경상북도 구미시 고아읍 선산대로 518054-444-9444LPG
78동부가스경상북도 구미시 고아읍 선산대로 1215054-481-8399LPG
89시민삼성가스경상북도 구미시 고아읍 황산길 116054-482-7221LPG
910유성가스경상북도 구미시 고아읍 선산대로 518054-442-4000LPG
번호상호소재지전화번호비고
2526금오상신종합가스경상북도 구미시 임은5길 4-10(임은동)054-463-5335LPG, 고압가스
2627한진좋은종합가스경상북도 구미시 임은5길 4-10(임은동)054-464-7778LPG, 고압가스
2728인동가스경상북도 구미시 수출대로23길 21(황상동)054-472-8000LPG
2829한국가스경상북도 구미시 수출대로28길 6-1(인의동)054-471-7733LPG
2930신라에너지경상북도 구미시 거양길 280(양호동)054-456-7777LPG, 고압가스
3031대명종합가스경상북도 구미시 거양8길 15(거의동)054-473-8282LPG, 고압가스
3132태양에너지경상북도 구미시 장천면 강동로 581054-462-1555LPG
3233바이오가스솔루션 주식회사경상북도 구미시 장천면 강동로 625054-462-1555LPG, 고압가스
3334㈜신라충전소경상북도 구미시 낙동대로 830(하장리 762-1)054-456-7777LPG, 고압가스
3435한국가스텍경상북도 구미시 장천면 강동로 583054-455-0111LPG