Overview

Dataset statistics

Number of variables5
Number of observations36
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.6 KiB
Average record size in memory44.7 B

Variable types

Text3
Numeric1
Categorical1

Dataset

Description대구광역시 서구 내에 위치한 주유소들의 정보가 포함된 데이터 자료이다. 칼럼으로는 사업자명, 도로명주소, 면적, 전화번호 등을 포함하고 있다.
URLhttps://www.data.go.kr/data/15088627/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
사업장명 has unique valuesUnique
소재지도로명주소 has unique valuesUnique
전화번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 03:13:29.132363
Analysis finished2023-12-12 03:13:29.520516
Duration0.39 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사업장명
Text

UNIQUE 

Distinct36
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size420.0 B
2023-12-12T12:13:29.671775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length18
Mean length7.75
Min length4

Characters and Unicode

Total characters279
Distinct characters83
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)100.0%

Sample

1st row화성제일주유소
2nd row미리내주유소
3rd row아름다운주유소
4th row새방골주유소
5th row대영셀프주유소
ValueCountFrequency (%)
지에스칼텍스(주 2
 
4.9%
화성제일주유소 1
 
2.4%
달구벌대로주유소 1
 
2.4%
광명주유소 1
 
2.4%
나혜주유소 1
 
2.4%
주)세아에너지 1
 
2.4%
행복제1주유소 1
 
2.4%
주)대일종합에너지 1
 
2.4%
서부지점 1
 
2.4%
행복주유소 1
 
2.4%
Other values (30) 30
73.2%
2023-12-12T12:13:30.058275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
40
 
14.3%
34
 
12.2%
33
 
11.8%
9
 
3.2%
) 7
 
2.5%
7
 
2.5%
( 7
 
2.5%
6
 
2.2%
6
 
2.2%
5
 
1.8%
Other values (73) 125
44.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 252
90.3%
Close Punctuation 7
 
2.5%
Open Punctuation 7
 
2.5%
Space Separator 5
 
1.8%
Uppercase Letter 4
 
1.4%
Decimal Number 3
 
1.1%
Lowercase Letter 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
40
 
15.9%
34
 
13.5%
33
 
13.1%
9
 
3.6%
7
 
2.8%
6
 
2.4%
6
 
2.4%
5
 
2.0%
5
 
2.0%
4
 
1.6%
Other values (65) 103
40.9%
Decimal Number
ValueCountFrequency (%)
2 2
66.7%
1 1
33.3%
Uppercase Letter
ValueCountFrequency (%)
C 2
50.0%
I 2
50.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Space Separator
ValueCountFrequency (%)
5
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 252
90.3%
Common 22
 
7.9%
Latin 5
 
1.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
40
 
15.9%
34
 
13.5%
33
 
13.1%
9
 
3.6%
7
 
2.8%
6
 
2.4%
6
 
2.4%
5
 
2.0%
5
 
2.0%
4
 
1.6%
Other values (65) 103
40.9%
Common
ValueCountFrequency (%)
) 7
31.8%
( 7
31.8%
5
22.7%
2 2
 
9.1%
1 1
 
4.5%
Latin
ValueCountFrequency (%)
C 2
40.0%
I 2
40.0%
e 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 252
90.3%
ASCII 27
 
9.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
40
 
15.9%
34
 
13.5%
33
 
13.1%
9
 
3.6%
7
 
2.8%
6
 
2.4%
6
 
2.4%
5
 
2.0%
5
 
2.0%
4
 
1.6%
Other values (65) 103
40.9%
ASCII
ValueCountFrequency (%)
) 7
25.9%
( 7
25.9%
5
18.5%
2 2
 
7.4%
C 2
 
7.4%
I 2
 
7.4%
1 1
 
3.7%
e 1
 
3.7%
Distinct36
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size420.0 B
2023-12-12T12:13:30.322483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length25
Mean length22.916667
Min length21

Characters and Unicode

Total characters825
Distinct characters52
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)100.0%

Sample

1st row대구광역시 서구 국채보상로 355-1 (비산동)
2nd row대구광역시 서구 새방로 101 (상리동)
3rd row대구광역시 서구 달서로 67 (비산동)
4th row대구광역시 서구 국채보상로 43 (이현동)
5th row대구광역시 서구 팔달로 78 (비산동)
ValueCountFrequency (%)
대구광역시 36
20.0%
서구 36
20.0%
평리동 14
 
7.8%
북비산로 10
 
5.6%
이현동 8
 
4.4%
서대구로 6
 
3.3%
국채보상로 6
 
3.3%
비산동 5
 
2.8%
중리동 4
 
2.2%
와룡로 3
 
1.7%
Other values (45) 52
28.9%
2023-12-12T12:13:30.783862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
144
17.5%
80
 
9.7%
45
 
5.5%
44
 
5.3%
36
 
4.4%
36
 
4.4%
36
 
4.4%
) 36
 
4.4%
36
 
4.4%
36
 
4.4%
Other values (42) 296
35.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 505
61.2%
Space Separator 144
 
17.5%
Decimal Number 103
 
12.5%
Close Punctuation 36
 
4.4%
Open Punctuation 36
 
4.4%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
80
15.8%
45
 
8.9%
44
 
8.7%
36
 
7.1%
36
 
7.1%
36
 
7.1%
36
 
7.1%
36
 
7.1%
22
 
4.4%
17
 
3.4%
Other values (28) 117
23.2%
Decimal Number
ValueCountFrequency (%)
1 24
23.3%
7 13
12.6%
3 11
10.7%
2 11
10.7%
4 10
9.7%
0 10
9.7%
8 8
 
7.8%
5 8
 
7.8%
6 5
 
4.9%
9 3
 
2.9%
Space Separator
ValueCountFrequency (%)
144
100.0%
Close Punctuation
ValueCountFrequency (%)
) 36
100.0%
Open Punctuation
ValueCountFrequency (%)
( 36
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 505
61.2%
Common 320
38.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
80
15.8%
45
 
8.9%
44
 
8.7%
36
 
7.1%
36
 
7.1%
36
 
7.1%
36
 
7.1%
36
 
7.1%
22
 
4.4%
17
 
3.4%
Other values (28) 117
23.2%
Common
ValueCountFrequency (%)
144
45.0%
) 36
 
11.2%
( 36
 
11.2%
1 24
 
7.5%
7 13
 
4.1%
3 11
 
3.4%
2 11
 
3.4%
4 10
 
3.1%
0 10
 
3.1%
8 8
 
2.5%
Other values (4) 17
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 505
61.2%
ASCII 320
38.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
144
45.0%
) 36
 
11.2%
( 36
 
11.2%
1 24
 
7.5%
7 13
 
4.1%
3 11
 
3.4%
2 11
 
3.4%
4 10
 
3.1%
0 10
 
3.1%
8 8
 
2.5%
Other values (4) 17
 
5.3%
Hangul
ValueCountFrequency (%)
80
15.8%
45
 
8.9%
44
 
8.7%
36
 
7.1%
36
 
7.1%
36
 
7.1%
36
 
7.1%
36
 
7.1%
22
 
4.4%
17
 
3.4%
Other values (28) 117
23.2%

소재지면적
Real number (ℝ)

Distinct35
Distinct (%)97.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean990.13056
Minimum414
Maximum1967
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size456.0 B
2023-12-12T12:13:31.098701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum414
5-th percentile530.75
Q1706
median958.85
Q31179.5
95-th percentile1566
Maximum1967
Range1553
Interquartile range (IQR)473.5

Descriptive statistics

Standard deviation350.81176
Coefficient of variation (CV)0.35430859
Kurtosis0.29019258
Mean990.13056
Median Absolute Deviation (MAD)252.85
Skewness0.70176358
Sum35644.7
Variance123068.89
MonotonicityNot monotonic
2023-12-12T12:13:31.266223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)
ValueCountFrequency (%)
706.0 2
 
5.6%
414.0 1
 
2.8%
623.0 1
 
2.8%
813.0 1
 
2.8%
934.0 1
 
2.8%
669.0 1
 
2.8%
860.0 1
 
2.8%
1147.0 1
 
2.8%
972.0 1
 
2.8%
966.0 1
 
2.8%
Other values (25) 25
69.4%
ValueCountFrequency (%)
414.0 1
2.8%
527.0 1
2.8%
532.0 1
2.8%
622.0 1
2.8%
623.0 1
2.8%
669.0 1
2.8%
686.0 1
2.8%
688.0 1
2.8%
706.0 2
5.6%
712.0 1
2.8%
ValueCountFrequency (%)
1967.0 1
2.8%
1623.0 1
2.8%
1547.0 1
2.8%
1477.0 1
2.8%
1407.0 1
2.8%
1376.0 1
2.8%
1298.0 1
2.8%
1288.0 1
2.8%
1238.0 1
2.8%
1160.0 1
2.8%

전화번호
Text

UNIQUE 

Distinct36
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size420.0 B
2023-12-12T12:13:31.547119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters432
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)100.0%

Sample

1st row053-523-5100
2nd row053-566-5182
3rd row053-557-0900
4th row053-573-5511
5th row053-357-0089
ValueCountFrequency (%)
053-523-5100 1
 
2.8%
053-566-5182 1
 
2.8%
053-356-1010 1
 
2.8%
053-555-5563 1
 
2.8%
053-556-5551 1
 
2.8%
053-551-8113 1
 
2.8%
053-356-1951 1
 
2.8%
053-571-8251 1
 
2.8%
053-566-1778 1
 
2.8%
053-555-6688 1
 
2.8%
Other values (26) 26
72.2%
2023-12-12T12:13:32.006968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 119
27.5%
- 72
16.7%
0 55
12.7%
3 54
12.5%
1 34
 
7.9%
6 26
 
6.0%
2 23
 
5.3%
8 19
 
4.4%
7 16
 
3.7%
9 9
 
2.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 360
83.3%
Dash Punctuation 72
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 119
33.1%
0 55
15.3%
3 54
15.0%
1 34
 
9.4%
6 26
 
7.2%
2 23
 
6.4%
8 19
 
5.3%
7 16
 
4.4%
9 9
 
2.5%
4 5
 
1.4%
Dash Punctuation
ValueCountFrequency (%)
- 72
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 432
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 119
27.5%
- 72
16.7%
0 55
12.7%
3 54
12.5%
1 34
 
7.9%
6 26
 
6.0%
2 23
 
5.3%
8 19
 
4.4%
7 16
 
3.7%
9 9
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 432
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 119
27.5%
- 72
16.7%
0 55
12.7%
3 54
12.5%
1 34
 
7.9%
6 26
 
6.0%
2 23
 
5.3%
8 19
 
4.4%
7 16
 
3.7%
9 9
 
2.1%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size420.0 B
2023-06-05
36 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-06-05
2nd row2023-06-05
3rd row2023-06-05
4th row2023-06-05
5th row2023-06-05

Common Values

ValueCountFrequency (%)
2023-06-05 36
100.0%

Length

2023-12-12T12:13:32.183972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:13:32.333796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-06-05 36
100.0%

Interactions

2023-12-12T12:13:29.284832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T12:13:32.440716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업장명소재지도로명주소소재지면적전화번호
사업장명1.0001.0001.0001.000
소재지도로명주소1.0001.0001.0001.000
소재지면적1.0001.0001.0001.000
전화번호1.0001.0001.0001.000

Missing values

2023-12-12T12:13:29.400358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:13:29.485994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업장명소재지도로명주소소재지면적전화번호데이터기준일자
0화성제일주유소대구광역시 서구 국채보상로 355-1 (비산동)414.0053-523-51002023-06-05
1미리내주유소대구광역시 서구 새방로 101 (상리동)1118.0053-566-51822023-06-05
2아름다운주유소대구광역시 서구 달서로 67 (비산동)706.0053-557-09002023-06-05
3새방골주유소대구광역시 서구 국채보상로 43 (이현동)686.0053-573-55112023-06-05
4대영셀프주유소대구광역시 서구 팔달로 78 (비산동)706.0053-357-00892023-06-05
5대원석유(주)이현대원 주유소대구광역시 서구 북비산로 107 (이현동)712.0053-525-51822023-06-05
6서대구터미널주유소대구광역시 서구 북비산로 49 (이현동)730.0053-527-51512023-06-05
7서대구IC주유소대구광역시 서구 북비산로 48 (이현동)1477.0053-566-00092023-06-05
8(주)기분좋은주유소대구광역시 서구 와룡로 420 (이현동)1407.0053-563-22222023-06-05
9서대구공단주유소대구광역시 서구 와룡로 358 (중리동)1298.0053-568-00512023-06-05
사업장명소재지도로명주소소재지면적전화번호데이터기준일자
26합천주유소대구광역시 서구 염색공단천로 78 (비산동)1147.0053-356-10102023-06-05
27명조주유소대구광역시 서구 평리로 156 (중리동)972.0053-555-66882023-06-05
28꽉주유소대구광역시 서구 국채보상로 317 (평리동)623.0053-565-51862023-06-05
29e편한주유소대구광역시 서구 평리로 317 (평리동)966.0053-556-99422023-06-05
30채움셀프주유소대구광역시 서구 달구벌대로 1833 (내당동)527.0053-526-55582023-06-05
31평리주유소대구광역시 서구 서대구로 227 (평리동)1238.0053-526-28282023-06-05
32베스트오일(주)브라보주유소대구광역시 서구 북비산로 95 (이현동)1160.0053-551-88512023-06-05
33이현공단주유소대구광역시 서구 국채보상로 181 (평리동)1067.0053-565-08892023-06-05
34달서주유소대구광역시 서구 달서천로 176 (평리동)532.0053-352-40012023-06-05
35태양주유소대구광역시 서구 국채보상로 372 (비산동)622.0053-573-83032023-06-05