Overview

Dataset statistics

Number of variables4
Number of observations47
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.6 KiB
Average record size in memory35.8 B

Variable types

Numeric1
Categorical1
Text2

Dataset

Description대전 서구 내 석유를 판매할 수 있는 사업장(주유소 40개소 및 일반판매소 7개소)의 사업장 이름, 주소 등에 대한 정보 파일입니다.
Author대전광역시 서구
URLhttps://www.data.go.kr/data/15008933/fileData.do

Alerts

연번 is highly overall correlated with 사업구분High correlation
사업구분 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
영업소소재지(도로명) has unique valuesUnique

Reproduction

Analysis started2023-12-12 05:28:21.396957
Analysis finished2023-12-12 05:28:21.972479
Duration0.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct47
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean24.425532
Minimum1
Maximum49
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size555.0 B
2023-12-12T14:28:22.076404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.3
Q112.5
median24
Q336.5
95-th percentile46.7
Maximum49
Range48
Interquartile range (IQR)24

Descriptive statistics

Standard deviation14.291598
Coefficient of variation (CV)0.58510897
Kurtosis-1.165065
Mean24.425532
Median Absolute Deviation (MAD)12
Skewness0.080024299
Sum1148
Variance204.24977
MonotonicityStrictly increasing
2023-12-12T14:28:22.243945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=47)
ValueCountFrequency (%)
1 1
 
2.1%
2 1
 
2.1%
27 1
 
2.1%
28 1
 
2.1%
29 1
 
2.1%
30 1
 
2.1%
31 1
 
2.1%
32 1
 
2.1%
33 1
 
2.1%
34 1
 
2.1%
Other values (37) 37
78.7%
ValueCountFrequency (%)
1 1
2.1%
2 1
2.1%
3 1
2.1%
4 1
2.1%
5 1
2.1%
6 1
2.1%
7 1
2.1%
8 1
2.1%
9 1
2.1%
10 1
2.1%
ValueCountFrequency (%)
49 1
2.1%
48 1
2.1%
47 1
2.1%
46 1
2.1%
45 1
2.1%
44 1
2.1%
43 1
2.1%
41 1
2.1%
40 1
2.1%
39 1
2.1%

사업구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Memory size508.0 B
주유소
40 
일반판매소

Length

Max length5
Median length3
Mean length3.2978723
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row주유소
2nd row주유소
3rd row주유소
4th row주유소
5th row주유소

Common Values

ValueCountFrequency (%)
주유소 40
85.1%
일반판매소 7
 
14.9%

Length

2023-12-12T14:28:22.434994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:28:22.579277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
주유소 40
85.1%
일반판매소 7
 
14.9%

상호
Text

Distinct46
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Memory size508.0 B
2023-12-12T14:28:22.790561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length18
Mean length9.5319149
Min length4

Characters and Unicode

Total characters448
Distinct characters112
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)95.7%

Sample

1st row우리하나에너지(주)우리하나셀프주유소
2nd row지에스칼텍스(주) 구봉산셀프주유소
3rd row씨앤에스유통(주) 구도일주유소 도안신도시
4th row(주)덕성에너지 대전지점
5th row(주)에너비즈 명품주유소
ValueCountFrequency (%)
경동석유 2
 
3.3%
둔산주유소 2
 
3.3%
주)에너비즈 2
 
3.3%
코끼리셀프주유소 1
 
1.6%
괴정주유소 1
 
1.6%
대성석유 1
 
1.6%
에너지상사 1
 
1.6%
갈마주유소 1
 
1.6%
주)탄방주유소 1
 
1.6%
우리주유소 1
 
1.6%
Other values (48) 48
78.7%
2023-12-12T14:28:23.231698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
60
 
13.4%
53
 
11.8%
38
 
8.5%
( 20
 
4.5%
) 20
 
4.5%
14
 
3.1%
14
 
3.1%
11
 
2.5%
10
 
2.2%
10
 
2.2%
Other values (102) 198
44.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 392
87.5%
Open Punctuation 20
 
4.5%
Close Punctuation 20
 
4.5%
Space Separator 14
 
3.1%
Uppercase Letter 2
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
60
 
15.3%
53
 
13.5%
38
 
9.7%
14
 
3.6%
11
 
2.8%
10
 
2.6%
10
 
2.6%
9
 
2.3%
9
 
2.3%
8
 
2.0%
Other values (97) 170
43.4%
Uppercase Letter
ValueCountFrequency (%)
K 1
50.0%
H 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 20
100.0%
Close Punctuation
ValueCountFrequency (%)
) 20
100.0%
Space Separator
ValueCountFrequency (%)
14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 392
87.5%
Common 54
 
12.1%
Latin 2
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
60
 
15.3%
53
 
13.5%
38
 
9.7%
14
 
3.6%
11
 
2.8%
10
 
2.6%
10
 
2.6%
9
 
2.3%
9
 
2.3%
8
 
2.0%
Other values (97) 170
43.4%
Common
ValueCountFrequency (%)
( 20
37.0%
) 20
37.0%
14
25.9%
Latin
ValueCountFrequency (%)
K 1
50.0%
H 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 392
87.5%
ASCII 56
 
12.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
60
 
15.3%
53
 
13.5%
38
 
9.7%
14
 
3.6%
11
 
2.8%
10
 
2.6%
10
 
2.6%
9
 
2.3%
9
 
2.3%
8
 
2.0%
Other values (97) 170
43.4%
ASCII
ValueCountFrequency (%)
( 20
35.7%
) 20
35.7%
14
25.0%
K 1
 
1.8%
H 1
 
1.8%
Distinct47
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size508.0 B
2023-12-12T14:28:23.629312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length29
Mean length23.510638
Min length20

Characters and Unicode

Total characters1105
Distinct characters79
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique47 ?
Unique (%)100.0%

Sample

1st row대전광역시 서구 동서대로 1120 (변동)
2nd row대전광역시 서구 구봉로 208 (관저동)
3rd row대전광역시 서구 도안동로 277 (도안동)
4th row대전광역시 서구 배재로 205 (도마동,417번지)
5th row대전광역시 서구 배재로 218 (도마동)
ValueCountFrequency (%)
대전광역시 47
19.8%
서구 47
19.8%
계백로 8
 
3.4%
정림동 6
 
2.5%
도마동 6
 
2.5%
동서대로 5
 
2.1%
변동 5
 
2.1%
도산로 5
 
2.1%
갈마동 3
 
1.3%
둔산동 3
 
1.3%
Other values (88) 102
43.0%
2023-12-12T14:28:24.159963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
190
17.2%
57
 
5.2%
54
 
4.9%
52
 
4.7%
48
 
4.3%
47
 
4.3%
47
 
4.3%
47
 
4.3%
47
 
4.3%
( 47
 
4.3%
Other values (69) 469
42.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 627
56.7%
Space Separator 190
 
17.2%
Decimal Number 179
 
16.2%
Open Punctuation 47
 
4.3%
Close Punctuation 47
 
4.3%
Other Punctuation 8
 
0.7%
Dash Punctuation 5
 
0.5%
Uppercase Letter 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
57
 
9.1%
54
 
8.6%
52
 
8.3%
48
 
7.7%
47
 
7.5%
47
 
7.5%
47
 
7.5%
47
 
7.5%
41
 
6.5%
19
 
3.0%
Other values (52) 168
26.8%
Decimal Number
ValueCountFrequency (%)
1 41
22.9%
3 24
13.4%
2 23
12.8%
8 15
 
8.4%
4 15
 
8.4%
7 14
 
7.8%
6 13
 
7.3%
0 13
 
7.3%
9 11
 
6.1%
5 10
 
5.6%
Uppercase Letter
ValueCountFrequency (%)
K 1
50.0%
S 1
50.0%
Space Separator
ValueCountFrequency (%)
190
100.0%
Open Punctuation
ValueCountFrequency (%)
( 47
100.0%
Close Punctuation
ValueCountFrequency (%)
) 47
100.0%
Other Punctuation
ValueCountFrequency (%)
, 8
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 627
56.7%
Common 476
43.1%
Latin 2
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
57
 
9.1%
54
 
8.6%
52
 
8.3%
48
 
7.7%
47
 
7.5%
47
 
7.5%
47
 
7.5%
47
 
7.5%
41
 
6.5%
19
 
3.0%
Other values (52) 168
26.8%
Common
ValueCountFrequency (%)
190
39.9%
( 47
 
9.9%
) 47
 
9.9%
1 41
 
8.6%
3 24
 
5.0%
2 23
 
4.8%
8 15
 
3.2%
4 15
 
3.2%
7 14
 
2.9%
6 13
 
2.7%
Other values (5) 47
 
9.9%
Latin
ValueCountFrequency (%)
K 1
50.0%
S 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 627
56.7%
ASCII 478
43.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
190
39.7%
( 47
 
9.8%
) 47
 
9.8%
1 41
 
8.6%
3 24
 
5.0%
2 23
 
4.8%
8 15
 
3.1%
4 15
 
3.1%
7 14
 
2.9%
6 13
 
2.7%
Other values (7) 49
 
10.3%
Hangul
ValueCountFrequency (%)
57
 
9.1%
54
 
8.6%
52
 
8.3%
48
 
7.7%
47
 
7.5%
47
 
7.5%
47
 
7.5%
47
 
7.5%
41
 
6.5%
19
 
3.0%
Other values (52) 168
26.8%

Interactions

2023-12-12T14:28:21.634696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:28:24.283402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사업구분상호영업소소재지(도로명)
연번1.0000.9840.9241.000
사업구분0.9841.0001.0001.000
상호0.9241.0001.0001.000
영업소소재지(도로명)1.0001.0001.0001.000
2023-12-12T14:28:24.386876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사업구분
연번1.0000.807
사업구분0.8071.000

Missing values

2023-12-12T14:28:21.806739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:28:21.912933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번사업구분상호영업소소재지(도로명)
01주유소우리하나에너지(주)우리하나셀프주유소대전광역시 서구 동서대로 1120 (변동)
12주유소지에스칼텍스(주) 구봉산셀프주유소대전광역시 서구 구봉로 208 (관저동)
23주유소씨앤에스유통(주) 구도일주유소 도안신도시대전광역시 서구 도안동로 277 (도안동)
34주유소(주)덕성에너지 대전지점대전광역시 서구 배재로 205 (도마동,417번지)
45주유소(주)에너비즈 명품주유소대전광역시 서구 배재로 218 (도마동)
56주유소디에스주유소대전광역시 서구 배재로 210 (도마동,14-20)
67주유소씨앤에스유통(주)호수공원주유소대전광역시 서구 계백로 1259 (정림동)
78주유소그린주유소(주)대전광역시 서구 동서대로 953 (내동)
89주유소황금주유소대전광역시 서구 동서대로 1081 (내동)
910주유소혜천셀프주유소대전광역시 서구 혜천로 28 (정림동)
연번사업구분상호영업소소재지(도로명)
3739주유소중동주유소대전광역시 서구 계룡로 344 (갈마동)
3840주유소지에스칼텍스(주)정림주유소대전광역시 서구 계백로 1277 (정림동)
3941주유소중도석유(주)도마동주유소대전광역시 서구 계백로 1371 (도마동)
4043일반판매소경동석유대전광역시 서구 계백로1307번길 23 (정림동)
4144일반판매소이현석유대전광역시 서구 흑석4길 2 (흑석동)
4245일반판매소용일석유대전광역시 서구 조달청길 174 (도마동)
4346일반판매소경동석유대전광역시 서구 대신3길 6 (도마동)
4447일반판매소충남석유대전광역시 서구 도마5길 33 (도마동)
4548일반판매소대성석유 에너지상사대전광역시 서구 벌곡로1349번길 39 (가수원동)
4649일반판매소구봉석유대전광역시 서구 갈마로85번길 35 (갈마동)