Overview

Dataset statistics

Number of variables4
Number of observations44
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.5 KiB
Average record size in memory36.0 B

Variable types

Numeric1
Categorical1
Text2

Dataset

Description대전광역시 중구 관내에 위치한 석유판매업에 대한 데이터입니다. 사업구분, 상호, 영업소 소재지(도로명)를 제공합니다.
URLhttps://www.data.go.kr/data/15035689/fileData.do

Alerts

연번 is highly overall correlated with 사업구분High correlation
사업구분 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
상호 has unique valuesUnique
영업소소재지(도로명) has unique valuesUnique

Reproduction

Analysis started2023-12-12 03:42:01.427767
Analysis finished2023-12-12 03:42:01.983182
Duration0.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct44
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean24.545455
Minimum1
Maximum47
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size528.0 B
2023-12-12T12:42:02.518441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.15
Q113.75
median24.5
Q336.25
95-th percentile44.85
Maximum47
Range46
Interquartile range (IQR)22.5

Descriptive statistics

Standard deviation13.601393
Coefficient of variation (CV)0.55413082
Kurtosis-1.1298828
Mean24.545455
Median Absolute Deviation (MAD)11.5
Skewness-0.039661832
Sum1080
Variance184.99789
MonotonicityStrictly increasing
2023-12-12T12:42:02.725301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=44)
ValueCountFrequency (%)
1 1
 
2.3%
26 1
 
2.3%
28 1
 
2.3%
29 1
 
2.3%
30 1
 
2.3%
31 1
 
2.3%
32 1
 
2.3%
33 1
 
2.3%
35 1
 
2.3%
36 1
 
2.3%
Other values (34) 34
77.3%
ValueCountFrequency (%)
1 1
2.3%
2 1
2.3%
3 1
2.3%
4 1
2.3%
5 1
2.3%
7 1
2.3%
9 1
2.3%
10 1
2.3%
11 1
2.3%
12 1
2.3%
ValueCountFrequency (%)
47 1
2.3%
46 1
2.3%
45 1
2.3%
44 1
2.3%
43 1
2.3%
42 1
2.3%
41 1
2.3%
40 1
2.3%
39 1
2.3%
38 1
2.3%

사업구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)4.5%
Missing0
Missing (%)0.0%
Memory size484.0 B
석유판매업(주유소)
31 
석유판매업(일반판매소)
13 

Length

Max length12
Median length10
Mean length10.590909
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row석유판매업(주유소)
2nd row석유판매업(주유소)
3rd row석유판매업(주유소)
4th row석유판매업(주유소)
5th row석유판매업(주유소)

Common Values

ValueCountFrequency (%)
석유판매업(주유소) 31
70.5%
석유판매업(일반판매소) 13
29.5%

Length

2023-12-12T12:42:02.961342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:42:03.148755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
석유판매업(주유소 31
70.5%
석유판매업(일반판매소 13
29.5%

상호
Text

UNIQUE 

Distinct44
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size484.0 B
2023-12-12T12:42:03.559376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length7.2954545
Min length4

Characters and Unicode

Total characters321
Distinct characters96
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique44 ?
Unique (%)100.0%

Sample

1st row에이치디현대오일뱅크㈜ 중앙로셀프주유소
2nd row진성주유소
3rd row충남주유소
4th row금성주유소
5th row동건셀프주유소
ValueCountFrequency (%)
하나로주유소 2
 
3.8%
에이치디현대오일뱅크㈜ 1
 
1.9%
sk청정주유소 1
 
1.9%
소망주유소 1
 
1.9%
충남석유판매소 1
 
1.9%
동물원주유소 1
 
1.9%
오월드주유소 1
 
1.9%
명품주유소 1
 
1.9%
㈜제트에너지 1
 
1.9%
동서로점 1
 
1.9%
Other values (41) 41
78.8%
2023-12-12T12:42:04.159853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
42
 
13.1%
38
 
11.8%
34
 
10.6%
11
 
3.4%
8
 
2.5%
6
 
1.9%
6
 
1.9%
6
 
1.9%
6
 
1.9%
5
 
1.6%
Other values (86) 159
49.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 297
92.5%
Space Separator 8
 
2.5%
Open Punctuation 4
 
1.2%
Close Punctuation 4
 
1.2%
Other Symbol 4
 
1.2%
Uppercase Letter 4
 
1.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
42
 
14.1%
38
 
12.8%
34
 
11.4%
11
 
3.7%
6
 
2.0%
6
 
2.0%
6
 
2.0%
6
 
2.0%
5
 
1.7%
5
 
1.7%
Other values (78) 138
46.5%
Uppercase Letter
ValueCountFrequency (%)
I 1
25.0%
C 1
25.0%
S 1
25.0%
K 1
25.0%
Space Separator
ValueCountFrequency (%)
8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 301
93.8%
Common 16
 
5.0%
Latin 4
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
42
 
14.0%
38
 
12.6%
34
 
11.3%
11
 
3.7%
6
 
2.0%
6
 
2.0%
6
 
2.0%
6
 
2.0%
5
 
1.7%
5
 
1.7%
Other values (79) 142
47.2%
Latin
ValueCountFrequency (%)
I 1
25.0%
C 1
25.0%
S 1
25.0%
K 1
25.0%
Common
ValueCountFrequency (%)
8
50.0%
( 4
25.0%
) 4
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 297
92.5%
ASCII 20
 
6.2%
None 4
 
1.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
42
 
14.1%
38
 
12.8%
34
 
11.4%
11
 
3.7%
6
 
2.0%
6
 
2.0%
6
 
2.0%
6
 
2.0%
5
 
1.7%
5
 
1.7%
Other values (78) 138
46.5%
ASCII
ValueCountFrequency (%)
8
40.0%
( 4
20.0%
) 4
20.0%
I 1
 
5.0%
C 1
 
5.0%
S 1
 
5.0%
K 1
 
5.0%
None
ValueCountFrequency (%)
4
100.0%
Distinct44
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size484.0 B
2023-12-12T12:42:04.598433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length27
Mean length23.363636
Min length21

Characters and Unicode

Total characters1028
Distinct characters72
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique44 ?
Unique (%)100.0%

Sample

1st row대전광역시 중구 중앙로 64 (대흥동)
2nd row대전광역시 중구 대종로 122 (호동)
3rd row대전광역시 중구 대흥로 80 (대흥동)
4th row대전광역시 중구 동서대로 1183 (태평동)
5th row대전광역시 중구 문화로 97 (유천동)
ValueCountFrequency (%)
대전광역시 44
20.0%
중구 44
20.0%
대종로 6
 
2.7%
대둔산로 6
 
2.7%
안영동 5
 
2.3%
태평동 5
 
2.3%
동서대로 4
 
1.8%
유천동 4
 
1.8%
오류동 3
 
1.4%
문화동 3
 
1.4%
Other values (76) 96
43.6%
2023-12-12T12:42:05.202212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
176
17.1%
71
 
6.9%
52
 
5.1%
47
 
4.6%
45
 
4.4%
( 44
 
4.3%
44
 
4.3%
44
 
4.3%
44
 
4.3%
44
 
4.3%
Other values (62) 417
40.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 615
59.8%
Space Separator 176
 
17.1%
Decimal Number 148
 
14.4%
Open Punctuation 44
 
4.3%
Close Punctuation 44
 
4.3%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
71
11.5%
52
 
8.5%
47
 
7.6%
45
 
7.3%
44
 
7.2%
44
 
7.2%
44
 
7.2%
44
 
7.2%
44
 
7.2%
15
 
2.4%
Other values (48) 165
26.8%
Decimal Number
ValueCountFrequency (%)
1 32
21.6%
3 18
12.2%
5 16
10.8%
2 16
10.8%
4 15
10.1%
0 13
8.8%
7 11
 
7.4%
8 11
 
7.4%
9 8
 
5.4%
6 8
 
5.4%
Space Separator
ValueCountFrequency (%)
176
100.0%
Open Punctuation
ValueCountFrequency (%)
( 44
100.0%
Close Punctuation
ValueCountFrequency (%)
) 44
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 615
59.8%
Common 413
40.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
71
11.5%
52
 
8.5%
47
 
7.6%
45
 
7.3%
44
 
7.2%
44
 
7.2%
44
 
7.2%
44
 
7.2%
44
 
7.2%
15
 
2.4%
Other values (48) 165
26.8%
Common
ValueCountFrequency (%)
176
42.6%
( 44
 
10.7%
) 44
 
10.7%
1 32
 
7.7%
3 18
 
4.4%
5 16
 
3.9%
2 16
 
3.9%
4 15
 
3.6%
0 13
 
3.1%
7 11
 
2.7%
Other values (4) 28
 
6.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 615
59.8%
ASCII 413
40.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
176
42.6%
( 44
 
10.7%
) 44
 
10.7%
1 32
 
7.7%
3 18
 
4.4%
5 16
 
3.9%
2 16
 
3.9%
4 15
 
3.6%
0 13
 
3.1%
7 11
 
2.7%
Other values (4) 28
 
6.8%
Hangul
ValueCountFrequency (%)
71
11.5%
52
 
8.5%
47
 
7.6%
45
 
7.3%
44
 
7.2%
44
 
7.2%
44
 
7.2%
44
 
7.2%
44
 
7.2%
15
 
2.4%
Other values (48) 165
26.8%

Interactions

2023-12-12T12:42:01.666757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T12:42:05.356931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사업구분상호영업소소재지(도로명)
연번1.0001.0001.0001.000
사업구분1.0001.0001.0001.000
상호1.0001.0001.0001.000
영업소소재지(도로명)1.0001.0001.0001.000
2023-12-12T12:42:05.473764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사업구분
연번1.0000.900
사업구분0.9001.000

Missing values

2023-12-12T12:42:01.808028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:42:01.930556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번사업구분상호영업소소재지(도로명)
01석유판매업(주유소)에이치디현대오일뱅크㈜ 중앙로셀프주유소대전광역시 중구 중앙로 64 (대흥동)
12석유판매업(주유소)진성주유소대전광역시 중구 대종로 122 (호동)
23석유판매업(주유소)충남주유소대전광역시 중구 대흥로 80 (대흥동)
34석유판매업(주유소)금성주유소대전광역시 중구 동서대로 1183 (태평동)
45석유판매업(주유소)동건셀프주유소대전광역시 중구 문화로 97 (유천동)
57석유판매업(주유소)(유)리치주유소대전광역시 중구 보문로 117 (대사동)
69석유판매업(주유소)옥계셀프주유소대전광역시 중구 대종로 103 (옥계동)
710석유판매업(주유소)(주)중촌주유소대전광역시 중구 동서대로 1434 (선화동)
811석유판매업(주유소)이화주유소대전광역시 중구 대종로 166 (석교동)
912석유판매업(주유소)천지인주유소대전광역시 중구 대종로 44 (옥계동)
연번사업구분상호영업소소재지(도로명)
3438석유판매업(일반판매소)동아석유대전광역시 중구 돌다리로 29 (석교동)
3539석유판매업(일반판매소)형제석유판매소대전광역시 중구 모암로13번길 1 (호동)
3640석유판매업(일반판매소)서대전농업협동조합석유판매소대전광역시 중구 산서로 384 (목달동)
3741석유판매업(일반판매소)충대에너지대전광역시 중구 천근로 41-2 (문화동)
3842석유판매업(일반판매소)금강석유대전광역시 중구 대둔산로466번길 23 (산성동)
3943석유판매업(일반판매소)그린에너지대전광역시 중구 유천로18번길 20 (유천동)
4044석유판매업(일반판매소)대전석유대전광역시 중구 어덕마을로10번길 30 (용두동)
4145석유판매업(일반판매소)한국석유판매소대전광역시 중구 문화로 135 (유천동)
4246석유판매업(일반판매소)재일석유판매소대전광역시 중구 보문로162번길 44 (대사동)
4347석유판매업(일반판매소)원 오일대전광역시 중구 동서대로1185번길 3 (태평동)