Overview

Dataset statistics

Number of variables5
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.1 KiB
Average record size in memory42.3 B

Variable types

Categorical2
Text2
Numeric1

Dataset

Description서산시내 석유판매업소(주유소) 등록현황으로 상호명, 주소, 상표, 전화번호, 셀프여부에 대한 정보를 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=447&beforeMenuCd=DOM_000000201001001000&publicdatapk=3080555

Alerts

데이터기준일 has constant value ""Constant
주유기수 is highly overall correlated with 사업구분High correlation
사업구분 is highly overall correlated with 주유기수High correlation
사업구분 is highly imbalanced (59.8%)Imbalance
상호 has unique valuesUnique
영업 소재지 has unique valuesUnique

Reproduction

Analysis started2024-01-09 20:56:02.300320
Analysis finished2024-01-09 20:56:02.742843
Duration0.44 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사업구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
주유소
92 
일반판매소
 
8

Length

Max length5
Median length3
Mean length3.16
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반판매소
2nd row일반판매소
3rd row일반판매소
4th row일반판매소
5th row일반판매소

Common Values

ValueCountFrequency (%)
주유소 92
92.0%
일반판매소 8
 
8.0%

Length

2024-01-10T05:56:02.808808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:56:02.897075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
주유소 92
92.0%
일반판매소 8
 
8.0%

상호
Text

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2024-01-10T05:56:03.122391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length17
Mean length6.75
Min length4

Characters and Unicode

Total characters675
Distinct characters149
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st row음암농협
2nd row대영석유상사
3rd row대산석유
4th row현대석유판매
5th row그린에너지
ValueCountFrequency (%)
현대오일뱅크(주)직영 3
 
2.8%
음암농협 1
 
0.9%
대영석유상사 1
 
0.9%
죽성주유소 1
 
0.9%
장등주유소 1
 
0.9%
인지주유소 1
 
0.9%
음암주유소 1
 
0.9%
음암농협주유소 1
 
0.9%
운산주유소 1
 
0.9%
운산농협주유소 1
 
0.9%
Other values (94) 94
88.7%
2024-01-10T05:56:03.504123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
102
 
15.1%
93
 
13.8%
90
 
13.3%
21
 
3.1%
18
 
2.7%
18
 
2.7%
( 15
 
2.2%
) 15
 
2.2%
11
 
1.6%
11
 
1.6%
Other values (139) 281
41.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 630
93.3%
Open Punctuation 15
 
2.2%
Close Punctuation 15
 
2.2%
Space Separator 6
 
0.9%
Uppercase Letter 6
 
0.9%
Decimal Number 2
 
0.3%
Other Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
102
16.2%
93
 
14.8%
90
 
14.3%
21
 
3.3%
18
 
2.9%
18
 
2.9%
11
 
1.7%
11
 
1.7%
11
 
1.7%
8
 
1.3%
Other values (130) 247
39.2%
Uppercase Letter
ValueCountFrequency (%)
C 2
33.3%
I 2
33.3%
K 1
16.7%
S 1
16.7%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Space Separator
ValueCountFrequency (%)
6
100.0%
Decimal Number
ValueCountFrequency (%)
2 2
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 631
93.5%
Common 38
 
5.6%
Latin 6
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
102
16.2%
93
 
14.7%
90
 
14.3%
21
 
3.3%
18
 
2.9%
18
 
2.9%
11
 
1.7%
11
 
1.7%
11
 
1.7%
8
 
1.3%
Other values (131) 248
39.3%
Common
ValueCountFrequency (%)
( 15
39.5%
) 15
39.5%
6
 
15.8%
2 2
 
5.3%
Latin
ValueCountFrequency (%)
C 2
33.3%
I 2
33.3%
K 1
16.7%
S 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 630
93.3%
ASCII 44
 
6.5%
None 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
102
16.2%
93
 
14.8%
90
 
14.3%
21
 
3.3%
18
 
2.9%
18
 
2.9%
11
 
1.7%
11
 
1.7%
11
 
1.7%
8
 
1.3%
Other values (130) 247
39.2%
ASCII
ValueCountFrequency (%)
( 15
34.1%
) 15
34.1%
6
 
13.6%
C 2
 
4.5%
I 2
 
4.5%
2 2
 
4.5%
K 1
 
2.3%
S 1
 
2.3%
None
ValueCountFrequency (%)
1
100.0%

영업 소재지
Text

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2024-01-10T05:56:03.839339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length29
Mean length25.87
Min length20

Characters and Unicode

Total characters2587
Distinct characters97
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st row충청남도 서산시 음암면 도당리 1296 번지1 호
2nd row충청남도 서산시 석남동 22 번지3 호
3rd row충청남도 서산시 대산읍 대산리 728 번지1 호
4th row충청남도 서산시 읍내동 327 번지11 호
5th row충청남도 서산시 고북면 신송리 787 번지2 호
ValueCountFrequency (%)
충청남도 100
 
15.4%
서산시 100
 
15.4%
73
 
11.2%
번지 21
 
3.2%
번지1 15
 
2.3%
음암면 13
 
2.0%
대산읍 13
 
2.0%
번지3 13
 
2.0%
번지2 9
 
1.4%
부석면 8
 
1.2%
Other values (179) 284
43.8%
2024-01-10T05:56:04.292898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
680
26.3%
137
 
5.3%
112
 
4.3%
104
 
4.0%
104
 
4.0%
100
 
3.9%
100
 
3.9%
100
 
3.9%
100
 
3.9%
99
 
3.8%
Other values (87) 951
36.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1502
58.1%
Space Separator 680
26.3%
Decimal Number 396
 
15.3%
Dash Punctuation 8
 
0.3%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
137
 
9.1%
112
 
7.5%
104
 
6.9%
104
 
6.9%
100
 
6.7%
100
 
6.7%
100
 
6.7%
100
 
6.7%
99
 
6.6%
76
 
5.1%
Other values (74) 470
31.3%
Decimal Number
ValueCountFrequency (%)
1 74
18.7%
2 49
12.4%
6 46
11.6%
3 45
11.4%
4 41
10.4%
5 39
9.8%
7 34
8.6%
8 29
 
7.3%
9 23
 
5.8%
0 16
 
4.0%
Space Separator
ValueCountFrequency (%)
680
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1502
58.1%
Common 1085
41.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
137
 
9.1%
112
 
7.5%
104
 
6.9%
104
 
6.9%
100
 
6.7%
100
 
6.7%
100
 
6.7%
100
 
6.7%
99
 
6.6%
76
 
5.1%
Other values (74) 470
31.3%
Common
ValueCountFrequency (%)
680
62.7%
1 74
 
6.8%
2 49
 
4.5%
6 46
 
4.2%
3 45
 
4.1%
4 41
 
3.8%
5 39
 
3.6%
7 34
 
3.1%
8 29
 
2.7%
9 23
 
2.1%
Other values (3) 25
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1502
58.1%
ASCII 1085
41.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
680
62.7%
1 74
 
6.8%
2 49
 
4.5%
6 46
 
4.2%
3 45
 
4.1%
4 41
 
3.8%
5 39
 
3.6%
7 34
 
3.1%
8 29
 
2.7%
9 23
 
2.1%
Other values (3) 25
 
2.3%
Hangul
ValueCountFrequency (%)
137
 
9.1%
112
 
7.5%
104
 
6.9%
104
 
6.9%
100
 
6.7%
100
 
6.7%
100
 
6.7%
100
 
6.7%
99
 
6.6%
76
 
5.1%
Other values (74) 470
31.3%

주유기수
Real number (ℝ)

HIGH CORRELATION 

Distinct18
Distinct (%)18.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.13
Minimum2
Maximum26
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-01-10T05:56:04.417711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile2
Q16
median10
Q314
95-th percentile20
Maximum26
Range24
Interquartile range (IQR)8

Descriptive statistics

Standard deviation5.0425563
Coefficient of variation (CV)0.49778443
Kurtosis0.16721429
Mean10.13
Median Absolute Deviation (MAD)4
Skewness0.54864446
Sum1013
Variance25.427374
MonotonicityNot monotonic
2024-01-10T05:56:04.521065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
10 14
14.0%
6 14
14.0%
14 12
12.0%
12 8
8.0%
2 7
 
7.0%
7 6
 
6.0%
20 5
 
5.0%
8 5
 
5.0%
5 5
 
5.0%
9 5
 
5.0%
Other values (8) 19
19.0%
ValueCountFrequency (%)
2 7
7.0%
3 3
 
3.0%
5 5
 
5.0%
6 14
14.0%
7 6
6.0%
8 5
 
5.0%
9 5
 
5.0%
10 14
14.0%
11 4
 
4.0%
12 8
8.0%
ValueCountFrequency (%)
26 1
 
1.0%
22 1
 
1.0%
20 5
5.0%
18 3
 
3.0%
16 3
 
3.0%
15 2
 
2.0%
14 12
12.0%
13 2
 
2.0%
12 8
8.0%
11 4
 
4.0%

데이터기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2016-03-15
100 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2016-03-15
2nd row2016-03-15
3rd row2016-03-15
4th row2016-03-15
5th row2016-03-15

Common Values

ValueCountFrequency (%)
2016-03-15 100
100.0%

Length

2024-01-10T05:56:04.623941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:56:04.704864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2016-03-15 100
100.0%

Interactions

2024-01-10T05:56:02.546660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T05:56:04.762505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업구분상호영업 소재지주유기수
사업구분1.0001.0001.0000.978
상호1.0001.0001.0001.000
영업 소재지1.0001.0001.0001.000
주유기수0.9781.0001.0001.000
2024-01-10T05:56:04.846815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주유기수사업구분
주유기수1.0000.836
사업구분0.8361.000

Missing values

2024-01-10T05:56:02.632066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:56:02.710506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업구분상호영업 소재지주유기수데이터기준일
0일반판매소음암농협충청남도 서산시 음암면 도당리 1296 번지1 호32016-03-15
1일반판매소대영석유상사충청남도 서산시 석남동 22 번지3 호22016-03-15
2일반판매소대산석유충청남도 서산시 대산읍 대산리 728 번지1 호22016-03-15
3일반판매소현대석유판매충청남도 서산시 읍내동 327 번지11 호22016-03-15
4일반판매소그린에너지충청남도 서산시 고북면 신송리 787 번지2 호22016-03-15
5일반판매소대성에너지충청남도 서산시 고북면 신상리 40 번지3 호22016-03-15
6일반판매소대신석유충청남도 서산시 오남동 159 번지7 호22016-03-15
7일반판매소장군에너지충청남도 서산시 부석면 취평리 407 번지7 호22016-03-15
8주유소서정주유소충청남도 서산시 운산면 갈산리 768 번지3 호32016-03-15
9주유소(주)당진엘피지 대산에너지충청남도 서산시 대산읍 영탑리 463 번지13 호102016-03-15
사업구분상호영업 소재지주유기수데이터기준일
90주유소행복한주유소충청남도 서산시 인지면 둔당리 49 번지2 호152016-03-15
91주유소현대오일뱅크(주)직영 대산주유소충청남도 서산시 대산읍 화곡리 808 번지1 호132016-03-15
92주유소현대오일뱅크(주)직영 서강주유소충청남도 서산시 잠홍동 745외2필지 번지142016-03-15
93주유소현대오일뱅크(주)직영 황금산주유소충청남도 서산시 대산읍 독곳리 69 번지92016-03-15
94주유소현대주유소충청남도 서산시 음암면 도당리 1552 번지1 호82016-03-15
95주유소형제주유소충청남도 서산시 음암면 탑곡리 212 번지4 호72016-03-15
96주유소희망주유소충청남도 서산시 갈산동 157 번지11 호142016-03-15
97주유소예천동주유소충청남도 서산시 예천동 621-2번지142016-03-15
98주유소하늘빛주유소충청남도 서산시 온석동 344-1번지102016-03-15
99주유소오토밸리㈜충청남도 서산시 지곡면 무장산업로 183262016-03-15