Overview

Dataset statistics

Number of variables4
Number of observations100
Missing cells6
Missing cells (%)1.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.4 KiB
Average record size in memory34.3 B

Variable types

Numeric1
Text3

Dataset

Description경상북도 구미시에 등록된 주유소 현황 데이터로 주유소의 상호명, 도로명주소, 전화번호등의 데이터를 제공하고 있습니다.
Author경상북도 구미시
URLhttps://www.data.go.kr/data/3069586/fileData.do

Alerts

전화번호 has 5 (5.0%) missing valuesMissing
연번 has unique valuesUnique
상호명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:30:55.201760
Analysis finished2023-12-12 12:30:55.928221
Duration0.73 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.5
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-12T21:30:56.032487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.95
Q125.75
median50.5
Q375.25
95-th percentile95.05
Maximum100
Range99
Interquartile range (IQR)49.5

Descriptive statistics

Standard deviation29.011492
Coefficient of variation (CV)0.57448499
Kurtosis-1.2
Mean50.5
Median Absolute Deviation (MAD)25
Skewness0
Sum5050
Variance841.66667
MonotonicityStrictly increasing
2023-12-12T21:30:56.220345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%

상호명
Text

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-12T21:30:56.522493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length18
Mean length7.33
Min length5

Characters and Unicode

Total characters733
Distinct characters158
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st row해평농협주유소
2nd row(주)지엠옥계주유소
3rd row(주)도개휴게주유소
4th row4공단주유소
5th row드림2주유소
ValueCountFrequency (%)
선산산업(주 2
 
1.9%
선산주유소 2
 
1.9%
주유소 2
 
1.9%
주)지엠옥계주유소 1
 
0.9%
광평세종주유소 1
 
0.9%
황상동양주유소 1
 
0.9%
정한주유소 1
 
0.9%
인동명품주유소 1
 
0.9%
제이엠8주유소 1
 
0.9%
원남주유소 1
 
0.9%
Other values (95) 95
88.0%
2023-12-12T21:30:56.979003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
115
 
15.7%
104
 
14.2%
101
 
13.8%
16
 
2.2%
( 15
 
2.0%
15
 
2.0%
) 15
 
2.0%
14
 
1.9%
14
 
1.9%
11
 
1.5%
Other values (148) 313
42.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 675
92.1%
Open Punctuation 15
 
2.0%
Close Punctuation 15
 
2.0%
Uppercase Letter 9
 
1.2%
Space Separator 8
 
1.1%
Decimal Number 7
 
1.0%
Lowercase Letter 3
 
0.4%
Other Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
115
17.0%
104
 
15.4%
101
 
15.0%
16
 
2.4%
15
 
2.2%
14
 
2.1%
14
 
2.1%
11
 
1.6%
10
 
1.5%
7
 
1.0%
Other values (132) 268
39.7%
Decimal Number
ValueCountFrequency (%)
2 2
28.6%
1 2
28.6%
8 1
14.3%
4 1
14.3%
3 1
14.3%
Uppercase Letter
ValueCountFrequency (%)
K 5
55.6%
S 2
 
22.2%
C 1
 
11.1%
I 1
 
11.1%
Lowercase Letter
ValueCountFrequency (%)
l 1
33.3%
e 1
33.3%
f 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Space Separator
ValueCountFrequency (%)
8
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 676
92.2%
Common 45
 
6.1%
Latin 12
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
115
17.0%
104
 
15.4%
101
 
14.9%
16
 
2.4%
15
 
2.2%
14
 
2.1%
14
 
2.1%
11
 
1.6%
10
 
1.5%
7
 
1.0%
Other values (133) 269
39.8%
Common
ValueCountFrequency (%)
( 15
33.3%
) 15
33.3%
8
17.8%
2 2
 
4.4%
1 2
 
4.4%
8 1
 
2.2%
4 1
 
2.2%
3 1
 
2.2%
Latin
ValueCountFrequency (%)
K 5
41.7%
S 2
 
16.7%
l 1
 
8.3%
C 1
 
8.3%
I 1
 
8.3%
e 1
 
8.3%
f 1
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 675
92.1%
ASCII 57
 
7.8%
None 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
115
17.0%
104
 
15.4%
101
 
15.0%
16
 
2.4%
15
 
2.2%
14
 
2.1%
14
 
2.1%
11
 
1.6%
10
 
1.5%
7
 
1.0%
Other values (132) 268
39.7%
ASCII
ValueCountFrequency (%)
( 15
26.3%
) 15
26.3%
8
14.0%
K 5
 
8.8%
S 2
 
3.5%
2 2
 
3.5%
1 2
 
3.5%
8 1
 
1.8%
l 1
 
1.8%
C 1
 
1.8%
Other values (5) 5
 
8.8%
None
ValueCountFrequency (%)
1
100.0%
Distinct99
Distinct (%)100.0%
Missing1
Missing (%)1.0%
Memory size932.0 B
2023-12-12T21:30:57.322026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length27
Mean length22.333333
Min length20

Characters and Unicode

Total characters2211
Distinct characters106
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique99 ?
Unique (%)100.0%

Sample

1st row경상북도 구미시 해평면 강동로 1613
2nd row경상북도 구미시 산호대로 585 (양호동)
3rd row경상북도 구미시 도개면 낙동대로 3878
4th row경상북도 구미시 옥계신당로 6 (옥계동)
5th row경상북도 구미시 고아읍 선산대로 503
ValueCountFrequency (%)
경상북도 99
20.0%
구미시 99
20.0%
선산대로 15
 
3.0%
고아읍 12
 
2.4%
도량동 10
 
2.0%
야은로 9
 
1.8%
강동로 7
 
1.4%
장천면 7
 
1.4%
오태동 6
 
1.2%
인동가산로 6
 
1.2%
Other values (150) 226
45.6%
2023-12-12T21:30:57.821362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
397
18.0%
114
 
5.2%
106
 
4.8%
105
 
4.7%
103
 
4.7%
99
 
4.5%
99
 
4.5%
99
 
4.5%
98
 
4.4%
93
 
4.2%
Other values (96) 898
40.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1384
62.6%
Space Separator 397
 
18.0%
Decimal Number 306
 
13.8%
Close Punctuation 60
 
2.7%
Open Punctuation 60
 
2.7%
Dash Punctuation 3
 
0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
114
 
8.2%
106
 
7.7%
105
 
7.6%
103
 
7.4%
99
 
7.2%
99
 
7.2%
99
 
7.2%
98
 
7.1%
93
 
6.7%
45
 
3.3%
Other values (81) 423
30.6%
Decimal Number
ValueCountFrequency (%)
1 57
18.6%
2 43
14.1%
3 37
12.1%
5 31
10.1%
6 29
9.5%
8 26
8.5%
9 24
7.8%
0 23
7.5%
4 19
 
6.2%
7 17
 
5.6%
Space Separator
ValueCountFrequency (%)
397
100.0%
Close Punctuation
ValueCountFrequency (%)
) 60
100.0%
Open Punctuation
ValueCountFrequency (%)
( 60
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1384
62.6%
Common 827
37.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
114
 
8.2%
106
 
7.7%
105
 
7.6%
103
 
7.4%
99
 
7.2%
99
 
7.2%
99
 
7.2%
98
 
7.1%
93
 
6.7%
45
 
3.3%
Other values (81) 423
30.6%
Common
ValueCountFrequency (%)
397
48.0%
) 60
 
7.3%
( 60
 
7.3%
1 57
 
6.9%
2 43
 
5.2%
3 37
 
4.5%
5 31
 
3.7%
6 29
 
3.5%
8 26
 
3.1%
9 24
 
2.9%
Other values (5) 63
 
7.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1384
62.6%
ASCII 827
37.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
397
48.0%
) 60
 
7.3%
( 60
 
7.3%
1 57
 
6.9%
2 43
 
5.2%
3 37
 
4.5%
5 31
 
3.7%
6 29
 
3.5%
8 26
 
3.1%
9 24
 
2.9%
Other values (5) 63
 
7.6%
Hangul
ValueCountFrequency (%)
114
 
8.2%
106
 
7.7%
105
 
7.6%
103
 
7.4%
99
 
7.2%
99
 
7.2%
99
 
7.2%
98
 
7.1%
93
 
6.7%
45
 
3.3%
Other values (81) 423
30.6%

전화번호
Text

MISSING 

Distinct95
Distinct (%)100.0%
Missing5
Missing (%)5.0%
Memory size932.0 B
2023-12-12T21:30:58.080186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters1140
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique95 ?
Unique (%)100.0%

Sample

1st row054-474-4205
2nd row054-471-5959
3rd row054-471-5900
4th row054-463-5189
5th row054-453-8080
ValueCountFrequency (%)
054-474-4205 1
 
1.1%
054-482-6014 1
 
1.1%
054-482-6034 1
 
1.1%
054-452-7424 1
 
1.1%
054-455-9330 1
 
1.1%
054-473-2101 1
 
1.1%
054-482-0077 1
 
1.1%
054-472-0117 1
 
1.1%
054-482-5149 1
 
1.1%
054-458-1114 1
 
1.1%
Other values (85) 85
89.5%
2023-12-12T21:30:58.422236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 246
21.6%
- 190
16.7%
5 187
16.4%
0 166
14.6%
1 76
 
6.7%
2 67
 
5.9%
6 55
 
4.8%
7 49
 
4.3%
8 39
 
3.4%
3 36
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 950
83.3%
Dash Punctuation 190
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 246
25.9%
5 187
19.7%
0 166
17.5%
1 76
 
8.0%
2 67
 
7.1%
6 55
 
5.8%
7 49
 
5.2%
8 39
 
4.1%
3 36
 
3.8%
9 29
 
3.1%
Dash Punctuation
ValueCountFrequency (%)
- 190
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1140
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 246
21.6%
- 190
16.7%
5 187
16.4%
0 166
14.6%
1 76
 
6.7%
2 67
 
5.9%
6 55
 
4.8%
7 49
 
4.3%
8 39
 
3.4%
3 36
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1140
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 246
21.6%
- 190
16.7%
5 187
16.4%
0 166
14.6%
1 76
 
6.7%
2 67
 
5.9%
6 55
 
4.8%
7 49
 
4.3%
8 39
 
3.4%
3 36
 
3.2%

Interactions

2023-12-12T21:30:55.519948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:30:58.515017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번상호명도로명주소전화번호
연번1.0001.0001.0001.000
상호명1.0001.0001.0001.000
도로명주소1.0001.0001.0001.000
전화번호1.0001.0001.0001.000

Missing values

2023-12-12T21:30:55.644783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:30:55.778700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T21:30:55.871042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번상호명도로명주소전화번호
01해평농협주유소경상북도 구미시 해평면 강동로 1613054-474-4205
12(주)지엠옥계주유소경상북도 구미시 산호대로 585 (양호동)<NA>
23(주)도개휴게주유소경상북도 구미시 도개면 낙동대로 3878<NA>
344공단주유소경상북도 구미시 옥계신당로 6 (옥계동)<NA>
45드림2주유소경상북도 구미시 고아읍 선산대로 503<NA>
56양포주유소경상북도 구미시 옥계2공단로 548 (옥계동)<NA>
67산동농협주유소경상북도 구미시 장천면 강동로 301054-471-5959
78케이케이(주)KK중앙1주유소경상북도 구미시 장천면 산호대로 1394054-471-5900
89대원석유(주)공단대원주유소경상북도 구미시 비산로 121 (공단동)054-463-5189
910대경주유소경상북도 구미시 고아읍 들성로 269054-453-8080
연번상호명도로명주소전화번호
9091신원셀프주유소경상북도 구미시 야은로 713 (원평동)054-461-5160
9192아리랑주유소경상북도 구미시 금오대로 382 (오태동)054-465-0055
9293성원셀프주유소경상북도 구미시 인동가산로 395 (구평동)054-471-0700
9394구포주유소경상북도 구미시 옥계2공단로 268 (구포동)054-475-0506
9495명문주유소경상북도 구미시 고아읍 선산대로 1075054-481-3100
9596서부주유소경상북도 구미시 선산읍 김선로 979054-481-2428
9697소망주유소경상북도 구미시 고아읍 선산대로 486054-452-5189
9798하나주유소경상북도 구미시 해평면 강동로 1970054-474-5488
9899문수주유소경상북도 구미시 장천면 강동로 640054-471-3281
99100장천주유소경상북도 구미시 장천면 강동로 231054-471-5613