Overview

Dataset statistics

Number of variables4
Number of observations473
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.4 KiB
Average record size in memory33.3 B

Variable types

Numeric1
Categorical1
Text2

Dataset

Description서울특별시 소재의 주요소 현황 데이터입니다. 자치구, 주유소명, 도로명주소를 제공합니다.
Author서울특별시
URLhttps://www.data.go.kr/data/15098386/fileData.do

Alerts

연번 is highly overall correlated with 자치구명High correlation
자치구명 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
주소 has unique valuesUnique

Reproduction

Analysis started2023-12-12 16:36:40.924299
Analysis finished2023-12-12 16:36:41.518296
Duration0.59 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct473
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean237
Minimum1
Maximum473
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.3 KiB
2023-12-13T01:36:41.605197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile24.6
Q1119
median237
Q3355
95-th percentile449.4
Maximum473
Range472
Interquartile range (IQR)236

Descriptive statistics

Standard deviation136.6876
Coefficient of variation (CV)0.57674093
Kurtosis-1.2
Mean237
Median Absolute Deviation (MAD)118
Skewness0
Sum112101
Variance18683.5
MonotonicityStrictly increasing
2023-12-13T01:36:41.759271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
312 1
 
0.2%
324 1
 
0.2%
323 1
 
0.2%
322 1
 
0.2%
321 1
 
0.2%
320 1
 
0.2%
319 1
 
0.2%
318 1
 
0.2%
317 1
 
0.2%
Other values (463) 463
97.9%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
473 1
0.2%
472 1
0.2%
471 1
0.2%
470 1
0.2%
469 1
0.2%
468 1
0.2%
467 1
0.2%
466 1
0.2%
465 1
0.2%
464 1
0.2%

자치구명
Categorical

HIGH CORRELATION 

Distinct25
Distinct (%)5.3%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
강남구
35 
서초구
33 
송파구
33 
강서구
32 
영등포구
 
29
Other values (20)
311 

Length

Max length4
Median length3
Mean length3.1099366
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row용산구
2nd row용산구
3rd row용산구
4th row용산구
5th row용산구

Common Values

ValueCountFrequency (%)
강남구 35
 
7.4%
서초구 33
 
7.0%
송파구 33
 
7.0%
강서구 32
 
6.8%
영등포구 29
 
6.1%
양천구 25
 
5.3%
성북구 24
 
5.1%
구로구 21
 
4.4%
동대문구 19
 
4.0%
도봉구 18
 
3.8%
Other values (15) 204
43.1%

Length

2023-12-13T01:36:41.925795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
강남구 35
 
7.4%
서초구 33
 
7.0%
송파구 33
 
7.0%
강서구 32
 
6.8%
영등포구 29
 
6.1%
양천구 25
 
5.3%
성북구 24
 
5.1%
구로구 21
 
4.4%
동대문구 19
 
4.0%
도봉구 18
 
3.8%
Other values (15) 204
43.1%
Distinct466
Distinct (%)98.5%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2023-12-13T01:36:42.203821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length17
Mean length10.72093
Min length5

Characters and Unicode

Total characters5071
Distinct characters320
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique459 ?
Unique (%)97.0%

Sample

1st row현대오일뱅크(주) 직영소월길주유소
2nd row선익상사(주) 동자동주유소
3rd row현대오일뱅크㈜ 직영갈월동주유소
4th row서계주유소
5th row㈜영원에너지 풍기주유소
ValueCountFrequency (%)
현대오일뱅크㈜직영 20
 
2.8%
셀프 16
 
2.3%
직영 11
 
1.5%
현대오일뱅크㈜ 11
 
1.5%
주식회사 10
 
1.4%
지에스칼텍스(주 10
 
1.4%
현대오일뱅크(주)직영 9
 
1.3%
sk에너지㈜ 9
 
1.3%
구도일주유소 8
 
1.1%
지에스칼텍스㈜ 6
 
0.8%
Other values (546) 601
84.5%
2023-12-13T01:36:42.699371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
535
 
10.6%
489
 
9.6%
443
 
8.7%
256
 
5.0%
167
 
3.3%
132
 
2.6%
128
 
2.5%
123
 
2.4%
) 116
 
2.3%
( 116
 
2.3%
Other values (310) 2566
50.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4320
85.2%
Space Separator 256
 
5.0%
Other Symbol 167
 
3.3%
Close Punctuation 116
 
2.3%
Open Punctuation 116
 
2.3%
Uppercase Letter 76
 
1.5%
Decimal Number 11
 
0.2%
Lowercase Letter 6
 
0.1%
Dash Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
535
 
12.4%
489
 
11.3%
443
 
10.3%
132
 
3.1%
128
 
3.0%
123
 
2.8%
105
 
2.4%
98
 
2.3%
81
 
1.9%
81
 
1.9%
Other values (287) 2105
48.7%
Uppercase Letter
ValueCountFrequency (%)
K 28
36.8%
S 25
32.9%
H 6
 
7.9%
J 5
 
6.6%
G 4
 
5.3%
L 2
 
2.6%
C 2
 
2.6%
P 2
 
2.6%
Q 1
 
1.3%
I 1
 
1.3%
Lowercase Letter
ValueCountFrequency (%)
s 2
33.3%
k 1
16.7%
f 1
16.7%
l 1
16.7%
e 1
16.7%
Decimal Number
ValueCountFrequency (%)
2 8
72.7%
1 2
 
18.2%
3 1
 
9.1%
Space Separator
ValueCountFrequency (%)
256
100.0%
Other Symbol
ValueCountFrequency (%)
167
100.0%
Close Punctuation
ValueCountFrequency (%)
) 116
100.0%
Open Punctuation
ValueCountFrequency (%)
( 116
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4487
88.5%
Common 502
 
9.9%
Latin 82
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
535
 
11.9%
489
 
10.9%
443
 
9.9%
167
 
3.7%
132
 
2.9%
128
 
2.9%
123
 
2.7%
105
 
2.3%
98
 
2.2%
81
 
1.8%
Other values (288) 2186
48.7%
Latin
ValueCountFrequency (%)
K 28
34.1%
S 25
30.5%
H 6
 
7.3%
J 5
 
6.1%
G 4
 
4.9%
L 2
 
2.4%
s 2
 
2.4%
C 2
 
2.4%
P 2
 
2.4%
Q 1
 
1.2%
Other values (5) 5
 
6.1%
Common
ValueCountFrequency (%)
256
51.0%
) 116
23.1%
( 116
23.1%
2 8
 
1.6%
- 3
 
0.6%
1 2
 
0.4%
3 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4320
85.2%
ASCII 584
 
11.5%
None 167
 
3.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
535
 
12.4%
489
 
11.3%
443
 
10.3%
132
 
3.1%
128
 
3.0%
123
 
2.8%
105
 
2.4%
98
 
2.3%
81
 
1.9%
81
 
1.9%
Other values (287) 2105
48.7%
ASCII
ValueCountFrequency (%)
256
43.8%
) 116
19.9%
( 116
19.9%
K 28
 
4.8%
S 25
 
4.3%
2 8
 
1.4%
H 6
 
1.0%
J 5
 
0.9%
G 4
 
0.7%
- 3
 
0.5%
Other values (12) 17
 
2.9%
None
ValueCountFrequency (%)
167
100.0%

주소
Text

UNIQUE 

Distinct473
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2023-12-13T01:36:43.023799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length33
Mean length23.114165
Min length15

Characters and Unicode

Total characters10933
Distinct characters234
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique473 ?
Unique (%)100.0%

Sample

1st row서울특별시 용산구 소월로66
2nd row서울특별시 용산구 한강대로 104길 6
3rd row서울특별시 용산구 한강대로 322
4th row서울특별시 용산구 청파로 367
5th row서울특별시 용산구 원효로178
ValueCountFrequency (%)
서울특별시 473
 
22.6%
강남구 35
 
1.7%
서초구 33
 
1.6%
송파구 33
 
1.6%
강서구 31
 
1.5%
영등포구 29
 
1.4%
양천구 25
 
1.2%
성북구 24
 
1.1%
구로구 21
 
1.0%
남부순환로 20
 
1.0%
Other values (802) 1368
65.4%
2023-12-13T01:36:43.544766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1639
 
15.0%
567
 
5.2%
514
 
4.7%
510
 
4.7%
499
 
4.6%
490
 
4.5%
474
 
4.3%
474
 
4.3%
474
 
4.3%
( 408
 
3.7%
Other values (224) 4884
44.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6727
61.5%
Decimal Number 1688
 
15.4%
Space Separator 1639
 
15.0%
Open Punctuation 408
 
3.7%
Close Punctuation 408
 
3.7%
Dash Punctuation 62
 
0.6%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
567
 
8.4%
514
 
7.6%
510
 
7.6%
499
 
7.4%
490
 
7.3%
474
 
7.0%
474
 
7.0%
474
 
7.0%
115
 
1.7%
105
 
1.6%
Other values (209) 2505
37.2%
Decimal Number
ValueCountFrequency (%)
1 318
18.8%
2 240
14.2%
3 178
10.5%
4 171
10.1%
5 153
9.1%
6 150
8.9%
7 136
8.1%
8 124
 
7.3%
9 116
 
6.9%
0 102
 
6.0%
Space Separator
ValueCountFrequency (%)
1639
100.0%
Open Punctuation
ValueCountFrequency (%)
( 408
100.0%
Close Punctuation
ValueCountFrequency (%)
) 408
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 62
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6727
61.5%
Common 4206
38.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
567
 
8.4%
514
 
7.6%
510
 
7.6%
499
 
7.4%
490
 
7.3%
474
 
7.0%
474
 
7.0%
474
 
7.0%
115
 
1.7%
105
 
1.6%
Other values (209) 2505
37.2%
Common
ValueCountFrequency (%)
1639
39.0%
( 408
 
9.7%
) 408
 
9.7%
1 318
 
7.6%
2 240
 
5.7%
3 178
 
4.2%
4 171
 
4.1%
5 153
 
3.6%
6 150
 
3.6%
7 136
 
3.2%
Other values (5) 405
 
9.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6727
61.5%
ASCII 4206
38.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1639
39.0%
( 408
 
9.7%
) 408
 
9.7%
1 318
 
7.6%
2 240
 
5.7%
3 178
 
4.2%
4 171
 
4.1%
5 153
 
3.6%
6 150
 
3.6%
7 136
 
3.2%
Other values (5) 405
 
9.6%
Hangul
ValueCountFrequency (%)
567
 
8.4%
514
 
7.6%
510
 
7.6%
499
 
7.4%
490
 
7.3%
474
 
7.0%
474
 
7.0%
474
 
7.0%
115
 
1.7%
105
 
1.6%
Other values (209) 2505
37.2%

Interactions

2023-12-13T01:36:41.246860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T01:36:43.650913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번자치구명
연번1.0000.987
자치구명0.9871.000
2023-12-13T01:36:43.750363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번자치구명
연번1.0000.873
자치구명0.8731.000

Missing values

2023-12-13T01:36:41.377087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:36:41.478132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번자치구명주유소명주소
01용산구현대오일뱅크(주) 직영소월길주유소서울특별시 용산구 소월로66
12용산구선익상사(주) 동자동주유소서울특별시 용산구 한강대로 104길 6
23용산구현대오일뱅크㈜ 직영갈월동주유소서울특별시 용산구 한강대로 322
34용산구서계주유소서울특별시 용산구 청파로 367
45용산구㈜영원에너지 풍기주유소서울특별시 용산구 원효로178
56용산구㈜신태성주유소서울특별시 용산구 원효로 147
67용산구현대오일뱅크㈜ 직영강변주유소서울특별시 용산구 원효로 9
78용산구한국석유공업㈜ 한석주유소서울특별시 용산구 이촌로 164
89용산구(주)중앙에너비스 한남지점서울특별시 용산구 한남대로 21길 4
910용산구한남제3한강주유소 주식회사서울특별시 용산구 한남대로 45
연번자치구명주유소명주소
463464도봉구구도일주유소 파크빌서울특별시 도봉구 해등로3길 86 (창동)
464465도봉구대성산업㈜ 신창주유소서울특별시 도봉구 덕릉로 267 (창동)
465466도봉구동일석유㈜ 창동주유소서울특별시 도봉구 도봉로 434 (창동)
466467도봉구극동유화㈜ 대안주유소서울특별시 도봉구 마들로 574 (창동)
467468도봉구한이에너지㈜ KLP제1주유소서울특별시 도봉구 도봉로 596 (창동)
468469도봉구현대오일뱅크㈜직영 도봉현대셀프주유소서울특별시 도봉구 도봉로 941 (도봉동)
469470도봉구GS칼텍스㈜ 도봉주유소서울특별시 도봉구 도봉로 895 (도봉동)
470471도봉구(주)송만에너지 도봉제일주유소서울특별시 도봉구 도봉로 783 (도봉동)
471472도봉구노원교주유소서울특별시 도봉구 마들로 776 (도봉동)
472473도봉구오복주유소서울특별시 도봉구 방학로 43 (방학동)