Overview

Dataset statistics

Number of variables3
Number of observations55
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.5 KiB
Average record size in memory27.4 B

Variable types

Text2
Numeric1

Dataset

Description부산광역시연제구_가로등현황_20230801
Author부산광역시 연제구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15028785

Alerts

위치 has unique valuesUnique

Reproduction

Analysis started2023-12-10 17:08:17.938233
Analysis finished2023-12-10 17:08:18.846216
Duration0.91 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct49
Distinct (%)89.1%
Missing0
Missing (%)0.0%
Memory size572.0 B
2023-12-11T02:08:19.081368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length5.0363636
Min length3

Characters and Unicode

Total characters277
Distinct characters70
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)78.2%

Sample

1st row거제대로
2nd row연수로
3rd row반송로
4th row아시아드대로
5th row종합운동장로
ValueCountFrequency (%)
00번길 9
 
13.8%
중앙대로 3
 
4.6%
월드컵대로 3
 
4.6%
연수로 3
 
4.6%
해맞이로 2
 
3.1%
아시아드대로 2
 
3.1%
쌍미천로 2
 
3.1%
과정로 2
 
3.1%
고분로 2
 
3.1%
법원로 2
 
3.1%
Other values (34) 35
53.8%
2023-12-11T02:08:19.694850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
54
19.5%
35
 
12.6%
0 19
 
6.9%
11
 
4.0%
11
 
4.0%
10
 
3.6%
10
 
3.6%
7
 
2.5%
5
 
1.8%
5
 
1.8%
Other values (60) 110
39.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 223
80.5%
Space Separator 35
 
12.6%
Decimal Number 19
 
6.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
54
24.2%
11
 
4.9%
11
 
4.9%
10
 
4.5%
10
 
4.5%
7
 
3.1%
5
 
2.2%
5
 
2.2%
5
 
2.2%
5
 
2.2%
Other values (58) 100
44.8%
Space Separator
ValueCountFrequency (%)
35
100.0%
Decimal Number
ValueCountFrequency (%)
0 19
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 223
80.5%
Common 54
 
19.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
54
24.2%
11
 
4.9%
11
 
4.9%
10
 
4.5%
10
 
4.5%
7
 
3.1%
5
 
2.2%
5
 
2.2%
5
 
2.2%
5
 
2.2%
Other values (58) 100
44.8%
Common
ValueCountFrequency (%)
35
64.8%
0 19
35.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 223
80.5%
ASCII 54
 
19.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
54
24.2%
11
 
4.9%
11
 
4.9%
10
 
4.5%
10
 
4.5%
7
 
3.1%
5
 
2.2%
5
 
2.2%
5
 
2.2%
5
 
2.2%
Other values (58) 100
44.8%
ASCII
ValueCountFrequency (%)
35
64.8%
0 19
35.2%

위치
Text

UNIQUE 

Distinct55
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size572.0 B
2023-12-11T02:08:20.004605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length19
Mean length14.254545
Min length6

Characters and Unicode

Total characters784
Distinct characters182
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique55 ?
Unique (%)100.0%

Sample

1st row하마정(진구경계)~교대역
2nd row양정(진구 경계)~망미(수영구 경계)
3rd row연산로타리~연안교 일원
4th row거제삼거리~운동장교차로
5th row사직테니스장(동래구경계)~월드컵대로
ValueCountFrequency (%)
일대 5
 
5.2%
일원 2
 
2.1%
정문 2
 
2.1%
경계 2
 
2.1%
2
 
2.1%
하마정(진구경계)~교대역 1
 
1.0%
홈플러스(일원 1
 
1.0%
1
 
1.0%
올라가는 1
 
1.0%
동덕현대아파트 1
 
1.0%
Other values (79) 79
81.4%
2023-12-11T02:08:20.539765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
~ 54
 
6.9%
42
 
5.4%
27
 
3.4%
25
 
3.2%
22
 
2.8%
20
 
2.6%
18
 
2.3%
17
 
2.2%
16
 
2.0%
15
 
1.9%
Other values (172) 528
67.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 644
82.1%
Math Symbol 54
 
6.9%
Space Separator 42
 
5.4%
Decimal Number 18
 
2.3%
Open Punctuation 10
 
1.3%
Close Punctuation 10
 
1.3%
Uppercase Letter 6
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
27
 
4.2%
25
 
3.9%
22
 
3.4%
20
 
3.1%
18
 
2.8%
17
 
2.6%
16
 
2.5%
15
 
2.3%
13
 
2.0%
13
 
2.0%
Other values (158) 458
71.1%
Decimal Number
ValueCountFrequency (%)
1 8
44.4%
3 3
 
16.7%
2 3
 
16.7%
0 2
 
11.1%
4 1
 
5.6%
7 1
 
5.6%
Uppercase Letter
ValueCountFrequency (%)
G 2
33.3%
L 2
33.3%
C 1
16.7%
S 1
16.7%
Math Symbol
ValueCountFrequency (%)
~ 54
100.0%
Space Separator
ValueCountFrequency (%)
42
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 644
82.1%
Common 134
 
17.1%
Latin 6
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
27
 
4.2%
25
 
3.9%
22
 
3.4%
20
 
3.1%
18
 
2.8%
17
 
2.6%
16
 
2.5%
15
 
2.3%
13
 
2.0%
13
 
2.0%
Other values (158) 458
71.1%
Common
ValueCountFrequency (%)
~ 54
40.3%
42
31.3%
( 10
 
7.5%
) 10
 
7.5%
1 8
 
6.0%
3 3
 
2.2%
2 3
 
2.2%
0 2
 
1.5%
4 1
 
0.7%
7 1
 
0.7%
Latin
ValueCountFrequency (%)
G 2
33.3%
L 2
33.3%
C 1
16.7%
S 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 644
82.1%
ASCII 140
 
17.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
~ 54
38.6%
42
30.0%
( 10
 
7.1%
) 10
 
7.1%
1 8
 
5.7%
3 3
 
2.1%
2 3
 
2.1%
G 2
 
1.4%
L 2
 
1.4%
0 2
 
1.4%
Other values (4) 4
 
2.9%
Hangul
ValueCountFrequency (%)
27
 
4.2%
25
 
3.9%
22
 
3.4%
20
 
3.1%
18
 
2.8%
17
 
2.6%
16
 
2.5%
15
 
2.3%
13
 
2.0%
13
 
2.0%
Other values (158) 458
71.1%

가로등(등)
Real number (ℝ)

Distinct45
Distinct (%)81.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean80.472727
Minimum11
Maximum392
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size627.0 B
2023-12-11T02:08:20.741187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum11
5-th percentile12
Q122
median44
Q388.5
95-th percentile252.9
Maximum392
Range381
Interquartile range (IQR)66.5

Descriptive statistics

Standard deviation84.784444
Coefficient of variation (CV)1.0535799
Kurtosis2.8731352
Mean80.472727
Median Absolute Deviation (MAD)28
Skewness1.7617111
Sum4426
Variance7188.402
MonotonicityNot monotonic
2023-12-11T02:08:20.975048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)
ValueCountFrequency (%)
15 3
 
5.5%
22 2
 
3.6%
12 2
 
3.6%
11 2
 
3.6%
36 2
 
3.6%
25 2
 
3.6%
165 2
 
3.6%
19 2
 
3.6%
86 2
 
3.6%
65 1
 
1.8%
Other values (35) 35
63.6%
ValueCountFrequency (%)
11 2
3.6%
12 2
3.6%
14 1
 
1.8%
15 3
5.5%
16 1
 
1.8%
18 1
 
1.8%
19 2
3.6%
21 1
 
1.8%
22 2
3.6%
24 1
 
1.8%
ValueCountFrequency (%)
392 1
1.8%
297 1
1.8%
262 1
1.8%
249 1
1.8%
222 1
1.8%
216 1
1.8%
210 1
1.8%
183 1
1.8%
165 2
3.6%
143 1
1.8%

Interactions

2023-12-11T02:08:18.358659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T02:08:21.138772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도로명위치가로등(등)
도로명1.0001.0000.000
위치1.0001.0001.000
가로등(등)0.0001.0001.000

Missing values

2023-12-11T02:08:18.623680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:08:18.785277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

도로명위치가로등(등)
0거제대로하마정(진구경계)~교대역297
1연수로양정(진구 경계)~망미(수영구 경계)143
2반송로연산로타리~연안교 일원116
3아시아드대로거제삼거리~운동장교차로77
4종합운동장로사직테니스장(동래구경계)~월드컵대로86
5명륜로교육대학 앞~세병교39
6과정로과정사거리~망미동 경계210
7중앙대로시청(진구경계)~구 송월타올(동래구경계)222
8월드컵대로연산로터리~월드컵대로(시립도서관 진구경계)392
9미남로법원 ~ 법원어귀교차로 일대46
도로명위치가로등(등)
45중앙대로 00번길부산시청 맞은편~동원비스타 정문12
46중앙천로부산은행~연산2치안센터(일원)69
47중앙천로중앙천로~월드컵대로73번길25
48토곡로동서교회~현대오일뱅크22
49토현로농협토곡지점~토현중19
50톳고개로화신사이버대학교~동서그린아파트15
51해맞이로거제유림아시아드~거제초등학교24
52해맞이로거제유림아시아드~화신데파트79
53화지로거제2글로벌화임빌~거제4부산정보고교60
54황새알로교대 테니스장 ~ 거제 자이 11동 앞12