Overview

Dataset statistics

Number of variables6
Number of observations29
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)3.4%
Total size in memory1.5 KiB
Average record size in memory53.6 B

Variable types

Text1
Categorical4
Numeric1

Dataset

Description부산광역시북구하천점용현황_20230413
Author부산광역시 북구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15007459

Alerts

Dataset has 1 (3.4%) duplicate rowsDuplicates
점용시작일 is highly overall correlated with 점용종료일High correlation
점용종료일 is highly overall correlated with 점용시작일High correlation
하천명 is highly overall correlated with 점용목적High correlation
점용목적 is highly overall correlated with 하천명High correlation

Reproduction

Analysis started2023-12-10 16:50:15.348731
Analysis finished2023-12-10 16:50:18.462391
Duration3.11 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct17
Distinct (%)58.6%
Missing0
Missing (%)0.0%
Memory size364.0 B
2023-12-11T01:50:18.644122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length31
Mean length23.103448
Min length19

Characters and Unicode

Total characters670
Distinct characters32
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)34.5%

Sample

1st row부산광역시 북구 구포동 1141번지 112호 (745-11번지선)
2nd row부산광역시 북구 구포동 1141번지 120호
3rd row부산광역시 북구 구포동 166번지
4th row부산광역시 북구 구포동 620번지 6호
5th row부산광역시 북구 구포동 1141번지 112호
ValueCountFrequency (%)
부산광역시 29
21.0%
북구 29
21.0%
만덕동 15
10.9%
783번지 11
 
8.0%
구포동 8
 
5.8%
54호 7
 
5.1%
화명동 5
 
3.6%
775번지 4
 
2.9%
1141번지 4
 
2.9%
12호 3
 
2.2%
Other values (17) 23
16.7%
2023-12-11T01:50:19.418532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
149
22.2%
37
 
5.5%
1 36
 
5.4%
31
 
4.6%
31
 
4.6%
29
 
4.3%
29
 
4.3%
29
 
4.3%
29
 
4.3%
29
 
4.3%
Other values (22) 241
36.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 374
55.8%
Space Separator 149
 
22.2%
Decimal Number 142
 
21.2%
Open Punctuation 2
 
0.3%
Close Punctuation 2
 
0.3%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
37
9.9%
31
8.3%
31
8.3%
29
7.8%
29
7.8%
29
7.8%
29
7.8%
29
7.8%
29
7.8%
29
7.8%
Other values (8) 72
19.3%
Decimal Number
ValueCountFrequency (%)
1 36
25.4%
7 20
14.1%
5 17
12.0%
4 16
11.3%
2 16
11.3%
8 12
 
8.5%
3 11
 
7.7%
6 10
 
7.0%
0 3
 
2.1%
9 1
 
0.7%
Space Separator
ValueCountFrequency (%)
149
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 374
55.8%
Common 296
44.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
37
9.9%
31
8.3%
31
8.3%
29
7.8%
29
7.8%
29
7.8%
29
7.8%
29
7.8%
29
7.8%
29
7.8%
Other values (8) 72
19.3%
Common
ValueCountFrequency (%)
149
50.3%
1 36
 
12.2%
7 20
 
6.8%
5 17
 
5.7%
4 16
 
5.4%
2 16
 
5.4%
8 12
 
4.1%
3 11
 
3.7%
6 10
 
3.4%
0 3
 
1.0%
Other values (4) 6
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 374
55.8%
ASCII 296
44.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
149
50.3%
1 36
 
12.2%
7 20
 
6.8%
5 17
 
5.7%
4 16
 
5.4%
2 16
 
5.4%
8 12
 
4.1%
3 11
 
3.7%
6 10
 
3.4%
0 3
 
1.0%
Other values (4) 6
 
2.0%
Hangul
ValueCountFrequency (%)
37
9.9%
31
8.3%
31
8.3%
29
7.8%
29
7.8%
29
7.8%
29
7.8%
29
7.8%
29
7.8%
29
7.8%
Other values (8) 72
19.3%

점용시작일
Categorical

HIGH CORRELATION 

Distinct14
Distinct (%)48.3%
Missing0
Missing (%)0.0%
Memory size364.0 B
2012-01-01
2011-01-01
2013-01-01
2015-01-01
2021-05-17
Other values (9)
10 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique8 ?
Unique (%)27.6%

Sample

1st row2015-01-01
2nd row2011-01-01
3rd row2012-02-06
4th row2013-01-01
5th row2021-05-17

Common Values

ValueCountFrequency (%)
2012-01-01 7
24.1%
2011-01-01 4
13.8%
2013-01-01 4
13.8%
2015-01-01 2
 
6.9%
2021-05-17 2
 
6.9%
2018-01-01 2
 
6.9%
2012-02-06 1
 
3.4%
2021-08-26 1
 
3.4%
2021-09-13 1
 
3.4%
2022-06-10 1
 
3.4%
Other values (4) 4
13.8%

Length

2023-12-11T01:50:19.666563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2012-01-01 7
24.1%
2011-01-01 4
13.8%
2013-01-01 4
13.8%
2015-01-01 2
 
6.9%
2021-05-17 2
 
6.9%
2018-01-01 2
 
6.9%
2012-02-06 1
 
3.4%
2021-08-26 1
 
3.4%
2021-09-13 1
 
3.4%
2022-06-10 1
 
3.4%
Other values (4) 4
13.8%

점용종료일
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)27.6%
Missing0
Missing (%)0.0%
Memory size364.0 B
2023-12-31
15 
2025-12-31
2024-12-31
2026-12-31
영구
 
1
Other values (3)

Length

Max length10
Median length10
Mean length9.7241379
Min length2

Unique

Unique4 ?
Unique (%)13.8%

Sample

1st row2024-12-31
2nd row2023-12-31
3rd row2023-12-31
4th row2023-12-31
5th row2026-12-31

Common Values

ValueCountFrequency (%)
2023-12-31 15
51.7%
2025-12-31 4
 
13.8%
2024-12-31 3
 
10.3%
2026-12-31 3
 
10.3%
영구 1
 
3.4%
2026-06-10 1
 
3.4%
2026-05-31 1
 
3.4%
2024-05-31 1
 
3.4%

Length

2023-12-11T01:50:19.921386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:50:20.137879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-12-31 15
51.7%
2025-12-31 4
 
13.8%
2024-12-31 3
 
10.3%
2026-12-31 3
 
10.3%
영구 1
 
3.4%
2026-06-10 1
 
3.4%
2026-05-31 1
 
3.4%
2024-05-31 1
 
3.4%

하천명
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)24.1%
Missing0
Missing (%)0.0%
Memory size364.0 B
지방2급대리천
사기천
지방2급대천천
지방2급덕천천
병풍천
Other values (2)

Length

Max length7
Median length7
Mean length5.2068966
Min length3

Unique

Unique2 ?
Unique (%)6.9%

Sample

1st row지방2급대리천
2nd row지방2급대천천
3rd row지방2급대리천
4th row지방2급대리천
5th row지방2급대리천

Common Values

ValueCountFrequency (%)
지방2급대리천 7
24.1%
사기천 7
24.1%
지방2급대천천 5
17.2%
지방2급덕천천 4
13.8%
병풍천 4
13.8%
용을천 1
 
3.4%
용두천 1
 
3.4%

Length

2023-12-11T01:50:20.406051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:50:20.624520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지방2급대리천 7
24.1%
사기천 7
24.1%
지방2급대천천 5
17.2%
지방2급덕천천 4
13.8%
병풍천 4
13.8%
용을천 1
 
3.4%
용두천 1
 
3.4%

점용목적
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)13.8%
Missing0
Missing (%)0.0%
Memory size364.0 B
기타 목적을 위한 점용
15 
일반 공작물
관로등 매설을 위한 점용
주거를 목적으로 하는 점용

Length

Max length14
Median length12
Mean length11.413793
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row주거를 목적으로 하는 점용
2nd row기타 목적을 위한 점용
3rd row일반 공작물
4th row기타 목적을 위한 점용
5th row일반 공작물

Common Values

ValueCountFrequency (%)
기타 목적을 위한 점용 15
51.7%
일반 공작물 5
 
17.2%
관로등 매설을 위한 점용 5
 
17.2%
주거를 목적으로 하는 점용 4
 
13.8%

Length

2023-12-11T01:50:20.869070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:50:21.048828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
점용 24
22.6%
위한 20
18.9%
기타 15
14.2%
목적을 15
14.2%
일반 5
 
4.7%
공작물 5
 
4.7%
관로등 5
 
4.7%
매설을 5
 
4.7%
주거를 4
 
3.8%
목적으로 4
 
3.8%
Distinct26
Distinct (%)89.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean74.516207
Minimum0.16
Maximum621
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size393.0 B
2023-12-11T01:50:21.252127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.16
5-th percentile0.876
Q14
median17
Q363
95-th percentile305
Maximum621
Range620.84
Interquartile range (IQR)59

Descriptive statistics

Standard deviation137.24519
Coefficient of variation (CV)1.8418166
Kurtosis8.7097152
Mean74.516207
Median Absolute Deviation (MAD)14
Skewness2.8047987
Sum2160.97
Variance18836.241
MonotonicityNot monotonic
2023-12-11T01:50:21.461549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
0.16 2
 
6.9%
17.0 2
 
6.9%
4.0 2
 
6.9%
203.0 1
 
3.4%
2.0 1
 
3.4%
30.0 1
 
3.4%
3.9 1
 
3.4%
621.0 1
 
3.4%
3.0 1
 
3.4%
63.0 1
 
3.4%
Other values (16) 16
55.2%
ValueCountFrequency (%)
0.16 2
6.9%
1.95 1
3.4%
2.0 1
3.4%
3.0 1
3.4%
3.6 1
3.4%
3.9 1
3.4%
4.0 2
6.9%
6.0 1
3.4%
7.0 1
3.4%
10.0 1
3.4%
ValueCountFrequency (%)
621.0 1
3.4%
343.0 1
3.4%
248.0 1
3.4%
240.0 1
3.4%
203.0 1
3.4%
85.7 1
3.4%
83.0 1
3.4%
63.0 1
3.4%
42.0 1
3.4%
31.0 1
3.4%

Interactions

2023-12-11T01:50:17.956400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:50:21.630232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
부과대상점용시작일점용종료일하천명점용목적점용면적(제곱미터)
부과대상1.0000.9410.9221.0001.0000.747
점용시작일0.9411.0000.9660.9010.7620.626
점용종료일0.9220.9661.0000.4520.5930.453
하천명1.0000.9010.4521.0000.7070.176
점용목적1.0000.7620.5930.7071.0000.000
점용면적(제곱미터)0.7470.6260.4530.1760.0001.000
2023-12-11T01:50:21.817402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
점용종료일점용목적점용시작일하천명
점용종료일1.0000.2570.7360.235
점용목적0.2571.0000.4100.535
점용시작일0.7360.4101.0000.458
하천명0.2350.5350.4581.000
2023-12-11T01:50:21.967248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
점용면적(제곱미터)점용시작일점용종료일하천명점용목적
점용면적(제곱미터)1.0000.2800.2630.0560.000
점용시작일0.2801.0000.7360.4580.410
점용종료일0.2630.7361.0000.2350.257
하천명0.0560.4580.2351.0000.535
점용목적0.0000.4100.2570.5351.000

Missing values

2023-12-11T01:50:18.191859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:50:18.347089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

부과대상점용시작일점용종료일하천명점용목적점용면적(제곱미터)
0부산광역시 북구 구포동 1141번지 112호 (745-11번지선)2015-01-012024-12-31지방2급대리천주거를 목적으로 하는 점용203.0
1부산광역시 북구 구포동 1141번지 120호2011-01-012023-12-31지방2급대천천기타 목적을 위한 점용83.0
2부산광역시 북구 구포동 166번지2012-02-062023-12-31지방2급대리천일반 공작물3.6
3부산광역시 북구 구포동 620번지 6호2013-01-012023-12-31지방2급대리천기타 목적을 위한 점용7.0
4부산광역시 북구 구포동 1141번지 112호2021-05-172026-12-31지방2급대리천일반 공작물26.3
5부산광역시 북구 구포동 1141번지 112호2021-05-17영구지방2급대리천일반 공작물31.0
6부산광역시 북구 구포동 156번지 11호2021-08-262026-12-31지방2급대리천관로등 매설을 위한 점용0.16
7부산광역시 북구 구포동 156번지 11호2021-09-132026-12-31지방2급대리천관로등 매설을 위한 점용0.16
8부산광역시 북구 만덕동 783번지 12호2013-01-012023-12-31지방2급덕천천주거를 목적으로 하는 점용22.0
9부산광역시 북구 만덕동 783번지 12호2013-01-012023-12-31지방2급덕천천주거를 목적으로 하는 점용42.0
부과대상점용시작일점용종료일하천명점용목적점용면적(제곱미터)
19부산광역시 북구 만덕동 775번지2018-01-012023-12-31병풍천기타 목적을 위한 점용10.0
20부산광역시 북구 만덕동 775번지2012-01-012023-12-31병풍천기타 목적을 위한 점용17.0
21부산광역시 북구 만덕동 775번지2011-01-012025-12-31병풍천기타 목적을 위한 점용6.0
22부산광역시 북구 만덕동 783번지 4호 (546번지선)2011-02-142025-12-31용을천기타 목적을 위한 점용63.0
23부산광역시 북구 만덕동 783번지 54호2012-01-012023-12-31사기천기타 목적을 위한 점용3.0
24부산광역시 북구 만덕동 775번지2011-01-012025-12-31병풍천기타 목적을 위한 점용17.0
25부산광역시 북구 만덕동 783번지 54호2012-01-012023-12-31사기천기타 목적을 위한 점용4.0
26부산광역시 북구 만덕동 783번지 54호2019-01-012024-05-31사기천기타 목적을 위한 점용621.0
27부산광역시 북구 화명동 2126번지2020-12-162024-12-31용두천일반 공작물3.9
28부산광역시 북구 만덕동 783번지 54호2012-01-012023-12-31사기천기타 목적을 위한 점용30.0

Duplicate rows

Most frequently occurring

부과대상점용시작일점용종료일하천명점용목적점용면적(제곱미터)# duplicates
0부산광역시 북구 만덕동 783번지 54호2012-01-012023-12-31사기천기타 목적을 위한 점용4.02