Overview

Dataset statistics

Number of variables5
Number of observations806
Missing cells0
Missing cells (%)0.0%
Duplicate rows22
Duplicate rows (%)2.7%
Total size in memory32.4 KiB
Average record size in memory41.2 B

Variable types

Numeric1
Categorical3
Text1

Dataset

Description부산광역시해운대구_재정정보공개시스템_부서정보_20210112
Author부산광역시 해운대구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15050176

Alerts

Dataset has 22 (2.7%) duplicate rowsDuplicates
부서구분명 is highly overall correlated with 관서명 and 1 other fieldsHigh correlation
실국명 is highly overall correlated with 부서구분명High correlation
관서명 is highly overall correlated with 부서구분명High correlation

Reproduction

Analysis started2023-12-10 16:40:26.584364
Analysis finished2023-12-10 16:40:27.297647
Duration0.71 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

회계연도
Real number (ℝ)

Distinct13
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2013.8201
Minimum2008
Maximum2020
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.2 KiB
2023-12-11T01:40:27.650823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2008
5-th percentile2008
Q12011
median2014
Q32017
95-th percentile2020
Maximum2020
Range12
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.6970551
Coefficient of variation (CV)0.0018358418
Kurtosis-1.1833789
Mean2013.8201
Median Absolute Deviation (MAD)3
Skewness0.049436095
Sum1623139
Variance13.668217
MonotonicityNot monotonic
2023-12-11T01:40:27.772411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
2009 65
 
8.1%
2008 65
 
8.1%
2015 64
 
7.9%
2014 64
 
7.9%
2013 64
 
7.9%
2012 64
 
7.9%
2011 64
 
7.9%
2010 64
 
7.9%
2016 63
 
7.8%
2017 60
 
7.4%
Other values (3) 169
21.0%
ValueCountFrequency (%)
2008 65
8.1%
2009 65
8.1%
2010 64
7.9%
2011 64
7.9%
2012 64
7.9%
2013 64
7.9%
2014 64
7.9%
2015 64
7.9%
2016 63
7.8%
2017 60
7.4%
ValueCountFrequency (%)
2020 54
6.7%
2019 56
6.9%
2018 59
7.3%
2017 60
7.4%
2016 63
7.8%
2015 64
7.9%
2014 64
7.9%
2013 64
7.9%
2012 64
7.9%
2011 64
7.9%

관서명
Categorical

HIGH CORRELATION 

Distinct27
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size6.4 KiB
본청
460 
보건소
 
26
송정동
 
13
우2동
 
13
우1동
 
13
Other values (22)
281 

Length

Max length9
Median length2
Mean length2.9230769
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row본청
2nd row본청
3rd row본청
4th row본청
5th row본청

Common Values

ValueCountFrequency (%)
본청 460
57.1%
보건소 26
 
3.2%
송정동 13
 
1.6%
우2동 13
 
1.6%
우1동 13
 
1.6%
재송어린이도서관 13
 
1.6%
반여도서관 13
 
1.6%
해운대문화회관 13
 
1.6%
관광시설관리사업소 13
 
1.6%
의회사무국 13
 
1.6%
Other values (17) 216
26.8%

Length

2023-12-11T01:40:27.916404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
본청 460
57.1%
보건소 26
 
3.2%
좌3동 13
 
1.6%
좌4동 13
 
1.6%
인문학도서관 13
 
1.6%
우3동 13
 
1.6%
반여4동 13
 
1.6%
반송1동 13
 
1.6%
반송2동 13
 
1.6%
재송1동 13
 
1.6%
Other values (17) 216
26.8%

부서구분명
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size6.4 KiB
본청
460 
읍면동
242 
사업소
65 
직속기관
 
26
외청
 
13

Length

Max length4
Median length2
Mean length2.4454094
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row본청
2nd row본청
3rd row본청
4th row본청
5th row본청

Common Values

ValueCountFrequency (%)
본청 460
57.1%
읍면동 242
30.0%
사업소 65
 
8.1%
직속기관 26
 
3.2%
외청 13
 
1.6%

Length

2023-12-11T01:40:28.066071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:40:28.202895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
본청 460
57.1%
읍면동 242
30.0%
사업소 65
 
8.1%
직속기관 26
 
3.2%
외청 13
 
1.6%

실국명
Categorical

HIGH CORRELATION 

Distinct20
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size6.4 KiB
동사무소
242 
주민생활지원국
73 
일자리산업국
70 
사업소
65 
행정관리국
65 
Other values (15)
291 

Length

Max length8
Median length7
Mean length4.8548387
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row교통건설국
2nd row교통건설국
3rd row교통건설국
4th row미래도시국
5th row행정지원국

Common Values

ValueCountFrequency (%)
동사무소 242
30.0%
주민생활지원국 73
 
9.1%
일자리산업국 70
 
8.7%
사업소 65
 
8.1%
행정관리국 65
 
8.1%
안전도시국 57
 
7.1%
관광경제국 56
 
6.9%
주민복지국 44
 
5.5%
보건소 26
 
3.2%
행정지원국 21
 
2.6%
Other values (10) 87
 
10.8%

Length

2023-12-11T01:40:28.342557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
동사무소 242
30.0%
주민생활지원국 73
 
9.1%
일자리산업국 70
 
8.7%
사업소 65
 
8.1%
행정관리국 65
 
8.1%
안전도시국 57
 
7.1%
관광경제국 56
 
6.9%
주민복지국 44
 
5.5%
보건소 26
 
3.2%
행정지원국 21
 
2.6%
Other values (10) 87
 
10.8%
Distinct67
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Memory size6.4 KiB
2023-12-11T01:40:28.614688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length4.7109181
Min length3

Characters and Unicode

Total characters3797
Distinct characters103
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)0.2%

Sample

1st row건설과
2nd row건축과
3rd row토지정보과
4th row안전총괄과
5th row행정지원과
ValueCountFrequency (%)
관광문화과 33
 
4.1%
교통행정과 23
 
2.9%
청소행정과 22
 
2.7%
경제진흥과 22
 
2.7%
늘푸른과 22
 
2.7%
재무과 15
 
1.9%
민원여권과 13
 
1.6%
건설과 13
 
1.6%
우3동 13
 
1.6%
재송1동 13
 
1.6%
Other values (57) 617
76.6%
2023-12-11T01:40:29.166521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
429
 
11.3%
242
 
6.4%
125
 
3.3%
111
 
2.9%
99
 
2.6%
2 91
 
2.4%
1 91
 
2.4%
88
 
2.3%
86
 
2.3%
78
 
2.1%
Other values (93) 2357
62.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3542
93.3%
Decimal Number 255
 
6.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
429
 
12.1%
242
 
6.8%
125
 
3.5%
111
 
3.1%
99
 
2.8%
88
 
2.5%
86
 
2.4%
78
 
2.2%
78
 
2.2%
73
 
2.1%
Other values (89) 2133
60.2%
Decimal Number
ValueCountFrequency (%)
2 91
35.7%
1 91
35.7%
3 47
18.4%
4 26
 
10.2%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3542
93.3%
Common 255
 
6.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
429
 
12.1%
242
 
6.8%
125
 
3.5%
111
 
3.1%
99
 
2.8%
88
 
2.5%
86
 
2.4%
78
 
2.2%
78
 
2.2%
73
 
2.1%
Other values (89) 2133
60.2%
Common
ValueCountFrequency (%)
2 91
35.7%
1 91
35.7%
3 47
18.4%
4 26
 
10.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3542
93.3%
ASCII 255
 
6.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
429
 
12.1%
242
 
6.8%
125
 
3.5%
111
 
3.1%
99
 
2.8%
88
 
2.5%
86
 
2.4%
78
 
2.2%
78
 
2.2%
73
 
2.1%
Other values (89) 2133
60.2%
ASCII
ValueCountFrequency (%)
2 91
35.7%
1 91
35.7%
3 47
18.4%
4 26
 
10.2%

Interactions

2023-12-11T01:40:26.966157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:40:29.344385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
회계연도관서명부서구분명실국명부서명
회계연도1.0000.0000.0000.0000.000
관서명0.0001.0001.0000.8651.000
부서구분명0.0001.0001.0001.0001.000
실국명0.0000.8651.0001.0000.995
부서명0.0001.0001.0000.9951.000
2023-12-11T01:40:29.540748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
부서구분명실국명관서명
부서구분명1.0000.9910.986
실국명0.9911.0000.427
관서명0.9860.4271.000
2023-12-11T01:40:29.738900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
회계연도관서명부서구분명실국명
회계연도1.0000.0000.0000.000
관서명0.0001.0000.9860.427
부서구분명0.0000.9861.0000.991
실국명0.0000.4270.9911.000

Missing values

2023-12-11T01:40:27.140633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:40:27.251630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

회계연도관서명부서구분명실국명부서명
02020본청본청교통건설국건설과
12020본청본청교통건설국건축과
22020본청본청교통건설국토지정보과
32020본청본청미래도시국안전총괄과
42020본청본청행정지원국행정지원과
52020본청본청행정지원국세무2과
62020본청본청관광경제국자원순환과
72020본청본청주민생활지원국행복나눔과
82020본청본청안전도시국도시디자인과
92020중1동읍면동동사무소중1동
회계연도관서명부서구분명실국명부서명
7962008본청본청기획감사실기획감사실
7972008본청본청관광경제국늘푸른과
7982008본청본청행정관리국세무1과
7992008반여1동읍면동동사무소반여1동
8002008반여도서관사업소사업소반여도서관
8012008본청본청안전도시국재난안전과
8022008재송어린이도서관사업소사업소재송어린이도서관
8032008반여3동읍면동동사무소반여3동
8042008재송2동읍면동동사무소재송2동
8052008본청본청관광경제국청소행정과

Duplicate rows

Most frequently occurring

회계연도관서명부서구분명실국명부서명# duplicates
02008본청본청일자리산업국관광문화과2
12008본청본청행정관리국재무과2
22009본청본청일자리산업국관광문화과2
32009본청본청행정관리국재무과2
42010본청본청일자리산업국관광문화과2
52010본청본청주민생활지원국청소행정과2
62011본청본청일자리산업국관광문화과2
72011본청본청주민생활지원국청소행정과2
82012본청본청일자리산업국관광문화과2
92012본청본청주민생활지원국청소행정과2