Overview

Dataset statistics

Number of variables7
Number of observations3469
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory200.0 KiB
Average record size in memory59.0 B

Variable types

Categorical5
Numeric2

Dataset

Description부산광역시_부산시인터넷지방세청(사이버지방세청)_지방세등납부현황_20200630
Author부산광역시
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15061359

Alerts

건수 is highly overall correlated with 금액 High correlation
금액 is highly overall correlated with 건수 High correlation

Reproduction

Analysis started2024-03-13 13:19:27.735208
Analysis finished2024-03-13 13:19:29.162078
Duration1.43 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기관명
Categorical

Distinct18
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size27.2 KiB
기타
269 
사하구
 
200
부산진구
 
199
수영구
 
198
북구
 
198
Other values (13)
2405 

Length

Max length4
Median length3
Mean length2.720957
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row시청
2nd row시청
3rd row시청
4th row시청
5th row시청

Common Values

ValueCountFrequency (%)
기타 269
 
7.8%
사하구 200
 
5.8%
부산진구 199
 
5.7%
수영구 198
 
5.7%
북구 198
 
5.7%
사상구 196
 
5.7%
영도구 195
 
5.6%
해운대구 194
 
5.6%
연제구 194
 
5.6%
동래구 194
 
5.6%
Other values (8) 1432
41.3%

Length

2024-03-13T22:19:29.259281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
기타 269
 
7.8%
사하구 200
 
5.8%
부산진구 199
 
5.7%
수영구 198
 
5.7%
북구 198
 
5.7%
사상구 196
 
5.7%
영도구 195
 
5.6%
연제구 194
 
5.6%
동래구 194
 
5.6%
해운대구 194
 
5.6%
Other values (8) 1432
41.3%

납부년도
Categorical

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size27.2 KiB
2019
989 
2017
867 
2018
845 
2020
768 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2017
2nd row2017
3rd row2017
4th row2017
5th row2017

Common Values

ValueCountFrequency (%)
2019 989
28.5%
2017 867
25.0%
2018 845
24.4%
2020 768
22.1%

Length

2024-03-13T22:19:29.390420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T22:19:29.498945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019 989
28.5%
2017 867
25.0%
2018 845
24.4%
2020 768
22.1%

구분
Categorical

Distinct8
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size27.2 KiB
지방세
1112 
환경개선부담금
724 
표준세외수입
583 
주정차위반과태료
512 
교통유발부담금
256 
Other values (3)
282 

Length

Max length11
Median length9
Mean length5.8345344
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row버스전용차로위반과태료
2nd row버스전용차로위반과태료
3rd row버스전용차로위반과태료
4th row버스전용차로위반과태료
5th row버스전용차로위반과태료

Common Values

ValueCountFrequency (%)
지방세 1112
32.1%
환경개선부담금 724
20.9%
표준세외수입 583
16.8%
주정차위반과태료 512
14.8%
교통유발부담금 256
 
7.4%
주거지전용주차요금 196
 
5.7%
상하수도요금 52
 
1.5%
버스전용차로위반과태료 34
 
1.0%

Length

2024-03-13T22:19:29.637885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T22:19:29.781318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지방세 1112
32.1%
환경개선부담금 724
20.9%
표준세외수입 583
16.8%
주정차위반과태료 512
14.8%
교통유발부담금 256
 
7.4%
주거지전용주차요금 196
 
5.7%
상하수도요금 52
 
1.5%
버스전용차로위반과태료 34
 
1.0%

기분
Categorical

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size27.2 KiB
정기분
1598 
수시분
1338 
자납분
533 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수시분
2nd row수시분
3rd row수시분
4th row수시분
5th row수시분

Common Values

ValueCountFrequency (%)
정기분 1598
46.1%
수시분 1338
38.6%
자납분 533
 
15.4%

Length

2024-03-13T22:19:29.939484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T22:19:30.047106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정기분 1598
46.1%
수시분 1338
38.6%
자납분 533
 
15.4%

수납매체
Categorical

Distinct9
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size27.2 KiB
위택스(금결원)
833 
신용카드
807 
가상계좌
798 
휴대폰소액결제
478 
카카오QR납부
169 
Other values (4)
384 

Length

Max length8
Median length7
Mean length5.6656097
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row위택스(금결원)
2nd row신용카드
3rd row가상계좌
4th row휴대폰소액결제
5th row편의점 계좌이체

Common Values

ValueCountFrequency (%)
위택스(금결원) 833
24.0%
신용카드 807
23.3%
가상계좌 798
23.0%
휴대폰소액결제 478
13.8%
카카오QR납부 169
 
4.9%
편의점 계좌이체 121
 
3.5%
OCR 112
 
3.2%
부산은행BPR 87
 
2.5%
충당 64
 
1.8%

Length

2024-03-13T22:19:30.181439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T22:19:30.313107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
위택스(금결원 833
23.2%
신용카드 807
22.5%
가상계좌 798
22.2%
휴대폰소액결제 478
13.3%
카카오qr납부 169
 
4.7%
편의점 121
 
3.4%
계좌이체 121
 
3.4%
ocr 112
 
3.1%
부산은행bpr 87
 
2.4%
충당 64
 
1.8%

건수
Real number (ℝ)

HIGH CORRELATION 

Distinct2042
Distinct (%)58.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10870.773
Minimum1
Maximum412469
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size30.6 KiB
2024-03-13T22:19:30.476823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q130
median602
Q35414
95-th percentile52061.2
Maximum412469
Range412468
Interquartile range (IQR)5384

Descriptive statistics

Standard deviation37726.516
Coefficient of variation (CV)3.4704539
Kurtosis42.29316
Mean10870.773
Median Absolute Deviation (MAD)600
Skewness6.0211077
Sum37710711
Variance1.42329 × 109
MonotonicityNot monotonic
2024-03-13T22:19:30.623307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 193
 
5.6%
2 116
 
3.3%
3 63
 
1.8%
4 59
 
1.7%
6 37
 
1.1%
5 36
 
1.0%
7 34
 
1.0%
9 31
 
0.9%
8 29
 
0.8%
15 27
 
0.8%
Other values (2032) 2844
82.0%
ValueCountFrequency (%)
1 193
5.6%
2 116
3.3%
3 63
 
1.8%
4 59
 
1.7%
5 36
 
1.0%
6 37
 
1.1%
7 34
 
1.0%
8 29
 
0.8%
9 31
 
0.9%
10 19
 
0.5%
ValueCountFrequency (%)
412469 1
< 0.1%
405249 1
< 0.1%
403771 1
< 0.1%
397229 1
< 0.1%
396719 1
< 0.1%
374137 1
< 0.1%
371600 1
< 0.1%
313436 1
< 0.1%
311625 1
< 0.1%
302435 1
< 0.1%

금액
Real number (ℝ)

HIGH CORRELATION 

Distinct3378
Distinct (%)97.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.790668 × 109
Minimum100
Maximum5.31206 × 1011
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size30.6 KiB
2024-03-13T22:19:30.767981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum100
5-th percentile60510
Q12993950
median81292380
Q36.7181125 × 108
95-th percentile1.9161228 × 1010
Maximum5.31206 × 1011
Range5.31206 × 1011
Interquartile range (IQR)6.688173 × 108

Descriptive statistics

Standard deviation2.4667287 × 1010
Coefficient of variation (CV)5.1490286
Kurtosis154.2089
Mean4.790668 × 109
Median Absolute Deviation (MAD)81185590
Skewness10.884505
Sum1.6618827 × 1013
Variance6.0847504 × 1020
MonotonicityNot monotonic
2024-03-13T22:19:30.965104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
64000 11
 
0.3%
32000 9
 
0.3%
80000 8
 
0.2%
12000 7
 
0.2%
192000 6
 
0.2%
16000 5
 
0.1%
48410 4
 
0.1%
160000 4
 
0.1%
256000 4
 
0.1%
168000 3
 
0.1%
Other values (3368) 3408
98.2%
ValueCountFrequency (%)
100 1
< 0.1%
2810 1
< 0.1%
3490 1
< 0.1%
3940 1
< 0.1%
4580 1
< 0.1%
4650 1
< 0.1%
5140 1
< 0.1%
5280 1
< 0.1%
6020 1
< 0.1%
6050 1
< 0.1%
ValueCountFrequency (%)
531206000000 1
< 0.1%
452623000000 1
< 0.1%
347688000000 1
< 0.1%
321068000000 1
< 0.1%
316560000000 1
< 0.1%
304113000000 1
< 0.1%
285414000000 1
< 0.1%
275688000000 1
< 0.1%
273238000000 1
< 0.1%
271294000000 1
< 0.1%

Interactions

2024-03-13T22:19:28.645292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:19:28.139516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:19:28.786581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:19:28.232355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-13T22:19:31.094454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기관명납부년도구분기분수납매체건수금액
기관명1.0000.1080.6100.4220.5080.1630.207
납부년도0.1081.0000.0000.0000.2030.0490.000
구분0.6100.0001.0000.4920.3950.2430.126
기분0.4220.0000.4921.0000.3860.3540.364
수납매체0.5080.2030.3950.3861.0000.2480.209
건수0.1630.0490.2430.3540.2481.0000.702
금액0.2070.0000.1260.3640.2090.7021.000
2024-03-13T22:19:31.240006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수납매체기관명납부년도기분구분
수납매체1.0000.1920.1310.1840.206
기관명0.1921.0000.0590.2140.309
납부년도0.1310.0591.0000.0000.000
기분0.1840.2140.0001.0000.359
구분0.2060.3090.0000.3591.000
2024-03-13T22:19:31.374094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
건수금액기관명납부년도구분기분수납매체
건수1.0000.9360.0550.0310.1210.1670.082
금액0.9361.0000.0700.0000.0620.1720.068
기관명0.0550.0701.0000.0590.3090.2140.192
납부년도0.0310.0000.0591.0000.0000.0000.131
구분0.1210.0620.3090.0001.0000.3590.206
기분0.1670.1720.2140.0000.3591.0000.184
수납매체0.0820.0680.1920.1310.2060.1841.000

Missing values

2024-03-13T22:19:28.953422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-13T22:19:29.076900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기관명납부년도구분기분수납매체건수금액
0시청2017버스전용차로위반과태료수시분위택스(금결원)10146525339490
1시청2017버스전용차로위반과태료수시분신용카드2832157103600
2시청2017버스전용차로위반과태료수시분가상계좌222661200327140
3시청2017버스전용차로위반과태료수시분휴대폰소액결제301553450
4시청2017버스전용차로위반과태료수시분편의점 계좌이체6301500
5시청2017버스전용차로위반과태료정기분위택스(금결원)2494124730000
6시청2017버스전용차로위반과태료정기분신용카드60930470000
7시청2017버스전용차로위반과태료정기분가상계좌259541299370000
8시청2017버스전용차로위반과태료정기분휴대폰소액결제4200000
9시청2017상하수도요금수시분위택스(금결원)520730305225588
기관명납부년도구분기분수납매체건수금액
3459기장군2020표준세외수입정기분가상계좌600213150160
3460기장군2020환경개선부담금수시분위택스(금결원)1864584240
3461기장군2020환경개선부담금수시분신용카드1493558390
3462기장군2020환경개선부담금수시분가상계좌47811666980
3463기장군2020환경개선부담금자납분위택스(금결원)1115193910
3464기장군2020환경개선부담금자납분신용카드13550880
3465기장군2020환경개선부담금자납분가상계좌855268520
3466기장군2020환경개선부담금정기분위택스(금결원)4004107395820
3467기장군2020환경개선부담금정기분신용카드41910255180
3468기장군2020환경개선부담금정기분가상계좌4185110647490