Overview

Dataset statistics

Number of variables7
Number of observations102
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.2 KiB
Average record size in memory62.3 B

Variable types

DateTime1
Text1
Numeric2
Categorical3

Dataset

Description부산광역시_통합자금관리시스템예상수입액수기등록_20130611
Author부산광역시
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15121626

Alerts

자치단체코드 has constant value ""Constant
회계년도 has constant value ""Constant
회계코드 is highly imbalanced (77.0%)Imbalance
일련번호 has unique valuesUnique

Reproduction

Analysis started2024-04-21 10:42:24.499205
Analysis finished2024-04-21 10:42:25.766309
Duration1.27 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct35
Distinct (%)34.3%
Missing0
Missing (%)0.0%
Memory size944.0 B
Minimum2013-04-01 00:00:00
Maximum2013-06-11 00:00:00
2024-04-21T19:42:25.866639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T19:42:26.083338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)

비고
Text

Distinct91
Distinct (%)89.2%
Missing0
Missing (%)0.0%
Memory size944.0 B
2024-04-21T19:42:26.751509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length16
Mean length8.5980392
Min length3

Characters and Unicode

Total characters877
Distinct characters215
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique83 ?
Unique (%)81.4%

Sample

1st row지역자율형사회서비스
2nd row취득세 감면분 보전
3rd row생계급여
4th row주거급여
5th row장애인연금
ValueCountFrequency (%)
지방교부세 5
 
3.4%
긴급복지 2
 
1.4%
자살예방 2
 
1.4%
장애아동가족지원 2
 
1.4%
지원 2
 
1.4%
사회서비스 2
 
1.4%
지역자율형 2
 
1.4%
영유아보육료 2
 
1.4%
자동차세(주행분 2
 
1.4%
운영 2
 
1.4%
Other values (117) 122
84.1%
2024-04-21T19:42:28.013427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
45
 
5.1%
44
 
5.0%
28
 
3.2%
23
 
2.6%
19
 
2.2%
17
 
1.9%
16
 
1.8%
16
 
1.8%
14
 
1.6%
13
 
1.5%
Other values (205) 642
73.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 800
91.2%
Space Separator 45
 
5.1%
Uppercase Letter 11
 
1.3%
Close Punctuation 6
 
0.7%
Open Punctuation 6
 
0.7%
Decimal Number 4
 
0.5%
Dash Punctuation 3
 
0.3%
Other Punctuation 1
 
0.1%
Math Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
44
 
5.5%
28
 
3.5%
23
 
2.9%
19
 
2.4%
17
 
2.1%
16
 
2.0%
16
 
2.0%
14
 
1.8%
13
 
1.6%
13
 
1.6%
Other values (188) 597
74.6%
Uppercase Letter
ValueCountFrequency (%)
T 3
27.3%
C 2
18.2%
S 2
18.2%
M 1
 
9.1%
I 1
 
9.1%
A 1
 
9.1%
V 1
 
9.1%
Decimal Number
ValueCountFrequency (%)
2 1
25.0%
3 1
25.0%
5 1
25.0%
1 1
25.0%
Space Separator
ValueCountFrequency (%)
45
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 800
91.2%
Common 66
 
7.5%
Latin 11
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
44
 
5.5%
28
 
3.5%
23
 
2.9%
19
 
2.4%
17
 
2.1%
16
 
2.0%
16
 
2.0%
14
 
1.8%
13
 
1.6%
13
 
1.6%
Other values (188) 597
74.6%
Common
ValueCountFrequency (%)
45
68.2%
) 6
 
9.1%
( 6
 
9.1%
- 3
 
4.5%
, 1
 
1.5%
2 1
 
1.5%
3 1
 
1.5%
~ 1
 
1.5%
5 1
 
1.5%
1 1
 
1.5%
Latin
ValueCountFrequency (%)
T 3
27.3%
C 2
18.2%
S 2
18.2%
M 1
 
9.1%
I 1
 
9.1%
A 1
 
9.1%
V 1
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 800
91.2%
ASCII 77
 
8.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
45
58.4%
) 6
 
7.8%
( 6
 
7.8%
T 3
 
3.9%
- 3
 
3.9%
C 2
 
2.6%
S 2
 
2.6%
, 1
 
1.3%
M 1
 
1.3%
I 1
 
1.3%
Other values (7) 7
 
9.1%
Hangul
ValueCountFrequency (%)
44
 
5.5%
28
 
3.5%
23
 
2.9%
19
 
2.4%
17
 
2.1%
16
 
2.0%
16
 
2.0%
14
 
1.8%
13
 
1.6%
13
 
1.6%
Other values (188) 597
74.6%

예상수입액
Real number (ℝ)

Distinct97
Distinct (%)95.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.317598 × 109
Minimum1.25 × 108
Maximum8.8376 × 1010
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-21T19:42:28.255946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.25 × 108
5-th percentile2.0025 × 108
Q16.305 × 108
median1.8685 × 109
Q31.072425 × 1010
95-th percentile3.99866 × 1010
Maximum8.8376 × 1010
Range8.8251 × 1010
Interquartile range (IQR)1.009375 × 1010

Descriptive statistics

Standard deviation1.6456472 × 1010
Coefficient of variation (CV)1.7661711
Kurtosis9.7986259
Mean9.317598 × 109
Median Absolute Deviation (MAD)1.553 × 109
Skewness2.8865982
Sum9.50395 × 1011
Variance2.7081548 × 1020
MonotonicityNot monotonic
2024-04-21T19:42:28.527093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
22975000000 2
 
2.0%
15878000000 2
 
2.0%
1399000000 2
 
2.0%
438000000 2
 
2.0%
272000000 2
 
2.0%
3390000000 1
 
1.0%
3578000000 1
 
1.0%
1819000000 1
 
1.0%
12146000000 1
 
1.0%
154000000 1
 
1.0%
Other values (87) 87
85.3%
ValueCountFrequency (%)
125000000 1
1.0%
139000000 1
1.0%
154000000 1
1.0%
186000000 1
1.0%
193000000 1
1.0%
200000000 1
1.0%
205000000 1
1.0%
250000000 1
1.0%
259000000 1
1.0%
272000000 2
2.0%
ValueCountFrequency (%)
88376000000 1
1.0%
88367000000 1
1.0%
54661000000 1
1.0%
53020000000 1
1.0%
41355000000 1
1.0%
40300000000 1
1.0%
34032000000 1
1.0%
32715000000 1
1.0%
30400000000 1
1.0%
30000000000 1
1.0%

일련번호
Real number (ℝ)

UNIQUE 

Distinct102
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean51.5
Minimum1
Maximum102
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-21T19:42:28.795914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.05
Q126.25
median51.5
Q376.75
95-th percentile96.95
Maximum102
Range101
Interquartile range (IQR)50.5

Descriptive statistics

Standard deviation29.588849
Coefficient of variation (CV)0.57454076
Kurtosis-1.2
Mean51.5
Median Absolute Deviation (MAD)25.5
Skewness0
Sum5253
Variance875.5
MonotonicityNot monotonic
2024-04-21T19:42:29.057901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9 1
 
1.0%
14 1
 
1.0%
37 1
 
1.0%
36 1
 
1.0%
35 1
 
1.0%
34 1
 
1.0%
33 1
 
1.0%
32 1
 
1.0%
31 1
 
1.0%
30 1
 
1.0%
Other values (92) 92
90.2%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
102 1
1.0%
101 1
1.0%
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size944.0 B
6260000
102 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row6260000
2nd row6260000
3rd row6260000
4th row6260000
5th row6260000

Common Values

ValueCountFrequency (%)
6260000 102
100.0%

Length

2024-04-21T19:42:29.291739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T19:42:29.461264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
6260000 102
100.0%

회계년도
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size944.0 B
2013
102 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2013
2nd row2013
3rd row2013
4th row2013
5th row2013

Common Values

ValueCountFrequency (%)
2013 102
100.0%

Length

2024-04-21T19:42:29.631769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T19:42:29.790745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2013 102
100.0%

회계코드
Categorical

IMBALANCE 

Distinct5
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Memory size944.0 B
100
94 
235
 
4
260
 
2
220
 
1
230
 
1

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique2 ?
Unique (%)2.0%

Sample

1st row100
2nd row100
3rd row100
4th row100
5th row100

Common Values

ValueCountFrequency (%)
100 94
92.2%
235 4
 
3.9%
260 2
 
2.0%
220 1
 
1.0%
230 1
 
1.0%

Length

2024-04-21T19:42:29.992298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T19:42:30.173472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
100 94
92.2%
235 4
 
3.9%
260 2
 
2.0%
220 1
 
1.0%
230 1
 
1.0%

Interactions

2024-04-21T19:42:25.161976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T19:42:24.892823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T19:42:25.296520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T19:42:25.022037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T19:42:30.296271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준일자비고예상수입액일련번호회계코드
기준일자1.0000.0000.0000.9800.000
비고0.0001.0000.0000.7360.910
예상수입액0.0000.0001.0000.0870.532
일련번호0.9800.7360.0871.0000.421
회계코드0.0000.9100.5320.4211.000
2024-04-21T19:42:30.462734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
예상수입액일련번호회계코드
예상수입액1.000-0.3590.357
일련번호-0.3591.0000.180
회계코드0.3570.1801.000

Missing values

2024-04-21T19:42:25.484725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T19:42:25.689145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준일자비고예상수입액일련번호자치단체코드회계년도회계코드
02013-04-18지역자율형사회서비스2099000000962600002013100
12013-04-05취득세 감면분 보전40300000000762600002013100
22013-05-14생계급여327150000004462600002013100
32013-05-14주거급여75750000004562600002013100
42013-05-14장애인연금24450000004662600002013100
52013-05-14전통시장26290000004762600002013100
62013-05-14지역자율형 사회서비스13990000004862600002013100
72013-05-14재해위험지역 정비10000000004962600002013100
82013-05-23연안정비사업37800000006562600002013100
92013-05-21어린이안전영상정보인프라구축15500000006662600002013100
기준일자비고예상수입액일련번호자치단체코드회계년도회계코드
922013-05-28친환경농자재(유기질비료, 토양개량제)6820000008062600002013100
932013-05-28관광특구 활성화 지원사업8000000008162600002013100
942013-06-03부산교통공단 채무상환 지원18570000009062600002013235
952013-06-03부산-김해 경전철 국고보조금8670000009162600002013100
962013-06-03아이돌봄지원사업1390000009262600002013100
972013-06-04천연가스자동차 보급9000000009362600002013100
982013-06-11장안-임랑국지도건설13000000009962600002013100
992013-06-11지역자율형 사회서비스 투자사업139900000010062600002013100
1002013-06-11장애아동가족지원43800000010162600002013100
1012013-06-11희망키움통장38200000010262600002013100