Overview

Dataset statistics

Number of variables13
Number of observations1933
Missing cells1933
Missing cells (%)7.7%
Duplicate rows75
Duplicate rows (%)3.9%
Total size in memory202.1 KiB
Average record size in memory107.1 B

Variable types

Categorical3
Text9
Unsupported1

Dataset

Description부산광역시 상수도본부 공사예산년간집계정보(신청) 입니다. 공사와 관련된 예산에 대한 신청상태의 년간집계정보 제공(예산종류, 예산년도, 예산과목명 등)
Author부산광역시
URLhttps://www.data.go.kr/data/15083552/fileData.do

Alerts

예산년도 has constant value ""Constant
Dataset has 75 (3.9%) duplicate rowsDuplicates
Unnamed: 12 has 1933 (100.0%) missing valuesMissing
Unnamed: 12 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 12:44:13.122698
Analysis finished2023-12-12 12:44:14.196283
Duration1.07 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

예산종류
Categorical

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size15.2 KiB
1
1391 
2
541 
3
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 1391
72.0%
2 541
 
28.0%
3 1
 
0.1%

Length

2023-12-12T21:44:14.252407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:44:14.360399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 1391
72.0%
2 541
 
28.0%
3 1
 
0.1%

예산년도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size15.2 KiB
2020
1933 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2020
3rd row2020
4th row2020
5th row2020

Common Values

ValueCountFrequency (%)
2020 1933
100.0%

Length

2023-12-12T21:44:14.462352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:44:14.548852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 1933
100.0%
Distinct180
Distinct (%)9.3%
Missing0
Missing (%)0.0%
Memory size15.2 KiB
2023-12-12T21:44:14.834759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length5
Mean length5
Min length5

Characters and Unicode

Total characters9665
Distinct characters17
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)1.9%

Sample

1st row11111
2nd row11111
3rd row11111
4th row11111
5th row11111
ValueCountFrequency (%)
22142 172
 
8.9%
11176 133
 
6.9%
22152 109
 
5.6%
22176 91
 
4.7%
12559 82
 
4.2%
12259 50
 
2.6%
11630 49
 
2.5%
12511 37
 
1.9%
11117 35
 
1.8%
11800 33
 
1.7%
Other values (170) 1142
59.1%
2023-12-12T21:44:15.260289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 3382
35.0%
2 2753
28.5%
5 1035
 
10.7%
3 572
 
5.9%
4 513
 
5.3%
6 475
 
4.9%
7 346
 
3.6%
0 271
 
2.8%
9 215
 
2.2%
8 62
 
0.6%
Other values (7) 41
 
0.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 9624
99.6%
Uppercase Letter 41
 
0.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 3382
35.1%
2 2753
28.6%
5 1035
 
10.8%
3 572
 
5.9%
4 513
 
5.3%
6 475
 
4.9%
7 346
 
3.6%
0 271
 
2.8%
9 215
 
2.2%
8 62
 
0.6%
Uppercase Letter
ValueCountFrequency (%)
F 16
39.0%
B 13
31.7%
E 8
19.5%
A 1
 
2.4%
G 1
 
2.4%
H 1
 
2.4%
I 1
 
2.4%

Most occurring scripts

ValueCountFrequency (%)
Common 9624
99.6%
Latin 41
 
0.4%

Most frequent character per script

Common
ValueCountFrequency (%)
1 3382
35.1%
2 2753
28.6%
5 1035
 
10.8%
3 572
 
5.9%
4 513
 
5.3%
6 475
 
4.9%
7 346
 
3.6%
0 271
 
2.8%
9 215
 
2.2%
8 62
 
0.6%
Latin
ValueCountFrequency (%)
F 16
39.0%
B 13
31.7%
E 8
19.5%
A 1
 
2.4%
G 1
 
2.4%
H 1
 
2.4%
I 1
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9665
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 3382
35.0%
2 2753
28.5%
5 1035
 
10.7%
3 572
 
5.9%
4 513
 
5.3%
6 475
 
4.9%
7 346
 
3.6%
0 271
 
2.8%
9 215
 
2.2%
8 62
 
0.6%
Other values (7) 41
 
0.4%
Distinct842
Distinct (%)43.6%
Missing0
Missing (%)0.0%
Memory size15.2 KiB
2023-12-12T21:44:15.581913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length4
Mean length3.682359
Min length1

Characters and Unicode

Total characters7118
Distinct characters16
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique660 ?
Unique (%)34.1%

Sample

1st row-
2nd row-
3rd row-
4th row-
5th row-
ValueCountFrequency (%)
204
 
10.6%
6461 17
 
0.9%
6315 16
 
0.8%
6902 13
 
0.7%
0015 12
 
0.6%
6005 12
 
0.6%
6201 12
 
0.6%
6202 12
 
0.6%
6131 12
 
0.6%
0943 12
 
0.6%
Other values (828) 1611
83.3%
2023-12-12T21:44:16.063014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1539
21.6%
1 855
12.0%
6 754
10.6%
2 645
9.1%
3 575
 
8.1%
8 541
 
7.6%
5 481
 
6.8%
4 449
 
6.3%
9 412
 
5.8%
7 298
 
4.2%
Other values (6) 569
 
8.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 6549
92.0%
Uppercase Letter 301
 
4.2%
Dash Punctuation 204
 
2.9%
Lowercase Letter 63
 
0.9%
Space Separator 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1539
23.5%
1 855
13.1%
6 754
11.5%
2 645
9.8%
3 575
 
8.8%
8 541
 
8.3%
5 481
 
7.3%
4 449
 
6.9%
9 412
 
6.3%
7 298
 
4.6%
Uppercase Letter
ValueCountFrequency (%)
B 188
62.5%
A 60
 
19.9%
C 53
 
17.6%
Dash Punctuation
ValueCountFrequency (%)
- 204
100.0%
Lowercase Letter
ValueCountFrequency (%)
c 63
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 6754
94.9%
Latin 364
 
5.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1539
22.8%
1 855
12.7%
6 754
11.2%
2 645
9.5%
3 575
 
8.5%
8 541
 
8.0%
5 481
 
7.1%
4 449
 
6.6%
9 412
 
6.1%
7 298
 
4.4%
Other values (2) 205
 
3.0%
Latin
ValueCountFrequency (%)
B 188
51.6%
c 63
 
17.3%
A 60
 
16.5%
C 53
 
14.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7118
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1539
21.6%
1 855
12.0%
6 754
10.6%
2 645
9.1%
3 575
 
8.1%
8 541
 
7.6%
5 481
 
6.8%
4 449
 
6.3%
9 412
 
5.8%
7 298
 
4.2%
Other values (6) 569
 
8.0%
Distinct116
Distinct (%)6.0%
Missing0
Missing (%)0.0%
Memory size15.2 KiB
2023-12-12T21:44:16.273520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length23
Mean length5.819969
Min length2

Characters and Unicode

Total characters11250
Distinct characters135
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)1.3%

Sample

1st row가정용
2nd row가정용
3rd row가정용
4th row가정용
5th row가정용
ValueCountFrequency (%)
시설비 317
 
16.3%
수선유지비 155
 
8.0%
기타수수료수익 133
 
6.8%
자산취득비 102
 
5.2%
보수 58
 
3.0%
변상금및위약금수익 49
 
2.5%
일반재료비 48
 
2.5%
공공운영비 43
 
2.2%
기타복리후생비 40
 
2.1%
감리비 39
 
2.0%
Other values (108) 962
49.4%
2023-12-12T21:44:16.616109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1238
 
11.0%
1218
 
10.8%
491
 
4.4%
402
 
3.6%
359
 
3.2%
348
 
3.1%
338
 
3.0%
300
 
2.7%
239
 
2.1%
222
 
2.0%
Other values (125) 6095
54.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11103
98.7%
Dash Punctuation 128
 
1.1%
Space Separator 13
 
0.1%
Other Punctuation 6
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1238
 
11.2%
1218
 
11.0%
491
 
4.4%
402
 
3.6%
359
 
3.2%
348
 
3.1%
338
 
3.0%
300
 
2.7%
239
 
2.2%
222
 
2.0%
Other values (122) 5948
53.6%
Dash Punctuation
ValueCountFrequency (%)
- 128
100.0%
Space Separator
ValueCountFrequency (%)
13
100.0%
Other Punctuation
ValueCountFrequency (%)
· 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11103
98.7%
Common 147
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1238
 
11.2%
1218
 
11.0%
491
 
4.4%
402
 
3.6%
359
 
3.2%
348
 
3.1%
338
 
3.0%
300
 
2.7%
239
 
2.2%
222
 
2.0%
Other values (122) 5948
53.6%
Common
ValueCountFrequency (%)
- 128
87.1%
13
 
8.8%
· 6
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11103
98.7%
ASCII 141
 
1.3%
None 6
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1238
 
11.2%
1218
 
11.0%
491
 
4.4%
402
 
3.6%
359
 
3.2%
348
 
3.1%
338
 
3.0%
300
 
2.7%
239
 
2.2%
222
 
2.0%
Other values (122) 5948
53.6%
ASCII
ValueCountFrequency (%)
- 128
90.8%
13
 
9.2%
None
ValueCountFrequency (%)
· 6
100.0%
Distinct741
Distinct (%)38.3%
Missing0
Missing (%)0.0%
Memory size15.2 KiB
2023-12-12T21:44:16.838423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length31
Mean length10.870667
Min length1

Characters and Unicode

Total characters21013
Distinct characters431
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique597 ?
Unique (%)30.9%

Sample

1st row-
2nd row-
3rd row-
4th row-
5th row-
ValueCountFrequency (%)
204
 
4.9%
122
 
2.9%
교체 93
 
2.2%
주변 78
 
1.9%
구입 55
 
1.3%
구입(대체 44
 
1.1%
사용료 43
 
1.0%
구경별기본요금 42
 
1.0%
유지관리 40
 
1.0%
상수도관 38
 
0.9%
Other values (1158) 3401
81.8%
2023-12-12T21:44:17.246661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2234
 
10.6%
862
 
4.1%
625
 
3.0%
577
 
2.7%
490
 
2.3%
( 404
 
1.9%
) 404
 
1.9%
395
 
1.9%
377
 
1.8%
355
 
1.7%
Other values (421) 14290
68.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 17056
81.2%
Space Separator 2234
 
10.6%
Open Punctuation 404
 
1.9%
Close Punctuation 404
 
1.9%
Decimal Number 373
 
1.8%
Dash Punctuation 214
 
1.0%
Lowercase Letter 181
 
0.9%
Other Punctuation 72
 
0.3%
Uppercase Letter 53
 
0.3%
Math Symbol 20
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
862
 
5.1%
625
 
3.7%
577
 
3.4%
490
 
2.9%
395
 
2.3%
377
 
2.2%
355
 
2.1%
335
 
2.0%
318
 
1.9%
289
 
1.7%
Other values (380) 12433
72.9%
Uppercase Letter
ValueCountFrequency (%)
C 8
15.1%
S 7
13.2%
T 5
9.4%
I 4
 
7.5%
A 4
 
7.5%
E 4
 
7.5%
L 3
 
5.7%
V 3
 
5.7%
P 3
 
5.7%
D 2
 
3.8%
Other values (8) 10
18.9%
Decimal Number
ValueCountFrequency (%)
2 153
41.0%
3 69
18.5%
1 58
 
15.5%
5 58
 
15.5%
4 11
 
2.9%
0 10
 
2.7%
7 7
 
1.9%
8 4
 
1.1%
6 2
 
0.5%
9 1
 
0.3%
Other Punctuation
ValueCountFrequency (%)
, 38
52.8%
· 33
45.8%
. 1
 
1.4%
Lowercase Letter
ValueCountFrequency (%)
m 180
99.4%
e 1
 
0.6%
Math Symbol
ValueCountFrequency (%)
~ 18
90.0%
2
 
10.0%
Control
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
2234
100.0%
Open Punctuation
ValueCountFrequency (%)
( 404
100.0%
Close Punctuation
ValueCountFrequency (%)
) 404
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 214
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 17056
81.2%
Common 3723
 
17.7%
Latin 234
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
862
 
5.1%
625
 
3.7%
577
 
3.4%
490
 
2.9%
395
 
2.3%
377
 
2.2%
355
 
2.1%
335
 
2.0%
318
 
1.9%
289
 
1.7%
Other values (380) 12433
72.9%
Common
ValueCountFrequency (%)
2234
60.0%
( 404
 
10.9%
) 404
 
10.9%
- 214
 
5.7%
2 153
 
4.1%
3 69
 
1.9%
1 58
 
1.6%
5 58
 
1.6%
, 38
 
1.0%
· 33
 
0.9%
Other values (11) 58
 
1.6%
Latin
ValueCountFrequency (%)
m 180
76.9%
C 8
 
3.4%
S 7
 
3.0%
T 5
 
2.1%
I 4
 
1.7%
A 4
 
1.7%
E 4
 
1.7%
L 3
 
1.3%
V 3
 
1.3%
P 3
 
1.3%
Other values (10) 13
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 17056
81.2%
ASCII 3922
 
18.7%
None 33
 
0.2%
Math Operators 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2234
57.0%
( 404
 
10.3%
) 404
 
10.3%
- 214
 
5.5%
m 180
 
4.6%
2 153
 
3.9%
3 69
 
1.8%
1 58
 
1.5%
5 58
 
1.5%
, 38
 
1.0%
Other values (29) 110
 
2.8%
Hangul
ValueCountFrequency (%)
862
 
5.1%
625
 
3.7%
577
 
3.4%
490
 
2.9%
395
 
2.3%
377
 
2.2%
355
 
2.1%
335
 
2.0%
318
 
1.9%
289
 
1.7%
Other values (380) 12433
72.9%
None
ValueCountFrequency (%)
· 33
100.0%
Math Operators
ValueCountFrequency (%)
2
100.0%
Distinct64
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size15.2 KiB
2023-12-12T21:44:17.468566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length2
Mean length2.0719089
Min length1

Characters and Unicode

Total characters4005
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique62 ?
Unique (%)3.2%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
0 1519
78.6%
352
 
18.2%
220308000 1
 
0.1%
23535590 1
 
0.1%
47695000 1
 
0.1%
290206610 1
 
0.1%
301006090 1
 
0.1%
256744000 1
 
0.1%
310744000 1
 
0.1%
934000000 1
 
0.1%
Other values (54) 54
 
2.8%
2023-12-12T21:44:17.868260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1733
43.3%
1581
39.5%
- 352
 
8.8%
2 51
 
1.3%
1 44
 
1.1%
4 43
 
1.1%
3 39
 
1.0%
6 36
 
0.9%
7 35
 
0.9%
9 33
 
0.8%
Other values (2) 58
 
1.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2072
51.7%
Space Separator 1581
39.5%
Dash Punctuation 352
 
8.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1733
83.6%
2 51
 
2.5%
1 44
 
2.1%
4 43
 
2.1%
3 39
 
1.9%
6 36
 
1.7%
7 35
 
1.7%
9 33
 
1.6%
5 31
 
1.5%
8 27
 
1.3%
Space Separator
ValueCountFrequency (%)
1581
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 352
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4005
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1733
43.3%
1581
39.5%
- 352
 
8.8%
2 51
 
1.3%
1 44
 
1.1%
4 43
 
1.1%
3 39
 
1.0%
6 36
 
0.9%
7 35
 
0.9%
9 33
 
0.8%
Other values (2) 58
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4005
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1733
43.3%
1581
39.5%
- 352
 
8.8%
2 51
 
1.3%
1 44
 
1.1%
4 43
 
1.1%
3 39
 
1.0%
6 36
 
0.9%
7 35
 
0.9%
9 33
 
0.8%
Other values (2) 58
 
1.4%
Distinct927
Distinct (%)48.0%
Missing0
Missing (%)0.0%
Memory size15.2 KiB
2023-12-12T21:44:18.082599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10
Mean length7.4247284
Min length1

Characters and Unicode

Total characters14352
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique754 ?
Unique (%)39.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
0 240
 
12.4%
191
 
9.9%
3000000 23
 
1.2%
1000000 22
 
1.1%
5000000 19
 
1.0%
3135000 18
 
0.9%
150000000 16
 
0.8%
20000000 15
 
0.8%
28860000 14
 
0.7%
300000000 13
 
0.7%
Other values (917) 1362
70.5%
2023-12-12T21:44:18.411116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 7814
54.4%
1742
 
12.1%
1 808
 
5.6%
2 631
 
4.4%
5 560
 
3.9%
3 507
 
3.5%
4 504
 
3.5%
6 467
 
3.3%
8 456
 
3.2%
7 366
 
2.6%
Other values (2) 497
 
3.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 12419
86.5%
Space Separator 1742
 
12.1%
Dash Punctuation 191
 
1.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 7814
62.9%
1 808
 
6.5%
2 631
 
5.1%
5 560
 
4.5%
3 507
 
4.1%
4 504
 
4.1%
6 467
 
3.8%
8 456
 
3.7%
7 366
 
2.9%
9 306
 
2.5%
Space Separator
ValueCountFrequency (%)
1742
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 191
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 14352
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 7814
54.4%
1742
 
12.1%
1 808
 
5.6%
2 631
 
4.4%
5 560
 
3.9%
3 507
 
3.5%
4 504
 
3.5%
6 467
 
3.3%
8 456
 
3.2%
7 366
 
2.6%
Other values (2) 497
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 14352
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 7814
54.4%
1742
 
12.1%
1 808
 
5.6%
2 631
 
4.4%
5 560
 
3.9%
3 507
 
3.5%
4 504
 
3.5%
6 467
 
3.3%
8 456
 
3.2%
7 366
 
2.6%
Other values (2) 497
 
3.5%
Distinct937
Distinct (%)48.5%
Missing0
Missing (%)0.0%
Memory size15.2 KiB
2023-12-12T21:44:18.643888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10
Mean length6.5188826
Min length1

Characters and Unicode

Total characters12601
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique832 ?
Unique (%)43.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
352
 
18.2%
0 324
 
16.8%
3135000 18
 
0.9%
5000000 15
 
0.8%
3000000 14
 
0.7%
20000000 11
 
0.6%
1800000 10
 
0.5%
28920000 10
 
0.5%
50000000 10
 
0.5%
10000000 10
 
0.5%
Other values (927) 1159
60.0%
2023-12-12T21:44:18.996147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 6320
50.2%
1581
 
12.5%
1 746
 
5.9%
2 588
 
4.7%
5 517
 
4.1%
3 509
 
4.0%
4 467
 
3.7%
6 429
 
3.4%
8 411
 
3.3%
- 352
 
2.8%
Other values (2) 681
 
5.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 10668
84.7%
Space Separator 1581
 
12.5%
Dash Punctuation 352
 
2.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 6320
59.2%
1 746
 
7.0%
2 588
 
5.5%
5 517
 
4.8%
3 509
 
4.8%
4 467
 
4.4%
6 429
 
4.0%
8 411
 
3.9%
7 349
 
3.3%
9 332
 
3.1%
Space Separator
ValueCountFrequency (%)
1581
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 352
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 12601
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 6320
50.2%
1581
 
12.5%
1 746
 
5.9%
2 588
 
4.7%
5 517
 
4.1%
3 509
 
4.0%
4 467
 
3.7%
6 429
 
3.4%
8 411
 
3.3%
- 352
 
2.8%
Other values (2) 681
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 12601
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 6320
50.2%
1581
 
12.5%
1 746
 
5.9%
2 588
 
4.7%
5 517
 
4.1%
3 509
 
4.0%
4 467
 
3.7%
6 429
 
3.4%
8 411
 
3.3%
- 352
 
2.8%
Other values (2) 681
 
5.4%
Distinct419
Distinct (%)21.7%
Missing0
Missing (%)0.0%
Memory size15.2 KiB
2023-12-12T21:44:19.187834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length2
Mean length3.8520435
Min length1

Characters and Unicode

Total characters7446
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique360 ?
Unique (%)18.6%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
0 1054
54.5%
352
 
18.2%
20000000 11
 
0.6%
90000000 9
 
0.5%
50000000 8
 
0.4%
300000000 7
 
0.4%
80000000 7
 
0.4%
40000000 6
 
0.3%
100000000 6
 
0.3%
150000000 6
 
0.3%
Other values (385) 467
24.2%
2023-12-12T21:44:19.842120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 3612
48.5%
1581
21.2%
- 650
 
8.7%
1 290
 
3.9%
5 219
 
2.9%
2 200
 
2.7%
4 180
 
2.4%
3 177
 
2.4%
8 149
 
2.0%
9 143
 
1.9%
Other values (2) 245
 
3.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5215
70.0%
Space Separator 1581
 
21.2%
Dash Punctuation 650
 
8.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 3612
69.3%
1 290
 
5.6%
5 219
 
4.2%
2 200
 
3.8%
4 180
 
3.5%
3 177
 
3.4%
8 149
 
2.9%
9 143
 
2.7%
6 128
 
2.5%
7 117
 
2.2%
Space Separator
ValueCountFrequency (%)
1581
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 650
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 7446
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 3612
48.5%
1581
21.2%
- 650
 
8.7%
1 290
 
3.9%
5 219
 
2.9%
2 200
 
2.7%
4 180
 
2.4%
3 177
 
2.4%
8 149
 
2.0%
9 143
 
1.9%
Other values (2) 245
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7446
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 3612
48.5%
1581
21.2%
- 650
 
8.7%
1 290
 
3.9%
5 219
 
2.9%
2 200
 
2.7%
4 180
 
2.4%
3 177
 
2.4%
8 149
 
2.0%
9 143
 
1.9%
Other values (2) 245
 
3.3%
Distinct256
Distinct (%)13.2%
Missing0
Missing (%)0.0%
Memory size15.2 KiB
2023-12-12T21:44:20.114996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length2
Mean length2.9570616
Min length1

Characters and Unicode

Total characters5716
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique220 ?
Unique (%)11.4%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
0 1267
65.5%
352
 
18.2%
60000 12
 
0.6%
50000000 7
 
0.4%
100000000 7
 
0.4%
20000000 6
 
0.3%
3000000 6
 
0.3%
2000000 5
 
0.3%
100000 5
 
0.3%
200000000 4
 
0.2%
Other values (211) 262
 
13.6%
2023-12-12T21:44:20.501877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 2710
47.4%
1581
27.7%
- 485
 
8.5%
1 167
 
2.9%
2 139
 
2.4%
3 111
 
1.9%
6 105
 
1.8%
5 97
 
1.7%
8 88
 
1.5%
4 87
 
1.5%
Other values (2) 146
 
2.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3650
63.9%
Space Separator 1581
27.7%
Dash Punctuation 485
 
8.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 2710
74.2%
1 167
 
4.6%
2 139
 
3.8%
3 111
 
3.0%
6 105
 
2.9%
5 97
 
2.7%
8 88
 
2.4%
4 87
 
2.4%
7 81
 
2.2%
9 65
 
1.8%
Space Separator
ValueCountFrequency (%)
1581
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 485
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5716
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 2710
47.4%
1581
27.7%
- 485
 
8.5%
1 167
 
2.9%
2 139
 
2.4%
3 111
 
1.9%
6 105
 
1.8%
5 97
 
1.7%
8 88
 
1.5%
4 87
 
1.5%
Other values (2) 146
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5716
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 2710
47.4%
1581
27.7%
- 485
 
8.5%
1 167
 
2.9%
2 139
 
2.4%
3 111
 
1.9%
6 105
 
1.8%
5 97
 
1.7%
8 88
 
1.5%
4 87
 
1.5%
Other values (2) 146
 
2.6%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size15.2 KiB
0
1172 
-
761 

Length

Max length2
Median length2
Mean length1.6063114
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row-
2nd row-
3rd row-
4th row-
5th row-

Common Values

ValueCountFrequency (%)
0 1172
60.6%
- 761
39.4%

Length

2023-12-12T21:44:20.637276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:44:20.770905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 1172
60.6%
761
39.4%

Unnamed: 12
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1933
Missing (%)100.0%
Memory size17.1 KiB

Correlations

2023-12-12T21:44:20.832602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
예산종류이월예산계획변경,취소
예산종류1.0000.2870.021
이월예산0.2871.0000.765
계획변경,취소0.0210.7651.000
2023-12-12T21:44:20.926582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
계획변경,취소예산종류
계획변경,취소1.0000.035
예산종류0.0351.000
2023-12-12T21:44:21.020631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
예산종류계획변경,취소
예산종류1.0000.035
계획변경,취소0.0351.000

Missing values

2023-12-12T21:44:13.840063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:44:14.102071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

예산종류예산년도예산코드분류코드예산과목명분류코드명이월예산본예산배정예산추경예산전용예산계획변경,취소Unnamed: 12
01202011111-가정용-00000-<NA>
11202011111-가정용-00000-<NA>
21202011111-가정용-00000-<NA>
31202011111-가정용-00000-<NA>
41202011111-가정용-00000-<NA>
51202011111-가정용-00000-<NA>
61202011111-가정용-00000-<NA>
71202011111-가정용-00000-<NA>
81202011111-가정용-00000-<NA>
91202011111-가정용-00000-<NA>
예산종류예산년도예산코드분류코드예산과목명분류코드명이월예산본예산배정예산추경예산전용예산계획변경,취소Unnamed: 12
19232202022413c019감리비당감2배수지 설치공사 건설사업관리용역(이월)24470000-000-<NA>
19242202022414A805시설부대비사직배수지 설치공사 시설부대비080000008000000000<NA>
19252202022414c020시설부대비당감2배수지 설치공사 시설부대비(이월)0-000-<NA>
19262202022612A911국고보조금반환금덕산정수장 태양광발전장치 설치 집행잔액 반환금0-19000000190000000-<NA>
19272202022612A912국고보조금반환금매리취수장 태양광발전장치 설치 집행잔액 반환금0-1197000001197000000-<NA>
19282202022612B075국고보조금반환금사상가압장 비효율 펌프모터 교체 집행잔액 반환0-478800047880000-<NA>
19292202022612B322국고보조금반환금물금취수장 취수펌프 제작교체 집행잔액 반환0-39511000395110000-<NA>
19302202022612B442국고보조금반환금매리취수장 고압펌프모터 제작교체 집행잔액 반환0-34204000342040000-<NA>
193122020227110999예비비자본예산 예비비030000000000-19000000000-<NA>
193232020331104000자금교부자금교부0-000-<NA>

Duplicate rows

Most frequently occurring

예산종류예산년도예산코드분류코드예산과목명분류코드명이월예산본예산배정예산추경예산전용예산계획변경,취소# duplicates
181202011514-기타이자수익-00000-16
271202011800-기타영업외수익-00000-16
251202011630-변상금및위약금수익-00000-15
642202021521-영업미수금-00000-14
672202021522-기타미수금-00000-14
81202011176-기타수수료수익-00000-13
311202011921-전기손익수정이익-00000-13
11202011117-일반용-00000-12
51202011151-신설공사수익-00000-12
61202011161-개조공사수익-00000-12