Overview

Dataset statistics

Number of variables14
Number of observations10000
Missing cells14653
Missing cells (%)10.5%
Duplicate rows604
Duplicate rows (%)6.0%
Total size in memory1.2 MiB
Average record size in memory125.0 B

Variable types

Categorical6
Text4
Numeric3
Unsupported1

Dataset

Description경기도 지출집행 현황(일상경비)
Author경기도
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=LJ9Z18Z5KLJ1M32VO1BK27178834&infSeq=1

Alerts

회계연도 has constant value ""Constant
회계구분명 has constant value ""Constant
부서구분명 has constant value ""Constant
경비구분명 has constant value ""Constant
Dataset has 604 (6.0%) duplicate rowsDuplicates
계약대장관리번호 has 10000 (100.0%) missing valuesMissing
사업자등록번호 has 4648 (46.5%) missing valuesMissing
지출금액 is highly skewed (γ1 = 42.91766329)Skewed
계약대장관리번호 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-05-10 21:18:17.504015
Analysis finished2024-05-10 21:18:28.205127
Duration10.7 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

회계연도
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024
10000 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024
2nd row2024
3rd row2024
4th row2024
5th row2024

Common Values

ValueCountFrequency (%)
2024 10000
100.0%

Length

2024-05-10T21:18:28.423163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-10T21:18:28.753413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024 10000
100.0%

회계구분명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
일반회계
10000 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반회계
2nd row일반회계
3rd row일반회계
4th row일반회계
5th row일반회계

Common Values

ValueCountFrequency (%)
일반회계 10000
100.0%

Length

2024-05-10T21:18:29.077950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-10T21:18:29.379589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반회계 10000
100.0%

부서구분명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
본청
10000 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row본청
2nd row본청
3rd row본청
4th row본청
5th row본청

Common Values

ValueCountFrequency (%)
본청 10000
100.0%

Length

2024-05-10T21:18:29.704882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-10T21:18:29.995319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
본청 10000
100.0%

실국명
Categorical

Distinct19
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
보건건강국
3315 
자치행정국
1586 
기획조정실
742 
여성가족국
622 
안전관리실
616 
Other values (14)
3119 

Length

Max length9
Median length5
Mean length5.1797
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row자치행정국
2nd row미래성장산업국
3rd row자치행정국
4th row보건건강국
5th row보건건강국

Common Values

ValueCountFrequency (%)
보건건강국 3315
33.1%
자치행정국 1586
15.9%
기획조정실 742
 
7.4%
여성가족국 622
 
6.2%
안전관리실 616
 
6.2%
기후환경에너지국 446
 
4.5%
복지국 425
 
4.2%
문화체육관광국 409
 
4.1%
도시주택실 319
 
3.2%
평생교육국 259
 
2.6%
Other values (9) 1261
 
12.6%

Length

2024-05-10T21:18:30.340968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
보건건강국 3315
33.1%
자치행정국 1586
15.9%
기획조정실 742
 
7.4%
여성가족국 622
 
6.2%
안전관리실 616
 
6.2%
기후환경에너지국 446
 
4.5%
복지국 425
 
4.2%
문화체육관광국 409
 
4.1%
도시주택실 319
 
3.2%
평생교육국 259
 
2.6%
Other values (9) 1261
 
12.6%
Distinct87
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-10T21:18:30.876346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length5
Mean length5.2457
Min length3

Characters and Unicode

Total characters52457
Distinct characters154
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row인사과
2nd row디지털혁신과
3rd row세정과
4th row질병정책과
5th row질병정책과
ValueCountFrequency (%)
질병정책과 3099
31.0%
인사과 490
 
4.9%
총무과 483
 
4.8%
특별사법경찰단 438
 
4.4%
아동돌봄과 353
 
3.5%
자산관리과 271
 
2.7%
법무담당관 225
 
2.2%
여성정책과 199
 
2.0%
장애인복지과 185
 
1.8%
문화유산과 182
 
1.8%
Other values (77) 4075
40.8%
2024-05-10T21:18:31.847929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7912
 
15.1%
4697
 
9.0%
3972
 
7.6%
3128
 
6.0%
3099
 
5.9%
1923
 
3.7%
1359
 
2.6%
1332
 
2.5%
1332
 
2.5%
987
 
1.9%
Other values (144) 22716
43.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 52359
99.8%
Uppercase Letter 98
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7912
 
15.1%
4697
 
9.0%
3972
 
7.6%
3128
 
6.0%
3099
 
5.9%
1923
 
3.7%
1359
 
2.6%
1332
 
2.5%
1332
 
2.5%
987
 
1.9%
Other values (142) 22618
43.2%
Uppercase Letter
ValueCountFrequency (%)
I 49
50.0%
A 49
50.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 52359
99.8%
Latin 98
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7912
 
15.1%
4697
 
9.0%
3972
 
7.6%
3128
 
6.0%
3099
 
5.9%
1923
 
3.7%
1359
 
2.6%
1332
 
2.5%
1332
 
2.5%
987
 
1.9%
Other values (142) 22618
43.2%
Latin
ValueCountFrequency (%)
I 49
50.0%
A 49
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 52359
99.8%
ASCII 98
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
7912
 
15.1%
4697
 
9.0%
3972
 
7.6%
3128
 
6.0%
3099
 
5.9%
1923
 
3.7%
1359
 
2.6%
1332
 
2.5%
1332
 
2.5%
987
 
1.9%
Other values (142) 22618
43.2%
ASCII
ValueCountFrequency (%)
I 49
50.0%
A 49
50.0%

경비구분명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
일상경비
10000 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일상경비
2nd row일상경비
3rd row일상경비
4th row일상경비
5th row일상경비

Common Values

ValueCountFrequency (%)
일상경비 10000
100.0%

Length

2024-05-10T21:18:32.264771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-10T21:18:32.566057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일상경비 10000
100.0%
Distinct311
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-10T21:18:33.121565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length36
Mean length18.5742
Min length2

Characters and Unicode

Total characters185742
Distinct characters334
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique41 ?
Unique (%)0.4%

Sample

1st row인사업무추진(자체/직접)
2nd row여비
3rd row여비
4th row코로나바이러스감염증-19 격리입원치료비(국비/직접)
5th row코로나바이러스감염증-19 격리입원치료비(국비/직접)
ValueCountFrequency (%)
코로나바이러스감염증-19 3048
 
14.3%
격리입원치료비(국비/직접 3048
 
14.3%
지원(자체/직접 1428
 
6.7%
운영(자체/직접 1070
 
5.0%
일반운영비 881
 
4.1%
여비 839
 
3.9%
활성화 506
 
2.4%
민생안전사법경찰활동 401
 
1.9%
추진(자체/직접 384
 
1.8%
업무추진비 341
 
1.6%
Other values (538) 9440
44.1%
2024-05-10T21:18:34.197234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11391
 
6.1%
8590
 
4.6%
( 8140
 
4.4%
) 8140
 
4.4%
7880
 
4.2%
7877
 
4.2%
/ 7870
 
4.2%
5516
 
3.0%
5331
 
2.9%
4883
 
2.6%
Other values (324) 110124
59.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 140659
75.7%
Space Separator 11391
 
6.1%
Open Punctuation 8140
 
4.4%
Close Punctuation 8140
 
4.4%
Other Punctuation 7967
 
4.3%
Decimal Number 6283
 
3.4%
Dash Punctuation 3048
 
1.6%
Uppercase Letter 114
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8590
 
6.1%
7880
 
5.6%
7877
 
5.6%
5516
 
3.9%
5331
 
3.8%
4883
 
3.5%
3722
 
2.6%
3400
 
2.4%
3224
 
2.3%
3204
 
2.3%
Other values (305) 87032
61.9%
Decimal Number
ValueCountFrequency (%)
1 3100
49.3%
9 3053
48.6%
6 72
 
1.1%
3 36
 
0.6%
0 11
 
0.2%
2 11
 
0.2%
Uppercase Letter
ValueCountFrequency (%)
I 26
22.8%
A 26
22.8%
T 22
19.3%
V 22
19.3%
G 18
15.8%
Other Punctuation
ValueCountFrequency (%)
/ 7870
98.8%
· 89
 
1.1%
, 6
 
0.1%
. 2
 
< 0.1%
Space Separator
ValueCountFrequency (%)
11391
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8140
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8140
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3048
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 140659
75.7%
Common 44969
 
24.2%
Latin 114
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8590
 
6.1%
7880
 
5.6%
7877
 
5.6%
5516
 
3.9%
5331
 
3.8%
4883
 
3.5%
3722
 
2.6%
3400
 
2.4%
3224
 
2.3%
3204
 
2.3%
Other values (305) 87032
61.9%
Common
ValueCountFrequency (%)
11391
25.3%
( 8140
18.1%
) 8140
18.1%
/ 7870
17.5%
1 3100
 
6.9%
9 3053
 
6.8%
- 3048
 
6.8%
· 89
 
0.2%
6 72
 
0.2%
3 36
 
0.1%
Other values (4) 30
 
0.1%
Latin
ValueCountFrequency (%)
I 26
22.8%
A 26
22.8%
T 22
19.3%
V 22
19.3%
G 18
15.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 140659
75.7%
ASCII 44994
 
24.2%
None 89
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11391
25.3%
( 8140
18.1%
) 8140
18.1%
/ 7870
17.5%
1 3100
 
6.9%
9 3053
 
6.8%
- 3048
 
6.8%
6 72
 
0.2%
3 36
 
0.1%
I 26
 
0.1%
Other values (8) 118
 
0.3%
Hangul
ValueCountFrequency (%)
8590
 
6.1%
7880
 
5.6%
7877
 
5.6%
5516
 
3.9%
5331
 
3.8%
4883
 
3.5%
3722
 
2.6%
3400
 
2.4%
3224
 
2.3%
3204
 
2.3%
Other values (305) 87032
61.9%
None
ValueCountFrequency (%)
· 89
100.0%

통계목명
Categorical

Distinct20
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
의료 및 회복비
3050 
사무관리비
2571 
국내여비
1245 
사회보장적수혜금(취약계층, 지방재원)
889 
시책추진업무추진비
710 
Other values (15)
1535 

Length

Max length20
Median length14
Mean length7.6815
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사무관리비
2nd row국내여비
3rd row국내여비
4th row의료 및 회복비
5th row의료 및 회복비

Common Values

ValueCountFrequency (%)
의료 및 회복비 3050
30.5%
사무관리비 2571
25.7%
국내여비 1245
12.4%
사회보장적수혜금(취약계층, 지방재원) 889
 
8.9%
시책추진업무추진비 710
 
7.1%
공공운영비 395
 
4.0%
특정업무경비 270
 
2.7%
기관운영업무추진비 213
 
2.1%
기타보상금 182
 
1.8%
기간제근로자등보수 141
 
1.4%
Other values (10) 334
 
3.3%

Length

2024-05-10T21:18:34.882353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
의료 3050
17.8%
3050
17.8%
회복비 3050
17.8%
사무관리비 2571
15.0%
국내여비 1245
7.3%
사회보장적수혜금(취약계층 889
 
5.2%
지방재원 889
 
5.2%
시책추진업무추진비 710
 
4.2%
공공운영비 395
 
2.3%
특정업무경비 270
 
1.6%
Other values (14) 981
 
5.7%
Distinct3435
Distinct (%)34.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-10T21:18:35.590824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length96
Median length74
Mean length29.1598
Min length7

Characters and Unicode

Total characters291598
Distinct characters615
Distinct categories15 ?
Distinct scripts4 ?
Distinct blocks10 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2226 ?
Unique (%)22.3%

Sample

1st row2024년 제11차 위원회 참석수당 등 지급
2nd row`24년 2월 디지털혁신과 출장여비 지출
3rd row여비내역(2.1~2.15)
4th row코로나바이러스감염증-19 격리입원치료비 지급(351차)
5th row코로나바이러스감염증-19 격리입원치료비 지급(350차)
ValueCountFrequency (%)
코로나바이러스감염증-19 3048
 
6.5%
격리입원치료비 3048
 
6.5%
지급 2859
 
6.1%
2024년 2442
 
5.2%
지급(351차 1144
 
2.4%
건의 925
 
2.0%
지출 893
 
1.9%
종사자 827
 
1.8%
지급(356차 589
 
1.3%
처우개선비 586
 
1.2%
Other values (4471) 30543
65.1%
2024-05-10T21:18:36.780292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
37010
 
12.7%
10695
 
3.7%
2 10273
 
3.5%
1 8523
 
2.9%
8075
 
2.8%
7672
 
2.6%
) 7253
 
2.5%
( 7230
 
2.5%
4 5916
 
2.0%
3 5824
 
2.0%
Other values (605) 183127
62.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 183949
63.1%
Decimal Number 44526
 
15.3%
Space Separator 37010
 
12.7%
Close Punctuation 7339
 
2.5%
Open Punctuation 7318
 
2.5%
Other Punctuation 6231
 
2.1%
Dash Punctuation 3227
 
1.1%
Math Symbol 1362
 
0.5%
Uppercase Letter 443
 
0.2%
Lowercase Letter 76
 
< 0.1%
Other values (5) 117
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10695
 
5.8%
8075
 
4.4%
7672
 
4.2%
5478
 
3.0%
3975
 
2.2%
3653
 
2.0%
3632
 
2.0%
3312
 
1.8%
3297
 
1.8%
3228
 
1.8%
Other values (527) 130932
71.2%
Uppercase Letter
ValueCountFrequency (%)
T 60
13.5%
A 54
12.2%
P 41
9.3%
O 39
8.8%
I 36
8.1%
G 35
7.9%
C 29
 
6.5%
E 25
 
5.6%
V 24
 
5.4%
F 15
 
3.4%
Other values (12) 85
19.2%
Lowercase Letter
ValueCountFrequency (%)
t 12
15.8%
k 7
9.2%
p 7
9.2%
a 5
 
6.6%
s 5
 
6.6%
o 5
 
6.6%
h 5
 
6.6%
e 5
 
6.6%
g 4
 
5.3%
r 4
 
5.3%
Other values (9) 17
22.4%
Decimal Number
ValueCountFrequency (%)
2 10273
23.1%
1 8523
19.1%
4 5916
13.3%
3 5824
13.1%
0 5093
11.4%
9 3875
 
8.7%
5 3284
 
7.4%
6 1120
 
2.5%
8 319
 
0.7%
7 299
 
0.7%
Other Punctuation
ValueCountFrequency (%)
. 5064
81.3%
, 759
 
12.2%
' 218
 
3.5%
· 160
 
2.6%
/ 12
 
0.2%
: 10
 
0.2%
? 7
 
0.1%
& 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 1354
99.4%
+ 5
 
0.4%
2
 
0.1%
1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 7253
98.8%
61
 
0.8%
] 25
 
0.3%
Open Punctuation
ValueCountFrequency (%)
( 7230
98.8%
61
 
0.8%
[ 27
 
0.4%
Modifier Symbol
ValueCountFrequency (%)
` 19
67.9%
˙ 8
28.6%
˚ 1
 
3.6%
Space Separator
ValueCountFrequency (%)
37010
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3227
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 72
100.0%
Other Symbol
ValueCountFrequency (%)
8
100.0%
Final Punctuation
ValueCountFrequency (%)
6
100.0%
Initial Punctuation
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 183936
63.1%
Common 107130
36.7%
Latin 520
 
0.2%
Han 12
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10695
 
5.8%
8075
 
4.4%
7672
 
4.2%
5478
 
3.0%
3975
 
2.2%
3653
 
2.0%
3632
 
2.0%
3312
 
1.8%
3297
 
1.8%
3228
 
1.8%
Other values (524) 130919
71.2%
Latin
ValueCountFrequency (%)
T 60
 
11.5%
A 54
 
10.4%
P 41
 
7.9%
O 39
 
7.5%
I 36
 
6.9%
G 35
 
6.7%
C 29
 
5.6%
E 25
 
4.8%
V 24
 
4.6%
F 15
 
2.9%
Other values (32) 162
31.2%
Common
ValueCountFrequency (%)
37010
34.5%
2 10273
 
9.6%
1 8523
 
8.0%
) 7253
 
6.8%
( 7230
 
6.7%
4 5916
 
5.5%
3 5824
 
5.4%
0 5093
 
4.8%
. 5064
 
4.7%
9 3875
 
3.6%
Other values (27) 11069
 
10.3%
Han
ValueCountFrequency (%)
11
91.7%
1
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 183858
63.1%
ASCII 107338
36.8%
None 283
 
0.1%
Compat Jamo 78
 
< 0.1%
CJK 12
 
< 0.1%
Modifier Letters 9
 
< 0.1%
Punctuation 9
 
< 0.1%
Geometric Shapes 8
 
< 0.1%
Math Operators 2
 
< 0.1%
Arrows 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
37010
34.5%
2 10273
 
9.6%
1 8523
 
7.9%
) 7253
 
6.8%
( 7230
 
6.7%
4 5916
 
5.5%
3 5824
 
5.4%
0 5093
 
4.7%
. 5064
 
4.7%
9 3875
 
3.6%
Other values (58) 11277
 
10.5%
Hangul
ValueCountFrequency (%)
10695
 
5.8%
8075
 
4.4%
7672
 
4.2%
5478
 
3.0%
3975
 
2.2%
3653
 
2.0%
3632
 
2.0%
3312
 
1.8%
3297
 
1.8%
3228
 
1.8%
Other values (523) 130841
71.2%
None
ValueCountFrequency (%)
· 160
56.5%
61
 
21.6%
61
 
21.6%
º 1
 
0.4%
Compat Jamo
ValueCountFrequency (%)
78
100.0%
CJK
ValueCountFrequency (%)
11
91.7%
1
 
8.3%
Geometric Shapes
ValueCountFrequency (%)
8
100.0%
Modifier Letters
ValueCountFrequency (%)
˙ 8
88.9%
˚ 1
 
11.1%
Punctuation
ValueCountFrequency (%)
6
66.7%
3
33.3%
Math Operators
ValueCountFrequency (%)
2
100.0%
Arrows
ValueCountFrequency (%)
1
100.0%

지출금액
Real number (ℝ)

SKEWED 

Distinct4063
Distinct (%)40.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean392824.78
Minimum-971480
Maximum2.0098297 × 108
Zeros0
Zeros (%)0.0%
Negative12
Negative (%)0.1%
Memory size166.0 KiB
2024-05-10T21:18:37.228938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-971480
5-th percentile10000
Q150000
median112500
Q3258850
95-th percentile1047287.5
Maximum2.0098297 × 108
Range2.0195445 × 108
Interquartile range (IQR)208850

Descriptive statistics

Standard deviation3198247
Coefficient of variation (CV)8.1416631
Kurtosis2199.7379
Mean392824.78
Median Absolute Deviation (MAD)78500
Skewness42.917663
Sum3.9282478 × 109
Variance1.0228784 × 1013
MonotonicityNot monotonic
2024-05-10T21:18:37.714516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
50000 677
 
6.8%
200000 439
 
4.4%
100000 319
 
3.2%
150000 177
 
1.8%
37500 157
 
1.6%
300000 141
 
1.4%
273600 141
 
1.4%
80000 102
 
1.0%
20000 98
 
1.0%
182400 97
 
1.0%
Other values (4053) 7652
76.5%
ValueCountFrequency (%)
-971480 1
< 0.1%
-330000 1
< 0.1%
-200000 1
< 0.1%
-170000 1
< 0.1%
-119230 1
< 0.1%
-100000 1
< 0.1%
-55170 2
< 0.1%
-42850 1
< 0.1%
-20000 1
< 0.1%
-12300 1
< 0.1%
ValueCountFrequency (%)
200982970 1
< 0.1%
127876520 1
< 0.1%
125692730 1
< 0.1%
101745130 1
< 0.1%
89974800 1
< 0.1%
33837000 1
< 0.1%
27700000 1
< 0.1%
26660000 1
< 0.1%
22617530 1
< 0.1%
22000000 1
< 0.1%

지급명령일자
Real number (ℝ)

Distinct85
Distinct (%)0.9%
Missing5
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean20240327
Minimum20240108
Maximum20240513
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-10T21:18:38.138949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20240108
5-th percentile20240125
Q120240229
median20240320
Q320240415
95-th percentile20240508
Maximum20240513
Range405
Interquartile range (IQR)186

Descriptive statistics

Standard deviation108.84876
Coefficient of variation (CV)5.3778162 × 10-6
Kurtosis-0.59959481
Mean20240327
Median Absolute Deviation (MAD)92
Skewness-0.11453157
Sum2.0230207 × 1011
Variance11848.053
MonotonicityNot monotonic
2024-05-10T21:18:38.510438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20240320 1338
 
13.4%
20240508 674
 
6.7%
20240314 575
 
5.8%
20240312 557
 
5.6%
20240223 243
 
2.4%
20240125 233
 
2.3%
20240307 197
 
2.0%
20240325 175
 
1.8%
20240424 166
 
1.7%
20240425 164
 
1.6%
Other values (75) 5673
56.7%
ValueCountFrequency (%)
20240108 2
 
< 0.1%
20240109 5
 
0.1%
20240110 15
 
0.1%
20240111 11
 
0.1%
20240112 3
 
< 0.1%
20240115 18
 
0.2%
20240116 19
 
0.2%
20240117 19
 
0.2%
20240118 48
 
0.5%
20240119 133
1.3%
ValueCountFrequency (%)
20240513 4
 
< 0.1%
20240510 142
 
1.4%
20240509 104
 
1.0%
20240508 674
6.7%
20240507 60
 
0.6%
20240503 110
 
1.1%
20240502 159
 
1.6%
20240430 131
 
1.3%
20240429 98
 
1.0%
20240426 111
 
1.1%
Distinct3740
Distinct (%)37.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-10T21:18:38.959154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length3
Mean length4.2886
Min length2

Characters and Unicode

Total characters42886
Distinct characters650
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3242 ?
Unique (%)32.4%

Sample

1st row***
2nd row***
3rd row***
4th row박하준
5th row정태윤
ValueCountFrequency (%)
4899
45.4%
광교점 121
 
1.1%
주식회 61
 
0.6%
엠에스리테일 55
 
0.5%
주식회사 55
 
0.5%
광교 35
 
0.3%
우사 32
 
0.3%
얌샘김밥 31
 
0.3%
힘난다버거 28
 
0.3%
영통구청 27
 
0.3%
Other values (3884) 5454
50.5%
2024-05-10T21:18:39.931072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 17337
40.4%
911
 
2.1%
798
 
1.9%
743
 
1.7%
678
 
1.6%
475
 
1.1%
( 454
 
1.1%
) 443
 
1.0%
428
 
1.0%
399
 
0.9%
Other values (640) 20220
47.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 23180
54.1%
Other Punctuation 17421
40.6%
Space Separator 798
 
1.9%
Open Punctuation 454
 
1.1%
Close Punctuation 443
 
1.0%
Uppercase Letter 364
 
0.8%
Lowercase Letter 100
 
0.2%
Decimal Number 93
 
0.2%
Other Symbol 24
 
0.1%
Dash Punctuation 4
 
< 0.1%
Other values (2) 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
911
 
3.9%
743
 
3.2%
678
 
2.9%
475
 
2.0%
428
 
1.8%
399
 
1.7%
399
 
1.7%
348
 
1.5%
313
 
1.4%
294
 
1.3%
Other values (576) 18192
78.5%
Uppercase Letter
ValueCountFrequency (%)
S 67
18.4%
C 46
12.6%
K 39
10.7%
T 28
7.7%
G 26
 
7.1%
O 23
 
6.3%
M 20
 
5.5%
F 19
 
5.2%
A 17
 
4.7%
I 17
 
4.7%
Other values (13) 62
17.0%
Lowercase Letter
ValueCountFrequency (%)
i 15
15.0%
e 13
13.0%
l 12
12.0%
r 9
9.0%
s 7
7.0%
p 7
7.0%
n 6
 
6.0%
o 5
 
5.0%
g 5
 
5.0%
m 4
 
4.0%
Other values (10) 17
17.0%
Decimal Number
ValueCountFrequency (%)
9 20
21.5%
2 18
19.4%
1 17
18.3%
3 14
15.1%
0 9
9.7%
4 8
 
8.6%
5 4
 
4.3%
6 2
 
2.2%
8 1
 
1.1%
Other Punctuation
ValueCountFrequency (%)
* 17337
99.5%
/ 33
 
0.2%
& 28
 
0.2%
. 21
 
0.1%
, 2
 
< 0.1%
Space Separator
ValueCountFrequency (%)
798
100.0%
Open Punctuation
ValueCountFrequency (%)
( 454
100.0%
Close Punctuation
ValueCountFrequency (%)
) 443
100.0%
Other Symbol
ValueCountFrequency (%)
24
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Math Symbol
ValueCountFrequency (%)
+ 3
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 23204
54.1%
Common 19218
44.8%
Latin 464
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
911
 
3.9%
743
 
3.2%
678
 
2.9%
475
 
2.0%
428
 
1.8%
399
 
1.7%
399
 
1.7%
348
 
1.5%
313
 
1.3%
294
 
1.3%
Other values (577) 18216
78.5%
Latin
ValueCountFrequency (%)
S 67
14.4%
C 46
 
9.9%
K 39
 
8.4%
T 28
 
6.0%
G 26
 
5.6%
O 23
 
5.0%
M 20
 
4.3%
F 19
 
4.1%
A 17
 
3.7%
I 17
 
3.7%
Other values (33) 162
34.9%
Common
ValueCountFrequency (%)
* 17337
90.2%
798
 
4.2%
( 454
 
2.4%
) 443
 
2.3%
/ 33
 
0.2%
& 28
 
0.1%
. 21
 
0.1%
9 20
 
0.1%
2 18
 
0.1%
1 17
 
0.1%
Other values (10) 49
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 23180
54.1%
ASCII 19682
45.9%
None 24
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 17337
88.1%
798
 
4.1%
( 454
 
2.3%
) 443
 
2.3%
S 67
 
0.3%
C 46
 
0.2%
K 39
 
0.2%
/ 33
 
0.2%
& 28
 
0.1%
T 28
 
0.1%
Other values (53) 409
 
2.1%
Hangul
ValueCountFrequency (%)
911
 
3.9%
743
 
3.2%
678
 
2.9%
475
 
2.0%
428
 
1.8%
399
 
1.7%
399
 
1.7%
348
 
1.5%
313
 
1.4%
294
 
1.3%
Other values (576) 18192
78.5%
None
ValueCountFrequency (%)
24
100.0%

계약대장관리번호
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

사업자등록번호
Real number (ℝ)

MISSING 

Distinct1214
Distinct (%)22.7%
Missing4648
Missing (%)46.5%
Infinite0
Infinite (%)0.0%
Mean2.8420256 × 109
Minimum1.010973 × 109
Maximum8.9903025 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-10T21:18:40.338080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.010973 × 109
5-th percentile1.1682022 × 109
Q11.2882064 × 109
median1.3782025 × 109
Q34.1218019 × 109
95-th percentile7.9685022 × 109
Maximum8.9903025 × 109
Range7.9793296 × 109
Interquartile range (IQR)2.8335955 × 109

Descriptive statistics

Standard deviation2.2625735 × 109
Coefficient of variation (CV)0.79611299
Kurtosis0.15036034
Mean2.8420256 × 109
Median Absolute Deviation (MAD)1.9993781 × 108
Skewness1.2511733
Sum1.5210521 × 1013
Variance5.119239 × 1018
MonotonicityNot monotonic
2024-05-10T21:18:40.729425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1258204368 312
 
3.1%
1328207422 191
 
1.9%
6254300295 102
 
1.0%
3449500361 96
 
1.0%
1358212074 94
 
0.9%
1248300269 59
 
0.6%
1328632822 59
 
0.6%
1349271900 57
 
0.6%
1288208418 55
 
0.5%
8078200237 52
 
0.5%
Other values (1204) 4275
42.8%
(Missing) 4648
46.5%
ValueCountFrequency (%)
1010972950 1
 
< 0.1%
1011126202 1
 
< 0.1%
1011184240 1
 
< 0.1%
1018302925 2
< 0.1%
1018547682 4
< 0.1%
1019467642 1
 
< 0.1%
1020892383 1
 
< 0.1%
1022708343 1
 
< 0.1%
1028111670 3
< 0.1%
1028132035 1
 
< 0.1%
ValueCountFrequency (%)
8990302511 1
 
< 0.1%
8973700206 1
 
< 0.1%
8973600679 6
0.1%
8949601431 1
 
< 0.1%
8919100383 1
 
< 0.1%
8908501791 1
 
< 0.1%
8875400735 2
 
< 0.1%
8870502754 1
 
< 0.1%
8867800382 2
 
< 0.1%
8855000920 1
 
< 0.1%

Interactions

2024-05-10T21:18:25.938973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-10T21:18:23.962535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-10T21:18:24.962805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-10T21:18:26.244980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-10T21:18:24.330542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-10T21:18:25.269269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-10T21:18:26.555263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-10T21:18:24.679287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-10T21:18:25.612622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-10T21:18:41.015869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
실국명부서명통계목명지출금액지급명령일자사업자등록번호
실국명1.0001.0000.7590.0000.5130.282
부서명1.0001.0000.8980.0000.6150.413
통계목명0.7590.8981.0000.4840.5540.364
지출금액0.0000.0000.4841.0000.0000.000
지급명령일자0.5130.6150.5540.0001.0000.271
사업자등록번호0.2820.4130.3640.0000.2711.000
2024-05-10T21:18:41.259808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
통계목명실국명
통계목명1.0000.328
실국명0.3281.000
2024-05-10T21:18:41.460819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지출금액지급명령일자사업자등록번호실국명통계목명
지출금액1.000-0.055-0.1060.0000.248
지급명령일자-0.0551.0000.0440.2670.295
사업자등록번호-0.1060.0441.0000.1090.144
실국명0.0000.2670.1091.0000.328
통계목명0.2480.2950.1440.3281.000

Missing values

2024-05-10T21:18:26.990724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-10T21:18:27.679318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-05-10T21:18:28.054073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

회계연도회계구분명부서구분명실국명부서명경비구분명세부사업명통계목명지출개요지출금액지급명령일자거래처명계약대장관리번호사업자등록번호
331702024일반회계본청자치행정국인사과일상경비인사업무추진(자체/직접)사무관리비2024년 제11차 위원회 참석수당 등 지급74000020240409***<NA><NA>
278602024일반회계본청미래성장산업국디지털혁신과일상경비여비국내여비`24년 2월 디지털혁신과 출장여비 지출22800020240318***<NA><NA>
323482024일반회계본청자치행정국세정과일상경비여비국내여비여비내역(2.1~2.15)2000020240307***<NA><NA>
188762024일반회계본청보건건강국질병정책과일상경비코로나바이러스감염증-19 격리입원치료비(국비/직접)의료 및 회복비코로나바이러스감염증-19 격리입원치료비 지급(351차)10279020240320박하준<NA>1378208431
197202024일반회계본청보건건강국질병정책과일상경비코로나바이러스감염증-19 격리입원치료비(국비/직접)의료 및 회복비코로나바이러스감염증-19 격리입원치료비 지급(350차)2327020240314정태윤<NA>1358200113
276022024일반회계본청감사관계약심사담당관일상경비여비국내여비국내여비 지출(4.25.)3750020240502***<NA><NA>
157992024일반회계본청보건건강국질병정책과일상경비코로나바이러스감염증-19 격리입원치료비(국비/직접)의료 및 회복비코로나바이러스감염증-19 격리입원치료비 지급(351차)3349020240320신승근<NA>1328207422
570732024일반회계본청도시주택실도시재생과일상경비위원회 운영(자체/직접)행사실비지원금도시재생사업 공모 선정 제고를 위한 컨설팅 운영비 지출 건의2400020240326두부품은 육개장(양주<NA>1975500584
59702024일반회계본청안전관리실안전기획과일상경비여비국내여비2월 국내여비 지급(2.1.~2.29.)1000020240314***<NA><NA>
106602024일반회계본청보건건강국질병정책과일상경비코로나바이러스감염증-19 격리입원치료비(국비/직접)의료 및 회복비코로나바이러스감염증-19 격리입원치료비 지급(355차)37020240502장남경<NA>2091254203
회계연도회계구분명부서구분명실국명부서명경비구분명세부사업명통계목명지출개요지출금액지급명령일자거래처명계약대장관리번호사업자등록번호
13662024일반회계본청평생교육국도서관정책과일상경비여비국내여비직원 출장여비 지급(4.1.~4.21.)5000020240426***<NA><NA>
291042024일반회계본청자치행정국총무과일상경비일반운영비사무관리비2024년 4월 당직비 지급(4.1~4.30)5200020240508***<NA><NA>
246822024일반회계본청보건건강국질병정책과일상경비코로나바이러스감염증-19 격리입원치료비(국비/직접)의료 및 회복비코로나바이러스감염증-19 격리입원치료비 지급(347차)343724020240305황민호(아이원병원)<NA>8449300246
155082024일반회계본청보건건강국질병정책과일상경비코로나바이러스감염증-19 격리입원치료비(국비/직접)의료 및 회복비코로나바이러스감염증-19 격리입원치료비 지급(351차)30913020240320선봉규<NA>1359012521
20812024일반회계본청홍보기획관도민소통담당관일상경비광고제작 및 확산(자체/직접)사무관리비제안서 평가위원회 수당 지급(2024년 경기도 및 도 정책사업 통합 마케팅 용역)30000020240124***<NA><NA>
170312024일반회계본청보건건강국질병정책과일상경비코로나바이러스감염증-19 격리입원치료비(국비/직접)의료 및 회복비코로나바이러스감염증-19 격리입원치료비 지급(351차)64329020240320이창수<NA>1378208431
52812024일반회계본청안전관리실안전특별점검단일상경비안전점검 및 안전감찰을 통한 재난예방 추진(자체/직접)국내여비2월 2차 국내여비 지급(2. 16. ~ 2. 29.)22450020240312******<NA><NA>
392082024일반회계본청문화체육관광국문화유산과일상경비도 무형문화재(개인) 전승(자체/직접)기타보상금도 무형문화재(개인) 전승지원금 지급(2024년 3월)25000020240320***<NA><NA>
96652024일반회계본청보건건강국질병정책과일상경비코로나바이러스감염증-19 격리입원치료비(국비/직접)의료 및 회복비코로나바이러스감염증-19 격리입원치료비 지급(356차)350020240508장선아<NA>6254300295
348962024일반회계본청자치행정국인사과일상경비공정한 시험업무 추진(자체/직접)사무관리비2024년 제1회 개방형직위 임용시험 사전서류전형·면접·선발시험위원 및 시험관리관 수당 지급 건의6000020240216***<NA><NA>

Duplicate rows

Most frequently occurring

회계연도회계구분명부서구분명실국명부서명경비구분명세부사업명통계목명지출개요지출금액지급명령일자거래처명사업자등록번호# duplicates
4392024일반회계본청자치행정국열린민원실일상경비경기사랑 도민 참여단 운영(자체/직접)기타보상금2024년 제1차 경기사랑 도민 참여단 도정 의견수렴 보상금 지급3000020240312***<NA>34
3292024일반회계본청안전관리실특별사법경찰단일상경비민생안전사법경찰활동 활성화 지원(자체/직접)특정업무경비2024년 1월 특정업무경비 지급20000020240131***<NA>27
3332024일반회계본청안전관리실특별사법경찰단일상경비민생안전사법경찰활동 활성화 지원(자체/직접)특정업무경비2024년 4월 특정업무경비 지급20000020240430***<NA>21
3312024일반회계본청안전관리실특별사법경찰단일상경비민생안전사법경찰활동 활성화 지원(자체/직접)특정업무경비2024년 2월 특정업무경비 지급20000020240229***<NA>20
3792024일반회계본청여성가족국아동돌봄과일상경비아동일시보호소 지원(자체/직접)사회보장적수혜금(취약계층, 지방재원)2024년 1월 아동일시보호소 종사자 처우개선비 지급5000020240125***<NA>20
3822024일반회계본청여성가족국아동돌봄과일상경비아동일시보호소 지원(자체/직접)사회보장적수혜금(취약계층, 지방재원)2024년 2월 아동일시보호소 종사자 처우개선비 지급(경기도, 북부, 남부)5000020240223***<NA>20
3902024일반회계본청여성가족국아동돌봄과일상경비아동일시보호소 지원(자체/직접)사회보장적수혜금(취약계층, 지방재원)2024년 4월 아동일시보호소 종사자 처우개선비 지급5000020240425***<NA>20
4172024일반회계본청여성가족국여성정책과일상경비해바라기센터 운영지원(자체/직접)사회보장적수혜금(취약계층, 지방재원)2024년 4월 해바라기센터 종사자 처우개선비 지급5000020240429***<NA>20
3322024일반회계본청안전관리실특별사법경찰단일상경비민생안전사법경찰활동 활성화 지원(자체/직접)특정업무경비2024년 3월 특정업무경비 지급20000020240329***<NA>19
5812024일반회계본청평생교육국청소년과일상경비도 청소년시설 종사자 처우개선(자체/직접)사회보장적수혜금(취약계층, 지방재원)2024년 4월 경기도청소년수련원 종사자 처우개선비 지급5000020240424***<NA>18