Overview

Dataset statistics

Number of variables23
Number of observations10000
Missing cells6561
Missing cells (%)2.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.9 MiB
Average record size in memory199.0 B

Variable types

Categorical6
Text7
Numeric5
Boolean3
DateTime2

Dataset

Description한국노인인력개발원 시장형, 공익활동형 노인일자리 사업의 사업명, 내용, 기간, 수치 등 통합 정보를 제공하는 데이터입니다.
URLhttps://www.data.go.kr/data/15050148/fileData.do

Alerts

사업년도 has constant value ""Constant
특수사업명코드 has constant value ""Constant
비예산여부 is highly imbalanced (98.6%)Imbalance
사업계획서상태코드 is highly imbalanced (76.4%)Imbalance
최근승인첨부파일 has 6557 (65.6%) missing valuesMissing
사업번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 08:55:15.619567
Analysis finished2023-12-12 08:55:18.064254
Duration2.44 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사업유형
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
공익활동형
5859 
사회서비스형
2251 
시장형
1890 

Length

Max length6
Median length5
Mean length4.8471
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공익활동형
2nd row공익활동형
3rd row공익활동형
4th row사회서비스형
5th row공익활동형

Common Values

ValueCountFrequency (%)
공익활동형 5859
58.6%
사회서비스형 2251
 
22.5%
시장형 1890
 
18.9%

Length

2023-12-12T17:55:18.137607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:55:18.264360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공익활동형 5859
58.6%
사회서비스형 2251
 
22.5%
시장형 1890
 
18.9%

사업번호
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T17:55:18.578891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters100000
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st row2022-08423
2nd row2022-06930
3rd row2022-01372
4th row2022-00382
5th row2022-02984
ValueCountFrequency (%)
2022-08423 1
 
< 0.1%
2022-13445 1
 
< 0.1%
2022-06535 1
 
< 0.1%
2022-07488 1
 
< 0.1%
2022-06812 1
 
< 0.1%
2022-09450 1
 
< 0.1%
2022-05464 1
 
< 0.1%
2022-02361 1
 
< 0.1%
2022-08120 1
 
< 0.1%
2022-08577 1
 
< 0.1%
Other values (9990) 9990
99.9%
2023-12-12T17:55:19.066120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 33966
34.0%
0 23007
23.0%
- 10000
 
10.0%
1 5685
 
5.7%
5 4004
 
4.0%
3 3972
 
4.0%
4 3961
 
4.0%
7 3921
 
3.9%
8 3883
 
3.9%
6 3852
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 90000
90.0%
Dash Punctuation 10000
 
10.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 33966
37.7%
0 23007
25.6%
1 5685
 
6.3%
5 4004
 
4.4%
3 3972
 
4.4%
4 3961
 
4.4%
7 3921
 
4.4%
8 3883
 
4.3%
6 3852
 
4.3%
9 3749
 
4.2%
Dash Punctuation
ValueCountFrequency (%)
- 10000
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 100000
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 33966
34.0%
0 23007
23.0%
- 10000
 
10.0%
1 5685
 
5.7%
5 4004
 
4.0%
3 3972
 
4.0%
4 3961
 
4.0%
7 3921
 
3.9%
8 3883
 
3.9%
6 3852
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 100000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 33966
34.0%
0 23007
23.0%
- 10000
 
10.0%
1 5685
 
5.7%
5 4004
 
4.0%
3 3972
 
4.0%
4 3961
 
4.0%
7 3921
 
3.9%
8 3883
 
3.9%
6 3852
 
3.9%

사업계획변경순번
Real number (ℝ)

Distinct13
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.1215
Minimum0
Maximum13
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T17:55:19.231654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q11
median2
Q33
95-th percentile5
Maximum13
Range13
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.4127765
Coefficient of variation (CV)0.66593281
Kurtosis4.6532614
Mean2.1215
Median Absolute Deviation (MAD)1
Skewness1.8589998
Sum21215
Variance1.9959373
MonotonicityNot monotonic
2023-12-12T17:55:19.368952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
1 4267
42.7%
2 2929
29.3%
3 1441
 
14.4%
4 680
 
6.8%
5 352
 
3.5%
6 163
 
1.6%
7 87
 
0.9%
8 42
 
0.4%
9 22
 
0.2%
10 12
 
0.1%
Other values (3) 5
 
0.1%
ValueCountFrequency (%)
0 1
 
< 0.1%
1 4267
42.7%
2 2929
29.3%
3 1441
 
14.4%
4 680
 
6.8%
5 352
 
3.5%
6 163
 
1.6%
7 87
 
0.9%
8 42
 
0.4%
9 22
 
0.2%
ValueCountFrequency (%)
13 1
 
< 0.1%
11 3
 
< 0.1%
10 12
 
0.1%
9 22
 
0.2%
8 42
 
0.4%
7 87
 
0.9%
6 163
 
1.6%
5 352
 
3.5%
4 680
6.8%
3 1441
14.4%

사업년도
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2022
10000 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 10000
100.0%

Length

2023-12-12T17:55:19.545232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:55:19.714220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 10000
100.0%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
True
8125 
False
1875 
ValueCountFrequency (%)
True 8125
81.2%
False 1875
 
18.8%
2023-12-12T17:55:19.805929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

계속사업시작연도
Real number (ℝ)

Distinct22
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2018.619
Minimum2000
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T17:55:19.947027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2000
5-th percentile2009
Q12017
median2020
Q32022
95-th percentile2022
Maximum2022
Range22
Interquartile range (IQR)5

Descriptive statistics

Standard deviation4.1393662
Coefficient of variation (CV)0.0020505931
Kurtosis1.4739148
Mean2018.619
Median Absolute Deviation (MAD)2
Skewness-1.4570287
Sum20186190
Variance17.134352
MonotonicityNot monotonic
2023-12-12T17:55:20.100348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%)
2022 3352
33.5%
2021 1278
 
12.8%
2019 1024
 
10.2%
2020 984
 
9.8%
2016 555
 
5.5%
2018 487
 
4.9%
2017 484
 
4.8%
2015 334
 
3.3%
2014 243
 
2.4%
2012 233
 
2.3%
Other values (12) 1026
 
10.3%
ValueCountFrequency (%)
2000 2
 
< 0.1%
2002 1
 
< 0.1%
2003 1
 
< 0.1%
2004 23
 
0.2%
2005 20
 
0.2%
2006 150
1.5%
2007 107
1.1%
2008 110
1.1%
2009 168
1.7%
2010 122
1.2%
ValueCountFrequency (%)
2022 3352
33.5%
2021 1278
 
12.8%
2020 984
 
9.8%
2019 1024
 
10.2%
2018 487
 
4.9%
2017 484
 
4.8%
2016 555
 
5.5%
2015 334
 
3.3%
2014 243
 
2.4%
2013 195
 
1.9%
Distinct64
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T17:55:20.706131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length19
Mean length14.8827
Min length8

Characters and Unicode

Total characters148827
Distinct characters158
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row주정차질서 계도 봉사(A-17)
2nd row도서관 봉사(A-12)
3rd row지역아동센터 봉사(A-11)
4th row안전관리지원(B-10)
5th row지역사회 환경개선 봉사(A-16)
ValueCountFrequency (%)
1226
 
5.1%
1071
 
4.5%
봉사(a-15 932
 
3.9%
공공의료 932
 
3.9%
복지시설 932
 
3.9%
지역사회 930
 
3.9%
환경개선 930
 
3.9%
봉사(a-16 930
 
3.9%
공원 926
 
3.9%
공공시설 926
 
3.9%
Other values (108) 14216
59.4%
2023-12-12T17:55:21.265381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13951
 
9.4%
( 10378
 
7.0%
) 10378
 
7.0%
- 10000
 
6.7%
1 7091
 
4.8%
A 5859
 
3.9%
5580
 
3.7%
5311
 
3.6%
4955
 
3.3%
0 4350
 
2.9%
Other values (148) 70974
47.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 74104
49.8%
Decimal Number 20000
 
13.4%
Space Separator 13951
 
9.4%
Open Punctuation 10378
 
7.0%
Close Punctuation 10378
 
7.0%
Uppercase Letter 10016
 
6.7%
Dash Punctuation 10000
 
6.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5580
 
7.5%
5311
 
7.2%
4955
 
6.7%
4106
 
5.5%
3335
 
4.5%
3295
 
4.4%
3133
 
4.2%
1782
 
2.4%
1458
 
2.0%
1442
 
1.9%
Other values (128) 39707
53.6%
Decimal Number
ValueCountFrequency (%)
1 7091
35.5%
0 4350
21.8%
3 1338
 
6.7%
2 1315
 
6.6%
5 1266
 
6.3%
6 1243
 
6.2%
9 1189
 
5.9%
8 1083
 
5.4%
7 745
 
3.7%
4 380
 
1.9%
Uppercase Letter
ValueCountFrequency (%)
A 5859
58.5%
B 2251
 
22.5%
E 1890
 
18.9%
C 8
 
0.1%
T 4
 
< 0.1%
V 4
 
< 0.1%
Space Separator
ValueCountFrequency (%)
13951
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10378
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10378
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10000
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 74104
49.8%
Common 64707
43.5%
Latin 10016
 
6.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5580
 
7.5%
5311
 
7.2%
4955
 
6.7%
4106
 
5.5%
3335
 
4.5%
3295
 
4.4%
3133
 
4.2%
1782
 
2.4%
1458
 
2.0%
1442
 
1.9%
Other values (128) 39707
53.6%
Common
ValueCountFrequency (%)
13951
21.6%
( 10378
16.0%
) 10378
16.0%
- 10000
15.5%
1 7091
11.0%
0 4350
 
6.7%
3 1338
 
2.1%
2 1315
 
2.0%
5 1266
 
2.0%
6 1243
 
1.9%
Other values (4) 3397
 
5.2%
Latin
ValueCountFrequency (%)
A 5859
58.5%
B 2251
 
22.5%
E 1890
 
18.9%
C 8
 
0.1%
T 4
 
< 0.1%
V 4
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 74723
50.2%
Hangul 74104
49.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
13951
18.7%
( 10378
13.9%
) 10378
13.9%
- 10000
13.4%
1 7091
9.5%
A 5859
7.8%
0 4350
 
5.8%
B 2251
 
3.0%
E 1890
 
2.5%
3 1338
 
1.8%
Other values (10) 7237
9.7%
Hangul
ValueCountFrequency (%)
5580
 
7.5%
5311
 
7.2%
4955
 
6.7%
4106
 
5.5%
3335
 
4.5%
3295
 
4.4%
3133
 
4.2%
1782
 
2.4%
1458
 
2.0%
1442
 
1.9%
Other values (128) 39707
53.6%

비예산여부
Boolean

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
False
9987 
True
 
13
ValueCountFrequency (%)
False 9987
99.9%
True 13
 
0.1%
2023-12-12T17:55:21.416352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

특수사업명코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
105001
10000 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row105001
2nd row105001
3rd row105001
4th row105001
5th row105001

Common Values

ValueCountFrequency (%)
105001 10000
100.0%

Length

2023-12-12T17:55:21.553859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:55:21.685316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
105001 10000
100.0%
Distinct7279
Distinct (%)72.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T17:55:21.985147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length34
Mean length9.53
Min length2

Characters and Unicode

Total characters95300
Distinct characters858
Distinct categories18 ?
Distinct scripts4 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6259 ?
Unique (%)62.6%

Sample

1st row불법주정차계도사업
2nd row사서도우미
3rd row지역아동센터환경정화
4th row수요처안전모니터링
5th row우리마을가꾸기
ValueCountFrequency (%)
노노케어 283
 
2.0%
시니어 195
 
1.4%
188
 
1.3%
사업 146
 
1.0%
지원 130
 
0.9%
경로당 127
 
0.9%
도우미 105
 
0.7%
사업단 84
 
0.6%
공공시설 74
 
0.5%
봉사 70
 
0.5%
Other values (7490) 13026
90.3%
2023-12-12T17:55:22.555642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4559
 
4.8%
4366
 
4.6%
4239
 
4.4%
2996
 
3.1%
2751
 
2.9%
2327
 
2.4%
1860
 
2.0%
1690
 
1.8%
1687
 
1.8%
1686
 
1.8%
Other values (848) 67139
70.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 85969
90.2%
Space Separator 4559
 
4.8%
Close Punctuation 1256
 
1.3%
Open Punctuation 1254
 
1.3%
Decimal Number 972
 
1.0%
Other Punctuation 370
 
0.4%
Uppercase Letter 352
 
0.4%
Dash Punctuation 260
 
0.3%
Lowercase Letter 211
 
0.2%
Letter Number 33
 
< 0.1%
Other values (8) 64
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4366
 
5.1%
4239
 
4.9%
2996
 
3.5%
2751
 
3.2%
2327
 
2.7%
1860
 
2.2%
1690
 
2.0%
1687
 
2.0%
1686
 
2.0%
1652
 
1.9%
Other values (758) 60715
70.6%
Uppercase Letter
ValueCountFrequency (%)
E 42
11.9%
M 37
10.5%
S 36
10.2%
C 34
9.7%
G 26
 
7.4%
K 22
 
6.2%
T 22
 
6.2%
O 20
 
5.7%
A 17
 
4.8%
B 12
 
3.4%
Other values (13) 84
23.9%
Lowercase Letter
ValueCountFrequency (%)
e 50
23.7%
a 28
13.3%
f 20
 
9.5%
r 14
 
6.6%
m 14
 
6.6%
n 11
 
5.2%
i 11
 
5.2%
c 10
 
4.7%
o 10
 
4.7%
h 6
 
2.8%
Other values (11) 37
17.5%
Other Punctuation
ValueCountFrequency (%)
' 116
31.4%
, 92
24.9%
" 60
16.2%
. 28
 
7.6%
& 25
 
6.8%
! 17
 
4.6%
: 12
 
3.2%
· 11
 
3.0%
/ 4
 
1.1%
# 4
 
1.1%
Decimal Number
ValueCountFrequency (%)
2 303
31.2%
1 188
19.3%
0 143
14.7%
3 69
 
7.1%
9 65
 
6.7%
6 57
 
5.9%
8 53
 
5.5%
5 45
 
4.6%
7 27
 
2.8%
4 22
 
2.3%
Letter Number
ValueCountFrequency (%)
19
57.6%
12
36.4%
1
 
3.0%
1
 
3.0%
Math Symbol
ValueCountFrequency (%)
+ 4
50.0%
~ 2
25.0%
> 1
 
12.5%
< 1
 
12.5%
Close Punctuation
ValueCountFrequency (%)
) 1235
98.3%
] 20
 
1.6%
1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1233
98.3%
[ 20
 
1.6%
1
 
0.1%
Final Punctuation
ValueCountFrequency (%)
12
75.0%
4
 
25.0%
Initial Punctuation
ValueCountFrequency (%)
12
75.0%
4
 
25.0%
Space Separator
ValueCountFrequency (%)
4559
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 260
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 16
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 4
100.0%
Other Number
ValueCountFrequency (%)
2
100.0%
Control
ValueCountFrequency (%)
1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 85908
90.1%
Common 8735
 
9.2%
Latin 596
 
0.6%
Han 61
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4366
 
5.1%
4239
 
4.9%
2996
 
3.5%
2751
 
3.2%
2327
 
2.7%
1860
 
2.2%
1690
 
2.0%
1687
 
2.0%
1686
 
2.0%
1652
 
1.9%
Other values (742) 60654
70.6%
Latin
ValueCountFrequency (%)
e 50
 
8.4%
E 42
 
7.0%
M 37
 
6.2%
S 36
 
6.0%
C 34
 
5.7%
a 28
 
4.7%
G 26
 
4.4%
K 22
 
3.7%
T 22
 
3.7%
O 20
 
3.4%
Other values (38) 279
46.8%
Common
ValueCountFrequency (%)
4559
52.2%
) 1235
 
14.1%
( 1233
 
14.1%
2 303
 
3.5%
- 260
 
3.0%
1 188
 
2.2%
0 143
 
1.6%
' 116
 
1.3%
, 92
 
1.1%
3 69
 
0.8%
Other values (32) 537
 
6.1%
Han
ValueCountFrequency (%)
29
47.5%
11
 
18.0%
3
 
4.9%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
1
 
1.6%
1
 
1.6%
Other values (6) 6
 
9.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 85908
90.1%
ASCII 9250
 
9.7%
CJK 48
 
0.1%
Number Forms 33
 
< 0.1%
Punctuation 32
 
< 0.1%
CJK Compat Ideographs 13
 
< 0.1%
None 13
 
< 0.1%
Enclosed Alphanum 2
 
< 0.1%
Letterlike Symbols 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4559
49.3%
) 1235
 
13.4%
( 1233
 
13.3%
2 303
 
3.3%
- 260
 
2.8%
1 188
 
2.0%
0 143
 
1.5%
' 116
 
1.3%
, 92
 
1.0%
3 69
 
0.7%
Other values (67) 1052
 
11.4%
Hangul
ValueCountFrequency (%)
4366
 
5.1%
4239
 
4.9%
2996
 
3.5%
2751
 
3.2%
2327
 
2.7%
1860
 
2.2%
1690
 
2.0%
1687
 
2.0%
1686
 
2.0%
1652
 
1.9%
Other values (742) 60654
70.6%
CJK
ValueCountFrequency (%)
29
60.4%
3
 
6.2%
2
 
4.2%
2
 
4.2%
2
 
4.2%
2
 
4.2%
1
 
2.1%
1
 
2.1%
1
 
2.1%
1
 
2.1%
Other values (4) 4
 
8.3%
Number Forms
ValueCountFrequency (%)
19
57.6%
12
36.4%
1
 
3.0%
1
 
3.0%
Punctuation
ValueCountFrequency (%)
12
37.5%
12
37.5%
4
 
12.5%
4
 
12.5%
CJK Compat Ideographs
ValueCountFrequency (%)
11
84.6%
2
 
15.4%
None
ValueCountFrequency (%)
· 11
84.6%
1
 
7.7%
1
 
7.7%
Enclosed Alphanum
ValueCountFrequency (%)
2
100.0%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%

관활시도명
Categorical

Distinct17
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경기도
1611 
서울특별시
1160 
전라북도
908 
부산광역시
897 
경상남도
787 
Other values (12)
4637 

Length

Max length7
Median length5
Mean length4.2102
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row광주광역시
2nd row인천광역시
3rd row전라남도
4th row경기도
5th row인천광역시

Common Values

ValueCountFrequency (%)
경기도 1611
16.1%
서울특별시 1160
11.6%
전라북도 908
9.1%
부산광역시 897
9.0%
경상남도 787
7.9%
전라남도 643
 
6.4%
경상북도 601
 
6.0%
충청남도 587
 
5.9%
강원도 573
 
5.7%
인천광역시 542
 
5.4%
Other values (7) 1691
16.9%

Length

2023-12-12T17:55:22.738569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도 1611
16.1%
서울특별시 1160
11.6%
전라북도 908
9.1%
부산광역시 897
9.0%
경상남도 787
7.9%
전라남도 643
 
6.4%
경상북도 601
 
6.0%
충청남도 587
 
5.9%
강원도 573
 
5.7%
인천광역시 542
 
5.4%
Other values (7) 1691
16.9%

시군구코드
Real number (ℝ)

Distinct239
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.6402253 × 109
Minimum1.1009 × 109
Maximum5.013 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T17:55:22.950074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.1009 × 109
5-th percentile1.138 × 109
Q12.814 × 109
median4.146 × 109
Q34.521 × 109
95-th percentile4.831 × 109
Maximum5.013 × 109
Range3.9121 × 109
Interquartile range (IQR)1.707 × 109

Descriptive statistics

Standard deviation1.1742684 × 109
Coefficient of variation (CV)0.32258123
Kurtosis-0.29451301
Mean3.6402253 × 109
Median Absolute Deviation (MAD)5.77 × 108
Skewness-0.93951551
Sum3.6402253 × 1013
Variance1.3789062 × 1018
MonotonicityNot monotonic
2023-12-12T17:55:23.203154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4511000000 218
 
2.2%
4812000000 158
 
1.6%
4128000000 143
 
1.4%
4311000000 135
 
1.4%
4514000000 133
 
1.3%
4113000000 125
 
1.2%
4119000000 110
 
1.1%
4111000000 109
 
1.1%
2817000000 109
 
1.1%
1138000000 102
 
1.0%
Other values (229) 8658
86.6%
ValueCountFrequency (%)
1100900000 5
 
0.1%
1111000000 46
0.5%
1114000000 44
0.4%
1117000000 24
0.2%
1120000000 38
0.4%
1121500000 32
0.3%
1123000000 29
0.3%
1126000000 31
0.3%
1129000000 35
0.4%
1130500000 16
 
0.2%
ValueCountFrequency (%)
5013000000 48
0.5%
5011000000 72
0.7%
4889000000 63
0.6%
4888000000 35
0.4%
4887000000 35
0.4%
4886000000 20
 
0.2%
4885000000 25
 
0.2%
4884000000 17
 
0.2%
4882000000 33
0.3%
4874000000 30
0.3%
Distinct214
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T17:55:23.677542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length3
Mean length2.9275
Min length2

Characters and Unicode

Total characters29275
Distinct characters136
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5 ?
Unique (%)< 0.1%

Sample

1st row북구
2nd row계양구
3rd row여수시
4th row구리시
5th row강화군
ValueCountFrequency (%)
동구 282
 
2.8%
서구 275
 
2.7%
중구 273
 
2.7%
남구 252
 
2.5%
북구 225
 
2.2%
전주시 218
 
2.2%
창원시 158
 
1.6%
고양시 143
 
1.4%
청주시 139
 
1.4%
익산시 133
 
1.3%
Other values (204) 7912
79.0%
2023-12-12T17:55:24.271668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4391
 
15.0%
3819
 
13.0%
2177
 
7.4%
1068
 
3.6%
890
 
3.0%
753
 
2.6%
746
 
2.5%
739
 
2.5%
729
 
2.5%
709
 
2.4%
Other values (126) 13254
45.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 29265
> 99.9%
Space Separator 10
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4391
 
15.0%
3819
 
13.0%
2177
 
7.4%
1068
 
3.6%
890
 
3.0%
753
 
2.6%
746
 
2.5%
739
 
2.5%
729
 
2.5%
709
 
2.4%
Other values (125) 13244
45.3%
Space Separator
ValueCountFrequency (%)
10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 29265
> 99.9%
Common 10
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4391
 
15.0%
3819
 
13.0%
2177
 
7.4%
1068
 
3.6%
890
 
3.0%
753
 
2.6%
746
 
2.5%
739
 
2.5%
729
 
2.5%
709
 
2.4%
Other values (125) 13244
45.3%
Common
ValueCountFrequency (%)
10
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 29265
> 99.9%
ASCII 10
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4391
 
15.0%
3819
 
13.0%
2177
 
7.4%
1068
 
3.6%
890
 
3.0%
753
 
2.6%
746
 
2.5%
739
 
2.5%
729
 
2.5%
709
 
2.4%
Other values (125) 13244
45.3%
ASCII
ValueCountFrequency (%)
10
100.0%

기관ID
Real number (ℝ)

Distinct1279
Distinct (%)12.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean39234.122
Minimum2
Maximum701083
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T17:55:24.431726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile85
Q1586
median1652
Q32693
95-th percentile330083
Maximum701083
Range701081
Interquartile range (IQR)2107

Descriptive statistics

Standard deviation108012.84
Coefficient of variation (CV)2.7530333
Kurtosis9.6767103
Mean39234.122
Median Absolute Deviation (MAD)1052
Skewness3.1681803
Sum3.9234122 × 108
Variance1.1666775 × 1010
MonotonicityNot monotonic
2023-12-12T17:55:24.573811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1652 55
 
0.5%
744 47
 
0.5%
1308 45
 
0.4%
2739 42
 
0.4%
163 41
 
0.4%
178 39
 
0.4%
171 39
 
0.4%
2496 38
 
0.4%
109083 38
 
0.4%
25083 38
 
0.4%
Other values (1269) 9578
95.8%
ValueCountFrequency (%)
2 7
0.1%
3 5
0.1%
4 3
 
< 0.1%
5 1
 
< 0.1%
6 9
0.1%
7 6
0.1%
8 3
 
< 0.1%
9 7
0.1%
10 2
 
< 0.1%
11 4
< 0.1%
ValueCountFrequency (%)
701083 1
 
< 0.1%
690083 1
 
< 0.1%
687085 1
 
< 0.1%
635083 1
 
< 0.1%
634083 1
 
< 0.1%
633083 1
 
< 0.1%
632083 5
 
0.1%
584084 16
0.2%
570083 5
 
0.1%
568083 2
 
< 0.1%
Distinct17
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경기도
1611 
서울특별시
1160 
전라북도
908 
부산광역시
897 
경상남도
787 
Other values (12)
4637 

Length

Max length7
Median length5
Mean length4.2102
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row광주광역시
2nd row인천광역시
3rd row전라남도
4th row경기도
5th row인천광역시

Common Values

ValueCountFrequency (%)
경기도 1611
16.1%
서울특별시 1160
11.6%
전라북도 908
9.1%
부산광역시 897
9.0%
경상남도 787
7.9%
전라남도 643
 
6.4%
경상북도 601
 
6.0%
충청남도 587
 
5.9%
강원도 573
 
5.7%
인천광역시 542
 
5.4%
Other values (7) 1691
16.9%

Length

2023-12-12T17:55:24.739711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도 1611
16.1%
서울특별시 1160
11.6%
전라북도 908
9.1%
부산광역시 897
9.0%
경상남도 787
7.9%
전라남도 643
 
6.4%
경상북도 601
 
6.0%
충청남도 587
 
5.9%
강원도 573
 
5.7%
인천광역시 542
 
5.4%
Other values (7) 1691
16.9%
Distinct214
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T17:55:25.160715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length3
Mean length2.9275
Min length2

Characters and Unicode

Total characters29275
Distinct characters136
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5 ?
Unique (%)< 0.1%

Sample

1st row북구
2nd row계양구
3rd row여수시
4th row구리시
5th row강화군
ValueCountFrequency (%)
동구 282
 
2.8%
서구 275
 
2.7%
중구 273
 
2.7%
남구 252
 
2.5%
북구 225
 
2.2%
전주시 218
 
2.2%
창원시 158
 
1.6%
고양시 143
 
1.4%
청주시 139
 
1.4%
익산시 133
 
1.3%
Other values (204) 7912
79.0%
2023-12-12T17:55:25.653393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4391
 
15.0%
3819
 
13.0%
2177
 
7.4%
1068
 
3.6%
890
 
3.0%
753
 
2.6%
746
 
2.5%
739
 
2.5%
729
 
2.5%
709
 
2.4%
Other values (126) 13254
45.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 29265
> 99.9%
Space Separator 10
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4391
 
15.0%
3819
 
13.0%
2177
 
7.4%
1068
 
3.6%
890
 
3.0%
753
 
2.6%
746
 
2.5%
739
 
2.5%
729
 
2.5%
709
 
2.4%
Other values (125) 13244
45.3%
Space Separator
ValueCountFrequency (%)
10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 29265
> 99.9%
Common 10
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4391
 
15.0%
3819
 
13.0%
2177
 
7.4%
1068
 
3.6%
890
 
3.0%
753
 
2.6%
746
 
2.5%
739
 
2.5%
729
 
2.5%
709
 
2.4%
Other values (125) 13244
45.3%
Common
ValueCountFrequency (%)
10
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 29265
> 99.9%
ASCII 10
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4391
 
15.0%
3819
 
13.0%
2177
 
7.4%
1068
 
3.6%
890
 
3.0%
753
 
2.6%
746
 
2.5%
739
 
2.5%
729
 
2.5%
709
 
2.4%
Other values (125) 13244
45.3%
ASCII
ValueCountFrequency (%)
10
100.0%
Distinct83
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2022-01-01 00:00:00
Maximum2022-12-01 00:00:00
2023-12-12T17:55:25.813613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:55:25.988671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct54
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2022-01-31 00:00:00
Maximum2022-12-31 00:00:00
2023-12-12T17:55:26.147725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:55:26.283321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

사업계획서상태코드
Categorical

IMBALANCE 

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
승인완료
8837 
임시
 
864
반려
 
172
삭제요청
 
66
변경심사요청
 
41
Other values (2)
 
20

Length

Max length6
Median length4
Mean length3.8028
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row승인완료
2nd row승인완료
3rd row임시
4th row승인완료
5th row임시

Common Values

ValueCountFrequency (%)
승인완료 8837
88.4%
임시 864
 
8.6%
반려 172
 
1.7%
삭제요청 66
 
0.7%
변경심사요청 41
 
0.4%
조건부승인 18
 
0.2%
심사요청 2
 
< 0.1%

Length

2023-12-12T17:55:26.430251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:55:26.546080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
승인완료 8837
88.4%
임시 864
 
8.6%
반려 172
 
1.7%
삭제요청 66
 
0.7%
변경심사요청 41
 
0.4%
조건부승인 18
 
0.2%
심사요청 2
 
< 0.1%

목표일자리수
Real number (ℝ)

Distinct424
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean69.4731
Minimum0
Maximum3558
Zeros9
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T17:55:26.680992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile6
Q115
median35
Q385
95-th percentile220
Maximum3558
Range3558
Interquartile range (IQR)70

Descriptive statistics

Standard deviation112.05944
Coefficient of variation (CV)1.6129903
Kurtosis168.58079
Mean69.4731
Median Absolute Deviation (MAD)25
Skewness8.9595275
Sum694731
Variance12557.317
MonotonicityNot monotonic
2023-12-12T17:55:26.857558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10 627
 
6.3%
20 605
 
6.0%
30 409
 
4.1%
40 335
 
3.4%
50 328
 
3.3%
12 307
 
3.1%
15 260
 
2.6%
8 252
 
2.5%
60 248
 
2.5%
6 220
 
2.2%
Other values (414) 6409
64.1%
ValueCountFrequency (%)
0 9
 
0.1%
1 20
 
0.2%
2 90
 
0.9%
3 61
 
0.6%
4 162
1.6%
5 99
 
1.0%
6 220
2.2%
7 84
 
0.8%
8 252
2.5%
9 75
 
0.8%
ValueCountFrequency (%)
3558 1
< 0.1%
2642 1
< 0.1%
2029 1
< 0.1%
1800 1
< 0.1%
1700 1
< 0.1%
1541 1
< 0.1%
1500 1
< 0.1%
1499 1
< 0.1%
1493 1
< 0.1%
1406 1
< 0.1%
Distinct9064
Distinct (%)90.7%
Missing4
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-12T17:55:27.182130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length75
Median length59
Mean length25.926971
Min length6

Characters and Unicode

Total characters259166
Distinct characters824
Distinct categories16 ?
Distinct scripts4 ?
Distinct blocks10 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8435 ?
Unique (%)84.4%

Sample

1st row2022년+노인+공익활동+사업계획서(주정차).hwp
2nd row사서도우미.hwp
3rd row(공익활동)2022년 지역아동센터환경정화 사업계획서.hwp
4th row2022년 수요처안전모니터링 사업계획서.hwp
5th row2022년+노인+공익활동+사업계획서+(우리마을가꾸기)+210명.hwp
ValueCountFrequency (%)
2022년 4860
 
19.1%
사업계획서.hwp 2622
 
10.3%
공익활동 622
 
2.4%
사회서비스형 427
 
1.7%
노인 415
 
1.6%
사업계획서 380
 
1.5%
2022 283
 
1.1%
22년 242
 
0.9%
196
 
0.8%
168
 
0.7%
Other values (9841) 15260
59.9%
2023-12-12T17:55:27.767979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 27127
 
10.5%
15569
 
6.0%
12890
 
5.0%
. 11145
 
4.3%
10554
 
4.1%
p 9994
 
3.9%
h 9841
 
3.8%
w 9837
 
3.8%
9626
 
3.7%
0 8998
 
3.5%
Other values (814) 133585
51.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 144894
55.9%
Decimal Number 40026
 
15.4%
Lowercase Letter 30251
 
11.7%
Space Separator 15569
 
6.0%
Other Punctuation 11360
 
4.4%
Close Punctuation 4572
 
1.8%
Open Punctuation 4481
 
1.7%
Math Symbol 4128
 
1.6%
Connector Punctuation 1731
 
0.7%
Dash Punctuation 1614
 
0.6%
Other values (6) 540
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12890
 
8.9%
10554
 
7.3%
9626
 
6.6%
8408
 
5.8%
8273
 
5.7%
7956
 
5.5%
3858
 
2.7%
2924
 
2.0%
2920
 
2.0%
2597
 
1.8%
Other values (724) 74888
51.7%
Lowercase Letter
ValueCountFrequency (%)
p 9994
33.0%
h 9841
32.5%
w 9837
32.5%
f 148
 
0.5%
d 139
 
0.5%
x 55
 
0.2%
e 55
 
0.2%
i 29
 
0.1%
r 26
 
0.1%
v 22
 
0.1%
Other values (13) 105
 
0.3%
Uppercase Letter
ValueCountFrequency (%)
A 76
18.0%
E 65
15.4%
B 52
12.3%
M 36
8.5%
S 31
7.3%
C 29
 
6.9%
T 24
 
5.7%
O 22
 
5.2%
K 18
 
4.3%
G 13
 
3.1%
Other values (12) 56
13.3%
Decimal Number
ValueCountFrequency (%)
2 27127
67.8%
0 8998
 
22.5%
1 1875
 
4.7%
3 446
 
1.1%
6 360
 
0.9%
5 303
 
0.8%
4 298
 
0.7%
7 230
 
0.6%
8 224
 
0.6%
9 165
 
0.4%
Other Punctuation
ValueCountFrequency (%)
. 11145
98.1%
, 106
 
0.9%
' 70
 
0.6%
& 14
 
0.1%
# 7
 
0.1%
! 7
 
0.1%
@ 5
 
< 0.1%
§ 3
 
< 0.1%
· 3
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 4477
97.9%
] 90
 
2.0%
4
 
0.1%
} 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
+ 4118
99.8%
~ 6
 
0.1%
3
 
0.1%
= 1
 
< 0.1%
Other Symbol
ValueCountFrequency (%)
91
95.8%
2
 
2.1%
1
 
1.1%
1
 
1.1%
Open Punctuation
ValueCountFrequency (%)
( 4387
97.9%
[ 90
 
2.0%
4
 
0.1%
Connector Punctuation
ValueCountFrequency (%)
_ 1727
99.8%
_ 4
 
0.2%
Letter Number
ValueCountFrequency (%)
7
53.8%
6
46.2%
Initial Punctuation
ValueCountFrequency (%)
2
66.7%
1
33.3%
Final Punctuation
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
15569
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1614
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 144871
55.9%
Common 83586
32.3%
Latin 30686
 
11.8%
Han 23
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12890
 
8.9%
10554
 
7.3%
9626
 
6.6%
8408
 
5.8%
8273
 
5.7%
7956
 
5.5%
3858
 
2.7%
2924
 
2.0%
2920
 
2.0%
2597
 
1.8%
Other values (718) 74865
51.7%
Latin
ValueCountFrequency (%)
p 9994
32.6%
h 9841
32.1%
w 9837
32.1%
f 148
 
0.5%
d 139
 
0.5%
A 76
 
0.2%
E 65
 
0.2%
x 55
 
0.2%
e 55
 
0.2%
B 52
 
0.2%
Other values (37) 424
 
1.4%
Common
ValueCountFrequency (%)
2 27127
32.5%
15569
18.6%
. 11145
13.3%
0 8998
 
10.8%
) 4477
 
5.4%
( 4387
 
5.2%
+ 4118
 
4.9%
1 1875
 
2.2%
_ 1727
 
2.1%
- 1614
 
1.9%
Other values (33) 2549
 
3.0%
Han
ValueCountFrequency (%)
18
78.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 144871
55.9%
ASCII 114137
44.0%
Misc Symbols 91
 
< 0.1%
CJK 22
 
< 0.1%
None 18
 
< 0.1%
Number Forms 13
 
< 0.1%
Geometric Shapes 6
 
< 0.1%
Punctuation 6
 
< 0.1%
Letterlike Symbols 1
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 27127
23.8%
15569
13.6%
. 11145
9.8%
p 9994
 
8.8%
h 9841
 
8.6%
w 9837
 
8.6%
0 8998
 
7.9%
) 4477
 
3.9%
( 4387
 
3.8%
+ 4118
 
3.6%
Other values (64) 8644
 
7.6%
Hangul
ValueCountFrequency (%)
12890
 
8.9%
10554
 
7.3%
9626
 
6.6%
8408
 
5.8%
8273
 
5.7%
7956
 
5.5%
3858
 
2.7%
2924
 
2.0%
2920
 
2.0%
2597
 
1.8%
Other values (718) 74865
51.7%
Misc Symbols
ValueCountFrequency (%)
91
100.0%
CJK
ValueCountFrequency (%)
18
81.8%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
Number Forms
ValueCountFrequency (%)
7
53.8%
6
46.2%
None
ValueCountFrequency (%)
_ 4
22.2%
4
22.2%
4
22.2%
§ 3
16.7%
· 3
16.7%
Geometric Shapes
ValueCountFrequency (%)
3
50.0%
2
33.3%
1
 
16.7%
Punctuation
ValueCountFrequency (%)
2
33.3%
2
33.3%
1
16.7%
1
16.7%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Distinct3426
Distinct (%)99.5%
Missing6557
Missing (%)65.6%
Memory size156.2 KiB
2023-12-12T17:55:28.086232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length65
Median length53
Mean length28.713041
Min length8

Characters and Unicode

Total characters98859
Distinct characters667
Distinct categories17 ?
Distinct scripts4 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3411 ?
Unique (%)99.1%

Sample

1st row2022년+노인+공익활동+사업계획서(주정차).hwp
2nd row2022년+수요처안전모니터링+사업계획서(수정).hwp
3rd row2022년+정담식당사업계획서-2.hwp
4th row2022년+노인+공익활동+사업계획서(노선생이간다) (3).hwp
5th row수정_2022년 노인 공익활동 사업계획서(A-07 초등학교급식도우미-6명)_신안군20200104.hwp
ValueCountFrequency (%)
2022년 1307
 
15.4%
사업계획서.hwp 316
 
3.7%
사업계획서 165
 
1.9%
공익활동 148
 
1.7%
사회서비스형 109
 
1.3%
노인 99
 
1.2%
변경 75
 
0.9%
74
 
0.9%
22년 66
 
0.8%
사업계획서(변경).hwp 66
 
0.8%
Other values (4304) 6039
71.3%
2023-12-12T17:55:28.622512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 9746
 
9.9%
5046
 
5.1%
4436
 
4.5%
. 4223
 
4.3%
3599
 
3.6%
p 3443
 
3.5%
0 3391
 
3.4%
h 3375
 
3.4%
w 3375
 
3.4%
3276
 
3.3%
Other values (657) 54949
55.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 53841
54.5%
Decimal Number 15937
 
16.1%
Lowercase Letter 10423
 
10.5%
Space Separator 5046
 
5.1%
Other Punctuation 4286
 
4.3%
Close Punctuation 2586
 
2.6%
Open Punctuation 2548
 
2.6%
Math Symbol 2382
 
2.4%
Connector Punctuation 920
 
0.9%
Dash Punctuation 709
 
0.7%
Other values (7) 181
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4436
 
8.2%
3599
 
6.7%
3276
 
6.1%
2860
 
5.3%
2826
 
5.2%
2577
 
4.8%
1325
 
2.5%
1312
 
2.4%
1005
 
1.9%
991
 
1.8%
Other values (582) 29634
55.0%
Lowercase Letter
ValueCountFrequency (%)
p 3443
33.0%
h 3375
32.4%
w 3375
32.4%
f 69
 
0.7%
d 64
 
0.6%
v 14
 
0.1%
x 13
 
0.1%
e 13
 
0.1%
i 7
 
0.1%
r 7
 
0.1%
Other values (12) 43
 
0.4%
Uppercase Letter
ValueCountFrequency (%)
A 19
16.8%
E 15
13.3%
M 13
11.5%
T 12
10.6%
C 12
10.6%
B 12
10.6%
K 9
8.0%
O 7
 
6.2%
S 7
 
6.2%
V 3
 
2.7%
Other values (3) 4
 
3.5%
Decimal Number
ValueCountFrequency (%)
2 9746
61.2%
0 3391
 
21.3%
1 1444
 
9.1%
3 307
 
1.9%
4 219
 
1.4%
5 204
 
1.3%
6 199
 
1.2%
8 162
 
1.0%
9 137
 
0.9%
7 128
 
0.8%
Other Punctuation
ValueCountFrequency (%)
. 4223
98.5%
, 35
 
0.8%
' 12
 
0.3%
# 12
 
0.3%
& 3
 
0.1%
! 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
+ 2371
99.5%
~ 5
 
0.2%
5
 
0.2%
= 1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 2568
99.3%
] 17
 
0.7%
} 1
 
< 0.1%
Other Symbol
ValueCountFrequency (%)
49
92.5%
3
 
5.7%
1
 
1.9%
Other Number
ValueCountFrequency (%)
3
60.0%
1
 
20.0%
1
 
20.0%
Open Punctuation
ValueCountFrequency (%)
( 2531
99.3%
[ 17
 
0.7%
Connector Punctuation
ValueCountFrequency (%)
_ 918
99.8%
_ 2
 
0.2%
Letter Number
ValueCountFrequency (%)
4
57.1%
3
42.9%
Space Separator
ValueCountFrequency (%)
5046
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 709
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 53830
54.5%
Common 34475
34.9%
Latin 10543
 
10.7%
Han 11
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4436
 
8.2%
3599
 
6.7%
3276
 
6.1%
2860
 
5.3%
2826
 
5.2%
2577
 
4.8%
1325
 
2.5%
1312
 
2.4%
1005
 
1.9%
991
 
1.8%
Other values (577) 29623
55.0%
Common
ValueCountFrequency (%)
2 9746
28.3%
5046
14.6%
. 4223
12.2%
0 3391
 
9.8%
) 2568
 
7.4%
( 2531
 
7.3%
+ 2371
 
6.9%
1 1444
 
4.2%
_ 918
 
2.7%
- 709
 
2.1%
Other values (28) 1528
 
4.4%
Latin
ValueCountFrequency (%)
p 3443
32.7%
h 3375
32.0%
w 3375
32.0%
f 69
 
0.7%
d 64
 
0.6%
A 19
 
0.2%
E 15
 
0.1%
v 14
 
0.1%
x 13
 
0.1%
M 13
 
0.1%
Other values (27) 143
 
1.4%
Han
ValueCountFrequency (%)
7
63.6%
1
 
9.1%
1
 
9.1%
1
 
9.1%
1
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 53830
54.5%
ASCII 44944
45.5%
Misc Symbols 52
 
0.1%
CJK 10
 
< 0.1%
Number Forms 7
 
< 0.1%
Arrows 5
 
< 0.1%
Enclosed Alphanum 5
 
< 0.1%
None 2
 
< 0.1%
Punctuation 2
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 9746
21.7%
5046
11.2%
. 4223
9.4%
p 3443
 
7.7%
0 3391
 
7.5%
h 3375
 
7.5%
w 3375
 
7.5%
) 2568
 
5.7%
( 2531
 
5.6%
+ 2371
 
5.3%
Other values (53) 4875
10.8%
Hangul
ValueCountFrequency (%)
4436
 
8.2%
3599
 
6.7%
3276
 
6.1%
2860
 
5.3%
2826
 
5.2%
2577
 
4.8%
1325
 
2.5%
1312
 
2.4%
1005
 
1.9%
991
 
1.8%
Other values (577) 29623
55.0%
Misc Symbols
ValueCountFrequency (%)
49
94.2%
3
 
5.8%
CJK
ValueCountFrequency (%)
7
70.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
Arrows
ValueCountFrequency (%)
5
100.0%
Number Forms
ValueCountFrequency (%)
4
57.1%
3
42.9%
Enclosed Alphanum
ValueCountFrequency (%)
3
60.0%
1
 
20.0%
1
 
20.0%
None
ValueCountFrequency (%)
_ 2
100.0%
Punctuation
ValueCountFrequency (%)
1
50.0%
1
50.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
False
8886 
True
1114 
ValueCountFrequency (%)
False 8886
88.9%
True 1114
 
11.1%
2023-12-12T17:55:28.764858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Sample

사업유형사업번호사업계획변경순번사업년도계속사업여부계속사업시작연도사업유형코드비예산여부특수사업명코드사업명관활시도명시군구코드관할시군구기관ID수행기관시도명수행기관시군구사업기간시작일사업기간종료일사업계획서상태코드목표일자리수최초등록첨부파일최근승인첨부파일삭제여부
3915공익활동형2022-0842322022Y2017주정차질서 계도 봉사(A-17)N105001불법주정차계도사업광주광역시2917000000북구49광주광역시북구2022-01-042022-12-30승인완료3322022년+노인+공익활동+사업계획서(주정차).hwp2022년+노인+공익활동+사업계획서(주정차).hwpN
3504공익활동형2022-0693022022Y2018도서관 봉사(A-12)N105001사서도우미인천광역시2824500000계양구6066인천광역시계양구2022-01-102022-12-16승인완료120사서도우미.hwp<NA>N
10346공익활동형2022-0137212022Y2013지역아동센터 봉사(A-11)N105001지역아동센터환경정화전라남도4613000000여수시173전라남도여수시2022-01-012022-11-30임시169(공익활동)2022년 지역아동센터환경정화 사업계획서.hwp<NA>Y
10787사회서비스형2022-0038252022Y2022안전관리지원(B-10)N105001수요처안전모니터링경기도4131000000구리시1871경기도구리시2022-01-032022-12-09승인완료102022년 수요처안전모니터링 사업계획서.hwp2022년+수요처안전모니터링+사업계획서(수정).hwpN
8047공익활동형2022-0298412022Y2022지역사회 환경개선 봉사(A-16)N105001우리마을가꾸기인천광역시2871000000강화군338083인천광역시강화군2022-01-242022-12-30임시2102022년+노인+공익활동+사업계획서+(우리마을가꾸기)+210명.hwp<NA>Y
2751공익활동형2022-1018012022Y2022공공의료 및 복지시설 봉사(A-15)N105001경로당 알리미부산광역시2626000000동래구830부산광역시동래구2022-01-032022-12-30승인완료1522년 경로당알리미 계획서.hwp<NA>N
7080공익활동형2022-0157222022Y2021노노케어(A-01)N105001노노케어경기도4157000000김포시1912경기도김포시2022-01-032022-12-31승인완료22(수정)2. 2022년 노노케어 사업계획서(A-01).hwp<NA>N
2382공익활동형2022-3407112022N2022지역사회 환경개선 봉사(A-16)N105001아름다운마을가꾸기 Ⅳ전라북도4519000000남원시890전라북도남원시2022-03-072022-12-31승인완료802022년 아름다운마을가꾸기4사업단 (공익형 사업계획서.hwp<NA>N
5065시장형2022-0765542022Y2021음식점(E-09)N105001정담식당경상남도4882000000고성군246083경상남도고성군2022-01-032022-12-31승인완료182022년 정담식당사업계획서.hwp2022년+정담식당사업계획서-2.hwpN
10052공익활동형2022-0288922022Y2015건강체조 취미생활 지도(A-20)N105001노선생이간다충청남도4477000000서천군2455충청남도서천군2022-02-012022-12-31승인완료152022년 노인 공익활동 사업계획서(노선생이간다).hwp2022년+노인+공익활동+사업계획서(노선생이간다) (3).hwpN
사업유형사업번호사업계획변경순번사업년도계속사업여부계속사업시작연도사업유형코드비예산여부특수사업명코드사업명관활시도명시군구코드관할시군구기관ID수행기관시도명수행기관시군구사업기간시작일사업기간종료일사업계획서상태코드목표일자리수최초등록첨부파일최근승인첨부파일삭제여부
9456시장형2022-0414132022Y2020카페(E-08)N105001품애경상북도4775000000청송군1793경상북도청송군2022-01-032022-12-30승인완료62022년 시장형사업단 품애 사업계획서.hwp<NA>N
385공익활동형2022-1036712022Y2020장애인 봉사(A-02)N105001장애아동청소년 특수학교 지원사업경기도4128000000고양시218083경기도고양시2022-01-012022-12-31승인완료152022년 노인 공익활동 사업계획서(장애특수학교지원사업).hwp<NA>N
7510시장형2022-0332832022Y2012카페(E-08)N105001더카페 목포이랜드점전라남도4611000000목포시2386전라남도목포시2022-01-012022-12-31승인완료162022년 사업계획서(시장형 더카페사업).hwp<NA>N
339공익활동형2022-1011012022Y2020생활시설이용자 지원봉사(A-06)N105001행복한 밥상전라북도4521000000김제시2356전라북도김제시2022-01-032022-11-30승인완료1022_ 행복한 밥상.hwp<NA>N
2948공익활동형2022-1058512022Y2015공공의료 및 복지시설 봉사(A-15)N105001복지시설관리지원전라북도4521000000김제시520전라북도김제시2022-01-032022-12-30승인완료1092022년 노인 공익활동 사업계획서(복지시설).hwp<NA>N
3093공익활동형2022-5105612022N2022지역사회 환경개선 봉사(A-16)N105001종이팩재활용사업제주특별자치도5011000000제주시178제주특별자치도제주시2022-05-012022-12-30승인완료2422-종이팩재활용사업.hwp<NA>N
6381공익활동형2022-0697212022Y2007지역사회 환경개선 봉사(A-16)N1050012022노인사회활동(거리환경지킴이)경기도4146000000용인시1159경기도용인시2022-02-012022-12-31승인완료1102022-거리환경지킴이 사업계획서.hwp<NA>N
3728공익활동형2022-0542922022Y2021공원 놀이터 등 공공시설 봉사(A-13)N105001공중전화관리지원경상북도4721000000영주시2617경상북도영주시2022-01-032022-12-31승인완료100공중전화관리지원(2022.hwp공중전화관리지원(2022).hwpN
4420공익활동형2022-0750512022Y2014공공의료 및 복지시설 봉사(A-15)N105001복지시설봉사대전광역시3017000000서구1085대전광역시서구2022-01-172022-12-30승인완료9222년 복지시설.hwp<NA>N
7396공익활동형2022-0231812022Y2022지역사회 환경개선 봉사(A-16)N105001아름다운 농촌마을만들기전라북도4514000000익산시2656전라북도익산시2022-01-102022-11-30승인완료1002022년-아름다운농촌마을만들기.hwp<NA>N