Overview

Dataset statistics

Number of variables15
Number of observations2361
Missing cells177
Missing cells (%)0.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory283.7 KiB
Average record size in memory123.1 B

Variable types

Numeric3
Boolean4
Text4
DateTime2
Categorical2

Dataset

Description당진시 지방보조금통합관리시스템에서 관리하는 보조사업 및 보조사업자에 대한 데이터로 사업자관리번호,연차사업진행여부,회계연도,사업명,부서코드등의 항목을 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=322&beforeMenuCd=DOM_000000201001001000&publicdatapk=15091599

Alerts

마감취소요청여부 has constant value ""Constant
사업관리번호 is highly overall correlated with 회계연도High correlation
회계연도 is highly overall correlated with 사업관리번호High correlation
부서코드 is highly overall correlated with 이호조정책사업부서명High correlation
마감여부 is highly overall correlated with 이호조정책사업부서명High correlation
이호조정책사업부서명 is highly overall correlated with 부서코드 and 1 other fieldsHigh correlation
연차사업진행여부 is highly imbalanced (93.2%)Imbalance
삭제요청여부 is highly imbalanced (99.5%)Imbalance
이호조정책사업명 has 59 (2.5%) missing valuesMissing
이호조단위사업명 has 59 (2.5%) missing valuesMissing
이호조세부사업명 has 59 (2.5%) missing valuesMissing
사업관리번호 has unique valuesUnique

Reproduction

Analysis started2024-01-09 23:08:24.610019
Analysis finished2024-01-09 23:08:27.376713
Duration2.77 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사업관리번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct2361
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1494.8344
Minimum36
Maximum2861
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size20.9 KiB
2024-01-10T08:08:27.444878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36
5-th percentile234
Q1765
median1531
Q32170
95-th percentile2711
Maximum2861
Range2825
Interquartile range (IQR)1405

Descriptive statistics

Standard deviation801.04446
Coefficient of variation (CV)0.53587505
Kurtosis-1.1944269
Mean1494.8344
Median Absolute Deviation (MAD)704
Skewness-0.063087624
Sum3529304
Variance641672.23
MonotonicityNot monotonic
2024-01-10T08:08:27.575265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
187 1
 
< 0.1%
1969 1
 
< 0.1%
1953 1
 
< 0.1%
1954 1
 
< 0.1%
1955 1
 
< 0.1%
1956 1
 
< 0.1%
1957 1
 
< 0.1%
1958 1
 
< 0.1%
1959 1
 
< 0.1%
1960 1
 
< 0.1%
Other values (2351) 2351
99.6%
ValueCountFrequency (%)
36 1
< 0.1%
41 1
< 0.1%
42 1
< 0.1%
43 1
< 0.1%
45 1
< 0.1%
46 1
< 0.1%
59 1
< 0.1%
79 1
< 0.1%
83 1
< 0.1%
84 1
< 0.1%
ValueCountFrequency (%)
2861 1
< 0.1%
2860 1
< 0.1%
2859 1
< 0.1%
2858 1
< 0.1%
2857 1
< 0.1%
2856 1
< 0.1%
2855 1
< 0.1%
2854 1
< 0.1%
2853 1
< 0.1%
2852 1
< 0.1%

연차사업진행여부
Boolean

IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
False
2342 
True
 
19
ValueCountFrequency (%)
False 2342
99.2%
True 19
 
0.8%
2024-01-10T08:08:27.674260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

회계연도
Real number (ℝ)

HIGH CORRELATION 

Distinct8
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2018.0203
Minimum2014
Maximum2021
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size20.9 KiB
2024-01-10T08:08:27.745416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2014
5-th percentile2016
Q12016
median2018
Q32019
95-th percentile2021
Maximum2021
Range7
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.6432997
Coefficient of variation (CV)0.00081431277
Kurtosis-1.0736462
Mean2018.0203
Median Absolute Deviation (MAD)1
Skewness0.1509265
Sum4764546
Variance2.700434
MonotonicityNot monotonic
2024-01-10T08:08:27.842377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
2016 583
24.7%
2018 528
22.4%
2019 369
15.6%
2020 369
15.6%
2017 304
12.9%
2021 173
 
7.3%
2015 32
 
1.4%
2014 3
 
0.1%
ValueCountFrequency (%)
2014 3
 
0.1%
2015 32
 
1.4%
2016 583
24.7%
2017 304
12.9%
2018 528
22.4%
2019 369
15.6%
2020 369
15.6%
2021 173
 
7.3%
ValueCountFrequency (%)
2021 173
 
7.3%
2020 369
15.6%
2019 369
15.6%
2018 528
22.4%
2017 304
12.9%
2016 583
24.7%
2015 32
 
1.4%
2014 3
 
0.1%
Distinct1979
Distinct (%)83.8%
Missing0
Missing (%)0.0%
Memory size18.6 KiB
2024-01-10T08:08:28.061335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length31
Mean length15.098687
Min length1

Characters and Unicode

Total characters35648
Distinct characters613
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1724 ?
Unique (%)73.0%

Sample

1st row2016년딸기고설배드양액재배시설지원사업
2nd row경로당도비보조기능보강사업
3rd row과수원예농가저온저장고설치지원사업
4th row건강가정지원센터운영
5th row다문화가족지원센터운영지원
ValueCountFrequency (%)
원예작물병해충종합방제시범 9
 
0.4%
고추품종비교시범사업 6
 
0.3%
농업경영인회역량강화교육지원 5
 
0.2%
4-h회원영농정착지원시범 5
 
0.2%
청년농업인4-h회원영농정착지원시범 5
 
0.2%
바르게살기운동당진2동위원회운영비지원 4
 
0.2%
한우우량정액지원 4
 
0.2%
젖소고온면역증강제지원 4
 
0.2%
면천두견주보존단체전승지원 4
 
0.2%
한국예총당진지회운영지원 4
 
0.2%
Other values (1968) 2311
97.9%
2024-01-10T08:08:28.450477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1478
 
4.1%
1316
 
3.7%
1150
 
3.2%
926
 
2.6%
890
 
2.5%
791
 
2.2%
2 598
 
1.7%
570
 
1.6%
1 519
 
1.5%
0 510
 
1.4%
Other values (603) 26900
75.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 32360
90.8%
Decimal Number 2204
 
6.2%
Close Punctuation 377
 
1.1%
Open Punctuation 376
 
1.1%
Uppercase Letter 197
 
0.6%
Other Punctuation 59
 
0.2%
Dash Punctuation 40
 
0.1%
Lowercase Letter 30
 
0.1%
Math Symbol 4
 
< 0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1478
 
4.6%
1316
 
4.1%
1150
 
3.6%
926
 
2.9%
890
 
2.8%
791
 
2.4%
570
 
1.8%
487
 
1.5%
422
 
1.3%
416
 
1.3%
Other values (546) 23914
73.9%
Uppercase Letter
ValueCountFrequency (%)
H 46
23.4%
C 35
17.8%
A 21
10.7%
T 20
10.2%
P 20
10.2%
I 12
 
6.1%
R 9
 
4.6%
M 7
 
3.6%
V 6
 
3.0%
G 5
 
2.5%
Other values (9) 16
 
8.1%
Lowercase Letter
ValueCountFrequency (%)
a 4
13.3%
c 4
13.3%
l 3
10.0%
e 3
10.0%
r 3
10.0%
p 2
6.7%
h 2
6.7%
o 2
6.7%
i 2
6.7%
v 1
 
3.3%
Other values (4) 4
13.3%
Decimal Number
ValueCountFrequency (%)
2 598
27.1%
1 519
23.5%
0 510
23.1%
6 146
 
6.6%
8 126
 
5.7%
7 97
 
4.4%
4 67
 
3.0%
9 67
 
3.0%
3 42
 
1.9%
5 32
 
1.5%
Other Punctuation
ValueCountFrequency (%)
. 41
69.5%
· 8
 
13.6%
" 7
 
11.9%
; 1
 
1.7%
# 1
 
1.7%
& 1
 
1.7%
Close Punctuation
ValueCountFrequency (%)
) 376
99.7%
] 1
 
0.3%
Open Punctuation
ValueCountFrequency (%)
( 375
99.7%
[ 1
 
0.3%
Math Symbol
ValueCountFrequency (%)
+ 3
75.0%
~ 1
 
25.0%
Dash Punctuation
ValueCountFrequency (%)
- 40
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 32360
90.8%
Common 3061
 
8.6%
Latin 227
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1478
 
4.6%
1316
 
4.1%
1150
 
3.6%
926
 
2.9%
890
 
2.8%
791
 
2.4%
570
 
1.8%
487
 
1.5%
422
 
1.3%
416
 
1.3%
Other values (546) 23914
73.9%
Latin
ValueCountFrequency (%)
H 46
20.3%
C 35
15.4%
A 21
9.3%
T 20
8.8%
P 20
8.8%
I 12
 
5.3%
R 9
 
4.0%
M 7
 
3.1%
V 6
 
2.6%
G 5
 
2.2%
Other values (23) 46
20.3%
Common
ValueCountFrequency (%)
2 598
19.5%
1 519
17.0%
0 510
16.7%
) 376
12.3%
( 375
12.3%
6 146
 
4.8%
8 126
 
4.1%
7 97
 
3.2%
4 67
 
2.2%
9 67
 
2.2%
Other values (14) 180
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 32360
90.8%
ASCII 3280
 
9.2%
None 8
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1478
 
4.6%
1316
 
4.1%
1150
 
3.6%
926
 
2.9%
890
 
2.8%
791
 
2.4%
570
 
1.8%
487
 
1.5%
422
 
1.3%
416
 
1.3%
Other values (546) 23914
73.9%
ASCII
ValueCountFrequency (%)
2 598
18.2%
1 519
15.8%
0 510
15.5%
) 376
11.5%
( 375
11.4%
6 146
 
4.5%
8 126
 
3.8%
7 97
 
3.0%
4 67
 
2.0%
9 67
 
2.0%
Other values (46) 399
12.2%
None
ValueCountFrequency (%)
· 8
100.0%

부서코드
Real number (ℝ)

HIGH CORRELATION 

Distinct30
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1031950.5
Minimum1001002
Maximum2105028
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size20.9 KiB
2024-01-10T08:08:28.575422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1001002
5-th percentile1003002
Q11005060
median1005154
Q31009007
95-th percentile1105023
Maximum2105028
Range1104026
Interquartile range (IQR)3947

Descriptive statistics

Standard deviation86440.887
Coefficient of variation (CV)0.083764569
Kurtosis118.17124
Mean1031950.5
Median Absolute Deviation (MAD)2850
Skewness9.8374628
Sum2.4364351 × 109
Variance7.472027 × 109
MonotonicityNot monotonic
2024-01-10T08:08:28.691602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
1009007 390
16.5%
1005040 378
16.0%
1105020 370
15.7%
1005060 332
14.1%
1005154 189
8.0%
1005070 98
 
4.2%
1001002 94
 
4.0%
1105023 86
 
3.6%
1008004 74
 
3.1%
1003002 56
 
2.4%
Other values (20) 294
12.5%
ValueCountFrequency (%)
1001002 94
 
4.0%
1003002 56
 
2.4%
1005010 14
 
0.6%
1005030 20
 
0.8%
1005040 378
16.0%
1005050 18
 
0.8%
1005060 332
14.1%
1005070 98
 
4.2%
1005080 28
 
1.2%
1005090 1
 
< 0.1%
ValueCountFrequency (%)
2105028 2
 
0.1%
2105024 10
 
0.4%
1105024 22
 
0.9%
1105023 86
 
3.6%
1105020 370
15.7%
1105010 3
 
0.1%
1017001 28
 
1.2%
1013001 2
 
0.1%
1012003 50
 
2.1%
1010001 5
 
0.2%
Distinct465
Distinct (%)19.7%
Missing0
Missing (%)0.0%
Memory size18.6 KiB
Minimum2013-01-01 00:00:00
Maximum2021-09-01 00:00:00
2024-01-10T08:08:28.804752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:08:28.920480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct302
Distinct (%)12.8%
Missing0
Missing (%)0.0%
Memory size18.6 KiB
Minimum2014-12-31 00:00:00
Maximum2021-12-31 00:00:00
2024-01-10T08:08:29.040273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:08:29.158328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

마감여부
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
False
1602 
True
759 
ValueCountFrequency (%)
False 1602
67.9%
True 759
32.1%
2024-01-10T08:08:29.250932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

마감취소요청여부
Boolean

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
False
2361 
ValueCountFrequency (%)
False 2361
100.0%
2024-01-10T08:08:29.318384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

삭제요청여부
Boolean

IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
False
2360 
True
 
1
ValueCountFrequency (%)
False 2360
> 99.9%
True 1
 
< 0.1%
2024-01-10T08:08:29.386722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct55
Distinct (%)2.4%
Missing59
Missing (%)2.5%
Memory size18.6 KiB
2024-01-10T08:08:29.629627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length9
Mean length10.126412
Min length4

Characters and Unicode

Total characters23311
Distinct characters156
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9 ?
Unique (%)0.4%

Sample

1st row농업 경쟁력 강화
2nd row노인복지증진
3rd row농업 경쟁력 강화
4th row보육.가족지원 및 여성복지 증진
5th row보육.가족지원 및 여성복지 증진
ValueCountFrequency (%)
394
 
6.2%
강화 381
 
6.0%
농업 374
 
5.9%
경쟁력 374
 
5.9%
축산경쟁력강화 343
 
5.4%
문화예술 267
 
4.2%
도시육성 267
 
4.2%
조성 231
 
3.6%
새마을 228
 
3.6%
정신 228
 
3.6%
Other values (115) 3291
51.6%
2024-01-10T08:08:30.031321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4076
 
17.5%
1180
 
5.1%
1076
 
4.6%
938
 
4.0%
772
 
3.3%
734
 
3.1%
717
 
3.1%
717
 
3.1%
559
 
2.4%
531
 
2.3%
Other values (146) 12011
51.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 19222
82.5%
Space Separator 4076
 
17.5%
Other Punctuation 9
 
< 0.1%
Close Punctuation 2
 
< 0.1%
Open Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1180
 
6.1%
1076
 
5.6%
938
 
4.9%
772
 
4.0%
734
 
3.8%
717
 
3.7%
717
 
3.7%
559
 
2.9%
531
 
2.8%
517
 
2.7%
Other values (140) 11481
59.7%
Other Punctuation
ValueCountFrequency (%)
. 4
44.4%
· 3
33.3%
/ 2
22.2%
Space Separator
ValueCountFrequency (%)
4076
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 19222
82.5%
Common 4089
 
17.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1180
 
6.1%
1076
 
5.6%
938
 
4.9%
772
 
4.0%
734
 
3.8%
717
 
3.7%
717
 
3.7%
559
 
2.9%
531
 
2.8%
517
 
2.7%
Other values (140) 11481
59.7%
Common
ValueCountFrequency (%)
4076
99.7%
. 4
 
0.1%
· 3
 
0.1%
) 2
 
< 0.1%
/ 2
 
< 0.1%
( 2
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 19203
82.4%
ASCII 4086
 
17.5%
Compat Jamo 19
 
0.1%
None 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4076
99.8%
. 4
 
0.1%
) 2
 
< 0.1%
/ 2
 
< 0.1%
( 2
 
< 0.1%
Hangul
ValueCountFrequency (%)
1180
 
6.1%
1076
 
5.6%
938
 
4.9%
772
 
4.0%
734
 
3.8%
717
 
3.7%
717
 
3.7%
559
 
2.9%
531
 
2.8%
517
 
2.7%
Other values (139) 11462
59.7%
Compat Jamo
ValueCountFrequency (%)
19
100.0%
None
ValueCountFrequency (%)
· 3
100.0%
Distinct106
Distinct (%)4.6%
Missing59
Missing (%)2.5%
Memory size18.6 KiB
2024-01-10T08:08:30.326237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length19
Mean length10.43788
Min length4

Characters and Unicode

Total characters24028
Distinct characters199
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)0.9%

Sample

1st row원예특작 생산기반 구축
2nd row노인단체 지원 및 경로당 활성화 지원
3rd row원예특작 생산기반 구축
4th row건강가정 지원
5th row다문화가족 지원
ValueCountFrequency (%)
385
 
6.5%
지원 273
 
4.6%
기술보급 257
 
4.3%
구축 238
 
4.0%
육성 233
 
3.9%
생산기반 229
 
3.9%
정신 228
 
3.8%
도의 228
 
3.8%
고취 228
 
3.8%
문화예술 200
 
3.4%
Other values (178) 3445
58.0%
2024-01-10T08:08:30.774028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3644
 
15.2%
954
 
4.0%
845
 
3.5%
791
 
3.3%
785
 
3.3%
777
 
3.2%
776
 
3.2%
629
 
2.6%
549
 
2.3%
524
 
2.2%
Other values (189) 13754
57.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 20269
84.4%
Space Separator 3644
 
15.2%
Other Punctuation 83
 
0.3%
Decimal Number 12
 
< 0.1%
Close Punctuation 10
 
< 0.1%
Open Punctuation 10
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
954
 
4.7%
845
 
4.2%
791
 
3.9%
785
 
3.9%
777
 
3.8%
776
 
3.8%
629
 
3.1%
549
 
2.7%
524
 
2.6%
488
 
2.4%
Other values (183) 13151
64.9%
Decimal Number
ValueCountFrequency (%)
6 9
75.0%
3 3
 
25.0%
Space Separator
ValueCountFrequency (%)
3644
100.0%
Other Punctuation
ValueCountFrequency (%)
, 83
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 20269
84.4%
Common 3759
 
15.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
954
 
4.7%
845
 
4.2%
791
 
3.9%
785
 
3.9%
777
 
3.8%
776
 
3.8%
629
 
3.1%
549
 
2.7%
524
 
2.6%
488
 
2.4%
Other values (183) 13151
64.9%
Common
ValueCountFrequency (%)
3644
96.9%
, 83
 
2.2%
) 10
 
0.3%
( 10
 
0.3%
6 9
 
0.2%
3 3
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 20269
84.4%
ASCII 3759
 
15.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3644
96.9%
, 83
 
2.2%
) 10
 
0.3%
( 10
 
0.3%
6 9
 
0.2%
3 3
 
0.1%
Hangul
ValueCountFrequency (%)
954
 
4.7%
845
 
4.2%
791
 
3.9%
785
 
3.9%
777
 
3.8%
776
 
3.8%
629
 
3.1%
549
 
2.7%
524
 
2.6%
488
 
2.4%
Other values (183) 13151
64.9%
Distinct609
Distinct (%)26.5%
Missing59
Missing (%)2.5%
Memory size18.6 KiB
2024-01-10T08:08:31.038041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length29
Mean length12.342311
Min length4

Characters and Unicode

Total characters28412
Distinct characters421
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique299 ?
Unique (%)13.0%

Sample

1st row딸기 재배영농자재 지원
2nd row경로당 도비보조 기능보강사업
3rd rowFTA대응 과수사업농가지원
4th row건강가정지원센터 운영
5th row다문화가족지원센터 운영 지원
ValueCountFrequency (%)
지원 537
 
9.1%
육성 140
 
2.4%
122
 
2.1%
활성화 112
 
1.9%
농업인 100
 
1.7%
바르게살기운동단체 99
 
1.7%
고품질 95
 
1.6%
사업 89
 
1.5%
운영 89
 
1.5%
시범사업 69
 
1.2%
Other values (984) 4438
75.3%
2024-01-10T08:08:31.443982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3588
 
12.6%
1378
 
4.9%
1201
 
4.2%
769
 
2.7%
645
 
2.3%
634
 
2.2%
592
 
2.1%
544
 
1.9%
531
 
1.9%
488
 
1.7%
Other values (411) 18042
63.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 23890
84.1%
Space Separator 3588
 
12.6%
Close Punctuation 290
 
1.0%
Open Punctuation 290
 
1.0%
Uppercase Letter 167
 
0.6%
Decimal Number 102
 
0.4%
Dash Punctuation 47
 
0.2%
Other Punctuation 37
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1378
 
5.8%
1201
 
5.0%
769
 
3.2%
645
 
2.7%
634
 
2.7%
592
 
2.5%
544
 
2.3%
531
 
2.2%
488
 
2.0%
477
 
2.0%
Other values (378) 16631
69.6%
Uppercase Letter
ValueCountFrequency (%)
H 52
31.1%
A 27
16.2%
C 22
13.2%
T 17
 
10.2%
F 14
 
8.4%
P 10
 
6.0%
I 7
 
4.2%
V 5
 
3.0%
R 3
 
1.8%
O 2
 
1.2%
Other values (5) 8
 
4.8%
Decimal Number
ValueCountFrequency (%)
4 46
45.1%
1 15
 
14.7%
3 14
 
13.7%
0 8
 
7.8%
6 6
 
5.9%
5 4
 
3.9%
2 3
 
2.9%
9 3
 
2.9%
7 2
 
2.0%
8 1
 
1.0%
Other Punctuation
ValueCountFrequency (%)
, 33
89.2%
/ 2
 
5.4%
. 2
 
5.4%
Space Separator
ValueCountFrequency (%)
3588
100.0%
Close Punctuation
ValueCountFrequency (%)
) 290
100.0%
Open Punctuation
ValueCountFrequency (%)
( 290
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 47
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 23890
84.1%
Common 4355
 
15.3%
Latin 167
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1378
 
5.8%
1201
 
5.0%
769
 
3.2%
645
 
2.7%
634
 
2.7%
592
 
2.5%
544
 
2.3%
531
 
2.2%
488
 
2.0%
477
 
2.0%
Other values (378) 16631
69.6%
Common
ValueCountFrequency (%)
3588
82.4%
) 290
 
6.7%
( 290
 
6.7%
- 47
 
1.1%
4 46
 
1.1%
, 33
 
0.8%
1 15
 
0.3%
3 14
 
0.3%
0 8
 
0.2%
6 6
 
0.1%
Other values (8) 18
 
0.4%
Latin
ValueCountFrequency (%)
H 52
31.1%
A 27
16.2%
C 22
13.2%
T 17
 
10.2%
F 14
 
8.4%
P 10
 
6.0%
I 7
 
4.2%
V 5
 
3.0%
R 3
 
1.8%
O 2
 
1.2%
Other values (5) 8
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 23890
84.1%
ASCII 4522
 
15.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3588
79.3%
) 290
 
6.4%
( 290
 
6.4%
H 52
 
1.1%
- 47
 
1.0%
4 46
 
1.0%
, 33
 
0.7%
A 27
 
0.6%
C 22
 
0.5%
T 17
 
0.4%
Other values (23) 110
 
2.4%
Hangul
ValueCountFrequency (%)
1378
 
5.8%
1201
 
5.0%
769
 
3.2%
645
 
2.7%
634
 
2.7%
592
 
2.5%
544
 
2.3%
531
 
2.2%
488
 
2.0%
477
 
2.0%
Other values (378) 16631
69.6%
Distinct11
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size18.6 KiB
민간경상사업보조
518 
민간행사사업보조
433 
민간자본사업보조(이전재원)
405 
민간자본사업보조(자체재원)
385 
민간자본사업보조
311 
Other values (6)
309 

Length

Max length14
Median length8
Mean length10.179161
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row민간자본사업보조
2nd row민간자본보조
3rd row민간자본사업보조
4th row사회복지시설법정운영비보조
5th row사회복지시설법정운영비보조

Common Values

ValueCountFrequency (%)
민간경상사업보조 518
21.9%
민간행사사업보조 433
18.3%
민간자본사업보조(이전재원) 405
17.2%
민간자본사업보조(자체재원) 385
16.3%
민간자본사업보조 311
13.2%
민간단체법정운영비보조 145
 
6.1%
<NA> 59
 
2.5%
사회복지사업보조 58
 
2.5%
사회복지시설법정운영비보조 43
 
1.8%
민간자본보조 3
 
0.1%

Length

2024-01-10T08:08:31.598589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
민간경상사업보조 518
21.9%
민간행사사업보조 433
18.3%
민간자본사업보조(이전재원 405
17.2%
민간자본사업보조(자체재원 385
16.3%
민간자본사업보조 311
13.2%
민간단체법정운영비보조 145
 
6.1%
na 59
 
2.5%
사회복지사업보조 58
 
2.5%
사회복지시설법정운영비보조 43
 
1.8%
민간자본보조 3
 
0.1%

이호조정책사업부서명
Categorical

HIGH CORRELATION 

Distinct45
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size18.6 KiB
농업정책과
373 
문화관광과
332 
농업기술센터
220 
축산과
176 
평생교육새마을과
168 
Other values (40)
1092 

Length

Max length9
Median length5
Mean length5.2698009
Min length3

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row농업정책과
2nd row여성가족과
3rd row농업정책과
4th row여성가족과
5th row여성가족과

Common Values

ValueCountFrequency (%)
농업정책과 373
15.8%
문화관광과 332
14.1%
농업기술센터 220
 
9.3%
축산과 176
 
7.5%
평생교육새마을과 168
 
7.1%
축산지원과 167
 
7.1%
농촌진흥과 131
 
5.5%
사회복지과 98
 
4.2%
기술보급과 86
 
3.6%
<NA> 59
 
2.5%
Other values (35) 551
23.3%

Length

2024-01-10T08:08:31.739464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
농업정책과 373
15.8%
문화관광과 332
14.1%
농업기술센터 220
 
9.3%
축산과 176
 
7.5%
평생교육새마을과 168
 
7.1%
축산지원과 167
 
7.1%
농촌진흥과 131
 
5.5%
사회복지과 98
 
4.2%
기술보급과 86
 
3.6%
na 59
 
2.5%
Other values (35) 551
23.3%

Interactions

2024-01-10T08:08:26.403722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:08:25.833390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:08:26.137962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:08:26.492068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:08:25.934187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:08:26.225201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:08:26.584195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:08:26.048529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:08:26.317944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T08:08:31.831831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업관리번호연차사업진행여부회계연도부서코드마감여부삭제요청여부이호조정책사업명이호조통계목명이호조정책사업부서명
사업관리번호1.0000.0210.8930.0960.4570.0000.6750.6750.803
연차사업진행여부0.0211.0000.0420.0000.0000.0000.4940.0590.515
회계연도0.8930.0421.0000.0000.3440.0000.6120.5310.799
부서코드0.0960.0000.0001.0000.0850.0001.0000.0001.000
마감여부0.4570.0000.3440.0851.0000.0000.6890.5950.743
삭제요청여부0.0000.0000.0000.0000.0001.0000.2180.0000.163
이호조정책사업명0.6750.4940.6121.0000.6890.2181.0000.8340.996
이호조통계목명0.6750.0590.5310.0000.5950.0000.8341.0000.815
이호조정책사업부서명0.8030.5150.7991.0000.7430.1630.9960.8151.000
2024-01-10T08:08:31.971068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
이호조통계목명마감여부삭제요청여부이호조정책사업부서명연차사업진행여부
이호조통계목명1.0000.4590.0000.4400.045
마감여부0.4591.0000.0000.6030.000
삭제요청여부0.0000.0001.0000.1290.000
이호조정책사업부서명0.4400.6030.1291.0000.409
연차사업진행여부0.0450.0000.0000.4091.000
2024-01-10T08:08:32.091257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업관리번호회계연도부서코드연차사업진행여부마감여부삭제요청여부이호조통계목명이호조정책사업부서명
사업관리번호1.0000.9680.2320.0160.3500.0000.2660.424
회계연도0.9681.0000.2350.0400.3680.0000.4770.441
부서코드0.2320.2351.0000.0000.0560.0000.0000.991
연차사업진행여부0.0160.0400.0001.0000.0000.0000.0450.409
마감여부0.3500.3680.0560.0001.0000.0000.4590.603
삭제요청여부0.0000.0000.0000.0000.0001.0000.0000.129
이호조통계목명0.2660.4770.0000.0450.4590.0001.0000.440
이호조정책사업부서명0.4240.4410.9910.4090.6030.1290.4401.000

Missing values

2024-01-10T08:08:26.710253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T08:08:27.175324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-01-10T08:08:27.304265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

사업관리번호연차사업진행여부회계연도사업명부서코드사업시작일사업종료일마감여부마감취소요청여부삭제요청여부이호조정책사업명이호조단위사업명이호조세부사업명이호조통계목명이호조정책사업부서명
0187N20162016년딸기고설배드양액재배시설지원사업10050402016-02-012016-11-30NNN농업 경쟁력 강화원예특작 생산기반 구축딸기 재배영농자재 지원민간자본사업보조농업정책과
136N2014경로당도비보조기능보강사업10080012014-01-012014-12-31NNN노인복지증진노인단체 지원 및 경로당 활성화 지원경로당 도비보조 기능보강사업민간자본보조여성가족과
2110N2016과수원예농가저온저장고설치지원사업10050402016-02-012016-12-30NNN농업 경쟁력 강화원예특작 생산기반 구축FTA대응 과수사업농가지원민간자본사업보조농업정책과
341N2015건강가정지원센터운영10080012015-01-012015-12-31NNN보육.가족지원 및 여성복지 증진건강가정 지원건강가정지원센터 운영사회복지시설법정운영비보조여성가족과
442N2015다문화가족지원센터운영지원10080012015-01-012015-12-31NNN보육.가족지원 및 여성복지 증진다문화가족 지원다문화가족지원센터 운영 지원사회복지시설법정운영비보조여성가족과
543N2015다문화대축제행사10080012015-05-012015-06-30NNN보육.가족지원 및 여성복지 증진다문화가족 지원다문화대축제 행사지원민간행사사업보조여성가족과
645N2015합덕읍소재지종합정비사업10051002015-01-012015-12-31NNN도시개발소도읍 육성소도읍 육성사업민간경상사업보조도시과
746N2015송악읍소재지종합정비사업추진위원회운영경비10051002015-01-012017-12-29NNN도시개발읍소재지종합정비사업송악읍소재지종합정비사업민간경상사업보조도시과
859N2014축산농가소독시설지원10090072014-01-012014-12-31NNN축산경쟁력강화가축위생관리가축방역시설사업민간자본보조축산과
979N2014시설채소재배단지조성11050202015-01-012015-12-31NNN농업기술 보급과수 및 특용작물 기술보급고품질 특용작물 생력화 시범사업민간자본보조농업기술센터
사업관리번호연차사업진행여부회계연도사업명부서코드사업시작일사업종료일마감여부마감취소요청여부삭제요청여부이호조정책사업명이호조단위사업명이호조세부사업명이호조통계목명이호조정책사업부서명
23512852N20212021년모범운전자회교통지도활동근무복및장비지원10090022021-01-012021-12-31NNN편리한 교통 체계 구축운수사업 지원브랜드택시 지원민간경상사업보조교통과
23522853Y2021제15회당진시우수광고물전시회보조금지원10051612021-05-312021-11-30NNN주택 건설 및 운영도시경관조성옥외광고물 관리민간행사사업보조건축과
23532854N2021바르게살기운동회원전국대회보조금지원10010022021-07-012021-12-31NNN새마을 정신 함양도의 정신 고취바르게살기운동단체 지원민간행사사업보조공동체새마을과
23542855N2021바르게살기운동회원수련대회보조금지원10010022021-09-012021-12-31NNN새마을 정신 함양도의 정신 고취바르게살기운동단체 지원민간행사사업보조공동체새마을과
23552856N2021당진시택시콜센터구축사업10090022021-08-012021-12-31NNN편리한 교통 체계 구축운수사업 지원택시 관련사업 지원민간자본사업보조(이전재원)교통과
23562857N2021당진시택시장비개선확충사업10090022021-08-012021-12-31NNN편리한 교통 체계 구축운수사업 지원택시 관련사업 지원민간자본사업보조(이전재원)교통과
23572858N2021국제안전도시시민참여사업10080052021-07-012021-12-31NNN시민안전복지 향상시민안전관리행복한 안전도시 조성민간경상사업보조안전총괄과
23582859N20212021년대통령기국민독서경진대회및독서골든벨10010022021-08-012021-12-20NNN새마을 정신 함양도의 정신 고취새마을단체 지원민간행사사업보조공동체새마을과
23592860N2021당진시장애인가족지원센터종사자처우개선비10120032021-01-012021-12-31NNN장애인 복지 증진 및 취약계층 보호장애인지역사회재활시설 지원장애인이용시설 종사자 처우개선비사회복지시설법정운영비보조경로장애인과
23602861N2021충남학프로그램운영10051542021-02-012021-12-31NNN평생학습체제 구축평생학습 운영지원충남학 프로그램 운영민간경상사업보조평생학습과