Overview

Dataset statistics

Number of variables5
Number of observations9582
Missing cells0
Missing cells (%)0.0%
Duplicate rows2
Duplicate rows (%)< 0.1%
Total size in memory374.4 KiB
Average record size in memory40.0 B

Variable types

Text4
DateTime1

Dataset

Description진천군에서 생산한 홍보 보도자료 목록으로 온나라 프로그램에 등록된 자료를 추출하여 등록하였습니다.생산 기간은 2016년부터 2023년까지 자료입니다.
Author충청북도 진천군
URLhttps://www.data.go.kr/data/15127537/fileData.do

Alerts

Dataset has 2 (< 0.1%) duplicate rowsDuplicates

Reproduction

Analysis started2024-04-21 02:33:48.388984
Analysis finished2024-04-21 02:33:50.387173
Duration2 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct9017
Distinct (%)94.1%
Missing0
Missing (%)0.0%
Memory size75.0 KiB
2024-04-21T11:33:50.592150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length67
Median length52
Mean length30.551033
Min length4

Characters and Unicode

Total characters292740
Distinct characters1023
Distinct categories16 ?
Distinct scripts4 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8712 ?
Unique (%)90.9%

Sample

1st row보도자료 송부[진천군, 2024년 진천사랑상품권 300억 발행]
2nd row보도자료 송부(안정환,‘생거진천’브랜드 전속모델 발탁)
3rd row보도자료 송부(2024년 생거진천 뿌리내리기 지원사업)
4th row보도자료 송부(2024 고향사랑기부금 제1호 기탁식)
5th row보도자료 송부(이웃돕기 기탁)
ValueCountFrequency (%)
보도자료 7991
 
14.5%
송부 1410
 
2.6%
송부(진천군 836
 
1.5%
개최 743
 
1.4%
실시 684
 
1.2%
기탁 645
 
1.2%
507
 
0.9%
제출 416
 
0.8%
진천군 393
 
0.7%
운영 383
 
0.7%
Other values (14677) 41027
74.5%
2024-04-21T11:33:51.026856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
45490
 
15.5%
11122
 
3.8%
11119
 
3.8%
10696
 
3.7%
9865
 
3.4%
( 8965
 
3.1%
) 8953
 
3.1%
8208
 
2.8%
7646
 
2.6%
4480
 
1.5%
Other values (1013) 166196
56.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 212079
72.4%
Space Separator 45491
 
15.5%
Open Punctuation 9848
 
3.4%
Close Punctuation 9834
 
3.4%
Decimal Number 9069
 
3.1%
Other Punctuation 3569
 
1.2%
Uppercase Letter 1035
 
0.4%
Lowercase Letter 429
 
0.1%
Dash Punctuation 385
 
0.1%
Initial Punctuation 380
 
0.1%
Other values (6) 621
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11122
 
5.2%
11119
 
5.2%
10696
 
5.0%
9865
 
4.7%
8208
 
3.9%
7646
 
3.6%
4480
 
2.1%
3904
 
1.8%
3900
 
1.8%
3561
 
1.7%
Other values (903) 137578
64.9%
Uppercase Letter
ValueCountFrequency (%)
C 111
 
10.7%
S 85
 
8.2%
H 82
 
7.9%
T 71
 
6.9%
E 61
 
5.9%
L 55
 
5.3%
O 54
 
5.2%
N 50
 
4.8%
D 49
 
4.7%
G 48
 
4.6%
Other values (16) 369
35.7%
Lowercase Letter
ValueCountFrequency (%)
k 52
12.1%
o 50
11.7%
g 45
10.5%
a 42
9.8%
e 33
 
7.7%
p 32
 
7.5%
l 29
 
6.8%
y 20
 
4.7%
t 18
 
4.2%
r 16
 
3.7%
Other values (14) 92
21.4%
Other Punctuation
ValueCountFrequency (%)
, 2264
63.4%
' 379
 
10.6%
· 353
 
9.9%
! 206
 
5.8%
. 202
 
5.7%
" 63
 
1.8%
% 32
 
0.9%
& 24
 
0.7%
; 17
 
0.5%
# 15
 
0.4%
Other values (4) 14
 
0.4%
Decimal Number
ValueCountFrequency (%)
2 2778
30.6%
0 2051
22.6%
1 1678
18.5%
3 537
 
5.9%
9 433
 
4.8%
7 403
 
4.4%
8 399
 
4.4%
4 277
 
3.1%
6 276
 
3.0%
5 237
 
2.6%
Open Punctuation
ValueCountFrequency (%)
( 8965
91.0%
[ 334
 
3.4%
323
 
3.3%
215
 
2.2%
6
 
0.1%
3
 
< 0.1%
{ 1
 
< 0.1%
1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 8953
91.0%
] 334
 
3.4%
323
 
3.3%
213
 
2.2%
6
 
0.1%
3
 
< 0.1%
} 1
 
< 0.1%
1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
< 29
27.9%
> 29
27.9%
~ 26
25.0%
10
 
9.6%
+ 5
 
4.8%
4
 
3.8%
1
 
1.0%
Space Separator
ValueCountFrequency (%)
45490
> 99.9%
  1
 
< 0.1%
Initial Punctuation
ValueCountFrequency (%)
224
58.9%
156
41.1%
Final Punctuation
ValueCountFrequency (%)
201
59.5%
137
40.5%
Other Symbol
ValueCountFrequency (%)
49
98.0%
1
 
2.0%
Letter Number
ValueCountFrequency (%)
11
78.6%
3
 
21.4%
Dash Punctuation
ValueCountFrequency (%)
- 385
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 112
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 212061
72.4%
Common 79134
 
27.0%
Latin 1478
 
0.5%
Han 67
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11122
 
5.2%
11119
 
5.2%
10696
 
5.0%
9865
 
4.7%
8208
 
3.9%
7646
 
3.6%
4480
 
2.1%
3904
 
1.8%
3900
 
1.8%
3561
 
1.7%
Other values (873) 137560
64.9%
Common
ValueCountFrequency (%)
45490
57.5%
( 8965
 
11.3%
) 8953
 
11.3%
2 2778
 
3.5%
, 2264
 
2.9%
0 2051
 
2.6%
1 1678
 
2.1%
3 537
 
0.7%
9 433
 
0.5%
7 403
 
0.5%
Other values (47) 5582
 
7.1%
Latin
ValueCountFrequency (%)
C 111
 
7.5%
S 85
 
5.8%
H 82
 
5.5%
T 71
 
4.8%
E 61
 
4.1%
L 55
 
3.7%
O 54
 
3.7%
k 52
 
3.5%
o 50
 
3.4%
N 50
 
3.4%
Other values (42) 807
54.6%
Han
ValueCountFrequency (%)
24
35.8%
5
 
7.5%
3
 
4.5%
3
 
4.5%
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
1
 
1.5%
Other values (21) 21
31.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 212008
72.4%
ASCII 78414
 
26.8%
None 1498
 
0.5%
Punctuation 719
 
0.2%
CJK 67
 
< 0.1%
Arrows 15
 
< 0.1%
Number Forms 14
 
< 0.1%
Compat Jamo 4
 
< 0.1%
Geometric Shapes 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
45490
58.0%
( 8965
 
11.4%
) 8953
 
11.4%
2 2778
 
3.5%
, 2264
 
2.9%
0 2051
 
2.6%
1 1678
 
2.1%
3 537
 
0.7%
9 433
 
0.6%
7 403
 
0.5%
Other values (75) 4862
 
6.2%
Hangul
ValueCountFrequency (%)
11122
 
5.2%
11119
 
5.2%
10696
 
5.0%
9865
 
4.7%
8208
 
3.9%
7646
 
3.6%
4480
 
2.1%
3904
 
1.8%
3900
 
1.8%
3561
 
1.7%
Other values (871) 137507
64.9%
None
ValueCountFrequency (%)
· 353
23.6%
323
21.6%
323
21.6%
215
14.4%
213
14.2%
49
 
3.3%
6
 
0.4%
6
 
0.4%
3
 
0.2%
3
 
0.2%
Other values (4) 4
 
0.3%
Punctuation
ValueCountFrequency (%)
224
31.2%
201
28.0%
156
21.7%
137
19.1%
1
 
0.1%
CJK
ValueCountFrequency (%)
24
35.8%
5
 
7.5%
3
 
4.5%
3
 
4.5%
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
2
 
3.0%
1
 
1.5%
Other values (21) 21
31.3%
Number Forms
ValueCountFrequency (%)
11
78.6%
3
 
21.4%
Arrows
ValueCountFrequency (%)
10
66.7%
4
 
26.7%
1
 
6.7%
Compat Jamo
ValueCountFrequency (%)
4
100.0%
Geometric Shapes
ValueCountFrequency (%)
1
100.0%
Distinct61
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size75.0 KiB
2024-04-21T11:33:51.241319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length5
Mean length4.7927364
Min length3

Characters and Unicode

Total characters45924
Distinct characters111
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row경제과
2nd row축산유통과
3rd row통합일자리지원단
4th row행정지원과
5th row주민복지과
ValueCountFrequency (%)
주민복지과 1210
 
12.6%
행정지원과 756
 
7.9%
여성가족과 479
 
5.0%
보건소 359
 
3.7%
평생학습센터 357
 
3.7%
진천읍 336
 
3.5%
건강증진과 315
 
3.3%
이월면 293
 
3.1%
보건행정과 280
 
2.9%
농업기술센터 256
 
2.7%
Other values (51) 4941
51.6%
2024-04-21T11:33:51.563462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6124
 
13.3%
2662
 
5.8%
1791
 
3.9%
1511
 
3.3%
1399
 
3.0%
1317
 
2.9%
1210
 
2.6%
1210
 
2.6%
1036
 
2.3%
1009
 
2.2%
Other values (101) 26655
58.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 45924
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6124
 
13.3%
2662
 
5.8%
1791
 
3.9%
1511
 
3.3%
1399
 
3.0%
1317
 
2.9%
1210
 
2.6%
1210
 
2.6%
1036
 
2.3%
1009
 
2.2%
Other values (101) 26655
58.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 45924
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6124
 
13.3%
2662
 
5.8%
1791
 
3.9%
1511
 
3.3%
1399
 
3.0%
1317
 
2.9%
1210
 
2.6%
1210
 
2.6%
1036
 
2.3%
1009
 
2.2%
Other values (101) 26655
58.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 45924
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6124
 
13.3%
2662
 
5.8%
1791
 
3.9%
1511
 
3.3%
1399
 
3.0%
1317
 
2.9%
1210
 
2.6%
1210
 
2.6%
1036
 
2.3%
1009
 
2.2%
Other values (101) 26655
58.0%
Distinct73
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size75.0 KiB
2024-04-21T11:33:51.768622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length15
Mean length11.830202
Min length7

Characters and Unicode

Total characters113357
Distinct characters114
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row진천군 문화경제국 경제과
2nd row진천군 농업기술센터 축산유통과
3rd row진천군 통합일자리지원단
4th row진천군 복지행정국 행정지원과
5th row진천군 복지행정국 주민복지과
ValueCountFrequency (%)
진천군 9582
39.7%
복지행정국 2416
 
10.0%
주민복지과 1210
 
5.0%
보건소 954
 
4.0%
농업기술센터 875
 
3.6%
행정지원과 756
 
3.1%
미래도시국 745
 
3.1%
문화경제국 571
 
2.4%
여성가족과 479
 
2.0%
평생학습센터 357
 
1.5%
Other values (55) 6165
25.6%
2024-04-21T11:33:52.085936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
14528
 
12.8%
10330
 
9.1%
9918
 
8.7%
9582
 
8.5%
6124
 
5.4%
5078
 
4.5%
4207
 
3.7%
3732
 
3.3%
3626
 
3.2%
3452
 
3.0%
Other values (104) 42780
37.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 98829
87.2%
Space Separator 14528
 
12.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10330
 
10.5%
9918
 
10.0%
9582
 
9.7%
6124
 
6.2%
5078
 
5.1%
4207
 
4.3%
3732
 
3.8%
3626
 
3.7%
3452
 
3.5%
1912
 
1.9%
Other values (103) 40868
41.4%
Space Separator
ValueCountFrequency (%)
14528
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 98829
87.2%
Common 14528
 
12.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10330
 
10.5%
9918
 
10.0%
9582
 
9.7%
6124
 
6.2%
5078
 
5.1%
4207
 
4.3%
3732
 
3.8%
3626
 
3.7%
3452
 
3.5%
1912
 
1.9%
Other values (103) 40868
41.4%
Common
ValueCountFrequency (%)
14528
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 98829
87.2%
ASCII 14528
 
12.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
14528
100.0%
Hangul
ValueCountFrequency (%)
10330
 
10.5%
9918
 
10.0%
9582
 
9.7%
6124
 
6.2%
5078
 
5.1%
4207
 
4.3%
3732
 
3.8%
3626
 
3.7%
3452
 
3.5%
1912
 
1.9%
Other values (103) 40868
41.4%
Distinct816
Distinct (%)8.5%
Missing0
Missing (%)0.0%
Memory size75.0 KiB
2024-04-21T11:33:52.392662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length2.9982258
Min length2

Characters and Unicode

Total characters28729
Distinct characters189
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique125 ?
Unique (%)1.3%

Sample

1st row유원상
2nd row신현정
3rd row정준호
4th row김민형
5th row조은별
ValueCountFrequency (%)
김은경 151
 
1.6%
이재철 112
 
1.2%
임정희 108
 
1.1%
서지현 101
 
1.1%
이진범 99
 
1.0%
이은정 88
 
0.9%
하지현 85
 
0.9%
이영희 83
 
0.9%
이세영 80
 
0.8%
정준호 77
 
0.8%
Other values (806) 8598
89.7%
2024-04-21T11:33:52.809850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1994
 
6.9%
1838
 
6.4%
1471
 
5.1%
1012
 
3.5%
953
 
3.3%
833
 
2.9%
724
 
2.5%
690
 
2.4%
677
 
2.4%
653
 
2.3%
Other values (179) 17884
62.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 28729
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1994
 
6.9%
1838
 
6.4%
1471
 
5.1%
1012
 
3.5%
953
 
3.3%
833
 
2.9%
724
 
2.5%
690
 
2.4%
677
 
2.4%
653
 
2.3%
Other values (179) 17884
62.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 28729
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1994
 
6.9%
1838
 
6.4%
1471
 
5.1%
1012
 
3.5%
953
 
3.3%
833
 
2.9%
724
 
2.5%
690
 
2.4%
677
 
2.4%
653
 
2.3%
Other values (179) 17884
62.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 28729
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1994
 
6.9%
1838
 
6.4%
1471
 
5.1%
1012
 
3.5%
953
 
3.3%
833
 
2.9%
724
 
2.5%
690
 
2.4%
677
 
2.4%
653
 
2.3%
Other values (179) 17884
62.3%
Distinct8077
Distinct (%)84.3%
Missing0
Missing (%)0.0%
Memory size75.0 KiB
Minimum2016-07-01 16:16:00
Maximum2023-12-29 17:49:00
2024-04-21T11:33:52.960413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:33:53.077118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Correlations

2024-04-21T11:33:53.153645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기안자부서부서상세명
기안자부서1.0001.000
부서상세명1.0001.000

Missing values

2024-04-21T11:33:50.215285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T11:33:50.340238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

문서제목기안자부서부서상세명기안자접수일자
0보도자료 송부[진천군, 2024년 진천사랑상품권 300억 발행]경제과진천군 문화경제국 경제과유원상2023-12-29 17:49
1보도자료 송부(안정환,‘생거진천’브랜드 전속모델 발탁)축산유통과진천군 농업기술센터 축산유통과신현정2023-12-29 16:03
2보도자료 송부(2024년 생거진천 뿌리내리기 지원사업)통합일자리지원단진천군 통합일자리지원단정준호2023-12-29 13:48
3보도자료 송부(2024 고향사랑기부금 제1호 기탁식)행정지원과진천군 복지행정국 행정지원과김민형2023-12-29 9:14
4보도자료 송부(이웃돕기 기탁)주민복지과진천군 복지행정국 주민복지과조은별2023-12-28 18:00
5보도자료 송부(2023 직장운동경기부 우수 운영팀 지원 공모 3년연속 선정)체육진흥지원단진천군 체육진흥지원단안진영2023-12-28 15:46
6보도자료 송부(진천군 모범음식점 지정)식산업자원과진천군 문화경제국 식산업자원과김기중2023-12-28 15:46
7보도자료 송부(백곡면 해맞이 행사, 엽돈재 정상에서 개최)백곡면진천군 백곡면전연주2023-12-28 15:46
8보도자료 송부(진천군, 조기발주를 위한 합동측량·설계반 운영)지역개발과진천군 미래도시국 지역개발과홍영기2023-12-28 14:50
9보도자료 송부('24년도 생활밀착형 도시재생 스마트기술 지원사업 선정)지역개발과진천군 미래도시국 지역개발과김규완2023-12-28 14:00
문서제목기안자부서부서상세명기안자접수일자
9572보도자료 송부(농기 2016-66호, 진천 정보화농업인, 내 농장 알리기에 나섰다)농업기술센터진천군 농업기술센터김은경2016-07-05 8:23
9573보도자료 제출(주민세 재산분 신고납부 홍보)세정과진천군 세정과김이섭2016-07-04 17:48
9574보도자료(인터넷·스마트폰 중독 예방·해소 프로그램 운영) 송부주민복지과진천군 주민복지과임종옥2016-07-04 16:57
9575한의학건강증진사업 중풍예방교실 보도자료 제출보건소진천군 보건소곽재영2016-07-04 16:56
9576세무조사관련 보도자료 제출세정과진천군 세정과박용복2016-07-04 16:56
9577보도자료 의뢰(진천봉화로타리클럽 사랑의 생거진천 쌀 전달)진천읍진천군 진천읍황진식2016-07-04 16:32
9578보도자료 송부(명품도시추진단 현판식)명품도시추진단진천군 명품도시추진단김만희2016-07-04 13:59
9579보도자료 송부(진천군 종합안전교육체험관 개관)안전건설과진천군 안전건설과민경환2016-07-04 8:11
9580보도자료 제출(체납차량 번호판 영치 추진)세정과진천군 세정과김해경2016-07-01 16:21
9581보도자료(장애인복지관 충북장애인기능경기대회 참가) 제출주민복지과진천군 주민복지과차근우2016-07-01 16:16

Duplicate rows

Most frequently occurring

문서제목기안자부서부서상세명기안자접수일자# duplicates
0보도자료 송부기획조정실진천군 기획조정실성민중2017-08-01 9:402
1보도자료 송부(장학기금 기탁)행정지원과진천군 행정지원과류효선2017-12-14 13:242