Overview

Dataset statistics

Number of variables4
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory390.6 KiB
Average record size in memory40.0 B

Variable types

Categorical1
DateTime1
Text2

Dataset

Description경기도_보도자료 현황
Author경기도뉴스포털
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=APXKY127QXG0TVVR1Y6E28683342&infSeq=1

Alerts

Dataset has 1 (< 0.1%) duplicate rowsDuplicates

Reproduction

Analysis started2024-05-03 19:39:50.364203
Analysis finished2024-05-03 19:39:53.514352
Duration3.15 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기관명
Categorical

Distinct16
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경기도
4572 
포천시
1141 
광주시
965 
성남시
765 
오산시
733 
Other values (11)
1824 

Length

Max length8
Median length3
Mean length3.0672
Min length3

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st row경기도
2nd row성남시
3rd row경기도
4th row동두천시
5th row고양시

Common Values

ValueCountFrequency (%)
경기도 4572
45.7%
포천시 1141
 
11.4%
광주시 965
 
9.7%
성남시 765
 
7.6%
오산시 733
 
7.3%
동두천시 667
 
6.7%
군포시 316
 
3.2%
고양시 305
 
3.0%
양주시 253
 
2.5%
안양시 118
 
1.2%
Other values (6) 165
 
1.7%

Length

2024-05-03T19:39:53.756546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도 4572
45.7%
포천시 1141
 
11.4%
광주시 965
 
9.7%
성남시 765
 
7.6%
오산시 733
 
7.3%
동두천시 667
 
6.7%
군포시 316
 
3.2%
고양시 305
 
3.0%
양주시 253
 
2.5%
안양시 118
 
1.2%
Other values (6) 165
 
1.7%

일자
Date

Distinct2354
Distinct (%)23.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2016-04-19 00:00:00
Maximum2024-05-03 00:00:00
2024-05-03T19:39:54.198055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T19:39:54.668398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

제목
Text

Distinct9986
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-03T19:39:55.407609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length89
Median length73
Mean length32.4484
Min length8

Characters and Unicode

Total characters324484
Distinct characters1283
Distinct categories16 ?
Distinct scripts4 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9972 ?
Unique (%)99.7%

Sample

1st row경기도, ‘코로나19’ 확산방지 위해 주한미군에 적극적 협조 당부
2nd row성남시 “주방용 오물분쇄기 인증 제품만 쓰세요”
3rd row경기 연천, 구제역 발생 24일만에 이동제한 조치 해제
4th row동두천시, “새 마음, 새 꿈”의 날개를 단 새마을지도자
5th row고양시, 민원·의료 등 추석 연휴 종합 대책 추진
ValueCountFrequency (%)
경기도 1434
 
2.0%
개최 1034
 
1.5%
1015
 
1.4%
광주시 773
 
1.1%
포천시 756
 
1.1%
실시 628
 
0.9%
성남시 510
 
0.7%
위한 509
 
0.7%
동두천시 465
 
0.7%
오산시 413
 
0.6%
Other values (26335) 63339
89.4%
2024-05-03T19:39:57.305932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
61727
 
19.0%
7604
 
2.3%
, 7483
 
2.3%
6317
 
1.9%
5158
 
1.6%
3775
 
1.2%
2 3732
 
1.2%
3644
 
1.1%
1 3479
 
1.1%
3453
 
1.1%
Other values (1273) 218112
67.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 222398
68.5%
Space Separator 61727
 
19.0%
Decimal Number 16201
 
5.0%
Other Punctuation 12055
 
3.7%
Initial Punctuation 3026
 
0.9%
Final Punctuation 3006
 
0.9%
Uppercase Letter 2287
 
0.7%
Open Punctuation 1188
 
0.4%
Close Punctuation 1185
 
0.4%
Lowercase Letter 638
 
0.2%
Other values (6) 773
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7604
 
3.4%
6317
 
2.8%
5158
 
2.3%
3775
 
1.7%
3644
 
1.6%
3453
 
1.6%
2696
 
1.2%
2562
 
1.2%
2428
 
1.1%
2424
 
1.1%
Other values (1158) 182337
82.0%
Uppercase Letter
ValueCountFrequency (%)
A 277
 
12.1%
S 231
 
10.1%
I 179
 
7.8%
T 134
 
5.9%
C 131
 
5.7%
F 129
 
5.6%
G 116
 
5.1%
D 112
 
4.9%
M 111
 
4.9%
E 102
 
4.5%
Other values (16) 765
33.4%
Lowercase Letter
ValueCountFrequency (%)
e 81
12.7%
t 71
11.1%
a 62
 
9.7%
o 56
 
8.8%
l 43
 
6.7%
r 40
 
6.3%
n 38
 
6.0%
g 35
 
5.5%
i 29
 
4.5%
y 26
 
4.1%
Other values (14) 157
24.6%
Other Punctuation
ValueCountFrequency (%)
, 7483
62.1%
. 1378
 
11.4%
· 1002
 
8.3%
640
 
5.3%
! 450
 
3.7%
265
 
2.2%
& 178
 
1.5%
? 171
 
1.4%
% 159
 
1.3%
; 145
 
1.2%
Other values (6) 184
 
1.5%
Decimal Number
ValueCountFrequency (%)
2 3732
23.0%
1 3479
21.5%
0 3152
19.5%
3 1079
 
6.7%
9 955
 
5.9%
8 870
 
5.4%
5 766
 
4.7%
4 755
 
4.7%
7 751
 
4.6%
6 662
 
4.1%
Math Symbol
ValueCountFrequency (%)
~ 164
74.5%
19
 
8.6%
15
 
6.8%
+ 11
 
5.0%
5
 
2.3%
4
 
1.8%
1
 
0.5%
× 1
 
0.5%
Close Punctuation
ValueCountFrequency (%)
) 984
83.0%
94
 
7.9%
] 62
 
5.2%
41
 
3.5%
2
 
0.2%
1
 
0.1%
1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 984
82.8%
96
 
8.1%
[ 63
 
5.3%
41
 
3.5%
2
 
0.2%
1
 
0.1%
1
 
0.1%
Other Symbol
ValueCountFrequency (%)
62
79.5%
10
 
12.8%
2
 
2.6%
2
 
2.6%
1
 
1.3%
° 1
 
1.3%
Letter Number
ValueCountFrequency (%)
3
60.0%
1
 
20.0%
1
 
20.0%
Initial Punctuation
ValueCountFrequency (%)
2378
78.6%
648
 
21.4%
Final Punctuation
ValueCountFrequency (%)
2366
78.7%
640
 
21.3%
Space Separator
ValueCountFrequency (%)
61727
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 347
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 109
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 222239
68.5%
Common 99092
30.5%
Latin 2930
 
0.9%
Han 223
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7604
 
3.4%
6317
 
2.8%
5158
 
2.3%
3775
 
1.7%
3644
 
1.6%
3453
 
1.6%
2696
 
1.2%
2562
 
1.2%
2428
 
1.1%
2424
 
1.1%
Other values (1059) 182178
82.0%
Han
ValueCountFrequency (%)
28
 
12.6%
19
 
8.5%
16
 
7.2%
13
 
5.8%
10
 
4.5%
5
 
2.2%
5
 
2.2%
4
 
1.8%
4
 
1.8%
3
 
1.3%
Other values (91) 116
52.0%
Common
ValueCountFrequency (%)
61727
62.3%
, 7483
 
7.6%
2 3732
 
3.8%
1 3479
 
3.5%
0 3152
 
3.2%
2378
 
2.4%
2366
 
2.4%
. 1378
 
1.4%
3 1079
 
1.1%
· 1002
 
1.0%
Other values (50) 11316
 
11.4%
Latin
ValueCountFrequency (%)
A 277
 
9.5%
S 231
 
7.9%
I 179
 
6.1%
T 134
 
4.6%
C 131
 
4.5%
F 129
 
4.4%
G 116
 
4.0%
D 112
 
3.8%
M 111
 
3.8%
E 102
 
3.5%
Other values (43) 1408
48.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 222153
68.5%
ASCII 93736
28.9%
Punctuation 6937
 
2.1%
None 1351
 
0.4%
CJK 223
 
0.1%
Arrows 43
 
< 0.1%
Compat Jamo 22
 
< 0.1%
CJK Compat 12
 
< 0.1%
Number Forms 5
 
< 0.1%
Letterlike Symbols 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
61727
65.9%
, 7483
 
8.0%
2 3732
 
4.0%
1 3479
 
3.7%
0 3152
 
3.4%
. 1378
 
1.5%
3 1079
 
1.2%
) 984
 
1.0%
( 984
 
1.0%
9 955
 
1.0%
Other values (71) 8783
 
9.4%
Hangul
ValueCountFrequency (%)
7604
 
3.4%
6317
 
2.8%
5158
 
2.3%
3775
 
1.7%
3644
 
1.6%
3453
 
1.6%
2696
 
1.2%
2562
 
1.2%
2428
 
1.1%
2424
 
1.1%
Other values (1055) 182092
82.0%
Punctuation
ValueCountFrequency (%)
2378
34.3%
2366
34.1%
648
 
9.3%
640
 
9.2%
640
 
9.2%
265
 
3.8%
None
ValueCountFrequency (%)
· 1002
74.2%
96
 
7.1%
94
 
7.0%
62
 
4.6%
41
 
3.0%
41
 
3.0%
2
 
0.1%
2
 
0.1%
2
 
0.1%
2
 
0.1%
Other values (7) 7
 
0.5%
CJK
ValueCountFrequency (%)
28
 
12.6%
19
 
8.5%
16
 
7.2%
13
 
5.8%
10
 
4.5%
5
 
2.2%
5
 
2.2%
4
 
1.8%
4
 
1.8%
3
 
1.3%
Other values (91) 116
52.0%
Compat Jamo
ValueCountFrequency (%)
21
95.5%
1
 
4.5%
Arrows
ValueCountFrequency (%)
19
44.2%
15
34.9%
5
 
11.6%
4
 
9.3%
CJK Compat
ValueCountFrequency (%)
10
83.3%
2
 
16.7%
Number Forms
ValueCountFrequency (%)
3
60.0%
1
 
20.0%
1
 
20.0%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
Math Operators
ValueCountFrequency (%)
1
100.0%
Distinct9999
Distinct (%)> 99.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-03T19:39:58.310951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length78
Median length78
Mean length77.5895
Min length77

Characters and Unicode

Total characters775895
Distinct characters41
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9998 ?
Unique (%)> 99.9%

Sample

1st rowhttps://gnews.gg.go.kr/briefing/brief_gongbo_view.do?BS_CODE=S017&number=45050
2nd rowhttps://gnews.gg.go.kr/briefing/brief_sigun_view.do?BS_CODE=S003&number=95385
3rd rowhttps://gnews.gg.go.kr/briefing/brief_gongbo_view.do?BS_CODE=S017&number=33195
4th rowhttps://gnews.gg.go.kr/briefing/brief_sigun_view.do?BS_CODE=S003&number=92590
5th rowhttps://gnews.gg.go.kr/briefing/brief_sigun_view.do?BS_CODE=S003&number=73819
ValueCountFrequency (%)
https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?bs_code=s017&number=59192 2
 
< 0.1%
https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?bs_code=s017&number=33519 1
 
< 0.1%
https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?bs_code=s017&number=31613 1
 
< 0.1%
https://gnews.gg.go.kr/briefing/brief_sigun_view.do?bs_code=s003&number=82884 1
 
< 0.1%
https://gnews.gg.go.kr/briefing/brief_sigun_view.do?bs_code=s003&number=107536 1
 
< 0.1%
https://gnews.gg.go.kr/briefing/brief_sigun_view.do?bs_code=s003&number=75696 1
 
< 0.1%
https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?bs_code=s017&number=40223 1
 
< 0.1%
https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?bs_code=s017&number=40686 1
 
< 0.1%
https://gnews.gg.go.kr/briefing/brief_sigun_view.do?bs_code=s003&number=98285 1
 
< 0.1%
https://gnews.gg.go.kr/briefing/brief_sigun_view.do?bs_code=s003&number=94547 1
 
< 0.1%
Other values (9989) 9989
99.9%
2024-05-03T19:39:59.760973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
g 64572
 
8.3%
e 50000
 
6.4%
i 45428
 
5.9%
/ 40000
 
5.2%
n 40000
 
5.2%
. 40000
 
5.2%
r 40000
 
5.2%
b 34572
 
4.5%
_ 30000
 
3.9%
o 29144
 
3.8%
Other values (31) 362179
46.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 464572
59.9%
Other Punctuation 110000
 
14.2%
Decimal Number 81323
 
10.5%
Uppercase Letter 70000
 
9.0%
Connector Punctuation 30000
 
3.9%
Math Symbol 20000
 
2.6%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
g 64572
13.9%
e 50000
10.8%
i 45428
9.8%
n 40000
8.6%
r 40000
8.6%
b 34572
 
7.4%
o 29144
 
6.3%
s 25428
 
5.5%
w 20000
 
4.3%
f 20000
 
4.3%
Other values (8) 95428
20.5%
Decimal Number
ValueCountFrequency (%)
0 20699
25.5%
3 10885
13.4%
1 9909
12.2%
7 9810
12.1%
4 5578
 
6.9%
5 5427
 
6.7%
8 5407
 
6.6%
9 5357
 
6.6%
6 4276
 
5.3%
2 3975
 
4.9%
Uppercase Letter
ValueCountFrequency (%)
S 20000
28.6%
D 10000
14.3%
E 10000
14.3%
O 10000
14.3%
C 10000
14.3%
B 10000
14.3%
Other Punctuation
ValueCountFrequency (%)
/ 40000
36.4%
. 40000
36.4%
& 10000
 
9.1%
? 10000
 
9.1%
: 10000
 
9.1%
Connector Punctuation
ValueCountFrequency (%)
_ 30000
100.0%
Math Symbol
ValueCountFrequency (%)
= 20000
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 534572
68.9%
Common 241323
31.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
g 64572
 
12.1%
e 50000
 
9.4%
i 45428
 
8.5%
n 40000
 
7.5%
r 40000
 
7.5%
b 34572
 
6.5%
o 29144
 
5.5%
s 25428
 
4.8%
w 20000
 
3.7%
f 20000
 
3.7%
Other values (14) 165428
30.9%
Common
ValueCountFrequency (%)
/ 40000
16.6%
. 40000
16.6%
_ 30000
12.4%
0 20699
8.6%
= 20000
8.3%
3 10885
 
4.5%
& 10000
 
4.1%
? 10000
 
4.1%
: 10000
 
4.1%
1 9909
 
4.1%
Other values (7) 39830
16.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 775895
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
g 64572
 
8.3%
e 50000
 
6.4%
i 45428
 
5.9%
/ 40000
 
5.2%
n 40000
 
5.2%
. 40000
 
5.2%
r 40000
 
5.2%
b 34572
 
4.5%
_ 30000
 
3.9%
o 29144
 
3.8%
Other values (31) 362179
46.7%

Missing values

2024-05-03T19:39:53.097679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-03T19:39:53.381605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기관명일자제목링크URL
29236경기도2020-07-17경기도, ‘코로나19’ 확산방지 위해 주한미군에 적극적 협조 당부https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?BS_CODE=S017&number=45050
30011성남시2020-06-11성남시 “주방용 오물분쇄기 인증 제품만 쓰세요”https://gnews.gg.go.kr/briefing/brief_sigun_view.do?BS_CODE=S003&number=95385
59990경기도2017-03-03경기 연천, 구제역 발생 24일만에 이동제한 조치 해제https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?BS_CODE=S017&number=33195
35434동두천시2019-09-30동두천시, “새 마음, 새 꿈”의 날개를 단 새마을지도자https://gnews.gg.go.kr/briefing/brief_sigun_view.do?BS_CODE=S003&number=92590
64302고양시2016-09-12고양시, 민원·의료 등 추석 연휴 종합 대책 추진https://gnews.gg.go.kr/briefing/brief_sigun_view.do?BS_CODE=S003&number=73819
32128경기도2020-03-03경기도 코로나19 발생 현황(2020.03.03) 10시https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?BS_CODE=S017&number=43551
60775동두천시2017-02-03동두천시, 겨울철 안전 주제로 캠페인 실시https://gnews.gg.go.kr/briefing/brief_sigun_view.do?BS_CODE=S003&number=76194
8629경기도2023-03-12경기도, 중소기업 협업·기술 융합에 최대 5천만 원 지원https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?BS_CODE=S017&number=56463
9077군포시2023-02-20군포시청년청책협의회 열려...군포시 청년정책은 청년들이 직접 만든다https://gnews.gg.go.kr/briefing/brief_sigun_view.do?BS_CODE=S003&number=104881
29048광주시2020-07-27무공수훈자회 광주시지회, 국가유공자 장례 선양사업https://gnews.gg.go.kr/briefing/brief_sigun_view.do?BS_CODE=S003&number=95831
기관명일자제목링크URL
19551포천시2021-09-15포천시, 365일 스마트 무인 도서관 운영https://gnews.gg.go.kr/briefing/brief_sigun_view.do?BS_CODE=S003&number=100091
12413경기도2022-09-02경기도, ‘산업재해 사망사고 감축’ 위한 건설 현장 책임자 교육 개최https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?BS_CODE=S017&number=54304
34959경기도2019-10-27경기도-노보시비리스크주, 보건의료 발전 위해 ‘맞손’https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?BS_CODE=S017&number=42204
5894경기도2023-07-18‘경기도형 스마트공장’ 구축지원 65개 사 선정. 하반기부터 공장 구축 들어가https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?BS_CODE=S017&number=58127
58751동두천시2017-04-14동두천시, "Geen동두천 환경교실" 개최https://gnews.gg.go.kr/briefing/brief_sigun_view.do?BS_CODE=S003&number=77463
51308오산시2018-01-092018 제1회 오산시협회장배 자선 농구대회 성료https://gnews.gg.go.kr/briefing/brief_sigun_view.do?BS_CODE=S003&number=82515
46404포천시2018-07-242018 포천시 디딤돌 취업박람회 개최https://gnews.gg.go.kr/briefing/brief_sigun_view.do?BS_CODE=S003&number=85744
55102동두천시2017-08-25동두천시, 송내동 지역사회보장협의체 복지사각지대 지원방향 모색https://gnews.gg.go.kr/briefing/brief_sigun_view.do?BS_CODE=S003&number=79892
41242경기도2019-02-11도, 지난해 47초당 1번씩 119구급차 출동 … 7월·50대·고혈압이 많았다https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?BS_CODE=S017&number=39520
33691오산시2019-12-17오산시 129개 빅데이터 구축해 행정 활용한다https://gnews.gg.go.kr/briefing/brief_sigun_view.do?BS_CODE=S003&number=93518

Duplicate rows

Most frequently occurring

기관명일자제목링크URL# duplicates
0경기도2023-10-24경기도, 세계 2위 전기차용 전력반도체 기업 미국 온세미 신소재 연구소ㆍ제조시설 준공. 지역 내 1천 명 고용 기대https://gnews.gg.go.kr/briefing/brief_gongbo_view.do?BS_CODE=S017&number=591922