Overview

Dataset statistics

Number of variables4
Number of observations2697
Missing cells14
Missing cells (%)0.1%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory84.4 KiB
Average record size in memory32.0 B

Variable types

DateTime1
Categorical1
Text2

Dataset

Description경기도 전자책 소장목록 현황
Author경기도
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=KAPA7HYXL2UXII9C4PE631433764&infSeq=1

Alerts

Dataset has 1 (< 0.1%) duplicate rowsDuplicates

Reproduction

Analysis started2024-03-12 23:47:23.832253
Analysis finished2024-03-12 23:47:24.475503
Duration0.64 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1273
Distinct (%)47.2%
Missing0
Missing (%)0.0%
Memory size21.2 KiB
Minimum2006-09-25 00:00:00
Maximum2023-12-13 00:00:00
2024-03-13T08:47:24.533557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T08:47:24.640821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

분류내용
Categorical

Distinct29
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size21.2 KiB
본청
810 
의회/사업소/직속기관
715 
출연기관
192 
홍보
189 
업무계획,지침/편람
186 
Other values (24)
605 

Length

Max length11
Median length10
Mean length5.5813867
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row본청
2nd row의회/사업소/직속기관
3rd row의회/사업소/직속기관
4th row홍보
5th row본청

Common Values

ValueCountFrequency (%)
본청 810
30.0%
의회/사업소/직속기관 715
26.5%
출연기관 192
 
7.1%
홍보 189
 
7.0%
업무계획,지침/편람 186
 
6.9%
공약/비전 100
 
3.7%
디자인 88
 
3.3%
안내 등 79
 
2.9%
70
 
2.6%
경기도가족여성연구원 46
 
1.7%
Other values (19) 222
 
8.2%

Length

2024-03-13T08:47:24.743033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
본청 810
28.8%
의회/사업소/직속기관 715
25.4%
출연기관 192
 
6.8%
홍보 189
 
6.7%
업무계획,지침/편람 186
 
6.6%
공약/비전 100
 
3.6%
디자인 88
 
3.1%
안내 79
 
2.8%
79
 
2.8%
70
 
2.5%
Other values (22) 302
 
10.7%
Distinct2645
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size21.2 KiB
2024-03-13T08:47:24.956938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length43
Mean length15.732666
Min length3

Characters and Unicode

Total characters42431
Distinct characters643
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2601 ?
Unique (%)96.4%

Sample

1st row경기도 대기환경관리 시행계획(변경)
2nd row월간입법동향 11월호
3rd row의회소식지 229호
4th row재활용품 분리배출 가이드라인
5th rowG Life 2018년 11월호
ValueCountFrequency (%)
경기도 494
 
6.2%
의회소식지 131
 
1.6%
g 114
 
1.4%
life 114
 
1.4%
월간입법동향 106
 
1.3%
연구 99
 
1.2%
83
 
1.0%
나의 58
 
0.7%
위한 46
 
0.6%
입법동향 45
 
0.6%
Other values (3401) 6675
83.8%
2024-03-13T08:47:25.296092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5268
 
12.4%
2 1805
 
4.3%
0 1707
 
4.0%
1497
 
3.5%
1 1444
 
3.4%
1298
 
3.1%
1085
 
2.6%
849
 
2.0%
658
 
1.6%
638
 
1.5%
Other values (633) 26182
61.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 27182
64.1%
Decimal Number 6993
 
16.5%
Space Separator 5268
 
12.4%
Lowercase Letter 953
 
2.2%
Uppercase Letter 571
 
1.3%
Open Punctuation 437
 
1.0%
Close Punctuation 407
 
1.0%
Other Punctuation 254
 
0.6%
Math Symbol 169
 
0.4%
Connector Punctuation 82
 
0.2%
Other values (2) 115
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1497
 
5.5%
1298
 
4.8%
1085
 
4.0%
849
 
3.1%
658
 
2.4%
638
 
2.3%
564
 
2.1%
500
 
1.8%
469
 
1.7%
435
 
1.6%
Other values (551) 19189
70.6%
Uppercase Letter
ValueCountFrequency (%)
G 174
30.5%
L 118
20.7%
I 53
 
9.3%
T 26
 
4.6%
V 25
 
4.4%
F 20
 
3.5%
E 19
 
3.3%
N 18
 
3.2%
O 16
 
2.8%
A 15
 
2.6%
Other values (14) 87
15.2%
Lowercase Letter
ValueCountFrequency (%)
i 211
22.1%
e 161
16.9%
f 117
12.3%
o 76
 
8.0%
l 51
 
5.4%
r 47
 
4.9%
s 39
 
4.1%
c 29
 
3.0%
u 29
 
3.0%
n 28
 
2.9%
Other values (12) 165
17.3%
Decimal Number
ValueCountFrequency (%)
2 1805
25.8%
0 1707
24.4%
1 1444
20.6%
3 395
 
5.6%
9 333
 
4.8%
4 315
 
4.5%
5 285
 
4.1%
6 253
 
3.6%
8 234
 
3.3%
7 222
 
3.2%
Other Punctuation
ValueCountFrequency (%)
. 100
39.4%
, 77
30.3%
& 19
 
7.5%
? 19
 
7.5%
: 12
 
4.7%
· 10
 
3.9%
! 6
 
2.4%
" 6
 
2.4%
' 4
 
1.6%
/ 1
 
0.4%
Math Symbol
ValueCountFrequency (%)
> 69
40.8%
< 69
40.8%
~ 28
16.6%
+ 2
 
1.2%
1
 
0.6%
Letter Number
ValueCountFrequency (%)
12
36.4%
11
33.3%
9
27.3%
1
 
3.0%
Open Punctuation
ValueCountFrequency (%)
( 436
99.8%
[ 1
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 406
99.8%
1
 
0.2%
Space Separator
ValueCountFrequency (%)
5268
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 82
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 82
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 27091
63.8%
Common 13692
32.3%
Latin 1557
 
3.7%
Han 91
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1497
 
5.5%
1298
 
4.8%
1085
 
4.0%
849
 
3.1%
658
 
2.4%
638
 
2.4%
564
 
2.1%
500
 
1.8%
469
 
1.7%
435
 
1.6%
Other values (533) 19098
70.5%
Latin
ValueCountFrequency (%)
i 211
13.6%
G 174
 
11.2%
e 161
 
10.3%
L 118
 
7.6%
f 117
 
7.5%
o 76
 
4.9%
I 53
 
3.4%
l 51
 
3.3%
r 47
 
3.0%
s 39
 
2.5%
Other values (40) 510
32.8%
Common
ValueCountFrequency (%)
5268
38.5%
2 1805
 
13.2%
0 1707
 
12.5%
1 1444
 
10.5%
( 436
 
3.2%
) 406
 
3.0%
3 395
 
2.9%
9 333
 
2.4%
4 315
 
2.3%
5 285
 
2.1%
Other values (22) 1298
 
9.5%
Han
ValueCountFrequency (%)
18
19.8%
18
19.8%
18
19.8%
15
16.5%
4
 
4.4%
3
 
3.3%
3
 
3.3%
2
 
2.2%
1
 
1.1%
1
 
1.1%
Other values (8) 8
8.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 27091
63.8%
ASCII 15204
35.8%
CJK 91
 
0.2%
Number Forms 33
 
0.1%
None 12
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5268
34.6%
2 1805
 
11.9%
0 1707
 
11.2%
1 1444
 
9.5%
( 436
 
2.9%
) 406
 
2.7%
3 395
 
2.6%
9 333
 
2.2%
4 315
 
2.1%
5 285
 
1.9%
Other values (65) 2810
18.5%
Hangul
ValueCountFrequency (%)
1497
 
5.5%
1298
 
4.8%
1085
 
4.0%
849
 
3.1%
658
 
2.4%
638
 
2.4%
564
 
2.1%
500
 
1.8%
469
 
1.7%
435
 
1.6%
Other values (533) 19098
70.5%
CJK
ValueCountFrequency (%)
18
19.8%
18
19.8%
18
19.8%
15
16.5%
4
 
4.4%
3
 
3.3%
3
 
3.3%
2
 
2.2%
1
 
1.1%
1
 
1.1%
Other values (8) 8
8.8%
Number Forms
ValueCountFrequency (%)
12
36.4%
11
33.3%
9
27.3%
1
 
3.0%
None
ValueCountFrequency (%)
· 10
83.3%
1
 
8.3%
1
 
8.3%

URL
Text

Distinct2679
Distinct (%)99.9%
Missing14
Missing (%)0.5%
Memory size21.2 KiB
2024-03-13T08:47:25.480128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length30
Mean length30.438315
Min length21

Characters and Unicode

Total characters81666
Distinct characters32
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2675 ?
Unique (%)99.7%

Sample

1st rowebook.gg.go.kr/20181126_094455
2nd rowebook.gg.go.kr/20181112_150321
3rd rowebook.gg.go.kr/20181107_104641
4th rowebook.gg.go.kr/20181107_113319
5th rowebook.gg.go.kr/20181102_101350
ValueCountFrequency (%)
ebook.gg.go.kr/20220610_100054 2
 
0.1%
ebook.gg.go.kr/20230720_104839 2
 
0.1%
ebook.gg.go.kr/20220603_173843 2
 
0.1%
ebook.gg.go.kr/20221101_101638 2
 
0.1%
ebook.gg.go.kr/20181126_094455 1
 
< 0.1%
ebook.gg.go.kr/20061018_180224_4 1
 
< 0.1%
ebook.gg.go.kr/20061018_141929_4 1
 
< 0.1%
ebook.gg.go.kr/20061018_133004_1 1
 
< 0.1%
ebook.gg.go.kr/20061018_141929_3 1
 
< 0.1%
ebook.gg.go.kr/20061018_133004_2 1
 
< 0.1%
Other values (2669) 2669
99.5%
2024-03-13T08:47:25.765595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 9091
11.1%
1 8303
10.2%
g 8065
9.9%
o 8055
9.9%
. 8049
9.9%
2 6934
 
8.5%
k 5368
 
6.6%
_ 3171
 
3.9%
e 2687
 
3.3%
r 2685
 
3.3%
Other values (22) 19258
23.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 38186
46.8%
Lowercase Letter 29577
36.2%
Other Punctuation 10732
 
13.1%
Connector Punctuation 3171
 
3.9%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
g 8065
27.3%
o 8055
27.2%
k 5368
18.1%
e 2687
 
9.1%
r 2685
 
9.1%
b 2684
 
9.1%
n 6
 
< 0.1%
a 4
 
< 0.1%
i 3
 
< 0.1%
m 3
 
< 0.1%
Other values (9) 17
 
0.1%
Decimal Number
ValueCountFrequency (%)
0 9091
23.8%
1 8303
21.7%
2 6934
18.2%
3 2580
 
6.8%
4 2385
 
6.2%
5 2271
 
5.9%
6 2015
 
5.3%
9 1655
 
4.3%
7 1649
 
4.3%
8 1303
 
3.4%
Other Punctuation
ValueCountFrequency (%)
. 8049
75.0%
/ 2683
 
25.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3171
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 52089
63.8%
Latin 29577
36.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
g 8065
27.3%
o 8055
27.2%
k 5368
18.1%
e 2687
 
9.1%
r 2685
 
9.1%
b 2684
 
9.1%
n 6
 
< 0.1%
a 4
 
< 0.1%
i 3
 
< 0.1%
m 3
 
< 0.1%
Other values (9) 17
 
0.1%
Common
ValueCountFrequency (%)
0 9091
17.5%
1 8303
15.9%
. 8049
15.5%
2 6934
13.3%
_ 3171
 
6.1%
/ 2683
 
5.2%
3 2580
 
5.0%
4 2385
 
4.6%
5 2271
 
4.4%
6 2015
 
3.9%
Other values (3) 4607
8.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 81666
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 9091
11.1%
1 8303
10.2%
g 8065
9.9%
o 8055
9.9%
. 8049
9.9%
2 6934
 
8.5%
k 5368
 
6.6%
_ 3171
 
3.9%
e 2687
 
3.3%
r 2685
 
3.3%
Other values (22) 19258
23.6%

Missing values

2024-03-13T08:47:24.150519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-13T08:47:24.448516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

제작일자분류내용자료명URL
02018-11-26본청경기도 대기환경관리 시행계획(변경)ebook.gg.go.kr/20181126_094455
12018-11-12의회/사업소/직속기관월간입법동향 11월호ebook.gg.go.kr/20181112_150321
22018-11-07의회/사업소/직속기관의회소식지 229호ebook.gg.go.kr/20181107_104641
32018-11-07홍보재활용품 분리배출 가이드라인ebook.gg.go.kr/20181107_113319
42018-11-02본청G Life 2018년 11월호ebook.gg.go.kr/20181102_101350
52018-10-30알기쉬운 2017 경기도 결산정보ebook.gg.go.kr/20181030_091750
62018-10-26의회/사업소/직속기관월간입법동향 10월호ebook.gg.go.kr/20181026_113539
72018-10-20의회/사업소/직속기관물은 자원이다(2017 수자원백서)ebook.gg.go.kr/20181020_092945
82018-10-17의회/사업소/직속기관제9대경기도의회 의정백서(양면합본)2권ebook.gg.go.kr/20181017_090001
92018-10-16의회/사업소/직속기관제9대경기도의회 의정백서(양면합본)1권ebook.gg.go.kr/20181016_173049
제작일자분류내용자료명URL
26872021-07-22의회/사업소/직속기관기후변화에 따른 경기도 농업분야 영향도 분석 및 농정 추진 전략 수립 연구ebook.gg.go.kr/20210722_150739
26882021-07-22의회/사업소/직속기관경기도 중소제조기업의 산업재해 예방 및 정책적 지원방안 연구ebook.gg.go.kr/20210722_141458
26892021-07-22의회/사업소/직속기관재외한인사회 및 지역 지방정부와 경기도의 교류 활성화 방안연구ebook.gg.go.kr/20210722_135426
26902021-07-20의회/사업소/직속기관경기도 내 범죄에 의한 사회적 비용 추정 연구ebook.gg.go.kr/20210720_181552
26912021-07-20의회/사업소/직속기관포스트코로나 대비 경기MICE 유니크베뉴 발굴 및 활성화전략 연구ebook.gg.go.kr/20210720_175836
26922021-07-20의회/사업소/직속기관경기도의회 운영혁신 방안에 관한 연구ebook.gg.go.kr/20210720_173007
26932021-07-20의회/사업소/직속기관행정구역 분리가 북부지역 시·군의 균등한 복지행정 서비스와 생활복지 SOC시설 지원에 미치는 영향ebook.gg.go.kr/20210720_171924
26942021-07-20의회/사업소/직속기관경기도 소비자 정책 방안 연구ebook.gg.go.kr/20210720_171125
26952021-07-20의회/사업소/직속기관소방업무 유형 분석과 업무성과 제고를 위한 연구ebook.gg.go.kr/20210720_165546
26962021-07-20의회/사업소/직속기관경기도 육아종합지원센터의 어린이집 지원사업 활성화 방안 연구ebook.gg.go.kr/20210720_163829

Duplicate rows

Most frequently occurring

제작일자분류내용자료명URL# duplicates
02022-06-03의회/사업소/직속기관의회소식지 262호ebook.gg.go.kr/20220603_1738432