Overview

Dataset statistics

Number of variables5
Number of observations4376
Missing cells1221
Missing cells (%)5.6%
Duplicate rows240
Duplicate rows (%)5.5%
Total size in memory171.1 KiB
Average record size in memory40.0 B

Variable types

Text4
DateTime1

Dataset

Description여성사전시관에서 관리하고 있는 도서 정보를 제공합니다. (도서명, 저자명, 출판사명, 출판연도, 데이터기준일자)
Author여성가족부
URLhttps://www.data.go.kr/data/15085795/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 240 (5.5%) duplicate rowsDuplicates
저자명 has 789 (18.0%) missing valuesMissing
출판사명 has 432 (9.9%) missing valuesMissing

Reproduction

Analysis started2023-12-12 06:59:43.751890
Analysis finished2023-12-12 06:59:45.312256
Duration1.56 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct3987
Distinct (%)91.1%
Missing0
Missing (%)0.0%
Memory size34.3 KiB
2023-12-12T15:59:45.592319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length116
Median length73
Mean length17.280393
Min length1

Characters and Unicode

Total characters75619
Distinct characters1243
Distinct categories17 ?
Distinct scripts7 ?
Distinct blocks12 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3735 ?
Unique (%)85.4%

Sample

1st row 독립을 향한 여성영웅들의 행진
2nd row 토지주택박물관 전시도록
3rd row"다문화 가정의 현화과 정책방안"-경제, 여성, 자녀교육문제 중심으로-
4th row"다시함께"와 함께걷기
5th row"다시함께"와 함께걷기 2
ValueCountFrequency (%)
280
 
1.6%
여성 140
 
0.8%
연구 138
 
0.8%
역사 103
 
0.6%
한국 103
 
0.6%
이야기 94
 
0.5%
위한 90
 
0.5%
1 79
 
0.5%
정책보고서 73
 
0.4%
2 71
 
0.4%
Other values (8301) 16186
93.3%
2023-12-12T15:59:46.186621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13415
 
17.7%
1657
 
2.2%
1586
 
2.1%
1523
 
2.0%
1293
 
1.7%
1 1137
 
1.5%
1079
 
1.4%
0 1069
 
1.4%
2 935
 
1.2%
861
 
1.1%
Other values (1233) 51064
67.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 51727
68.4%
Space Separator 13419
 
17.7%
Decimal Number 4789
 
6.3%
Lowercase Letter 1841
 
2.4%
Other Punctuation 1512
 
2.0%
Uppercase Letter 1253
 
1.7%
Dash Punctuation 443
 
0.6%
Close Punctuation 184
 
0.2%
Open Punctuation 183
 
0.2%
Math Symbol 170
 
0.2%
Other values (7) 98
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1657
 
3.2%
1586
 
3.1%
1523
 
2.9%
1293
 
2.5%
1079
 
2.1%
861
 
1.7%
852
 
1.6%
718
 
1.4%
687
 
1.3%
603
 
1.2%
Other values (1127) 40868
79.0%
Uppercase Letter
ValueCountFrequency (%)
E 125
 
10.0%
I 106
 
8.5%
A 96
 
7.7%
T 81
 
6.5%
O 80
 
6.4%
F 77
 
6.1%
N 75
 
6.0%
R 68
 
5.4%
S 67
 
5.3%
C 58
 
4.6%
Other values (17) 420
33.5%
Lowercase Letter
ValueCountFrequency (%)
e 263
14.3%
o 179
9.7%
n 177
9.6%
i 147
 
8.0%
t 140
 
7.6%
a 127
 
6.9%
r 126
 
6.8%
s 100
 
5.4%
l 83
 
4.5%
u 77
 
4.2%
Other values (15) 422
22.9%
Other Punctuation
ValueCountFrequency (%)
, 494
32.7%
: 380
25.1%
' 295
19.5%
. 177
 
11.7%
/ 54
 
3.6%
? 38
 
2.5%
" 34
 
2.2%
! 26
 
1.7%
4
 
0.3%
& 4
 
0.3%
Other values (3) 6
 
0.4%
Decimal Number
ValueCountFrequency (%)
1 1137
23.7%
0 1069
22.3%
2 935
19.5%
3 341
 
7.1%
5 267
 
5.6%
9 266
 
5.6%
4 241
 
5.0%
6 195
 
4.1%
7 173
 
3.6%
8 165
 
3.4%
Math Symbol
ValueCountFrequency (%)
~ 60
35.3%
50
29.4%
> 26
15.3%
< 26
15.3%
+ 6
 
3.5%
× 1
 
0.6%
= 1
 
0.6%
Letter Number
ValueCountFrequency (%)
21
50.0%
9
21.4%
5
 
11.9%
5
 
11.9%
1
 
2.4%
1
 
2.4%
Dash Punctuation
ValueCountFrequency (%)
- 438
98.9%
4
 
0.9%
1
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 174
94.6%
9
 
4.9%
1
 
0.5%
Open Punctuation
ValueCountFrequency (%)
( 173
94.5%
9
 
4.9%
1
 
0.5%
Space Separator
ValueCountFrequency (%)
13415
> 99.9%
  4
 
< 0.1%
Other Number
ValueCountFrequency (%)
1
50.0%
1
50.0%
Connector Punctuation
ValueCountFrequency (%)
_ 44
100.0%
Spacing Mark
ValueCountFrequency (%)
3
100.0%
Final Punctuation
ValueCountFrequency (%)
3
100.0%
Modifier Letter
ValueCountFrequency (%)
2
100.0%
Initial Punctuation
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 51110
67.6%
Common 20753
27.4%
Latin 3135
 
4.1%
Han 529
 
0.7%
Katakana 49
 
0.1%
Hiragana 42
 
0.1%
Greek 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1657
 
3.2%
1586
 
3.1%
1523
 
3.0%
1293
 
2.5%
1079
 
2.1%
861
 
1.7%
852
 
1.7%
718
 
1.4%
687
 
1.3%
603
 
1.2%
Other values (876) 40251
78.8%
Han
ValueCountFrequency (%)
30
 
5.7%
26
 
4.9%
24
 
4.5%
19
 
3.6%
18
 
3.4%
14
 
2.6%
14
 
2.6%
13
 
2.5%
11
 
2.1%
10
 
1.9%
Other values (202) 350
66.2%
Latin
ValueCountFrequency (%)
e 263
 
8.4%
o 179
 
5.7%
n 177
 
5.6%
i 147
 
4.7%
t 140
 
4.5%
a 127
 
4.1%
r 126
 
4.0%
E 125
 
4.0%
I 106
 
3.4%
s 100
 
3.2%
Other values (47) 1645
52.5%
Common
ValueCountFrequency (%)
13415
64.6%
1 1137
 
5.5%
0 1069
 
5.2%
2 935
 
4.5%
, 494
 
2.4%
- 438
 
2.1%
: 380
 
1.8%
3 341
 
1.6%
' 295
 
1.4%
5 267
 
1.3%
Other values (37) 1982
 
9.6%
Hiragana
ValueCountFrequency (%)
7
16.7%
4
 
9.5%
4
 
9.5%
4
 
9.5%
3
 
7.1%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
1
 
2.4%
Other values (11) 11
26.2%
Katakana
ValueCountFrequency (%)
5
 
10.2%
4
 
8.2%
4
 
8.2%
4
 
8.2%
4
 
8.2%
4
 
8.2%
3
 
6.1%
3
 
6.1%
2
 
4.1%
2
 
4.1%
Other values (9) 14
28.6%
Greek
ValueCountFrequency (%)
Ι 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 51093
67.6%
ASCII 23752
31.4%
CJK 498
 
0.7%
Katakana 51
 
0.1%
Math Operators 50
 
0.1%
Number Forms 42
 
0.1%
Hiragana 42
 
0.1%
CJK Compat Ideographs 31
 
< 0.1%
None 29
 
< 0.1%
Punctuation 15
 
< 0.1%
Other values (2) 16
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
13415
56.5%
1 1137
 
4.8%
0 1069
 
4.5%
2 935
 
3.9%
, 494
 
2.1%
- 438
 
1.8%
: 380
 
1.6%
3 341
 
1.4%
' 295
 
1.2%
5 267
 
1.1%
Other values (72) 4981
 
21.0%
Hangul
ValueCountFrequency (%)
1657
 
3.2%
1586
 
3.1%
1523
 
3.0%
1293
 
2.5%
1079
 
2.1%
861
 
1.7%
852
 
1.7%
718
 
1.4%
687
 
1.3%
603
 
1.2%
Other values (861) 40234
78.7%
Math Operators
ValueCountFrequency (%)
50
100.0%
CJK
ValueCountFrequency (%)
30
 
6.0%
26
 
5.2%
19
 
3.8%
18
 
3.6%
14
 
2.8%
14
 
2.8%
13
 
2.6%
11
 
2.2%
10
 
2.0%
10
 
2.0%
Other values (195) 333
66.9%
CJK Compat Ideographs
ValueCountFrequency (%)
24
77.4%
2
 
6.5%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
Number Forms
ValueCountFrequency (%)
21
50.0%
9
21.4%
5
 
11.9%
5
 
11.9%
1
 
2.4%
1
 
2.4%
None
ValueCountFrequency (%)
9
31.0%
9
31.0%
  4
13.8%
3
 
10.3%
1
 
3.4%
1
 
3.4%
× 1
 
3.4%
Ι 1
 
3.4%
Hiragana
ValueCountFrequency (%)
7
16.7%
4
 
9.5%
4
 
9.5%
4
 
9.5%
3
 
7.1%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
1
 
2.4%
Other values (11) 11
26.2%
Katakana
ValueCountFrequency (%)
5
 
9.8%
4
 
7.8%
4
 
7.8%
4
 
7.8%
4
 
7.8%
4
 
7.8%
3
 
5.9%
3
 
5.9%
2
 
3.9%
2
 
3.9%
Other values (10) 16
31.4%
Punctuation
ValueCountFrequency (%)
4
26.7%
4
26.7%
3
20.0%
2
13.3%
1
 
6.7%
1
 
6.7%
Compat Jamo
ValueCountFrequency (%)
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
Other values (4) 4
28.6%
Enclosed Alphanum
ValueCountFrequency (%)
1
50.0%
1
50.0%

저자명
Text

MISSING 

Distinct2252
Distinct (%)62.8%
Missing789
Missing (%)18.0%
Memory size34.3 KiB
2023-12-12T15:59:46.472957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length100
Median length36
Mean length7.9163647
Min length1

Characters and Unicode

Total characters28396
Distinct characters707
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1809 ?
Unique (%)50.4%

Sample

1st row
2nd row다시함께센터
3rd row다시함께센터
4th row다시함께센터
5th row정선희
ValueCountFrequency (%)
296
 
4.4%
158
 
2.4%
142
 
2.1%
그림 101
 
1.5%
58
 
0.9%
숙명여자대학교 58
 
0.9%
한국여성연구소 45
 
0.7%
45
 
0.7%
퍼블릭아트 43
 
0.6%
옮김 38
 
0.6%
Other values (3340) 5692
85.3%
2023-12-12T15:59:46.959816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3247
 
11.4%
779
 
2.7%
, 717
 
2.5%
617
 
2.2%
583
 
2.1%
534
 
1.9%
494
 
1.7%
420
 
1.5%
405
 
1.4%
378
 
1.3%
Other values (697) 20222
71.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 23008
81.0%
Space Separator 3247
 
11.4%
Other Punctuation 1134
 
4.0%
Lowercase Letter 532
 
1.9%
Uppercase Letter 282
 
1.0%
Decimal Number 126
 
0.4%
Open Punctuation 21
 
0.1%
Close Punctuation 21
 
0.1%
Other Symbol 11
 
< 0.1%
Math Symbol 9
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
779
 
3.4%
617
 
2.7%
583
 
2.5%
534
 
2.3%
494
 
2.1%
420
 
1.8%
405
 
1.8%
378
 
1.6%
357
 
1.6%
350
 
1.5%
Other values (625) 18091
78.6%
Uppercase Letter
ValueCountFrequency (%)
A 31
 
11.0%
M 27
 
9.6%
B 26
 
9.2%
S 22
 
7.8%
E 21
 
7.4%
R 15
 
5.3%
C 13
 
4.6%
K 12
 
4.3%
T 12
 
4.3%
N 12
 
4.3%
Other values (14) 91
32.3%
Lowercase Letter
ValueCountFrequency (%)
e 77
14.5%
n 53
10.0%
a 47
 
8.8%
r 44
 
8.3%
o 42
 
7.9%
i 33
 
6.2%
s 32
 
6.0%
u 31
 
5.8%
t 27
 
5.1%
l 24
 
4.5%
Other values (13) 122
22.9%
Decimal Number
ValueCountFrequency (%)
1 24
19.0%
3 23
18.3%
4 19
15.1%
0 15
11.9%
5 13
10.3%
2 12
9.5%
8 9
 
7.1%
7 5
 
4.0%
9 4
 
3.2%
6 2
 
1.6%
Other Punctuation
ValueCountFrequency (%)
, 717
63.2%
/ 358
31.6%
. 49
 
4.3%
' 5
 
0.4%
: 4
 
0.4%
1
 
0.1%
Math Symbol
ValueCountFrequency (%)
< 3
33.3%
> 3
33.3%
2
22.2%
+ 1
 
11.1%
Space Separator
ValueCountFrequency (%)
3247
100.0%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%
Other Symbol
ValueCountFrequency (%)
11
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 22910
80.7%
Common 4563
 
16.1%
Latin 814
 
2.9%
Han 109
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
779
 
3.4%
617
 
2.7%
583
 
2.5%
534
 
2.3%
494
 
2.2%
420
 
1.8%
405
 
1.8%
378
 
1.6%
357
 
1.6%
350
 
1.5%
Other values (590) 17993
78.5%
Latin
ValueCountFrequency (%)
e 77
 
9.5%
n 53
 
6.5%
a 47
 
5.8%
r 44
 
5.4%
o 42
 
5.2%
i 33
 
4.1%
s 32
 
3.9%
u 31
 
3.8%
A 31
 
3.8%
t 27
 
3.3%
Other values (37) 397
48.8%
Han
ValueCountFrequency (%)
27
24.8%
6
 
5.5%
4
 
3.7%
4
 
3.7%
4
 
3.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
Other values (26) 49
45.0%
Common
ValueCountFrequency (%)
3247
71.2%
, 717
 
15.7%
/ 358
 
7.8%
. 49
 
1.1%
1 24
 
0.5%
3 23
 
0.5%
( 21
 
0.5%
) 21
 
0.5%
4 19
 
0.4%
0 15
 
0.3%
Other values (14) 69
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 22899
80.6%
ASCII 5374
 
18.9%
CJK 106
 
0.4%
None 11
 
< 0.1%
CJK Compat Ideographs 3
 
< 0.1%
Math Operators 2
 
< 0.1%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3247
60.4%
, 717
 
13.3%
/ 358
 
6.7%
e 77
 
1.4%
n 53
 
1.0%
. 49
 
0.9%
a 47
 
0.9%
r 44
 
0.8%
o 42
 
0.8%
i 33
 
0.6%
Other values (59) 707
 
13.2%
Hangul
ValueCountFrequency (%)
779
 
3.4%
617
 
2.7%
583
 
2.5%
534
 
2.3%
494
 
2.2%
420
 
1.8%
405
 
1.8%
378
 
1.7%
357
 
1.6%
350
 
1.5%
Other values (589) 17982
78.5%
CJK
ValueCountFrequency (%)
27
25.5%
6
 
5.7%
4
 
3.8%
4
 
3.8%
4
 
3.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
Other values (24) 46
43.4%
None
ValueCountFrequency (%)
11
100.0%
Math Operators
ValueCountFrequency (%)
2
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
2
66.7%
1
33.3%
Punctuation
ValueCountFrequency (%)
1
100.0%

출판사명
Text

MISSING 

Distinct1514
Distinct (%)38.4%
Missing432
Missing (%)9.9%
Memory size34.3 KiB
2023-12-12T15:59:47.274232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length37
Mean length6.3651116
Min length1

Characters and Unicode

Total characters25104
Distinct characters641
Distinct categories11 ?
Distinct scripts6 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique972 ?
Unique (%)24.6%

Sample

1st row국립여성사전시관
2nd row토지주택박물관
3rd row한국여성경제학회
4th row다시함께센터
5th row다시함께센터
ValueCountFrequency (%)
한국여성정책연구원 97
 
2.1%
국립민속박물관 87
 
1.9%
재)경기도가족여성연구원 78
 
1.7%
대한민국역사박물관 45
 
1.0%
여성가족부 40
 
0.9%
국립여성사전시관 38
 
0.8%
창비 37
 
0.8%
국학자료원 36
 
0.8%
문화 35
 
0.8%
한울 31
 
0.7%
Other values (1655) 4100
88.7%
2023-12-12T15:59:47.746387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1052
 
4.2%
763
 
3.0%
756
 
3.0%
724
 
2.9%
685
 
2.7%
595
 
2.4%
550
 
2.2%
538
 
2.1%
513
 
2.0%
464
 
1.8%
Other values (631) 18464
73.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 22602
90.0%
Lowercase Letter 745
 
3.0%
Space Separator 725
 
2.9%
Uppercase Letter 459
 
1.8%
Other Punctuation 168
 
0.7%
Decimal Number 127
 
0.5%
Open Punctuation 125
 
0.5%
Close Punctuation 125
 
0.5%
Other Symbol 23
 
0.1%
Math Symbol 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1052
 
4.7%
763
 
3.4%
756
 
3.3%
685
 
3.0%
595
 
2.6%
550
 
2.4%
538
 
2.4%
513
 
2.3%
464
 
2.1%
453
 
2.0%
Other values (564) 16233
71.8%
Lowercase Letter
ValueCountFrequency (%)
e 100
13.4%
u 75
10.1%
r 71
9.5%
n 64
8.6%
o 63
8.5%
s 62
8.3%
a 55
7.4%
t 50
 
6.7%
i 39
 
5.2%
m 35
 
4.7%
Other values (14) 131
17.6%
Uppercase Letter
ValueCountFrequency (%)
M 52
 
11.3%
A 44
 
9.6%
E 35
 
7.6%
C 30
 
6.5%
F 27
 
5.9%
I 25
 
5.4%
S 24
 
5.2%
N 24
 
5.2%
O 22
 
4.8%
L 22
 
4.8%
Other values (13) 154
33.6%
Decimal Number
ValueCountFrequency (%)
0 44
34.6%
1 41
32.3%
2 16
 
12.6%
3 12
 
9.4%
5 8
 
6.3%
8 3
 
2.4%
7 2
 
1.6%
6 1
 
0.8%
Other Punctuation
ValueCountFrequency (%)
, 119
70.8%
/ 18
 
10.7%
. 13
 
7.7%
& 13
 
7.7%
' 5
 
3.0%
Space Separator
ValueCountFrequency (%)
724
99.9%
  1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 125
100.0%
Close Punctuation
ValueCountFrequency (%)
) 125
100.0%
Other Symbol
ValueCountFrequency (%)
23
100.0%
Math Symbol
ValueCountFrequency (%)
4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 22393
89.2%
Common 1275
 
5.1%
Latin 1204
 
4.8%
Han 222
 
0.9%
Hiragana 6
 
< 0.1%
Katakana 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1052
 
4.7%
763
 
3.4%
756
 
3.4%
685
 
3.1%
595
 
2.7%
550
 
2.5%
538
 
2.4%
513
 
2.3%
464
 
2.1%
453
 
2.0%
Other values (473) 16024
71.6%
Han
ValueCountFrequency (%)
13
 
5.9%
11
 
5.0%
11
 
5.0%
9
 
4.1%
9
 
4.1%
9
 
4.1%
9
 
4.1%
8
 
3.6%
7
 
3.2%
7
 
3.2%
Other values (74) 129
58.1%
Latin
ValueCountFrequency (%)
e 100
 
8.3%
u 75
 
6.2%
r 71
 
5.9%
n 64
 
5.3%
o 63
 
5.2%
s 62
 
5.1%
a 55
 
4.6%
M 52
 
4.3%
t 50
 
4.2%
A 44
 
3.7%
Other values (37) 568
47.2%
Common
ValueCountFrequency (%)
724
56.8%
( 125
 
9.8%
) 125
 
9.8%
, 119
 
9.3%
0 44
 
3.5%
1 41
 
3.2%
/ 18
 
1.4%
2 16
 
1.3%
. 13
 
1.0%
& 13
 
1.0%
Other values (9) 37
 
2.9%
Hiragana
ValueCountFrequency (%)
2
33.3%
2
33.3%
1
16.7%
1
16.7%
Katakana
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 22370
89.1%
ASCII 2474
 
9.9%
CJK 214
 
0.9%
None 24
 
0.1%
CJK Compat Ideographs 8
 
< 0.1%
Hiragana 6
 
< 0.1%
Math Operators 4
 
< 0.1%
Katakana 4
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1052
 
4.7%
763
 
3.4%
756
 
3.4%
685
 
3.1%
595
 
2.7%
550
 
2.5%
538
 
2.4%
513
 
2.3%
464
 
2.1%
453
 
2.0%
Other values (472) 16001
71.5%
ASCII
ValueCountFrequency (%)
724
29.3%
( 125
 
5.1%
) 125
 
5.1%
, 119
 
4.8%
e 100
 
4.0%
u 75
 
3.0%
r 71
 
2.9%
n 64
 
2.6%
o 63
 
2.5%
s 62
 
2.5%
Other values (54) 946
38.2%
None
ValueCountFrequency (%)
23
95.8%
  1
 
4.2%
CJK
ValueCountFrequency (%)
13
 
6.1%
11
 
5.1%
11
 
5.1%
9
 
4.2%
9
 
4.2%
9
 
4.2%
9
 
4.2%
8
 
3.7%
7
 
3.3%
7
 
3.3%
Other values (71) 121
56.5%
CJK Compat Ideographs
ValueCountFrequency (%)
5
62.5%
2
 
25.0%
1
 
12.5%
Math Operators
ValueCountFrequency (%)
4
100.0%
Hiragana
ValueCountFrequency (%)
2
33.3%
2
33.3%
1
16.7%
1
16.7%
Katakana
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Distinct65
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size34.3 KiB
2023-12-12T15:59:48.015705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length4
Mean length3.9968007
Min length2

Characters and Unicode

Total characters17490
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4 ?
Unique (%)0.1%

Sample

1st row2015
2nd row2015
3rd row2010
4th row2004
5th row2005
ValueCountFrequency (%)
2011 309
 
7.2%
2008 279
 
6.5%
2009 256
 
6.0%
2010 246
 
5.7%
2006 241
 
5.6%
2007 215
 
5.0%
2012 215
 
5.0%
2003 210
 
4.9%
2005 197
 
4.6%
2002 181
 
4.2%
Other values (52) 1935
45.2%
2023-12-12T15:59:48.409242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 6135
35.1%
2 4131
23.6%
1 2724
15.6%
9 1462
 
8.4%
8 585
 
3.3%
7 493
 
2.8%
6 453
 
2.6%
3 403
 
2.3%
4 379
 
2.2%
5 371
 
2.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 17136
98.0%
Space Separator 354
 
2.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 6135
35.8%
2 4131
24.1%
1 2724
15.9%
9 1462
 
8.5%
8 585
 
3.4%
7 493
 
2.9%
6 453
 
2.6%
3 403
 
2.4%
4 379
 
2.2%
5 371
 
2.2%
Space Separator
ValueCountFrequency (%)
354
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 17490
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 6135
35.1%
2 4131
23.6%
1 2724
15.6%
9 1462
 
8.4%
8 585
 
3.3%
7 493
 
2.8%
6 453
 
2.6%
3 403
 
2.3%
4 379
 
2.2%
5 371
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 17490
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 6135
35.1%
2 4131
23.6%
1 2724
15.6%
9 1462
 
8.4%
8 585
 
3.3%
7 493
 
2.8%
6 453
 
2.6%
3 403
 
2.3%
4 379
 
2.2%
5 371
 
2.1%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size34.3 KiB
Minimum2021-08-06 00:00:00
Maximum2021-08-06 00:00:00
2023-12-12T15:59:48.572414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:59:48.667417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Missing values

2023-12-12T15:59:45.034917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:59:45.145677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T15:59:45.254058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

도서명저자명출판사명출판연도데이터기준일자
0독립을 향한 여성영웅들의 행진<NA>국립여성사전시관20152021-08-06
1토지주택박물관 전시도록<NA>토지주택박물관20152021-08-06
2"다문화 가정의 현화과 정책방안"-경제, 여성, 자녀교육문제 중심으로-한국여성경제학회20102021-08-06
3"다시함께"와 함께걷기다시함께센터다시함께센터20042021-08-06
4"다시함께"와 함께걷기 2다시함께센터다시함께센터20052021-08-06
5"다시함께"와 함께걷기3다시함께센터다시함께센터20062021-08-06
6"이익을 만들고 행복을 나누는" 사회적 기업정선희다우20042021-08-06
7(자료로 본) 한국영화사 1권정종화열화당19972021-08-06
8(자료로 본) 한국영화사 2권정종화열화당19972021-08-06
91% 리더만 아는 유머대화법임붕영미래지식20122021-08-06
도서명저자명출판사명출판연도데이터기준일자
4366희망을 키우는 착한 소비: 커피, 바나나, 청바지에 담긴 공정무역의 역사니코 로전, 프란스 판 데어 호프 / 김영중 역서해문집20082021-08-06
4367흰곰 가족의 5층짜리 신발 가게오오데 유카코, 김영주 역북스토리아이20162021-08-06
4368흰둥이네 할머니송언 글/김성민 그림현암사20032021-08-06
4369金南日報 重要事件 20年史 1952年~1960年(上券)<NA>金南日報社2021-08-06
4370女四書김종권 역명문당19872021-08-06
4371女性史硏究入門<NA>歷史科学協議会2021-08-06
4372女性學新論 改訂版李花女子大學校 韓國女性硏究所<NA>2021-08-06
4373龍仁 瑞峰寺 (용인 서봉사지)수원대학교박물관용인시∙수원대학교박물관20092021-08-06
4374李聖子, 예술과 삶이지은, 강영주, 정영목, 심삼용생각의 나무20072021-08-06
4375梨花百年史 1886~1986심치선이화여자고등학교19942021-08-06

Duplicate rows

Most frequently occurring

도서명저자명출판사명출판연도데이터기준일자# duplicates
41구석구석 젠더정치남윤인순해피스토리20142021-08-0611
192013 구술자료집 살아있는 여성사 2002~2013<NA>국립여성사전시관20132021-08-066
83서간도에 들꽃 피다 <3>이윤옥얼레빗20112021-08-066
118여성사 강좌 1 新여성<NA>여성사 전시관20052021-08-066
170조선성자 방애인배은희두인2021-08-066
190페미니스트 저널 IF도서출판이프<NA>20032021-08-066
40교사를 위한 '청소년 노동인권 교육'국립여성사전시관국립여성사전시관20092021-08-065
48나를 만든 위대한 유산여성사전시관여성사전시관20082021-08-065
53대한민국의 미래 여성이 품다 국립여성사박물관<NA>국립여성사박물관 건립추진위원회20122021-08-065
105여성 60년사, 그 삶의 발자취여성부여성부20082021-08-065