Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory478.5 KiB
Average record size in memory49.0 B

Variable types

Text4
Numeric1

Dataset

Description광주광역시 인재교육원에서 보유중인 도서 정보 데이터입니다. 서명, 저자, 출판사, 발행년도 등의 항목을 제공합니다.
Author광주광역시
URLhttps://www.data.go.kr/data/15001667/fileData.do

Alerts

번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 17:20:33.787881
Analysis finished2023-12-12 17:20:36.164947
Duration2.38 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T02:20:36.441775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length7
Mean length7
Min length7

Characters and Unicode

Total characters70000
Distinct characters12
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st rowED10407
2nd rowED07461
3rd rowED10056
4th rowED10194
5th rowED00651
ValueCountFrequency (%)
ed10407 1
 
< 0.1%
ed11207 1
 
< 0.1%
ed07116 1
 
< 0.1%
ed08766 1
 
< 0.1%
ed00456 1
 
< 0.1%
ed02294 1
 
< 0.1%
ed05182 1
 
< 0.1%
ed06757 1
 
< 0.1%
ed05875 1
 
< 0.1%
ed10071 1
 
< 0.1%
Other values (9990) 9990
99.9%
2023-12-13T02:20:36.894171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 12435
17.8%
E 10000
14.3%
D 10000
14.3%
1 6956
9.9%
2 4491
 
6.4%
4 3900
 
5.6%
3 3859
 
5.5%
5 3854
 
5.5%
8 3749
 
5.4%
7 3715
 
5.3%
Other values (2) 7041
10.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 50000
71.4%
Uppercase Letter 20000
 
28.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 12435
24.9%
1 6956
13.9%
2 4491
 
9.0%
4 3900
 
7.8%
3 3859
 
7.7%
5 3854
 
7.7%
8 3749
 
7.5%
7 3715
 
7.4%
6 3693
 
7.4%
9 3348
 
6.7%
Uppercase Letter
ValueCountFrequency (%)
E 10000
50.0%
D 10000
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 50000
71.4%
Latin 20000
 
28.6%

Most frequent character per script

Common
ValueCountFrequency (%)
0 12435
24.9%
1 6956
13.9%
2 4491
 
9.0%
4 3900
 
7.8%
3 3859
 
7.7%
5 3854
 
7.7%
8 3749
 
7.5%
7 3715
 
7.4%
6 3693
 
7.4%
9 3348
 
6.7%
Latin
ValueCountFrequency (%)
E 10000
50.0%
D 10000
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 70000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 12435
17.8%
E 10000
14.3%
D 10000
14.3%
1 6956
9.9%
2 4491
 
6.4%
4 3900
 
5.6%
3 3859
 
5.5%
5 3854
 
5.5%
8 3749
 
5.4%
7 3715
 
5.3%
Other values (2) 7041
10.1%

서명
Text

Distinct9066
Distinct (%)90.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T02:20:37.297819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length207
Median length132
Mean length18.7627
Min length1

Characters and Unicode

Total characters187627
Distinct characters1958
Distinct categories18 ?
Distinct scripts7 ?
Distinct blocks13 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8674 ?
Unique (%)86.7%

Sample

1st row각방 예찬
2nd row(합격의 힘 EBS 고졸검정고시) 국사
3rd row아들아 세상을 살아가는 지혜를 배우렴
4th row대륙의 딸. 하
5th row3ds Max 9.X 현장 실무 테크닉 57선
ValueCountFrequency (%)
2392
 
5.2%
2 473
 
1.0%
1 464
 
1.0%
320
 
0.7%
장편소설 279
 
0.6%
합격의 267
 
0.6%
위한 236
 
0.5%
이야기 201
 
0.4%
연구 175
 
0.4%
3 172
 
0.4%
Other values (19003) 40611
89.1%
2023-12-13T02:20:38.097598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
35890
 
19.1%
3926
 
2.1%
: 2632
 
1.4%
2496
 
1.3%
2185
 
1.2%
. 1969
 
1.0%
1957
 
1.0%
1905
 
1.0%
1759
 
0.9%
1696
 
0.9%
Other values (1948) 131212
69.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 120118
64.0%
Space Separator 35890
 
19.1%
Lowercase Letter 12858
 
6.9%
Decimal Number 6634
 
3.5%
Other Punctuation 6388
 
3.4%
Uppercase Letter 1905
 
1.0%
Open Punctuation 1592
 
0.8%
Close Punctuation 1592
 
0.8%
Math Symbol 482
 
0.3%
Dash Punctuation 143
 
0.1%
Other values (8) 25
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3926
 
3.3%
2496
 
2.1%
2185
 
1.8%
1957
 
1.6%
1905
 
1.6%
1759
 
1.5%
1696
 
1.4%
1618
 
1.3%
1607
 
1.3%
1490
 
1.2%
Other values (1837) 99479
82.8%
Lowercase Letter
ValueCountFrequency (%)
e 1435
11.2%
n 1130
 
8.8%
a 1107
 
8.6%
o 1074
 
8.4%
t 1064
 
8.3%
i 1053
 
8.2%
r 882
 
6.9%
s 753
 
5.9%
l 530
 
4.1%
d 500
 
3.9%
Other values (16) 3330
25.9%
Uppercase Letter
ValueCountFrequency (%)
S 249
13.1%
E 210
 
11.0%
B 201
 
10.6%
A 130
 
6.8%
T 123
 
6.5%
C 109
 
5.7%
I 87
 
4.6%
M 78
 
4.1%
W 70
 
3.7%
O 68
 
3.6%
Other values (16) 580
30.4%
Other Punctuation
ValueCountFrequency (%)
: 2632
41.2%
. 1969
30.8%
, 1200
18.8%
· 190
 
3.0%
? 140
 
2.2%
! 87
 
1.4%
' 80
 
1.3%
/ 28
 
0.4%
& 17
 
0.3%
16
 
0.3%
Other values (7) 29
 
0.5%
Decimal Number
ValueCountFrequency (%)
1 1653
24.9%
2 1486
22.4%
0 1432
21.6%
3 542
 
8.2%
5 386
 
5.8%
4 344
 
5.2%
9 251
 
3.8%
6 211
 
3.2%
7 181
 
2.7%
8 148
 
2.2%
Open Punctuation
ValueCountFrequency (%)
( 1577
99.1%
[ 10
 
0.6%
2
 
0.1%
1
 
0.1%
1
 
0.1%
1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 1577
99.1%
] 10
 
0.6%
2
 
0.1%
1
 
0.1%
1
 
0.1%
1
 
0.1%
Math Symbol
ValueCountFrequency (%)
= 446
92.5%
~ 14
 
2.9%
+ 13
 
2.7%
> 4
 
0.8%
< 4
 
0.8%
× 1
 
0.2%
Dash Punctuation
ValueCountFrequency (%)
- 142
99.3%
1
 
0.7%
Modifier Symbol
ValueCountFrequency (%)
´ 4
80.0%
` 1
 
20.0%
Other Symbol
ValueCountFrequency (%)
1
50.0%
1
50.0%
Letter Number
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
35890
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 12
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%
Private Use
ValueCountFrequency (%)
1
100.0%
Other Number
ValueCountFrequency (%)
² 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 116820
62.3%
Common 52743
28.1%
Latin 14765
 
7.9%
Han 3242
 
1.7%
Hiragana 36
 
< 0.1%
Katakana 20
 
< 0.1%
Unknown 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3926
 
3.4%
2496
 
2.1%
2185
 
1.9%
1957
 
1.7%
1905
 
1.6%
1759
 
1.5%
1696
 
1.5%
1618
 
1.4%
1607
 
1.4%
1490
 
1.3%
Other values (1170) 96181
82.3%
Han
ValueCountFrequency (%)
99
 
3.1%
92
 
2.8%
82
 
2.5%
74
 
2.3%
60
 
1.9%
58
 
1.8%
54
 
1.7%
51
 
1.6%
41
 
1.3%
40
 
1.2%
Other values (614) 2591
79.9%
Common
ValueCountFrequency (%)
35890
68.0%
: 2632
 
5.0%
. 1969
 
3.7%
1 1653
 
3.1%
( 1577
 
3.0%
) 1577
 
3.0%
2 1486
 
2.8%
0 1432
 
2.7%
, 1200
 
2.3%
3 542
 
1.0%
Other values (46) 2785
 
5.3%
Latin
ValueCountFrequency (%)
e 1435
 
9.7%
n 1130
 
7.7%
a 1107
 
7.5%
o 1074
 
7.3%
t 1064
 
7.2%
i 1053
 
7.1%
r 882
 
6.0%
s 753
 
5.1%
l 530
 
3.6%
d 500
 
3.4%
Other values (44) 5237
35.5%
Hiragana
ValueCountFrequency (%)
6
16.7%
3
 
8.3%
3
 
8.3%
2
 
5.6%
2
 
5.6%
2
 
5.6%
2
 
5.6%
1
 
2.8%
1
 
2.8%
1
 
2.8%
Other values (13) 13
36.1%
Katakana
ValueCountFrequency (%)
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
Other values (10) 10
50.0%
Unknown
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 116816
62.3%
ASCII 67263
35.8%
CJK 3158
 
1.7%
None 238
 
0.1%
CJK Compat Ideographs 84
 
< 0.1%
Hiragana 36
 
< 0.1%
Katakana 20
 
< 0.1%
Compat Jamo 4
 
< 0.1%
Punctuation 3
 
< 0.1%
Number Forms 2
 
< 0.1%
Other values (3) 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
35890
53.4%
: 2632
 
3.9%
. 1969
 
2.9%
1 1653
 
2.5%
( 1577
 
2.3%
) 1577
 
2.3%
2 1486
 
2.2%
e 1435
 
2.1%
0 1432
 
2.1%
, 1200
 
1.8%
Other values (77) 16412
24.4%
Hangul
ValueCountFrequency (%)
3926
 
3.4%
2496
 
2.1%
2185
 
1.9%
1957
 
1.7%
1905
 
1.6%
1759
 
1.5%
1696
 
1.5%
1618
 
1.4%
1607
 
1.4%
1490
 
1.3%
Other values (1169) 96177
82.3%
None
ValueCountFrequency (%)
· 190
79.8%
16
 
6.7%
11
 
4.6%
´ 4
 
1.7%
4
 
1.7%
2
 
0.8%
2
 
0.8%
1
 
0.4%
1
 
0.4%
1
 
0.4%
Other values (6) 6
 
2.5%
CJK
ValueCountFrequency (%)
99
 
3.1%
92
 
2.9%
82
 
2.6%
74
 
2.3%
60
 
1.9%
58
 
1.8%
54
 
1.7%
51
 
1.6%
41
 
1.3%
40
 
1.3%
Other values (590) 2507
79.4%
CJK Compat Ideographs
ValueCountFrequency (%)
18
21.4%
14
16.7%
8
9.5%
6
 
7.1%
5
 
6.0%
4
 
4.8%
4
 
4.8%
4
 
4.8%
2
 
2.4%
2
 
2.4%
Other values (14) 17
20.2%
Hiragana
ValueCountFrequency (%)
6
16.7%
3
 
8.3%
3
 
8.3%
2
 
5.6%
2
 
5.6%
2
 
5.6%
2
 
5.6%
1
 
2.8%
1
 
2.8%
1
 
2.8%
Other values (13) 13
36.1%
Compat Jamo
ValueCountFrequency (%)
4
100.0%
Katakana
ValueCountFrequency (%)
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
Other values (10) 10
50.0%
Punctuation
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Geometric Shapes
ValueCountFrequency (%)
1
100.0%
PUA
ValueCountFrequency (%)
1
100.0%
Misc Symbols
ValueCountFrequency (%)
1
100.0%
Number Forms
ValueCountFrequency (%)
1
50.0%
1
50.0%

저자
Text

Distinct7062
Distinct (%)70.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T02:20:38.452571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length107
Median length79
Mean length11.2935
Min length1

Characters and Unicode

Total characters112935
Distinct characters1388
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6086 ?
Unique (%)60.9%

Sample

1st row카우프만 장클로드
2nd row김은미 편저
3rd row필립 체스터필드
4th row장융
5th row강일웅, 웰기획 지음
ValueCountFrequency (%)
지음 5080
 
15.6%
2957
 
9.1%
옮김 1964
 
6.0%
735
 
2.3%
편저 643
 
2.0%
518
 
1.6%
ebs 324
 
1.0%
검정고시선생님 303
 
0.9%
253
 
0.8%
그림 214
 
0.7%
Other values (9835) 19521
60.0%
2023-12-13T02:20:38.961298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
22565
 
20.0%
6139
 
5.4%
5807
 
5.1%
3777
 
3.3%
; 3348
 
3.0%
2303
 
2.0%
2029
 
1.8%
1595
 
1.4%
1465
 
1.3%
1392
 
1.2%
Other values (1378) 62515
55.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 81356
72.0%
Space Separator 22566
 
20.0%
Other Punctuation 5155
 
4.6%
Uppercase Letter 1901
 
1.7%
Lowercase Letter 664
 
0.6%
Close Punctuation 614
 
0.5%
Open Punctuation 613
 
0.5%
Decimal Number 42
 
< 0.1%
Dash Punctuation 14
 
< 0.1%
Math Symbol 10
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6139
 
7.5%
5807
 
7.1%
3777
 
4.6%
2303
 
2.8%
2029
 
2.5%
1595
 
2.0%
1465
 
1.8%
1392
 
1.7%
1028
 
1.3%
953
 
1.2%
Other values (1299) 54868
67.4%
Uppercase Letter
ValueCountFrequency (%)
S 480
25.2%
B 479
25.2%
E 448
23.6%
A 53
 
2.8%
J 53
 
2.8%
M 52
 
2.7%
C 47
 
2.5%
R 40
 
2.1%
K 31
 
1.6%
D 29
 
1.5%
Other values (16) 189
 
9.9%
Lowercase Letter
ValueCountFrequency (%)
o 72
10.8%
a 71
10.7%
r 63
 
9.5%
e 63
 
9.5%
i 62
 
9.3%
n 43
 
6.5%
l 35
 
5.3%
c 33
 
5.0%
t 32
 
4.8%
s 29
 
4.4%
Other values (14) 161
24.2%
Other Punctuation
ValueCountFrequency (%)
; 3348
64.9%
, 1029
 
20.0%
. 511
 
9.9%
· 250
 
4.8%
& 10
 
0.2%
: 4
 
0.1%
? 1
 
< 0.1%
/ 1
 
< 0.1%
1
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
0 12
28.6%
2 9
21.4%
3 8
19.0%
1 6
14.3%
8 2
 
4.8%
9 2
 
4.8%
4 1
 
2.4%
6 1
 
2.4%
7 1
 
2.4%
Close Punctuation
ValueCountFrequency (%)
] 599
97.6%
) 12
 
2.0%
3
 
0.5%
Open Punctuation
ValueCountFrequency (%)
[ 598
97.6%
( 12
 
2.0%
3
 
0.5%
Space Separator
ValueCountFrequency (%)
22565
> 99.9%
  1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
< 5
50.0%
> 5
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 78881
69.8%
Common 29014
 
25.7%
Latin 2565
 
2.3%
Han 2475
 
2.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6139
 
7.8%
5807
 
7.4%
3777
 
4.8%
2303
 
2.9%
2029
 
2.6%
1595
 
2.0%
1465
 
1.9%
1392
 
1.8%
1028
 
1.3%
953
 
1.2%
Other values (829) 52393
66.4%
Han
ValueCountFrequency (%)
325
 
13.1%
108
 
4.4%
89
 
3.6%
69
 
2.8%
61
 
2.5%
50
 
2.0%
46
 
1.9%
44
 
1.8%
44
 
1.8%
35
 
1.4%
Other values (460) 1604
64.8%
Latin
ValueCountFrequency (%)
S 480
18.7%
B 479
18.7%
E 448
17.5%
o 72
 
2.8%
a 71
 
2.8%
r 63
 
2.5%
e 63
 
2.5%
i 62
 
2.4%
A 53
 
2.1%
J 53
 
2.1%
Other values (40) 721
28.1%
Common
ValueCountFrequency (%)
22565
77.8%
; 3348
 
11.5%
, 1029
 
3.5%
] 599
 
2.1%
[ 598
 
2.1%
. 511
 
1.8%
· 250
 
0.9%
- 14
 
< 0.1%
) 12
 
< 0.1%
( 12
 
< 0.1%
Other values (19) 76
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 78881
69.8%
ASCII 31321
 
27.7%
CJK 2348
 
2.1%
None 258
 
0.2%
CJK Compat Ideographs 127
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
22565
72.0%
; 3348
 
10.7%
, 1029
 
3.3%
] 599
 
1.9%
[ 598
 
1.9%
. 511
 
1.6%
S 480
 
1.5%
B 479
 
1.5%
E 448
 
1.4%
o 72
 
0.2%
Other values (64) 1192
 
3.8%
Hangul
ValueCountFrequency (%)
6139
 
7.8%
5807
 
7.4%
3777
 
4.8%
2303
 
2.9%
2029
 
2.6%
1595
 
2.0%
1465
 
1.9%
1392
 
1.8%
1028
 
1.3%
953
 
1.2%
Other values (829) 52393
66.4%
CJK
ValueCountFrequency (%)
325
 
13.8%
108
 
4.6%
89
 
3.8%
61
 
2.6%
50
 
2.1%
46
 
2.0%
44
 
1.9%
44
 
1.9%
35
 
1.5%
34
 
1.4%
Other values (439) 1512
64.4%
None
ValueCountFrequency (%)
· 250
96.9%
3
 
1.2%
3
 
1.2%
1
 
0.4%
  1
 
0.4%
CJK Compat Ideographs
ValueCountFrequency (%)
69
54.3%
12
 
9.4%
7
 
5.5%
6
 
4.7%
5
 
3.9%
4
 
3.1%
4
 
3.1%
3
 
2.4%
2
 
1.6%
2
 
1.6%
Other values (11) 13
 
10.2%
Distinct2901
Distinct (%)29.0%
Missing1
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-13T02:20:39.255806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length29
Mean length4.6333633
Min length1

Characters and Unicode

Total characters46329
Distinct characters907
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1717 ?
Unique (%)17.2%

Sample

1st row행성B잎새
2nd row지식과미래
3rd row글고은
4th row까치
5th row성안당
ValueCountFrequency (%)
지식과미래 503
 
4.8%
문학동네 169
 
1.6%
해냄 141
 
1.4%
국립방재연구소 123
 
1.2%
시공사 122
 
1.2%
김영사 122
 
1.2%
21세기북스 118
 
1.1%
한국지방행정연구원 116
 
1.1%
민음사 104
 
1.0%
광주광역시 90
 
0.9%
Other values (2933) 8772
84.5%
2023-12-13T02:20:39.685794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2098
 
4.5%
1146
 
2.5%
1081
 
2.3%
1024
 
2.2%
868
 
1.9%
843
 
1.8%
782
 
1.7%
740
 
1.6%
696
 
1.5%
676
 
1.5%
Other values (897) 36375
78.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 43811
94.6%
Other Punctuation 551
 
1.2%
Uppercase Letter 546
 
1.2%
Lowercase Letter 461
 
1.0%
Space Separator 382
 
0.8%
Decimal Number 361
 
0.8%
Open Punctuation 106
 
0.2%
Close Punctuation 102
 
0.2%
Dash Punctuation 8
 
< 0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2098
 
4.8%
1146
 
2.6%
1081
 
2.5%
1024
 
2.3%
868
 
2.0%
843
 
1.9%
782
 
1.8%
740
 
1.7%
696
 
1.6%
676
 
1.5%
Other values (825) 33857
77.3%
Uppercase Letter
ValueCountFrequency (%)
B 83
15.2%
K 81
14.8%
I 66
12.1%
L 58
10.6%
M 31
 
5.7%
O 29
 
5.3%
S 27
 
4.9%
H 25
 
4.6%
C 22
 
4.0%
R 22
 
4.0%
Other values (13) 102
18.7%
Lowercase Letter
ValueCountFrequency (%)
o 90
19.5%
s 47
10.2%
e 43
9.3%
k 35
 
7.6%
i 34
 
7.4%
a 30
 
6.5%
n 29
 
6.3%
b 26
 
5.6%
r 24
 
5.2%
t 15
 
3.3%
Other values (12) 88
19.1%
Other Punctuation
ValueCountFrequency (%)
: 402
73.0%
· 63
 
11.4%
. 21
 
3.8%
, 21
 
3.8%
& 19
 
3.4%
14
 
2.5%
; 6
 
1.1%
# 3
 
0.5%
1
 
0.2%
1
 
0.2%
Decimal Number
ValueCountFrequency (%)
2 159
44.0%
1 154
42.7%
3 11
 
3.0%
0 9
 
2.5%
9 8
 
2.2%
6 7
 
1.9%
4 4
 
1.1%
5 4
 
1.1%
8 4
 
1.1%
7 1
 
0.3%
Open Punctuation
ValueCountFrequency (%)
( 78
73.6%
[ 28
 
26.4%
Close Punctuation
ValueCountFrequency (%)
) 74
72.5%
] 28
 
27.5%
Space Separator
ValueCountFrequency (%)
382
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 41848
90.3%
Han 1964
 
4.2%
Common 1510
 
3.3%
Latin 1007
 
2.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2098
 
5.0%
1146
 
2.7%
1081
 
2.6%
1024
 
2.4%
868
 
2.1%
843
 
2.0%
782
 
1.9%
740
 
1.8%
696
 
1.7%
676
 
1.6%
Other values (602) 31894
76.2%
Han
ValueCountFrequency (%)
287
 
14.6%
110
 
5.6%
101
 
5.1%
80
 
4.1%
66
 
3.4%
65
 
3.3%
59
 
3.0%
55
 
2.8%
52
 
2.6%
50
 
2.5%
Other values (214) 1039
52.9%
Latin
ValueCountFrequency (%)
o 90
 
8.9%
B 83
 
8.2%
K 81
 
8.0%
I 66
 
6.6%
L 58
 
5.8%
s 47
 
4.7%
e 43
 
4.3%
k 35
 
3.5%
i 34
 
3.4%
M 31
 
3.1%
Other values (35) 439
43.6%
Common
ValueCountFrequency (%)
: 402
26.6%
382
25.3%
2 159
 
10.5%
1 154
 
10.2%
( 78
 
5.2%
) 74
 
4.9%
· 63
 
4.2%
[ 28
 
1.9%
] 28
 
1.9%
. 21
 
1.4%
Other values (16) 121
 
8.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 41847
90.3%
ASCII 2438
 
5.3%
CJK 1961
 
4.2%
None 80
 
0.2%
CJK Compat Ideographs 3
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2098
 
5.0%
1146
 
2.7%
1081
 
2.6%
1024
 
2.4%
868
 
2.1%
843
 
2.0%
782
 
1.9%
740
 
1.8%
696
 
1.7%
676
 
1.6%
Other values (601) 31893
76.2%
ASCII
ValueCountFrequency (%)
: 402
16.5%
382
15.7%
2 159
 
6.5%
1 154
 
6.3%
o 90
 
3.7%
B 83
 
3.4%
K 81
 
3.3%
( 78
 
3.2%
) 74
 
3.0%
I 66
 
2.7%
Other values (57) 869
35.6%
CJK
ValueCountFrequency (%)
287
 
14.6%
110
 
5.6%
101
 
5.2%
80
 
4.1%
66
 
3.4%
65
 
3.3%
59
 
3.0%
55
 
2.8%
52
 
2.7%
50
 
2.5%
Other values (212) 1036
52.8%
None
ValueCountFrequency (%)
· 63
78.8%
14
 
17.5%
1
 
1.2%
1
 
1.2%
1
 
1.2%
CJK Compat Ideographs
ValueCountFrequency (%)
2
66.7%
1
33.3%

발행년도
Real number (ℝ)

Distinct56
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2007.341
Minimum1961
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T02:20:39.850157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1961
5-th percentile1994
Q12000
median2008
Q32014
95-th percentile2021
Maximum2023
Range62
Interquartile range (IQR)14

Descriptive statistics

Standard deviation8.9973448
Coefficient of variation (CV)0.0044822204
Kurtosis-0.28268305
Mean2007.341
Median Absolute Deviation (MAD)7
Skewness-0.26820862
Sum20073410
Variance80.952214
MonotonicityNot monotonic
2023-12-13T02:20:39.985708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2011 621
 
6.2%
2006 491
 
4.9%
1995 457
 
4.6%
2007 454
 
4.5%
1996 425
 
4.2%
2010 413
 
4.1%
2009 393
 
3.9%
2014 365
 
3.6%
2008 348
 
3.5%
1999 342
 
3.4%
Other values (46) 5691
56.9%
ValueCountFrequency (%)
1961 1
 
< 0.1%
1963 3
< 0.1%
1970 1
 
< 0.1%
1971 1
 
< 0.1%
1972 2
 
< 0.1%
1973 3
< 0.1%
1974 1
 
< 0.1%
1975 1
 
< 0.1%
1976 4
< 0.1%
1977 5
0.1%
ValueCountFrequency (%)
2023 250
2.5%
2022 247
2.5%
2021 248
2.5%
2020 298
3.0%
2019 243
2.4%
2018 243
2.4%
2017 277
2.8%
2016 296
3.0%
2015 298
3.0%
2014 365
3.6%

Interactions

2023-12-13T02:20:35.879118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-13T02:20:36.002604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:20:36.109672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호서명저자출판사발행년도
9435ED10407각방 예찬카우프만 장클로드행성B잎새2017
6955ED07461(합격의 힘 EBS 고졸검정고시) 국사김은미 편저지식과미래2014
9088ED10056아들아 세상을 살아가는 지혜를 배우렴필립 체스터필드글고은2015
9225ED10194대륙의 딸. 하장융까치2016
643ED006513ds Max 9.X 현장 실무 테크닉 57선강일웅, 웰기획 지음성안당2007
7644ED08159광주통계연보. 51회광주광역시 편광주직할시2011
1474ED01575眞理가 너희를 自由케 하리라 = (The)truth shall make you free김경선 저여운사1996
7859ED08374남북정치공동체 형성방안김동수 지음통일부통일교육원2015
4013ED04235춘호. 1, 그들만의 둥지 = Chunho : 이진수 신작 장편소설이진수 지음열매출판사2002
6858ED07357퇴계와 고봉, 편지를 쓰다퇴계 ; 고봉 지음 ; 김영두 옮김소나무2003
번호서명저자출판사발행년도
10487ED11481그림자 없는 남자조이스 캐럴 오츠위즈덤하우스2019
3392ED03578탐그루. 5김상현 저명상1998
9095ED10063여자가 절대 포기하지 말아야 할 것들박금선갤리온2016
7244ED07750(합격의 힘 고입검정고시특강) 사회EBS 검정고시선생님 편저지식과미래2010
3980ED04202일본여자를 말한다유재순 지음창해,1998
9005ED09567몽점일지진사원 글 ; 김재두 譯注은행나무2008
3782ED03982(사진과 함께 읽는) 삼국유사일연 ; 리상호 옮김 ; 강운구 사진까치2009
8057ED085721등 광주건설 5개년계획광주광역시 편광주광역시2006
7325ED07831(합격의 힘 고입검정고시특강) 국어EBS 검정고시선생님 편저지식과미래2013
4165ED04398핫나경. 3전준상 지음핫나경2004