Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells5
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory468.8 KiB
Average record size in memory48.0 B

Variable types

Text5

Dataset

Description전라남도 인재개발원 자료실(도서실)에서 보유하고 있는 자료들의 목록입니다. 자료 구성은 행정자료_영상자료(DVD)_일반도서로 구분되어 있습니다.
Author전라남도
URLhttps://www.data.go.kr/data/15067344/fileData.do

Alerts

등록번호 has unique valuesUnique

Reproduction

Analysis started2024-04-20 19:30:04.154088
Analysis finished2024-04-20 19:30:08.837527
Duration4.68 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

등록번호
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-21T04:30:09.532967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters120000
Distinct characters15
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st rowIM0000020309
2nd rowIM0000008326
3rd rowIM0000028565
4th rowPA0000000252
5th rowIM0000007262
ValueCountFrequency (%)
im0000020309 1
 
< 0.1%
im0000005409 1
 
< 0.1%
im0000018201 1
 
< 0.1%
im0000024584 1
 
< 0.1%
im0000003518 1
 
< 0.1%
im0000026122 1
 
< 0.1%
im0000021467 1
 
< 0.1%
im0000003891 1
 
< 0.1%
im0000006198 1
 
< 0.1%
im0000003600 1
 
< 0.1%
Other values (9990) 9990
99.9%
2024-04-21T04:30:10.757197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 57636
48.0%
I 9612
 
8.0%
M 9612
 
8.0%
2 7360
 
6.1%
1 7211
 
6.0%
3 4774
 
4.0%
5 3942
 
3.3%
6 3919
 
3.3%
7 3911
 
3.3%
8 3886
 
3.2%
Other values (5) 8137
 
6.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 100000
83.3%
Uppercase Letter 20000
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 57636
57.6%
2 7360
 
7.4%
1 7211
 
7.2%
3 4774
 
4.8%
5 3942
 
3.9%
6 3919
 
3.9%
7 3911
 
3.9%
8 3886
 
3.9%
4 3825
 
3.8%
9 3536
 
3.5%
Uppercase Letter
ValueCountFrequency (%)
I 9612
48.1%
M 9612
48.1%
A 388
 
1.9%
P 270
 
1.4%
V 118
 
0.6%

Most occurring scripts

ValueCountFrequency (%)
Common 100000
83.3%
Latin 20000
 
16.7%

Most frequent character per script

Common
ValueCountFrequency (%)
0 57636
57.6%
2 7360
 
7.4%
1 7211
 
7.2%
3 4774
 
4.8%
5 3942
 
3.9%
6 3919
 
3.9%
7 3911
 
3.9%
8 3886
 
3.9%
4 3825
 
3.8%
9 3536
 
3.5%
Latin
ValueCountFrequency (%)
I 9612
48.1%
M 9612
48.1%
A 388
 
1.9%
P 270
 
1.4%
V 118
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 120000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 57636
48.0%
I 9612
 
8.0%
M 9612
 
8.0%
2 7360
 
6.1%
1 7211
 
6.0%
3 4774
 
4.0%
5 3942
 
3.3%
6 3919
 
3.3%
7 3911
 
3.3%
8 3886
 
3.2%
Other values (5) 8137
 
6.8%

서명
Text

Distinct9664
Distinct (%)96.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-21T04:30:11.965555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length180
Median length89
Mean length17.3089
Min length1

Characters and Unicode

Total characters173089
Distinct characters2107
Distinct categories17 ?
Distinct scripts5 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9377 ?
Unique (%)93.8%

Sample

1st row꿈꾸는 씨앗_영혼을 위한 101가지 이야기
2nd row主觀式 勞動法演習
3rd row세라 이야기
4th row全南水産
5th row서양문화사
ValueCountFrequency (%)
1 461
 
1.2%
2 449
 
1.2%
장편소설 275
 
0.7%
3 222
 
0.6%
이야기 180
 
0.5%
위한 155
 
0.4%
5 121
 
0.3%
4 116
 
0.3%
the 98
 
0.3%
of 97
 
0.2%
Other values (20252) 36672
94.4%
2024-04-21T04:30:13.721003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
28862
 
16.7%
4148
 
2.4%
_ 3549
 
2.1%
2515
 
1.5%
. 2415
 
1.4%
2331
 
1.3%
2037
 
1.2%
1878
 
1.1%
1622
 
0.9%
, 1622
 
0.9%
Other values (2097) 122110
70.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 117250
67.7%
Space Separator 28862
 
16.7%
Lowercase Letter 8735
 
5.0%
Decimal Number 5785
 
3.3%
Other Punctuation 4251
 
2.5%
Connector Punctuation 3549
 
2.1%
Uppercase Letter 2483
 
1.4%
Close Punctuation 984
 
0.6%
Open Punctuation 984
 
0.6%
Dash Punctuation 115
 
0.1%
Other values (7) 91
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4148
 
3.5%
2515
 
2.1%
2331
 
2.0%
2037
 
1.7%
1878
 
1.6%
1622
 
1.4%
1559
 
1.3%
1489
 
1.3%
1422
 
1.2%
1416
 
1.2%
Other values (1995) 96833
82.6%
Lowercase Letter
ValueCountFrequency (%)
e 1069
12.2%
o 842
 
9.6%
i 711
 
8.1%
a 682
 
7.8%
n 680
 
7.8%
t 627
 
7.2%
r 574
 
6.6%
s 516
 
5.9%
h 406
 
4.6%
l 402
 
4.6%
Other values (16) 2226
25.5%
Uppercase Letter
ValueCountFrequency (%)
T 257
 
10.4%
S 228
 
9.2%
E 191
 
7.7%
A 185
 
7.5%
C 147
 
5.9%
I 124
 
5.0%
O 122
 
4.9%
M 120
 
4.8%
N 115
 
4.6%
B 106
 
4.3%
Other values (16) 888
35.8%
Other Punctuation
ValueCountFrequency (%)
. 2415
56.8%
, 1622
38.2%
· 138
 
3.2%
/ 36
 
0.8%
& 20
 
0.5%
; 7
 
0.2%
7
 
0.2%
4
 
0.1%
@ 1
 
< 0.1%
1
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
1 1504
26.0%
2 1060
18.3%
0 795
13.7%
3 568
 
9.8%
9 432
 
7.5%
5 382
 
6.6%
4 368
 
6.4%
6 233
 
4.0%
7 230
 
4.0%
8 213
 
3.7%
Math Symbol
ValueCountFrequency (%)
~ 23
40.4%
+ 13
22.8%
> 9
 
15.8%
< 9
 
15.8%
| 2
 
3.5%
= 1
 
1.8%
Close Punctuation
ValueCountFrequency (%)
) 969
98.5%
] 9
 
0.9%
4
 
0.4%
1
 
0.1%
1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 969
98.5%
[ 9
 
0.9%
4
 
0.4%
1
 
0.1%
1
 
0.1%
Letter Number
ValueCountFrequency (%)
12
52.2%
6
26.1%
2
 
8.7%
2
 
8.7%
1
 
4.3%
Other Symbol
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
28862
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3549
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 115
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 4
100.0%
Initial Punctuation
ValueCountFrequency (%)
2
100.0%
Final Punctuation
ValueCountFrequency (%)
2
100.0%
Other Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 112182
64.8%
Common 44598
 
25.8%
Latin 11241
 
6.5%
Han 5059
 
2.9%
Hiragana 9
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4148
 
3.7%
2515
 
2.2%
2331
 
2.1%
2037
 
1.8%
1878
 
1.7%
1622
 
1.4%
1559
 
1.4%
1489
 
1.3%
1422
 
1.3%
1416
 
1.3%
Other values (1181) 91765
81.8%
Han
ValueCountFrequency (%)
144
 
2.8%
111
 
2.2%
98
 
1.9%
96
 
1.9%
90
 
1.8%
78
 
1.5%
74
 
1.5%
73
 
1.4%
70
 
1.4%
67
 
1.3%
Other values (798) 4158
82.2%
Latin
ValueCountFrequency (%)
e 1069
 
9.5%
o 842
 
7.5%
i 711
 
6.3%
a 682
 
6.1%
n 680
 
6.0%
t 627
 
5.6%
r 574
 
5.1%
s 516
 
4.6%
h 406
 
3.6%
l 402
 
3.6%
Other values (47) 4732
42.1%
Common
ValueCountFrequency (%)
28862
64.7%
_ 3549
 
8.0%
. 2415
 
5.4%
, 1622
 
3.6%
1 1504
 
3.4%
2 1060
 
2.4%
) 969
 
2.2%
( 969
 
2.2%
0 795
 
1.8%
3 568
 
1.3%
Other values (35) 2285
 
5.1%
Hiragana
ValueCountFrequency (%)
3
33.3%
2
22.2%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 112172
64.8%
ASCII 55647
32.1%
CJK 4896
 
2.8%
CJK Compat Ideographs 163
 
0.1%
None 155
 
0.1%
Number Forms 23
 
< 0.1%
Punctuation 11
 
< 0.1%
Compat Jamo 10
 
< 0.1%
Hiragana 9
 
< 0.1%
Misc Symbols 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
28862
51.9%
_ 3549
 
6.4%
. 2415
 
4.3%
, 1622
 
2.9%
1 1504
 
2.7%
e 1069
 
1.9%
2 1060
 
1.9%
) 969
 
1.7%
( 969
 
1.7%
o 842
 
1.5%
Other values (72) 12786
23.0%
Hangul
ValueCountFrequency (%)
4148
 
3.7%
2515
 
2.2%
2331
 
2.1%
2037
 
1.8%
1878
 
1.7%
1622
 
1.4%
1559
 
1.4%
1489
 
1.3%
1422
 
1.3%
1416
 
1.3%
Other values (1174) 91755
81.8%
CJK
ValueCountFrequency (%)
144
 
2.9%
111
 
2.3%
98
 
2.0%
96
 
2.0%
90
 
1.8%
78
 
1.6%
74
 
1.5%
73
 
1.5%
70
 
1.4%
67
 
1.4%
Other values (769) 3995
81.6%
None
ValueCountFrequency (%)
· 138
89.0%
4
 
2.6%
4
 
2.6%
4
 
2.6%
1
 
0.6%
1
 
0.6%
1
 
0.6%
1
 
0.6%
1
 
0.6%
CJK Compat Ideographs
ValueCountFrequency (%)
30
18.4%
29
17.8%
19
11.7%
15
9.2%
13
 
8.0%
7
 
4.3%
5
 
3.1%
4
 
2.5%
4
 
2.5%
4
 
2.5%
Other values (19) 33
20.2%
Number Forms
ValueCountFrequency (%)
12
52.2%
6
26.1%
2
 
8.7%
2
 
8.7%
1
 
4.3%
Punctuation
ValueCountFrequency (%)
7
63.6%
2
 
18.2%
2
 
18.2%
Compat Jamo
ValueCountFrequency (%)
4
40.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
Hiragana
ValueCountFrequency (%)
3
33.3%
2
22.2%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
Misc Symbols
ValueCountFrequency (%)
1
50.0%
1
50.0%
Enclosed Alphanum
ValueCountFrequency (%)
1
100.0%
Distinct7932
Distinct (%)79.3%
Missing1
Missing (%)< 0.1%
Memory size156.2 KiB
2024-04-21T04:30:14.924020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length342
Median length221
Mean length10.470847
Min length2

Characters and Unicode

Total characters104698
Distinct characters1490
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6948 ?
Unique (%)69.5%

Sample

1st row브라이언 카바노프지음 ; 주연진 옮김
2nd row金致善
3rd row프랜시스 호즈슨 버넷 글 ; 타샤 튜더 그림 ; 햇살과 나무꾼 옮김
4th row全羅南道 編.
5th row문석홍 지음
ValueCountFrequency (%)
지음 6039
 
19.3%
2624
 
8.4%
옮김 1771
 
5.7%
666
 
2.1%
430
 
1.4%
415
 
1.3%
엮음 149
 
0.5%
그림 137
 
0.4%
131
 
0.4%
123
 
0.4%
Other values (10839) 18852
60.2%
2024-04-21T04:30:16.653482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
22213
 
21.2%
6630
 
6.3%
6277
 
6.0%
3628
 
3.5%
; 2611
 
2.5%
2221
 
2.1%
1809
 
1.7%
, 1449
 
1.4%
1004
 
1.0%
991
 
0.9%
Other values (1480) 55865
53.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 75931
72.5%
Space Separator 22213
 
21.2%
Other Punctuation 4681
 
4.5%
Uppercase Letter 693
 
0.7%
Lowercase Letter 485
 
0.5%
Close Punctuation 274
 
0.3%
Open Punctuation 273
 
0.3%
Connector Punctuation 73
 
0.1%
Decimal Number 50
 
< 0.1%
Math Symbol 18
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6630
 
8.7%
6277
 
8.3%
3628
 
4.8%
2221
 
2.9%
1809
 
2.4%
1004
 
1.3%
991
 
1.3%
876
 
1.2%
860
 
1.1%
783
 
1.0%
Other values (1404) 50852
67.0%
Uppercase Letter
ValueCountFrequency (%)
J 79
 
11.4%
S 57
 
8.2%
B 48
 
6.9%
C 47
 
6.8%
A 44
 
6.3%
K 43
 
6.2%
E 43
 
6.2%
H 41
 
5.9%
R 41
 
5.9%
M 33
 
4.8%
Other values (14) 217
31.3%
Lowercase Letter
ValueCountFrequency (%)
e 69
14.2%
n 56
11.5%
r 52
10.7%
a 42
8.7%
i 40
8.2%
s 34
 
7.0%
o 33
 
6.8%
t 29
 
6.0%
l 17
 
3.5%
u 15
 
3.1%
Other values (13) 98
20.2%
Decimal Number
ValueCountFrequency (%)
2 8
16.0%
1 7
14.0%
3 6
12.0%
4 6
12.0%
9 5
10.0%
8 4
8.0%
6 4
8.0%
0 4
8.0%
5 3
 
6.0%
7 3
 
6.0%
Other Punctuation
ValueCountFrequency (%)
; 2611
55.8%
, 1449
31.0%
. 541
 
11.6%
· 73
 
1.6%
/ 4
 
0.1%
1
 
< 0.1%
& 1
 
< 0.1%
\ 1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
] 203
74.1%
) 70
 
25.5%
1
 
0.4%
Open Punctuation
ValueCountFrequency (%)
[ 202
74.0%
( 70
 
25.6%
1
 
0.4%
Math Symbol
ValueCountFrequency (%)
> 9
50.0%
< 9
50.0%
Space Separator
ValueCountFrequency (%)
22213
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 73
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 72262
69.0%
Common 27589
 
26.4%
Han 3669
 
3.5%
Latin 1178
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6630
 
9.2%
6277
 
8.7%
3628
 
5.0%
2221
 
3.1%
1809
 
2.5%
1004
 
1.4%
991
 
1.4%
876
 
1.2%
860
 
1.2%
783
 
1.1%
Other values (815) 47183
65.3%
Han
ValueCountFrequency (%)
484
 
13.2%
170
 
4.6%
157
 
4.3%
151
 
4.1%
94
 
2.6%
62
 
1.7%
60
 
1.6%
56
 
1.5%
49
 
1.3%
43
 
1.2%
Other values (579) 2343
63.9%
Latin
ValueCountFrequency (%)
J 79
 
6.7%
e 69
 
5.9%
S 57
 
4.8%
n 56
 
4.8%
r 52
 
4.4%
B 48
 
4.1%
C 47
 
4.0%
A 44
 
3.7%
K 43
 
3.7%
E 43
 
3.7%
Other values (37) 640
54.3%
Common
ValueCountFrequency (%)
22213
80.5%
; 2611
 
9.5%
, 1449
 
5.3%
. 541
 
2.0%
] 203
 
0.7%
[ 202
 
0.7%
· 73
 
0.3%
_ 73
 
0.3%
( 70
 
0.3%
) 70
 
0.3%
Other values (19) 84
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 72262
69.0%
ASCII 28691
 
27.4%
CJK 3532
 
3.4%
CJK Compat Ideographs 137
 
0.1%
None 76
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
22213
77.4%
; 2611
 
9.1%
, 1449
 
5.1%
. 541
 
1.9%
] 203
 
0.7%
[ 202
 
0.7%
J 79
 
0.3%
_ 73
 
0.3%
( 70
 
0.2%
) 70
 
0.2%
Other values (62) 1180
 
4.1%
Hangul
ValueCountFrequency (%)
6630
 
9.2%
6277
 
8.7%
3628
 
5.0%
2221
 
3.1%
1809
 
2.5%
1004
 
1.4%
991
 
1.4%
876
 
1.2%
860
 
1.2%
783
 
1.1%
Other values (815) 47183
65.3%
CJK
ValueCountFrequency (%)
484
 
13.7%
170
 
4.8%
157
 
4.4%
151
 
4.3%
62
 
1.8%
60
 
1.7%
56
 
1.6%
49
 
1.4%
43
 
1.2%
41
 
1.2%
Other values (557) 2259
64.0%
CJK Compat Ideographs
ValueCountFrequency (%)
94
68.6%
6
 
4.4%
5
 
3.6%
4
 
2.9%
4
 
2.9%
4
 
2.9%
2
 
1.5%
2
 
1.5%
2
 
1.5%
2
 
1.5%
Other values (12) 12
 
8.8%
None
ValueCountFrequency (%)
· 73
96.1%
1
 
1.3%
1
 
1.3%
1
 
1.3%
Distinct3178
Distinct (%)31.8%
Missing3
Missing (%)< 0.1%
Memory size156.2 KiB
2024-04-21T04:30:17.924276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length38
Mean length4.4886466
Min length1

Characters and Unicode

Total characters44873
Distinct characters978
Distinct categories10 ?
Distinct scripts5 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1936 ?
Unique (%)19.4%

Sample

1st row동해출판
2nd row博英社
3rd row시공주니어
4th row전라남도
5th row서울대학교출판부
ValueCountFrequency (%)
박영사 151
 
1.4%
김영사 133
 
1.3%
문학동네 109
 
1.0%
전라남도지방공무원교육원 103
 
1.0%
위즈덤하우스 94
 
0.9%
민음사 90
 
0.9%
고려원 83
 
0.8%
전라남도 83
 
0.8%
한길사 81
 
0.8%
법문사 72
 
0.7%
Other values (3233) 9432
90.4%
2024-04-21T04:30:19.608174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3195
 
7.1%
1425
 
3.2%
1083
 
2.4%
946
 
2.1%
923
 
2.1%
817
 
1.8%
676
 
1.5%
643
 
1.4%
637
 
1.4%
609
 
1.4%
Other values (968) 33919
75.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 42420
94.5%
Lowercase Letter 654
 
1.5%
Uppercase Letter 581
 
1.3%
Space Separator 442
 
1.0%
Decimal Number 209
 
0.5%
Close Punctuation 190
 
0.4%
Open Punctuation 189
 
0.4%
Connector Punctuation 108
 
0.2%
Other Punctuation 78
 
0.2%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3195
 
7.5%
1425
 
3.4%
1083
 
2.6%
946
 
2.2%
923
 
2.2%
817
 
1.9%
676
 
1.6%
643
 
1.5%
637
 
1.5%
609
 
1.4%
Other values (898) 31466
74.2%
Uppercase Letter
ValueCountFrequency (%)
B 95
16.4%
M 64
11.0%
H 48
 
8.3%
P 47
 
8.1%
K 43
 
7.4%
O 34
 
5.9%
C 30
 
5.2%
R 28
 
4.8%
S 28
 
4.8%
I 25
 
4.3%
Other values (14) 139
23.9%
Lowercase Letter
ValueCountFrequency (%)
o 121
18.5%
n 59
 
9.0%
e 56
 
8.6%
i 54
 
8.3%
k 44
 
6.7%
s 43
 
6.6%
a 39
 
6.0%
b 31
 
4.7%
r 30
 
4.6%
u 26
 
4.0%
Other values (13) 151
23.1%
Decimal Number
ValueCountFrequency (%)
2 94
45.0%
1 90
43.1%
0 13
 
6.2%
3 5
 
2.4%
8 2
 
1.0%
9 2
 
1.0%
4 2
 
1.0%
7 1
 
0.5%
Other Punctuation
ValueCountFrequency (%)
& 27
34.6%
· 21
26.9%
. 10
 
12.8%
10
 
12.8%
@ 4
 
5.1%
/ 3
 
3.8%
, 2
 
2.6%
; 1
 
1.3%
Open Punctuation
ValueCountFrequency (%)
( 146
77.2%
[ 43
 
22.8%
Close Punctuation
ValueCountFrequency (%)
) 146
76.8%
] 44
 
23.2%
Space Separator
ValueCountFrequency (%)
442
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 108
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 39326
87.6%
Han 3088
 
6.9%
Latin 1235
 
2.8%
Common 1218
 
2.7%
Katakana 6
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3195
 
8.1%
1425
 
3.6%
1083
 
2.8%
946
 
2.4%
923
 
2.3%
817
 
2.1%
676
 
1.7%
643
 
1.6%
637
 
1.6%
609
 
1.5%
Other values (609) 28372
72.1%
Han
ValueCountFrequency (%)
493
 
16.0%
214
 
6.9%
94
 
3.0%
94
 
3.0%
90
 
2.9%
72
 
2.3%
68
 
2.2%
57
 
1.8%
55
 
1.8%
49
 
1.6%
Other values (273) 1802
58.4%
Latin
ValueCountFrequency (%)
o 121
 
9.8%
B 95
 
7.7%
M 64
 
5.2%
n 59
 
4.8%
e 56
 
4.5%
i 54
 
4.4%
H 48
 
3.9%
P 47
 
3.8%
k 44
 
3.6%
K 43
 
3.5%
Other values (37) 604
48.9%
Common
ValueCountFrequency (%)
442
36.3%
( 146
 
12.0%
) 146
 
12.0%
_ 108
 
8.9%
2 94
 
7.7%
1 90
 
7.4%
] 44
 
3.6%
[ 43
 
3.5%
& 27
 
2.2%
· 21
 
1.7%
Other values (13) 57
 
4.7%
Katakana
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 39323
87.6%
CJK 3072
 
6.8%
ASCII 2422
 
5.4%
None 31
 
0.1%
CJK Compat Ideographs 16
 
< 0.1%
Katakana 6
 
< 0.1%
Compat Jamo 3
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3195
 
8.1%
1425
 
3.6%
1083
 
2.8%
946
 
2.4%
923
 
2.3%
817
 
2.1%
676
 
1.7%
643
 
1.6%
637
 
1.6%
609
 
1.5%
Other values (607) 28369
72.1%
CJK
ValueCountFrequency (%)
493
 
16.0%
214
 
7.0%
94
 
3.1%
94
 
3.1%
90
 
2.9%
72
 
2.3%
68
 
2.2%
57
 
1.9%
55
 
1.8%
49
 
1.6%
Other values (267) 1786
58.1%
ASCII
ValueCountFrequency (%)
442
18.2%
( 146
 
6.0%
) 146
 
6.0%
o 121
 
5.0%
_ 108
 
4.5%
B 95
 
3.9%
2 94
 
3.9%
1 90
 
3.7%
M 64
 
2.6%
n 59
 
2.4%
Other values (58) 1057
43.6%
None
ValueCountFrequency (%)
· 21
67.7%
10
32.3%
CJK Compat Ideographs
ValueCountFrequency (%)
10
62.5%
2
 
12.5%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
Compat Jamo
ValueCountFrequency (%)
2
66.7%
1
33.3%
Katakana
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Distinct72
Distinct (%)0.7%
Missing1
Missing (%)< 0.1%
Memory size156.2 KiB
2024-04-21T04:30:20.423105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length4
Mean length4.0008001
Min length3

Characters and Unicode

Total characters40004
Distinct characters13
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique11 ?
Unique (%)0.1%

Sample

1st row2011
2nd row1992
3rd row2004
4th row1984
5th row1988
ValueCountFrequency (%)
1988 542
 
5.4%
1986 474
 
4.7%
1989 465
 
4.7%
1987 406
 
4.1%
1990 395
 
4.0%
2009 331
 
3.3%
2021 327
 
3.3%
1991 314
 
3.1%
1993 299
 
3.0%
2010 292
 
2.9%
Other values (57) 6154
61.5%
2024-04-21T04:30:21.416322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9 8541
21.4%
1 8435
21.1%
0 7560
18.9%
2 6570
16.4%
8 4001
10.0%
7 1273
 
3.2%
6 1031
 
2.6%
3 1021
 
2.6%
4 816
 
2.0%
5 747
 
1.9%
Other values (3) 9
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 39995
> 99.9%
Open Punctuation 4
 
< 0.1%
Close Punctuation 4
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
9 8541
21.4%
1 8435
21.1%
0 7560
18.9%
2 6570
16.4%
8 4001
10.0%
7 1273
 
3.2%
6 1031
 
2.6%
3 1021
 
2.6%
4 816
 
2.0%
5 747
 
1.9%
Open Punctuation
ValueCountFrequency (%)
[ 4
100.0%
Close Punctuation
ValueCountFrequency (%)
] 4
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 40004
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
9 8541
21.4%
1 8435
21.1%
0 7560
18.9%
2 6570
16.4%
8 4001
10.0%
7 1273
 
3.2%
6 1031
 
2.6%
3 1021
 
2.6%
4 816
 
2.0%
5 747
 
1.9%
Other values (3) 9
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 40004
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9 8541
21.4%
1 8435
21.1%
0 7560
18.9%
2 6570
16.4%
8 4001
10.0%
7 1273
 
3.2%
6 1031
 
2.6%
3 1021
 
2.6%
4 816
 
2.0%
5 747
 
1.9%
Other values (3) 9
 
< 0.1%

Missing values

2024-04-21T04:30:08.072381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T04:30:08.391026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-21T04:30:08.684346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

등록번호서명저작자발행자발행년도
17921IM0000020309꿈꾸는 씨앗_영혼을 위한 101가지 이야기브라이언 카바노프지음 ; 주연진 옮김동해출판2011
8159IM0000008326主觀式 勞動法演習金致善博英社1992
26076IM0000028565세라 이야기프랜시스 호즈슨 버넷 글 ; 타샤 튜더 그림 ; 햇살과 나무꾼 옮김시공주니어2004
30433PA0000000252全南水産全羅南道 編.전라남도1984
7331IM0000007262서양문화사문석홍 지음서울대학교출판부1988
25349IM0000027837은행의 사생활_서민들만 모르는 은행거래의 비밀박혜정 지음다산북스2009
22164IM0000024642우리가 있기에 내가 있습니다_world culture open홍석현쌤앤파커스2016
29515IM0000032046살인자의 쇼핑목록강지영 지음네오픽션2022
4989IM0000004810현대사상과비평. 7김형석 지음민주원사1981
29794IM0000032325골린이 4주 만에 필드 나가기_골프장 부킹부터 용품, 스윙 방법, 점수 계산까지김정락황금부엉이2022
등록번호서명저작자발행자발행년도
9268IM0000010610革命이데올로기와 葛藤_ 思想·運動·體系·歷史金永俊 著亞細亞文化社1982
21093IM0000023547마르크스의 역사적 유물론과 인간론김창호 지음竹山1991
9198IM0000010482자유의조건. 2아르만도 발라다레스 저 ; 정성호 역나남1987
9014IM0000010247성냥갑속의 여자신달자 지음자유문학사1993
21007IM0000023458잠 도둑들스탠리 코렌 지음 ; 안인희 옮김황금가지1997
3082IM0000002847외국의 지방재정조정제도, 연구자료집한국지방행정연구원 지음한국지방행정연구원1997
30264PA0000000083(2012년도) 교육훈련계획. 2012전라남도지방공무원교육원 편전라남도지방공무원교육원2012
4816IM0000004635국역동문선. 4민족문화추진회 지음민족문화추진회1986
5737IM0000005625대한국사. 12이경근 지음신태양사1973
20445IM0000022881(구조조정기의)국가와 노동최영기, 이장원 편저나무와 숲1998