Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory556.6 KiB
Average record size in memory57.0 B

Variable types

Numeric1
Text5

Dataset

Description부산광역시연제구_자료관도서목록_20230411
Author부산광역시 연제구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15048049

Alerts

번호 has unique valuesUnique
등록번호 has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:13:03.882830
Analysis finished2023-12-10 16:13:06.940320
Duration3.06 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean46015.229
Minimum4
Maximum91789
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T01:13:07.052006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4
5-th percentile4608.75
Q123296.25
median45744
Q369115.25
95-th percentile87440.45
Maximum91789
Range91785
Interquartile range (IQR)45819

Descriptive statistics

Standard deviation26456.761
Coefficient of variation (CV)0.57495662
Kurtosis-1.1951469
Mean46015.229
Median Absolute Deviation (MAD)22927
Skewness0.00012559612
Sum4.6015229 × 108
Variance6.9996018 × 108
MonotonicityNot monotonic
2023-12-11T01:13:07.253037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
75892 1
 
< 0.1%
90223 1
 
< 0.1%
48117 1
 
< 0.1%
26892 1
 
< 0.1%
13869 1
 
< 0.1%
66604 1
 
< 0.1%
72619 1
 
< 0.1%
39497 1
 
< 0.1%
71676 1
 
< 0.1%
4346 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
4 1
< 0.1%
12 1
< 0.1%
18 1
< 0.1%
24 1
< 0.1%
25 1
< 0.1%
28 1
< 0.1%
38 1
< 0.1%
42 1
< 0.1%
47 1
< 0.1%
64 1
< 0.1%
ValueCountFrequency (%)
91789 1
< 0.1%
91783 1
< 0.1%
91779 1
< 0.1%
91773 1
< 0.1%
91762 1
< 0.1%
91747 1
< 0.1%
91744 1
< 0.1%
91739 1
< 0.1%
91737 1
< 0.1%
91735 1
< 0.1%

등록번호
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T01:13:07.580759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters120000
Distinct characters13
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st rowABN000063728
2nd rowABN000121630
3rd rowABN000090195
4th rowABN000054975
5th rowABN000072929
ValueCountFrequency (%)
abn000063728 1
 
< 0.1%
abn000104306 1
 
< 0.1%
abn000054846 1
 
< 0.1%
abn000108452 1
 
< 0.1%
abn000095053 1
 
< 0.1%
abn000122026 1
 
< 0.1%
abn000138335 1
 
< 0.1%
abn000073421 1
 
< 0.1%
abn000067208 1
 
< 0.1%
abn000048766 1
 
< 0.1%
Other values (9990) 9990
99.9%
2023-12-11T01:13:08.113926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 40292
33.6%
A 10000
 
8.3%
B 10000
 
8.3%
N 10000
 
8.3%
1 9389
 
7.8%
5 5407
 
4.5%
4 5127
 
4.3%
8 5049
 
4.2%
6 5040
 
4.2%
3 5030
 
4.2%
Other values (3) 14666
 
12.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 90000
75.0%
Uppercase Letter 30000
 
25.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 40292
44.8%
1 9389
 
10.4%
5 5407
 
6.0%
4 5127
 
5.7%
8 5049
 
5.6%
6 5040
 
5.6%
3 5030
 
5.6%
2 4953
 
5.5%
9 4861
 
5.4%
7 4852
 
5.4%
Uppercase Letter
ValueCountFrequency (%)
A 10000
33.3%
B 10000
33.3%
N 10000
33.3%

Most occurring scripts

ValueCountFrequency (%)
Common 90000
75.0%
Latin 30000
 
25.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 40292
44.8%
1 9389
 
10.4%
5 5407
 
6.0%
4 5127
 
5.7%
8 5049
 
5.6%
6 5040
 
5.6%
3 5030
 
5.6%
2 4953
 
5.5%
9 4861
 
5.4%
7 4852
 
5.4%
Latin
ValueCountFrequency (%)
A 10000
33.3%
B 10000
33.3%
N 10000
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 120000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 40292
33.6%
A 10000
 
8.3%
B 10000
 
8.3%
N 10000
 
8.3%
1 9389
 
7.8%
5 5407
 
4.5%
4 5127
 
4.3%
8 5049
 
4.2%
6 5040
 
4.2%
3 5030
 
4.2%
Other values (3) 14666
 
12.2%

서명
Text

Distinct9858
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T01:13:08.723820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length302
Median length92
Mean length23.7966
Min length1

Characters and Unicode

Total characters237966
Distinct characters1608
Distinct categories17 ?
Distinct scripts5 ?
Distinct blocks12 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9720 ?
Unique (%)97.2%

Sample

1st row밤을 건너는 소년
2nd row천 개의 죽음이 내게 말해준 것들
3rd row쉬엄쉬엄 가도 괜찮아요
4th row졸업선물 : 성공이 아닌 성장을 위한 이야기
5th row한국 산문선. 3, 위험한 백성
ValueCountFrequency (%)
4394
 
7.3%
이야기 439
 
0.7%
1 301
 
0.5%
위한 293
 
0.5%
2 291
 
0.5%
221
 
0.4%
the 172
 
0.3%
172
 
0.3%
우리 163
 
0.3%
나는 150
 
0.2%
Other values (24566) 53885
89.1%
2023-12-11T01:13:09.846304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
50530
 
21.2%
4439
 
1.9%
: 4223
 
1.8%
4138
 
1.7%
3509
 
1.5%
2378
 
1.0%
, 2232
 
0.9%
2183
 
0.9%
2133
 
0.9%
2076
 
0.9%
Other values (1598) 160125
67.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 148539
62.4%
Space Separator 50530
 
21.2%
Lowercase Letter 16863
 
7.1%
Other Punctuation 9764
 
4.1%
Decimal Number 4593
 
1.9%
Uppercase Letter 2347
 
1.0%
Open Punctuation 2245
 
0.9%
Close Punctuation 2245
 
0.9%
Math Symbol 621
 
0.3%
Dash Punctuation 170
 
0.1%
Other values (7) 49
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4439
 
3.0%
4138
 
2.8%
3509
 
2.4%
2378
 
1.6%
2183
 
1.5%
2133
 
1.4%
2076
 
1.4%
1974
 
1.3%
1964
 
1.3%
1960
 
1.3%
Other values (1474) 121785
82.0%
Lowercase Letter
ValueCountFrequency (%)
e 1966
11.7%
o 1515
 
9.0%
a 1419
 
8.4%
i 1339
 
7.9%
t 1249
 
7.4%
n 1197
 
7.1%
r 1142
 
6.8%
s 1096
 
6.5%
l 734
 
4.4%
h 723
 
4.3%
Other values (17) 4483
26.6%
Uppercase Letter
ValueCountFrequency (%)
S 268
 
11.4%
T 229
 
9.8%
C 140
 
6.0%
M 134
 
5.7%
B 130
 
5.5%
P 123
 
5.2%
D 118
 
5.0%
W 114
 
4.9%
A 113
 
4.8%
G 107
 
4.6%
Other values (16) 871
37.1%
Other Punctuation
ValueCountFrequency (%)
: 4223
43.3%
, 2232
22.9%
. 1458
 
14.9%
! 665
 
6.8%
? 473
 
4.8%
· 284
 
2.9%
' 260
 
2.7%
; 34
 
0.3%
& 29
 
0.3%
" 22
 
0.2%
Other values (11) 84
 
0.9%
Decimal Number
ValueCountFrequency (%)
1 1163
25.3%
0 903
19.7%
2 825
18.0%
3 394
 
8.6%
5 331
 
7.2%
4 283
 
6.2%
6 200
 
4.4%
9 187
 
4.1%
8 157
 
3.4%
7 150
 
3.3%
Math Symbol
ValueCountFrequency (%)
= 522
84.1%
~ 47
 
7.6%
+ 18
 
2.9%
× 9
 
1.4%
< 8
 
1.3%
> 8
 
1.3%
| 5
 
0.8%
2
 
0.3%
1
 
0.2%
1
 
0.2%
Other Number
ValueCountFrequency (%)
2
28.6%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
Open Punctuation
ValueCountFrequency (%)
( 1739
77.5%
[ 476
 
21.2%
22
 
1.0%
5
 
0.2%
3
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 1739
77.5%
] 476
 
21.2%
22
 
1.0%
5
 
0.2%
3
 
0.1%
Letter Number
ValueCountFrequency (%)
10
41.7%
6
25.0%
4
 
16.7%
4
 
16.7%
Other Symbol
ValueCountFrequency (%)
5
62.5%
2
 
25.0%
1
 
12.5%
Modifier Symbol
ValueCountFrequency (%)
´ 2
66.7%
` 1
33.3%
Space Separator
ValueCountFrequency (%)
50530
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 170
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%
Final Punctuation
ValueCountFrequency (%)
3
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 148313
62.3%
Common 70193
29.5%
Latin 19231
 
8.1%
Han 226
 
0.1%
Greek 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4439
 
3.0%
4138
 
2.8%
3509
 
2.4%
2378
 
1.6%
2183
 
1.5%
2133
 
1.4%
2076
 
1.4%
1974
 
1.3%
1964
 
1.3%
1960
 
1.3%
Other values (1317) 121559
82.0%
Han
ValueCountFrequency (%)
11
 
4.9%
7
 
3.1%
6
 
2.7%
6
 
2.7%
4
 
1.8%
4
 
1.8%
4
 
1.8%
4
 
1.8%
4
 
1.8%
3
 
1.3%
Other values (147) 173
76.5%
Common
ValueCountFrequency (%)
50530
72.0%
: 4223
 
6.0%
, 2232
 
3.2%
( 1739
 
2.5%
) 1739
 
2.5%
. 1458
 
2.1%
1 1163
 
1.7%
0 903
 
1.3%
2 825
 
1.2%
! 665
 
0.9%
Other values (57) 4716
 
6.7%
Latin
ValueCountFrequency (%)
e 1966
 
10.2%
o 1515
 
7.9%
a 1419
 
7.4%
i 1339
 
7.0%
t 1249
 
6.5%
n 1197
 
6.2%
r 1142
 
5.9%
s 1096
 
5.7%
l 734
 
3.8%
h 723
 
3.8%
Other values (46) 6851
35.6%
Greek
ValueCountFrequency (%)
π 3
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 148297
62.3%
ASCII 88974
37.4%
None 401
 
0.2%
CJK 219
 
0.1%
Number Forms 24
 
< 0.1%
Compat Jamo 16
 
< 0.1%
Punctuation 12
 
< 0.1%
Misc Symbols 7
 
< 0.1%
CJK Compat Ideographs 7
 
< 0.1%
Enclosed Alphanum 6
 
< 0.1%
Other values (2) 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
50530
56.8%
: 4223
 
4.7%
, 2232
 
2.5%
e 1966
 
2.2%
( 1739
 
2.0%
) 1739
 
2.0%
o 1515
 
1.7%
. 1458
 
1.6%
a 1419
 
1.6%
i 1339
 
1.5%
Other values (79) 20814
23.4%
Hangul
ValueCountFrequency (%)
4439
 
3.0%
4138
 
2.8%
3509
 
2.4%
2378
 
1.6%
2183
 
1.5%
2133
 
1.4%
2076
 
1.4%
1974
 
1.3%
1964
 
1.3%
1960
 
1.3%
Other values (1309) 121543
82.0%
None
ValueCountFrequency (%)
· 284
70.8%
22
 
5.5%
22
 
5.5%
20
 
5.0%
16
 
4.0%
× 9
 
2.2%
5
 
1.2%
5
 
1.2%
3
 
0.7%
π 3
 
0.7%
Other values (9) 12
 
3.0%
CJK
ValueCountFrequency (%)
11
 
5.0%
7
 
3.2%
6
 
2.7%
6
 
2.7%
4
 
1.8%
4
 
1.8%
4
 
1.8%
4
 
1.8%
4
 
1.8%
3
 
1.4%
Other values (140) 166
75.8%
Number Forms
ValueCountFrequency (%)
10
41.7%
6
25.0%
4
 
16.7%
4
 
16.7%
Punctuation
ValueCountFrequency (%)
8
66.7%
3
 
25.0%
1
 
8.3%
Misc Symbols
ValueCountFrequency (%)
5
71.4%
2
 
28.6%
Compat Jamo
ValueCountFrequency (%)
3
18.8%
3
18.8%
3
18.8%
2
12.5%
2
12.5%
1
 
6.2%
1
 
6.2%
1
 
6.2%
Enclosed Alphanum
ValueCountFrequency (%)
2
33.3%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Arrows
ValueCountFrequency (%)
2
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%

저자
Text

Distinct8907
Distinct (%)89.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T01:13:10.441563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length156
Median length111
Mean length15.9349
Min length2

Characters and Unicode

Total characters159349
Distinct characters1034
Distinct categories12 ?
Distinct scripts5 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8169 ?
Unique (%)81.7%

Sample

1st row최양선 지음
2nd row고칸 메구미 지음 ; 오시연 옮김
3rd row서정홍 지음
4th row신영준 글 ; 서동민 그림
5th row이종묵, 장유승 [공]편역
ValueCountFrequency (%)
6815
 
14.6%
지음 5649
 
12.1%
옮김 3020
 
6.5%
그림 2511
 
5.4%
2020
 
4.3%
글·그림 606
 
1.3%
by 572
 
1.2%
공]지음 385
 
0.8%
242
 
0.5%
illustrated 165
 
0.4%
Other values (13489) 24708
52.9%
2023-12-11T01:13:11.237525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
36764
23.1%
7013
 
4.4%
; 6812
 
4.3%
6298
 
4.0%
5550
 
3.5%
3430
 
2.2%
3325
 
2.1%
3124
 
2.0%
3117
 
2.0%
2904
 
1.8%
Other values (1024) 81012
50.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 97677
61.3%
Space Separator 36764
 
23.1%
Lowercase Letter 11305
 
7.1%
Other Punctuation 9399
 
5.9%
Uppercase Letter 1917
 
1.2%
Open Punctuation 1082
 
0.7%
Close Punctuation 1080
 
0.7%
Dash Punctuation 56
 
< 0.1%
Decimal Number 52
 
< 0.1%
Math Symbol 14
 
< 0.1%
Other values (2) 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7013
 
7.2%
6298
 
6.4%
5550
 
5.7%
3430
 
3.5%
3325
 
3.4%
3124
 
3.2%
3117
 
3.2%
2904
 
3.0%
1573
 
1.6%
1545
 
1.6%
Other values (936) 59798
61.2%
Lowercase Letter
ValueCountFrequency (%)
a 1131
10.0%
e 1117
9.9%
t 910
 
8.0%
r 896
 
7.9%
l 893
 
7.9%
i 856
 
7.6%
y 819
 
7.2%
n 775
 
6.9%
b 650
 
5.7%
o 619
 
5.5%
Other values (16) 2639
23.3%
Uppercase Letter
ValueCountFrequency (%)
M 179
 
9.3%
J 167
 
8.7%
S 148
 
7.7%
A 138
 
7.2%
B 124
 
6.5%
C 124
 
6.5%
K 113
 
5.9%
R 112
 
5.8%
D 89
 
4.6%
L 87
 
4.5%
Other values (16) 636
33.2%
Other Punctuation
ValueCountFrequency (%)
; 6812
72.5%
, 1260
 
13.4%
· 721
 
7.7%
. 510
 
5.4%
: 72
 
0.8%
& 8
 
0.1%
? 4
 
< 0.1%
4
 
< 0.1%
/ 3
 
< 0.1%
" 2
 
< 0.1%
Other values (2) 3
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
2 12
23.1%
1 10
19.2%
3 8
15.4%
0 7
13.5%
4 4
 
7.7%
7 3
 
5.8%
6 3
 
5.8%
9 2
 
3.8%
5 2
 
3.8%
8 1
 
1.9%
Open Punctuation
ValueCountFrequency (%)
[ 1068
98.7%
( 9
 
0.8%
3
 
0.3%
2
 
0.2%
Close Punctuation
ValueCountFrequency (%)
] 1066
98.7%
) 9
 
0.8%
3
 
0.3%
2
 
0.2%
Math Symbol
ValueCountFrequency (%)
< 7
50.0%
> 7
50.0%
Space Separator
ValueCountFrequency (%)
36764
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 56
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 97647
61.3%
Common 48450
30.4%
Latin 13222
 
8.3%
Han 28
 
< 0.1%
Katakana 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7013
 
7.2%
6298
 
6.4%
5550
 
5.7%
3430
 
3.5%
3325
 
3.4%
3124
 
3.2%
3117
 
3.2%
2904
 
3.0%
1573
 
1.6%
1545
 
1.6%
Other values (912) 59768
61.2%
Latin
ValueCountFrequency (%)
a 1131
 
8.6%
e 1117
 
8.4%
t 910
 
6.9%
r 896
 
6.8%
l 893
 
6.8%
i 856
 
6.5%
y 819
 
6.2%
n 775
 
5.9%
b 650
 
4.9%
o 619
 
4.7%
Other values (42) 4556
34.5%
Common
ValueCountFrequency (%)
36764
75.9%
; 6812
 
14.1%
, 1260
 
2.6%
[ 1068
 
2.2%
] 1066
 
2.2%
· 721
 
1.5%
. 510
 
1.1%
: 72
 
0.1%
- 56
 
0.1%
2 12
 
< 0.1%
Other values (26) 109
 
0.2%
Han
ValueCountFrequency (%)
4
 
14.3%
2
 
7.1%
2
 
7.1%
1
 
3.6%
1
 
3.6%
1
 
3.6%
1
 
3.6%
1
 
3.6%
1
 
3.6%
1
 
3.6%
Other values (13) 13
46.4%
Katakana
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 97640
61.3%
ASCII 60933
38.2%
None 737
 
0.5%
CJK 27
 
< 0.1%
Compat Jamo 7
 
< 0.1%
Katakana 2
 
< 0.1%
Enclosed Alphanum 2
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
36764
60.3%
; 6812
 
11.2%
, 1260
 
2.1%
a 1131
 
1.9%
e 1117
 
1.8%
[ 1068
 
1.8%
] 1066
 
1.7%
t 910
 
1.5%
r 896
 
1.5%
l 893
 
1.5%
Other values (70) 9016
 
14.8%
Hangul
ValueCountFrequency (%)
7013
 
7.2%
6298
 
6.5%
5550
 
5.7%
3430
 
3.5%
3325
 
3.4%
3124
 
3.2%
3117
 
3.2%
2904
 
3.0%
1573
 
1.6%
1545
 
1.6%
Other values (908) 59761
61.2%
None
ValueCountFrequency (%)
· 721
97.8%
4
 
0.5%
3
 
0.4%
3
 
0.4%
2
 
0.3%
2
 
0.3%
2
 
0.3%
Compat Jamo
ValueCountFrequency (%)
4
57.1%
1
 
14.3%
1
 
14.3%
1
 
14.3%
CJK
ValueCountFrequency (%)
4
 
14.8%
2
 
7.4%
2
 
7.4%
1
 
3.7%
1
 
3.7%
1
 
3.7%
1
 
3.7%
1
 
3.7%
1
 
3.7%
1
 
3.7%
Other values (12) 12
44.4%
Katakana
ValueCountFrequency (%)
2
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
2
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Distinct2859
Distinct (%)28.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T01:13:11.565789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length67
Median length47
Mean length4.9513
Min length1

Characters and Unicode

Total characters49513
Distinct characters777
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1524 ?
Unique (%)15.2%

Sample

1st row사계절
2nd row웅진지식하우스:웅진씽크빅
3rd row단비
4th row로크미디어
5th row민음사
ValueCountFrequency (%)
문학동네 177
 
1.7%
창비 158
 
1.5%
위즈덤하우스 109
 
1.0%
비룡소 100
 
0.9%
books 97
 
0.9%
서울문화사 89
 
0.8%
김영사 75
 
0.7%
사계절 64
 
0.6%
민음사 64
 
0.6%
자음과모음 63
 
0.6%
Other values (2914) 9652
90.6%
2023-12-11T01:13:12.076631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1716
 
3.5%
1351
 
2.7%
1349
 
2.7%
1287
 
2.6%
876
 
1.8%
775
 
1.6%
753
 
1.5%
o 735
 
1.5%
648
 
1.3%
622
 
1.3%
Other values (767) 39401
79.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 41094
83.0%
Lowercase Letter 5360
 
10.8%
Uppercase Letter 1413
 
2.9%
Other Punctuation 705
 
1.4%
Space Separator 648
 
1.3%
Decimal Number 141
 
0.3%
Close Punctuation 64
 
0.1%
Open Punctuation 64
 
0.1%
Dash Punctuation 23
 
< 0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1716
 
4.2%
1351
 
3.3%
1349
 
3.3%
1287
 
3.1%
876
 
2.1%
775
 
1.9%
753
 
1.8%
622
 
1.5%
617
 
1.5%
540
 
1.3%
Other values (689) 31208
75.9%
Uppercase Letter
ValueCountFrequency (%)
B 239
16.9%
H 118
 
8.4%
S 116
 
8.2%
P 105
 
7.4%
C 93
 
6.6%
R 91
 
6.4%
M 82
 
5.8%
A 76
 
5.4%
K 69
 
4.9%
L 67
 
4.7%
Other values (16) 357
25.3%
Lowercase Letter
ValueCountFrequency (%)
o 735
13.7%
e 519
 
9.7%
s 488
 
9.1%
r 438
 
8.2%
i 378
 
7.1%
n 374
 
7.0%
a 345
 
6.4%
l 280
 
5.2%
k 255
 
4.8%
t 242
 
4.5%
Other values (15) 1306
24.4%
Other Punctuation
ValueCountFrequency (%)
: 571
81.0%
& 38
 
5.4%
' 36
 
5.1%
· 18
 
2.6%
. 15
 
2.1%
9
 
1.3%
# 7
 
1.0%
, 7
 
1.0%
; 3
 
0.4%
" 1
 
0.1%
Decimal Number
ValueCountFrequency (%)
2 59
41.8%
1 57
40.4%
0 7
 
5.0%
3 7
 
5.0%
5 3
 
2.1%
6 2
 
1.4%
4 2
 
1.4%
8 2
 
1.4%
9 1
 
0.7%
7 1
 
0.7%
Close Punctuation
ValueCountFrequency (%)
) 63
98.4%
] 1
 
1.6%
Open Punctuation
ValueCountFrequency (%)
( 63
98.4%
[ 1
 
1.6%
Space Separator
ValueCountFrequency (%)
648
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 23
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 41057
82.9%
Latin 6773
 
13.7%
Common 1646
 
3.3%
Han 37
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1716
 
4.2%
1351
 
3.3%
1349
 
3.3%
1287
 
3.1%
876
 
2.1%
775
 
1.9%
753
 
1.8%
622
 
1.5%
617
 
1.5%
540
 
1.3%
Other values (663) 31171
75.9%
Latin
ValueCountFrequency (%)
o 735
 
10.9%
e 519
 
7.7%
s 488
 
7.2%
r 438
 
6.5%
i 378
 
5.6%
n 374
 
5.5%
a 345
 
5.1%
l 280
 
4.1%
k 255
 
3.8%
t 242
 
3.6%
Other values (41) 2719
40.1%
Common
ValueCountFrequency (%)
648
39.4%
: 571
34.7%
) 63
 
3.8%
( 63
 
3.8%
2 59
 
3.6%
1 57
 
3.5%
& 38
 
2.3%
' 36
 
2.2%
- 23
 
1.4%
· 18
 
1.1%
Other values (17) 70
 
4.3%
Han
ValueCountFrequency (%)
5
 
13.5%
3
 
8.1%
3
 
8.1%
2
 
5.4%
2
 
5.4%
2
 
5.4%
1
 
2.7%
1
 
2.7%
1
 
2.7%
1
 
2.7%
Other values (16) 16
43.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 41057
82.9%
ASCII 8392
 
16.9%
CJK 37
 
0.1%
None 27
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1716
 
4.2%
1351
 
3.3%
1349
 
3.3%
1287
 
3.1%
876
 
2.1%
775
 
1.9%
753
 
1.8%
622
 
1.5%
617
 
1.5%
540
 
1.3%
Other values (663) 31171
75.9%
ASCII
ValueCountFrequency (%)
o 735
 
8.8%
648
 
7.7%
: 571
 
6.8%
e 519
 
6.2%
s 488
 
5.8%
r 438
 
5.2%
i 378
 
4.5%
n 374
 
4.5%
a 345
 
4.1%
l 280
 
3.3%
Other values (66) 3616
43.1%
None
ValueCountFrequency (%)
· 18
66.7%
9
33.3%
CJK
ValueCountFrequency (%)
5
 
13.5%
3
 
8.1%
3
 
8.1%
2
 
5.4%
2
 
5.4%
2
 
5.4%
1
 
2.7%
1
 
2.7%
1
 
2.7%
1
 
2.7%
Other values (16) 16
43.2%
Distinct9998
Distinct (%)> 99.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T01:13:12.316951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length16
Mean length10.4287
Min length5

Characters and Unicode

Total characters104287
Distinct characters41
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9996 ?
Unique (%)> 99.9%

Sample

1st row808.31-3-108
2nd row512.804-11
3rd row811.7-960
4th row818-563
5th row814.5-1-3
ValueCountFrequency (%)
아동 2242
 
15.8%
그림책 920
 
6.5%
영어 337
 
2.4%
더책 234
 
1.6%
시니어 162
 
1.1%
큰글자 146
 
1.0%
mom 80
 
0.6%
보드북 75
 
0.5%
참고 19
 
0.1%
점자 6
 
< 0.1%
Other values (9944) 10004
70.3%
2023-12-11T01:13:12.679618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 13123
12.6%
1 12530
12.0%
3 9709
9.3%
8 9607
9.2%
2 8442
 
8.1%
. 6097
 
5.8%
4 6093
 
5.8%
5 5151
 
4.9%
0 5025
 
4.8%
9 4808
 
4.6%
Other values (31) 23702
22.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 69701
66.8%
Dash Punctuation 13123
 
12.6%
Other Letter 9596
 
9.2%
Other Punctuation 6097
 
5.8%
Space Separator 4225
 
4.1%
Math Symbol 1298
 
1.2%
Uppercase Letter 243
 
0.2%
Open Punctuation 2
 
< 0.1%
Close Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2243
23.4%
2243
23.4%
1154
12.0%
920
9.6%
920
9.6%
499
 
5.2%
337
 
3.5%
234
 
2.4%
162
 
1.7%
162
 
1.7%
Other values (12) 722
 
7.5%
Decimal Number
ValueCountFrequency (%)
1 12530
18.0%
3 9709
13.9%
8 9607
13.8%
2 8442
12.1%
4 6093
8.7%
5 5151
7.4%
0 5025
7.2%
9 4808
 
6.9%
7 4672
 
6.7%
6 3664
 
5.3%
Uppercase Letter
ValueCountFrequency (%)
M 160
65.8%
O 80
32.9%
A 3
 
1.2%
Dash Punctuation
ValueCountFrequency (%)
- 13123
100.0%
Other Punctuation
ValueCountFrequency (%)
. 6097
100.0%
Space Separator
ValueCountFrequency (%)
4225
100.0%
Math Symbol
ValueCountFrequency (%)
= 1298
100.0%
Open Punctuation
ValueCountFrequency (%)
[ 2
100.0%
Close Punctuation
ValueCountFrequency (%)
] 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 94448
90.6%
Hangul 9596
 
9.2%
Latin 243
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2243
23.4%
2243
23.4%
1154
12.0%
920
9.6%
920
9.6%
499
 
5.2%
337
 
3.5%
234
 
2.4%
162
 
1.7%
162
 
1.7%
Other values (12) 722
 
7.5%
Common
ValueCountFrequency (%)
- 13123
13.9%
1 12530
13.3%
3 9709
10.3%
8 9607
10.2%
2 8442
8.9%
. 6097
6.5%
4 6093
6.5%
5 5151
 
5.5%
0 5025
 
5.3%
9 4808
 
5.1%
Other values (6) 13863
14.7%
Latin
ValueCountFrequency (%)
M 160
65.8%
O 80
32.9%
A 3
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 94691
90.8%
Hangul 9596
 
9.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 13123
13.9%
1 12530
13.2%
3 9709
10.3%
8 9607
10.1%
2 8442
8.9%
. 6097
6.4%
4 6093
6.4%
5 5151
 
5.4%
0 5025
 
5.3%
9 4808
 
5.1%
Other values (9) 14106
14.9%
Hangul
ValueCountFrequency (%)
2243
23.4%
2243
23.4%
1154
12.0%
920
9.6%
920
9.6%
499
 
5.2%
337
 
3.5%
234
 
2.4%
162
 
1.7%
162
 
1.7%
Other values (12) 722
 
7.5%

Interactions

2023-12-11T01:13:06.516211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-11T01:13:06.699066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:13:06.854179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호등록번호서명저자발행자청구기호
7589175892ABN000063728밤을 건너는 소년최양선 지음사계절808.31-3-108
2728727288ABN000121630천 개의 죽음이 내게 말해준 것들고칸 메구미 지음 ; 오시연 옮김웅진지식하우스:웅진씽크빅512.804-11
5108851089ABN000090195쉬엄쉬엄 가도 괜찮아요서정홍 지음단비811.7-960
8431484315ABN000054975졸업선물 : 성공이 아닌 성장을 위한 이야기신영준 글 ; 서동민 그림로크미디어818-563
6706267063ABN000072929한국 산문선. 3, 위험한 백성이종묵, 장유승 [공]편역민음사814.5-1-3
4440344404ABN000099376유전과 게놈 : '개성'은 어떻게 유전되는가? : 유전의 기본부터 맞춤의료, 게놈 편집까지[뉴턴프레스 편] ; 강금희 ; 이세영 [공] 번역아이뉴턴408-2-117
1098710988ABN000141216(더책)아빠나무김미영 글·그림고래뱃속더책 813.8-515
8320283203ABN000056168하나야 놀자 두리야 놀자김녹두 글 ; 김진화 그림문학동네아동 813.8-1492
6386263863ABN000076270희망앙드레 말로 지음 ; 김웅권 옮김문학동네808.31-2-163
1990419905ABN000131545(더책)아빠가 둘이야?임지형 글 ; 윤태규 그림키다리더책 813.8-474
번호등록번호서명저자발행자청구기호
4647ABN000154706외계에서 온 펀자이씨엄유진 글·그림문학동네818-1919-2
5178051781ABN000089501(청소년)북유럽 신화=. 3Norse mythology노경실 지음자음과모음219.23-9-3
2872128722ABN000120195국어를 좋아해 : 의성어·의태어기린교육연구소 글·기획 ; 김소희 그림기린미디어아동 710-26-3
5492854929ABN000086150익숙한 길의 왼쪽황선미 지음미디어창비814.7-680
1135211353ABN000140851빛이 매혹이 될 때 : 빛의 물리학은 어떻게 예술과 우리의 세계를 확장시켰나서민아 지음인플루엔셜425-4=3
1136211363ABN000140841성공한 나라 불안한 시민 : 대전환 시대, 한국 복지국가의 새판 짜기이태수 외 지음헤이북스338.15-16=3
7033670337ABN000069546(최열 아저씨의)지구 온난화 이야기최열 글 ; 조원희 그림도요새:환경재단아동 539.9-41
83538354ABN000144951세계를 품은 외교관 : 외교관을 꿈꾸는 이들을 위한 스토리 가이드북민동석 지음이담Books321.55-65
4264342644ABN000101151수탉과 독재자카르멘 애그라 디디 글 ; 유진 옐친 그림 ; 김경희 옮김길벗어린이그림책 843-1189
9161891619ABN000047336(한현조·천종현 선생님의)천하무적 창의수학 연구소. 1, 수편한헌조 ; 천종현 [공]지음 ; 배소미 스토리 ; 김영진 그림보랏빛소아동 410-98-1