Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory634.8 KiB
Average record size in memory65.0 B

Variable types

Numeric1
Text6

Dataset

Description충청남도 청양군 정산면에 소재하는 정산도서관의 도서 목록에 관한 데이터로 등록번호, 청구기호, 서명, 저작자, 발행자, 발행년에 관한 데이터를 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=409&beforeMenuCd=DOM_000000201001001000&publicdatapk=3062732

Alerts

순번 has unique valuesUnique
등록번호 has unique valuesUnique

Reproduction

Analysis started2024-01-09 19:43:09.052561
Analysis finished2024-01-09 19:43:12.295652
Duration3.24 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19129.005
Minimum1
Maximum38522
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-10T04:43:12.362702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1795.6
Q19356.25
median19187.5
Q328781.75
95-th percentile36569.1
Maximum38522
Range38521
Interquartile range (IQR)19425.5

Descriptive statistics

Standard deviation11154.501
Coefficient of variation (CV)0.58311976
Kurtosis-1.2067748
Mean19129.005
Median Absolute Deviation (MAD)9723
Skewness0.0082152513
Sum1.9129005 × 108
Variance1.2442289 × 108
MonotonicityNot monotonic
2024-01-10T04:43:12.479861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
35049 1
 
< 0.1%
34135 1
 
< 0.1%
24624 1
 
< 0.1%
21754 1
 
< 0.1%
5434 1
 
< 0.1%
30552 1
 
< 0.1%
4284 1
 
< 0.1%
16078 1
 
< 0.1%
20477 1
 
< 0.1%
3532 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
3 1
< 0.1%
10 1
< 0.1%
15 1
< 0.1%
16 1
< 0.1%
20 1
< 0.1%
22 1
< 0.1%
26 1
< 0.1%
45 1
< 0.1%
46 1
< 0.1%
ValueCountFrequency (%)
38522 1
< 0.1%
38519 1
< 0.1%
38518 1
< 0.1%
38507 1
< 0.1%
38502 1
< 0.1%
38500 1
< 0.1%
38494 1
< 0.1%
38493 1
< 0.1%
38491 1
< 0.1%
38479 1
< 0.1%

등록번호
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-01-10T04:43:12.692088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters120000
Distinct characters17
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st rowEM0000009180
2nd rowGM0000011087
3rd rowGM0000020943
4th rowEM0000009251
5th rowEM0000008867
ValueCountFrequency (%)
em0000009180 1
 
< 0.1%
gm0000009013 1
 
< 0.1%
gm0000010139 1
 
< 0.1%
em0000007135 1
 
< 0.1%
gm0000013321 1
 
< 0.1%
nb0000000648 1
 
< 0.1%
em0000001413 1
 
< 0.1%
gm0000016038 1
 
< 0.1%
em0000000696 1
 
< 0.1%
gm0000018536 1
 
< 0.1%
Other values (9990) 9990
99.9%
2024-01-10T04:43:12.981438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 61531
51.3%
M 9638
 
8.0%
1 6978
 
5.8%
G 5677
 
4.7%
2 4436
 
3.7%
3 4180
 
3.5%
4 4065
 
3.4%
5 3940
 
3.3%
6 3765
 
3.1%
9 3744
 
3.1%
Other values (7) 12046
 
10.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 100000
83.3%
Uppercase Letter 20000
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 61531
61.5%
1 6978
 
7.0%
2 4436
 
4.4%
3 4180
 
4.2%
4 4065
 
4.1%
5 3940
 
3.9%
6 3765
 
3.8%
9 3744
 
3.7%
8 3722
 
3.7%
7 3639
 
3.6%
Uppercase Letter
ValueCountFrequency (%)
M 9638
48.2%
G 5677
28.4%
E 2663
 
13.3%
C 1461
 
7.3%
N 199
 
1.0%
B 199
 
1.0%
S 163
 
0.8%

Most occurring scripts

ValueCountFrequency (%)
Common 100000
83.3%
Latin 20000
 
16.7%

Most frequent character per script

Common
ValueCountFrequency (%)
0 61531
61.5%
1 6978
 
7.0%
2 4436
 
4.4%
3 4180
 
4.2%
4 4065
 
4.1%
5 3940
 
3.9%
6 3765
 
3.8%
9 3744
 
3.7%
8 3722
 
3.7%
7 3639
 
3.6%
Latin
ValueCountFrequency (%)
M 9638
48.2%
G 5677
28.4%
E 2663
 
13.3%
C 1461
 
7.3%
N 199
 
1.0%
B 199
 
1.0%
S 163
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 120000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 61531
51.3%
M 9638
 
8.0%
1 6978
 
5.8%
G 5677
 
4.7%
2 4436
 
3.7%
3 4180
 
3.5%
4 4065
 
3.4%
5 3940
 
3.3%
6 3765
 
3.1%
9 3744
 
3.1%
Other values (7) 12046
 
10.0%
Distinct9832
Distinct (%)98.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-01-10T04:43:13.220085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length18
Mean length12.4172
Min length7

Characters and Unicode

Total characters124172
Distinct characters627
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9689 ?
Unique (%)96.9%

Sample

1st row아동 498-정22ㅇ
2nd row327.04-정67ㅇ
3rd row517.52-이25ㅆ
4th row아동 443.1-버878ㄹ
5th row아동 813.8-박293ㅇ
ValueCountFrequency (%)
아동 2565
 
18.0%
유아 1397
 
9.8%
dvd 199
 
1.4%
오디오 39
 
0.3%
참고 34
 
0.2%
388-호295 5
 
< 0.1%
843-m685a 5
 
< 0.1%
813.8-동95ㄱ 5
 
< 0.1%
747-b881a 4
 
< 0.1%
747-와68ㅌ 4
 
< 0.1%
Other values (9819) 9980
70.1%
2024-01-10T04:43:13.551974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 13791
 
11.1%
8 11553
 
9.3%
. 8773
 
7.1%
1 8773
 
7.1%
2 7425
 
6.0%
3 7234
 
5.8%
4 6573
 
5.3%
9 6056
 
4.9%
5 6012
 
4.8%
7 5536
 
4.5%
Other values (617) 42446
34.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 68320
55.0%
Other Letter 25229
 
20.3%
Dash Punctuation 13791
 
11.1%
Other Punctuation 8775
 
7.1%
Space Separator 4237
 
3.4%
Lowercase Letter 2766
 
2.2%
Uppercase Letter 829
 
0.7%
Math Symbol 225
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4066
 
16.1%
2620
 
10.4%
1534
 
6.1%
1496
 
5.9%
1061
 
4.2%
772
 
3.1%
768
 
3.0%
764
 
3.0%
540
 
2.1%
517
 
2.0%
Other values (555) 11091
44.0%
Uppercase Letter
ValueCountFrequency (%)
D 410
49.5%
V 199
24.0%
S 33
 
4.0%
M 30
 
3.6%
B 27
 
3.3%
C 17
 
2.1%
L 11
 
1.3%
R 11
 
1.3%
H 9
 
1.1%
J 9
 
1.1%
Other values (14) 73
 
8.8%
Lowercase Letter
ValueCountFrequency (%)
v 2530
91.5%
s 25
 
0.9%
a 25
 
0.9%
w 19
 
0.7%
m 17
 
0.6%
b 16
 
0.6%
p 14
 
0.5%
h 13
 
0.5%
d 13
 
0.5%
t 13
 
0.5%
Other values (13) 81
 
2.9%
Decimal Number
ValueCountFrequency (%)
8 11553
16.9%
1 8773
12.8%
2 7425
10.9%
3 7234
10.6%
4 6573
9.6%
9 6056
8.9%
5 6012
8.8%
7 5536
8.1%
6 5040
7.4%
0 4118
 
6.0%
Other Punctuation
ValueCountFrequency (%)
. 8773
> 99.9%
· 2
 
< 0.1%
Dash Punctuation
ValueCountFrequency (%)
- 13791
100.0%
Space Separator
ValueCountFrequency (%)
4237
100.0%
Math Symbol
ValueCountFrequency (%)
= 225
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 95348
76.8%
Hangul 25229
 
20.3%
Latin 3595
 
2.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4066
 
16.1%
2620
 
10.4%
1534
 
6.1%
1496
 
5.9%
1061
 
4.2%
772
 
3.1%
768
 
3.0%
764
 
3.0%
540
 
2.1%
517
 
2.0%
Other values (555) 11091
44.0%
Latin
ValueCountFrequency (%)
v 2530
70.4%
D 410
 
11.4%
V 199
 
5.5%
S 33
 
0.9%
M 30
 
0.8%
B 27
 
0.8%
s 25
 
0.7%
a 25
 
0.7%
w 19
 
0.5%
C 17
 
0.5%
Other values (37) 280
 
7.8%
Common
ValueCountFrequency (%)
- 13791
14.5%
8 11553
12.1%
. 8773
9.2%
1 8773
9.2%
2 7425
7.8%
3 7234
7.6%
4 6573
6.9%
9 6056
6.4%
5 6012
6.3%
7 5536
5.8%
Other values (5) 13622
14.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 98941
79.7%
Hangul 17953
 
14.5%
Compat Jamo 7276
 
5.9%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 13791
13.9%
8 11553
11.7%
. 8773
8.9%
1 8773
8.9%
2 7425
7.5%
3 7234
7.3%
4 6573
 
6.6%
9 6056
 
6.1%
5 6012
 
6.1%
7 5536
 
5.6%
Other values (51) 17215
17.4%
Hangul
ValueCountFrequency (%)
4066
22.6%
2620
 
14.6%
1496
 
8.3%
772
 
4.3%
764
 
4.3%
322
 
1.8%
203
 
1.1%
199
 
1.1%
159
 
0.9%
150
 
0.8%
Other values (536) 7202
40.1%
Compat Jamo
ValueCountFrequency (%)
1534
21.1%
1061
14.6%
768
10.6%
540
 
7.4%
517
 
7.1%
466
 
6.4%
455
 
6.3%
424
 
5.8%
421
 
5.8%
285
 
3.9%
Other values (9) 805
11.1%
None
ValueCountFrequency (%)
· 2
100.0%

서명
Text

Distinct9895
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-01-10T04:43:13.863121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length120
Median length80
Mean length21.9167
Min length1

Characters and Unicode

Total characters219167
Distinct characters1798
Distinct categories18 ?
Distinct scripts6 ?
Distinct blocks15 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9797 ?
Unique (%)98.0%

Sample

1st row어서 와, 여기는 꾸룩새 연구소야 : 새박사 다미의 부엉이 펠릿 탐구생활
2nd row(적게 벌어도 잘사는)여자의 습관
3rd row쏘팟의 하나만 빼고 다 먹는 다이어트 : 맘껏 먹으면서 평생 날씬하게
4th row루시와 우주로 날아간 라이카
5th row(119 소방관 아저씨의)연탄꽃이 활짝 피었습니다
ValueCountFrequency (%)
4626
 
8.3%
이야기 394
 
0.7%
장편소설 382
 
0.7%
1 287
 
0.5%
2 271
 
0.5%
220
 
0.4%
위한 209
 
0.4%
우리 167
 
0.3%
the 147
 
0.3%
비밀 126
 
0.2%
Other values (23216) 49034
87.8%
2024-01-10T04:43:14.278277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
45882
 
20.9%
: 4594
 
2.1%
3879
 
1.8%
3869
 
1.8%
2862
 
1.3%
e 2300
 
1.0%
2105
 
1.0%
( 2046
 
0.9%
) 2046
 
0.9%
1960
 
0.9%
Other values (1788) 147624
67.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 133141
60.7%
Space Separator 45882
 
20.9%
Lowercase Letter 18812
 
8.6%
Other Punctuation 8912
 
4.1%
Decimal Number 4201
 
1.9%
Uppercase Letter 2994
 
1.4%
Open Punctuation 2097
 
1.0%
Close Punctuation 2097
 
1.0%
Math Symbol 817
 
0.4%
Dash Punctuation 166
 
0.1%
Other values (8) 48
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3879
 
2.9%
3869
 
2.9%
2862
 
2.1%
2105
 
1.6%
1960
 
1.5%
1885
 
1.4%
1862
 
1.4%
1811
 
1.4%
1733
 
1.3%
1665
 
1.3%
Other values (1673) 109510
82.3%
Lowercase Letter
ValueCountFrequency (%)
e 2300
12.2%
o 1630
 
8.7%
a 1620
 
8.6%
i 1411
 
7.5%
n 1368
 
7.3%
t 1339
 
7.1%
r 1247
 
6.6%
s 1181
 
6.3%
l 865
 
4.6%
h 856
 
4.6%
Other values (17) 4995
26.6%
Uppercase Letter
ValueCountFrequency (%)
S 337
 
11.3%
T 315
 
10.5%
C 231
 
7.7%
D 162
 
5.4%
A 157
 
5.2%
P 157
 
5.2%
B 151
 
5.0%
M 149
 
5.0%
E 147
 
4.9%
N 140
 
4.7%
Other values (16) 1048
35.0%
Other Punctuation
ValueCountFrequency (%)
: 4594
51.5%
, 1830
 
20.5%
. 1351
 
15.2%
! 651
 
7.3%
· 199
 
2.2%
' 153
 
1.7%
73
 
0.8%
19
 
0.2%
& 14
 
0.2%
/ 9
 
0.1%
Other values (6) 19
 
0.2%
Decimal Number
ValueCountFrequency (%)
1 1101
26.2%
0 836
19.9%
2 714
17.0%
3 399
 
9.5%
5 291
 
6.9%
4 244
 
5.8%
7 185
 
4.4%
9 148
 
3.5%
6 145
 
3.5%
8 138
 
3.3%
Math Symbol
ValueCountFrequency (%)
= 714
87.4%
~ 38
 
4.7%
+ 24
 
2.9%
18
 
2.2%
< 10
 
1.2%
> 10
 
1.2%
2
 
0.2%
× 1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 2046
97.6%
[ 40
 
1.9%
7
 
0.3%
3
 
0.1%
1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 2046
97.6%
] 40
 
1.9%
7
 
0.3%
3
 
0.1%
1
 
< 0.1%
Other Symbol
ValueCountFrequency (%)
12
80.0%
1
 
6.7%
1
 
6.7%
1
 
6.7%
Letter Number
ValueCountFrequency (%)
12
50.0%
8
33.3%
4
 
16.7%
Dash Punctuation
ValueCountFrequency (%)
- 165
99.4%
1
 
0.6%
Currency Symbol
ValueCountFrequency (%)
1
50.0%
$ 1
50.0%
Other Number
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
45882
100.0%
Control
ValueCountFrequency (%)
2
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 132454
60.4%
Common 64196
29.3%
Latin 21830
 
10.0%
Han 564
 
0.3%
Hiragana 83
 
< 0.1%
Katakana 40
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3879
 
2.9%
3869
 
2.9%
2862
 
2.2%
2105
 
1.6%
1960
 
1.5%
1885
 
1.4%
1862
 
1.4%
1811
 
1.4%
1733
 
1.3%
1665
 
1.3%
Other values (1307) 108823
82.2%
Han
ValueCountFrequency (%)
14
 
2.5%
14
 
2.5%
13
 
2.3%
11
 
2.0%
9
 
1.6%
8
 
1.4%
8
 
1.4%
8
 
1.4%
7
 
1.2%
7
 
1.2%
Other values (295) 465
82.4%
Common
ValueCountFrequency (%)
45882
71.5%
: 4594
 
7.2%
( 2046
 
3.2%
) 2046
 
3.2%
, 1830
 
2.9%
. 1351
 
2.1%
1 1101
 
1.7%
0 836
 
1.3%
= 714
 
1.1%
2 714
 
1.1%
Other values (49) 3082
 
4.8%
Latin
ValueCountFrequency (%)
e 2300
 
10.5%
o 1630
 
7.5%
a 1620
 
7.4%
i 1411
 
6.5%
n 1368
 
6.3%
t 1339
 
6.1%
r 1247
 
5.7%
s 1181
 
5.4%
l 865
 
4.0%
h 856
 
3.9%
Other values (46) 8013
36.7%
Hiragana
ValueCountFrequency (%)
6
 
7.2%
6
 
7.2%
5
 
6.0%
4
 
4.8%
4
 
4.8%
4
 
4.8%
4
 
4.8%
4
 
4.8%
3
 
3.6%
3
 
3.6%
Other values (25) 40
48.2%
Katakana
ValueCountFrequency (%)
3
 
7.5%
3
 
7.5%
3
 
7.5%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
Other values (16) 17
42.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 132446
60.4%
ASCII 85631
39.1%
CJK 551
 
0.3%
None 329
 
0.2%
Hiragana 83
 
< 0.1%
Katakana 40
 
< 0.1%
Number Forms 24
 
< 0.1%
Math Operators 18
 
< 0.1%
Enclosed Alphanum 14
 
< 0.1%
CJK Compat Ideographs 13
 
< 0.1%
Other values (5) 18
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
45882
53.6%
: 4594
 
5.4%
e 2300
 
2.7%
( 2046
 
2.4%
) 2046
 
2.4%
, 1830
 
2.1%
o 1630
 
1.9%
a 1620
 
1.9%
i 1411
 
1.6%
n 1368
 
1.6%
Other values (76) 20904
24.4%
Hangul
ValueCountFrequency (%)
3879
 
2.9%
3869
 
2.9%
2862
 
2.2%
2105
 
1.6%
1960
 
1.5%
1885
 
1.4%
1862
 
1.4%
1811
 
1.4%
1733
 
1.3%
1665
 
1.3%
Other values (1303) 108815
82.2%
None
ValueCountFrequency (%)
· 199
60.5%
73
 
22.2%
19
 
5.8%
đ 9
 
2.7%
7
 
2.1%
7
 
2.1%
3
 
0.9%
3
 
0.9%
2
 
0.6%
2
 
0.6%
Other values (5) 5
 
1.5%
Math Operators
ValueCountFrequency (%)
18
100.0%
CJK
ValueCountFrequency (%)
14
 
2.5%
14
 
2.5%
13
 
2.4%
11
 
2.0%
9
 
1.6%
8
 
1.5%
8
 
1.5%
8
 
1.5%
7
 
1.3%
7
 
1.3%
Other values (287) 452
82.0%
Number Forms
ValueCountFrequency (%)
12
50.0%
8
33.3%
4
 
16.7%
Enclosed Alphanum
ValueCountFrequency (%)
12
85.7%
1
 
7.1%
1
 
7.1%
Hiragana
ValueCountFrequency (%)
6
 
7.2%
6
 
7.2%
5
 
6.0%
4
 
4.8%
4
 
4.8%
4
 
4.8%
4
 
4.8%
4
 
4.8%
3
 
3.6%
3
 
3.6%
Other values (25) 40
48.2%
Punctuation
ValueCountFrequency (%)
4
57.1%
1
 
14.3%
1
 
14.3%
1
 
14.3%
Katakana
ValueCountFrequency (%)
3
 
7.5%
3
 
7.5%
3
 
7.5%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
Other values (16) 17
42.5%
CJK Compat Ideographs
ValueCountFrequency (%)
3
23.1%
2
15.4%
2
15.4%
2
15.4%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
Compat Jamo
ValueCountFrequency (%)
2
25.0%
2
25.0%
2
25.0%
2
25.0%
Misc Symbols
ValueCountFrequency (%)
1
100.0%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
CJK Compat
ValueCountFrequency (%)
1
100.0%
Distinct8885
Distinct (%)88.9%
Missing1
Missing (%)< 0.1%
Memory size156.2 KiB
2024-01-10T04:43:14.571699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length112
Median length92
Mean length16.260526
Min length2

Characters and Unicode

Total characters162589
Distinct characters1160
Distinct categories13 ?
Distinct scripts6 ?
Distinct blocks10 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8217 ?
Unique (%)82.2%

Sample

1st row정다미 글 ; 이장미 그림
2nd row정은길 지음
3rd row이동훈 지음
4th row윌 버킹엄 글 ; 모미카 아르날도 그림 ; 정화진 옮김
5th row박래균 글·그림
ValueCountFrequency (%)
8285
 
16.6%
지음 4658
 
9.3%
옮김 2959
 
5.9%
그림 2780
 
5.6%
2409
 
4.8%
글·그림 556
 
1.1%
369
 
0.7%
공]지음 337
 
0.7%
by 319
 
0.6%
감수 238
 
0.5%
Other values (13370) 26964
54.1%
2024-01-10T04:43:15.024088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
39966
24.6%
; 8285
 
5.1%
5928
 
3.6%
5417
 
3.3%
5352
 
3.3%
3682
 
2.3%
3576
 
2.2%
3297
 
2.0%
3203
 
2.0%
3049
 
1.9%
Other values (1150) 80834
49.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 100982
62.1%
Space Separator 39966
 
24.6%
Other Punctuation 9593
 
5.9%
Lowercase Letter 7495
 
4.6%
Uppercase Letter 1841
 
1.1%
Open Punctuation 1293
 
0.8%
Close Punctuation 1293
 
0.8%
Dash Punctuation 48
 
< 0.1%
Math Symbol 34
 
< 0.1%
Decimal Number 26
 
< 0.1%
Other values (3) 18
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5928
 
5.9%
5417
 
5.4%
5352
 
5.3%
3682
 
3.6%
3576
 
3.5%
3297
 
3.3%
3203
 
3.2%
3049
 
3.0%
1561
 
1.5%
1556
 
1.5%
Other values (1059) 64361
63.7%
Lowercase Letter
ValueCountFrequency (%)
e 775
 
10.3%
a 696
 
9.3%
i 617
 
8.2%
n 577
 
7.7%
r 563
 
7.5%
l 503
 
6.7%
t 503
 
6.7%
o 472
 
6.3%
y 466
 
6.2%
s 368
 
4.9%
Other values (17) 1955
26.1%
Uppercase Letter
ValueCountFrequency (%)
S 188
 
10.2%
B 174
 
9.5%
M 154
 
8.4%
J 137
 
7.4%
A 110
 
6.0%
C 105
 
5.7%
K 96
 
5.2%
T 88
 
4.8%
E 87
 
4.7%
L 84
 
4.6%
Other values (16) 618
33.6%
Other Punctuation
ValueCountFrequency (%)
; 8285
86.4%
· 830
 
8.7%
. 400
 
4.2%
: 24
 
0.3%
, 24
 
0.3%
16
 
0.2%
/ 6
 
0.1%
' 3
 
< 0.1%
" 2
 
< 0.1%
& 2
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
1 6
23.1%
0 4
15.4%
3 4
15.4%
5 3
11.5%
8 2
 
7.7%
2 2
 
7.7%
6 2
 
7.7%
4 2
 
7.7%
9 1
 
3.8%
Open Punctuation
ValueCountFrequency (%)
[ 1280
99.0%
( 5
 
0.4%
4
 
0.3%
3
 
0.2%
1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
] 1280
99.0%
) 5
 
0.4%
4
 
0.3%
3
 
0.2%
1
 
0.1%
Math Symbol
ValueCountFrequency (%)
> 17
50.0%
< 17
50.0%
Other Symbol
ValueCountFrequency (%)
7
87.5%
1
 
12.5%
Space Separator
ValueCountFrequency (%)
39966
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 48
100.0%
Control
ValueCountFrequency (%)
8
100.0%
Modifier Symbol
ValueCountFrequency (%)
˙ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 100520
61.8%
Common 52271
32.1%
Latin 9336
 
5.7%
Han 424
 
0.3%
Katakana 25
 
< 0.1%
Hiragana 13
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5928
 
5.9%
5417
 
5.4%
5352
 
5.3%
3682
 
3.7%
3576
 
3.6%
3297
 
3.3%
3203
 
3.2%
3049
 
3.0%
1561
 
1.6%
1556
 
1.5%
Other values (884) 63899
63.6%
Han
ValueCountFrequency (%)
33
 
7.8%
14
 
3.3%
11
 
2.6%
10
 
2.4%
9
 
2.1%
9
 
2.1%
8
 
1.9%
8
 
1.9%
8
 
1.9%
8
 
1.9%
Other values (137) 306
72.2%
Latin
ValueCountFrequency (%)
e 775
 
8.3%
a 696
 
7.5%
i 617
 
6.6%
n 577
 
6.2%
r 563
 
6.0%
l 503
 
5.4%
t 503
 
5.4%
o 472
 
5.1%
y 466
 
5.0%
s 368
 
3.9%
Other values (43) 3796
40.7%
Common
ValueCountFrequency (%)
39966
76.5%
; 8285
 
15.9%
[ 1280
 
2.4%
] 1280
 
2.4%
· 830
 
1.6%
. 400
 
0.8%
- 48
 
0.1%
: 24
 
< 0.1%
, 24
 
< 0.1%
> 17
 
< 0.1%
Other values (28) 117
 
0.2%
Katakana
ValueCountFrequency (%)
4
16.0%
2
 
8.0%
2
 
8.0%
2
 
8.0%
2
 
8.0%
2
 
8.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
Other values (7) 7
28.0%
Hiragana
ValueCountFrequency (%)
2
15.4%
2
15.4%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 100509
61.8%
ASCII 60734
37.4%
None 863
 
0.5%
CJK 424
 
0.3%
Katakana 25
 
< 0.1%
Hiragana 13
 
< 0.1%
Compat Jamo 11
 
< 0.1%
Enclosed Alphanum 7
 
< 0.1%
Modifier Letters 2
 
< 0.1%
Letterlike Symbols 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
39966
65.8%
; 8285
 
13.6%
[ 1280
 
2.1%
] 1280
 
2.1%
e 775
 
1.3%
a 696
 
1.1%
i 617
 
1.0%
n 577
 
1.0%
r 563
 
0.9%
l 503
 
0.8%
Other values (69) 6192
 
10.2%
Hangul
ValueCountFrequency (%)
5928
 
5.9%
5417
 
5.4%
5352
 
5.3%
3682
 
3.7%
3576
 
3.6%
3297
 
3.3%
3203
 
3.2%
3049
 
3.0%
1561
 
1.6%
1556
 
1.5%
Other values (883) 63888
63.6%
None
ValueCountFrequency (%)
· 830
96.2%
16
 
1.9%
4
 
0.5%
4
 
0.5%
3
 
0.3%
3
 
0.3%
đ 1
 
0.1%
1
 
0.1%
1
 
0.1%
CJK
ValueCountFrequency (%)
33
 
7.8%
14
 
3.3%
11
 
2.6%
10
 
2.4%
9
 
2.1%
9
 
2.1%
8
 
1.9%
8
 
1.9%
8
 
1.9%
8
 
1.9%
Other values (137) 306
72.2%
Compat Jamo
ValueCountFrequency (%)
11
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
7
100.0%
Katakana
ValueCountFrequency (%)
4
16.0%
2
 
8.0%
2
 
8.0%
2
 
8.0%
2
 
8.0%
2
 
8.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
Other values (7) 7
28.0%
Hiragana
ValueCountFrequency (%)
2
15.4%
2
15.4%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
Modifier Letters
ValueCountFrequency (%)
˙ 2
100.0%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
Distinct2695
Distinct (%)27.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-01-10T04:43:15.362520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length40
Mean length5.3474
Min length1

Characters and Unicode

Total characters53474
Distinct characters783
Distinct categories10 ?
Distinct scripts5 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1470 ?
Unique (%)14.7%

Sample

1st row한겨레아이들
2nd row다산북스
3rd row21세기북스
4th row청어람아이
5th row주니어김영사
ValueCountFrequency (%)
웅진씽크빅:웅진다책 494
 
4.8%
창비 148
 
1.4%
문학동네 143
 
1.4%
비룡소 113
 
1.1%
신원문화사 111
 
1.1%
21세기북스 86
 
0.8%
위즈덤하우스 75
 
0.7%
서울문화사 73
 
0.7%
사계절 60
 
0.6%
웅진씽크빅 54
 
0.5%
Other values (2729) 8982
86.9%
2024-01-10T04:43:15.744139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1773
 
3.3%
: 1738
 
3.3%
1437
 
2.7%
1389
 
2.6%
1339
 
2.5%
1213
 
2.3%
1178
 
2.2%
886
 
1.7%
853
 
1.6%
822
 
1.5%
Other values (773) 40846
76.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 45629
85.3%
Lowercase Letter 4036
 
7.5%
Other Punctuation 1846
 
3.5%
Uppercase Letter 1366
 
2.6%
Space Separator 340
 
0.6%
Decimal Number 236
 
0.4%
Dash Punctuation 10
 
< 0.1%
Open Punctuation 5
 
< 0.1%
Close Punctuation 5
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1773
 
3.9%
1437
 
3.1%
1389
 
3.0%
1339
 
2.9%
1213
 
2.7%
1178
 
2.6%
886
 
1.9%
853
 
1.9%
822
 
1.8%
780
 
1.7%
Other values (698) 33959
74.4%
Lowercase Letter
ValueCountFrequency (%)
o 606
15.0%
s 362
 
9.0%
r 330
 
8.2%
a 325
 
8.1%
i 314
 
7.8%
e 304
 
7.5%
n 279
 
6.9%
l 200
 
5.0%
k 194
 
4.8%
t 139
 
3.4%
Other values (16) 983
24.4%
Uppercase Letter
ValueCountFrequency (%)
B 220
16.1%
M 149
10.9%
H 136
 
10.0%
S 110
 
8.1%
K 83
 
6.1%
P 81
 
5.9%
C 76
 
5.6%
R 70
 
5.1%
T 52
 
3.8%
D 46
 
3.4%
Other values (14) 343
25.1%
Other Punctuation
ValueCountFrequency (%)
: 1738
94.1%
61
 
3.3%
. 17
 
0.9%
& 13
 
0.7%
, 5
 
0.3%
' 5
 
0.3%
/ 2
 
0.1%
! 2
 
0.1%
# 1
 
0.1%
· 1
 
0.1%
Decimal Number
ValueCountFrequency (%)
2 111
47.0%
1 109
46.2%
0 8
 
3.4%
3 3
 
1.3%
4 2
 
0.8%
8 2
 
0.8%
9 1
 
0.4%
Open Punctuation
ValueCountFrequency (%)
( 3
60.0%
[ 2
40.0%
Close Punctuation
ValueCountFrequency (%)
) 3
60.0%
] 2
40.0%
Space Separator
ValueCountFrequency (%)
340
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 45394
84.9%
Latin 5402
 
10.1%
Common 2443
 
4.6%
Han 215
 
0.4%
Katakana 20
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1773
 
3.9%
1437
 
3.2%
1389
 
3.1%
1339
 
2.9%
1213
 
2.7%
1178
 
2.6%
886
 
2.0%
853
 
1.9%
822
 
1.8%
780
 
1.7%
Other values (619) 33724
74.3%
Han
ValueCountFrequency (%)
33
15.3%
32
 
14.9%
31
 
14.4%
8
 
3.7%
6
 
2.8%
5
 
2.3%
5
 
2.3%
5
 
2.3%
5
 
2.3%
4
 
1.9%
Other values (53) 81
37.7%
Latin
ValueCountFrequency (%)
o 606
 
11.2%
s 362
 
6.7%
r 330
 
6.1%
a 325
 
6.0%
i 314
 
5.8%
e 304
 
5.6%
n 279
 
5.2%
B 220
 
4.1%
l 200
 
3.7%
k 194
 
3.6%
Other values (40) 2268
42.0%
Common
ValueCountFrequency (%)
: 1738
71.1%
340
 
13.9%
2 111
 
4.5%
1 109
 
4.5%
61
 
2.5%
. 17
 
0.7%
& 13
 
0.5%
- 10
 
0.4%
0 8
 
0.3%
, 5
 
0.2%
Other values (15) 31
 
1.3%
Katakana
ValueCountFrequency (%)
3
15.0%
2
 
10.0%
2
 
10.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
Other values (6) 6
30.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 45394
84.9%
ASCII 7783
 
14.6%
CJK 215
 
0.4%
None 62
 
0.1%
Katakana 20
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1773
 
3.9%
1437
 
3.2%
1389
 
3.1%
1339
 
2.9%
1213
 
2.7%
1178
 
2.6%
886
 
2.0%
853
 
1.9%
822
 
1.8%
780
 
1.7%
Other values (619) 33724
74.3%
ASCII
ValueCountFrequency (%)
: 1738
22.3%
o 606
 
7.8%
s 362
 
4.7%
340
 
4.4%
r 330
 
4.2%
a 325
 
4.2%
i 314
 
4.0%
e 304
 
3.9%
n 279
 
3.6%
B 220
 
2.8%
Other values (63) 2965
38.1%
None
ValueCountFrequency (%)
61
98.4%
· 1
 
1.6%
CJK
ValueCountFrequency (%)
33
15.3%
32
 
14.9%
31
 
14.4%
8
 
3.7%
6
 
2.8%
5
 
2.3%
5
 
2.3%
5
 
2.3%
5
 
2.3%
4
 
1.9%
Other values (53) 81
37.7%
Katakana
ValueCountFrequency (%)
3
15.0%
2
 
10.0%
2
 
10.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
Other values (6) 6
30.0%
Distinct57
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-01-10T04:43:15.888653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length4
Mean length4.0029
Min length4

Characters and Unicode

Total characters40029
Distinct characters14
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique17 ?
Unique (%)0.2%

Sample

1st row2018
2nd row2013
3rd row2020
4th row2018
5th row2018
ValueCountFrequency (%)
2011 2136
21.4%
2010 1415
14.1%
2014 812
 
8.1%
2013 795
 
8.0%
2012 753
 
7.5%
2016 716
 
7.2%
2015 654
 
6.5%
2017 491
 
4.9%
2020 405
 
4.0%
2018 343
 
3.4%
Other values (43) 1480
14.8%
2024-01-10T04:43:16.156834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 12691
31.7%
2 11146
27.8%
1 10835
27.1%
4 871
 
2.2%
3 869
 
2.2%
9 839
 
2.1%
6 804
 
2.0%
5 733
 
1.8%
7 696
 
1.7%
8 516
 
1.3%
Other values (4) 29
 
0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 40000
99.9%
Open Punctuation 11
 
< 0.1%
Close Punctuation 11
 
< 0.1%
Lowercase Letter 5
 
< 0.1%
Other Punctuation 2
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 12691
31.7%
2 11146
27.9%
1 10835
27.1%
4 871
 
2.2%
3 869
 
2.2%
9 839
 
2.1%
6 804
 
2.0%
5 733
 
1.8%
7 696
 
1.7%
8 516
 
1.3%
Open Punctuation
ValueCountFrequency (%)
[ 11
100.0%
Close Punctuation
ValueCountFrequency (%)
] 11
100.0%
Lowercase Letter
ValueCountFrequency (%)
c 5
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 40024
> 99.9%
Latin 5
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 12691
31.7%
2 11146
27.8%
1 10835
27.1%
4 871
 
2.2%
3 869
 
2.2%
9 839
 
2.1%
6 804
 
2.0%
5 733
 
1.8%
7 696
 
1.7%
8 516
 
1.3%
Other values (3) 24
 
0.1%
Latin
ValueCountFrequency (%)
c 5
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 40029
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 12691
31.7%
2 11146
27.8%
1 10835
27.1%
4 871
 
2.2%
3 869
 
2.2%
9 839
 
2.1%
6 804
 
2.0%
5 733
 
1.8%
7 696
 
1.7%
8 516
 
1.3%
Other values (4) 29
 
0.1%

Interactions

2024-01-10T04:43:11.942356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T04:43:16.226513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번발행년
순번1.0000.924
발행년0.9241.000

Missing values

2024-01-10T04:43:12.124987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T04:43:12.228719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번등록번호청구기호서명저작자발행자발행년
3504335049EM0000009180아동 498-정22ㅇ어서 와, 여기는 꾸룩새 연구소야 : 새박사 다미의 부엉이 펠릿 탐구생활정다미 글 ; 이장미 그림한겨레아이들2018
2135621360GM0000011087327.04-정67ㅇ(적게 벌어도 잘사는)여자의 습관정은길 지음다산북스2013
3761137617GM0000020943517.52-이25ㅆ쏘팟의 하나만 빼고 다 먹는 다이어트 : 맘껏 먹으면서 평생 날씬하게이동훈 지음21세기북스2020
3548535491EM0000009251아동 443.1-버878ㄹ루시와 우주로 날아간 라이카윌 버킹엄 글 ; 모미카 아르날도 그림 ; 정화진 옮김청어람아이2018
3319233198EM0000008867아동 813.8-박293ㅇ(119 소방관 아저씨의)연탄꽃이 활짝 피었습니다박래균 글·그림주니어김영사2018
95239524GM0000005181843.6-브294ㅅ스펜스 기숙학교의 마녀들리바 브레이 지음 ; 이원경 옮김문학동네2011
2771127716GM0000014968813.7-이894ㅇ엉겅퀴 칸타타 : 이평재 장편소설이평재 지음 ; 윤후명 옮김폭스코너2015
2400624010CM0000003399유아 808-사14-v.42=2커졌다!서현 글·그림사계절2014
1888218885EM0000005044아동 811.8-정23ㅂ바다가 그린 그림 : 정대성 동시정대성 지음 ; 이규경 그림아동문예2013
94519452GM0000005162848-컬297ㅅ솔리튜드 : 고독로버트 컬 지음 ; 정연희 옮김Human & Books2011
순번등록번호청구기호서명저작자발행자발행년
50415042EM0000000944아동 833.8-대66-v.17마루 밑 아리에티메리 노튼 원작 ; 미야자키 하야오 기획·각본 ; 니와 케이코 각본 ; 요네바야시 히로마사 감독 ; 서은정 번역대원씨아이2010
72187219EM0000002584아동 199.4-엠48ㅅ세상을 구하라MBC희망특강파랑새 글 ; 박영숙 그림리젬2011
1561115614GM0000008641373.4-사68ㄷ다시 공부하고 싶은 나이, 서른 : 직장인을 위한 14일 스터디플래너사이토 다카시 지음 ; 한성례 옮김비전코리아2012
77597760GM0000003474181.37-황195ㅁ몰입 = Think harder!황농문 지음랜덤하우스2011
1700417007EM0000004089아동 410.4-송25ㅅ-v.20(코믹 메이플스토리)수학도둑. 20송도수 글 ; 서정은 그림 ; 여운방 감수서울문화사2012
2222222226GM0000011689656.2-쿠65ㅇ어썸 스케치북 : 동물줄리아 쿠오 지음 ; 이종 편집부 [옮김]이종2014
2559925603GM0000013579188.5-남14ㄴ내 안에 인생코드 : 음양오행으로 보는 운명과 체질남경우 지음굿플러스북2015
3556335569GM0000019628큰813.7-조93ㅂ-1빛의 호위 : 조해진 소설집. 1조해진 지음창비2018
2713427139GM0000014387372.68-최66ㅁ명문대로 가는 인성·진로 코칭 : 학생부 종합전형 대비최원호 지음푸른영토2014
1849818501NB0000000546DVD 688.6-이74ㅎ홍길동 2084이정인 감독디에스미디어2012