Overview

Dataset statistics

Number of variables8
Number of observations2441
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory157.5 KiB
Average record size in memory66.1 B

Variable types

Numeric2
Text5
Categorical1

Dataset

Description2022년 2분기(4~6월) 대구광역시립달성도서관에 새로 들어온 신착도서 목록입니다.
Author대구광역시교육청 대구광역시립달성도서관
URLhttps://www.data.go.kr/data/15102562/fileData.do

Alerts

발행년 is highly imbalanced (68.6%)Imbalance
번호 has unique valuesUnique
등록번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 15:04:34.810292
Analysis finished2023-12-12 15:04:37.015304
Duration2.21 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct2441
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1221
Minimum1
Maximum2441
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size21.6 KiB
2023-12-13T00:04:37.102060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile123
Q1611
median1221
Q31831
95-th percentile2319
Maximum2441
Range2440
Interquartile range (IQR)1220

Descriptive statistics

Standard deviation704.80033
Coefficient of variation (CV)0.57723204
Kurtosis-1.2
Mean1221
Median Absolute Deviation (MAD)610
Skewness0
Sum2980461
Variance496743.5
MonotonicityStrictly increasing
2023-12-13T00:04:37.254269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
1631 1
 
< 0.1%
1624 1
 
< 0.1%
1625 1
 
< 0.1%
1626 1
 
< 0.1%
1627 1
 
< 0.1%
1628 1
 
< 0.1%
1629 1
 
< 0.1%
1630 1
 
< 0.1%
1632 1
 
< 0.1%
Other values (2431) 2431
99.6%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
2441 1
< 0.1%
2440 1
< 0.1%
2439 1
< 0.1%
2438 1
< 0.1%
2437 1
< 0.1%
2436 1
< 0.1%
2435 1
< 0.1%
2434 1
< 0.1%
2433 1
< 0.1%
2432 1
< 0.1%

등록번호
Text

UNIQUE 

Distinct2441
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size19.2 KiB
2023-12-13T00:04:37.517565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters29292
Distinct characters17
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2441 ?
Unique (%)100.0%

Sample

1st rowBPG000233453
2nd rowBPG000233490
3rd rowBPG000233462
4th rowBPG000233461
5th rowBPG000233471
ValueCountFrequency (%)
bpg000233453 1
 
< 0.1%
bpg000233744 1
 
< 0.1%
bpg000233834 1
 
< 0.1%
bpg000233848 1
 
< 0.1%
bpg000233910 1
 
< 0.1%
bpg000233664 1
 
< 0.1%
bpg000233640 1
 
< 0.1%
bpg000233625 1
 
< 0.1%
bpg000233891 1
 
< 0.1%
bpg000233731 1
 
< 0.1%
Other values (2431) 2431
99.6%
2023-12-13T00:04:37.947593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 8288
28.3%
3 4057
13.9%
2 3664
12.5%
B 2441
 
8.3%
P 2433
 
8.3%
G 2343
 
8.0%
4 1391
 
4.7%
5 914
 
3.1%
6 773
 
2.6%
1 766
 
2.6%
Other values (7) 2222
 
7.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 21969
75.0%
Uppercase Letter 7323
 
25.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 8288
37.7%
3 4057
18.5%
2 3664
16.7%
4 1391
 
6.3%
5 914
 
4.2%
6 773
 
3.5%
1 766
 
3.5%
7 729
 
3.3%
9 704
 
3.2%
8 683
 
3.1%
Uppercase Letter
ValueCountFrequency (%)
B 2441
33.3%
P 2433
33.2%
G 2343
32.0%
M 90
 
1.2%
N 8
 
0.1%
S 7
 
0.1%
T 1
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common 21969
75.0%
Latin 7323
 
25.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 8288
37.7%
3 4057
18.5%
2 3664
16.7%
4 1391
 
6.3%
5 914
 
4.2%
6 773
 
3.5%
1 766
 
3.5%
7 729
 
3.3%
9 704
 
3.2%
8 683
 
3.1%
Latin
ValueCountFrequency (%)
B 2441
33.3%
P 2433
33.2%
G 2343
32.0%
M 90
 
1.2%
N 8
 
0.1%
S 7
 
0.1%
T 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 29292
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 8288
28.3%
3 4057
13.9%
2 3664
12.5%
B 2441
 
8.3%
P 2433
 
8.3%
G 2343
 
8.0%
4 1391
 
4.7%
5 914
 
3.1%
6 773
 
2.6%
1 766
 
2.6%
Other values (7) 2222
 
7.6%
Distinct2380
Distinct (%)97.5%
Missing0
Missing (%)0.0%
Memory size19.2 KiB
2023-12-13T00:04:38.214858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length14
Mean length11.188447
Min length6

Characters and Unicode

Total characters27311
Distinct characters405
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2343 ?
Unique (%)96.0%

Sample

1st row189.24-가831ㅈ
2nd row813.7-정72ㅂ
3rd rowJ 340-이73ㅇ
4th row331.54-오223ㅇ
5th row375.441-조79ㅅ
ValueCountFrequency (%)
j 656
 
18.6%
mc 181
 
5.1%
wj 90
 
2.6%
81
 
2.3%
54
 
1.5%
408-자64 13
 
0.4%
교(j 11
 
0.3%
人文 8
 
0.2%
998-미232 6
 
0.2%
375.1-라69ㅅ 4
 
0.1%
Other values (2373) 2419
68.7%
2023-12-13T00:04:38.619435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 2761
 
10.1%
3 2520
 
9.2%
8 2466
 
9.0%
1 2312
 
8.5%
2 1641
 
6.0%
. 1611
 
5.9%
9 1381
 
5.1%
5 1356
 
5.0%
4 1332
 
4.9%
7 1202
 
4.4%
Other values (395) 8729
32.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 15742
57.6%
Other Letter 4681
 
17.1%
Dash Punctuation 2761
 
10.1%
Other Punctuation 1611
 
5.9%
Uppercase Letter 1299
 
4.8%
Space Separator 1082
 
4.0%
Lowercase Letter 78
 
0.3%
Math Symbol 35
 
0.1%
Open Punctuation 11
 
< 0.1%
Close Punctuation 11
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
496
 
10.6%
281
 
6.0%
227
 
4.8%
224
 
4.8%
200
 
4.3%
166
 
3.5%
153
 
3.3%
153
 
3.3%
147
 
3.1%
133
 
2.8%
Other values (342) 2501
53.4%
Uppercase Letter
ValueCountFrequency (%)
J 760
58.5%
M 193
 
14.9%
C 184
 
14.2%
W 94
 
7.2%
B 16
 
1.2%
P 11
 
0.8%
S 7
 
0.5%
F 6
 
0.5%
D 6
 
0.5%
H 5
 
0.4%
Other values (10) 17
 
1.3%
Lowercase Letter
ValueCountFrequency (%)
b 11
14.1%
t 11
14.1%
m 9
11.5%
a 7
9.0%
w 6
7.7%
s 5
 
6.4%
p 4
 
5.1%
r 4
 
5.1%
c 4
 
5.1%
l 3
 
3.8%
Other values (7) 14
17.9%
Decimal Number
ValueCountFrequency (%)
3 2520
16.0%
8 2466
15.7%
1 2312
14.7%
2 1641
10.4%
9 1381
8.8%
5 1356
8.6%
4 1332
8.5%
7 1202
7.6%
6 940
 
6.0%
0 592
 
3.8%
Dash Punctuation
ValueCountFrequency (%)
- 2761
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1611
100.0%
Space Separator
ValueCountFrequency (%)
1082
100.0%
Math Symbol
ValueCountFrequency (%)
= 35
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 21253
77.8%
Hangul 4665
 
17.1%
Latin 1377
 
5.0%
Han 16
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
496
 
10.6%
281
 
6.0%
227
 
4.9%
224
 
4.8%
200
 
4.3%
166
 
3.6%
153
 
3.3%
153
 
3.3%
147
 
3.2%
133
 
2.9%
Other values (340) 2485
53.3%
Latin
ValueCountFrequency (%)
J 760
55.2%
M 193
 
14.0%
C 184
 
13.4%
W 94
 
6.8%
B 16
 
1.2%
b 11
 
0.8%
t 11
 
0.8%
P 11
 
0.8%
m 9
 
0.7%
a 7
 
0.5%
Other values (27) 81
 
5.9%
Common
ValueCountFrequency (%)
- 2761
13.0%
3 2520
11.9%
8 2466
11.6%
1 2312
10.9%
2 1641
7.7%
. 1611
7.6%
9 1381
6.5%
5 1356
6.4%
4 1332
6.3%
7 1202
5.7%
Other values (6) 2671
12.6%
Han
ValueCountFrequency (%)
8
50.0%
8
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 22630
82.9%
Hangul 2498
 
9.1%
Compat Jamo 2167
 
7.9%
CJK 16
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 2761
12.2%
3 2520
11.1%
8 2466
10.9%
1 2312
10.2%
2 1641
7.3%
. 1611
 
7.1%
9 1381
 
6.1%
5 1356
 
6.0%
4 1332
 
5.9%
7 1202
 
5.3%
Other values (43) 4048
17.9%
Compat Jamo
ValueCountFrequency (%)
496
22.9%
281
13.0%
227
10.5%
166
 
7.7%
153
 
7.1%
153
 
7.1%
147
 
6.8%
133
 
6.1%
115
 
5.3%
68
 
3.1%
Other values (9) 228
10.5%
Hangul
ValueCountFrequency (%)
224
 
9.0%
200
 
8.0%
101
 
4.0%
81
 
3.2%
65
 
2.6%
54
 
2.2%
53
 
2.1%
45
 
1.8%
37
 
1.5%
33
 
1.3%
Other values (321) 1605
64.3%
CJK
ValueCountFrequency (%)
8
50.0%
8
50.0%

서명
Text

Distinct2427
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Memory size19.2 KiB
2023-12-13T00:04:39.016536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length95
Median length63
Mean length25.37034
Min length1

Characters and Unicode

Total characters61929
Distinct characters1153
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2416 ?
Unique (%)99.0%

Sample

1st row자식을 미치게 만드는 부모들 : 상처주고 공격하고 지배하려는 부모와 그로부터 벗어나는 법
2nd row보헤미안 랩소디 : 정재민 장편소설
3rd row알아 두면 세상이 보이는 선거와 정치 30
4th row아무것도 하지 않는 법
5th row선을 넘는 초등수학 공부법 : 수학 1등급을 만드는 초등 6년 완전 학습
ValueCountFrequency (%)
1410
 
8.5%
장편소설 113
 
0.7%
이야기 108
 
0.6%
위한 106
 
0.6%
1 71
 
0.4%
70
 
0.4%
2 67
 
0.4%
the 54
 
0.3%
큰글자책 53
 
0.3%
3 40
 
0.2%
Other values (8343) 14593
87.5%
2023-12-13T00:04:39.567688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
14247
 
23.0%
: 1406
 
2.3%
1152
 
1.9%
1119
 
1.8%
975
 
1.6%
665
 
1.1%
558
 
0.9%
549
 
0.9%
535
 
0.9%
515
 
0.8%
Other values (1143) 40208
64.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 39598
63.9%
Space Separator 14247
 
23.0%
Lowercase Letter 3611
 
5.8%
Other Punctuation 2053
 
3.3%
Decimal Number 1138
 
1.8%
Uppercase Letter 489
 
0.8%
Open Punctuation 320
 
0.5%
Close Punctuation 320
 
0.5%
Math Symbol 127
 
0.2%
Dash Punctuation 24
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1152
 
2.9%
1119
 
2.8%
975
 
2.5%
665
 
1.7%
558
 
1.4%
549
 
1.4%
535
 
1.4%
515
 
1.3%
504
 
1.3%
498
 
1.3%
Other values (1054) 32528
82.1%
Lowercase Letter
ValueCountFrequency (%)
e 455
12.6%
o 323
 
8.9%
a 259
 
7.2%
n 258
 
7.1%
i 248
 
6.9%
t 241
 
6.7%
r 238
 
6.6%
s 227
 
6.3%
h 197
 
5.5%
l 156
 
4.3%
Other values (16) 1009
27.9%
Uppercase Letter
ValueCountFrequency (%)
T 69
14.1%
S 49
 
10.0%
A 40
 
8.2%
B 34
 
7.0%
M 32
 
6.5%
I 25
 
5.1%
W 22
 
4.5%
P 22
 
4.5%
D 20
 
4.1%
C 20
 
4.1%
Other values (16) 156
31.9%
Other Punctuation
ValueCountFrequency (%)
: 1406
68.5%
. 331
 
16.1%
! 163
 
7.9%
' 66
 
3.2%
· 59
 
2.9%
& 13
 
0.6%
# 5
 
0.2%
% 5
 
0.2%
2
 
0.1%
; 1
 
< 0.1%
Other values (2) 2
 
0.1%
Decimal Number
ValueCountFrequency (%)
1 287
25.2%
0 235
20.7%
2 207
18.2%
3 111
 
9.8%
5 78
 
6.9%
4 65
 
5.7%
9 49
 
4.3%
6 39
 
3.4%
8 34
 
3.0%
7 33
 
2.9%
Open Punctuation
ValueCountFrequency (%)
( 312
97.5%
4
 
1.2%
[ 3
 
0.9%
1
 
0.3%
Close Punctuation
ValueCountFrequency (%)
) 312
97.5%
4
 
1.2%
] 3
 
0.9%
1
 
0.3%
Math Symbol
ValueCountFrequency (%)
= 112
88.2%
~ 7
 
5.5%
+ 6
 
4.7%
× 2
 
1.6%
Space Separator
ValueCountFrequency (%)
14247
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 24
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 39589
63.9%
Common 18231
29.4%
Latin 4100
 
6.6%
Han 9
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1152
 
2.9%
1119
 
2.8%
975
 
2.5%
665
 
1.7%
558
 
1.4%
549
 
1.4%
535
 
1.4%
515
 
1.3%
504
 
1.3%
498
 
1.3%
Other values (1046) 32519
82.1%
Latin
ValueCountFrequency (%)
e 455
 
11.1%
o 323
 
7.9%
a 259
 
6.3%
n 258
 
6.3%
i 248
 
6.0%
t 241
 
5.9%
r 238
 
5.8%
s 227
 
5.5%
h 197
 
4.8%
l 156
 
3.8%
Other values (42) 1498
36.5%
Common
ValueCountFrequency (%)
14247
78.1%
: 1406
 
7.7%
. 331
 
1.8%
( 312
 
1.7%
) 312
 
1.7%
1 287
 
1.6%
0 235
 
1.3%
2 207
 
1.1%
! 163
 
0.9%
= 112
 
0.6%
Other values (27) 619
 
3.4%
Han
ValueCountFrequency (%)
2
22.2%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 39579
63.9%
ASCII 22258
35.9%
None 73
 
0.1%
Compat Jamo 10
 
< 0.1%
CJK 9
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
14247
64.0%
: 1406
 
6.3%
e 455
 
2.0%
. 331
 
1.5%
o 323
 
1.5%
( 312
 
1.4%
) 312
 
1.4%
1 287
 
1.3%
a 259
 
1.2%
n 258
 
1.2%
Other values (72) 4068
 
18.3%
Hangul
ValueCountFrequency (%)
1152
 
2.9%
1119
 
2.8%
975
 
2.5%
665
 
1.7%
558
 
1.4%
549
 
1.4%
535
 
1.4%
515
 
1.3%
504
 
1.3%
498
 
1.3%
Other values (1038) 32509
82.1%
None
ValueCountFrequency (%)
· 59
80.8%
4
 
5.5%
4
 
5.5%
× 2
 
2.7%
2
 
2.7%
1
 
1.4%
1
 
1.4%
Compat Jamo
ValueCountFrequency (%)
3
30.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
CJK
ValueCountFrequency (%)
2
22.2%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%

저자
Text

Distinct2247
Distinct (%)92.1%
Missing0
Missing (%)0.0%
Memory size19.2 KiB
2023-12-13T00:04:39.977807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length98
Median length63
Mean length16.0127
Min length3

Characters and Unicode

Total characters39087
Distinct characters732
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2127 ?
Unique (%)87.1%

Sample

1st row가타다 다마미 지음 ; 김수정 옮김
2nd row정재민 지음
3rd row이정호 글 ; 원정민 그림
4th row제니 오델 지음 ; 김하현 옮김
5th row조지희(깔루아) 지음
ValueCountFrequency (%)
1567
 
14.3%
지음 1132
 
10.3%
옮김 596
 
5.4%
그림 588
 
5.4%
504
 
4.6%
지은이 430
 
3.9%
옮긴이 154
 
1.4%
글·그림 117
 
1.1%
by 97
 
0.9%
illustrated 47
 
0.4%
Other values (4006) 5739
52.3%
2023-12-13T00:04:40.545326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8542
21.9%
1838
 
4.7%
; 1563
 
4.0%
1360
 
3.5%
1202
 
3.1%
1184
 
3.0%
: 1136
 
2.9%
788
 
2.0%
752
 
1.9%
743
 
1.9%
Other values (722) 19979
51.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 24423
62.5%
Space Separator 8542
 
21.9%
Other Punctuation 2917
 
7.5%
Lowercase Letter 2631
 
6.7%
Uppercase Letter 429
 
1.1%
Close Punctuation 61
 
0.2%
Open Punctuation 61
 
0.2%
Decimal Number 14
 
< 0.1%
Dash Punctuation 7
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1838
 
7.5%
1360
 
5.6%
1202
 
4.9%
1184
 
4.8%
788
 
3.2%
752
 
3.1%
743
 
3.0%
690
 
2.8%
630
 
2.6%
405
 
1.7%
Other values (652) 14831
60.7%
Lowercase Letter
ValueCountFrequency (%)
e 330
12.5%
t 255
9.7%
a 236
9.0%
r 229
8.7%
l 227
 
8.6%
n 208
 
7.9%
i 202
 
7.7%
y 140
 
5.3%
s 127
 
4.8%
b 108
 
4.1%
Other values (16) 569
21.6%
Uppercase Letter
ValueCountFrequency (%)
S 50
11.7%
B 42
9.8%
A 39
 
9.1%
C 37
 
8.6%
M 32
 
7.5%
R 28
 
6.5%
J 28
 
6.5%
P 26
 
6.1%
T 26
 
6.1%
D 21
 
4.9%
Other values (13) 100
23.3%
Other Punctuation
ValueCountFrequency (%)
; 1563
53.6%
: 1136
38.9%
· 136
 
4.7%
. 72
 
2.5%
& 4
 
0.1%
' 4
 
0.1%
/ 2
 
0.1%
Decimal Number
ValueCountFrequency (%)
1 5
35.7%
0 3
21.4%
6 2
 
14.3%
9 2
 
14.3%
2 1
 
7.1%
3 1
 
7.1%
Close Punctuation
ValueCountFrequency (%)
] 54
88.5%
) 7
 
11.5%
Open Punctuation
ValueCountFrequency (%)
[ 54
88.5%
( 7
 
11.5%
Math Symbol
ValueCountFrequency (%)
> 1
50.0%
< 1
50.0%
Space Separator
ValueCountFrequency (%)
8542
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 24423
62.5%
Common 11604
29.7%
Latin 3060
 
7.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1838
 
7.5%
1360
 
5.6%
1202
 
4.9%
1184
 
4.8%
788
 
3.2%
752
 
3.1%
743
 
3.0%
690
 
2.8%
630
 
2.6%
405
 
1.7%
Other values (652) 14831
60.7%
Latin
ValueCountFrequency (%)
e 330
 
10.8%
t 255
 
8.3%
a 236
 
7.7%
r 229
 
7.5%
l 227
 
7.4%
n 208
 
6.8%
i 202
 
6.6%
y 140
 
4.6%
s 127
 
4.2%
b 108
 
3.5%
Other values (39) 998
32.6%
Common
ValueCountFrequency (%)
8542
73.6%
; 1563
 
13.5%
: 1136
 
9.8%
· 136
 
1.2%
. 72
 
0.6%
] 54
 
0.5%
[ 54
 
0.5%
- 7
 
0.1%
( 7
 
0.1%
) 7
 
0.1%
Other values (11) 26
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 24423
62.5%
ASCII 14528
37.2%
None 136
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8542
58.8%
; 1563
 
10.8%
: 1136
 
7.8%
e 330
 
2.3%
t 255
 
1.8%
a 236
 
1.6%
r 229
 
1.6%
l 227
 
1.6%
n 208
 
1.4%
i 202
 
1.4%
Other values (59) 1600
 
11.0%
Hangul
ValueCountFrequency (%)
1838
 
7.5%
1360
 
5.6%
1202
 
4.9%
1184
 
4.8%
788
 
3.2%
752
 
3.1%
743
 
3.0%
690
 
2.8%
630
 
2.6%
405
 
1.7%
Other values (652) 14831
60.7%
None
ValueCountFrequency (%)
· 136
100.0%
Distinct1182
Distinct (%)48.4%
Missing0
Missing (%)0.0%
Memory size19.2 KiB
2023-12-13T00:04:40.882679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length29
Mean length6.1581319
Min length1

Characters and Unicode

Total characters15032
Distinct characters601
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique742 ?
Unique (%)30.4%

Sample

1st rowWillcompany(윌컴퍼니)
2nd row나무옆의자
3rd row푸른날개
4th row필로우
5th row책밥
ValueCountFrequency (%)
문학동네 50
 
1.8%
books 50
 
1.8%
서울문화사 31
 
1.1%
비룡소 30
 
1.1%
민음사 29
 
1.1%
창비 21
 
0.8%
김영사 20
 
0.7%
위즈덤하우스 18
 
0.7%
아울북 17
 
0.6%
rhk(알에이치코리아 17
 
0.6%
Other values (1254) 2437
89.6%
2023-12-13T00:04:41.353209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
481
 
3.2%
448
 
3.0%
415
 
2.8%
392
 
2.6%
: 373
 
2.5%
o 326
 
2.2%
279
 
1.9%
238
 
1.6%
213
 
1.4%
198
 
1.3%
Other values (591) 11669
77.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11501
76.5%
Lowercase Letter 1779
 
11.8%
Uppercase Letter 621
 
4.1%
Other Punctuation 410
 
2.7%
Space Separator 279
 
1.9%
Open Punctuation 189
 
1.3%
Close Punctuation 189
 
1.3%
Decimal Number 62
 
0.4%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
481
 
4.2%
448
 
3.9%
415
 
3.6%
392
 
3.4%
238
 
2.1%
213
 
1.9%
198
 
1.7%
196
 
1.7%
174
 
1.5%
164
 
1.4%
Other values (518) 8582
74.6%
Lowercase Letter
ValueCountFrequency (%)
o 326
18.3%
s 167
9.4%
r 147
8.3%
k 142
8.0%
e 138
7.8%
a 137
7.7%
n 122
 
6.9%
i 111
 
6.2%
l 107
 
6.0%
d 56
 
3.1%
Other values (15) 326
18.3%
Uppercase Letter
ValueCountFrequency (%)
B 142
22.9%
K 49
 
7.9%
M 46
 
7.4%
S 42
 
6.8%
H 36
 
5.8%
C 33
 
5.3%
R 33
 
5.3%
P 32
 
5.2%
E 31
 
5.0%
O 23
 
3.7%
Other values (15) 154
24.8%
Decimal Number
ValueCountFrequency (%)
2 28
45.2%
1 24
38.7%
6 2
 
3.2%
4 2
 
3.2%
8 1
 
1.6%
7 1
 
1.6%
9 1
 
1.6%
5 1
 
1.6%
3 1
 
1.6%
0 1
 
1.6%
Other Punctuation
ValueCountFrequency (%)
: 373
91.0%
& 10
 
2.4%
. 10
 
2.4%
' 7
 
1.7%
# 7
 
1.7%
· 2
 
0.5%
1
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 188
99.5%
[ 1
 
0.5%
Close Punctuation
ValueCountFrequency (%)
) 188
99.5%
] 1
 
0.5%
Space Separator
ValueCountFrequency (%)
279
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11499
76.5%
Latin 2400
 
16.0%
Common 1131
 
7.5%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
481
 
4.2%
448
 
3.9%
415
 
3.6%
392
 
3.4%
238
 
2.1%
213
 
1.9%
198
 
1.7%
196
 
1.7%
174
 
1.5%
164
 
1.4%
Other values (516) 8580
74.6%
Latin
ValueCountFrequency (%)
o 326
 
13.6%
s 167
 
7.0%
r 147
 
6.1%
k 142
 
5.9%
B 142
 
5.9%
e 138
 
5.8%
a 137
 
5.7%
n 122
 
5.1%
i 111
 
4.6%
l 107
 
4.5%
Other values (40) 861
35.9%
Common
ValueCountFrequency (%)
: 373
33.0%
279
24.7%
( 188
16.6%
) 188
16.6%
2 28
 
2.5%
1 24
 
2.1%
& 10
 
0.9%
. 10
 
0.9%
' 7
 
0.6%
# 7
 
0.6%
Other values (13) 17
 
1.5%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11498
76.5%
ASCII 3528
 
23.5%
None 3
 
< 0.1%
CJK 2
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
481
 
4.2%
448
 
3.9%
415
 
3.6%
392
 
3.4%
238
 
2.1%
213
 
1.9%
198
 
1.7%
196
 
1.7%
174
 
1.5%
164
 
1.4%
Other values (515) 8579
74.6%
ASCII
ValueCountFrequency (%)
: 373
 
10.6%
o 326
 
9.2%
279
 
7.9%
( 188
 
5.3%
) 188
 
5.3%
s 167
 
4.7%
r 147
 
4.2%
k 142
 
4.0%
B 142
 
4.0%
e 138
 
3.9%
Other values (61) 1438
40.8%
None
ValueCountFrequency (%)
· 2
66.7%
1
33.3%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

발행년
Categorical

IMBALANCE 

Distinct22
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size19.2 KiB
2021
1855 
2022
289 
2020
 
117
2019
 
36
2017
 
35
Other values (17)
 
109

Length

Max length6
Median length4
Mean length4.0016387
Min length4

Unique

Unique6 ?
Unique (%)0.2%

Sample

1st row2020
2nd row2021
3rd row2022
4th row2021
5th row2022

Common Values

ValueCountFrequency (%)
2021 1855
76.0%
2022 289
 
11.8%
2020 117
 
4.8%
2019 36
 
1.5%
2017 35
 
1.4%
2015 24
 
1.0%
2018 17
 
0.7%
2014 17
 
0.7%
2016 14
 
0.6%
2012 7
 
0.3%
Other values (12) 30
 
1.2%

Length

2023-12-13T00:04:41.492695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2021 1856
76.0%
2022 289
 
11.8%
2020 118
 
4.8%
2019 36
 
1.5%
2017 35
 
1.4%
2015 24
 
1.0%
2018 17
 
0.7%
2014 17
 
0.7%
2016 14
 
0.6%
2013 7
 
0.3%
Other values (10) 28
 
1.1%

가격
Real number (ℝ)

Distinct107
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15662.474
Minimum0
Maximum60000
Zeros8
Zeros (%)0.3%
Negative0
Negative (%)0.0%
Memory size21.6 KiB
2023-12-13T00:04:41.613231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile10500
Q113000
median15000
Q317000
95-th percentile25000
Maximum60000
Range60000
Interquartile range (IQR)4000

Descriptive statistics

Standard deviation4978.1647
Coefficient of variation (CV)0.31784025
Kurtosis10.330281
Mean15662.474
Median Absolute Deviation (MAD)2000
Skewness2.3123743
Sum38232100
Variance24782124
MonotonicityNot monotonic
2023-12-13T00:04:41.776029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15000 287
 
11.8%
12000 280
 
11.5%
13000 262
 
10.7%
14000 165
 
6.8%
16000 163
 
6.7%
18000 107
 
4.4%
17000 88
 
3.6%
11000 63
 
2.6%
10000 63
 
2.6%
15800 63
 
2.6%
Other values (97) 900
36.9%
ValueCountFrequency (%)
0 8
0.3%
1800 1
 
< 0.1%
6500 1
 
< 0.1%
7000 2
 
0.1%
7500 1
 
< 0.1%
7900 1
 
< 0.1%
8000 5
0.2%
8500 3
 
0.1%
8900 1
 
< 0.1%
9000 9
0.4%
ValueCountFrequency (%)
60000 1
 
< 0.1%
55000 1
 
< 0.1%
50000 2
0.1%
48000 1
 
< 0.1%
45000 1
 
< 0.1%
43000 1
 
< 0.1%
40000 4
0.2%
39000 3
0.1%
38000 2
0.1%
36000 1
 
< 0.1%

Interactions

2023-12-13T00:04:36.547195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:04:36.355055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:04:36.655345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:04:36.450961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:04:41.871341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호발행년가격
번호1.0000.4480.497
발행년0.4481.0000.299
가격0.4970.2991.000
2023-12-13T00:04:41.949832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호가격발행년
번호1.0000.1920.182
가격0.1921.0000.115
발행년0.1820.1151.000

Missing values

2023-12-13T00:04:36.806789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:04:36.956815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호등록번호청구기호서명저자발행자발행년가격
01BPG000233453189.24-가831ㅈ자식을 미치게 만드는 부모들 : 상처주고 공격하고 지배하려는 부모와 그로부터 벗어나는 법가타다 다마미 지음 ; 김수정 옮김Willcompany(윌컴퍼니)202013500
12BPG000233490813.7-정72ㅂ보헤미안 랩소디 : 정재민 장편소설정재민 지음나무옆의자202114000
23BPG000233462J 340-이73ㅇ알아 두면 세상이 보이는 선거와 정치 30이정호 글 ; 원정민 그림푸른날개202213000
34BPG000233461331.54-오223ㅇ아무것도 하지 않는 법제니 오델 지음 ; 김하현 옮김필로우202116000
45BPG000233471375.441-조79ㅅ선을 넘는 초등수학 공부법 : 수학 1등급을 만드는 초등 6년 완전 학습조지희(깔루아) 지음책밥202218000
56BPG000233475600.04-랭65ㅇ이상한 날씨 : 위기가 범람하는 세계 속 예술이 하는 일올리비아 랭 지음 ; 이동교 옮김어크로스(어크로스출판그룹)202117000
67BPG000233496812.68-설69ㅇ-2악의 마음을 읽는 자들 : 설이나 대본집. 2설이나 지음21세기북스202219800
78BPG000233495812.68-설69ㅇ-1악의 마음을 읽는 자들 : 설이나 대본집. 1설이나 지음21세기북스202219800
89BPG000233455029-루69ㅊ책 읽는 삶 : 타인의 눈으로 새로운 세계를 보는 독서의 즐거움지은이: C. S. 루이스 ; 옮긴이: 윤종석두란노(두란노서원)202110000
910BPG000233473375.1-김14ㅇ유치원의 힘 : 처음 학교가 마지막 학교를 결정한다김경란 지음EBS Books:한국교육방송공사202017000
번호등록번호청구기호서명저자발행자발행년가격
24312432BPG000235743814.7-윤53ㅎ=2헌책방 기담 수집가지은이: 윤성근 ; 그림: 남서연황정하프시케의숲202115000
24322433BPG000235740810.907-권98ㅈ정화된 밤 : 권희철 평론집지은이: 권희철문학동네202222000
24332434BPG000235724MC 843-테69ㄱ간다아아!글·그림: 코리 R. 테이버 ; 옮김: 노은정대교북스 주니어:키즈스콜레202213000
24342435BPG000235738MC 843-시92ㅇ용이지만 괜찮아!리사 시핸 글·그림 ; 고정아 옮김지학사아르볼202214000
24352436BPG000235731MC 813.8-김67ㅁ마음버스글: 김유 ; 그림: 소복이천개의바람202213000
24362437BPG000235725MC 813.8-조64ㄱ감자아이 : 조영지 그림책지음: 조영지키위북스202215000
24372438BPG000235707480.4-신94ㅅ=2식물학자의 노트=Notes of a botanist : 식물이 내게 들려준 이야기신혜우 글·그림김영사202119800
24382439BPG000235696199.1-글42ㄱ고민의 답글배우 지음강한별202215800
24392440BPG000235736818-정53ㅇ아끼고 아낀 말 : 정세운 청춘 에세이지은이: 정세운위즈덤하우스202216000
24402441BPG000235712525.765-야58ㅅ생활 속의 그린테리어 : 한눈에 보는 식물 고르기 꾸미기 키우기야스모토 사치에 지음 ; 심수정 옮김시그마북스202216000