Overview

Dataset statistics

Number of variables9
Number of observations961
Missing cells483
Missing cells (%)5.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory68.6 KiB
Average record size in memory73.1 B

Variable types

Numeric1
Text6
Categorical2

Dataset

Description중랑숲어린이도서관신착자료 12월 분
Author중랑구시설관리공단
URLhttps://www.data.go.kr/data/15044313/fileData.do

Alerts

2014년 12월 신착자료목록 is highly overall correlated with Unnamed: 3High correlation
Unnamed: 3 is highly overall correlated with 2014년 12월 신착자료목록 and 1 other fieldsHigh correlation
Unnamed: 8 is highly overall correlated with Unnamed: 3High correlation
Unnamed: 3 is highly imbalanced (52.7%)Imbalance
Unnamed: 6 has 476 (49.5%) missing valuesMissing

Reproduction

Analysis started2023-12-12 04:43:24.832626
Analysis finished2023-12-12 04:43:26.422666
Duration1.59 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

2014년 12월 신착자료목록
Real number (ℝ)

HIGH CORRELATION 

Distinct959
Distinct (%)100.0%
Missing2
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean480
Minimum1
Maximum959
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.6 KiB
2023-12-12T13:43:26.508879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile48.9
Q1240.5
median480
Q3719.5
95-th percentile911.1
Maximum959
Range958
Interquartile range (IQR)479

Descriptive statistics

Standard deviation276.98375
Coefficient of variation (CV)0.57704949
Kurtosis-1.2
Mean480
Median Absolute Deviation (MAD)240
Skewness0
Sum460320
Variance76720
MonotonicityStrictly increasing
2023-12-12T13:43:26.669730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2 1
 
0.1%
633 1
 
0.1%
634 1
 
0.1%
635 1
 
0.1%
636 1
 
0.1%
637 1
 
0.1%
638 1
 
0.1%
639 1
 
0.1%
640 1
 
0.1%
641 1
 
0.1%
Other values (949) 949
98.8%
(Missing) 2
 
0.2%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
959 1
0.1%
958 1
0.1%
957 1
0.1%
956 1
0.1%
955 1
0.1%
954 1
0.1%
953 1
0.1%
952 1
0.1%
951 1
0.1%
950 1
0.1%
Distinct960
Distinct (%)100.0%
Missing1
Missing (%)0.1%
Memory size7.6 KiB
2023-12-12T13:43:26.944650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length11.991667
Min length4

Characters and Unicode

Total characters11512
Distinct characters17
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique960 ?
Unique (%)100.0%

Sample

1st row등록번호
2nd rowAM0000001106
3rd rowAM0000001107
4th rowAM0000001108
5th rowAM0000001109
ValueCountFrequency (%)
등록번호 1
 
0.1%
am0000001106 1
 
0.1%
cm0000031959 1
 
0.1%
cm0000031944 1
 
0.1%
cm0000031932 1
 
0.1%
cm0000031933 1
 
0.1%
cm0000031934 1
 
0.1%
cm0000031935 1
 
0.1%
cm0000031936 1
 
0.1%
cm0000031937 1
 
0.1%
Other values (950) 950
99.0%
2023-12-12T13:43:27.402696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 5674
49.3%
1 1090
 
9.5%
M 959
 
8.3%
3 666
 
5.8%
2 614
 
5.3%
A 589
 
5.1%
C 370
 
3.2%
4 296
 
2.6%
5 294
 
2.6%
9 291
 
2.5%
Other values (7) 669
 
5.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 9590
83.3%
Uppercase Letter 1918
 
16.7%
Other Letter 4
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 5674
59.2%
1 1090
 
11.4%
3 666
 
6.9%
2 614
 
6.4%
4 296
 
3.1%
5 294
 
3.1%
9 291
 
3.0%
6 281
 
2.9%
8 198
 
2.1%
7 186
 
1.9%
Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Uppercase Letter
ValueCountFrequency (%)
M 959
50.0%
A 589
30.7%
C 370
 
19.3%

Most occurring scripts

ValueCountFrequency (%)
Common 9590
83.3%
Latin 1918
 
16.7%
Hangul 4
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 5674
59.2%
1 1090
 
11.4%
3 666
 
6.9%
2 614
 
6.4%
4 296
 
3.1%
5 294
 
3.1%
9 291
 
3.0%
6 281
 
2.9%
8 198
 
2.1%
7 186
 
1.9%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Latin
ValueCountFrequency (%)
M 959
50.0%
A 589
30.7%
C 370
 
19.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 11508
> 99.9%
Hangul 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 5674
49.3%
1 1090
 
9.5%
M 959
 
8.3%
3 666
 
5.8%
2 614
 
5.3%
A 589
 
5.1%
C 370
 
3.2%
4 296
 
2.6%
5 294
 
2.6%
9 291
 
2.5%
Other values (3) 665
 
5.8%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Distinct959
Distinct (%)99.9%
Missing1
Missing (%)0.1%
Memory size7.6 KiB
2023-12-12T13:43:27.721481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length19
Mean length15.246875
Min length4

Characters and Unicode

Total characters14637
Distinct characters96
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique958 ?
Unique (%)99.8%

Sample

1st row청구기호
2nd rowBS 843-B395j
3rd rowBS 843-S951b
4th rowBS 843-B468b
5th rowBS 843-M216b
ValueCountFrequency (%)
cs 633
33.0%
bs 326
 
17.0%
747-s958c-v.4-9 2
 
0.1%
029.85-ㅇ732ㄴ 1
 
0.1%
710-ㅇ597ㅅ-v.3=3 1
 
0.1%
808.9-ㅍ84ㅋ-v.149 1
 
0.1%
710-ㅇ597ㅅ-v.5=4 1
 
0.1%
989-ㅅ926ㅅ 1
 
0.1%
843-ㅋ458ㅇ 1
 
0.1%
031-ㅈ224ㄸ-v.46 1
 
0.1%
Other values (951) 951
49.6%
2023-12-12T13:43:28.158514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1650
 
11.3%
8 1275
 
8.7%
S 1071
 
7.3%
4 986
 
6.7%
959
 
6.6%
3 925
 
6.3%
7 805
 
5.5%
. 751
 
5.1%
C 660
 
4.5%
1 660
 
4.5%
Other values (86) 4895
33.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 6941
47.4%
Uppercase Letter 2507
 
17.1%
Dash Punctuation 1650
 
11.3%
Lowercase Letter 1077
 
7.4%
Space Separator 959
 
6.6%
Other Punctuation 751
 
5.1%
Other Letter 744
 
5.1%
Math Symbol 8
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
143
19.2%
100
13.4%
94
12.6%
65
8.7%
57
 
7.7%
51
 
6.9%
46
 
6.2%
34
 
4.6%
29
 
3.9%
24
 
3.2%
Other values (24) 101
13.6%
Uppercase Letter
ValueCountFrequency (%)
S 1071
42.7%
C 660
26.3%
B 368
 
14.7%
P 91
 
3.6%
D 53
 
2.1%
R 50
 
2.0%
G 37
 
1.5%
M 35
 
1.4%
H 23
 
0.9%
A 18
 
0.7%
Other values (14) 101
 
4.0%
Lowercase Letter
ValueCountFrequency (%)
v 485
45.0%
p 113
 
10.5%
c 69
 
6.4%
m 62
 
5.8%
d 55
 
5.1%
b 39
 
3.6%
r 29
 
2.7%
s 28
 
2.6%
h 27
 
2.5%
l 27
 
2.5%
Other values (14) 143
 
13.3%
Decimal Number
ValueCountFrequency (%)
8 1275
18.4%
4 986
14.2%
3 925
13.3%
7 805
11.6%
1 660
9.5%
2 595
8.6%
9 576
8.3%
5 494
 
7.1%
6 418
 
6.0%
0 207
 
3.0%
Dash Punctuation
ValueCountFrequency (%)
- 1650
100.0%
Space Separator
ValueCountFrequency (%)
959
100.0%
Other Punctuation
ValueCountFrequency (%)
. 751
100.0%
Math Symbol
ValueCountFrequency (%)
= 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 10309
70.4%
Latin 3584
 
24.5%
Hangul 744
 
5.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
S 1071
29.9%
C 660
18.4%
v 485
13.5%
B 368
 
10.3%
p 113
 
3.2%
P 91
 
2.5%
c 69
 
1.9%
m 62
 
1.7%
d 55
 
1.5%
D 53
 
1.5%
Other values (38) 557
15.5%
Hangul
ValueCountFrequency (%)
143
19.2%
100
13.4%
94
12.6%
65
8.7%
57
 
7.7%
51
 
6.9%
46
 
6.2%
34
 
4.6%
29
 
3.9%
24
 
3.2%
Other values (24) 101
13.6%
Common
ValueCountFrequency (%)
- 1650
16.0%
8 1275
12.4%
4 986
9.6%
959
9.3%
3 925
9.0%
7 805
7.8%
. 751
7.3%
1 660
 
6.4%
2 595
 
5.8%
9 576
 
5.6%
Other values (4) 1127
10.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 13893
94.9%
Compat Jamo 715
 
4.9%
Hangul 29
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1650
 
11.9%
8 1275
 
9.2%
S 1071
 
7.7%
4 986
 
7.1%
959
 
6.9%
3 925
 
6.7%
7 805
 
5.8%
. 751
 
5.4%
C 660
 
4.8%
1 660
 
4.8%
Other values (52) 4151
29.9%
Compat Jamo
ValueCountFrequency (%)
143
20.0%
100
14.0%
94
13.1%
65
9.1%
57
 
8.0%
51
 
7.1%
46
 
6.4%
34
 
4.8%
29
 
4.1%
24
 
3.4%
Other values (8) 72
10.1%
Hangul
ValueCountFrequency (%)
7
24.1%
3
10.3%
3
10.3%
2
 
6.9%
2
 
6.9%
2
 
6.9%
1
 
3.4%
1
 
3.4%
1
 
3.4%
1
 
3.4%
Other values (6) 6
20.7%

Unnamed: 3
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size7.6 KiB
[중랑숲]어린이자료실
633 
[중랑숲]유아자료실
326 
자료실명
 
1
<NA>
 
1

Length

Max length11
Median length11
Mean length10.646202
Min length4

Unique

Unique2 ?
Unique (%)0.2%

Sample

1st row자료실명
2nd row[중랑숲]유아자료실
3rd row[중랑숲]유아자료실
4th row[중랑숲]유아자료실
5th row[중랑숲]유아자료실

Common Values

ValueCountFrequency (%)
[중랑숲]어린이자료실 633
65.9%
[중랑숲]유아자료실 326
33.9%
자료실명 1
 
0.1%
<NA> 1
 
0.1%

Length

2023-12-12T13:43:28.318116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:43:28.438822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
중랑숲]어린이자료실 633
65.9%
중랑숲]유아자료실 326
33.9%
자료실명 1
 
0.1%
na 1
 
0.1%
Distinct958
Distinct (%)99.8%
Missing1
Missing (%)0.1%
Memory size7.6 KiB
2023-12-12T13:43:28.901171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length85
Median length45
Mean length18.91875
Min length1

Characters and Unicode

Total characters18162
Distinct characters654
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique956 ?
Unique (%)99.6%

Sample

1st row서명
2nd rowJourney
3rd rowBall
4th rowBaa Moo What Will We Do
5th rowBubble Trouble
ValueCountFrequency (%)
109
 
2.9%
the 95
 
2.6%
and 57
 
1.5%
이야기 37
 
1.0%
is 34
 
0.9%
a 33
 
0.9%
of 31
 
0.8%
my 26
 
0.7%
little 22
 
0.6%
day 20
 
0.5%
Other values (2193) 3256
87.5%
2023-12-12T13:43:29.495635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2839
 
15.6%
e 1065
 
5.9%
a 748
 
4.1%
o 725
 
4.0%
t 693
 
3.8%
s 671
 
3.7%
r 665
 
3.7%
i 585
 
3.2%
n 585
 
3.2%
h 467
 
2.6%
Other values (644) 9119
50.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 9042
49.8%
Other Letter 4322
23.8%
Space Separator 2839
 
15.6%
Uppercase Letter 983
 
5.4%
Other Punctuation 513
 
2.8%
Close Punctuation 171
 
0.9%
Open Punctuation 171
 
0.9%
Decimal Number 79
 
0.4%
Dash Punctuation 28
 
0.2%
Math Symbol 11
 
0.1%
Other values (2) 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
161
 
3.7%
101
 
2.3%
96
 
2.2%
79
 
1.8%
74
 
1.7%
71
 
1.6%
66
 
1.5%
59
 
1.4%
57
 
1.3%
54
 
1.2%
Other values (564) 3504
81.1%
Lowercase Letter
ValueCountFrequency (%)
e 1065
11.8%
a 748
 
8.3%
o 725
 
8.0%
t 693
 
7.7%
s 671
 
7.4%
r 665
 
7.4%
i 585
 
6.5%
n 585
 
6.5%
h 467
 
5.2%
l 397
 
4.4%
Other values (16) 2441
27.0%
Uppercase Letter
ValueCountFrequency (%)
T 125
12.7%
M 106
 
10.8%
B 97
 
9.9%
W 73
 
7.4%
S 68
 
6.9%
A 54
 
5.5%
H 53
 
5.4%
D 49
 
5.0%
L 48
 
4.9%
F 45
 
4.6%
Other values (15) 265
27.0%
Decimal Number
ValueCountFrequency (%)
1 19
24.1%
0 14
17.7%
2 13
16.5%
3 8
10.1%
5 7
 
8.9%
4 6
 
7.6%
8 4
 
5.1%
7 3
 
3.8%
6 3
 
3.8%
9 2
 
2.5%
Other Punctuation
ValueCountFrequency (%)
! 119
23.2%
, 105
20.5%
: 99
19.3%
' 75
14.6%
. 65
12.7%
? 46
 
9.0%
" 2
 
0.4%
; 1
 
0.2%
1
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 169
98.8%
] 2
 
1.2%
Open Punctuation
ValueCountFrequency (%)
( 169
98.8%
[ 2
 
1.2%
Math Symbol
ValueCountFrequency (%)
= 10
90.9%
~ 1
 
9.1%
Space Separator
ValueCountFrequency (%)
2839
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 28
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 10025
55.2%
Hangul 4320
23.8%
Common 3815
 
21.0%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
161
 
3.7%
101
 
2.3%
96
 
2.2%
79
 
1.8%
74
 
1.7%
71
 
1.6%
66
 
1.5%
59
 
1.4%
57
 
1.3%
54
 
1.2%
Other values (563) 3502
81.1%
Latin
ValueCountFrequency (%)
e 1065
 
10.6%
a 748
 
7.5%
o 725
 
7.2%
t 693
 
6.9%
s 671
 
6.7%
r 665
 
6.6%
i 585
 
5.8%
n 585
 
5.8%
h 467
 
4.7%
l 397
 
4.0%
Other values (41) 3424
34.2%
Common
ValueCountFrequency (%)
2839
74.4%
) 169
 
4.4%
( 169
 
4.4%
! 119
 
3.1%
, 105
 
2.8%
: 99
 
2.6%
' 75
 
2.0%
. 65
 
1.7%
? 46
 
1.2%
- 28
 
0.7%
Other values (19) 101
 
2.6%
Han
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 13838
76.2%
Hangul 4319
 
23.8%
CJK 2
 
< 0.1%
Compat Jamo 1
 
< 0.1%
Punctuation 1
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2839
20.5%
e 1065
 
7.7%
a 748
 
5.4%
o 725
 
5.2%
t 693
 
5.0%
s 671
 
4.8%
r 665
 
4.8%
i 585
 
4.2%
n 585
 
4.2%
h 467
 
3.4%
Other values (68) 4795
34.7%
Hangul
ValueCountFrequency (%)
161
 
3.7%
101
 
2.3%
96
 
2.2%
79
 
1.8%
74
 
1.7%
71
 
1.6%
66
 
1.5%
59
 
1.4%
57
 
1.3%
54
 
1.3%
Other values (562) 3501
81.1%
CJK
ValueCountFrequency (%)
2
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Punctuation
ValueCountFrequency (%)
1
100.0%
None
ValueCountFrequency (%)
1
100.0%
Distinct795
Distinct (%)82.8%
Missing1
Missing (%)0.1%
Memory size7.6 KiB
2023-12-12T13:43:29.912764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length157
Median length64
Mean length30.2875
Min length3

Characters and Unicode

Total characters29076
Distinct characters421
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique724 ?
Unique (%)75.4%

Sample

1st row저작자
2nd rowby Aaron Becker
3rd rowword and pictures by Mary Sullivan
4th rowby A. H. Benjamin ; illustrated by Jane Chapman
5th rowby Margaret Mahy ; illustrated by Polly Dunbar
ValueCountFrequency (%)
by 949
 
15.6%
767
 
12.6%
illustrated 274
 
4.5%
그림 269
 
4.4%
235
 
3.9%
옮김 110
 
1.8%
written 89
 
1.5%
지음 65
 
1.1%
글ㆍ그림 47
 
0.8%
joy 45
 
0.7%
Other values (1736) 3249
53.3%
2023-12-12T13:43:30.549333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5168
17.8%
e 1551
 
5.3%
a 1496
 
5.1%
t 1437
 
4.9%
l 1374
 
4.7%
i 1332
 
4.6%
y 1303
 
4.5%
r 1261
 
4.3%
b 1081
 
3.7%
n 1020
 
3.5%
Other values (411) 12053
41.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 16426
56.5%
Space Separator 5168
 
17.8%
Other Letter 4399
 
15.1%
Uppercase Letter 2128
 
7.3%
Other Punctuation 876
 
3.0%
Close Punctuation 30
 
0.1%
Open Punctuation 30
 
0.1%
Dash Punctuation 18
 
0.1%
Decimal Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
327
 
7.4%
322
 
7.3%
290
 
6.6%
250
 
5.7%
168
 
3.8%
110
 
2.5%
102
 
2.3%
73
 
1.7%
69
 
1.6%
68
 
1.5%
Other values (348) 2620
59.6%
Lowercase Letter
ValueCountFrequency (%)
e 1551
9.4%
a 1496
9.1%
t 1437
 
8.7%
l 1374
 
8.4%
i 1332
 
8.1%
y 1303
 
7.9%
r 1261
 
7.7%
b 1081
 
6.6%
n 1020
 
6.2%
o 857
 
5.2%
Other values (16) 3714
22.6%
Uppercase Letter
ValueCountFrequency (%)
M 208
 
9.8%
J 208
 
9.8%
S 185
 
8.7%
C 174
 
8.2%
D 146
 
6.9%
B 136
 
6.4%
A 129
 
6.1%
P 122
 
5.7%
R 106
 
5.0%
L 94
 
4.4%
Other values (15) 620
29.1%
Other Punctuation
ValueCountFrequency (%)
; 766
87.4%
. 55
 
6.3%
, 48
 
5.5%
" 2
 
0.2%
· 2
 
0.2%
' 2
 
0.2%
& 1
 
0.1%
Space Separator
ValueCountFrequency (%)
5168
100.0%
Close Punctuation
ValueCountFrequency (%)
] 30
100.0%
Open Punctuation
ValueCountFrequency (%)
[ 30
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 18
100.0%
Decimal Number
ValueCountFrequency (%)
7 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 18554
63.8%
Common 6123
 
21.1%
Hangul 4399
 
15.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
327
 
7.4%
322
 
7.3%
290
 
6.6%
250
 
5.7%
168
 
3.8%
110
 
2.5%
102
 
2.3%
73
 
1.7%
69
 
1.6%
68
 
1.5%
Other values (348) 2620
59.6%
Latin
ValueCountFrequency (%)
e 1551
 
8.4%
a 1496
 
8.1%
t 1437
 
7.7%
l 1374
 
7.4%
i 1332
 
7.2%
y 1303
 
7.0%
r 1261
 
6.8%
b 1081
 
5.8%
n 1020
 
5.5%
o 857
 
4.6%
Other values (41) 5842
31.5%
Common
ValueCountFrequency (%)
5168
84.4%
; 766
 
12.5%
. 55
 
0.9%
, 48
 
0.8%
] 30
 
0.5%
[ 30
 
0.5%
- 18
 
0.3%
" 2
 
< 0.1%
· 2
 
< 0.1%
' 2
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 24675
84.9%
Hangul 4350
 
15.0%
Compat Jamo 49
 
0.2%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5168
20.9%
e 1551
 
6.3%
a 1496
 
6.1%
t 1437
 
5.8%
l 1374
 
5.6%
i 1332
 
5.4%
y 1303
 
5.3%
r 1261
 
5.1%
b 1081
 
4.4%
n 1020
 
4.1%
Other values (52) 7652
31.0%
Hangul
ValueCountFrequency (%)
327
 
7.5%
322
 
7.4%
290
 
6.7%
250
 
5.7%
168
 
3.9%
110
 
2.5%
102
 
2.3%
73
 
1.7%
69
 
1.6%
68
 
1.6%
Other values (347) 2571
59.1%
Compat Jamo
ValueCountFrequency (%)
49
100.0%
None
ValueCountFrequency (%)
· 2
100.0%

Unnamed: 6
Text

MISSING 

Distinct178
Distinct (%)36.7%
Missing476
Missing (%)49.5%
Memory size7.6 KiB
2023-12-12T13:43:31.020033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length5
Mean length4.2618557
Min length2

Characters and Unicode

Total characters2067
Distinct characters15
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique97 ?
Unique (%)20.0%

Sample

1st row권차
2nd rowv.1
3rd rowv.2
4th rowv.3
5th rowv.4
ValueCountFrequency (%)
v.2 23
 
4.7%
v.1 19
 
3.9%
v.7 19
 
3.9%
v.8 19
 
3.9%
v.3 18
 
3.7%
v.4 18
 
3.7%
v.6 18
 
3.7%
v.5 17
 
3.5%
v.9 12
 
2.5%
v.14 9
 
1.9%
Other values (168) 313
64.5%
2023-12-12T13:43:31.716187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
v 484
23.4%
. 484
23.4%
1 216
10.4%
- 207
10.0%
2 150
 
7.3%
3 111
 
5.4%
4 98
 
4.7%
5 90
 
4.4%
7 49
 
2.4%
6 48
 
2.3%
Other values (5) 130
 
6.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 890
43.1%
Lowercase Letter 484
23.4%
Other Punctuation 484
23.4%
Dash Punctuation 207
 
10.0%
Other Letter 2
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 216
24.3%
2 150
16.9%
3 111
12.5%
4 98
11.0%
5 90
10.1%
7 49
 
5.5%
6 48
 
5.4%
0 48
 
5.4%
8 43
 
4.8%
9 37
 
4.2%
Other Letter
ValueCountFrequency (%)
1
50.0%
1
50.0%
Lowercase Letter
ValueCountFrequency (%)
v 484
100.0%
Other Punctuation
ValueCountFrequency (%)
. 484
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 207
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1581
76.5%
Latin 484
 
23.4%
Hangul 2
 
0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
. 484
30.6%
1 216
13.7%
- 207
13.1%
2 150
 
9.5%
3 111
 
7.0%
4 98
 
6.2%
5 90
 
5.7%
7 49
 
3.1%
6 48
 
3.0%
0 48
 
3.0%
Other values (2) 80
 
5.1%
Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%
Latin
ValueCountFrequency (%)
v 484
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2065
99.9%
Hangul 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
v 484
23.4%
. 484
23.4%
1 216
10.5%
- 207
10.0%
2 150
 
7.3%
3 111
 
5.4%
4 98
 
4.7%
5 90
 
4.4%
7 49
 
2.4%
6 48
 
2.3%
Other values (3) 128
 
6.2%
Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct285
Distinct (%)29.7%
Missing1
Missing (%)0.1%
Memory size7.6 KiB
2023-12-12T13:43:32.372472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length31
Mean length12.026042
Min length2

Characters and Unicode

Total characters11545
Distinct characters235
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique164 ?
Unique (%)17.1%

Sample

1st row발행자
2nd rowCandlewick Press
3rd rowHoughton Mifflin Harcourt
4th rowLittle Tiger Press
5th rowHoughton Mifflin Harcourt
ValueCountFrequency (%)
books 125
 
7.1%
readers 83
 
4.7%
young 83
 
4.7%
puffin 76
 
4.3%
compass 53
 
3.0%
press 51
 
2.9%
dorling 44
 
2.5%
media 43
 
2.4%
kindersley,inc 42
 
2.4%
picture 40
 
2.3%
Other values (280) 1124
63.7%
2023-12-12T13:43:32.840034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
804
 
7.0%
o 793
 
6.9%
e 778
 
6.7%
r 707
 
6.1%
s 697
 
6.0%
n 665
 
5.8%
i 664
 
5.8%
a 515
 
4.5%
l 426
 
3.7%
d 405
 
3.5%
Other values (225) 5091
44.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 7728
66.9%
Other Letter 1480
 
12.8%
Uppercase Letter 1412
 
12.2%
Space Separator 804
 
7.0%
Other Punctuation 120
 
1.0%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
70
 
4.7%
66
 
4.5%
53
 
3.6%
52
 
3.5%
47
 
3.2%
45
 
3.0%
42
 
2.8%
41
 
2.8%
40
 
2.7%
40
 
2.7%
Other values (171) 984
66.5%
Lowercase Letter
ValueCountFrequency (%)
o 793
10.3%
e 778
10.1%
r 707
9.1%
s 697
 
9.0%
n 665
 
8.6%
i 664
 
8.6%
a 515
 
6.7%
l 426
 
5.5%
d 405
 
5.2%
u 332
 
4.3%
Other values (14) 1746
22.6%
Uppercase Letter
ValueCountFrequency (%)
P 238
16.9%
B 145
10.3%
C 144
10.2%
H 113
 
8.0%
R 100
 
7.1%
Y 80
 
5.7%
S 67
 
4.7%
I 66
 
4.7%
M 63
 
4.5%
K 55
 
3.9%
Other values (12) 341
24.2%
Other Punctuation
ValueCountFrequency (%)
, 49
40.8%
' 36
30.0%
. 19
 
15.8%
& 14
 
11.7%
/ 1
 
0.8%
; 1
 
0.8%
Space Separator
ValueCountFrequency (%)
804
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 9140
79.2%
Hangul 1480
 
12.8%
Common 925
 
8.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
70
 
4.7%
66
 
4.5%
53
 
3.6%
52
 
3.5%
47
 
3.2%
45
 
3.0%
42
 
2.8%
41
 
2.8%
40
 
2.7%
40
 
2.7%
Other values (171) 984
66.5%
Latin
ValueCountFrequency (%)
o 793
 
8.7%
e 778
 
8.5%
r 707
 
7.7%
s 697
 
7.6%
n 665
 
7.3%
i 664
 
7.3%
a 515
 
5.6%
l 426
 
4.7%
d 405
 
4.4%
u 332
 
3.6%
Other values (36) 3158
34.6%
Common
ValueCountFrequency (%)
804
86.9%
, 49
 
5.3%
' 36
 
3.9%
. 19
 
2.1%
& 14
 
1.5%
- 1
 
0.1%
/ 1
 
0.1%
; 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 10065
87.2%
Hangul 1480
 
12.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
804
 
8.0%
o 793
 
7.9%
e 778
 
7.7%
r 707
 
7.0%
s 697
 
6.9%
n 665
 
6.6%
i 664
 
6.6%
a 515
 
5.1%
l 426
 
4.2%
d 405
 
4.0%
Other values (44) 3611
35.9%
Hangul
ValueCountFrequency (%)
70
 
4.7%
66
 
4.5%
53
 
3.6%
52
 
3.5%
47
 
3.2%
45
 
3.0%
42
 
2.8%
41
 
2.8%
40
 
2.7%
40
 
2.7%
Other values (171) 984
66.5%

Unnamed: 8
Categorical

HIGH CORRELATION 

Distinct27
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size7.6 KiB
2014
312 
2009
118 
2013
109 
2011
82 
2005
62 
Other values (22)
278 

Length

Max length4
Median length4
Mean length3.9989594
Min length3

Unique

Unique8 ?
Unique (%)0.8%

Sample

1st row발행년
2nd row2013
3rd row2013
4th row2003
5th row2010

Common Values

ValueCountFrequency (%)
2014 312
32.5%
2009 118
 
12.3%
2013 109
 
11.3%
2011 82
 
8.5%
2005 62
 
6.5%
2012 62
 
6.5%
2007 43
 
4.5%
2010 41
 
4.3%
2008 34
 
3.5%
2006 21
 
2.2%
Other values (17) 77
 
8.0%

Length

2023-12-12T13:43:32.986793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2014 312
32.5%
2009 118
 
12.3%
2013 109
 
11.3%
2011 82
 
8.5%
2005 62
 
6.5%
2012 62
 
6.5%
2007 43
 
4.5%
2010 41
 
4.3%
2008 34
 
3.5%
2006 21
 
2.2%
Other values (17) 77
 
8.0%

Interactions

2023-12-12T13:43:25.856898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T13:43:33.071052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2014년 12월 신착자료목록Unnamed: 3Unnamed: 8
2014년 12월 신착자료목록1.0000.9110.714
Unnamed: 30.9111.0000.883
Unnamed: 80.7140.8831.000
2023-12-12T13:43:33.168223image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 3Unnamed: 8
Unnamed: 31.0000.717
Unnamed: 80.7171.000
2023-12-12T13:43:33.265951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2014년 12월 신착자료목록Unnamed: 3Unnamed: 8
2014년 12월 신착자료목록1.0000.7480.340
Unnamed: 30.7481.0000.717
Unnamed: 80.3400.7171.000

Missing values

2023-12-12T13:43:25.994499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:43:26.152175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T13:43:26.308865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

2014년 12월 신착자료목록Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8
0<NA>등록번호청구기호자료실명서명저작자권차발행자발행년
11AM0000001106BS 843-B395j[중랑숲]유아자료실Journeyby Aaron Becker<NA>Candlewick Press2013
22AM0000001107BS 843-S951b[중랑숲]유아자료실Ballword and pictures by Mary Sullivan<NA>Houghton Mifflin Harcourt2013
33AM0000001108BS 843-B468b[중랑숲]유아자료실Baa Moo What Will We Doby A. H. Benjamin ; illustrated by Jane Chapman<NA>Little Tiger Press2003
44AM0000001109BS 843-M216b[중랑숲]유아자료실Bubble Troubleby Margaret Mahy ; illustrated by Polly Dunbar<NA>Houghton Mifflin Harcourt2010
55AM0000001110BS 843-B942b[중랑숲]유아자료실Bunny Loves To WriteWritten by Buster Bunny<NA>Peter Bently2014
66AM0000001111BS 843-S817b[중랑숲]유아자료실Busy boatsby Susan Steggall<NA>Frances Lincoln Children's2010
77AM0000001112BS 843-C255b-v.1[중랑숲]유아자료실Biscuit loves father's dayby Alyssa Satin Capucilli ; illustrated by Pat Schoriesv.1HarperFestival2009
88AM0000001113BS 843-C255b-v.2[중랑숲]유아자료실Biscuit loves mother's dayby Alyssa Satin Capucilli ; illustrated by Pat Schoriesv.2HarperFestival2009
99AM0000001114BS 843-C255b-v.3[중랑숲]유아자료실Biscuit visits the doctorstory by Alyssa Satin Capucilli ; pictures by Pat Schoriesv.3HarperFestival2008
2014년 12월 신착자료목록Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8
951951CM0000032250CS 481.9911-ㅂ944ㄴ[중랑숲]어린이자료실나무야 궁금해북부지방산림청 지음<NA>생각쉼표2014
952952CM0000032251BS 813.8-ㅇ775ㅇ[중랑숲]유아자료실이 세상에서 가장 소중한 보물찾기이수진 글 ; 조용호 그림<NA>휴먼컬처아리랑2014
953953CM0000032252BS 813.8-ㅈ658ㅁ[중랑숲]유아자료실무섭지 않아요!조용호 글ㆍ그림<NA>휴먼컬처아리랑2014
954954CM0000032253CS 811.8-ㅅ946ㅎ[중랑숲]어린이자료실황금똥 : 신현창 동시화집신현창 지음<NA>JMG2014
955955CM0000032254CS 823.5-ㅂ236ㅇ[중랑숲]어린이자료실(이야기로 배우는)인성교과서, 삼국지 편박동석 글 ; 정지혜 그림<NA>M&Kids2014
956956CM0000032255CS 813.8-ㅂ446ㄴ[중랑숲]어린이자료실내 친구 토토는 경찰이예요박인경 지음 ; 봄 그림<NA>M&Kids2014
957957CM0000032256CS 029.85-ㅇ732ㄴ[중랑숲]어린이자료실나만의 독서록 비법 알려 줄까?이미영 글 ; 김화빈 그림<NA>M&Kids2014
958958CM0000032257CS 911.03-ㅇ965ㅅ[중랑숲]어린이자료실(어린이와 청소년을 위한) 삼국유사일연 원저 ; 강숙인 편저<NA>보물창고2014
959959CM0000032258CS 813.8-ㅅ894ㅎ[중랑숲]어린이자료실히말라야 청소부신자은 지음 ; 김상인 그림<NA>학고재2014
960<NA><NA><NA><NA><NA><NA><NA><NA><NA>