Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells3691
Missing cells (%)5.3%
Duplicate rows48
Duplicate rows (%)0.5%
Total size in memory625.0 KiB
Average record size in memory64.0 B

Variable types

Categorical1
Text5
Numeric1

Dataset

Description경기도 안양시 도서관 폐기자료(관리구분, 제목, 저자명, 출판사, 출판년도, 국제표준도서번호, 청구기호) 정보를 제공합니다.
Author경기도 안양시
URLhttps://www.data.go.kr/data/15069877/fileData.do

Alerts

Dataset has 48 (0.5%) duplicate rowsDuplicates
출판년도 has 194 (1.9%) missing valuesMissing
국제표준도서번호 has 3423 (34.2%) missing valuesMissing
출판년도 is highly skewed (γ1 = 66.75050372)Skewed

Reproduction

Analysis started2023-12-23 06:34:10.285705
Analysis finished2023-12-23 06:34:22.808557
Duration12.52 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

관리구분
Categorical

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
안양시평촌도서관
2776 
안양시만안도서관
2187 
안양시석수도서관
1940 
안양시호계도서관
1549 
안양어린이도서관
605 
Other values (4)
943 

Length

Max length9
Median length8
Mean length8.0304
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row안양시박달도서관
2nd row안양시평촌도서관
3rd row안양시만안도서관
4th row안양시립비산도서관
5th row안양어린이도서관

Common Values

ValueCountFrequency (%)
안양시평촌도서관 2776
27.8%
안양시만안도서관 2187
21.9%
안양시석수도서관 1940
19.4%
안양시호계도서관 1549
15.5%
안양어린이도서관 605
 
6.0%
안양시박달도서관 533
 
5.3%
안양시립비산도서관 310
 
3.1%
안양시벌말도서관 94
 
0.9%
안양삼덕도서관 6
 
0.1%

Length

2023-12-23T06:34:23.214494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-23T06:34:23.950768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
안양시평촌도서관 2776
27.8%
안양시만안도서관 2187
21.9%
안양시석수도서관 1940
19.4%
안양시호계도서관 1549
15.5%
안양어린이도서관 605
 
6.0%
안양시박달도서관 533
 
5.3%
안양시립비산도서관 310
 
3.1%
안양시벌말도서관 94
 
0.9%
안양삼덕도서관 6
 
0.1%

제목
Text

Distinct9163
Distinct (%)91.6%
Missing2
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-23T06:34:25.747063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length192
Median length128
Mean length13.815163
Min length1

Characters and Unicode

Total characters138124
Distinct characters2135
Distinct categories18 ?
Distinct scripts6 ?
Distinct blocks15 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8556 ?
Unique (%)85.6%

Sample

1st row마법 새 찌루
2nd row(그건 네 잘못이 아니라...)네 성격 탓이야
3rd row쥐 이야기
4th row카오스
5th row사람놀이
ValueCountFrequency (%)
1 336
 
1.0%
2 274
 
0.8%
232
 
0.7%
이야기 226
 
0.7%
109
 
0.3%
3 108
 
0.3%
the 92
 
0.3%
of 90
 
0.3%
위한 78
 
0.2%
우리 75
 
0.2%
Other values (17438) 31152
95.1%
2023-12-23T06:34:28.711850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
23368
 
16.9%
2645
 
1.9%
2470
 
1.8%
1577
 
1.1%
1458
 
1.1%
1450
 
1.0%
: 1436
 
1.0%
) 1317
 
1.0%
( 1317
 
1.0%
1211
 
0.9%
Other values (2125) 99875
72.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 90056
65.2%
Space Separator 23368
 
16.9%
Lowercase Letter 10158
 
7.4%
Decimal Number 4127
 
3.0%
Other Punctuation 4016
 
2.9%
Uppercase Letter 2980
 
2.2%
Close Punctuation 1375
 
1.0%
Open Punctuation 1375
 
1.0%
Math Symbol 390
 
0.3%
Dash Punctuation 228
 
0.2%
Other values (8) 51
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2645
 
2.9%
2470
 
2.7%
1577
 
1.8%
1458
 
1.6%
1450
 
1.6%
1211
 
1.3%
1150
 
1.3%
1087
 
1.2%
1050
 
1.2%
1038
 
1.2%
Other values (2010) 74920
83.2%
Lowercase Letter
ValueCountFrequency (%)
e 1180
11.6%
o 934
 
9.2%
i 846
 
8.3%
n 793
 
7.8%
a 788
 
7.8%
t 779
 
7.7%
r 684
 
6.7%
s 606
 
6.0%
l 506
 
5.0%
h 435
 
4.3%
Other values (16) 2607
25.7%
Uppercase Letter
ValueCountFrequency (%)
S 264
 
8.9%
A 253
 
8.5%
E 237
 
8.0%
C 232
 
7.8%
T 227
 
7.6%
I 162
 
5.4%
O 161
 
5.4%
B 141
 
4.7%
M 139
 
4.7%
P 134
 
4.5%
Other values (16) 1030
34.6%
Other Punctuation
ValueCountFrequency (%)
: 1436
35.8%
. 1164
29.0%
, 667
16.6%
! 249
 
6.2%
· 191
 
4.8%
/ 173
 
4.3%
' 63
 
1.6%
& 17
 
0.4%
; 17
 
0.4%
15
 
0.4%
Other values (8) 24
 
0.6%
Decimal Number
ValueCountFrequency (%)
1 1107
26.8%
2 734
17.8%
0 583
14.1%
3 416
 
10.1%
5 293
 
7.1%
9 269
 
6.5%
4 226
 
5.5%
6 184
 
4.5%
8 158
 
3.8%
7 157
 
3.8%
Math Symbol
ValueCountFrequency (%)
= 312
80.0%
+ 30
 
7.7%
~ 21
 
5.4%
10
 
2.6%
> 7
 
1.8%
< 7
 
1.8%
3
 
0.8%
Close Punctuation
ValueCountFrequency (%)
) 1317
95.8%
] 55
 
4.0%
1
 
0.1%
1
 
0.1%
1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1317
95.8%
[ 55
 
4.0%
1
 
0.1%
1
 
0.1%
1
 
0.1%
Other Number
ValueCountFrequency (%)
7
63.6%
2
 
18.2%
1
 
9.1%
½ 1
 
9.1%
Other Symbol
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Letter Number
ValueCountFrequency (%)
14
51.9%
11
40.7%
2
 
7.4%
Space Separator
ValueCountFrequency (%)
23368
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 228
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 3
100.0%
Final Punctuation
ValueCountFrequency (%)
2
100.0%
Initial Punctuation
ValueCountFrequency (%)
2
100.0%
Control
ValueCountFrequency (%)
1
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 86789
62.8%
Common 34903
25.3%
Latin 13165
 
9.5%
Han 3244
 
2.3%
Hiragana 19
 
< 0.1%
Katakana 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2645
 
3.0%
2470
 
2.8%
1577
 
1.8%
1458
 
1.7%
1450
 
1.7%
1211
 
1.4%
1150
 
1.3%
1087
 
1.3%
1050
 
1.2%
1038
 
1.2%
Other values (1202) 71653
82.6%
Han
ValueCountFrequency (%)
80
 
2.5%
76
 
2.3%
57
 
1.8%
47
 
1.4%
41
 
1.3%
38
 
1.2%
36
 
1.1%
36
 
1.1%
36
 
1.1%
34
 
1.0%
Other values (782) 2763
85.2%
Common
ValueCountFrequency (%)
23368
67.0%
: 1436
 
4.1%
) 1317
 
3.8%
( 1317
 
3.8%
. 1164
 
3.3%
1 1107
 
3.2%
2 734
 
2.1%
, 667
 
1.9%
0 583
 
1.7%
3 416
 
1.2%
Other values (50) 2794
 
8.0%
Latin
ValueCountFrequency (%)
e 1180
 
9.0%
o 934
 
7.1%
i 846
 
6.4%
n 793
 
6.0%
a 788
 
6.0%
t 779
 
5.9%
r 684
 
5.2%
s 606
 
4.6%
l 506
 
3.8%
h 435
 
3.3%
Other values (45) 5614
42.6%
Hiragana
ValueCountFrequency (%)
5
26.3%
2
 
10.5%
2
 
10.5%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
Other values (3) 3
15.8%
Katakana
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 86782
62.8%
ASCII 47788
34.6%
CJK 3165
 
2.3%
None 224
 
0.2%
CJK Compat Ideographs 79
 
0.1%
Number Forms 27
 
< 0.1%
Hiragana 19
 
< 0.1%
Enclosed Alphanum 11
 
< 0.1%
Math Operators 10
 
< 0.1%
Compat Jamo 7
 
< 0.1%
Other values (5) 12
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
23368
48.9%
: 1436
 
3.0%
) 1317
 
2.8%
( 1317
 
2.8%
e 1180
 
2.5%
. 1164
 
2.4%
1 1107
 
2.3%
o 934
 
2.0%
i 846
 
1.8%
n 793
 
1.7%
Other values (78) 14326
30.0%
Hangul
ValueCountFrequency (%)
2645
 
3.0%
2470
 
2.8%
1577
 
1.8%
1458
 
1.7%
1450
 
1.7%
1211
 
1.4%
1150
 
1.3%
1087
 
1.3%
1050
 
1.2%
1038
 
1.2%
Other values (1198) 71646
82.6%
None
ValueCountFrequency (%)
· 191
85.3%
15
 
6.7%
4
 
1.8%
3
 
1.3%
3
 
1.3%
1
 
0.4%
1
 
0.4%
1
 
0.4%
1
 
0.4%
½ 1
 
0.4%
Other values (3) 3
 
1.3%
CJK
ValueCountFrequency (%)
80
 
2.5%
76
 
2.4%
57
 
1.8%
47
 
1.5%
41
 
1.3%
38
 
1.2%
36
 
1.1%
36
 
1.1%
36
 
1.1%
34
 
1.1%
Other values (755) 2684
84.8%
Number Forms
ValueCountFrequency (%)
14
51.9%
11
40.7%
2
 
7.4%
CJK Compat Ideographs
ValueCountFrequency (%)
11
13.9%
10
12.7%
8
 
10.1%
6
 
7.6%
4
 
5.1%
4
 
5.1%
4
 
5.1%
3
 
3.8%
3
 
3.8%
2
 
2.5%
Other values (17) 24
30.4%
Math Operators
ValueCountFrequency (%)
10
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
7
63.6%
2
 
18.2%
1
 
9.1%
1
 
9.1%
Hiragana
ValueCountFrequency (%)
5
26.3%
2
 
10.5%
2
 
10.5%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
1
 
5.3%
Other values (3) 3
15.8%
Compat Jamo
ValueCountFrequency (%)
4
57.1%
1
 
14.3%
1
 
14.3%
1
 
14.3%
Katakana
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%
Punctuation
ValueCountFrequency (%)
2
40.0%
2
40.0%
1
20.0%
CJK Compat
ValueCountFrequency (%)
1
100.0%
Geometric Shapes
ValueCountFrequency (%)
1
100.0%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
Distinct8726
Distinct (%)87.9%
Missing68
Missing (%)0.7%
Memory size156.2 KiB
2023-12-23T06:34:30.382635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length126
Median length79
Mean length12.939489
Min length2

Characters and Unicode

Total characters128515
Distinct characters1555
Distinct categories12 ?
Distinct scripts5 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7881 ?
Unique (%)79.3%

Sample

1st row김학선 글;, 원혜영 그림
2nd row에이브러햄 J. 트워스키 지음;, 찰스 M. 슐츠 그림;, 최한림 옮김
3rd row정 위엔지에 지음;, 심봉희 옮김;, 이형진 그림
4th row제임스 글리크 지음;, 박배식;, 성하운 공역
5th row키무라 유이치 글;, 초 신타 그림;, 한수연 옮김
ValueCountFrequency (%)
지음 3642
 
10.9%
옮김 1846
 
5.5%
그림 1695
 
5.1%
1321
 
3.9%
1002
 
3.0%
530
 
1.6%
엮음 504
 
1.5%
편집부 278
 
0.8%
242
 
0.7%
글·그림 240
 
0.7%
Other values (12409) 22215
66.3%
2023-12-23T06:34:32.314002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
23694
 
18.4%
, 5627
 
4.4%
; 5017
 
3.9%
4367
 
3.4%
4259
 
3.3%
3802
 
3.0%
2480
 
1.9%
2228
 
1.7%
2179
 
1.7%
1906
 
1.5%
Other values (1545) 72956
56.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 83031
64.6%
Space Separator 23694
 
18.4%
Other Punctuation 11847
 
9.2%
Lowercase Letter 4999
 
3.9%
Uppercase Letter 1635
 
1.3%
Close Punctuation 1608
 
1.3%
Open Punctuation 1608
 
1.3%
Dash Punctuation 40
 
< 0.1%
Decimal Number 40
 
< 0.1%
Math Symbol 8
 
< 0.1%
Other values (2) 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4367
 
5.3%
4259
 
5.1%
3802
 
4.6%
2480
 
3.0%
2228
 
2.7%
2179
 
2.6%
1906
 
2.3%
1810
 
2.2%
1529
 
1.8%
1226
 
1.5%
Other values (1463) 57245
68.9%
Lowercase Letter
ValueCountFrequency (%)
e 516
10.3%
a 487
 
9.7%
r 432
 
8.6%
i 393
 
7.9%
n 373
 
7.5%
t 350
 
7.0%
l 336
 
6.7%
o 305
 
6.1%
s 283
 
5.7%
y 257
 
5.1%
Other values (16) 1267
25.3%
Uppercase Letter
ValueCountFrequency (%)
J 135
 
8.3%
S 126
 
7.7%
B 121
 
7.4%
A 106
 
6.5%
K 103
 
6.3%
H 100
 
6.1%
M 95
 
5.8%
C 92
 
5.6%
R 90
 
5.5%
E 83
 
5.1%
Other values (15) 584
35.7%
Other Punctuation
ValueCountFrequency (%)
, 5627
47.5%
; 5017
42.3%
. 808
 
6.8%
· 326
 
2.8%
: 38
 
0.3%
/ 9
 
0.1%
7
 
0.1%
' 6
 
0.1%
& 6
 
0.1%
3
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
2 14
35.0%
3 8
20.0%
1 8
20.0%
0 4
 
10.0%
4 2
 
5.0%
5 1
 
2.5%
6 1
 
2.5%
8 1
 
2.5%
7 1
 
2.5%
Close Punctuation
ValueCountFrequency (%)
] 1586
98.6%
) 19
 
1.2%
3
 
0.2%
Open Punctuation
ValueCountFrequency (%)
[ 1586
98.6%
( 19
 
1.2%
3
 
0.2%
Math Symbol
ValueCountFrequency (%)
> 4
50.0%
< 4
50.0%
Space Separator
ValueCountFrequency (%)
23694
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 40
100.0%
Control
ValueCountFrequency (%)
4
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 80314
62.5%
Common 38850
30.2%
Latin 6634
 
5.2%
Han 2709
 
2.1%
Katakana 8
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4367
 
5.4%
4259
 
5.3%
3802
 
4.7%
2480
 
3.1%
2228
 
2.8%
2179
 
2.7%
1906
 
2.4%
1810
 
2.3%
1529
 
1.9%
1226
 
1.5%
Other values (853) 54528
67.9%
Han
ValueCountFrequency (%)
314
 
11.6%
105
 
3.9%
94
 
3.5%
75
 
2.8%
75
 
2.8%
51
 
1.9%
34
 
1.3%
33
 
1.2%
30
 
1.1%
24
 
0.9%
Other values (592) 1874
69.2%
Latin
ValueCountFrequency (%)
e 516
 
7.8%
a 487
 
7.3%
r 432
 
6.5%
i 393
 
5.9%
n 373
 
5.6%
t 350
 
5.3%
l 336
 
5.1%
o 305
 
4.6%
s 283
 
4.3%
y 257
 
3.9%
Other values (41) 2902
43.7%
Common
ValueCountFrequency (%)
23694
61.0%
, 5627
 
14.5%
; 5017
 
12.9%
] 1586
 
4.1%
[ 1586
 
4.1%
. 808
 
2.1%
· 326
 
0.8%
- 40
 
0.1%
: 38
 
0.1%
) 19
 
< 0.1%
Other values (21) 109
 
0.3%
Katakana
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 80310
62.5%
ASCII 45142
35.1%
CJK 2575
 
2.0%
None 342
 
0.3%
CJK Compat Ideographs 134
 
0.1%
Katakana 8
 
< 0.1%
Compat Jamo 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
23694
52.5%
, 5627
 
12.5%
; 5017
 
11.1%
] 1586
 
3.5%
[ 1586
 
3.5%
. 808
 
1.8%
e 516
 
1.1%
a 487
 
1.1%
r 432
 
1.0%
i 393
 
0.9%
Other values (67) 4996
 
11.1%
Hangul
ValueCountFrequency (%)
4367
 
5.4%
4259
 
5.3%
3802
 
4.7%
2480
 
3.1%
2228
 
2.8%
2179
 
2.7%
1906
 
2.4%
1810
 
2.3%
1529
 
1.9%
1226
 
1.5%
Other values (852) 54524
67.9%
None
ValueCountFrequency (%)
· 326
95.3%
7
 
2.0%
3
 
0.9%
3
 
0.9%
3
 
0.9%
CJK
ValueCountFrequency (%)
314
 
12.2%
105
 
4.1%
94
 
3.7%
75
 
2.9%
51
 
2.0%
34
 
1.3%
33
 
1.3%
30
 
1.2%
24
 
0.9%
23
 
0.9%
Other values (566) 1792
69.6%
CJK Compat Ideographs
ValueCountFrequency (%)
75
56.0%
9
 
6.7%
6
 
4.5%
6
 
4.5%
5
 
3.7%
4
 
3.0%
3
 
2.2%
3
 
2.2%
2
 
1.5%
2
 
1.5%
Other values (16) 19
 
14.2%
Compat Jamo
ValueCountFrequency (%)
4
100.0%
Katakana
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Distinct3390
Distinct (%)33.9%
Missing4
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-23T06:34:33.114679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length62
Median length32
Mean length4.3883553
Min length1

Characters and Unicode

Total characters43866
Distinct characters971
Distinct categories10 ?
Distinct scripts5 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2022 ?
Unique (%)20.2%

Sample

1st row대교출판
2nd row미래사
3rd row비룡소
4th row동문사
5th row시공사
ValueCountFrequency (%)
계몽사 98
 
0.9%
비룡소 90
 
0.9%
우암 90
 
0.9%
예림당 83
 
0.8%
고려원 80
 
0.8%
삼성출판사 77
 
0.7%
김영사 77
 
0.7%
금성출판사 69
 
0.7%
지경사 65
 
0.6%
민음사 62
 
0.6%
Other values (3483) 9628
92.4%
2023-12-23T06:34:34.701737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2945
 
6.7%
1152
 
2.6%
1150
 
2.6%
1054
 
2.4%
847
 
1.9%
669
 
1.5%
650
 
1.5%
644
 
1.5%
618
 
1.4%
565
 
1.3%
Other values (961) 33572
76.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 40224
91.7%
Lowercase Letter 1977
 
4.5%
Uppercase Letter 837
 
1.9%
Space Separator 446
 
1.0%
Other Punctuation 214
 
0.5%
Decimal Number 81
 
0.2%
Open Punctuation 37
 
0.1%
Close Punctuation 37
 
0.1%
Dash Punctuation 11
 
< 0.1%
Modifier Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2945
 
7.3%
1152
 
2.9%
1150
 
2.9%
1054
 
2.6%
847
 
2.1%
669
 
1.7%
650
 
1.6%
644
 
1.6%
618
 
1.5%
565
 
1.4%
Other values (883) 29930
74.4%
Lowercase Letter
ValueCountFrequency (%)
o 239
12.1%
s 187
 
9.5%
e 185
 
9.4%
a 156
 
7.9%
i 155
 
7.8%
n 146
 
7.4%
r 137
 
6.9%
t 95
 
4.8%
l 90
 
4.6%
u 71
 
3.6%
Other values (15) 516
26.1%
Uppercase Letter
ValueCountFrequency (%)
B 121
14.5%
M 88
 
10.5%
S 77
 
9.2%
P 63
 
7.5%
C 60
 
7.2%
E 43
 
5.1%
O 41
 
4.9%
I 37
 
4.4%
R 36
 
4.3%
N 30
 
3.6%
Other values (15) 241
28.8%
Other Punctuation
ValueCountFrequency (%)
: 84
39.3%
. 32
 
15.0%
31
 
14.5%
& 30
 
14.0%
· 15
 
7.0%
, 8
 
3.7%
' 6
 
2.8%
; 3
 
1.4%
" 2
 
0.9%
! 1
 
0.5%
Other values (2) 2
 
0.9%
Decimal Number
ValueCountFrequency (%)
1 33
40.7%
2 31
38.3%
9 8
 
9.9%
0 2
 
2.5%
8 2
 
2.5%
7 2
 
2.5%
4 1
 
1.2%
3 1
 
1.2%
6 1
 
1.2%
Open Punctuation
ValueCountFrequency (%)
( 34
91.9%
[ 3
 
8.1%
Close Punctuation
ValueCountFrequency (%)
) 34
91.9%
] 3
 
8.1%
Space Separator
ValueCountFrequency (%)
446
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 38179
87.0%
Latin 2814
 
6.4%
Han 2042
 
4.7%
Common 828
 
1.9%
Katakana 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2945
 
7.7%
1152
 
3.0%
1150
 
3.0%
1054
 
2.8%
847
 
2.2%
669
 
1.8%
650
 
1.7%
644
 
1.7%
618
 
1.6%
565
 
1.5%
Other values (605) 27885
73.0%
Han
ValueCountFrequency (%)
274
 
13.4%
135
 
6.6%
88
 
4.3%
72
 
3.5%
63
 
3.1%
63
 
3.1%
40
 
2.0%
40
 
2.0%
40
 
2.0%
37
 
1.8%
Other values (265) 1190
58.3%
Latin
ValueCountFrequency (%)
o 239
 
8.5%
s 187
 
6.6%
e 185
 
6.6%
a 156
 
5.5%
i 155
 
5.5%
n 146
 
5.2%
r 137
 
4.9%
B 121
 
4.3%
t 95
 
3.4%
l 90
 
3.2%
Other values (40) 1303
46.3%
Common
ValueCountFrequency (%)
446
53.9%
: 84
 
10.1%
( 34
 
4.1%
) 34
 
4.1%
1 33
 
4.0%
. 32
 
3.9%
2 31
 
3.7%
31
 
3.7%
& 30
 
3.6%
· 15
 
1.8%
Other values (18) 58
 
7.0%
Katakana
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 38179
87.0%
ASCII 3596
 
8.2%
CJK 2030
 
4.6%
None 46
 
0.1%
CJK Compat Ideographs 12
 
< 0.1%
Katakana 3
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2945
 
7.7%
1152
 
3.0%
1150
 
3.0%
1054
 
2.8%
847
 
2.2%
669
 
1.8%
650
 
1.7%
644
 
1.7%
618
 
1.6%
565
 
1.5%
Other values (605) 27885
73.0%
ASCII
ValueCountFrequency (%)
446
 
12.4%
o 239
 
6.6%
s 187
 
5.2%
e 185
 
5.1%
a 156
 
4.3%
i 155
 
4.3%
n 146
 
4.1%
r 137
 
3.8%
B 121
 
3.4%
t 95
 
2.6%
Other values (66) 1729
48.1%
CJK
ValueCountFrequency (%)
274
 
13.5%
135
 
6.7%
88
 
4.3%
72
 
3.5%
63
 
3.1%
63
 
3.1%
40
 
2.0%
40
 
2.0%
40
 
2.0%
37
 
1.8%
Other values (259) 1178
58.0%
None
ValueCountFrequency (%)
31
67.4%
· 15
32.6%
CJK Compat Ideographs
ValueCountFrequency (%)
6
50.0%
2
 
16.7%
1
 
8.3%
1
 
8.3%
1
 
8.3%
1
 
8.3%
Katakana
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

출판년도
Real number (ℝ)

MISSING  SKEWED 

Distinct59
Distinct (%)0.6%
Missing194
Missing (%)1.9%
Infinite0
Infinite (%)0.0%
Mean1999.3038
Minimum1952
Maximum5000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-23T06:34:35.157987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1952
5-th percentile1985
Q11994
median1999
Q32004
95-th percentile2010
Maximum5000
Range3048
Interquartile range (IQR)10

Descriptive statistics

Standard deviation43.546495
Coefficient of variation (CV)0.021780829
Kurtosis4598.7574
Mean1999.3038
Median Absolute Deviation (MAD)5
Skewness66.750504
Sum19605173
Variance1896.2972
MonotonicityNot monotonic
2023-12-23T06:34:35.997996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2003 645
 
6.5%
1994 640
 
6.4%
2004 571
 
5.7%
2002 532
 
5.3%
1993 531
 
5.3%
1997 523
 
5.2%
1995 510
 
5.1%
1996 460
 
4.6%
1999 411
 
4.1%
1992 368
 
3.7%
Other values (49) 4615
46.2%
ValueCountFrequency (%)
1952 1
 
< 0.1%
1958 2
 
< 0.1%
1960 1
 
< 0.1%
1963 1
 
< 0.1%
1966 7
0.1%
1968 1
 
< 0.1%
1969 2
 
< 0.1%
1970 5
0.1%
1971 2
 
< 0.1%
1972 4
< 0.1%
ValueCountFrequency (%)
5000 2
 
< 0.1%
2020 1
 
< 0.1%
2019 4
 
< 0.1%
2018 10
 
0.1%
2017 14
 
0.1%
2016 10
 
0.1%
2015 38
 
0.4%
2014 50
 
0.5%
2013 104
1.0%
2012 127
1.3%
Distinct4960
Distinct (%)75.4%
Missing3423
Missing (%)34.2%
Memory size156.2 KiB
2023-12-23T06:34:37.637221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length10
Mean length10.624449
Min length8

Characters and Unicode

Total characters69877
Distinct characters46
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4628 ?
Unique (%)70.4%

Sample

1st row8939515587
2nd row8970875573
3rd row8949130297 74820:
4th row8970610065
5th row8952746414(172)
ValueCountFrequency (%)
9.79e+12 1232
 
17.5%
03810 63
 
0.9%
73810 30
 
0.4%
9.78e+12 19
 
0.3%
93560 19
 
0.3%
04810 15
 
0.2%
03840 11
 
0.2%
77810 9
 
0.1%
74810 8
 
0.1%
03800 6
 
0.1%
Other values (5115) 5608
79.9%
2023-12-23T06:34:39.794799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9 11538
16.5%
8 10095
14.4%
0 7010
10.0%
1 6400
9.2%
7 6061
8.7%
2 5445
7.8%
3 4762
6.8%
5 4314
 
6.2%
4 4160
 
6.0%
6 3341
 
4.8%
Other values (36) 6751
9.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 63126
90.3%
Uppercase Letter 1666
 
2.4%
Other Punctuation 1511
 
2.2%
Math Symbol 1260
 
1.8%
Open Punctuation 872
 
1.2%
Close Punctuation 872
 
1.2%
Space Separator 443
 
0.6%
Lowercase Letter 50
 
0.1%
Other Letter 39
 
0.1%
Dash Punctuation 36
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4
10.3%
4
10.3%
4
10.3%
4
10.3%
3
 
7.7%
3
 
7.7%
3
 
7.7%
2
 
5.1%
2
 
5.1%
1
 
2.6%
Other values (9) 9
23.1%
Decimal Number
ValueCountFrequency (%)
9 11538
18.3%
8 10095
16.0%
0 7010
11.1%
1 6400
10.1%
7 6061
9.6%
2 5445
8.6%
3 4762
7.5%
5 4314
 
6.8%
4 4160
 
6.6%
6 3341
 
5.3%
Uppercase Letter
ValueCountFrequency (%)
E 1260
75.6%
X 404
 
24.2%
A 2
 
0.1%
Lowercase Letter
ValueCountFrequency (%)
x 48
96.0%
g 1
 
2.0%
v 1
 
2.0%
Other Punctuation
ValueCountFrequency (%)
. 1261
83.5%
: 250
 
16.5%
Open Punctuation
ValueCountFrequency (%)
( 871
99.9%
[ 1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 871
99.9%
] 1
 
0.1%
Letter Number
ValueCountFrequency (%)
1
50.0%
1
50.0%
Math Symbol
ValueCountFrequency (%)
+ 1260
100.0%
Space Separator
ValueCountFrequency (%)
443
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 36
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 68120
97.5%
Latin 1718
 
2.5%
Hangul 39
 
0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
9 11538
16.9%
8 10095
14.8%
0 7010
10.3%
1 6400
9.4%
7 6061
8.9%
2 5445
8.0%
3 4762
7.0%
5 4314
 
6.3%
4 4160
 
6.1%
6 3341
 
4.9%
Other values (9) 4994
7.3%
Hangul
ValueCountFrequency (%)
4
10.3%
4
10.3%
4
10.3%
4
10.3%
3
 
7.7%
3
 
7.7%
3
 
7.7%
2
 
5.1%
2
 
5.1%
1
 
2.6%
Other values (9) 9
23.1%
Latin
ValueCountFrequency (%)
E 1260
73.3%
X 404
 
23.5%
x 48
 
2.8%
A 2
 
0.1%
g 1
 
0.1%
1
 
0.1%
v 1
 
0.1%
1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 69836
99.9%
Hangul 39
 
0.1%
Number Forms 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9 11538
16.5%
8 10095
14.5%
0 7010
10.0%
1 6400
9.2%
7 6061
8.7%
2 5445
7.8%
3 4762
6.8%
5 4314
 
6.2%
4 4160
 
6.0%
6 3341
 
4.8%
Other values (15) 6710
9.6%
Hangul
ValueCountFrequency (%)
4
10.3%
4
10.3%
4
10.3%
4
10.3%
3
 
7.7%
3
 
7.7%
3
 
7.7%
2
 
5.1%
2
 
5.1%
1
 
2.6%
Other values (9) 9
23.1%
Number Forms
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct9880
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-23T06:34:41.458662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length20
Mean length10.2992
Min length2

Characters and Unicode

Total characters102992
Distinct characters718
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9764 ?
Unique (%)97.6%

Sample

1st row아808.9눈19589대65
2nd row182.12트66ㄴ
3rd row823.8정66ㅈ
4th row429글298ㅋ
5th row유808.9네44ㅅ1722
ValueCountFrequency (%)
j808주198ㄱ 4
 
< 0.1%
813.6 4
 
< 0.1%
004.76박52플 3
 
< 0.1%
j490.8시887ㅎ 3
 
< 0.1%
아863생8841ㅇ 2
 
< 0.1%
813.6김64ㄱ 2
 
< 0.1%
wj740.7잉1729웅 2
 
< 0.1%
325.04정76ㅅ 2
 
< 0.1%
818김52ㅂ 2
 
< 0.1%
유808.9네44ㅅ1172 2
 
< 0.1%
Other values (9875) 9982
99.7%
2023-12-23T06:34:43.655023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8 10750
 
10.4%
1 10068
 
9.8%
2 8324
 
8.1%
3 7786
 
7.6%
6 7151
 
6.9%
9 6837
 
6.6%
. 6655
 
6.5%
4 6407
 
6.2%
5 5880
 
5.7%
0 5724
 
5.6%
Other values (708) 27410
26.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 73718
71.6%
Other Letter 20911
 
20.3%
Other Punctuation 6701
 
6.5%
Uppercase Letter 1243
 
1.2%
Dash Punctuation 260
 
0.3%
Lowercase Letter 126
 
0.1%
Space Separator 10
 
< 0.1%
Close Punctuation 8
 
< 0.1%
Open Punctuation 8
 
< 0.1%
Math Symbol 5
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2120
 
10.1%
1196
 
5.7%
813
 
3.9%
784
 
3.7%
779
 
3.7%
716
 
3.4%
695
 
3.3%
695
 
3.3%
513
 
2.5%
424
 
2.0%
Other values (642) 12176
58.2%
Lowercase Letter
ValueCountFrequency (%)
o 18
14.3%
s 16
12.7%
h 11
 
8.7%
p 9
 
7.1%
j 8
 
6.3%
t 7
 
5.6%
a 7
 
5.6%
g 7
 
5.6%
f 6
 
4.8%
l 5
 
4.0%
Other values (13) 32
25.4%
Uppercase Letter
ValueCountFrequency (%)
J 682
54.9%
R 84
 
6.8%
S 67
 
5.4%
T 54
 
4.3%
M 47
 
3.8%
D 47
 
3.8%
O 46
 
3.7%
V 38
 
3.1%
P 37
 
3.0%
W 32
 
2.6%
Other values (12) 109
 
8.8%
Decimal Number
ValueCountFrequency (%)
8 10750
14.6%
1 10068
13.7%
2 8324
11.3%
3 7786
10.6%
6 7151
9.7%
9 6837
9.3%
4 6407
8.7%
5 5880
8.0%
0 5724
7.8%
7 4791
6.5%
Other Punctuation
ValueCountFrequency (%)
. 6655
99.3%
, 46
 
0.7%
Close Punctuation
ValueCountFrequency (%)
) 4
50.0%
] 4
50.0%
Open Punctuation
ValueCountFrequency (%)
( 4
50.0%
[ 4
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 260
100.0%
Space Separator
ValueCountFrequency (%)
10
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%
Format
ValueCountFrequency (%)
­ 1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 80712
78.4%
Hangul 20911
 
20.3%
Latin 1369
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2120
 
10.1%
1196
 
5.7%
813
 
3.9%
784
 
3.7%
779
 
3.7%
716
 
3.4%
695
 
3.3%
695
 
3.3%
513
 
2.5%
424
 
2.0%
Other values (642) 12176
58.2%
Latin
ValueCountFrequency (%)
J 682
49.8%
R 84
 
6.1%
S 67
 
4.9%
T 54
 
3.9%
M 47
 
3.4%
D 47
 
3.4%
O 46
 
3.4%
V 38
 
2.8%
P 37
 
2.7%
W 32
 
2.3%
Other values (35) 235
 
17.2%
Common
ValueCountFrequency (%)
8 10750
13.3%
1 10068
12.5%
2 8324
10.3%
3 7786
9.6%
6 7151
8.9%
9 6837
8.5%
. 6655
8.2%
4 6407
7.9%
5 5880
7.3%
0 5724
7.1%
Other values (11) 5130
6.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 82080
79.7%
Hangul 14869
 
14.4%
Compat Jamo 6042
 
5.9%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8 10750
13.1%
1 10068
12.3%
2 8324
10.1%
3 7786
9.5%
6 7151
8.7%
9 6837
8.3%
. 6655
8.1%
4 6407
7.8%
5 5880
7.2%
0 5724
7.0%
Other values (55) 6498
7.9%
Hangul
ValueCountFrequency (%)
2120
 
14.3%
813
 
5.5%
784
 
5.3%
695
 
4.7%
424
 
2.9%
293
 
2.0%
237
 
1.6%
221
 
1.5%
176
 
1.2%
147
 
1.0%
Other values (623) 8959
60.3%
Compat Jamo
ValueCountFrequency (%)
1196
19.8%
779
12.9%
716
11.9%
695
11.5%
513
8.5%
400
 
6.6%
343
 
5.7%
339
 
5.6%
276
 
4.6%
237
 
3.9%
Other values (9) 548
9.1%
None
ValueCountFrequency (%)
­ 1
100.0%

Interactions

2023-12-23T06:34:20.155524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-23T06:34:44.188586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관리구분출판년도
관리구분1.0000.000
출판년도0.0001.000
2023-12-23T06:34:44.546318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
출판년도관리구분
출판년도1.0000.337
관리구분0.3371.000

Missing values

2023-12-23T06:34:21.030015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-23T06:34:21.906195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-23T06:34:22.432342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

관리구분제목저자명출판사출판년도국제표준도서번호청구기호
58178안양시박달도서관마법 새 찌루김학선 글;, 원혜영 그림대교출판20028939515587아808.9눈19589대65
70885안양시평촌도서관(그건 네 잘못이 아니라...)네 성격 탓이야에이브러햄 J. 트워스키 지음;, 찰스 M. 슐츠 그림;, 최한림 옮김미래사20048970875573182.12트66ㄴ
68025안양시만안도서관쥐 이야기정 위엔지에 지음;, 심봉희 옮김;, 이형진 그림비룡소19968949130297 74820:823.8정66ㅈ
95871안양시립비산도서관카오스제임스 글리크 지음;, 박배식;, 성하운 공역동문사19938970610065429글298ㅋ
6644안양어린이도서관사람놀이키무라 유이치 글;, 초 신타 그림;, 한수연 옮김시공사20068952746414(172)유808.9네44ㅅ1722
60974안양시평촌도서관國史大辭典國史大辭典編集委員會 編아름출판사1991<NA>참913.003국52ㄱ1
47037안양시평촌도서관누가 내 치즈를 옮겼을까스펜서 존슨 저;, 이영진 번역진명20028980103034843.8존57ㄴ
10199안양시호계도서관電子·通信無線工學大辭典사전연구사 편집부 [편]한국사전연구사1996<NA>R560.3사7419ㅈ2
67565안양시박달도서관베토벤 심리 상담 보고서: 아이에게 부모는 무엇일까김태형 지음부키20089.79E+12186.3김883베
74331안양시평촌도서관정신지체인 직업적응프로그램 개발한국장애인고용촉진공단한국장애인고용1997<NA>338.3한16저
관리구분제목저자명출판사출판년도국제표준도서번호청구기호
64852안양시만안도서관각하! 이제 마쳤습니다: 靑巖 朴泰俊 글모음조용경 엮음寒松19958986320037 03300991.1박883ㅊ
20323안양어린이도서관울지 않는 개구리정순 글;, 차정인 그림교원20099.79E+12유808.9이63ㄱ13
38752안양시만안도서관안네의 일기안네 프랑크범우사1992<NA>686300
11072안양시석수도서관사랑있는 믿음이외에는 기억치 않겠노라현선영아멘 스코프1994<NA>234.8현53ㅅ
41700안양시평촌도서관웅진위인전기웅진출판 편집부 [편]웅진출판주식회사1987<NA>J990.8웅78ㅇ7
90812안양시평촌도서관(영한완역대본)스파이더맨데이비드 코프 시나리오애플리스외국어사20038995358416747코897ㅅ
29613안양시석수도서관단군신화이형구 글;, 홍성찬 그림보림20048943300808J380.8솔14보122
36562안양시석수도서관慶熙宮址: 경희궁지 종합정비계획서울시 종로구청 문화공보과 편서울시 종로구청 문화공보과20149.79E+12참911.65서6629경
87331안양시만안도서관별이 빛나는 밤에CBS 엮음대명사1982<NA>763130
57907안양시만안도서관친밀한 적아쉬스 나디 저신구문화사<NA><NA>482960

Duplicate rows

Most frequently occurring

관리구분제목저자명출판사출판년도국제표준도서번호청구기호# duplicates
34안양시석수도서관주니어 세계문학금성출판사 편집부 엮음금성출판사1993<NA>J808주198ㄱ4
30안양시석수도서관시튼·파브르 선집시튼 지음한국프뢰벨1992<NA>J490.8시887ㅎ3
35안양시석수도서관플래시 MX: 애니메이션 & 게임 & 뮤직비디오 만들기: for creative Web animation박상훈;, 조진호 공저영진닷컴20028931421729004.76박52플3
0안양시립비산도서관(유지현의)한자 없는 중국어유지현 지음웅진씽크빅20109.79E+12727.5유78ㅎ2
1안양시립비산도서관1승 9패 유니클로처럼김성호 지음위즈덤하우스20109.79E+12325.1김54ㅇ2
2안양시만안도서관(最新)經濟學의 構造: 수리분석유진 실버버그 原著;, 노응원;, 신봉호 共譯진영사1993<NA>320.1실44ㄱ2
3안양시만안도서관(돈이 솔솔 굴러 들어오는) 창업마케팅김광희 지음미래와경영19998987988236326.17김156ㄷ2
4안양시만안도서관(만화로 보는)세계선교 발달사김종두 글·그림생명의말씀사19948904152410235.4김756ㅅ2
5안양시만안도서관(콕콕 찍어주는)꼬꼬 생활영어: 첫걸음편. 1김완수 지음국제어학연구소20038985972960(1)747김65ㄲ2
6안양시만안도서관20세기 사람들한겨레신문 문화부 편한겨레신문사19958985505351 04900990.99한14ㅇ22