Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells9999
Missing cells (%)12.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory712.9 KiB
Average record size in memory73.0 B

Variable types

Numeric1
Text4
Categorical2
DateTime1

Dataset

Description부산광역시 해운대인문학도서관, 반여도서관, 재송어린이도서관, 작은인문학도서관, 우2동어린이작은도서관 신착도서 현황. 서명, 저자, 발행자, 발행년, 청구기호, 자료실 정보 포함
Author부산광역시 해운대구
URLhttps://www.data.go.kr/data/3075601/fileData.do

Alerts

Unnamed: 7 has constant value ""Constant
번호 is highly overall correlated with 자료실High correlation
자료실 is highly overall correlated with 번호High correlation
발행년 is highly imbalanced (55.8%)Imbalance
Unnamed: 7 has 9999 (> 99.9%) missing valuesMissing
번호 has unique valuesUnique

Reproduction

Analysis started2024-03-14 17:09:36.894793
Analysis finished2024-03-14 17:09:41.020335
Duration4.13 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8029.7958
Minimum1
Maximum16040
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T02:09:41.157123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile819.95
Q14017.5
median8042.5
Q312010.25
95-th percentile15254.05
Maximum16040
Range16039
Interquartile range (IQR)7992.75

Descriptive statistics

Standard deviation4628.9475
Coefficient of variation (CV)0.57647138
Kurtosis-1.1943352
Mean8029.7958
Median Absolute Deviation (MAD)3999.5
Skewness0.0022623935
Sum80297958
Variance21427155
MonotonicityNot monotonic
2024-03-15T02:09:41.421377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
12529 1
 
< 0.1%
5100 1
 
< 0.1%
8396 1
 
< 0.1%
6638 1
 
< 0.1%
1128 1
 
< 0.1%
362 1
 
< 0.1%
8086 1
 
< 0.1%
11125 1
 
< 0.1%
3547 1
 
< 0.1%
10671 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
12 1
< 0.1%
ValueCountFrequency (%)
16040 1
< 0.1%
16039 1
< 0.1%
16038 1
< 0.1%
16037 1
< 0.1%
16036 1
< 0.1%
16034 1
< 0.1%
16033 1
< 0.1%
16031 1
< 0.1%
16030 1
< 0.1%
16029 1
< 0.1%

서명
Text

Distinct8713
Distinct (%)87.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T02:09:43.028550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length87
Median length63
Mean length16.0023
Min length1

Characters and Unicode

Total characters160023
Distinct characters1505
Distinct categories17 ?
Distinct scripts7 ?
Distinct blocks13 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7656 ?
Unique (%)76.6%

Sample

1st row데이지 존스 앤 더 식스
2nd row미래에는
3rd row신화와 정신분석
4th row알아차림에 대한 알아차림
5th row인간의 자리
ValueCountFrequency (%)
1232
 
2.9%
the 316
 
0.8%
1 287
 
0.7%
루카 268
 
0.6%
2 261
 
0.6%
이야기 252
 
0.6%
비디오녹화자료 204
 
0.5%
158
 
0.4%
3 152
 
0.4%
위한 148
 
0.4%
Other values (14961) 38504
92.2%
2024-03-15T02:09:45.046442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
31784
 
19.9%
2968
 
1.9%
2751
 
1.7%
, 1922
 
1.2%
1894
 
1.2%
. 1848
 
1.2%
e 1806
 
1.1%
1403
 
0.9%
1393
 
0.9%
1389
 
0.9%
Other values (1495) 110865
69.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 98422
61.5%
Space Separator 31784
 
19.9%
Lowercase Letter 13915
 
8.7%
Other Punctuation 6097
 
3.8%
Decimal Number 3587
 
2.2%
Uppercase Letter 2504
 
1.6%
Close Punctuation 1705
 
1.1%
Open Punctuation 1704
 
1.1%
Dash Punctuation 216
 
0.1%
Math Symbol 65
 
< 0.1%
Other values (7) 24
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2968
 
3.0%
2751
 
2.8%
1894
 
1.9%
1403
 
1.4%
1393
 
1.4%
1389
 
1.4%
1355
 
1.4%
1264
 
1.3%
1232
 
1.3%
1203
 
1.2%
Other values (1352) 81570
82.9%
Lowercase Letter
ValueCountFrequency (%)
e 1806
13.0%
a 1150
 
8.3%
o 1142
 
8.2%
t 952
 
6.8%
n 934
 
6.7%
r 922
 
6.6%
s 910
 
6.5%
i 903
 
6.5%
h 717
 
5.2%
l 591
 
4.2%
Other values (42) 3888
27.9%
Uppercase Letter
ValueCountFrequency (%)
T 304
 
12.1%
A 263
 
10.5%
R 257
 
10.3%
S 164
 
6.5%
B 145
 
5.8%
G 132
 
5.3%
M 130
 
5.2%
P 127
 
5.1%
I 101
 
4.0%
L 100
 
4.0%
Other values (22) 781
31.2%
Other Punctuation
ValueCountFrequency (%)
, 1922
31.5%
. 1848
30.3%
: 1183
19.4%
! 864
14.2%
· 122
 
2.0%
' 98
 
1.6%
12
 
0.2%
& 12
 
0.2%
10
 
0.2%
" 8
 
0.1%
Other values (7) 18
 
0.3%
Decimal Number
ValueCountFrequency (%)
1 901
25.1%
2 649
18.1%
0 446
12.4%
3 412
11.5%
4 320
 
8.9%
5 266
 
7.4%
9 162
 
4.5%
6 159
 
4.4%
7 137
 
3.8%
8 135
 
3.8%
Math Symbol
ValueCountFrequency (%)
~ 31
47.7%
= 22
33.8%
5
 
7.7%
+ 4
 
6.2%
× 2
 
3.1%
1
 
1.5%
Letter Number
ValueCountFrequency (%)
5
45.5%
2
 
18.2%
2
 
18.2%
1
 
9.1%
1
 
9.1%
Close Punctuation
ValueCountFrequency (%)
) 942
55.2%
] 761
44.6%
1
 
0.1%
1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 941
55.2%
[ 761
44.7%
1
 
0.1%
1
 
0.1%
Modifier Symbol
ValueCountFrequency (%)
` 3
60.0%
´ 1
 
20.0%
˙ 1
 
20.0%
Other Symbol
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Other Number
ValueCountFrequency (%)
² 1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
31784
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 216
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 98274
61.4%
Common 45171
28.2%
Latin 16277
 
10.2%
Cyrillic 153
 
0.1%
Han 92
 
0.1%
Hiragana 49
 
< 0.1%
Katakana 7
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2968
 
3.0%
2751
 
2.8%
1894
 
1.9%
1403
 
1.4%
1393
 
1.4%
1389
 
1.4%
1355
 
1.4%
1264
 
1.3%
1232
 
1.3%
1203
 
1.2%
Other values (1236) 81422
82.9%
Han
ValueCountFrequency (%)
3
 
3.3%
3
 
3.3%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
Other values (70) 70
76.1%
Latin
ValueCountFrequency (%)
e 1806
 
11.1%
a 1150
 
7.1%
o 1142
 
7.0%
t 952
 
5.8%
n 934
 
5.7%
r 922
 
5.7%
s 910
 
5.6%
i 903
 
5.5%
h 717
 
4.4%
l 591
 
3.6%
Other values (47) 6250
38.4%
Common
ValueCountFrequency (%)
31784
70.4%
, 1922
 
4.3%
. 1848
 
4.1%
: 1183
 
2.6%
) 942
 
2.1%
( 941
 
2.1%
1 901
 
2.0%
! 864
 
1.9%
] 761
 
1.7%
[ 761
 
1.7%
Other values (44) 3264
 
7.2%
Cyrillic
ValueCountFrequency (%)
а 17
 
11.1%
и 16
 
10.5%
н 14
 
9.2%
е 13
 
8.5%
к 10
 
6.5%
о 8
 
5.2%
р 8
 
5.2%
л 7
 
4.6%
ы 6
 
3.9%
с 6
 
3.9%
Other values (22) 48
31.4%
Hiragana
ValueCountFrequency (%)
3
 
6.1%
3
 
6.1%
3
 
6.1%
3
 
6.1%
3
 
6.1%
3
 
6.1%
2
 
4.1%
2
 
4.1%
2
 
4.1%
2
 
4.1%
Other values (19) 23
46.9%
Katakana
ValueCountFrequency (%)
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 98256
61.4%
ASCII 61263
38.3%
None 155
 
0.1%
Cyrillic 153
 
0.1%
CJK 92
 
0.1%
Hiragana 49
 
< 0.1%
Compat Jamo 18
 
< 0.1%
Punctuation 14
 
< 0.1%
Number Forms 11
 
< 0.1%
Katakana 7
 
< 0.1%
Other values (3) 5
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
31784
51.9%
, 1922
 
3.1%
. 1848
 
3.0%
e 1806
 
2.9%
: 1183
 
1.9%
a 1150
 
1.9%
o 1142
 
1.9%
t 952
 
1.6%
) 942
 
1.5%
( 941
 
1.5%
Other values (74) 17593
28.7%
Hangul
ValueCountFrequency (%)
2968
 
3.0%
2751
 
2.8%
1894
 
1.9%
1403
 
1.4%
1393
 
1.4%
1389
 
1.4%
1355
 
1.4%
1264
 
1.3%
1232
 
1.3%
1203
 
1.2%
Other values (1225) 81404
82.8%
None
ValueCountFrequency (%)
· 122
78.7%
10
 
6.5%
5
 
3.2%
5
 
3.2%
3
 
1.9%
× 2
 
1.3%
´ 1
 
0.6%
² 1
 
0.6%
1
 
0.6%
1
 
0.6%
Other values (4) 4
 
2.6%
Cyrillic
ValueCountFrequency (%)
а 17
 
11.1%
и 16
 
10.5%
н 14
 
9.2%
е 13
 
8.5%
к 10
 
6.5%
о 8
 
5.2%
р 8
 
5.2%
л 7
 
4.6%
ы 6
 
3.9%
с 6
 
3.9%
Other values (22) 48
31.4%
Punctuation
ValueCountFrequency (%)
12
85.7%
1
 
7.1%
1
 
7.1%
Number Forms
ValueCountFrequency (%)
5
45.5%
2
 
18.2%
2
 
18.2%
1
 
9.1%
1
 
9.1%
Hiragana
ValueCountFrequency (%)
3
 
6.1%
3
 
6.1%
3
 
6.1%
3
 
6.1%
3
 
6.1%
3
 
6.1%
2
 
4.1%
2
 
4.1%
2
 
4.1%
2
 
4.1%
Other values (19) 23
46.9%
CJK
ValueCountFrequency (%)
3
 
3.3%
3
 
3.3%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
Other values (70) 70
76.1%
Compat Jamo
ValueCountFrequency (%)
3
16.7%
3
16.7%
3
16.7%
2
11.1%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
Enclosed Alphanum
ValueCountFrequency (%)
1
50.0%
1
50.0%
Misc Symbols
ValueCountFrequency (%)
1
50.0%
1
50.0%
Modifier Letters
ValueCountFrequency (%)
˙ 1
100.0%
Katakana
ValueCountFrequency (%)
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%

저자
Text

Distinct7043
Distinct (%)70.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T02:09:46.689821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length91
Median length77
Mean length16.9265
Min length3

Characters and Unicode

Total characters169265
Distinct characters1115
Distinct categories10 ?
Distinct scripts7 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5635 ?
Unique (%)56.4%

Sample

1st row테일러 젠킨스 리드 지음 ; 최세희 옮김
2nd row허아성 글·그림
3rd row이창재 지음
4th row루퍼트 스파이라 지음 ; 김주환 옮김
5th row박한선 지음
ValueCountFrequency (%)
7658
 
15.2%
지음 4763
 
9.5%
그림 3692
 
7.3%
3060
 
6.1%
옮김 2626
 
5.2%
by 1012
 
2.0%
글·그림 542
 
1.1%
원작 364
 
0.7%
illustrated 274
 
0.5%
186
 
0.4%
Other values (10318) 26174
52.0%
2024-03-15T02:09:48.454610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
40370
23.9%
; 7657
 
4.5%
5923
 
3.5%
5072
 
3.0%
4979
 
2.9%
4482
 
2.6%
4414
 
2.6%
3827
 
2.3%
3022
 
1.8%
2670
 
1.6%
Other values (1105) 86849
51.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 97027
57.3%
Space Separator 40370
23.9%
Lowercase Letter 17578
 
10.4%
Other Punctuation 9449
 
5.6%
Uppercase Letter 3050
 
1.8%
Open Punctuation 846
 
0.5%
Close Punctuation 844
 
0.5%
Dash Punctuation 51
 
< 0.1%
Decimal Number 33
 
< 0.1%
Math Symbol 17
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5923
 
6.1%
5072
 
5.2%
4979
 
5.1%
4482
 
4.6%
4414
 
4.5%
3827
 
3.9%
3022
 
3.1%
2670
 
2.8%
1712
 
1.8%
1492
 
1.5%
Other values (989) 59434
61.3%
Lowercase Letter
ValueCountFrequency (%)
a 1679
 
9.6%
e 1673
 
9.5%
y 1404
 
8.0%
r 1370
 
7.8%
t 1352
 
7.7%
i 1277
 
7.3%
n 1236
 
7.0%
b 1145
 
6.5%
l 1121
 
6.4%
s 946
 
5.4%
Other values (40) 4375
24.9%
Uppercase Letter
ValueCountFrequency (%)
S 308
 
10.1%
J 243
 
8.0%
B 223
 
7.3%
T 211
 
6.9%
M 203
 
6.7%
P 196
 
6.4%
D 175
 
5.7%
C 162
 
5.3%
R 158
 
5.2%
E 148
 
4.9%
Other values (28) 1023
33.5%
Other Punctuation
ValueCountFrequency (%)
; 7657
81.0%
, 867
 
9.2%
· 633
 
6.7%
. 223
 
2.4%
& 31
 
0.3%
: 29
 
0.3%
' 6
 
0.1%
/ 3
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
2 9
27.3%
3 6
18.2%
1 5
15.2%
4 4
12.1%
9 4
12.1%
7 3
 
9.1%
0 2
 
6.1%
Open Punctuation
ValueCountFrequency (%)
[ 822
97.2%
( 20
 
2.4%
3
 
0.4%
1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
] 820
97.2%
) 20
 
2.4%
3
 
0.4%
1
 
0.1%
Math Symbol
ValueCountFrequency (%)
> 8
47.1%
< 8
47.1%
× 1
 
5.9%
Space Separator
ValueCountFrequency (%)
40370
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 51
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 96853
57.2%
Common 51610
30.5%
Latin 20388
 
12.0%
Cyrillic 240
 
0.1%
Han 93
 
0.1%
Katakana 43
 
< 0.1%
Hiragana 38
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5923
 
6.1%
5072
 
5.2%
4979
 
5.1%
4482
 
4.6%
4414
 
4.6%
3827
 
4.0%
3022
 
3.1%
2670
 
2.8%
1712
 
1.8%
1492
 
1.5%
Other values (871) 59260
61.2%
Han
ValueCountFrequency (%)
6
 
6.5%
6
 
6.5%
6
 
6.5%
4
 
4.3%
3
 
3.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
Other values (57) 58
62.4%
Latin
ValueCountFrequency (%)
a 1679
 
8.2%
e 1673
 
8.2%
y 1404
 
6.9%
r 1370
 
6.7%
t 1352
 
6.6%
i 1277
 
6.3%
n 1236
 
6.1%
b 1145
 
5.6%
l 1121
 
5.5%
s 946
 
4.6%
Other values (42) 7185
35.2%
Cyrillic
ValueCountFrequency (%)
о 22
 
9.2%
а 22
 
9.2%
и 20
 
8.3%
р 17
 
7.1%
е 16
 
6.7%
с 16
 
6.7%
л 14
 
5.8%
н 12
 
5.0%
в 10
 
4.2%
т 10
 
4.2%
Other values (26) 81
33.8%
Katakana
ValueCountFrequency (%)
3
 
7.0%
3
 
7.0%
3
 
7.0%
3
 
7.0%
2
 
4.7%
2
 
4.7%
2
 
4.7%
2
 
4.7%
2
 
4.7%
1
 
2.3%
Other values (20) 20
46.5%
Common
ValueCountFrequency (%)
40370
78.2%
; 7657
 
14.8%
, 867
 
1.7%
[ 822
 
1.6%
] 820
 
1.6%
· 633
 
1.2%
. 223
 
0.4%
- 51
 
0.1%
& 31
 
0.1%
: 29
 
0.1%
Other values (18) 107
 
0.2%
Hiragana
ValueCountFrequency (%)
6
15.8%
5
13.2%
3
 
7.9%
3
 
7.9%
2
 
5.3%
2
 
5.3%
2
 
5.3%
2
 
5.3%
1
 
2.6%
1
 
2.6%
Other values (11) 11
28.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 96852
57.2%
ASCII 71356
42.2%
None 642
 
0.4%
Cyrillic 240
 
0.1%
CJK 93
 
0.1%
Katakana 43
 
< 0.1%
Hiragana 38
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
40370
56.6%
; 7657
 
10.7%
a 1679
 
2.4%
e 1673
 
2.3%
y 1404
 
2.0%
r 1370
 
1.9%
t 1352
 
1.9%
i 1277
 
1.8%
n 1236
 
1.7%
b 1145
 
1.6%
Other values (64) 12193
 
17.1%
Hangul
ValueCountFrequency (%)
5923
 
6.1%
5072
 
5.2%
4979
 
5.1%
4482
 
4.6%
4414
 
4.6%
3827
 
4.0%
3022
 
3.1%
2670
 
2.8%
1712
 
1.8%
1492
 
1.5%
Other values (870) 59259
61.2%
None
ValueCountFrequency (%)
· 633
98.6%
3
 
0.5%
3
 
0.5%
1
 
0.2%
× 1
 
0.2%
1
 
0.2%
Cyrillic
ValueCountFrequency (%)
о 22
 
9.2%
а 22
 
9.2%
и 20
 
8.3%
р 17
 
7.1%
е 16
 
6.7%
с 16
 
6.7%
л 14
 
5.8%
н 12
 
5.0%
в 10
 
4.2%
т 10
 
4.2%
Other values (26) 81
33.8%
Hiragana
ValueCountFrequency (%)
6
15.8%
5
13.2%
3
 
7.9%
3
 
7.9%
2
 
5.3%
2
 
5.3%
2
 
5.3%
2
 
5.3%
1
 
2.6%
1
 
2.6%
Other values (11) 11
28.9%
CJK
ValueCountFrequency (%)
6
 
6.5%
6
 
6.5%
6
 
6.5%
4
 
4.3%
3
 
3.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
2
 
2.2%
Other values (57) 58
62.4%
Katakana
ValueCountFrequency (%)
3
 
7.0%
3
 
7.0%
3
 
7.0%
3
 
7.0%
2
 
4.7%
2
 
4.7%
2
 
4.7%
2
 
4.7%
2
 
4.7%
1
 
2.3%
Other values (20) 20
46.5%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct1917
Distinct (%)19.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T02:09:50.352025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length100
Median length40
Mean length5.2551
Min length1

Characters and Unicode

Total characters52551
Distinct characters753
Distinct categories12 ?
Distinct scripts7 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique884 ?
Unique (%)8.8%

Sample

1st row다산책방
2nd row리틀씨앤톡
3rd row아를
4th row퍼블리온
5th row바다
ValueCountFrequency (%)
키즈스콜레 357
 
3.2%
아울북 188
 
1.7%
서울문화사 149
 
1.3%
다산어린이 137
 
1.2%
위즈덤하우스 134
 
1.2%
문학동네 116
 
1.0%
books 112
 
1.0%
창비 105
 
0.9%
좋은책어린이 105
 
0.9%
scholastic 93
 
0.8%
Other values (1987) 9580
86.5%
2024-03-15T02:09:52.200039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1907
 
3.6%
1659
 
3.2%
1216
 
2.3%
1142
 
2.2%
o 1122
 
2.1%
1076
 
2.0%
958
 
1.8%
882
 
1.7%
e 842
 
1.6%
813
 
1.5%
Other values (743) 40934
77.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 39106
74.4%
Lowercase Letter 8948
 
17.0%
Uppercase Letter 2104
 
4.0%
Space Separator 1076
 
2.0%
Open Punctuation 431
 
0.8%
Close Punctuation 431
 
0.8%
Other Punctuation 280
 
0.5%
Decimal Number 146
 
0.3%
Connector Punctuation 23
 
< 0.1%
Dash Punctuation 3
 
< 0.1%
Other values (2) 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1907
 
4.9%
1659
 
4.2%
1216
 
3.1%
1142
 
2.9%
958
 
2.4%
882
 
2.3%
813
 
2.1%
788
 
2.0%
656
 
1.7%
635
 
1.6%
Other values (640) 28450
72.8%
Lowercase Letter
ValueCountFrequency (%)
o 1122
12.5%
e 842
 
9.4%
s 798
 
8.9%
i 778
 
8.7%
n 750
 
8.4%
a 608
 
6.8%
l 567
 
6.3%
r 542
 
6.1%
c 388
 
4.3%
h 380
 
4.2%
Other values (35) 2173
24.3%
Uppercase Letter
ValueCountFrequency (%)
S 314
14.9%
B 263
12.5%
H 243
11.5%
K 231
11.0%
P 164
7.8%
R 160
7.6%
C 129
 
6.1%
E 91
 
4.3%
M 70
 
3.3%
I 53
 
2.5%
Other values (21) 386
18.3%
Other Punctuation
ValueCountFrequency (%)
' 120
42.9%
· 88
31.4%
& 18
 
6.4%
. 18
 
6.4%
/ 10
 
3.6%
# 8
 
2.9%
, 8
 
2.9%
7
 
2.5%
; 3
 
1.1%
Decimal Number
ValueCountFrequency (%)
1 66
45.2%
2 53
36.3%
7 8
 
5.5%
4 5
 
3.4%
8 4
 
2.7%
6 4
 
2.7%
9 3
 
2.1%
5 2
 
1.4%
3 1
 
0.7%
Open Punctuation
ValueCountFrequency (%)
( 260
60.3%
[ 171
39.7%
Close Punctuation
ValueCountFrequency (%)
) 260
60.3%
] 171
39.7%
Space Separator
ValueCountFrequency (%)
1076
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 23
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 39029
74.3%
Latin 10974
 
20.9%
Common 2393
 
4.6%
Cyrillic 78
 
0.1%
Han 64
 
0.1%
Katakana 10
 
< 0.1%
Hiragana 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1907
 
4.9%
1659
 
4.3%
1216
 
3.1%
1142
 
2.9%
958
 
2.5%
882
 
2.3%
813
 
2.1%
788
 
2.0%
656
 
1.7%
635
 
1.6%
Other values (593) 28373
72.7%
Latin
ValueCountFrequency (%)
o 1122
 
10.2%
e 842
 
7.7%
s 798
 
7.3%
i 778
 
7.1%
n 750
 
6.8%
a 608
 
5.5%
l 567
 
5.2%
r 542
 
4.9%
c 388
 
3.5%
h 380
 
3.5%
Other values (42) 4199
38.3%
Han
ValueCountFrequency (%)
12
18.8%
9
 
14.1%
9
 
14.1%
2
 
3.1%
2
 
3.1%
2
 
3.1%
1
 
1.6%
1
 
1.6%
1
 
1.6%
1
 
1.6%
Other values (24) 24
37.5%
Common
ValueCountFrequency (%)
1076
45.0%
( 260
 
10.9%
) 260
 
10.9%
] 171
 
7.1%
[ 171
 
7.1%
' 120
 
5.0%
· 88
 
3.7%
1 66
 
2.8%
2 53
 
2.2%
_ 23
 
1.0%
Other values (17) 105
 
4.4%
Cyrillic
ValueCountFrequency (%)
о 10
12.8%
с 9
11.5%
т 9
11.5%
е 8
10.3%
м 6
7.7%
к 6
7.7%
д 5
 
6.4%
в 5
 
6.4%
а 3
 
3.8%
Э 3
 
3.8%
Other values (14) 14
17.9%
Katakana
ValueCountFrequency (%)
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
Hiragana
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 39027
74.3%
ASCII 13271
 
25.3%
None 95
 
0.2%
Cyrillic 78
 
0.1%
CJK 64
 
0.1%
Katakana 10
 
< 0.1%
Hiragana 3
 
< 0.1%
Compat Jamo 2
 
< 0.1%
Punctuation 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1907
 
4.9%
1659
 
4.3%
1216
 
3.1%
1142
 
2.9%
958
 
2.5%
882
 
2.3%
813
 
2.1%
788
 
2.0%
656
 
1.7%
635
 
1.6%
Other values (591) 28371
72.7%
ASCII
ValueCountFrequency (%)
o 1122
 
8.5%
1076
 
8.1%
e 842
 
6.3%
s 798
 
6.0%
i 778
 
5.9%
n 750
 
5.7%
a 608
 
4.6%
l 567
 
4.3%
r 542
 
4.1%
c 388
 
2.9%
Other values (66) 5800
43.7%
None
ValueCountFrequency (%)
· 88
92.6%
7
 
7.4%
CJK
ValueCountFrequency (%)
12
18.8%
9
 
14.1%
9
 
14.1%
2
 
3.1%
2
 
3.1%
2
 
3.1%
1
 
1.6%
1
 
1.6%
1
 
1.6%
1
 
1.6%
Other values (24) 24
37.5%
Cyrillic
ValueCountFrequency (%)
о 10
12.8%
с 9
11.5%
т 9
11.5%
е 8
10.3%
м 6
7.7%
к 6
7.7%
д 5
 
6.4%
в 5
 
6.4%
а 3
 
3.8%
Э 3
 
3.8%
Other values (14) 14
17.9%
Punctuation
ValueCountFrequency (%)
1
100.0%
Compat Jamo
ValueCountFrequency (%)
1
50.0%
1
50.0%
Katakana
ValueCountFrequency (%)
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
Hiragana
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

발행년
Categorical

IMBALANCE 

Distinct34
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023
5969 
2022
1313 
2021
667 
2019
 
436
2020
 
333
Other values (29)
1282 

Length

Max length10
Median length4
Mean length4.0022
Min length4

Unique

Unique7 ?
Unique (%)0.1%

Sample

1st row2023
2nd row2023
3rd row2023
4th row2023
5th row2023

Common Values

ValueCountFrequency (%)
2023 5969
59.7%
2022 1313
 
13.1%
2021 667
 
6.7%
2019 436
 
4.4%
2020 333
 
3.3%
2017 309
 
3.1%
2015 271
 
2.7%
2018 196
 
2.0%
2016 140
 
1.4%
2012 59
 
0.6%
Other values (24) 307
 
3.1%

Length

2024-03-15T02:09:52.566258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2023 5970
59.7%
2022 1313
 
13.1%
2021 667
 
6.7%
2019 436
 
4.4%
2020 333
 
3.3%
2017 309
 
3.1%
2015 271
 
2.7%
2018 197
 
2.0%
2016 140
 
1.4%
2012 59
 
0.6%
Other values (22) 305
 
3.0%
Distinct9821
Distinct (%)98.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T02:09:53.442963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length17
Mean length11.4546
Min length5

Characters and Unicode

Total characters114546
Distinct characters246
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9668 ?
Unique (%)96.7%

Sample

1st row843-5
2nd row그림책 808.3-3-38
3rd row185.5-62
4th row189.1-275
5th row471.2-2
ValueCountFrequency (%)
아동 2540
 
16.1%
유아 804
 
5.1%
루카도서 673
 
4.3%
그림책 644
 
4.1%
영어 465
 
2.9%
dvd 178
 
1.1%
아동부록 118
 
0.7%
아동참고 103
 
0.7%
ar도서 99
 
0.6%
어린이 91
 
0.6%
Other values (9543) 10103
63.9%
2024-03-15T02:09:54.926842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 14680
12.8%
1 12348
10.8%
8 10935
 
9.5%
3 10033
 
8.8%
2 7498
 
6.5%
. 6385
 
5.6%
4 6259
 
5.5%
5818
 
5.1%
0 4938
 
4.3%
5 4912
 
4.3%
Other values (236) 30740
26.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 69980
61.1%
Dash Punctuation 14680
 
12.8%
Other Letter 14443
 
12.6%
Other Punctuation 6387
 
5.6%
Space Separator 5818
 
5.1%
Math Symbol 987
 
0.9%
Uppercase Letter 781
 
0.7%
Open Punctuation 716
 
0.6%
Close Punctuation 716
 
0.6%
Connector Punctuation 38
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3606
25.0%
2762
19.1%
813
 
5.6%
809
 
5.6%
779
 
5.4%
678
 
4.7%
673
 
4.7%
651
 
4.5%
645
 
4.5%
644
 
4.5%
Other values (206) 2383
16.5%
Uppercase Letter
ValueCountFrequency (%)
D 383
49.0%
V 180
23.0%
A 105
 
13.4%
R 100
 
12.8%
M 5
 
0.6%
T 2
 
0.3%
K 2
 
0.3%
F 1
 
0.1%
I 1
 
0.1%
P 1
 
0.1%
Decimal Number
ValueCountFrequency (%)
1 12348
17.6%
8 10935
15.6%
3 10033
14.3%
2 7498
10.7%
4 6259
8.9%
0 4938
 
7.1%
5 4912
 
7.0%
9 4734
 
6.8%
7 4663
 
6.7%
6 3660
 
5.2%
Other Punctuation
ValueCountFrequency (%)
. 6385
> 99.9%
' 2
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
= 985
99.8%
~ 2
 
0.2%
Dash Punctuation
ValueCountFrequency (%)
- 14680
100.0%
Space Separator
ValueCountFrequency (%)
5818
100.0%
Open Punctuation
ValueCountFrequency (%)
[ 716
100.0%
Close Punctuation
ValueCountFrequency (%)
] 716
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 38
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 99322
86.7%
Hangul 14443
 
12.6%
Latin 781
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3606
25.0%
2762
19.1%
813
 
5.6%
809
 
5.6%
779
 
5.4%
678
 
4.7%
673
 
4.7%
651
 
4.5%
645
 
4.5%
644
 
4.5%
Other values (206) 2383
16.5%
Common
ValueCountFrequency (%)
- 14680
14.8%
1 12348
12.4%
8 10935
11.0%
3 10033
10.1%
2 7498
7.5%
. 6385
6.4%
4 6259
6.3%
5818
 
5.9%
0 4938
 
5.0%
5 4912
 
4.9%
Other values (9) 15516
15.6%
Latin
ValueCountFrequency (%)
D 383
49.0%
V 180
23.0%
A 105
 
13.4%
R 100
 
12.8%
M 5
 
0.6%
T 2
 
0.3%
K 2
 
0.3%
F 1
 
0.1%
I 1
 
0.1%
P 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 100103
87.4%
Hangul 14443
 
12.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 14680
14.7%
1 12348
12.3%
8 10935
10.9%
3 10033
10.0%
2 7498
7.5%
. 6385
 
6.4%
4 6259
 
6.3%
5818
 
5.8%
0 4938
 
4.9%
5 4912
 
4.9%
Other values (20) 16297
16.3%
Hangul
ValueCountFrequency (%)
3606
25.0%
2762
19.1%
813
 
5.6%
809
 
5.6%
779
 
5.4%
678
 
4.7%
673
 
4.7%
651
 
4.5%
645
 
4.5%
644
 
4.5%
Other values (206) 2383
16.5%

자료실
Categorical

HIGH CORRELATION 

Distinct18
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
[해운대인문학]종합자료실
2265 
[해운대인문학]어린이자료실
2166 
[반여]어린이실
1096 
[재송]아동자료실(2층)
991 
[반여]종합실
802 
Other values (13)
2680 

Length

Max length22
Median length21
Mean length12.371
Min length2

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row[반여]종합실
2nd row[반여]어린이실
3rd row[해운대인문학]종합자료실
4th row[해운대인문학]종합자료실
5th row[반여]종합실

Common Values

ValueCountFrequency (%)
[해운대인문학]종합자료실 2265
22.7%
[해운대인문학]어린이자료실 2166
21.7%
[반여]어린이실 1096
11.0%
[재송]아동자료실(2층) 991
9.9%
[반여]종합실 802
 
8.0%
[재송]유아자료실(1층) 667
 
6.7%
[해운대인문학]유아자료실 581
 
5.8%
[해운대인문학]스마트도서관(센텀시티역) 447
 
4.5%
일반 369
 
3.7%
[해운대인문학]스마트도서관(문화복합센터) 256
 
2.6%
Other values (8) 360
 
3.6%

Length

2024-03-15T02:09:55.375434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
해운대인문학]종합자료실 2265
22.5%
해운대인문학]어린이자료실 2166
21.6%
반여]어린이실 1096
10.9%
재송]아동자료실(2층 991
9.9%
반여]종합실 802
 
8.0%
재송]유아자료실(1층 667
 
6.6%
해운대인문학]유아자료실 581
 
5.8%
해운대인문학]스마트도서관(센텀시티역 447
 
4.4%
일반 369
 
3.7%
해운대인문학]스마트도서관(문화복합센터 256
 
2.5%
Other values (9) 411
 
4.1%

Unnamed: 7
Date

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing9999
Missing (%)> 99.9%
Memory size156.2 KiB
Minimum2024-01-02 00:00:00
Maximum2024-01-02 00:00:00
2024-03-15T02:09:55.721386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:09:56.026695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-03-15T02:09:40.105543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T02:09:56.182897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호발행년자료실
번호1.0000.4790.860
발행년0.4791.0000.458
자료실0.8600.4581.000
2024-03-15T02:09:56.328867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발행년자료실
발행년1.0000.139
자료실0.1391.000
2024-03-15T02:09:56.473365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호발행년자료실
번호1.0000.1890.555
발행년0.1891.0000.139
자료실0.5550.1391.000

Missing values

2024-03-15T02:09:40.493052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T02:09:40.782441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호서명저자발행자발행년청구기호자료실Unnamed: 7
1252812529데이지 존스 앤 더 식스테일러 젠킨스 리드 지음 ; 최세희 옮김다산책방2023843-5[반여]종합실<NA>
1379213793미래에는허아성 글·그림리틀씨앤톡2023그림책 808.3-3-38[반여]어린이실<NA>
56935694신화와 정신분석이창재 지음아를2023185.5-62[해운대인문학]종합자료실<NA>
33783379알아차림에 대한 알아차림루퍼트 스파이라 지음 ; 김주환 옮김퍼블리온2023189.1-275[해운대인문학]종합자료실<NA>
1270612707인간의 자리박한선 지음바다2023471.2-2[반여]종합실<NA>
60276028어느 날 은유가 찾아왔다박이강 지음교유서가2023813.7-3091[해운대인문학]종합자료실<NA>
1460114602나는 왜 네 말을 흘려듣지 못할까미키 이치타로 지음 ; 김주희 옮김갤리온2023189.2-6[반여]종합실<NA>
1303513036진짜 진짜 재밌는 거미 그림책클라우디아 마틴 지음 ; 앤드류 이스턴 일러스트 ; 김맑아 ,김경덕 [공]옮김라이카미2023아동 495.17-1[반여]어린이실<NA>
1202112022신비아파트 [비디오녹화자료] : 고스트볼Z 어둠의 퇴마사. 2유재운 감독Cj Enm(씨에지 이앤엠)[제공]2023D아 688.6-1063-2[재송]디지털자료실(1층)<NA>
1569315694노화 공부 : 텔로미어부터 노화 세포, 호르몬, 활성산소, 미토콘드리아까지 우리 몸을 나이 들게 하는 것들이덕철 지음위즈덤하우스2023511.1687-이24노일반<NA>
번호서명저자발행자발행년청구기호자료실Unnamed: 7
57945795거꾸로 흐르는 강. 한나와 천년의 새장 클로드 무를르바 지음 ; 임상훈 옮김문학세계사2023863-534-2[해운대인문학]종합자료실<NA>
16961697Groundhog day from the Black Lagoonby Mike Thaler ; illustrated by Jared LeeScholastic2022영어 843-2492-29[해운대인문학]어린이자료실<NA>
33503351처음 우주에 간 고양이, 피자를 맛보다맥 바넷 글 ; 숀 해리스 그림 ; 이숙희 옮김나무의말2023아동 843-682-1[해운대인문학]어린이자료실<NA>
1232712328[루카] The gingerbread man캠벨 북스 글 ; 맥밀란 그림Kids' Schole(키즈스콜레)2019루카도서 747-1-[15][재송]유아자료실(1층)<NA>
55275528알잖아! 플라스틱을 왜 줄여야 하는지이기규 글 ; 김창호 그림새숲2023아동 539.9-90[해운대인문학]어린이자료실<NA>
30713072코코와 아기양말찰리 지음옐로스톤2022유아 813.8-2679[해운대인문학]유아자료실<NA>
96479648소리를 보는 아이김희철 글 ; 이소영 그림가문비어린이2019아동 808.9-157-97[재송]아동자료실(2층)<NA>
24672468그림책 사용 설명서박희연 외 지음초록서재2022029.8-122[해운대인문학]종합자료실<NA>
1140511406레이크사이드히가시노 게이고 지음 ; 민경욱 옮김하빌리스2023833.6-602[재송]아동자료실(2층)<NA>
58365837경제기사 궁금증 300문 300답곽해선 지음 ; 추덕영 그림혜다2023320-90-개정판[해운대인문학]종합자료실<NA>