Overview

Dataset statistics

Number of variables7
Number of observations642
Missing cells1280
Missing cells (%)28.5%
Duplicate rows1
Duplicate rows (%)0.2%
Total size in memory35.9 KiB
Average record size in memory57.2 B

Variable types

Categorical2
Text5

Dataset

Description개방도서열람실도서목록
Author전라북도
URLhttps://www.bigdatahub.go.kr/opendata/dataSet/detail.nm?contentId=37&rlik=49451aebf056b486&serviceId=201658

Alerts

Dataset has 1 (0.2%) duplicate rowsDuplicates
도서분류 is highly overall correlated with Unnamed: 6High correlation
Unnamed: 6 is highly overall correlated with 도서분류High correlation
Unnamed: 6 is highly imbalanced (89.4%)Imbalance
부출서명 has 522 (81.3%) missing valuesMissing
저자명 has 383 (59.7%) missing valuesMissing
출판사 has 373 (58.1%) missing valuesMissing

Reproduction

Analysis started2024-03-14 01:46:47.023858
Analysis finished2024-03-14 01:46:47.962943
Duration0.94 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

도서분류
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size5.1 KiB
어린이
362 
화집
62 
서울시립미술관
 
32
대구시립미술관
 
25
대전시립미술관
 
23
Other values (8)
138 

Length

Max length10
Median length3
Mean length4.3239875
Min length2

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row국공립 도서관 도록
2nd row대전시립미술관
3rd row대전시립미술관
4th row대전시립미술관
5th row대전시립미술관

Common Values

ValueCountFrequency (%)
어린이 362
56.4%
화집 62
 
9.7%
서울시립미술관 32
 
5.0%
대구시립미술관 25
 
3.9%
대전시립미술관 23
 
3.6%
광주시립미술관2 23
 
3.6%
포항시립미술관 22
 
3.4%
경기도립미술관 21
 
3.3%
부산시립미술관 21
 
3.3%
국립현대미술관 20
 
3.1%
Other values (3) 31
 
4.8%

Length

2024-03-14T10:46:48.017914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
어린이 362
56.2%
화집 62
 
9.6%
서울시립미술관 32
 
5.0%
대구시립미술관 25
 
3.9%
대전시립미술관 23
 
3.6%
광주시립미술관2 23
 
3.6%
포항시립미술관 22
 
3.4%
경기도립미술관 21
 
3.3%
부산시립미술관 21
 
3.3%
국립현대미술관 20
 
3.1%
Other values (5) 33
 
5.1%
Distinct462
Distinct (%)72.1%
Missing1
Missing (%)0.2%
Memory size5.1 KiB
2024-03-14T10:46:48.165243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length41
Mean length12.031201
Min length3

Characters and Unicode

Total characters7712
Distinct characters160
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique445 ?
Unique (%)69.4%

Sample

1st row606.9/대74ㅇ c553
2nd row606.9/대74ㅈ c.2 C199
3rd row606.9/대74ㅈ c637
4th rowㄱ 606.9/대74ㅇ c.1 c216
5th rowㄱ 606.9/대74ㅁ c.2 c466
ValueCountFrequency (%)
606.9 177
 
12.3%
번호없음 123
 
8.5%
608 49
 
3.4%
657 28
 
1.9%
600.9 28
 
1.9%
800/식13 27
 
1.9%
800 24
 
1.7%
2014 22
 
1.5%
18
 
1.2%
2013 17
 
1.2%
Other values (720) 927
64.4%
2024-03-14T10:46:48.468883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6 929
 
12.0%
822
 
10.7%
0 799
 
10.4%
1 432
 
5.6%
9 431
 
5.6%
2 403
 
5.2%
8 393
 
5.1%
5 374
 
4.8%
7 331
 
4.3%
. 272
 
3.5%
Other values (150) 2526
32.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4551
59.0%
Other Letter 1487
 
19.3%
Space Separator 822
 
10.7%
Other Punctuation 518
 
6.7%
Uppercase Letter 254
 
3.3%
Lowercase Letter 37
 
0.5%
Open Punctuation 21
 
0.3%
Close Punctuation 21
 
0.3%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
126
 
8.5%
126
 
8.5%
126
 
8.5%
126
 
8.5%
107
 
7.2%
66
 
4.4%
47
 
3.2%
46
 
3.1%
43
 
2.9%
41
 
2.8%
Other values (114) 633
42.6%
Uppercase Letter
ValueCountFrequency (%)
C 196
77.2%
A 37
 
14.6%
T 6
 
2.4%
S 3
 
1.2%
D 3
 
1.2%
N 2
 
0.8%
J 2
 
0.8%
Q 1
 
0.4%
R 1
 
0.4%
M 1
 
0.4%
Other values (2) 2
 
0.8%
Decimal Number
ValueCountFrequency (%)
6 929
20.4%
0 799
17.6%
1 432
9.5%
9 431
9.5%
2 403
8.9%
8 393
8.6%
5 374
8.2%
7 331
 
7.3%
4 243
 
5.3%
3 216
 
4.7%
Other Punctuation
ValueCountFrequency (%)
. 272
52.5%
/ 194
37.5%
; 26
 
5.0%
, 24
 
4.6%
? 2
 
0.4%
Lowercase Letter
ValueCountFrequency (%)
c 22
59.5%
v 12
32.4%
s 1
 
2.7%
f 1
 
2.7%
z 1
 
2.7%
Space Separator
ValueCountFrequency (%)
822
100.0%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%
Math Symbol
ValueCountFrequency (%)
= 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5934
76.9%
Hangul 1487
 
19.3%
Latin 291
 
3.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
126
 
8.5%
126
 
8.5%
126
 
8.5%
126
 
8.5%
107
 
7.2%
66
 
4.4%
47
 
3.2%
46
 
3.1%
43
 
2.9%
41
 
2.8%
Other values (114) 633
42.6%
Common
ValueCountFrequency (%)
6 929
15.7%
822
13.9%
0 799
13.5%
1 432
7.3%
9 431
7.3%
2 403
6.8%
8 393
6.6%
5 374
6.3%
7 331
 
5.6%
. 272
 
4.6%
Other values (9) 748
12.6%
Latin
ValueCountFrequency (%)
C 196
67.4%
A 37
 
12.7%
c 22
 
7.6%
v 12
 
4.1%
T 6
 
2.1%
S 3
 
1.0%
D 3
 
1.0%
N 2
 
0.7%
J 2
 
0.7%
s 1
 
0.3%
Other values (7) 7
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6225
80.7%
Hangul 1046
 
13.6%
Compat Jamo 441
 
5.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6 929
14.9%
822
13.2%
0 799
12.8%
1 432
6.9%
9 431
6.9%
2 403
 
6.5%
8 393
 
6.3%
5 374
 
6.0%
7 331
 
5.3%
. 272
 
4.4%
Other values (26) 1039
16.7%
Hangul
ValueCountFrequency (%)
126
 
12.0%
126
 
12.0%
126
 
12.0%
126
 
12.0%
47
 
4.5%
43
 
4.1%
41
 
3.9%
32
 
3.1%
27
 
2.6%
24
 
2.3%
Other values (98) 328
31.4%
Compat Jamo
ValueCountFrequency (%)
107
24.3%
66
15.0%
46
10.4%
41
 
9.3%
37
 
8.4%
33
 
7.5%
23
 
5.2%
19
 
4.3%
17
 
3.9%
16
 
3.6%
Other values (6) 36
 
8.2%
Distinct625
Distinct (%)97.5%
Missing1
Missing (%)0.2%
Memory size5.1 KiB
2024-03-14T10:46:48.699012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length38
Mean length11.775351
Min length2

Characters and Unicode

Total characters7548
Distinct characters633
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique610 ?
Unique (%)95.2%

Sample

1st row웃음이 난다(sense of humor)
2nd row전환의 봄(Conversion's spring)
3rd row전혁림
4th row얼굴, 표정
5th row모든 경계에는 꽃이 핀다
ValueCountFrequency (%)
21세기 15
 
0.8%
먼나라 14
 
0.8%
2014 13
 
0.7%
이야기 13
 
0.7%
art 12
 
0.7%
바람의 11
 
0.6%
파이터 10
 
0.5%
9
 
0.5%
9
 
0.5%
동화 9
 
0.5%
Other values (1337) 1715
93.7%
2024-03-14T10:46:49.046574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1205
 
16.0%
150
 
2.0%
0 130
 
1.7%
126
 
1.7%
2 116
 
1.5%
1 109
 
1.4%
90
 
1.2%
77
 
1.0%
75
 
1.0%
A 70
 
0.9%
Other values (623) 5400
71.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4585
60.7%
Space Separator 1205
 
16.0%
Uppercase Letter 720
 
9.5%
Decimal Number 486
 
6.4%
Lowercase Letter 446
 
5.9%
Other Punctuation 52
 
0.7%
Dash Punctuation 24
 
0.3%
Math Symbol 8
 
0.1%
Open Punctuation 6
 
0.1%
Close Punctuation 6
 
0.1%
Other values (2) 10
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
150
 
3.3%
126
 
2.7%
90
 
2.0%
77
 
1.7%
75
 
1.6%
68
 
1.5%
64
 
1.4%
62
 
1.4%
60
 
1.3%
57
 
1.2%
Other values (544) 3756
81.9%
Uppercase Letter
ValueCountFrequency (%)
A 70
 
9.7%
N 67
 
9.3%
E 64
 
8.9%
O 63
 
8.8%
I 57
 
7.9%
S 49
 
6.8%
T 37
 
5.1%
R 34
 
4.7%
U 33
 
4.6%
G 27
 
3.8%
Other values (16) 219
30.4%
Lowercase Letter
ValueCountFrequency (%)
i 45
10.1%
t 44
9.9%
e 41
 
9.2%
o 40
 
9.0%
a 38
 
8.5%
r 38
 
8.5%
n 37
 
8.3%
s 27
 
6.1%
u 17
 
3.8%
g 16
 
3.6%
Other values (15) 103
23.1%
Decimal Number
ValueCountFrequency (%)
0 130
26.7%
2 116
23.9%
1 109
22.4%
4 27
 
5.6%
5 23
 
4.7%
3 22
 
4.5%
9 17
 
3.5%
7 15
 
3.1%
8 15
 
3.1%
6 12
 
2.5%
Other Punctuation
ValueCountFrequency (%)
, 29
55.8%
: 8
 
15.4%
? 7
 
13.5%
& 3
 
5.8%
! 2
 
3.8%
' 1
 
1.9%
1
 
1.9%
. 1
 
1.9%
Math Symbol
ValueCountFrequency (%)
< 3
37.5%
> 3
37.5%
~ 2
25.0%
Other Number
ValueCountFrequency (%)
4
80.0%
1
 
20.0%
Space Separator
ValueCountFrequency (%)
1205
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 24
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4583
60.7%
Common 1797
 
23.8%
Latin 1166
 
15.4%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
150
 
3.3%
126
 
2.7%
90
 
2.0%
77
 
1.7%
75
 
1.6%
68
 
1.5%
64
 
1.4%
62
 
1.4%
60
 
1.3%
57
 
1.2%
Other values (542) 3754
81.9%
Latin
ValueCountFrequency (%)
A 70
 
6.0%
N 67
 
5.7%
E 64
 
5.5%
O 63
 
5.4%
I 57
 
4.9%
S 49
 
4.2%
i 45
 
3.9%
t 44
 
3.8%
e 41
 
3.5%
o 40
 
3.4%
Other values (41) 626
53.7%
Common
ValueCountFrequency (%)
1205
67.1%
0 130
 
7.2%
2 116
 
6.5%
1 109
 
6.1%
, 29
 
1.6%
4 27
 
1.5%
- 24
 
1.3%
5 23
 
1.3%
3 22
 
1.2%
9 17
 
0.9%
Other values (18) 95
 
5.3%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4583
60.7%
ASCII 2957
39.2%
Enclosed Alphanum 5
 
0.1%
CJK 2
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1205
40.8%
0 130
 
4.4%
2 116
 
3.9%
1 109
 
3.7%
A 70
 
2.4%
N 67
 
2.3%
E 64
 
2.2%
O 63
 
2.1%
I 57
 
1.9%
S 49
 
1.7%
Other values (66) 1027
34.7%
Hangul
ValueCountFrequency (%)
150
 
3.3%
126
 
2.7%
90
 
2.0%
77
 
1.7%
75
 
1.6%
68
 
1.5%
64
 
1.4%
62
 
1.4%
60
 
1.3%
57
 
1.2%
Other values (542) 3754
81.9%
Enclosed Alphanum
ValueCountFrequency (%)
4
80.0%
1
 
20.0%
None
ValueCountFrequency (%)
1
100.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%

부출서명
Text

MISSING 

Distinct83
Distinct (%)69.2%
Missing522
Missing (%)81.3%
Memory size5.1 KiB
2024-03-14T10:46:49.297326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length22
Mean length10.583333
Min length2

Characters and Unicode

Total characters1270
Distinct characters250
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique75 ?
Unique (%)62.5%

Sample

1st row포항시립개관기념전
2nd row피카소가 모나리자를 그린다면?
3rd row수학 그림동화
4th row초등학생이 보는 지식정보 그림책
5th row좋은 그림동화
ValueCountFrequency (%)
그림동화 27
 
8.8%
한국전래 25
 
8.2%
보는 6
 
2.0%
들려주는 5
 
1.6%
함게 5
 
1.6%
김미진 4
 
1.3%
선생님이 4
 
1.3%
미술동화 4
 
1.3%
남북 4
 
1.3%
어린이가 4
 
1.3%
Other values (187) 218
71.2%
2024-03-14T10:46:49.698847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
225
 
17.7%
44
 
3.5%
39
 
3.1%
36
 
2.8%
35
 
2.8%
34
 
2.7%
33
 
2.6%
33
 
2.6%
33
 
2.6%
32
 
2.5%
Other values (240) 726
57.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1005
79.1%
Space Separator 225
 
17.7%
Decimal Number 34
 
2.7%
Uppercase Letter 3
 
0.2%
Other Punctuation 2
 
0.2%
Math Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
44
 
4.4%
39
 
3.9%
36
 
3.6%
35
 
3.5%
34
 
3.4%
33
 
3.3%
33
 
3.3%
33
 
3.3%
32
 
3.2%
25
 
2.5%
Other values (226) 661
65.8%
Decimal Number
ValueCountFrequency (%)
1 11
32.4%
0 9
26.5%
3 6
17.6%
2 4
 
11.8%
4 2
 
5.9%
8 1
 
2.9%
5 1
 
2.9%
Uppercase Letter
ValueCountFrequency (%)
I 1
33.3%
B 1
33.3%
F 1
33.3%
Other Punctuation
ValueCountFrequency (%)
? 1
50.0%
! 1
50.0%
Space Separator
ValueCountFrequency (%)
225
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1005
79.1%
Common 262
 
20.6%
Latin 3
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
44
 
4.4%
39
 
3.9%
36
 
3.6%
35
 
3.5%
34
 
3.4%
33
 
3.3%
33
 
3.3%
33
 
3.3%
32
 
3.2%
25
 
2.5%
Other values (226) 661
65.8%
Common
ValueCountFrequency (%)
225
85.9%
1 11
 
4.2%
0 9
 
3.4%
3 6
 
2.3%
2 4
 
1.5%
4 2
 
0.8%
? 1
 
0.4%
8 1
 
0.4%
5 1
 
0.4%
~ 1
 
0.4%
Latin
ValueCountFrequency (%)
I 1
33.3%
B 1
33.3%
F 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1005
79.1%
ASCII 265
 
20.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
225
84.9%
1 11
 
4.2%
0 9
 
3.4%
3 6
 
2.3%
2 4
 
1.5%
4 2
 
0.8%
? 1
 
0.4%
8 1
 
0.4%
5 1
 
0.4%
~ 1
 
0.4%
Other values (4) 4
 
1.5%
Hangul
ValueCountFrequency (%)
44
 
4.4%
39
 
3.9%
36
 
3.6%
35
 
3.5%
34
 
3.4%
33
 
3.3%
33
 
3.3%
33
 
3.3%
32
 
3.2%
25
 
2.5%
Other values (226) 661
65.8%

저자명
Text

MISSING 

Distinct150
Distinct (%)57.9%
Missing383
Missing (%)59.7%
Memory size5.1 KiB
2024-03-14T10:46:50.015068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length3
Mean length4.8494208
Min length1

Characters and Unicode

Total characters1256
Distinct characters247
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique115 ?
Unique (%)44.4%

Sample

1st row표트르 바르소니
2nd row토미 웅게러
3rd row에릭 나이트
4th row엘리자베르 보르헤르스
5th row로지 디킨즈
ValueCountFrequency (%)
허영만 27
 
7.6%
이원복 15
 
4.2%
방학기 15
 
4.2%
이정민 6
 
1.7%
이주헌 6
 
1.7%
대교 5
 
1.4%
어린이 5
 
1.4%
tv 5
 
1.4%
장진영 4
 
1.1%
v.노스이스트 4
 
1.1%
Other values (204) 265
74.2%
2024-03-14T10:46:50.423209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
105
 
8.4%
52
 
4.1%
37
 
2.9%
32
 
2.5%
29
 
2.3%
27
 
2.1%
27
 
2.1%
23
 
1.8%
20
 
1.6%
20
 
1.6%
Other values (237) 884
70.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1116
88.9%
Space Separator 105
 
8.4%
Other Punctuation 19
 
1.5%
Uppercase Letter 16
 
1.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
52
 
4.7%
37
 
3.3%
32
 
2.9%
29
 
2.6%
27
 
2.4%
27
 
2.4%
23
 
2.1%
20
 
1.8%
20
 
1.8%
19
 
1.7%
Other values (227) 830
74.4%
Other Punctuation
ValueCountFrequency (%)
; 6
31.6%
, 5
26.3%
. 4
21.1%
/ 3
15.8%
? 1
 
5.3%
Uppercase Letter
ValueCountFrequency (%)
V 9
56.2%
T 5
31.2%
Y 1
 
6.2%
L 1
 
6.2%
Space Separator
ValueCountFrequency (%)
105
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1116
88.9%
Common 124
 
9.9%
Latin 16
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
52
 
4.7%
37
 
3.3%
32
 
2.9%
29
 
2.6%
27
 
2.4%
27
 
2.4%
23
 
2.1%
20
 
1.8%
20
 
1.8%
19
 
1.7%
Other values (227) 830
74.4%
Common
ValueCountFrequency (%)
105
84.7%
; 6
 
4.8%
, 5
 
4.0%
. 4
 
3.2%
/ 3
 
2.4%
? 1
 
0.8%
Latin
ValueCountFrequency (%)
V 9
56.2%
T 5
31.2%
Y 1
 
6.2%
L 1
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1116
88.9%
ASCII 140
 
11.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
105
75.0%
V 9
 
6.4%
; 6
 
4.3%
T 5
 
3.6%
, 5
 
3.6%
. 4
 
2.9%
/ 3
 
2.1%
Y 1
 
0.7%
L 1
 
0.7%
? 1
 
0.7%
Hangul
ValueCountFrequency (%)
52
 
4.7%
37
 
3.3%
32
 
2.9%
29
 
2.6%
27
 
2.4%
27
 
2.4%
23
 
2.1%
20
 
1.8%
20
 
1.8%
19
 
1.7%
Other values (227) 830
74.4%

출판사
Text

MISSING 

Distinct69
Distinct (%)25.7%
Missing373
Missing (%)58.1%
Memory size5.1 KiB
2024-03-14T10:46:50.914352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length3
Mean length3.8624535
Min length2

Characters and Unicode

Total characters1039
Distinct characters154
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique44 ?
Unique (%)16.4%

Sample

1st row내인생의책
2nd row동쪽나라
3rd row시공주니어
4th row비룡소
5th row예경
ValueCountFrequency (%)
사계절 46
14.8%
김영사 44
 
14.2%
비룡소 26
 
8.4%
한성닷컴 25
 
8.1%
길찾기 14
 
4.5%
다빈치 13
 
4.2%
기프트 9
 
2.9%
나무숲 8
 
2.6%
대원씨아이 6
 
1.9%
애니북스 6
 
1.9%
Other values (71) 113
36.5%
2024-03-14T10:46:51.248545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
100
 
9.6%
47
 
4.5%
46
 
4.4%
46
 
4.4%
44
 
4.2%
42
 
4.0%
36
 
3.5%
28
 
2.7%
27
 
2.6%
26
 
2.5%
Other values (144) 597
57.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 982
94.5%
Space Separator 42
 
4.0%
Uppercase Letter 8
 
0.8%
Other Punctuation 3
 
0.3%
Decimal Number 2
 
0.2%
Open Punctuation 1
 
0.1%
Close Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
100
 
10.2%
47
 
4.8%
46
 
4.7%
46
 
4.7%
44
 
4.5%
36
 
3.7%
28
 
2.9%
27
 
2.7%
26
 
2.6%
26
 
2.6%
Other values (133) 556
56.6%
Uppercase Letter
ValueCountFrequency (%)
T 3
37.5%
V 3
37.5%
M 1
 
12.5%
B 1
 
12.5%
Other Punctuation
ValueCountFrequency (%)
? 2
66.7%
& 1
33.3%
Decimal Number
ValueCountFrequency (%)
1 1
50.0%
2 1
50.0%
Space Separator
ValueCountFrequency (%)
42
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 982
94.5%
Common 49
 
4.7%
Latin 8
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
100
 
10.2%
47
 
4.8%
46
 
4.7%
46
 
4.7%
44
 
4.5%
36
 
3.7%
28
 
2.9%
27
 
2.7%
26
 
2.6%
26
 
2.6%
Other values (133) 556
56.6%
Common
ValueCountFrequency (%)
42
85.7%
? 2
 
4.1%
1 1
 
2.0%
& 1
 
2.0%
2 1
 
2.0%
( 1
 
2.0%
) 1
 
2.0%
Latin
ValueCountFrequency (%)
T 3
37.5%
V 3
37.5%
M 1
 
12.5%
B 1
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 982
94.5%
ASCII 57
 
5.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
100
 
10.2%
47
 
4.8%
46
 
4.7%
46
 
4.7%
44
 
4.5%
36
 
3.7%
28
 
2.9%
27
 
2.7%
26
 
2.6%
26
 
2.6%
Other values (133) 556
56.6%
ASCII
ValueCountFrequency (%)
42
73.7%
T 3
 
5.3%
V 3
 
5.3%
? 2
 
3.5%
M 1
 
1.8%
1 1
 
1.8%
B 1
 
1.8%
& 1
 
1.8%
2 1
 
1.8%
( 1
 
1.8%

Unnamed: 6
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size5.1 KiB
<NA>
633 
0
 
9

Length

Max length4
Median length4
Mean length3.9579439
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 633
98.6%
0 9
 
1.4%

Length

2024-03-14T10:46:51.385708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T10:46:51.486593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 633
98.6%
0 9
 
1.4%

Correlations

2024-03-14T10:46:51.545310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도서분류부출서명출판사
도서분류1.0001.0001.000
부출서명1.0001.0001.000
출판사1.0001.0001.000
2024-03-14T10:46:51.645833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도서분류Unnamed: 6
도서분류1.0001.000
Unnamed: 61.0001.000
2024-03-14T10:46:51.716174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도서분류Unnamed: 6
도서분류1.0001.000
Unnamed: 61.0001.000

Missing values

2024-03-14T10:46:47.731894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T10:46:47.821698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-14T10:46:47.907331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

도서분류청구기호도서명부출서명저자명출판사Unnamed: 6
0국공립 도서관 도록<NA><NA><NA><NA><NA><NA>
1대전시립미술관606.9/대74ㅇ c553웃음이 난다(sense of humor)<NA><NA><NA><NA>
2대전시립미술관606.9/대74ㅈ c.2 C199전환의 봄(Conversion's spring)<NA><NA><NA><NA>
3대전시립미술관606.9/대74ㅈ c637전혁림<NA><NA><NA><NA>
4대전시립미술관ㄱ 606.9/대74ㅇ c.1 c216얼굴, 표정<NA><NA><NA><NA>
5대전시립미술관ㄱ 606.9/대74ㅁ c.2 c466모든 경계에는 꽃이 핀다<NA><NA><NA><NA>
6대전시립미술관ㄱ 606.9/대74ㄷ c.1 c186, c187(2권)대전미디어아트 2000<NA><NA><NA><NA>
7대전시립미술관606.9/대74ㅅ 2014 C001389미술관속사진페스티벌<NA><NA><NA><NA>
8대전시립미술관606.9/대74ㄷ 2014 C0014322014 대전미술의 지평 NAMO<NA><NA><NA><NA>
9대전시립미술관606.9/대74ㄷ 2014 C0014332014 대전미술의 지평 정장직<NA><NA><NA><NA>
도서분류청구기호도서명부출서명저자명출판사Unnamed: 6
632어린이번호없음황금거위<NA>그림 형제<NA><NA>
633어린이657 이26ㅁ 734머털도사와 벌레대왕어린이 만화세상04청년사<NA>
634어린이번호없음이야기로 배우는 경제영재들의 1등급 경제교실<NA>김상규<NA>
635어린이608 김75ㅁ 2011 v.1 3336① 서양 미술사미술 100장면대원키즈김윤수<NA>
636어린이608 박54ㅁ 2011 v.2 3337② 한국 미술사미술 100장면대원키즈김윤수<NA>
637어린이657 애198ㅇ에니메이션 교실애니메이션의 기초 지식과 작화의 실제<NA>조경수<NA>
638어린이번호없음어린이를 위한 마시멜로 이야기<NA>깊은산속옹달샘호아킴 데 포사다<NA>
639어린이657/전64ㅁ마요의 해피 쿡<NA>전영재스크린 M&B<NA>
640어린이800/김63ㄷ두근두근 내인생<NA>김애란참비<NA>
641어린이660 김62ㄱ A1703아타 김 ON-AIR EIGHTHOUR<NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

도서분류청구기호도서명부출서명저자명출판사Unnamed: 6# duplicates
0어린이800 박95ㅎ학교 가는 길을 개척할 거야<NA>박효미사계절<NA>2