Overview

Dataset statistics

Number of variables7
Number of observations6309
Missing cells17
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory345.2 KiB
Average record size in memory56.0 B

Variable types

Categorical1
Text4
DateTime1
Boolean1

Dataset

Description천안시 도서관에서 구입한 각종 전자도서 자료를 인터넷에 제공하며, 모바일 및 홈페이지에서 전차책을 대출 구독 가능합니다.
URLhttps://www.data.go.kr/data/15090702/fileData.do

Alerts

서비스 유무 is highly imbalanced (92.8%)Imbalance

Reproduction

Analysis started2023-12-12 09:12:07.164501
Analysis finished2023-12-12 09:12:09.068794
Duration1.9 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

도서공급사
Categorical

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size49.4 KiB
(주)북큐브네트웍스
3122 
교보전자책
1373 
북레일전자책
1062 
우리전자책
752 

Length

Max length10
Median length6
Mean length7.6425741
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row(주)북큐브네트웍스
2nd row(주)북큐브네트웍스
3rd row(주)북큐브네트웍스
4th row(주)북큐브네트웍스
5th row(주)북큐브네트웍스

Common Values

ValueCountFrequency (%)
(주)북큐브네트웍스 3122
49.5%
교보전자책 1373
21.8%
북레일전자책 1062
 
16.8%
우리전자책 752
 
11.9%

Length

2023-12-12T18:12:09.160323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:12:09.286961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
주)북큐브네트웍스 3122
49.5%
교보전자책 1373
21.8%
북레일전자책 1062
 
16.8%
우리전자책 752
 
11.9%
Distinct997
Distinct (%)15.8%
Missing0
Missing (%)0.0%
Memory size49.4 KiB
2023-12-12T18:12:09.569507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length14
Mean length4.4964337
Min length1

Characters and Unicode

Total characters28368
Distinct characters545
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique433 ?
Unique (%)6.9%

Sample

1st row베가북스
2nd row웅진지식하우스
3rd row웅진지식하우스
4th row위즈덤하우스
5th row갤리온
ValueCountFrequency (%)
위즈덤하우스 296
 
4.6%
지혜의숲 134
 
2.1%
rhk 133
 
2.1%
성현사 124
 
1.9%
21세기북스 121
 
1.9%
동도서기 108
 
1.7%
문학동네 106
 
1.7%
광보사 102
 
1.6%
웅진지식하우스 93
 
1.4%
나무생각 91
 
1.4%
Other values (990) 5106
79.6%
2023-12-12T18:12:09.945200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1821
 
6.4%
1136
 
4.0%
708
 
2.5%
623
 
2.2%
494
 
1.7%
469
 
1.7%
468
 
1.6%
444
 
1.6%
419
 
1.5%
391
 
1.4%
Other values (535) 21395
75.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 26023
91.7%
Uppercase Letter 815
 
2.9%
Lowercase Letter 661
 
2.3%
Decimal Number 324
 
1.1%
Close Punctuation 206
 
0.7%
Open Punctuation 206
 
0.7%
Space Separator 105
 
0.4%
Other Punctuation 26
 
0.1%
Other Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1821
 
7.0%
1136
 
4.4%
708
 
2.7%
623
 
2.4%
494
 
1.9%
469
 
1.8%
468
 
1.8%
444
 
1.7%
419
 
1.6%
391
 
1.5%
Other values (478) 19050
73.2%
Uppercase Letter
ValueCountFrequency (%)
K 165
20.2%
H 145
17.8%
R 139
17.1%
B 63
 
7.7%
D 57
 
7.0%
O 49
 
6.0%
M 43
 
5.3%
I 39
 
4.8%
P 30
 
3.7%
S 23
 
2.8%
Other values (12) 62
 
7.6%
Lowercase Letter
ValueCountFrequency (%)
e 122
18.5%
t 86
13.0%
r 84
12.7%
a 76
11.5%
o 70
10.6%
s 47
 
7.1%
n 28
 
4.2%
y 23
 
3.5%
k 23
 
3.5%
l 20
 
3.0%
Other values (9) 82
12.4%
Decimal Number
ValueCountFrequency (%)
2 134
41.4%
1 131
40.4%
0 16
 
4.9%
4 16
 
4.9%
3 11
 
3.4%
9 8
 
2.5%
8 7
 
2.2%
6 1
 
0.3%
Other Punctuation
ValueCountFrequency (%)
. 18
69.2%
& 4
 
15.4%
# 3
 
11.5%
: 1
 
3.8%
Close Punctuation
ValueCountFrequency (%)
) 206
100.0%
Open Punctuation
ValueCountFrequency (%)
( 206
100.0%
Space Separator
ValueCountFrequency (%)
105
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 26021
91.7%
Latin 1476
 
5.2%
Common 867
 
3.1%
Han 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1821
 
7.0%
1136
 
4.4%
708
 
2.7%
623
 
2.4%
494
 
1.9%
469
 
1.8%
468
 
1.8%
444
 
1.7%
419
 
1.6%
391
 
1.5%
Other values (475) 19048
73.2%
Latin
ValueCountFrequency (%)
K 165
 
11.2%
H 145
 
9.8%
R 139
 
9.4%
e 122
 
8.3%
t 86
 
5.8%
r 84
 
5.7%
a 76
 
5.1%
o 70
 
4.7%
B 63
 
4.3%
D 57
 
3.9%
Other values (31) 469
31.8%
Common
ValueCountFrequency (%)
) 206
23.8%
( 206
23.8%
2 134
15.5%
1 131
15.1%
105
12.1%
. 18
 
2.1%
0 16
 
1.8%
4 16
 
1.8%
3 11
 
1.3%
9 8
 
0.9%
Other values (5) 16
 
1.8%
Han
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 26019
91.7%
ASCII 2343
 
8.3%
CJK 3
 
< 0.1%
None 2
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1821
 
7.0%
1136
 
4.4%
708
 
2.7%
623
 
2.4%
494
 
1.9%
469
 
1.8%
468
 
1.8%
444
 
1.7%
419
 
1.6%
391
 
1.5%
Other values (474) 19046
73.2%
ASCII
ValueCountFrequency (%)
) 206
 
8.8%
( 206
 
8.8%
K 165
 
7.0%
H 145
 
6.2%
R 139
 
5.9%
2 134
 
5.7%
1 131
 
5.6%
e 122
 
5.2%
105
 
4.5%
t 86
 
3.7%
Other values (46) 904
38.6%
None
ValueCountFrequency (%)
2
100.0%
CJK
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Distinct6283
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size49.4 KiB
2023-12-12T18:12:10.433740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length92
Median length68
Mean length17.655254
Min length1

Characters and Unicode

Total characters111387
Distinct characters1283
Distinct categories15 ?
Distinct scripts4 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6257 ?
Unique (%)99.2%

Sample

1st row챗GPT 혁명
2nd row공부하고 있다는 착각
3rd row그럴 수 있어
4th row내 아이가 낯설어진 부모들에게
5th row스몰 트라우마
ValueCountFrequency (%)
1507
 
4.9%
위한 185
 
0.6%
2 170
 
0.6%
1 157
 
0.5%
이야기 152
 
0.5%
137
 
0.4%
나는 136
 
0.4%
87
 
0.3%
나를 85
 
0.3%
79
 
0.3%
Other values (12857) 27984
91.2%
2023-12-12T18:12:11.130210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24401
 
21.9%
2278
 
2.0%
2144
 
1.9%
2143
 
1.9%
1365
 
1.2%
1196
 
1.1%
1189
 
1.1%
1135
 
1.0%
1132
 
1.0%
1127
 
1.0%
Other values (1273) 73277
65.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 78279
70.3%
Space Separator 24401
 
21.9%
Decimal Number 2632
 
2.4%
Other Punctuation 1913
 
1.7%
Lowercase Letter 1061
 
1.0%
Uppercase Letter 954
 
0.9%
Dash Punctuation 701
 
0.6%
Open Punctuation 665
 
0.6%
Close Punctuation 664
 
0.6%
Math Symbol 57
 
0.1%
Other values (5) 60
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2278
 
2.9%
2144
 
2.7%
2143
 
2.7%
1365
 
1.7%
1196
 
1.5%
1189
 
1.5%
1135
 
1.4%
1132
 
1.4%
1127
 
1.4%
1084
 
1.4%
Other values (1162) 63486
81.1%
Uppercase Letter
ValueCountFrequency (%)
S 106
 
11.1%
T 103
 
10.8%
A 75
 
7.9%
E 70
 
7.3%
I 65
 
6.8%
O 59
 
6.2%
N 53
 
5.6%
M 44
 
4.6%
B 44
 
4.6%
C 43
 
4.5%
Other values (16) 292
30.6%
Lowercase Letter
ValueCountFrequency (%)
e 196
18.5%
o 84
 
7.9%
i 79
 
7.4%
n 76
 
7.2%
t 74
 
7.0%
h 67
 
6.3%
a 65
 
6.1%
r 52
 
4.9%
l 52
 
4.9%
s 48
 
4.5%
Other values (15) 268
25.3%
Other Punctuation
ValueCountFrequency (%)
: 922
48.2%
, 580
30.3%
! 127
 
6.6%
. 101
 
5.3%
? 95
 
5.0%
· 28
 
1.5%
' 14
 
0.7%
% 12
 
0.6%
& 9
 
0.5%
5
 
0.3%
Other values (8) 20
 
1.0%
Decimal Number
ValueCountFrequency (%)
1 674
25.6%
0 600
22.8%
2 469
17.8%
3 254
 
9.7%
5 169
 
6.4%
4 141
 
5.4%
9 97
 
3.7%
7 80
 
3.0%
8 74
 
2.8%
6 74
 
2.8%
Open Punctuation
ValueCountFrequency (%)
( 341
51.3%
[ 300
45.1%
14
 
2.1%
6
 
0.9%
3
 
0.5%
1
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 340
51.2%
] 300
45.2%
14
 
2.1%
6
 
0.9%
3
 
0.5%
1
 
0.2%
Math Symbol
ValueCountFrequency (%)
15
26.3%
+ 15
26.3%
~ 13
22.8%
| 10
17.5%
> 2
 
3.5%
< 2
 
3.5%
Other Number
ValueCountFrequency (%)
2
25.0%
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Final Punctuation
ValueCountFrequency (%)
16
88.9%
2
 
11.1%
Initial Punctuation
ValueCountFrequency (%)
15
88.2%
2
 
11.8%
Space Separator
ValueCountFrequency (%)
24401
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 701
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 11
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 78233
70.2%
Common 31093
 
27.9%
Latin 2015
 
1.8%
Han 46
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2278
 
2.9%
2144
 
2.7%
2143
 
2.7%
1365
 
1.7%
1196
 
1.5%
1189
 
1.5%
1135
 
1.5%
1132
 
1.4%
1127
 
1.4%
1084
 
1.4%
Other values (1120) 63440
81.1%
Common
ValueCountFrequency (%)
24401
78.5%
: 922
 
3.0%
- 701
 
2.3%
1 674
 
2.2%
0 600
 
1.9%
, 580
 
1.9%
2 469
 
1.5%
( 341
 
1.1%
) 340
 
1.1%
[ 300
 
1.0%
Other values (50) 1765
 
5.7%
Latin
ValueCountFrequency (%)
e 196
 
9.7%
S 106
 
5.3%
T 103
 
5.1%
o 84
 
4.2%
i 79
 
3.9%
n 76
 
3.8%
A 75
 
3.7%
t 74
 
3.7%
E 70
 
3.5%
h 67
 
3.3%
Other values (41) 1085
53.8%
Han
ValueCountFrequency (%)
3
 
6.5%
2
 
4.3%
2
 
4.3%
1
 
2.2%
1
 
2.2%
1
 
2.2%
1
 
2.2%
1
 
2.2%
1
 
2.2%
1
 
2.2%
Other values (32) 32
69.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 78227
70.2%
ASCII 32960
29.6%
None 102
 
0.1%
CJK 45
 
< 0.1%
Punctuation 38
 
< 0.1%
Enclosed Alphanum 8
 
< 0.1%
Compat Jamo 6
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
24401
74.0%
: 922
 
2.8%
- 701
 
2.1%
1 674
 
2.0%
0 600
 
1.8%
, 580
 
1.8%
2 469
 
1.4%
( 341
 
1.0%
) 340
 
1.0%
[ 300
 
0.9%
Other values (76) 3632
 
11.0%
Hangul
ValueCountFrequency (%)
2278
 
2.9%
2144
 
2.7%
2143
 
2.7%
1365
 
1.7%
1196
 
1.5%
1189
 
1.5%
1135
 
1.5%
1132
 
1.4%
1127
 
1.4%
1084
 
1.4%
Other values (1119) 63434
81.1%
None
ValueCountFrequency (%)
· 28
27.5%
15
14.7%
14
13.7%
14
13.7%
6
 
5.9%
6
 
5.9%
5
 
4.9%
4
 
3.9%
3
 
2.9%
3
 
2.9%
Other values (4) 4
 
3.9%
Punctuation
ValueCountFrequency (%)
16
42.1%
15
39.5%
3
 
7.9%
2
 
5.3%
2
 
5.3%
Compat Jamo
ValueCountFrequency (%)
6
100.0%
CJK
ValueCountFrequency (%)
3
 
6.7%
2
 
4.4%
2
 
4.4%
1
 
2.2%
1
 
2.2%
1
 
2.2%
1
 
2.2%
1
 
2.2%
1
 
2.2%
1
 
2.2%
Other values (31) 31
68.9%
Enclosed Alphanum
ValueCountFrequency (%)
2
25.0%
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Distinct103
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size49.4 KiB
2023-12-12T18:12:11.504368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length18
Mean length10.006023
Min length4

Characters and Unicode

Total characters63128
Distinct characters199
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique11 ?
Unique (%)0.2%

Sample

1st row경제/비즈니스/경제/경영
2nd row인문/인문학산책
3rd row에세이/산문/산문집
4th row가정/생활/자녀교육
5th row인문/심리/정신분석
ValueCountFrequency (%)
문학/한국소설 967
15.3%
경제/비즈니스/성공철학/자기계발 608
 
9.6%
문학/외국소설 470
 
7.4%
에세이/산문/산문집 416
 
6.6%
경제/비즈니스/경제/경영 383
 
6.1%
인문/인문학산책 323
 
5.1%
에세이/산문/에세이 262
 
4.1%
경제/비즈니스/재테크/투자 250
 
4.0%
인문/심리/정신분석 194
 
3.1%
에세이/산문/자기계발 163
 
2.6%
Other values (97) 2292
36.2%
2023-12-12T18:12:12.117264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 11496
 
18.2%
4065
 
6.4%
3399
 
5.4%
2173
 
3.4%
1763
 
2.8%
1752
 
2.8%
1736
 
2.7%
1649
 
2.6%
1497
 
2.4%
1497
 
2.4%
Other values (189) 32101
50.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 51453
81.5%
Other Punctuation 11503
 
18.2%
Uppercase Letter 77
 
0.1%
Lowercase Letter 40
 
0.1%
Decimal Number 24
 
< 0.1%
Space Separator 19
 
< 0.1%
Math Symbol 12
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4065
 
7.9%
3399
 
6.6%
2173
 
4.2%
1763
 
3.4%
1752
 
3.4%
1736
 
3.4%
1649
 
3.2%
1497
 
2.9%
1497
 
2.9%
1424
 
2.8%
Other values (167) 30498
59.3%
Lowercase Letter
ValueCountFrequency (%)
o 10
25.0%
u 5
12.5%
k 5
12.5%
e 5
12.5%
i 5
12.5%
t 5
12.5%
l 5
12.5%
Decimal Number
ValueCountFrequency (%)
2 5
20.8%
6 5
20.8%
1 5
20.8%
5 5
20.8%
3 2
 
8.3%
4 2
 
8.3%
Uppercase Letter
ValueCountFrequency (%)
S 31
40.3%
F 26
33.8%
B 10
 
13.0%
M 5
 
6.5%
E 5
 
6.5%
Other Punctuation
ValueCountFrequency (%)
/ 11496
99.9%
& 7
 
0.1%
Space Separator
ValueCountFrequency (%)
19
100.0%
Math Symbol
ValueCountFrequency (%)
~ 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 51453
81.5%
Common 11558
 
18.3%
Latin 117
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4065
 
7.9%
3399
 
6.6%
2173
 
4.2%
1763
 
3.4%
1752
 
3.4%
1736
 
3.4%
1649
 
3.2%
1497
 
2.9%
1497
 
2.9%
1424
 
2.8%
Other values (167) 30498
59.3%
Latin
ValueCountFrequency (%)
S 31
26.5%
F 26
22.2%
o 10
 
8.5%
B 10
 
8.5%
M 5
 
4.3%
E 5
 
4.3%
u 5
 
4.3%
k 5
 
4.3%
e 5
 
4.3%
i 5
 
4.3%
Other values (2) 10
 
8.5%
Common
ValueCountFrequency (%)
/ 11496
99.5%
19
 
0.2%
~ 12
 
0.1%
& 7
 
0.1%
2 5
 
< 0.1%
6 5
 
< 0.1%
1 5
 
< 0.1%
5 5
 
< 0.1%
3 2
 
< 0.1%
4 2
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 51453
81.5%
ASCII 11675
 
18.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 11496
98.5%
S 31
 
0.3%
F 26
 
0.2%
19
 
0.2%
~ 12
 
0.1%
o 10
 
0.1%
B 10
 
0.1%
& 7
 
0.1%
M 5
 
< 0.1%
2 5
 
< 0.1%
Other values (12) 54
 
0.5%
Hangul
ValueCountFrequency (%)
4065
 
7.9%
3399
 
6.6%
2173
 
4.2%
1763
 
3.4%
1752
 
3.4%
1736
 
3.4%
1649
 
3.2%
1497
 
2.9%
1497
 
2.9%
1424
 
2.8%
Other values (167) 30498
59.3%
Distinct2311
Distinct (%)36.7%
Missing17
Missing (%)0.3%
Memory size49.4 KiB
Minimum1996-12-12 00:00:00
Maximum2023-07-28 00:00:00
2023-12-12T18:12:12.345214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:12:12.508652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

저자
Text

Distinct4199
Distinct (%)66.6%
Missing0
Missing (%)0.0%
Memory size49.4 KiB
2023-12-12T18:12:12.957164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length102
Median length3
Mean length5.1390078
Min length1

Characters and Unicode

Total characters32422
Distinct characters822
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3393 ?
Unique (%)53.8%

Sample

1st row권기대
2nd row대니엘 윌링햄
3rd row양희은
4th row최정미
5th row멕 애럴
ValueCountFrequency (%)
채만식 110
 
1.2%
이효석 86
 
0.9%
김동인 85
 
0.9%
46
 
0.5%
최서해 41
 
0.4%
편집부 36
 
0.4%
그림 34
 
0.4%
33
 
0.3%
33
 
0.3%
나도향 31
 
0.3%
Other values (5476) 8954
94.4%
2023-12-12T18:12:13.657415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3182
 
9.8%
1274
 
3.9%
871
 
2.7%
691
 
2.1%
, 558
 
1.7%
537
 
1.7%
454
 
1.4%
373
 
1.2%
347
 
1.1%
321
 
1.0%
Other values (812) 23814
73.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 27526
84.9%
Space Separator 3182
 
9.8%
Other Punctuation 706
 
2.2%
Lowercase Letter 390
 
1.2%
Uppercase Letter 357
 
1.1%
Close Punctuation 101
 
0.3%
Open Punctuation 101
 
0.3%
Decimal Number 37
 
0.1%
Math Symbol 8
 
< 0.1%
Dash Punctuation 7
 
< 0.1%
Other values (3) 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1274
 
4.6%
871
 
3.2%
691
 
2.5%
537
 
2.0%
454
 
1.6%
373
 
1.4%
347
 
1.3%
321
 
1.2%
315
 
1.1%
298
 
1.1%
Other values (733) 22045
80.1%
Uppercase Letter
ValueCountFrequency (%)
S 50
14.0%
B 40
11.2%
E 34
 
9.5%
K 27
 
7.6%
J 21
 
5.9%
C 21
 
5.9%
M 20
 
5.6%
A 19
 
5.3%
L 18
 
5.0%
T 14
 
3.9%
Other values (13) 93
26.1%
Lowercase Letter
ValueCountFrequency (%)
a 48
12.3%
e 41
10.5%
i 38
 
9.7%
n 33
 
8.5%
l 28
 
7.2%
r 25
 
6.4%
o 22
 
5.6%
t 21
 
5.4%
h 17
 
4.4%
d 17
 
4.4%
Other values (13) 100
25.6%
Decimal Number
ValueCountFrequency (%)
2 9
24.3%
1 8
21.6%
4 5
13.5%
9 4
10.8%
0 4
10.8%
8 2
 
5.4%
7 2
 
5.4%
5 2
 
5.4%
6 1
 
2.7%
Other Punctuation
ValueCountFrequency (%)
, 558
79.0%
. 124
 
17.6%
& 7
 
1.0%
; 7
 
1.0%
? 4
 
0.6%
# 3
 
0.4%
: 2
 
0.3%
1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 90
89.1%
9
 
8.9%
1
 
1.0%
1
 
1.0%
Open Punctuation
ValueCountFrequency (%)
( 90
89.1%
9
 
8.9%
1
 
1.0%
1
 
1.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
85.7%
1
 
14.3%
Math Symbol
ValueCountFrequency (%)
> 4
50.0%
< 4
50.0%
Space Separator
ValueCountFrequency (%)
3182
100.0%
Final Punctuation
ValueCountFrequency (%)
4
100.0%
Initial Punctuation
ValueCountFrequency (%)
2
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 27506
84.8%
Common 4149
 
12.8%
Latin 747
 
2.3%
Han 20
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1274
 
4.6%
871
 
3.2%
691
 
2.5%
537
 
2.0%
454
 
1.7%
373
 
1.4%
347
 
1.3%
321
 
1.2%
315
 
1.1%
298
 
1.1%
Other values (721) 22025
80.1%
Latin
ValueCountFrequency (%)
S 50
 
6.7%
a 48
 
6.4%
e 41
 
5.5%
B 40
 
5.4%
i 38
 
5.1%
E 34
 
4.6%
n 33
 
4.4%
l 28
 
3.7%
K 27
 
3.6%
r 25
 
3.3%
Other values (36) 383
51.3%
Common
ValueCountFrequency (%)
3182
76.7%
, 558
 
13.4%
. 124
 
3.0%
) 90
 
2.2%
( 90
 
2.2%
2 9
 
0.2%
9
 
0.2%
9
 
0.2%
1 8
 
0.2%
& 7
 
0.2%
Other values (23) 63
 
1.5%
Han
ValueCountFrequency (%)
2
10.0%
2
10.0%
2
10.0%
2
10.0%
2
10.0%
2
10.0%
2
10.0%
2
10.0%
1
5.0%
姿 1
5.0%
Other values (2) 2
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 27506
84.8%
ASCII 4866
 
15.0%
None 24
 
0.1%
CJK 20
 
0.1%
Punctuation 6
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3182
65.4%
, 558
 
11.5%
. 124
 
2.5%
) 90
 
1.8%
( 90
 
1.8%
S 50
 
1.0%
a 48
 
1.0%
e 41
 
0.8%
B 40
 
0.8%
i 38
 
0.8%
Other values (59) 605
 
12.4%
Hangul
ValueCountFrequency (%)
1274
 
4.6%
871
 
3.2%
691
 
2.5%
537
 
2.0%
454
 
1.7%
373
 
1.4%
347
 
1.3%
321
 
1.2%
315
 
1.1%
298
 
1.1%
Other values (721) 22025
80.1%
None
ValueCountFrequency (%)
9
37.5%
9
37.5%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
Punctuation
ValueCountFrequency (%)
4
66.7%
2
33.3%
CJK
ValueCountFrequency (%)
2
10.0%
2
10.0%
2
10.0%
2
10.0%
2
10.0%
2
10.0%
2
10.0%
2
10.0%
1
5.0%
姿 1
5.0%
Other values (2) 2
10.0%

서비스 유무
Boolean

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size6.3 KiB
True
6254 
False
 
55
ValueCountFrequency (%)
True 6254
99.1%
False 55
 
0.9%
2023-12-12T18:12:13.831981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:12:13.914197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도서공급사서비스 유무
도서공급사1.0000.304
서비스 유무0.3041.000
2023-12-12T18:12:14.010766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도서공급사서비스 유무
도서공급사1.0000.203
서비스 유무0.2031.000
2023-12-12T18:12:14.129347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도서공급사서비스 유무
도서공급사1.0000.203
서비스 유무0.2031.000

Missing values

2023-12-12T18:12:08.855208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:12:08.999593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

도서공급사출판사도서분철도서카테고리도서출판일자저자서비스 유무
0(주)북큐브네트웍스베가북스챗GPT 혁명경제/비즈니스/경제/경영2023-07-28권기대Y
1(주)북큐브네트웍스웅진지식하우스공부하고 있다는 착각인문/인문학산책2023-07-27대니엘 윌링햄Y
2(주)북큐브네트웍스웅진지식하우스그럴 수 있어에세이/산문/산문집2023-07-27양희은Y
3(주)북큐브네트웍스위즈덤하우스내 아이가 낯설어진 부모들에게가정/생활/자녀교육2023-07-27최정미Y
4(주)북큐브네트웍스갤리온스몰 트라우마인문/심리/정신분석2023-07-27멕 애럴Y
5(주)북큐브네트웍스카시오페아엄마가 되고 내면아이를 만났다가정/생활/자녀교육2023-07-27안정희Y
6(주)북큐브네트웍스갤리온힘든 일을 먼저 하라경제/비즈니스/성공철학/자기계발2023-07-27스콧 앨런Y
7(주)북큐브네트웍스열린책들꿀벌의 예언 1문학/외국소설2023-07-17베르나르 베르베르Y
8(주)북큐브네트웍스열린책들꿀벌의 예언 2문학/외국소설2023-07-17베르나르 베르베르Y
9(주)북큐브네트웍스글항아리한국전쟁의 기원 1역사/한국사2023-07-14브루스 커밍스Y
도서공급사출판사도서분철도서카테고리도서출판일자저자서비스 유무
6299북레일전자책이북코리아[디지털 구연동화 환경동화] 들판의 청소부 송장벌레어린이/멀티동화<NA>앙리 파브르N
6300북레일전자책이북코리아[디지털 구연동화 환경동화] 마음씨 착한 매미어린이/멀티동화<NA>앙리 파브르N
6301북레일전자책이북코리아[디지털 구연동화 환경동화] 마취총을 쏘는 왕노래기벌어린이/멀티동화<NA>앙리 파브르N
6302북레일전자책이북코리아[디지털 구연동화 환경동화] 음악가 수염풍뎅이어린이/멀티동화<NA>앙리 파브르N
6303북레일전자책이북코리아[디지털 구연동화 환경동화] 행진하는 불개미어린이/멀티동화<NA>앙리 파브르N
6304북레일전자책이북코리아[디지털 영어동화 Level3] Colum moor and the elves어린이/멀티동화<NA>KidSing사업부N
6305북레일전자책이북코리아[디지털 영어동화 Level3] Frank`s big fish어린이/멀티동화<NA>KidSing사업부N
6306북레일전자책이북코리아[디지털 영어동화 Level3] Ginger and Pickles어린이/멀티동화<NA>KidSing사업부N
6307북레일전자책이북코리아[디지털 영어동화 Level3] Jack and the Beanstalk어린이/멀티동화<NA>KidSing사업부N
6308북레일전자책이북코리아[디지털 영어동화 Level3] The greedy cat어린이/멀티동화<NA>KidSing사업부N