Overview

Dataset statistics

Number of variables11
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory966.8 KiB
Average record size in memory99.0 B

Variable types

Text4
Categorical4
Numeric1
DateTime2

Dataset

Description부천시 공공 도서관의 전차책 대출 이력에 대한 데이터 2019, 2020, 2021년 3간 대출한 전자책에 대하여 도서명, 대출일시, 반납일시등에 대한 정보 제공
Author경기도 부천시
URLhttps://www.data.go.kr/data/15111938/fileData.do

Reproduction

Analysis started2024-04-17 09:41:31.993309
Analysis finished2024-04-17 09:41:33.552302
Duration1.56 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct3593
Distinct (%)35.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-17T18:41:33.704120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length15
Mean length15
Min length15

Characters and Unicode

Total characters150000
Distinct characters16
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1719 ?
Unique (%)17.2%

Sample

1st rowbef888000294456
2nd row9dc725830006727
3rd row1726c8301013569
4th row998a3e770139619
5th row6d35832f0059840
ValueCountFrequency (%)
5775576e0323632 84
 
0.8%
2b42538e0013345 51
 
0.5%
2d1ec49d0066485 36
 
0.4%
8b8b3fab0004748 36
 
0.4%
ba8cfcc40034403 30
 
0.3%
a7a118a30347420 27
 
0.3%
53b74b490073754 26
 
0.3%
aeba62a60125537 23
 
0.2%
542170341831439 23
 
0.2%
040d5f512586981 22
 
0.2%
Other values (3583) 9642
96.4%
2024-04-17T18:41:34.016448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 21604
14.4%
1 12608
8.4%
3 12326
8.2%
2 12281
8.2%
5 10558
 
7.0%
4 10472
 
7.0%
6 10368
 
6.9%
7 10360
 
6.9%
8 9884
 
6.6%
9 9685
 
6.5%
Other values (6) 29854
19.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 120146
80.1%
Lowercase Letter 29854
 
19.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 21604
18.0%
1 12608
10.5%
3 12326
10.3%
2 12281
10.2%
5 10558
8.8%
4 10472
8.7%
6 10368
8.6%
7 10360
8.6%
8 9884
8.2%
9 9685
8.1%
Lowercase Letter
ValueCountFrequency (%)
d 5205
17.4%
f 5179
17.3%
b 5011
16.8%
e 4918
16.5%
a 4788
16.0%
c 4753
15.9%

Most occurring scripts

ValueCountFrequency (%)
Common 120146
80.1%
Latin 29854
 
19.9%

Most frequent character per script

Common
ValueCountFrequency (%)
0 21604
18.0%
1 12608
10.5%
3 12326
10.3%
2 12281
10.2%
5 10558
8.8%
4 10472
8.7%
6 10368
8.6%
7 10360
8.6%
8 9884
8.2%
9 9685
8.1%
Latin
ValueCountFrequency (%)
d 5205
17.4%
f 5179
17.3%
b 5011
16.8%
e 4918
16.5%
a 4788
16.0%
c 4753
15.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 150000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 21604
14.4%
1 12608
8.4%
3 12326
8.2%
2 12281
8.2%
5 10558
 
7.0%
4 10472
 
7.0%
6 10368
 
6.9%
7 10360
 
6.9%
8 9884
 
6.6%
9 9685
 
6.5%
Other values (6) 29854
19.9%

성별
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
6716 
0
3284 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row1
3rd row1
4th row0
5th row1

Common Values

ValueCountFrequency (%)
1 6716
67.2%
0 3284
32.8%

Length

2024-04-17T18:41:34.133748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T18:41:34.219997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 6716
67.2%
0 3284
32.8%

대출연령
Real number (ℝ)

Distinct71
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean38.2896
Minimum5
Maximum79
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-17T18:41:34.327406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5
5-th percentile23
Q131
median38
Q345
95-th percentile55
Maximum79
Range74
Interquartile range (IQR)14

Descriptive statistics

Standard deviation10.444177
Coefficient of variation (CV)0.27276799
Kurtosis0.3913142
Mean38.2896
Median Absolute Deviation (MAD)7
Skewness0.21977858
Sum382896
Variance109.08084
MonotonicityNot monotonic
2024-04-17T18:41:34.447796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
39 440
 
4.4%
40 435
 
4.3%
45 431
 
4.3%
38 416
 
4.2%
37 406
 
4.1%
41 398
 
4.0%
31 388
 
3.9%
36 370
 
3.7%
32 356
 
3.6%
46 351
 
3.5%
Other values (61) 6009
60.1%
ValueCountFrequency (%)
5 16
0.2%
6 3
 
< 0.1%
8 5
 
0.1%
9 3
 
< 0.1%
10 10
 
0.1%
11 22
0.2%
12 15
0.1%
13 27
0.3%
14 14
0.1%
15 17
0.2%
ValueCountFrequency (%)
79 2
 
< 0.1%
78 1
 
< 0.1%
77 13
 
0.1%
73 7
 
0.1%
72 12
 
0.1%
71 7
 
0.1%
70 2
 
< 0.1%
69 1
 
< 0.1%
68 11
 
0.1%
67 37
0.4%
Distinct4239
Distinct (%)42.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-17T18:41:34.718994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length82
Median length46
Mean length12.3708
Min length1

Characters and Unicode

Total characters123708
Distinct characters1097
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2026 ?
Unique (%)20.3%

Sample

1st row백수의 1만 권 독서법
2nd rowI'm Your BooK: 네이티브가 사용하는 영어패턴은 따로 있다
3rd row화성에서 살 생각인가?
4th row생각의 보폭
5th row도덕경
ValueCountFrequency (%)
1 399
 
1.2%
나는 355
 
1.1%
2 243
 
0.7%
149
 
0.4%
어떻게 131
 
0.4%
기술 122
 
0.4%
이야기 110
 
0.3%
110
 
0.3%
했다 109
 
0.3%
108
 
0.3%
Other values (7585) 31967
94.6%
2024-04-17T18:41:35.145319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24022
 
19.4%
2612
 
2.1%
2428
 
2.0%
2228
 
1.8%
1817
 
1.5%
1509
 
1.2%
1402
 
1.1%
1374
 
1.1%
1 1319
 
1.1%
1248
 
1.0%
Other values (1087) 83749
67.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 89636
72.5%
Space Separator 24022
 
19.4%
Decimal Number 4180
 
3.4%
Other Punctuation 2089
 
1.7%
Lowercase Letter 1608
 
1.3%
Uppercase Letter 833
 
0.7%
Open Punctuation 579
 
0.5%
Close Punctuation 579
 
0.5%
Dash Punctuation 115
 
0.1%
Math Symbol 49
 
< 0.1%
Other values (3) 18
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2612
 
2.9%
2428
 
2.7%
2228
 
2.5%
1817
 
2.0%
1509
 
1.7%
1402
 
1.6%
1374
 
1.5%
1248
 
1.4%
1206
 
1.3%
1174
 
1.3%
Other values (999) 72638
81.0%
Lowercase Letter
ValueCountFrequency (%)
e 214
13.3%
o 164
 
10.2%
i 139
 
8.6%
r 133
 
8.3%
t 128
 
8.0%
n 120
 
7.5%
s 81
 
5.0%
a 79
 
4.9%
d 62
 
3.9%
y 61
 
3.8%
Other values (15) 427
26.6%
Uppercase Letter
ValueCountFrequency (%)
S 88
 
10.6%
E 65
 
7.8%
T 57
 
6.8%
A 55
 
6.6%
O 49
 
5.9%
B 48
 
5.8%
H 44
 
5.3%
W 41
 
4.9%
I 40
 
4.8%
N 39
 
4.7%
Other values (15) 307
36.9%
Other Punctuation
ValueCountFrequency (%)
. 853
40.8%
, 614
29.4%
: 301
 
14.4%
? 160
 
7.7%
! 106
 
5.1%
& 14
 
0.7%
' 13
 
0.6%
% 12
 
0.6%
· 9
 
0.4%
/ 5
 
0.2%
Decimal Number
ValueCountFrequency (%)
1 1319
31.6%
0 894
21.4%
2 695
16.6%
3 311
 
7.4%
9 218
 
5.2%
5 210
 
5.0%
4 186
 
4.4%
8 141
 
3.4%
7 106
 
2.5%
6 100
 
2.4%
Math Symbol
ValueCountFrequency (%)
+ 28
57.1%
~ 19
38.8%
> 1
 
2.0%
< 1
 
2.0%
Open Punctuation
ValueCountFrequency (%)
( 536
92.6%
[ 39
 
6.7%
4
 
0.7%
Close Punctuation
ValueCountFrequency (%)
) 536
92.6%
] 39
 
6.7%
4
 
0.7%
Final Punctuation
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
Initial Punctuation
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
24022
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 115
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 89608
72.4%
Common 31631
 
25.6%
Latin 2441
 
2.0%
Han 28
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2612
 
2.9%
2428
 
2.7%
2228
 
2.5%
1817
 
2.0%
1509
 
1.7%
1402
 
1.6%
1374
 
1.5%
1248
 
1.4%
1206
 
1.3%
1174
 
1.3%
Other values (988) 72610
81.0%
Latin
ValueCountFrequency (%)
e 214
 
8.8%
o 164
 
6.7%
i 139
 
5.7%
r 133
 
5.4%
t 128
 
5.2%
n 120
 
4.9%
S 88
 
3.6%
s 81
 
3.3%
a 79
 
3.2%
E 65
 
2.7%
Other values (40) 1230
50.4%
Common
ValueCountFrequency (%)
24022
75.9%
1 1319
 
4.2%
0 894
 
2.8%
. 853
 
2.7%
2 695
 
2.2%
, 614
 
1.9%
( 536
 
1.7%
) 536
 
1.7%
3 311
 
1.0%
: 301
 
1.0%
Other values (28) 1550
 
4.9%
Han
ValueCountFrequency (%)
4
14.3%
4
14.3%
4
14.3%
4
14.3%
4
14.3%
3
10.7%
1
 
3.6%
1
 
3.6%
1
 
3.6%
1
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 89608
72.4%
ASCII 34048
 
27.5%
CJK 28
 
< 0.1%
None 17
 
< 0.1%
Punctuation 7
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
24022
70.6%
1 1319
 
3.9%
0 894
 
2.6%
. 853
 
2.5%
2 695
 
2.0%
, 614
 
1.8%
( 536
 
1.6%
) 536
 
1.6%
3 311
 
0.9%
: 301
 
0.9%
Other values (71) 3967
 
11.7%
Hangul
ValueCountFrequency (%)
2612
 
2.9%
2428
 
2.7%
2228
 
2.5%
1817
 
2.0%
1509
 
1.7%
1402
 
1.6%
1374
 
1.5%
1248
 
1.4%
1206
 
1.3%
1174
 
1.3%
Other values (988) 72610
81.0%
None
ValueCountFrequency (%)
· 9
52.9%
4
23.5%
4
23.5%
CJK
ValueCountFrequency (%)
4
14.3%
4
14.3%
4
14.3%
4
14.3%
4
14.3%
3
10.7%
1
 
3.6%
1
 
3.6%
1
 
3.6%
1
 
3.6%
Punctuation
ValueCountFrequency (%)
3
42.9%
2
28.6%
1
 
14.3%
1
 
14.3%
Distinct3139
Distinct (%)31.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-17T18:41:35.465543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length3
Mean length4.8555
Min length2

Characters and Unicode

Total characters48555
Distinct characters721
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1316 ?
Unique (%)13.2%

Sample

1st row김병완
2nd rowJaymax Lee (이충훈)
3rd row이사카 고타로
4th row모리 히로시
5th row노자
ValueCountFrequency (%)
김영하 166
 
1.1%
게이고 156
 
1.1%
히가시노 156
 
1.1%
김진명 81
 
0.6%
유시민 81
 
0.6%
박시백 79
 
0.5%
다카시 73
 
0.5%
베르나르 69
 
0.5%
베르베르 69
 
0.5%
원작 59
 
0.4%
Other values (3928) 13520
93.2%
2024-04-17T18:41:35.880617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4509
 
9.3%
1777
 
3.7%
1261
 
2.6%
1130
 
2.3%
900
 
1.9%
823
 
1.7%
614
 
1.3%
599
 
1.2%
568
 
1.2%
553
 
1.1%
Other values (711) 35821
73.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 42167
86.8%
Space Separator 4509
 
9.3%
Uppercase Letter 635
 
1.3%
Lowercase Letter 528
 
1.1%
Open Punctuation 223
 
0.5%
Close Punctuation 223
 
0.5%
Other Punctuation 212
 
0.4%
Decimal Number 29
 
0.1%
Math Symbol 24
 
< 0.1%
Dash Punctuation 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1777
 
4.2%
1261
 
3.0%
1130
 
2.7%
900
 
2.1%
823
 
2.0%
614
 
1.5%
599
 
1.4%
568
 
1.3%
553
 
1.3%
546
 
1.3%
Other values (647) 33396
79.2%
Lowercase Letter
ValueCountFrequency (%)
e 90
17.0%
a 69
13.1%
r 57
10.8%
l 42
8.0%
o 38
 
7.2%
i 38
 
7.2%
s 30
 
5.7%
n 29
 
5.5%
m 22
 
4.2%
h 15
 
2.8%
Other values (13) 98
18.6%
Uppercase Letter
ValueCountFrequency (%)
B 92
14.5%
S 76
12.0%
K 63
 
9.9%
E 56
 
8.8%
T 31
 
4.9%
C 31
 
4.9%
L 30
 
4.7%
A 28
 
4.4%
J 28
 
4.4%
M 27
 
4.3%
Other values (12) 173
27.2%
Decimal Number
ValueCountFrequency (%)
2 8
27.6%
7 5
17.2%
3 5
17.2%
1 4
13.8%
6 3
 
10.3%
5 3
 
10.3%
4 1
 
3.4%
Other Punctuation
ValueCountFrequency (%)
. 183
86.3%
, 13
 
6.1%
& 9
 
4.2%
' 7
 
3.3%
Open Punctuation
ValueCountFrequency (%)
( 221
99.1%
[ 2
 
0.9%
Close Punctuation
ValueCountFrequency (%)
) 221
99.1%
] 2
 
0.9%
Math Symbol
ValueCountFrequency (%)
< 12
50.0%
> 12
50.0%
Space Separator
ValueCountFrequency (%)
4509
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 42161
86.8%
Common 5225
 
10.8%
Latin 1163
 
2.4%
Han 6
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1777
 
4.2%
1261
 
3.0%
1130
 
2.7%
900
 
2.1%
823
 
2.0%
614
 
1.5%
599
 
1.4%
568
 
1.3%
553
 
1.3%
546
 
1.3%
Other values (644) 33390
79.2%
Latin
ValueCountFrequency (%)
B 92
 
7.9%
e 90
 
7.7%
S 76
 
6.5%
a 69
 
5.9%
K 63
 
5.4%
r 57
 
4.9%
E 56
 
4.8%
l 42
 
3.6%
o 38
 
3.3%
i 38
 
3.3%
Other values (35) 542
46.6%
Common
ValueCountFrequency (%)
4509
86.3%
( 221
 
4.2%
) 221
 
4.2%
. 183
 
3.5%
, 13
 
0.2%
< 12
 
0.2%
> 12
 
0.2%
& 9
 
0.2%
2 8
 
0.2%
' 7
 
0.1%
Other values (9) 30
 
0.6%
Han
ValueCountFrequency (%)
2
33.3%
2
33.3%
2
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 42161
86.8%
ASCII 6388
 
13.2%
CJK 6
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4509
70.6%
( 221
 
3.5%
) 221
 
3.5%
. 183
 
2.9%
B 92
 
1.4%
e 90
 
1.4%
S 76
 
1.2%
a 69
 
1.1%
K 63
 
1.0%
r 57
 
0.9%
Other values (54) 807
 
12.6%
Hangul
ValueCountFrequency (%)
1777
 
4.2%
1261
 
3.0%
1130
 
2.7%
900
 
2.1%
823
 
2.0%
614
 
1.5%
599
 
1.4%
568
 
1.3%
553
 
1.3%
546
 
1.3%
Other values (644) 33390
79.2%
CJK
ValueCountFrequency (%)
2
33.3%
2
33.3%
2
33.3%
Distinct944
Distinct (%)9.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-17T18:41:36.161864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length29
Mean length4.6551
Min length1

Characters and Unicode

Total characters46551
Distinct characters551
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique257 ?
Unique (%)2.6%

Sample

1st row아템포
2nd row삼영서관
3rd row아르테(arte)
4th row마인드빌딩
5th row현대지성
ValueCountFrequency (%)
문학동네 733
 
7.3%
위즈덤하우스 351
 
3.5%
알에이치코리아 280
 
2.8%
21세기북스 247
 
2.5%
비즈니스북스 193
 
1.9%
웅진지식하우스 155
 
1.5%
펭귄클래식코리아 115
 
1.1%
아르테(arte 113
 
1.1%
다산북스 112
 
1.1%
열린책들 106
 
1.1%
Other values (945) 7674
76.1%
2024-04-17T18:41:36.554168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3418
 
7.3%
2081
 
4.5%
1180
 
2.5%
1141
 
2.5%
1086
 
2.3%
1046
 
2.2%
825
 
1.8%
819
 
1.8%
799
 
1.7%
798
 
1.7%
Other values (541) 33358
71.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 43820
94.1%
Lowercase Letter 1047
 
2.2%
Decimal Number 624
 
1.3%
Uppercase Letter 453
 
1.0%
Open Punctuation 233
 
0.5%
Close Punctuation 233
 
0.5%
Space Separator 79
 
0.2%
Other Punctuation 60
 
0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3418
 
7.8%
2081
 
4.7%
1180
 
2.7%
1141
 
2.6%
1086
 
2.5%
1046
 
2.4%
825
 
1.9%
819
 
1.9%
799
 
1.8%
798
 
1.8%
Other values (481) 30627
69.9%
Uppercase Letter
ValueCountFrequency (%)
B 76
16.8%
O 64
14.1%
K 51
11.3%
P 47
10.4%
S 31
 
6.8%
L 23
 
5.1%
I 21
 
4.6%
M 18
 
4.0%
H 16
 
3.5%
R 16
 
3.5%
Other values (14) 90
19.9%
Lowercase Letter
ValueCountFrequency (%)
e 200
19.1%
a 156
14.9%
t 139
13.3%
r 135
12.9%
o 88
8.4%
s 59
 
5.6%
b 43
 
4.1%
k 35
 
3.3%
l 35
 
3.3%
h 30
 
2.9%
Other values (11) 127
12.1%
Decimal Number
ValueCountFrequency (%)
2 258
41.3%
1 255
40.9%
0 37
 
5.9%
3 27
 
4.3%
4 21
 
3.4%
6 15
 
2.4%
5 8
 
1.3%
9 3
 
0.5%
Other Punctuation
ValueCountFrequency (%)
. 45
75.0%
& 14
 
23.3%
: 1
 
1.7%
Open Punctuation
ValueCountFrequency (%)
( 233
100.0%
Close Punctuation
ValueCountFrequency (%)
) 233
100.0%
Space Separator
ValueCountFrequency (%)
79
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 43820
94.1%
Latin 1500
 
3.2%
Common 1231
 
2.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3418
 
7.8%
2081
 
4.7%
1180
 
2.7%
1141
 
2.6%
1086
 
2.5%
1046
 
2.4%
825
 
1.9%
819
 
1.9%
799
 
1.8%
798
 
1.8%
Other values (481) 30627
69.9%
Latin
ValueCountFrequency (%)
e 200
13.3%
a 156
 
10.4%
t 139
 
9.3%
r 135
 
9.0%
o 88
 
5.9%
B 76
 
5.1%
O 64
 
4.3%
s 59
 
3.9%
K 51
 
3.4%
P 47
 
3.1%
Other values (35) 485
32.3%
Common
ValueCountFrequency (%)
2 258
21.0%
1 255
20.7%
( 233
18.9%
) 233
18.9%
79
 
6.4%
. 45
 
3.7%
0 37
 
3.0%
3 27
 
2.2%
4 21
 
1.7%
6 15
 
1.2%
Other values (5) 28
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 43820
94.1%
ASCII 2731
 
5.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3418
 
7.8%
2081
 
4.7%
1180
 
2.7%
1141
 
2.6%
1086
 
2.5%
1046
 
2.4%
825
 
1.9%
819
 
1.9%
799
 
1.8%
798
 
1.8%
Other values (481) 30627
69.9%
ASCII
ValueCountFrequency (%)
2 258
 
9.4%
1 255
 
9.3%
( 233
 
8.5%
) 233
 
8.5%
e 200
 
7.3%
a 156
 
5.7%
t 139
 
5.1%
r 135
 
4.9%
o 88
 
3.2%
79
 
2.9%
Other values (50) 955
35.0%

카테고리
Categorical

Distinct24
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
소설/희곡
2300 
자기계발
1553 
경영/경제
1200 
인문
1063 
시/에세이
1012 
Other values (19)
2872 

Length

Max length11
Median length5
Mean length4.6562
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row자기계발
2nd row국어/외국어
3rd row소설/희곡
4th row인문
5th row인문

Common Values

ValueCountFrequency (%)
소설/희곡 2300
23.0%
자기계발 1553
15.5%
경영/경제 1200
12.0%
인문 1063
10.6%
시/에세이 1012
10.1%
아동 477
 
4.8%
가족/생활/요리 408
 
4.1%
역사/풍속/신화 355
 
3.5%
여행/취미 302
 
3.0%
국어/외국어 260
 
2.6%
Other values (14) 1070
10.7%

Length

2024-04-17T18:41:36.664986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
소설/희곡 2300
23.0%
자기계발 1553
15.5%
경영/경제 1200
12.0%
인문 1063
10.6%
시/에세이 1012
10.1%
아동 477
 
4.8%
가족/생활/요리 408
 
4.1%
역사/풍속/신화 355
 
3.5%
여행/취미 302
 
3.0%
국어/외국어 260
 
2.6%
Other values (14) 1070
10.7%
Distinct9952
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2019-01-01 00:02:40
Maximum2020-02-12 00:36:48
2024-04-17T18:41:36.763916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T18:41:36.881125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct7856
Distinct (%)78.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2019-01-01 02:11:19
Maximum2020-03-10 00:05:32
2024-04-17T18:41:36.998062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T18:41:37.124064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

대출코드
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
S
7136 
T
1814 
W
1050 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowS
2nd rowS
3rd rowS
4th rowS
5th rowT

Common Values

ValueCountFrequency (%)
S 7136
71.4%
T 1814
 
18.1%
W 1050
 
10.5%

Length

2024-04-17T18:41:37.239155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T18:41:37.520752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
s 7136
71.4%
t 1814
 
18.1%
w 1050
 
10.5%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
7952 
1
2048 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row1
4th row1
5th row0

Common Values

ValueCountFrequency (%)
0 7952
79.5%
1 2048
 
20.5%

Length

2024-04-17T18:41:37.607806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T18:41:37.701149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 7952
79.5%
1 2048
 
20.5%

Interactions

2024-04-17T18:41:33.217753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-17T18:41:37.758902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성별대출연령카테고리대출코드대출연장횟수
성별1.0000.3460.2580.0380.119
대출연령0.3461.0000.2690.1740.135
카테고리0.2580.2691.0000.2400.163
대출코드0.0380.1740.2401.0000.013
대출연장횟수0.1190.1350.1630.0131.000
2024-04-17T18:41:37.840753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대출코드대출연장횟수성별카테고리
대출코드1.0000.0210.0640.113
대출연장횟수0.0211.0000.0760.129
성별0.0640.0761.0000.204
카테고리0.1130.1290.2041.000
2024-04-17T18:41:37.921551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대출연령성별카테고리대출코드대출연장횟수
대출연령1.0000.2650.1020.1050.104
성별0.2651.0000.2040.0640.076
카테고리0.1020.2041.0000.1130.129
대출코드0.1050.0640.1131.0000.021
대출연장횟수0.1040.0760.1290.0211.000

Missing values

2024-04-17T18:41:33.346021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-17T18:41:33.488833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

아이디성별대출연령도서명저자명출판사명카테고리대출일시반납일시대출코드대출연장횟수
13991bef888000294456031백수의 1만 권 독서법김병완아템포자기계발2019-03-10 23:33:392019-03-24 00:02:12S0
433279dc725830006727134I'm Your BooK: 네이티브가 사용하는 영어패턴은 따로 있다Jaymax Lee (이충훈)삼영서관국어/외국어2019-08-02 18:22:562019-08-16 00:02:11S0
220751726c8301013569150화성에서 살 생각인가?이사카 고타로아르테(arte)소설/희곡2019-04-18 13:52:582019-05-16 00:02:14S1
80446998a3e770139619034생각의 보폭모리 히로시마인드빌딩인문2020-01-25 15:08:492020-02-22 00:05:36S1
675336d35832f0059840150도덕경노자현대지성인문2019-11-29 11:43:052019-12-13 00:05:07T0
75843895eedda0416387029인류를 구한 12가지 약 이야기정승규반니역사/풍속/신화2020-01-06 23:20:532020-01-20 00:06:02S0
291830293a18a1398920131버티는 삶에 관하여허지웅문학동네시/에세이2019-05-25 02:21:212019-05-26 01:43:15S0
18724fc4d52240321458044이동진 독서법이동진예담인문2019-04-02 09:00:282019-04-30 00:02:14S1
67578229be6db002981603635년. 4: 1926-1930 학생 대중아 궐기하자박시백비아북역사/풍속/신화2019-11-29 18:18:172019-12-26 22:55:48S1
451173610f6ac0077317134여자들은 모르는 남자들의 심리이성현21세기북스시/에세이2019-08-11 14:32:472019-08-25 00:02:07S0
아이디성별대출연령도서명저자명출판사명카테고리대출일시반납일시대출코드대출연장횟수
13334d2dac7c80681410149생각의 힘이나모리 가즈오한국경제신문경영/경제2019-03-07 16:15:272019-03-14 15:29:10S0
11969e73517e20023961132이것이 중국의 역사다. 1: 고대부터 위진남북조 시대까지홍이애플북스역사/풍속/신화2019-02-28 02:51:292019-03-14 00:02:34S0
83905e46cd60025631143[필독서 따라잡기] 니코마코스 윤리학(아리스토텔레스)베리타스알파베리타스알파청소년교양2019-01-04 22:39:372019-01-18 00:02:48S0
750027c96f27c0369687126FBI 행동의 심리학조 내버로리더스북자기계발2020-01-03 09:47:172020-01-03 12:35:01W0
3945768f990571081980035시골의사의 주식투자란 무엇인가. 1 통찰 편박경철리더스북경영/경제2019-07-15 17:16:592019-07-29 00:02:08W0
702148f9ff1c90365734122오늘처럼 내가 싫었던 날은 없다글배우21세기북스인문2019-12-12 15:39:362019-12-26 00:05:45S0
27560ee67d7210037927135버텨내어 좋은 일 투성이설레다엔트리시/에세이2019-05-16 14:44:312019-05-30 00:02:10S0
7878108f1d8811531465040살인자의 기억법김영하문학동네소설/희곡2020-01-19 00:34:502020-01-19 22:00:57S0
10790fc0b5a140060120053고구려. 4 사유와 무 (고국원왕편)김진명새움소설/희곡2019-02-22 13:20:352019-02-27 13:48:37S0
7506f113df890280964147무슨 일이 일어났는지는 아무도김영하문학동네소설/희곡2019-02-05 22:51:332019-02-17 22:38:43S1