Overview

Dataset statistics

Number of variables9
Number of observations10000
Missing cells2344
Missing cells (%)2.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory800.8 KiB
Average record size in memory82.0 B

Variable types

Numeric1
Text4
DateTime1
Categorical3

Dataset

Description임대주택 입주민들이 이용할 수 있는 한국토지주택공사 디지털도서관 내 소장 중인 전자책의 제목, 저자, 발행처, 출판일 등에 관한 데이터를 제공합니다.
Author한국토지주택공사
URLhttps://www.data.go.kr/data/15092159/fileData.do

Alerts

대분류 is highly overall correlated with 순번 and 1 other fieldsHigh correlation
중분류 is highly overall correlated with 순번 and 1 other fieldsHigh correlation
순번 is highly overall correlated with 대분류 and 1 other fieldsHigh correlation
저자 has 2135 (21.3%) missing valuesMissing
출판일 has 209 (2.1%) missing valuesMissing
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:39:44.572706
Analysis finished2023-12-12 12:39:47.456928
Duration2.88 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8750.8087
Minimum3
Maximum17561
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T21:39:47.532604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3
5-th percentile873.95
Q14357.5
median8762.5
Q313109.25
95-th percentile16669.25
Maximum17561
Range17558
Interquartile range (IQR)8751.75

Descriptive statistics

Standard deviation5056.8141
Coefficient of variation (CV)0.57786821
Kurtosis-1.1925013
Mean8750.8087
Median Absolute Deviation (MAD)4376
Skewness0.0098417664
Sum87508087
Variance25571369
MonotonicityNot monotonic
2023-12-12T21:39:47.681711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
11376 1
 
< 0.1%
4296 1
 
< 0.1%
12363 1
 
< 0.1%
2385 1
 
< 0.1%
7451 1
 
< 0.1%
11391 1
 
< 0.1%
4268 1
 
< 0.1%
14612 1
 
< 0.1%
12452 1
 
< 0.1%
3065 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
3 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
10 1
< 0.1%
12 1
< 0.1%
13 1
< 0.1%
15 1
< 0.1%
16 1
< 0.1%
17 1
< 0.1%
ValueCountFrequency (%)
17561 1
< 0.1%
17558 1
< 0.1%
17557 1
< 0.1%
17556 1
< 0.1%
17554 1
< 0.1%
17553 1
< 0.1%
17552 1
< 0.1%
17546 1
< 0.1%
17543 1
< 0.1%
17542 1
< 0.1%
Distinct9951
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T21:39:47.971477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length72
Median length53
Mean length15.2218
Min length1

Characters and Unicode

Total characters152218
Distinct characters1332
Distinct categories17 ?
Distinct scripts4 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9905 ?
Unique (%)99.1%

Sample

1st row올에프 선생님
2nd row레오나르도 다 빈치
3rd row마법의 성에서 꺼내온 따끈따끈한 이야기 2
4th row그림형제 동화집17
5th rowBlack Heart And White Heart
ValueCountFrequency (%)
1601
 
4.0%
the 523
 
1.3%
1 357
 
0.9%
2 331
 
0.8%
of 296
 
0.7%
이야기 274
 
0.7%
위한 264
 
0.7%
세계 218
 
0.5%
읽는 182
 
0.5%
나는 164
 
0.4%
Other values (15888) 35721
89.5%
2023-12-12T21:39:48.489037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
30043
 
19.7%
2831
 
1.9%
2364
 
1.6%
e 2227
 
1.5%
2166
 
1.4%
- 1829
 
1.2%
1582
 
1.0%
1386
 
0.9%
1347
 
0.9%
o 1339
 
0.9%
Other values (1322) 105104
69.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 92555
60.8%
Space Separator 30043
 
19.7%
Lowercase Letter 15058
 
9.9%
Decimal Number 4917
 
3.2%
Uppercase Letter 4567
 
3.0%
Other Punctuation 1921
 
1.3%
Dash Punctuation 1829
 
1.2%
Open Punctuation 590
 
0.4%
Close Punctuation 589
 
0.4%
Math Symbol 83
 
0.1%
Other values (7) 66
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2831
 
3.1%
2364
 
2.6%
2166
 
2.3%
1582
 
1.7%
1386
 
1.5%
1347
 
1.5%
1302
 
1.4%
1235
 
1.3%
1229
 
1.3%
1210
 
1.3%
Other values (1216) 75903
82.0%
Lowercase Letter
ValueCountFrequency (%)
e 2227
14.8%
o 1339
 
8.9%
a 1171
 
7.8%
r 1155
 
7.7%
n 1149
 
7.6%
i 1019
 
6.8%
t 933
 
6.2%
s 920
 
6.1%
h 903
 
6.0%
l 757
 
5.0%
Other values (16) 3485
23.1%
Uppercase Letter
ValueCountFrequency (%)
T 729
16.0%
A 392
 
8.6%
S 374
 
8.2%
O 354
 
7.8%
M 257
 
5.6%
I 236
 
5.2%
C 207
 
4.5%
E 207
 
4.5%
B 197
 
4.3%
P 191
 
4.2%
Other values (16) 1423
31.2%
Other Punctuation
ValueCountFrequency (%)
, 988
51.4%
? 208
 
10.8%
. 206
 
10.7%
! 194
 
10.1%
: 165
 
8.6%
' 72
 
3.7%
% 37
 
1.9%
· 18
 
0.9%
& 15
 
0.8%
; 11
 
0.6%
Other values (4) 7
 
0.4%
Decimal Number
ValueCountFrequency (%)
1 1217
24.8%
2 961
19.5%
0 901
18.3%
3 445
 
9.1%
4 341
 
6.9%
5 285
 
5.8%
6 237
 
4.8%
7 187
 
3.8%
9 174
 
3.5%
8 169
 
3.4%
Open Punctuation
ValueCountFrequency (%)
( 387
65.6%
121
 
20.5%
[ 77
 
13.1%
5
 
0.8%
Close Punctuation
ValueCountFrequency (%)
) 387
65.7%
121
 
20.5%
] 77
 
13.1%
4
 
0.7%
Math Symbol
ValueCountFrequency (%)
~ 53
63.9%
+ 21
 
25.3%
× 7
 
8.4%
÷ 2
 
2.4%
Letter Number
ValueCountFrequency (%)
10
52.6%
5
26.3%
2
 
10.5%
2
 
10.5%
Other Number
ValueCountFrequency (%)
3
37.5%
2
25.0%
2
25.0%
1
 
12.5%
Final Punctuation
ValueCountFrequency (%)
11
73.3%
4
 
26.7%
Initial Punctuation
ValueCountFrequency (%)
4
57.1%
3
42.9%
Modifier Symbol
ValueCountFrequency (%)
` 3
75.0%
´ 1
 
25.0%
Space Separator
ValueCountFrequency (%)
30043
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1829
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 12
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 92520
60.8%
Common 40019
26.3%
Latin 19644
 
12.9%
Han 35
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2831
 
3.1%
2364
 
2.6%
2166
 
2.3%
1582
 
1.7%
1386
 
1.5%
1347
 
1.5%
1302
 
1.4%
1235
 
1.3%
1229
 
1.3%
1210
 
1.3%
Other values (1182) 75868
82.0%
Latin
ValueCountFrequency (%)
e 2227
 
11.3%
o 1339
 
6.8%
a 1171
 
6.0%
r 1155
 
5.9%
n 1149
 
5.8%
i 1019
 
5.2%
t 933
 
4.7%
s 920
 
4.7%
h 903
 
4.6%
l 757
 
3.9%
Other values (46) 8071
41.1%
Common
ValueCountFrequency (%)
30043
75.1%
- 1829
 
4.6%
1 1217
 
3.0%
, 988
 
2.5%
2 961
 
2.4%
0 901
 
2.3%
3 445
 
1.1%
( 387
 
1.0%
) 387
 
1.0%
4 341
 
0.9%
Other values (40) 2520
 
6.3%
Han
ValueCountFrequency (%)
2
 
5.7%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
Other values (24) 24
68.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 92465
60.7%
ASCII 59330
39.0%
None 280
 
0.2%
Compat Jamo 55
 
< 0.1%
CJK 35
 
< 0.1%
Punctuation 25
 
< 0.1%
Number Forms 19
 
< 0.1%
Enclosed Alphanum 8
 
< 0.1%
Letterlike Symbols 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
30043
50.6%
e 2227
 
3.8%
- 1829
 
3.1%
o 1339
 
2.3%
1 1217
 
2.1%
a 1171
 
2.0%
r 1155
 
1.9%
n 1149
 
1.9%
i 1019
 
1.7%
, 988
 
1.7%
Other values (73) 17193
29.0%
Hangul
ValueCountFrequency (%)
2831
 
3.1%
2364
 
2.6%
2166
 
2.3%
1582
 
1.7%
1386
 
1.5%
1347
 
1.5%
1302
 
1.4%
1235
 
1.3%
1229
 
1.3%
1210
 
1.3%
Other values (1181) 75813
82.0%
None
ValueCountFrequency (%)
121
43.2%
121
43.2%
· 18
 
6.4%
× 7
 
2.5%
5
 
1.8%
4
 
1.4%
÷ 2
 
0.7%
1
 
0.4%
´ 1
 
0.4%
Compat Jamo
ValueCountFrequency (%)
55
100.0%
Punctuation
ValueCountFrequency (%)
11
44.0%
4
 
16.0%
4
 
16.0%
3
 
12.0%
3
 
12.0%
Number Forms
ValueCountFrequency (%)
10
52.6%
5
26.3%
2
 
10.5%
2
 
10.5%
Enclosed Alphanum
ValueCountFrequency (%)
3
37.5%
2
25.0%
2
25.0%
1
 
12.5%
CJK
ValueCountFrequency (%)
2
 
5.7%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
1
 
2.9%
Other values (24) 24
68.6%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%

저자
Text

MISSING 

Distinct5120
Distinct (%)65.1%
Missing2135
Missing (%)21.3%
Memory size156.2 KiB
2023-12-12T21:39:48.882778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length82
Median length3
Mean length5.5605849
Min length1

Characters and Unicode

Total characters43734
Distinct characters848
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4130 ?
Unique (%)52.5%

Sample

1st row미야모토 마사하루
2nd row그림형제
3rd row이화득
4th row이경윤
5th row작자미상 외
ValueCountFrequency (%)
편집부 182
 
1.5%
그림 88
 
0.7%
76
 
0.6%
72
 
0.6%
ebook 58
 
0.5%
korea 58
 
0.5%
한국교육방송공사 54
 
0.4%
이효석 46
 
0.4%
구인환 45
 
0.4%
이태준 45
 
0.4%
Other values (6652) 11573
94.1%
2023-12-12T21:39:49.536871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4433
 
10.1%
1628
 
3.7%
1092
 
2.5%
, 949
 
2.2%
875
 
2.0%
681
 
1.6%
587
 
1.3%
511
 
1.2%
490
 
1.1%
406
 
0.9%
Other values (838) 32082
73.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 35331
80.8%
Space Separator 4433
 
10.1%
Lowercase Letter 1650
 
3.8%
Other Punctuation 1121
 
2.6%
Uppercase Letter 818
 
1.9%
Close Punctuation 156
 
0.4%
Open Punctuation 156
 
0.4%
Decimal Number 42
 
0.1%
Dash Punctuation 12
 
< 0.1%
Math Symbol 10
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1628
 
4.6%
1092
 
3.1%
875
 
2.5%
681
 
1.9%
587
 
1.7%
511
 
1.4%
490
 
1.4%
406
 
1.1%
366
 
1.0%
350
 
1.0%
Other values (760) 28345
80.2%
Lowercase Letter
ValueCountFrequency (%)
e 237
14.4%
o 233
14.1%
a 206
12.5%
r 145
8.8%
n 119
 
7.2%
i 116
 
7.0%
k 95
 
5.8%
h 69
 
4.2%
l 69
 
4.2%
t 62
 
3.8%
Other values (15) 299
18.1%
Uppercase Letter
ValueCountFrequency (%)
B 105
12.8%
K 103
12.6%
M 66
 
8.1%
S 65
 
7.9%
H 50
 
6.1%
R 48
 
5.9%
T 48
 
5.9%
C 43
 
5.3%
A 34
 
4.2%
O 31
 
3.8%
Other values (14) 225
27.5%
Other Punctuation
ValueCountFrequency (%)
, 949
84.7%
. 154
 
13.7%
" 6
 
0.5%
\ 3
 
0.3%
; 2
 
0.2%
' 2
 
0.2%
2
 
0.2%
& 2
 
0.2%
? 1
 
0.1%
Decimal Number
ValueCountFrequency (%)
1 9
21.4%
2 8
19.0%
5 7
16.7%
3 6
14.3%
9 4
9.5%
0 3
 
7.1%
4 2
 
4.8%
8 2
 
4.8%
7 1
 
2.4%
Other Symbol
ValueCountFrequency (%)
2
40.0%
2
40.0%
1
20.0%
Close Punctuation
ValueCountFrequency (%)
) 145
92.9%
11
 
7.1%
Open Punctuation
ValueCountFrequency (%)
( 145
92.9%
11
 
7.1%
Math Symbol
ValueCountFrequency (%)
| 9
90.0%
1
 
10.0%
Space Separator
ValueCountFrequency (%)
4433
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 35309
80.7%
Common 5934
 
13.6%
Latin 2468
 
5.6%
Han 23
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1628
 
4.6%
1092
 
3.1%
875
 
2.5%
681
 
1.9%
587
 
1.7%
511
 
1.4%
490
 
1.4%
406
 
1.1%
366
 
1.0%
350
 
1.0%
Other values (748) 28323
80.2%
Latin
ValueCountFrequency (%)
e 237
 
9.6%
o 233
 
9.4%
a 206
 
8.3%
r 145
 
5.9%
n 119
 
4.8%
i 116
 
4.7%
B 105
 
4.3%
K 103
 
4.2%
k 95
 
3.8%
h 69
 
2.8%
Other values (39) 1040
42.1%
Common
ValueCountFrequency (%)
4433
74.7%
, 949
 
16.0%
. 154
 
2.6%
) 145
 
2.4%
( 145
 
2.4%
- 12
 
0.2%
11
 
0.2%
11
 
0.2%
1 9
 
0.2%
| 9
 
0.2%
Other values (18) 56
 
0.9%
Han
ValueCountFrequency (%)
3
13.0%
3
13.0%
3
13.0%
3
13.0%
2
8.7%
2
8.7%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
Other values (3) 3
13.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 35308
80.7%
ASCII 8373
 
19.1%
None 25
 
0.1%
CJK 23
 
0.1%
Geometric Shapes 2
 
< 0.1%
Enclosed Alphanum 2
 
< 0.1%
Math Operators 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4433
52.9%
, 949
 
11.3%
e 237
 
2.8%
o 233
 
2.8%
a 206
 
2.5%
. 154
 
1.8%
r 145
 
1.7%
) 145
 
1.7%
( 145
 
1.7%
n 119
 
1.4%
Other values (61) 1607
 
19.2%
Hangul
ValueCountFrequency (%)
1628
 
4.6%
1092
 
3.1%
875
 
2.5%
681
 
1.9%
587
 
1.7%
511
 
1.4%
490
 
1.4%
406
 
1.1%
366
 
1.0%
350
 
1.0%
Other values (747) 28322
80.2%
None
ValueCountFrequency (%)
11
44.0%
11
44.0%
2
 
8.0%
1
 
4.0%
CJK
ValueCountFrequency (%)
3
13.0%
3
13.0%
3
13.0%
3
13.0%
2
8.7%
2
8.7%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
Other values (3) 3
13.0%
Geometric Shapes
ValueCountFrequency (%)
2
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
2
100.0%
Math Operators
ValueCountFrequency (%)
1
100.0%
Distinct1062
Distinct (%)10.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T21:39:49.866737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length12
Mean length5.0733
Min length1

Characters and Unicode

Total characters50733
Distinct characters570
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique321 ?
Unique (%)3.2%

Sample

1st row다산에듀
2nd row푸른들
3rd rowebook Korea
4th row옹달샘
5th rowFantasien
ValueCountFrequency (%)
작가문화 378
 
3.5%
엑스트라클래스 374
 
3.5%
신원문화사 313
 
2.9%
ebook 183
 
1.7%
korea 183
 
1.7%
도서출판 166
 
1.6%
위즈덤하우스 157
 
1.5%
자음과모음 152
 
1.4%
ybm시사닷컴 144
 
1.3%
들녘 113
 
1.1%
Other values (1064) 8529
79.8%
2023-12-12T21:39:50.401827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2396
 
4.7%
1389
 
2.7%
1243
 
2.5%
o 1120
 
2.2%
991
 
2.0%
e 894
 
1.8%
881
 
1.7%
a 873
 
1.7%
862
 
1.7%
i 737
 
1.5%
Other values (560) 39347
77.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 38938
76.8%
Lowercase Letter 7895
 
15.6%
Uppercase Letter 2366
 
4.7%
Space Separator 692
 
1.4%
Decimal Number 262
 
0.5%
Open Punctuation 221
 
0.4%
Close Punctuation 221
 
0.4%
Other Punctuation 81
 
0.2%
Other Symbol 57
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2396
 
6.2%
1389
 
3.6%
1243
 
3.2%
991
 
2.5%
881
 
2.3%
862
 
2.2%
714
 
1.8%
713
 
1.8%
596
 
1.5%
567
 
1.5%
Other values (499) 28586
73.4%
Uppercase Letter
ValueCountFrequency (%)
C 306
12.9%
B 293
12.4%
K 290
12.3%
L 200
8.5%
P 193
8.2%
M 184
7.8%
Y 144
 
6.1%
E 131
 
5.5%
A 113
 
4.8%
H 97
 
4.1%
Other values (13) 415
17.5%
Lowercase Letter
ValueCountFrequency (%)
o 1120
14.2%
e 894
11.3%
a 873
11.1%
i 737
9.3%
r 700
8.9%
s 579
 
7.3%
n 362
 
4.6%
b 346
 
4.4%
c 299
 
3.8%
l 249
 
3.2%
Other values (11) 1736
22.0%
Decimal Number
ValueCountFrequency (%)
1 95
36.3%
2 92
35.1%
0 35
 
13.4%
3 22
 
8.4%
4 11
 
4.2%
6 3
 
1.1%
5 3
 
1.1%
7 1
 
0.4%
Other Punctuation
ValueCountFrequency (%)
. 32
39.5%
# 31
38.3%
: 11
 
13.6%
& 6
 
7.4%
? 1
 
1.2%
Space Separator
ValueCountFrequency (%)
692
100.0%
Open Punctuation
ValueCountFrequency (%)
( 221
100.0%
Close Punctuation
ValueCountFrequency (%)
) 221
100.0%
Other Symbol
ValueCountFrequency (%)
57
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 38970
76.8%
Latin 10261
 
20.2%
Common 1477
 
2.9%
Han 25
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2396
 
6.1%
1389
 
3.6%
1243
 
3.2%
991
 
2.5%
881
 
2.3%
862
 
2.2%
714
 
1.8%
713
 
1.8%
596
 
1.5%
567
 
1.5%
Other values (495) 28618
73.4%
Latin
ValueCountFrequency (%)
o 1120
 
10.9%
e 894
 
8.7%
a 873
 
8.5%
i 737
 
7.2%
r 700
 
6.8%
s 579
 
5.6%
n 362
 
3.5%
b 346
 
3.4%
C 306
 
3.0%
c 299
 
2.9%
Other values (34) 4045
39.4%
Common
ValueCountFrequency (%)
692
46.9%
( 221
 
15.0%
) 221
 
15.0%
1 95
 
6.4%
2 92
 
6.2%
0 35
 
2.4%
. 32
 
2.2%
# 31
 
2.1%
3 22
 
1.5%
: 11
 
0.7%
Other values (6) 25
 
1.7%
Han
ValueCountFrequency (%)
11
44.0%
11
44.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 38913
76.7%
ASCII 11738
 
23.1%
None 57
 
0.1%
CJK 24
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2396
 
6.2%
1389
 
3.6%
1243
 
3.2%
991
 
2.5%
881
 
2.3%
862
 
2.2%
714
 
1.8%
713
 
1.8%
596
 
1.5%
567
 
1.5%
Other values (494) 28561
73.4%
ASCII
ValueCountFrequency (%)
o 1120
 
9.5%
e 894
 
7.6%
a 873
 
7.4%
i 737
 
6.3%
r 700
 
6.0%
692
 
5.9%
s 579
 
4.9%
n 362
 
3.1%
b 346
 
2.9%
C 306
 
2.6%
Other values (50) 5129
43.7%
None
ValueCountFrequency (%)
57
100.0%
CJK
ValueCountFrequency (%)
11
45.8%
11
45.8%
1
 
4.2%
1
 
4.2%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%

출판일
Date

MISSING 

Distinct2655
Distinct (%)27.1%
Missing209
Missing (%)2.1%
Memory size156.2 KiB
Minimum2000-01-01 00:00:00
Maximum2020-10-08 00:00:00
2023-12-12T21:39:50.543920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:39:50.696377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
10
4842 
3
3530 
5
1624 
10000
 
4

Length

Max length5
Median length1
Mean length1.4858
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row10
3rd row10
4th row10
5th row10

Common Values

ValueCountFrequency (%)
10 4842
48.4%
3 3530
35.3%
5 1624
 
16.2%
10000 4
 
< 0.1%

Length

2023-12-12T21:39:50.901034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:39:51.058128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
10 4842
48.4%
3 3530
35.3%
5 1624
 
16.2%
10000 4
 
< 0.1%

대분류
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
일반인
4850 
청소년
2584 
어린이
2566 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row청소년
2nd row어린이
3rd row어린이
4th row어린이
5th row일반인

Common Values

ValueCountFrequency (%)
일반인 4850
48.5%
청소년 2584
25.8%
어린이 2566
25.7%

Length

2023-12-12T21:39:51.180824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:39:51.277947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반인 4850
48.5%
청소년 2584
25.8%
어린이 2566
25.7%

중분류
Categorical

HIGH CORRELATION 

Distinct17
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
아동
2456 
문학
1832 
청소년교양
1697 
경영 경제
1284 
학습지 참고서
887 
Other values (12)
1844 

Length

Max length10
Median length9
Mean length3.9969
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row청소년교양
2nd row아동
3rd row아동
4th row유아
5th row문학

Common Values

ValueCountFrequency (%)
아동 2456
24.6%
문학 1832
18.3%
청소년교양 1697
17.0%
경영 경제 1284
12.8%
학습지 참고서 887
 
8.9%
가족생활 여행취미 400
 
4.0%
인문 390
 
3.9%
외국어 190
 
1.9%
역사 풍속 신화 174
 
1.7%
건강 의학 168
 
1.7%
Other values (7) 522
 
5.2%

Length

2023-12-12T21:39:51.398962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
아동 2456
18.0%
문학 1832
13.4%
청소년교양 1697
12.4%
경영 1284
9.4%
경제 1284
9.4%
학습지 887
 
6.5%
참고서 887
 
6.5%
가족생활 400
 
2.9%
여행취미 400
 
2.9%
인문 390
 
2.9%
Other values (21) 2140
15.7%
Distinct91
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T21:39:51.767589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length11
Mean length7.5958
Min length2

Characters and Unicode

Total characters75958
Distinct characters182
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6 ?
Unique (%)0.1%

Sample

1st row문학 고전 소설
2nd row역사 인물 지리
3rd row동화(한국,외국,멀티)
4th row그림책
5th row판타지
ValueCountFrequency (%)
동화(한국,외국,멀티 1708
 
8.9%
고전 1595
 
8.3%
소설 1414
 
7.4%
문학 1414
 
7.4%
공부방법 819
 
4.3%
학습일반 819
 
4.3%
외국소설 679
 
3.5%
성공 669
 
3.5%
처세 669
 
3.5%
자기계발 669
 
3.5%
Other values (134) 8692
45.4%
2023-12-12T21:39:52.230658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9149
 
12.0%
4614
 
6.1%
3812
 
5.0%
, 3416
 
4.5%
2436
 
3.2%
2322
 
3.1%
2322
 
3.1%
2021
 
2.7%
1987
 
2.6%
1807
 
2.4%
Other values (172) 42072
55.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 59926
78.9%
Space Separator 9149
 
12.0%
Other Punctuation 3416
 
4.5%
Close Punctuation 1708
 
2.2%
Open Punctuation 1708
 
2.2%
Uppercase Letter 48
 
0.1%
Lowercase Letter 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4614
 
7.7%
3812
 
6.4%
2436
 
4.1%
2322
 
3.9%
2322
 
3.9%
2021
 
3.4%
1987
 
3.3%
1807
 
3.0%
1710
 
2.9%
1710
 
2.9%
Other values (162) 35185
58.7%
Uppercase Letter
ValueCountFrequency (%)
S 15
31.2%
F 15
31.2%
C 6
 
12.5%
E 6
 
12.5%
O 6
 
12.5%
Space Separator
ValueCountFrequency (%)
9149
100.0%
Other Punctuation
ValueCountFrequency (%)
, 3416
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1708
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1708
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 59926
78.9%
Common 15981
 
21.0%
Latin 51
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4614
 
7.7%
3812
 
6.4%
2436
 
4.1%
2322
 
3.9%
2322
 
3.9%
2021
 
3.4%
1987
 
3.3%
1807
 
3.0%
1710
 
2.9%
1710
 
2.9%
Other values (162) 35185
58.7%
Latin
ValueCountFrequency (%)
S 15
29.4%
F 15
29.4%
C 6
 
11.8%
E 6
 
11.8%
O 6
 
11.8%
e 3
 
5.9%
Common
ValueCountFrequency (%)
9149
57.2%
, 3416
 
21.4%
) 1708
 
10.7%
( 1708
 
10.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 59926
78.9%
ASCII 16032
 
21.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9149
57.1%
, 3416
 
21.3%
) 1708
 
10.7%
( 1708
 
10.7%
S 15
 
0.1%
F 15
 
0.1%
C 6
 
< 0.1%
E 6
 
< 0.1%
O 6
 
< 0.1%
e 3
 
< 0.1%
Hangul
ValueCountFrequency (%)
4614
 
7.7%
3812
 
6.4%
2436
 
4.1%
2322
 
3.9%
2322
 
3.9%
2021
 
3.4%
1987
 
3.3%
1807
 
3.0%
1710
 
2.9%
1710
 
2.9%
Other values (162) 35185
58.7%

Interactions

2023-12-12T21:39:46.962752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:39:52.333447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번동시이용 가능 수대분류중분류소분류
순번1.0000.6300.9440.9280.966
동시이용 가능 수0.6301.0000.2900.6120.738
대분류0.9440.2901.0001.0001.000
중분류0.9280.6121.0001.0001.000
소분류0.9660.7381.0001.0001.000
2023-12-12T21:39:52.431150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
동시이용 가능 수대분류중분류
동시이용 가능 수1.0000.2780.389
대분류0.2781.0000.999
중분류0.3890.9991.000
2023-12-12T21:39:52.536595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번동시이용 가능 수대분류중분류
순번1.0000.4310.9320.716
동시이용 가능 수0.4311.0000.2780.389
대분류0.9320.2781.0000.999
중분류0.7160.3890.9991.000

Missing values

2023-12-12T21:39:47.126340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:39:47.289407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T21:39:47.399252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

순번도서명저자출판사출판일동시이용 가능 수대분류중분류소분류
1137511376올에프 선생님미야모토 마사하루다산에듀2014-02-053청소년청소년교양문학 고전 소설
1387413875레오나르도 다 빈치<NA>푸른들2003-02-0210어린이아동역사 인물 지리
1318013181마법의 성에서 꺼내온 따끈따끈한 이야기 2<NA>ebook Korea2001-03-0910어린이아동동화(한국,외국,멀티)
1304913050그림형제 동화집17그림형제옹달샘2003-02-0210어린이유아그림책
36713672Black Heart And White Heart<NA>Fantasien2003-02-0110일반인문학판타지
76037604The Miracle Mongers, An Expose'<NA>Classic house2003-02-0110일반인가족생활 여행취미취미서
80288029이화득의 유럽 자동차 여행 1 : 여행계획편이화득황금열쇠2017-05-315일반인가족생활 여행취미여행일반 국내여행
35843585Eight Hundred Leagues On The Amazon<NA>Balloon2003-02-0110일반인문학SF 밀리터리
1591815919박태준처럼이경윤FKI미디어(오이북)2013-06-123어린이아동역사 인물 지리
1207312074청소년이 읽어야 할 고전소설 2작자미상 외크리에이트플러스2014-07-153청소년학습지 참고서공부방법 학습일반
순번도서명저자출판사출판일동시이용 가능 수대분류중분류소분류
1502915030영어로 읽는 세계 명작 - 보물섬 4로버트 루이 스티븐슨YBM시사닷컴2002-01-2910어린이아동동화(한국,외국,멀티)
1025410255고갱 II<NA>초승달2003-04-2010청소년청소년교양문학 고전 소설
48514852라스베가스의 불빛은 아직도 어둡다배상환책나무출판사2016-01-223일반인문학시 에세이
32953296Four Beasts In One<NA>Ex Libris2003-02-0110일반인문학외국소설
1516사기진작 & 복리후생현진욱라이터스2005-08-1210일반인경영 경제기업실무관리
92609261혁명가의 안해 상이광수작가문화2003-04-0410청소년청소년교양문학 고전 소설
44584459비밀정원 - 잃어버린 엄마의 첫사랑을 찾아서박혜영다산책방2014-10-163일반인문학한국소설
1736317364산타를 기다리는 할머니이경선훈민2018-02-203어린이아동동화(한국,외국,멀티)
1579015791맛있는 말놀이 그림책 1노경실아울북2011-04-113어린이아동초등학년
45624563나는 오늘부터 말을 하지 않기로 했다편석환가디언2015-06-293일반인문학시 에세이