Overview

Dataset statistics

Number of variables11
Number of observations10000
Missing cells10
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory976.6 KiB
Average record size in memory100.0 B

Variable types

Numeric4
Categorical3
Text4

Dataset

Description서울특별시 종로구청에서 관리하는 도서관이 소장중인 전자책 목록으로 책이름, 저자 출판일등이 기재되어 있음을 말씀드립니다. http://elib.jongno.go.kr/ebook/ 에서도 전자책 목록을 검색하실수 있으십니다.
URLhttps://www.data.go.kr/data/15112922/fileData.do

Alerts

상품종류 has constant value ""Constant
기준일 has constant value ""Constant
연번 is highly overall correlated with 상품번호 and 1 other fieldsHigh correlation
상품번호 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
출판일 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
연번 has unique valuesUnique
상품번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 00:28:22.811626
Analysis finished2023-12-12 00:28:27.550044
Duration4.74 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6974.7884
Minimum1
Maximum13976
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T09:28:27.625694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile702.95
Q13439.75
median6971.5
Q310506.25
95-th percentile13268.05
Maximum13976
Range13975
Interquartile range (IQR)7066.5

Descriptive statistics

Standard deviation4047.4292
Coefficient of variation (CV)0.58029419
Kurtosis-1.2109543
Mean6974.7884
Median Absolute Deviation (MAD)3533.5
Skewness0.0077953294
Sum69747884
Variance16381683
MonotonicityNot monotonic
2023-12-12T09:28:27.761133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
13780 1
 
< 0.1%
456 1
 
< 0.1%
416 1
 
< 0.1%
5885 1
 
< 0.1%
1000 1
 
< 0.1%
12376 1
 
< 0.1%
6939 1
 
< 0.1%
8158 1
 
< 0.1%
4439 1
 
< 0.1%
8704 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
8 1
< 0.1%
10 1
< 0.1%
11 1
< 0.1%
12 1
< 0.1%
ValueCountFrequency (%)
13976 1
< 0.1%
13975 1
< 0.1%
13974 1
< 0.1%
13973 1
< 0.1%
13972 1
< 0.1%
13969 1
< 0.1%
13968 1
< 0.1%
13966 1
< 0.1%
13965 1
< 0.1%
13964 1
< 0.1%

상품종류
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
전자책
10000 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전자책
2nd row전자책
3rd row전자책
4th row전자책
5th row전자책

Common Values

ValueCountFrequency (%)
전자책 10000
100.0%

Length

2023-12-12T09:28:27.905015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:28:27.991984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전자책 10000
100.0%

상품번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean59773563
Minimum3808792
Maximum1.1560801 × 108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T09:28:28.127324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3808792
5-th percentile7006079.9
Q130705968
median64093146
Q389394144
95-th percentile1.0881812 × 108
Maximum1.1560801 × 108
Range1.1179922 × 108
Interquartile range (IQR)58688176

Descriptive statistics

Standard deviation33340428
Coefficient of variation (CV)0.55777883
Kurtosis-1.1494159
Mean59773563
Median Absolute Deviation (MAD)26157974
Skewness-0.25246767
Sum5.9773563 × 1011
Variance1.1115842 × 1015
MonotonicityNot monotonic
2023-12-12T09:28:28.269603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
108255044 1
 
< 0.1%
7006010 1
 
< 0.1%
7005966 1
 
< 0.1%
90186463 1
 
< 0.1%
7015741 1
 
< 0.1%
102116615 1
 
< 0.1%
66810315 1
 
< 0.1%
59271554 1
 
< 0.1%
68882961 1
 
< 0.1%
8645452 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
3808792 1
< 0.1%
3907826 1
< 0.1%
3999160 1
< 0.1%
3999172 1
< 0.1%
4081373 1
< 0.1%
4081395 1
< 0.1%
4081431 1
< 0.1%
4135080 1
< 0.1%
4254910 1
< 0.1%
4261493 1
< 0.1%
ValueCountFrequency (%)
115608012 1
< 0.1%
115587472 1
< 0.1%
115577566 1
< 0.1%
115561734 1
< 0.1%
115542216 1
< 0.1%
115470196 1
< 0.1%
115460697 1
< 0.1%
115457495 1
< 0.1%
115457122 1
< 0.1%
115456670 1
< 0.1%

제목
Text

Distinct9950
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T09:28:28.614028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length137
Median length82
Mean length22.7719
Min length1

Characters and Unicode

Total characters227719
Distinct characters1796
Distinct categories16 ?
Distinct scripts6 ?
Distinct blocks12 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9901 ?
Unique (%)99.0%

Sample

1st row엔드 오브 라이프
2nd row녹주석 왕관 : 짜릿하게 즐기는 명탐정 셜록 홈즈 013
3rd row엔지니어의 엄지손가락 : 짜릿하게 즐기는 명탐정 셜록 홈즈 011
4th row신의 아이 2
5th row유사역사학 비판 : 『환단고기』와 일그러진 고대사
ValueCountFrequency (%)
3963
 
6.7%
the 715
 
1.2%
읽는 449
 
0.8%
of 373
 
0.6%
위한 324
 
0.5%
이야기 271
 
0.5%
영어 247
 
0.4%
역사 241
 
0.4%
세계 238
 
0.4%
원서 233
 
0.4%
Other values (20657) 51913
88.0%
2023-12-12T09:28:29.137224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
48967
 
21.5%
3747
 
1.6%
3506
 
1.5%
e 3316
 
1.5%
3175
 
1.4%
: 3006
 
1.3%
2249
 
1.0%
2123
 
0.9%
o 2052
 
0.9%
2031
 
0.9%
Other values (1786) 153547
67.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 133408
58.6%
Space Separator 48967
 
21.5%
Lowercase Letter 22102
 
9.7%
Decimal Number 7245
 
3.2%
Uppercase Letter 5858
 
2.6%
Other Punctuation 5575
 
2.4%
Open Punctuation 1500
 
0.7%
Close Punctuation 1498
 
0.7%
Dash Punctuation 1199
 
0.5%
Math Symbol 231
 
0.1%
Other values (6) 136
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3747
 
2.8%
3506
 
2.6%
3175
 
2.4%
2249
 
1.7%
2123
 
1.6%
2031
 
1.5%
1895
 
1.4%
1783
 
1.3%
1772
 
1.3%
1755
 
1.3%
Other values (1671) 109372
82.0%
Lowercase Letter
ValueCountFrequency (%)
e 3316
15.0%
o 2052
9.3%
a 1808
 
8.2%
n 1658
 
7.5%
r 1651
 
7.5%
i 1549
 
7.0%
t 1528
 
6.9%
s 1357
 
6.1%
h 1220
 
5.5%
l 1007
 
4.6%
Other values (17) 4956
22.4%
Uppercase Letter
ValueCountFrequency (%)
T 984
16.8%
S 555
 
9.5%
B 449
 
7.7%
A 358
 
6.1%
E 343
 
5.9%
P 305
 
5.2%
M 285
 
4.9%
C 265
 
4.5%
L 262
 
4.5%
R 236
 
4.0%
Other values (17) 1816
31.0%
Other Punctuation
ValueCountFrequency (%)
: 3006
53.9%
, 1177
 
21.1%
' 344
 
6.2%
! 329
 
5.9%
? 271
 
4.9%
. 202
 
3.6%
/ 64
 
1.1%
· 63
 
1.1%
; 39
 
0.7%
& 26
 
0.5%
Other values (9) 54
 
1.0%
Decimal Number
ValueCountFrequency (%)
1 1692
23.4%
0 1477
20.4%
2 1020
14.1%
3 685
9.5%
5 517
 
7.1%
4 483
 
6.7%
6 417
 
5.8%
9 328
 
4.5%
8 319
 
4.4%
7 307
 
4.2%
Math Symbol
ValueCountFrequency (%)
+ 140
60.6%
~ 46
 
19.9%
26
 
11.3%
| 6
 
2.6%
× 4
 
1.7%
= 3
 
1.3%
3
 
1.3%
3
 
1.3%
Open Punctuation
ValueCountFrequency (%)
( 1341
89.4%
[ 154
 
10.3%
2
 
0.1%
2
 
0.1%
1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 1339
89.4%
] 154
 
10.3%
2
 
0.1%
2
 
0.1%
1
 
0.1%
Letter Number
ValueCountFrequency (%)
4
57.1%
2
28.6%
1
 
14.3%
Dash Punctuation
ValueCountFrequency (%)
- 1197
99.8%
2
 
0.2%
Final Punctuation
ValueCountFrequency (%)
53
93.0%
4
 
7.0%
Initial Punctuation
ValueCountFrequency (%)
47
92.2%
4
 
7.8%
Other Number
ValueCountFrequency (%)
² 1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
48967
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 18
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 132649
58.3%
Common 66344
29.1%
Latin 27967
 
12.3%
Han 685
 
0.3%
Hiragana 39
 
< 0.1%
Katakana 35
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3747
 
2.8%
3506
 
2.6%
3175
 
2.4%
2249
 
1.7%
2123
 
1.6%
2031
 
1.5%
1895
 
1.4%
1783
 
1.3%
1772
 
1.3%
1755
 
1.3%
Other values (1205) 108613
81.9%
Han
ValueCountFrequency (%)
13
 
1.9%
13
 
1.9%
11
 
1.6%
11
 
1.6%
10
 
1.5%
9
 
1.3%
8
 
1.2%
8
 
1.2%
6
 
0.9%
6
 
0.9%
Other values (412) 590
86.1%
Common
ValueCountFrequency (%)
48967
73.8%
: 3006
 
4.5%
1 1692
 
2.6%
0 1477
 
2.2%
( 1341
 
2.0%
) 1339
 
2.0%
- 1197
 
1.8%
, 1177
 
1.8%
2 1020
 
1.5%
3 685
 
1.0%
Other values (48) 4443
 
6.7%
Latin
ValueCountFrequency (%)
e 3316
 
11.9%
o 2052
 
7.3%
a 1808
 
6.5%
n 1658
 
5.9%
r 1651
 
5.9%
i 1549
 
5.5%
t 1528
 
5.5%
s 1357
 
4.9%
h 1220
 
4.4%
l 1007
 
3.6%
Other values (47) 10821
38.7%
Katakana
ValueCountFrequency (%)
5
 
14.3%
3
 
8.6%
2
 
5.7%
2
 
5.7%
2
 
5.7%
2
 
5.7%
2
 
5.7%
1
 
2.9%
1
 
2.9%
1
 
2.9%
Other values (14) 14
40.0%
Hiragana
ValueCountFrequency (%)
11
28.2%
3
 
7.7%
2
 
5.1%
2
 
5.1%
2
 
5.1%
2
 
5.1%
2
 
5.1%
2
 
5.1%
2
 
5.1%
1
 
2.6%
Other values (10) 10
25.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 132639
58.2%
ASCII 94061
41.3%
CJK 669
 
0.3%
None 127
 
0.1%
Punctuation 114
 
0.1%
Hiragana 39
 
< 0.1%
Katakana 35
 
< 0.1%
CJK Compat Ideographs 16
 
< 0.1%
Compat Jamo 10
 
< 0.1%
Number Forms 7
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
48967
52.1%
e 3316
 
3.5%
: 3006
 
3.2%
o 2052
 
2.2%
a 1808
 
1.9%
1 1692
 
1.8%
n 1658
 
1.8%
r 1651
 
1.8%
i 1549
 
1.6%
t 1528
 
1.6%
Other values (76) 26834
28.5%
Hangul
ValueCountFrequency (%)
3747
 
2.8%
3506
 
2.6%
3175
 
2.4%
2249
 
1.7%
2123
 
1.6%
2031
 
1.5%
1895
 
1.4%
1783
 
1.3%
1772
 
1.3%
1755
 
1.3%
Other values (1200) 108603
81.9%
None
ValueCountFrequency (%)
· 63
49.6%
26
20.5%
10
 
7.9%
× 4
 
3.1%
3
 
2.4%
3
 
2.4%
2
 
1.6%
2
 
1.6%
2
 
1.6%
2
 
1.6%
Other values (8) 10
 
7.9%
Punctuation
ValueCountFrequency (%)
53
46.5%
47
41.2%
4
 
3.5%
4
 
3.5%
4
 
3.5%
2
 
1.8%
CJK
ValueCountFrequency (%)
13
 
1.9%
13
 
1.9%
11
 
1.6%
11
 
1.6%
10
 
1.5%
9
 
1.3%
8
 
1.2%
8
 
1.2%
6
 
0.9%
6
 
0.9%
Other values (399) 574
85.8%
Hiragana
ValueCountFrequency (%)
11
28.2%
3
 
7.7%
2
 
5.1%
2
 
5.1%
2
 
5.1%
2
 
5.1%
2
 
5.1%
2
 
5.1%
2
 
5.1%
1
 
2.6%
Other values (10) 10
25.6%
Compat Jamo
ValueCountFrequency (%)
5
50.0%
2
 
20.0%
1
 
10.0%
1
 
10.0%
1
 
10.0%
Katakana
ValueCountFrequency (%)
5
 
14.3%
3
 
8.6%
2
 
5.7%
2
 
5.7%
2
 
5.7%
2
 
5.7%
2
 
5.7%
1
 
2.9%
1
 
2.9%
1
 
2.9%
Other values (14) 14
40.0%
Number Forms
ValueCountFrequency (%)
4
57.1%
2
28.6%
1
 
14.3%
CJK Compat Ideographs
ValueCountFrequency (%)
3
18.8%
2
12.5%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
Other values (3) 3
18.8%
Misc Symbols
ValueCountFrequency (%)
1
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
1
100.0%

저자
Text

Distinct5940
Distinct (%)59.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T09:28:29.436570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length50
Mean length10.3423
Min length2

Characters and Unicode

Total characters103423
Distinct characters980
Distinct categories12 ?
Distinct scripts5 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4904 ?
Unique (%)49.0%

Sample

1st row사사 료코 저/천감재 역
2nd row아서 코난 도일 저
3rd row아서 코난 도일 저
4th row야쿠마루 가쿠 저/이정민 역
5th row이문영 저
ValueCountFrequency (%)
7598
26.4%
1688
 
5.9%
그림 203
 
0.7%
이종하 165
 
0.6%
공저 161
 
0.6%
아서 144
 
0.5%
코난 142
 
0.5%
도일 142
 
0.5%
방정환 125
 
0.4%
김동인 108
 
0.4%
Other values (8517) 18285
63.6%
2023-12-12T09:28:29.918144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
18761
 
18.1%
9698
 
9.4%
2346
 
2.3%
/ 2241
 
2.2%
1880
 
1.8%
e 1810
 
1.8%
1641
 
1.6%
a 1509
 
1.5%
r 1342
 
1.3%
1324
 
1.3%
Other values (970) 60871
58.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 62125
60.1%
Space Separator 18761
 
18.1%
Lowercase Letter 14116
 
13.6%
Other Punctuation 3711
 
3.6%
Uppercase Letter 3543
 
3.4%
Open Punctuation 500
 
0.5%
Close Punctuation 498
 
0.5%
Decimal Number 134
 
0.1%
Dash Punctuation 26
 
< 0.1%
Math Symbol 6
 
< 0.1%
Other values (2) 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9698
 
15.6%
2346
 
3.8%
1880
 
3.0%
1641
 
2.6%
1324
 
2.1%
1024
 
1.6%
790
 
1.3%
724
 
1.2%
623
 
1.0%
593
 
1.0%
Other values (886) 41482
66.8%
Lowercase Letter
ValueCountFrequency (%)
e 1810
12.8%
a 1509
10.7%
r 1342
9.5%
n 1223
 
8.7%
o 1139
 
8.1%
i 926
 
6.6%
l 907
 
6.4%
s 781
 
5.5%
t 743
 
5.3%
h 578
 
4.1%
Other values (16) 3158
22.4%
Uppercase Letter
ValueCountFrequency (%)
S 313
 
8.8%
C 259
 
7.3%
B 255
 
7.2%
H 240
 
6.8%
A 227
 
6.4%
J 216
 
6.1%
D 207
 
5.8%
M 205
 
5.8%
W 193
 
5.4%
T 178
 
5.0%
Other values (16) 1250
35.3%
Decimal Number
ValueCountFrequency (%)
2 58
43.3%
1 43
32.1%
0 7
 
5.2%
7 6
 
4.5%
6 5
 
3.7%
5 5
 
3.7%
3 3
 
2.2%
9 3
 
2.2%
4 2
 
1.5%
8 2
 
1.5%
Other Punctuation
ValueCountFrequency (%)
/ 2241
60.4%
, 1045
28.2%
. 395
 
10.6%
? 7
 
0.2%
: 7
 
0.2%
& 6
 
0.2%
' 4
 
0.1%
· 4
 
0.1%
* 2
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 476
95.2%
[ 18
 
3.6%
5
 
1.0%
1
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 474
95.2%
] 18
 
3.6%
5
 
1.0%
1
 
0.2%
Space Separator
ValueCountFrequency (%)
18761
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 26
100.0%
Math Symbol
ValueCountFrequency (%)
| 6
100.0%
Final Punctuation
ValueCountFrequency (%)
2
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 61853
59.8%
Common 23639
 
22.9%
Latin 17659
 
17.1%
Han 210
 
0.2%
Hiragana 62
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9698
 
15.7%
2346
 
3.8%
1880
 
3.0%
1641
 
2.7%
1324
 
2.1%
1024
 
1.7%
790
 
1.3%
724
 
1.2%
623
 
1.0%
593
 
1.0%
Other values (779) 41210
66.6%
Han
ValueCountFrequency (%)
14
 
6.7%
14
 
6.7%
14
 
6.7%
14
 
6.7%
14
 
6.7%
7
 
3.3%
6
 
2.9%
5
 
2.4%
5
 
2.4%
5
 
2.4%
Other values (78) 112
53.3%
Latin
ValueCountFrequency (%)
e 1810
 
10.2%
a 1509
 
8.5%
r 1342
 
7.6%
n 1223
 
6.9%
o 1139
 
6.4%
i 926
 
5.2%
l 907
 
5.1%
s 781
 
4.4%
t 743
 
4.2%
h 578
 
3.3%
Other values (42) 6701
37.9%
Common
ValueCountFrequency (%)
18761
79.4%
/ 2241
 
9.5%
, 1045
 
4.4%
( 476
 
2.0%
) 474
 
2.0%
. 395
 
1.7%
2 58
 
0.2%
1 43
 
0.2%
- 26
 
0.1%
[ 18
 
0.1%
Other values (22) 102
 
0.4%
Hiragana
ValueCountFrequency (%)
5
 
8.1%
4
 
6.5%
4
 
6.5%
4
 
6.5%
4
 
6.5%
4
 
6.5%
4
 
6.5%
4
 
6.5%
4
 
6.5%
4
 
6.5%
Other values (9) 21
33.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 61853
59.8%
ASCII 41279
39.9%
CJK 209
 
0.2%
Hiragana 62
 
0.1%
None 16
 
< 0.1%
Punctuation 3
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
18761
45.4%
/ 2241
 
5.4%
e 1810
 
4.4%
a 1509
 
3.7%
r 1342
 
3.3%
n 1223
 
3.0%
o 1139
 
2.8%
, 1045
 
2.5%
i 926
 
2.2%
l 907
 
2.2%
Other values (67) 10376
25.1%
Hangul
ValueCountFrequency (%)
9698
 
15.7%
2346
 
3.8%
1880
 
3.0%
1641
 
2.7%
1324
 
2.1%
1024
 
1.7%
790
 
1.3%
724
 
1.2%
623
 
1.0%
593
 
1.0%
Other values (779) 41210
66.6%
CJK
ValueCountFrequency (%)
14
 
6.7%
14
 
6.7%
14
 
6.7%
14
 
6.7%
14
 
6.7%
7
 
3.3%
6
 
2.9%
5
 
2.4%
5
 
2.4%
5
 
2.4%
Other values (77) 111
53.1%
None
ValueCountFrequency (%)
5
31.2%
5
31.2%
· 4
25.0%
1
 
6.2%
1
 
6.2%
Hiragana
ValueCountFrequency (%)
5
 
8.1%
4
 
6.5%
4
 
6.5%
4
 
6.5%
4
 
6.5%
4
 
6.5%
4
 
6.5%
4
 
6.5%
4
 
6.5%
4
 
6.5%
Other values (9) 21
33.9%
Punctuation
ValueCountFrequency (%)
2
66.7%
1
33.3%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Distinct1251
Distinct (%)12.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T09:28:30.251394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length21
Mean length6.215
Min length1

Characters and Unicode

Total characters62150
Distinct characters629
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique532 ?
Unique (%)5.3%

Sample

1st row스튜디오오드리
2nd row바로이북
3rd row바로이북
4th row몽실북스
5th row역사비평사
ValueCountFrequency (%)
project 837
 
7.6%
gutenberg 837
 
7.6%
한국저작권위원회 648
 
5.8%
유페이퍼 399
 
3.6%
디오네 165
 
1.5%
u-paper(유페이퍼 157
 
1.4%
알에이치코리아(rhk 108
 
1.0%
내츄럴 108
 
1.0%
bookmaker 105
 
0.9%
위즈덤하우스 92
 
0.8%
Other values (1271) 7629
68.8%
2023-12-12T09:28:30.768328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 2991
 
4.8%
r 2019
 
3.2%
1913
 
3.1%
1825
 
2.9%
1788
 
2.9%
t 1730
 
2.8%
o 1403
 
2.3%
1085
 
1.7%
u 1040
 
1.7%
b 991
 
1.6%
Other values (619) 45365
73.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 40375
65.0%
Lowercase Letter 15215
 
24.5%
Uppercase Letter 3511
 
5.6%
Space Separator 1085
 
1.7%
Close Punctuation 733
 
1.2%
Open Punctuation 733
 
1.2%
Decimal Number 282
 
0.5%
Dash Punctuation 160
 
0.3%
Other Punctuation 55
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1913
 
4.7%
1825
 
4.5%
1788
 
4.4%
964
 
2.4%
838
 
2.1%
779
 
1.9%
762
 
1.9%
757
 
1.9%
756
 
1.9%
746
 
1.8%
Other values (552) 29247
72.4%
Lowercase Letter
ValueCountFrequency (%)
e 2991
19.7%
r 2019
13.3%
t 1730
11.4%
o 1403
9.2%
u 1040
 
6.8%
b 991
 
6.5%
n 872
 
5.7%
g 857
 
5.6%
c 855
 
5.6%
j 837
 
5.5%
Other values (14) 1620
10.6%
Uppercase Letter
ValueCountFrequency (%)
P 854
24.3%
G 843
24.0%
B 208
 
5.9%
K 181
 
5.2%
L 147
 
4.2%
M 144
 
4.1%
R 128
 
3.6%
H 118
 
3.4%
I 118
 
3.4%
S 117
 
3.3%
Other values (13) 653
18.6%
Decimal Number
ValueCountFrequency (%)
1 113
40.1%
2 99
35.1%
9 22
 
7.8%
4 13
 
4.6%
6 11
 
3.9%
8 10
 
3.5%
7 9
 
3.2%
5 3
 
1.1%
3 2
 
0.7%
Other Punctuation
ValueCountFrequency (%)
# 28
50.9%
& 15
27.3%
. 5
 
9.1%
/ 4
 
7.3%
' 2
 
3.6%
? 1
 
1.8%
Space Separator
ValueCountFrequency (%)
1085
100.0%
Close Punctuation
ValueCountFrequency (%)
) 733
100.0%
Open Punctuation
ValueCountFrequency (%)
( 733
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 160
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 40361
64.9%
Latin 18726
30.1%
Common 3049
 
4.9%
Han 14
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1913
 
4.7%
1825
 
4.5%
1788
 
4.4%
964
 
2.4%
838
 
2.1%
779
 
1.9%
762
 
1.9%
757
 
1.9%
756
 
1.9%
746
 
1.8%
Other values (547) 29233
72.4%
Latin
ValueCountFrequency (%)
e 2991
16.0%
r 2019
 
10.8%
t 1730
 
9.2%
o 1403
 
7.5%
u 1040
 
5.6%
b 991
 
5.3%
n 872
 
4.7%
g 857
 
4.6%
c 855
 
4.6%
P 854
 
4.6%
Other values (37) 5114
27.3%
Common
ValueCountFrequency (%)
1085
35.6%
) 733
24.0%
( 733
24.0%
- 160
 
5.2%
1 113
 
3.7%
2 99
 
3.2%
# 28
 
0.9%
9 22
 
0.7%
& 15
 
0.5%
4 13
 
0.4%
Other values (10) 48
 
1.6%
Han
ValueCountFrequency (%)
4
28.6%
4
28.6%
4
28.6%
1
 
7.1%
1
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 40360
64.9%
ASCII 21775
35.0%
CJK 10
 
< 0.1%
CJK Compat Ideographs 4
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 2991
 
13.7%
r 2019
 
9.3%
t 1730
 
7.9%
o 1403
 
6.4%
1085
 
5.0%
u 1040
 
4.8%
b 991
 
4.6%
n 872
 
4.0%
g 857
 
3.9%
c 855
 
3.9%
Other values (57) 7932
36.4%
Hangul
ValueCountFrequency (%)
1913
 
4.7%
1825
 
4.5%
1788
 
4.4%
964
 
2.4%
838
 
2.1%
779
 
1.9%
762
 
1.9%
757
 
1.9%
756
 
1.9%
746
 
1.8%
Other values (546) 29232
72.4%
CJK
ValueCountFrequency (%)
4
40.0%
4
40.0%
1
 
10.0%
1
 
10.0%
CJK Compat Ideographs
ValueCountFrequency (%)
4
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

출판일
Real number (ℝ)

HIGH CORRELATION 

Distinct2209
Distinct (%)22.1%
Missing10
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean20176016
Minimum20010430
Maximum20221205
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T09:28:30.960483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20010430
5-th percentile20120502
Q120160822
median20180816
Q320200227
95-th percentile20220411
Maximum20221205
Range210775
Interquartile range (IQR)39405

Descriptive statistics

Standard deviation29899.795
Coefficient of variation (CV)0.0014819474
Kurtosis-0.21477055
Mean20176016
Median Absolute Deviation (MAD)19604
Skewness-0.66181053
Sum2.015584 × 1011
Variance8.9399776 × 108
MonotonicityNot monotonic
2023-12-12T09:28:31.119335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20171107 248
 
2.5%
20170929 239
 
2.4%
20120501 236
 
2.4%
20120502 228
 
2.3%
20171204 168
 
1.7%
20120503 152
 
1.5%
20120430 140
 
1.4%
20160425 75
 
0.8%
20120523 50
 
0.5%
20161121 42
 
0.4%
Other values (2199) 8412
84.1%
ValueCountFrequency (%)
20010430 1
< 0.1%
20021210 1
< 0.1%
20040710 1
< 0.1%
20040901 1
< 0.1%
20041120 1
< 0.1%
20050314 1
< 0.1%
20050510 1
< 0.1%
20071005 1
< 0.1%
20071201 1
< 0.1%
20080101 1
< 0.1%
ValueCountFrequency (%)
20221205 1
 
< 0.1%
20221130 3
< 0.1%
20221122 1
 
< 0.1%
20221121 1
 
< 0.1%
20221118 2
< 0.1%
20221117 4
< 0.1%
20221115 4
< 0.1%
20221111 4
< 0.1%
20221110 1
 
< 0.1%
20221109 2
< 0.1%

보유 종수
Real number (ℝ)

Distinct10
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.3577
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T09:28:31.261724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q32
95-th percentile2
Maximum12
Range11
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.68410343
Coefficient of variation (CV)0.50386936
Kurtosis30.821236
Mean1.3577
Median Absolute Deviation (MAD)0
Skewness3.9403991
Sum13577
Variance0.46799751
MonotonicityNot monotonic
2023-12-12T09:28:31.423703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
1 7002
70.0%
2 2734
 
27.3%
3 112
 
1.1%
5 94
 
0.9%
4 40
 
0.4%
6 6
 
0.1%
7 5
 
0.1%
10 3
 
< 0.1%
8 2
 
< 0.1%
12 2
 
< 0.1%
ValueCountFrequency (%)
1 7002
70.0%
2 2734
 
27.3%
3 112
 
1.1%
4 40
 
0.4%
5 94
 
0.9%
6 6
 
0.1%
7 5
 
0.1%
8 2
 
< 0.1%
10 3
 
< 0.1%
12 2
 
< 0.1%
ValueCountFrequency (%)
12 2
 
< 0.1%
10 3
 
< 0.1%
8 2
 
< 0.1%
7 5
 
0.1%
6 6
 
0.1%
5 94
 
0.9%
4 40
 
0.4%
3 112
 
1.1%
2 2734
 
27.3%
1 7002
70.0%

대분류
Categorical

Distinct15
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
문학
3195 
인문/사회
1564 
자기관리
1008 
가정과생활
871 
해외eBook
837 
Other values (10)
2525 

Length

Max length7
Median length6
Mean length4.4752
Min length2

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row자기관리
2nd row문학
3rd row문학
4th row문학
5th row인문/사회

Common Values

ValueCountFrequency (%)
문학 3195
31.9%
인문/사회 1564
15.6%
자기관리 1008
 
10.1%
가정과생활 871
 
8.7%
해외eBook 837
 
8.4%
어린이/청소년 732
 
7.3%
비즈니스와경제 698
 
7.0%
국어와외국어 626
 
6.3%
자연과과학 210
 
2.1%
예술/대중문화 148
 
1.5%
Other values (5) 111
 
1.1%

Length

2023-12-12T09:28:31.584880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
문학 3195
31.9%
인문/사회 1564
15.6%
자기관리 1008
 
10.1%
가정과생활 871
 
8.7%
해외ebook 837
 
8.4%
어린이/청소년 732
 
7.3%
비즈니스와경제 698
 
7.0%
국어와외국어 626
 
6.3%
자연과과학 210
 
2.1%
예술/대중문화 148
 
1.5%
Other values (5) 111
 
1.1%
Distinct69
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T09:28:31.805146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length10
Mean length4.5518
Min length2

Characters and Unicode

Total characters45518
Distinct characters156
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)0.1%

Sample

1st row성공학/경력관리
2nd row소설
3rd row소설
4th row소설
5th row역사/종교
ValueCountFrequency (%)
소설 2497
24.6%
인문/사회 1075
10.6%
구텐베르크프로젝트 837
 
8.2%
가정과생활 655
 
6.4%
역사/종교 489
 
4.8%
영어 461
 
4.5%
에세이/산문 457
 
4.5%
어린이 425
 
4.2%
처세술/삶의자세 358
 
3.5%
성공학/경력관리 337
 
3.3%
Other values (64) 2579
25.4%
2023-12-12T09:28:32.169706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 3923
 
8.6%
2776
 
6.1%
2497
 
5.5%
1620
 
3.6%
1609
 
3.5%
1310
 
2.9%
1194
 
2.6%
1191
 
2.6%
1025
 
2.3%
914
 
2.0%
Other values (146) 27459
60.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 41115
90.3%
Other Punctuation 3932
 
8.6%
Uppercase Letter 301
 
0.7%
Space Separator 170
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2776
 
6.8%
2497
 
6.1%
1620
 
3.9%
1609
 
3.9%
1310
 
3.2%
1194
 
2.9%
1191
 
2.9%
1025
 
2.5%
914
 
2.2%
883
 
2.1%
Other values (136) 26096
63.5%
Uppercase Letter
ValueCountFrequency (%)
E 89
29.6%
C 89
29.6%
O 89
29.6%
I 16
 
5.3%
T 16
 
5.3%
F 1
 
0.3%
S 1
 
0.3%
Other Punctuation
ValueCountFrequency (%)
/ 3923
99.8%
& 9
 
0.2%
Space Separator
ValueCountFrequency (%)
170
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 41115
90.3%
Common 4102
 
9.0%
Latin 301
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2776
 
6.8%
2497
 
6.1%
1620
 
3.9%
1609
 
3.9%
1310
 
3.2%
1194
 
2.9%
1191
 
2.9%
1025
 
2.5%
914
 
2.2%
883
 
2.1%
Other values (136) 26096
63.5%
Latin
ValueCountFrequency (%)
E 89
29.6%
C 89
29.6%
O 89
29.6%
I 16
 
5.3%
T 16
 
5.3%
F 1
 
0.3%
S 1
 
0.3%
Common
ValueCountFrequency (%)
/ 3923
95.6%
170
 
4.1%
& 9
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 41115
90.3%
ASCII 4403
 
9.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 3923
89.1%
170
 
3.9%
E 89
 
2.0%
C 89
 
2.0%
O 89
 
2.0%
I 16
 
0.4%
T 16
 
0.4%
& 9
 
0.2%
F 1
 
< 0.1%
S 1
 
< 0.1%
Hangul
ValueCountFrequency (%)
2776
 
6.8%
2497
 
6.1%
1620
 
3.9%
1609
 
3.9%
1310
 
3.2%
1194
 
2.9%
1191
 
2.9%
1025
 
2.5%
914
 
2.2%
883
 
2.1%
Other values (136) 26096
63.5%

기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-03-23
10000 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-03-23
2nd row2023-03-23
3rd row2023-03-23
4th row2023-03-23
5th row2023-03-23

Common Values

ValueCountFrequency (%)
2023-03-23 10000
100.0%

Length

2023-12-12T09:28:32.357205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:28:32.465613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-03-23 10000
100.0%

Interactions

2023-12-12T09:28:26.582581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:28:25.324733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:28:25.778838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:28:26.189468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:28:26.683210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:28:25.457993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:28:25.883945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:28:26.288464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:28:26.776277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:28:25.579189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:28:25.986169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:28:26.408576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:28:26.871451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:28:25.681196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:28:26.086414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:28:26.498076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T09:28:32.541725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번상품번호출판일보유 종수대분류중분류
연번1.0000.8880.7970.2080.7130.773
상품번호0.8881.0000.9320.1950.6400.727
출판일0.7970.9321.0000.1510.6310.698
보유 종수0.2080.1950.1511.0000.1140.000
대분류0.7130.6400.6310.1141.0000.999
중분류0.7730.7270.6980.0000.9991.000
2023-12-12T09:28:32.647717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번상품번호출판일보유 종수대분류
연번1.0000.5530.550-0.3050.357
상품번호0.5531.0000.998-0.1300.299
출판일0.5500.9981.000-0.1260.292
보유 종수-0.305-0.130-0.1261.0000.047
대분류0.3570.2990.2920.0471.000

Missing values

2023-12-12T09:28:27.024578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:28:27.476019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번상품종류상품번호제목저자출판사출판일보유 종수대분류중분류기준일
1377913780전자책108255044엔드 오브 라이프사사 료코 저/천감재 역스튜디오오드리202202182자기관리성공학/경력관리2023-03-23
49214922전자책78882613녹주석 왕관 : 짜릿하게 즐기는 명탐정 셜록 홈즈 013아서 코난 도일 저바로이북201909201문학소설2023-03-23
44254426전자책78882615엔지니어의 엄지손가락 : 짜릿하게 즐기는 명탐정 셜록 홈즈 011아서 코난 도일 저바로이북201909201문학소설2023-03-23
26302631전자책71534556신의 아이 2야쿠마루 가쿠 저/이정민 역몽실북스201903201문학소설2023-03-23
51355136전자책86025571유사역사학 비판 : 『환단고기』와 일그러진 고대사이문영 저역사비평사201912201인문/사회역사/종교2023-03-23
52105211전자책80772814차이나는 클라스 : 과학·문화·미래 편 : 불통不通의 시대, 교양을 넘어 생존을 위한 질문을 던져라JTBC [차이나는 클라스] 제작팀 저중앙북스(books)201910181인문/사회인문/사회2023-03-23
165166전자책7002015Ballads of a CheechakoRobert W. Service 저Project Gutenberg201204302해외eBook구텐베르크프로젝트2023-03-23
1014210143전자책32382734화성의 공주 A Princess of Mars (영어 원서 읽기)Edgar Rice Burroughs (에드거 라이스 버로스) 저u-paper(유페이퍼)201609191국어와외국어영어2023-03-23
81388139전자책66274990강변에 남긴 말진영영 저도디드201810241가정과생활가정과생활2023-03-23
1349613497전자책111380354김종훈의 세계 현대건축 여행김종훈 저클라우드나인202208021인문/사회인문/사회2023-03-23
연번상품종류상품번호제목저자출판사출판일보유 종수대분류중분류기준일
1378313784전자책110563035호모 아딕투스김병규 저다산북스202207072비즈니스와경제경영2023-03-23
59185919전자책65564129어른에게도 어른이 필요하다박산호 저북라이프201810251문학에세이/산문2023-03-23
70397040전자책30555756취미의 유전 - 일본 중단편 고전문학 016나쓰메 소세키 저현인201608171문학소설2023-03-23
51345135전자책68859409엄마의 첫 심리 공부 : 자녀 관계, 부부 관계부터 고독감, 자존감까지누다심(강현식) 저유노북스201901211인문/사회인문/사회2023-03-23
67916792전자책43617976궁극의 걷기 여행 코스 서울 청계천 : 물 따라 걷는 도심 여행이민학 저북탐201706201가정과생활취미/여행2023-03-23
29462947전자책89017005모두를 위한 성평등 공부서울특별시 여성정책담당관 기획/이나영 편/이나영,최윤정,안재희,한채윤,김소라,김수아 저프로젝트P202002201인문/사회인문/사회2023-03-23
1036410365전자책40762266일본 문학 BEST 원서 81~90위 작품 읽기! (靑空文庫: 전자책 ebook 다운로드 81~90위)이즈미 쿄카,시마자키 도손 외 5명 저마음생각201704191국어와외국어일본어2023-03-23
43364337전자책59737678자기 돌봄 (개정판) : 누구보다 사랑하고 싶은 나를 위한 자기 치유법타라 브랙 저/이재석 역/김선경 편생각정원201804131인문/사회인문/사회2023-03-23
73177318전자책43861420마담 보바리 Frau Bovary ('독일어+영어+독일어/영어 오디오북' 1석 4조 함께 원서 읽기!)귀스타브 플로베르 (Gustave Flaubert) 저컨트롤V201706211국어와외국어기타 동양언어2023-03-23
48204821전자책77131918탐정, 마틴 휴잇 1 : 렌턴 크로프트 도난 사건아서 모리슨 저엔플래닛(Nplanet)201907221문학소설2023-03-23