Overview

Dataset statistics

Number of variables8
Number of observations7874
Missing cells3
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory515.3 KiB
Average record size in memory67.0 B

Variable types

Numeric2
Text2
Unsupported3
Categorical1

Dataset

Description도서정보(서명,저자,발행년도,가격,출판사)에 대한 데이터로 2022년도 구미도서관 도서구입 7,874건 제공합니다.
Author경상북도교육청 경상북도교육청구미도서관
URLhttps://www.data.go.kr/data/3068857/fileData.do

Alerts

금액 is highly overall correlated with 책수High correlation
책수 is highly overall correlated with 금액High correlation
책수 is highly imbalanced (58.5%)Imbalance
번호 has unique valuesUnique
발행자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
발행년 is an unsupported type, check if it needs cleaning or further analysisUnsupported
단가 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-04-17 10:57:15.477479
Analysis finished2024-04-17 10:57:17.304771
Duration1.83 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct7874
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3937.5
Minimum1
Maximum7874
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size69.3 KiB
2024-04-17T19:57:17.369907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile394.65
Q11969.25
median3937.5
Q35905.75
95-th percentile7480.35
Maximum7874
Range7873
Interquartile range (IQR)3936.5

Descriptive statistics

Standard deviation2273.1723
Coefficient of variation (CV)0.57731361
Kurtosis-1.2
Mean3937.5
Median Absolute Deviation (MAD)1968.5
Skewness0
Sum31003875
Variance5167312.5
MonotonicityStrictly increasing
2024-04-17T19:57:17.506248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
5261 1
 
< 0.1%
5259 1
 
< 0.1%
5258 1
 
< 0.1%
5257 1
 
< 0.1%
5256 1
 
< 0.1%
5255 1
 
< 0.1%
5254 1
 
< 0.1%
5253 1
 
< 0.1%
5252 1
 
< 0.1%
Other values (7864) 7864
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
7874 1
< 0.1%
7873 1
< 0.1%
7872 1
< 0.1%
7871 1
< 0.1%
7870 1
< 0.1%
7869 1
< 0.1%
7868 1
< 0.1%
7867 1
< 0.1%
7866 1
< 0.1%
7865 1
< 0.1%

서명
Text

Distinct7777
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size61.6 KiB
2024-04-17T19:57:17.781058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length132
Median length72
Mean length23.418339
Min length1

Characters and Unicode

Total characters184396
Distinct characters1659
Distinct categories17 ?
Distinct scripts8 ?
Distinct blocks14 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7680 ?
Unique (%)97.5%

Sample

1st row사랑한다고 말할 용기
2nd row아빠, 쿠키 주세요
3rd row숨바꼭질!
4th row소년과 살쾡이
5th row세종대왕을 찾아라
ValueCountFrequency (%)
4151
 
8.5%
위한 320
 
0.7%
이야기 316
 
0.6%
2 251
 
0.5%
1 241
 
0.5%
174
 
0.4%
과학 173
 
0.4%
3 133
 
0.3%
읽는 126
 
0.3%
나는 121
 
0.2%
Other values (18344) 42975
87.7%
2024-04-17T19:57:18.209221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
45114
 
24.5%
3393
 
1.8%
3290
 
1.8%
2816
 
1.5%
: 2749
 
1.5%
1857
 
1.0%
1689
 
0.9%
1648
 
0.9%
1569
 
0.9%
1551
 
0.8%
Other values (1649) 118720
64.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 115711
62.8%
Space Separator 45114
 
24.5%
Lowercase Letter 7735
 
4.2%
Other Punctuation 6209
 
3.4%
Decimal Number 3511
 
1.9%
Uppercase Letter 2393
 
1.3%
Open Punctuation 1599
 
0.9%
Close Punctuation 1599
 
0.9%
Math Symbol 335
 
0.2%
Nonspacing Mark 106
 
0.1%
Other values (7) 84
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3393
 
2.9%
3290
 
2.8%
2816
 
2.4%
1857
 
1.6%
1689
 
1.5%
1648
 
1.4%
1569
 
1.4%
1551
 
1.3%
1519
 
1.3%
1499
 
1.3%
Other values (1494) 94880
82.0%
Lowercase Letter
ValueCountFrequency (%)
e 895
11.6%
o 720
 
9.3%
a 623
 
8.1%
i 602
 
7.8%
n 579
 
7.5%
t 532
 
6.9%
s 516
 
6.7%
r 492
 
6.4%
h 361
 
4.7%
l 318
 
4.1%
Other values (52) 2097
27.1%
Uppercase Letter
ValueCountFrequency (%)
E 203
 
8.5%
S 198
 
8.3%
I 188
 
7.9%
T 185
 
7.7%
A 157
 
6.6%
L 148
 
6.2%
N 111
 
4.6%
V 106
 
4.4%
O 101
 
4.2%
B 99
 
4.1%
Other values (17) 897
37.5%
Other Punctuation
ValueCountFrequency (%)
: 2749
44.3%
. 1188
19.1%
, 965
 
15.5%
! 532
 
8.6%
? 383
 
6.2%
· 174
 
2.8%
' 130
 
2.1%
& 28
 
0.5%
% 22
 
0.4%
# 10
 
0.2%
Other values (6) 28
 
0.5%
Nonspacing Mark
ValueCountFrequency (%)
22
20.8%
17
16.0%
14
13.2%
13
12.3%
9
8.5%
9
8.5%
7
 
6.6%
5
 
4.7%
5
 
4.7%
́ 2
 
1.9%
Other values (2) 3
 
2.8%
Decimal Number
ValueCountFrequency (%)
1 832
23.7%
2 682
19.4%
0 630
17.9%
3 391
11.1%
5 255
 
7.3%
4 251
 
7.1%
6 141
 
4.0%
9 123
 
3.5%
7 113
 
3.2%
8 93
 
2.6%
Math Symbol
ValueCountFrequency (%)
= 290
86.6%
+ 18
 
5.4%
~ 14
 
4.2%
× 7
 
2.1%
< 3
 
0.9%
> 3
 
0.9%
Open Punctuation
ValueCountFrequency (%)
( 1394
87.2%
[ 193
 
12.1%
7
 
0.4%
4
 
0.3%
1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 1394
87.2%
] 193
 
12.1%
7
 
0.4%
4
 
0.3%
1
 
0.1%
Other Symbol
ValueCountFrequency (%)
8
80.0%
1
 
10.0%
1
 
10.0%
Other Number
ValueCountFrequency (%)
3
50.0%
3
50.0%
Letter Number
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
45114
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 60
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 4
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 115141
62.4%
Common 58449
31.7%
Latin 10130
 
5.5%
Thai 425
 
0.2%
Han 160
 
0.1%
Hiragana 66
 
< 0.1%
Katakana 21
 
< 0.1%
Inherited 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3393
 
2.9%
3290
 
2.9%
2816
 
2.4%
1857
 
1.6%
1689
 
1.5%
1648
 
1.4%
1569
 
1.4%
1551
 
1.3%
1519
 
1.3%
1499
 
1.3%
Other values (1276) 94310
81.9%
Han
ValueCountFrequency (%)
6
 
3.8%
4
 
2.5%
3
 
1.9%
3
 
1.9%
3
 
1.9%
2
 
1.2%
2
 
1.2%
2
 
1.2%
2
 
1.2%
2
 
1.2%
Other values (123) 131
81.9%
Latin
ValueCountFrequency (%)
e 895
 
8.8%
o 720
 
7.1%
a 623
 
6.2%
i 602
 
5.9%
n 579
 
5.7%
t 532
 
5.3%
s 516
 
5.1%
r 492
 
4.9%
h 361
 
3.6%
l 318
 
3.1%
Other values (81) 4492
44.3%
Common
ValueCountFrequency (%)
45114
77.2%
: 2749
 
4.7%
( 1394
 
2.4%
) 1394
 
2.4%
. 1188
 
2.0%
, 965
 
1.7%
1 832
 
1.4%
2 682
 
1.2%
0 630
 
1.1%
! 532
 
0.9%
Other values (42) 2969
 
5.1%
Thai
ValueCountFrequency (%)
27
 
6.4%
27
 
6.4%
22
 
5.2%
22
 
5.2%
19
 
4.5%
18
 
4.2%
17
 
4.0%
17
 
4.0%
16
 
3.8%
16
 
3.8%
Other values (38) 224
52.7%
Hiragana
ValueCountFrequency (%)
6
 
9.1%
5
 
7.6%
4
 
6.1%
4
 
6.1%
4
 
6.1%
3
 
4.5%
3
 
4.5%
3
 
4.5%
3
 
4.5%
3
 
4.5%
Other values (20) 28
42.4%
Katakana
ValueCountFrequency (%)
3
14.3%
2
 
9.5%
2
 
9.5%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
Other values (7) 7
33.3%
Inherited
ValueCountFrequency (%)
́ 2
50.0%
̂ 2
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 115130
62.4%
ASCII 68258
37.0%
Thai 425
 
0.2%
None 255
 
0.1%
CJK 160
 
0.1%
Hiragana 66
 
< 0.1%
Latin Ext Additional 43
 
< 0.1%
Katakana 21
 
< 0.1%
Compat Jamo 11
 
< 0.1%
Misc Symbols 8
 
< 0.1%
Other values (4) 19
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
45114
66.1%
: 2749
 
4.0%
( 1394
 
2.0%
) 1394
 
2.0%
. 1188
 
1.7%
, 965
 
1.4%
e 895
 
1.3%
1 832
 
1.2%
o 720
 
1.1%
2 682
 
1.0%
Other values (76) 12325
 
18.1%
Hangul
ValueCountFrequency (%)
3393
 
2.9%
3290
 
2.9%
2816
 
2.4%
1857
 
1.6%
1689
 
1.5%
1648
 
1.4%
1569
 
1.4%
1551
 
1.3%
1519
 
1.3%
1499
 
1.3%
Other values (1266) 94299
81.9%
None
ValueCountFrequency (%)
· 174
68.2%
7
 
2.7%
7
 
2.7%
× 7
 
2.7%
à 6
 
2.4%
á 5
 
2.0%
đ 5
 
2.0%
4
 
1.6%
4
 
1.6%
ô 4
 
1.6%
Other values (16) 32
 
12.5%
Thai
ValueCountFrequency (%)
27
 
6.4%
27
 
6.4%
22
 
5.2%
22
 
5.2%
19
 
4.5%
18
 
4.2%
17
 
4.0%
17
 
4.0%
16
 
3.8%
16
 
3.8%
Other values (38) 224
52.7%
Misc Symbols
ValueCountFrequency (%)
8
100.0%
Hiragana
ValueCountFrequency (%)
6
 
9.1%
5
 
7.6%
4
 
6.1%
4
 
6.1%
4
 
6.1%
3
 
4.5%
3
 
4.5%
3
 
4.5%
3
 
4.5%
3
 
4.5%
Other values (20) 28
42.4%
CJK
ValueCountFrequency (%)
6
 
3.8%
4
 
2.5%
3
 
1.9%
3
 
1.9%
3
 
1.9%
2
 
1.2%
2
 
1.2%
2
 
1.2%
2
 
1.2%
2
 
1.2%
Other values (123) 131
81.9%
Latin Ext Additional
ValueCountFrequency (%)
4
 
9.3%
4
 
9.3%
ế 4
 
9.3%
3
 
7.0%
3
 
7.0%
3
 
7.0%
2
 
4.7%
2
 
4.7%
2
 
4.7%
2
 
4.7%
Other values (11) 14
32.6%
Punctuation
ValueCountFrequency (%)
3
60.0%
1
 
20.0%
1
 
20.0%
Katakana
ValueCountFrequency (%)
3
14.3%
2
 
9.5%
2
 
9.5%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
Other values (7) 7
33.3%
Enclosed Alphanum
ValueCountFrequency (%)
3
37.5%
3
37.5%
1
 
12.5%
1
 
12.5%
Compat Jamo
ValueCountFrequency (%)
2
18.2%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
Diacriticals
ValueCountFrequency (%)
́ 2
50.0%
̂ 2
50.0%
Number Forms
ValueCountFrequency (%)
1
50.0%
1
50.0%

저자
Text

Distinct5974
Distinct (%)75.9%
Missing0
Missing (%)0.0%
Memory size61.6 KiB
2024-04-17T19:57:18.412118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length69
Median length50
Mean length7.6534163
Min length2

Characters and Unicode

Total characters60263
Distinct characters1030
Distinct categories14 ?
Distinct scripts7 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4987 ?
Unique (%)63.3%

Sample

1st row황선우 지음
2nd row데이비드 에즈라 스테인 지음
3rd rowLolita SECHAN
4th row우상구 지음
5th row김진 지음
ValueCountFrequency (%)
지음 6012
32.5%
515
 
2.8%
185
 
1.0%
원작 144
 
0.8%
글·그림 121
 
0.7%
by 73
 
0.4%
편집 59
 
0.3%
한자교연 59
 
0.3%
편집부 51
 
0.3%
엮음 45
 
0.2%
Other values (6941) 11257
60.8%
2024-04-17T19:57:18.731373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10647
 
17.7%
6392
 
10.6%
6076
 
10.1%
1435
 
2.4%
883
 
1.5%
841
 
1.4%
747
 
1.2%
679
 
1.1%
535
 
0.9%
433
 
0.7%
Other values (1020) 31595
52.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 45594
75.7%
Space Separator 10647
 
17.7%
Lowercase Letter 2443
 
4.1%
Uppercase Letter 654
 
1.1%
Other Punctuation 364
 
0.6%
Open Punctuation 204
 
0.3%
Close Punctuation 203
 
0.3%
Nonspacing Mark 71
 
0.1%
Dash Punctuation 34
 
0.1%
Decimal Number 26
 
< 0.1%
Other values (4) 23
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6392
 
14.0%
6076
 
13.3%
1435
 
3.1%
883
 
1.9%
841
 
1.8%
747
 
1.6%
679
 
1.5%
535
 
1.2%
433
 
0.9%
410
 
0.9%
Other values (917) 27163
59.6%
Lowercase Letter
ValueCountFrequency (%)
e 272
11.1%
a 243
 
9.9%
n 209
 
8.6%
i 194
 
7.9%
r 179
 
7.3%
t 171
 
7.0%
o 160
 
6.5%
l 153
 
6.3%
y 118
 
4.8%
s 104
 
4.3%
Other values (28) 640
26.2%
Uppercase Letter
ValueCountFrequency (%)
S 70
 
10.7%
B 63
 
9.6%
M 62
 
9.5%
D 43
 
6.6%
T 41
 
6.3%
H 37
 
5.7%
A 35
 
5.4%
K 31
 
4.7%
J 30
 
4.6%
C 28
 
4.3%
Other values (16) 214
32.7%
Nonspacing Mark
ValueCountFrequency (%)
19
26.8%
12
16.9%
9
12.7%
8
11.3%
7
 
9.9%
5
 
7.0%
4
 
5.6%
4
 
5.6%
2
 
2.8%
1
 
1.4%
Decimal Number
ValueCountFrequency (%)
0 6
23.1%
3 5
19.2%
1 5
19.2%
2 5
19.2%
7 1
 
3.8%
6 1
 
3.8%
4 1
 
3.8%
5 1
 
3.8%
8 1
 
3.8%
Other Punctuation
ValueCountFrequency (%)
· 146
40.1%
. 138
37.9%
: 34
 
9.3%
; 31
 
8.5%
, 12
 
3.3%
/ 2
 
0.5%
& 1
 
0.3%
Math Symbol
ValueCountFrequency (%)
< 9
45.0%
> 9
45.0%
+ 1
 
5.0%
× 1
 
5.0%
Open Punctuation
ValueCountFrequency (%)
[ 196
96.1%
( 8
 
3.9%
Close Punctuation
ValueCountFrequency (%)
] 195
96.1%
) 8
 
3.9%
Space Separator
ValueCountFrequency (%)
10647
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 34
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 45279
75.1%
Common 11501
 
19.1%
Latin 3097
 
5.1%
Thai 256
 
0.4%
Han 100
 
0.2%
Hiragana 23
 
< 0.1%
Katakana 7
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6392
 
14.1%
6076
 
13.4%
1435
 
3.2%
883
 
2.0%
841
 
1.9%
747
 
1.6%
679
 
1.5%
535
 
1.2%
433
 
1.0%
410
 
0.9%
Other values (789) 26848
59.3%
Han
ValueCountFrequency (%)
13
 
13.0%
3
 
3.0%
3
 
3.0%
3
 
3.0%
2
 
2.0%
2
 
2.0%
2
 
2.0%
2
 
2.0%
2
 
2.0%
2
 
2.0%
Other values (63) 66
66.0%
Latin
ValueCountFrequency (%)
e 272
 
8.8%
a 243
 
7.8%
n 209
 
6.7%
i 194
 
6.3%
r 179
 
5.8%
t 171
 
5.5%
o 160
 
5.2%
l 153
 
4.9%
y 118
 
3.8%
s 104
 
3.4%
Other values (54) 1294
41.8%
Thai
ValueCountFrequency (%)
29
 
11.3%
24
 
9.4%
22
 
8.6%
19
 
7.4%
15
 
5.9%
12
 
4.7%
10
 
3.9%
9
 
3.5%
8
 
3.1%
8
 
3.1%
Other values (33) 100
39.1%
Common
ValueCountFrequency (%)
10647
92.6%
[ 196
 
1.7%
] 195
 
1.7%
· 146
 
1.3%
. 138
 
1.2%
: 34
 
0.3%
- 34
 
0.3%
; 31
 
0.3%
, 12
 
0.1%
< 9
 
0.1%
Other values (19) 59
 
0.5%
Hiragana
ValueCountFrequency (%)
3
13.0%
2
 
8.7%
2
 
8.7%
2
 
8.7%
2
 
8.7%
2
 
8.7%
2
 
8.7%
1
 
4.3%
1
 
4.3%
1
 
4.3%
Other values (5) 5
21.7%
Katakana
ValueCountFrequency (%)
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 45271
75.1%
ASCII 14424
 
23.9%
Thai 256
 
0.4%
None 159
 
0.3%
CJK 100
 
0.2%
Hiragana 23
 
< 0.1%
Latin Ext Additional 12
 
< 0.1%
Compat Jamo 8
 
< 0.1%
Katakana 7
 
< 0.1%
Punctuation 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10647
73.8%
e 272
 
1.9%
a 243
 
1.7%
n 209
 
1.4%
[ 196
 
1.4%
] 195
 
1.4%
i 194
 
1.3%
r 179
 
1.2%
t 171
 
1.2%
o 160
 
1.1%
Other values (65) 1958
 
13.6%
Hangul
ValueCountFrequency (%)
6392
 
14.1%
6076
 
13.4%
1435
 
3.2%
883
 
2.0%
841
 
1.9%
747
 
1.7%
679
 
1.5%
535
 
1.2%
433
 
1.0%
410
 
0.9%
Other values (787) 26840
59.3%
None
ValueCountFrequency (%)
· 146
91.8%
é 3
 
1.9%
ơ 2
 
1.3%
ư 2
 
1.3%
à 2
 
1.3%
ñ 1
 
0.6%
ä 1
 
0.6%
á 1
 
0.6%
× 1
 
0.6%
Thai
ValueCountFrequency (%)
29
 
11.3%
24
 
9.4%
22
 
8.6%
19
 
7.4%
15
 
5.9%
12
 
4.7%
10
 
3.9%
9
 
3.5%
8
 
3.1%
8
 
3.1%
Other values (33) 100
39.1%
CJK
ValueCountFrequency (%)
13
 
13.0%
3
 
3.0%
3
 
3.0%
3
 
3.0%
2
 
2.0%
2
 
2.0%
2
 
2.0%
2
 
2.0%
2
 
2.0%
2
 
2.0%
Other values (63) 66
66.0%
Compat Jamo
ValueCountFrequency (%)
7
87.5%
1
 
12.5%
Hiragana
ValueCountFrequency (%)
3
13.0%
2
 
8.7%
2
 
8.7%
2
 
8.7%
2
 
8.7%
2
 
8.7%
2
 
8.7%
1
 
4.3%
1
 
4.3%
1
 
4.3%
Other values (5) 5
21.7%
Latin Ext Additional
ValueCountFrequency (%)
3
25.0%
3
25.0%
2
16.7%
ế 2
16.7%
1
 
8.3%
1
 
8.3%
Katakana
ValueCountFrequency (%)
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
Punctuation
ValueCountFrequency (%)
1
50.0%
1
50.0%
Enclosed Alphanum
ValueCountFrequency (%)
1
100.0%

발행자
Unsupported

REJECTED  UNSUPPORTED 

Missing2
Missing (%)< 0.1%
Memory size61.6 KiB

발행년
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size61.6 KiB

책수
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size61.6 KiB
1
5544 
2
2228 
3
 
83
4
 
15
5
 
4

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 5544
70.4%
2 2228
28.3%
3 83
 
1.1%
4 15
 
0.2%
5 4
 
0.1%

Length

2024-04-17T19:57:18.836950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T19:57:18.932316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 5544
70.4%
2 2228
28.3%
3 83
 
1.1%
4 15
 
0.2%
5 4
 
0.1%

단가
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size61.6 KiB

금액
Real number (ℝ)

HIGH CORRELATION 

Distinct221
Distinct (%)2.8%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean20394.752
Minimum0
Maximum220014
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size69.3 KiB
2024-04-17T19:57:19.038012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile11000
Q114800
median18000
Q326000
95-th percentile34000
Maximum220014
Range220014
Interquartile range (IQR)11200

Descriptive statistics

Standard deviation8495.5874
Coefficient of variation (CV)0.41655752
Kurtosis46.943726
Mean20394.752
Median Absolute Deviation (MAD)5000
Skewness3.2474052
Sum1.6056788 × 108
Variance72175005
MonotonicityNot monotonic
2024-04-17T19:57:19.157399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15000 536
 
6.8%
16000 471
 
6.0%
12000 434
 
5.5%
18000 428
 
5.4%
24000 420
 
5.3%
26000 411
 
5.2%
13000 385
 
4.9%
28000 355
 
4.5%
30000 331
 
4.2%
17000 313
 
4.0%
Other values (211) 3789
48.1%
ValueCountFrequency (%)
0 1
 
< 0.1%
2400 1
 
< 0.1%
3000 4
 
0.1%
5000 1
 
< 0.1%
5500 1
 
< 0.1%
6000 3
 
< 0.1%
6500 14
 
0.2%
6600 59
0.7%
7000 2
 
< 0.1%
7500 1
 
< 0.1%
ValueCountFrequency (%)
220014 1
< 0.1%
125000 1
< 0.1%
112000 1
< 0.1%
98000 1
< 0.1%
96000 1
< 0.1%
92000 1
< 0.1%
81000 1
< 0.1%
72000 1
< 0.1%
70000 1
< 0.1%
67500 1
< 0.1%

Interactions

2024-04-17T19:57:16.927829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T19:57:16.768514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T19:57:17.001497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T19:57:16.850589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-17T19:57:19.241764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호책수금액
번호1.0000.1960.100
책수0.1961.0000.666
금액0.1000.6661.000
2024-04-17T19:57:19.313775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호금액책수
번호1.0000.0360.082
금액0.0361.0000.510
책수0.0820.5101.000

Missing values

2024-04-17T19:57:17.089756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-17T19:57:17.189264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-17T19:57:17.267057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

번호서명저자발행자발행년책수단가금액
01사랑한다고 말할 용기황선우 지음책읽는수요일202131480044400
12아빠, 쿠키 주세요데이비드 에즈라 스테인 지음시공주니어202111300013000
23숨바꼭질!Lolita SECHAN바둑이하우스202111500015000
34소년과 살쾡이우상구 지음청어람주니어2015198009800
45세종대왕을 찾아라김진 지음천개의바람202111300013000
56자동 물시계 자격루김명희 지음푸른숲주니어202111480014800
67(주식 단기투자 필독서) 주식의 道생존재테크 지음트러스트북스202111880018800
78몸의 기분마숑 지음fifo202111380013800
89내 인생도 편집이 되나요?이지은 지음202111500015000
910들어 봐, 우릴 위해 만든 노래야이환희후마니타스202111800018000
번호서명저자발행자발행년책수단가금액
78647865Oliver twistretold from the story by Charles DickensKiddo202113340033400
78657866(The)wizard of Ozretold from the story by L. Frank BaumKiddo202113340033400
78667867Black beautyretold from the story by Anna SeweellKiddo202113340033400
78677868Anne of green gablesretold from the story by L. M. MontgomeryKiddo202113340033400
78687869My homeMind BooksMind Books202111260012600
78697870BodyMind BooksMind Books202111260012600
78707871Puwede po ba tayong magbasa ng aklatkuwento ni Lawrence SchimelKahel Press202111590015900
78717872KindnessAgnes de BezenacLampara Books202111590015900
78727873RespectAgnes de BezenacLampara Books202111590015900
78737874Pitong tsinelaskuwento ni Divine Gil ReyesTahanan Books202111590015900