Overview

Dataset statistics

Number of variables10
Number of observations707
Missing cells2
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory59.5 KiB
Average record size in memory86.2 B

Variable types

Numeric5
Categorical2
Text3

Dataset

Description충청남도 보령시립도서관의 2023년도 희망도서 구입목록 데이터입니다. 구입 월, 구분(성인/아동), 분류, 도서명, 저자명, 발행자, 단가, 수량, 금액 정보를 제공합니다. (기존 보령시 중앙도서관의 명칭이 "보령시립도서관"으로 변경되었습니다.)
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=97&beforeMenuCd=DOM_000000201001001000&publicdatapk=15104393

Alerts

연번 is highly overall correlated with 구입 월High correlation
구입 월 is highly overall correlated with 연번High correlation
단가 is highly overall correlated with 금액High correlation
금액 is highly overall correlated with 단가 and 1 other fieldsHigh correlation
수량 is highly overall correlated with 금액High correlation
수량 is highly imbalanced (97.1%)Imbalance
연번 has unique valuesUnique
도서명 has unique valuesUnique
분류 has 23 (3.3%) zerosZeros

Reproduction

Analysis started2024-01-09 20:48:49.187857
Analysis finished2024-01-09 20:48:52.104107
Duration2.92 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct707
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean354
Minimum1
Maximum707
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.3 KiB
2024-01-10T05:48:52.165236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile36.3
Q1177.5
median354
Q3530.5
95-th percentile671.7
Maximum707
Range706
Interquartile range (IQR)353

Descriptive statistics

Standard deviation204.23761
Coefficient of variation (CV)0.57694239
Kurtosis-1.2
Mean354
Median Absolute Deviation (MAD)177
Skewness0
Sum250278
Variance41713
MonotonicityStrictly increasing
2024-01-10T05:48:52.561584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
444 1
 
0.1%
468 1
 
0.1%
469 1
 
0.1%
470 1
 
0.1%
471 1
 
0.1%
472 1
 
0.1%
473 1
 
0.1%
474 1
 
0.1%
475 1
 
0.1%
Other values (697) 697
98.6%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
707 1
0.1%
706 1
0.1%
705 1
0.1%
704 1
0.1%
703 1
0.1%
702 1
0.1%
701 1
0.1%
700 1
0.1%
699 1
0.1%
698 1
0.1%

구입 월
Real number (ℝ)

HIGH CORRELATION 

Distinct11
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.7666195
Minimum1
Maximum11
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.3 KiB
2024-01-10T05:48:52.668654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median6
Q38
95-th percentile11
Maximum11
Range10
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.167311
Coefficient of variation (CV)0.54924916
Kurtosis-1.2239839
Mean5.7666195
Median Absolute Deviation (MAD)3
Skewness0.10017138
Sum4077
Variance10.031859
MonotonicityIncreasing
2024-01-10T05:48:52.760176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
4 105
14.9%
1 74
10.5%
8 72
10.2%
3 68
9.6%
6 62
8.8%
9 62
8.8%
2 61
8.6%
11 61
8.6%
7 60
8.5%
10 50
7.1%
ValueCountFrequency (%)
1 74
10.5%
2 61
8.6%
3 68
9.6%
4 105
14.9%
5 32
 
4.5%
6 62
8.8%
7 60
8.5%
8 72
10.2%
9 62
8.8%
10 50
7.1%
ValueCountFrequency (%)
11 61
8.6%
10 50
7.1%
9 62
8.8%
8 72
10.2%
7 60
8.5%
6 62
8.8%
5 32
 
4.5%
4 105
14.9%
3 68
9.6%
2 61
8.6%

구분
Categorical

Distinct3
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size5.7 KiB
일반
508 
아동
162 
유아
 
37

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반
2nd row일반
3rd row아동
4th row아동
5th row아동

Common Values

ValueCountFrequency (%)
일반 508
71.9%
아동 162
 
22.9%
유아 37
 
5.2%

Length

2024-01-10T05:48:52.856091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:48:52.958361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 508
71.9%
아동 162
 
22.9%
유아 37
 
5.2%

분류
Real number (ℝ)

ZEROS 

Distinct10
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean577.51061
Minimum0
Maximum900
Zeros23
Zeros (%)3.3%
Negative0
Negative (%)0.0%
Memory size6.3 KiB
2024-01-10T05:48:53.066621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile100
Q1300
median800
Q3800
95-th percentile800
Maximum900
Range900
Interquartile range (IQR)500

Descriptive statistics

Standard deviation274.43586
Coefficient of variation (CV)0.47520488
Kurtosis-1.0476515
Mean577.51061
Median Absolute Deviation (MAD)100
Skewness-0.63881683
Sum408300
Variance75315.041
MonotonicityNot monotonic
2024-01-10T05:48:53.182624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
800 339
47.9%
300 109
 
15.4%
100 63
 
8.9%
400 52
 
7.4%
600 40
 
5.7%
500 40
 
5.7%
900 33
 
4.7%
0 23
 
3.3%
200 4
 
0.6%
700 4
 
0.6%
ValueCountFrequency (%)
0 23
 
3.3%
100 63
 
8.9%
200 4
 
0.6%
300 109
 
15.4%
400 52
 
7.4%
500 40
 
5.7%
600 40
 
5.7%
700 4
 
0.6%
800 339
47.9%
900 33
 
4.7%
ValueCountFrequency (%)
900 33
 
4.7%
800 339
47.9%
700 4
 
0.6%
600 40
 
5.7%
500 40
 
5.7%
400 52
 
7.4%
300 109
 
15.4%
200 4
 
0.6%
100 63
 
8.9%
0 23
 
3.3%

도서명
Text

UNIQUE 

Distinct707
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size5.7 KiB
2024-01-10T05:48:53.515567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length34
Mean length12.015559
Min length2

Characters and Unicode

Total characters8495
Distinct characters725
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique707 ?
Unique (%)100.0%

Sample

1st row파이어족의 재테크
2nd row유튜브 이야기
3rd row비밀의 보석 가게 마석관.2
4th row기묘한 모모한약방 3
5th row유령고양이 후쿠코.3
ValueCountFrequency (%)
27
 
1.2%
2 13
 
0.6%
이야기 11
 
0.5%
1 11
 
0.5%
법칙 10
 
0.4%
위한 10
 
0.4%
3 10
 
0.4%
정글의 9
 
0.4%
가게 9
 
0.4%
나는 8
 
0.3%
Other values (1740) 2180
94.9%
2024-01-10T05:48:53.955496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1600
 
18.8%
203
 
2.4%
171
 
2.0%
134
 
1.6%
113
 
1.3%
96
 
1.1%
91
 
1.1%
86
 
1.0%
85
 
1.0%
84
 
1.0%
Other values (715) 5832
68.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6305
74.2%
Space Separator 1600
 
18.8%
Decimal Number 273
 
3.2%
Other Punctuation 162
 
1.9%
Uppercase Letter 45
 
0.5%
Open Punctuation 28
 
0.3%
Close Punctuation 28
 
0.3%
Dash Punctuation 26
 
0.3%
Lowercase Letter 25
 
0.3%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
203
 
3.2%
171
 
2.7%
134
 
2.1%
113
 
1.8%
96
 
1.5%
91
 
1.4%
86
 
1.4%
85
 
1.3%
84
 
1.3%
83
 
1.3%
Other values (666) 5159
81.8%
Uppercase Letter
ValueCountFrequency (%)
T 9
20.0%
S 6
13.3%
C 5
11.1%
P 5
11.1%
V 3
 
6.7%
D 3
 
6.7%
I 3
 
6.7%
L 3
 
6.7%
M 2
 
4.4%
X 2
 
4.4%
Other values (4) 4
8.9%
Lowercase Letter
ValueCountFrequency (%)
i 4
16.0%
e 4
16.0%
t 3
12.0%
v 3
12.0%
o 2
8.0%
a 2
8.0%
s 2
8.0%
w 1
 
4.0%
h 1
 
4.0%
p 1
 
4.0%
Other values (2) 2
8.0%
Decimal Number
ValueCountFrequency (%)
1 77
28.2%
2 59
21.6%
3 46
16.8%
0 27
 
9.9%
5 22
 
8.1%
4 17
 
6.2%
6 9
 
3.3%
8 7
 
2.6%
9 6
 
2.2%
7 3
 
1.1%
Other Punctuation
ValueCountFrequency (%)
. 77
47.5%
, 34
21.0%
: 25
 
15.4%
! 21
 
13.0%
& 2
 
1.2%
? 2
 
1.2%
/ 1
 
0.6%
Space Separator
ValueCountFrequency (%)
1600
100.0%
Open Punctuation
ValueCountFrequency (%)
( 28
100.0%
Close Punctuation
ValueCountFrequency (%)
) 28
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 26
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6305
74.2%
Common 2120
 
25.0%
Latin 70
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
203
 
3.2%
171
 
2.7%
134
 
2.1%
113
 
1.8%
96
 
1.5%
91
 
1.4%
86
 
1.4%
85
 
1.3%
84
 
1.3%
83
 
1.3%
Other values (666) 5159
81.8%
Latin
ValueCountFrequency (%)
T 9
 
12.9%
S 6
 
8.6%
C 5
 
7.1%
P 5
 
7.1%
i 4
 
5.7%
e 4
 
5.7%
V 3
 
4.3%
t 3
 
4.3%
D 3
 
4.3%
I 3
 
4.3%
Other values (16) 25
35.7%
Common
ValueCountFrequency (%)
1600
75.5%
1 77
 
3.6%
. 77
 
3.6%
2 59
 
2.8%
3 46
 
2.2%
, 34
 
1.6%
( 28
 
1.3%
) 28
 
1.3%
0 27
 
1.3%
- 26
 
1.2%
Other values (13) 118
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6305
74.2%
ASCII 2189
 
25.8%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1600
73.1%
1 77
 
3.5%
. 77
 
3.5%
2 59
 
2.7%
3 46
 
2.1%
, 34
 
1.6%
( 28
 
1.3%
) 28
 
1.3%
0 27
 
1.2%
- 26
 
1.2%
Other values (38) 187
 
8.5%
Hangul
ValueCountFrequency (%)
203
 
3.2%
171
 
2.7%
134
 
2.1%
113
 
1.8%
96
 
1.5%
91
 
1.4%
86
 
1.4%
85
 
1.3%
84
 
1.3%
83
 
1.3%
Other values (666) 5159
81.8%
Punctuation
ValueCountFrequency (%)
1
100.0%
Distinct558
Distinct (%)78.9%
Missing0
Missing (%)0.0%
Memory size5.7 KiB
2024-01-10T05:48:54.262186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length18
Mean length5.0806223
Min length2

Characters and Unicode

Total characters3592
Distinct characters446
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique484 ?
Unique (%)68.5%

Sample

1st row신현정+신영주
2nd row고몽
3rd row히로시마레이코
4th row히로시마레이코
5th row히로시마레이코
ValueCountFrequency (%)
23
 
2.2%
흔한남매 10
 
1.0%
제작팀 9
 
0.9%
레이코 8
 
0.8%
sbs 8
 
0.8%
정글의 8
 
0.8%
법칙 8
 
0.8%
원작 8
 
0.8%
이수지 8
 
0.8%
히로시마 8
 
0.8%
Other values (746) 944
90.6%
2024-01-10T05:48:54.693768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
335
 
9.3%
160
 
4.5%
98
 
2.7%
80
 
2.2%
74
 
2.1%
55
 
1.5%
44
 
1.2%
43
 
1.2%
41
 
1.1%
38
 
1.1%
Other values (436) 2624
73.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3088
86.0%
Space Separator 335
 
9.3%
Uppercase Letter 65
 
1.8%
Lowercase Letter 41
 
1.1%
Math Symbol 26
 
0.7%
Other Punctuation 21
 
0.6%
Close Punctuation 8
 
0.2%
Open Punctuation 8
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
160
 
5.2%
98
 
3.2%
80
 
2.6%
74
 
2.4%
55
 
1.8%
44
 
1.4%
43
 
1.4%
41
 
1.3%
38
 
1.2%
38
 
1.2%
Other values (396) 2417
78.3%
Uppercase Letter
ValueCountFrequency (%)
S 20
30.8%
B 10
15.4%
J 7
 
10.8%
T 4
 
6.2%
K 3
 
4.6%
R 3
 
4.6%
N 3
 
4.6%
D 3
 
4.6%
G 2
 
3.1%
E 2
 
3.1%
Other values (7) 8
 
12.3%
Lowercase Letter
ValueCountFrequency (%)
z 6
14.6%
e 5
12.2%
n 4
9.8%
o 4
9.8%
a 3
7.3%
s 3
7.3%
r 3
7.3%
t 3
7.3%
v 2
 
4.9%
m 2
 
4.9%
Other values (5) 6
14.6%
Math Symbol
ValueCountFrequency (%)
+ 24
92.3%
< 1
 
3.8%
> 1
 
3.8%
Other Punctuation
ValueCountFrequency (%)
. 16
76.2%
, 5
 
23.8%
Space Separator
ValueCountFrequency (%)
335
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3088
86.0%
Common 398
 
11.1%
Latin 106
 
3.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
160
 
5.2%
98
 
3.2%
80
 
2.6%
74
 
2.4%
55
 
1.8%
44
 
1.4%
43
 
1.4%
41
 
1.3%
38
 
1.2%
38
 
1.2%
Other values (396) 2417
78.3%
Latin
ValueCountFrequency (%)
S 20
18.9%
B 10
 
9.4%
J 7
 
6.6%
z 6
 
5.7%
e 5
 
4.7%
T 4
 
3.8%
n 4
 
3.8%
o 4
 
3.8%
a 3
 
2.8%
s 3
 
2.8%
Other values (22) 40
37.7%
Common
ValueCountFrequency (%)
335
84.2%
+ 24
 
6.0%
. 16
 
4.0%
) 8
 
2.0%
( 8
 
2.0%
, 5
 
1.3%
< 1
 
0.3%
> 1
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3088
86.0%
ASCII 504
 
14.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
335
66.5%
+ 24
 
4.8%
S 20
 
4.0%
. 16
 
3.2%
B 10
 
2.0%
) 8
 
1.6%
( 8
 
1.6%
J 7
 
1.4%
z 6
 
1.2%
, 5
 
1.0%
Other values (30) 65
 
12.9%
Hangul
ValueCountFrequency (%)
160
 
5.2%
98
 
3.2%
80
 
2.6%
74
 
2.4%
55
 
1.8%
44
 
1.4%
43
 
1.4%
41
 
1.3%
38
 
1.2%
38
 
1.2%
Other values (396) 2417
78.3%
Distinct365
Distinct (%)51.8%
Missing2
Missing (%)0.3%
Memory size5.7 KiB
2024-01-10T05:48:54.949965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length9
Mean length4.3659574
Min length1

Characters and Unicode

Total characters3078
Distinct characters372
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique242 ?
Unique (%)34.3%

Sample

1st row쌤앤파커스
2nd row성안당
3rd row길벗스쿨
4th row미래엔아이세움
5th row주니어김영사
ValueCountFrequency (%)
영상출판미디어 24
 
3.4%
미래엔아이세움 19
 
2.7%
위즈덤하우스 16
 
2.3%
주니어김영사 15
 
2.1%
문학동네 14
 
2.0%
비룡소 12
 
1.7%
김영사 10
 
1.4%
알에이치코리아 9
 
1.3%
창비 9
 
1.3%
민음사 9
 
1.3%
Other values (357) 570
80.6%
2024-01-10T05:48:55.326884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
158
 
5.1%
100
 
3.2%
96
 
3.1%
92
 
3.0%
91
 
3.0%
79
 
2.6%
67
 
2.2%
61
 
2.0%
57
 
1.9%
53
 
1.7%
Other values (362) 2224
72.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3017
98.0%
Uppercase Letter 30
 
1.0%
Lowercase Letter 14
 
0.5%
Decimal Number 12
 
0.4%
Space Separator 2
 
0.1%
Close Punctuation 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
158
 
5.2%
100
 
3.3%
96
 
3.2%
92
 
3.0%
91
 
3.0%
79
 
2.6%
67
 
2.2%
61
 
2.0%
57
 
1.9%
53
 
1.8%
Other values (335) 2163
71.7%
Uppercase Letter
ValueCountFrequency (%)
B 6
20.0%
O 6
20.0%
S 4
13.3%
K 3
10.0%
E 2
 
6.7%
X 1
 
3.3%
F 1
 
3.3%
T 1
 
3.3%
N 1
 
3.3%
I 1
 
3.3%
Other values (4) 4
13.3%
Lowercase Letter
ValueCountFrequency (%)
i 5
35.7%
n 3
21.4%
e 2
 
14.3%
d 2
 
14.3%
r 1
 
7.1%
s 1
 
7.1%
Decimal Number
ValueCountFrequency (%)
2 6
50.0%
1 5
41.7%
3 1
 
8.3%
Space Separator
ValueCountFrequency (%)
2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Other Punctuation
ValueCountFrequency (%)
: 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3017
98.0%
Latin 44
 
1.4%
Common 17
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
158
 
5.2%
100
 
3.3%
96
 
3.2%
92
 
3.0%
91
 
3.0%
79
 
2.6%
67
 
2.2%
61
 
2.0%
57
 
1.9%
53
 
1.8%
Other values (335) 2163
71.7%
Latin
ValueCountFrequency (%)
B 6
13.6%
O 6
13.6%
i 5
11.4%
S 4
 
9.1%
n 3
 
6.8%
K 3
 
6.8%
e 2
 
4.5%
E 2
 
4.5%
d 2
 
4.5%
X 1
 
2.3%
Other values (10) 10
22.7%
Common
ValueCountFrequency (%)
2 6
35.3%
1 5
29.4%
2
 
11.8%
) 1
 
5.9%
( 1
 
5.9%
: 1
 
5.9%
3 1
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3017
98.0%
ASCII 61
 
2.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
158
 
5.2%
100
 
3.3%
96
 
3.2%
92
 
3.0%
91
 
3.0%
79
 
2.6%
67
 
2.2%
61
 
2.0%
57
 
1.9%
53
 
1.8%
Other values (335) 2163
71.7%
ASCII
ValueCountFrequency (%)
B 6
 
9.8%
O 6
 
9.8%
2 6
 
9.8%
1 5
 
8.2%
i 5
 
8.2%
S 4
 
6.6%
n 3
 
4.9%
K 3
 
4.9%
e 2
 
3.3%
E 2
 
3.3%
Other values (17) 19
31.1%

단가
Real number (ℝ)

HIGH CORRELATION 

Distinct64
Distinct (%)9.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15884.441
Minimum6500
Maximum56000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.3 KiB
2024-01-10T05:48:55.457750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6500
5-th percentile10000
Q113000
median15000
Q316950
95-th percentile26000
Maximum56000
Range49500
Interquartile range (IQR)3950

Descriptive statistics

Standard deviation5285.1908
Coefficient of variation (CV)0.33272752
Kurtosis9.8820281
Mean15884.441
Median Absolute Deviation (MAD)2000
Skewness2.5092207
Sum11230300
Variance27933242
MonotonicityNot monotonic
2024-01-10T05:48:55.578520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15000 79
 
11.2%
12000 77
 
10.9%
16000 62
 
8.8%
14000 52
 
7.4%
13000 49
 
6.9%
18000 25
 
3.5%
16800 24
 
3.4%
17000 22
 
3.1%
11000 22
 
3.1%
14800 18
 
2.5%
Other values (54) 277
39.2%
ValueCountFrequency (%)
6500 3
 
0.4%
8000 3
 
0.4%
8500 3
 
0.4%
9000 9
1.3%
9800 4
 
0.6%
10000 15
2.1%
10500 5
 
0.7%
10800 4
 
0.6%
11000 22
3.1%
11200 1
 
0.1%
ValueCountFrequency (%)
56000 1
 
0.1%
45000 1
 
0.1%
43000 1
 
0.1%
42000 1
 
0.1%
39000 1
 
0.1%
38000 2
0.3%
36000 3
0.4%
35000 1
 
0.1%
34000 1
 
0.1%
33000 2
0.3%

수량
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size5.7 KiB
1
703 
2
 
2
4
 
1
10
 
1

Length

Max length2
Median length1
Mean length1.0014144
Min length1

Unique

Unique2 ?
Unique (%)0.3%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 703
99.4%
2 2
 
0.3%
4 1
 
0.1%
10 1
 
0.1%

Length

2024-01-10T05:48:55.688012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:48:55.784589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 703
99.4%
2 2
 
0.3%
4 1
 
0.1%
10 1
 
0.1%

금액
Real number (ℝ)

HIGH CORRELATION 

Distinct67
Distinct (%)9.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16176.521
Minimum6500
Maximum120000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.3 KiB
2024-01-10T05:48:55.885689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6500
5-th percentile10000
Q113000
median15000
Q317000
95-th percentile26560
Maximum120000
Range113500
Interquartile range (IQR)4000

Descriptive statistics

Standard deviation7134.8832
Coefficient of variation (CV)0.44106415
Kurtosis75.02549
Mean16176.521
Median Absolute Deviation (MAD)2000
Skewness6.5541065
Sum11436800
Variance50906558
MonotonicityNot monotonic
2024-01-10T05:48:56.002244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15000 79
 
11.2%
12000 76
 
10.7%
16000 62
 
8.8%
14000 52
 
7.4%
13000 49
 
6.9%
18000 25
 
3.5%
16800 24
 
3.4%
17000 22
 
3.1%
11000 22
 
3.1%
14800 18
 
2.5%
Other values (57) 278
39.3%
ValueCountFrequency (%)
6500 3
 
0.4%
8000 3
 
0.4%
8500 3
 
0.4%
9000 9
1.3%
9800 4
 
0.6%
10000 15
2.1%
10500 5
 
0.7%
10800 4
 
0.6%
11000 22
3.1%
11200 1
 
0.1%
ValueCountFrequency (%)
120000 1
 
0.1%
80000 1
 
0.1%
56000 1
 
0.1%
54000 1
 
0.1%
45000 1
 
0.1%
43000 1
 
0.1%
42000 1
 
0.1%
39000 1
 
0.1%
38000 2
0.3%
36000 3
0.4%

Interactions

2024-01-10T05:48:51.521281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:49.977019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:50.380006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:50.751996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:51.149390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:51.603832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:50.058812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:50.459418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:50.839151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:51.232084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:51.677787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:50.142083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:50.532928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:50.927206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:51.306333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:51.750287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:50.219008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:50.604092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:50.999007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:51.376150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:51.825225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:50.296820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:50.676505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:51.072819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:51.448322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T05:48:56.083362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구입 월구분분류단가수량금액
연번1.0000.9890.2550.5120.1790.0000.135
구입 월0.9891.0000.3320.5410.1610.1680.211
구분0.2550.3321.0000.3840.4460.0690.253
분류0.5120.5410.3841.0000.3930.0880.329
단가0.1790.1610.4460.3931.0000.0620.865
수량0.0000.1680.0690.0880.0621.0000.904
금액0.1350.2110.2530.3290.8650.9041.000
2024-01-10T05:48:56.170679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분수량
구분1.0000.065
수량0.0651.000
2024-01-10T05:48:56.247399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구입 월분류단가금액구분수량
연번1.0000.9950.0330.0580.0510.1560.000
구입 월0.9951.0000.0320.0480.0400.1890.095
분류0.0330.0321.000-0.387-0.3760.2490.052
단가0.0580.048-0.3871.0000.9880.2230.040
금액0.0510.040-0.3760.9881.0000.1760.863
구분0.1560.1890.2490.2230.1761.0000.065
수량0.0000.0950.0520.0400.8630.0651.000

Missing values

2024-01-10T05:48:51.930509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:48:52.057985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번구입 월구분분류도서명저자명발행자단가수량금액
011일반300파이어족의 재테크신현정+신영주쌤앤파커스15000115000
121일반300유튜브 이야기고몽성안당23000123000
231아동800비밀의 보석 가게 마석관.2히로시마레이코길벗스쿨12000112000
341아동800기묘한 모모한약방 3히로시마레이코미래엔아이세움12000112000
451아동800유령고양이 후쿠코.3히로시마레이코주니어김영사12000112000
561아동800블라인드.2잠뜰TV서울문화사12000112000
671일반900둠 : 재앙의 정치학니얼 퍼거슨21세기북스38000138000
781일반800무너진 다리천선란그래비티북스16000116000
891일반800루시 (저메이카 킨케이드장편소설)저메이카 킨케이드문학동네12000112000
9101일반400에이다, 당신이군요. 최초의 프로그래머시드니 파두아곰출판20000120000
연번구입 월구분분류도서명저자명발행자단가수량금액
69769811일반100감정의 발견마크 브래킷북라이프16800116800
69869911일반800빛과 물질에 관한 이론앤드루 포터문학동네13800113800
69970011일반800식물의 은밀한 감정디디에 반 코뵐라르트연금술사19500119500
70070111일반800닿고 싶다는 말전새벽김영사14800114800
70170211일반100비터스위트수전 케인알에이치코리아18000118000
70270311일반400감정의 뇌과학레오나르드 믈로디노프까치19000119000
70370411일반800무뎌진 감정이 말을 걸어올 때김소영책발전소X테라코타15800115800
70470511일반700영어 감정 표현 사전sam norris길벗이지톡27000127000
70570611일반800그림자밟기 여관의 괴담오시마 기요아키현대문학15500115500
70670711일반800우선 이것부터 먹고하라다 히카하빌리스15000115000