Overview

Dataset statistics

Number of variables10
Number of observations480
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory40.4 KiB
Average record size in memory86.3 B

Variable types

Numeric5
Categorical2
Text3

Dataset

Description충청남도 보령시립도서관의 2023년도 희망도서 구입목록 데이터입니다. 구입 월, 구분(성인/아동), 분류, 도서명, 저자명, 발행자, 단가, 수량, 금액 정보를 제공합니다. (기존 보령시 중앙도서관의 명칭이 "보령시립도서관"으로 변경되었습니다.)
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=97&beforeMenuCd=DOM_000000201001001000&publicdatapk=15104393

Alerts

연번 is highly overall correlated with 구입 월High correlation
구입 월 is highly overall correlated with 연번High correlation
단가 is highly overall correlated with 금액High correlation
금액 is highly overall correlated with 단가 and 1 other fieldsHigh correlation
수량 is highly overall correlated with 금액High correlation
구분 is highly imbalanced (58.8%)Imbalance
수량 is highly imbalanced (93.5%)Imbalance
연번 has unique valuesUnique
분류 has 6 (1.2%) zerosZeros

Reproduction

Analysis started2024-01-09 20:49:04.831033
Analysis finished2024-01-09 20:49:07.548115
Duration2.72 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct480
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean240.5
Minimum1
Maximum480
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.3 KiB
2024-01-10T05:49:07.881406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile24.95
Q1120.75
median240.5
Q3360.25
95-th percentile456.05
Maximum480
Range479
Interquartile range (IQR)239.5

Descriptive statistics

Standard deviation138.70833
Coefficient of variation (CV)0.5767498
Kurtosis-1.2
Mean240.5
Median Absolute Deviation (MAD)120
Skewness0
Sum115440
Variance19240
MonotonicityStrictly increasing
2024-01-10T05:49:08.003652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
242 1
 
0.2%
330 1
 
0.2%
329 1
 
0.2%
328 1
 
0.2%
327 1
 
0.2%
326 1
 
0.2%
325 1
 
0.2%
324 1
 
0.2%
323 1
 
0.2%
Other values (470) 470
97.9%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
480 1
0.2%
479 1
0.2%
478 1
0.2%
477 1
0.2%
476 1
0.2%
475 1
0.2%
474 1
0.2%
473 1
0.2%
472 1
0.2%
471 1
0.2%

구입 월
Real number (ℝ)

HIGH CORRELATION 

Distinct9
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.5895833
Minimum1
Maximum10
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.3 KiB
2024-01-10T05:49:08.111688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median4
Q36
95-th percentile10
Maximum10
Range9
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.503504
Coefficient of variation (CV)0.54547522
Kurtosis-0.5342734
Mean4.5895833
Median Absolute Deviation (MAD)2
Skewness0.43991523
Sum2203
Variance6.2675322
MonotonicityIncreasing
2024-01-10T05:49:08.198238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
4 107
22.3%
5 68
14.2%
2 65
13.5%
1 57
11.9%
7 47
9.8%
8 39
 
8.1%
3 33
 
6.9%
6 33
 
6.9%
10 31
 
6.5%
ValueCountFrequency (%)
1 57
11.9%
2 65
13.5%
3 33
 
6.9%
4 107
22.3%
5 68
14.2%
6 33
 
6.9%
7 47
9.8%
8 39
 
8.1%
10 31
 
6.5%
ValueCountFrequency (%)
10 31
 
6.5%
8 39
 
8.1%
7 47
9.8%
6 33
 
6.9%
5 68
14.2%
4 107
22.3%
3 33
 
6.9%
2 65
13.5%
1 57
11.9%

구분
Categorical

IMBALANCE 

Distinct4
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
일반
400 
아동
60 
유아
 
13
청소년
 
7

Length

Max length3
Median length2
Mean length2.0145833
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반
2nd row일반
3rd row아동
4th row아동
5th row일반

Common Values

ValueCountFrequency (%)
일반 400
83.3%
아동 60
 
12.5%
유아 13
 
2.7%
청소년 7
 
1.5%

Length

2024-01-10T05:49:08.302505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:49:08.386758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 400
83.3%
아동 60
 
12.5%
유아 13
 
2.7%
청소년 7
 
1.5%

분류
Real number (ℝ)

ZEROS 

Distinct10
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean587.70833
Minimum0
Maximum900
Zeros6
Zeros (%)1.2%
Negative0
Negative (%)0.0%
Memory size4.3 KiB
2024-01-10T05:49:08.471419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile100
Q1300
median800
Q3800
95-th percentile900
Maximum900
Range900
Interquartile range (IQR)500

Descriptive statistics

Standard deviation269.21939
Coefficient of variation (CV)0.45808333
Kurtosis-1.1461191
Mean587.70833
Median Absolute Deviation (MAD)100
Skewness-0.62291625
Sum282100
Variance72479.08
MonotonicityNot monotonic
2024-01-10T05:49:08.553324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
800 238
49.6%
300 93
 
19.4%
100 44
 
9.2%
500 34
 
7.1%
900 25
 
5.2%
600 18
 
3.8%
400 16
 
3.3%
0 6
 
1.2%
700 3
 
0.6%
200 3
 
0.6%
ValueCountFrequency (%)
0 6
 
1.2%
100 44
 
9.2%
200 3
 
0.6%
300 93
 
19.4%
400 16
 
3.3%
500 34
 
7.1%
600 18
 
3.8%
700 3
 
0.6%
800 238
49.6%
900 25
 
5.2%
ValueCountFrequency (%)
900 25
 
5.2%
800 238
49.6%
700 3
 
0.6%
600 18
 
3.8%
500 34
 
7.1%
400 16
 
3.3%
300 93
 
19.4%
200 3
 
0.6%
100 44
 
9.2%
0 6
 
1.2%
Distinct479
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
2024-01-10T05:49:08.815502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length67
Median length28
Mean length12.185417
Min length2

Characters and Unicode

Total characters5849
Distinct characters636
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique478 ?
Unique (%)99.6%

Sample

1st row이것이 모든 것을 바꾼다
2nd row삶이라는 우주를 건너는 너에게
3rd row테일즈런너 수학킹왕짱!.30
4th row최재천의 동물대탐험.1
5th row비트코인 지혜의 족보(개정판)
ValueCountFrequency (%)
29
 
1.8%
위한 10
 
0.6%
1 10
 
0.6%
2 9
 
0.6%
모든 8
 
0.5%
이상한 7
 
0.4%
7
 
0.4%
찰리 7
 
0.4%
수업 6
 
0.4%
한국사 6
 
0.4%
Other values (1224) 1517
93.9%
2024-01-10T05:49:09.256614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1138
 
19.5%
144
 
2.5%
102
 
1.7%
81
 
1.4%
78
 
1.3%
75
 
1.3%
71
 
1.2%
63
 
1.1%
57
 
1.0%
1 55
 
0.9%
Other values (626) 3985
68.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4264
72.9%
Space Separator 1138
 
19.5%
Decimal Number 191
 
3.3%
Other Punctuation 73
 
1.2%
Uppercase Letter 53
 
0.9%
Open Punctuation 37
 
0.6%
Lowercase Letter 37
 
0.6%
Close Punctuation 36
 
0.6%
Dash Punctuation 17
 
0.3%
Math Symbol 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
144
 
3.4%
102
 
2.4%
81
 
1.9%
78
 
1.8%
75
 
1.8%
71
 
1.7%
63
 
1.5%
57
 
1.3%
51
 
1.2%
49
 
1.1%
Other values (568) 3493
81.9%
Uppercase Letter
ValueCountFrequency (%)
P 8
15.1%
T 8
15.1%
G 7
13.2%
Z 4
 
7.5%
M 4
 
7.5%
I 3
 
5.7%
S 3
 
5.7%
A 2
 
3.8%
V 2
 
3.8%
W 2
 
3.8%
Other values (9) 10
18.9%
Lowercase Letter
ValueCountFrequency (%)
r 6
16.2%
e 5
13.5%
o 4
10.8%
n 4
10.8%
t 4
10.8%
i 3
8.1%
s 2
 
5.4%
d 2
 
5.4%
c 1
 
2.7%
y 1
 
2.7%
Other values (5) 5
13.5%
Decimal Number
ValueCountFrequency (%)
1 55
28.8%
2 35
18.3%
3 24
12.6%
0 22
 
11.5%
6 15
 
7.9%
4 11
 
5.8%
5 10
 
5.2%
7 9
 
4.7%
9 8
 
4.2%
8 2
 
1.0%
Other Punctuation
ValueCountFrequency (%)
. 25
34.2%
: 21
28.8%
, 13
17.8%
! 10
 
13.7%
' 2
 
2.7%
% 1
 
1.4%
& 1
 
1.4%
Open Punctuation
ValueCountFrequency (%)
( 36
97.3%
[ 1
 
2.7%
Close Punctuation
ValueCountFrequency (%)
) 35
97.2%
] 1
 
2.8%
Space Separator
ValueCountFrequency (%)
1138
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 17
100.0%
Math Symbol
ValueCountFrequency (%)
+ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4264
72.9%
Common 1495
 
25.6%
Latin 90
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
144
 
3.4%
102
 
2.4%
81
 
1.9%
78
 
1.8%
75
 
1.8%
71
 
1.7%
63
 
1.5%
57
 
1.3%
51
 
1.2%
49
 
1.1%
Other values (568) 3493
81.9%
Latin
ValueCountFrequency (%)
P 8
 
8.9%
T 8
 
8.9%
G 7
 
7.8%
r 6
 
6.7%
e 5
 
5.6%
o 4
 
4.4%
n 4
 
4.4%
Z 4
 
4.4%
M 4
 
4.4%
t 4
 
4.4%
Other values (24) 36
40.0%
Common
ValueCountFrequency (%)
1138
76.1%
1 55
 
3.7%
( 36
 
2.4%
2 35
 
2.3%
) 35
 
2.3%
. 25
 
1.7%
3 24
 
1.6%
0 22
 
1.5%
: 21
 
1.4%
- 17
 
1.1%
Other values (14) 87
 
5.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4264
72.9%
ASCII 1585
 
27.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1138
71.8%
1 55
 
3.5%
( 36
 
2.3%
2 35
 
2.2%
) 35
 
2.2%
. 25
 
1.6%
3 24
 
1.5%
0 22
 
1.4%
: 21
 
1.3%
- 17
 
1.1%
Other values (48) 177
 
11.2%
Hangul
ValueCountFrequency (%)
144
 
3.4%
102
 
2.4%
81
 
1.9%
78
 
1.8%
75
 
1.8%
71
 
1.7%
63
 
1.5%
57
 
1.3%
51
 
1.2%
49
 
1.1%
Other values (568) 3493
81.9%
Distinct415
Distinct (%)86.5%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
2024-01-10T05:49:09.534790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length3
Mean length4.6833333
Min length2

Characters and Unicode

Total characters2248
Distinct characters365
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique374 ?
Unique (%)77.9%

Sample

1st row나오미 클라인
2nd row김민형
3rd row이강안
4th row황혜영
5th row오태민
ValueCountFrequency (%)
20
 
3.0%
레온 7
 
1.0%
이미지 7
 
1.0%
히로시마 6
 
0.9%
레이코 6
 
0.9%
설민석 5
 
0.7%
교고쿠 4
 
0.6%
4
 
0.6%
윤병무 4
 
0.6%
김지우 4
 
0.6%
Other values (540) 604
90.0%
2024-01-10T05:49:09.995928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
191
 
8.5%
79
 
3.5%
53
 
2.4%
49
 
2.2%
46
 
2.0%
39
 
1.7%
34
 
1.5%
28
 
1.2%
28
 
1.2%
26
 
1.2%
Other values (355) 1675
74.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1996
88.8%
Space Separator 191
 
8.5%
Uppercase Letter 32
 
1.4%
Other Punctuation 13
 
0.6%
Math Symbol 9
 
0.4%
Lowercase Letter 5
 
0.2%
Dash Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
79
 
4.0%
53
 
2.7%
49
 
2.5%
46
 
2.3%
39
 
2.0%
34
 
1.7%
28
 
1.4%
28
 
1.4%
26
 
1.3%
24
 
1.2%
Other values (324) 1590
79.7%
Uppercase Letter
ValueCountFrequency (%)
T 4
12.5%
S 3
 
9.4%
N 3
 
9.4%
K 2
 
6.2%
I 2
 
6.2%
O 2
 
6.2%
J 2
 
6.2%
V 2
 
6.2%
C 2
 
6.2%
D 1
 
3.1%
Other values (9) 9
28.1%
Math Symbol
ValueCountFrequency (%)
< 3
33.3%
> 3
33.3%
+ 2
22.2%
~ 1
 
11.1%
Other Punctuation
ValueCountFrequency (%)
, 7
53.8%
. 5
38.5%
! 1
 
7.7%
Lowercase Letter
ValueCountFrequency (%)
v 2
40.0%
t 2
40.0%
n 1
20.0%
Space Separator
ValueCountFrequency (%)
191
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1996
88.8%
Common 215
 
9.6%
Latin 37
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
79
 
4.0%
53
 
2.7%
49
 
2.5%
46
 
2.3%
39
 
2.0%
34
 
1.7%
28
 
1.4%
28
 
1.4%
26
 
1.3%
24
 
1.2%
Other values (324) 1590
79.7%
Latin
ValueCountFrequency (%)
T 4
 
10.8%
S 3
 
8.1%
N 3
 
8.1%
K 2
 
5.4%
I 2
 
5.4%
O 2
 
5.4%
v 2
 
5.4%
t 2
 
5.4%
J 2
 
5.4%
V 2
 
5.4%
Other values (12) 13
35.1%
Common
ValueCountFrequency (%)
191
88.8%
, 7
 
3.3%
. 5
 
2.3%
< 3
 
1.4%
> 3
 
1.4%
- 2
 
0.9%
+ 2
 
0.9%
! 1
 
0.5%
~ 1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1996
88.8%
ASCII 252
 
11.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
191
75.8%
, 7
 
2.8%
. 5
 
2.0%
T 4
 
1.6%
S 3
 
1.2%
< 3
 
1.2%
> 3
 
1.2%
N 3
 
1.2%
- 2
 
0.8%
K 2
 
0.8%
Other values (21) 29
 
11.5%
Hangul
ValueCountFrequency (%)
79
 
4.0%
53
 
2.7%
49
 
2.5%
46
 
2.3%
39
 
2.0%
34
 
1.7%
28
 
1.4%
28
 
1.4%
26
 
1.3%
24
 
1.2%
Other values (324) 1590
79.7%
Distinct292
Distinct (%)61.0%
Missing1
Missing (%)0.2%
Memory size3.9 KiB
2024-01-10T05:49:10.250163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length4.1127349
Min length1

Characters and Unicode

Total characters1970
Distinct characters343
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique203 ?
Unique (%)42.4%

Sample

1st row열린책들
2nd row웅진지식하우스
3rd row거북이북스
4th row다산어린이
5th row케이디북스
ValueCountFrequency (%)
문학동네 11
 
2.3%
창비 11
 
2.3%
열린책들 9
 
1.9%
밝은미래 7
 
1.4%
위즈덤하우스 7
 
1.4%
민음사 7
 
1.4%
김영사 6
 
1.2%
주니어김영사 6
 
1.2%
손안의책 6
 
1.2%
미디어숲 5
 
1.0%
Other values (285) 408
84.5%
2024-01-10T05:49:10.625739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
104
 
5.3%
71
 
3.6%
56
 
2.8%
53
 
2.7%
45
 
2.3%
41
 
2.1%
37
 
1.9%
34
 
1.7%
32
 
1.6%
31
 
1.6%
Other values (333) 1466
74.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1939
98.4%
Uppercase Letter 14
 
0.7%
Lowercase Letter 7
 
0.4%
Decimal Number 6
 
0.3%
Space Separator 4
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
104
 
5.4%
71
 
3.7%
56
 
2.9%
53
 
2.7%
45
 
2.3%
41
 
2.1%
37
 
1.9%
34
 
1.8%
32
 
1.7%
31
 
1.6%
Other values (313) 1435
74.0%
Uppercase Letter
ValueCountFrequency (%)
B 4
28.6%
O 2
14.3%
S 2
14.3%
Z 1
 
7.1%
I 1
 
7.1%
F 1
 
7.1%
K 1
 
7.1%
E 1
 
7.1%
P 1
 
7.1%
Lowercase Letter
ValueCountFrequency (%)
b 1
14.3%
s 1
14.3%
d 1
14.3%
n 1
14.3%
e 1
14.3%
i 1
14.3%
r 1
14.3%
Decimal Number
ValueCountFrequency (%)
2 3
50.0%
1 2
33.3%
6 1
 
16.7%
Space Separator
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1939
98.4%
Latin 21
 
1.1%
Common 10
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
104
 
5.4%
71
 
3.7%
56
 
2.9%
53
 
2.7%
45
 
2.3%
41
 
2.1%
37
 
1.9%
34
 
1.8%
32
 
1.7%
31
 
1.6%
Other values (313) 1435
74.0%
Latin
ValueCountFrequency (%)
B 4
19.0%
O 2
 
9.5%
S 2
 
9.5%
b 1
 
4.8%
Z 1
 
4.8%
I 1
 
4.8%
F 1
 
4.8%
s 1
 
4.8%
d 1
 
4.8%
n 1
 
4.8%
Other values (6) 6
28.6%
Common
ValueCountFrequency (%)
4
40.0%
2 3
30.0%
1 2
20.0%
6 1
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1939
98.4%
ASCII 31
 
1.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
104
 
5.4%
71
 
3.7%
56
 
2.9%
53
 
2.7%
45
 
2.3%
41
 
2.1%
37
 
1.9%
34
 
1.8%
32
 
1.7%
31
 
1.6%
Other values (313) 1435
74.0%
ASCII
ValueCountFrequency (%)
4
 
12.9%
B 4
 
12.9%
2 3
 
9.7%
O 2
 
6.5%
S 2
 
6.5%
1 2
 
6.5%
b 1
 
3.2%
6 1
 
3.2%
Z 1
 
3.2%
I 1
 
3.2%
Other values (10) 10
32.3%

단가
Real number (ℝ)

HIGH CORRELATION 

Distinct70
Distinct (%)14.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17665.417
Minimum5000
Maximum55500
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.3 KiB
2024-01-10T05:49:10.750493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5000
5-th percentile12000
Q114000
median16000
Q318800
95-th percentile32000
Maximum55500
Range50500
Interquartile range (IQR)4800

Descriptive statistics

Standard deviation6337.4431
Coefficient of variation (CV)0.35874858
Kurtosis7.3841125
Mean17665.417
Median Absolute Deviation (MAD)2000
Skewness2.3336556
Sum8479400
Variance40163186
MonotonicityNot monotonic
2024-01-10T05:49:10.868796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15000 54
 
11.2%
16000 40
 
8.3%
13000 36
 
7.5%
16800 28
 
5.8%
12000 27
 
5.6%
14000 26
 
5.4%
17000 26
 
5.4%
18000 25
 
5.2%
22000 17
 
3.5%
20000 13
 
2.7%
Other values (60) 188
39.2%
ValueCountFrequency (%)
5000 1
 
0.2%
5800 1
 
0.2%
6000 1
 
0.2%
7000 1
 
0.2%
7200 1
 
0.2%
9000 3
0.6%
10000 1
 
0.2%
10500 1
 
0.2%
11000 4
0.8%
11800 2
0.4%
ValueCountFrequency (%)
55500 1
 
0.2%
51000 1
 
0.2%
48000 1
 
0.2%
45000 1
 
0.2%
40000 3
0.6%
39000 3
0.6%
38500 1
 
0.2%
38000 2
0.4%
37000 1
 
0.2%
36000 2
0.4%

수량
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
1
472 
2
 
4
3
 
2
10
 
1
4
 
1

Length

Max length2
Median length1
Mean length1.0020833
Min length1

Unique

Unique2 ?
Unique (%)0.4%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 472
98.3%
2 4
 
0.8%
3 2
 
0.4%
10 1
 
0.2%
4 1
 
0.2%

Length

2024-01-10T05:49:10.980984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:49:11.067637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 472
98.3%
2 4
 
0.8%
3 2
 
0.4%
10 1
 
0.2%
4 1
 
0.2%

금액
Real number (ℝ)

HIGH CORRELATION 

Distinct74
Distinct (%)15.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean18127.5
Minimum5000
Maximum58000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.3 KiB
2024-01-10T05:49:11.169694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5000
5-th percentile12000
Q114000
median16000
Q319000
95-th percentile33600
Maximum58000
Range53000
Interquartile range (IQR)5000

Descriptive statistics

Standard deviation7189.8062
Coefficient of variation (CV)0.39662426
Kurtosis7.7953777
Mean18127.5
Median Absolute Deviation (MAD)2000
Skewness2.4907103
Sum8701200
Variance51693313
MonotonicityNot monotonic
2024-01-10T05:49:11.290731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15000 54
 
11.2%
16000 40
 
8.3%
13000 33
 
6.9%
12000 27
 
5.6%
17000 26
 
5.4%
14000 26
 
5.4%
16800 26
 
5.4%
18000 24
 
5.0%
22000 16
 
3.3%
20000 13
 
2.7%
Other values (64) 195
40.6%
ValueCountFrequency (%)
5000 1
 
0.2%
6000 1
 
0.2%
7000 1
 
0.2%
7200 1
 
0.2%
9000 3
 
0.6%
10000 1
 
0.2%
10500 1
 
0.2%
11000 4
 
0.8%
11800 2
 
0.4%
12000 27
5.6%
ValueCountFrequency (%)
58000 1
 
0.2%
55500 1
 
0.2%
54000 1
 
0.2%
52000 1
 
0.2%
51000 1
 
0.2%
48000 1
 
0.2%
45000 1
 
0.2%
44000 1
 
0.2%
40000 3
0.6%
39000 4
0.8%

Interactions

2024-01-10T05:49:06.997194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:05.537592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:05.927140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:06.277444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:06.621845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:07.068006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:05.609796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:05.998426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:06.343418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:06.692033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:07.140261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:05.700332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:06.072377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:06.410986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:06.763097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:07.208419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:05.777539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:06.138711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:06.476642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:06.832072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:07.281465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:05.854832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:06.209586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:06.549474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:06.908212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T05:49:11.369304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구입 월구분분류단가수량금액
연번1.0000.9210.3230.4370.3280.0610.261
구입 월0.9211.0000.3010.2670.1800.1400.084
구분0.3230.3011.0000.3680.3920.1630.419
분류0.4370.2670.3681.0000.5260.3760.562
단가0.3280.1800.3920.5261.0000.2800.989
수량0.0610.1400.1630.3760.2801.0000.842
금액0.2610.0840.4190.5620.9890.8421.000
2024-01-10T05:49:11.463093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분수량
구분1.0000.133
수량0.1331.000
2024-01-10T05:49:11.543280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구입 월분류단가금액구분수량
연번1.0000.9900.0380.0450.0440.1970.024
구입 월0.9901.0000.0430.0300.0290.1950.080
분류0.0380.0431.000-0.321-0.3030.2270.164
단가0.0450.030-0.3211.0000.9570.2420.104
금액0.0440.029-0.3030.9571.0000.2620.502
구분0.1970.1950.2270.2420.2621.0000.133
수량0.0240.0800.1640.1040.5020.1331.000

Missing values

2024-01-10T05:49:07.385221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:49:07.500684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번구입 월구분분류도서명저자명발행자단가수량금액
011일반300이것이 모든 것을 바꾼다나오미 클라인열린책들33000133000
121일반100삶이라는 우주를 건너는 너에게김민형웅진지식하우스16800116800
231아동400테일즈런너 수학킹왕짱!.30이강안거북이북스10500110500
341아동400최재천의 동물대탐험.1황혜영다산어린이15000115000
451일반300비트코인 지혜의 족보(개정판)오태민케이디북스24500124500
561아동800소능력자들.7김하연마술피리12000112000
671일반500오지라퍼 선생님의 초등 학부모 수업김현경책소유16000116000
781일반100저 많은 돼지고기는 어디서 왔을까후루사와 고유나무를심는사람들13500113500
891일반300마흔 살의 정리법사카오카 요코이아소12000112000
9101일반300박 회계사의 사업보고서 분석법박동흠부크온22000122000
연번구입 월구분분류도서명저자명발행자단가수량금액
47047110일반800해저도시 타코야키김청귤래빗홀15000115000
47147210일반800치유를 파는 찻집모리사와 아키오북플라자16800116800
47247310일반300나폴레온 힐 성공의 법칙나폴레온 힐중앙경제평론사32000132000
47347410일반800쓰쿠모주쿠마이조 오타로b16000116000
47447510일반800W의 비극나쓰키 시즈코손안의책500015000
47547610일반300래리 윌리엄스 좋은 주식은 때가 있다래리 윌리엄스페이지2북스18000118000
47647710일반600드루이드가 되고 싶은 당신을 위한 안내서프로개드루이드아일랜드24000124000
47747810일반300현대 사회문제론현외성창지사27000127000
47847910일반800그곳엔 평화가 있다지미이야기모란단14000114000
47948010일반800염매처럼 신들리는 것미쓰다 신조비채14000114000