Overview

Dataset statistics

Number of variables10
Number of observations575
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory48.4 KiB
Average record size in memory86.2 B

Variable types

Numeric5
Categorical2
Text3

Dataset

Description충청남도 보령시립도서관의 2023년도 희망도서 구입목록 데이터입니다. 구입 월, 구분(성인/아동), 분류, 도서명, 저자명, 발행자, 단가, 수량, 금액 정보를 제공합니다. (기존 보령시 중앙도서관의 명칭이 "보령시립도서관"으로 변경되었습니다.)
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=97&beforeMenuCd=DOM_000000201001001000&publicdatapk=15104393

Alerts

연번 is highly overall correlated with 구입 월High correlation
구입 월 is highly overall correlated with 연번High correlation
단가 is highly overall correlated with 금액High correlation
금액 is highly overall correlated with 단가 and 1 other fieldsHigh correlation
수량 is highly overall correlated with 금액High correlation
수량 is highly imbalanced (94.4%)Imbalance
연번 has unique valuesUnique
분류 has 7 (1.2%) zerosZeros

Reproduction

Analysis started2024-01-09 20:49:34.096959
Analysis finished2024-01-09 20:49:37.215549
Duration3.12 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct575
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean288
Minimum1
Maximum575
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.2 KiB
2024-01-10T05:49:37.276669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile29.7
Q1144.5
median288
Q3431.5
95-th percentile546.3
Maximum575
Range574
Interquartile range (IQR)287

Descriptive statistics

Standard deviation166.13248
Coefficient of variation (CV)0.57684888
Kurtosis-1.2
Mean288
Median Absolute Deviation (MAD)144
Skewness0
Sum165600
Variance27600
MonotonicityStrictly increasing
2024-01-10T05:49:37.615589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
2 1
 
0.2%
381 1
 
0.2%
382 1
 
0.2%
383 1
 
0.2%
384 1
 
0.2%
385 1
 
0.2%
386 1
 
0.2%
387 1
 
0.2%
388 1
 
0.2%
Other values (565) 565
98.3%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
575 1
0.2%
574 1
0.2%
573 1
0.2%
572 1
0.2%
571 1
0.2%
570 1
0.2%
569 1
0.2%
568 1
0.2%
567 1
0.2%
566 1
0.2%

구입 월
Real number (ℝ)

HIGH CORRELATION 

Distinct10
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.6486957
Minimum1
Maximum11
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.2 KiB
2024-01-10T05:49:37.722996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median5
Q38
95-th percentile11
Maximum11
Range10
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.3026851
Coefficient of variation (CV)0.58468102
Kurtosis-1.078385
Mean5.6486957
Median Absolute Deviation (MAD)2
Skewness0.3756868
Sum3248
Variance10.907729
MonotonicityIncreasing
2024-01-10T05:49:37.815236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
4 107
18.6%
11 95
16.5%
5 68
11.8%
2 65
11.3%
1 57
9.9%
7 47
8.2%
8 39
 
6.8%
3 33
 
5.7%
6 33
 
5.7%
10 31
 
5.4%
ValueCountFrequency (%)
1 57
9.9%
2 65
11.3%
3 33
 
5.7%
4 107
18.6%
5 68
11.8%
6 33
 
5.7%
7 47
8.2%
8 39
 
6.8%
10 31
 
5.4%
11 95
16.5%
ValueCountFrequency (%)
11 95
16.5%
10 31
 
5.4%
8 39
 
6.8%
7 47
8.2%
6 33
 
5.7%
5 68
11.8%
4 107
18.6%
3 33
 
5.7%
2 65
11.3%
1 57
9.9%

구분
Categorical

Distinct4
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size4.6 KiB
일반
436 
아동
94 
유아
 
34
청소년
 
11

Length

Max length3
Median length2
Mean length2.0191304
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반
2nd row일반
3rd row아동
4th row아동
5th row일반

Common Values

ValueCountFrequency (%)
일반 436
75.8%
아동 94
 
16.3%
유아 34
 
5.9%
청소년 11
 
1.9%

Length

2024-01-10T05:49:37.923172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:49:38.012321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 436
75.8%
아동 94
 
16.3%
유아 34
 
5.9%
청소년 11
 
1.9%

분류
Real number (ℝ)

ZEROS 

Distinct10
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean593.56522
Minimum0
Maximum900
Zeros7
Zeros (%)1.2%
Negative0
Negative (%)0.0%
Memory size5.2 KiB
2024-01-10T05:49:38.098832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile100
Q1300
median800
Q3800
95-th percentile800
Maximum900
Range900
Interquartile range (IQR)500

Descriptive statistics

Standard deviation270.52083
Coefficient of variation (CV)0.45575586
Kurtosis-1.090209
Mean593.56522
Median Absolute Deviation (MAD)0
Skewness-0.68402756
Sum341300
Variance73181.518
MonotonicityNot monotonic
2024-01-10T05:49:38.194718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
800 299
52.0%
300 104
 
18.1%
100 56
 
9.7%
500 36
 
6.3%
900 28
 
4.9%
400 20
 
3.5%
600 19
 
3.3%
0 7
 
1.2%
700 3
 
0.5%
200 3
 
0.5%
ValueCountFrequency (%)
0 7
 
1.2%
100 56
 
9.7%
200 3
 
0.5%
300 104
 
18.1%
400 20
 
3.5%
500 36
 
6.3%
600 19
 
3.3%
700 3
 
0.5%
800 299
52.0%
900 28
 
4.9%
ValueCountFrequency (%)
900 28
 
4.9%
800 299
52.0%
700 3
 
0.5%
600 19
 
3.3%
500 36
 
6.3%
400 20
 
3.5%
300 104
 
18.1%
200 3
 
0.5%
100 56
 
9.7%
0 7
 
1.2%
Distinct574
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Memory size4.6 KiB
2024-01-10T05:49:38.522823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length67
Median length28
Mean length11.946087
Min length2

Characters and Unicode

Total characters6869
Distinct characters677
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique573 ?
Unique (%)99.7%

Sample

1st row이것이 모든 것을 바꾼다
2nd row삶이라는 우주를 건너는 너에게
3rd row테일즈런너 수학킹왕짱!.30
4th row최재천의 동물대탐험.1
5th row비트코인 지혜의 족보(개정판)
ValueCountFrequency (%)
29
 
1.5%
2 15
 
0.8%
1 14
 
0.7%
위한 12
 
0.6%
산타 9
 
0.5%
9
 
0.5%
모든 8
 
0.4%
나는 7
 
0.4%
이상한 7
 
0.4%
찰리 7
 
0.4%
Other values (1421) 1801
93.9%
2024-01-10T05:49:39.000084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1345
 
19.6%
162
 
2.4%
119
 
1.7%
97
 
1.4%
87
 
1.3%
87
 
1.3%
85
 
1.2%
72
 
1.0%
70
 
1.0%
1 62
 
0.9%
Other values (667) 4683
68.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5038
73.3%
Space Separator 1345
 
19.6%
Decimal Number 222
 
3.2%
Other Punctuation 78
 
1.1%
Uppercase Letter 56
 
0.8%
Open Punctuation 37
 
0.5%
Lowercase Letter 37
 
0.5%
Close Punctuation 36
 
0.5%
Dash Punctuation 17
 
0.2%
Math Symbol 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
162
 
3.2%
119
 
2.4%
97
 
1.9%
87
 
1.7%
87
 
1.7%
85
 
1.7%
72
 
1.4%
70
 
1.4%
57
 
1.1%
54
 
1.1%
Other values (609) 4148
82.3%
Uppercase Letter
ValueCountFrequency (%)
P 9
16.1%
T 9
16.1%
G 8
14.3%
M 4
 
7.1%
Z 4
 
7.1%
S 3
 
5.4%
I 3
 
5.4%
A 2
 
3.6%
V 2
 
3.6%
E 2
 
3.6%
Other values (9) 10
17.9%
Lowercase Letter
ValueCountFrequency (%)
r 6
16.2%
e 5
13.5%
o 4
10.8%
t 4
10.8%
n 4
10.8%
i 3
8.1%
d 2
 
5.4%
s 2
 
5.4%
l 1
 
2.7%
h 1
 
2.7%
Other values (5) 5
13.5%
Decimal Number
ValueCountFrequency (%)
1 62
27.9%
2 44
19.8%
0 26
11.7%
3 25
11.3%
6 19
 
8.6%
4 14
 
6.3%
5 11
 
5.0%
7 9
 
4.1%
9 8
 
3.6%
8 4
 
1.8%
Other Punctuation
ValueCountFrequency (%)
. 25
32.1%
: 24
30.8%
, 14
17.9%
! 11
14.1%
' 2
 
2.6%
% 1
 
1.3%
& 1
 
1.3%
Open Punctuation
ValueCountFrequency (%)
( 36
97.3%
[ 1
 
2.7%
Close Punctuation
ValueCountFrequency (%)
) 35
97.2%
] 1
 
2.8%
Space Separator
ValueCountFrequency (%)
1345
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 17
100.0%
Math Symbol
ValueCountFrequency (%)
+ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5038
73.3%
Common 1738
 
25.3%
Latin 93
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
162
 
3.2%
119
 
2.4%
97
 
1.9%
87
 
1.7%
87
 
1.7%
85
 
1.7%
72
 
1.4%
70
 
1.4%
57
 
1.1%
54
 
1.1%
Other values (609) 4148
82.3%
Latin
ValueCountFrequency (%)
P 9
 
9.7%
T 9
 
9.7%
G 8
 
8.6%
r 6
 
6.5%
e 5
 
5.4%
o 4
 
4.3%
M 4
 
4.3%
Z 4
 
4.3%
t 4
 
4.3%
n 4
 
4.3%
Other values (24) 36
38.7%
Common
ValueCountFrequency (%)
1345
77.4%
1 62
 
3.6%
2 44
 
2.5%
( 36
 
2.1%
) 35
 
2.0%
0 26
 
1.5%
3 25
 
1.4%
. 25
 
1.4%
: 24
 
1.4%
6 19
 
1.1%
Other values (14) 97
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5038
73.3%
ASCII 1831
 
26.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1345
73.5%
1 62
 
3.4%
2 44
 
2.4%
( 36
 
2.0%
) 35
 
1.9%
0 26
 
1.4%
3 25
 
1.4%
. 25
 
1.4%
: 24
 
1.3%
6 19
 
1.0%
Other values (48) 190
 
10.4%
Hangul
ValueCountFrequency (%)
162
 
3.2%
119
 
2.4%
97
 
1.9%
87
 
1.7%
87
 
1.7%
85
 
1.7%
72
 
1.4%
70
 
1.4%
57
 
1.1%
54
 
1.1%
Other values (609) 4148
82.3%
Distinct495
Distinct (%)86.1%
Missing0
Missing (%)0.0%
Memory size4.6 KiB
2024-01-10T05:49:39.302641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length3
Mean length4.6434783
Min length2

Characters and Unicode

Total characters2670
Distinct characters391
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique444 ?
Unique (%)77.2%

Sample

1st row나오미 클라인
2nd row김민형
3rd row이강안
4th row황혜영
5th row오태민
ValueCountFrequency (%)
23
 
2.9%
히로시마 8
 
1.0%
레이코 8
 
1.0%
레온 7
 
0.9%
이미지 7
 
0.9%
설민석 5
 
0.6%
김지우 4
 
0.5%
윤병무 4
 
0.5%
4
 
0.5%
교고쿠 4
 
0.5%
Other values (647) 729
90.8%
2024-01-10T05:49:39.721087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
228
 
8.5%
93
 
3.5%
65
 
2.4%
61
 
2.3%
57
 
2.1%
48
 
1.8%
37
 
1.4%
36
 
1.3%
31
 
1.2%
31
 
1.2%
Other values (381) 1983
74.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2379
89.1%
Space Separator 228
 
8.5%
Uppercase Letter 32
 
1.2%
Other Punctuation 13
 
0.5%
Math Symbol 9
 
0.3%
Lowercase Letter 5
 
0.2%
Dash Punctuation 2
 
0.1%
Decimal Number 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
93
 
3.9%
65
 
2.7%
61
 
2.6%
57
 
2.4%
48
 
2.0%
37
 
1.6%
36
 
1.5%
31
 
1.3%
31
 
1.3%
30
 
1.3%
Other values (348) 1890
79.4%
Uppercase Letter
ValueCountFrequency (%)
T 4
12.5%
N 3
 
9.4%
S 3
 
9.4%
V 2
 
6.2%
I 2
 
6.2%
K 2
 
6.2%
J 2
 
6.2%
O 2
 
6.2%
C 2
 
6.2%
B 1
 
3.1%
Other values (9) 9
28.1%
Math Symbol
ValueCountFrequency (%)
> 3
33.3%
< 3
33.3%
+ 2
22.2%
~ 1
 
11.1%
Other Punctuation
ValueCountFrequency (%)
, 7
53.8%
. 5
38.5%
! 1
 
7.7%
Lowercase Letter
ValueCountFrequency (%)
t 2
40.0%
v 2
40.0%
n 1
20.0%
Decimal Number
ValueCountFrequency (%)
6 1
50.0%
5 1
50.0%
Space Separator
ValueCountFrequency (%)
228
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2379
89.1%
Common 254
 
9.5%
Latin 37
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
93
 
3.9%
65
 
2.7%
61
 
2.6%
57
 
2.4%
48
 
2.0%
37
 
1.6%
36
 
1.5%
31
 
1.3%
31
 
1.3%
30
 
1.3%
Other values (348) 1890
79.4%
Latin
ValueCountFrequency (%)
T 4
 
10.8%
N 3
 
8.1%
S 3
 
8.1%
V 2
 
5.4%
I 2
 
5.4%
K 2
 
5.4%
J 2
 
5.4%
O 2
 
5.4%
C 2
 
5.4%
t 2
 
5.4%
Other values (12) 13
35.1%
Common
ValueCountFrequency (%)
228
89.8%
, 7
 
2.8%
. 5
 
2.0%
> 3
 
1.2%
< 3
 
1.2%
+ 2
 
0.8%
- 2
 
0.8%
6 1
 
0.4%
5 1
 
0.4%
! 1
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2379
89.1%
ASCII 291
 
10.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
228
78.4%
, 7
 
2.4%
. 5
 
1.7%
T 4
 
1.4%
N 3
 
1.0%
S 3
 
1.0%
> 3
 
1.0%
< 3
 
1.0%
V 2
 
0.7%
+ 2
 
0.7%
Other values (23) 31
 
10.7%
Hangul
ValueCountFrequency (%)
93
 
3.9%
65
 
2.7%
61
 
2.6%
57
 
2.4%
48
 
2.0%
37
 
1.6%
36
 
1.5%
31
 
1.3%
31
 
1.3%
30
 
1.3%
Other values (348) 1890
79.4%
Distinct332
Distinct (%)57.8%
Missing1
Missing (%)0.2%
Memory size4.6 KiB
2024-01-10T05:49:39.978491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length4.130662
Min length1

Characters and Unicode

Total characters2371
Distinct characters362
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique229 ?
Unique (%)39.9%

Sample

1st row열린책들
2nd row웅진지식하우스
3rd row거북이북스
4th row다산어린이
5th row케이디북스
ValueCountFrequency (%)
창비 17
 
2.9%
문학동네 14
 
2.4%
위즈덤하우스 10
 
1.7%
주니어김영사 9
 
1.5%
열린책들 9
 
1.5%
밝은미래 8
 
1.4%
민음사 7
 
1.2%
길벗스쿨 7
 
1.2%
시공사 6
 
1.0%
웅진지식하우스 6
 
1.0%
Other values (326) 488
84.0%
2024-01-10T05:49:40.338076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
121
 
5.1%
87
 
3.7%
67
 
2.8%
65
 
2.7%
62
 
2.6%
46
 
1.9%
41
 
1.7%
39
 
1.6%
38
 
1.6%
36
 
1.5%
Other values (352) 1769
74.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2328
98.2%
Uppercase Letter 20
 
0.8%
Decimal Number 9
 
0.4%
Space Separator 7
 
0.3%
Lowercase Letter 7
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
121
 
5.2%
87
 
3.7%
67
 
2.9%
65
 
2.8%
62
 
2.7%
46
 
2.0%
41
 
1.8%
39
 
1.7%
38
 
1.6%
36
 
1.5%
Other values (330) 1726
74.1%
Uppercase Letter
ValueCountFrequency (%)
B 4
20.0%
K 3
15.0%
H 2
10.0%
R 2
10.0%
S 2
10.0%
O 2
10.0%
I 1
 
5.0%
Z 1
 
5.0%
E 1
 
5.0%
F 1
 
5.0%
Lowercase Letter
ValueCountFrequency (%)
b 1
14.3%
e 1
14.3%
n 1
14.3%
d 1
14.3%
s 1
14.3%
i 1
14.3%
r 1
14.3%
Decimal Number
ValueCountFrequency (%)
2 5
55.6%
1 3
33.3%
6 1
 
11.1%
Space Separator
ValueCountFrequency (%)
7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2328
98.2%
Latin 27
 
1.1%
Common 16
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
121
 
5.2%
87
 
3.7%
67
 
2.9%
65
 
2.8%
62
 
2.7%
46
 
2.0%
41
 
1.8%
39
 
1.7%
38
 
1.6%
36
 
1.5%
Other values (330) 1726
74.1%
Latin
ValueCountFrequency (%)
B 4
14.8%
K 3
 
11.1%
H 2
 
7.4%
R 2
 
7.4%
S 2
 
7.4%
O 2
 
7.4%
I 1
 
3.7%
Z 1
 
3.7%
b 1
 
3.7%
E 1
 
3.7%
Other values (8) 8
29.6%
Common
ValueCountFrequency (%)
7
43.8%
2 5
31.2%
1 3
18.8%
6 1
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2328
98.2%
ASCII 43
 
1.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
121
 
5.2%
87
 
3.7%
67
 
2.9%
65
 
2.8%
62
 
2.7%
46
 
2.0%
41
 
1.8%
39
 
1.7%
38
 
1.6%
36
 
1.5%
Other values (330) 1726
74.1%
ASCII
ValueCountFrequency (%)
7
16.3%
2 5
11.6%
B 4
 
9.3%
K 3
 
7.0%
1 3
 
7.0%
H 2
 
4.7%
R 2
 
4.7%
S 2
 
4.7%
O 2
 
4.7%
I 1
 
2.3%
Other values (12) 12
27.9%

단가
Real number (ℝ)

HIGH CORRELATION 

Distinct74
Distinct (%)12.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17330.087
Minimum5000
Maximum55500
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.2 KiB
2024-01-10T05:49:40.470109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5000
5-th percentile12000
Q114000
median16000
Q318000
95-th percentile29650
Maximum55500
Range50500
Interquartile range (IQR)4000

Descriptive statistics

Standard deviation5983.1106
Coefficient of variation (CV)0.34524412
Kurtosis8.4918786
Mean17330.087
Median Absolute Deviation (MAD)2000
Skewness2.4372052
Sum9964800
Variance35797612
MonotonicityNot monotonic
2024-01-10T05:49:40.589219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15000 63
 
11.0%
13000 48
 
8.3%
16000 41
 
7.1%
14000 37
 
6.4%
17000 36
 
6.3%
12000 34
 
5.9%
18000 33
 
5.7%
16800 31
 
5.4%
22000 19
 
3.3%
15800 15
 
2.6%
Other values (64) 218
37.9%
ValueCountFrequency (%)
5000 1
 
0.2%
5800 1
 
0.2%
6000 1
 
0.2%
7000 1
 
0.2%
7200 1
 
0.2%
7500 1
 
0.2%
9000 3
0.5%
9500 1
 
0.2%
9800 2
0.3%
10000 2
0.3%
ValueCountFrequency (%)
55500 1
 
0.2%
51000 1
 
0.2%
48000 1
 
0.2%
45000 1
 
0.2%
40000 3
0.5%
39000 3
0.5%
38500 1
 
0.2%
38000 2
0.3%
37000 1
 
0.2%
36000 2
0.3%

수량
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size4.6 KiB
1
567 
2
 
4
3
 
2
10
 
1
4
 
1

Length

Max length2
Median length1
Mean length1.0017391
Min length1

Unique

Unique2 ?
Unique (%)0.3%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 567
98.6%
2 4
 
0.7%
3 2
 
0.3%
10 1
 
0.2%
4 1
 
0.2%

Length

2024-01-10T05:49:40.697663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:49:40.786832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 567
98.6%
2 4
 
0.7%
3 2
 
0.3%
10 1
 
0.2%
4 1
 
0.2%

금액
Real number (ℝ)

HIGH CORRELATION 

Distinct78
Distinct (%)13.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17715.826
Minimum5000
Maximum58000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.2 KiB
2024-01-10T05:49:40.889492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5000
5-th percentile12000
Q114000
median16000
Q318500
95-th percentile32000
Maximum58000
Range53000
Interquartile range (IQR)4500

Descriptive statistics

Standard deviation6760.7761
Coefficient of variation (CV)0.38162353
Kurtosis9.1815836
Mean17715.826
Median Absolute Deviation (MAD)2000
Skewness2.6393234
Sum10186600
Variance45708094
MonotonicityNot monotonic
2024-01-10T05:49:41.025119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15000 63
 
11.0%
13000 45
 
7.8%
16000 41
 
7.1%
14000 37
 
6.4%
17000 36
 
6.3%
12000 34
 
5.9%
18000 32
 
5.6%
16800 29
 
5.0%
22000 18
 
3.1%
15800 15
 
2.6%
Other values (68) 225
39.1%
ValueCountFrequency (%)
5000 1
 
0.2%
6000 1
 
0.2%
7000 1
 
0.2%
7200 1
 
0.2%
7500 1
 
0.2%
9000 3
0.5%
9500 1
 
0.2%
9800 2
0.3%
10000 2
0.3%
10500 1
 
0.2%
ValueCountFrequency (%)
58000 1
 
0.2%
55500 1
 
0.2%
54000 1
 
0.2%
52000 1
 
0.2%
51000 1
 
0.2%
48000 1
 
0.2%
45000 1
 
0.2%
44000 1
 
0.2%
40000 3
0.5%
39000 4
0.7%

Interactions

2024-01-10T05:49:36.614723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:34.881183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:35.302700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:35.749341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:36.214592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:36.692268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:34.962862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:35.395731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:35.846104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:36.315722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:36.774667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:35.034167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:35.482275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:35.946384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:36.402563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:36.861948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:35.114889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:35.566072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:36.031607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:36.472020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:36.934735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:35.205473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:35.654670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:36.118294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:49:36.540294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T05:49:41.101352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구입 월구분분류단가수량금액
연번1.0000.9850.4140.3400.3150.1220.285
구입 월0.9851.0000.4420.3700.2740.2150.195
구분0.4140.4421.0000.3990.4090.0740.420
분류0.3400.3700.3991.0000.5200.3920.565
단가0.3150.2740.4090.5201.0000.1690.989
수량0.1220.2150.0740.3920.1691.0000.844
금액0.2850.1950.4200.5650.9890.8441.000
2024-01-10T05:49:41.187170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분수량
구분1.0000.060
수량0.0601.000
2024-01-10T05:49:41.257881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구입 월분류단가금액구분수량
연번1.0000.9920.058-0.026-0.0350.2580.050
구입 월0.9921.0000.063-0.041-0.0490.2800.087
분류0.0580.0631.000-0.344-0.3290.2480.172
단가-0.026-0.041-0.3441.0000.9650.2540.060
금액-0.035-0.049-0.3290.9651.0000.2620.505
구분0.2580.2800.2480.2540.2621.0000.060
수량0.0500.0870.1720.0600.5050.0601.000

Missing values

2024-01-10T05:49:37.044835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:49:37.170589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번구입 월구분분류도서명저자명발행자단가수량금액
011일반300이것이 모든 것을 바꾼다나오미 클라인열린책들33000133000
121일반100삶이라는 우주를 건너는 너에게김민형웅진지식하우스16800116800
231아동400테일즈런너 수학킹왕짱!.30이강안거북이북스10500110500
341아동400최재천의 동물대탐험.1황혜영다산어린이15000115000
451일반300비트코인 지혜의 족보(개정판)오태민케이디북스24500124500
561아동800소능력자들.7김하연마술피리12000112000
671일반500오지라퍼 선생님의 초등 학부모 수업김현경책소유16000116000
781일반100저 많은 돼지고기는 어디서 왔을까후루사와 고유나무를심는사람들13500113500
891일반300마흔 살의 정리법사카오카 요코이아소12000112000
9101일반300박 회계사의 사업보고서 분석법박동흠부크온22000122000
연번구입 월구분분류도서명저자명발행자단가수량금액
56556611아동800프리워터아미나 루크먼 도슨밝은미래17500117500
56656711일반100하루 10분, 철학이 필요한 시간위저쥔알레25000125000
56756811아동800한여름 산타 할머니박서진바람의아이들14000114000
56856911아동800해님 달님 떡집김리리비룡소13000113000
56957011일반800헌치백이치가와 사오허블12000112000
57057111유아800호박 목욕탕시바타 케이코위즈덤하우스16700116700
57157211일반800황금종이 1조정래해냄출판사18500118500
57257311일반800황금종이 2조정래해냄출판사18500118500
57357411아동400흔한남매 과학 탐험대 8: 생물 2흔한남매주니어김영사14000114000
57457511아동900흔한남매 별난 세계 여행 2: 고대도시흔한남매미래엔아이세움14000114000