Overview

Dataset statistics

Number of variables10
Number of observations596
Missing cells2
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory50.2 KiB
Average record size in memory86.2 B

Variable types

Numeric5
Categorical2
Text3

Dataset

Description충청남도 보령시립도서관의 2023년도 희망도서 구입목록 데이터입니다. 구입 월, 구분(성인/아동), 분류, 도서명, 저자명, 발행자, 단가, 수량, 금액 정보를 제공합니다. (기존 보령시 중앙도서관의 명칭이 "보령시립도서관"으로 변경되었습니다.)
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=97&beforeMenuCd=DOM_000000201001001000&publicdatapk=15104393

Alerts

연번 is highly overall correlated with 구입 월High correlation
구입 월 is highly overall correlated with 연번High correlation
단가 is highly overall correlated with 금액High correlation
금액 is highly overall correlated with 단가 and 1 other fieldsHigh correlation
수량 is highly overall correlated with 금액High correlation
수량 is highly imbalanced (96.6%)Imbalance
연번 has unique valuesUnique
도서명 has unique valuesUnique
분류 has 23 (3.9%) zerosZeros

Reproduction

Analysis started2024-01-09 20:48:34.569539
Analysis finished2024-01-09 20:48:37.787325
Duration3.22 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct596
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean298.5
Minimum1
Maximum596
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.4 KiB
2024-01-10T05:48:37.864139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile30.75
Q1149.75
median298.5
Q3447.25
95-th percentile566.25
Maximum596
Range595
Interquartile range (IQR)297.5

Descriptive statistics

Standard deviation172.19466
Coefficient of variation (CV)0.57686652
Kurtosis-1.2
Mean298.5
Median Absolute Deviation (MAD)149
Skewness0
Sum177906
Variance29651
MonotonicityStrictly increasing
2024-01-10T05:48:38.004310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
394 1
 
0.2%
396 1
 
0.2%
397 1
 
0.2%
398 1
 
0.2%
399 1
 
0.2%
400 1
 
0.2%
401 1
 
0.2%
402 1
 
0.2%
403 1
 
0.2%
Other values (586) 586
98.3%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
596 1
0.2%
595 1
0.2%
594 1
0.2%
593 1
0.2%
592 1
0.2%
591 1
0.2%
590 1
0.2%
589 1
0.2%
588 1
0.2%
587 1
0.2%

구입 월
Real number (ℝ)

HIGH CORRELATION 

Distinct9
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.8758389
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.4 KiB
2024-01-10T05:48:38.103629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median4
Q37
95-th percentile9
Maximum9
Range8
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.6066534
Coefficient of variation (CV)0.53460614
Kurtosis-1.2540775
Mean4.8758389
Median Absolute Deviation (MAD)2
Skewness0.088454155
Sum2906
Variance6.7946422
MonotonicityIncreasing
2024-01-10T05:48:38.200490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
4 105
17.6%
1 74
12.4%
8 72
12.1%
3 68
11.4%
6 62
10.4%
9 62
10.4%
2 61
10.2%
7 60
10.1%
5 32
 
5.4%
ValueCountFrequency (%)
1 74
12.4%
2 61
10.2%
3 68
11.4%
4 105
17.6%
5 32
 
5.4%
6 62
10.4%
7 60
10.1%
8 72
12.1%
9 62
10.4%
ValueCountFrequency (%)
9 62
10.4%
8 72
12.1%
7 60
10.1%
6 62
10.4%
5 32
 
5.4%
4 105
17.6%
3 68
11.4%
2 61
10.2%
1 74
12.4%

구분
Categorical

Distinct3
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
일반
424 
아동
136 
유아
 
36

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반
2nd row일반
3rd row아동
4th row아동
5th row아동

Common Values

ValueCountFrequency (%)
일반 424
71.1%
아동 136
 
22.8%
유아 36
 
6.0%

Length

2024-01-10T05:48:38.561661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:48:38.643204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 424
71.1%
아동 136
 
22.8%
유아 36
 
6.0%

분류
Real number (ℝ)

ZEROS 

Distinct10
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean575.83893
Minimum0
Maximum900
Zeros23
Zeros (%)3.9%
Negative0
Negative (%)0.0%
Memory size5.4 KiB
2024-01-10T05:48:38.727280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile100
Q1300
median800
Q3800
95-th percentile900
Maximum900
Range900
Interquartile range (IQR)500

Descriptive statistics

Standard deviation280.73158
Coefficient of variation (CV)0.48751754
Kurtosis-1.1224058
Mean575.83893
Median Absolute Deviation (MAD)100
Skewness-0.61423462
Sum343200
Variance78810.219
MonotonicityNot monotonic
2024-01-10T05:48:38.818930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
800 294
49.3%
300 99
 
16.6%
100 51
 
8.6%
400 44
 
7.4%
500 33
 
5.5%
900 31
 
5.2%
0 23
 
3.9%
600 15
 
2.5%
200 4
 
0.7%
700 2
 
0.3%
ValueCountFrequency (%)
0 23
 
3.9%
100 51
 
8.6%
200 4
 
0.7%
300 99
 
16.6%
400 44
 
7.4%
500 33
 
5.5%
600 15
 
2.5%
700 2
 
0.3%
800 294
49.3%
900 31
 
5.2%
ValueCountFrequency (%)
900 31
 
5.2%
800 294
49.3%
700 2
 
0.3%
600 15
 
2.5%
500 33
 
5.5%
400 44
 
7.4%
300 99
 
16.6%
200 4
 
0.7%
100 51
 
8.6%
0 23
 
3.9%

도서명
Text

UNIQUE 

Distinct596
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2024-01-10T05:48:39.080799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length33
Mean length11.946309
Min length2

Characters and Unicode

Total characters7120
Distinct characters679
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique596 ?
Unique (%)100.0%

Sample

1st row파이어족의 재테크
2nd row유튜브 이야기
3rd row비밀의 보석 가게 마석관.2
4th row기묘한 모모한약방 3
5th row유령고양이 후쿠코.3
ValueCountFrequency (%)
25
 
1.3%
법칙 10
 
0.5%
2 10
 
0.5%
정글의 9
 
0.5%
이야기 9
 
0.5%
1 8
 
0.4%
과학 7
 
0.4%
나는 7
 
0.4%
위한 7
 
0.4%
7
 
0.4%
Other values (1488) 1831
94.9%
2024-01-10T05:48:39.480510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1342
 
18.8%
172
 
2.4%
139
 
2.0%
120
 
1.7%
102
 
1.4%
88
 
1.2%
80
 
1.1%
73
 
1.0%
69
 
1.0%
69
 
1.0%
Other values (669) 4866
68.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5291
74.3%
Space Separator 1342
 
18.8%
Decimal Number 234
 
3.3%
Other Punctuation 132
 
1.9%
Uppercase Letter 27
 
0.4%
Lowercase Letter 25
 
0.4%
Dash Punctuation 24
 
0.3%
Open Punctuation 21
 
0.3%
Close Punctuation 21
 
0.3%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
172
 
3.3%
139
 
2.6%
120
 
2.3%
102
 
1.9%
88
 
1.7%
80
 
1.5%
73
 
1.4%
69
 
1.3%
69
 
1.3%
62
 
1.2%
Other values (623) 4317
81.6%
Uppercase Letter
ValueCountFrequency (%)
T 9
33.3%
D 3
 
11.1%
V 3
 
11.1%
L 3
 
11.1%
I 2
 
7.4%
S 1
 
3.7%
B 1
 
3.7%
M 1
 
3.7%
X 1
 
3.7%
N 1
 
3.7%
Other values (2) 2
 
7.4%
Lowercase Letter
ValueCountFrequency (%)
e 4
16.0%
i 4
16.0%
v 3
12.0%
t 3
12.0%
s 2
8.0%
o 2
8.0%
a 2
8.0%
w 1
 
4.0%
h 1
 
4.0%
d 1
 
4.0%
Other values (2) 2
8.0%
Decimal Number
ValueCountFrequency (%)
1 68
29.1%
2 48
20.5%
3 37
15.8%
0 25
 
10.7%
5 19
 
8.1%
4 15
 
6.4%
8 7
 
3.0%
9 6
 
2.6%
6 6
 
2.6%
7 3
 
1.3%
Other Punctuation
ValueCountFrequency (%)
. 68
51.5%
, 29
22.0%
! 19
 
14.4%
: 14
 
10.6%
& 1
 
0.8%
/ 1
 
0.8%
Space Separator
ValueCountFrequency (%)
1342
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 24
100.0%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5291
74.3%
Common 1777
 
25.0%
Latin 52
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
172
 
3.3%
139
 
2.6%
120
 
2.3%
102
 
1.9%
88
 
1.7%
80
 
1.5%
73
 
1.4%
69
 
1.3%
69
 
1.3%
62
 
1.2%
Other values (623) 4317
81.6%
Latin
ValueCountFrequency (%)
T 9
17.3%
e 4
 
7.7%
i 4
 
7.7%
D 3
 
5.8%
v 3
 
5.8%
t 3
 
5.8%
V 3
 
5.8%
L 3
 
5.8%
s 2
 
3.8%
o 2
 
3.8%
Other values (14) 16
30.8%
Common
ValueCountFrequency (%)
1342
75.5%
. 68
 
3.8%
1 68
 
3.8%
2 48
 
2.7%
3 37
 
2.1%
, 29
 
1.6%
0 25
 
1.4%
- 24
 
1.4%
( 21
 
1.2%
) 21
 
1.2%
Other values (12) 94
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5291
74.3%
ASCII 1828
 
25.7%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1342
73.4%
. 68
 
3.7%
1 68
 
3.7%
2 48
 
2.6%
3 37
 
2.0%
, 29
 
1.6%
0 25
 
1.4%
- 24
 
1.3%
( 21
 
1.1%
) 21
 
1.1%
Other values (35) 145
 
7.9%
Hangul
ValueCountFrequency (%)
172
 
3.3%
139
 
2.6%
120
 
2.3%
102
 
1.9%
88
 
1.7%
80
 
1.5%
73
 
1.4%
69
 
1.3%
69
 
1.3%
62
 
1.2%
Other values (623) 4317
81.6%
Punctuation
ValueCountFrequency (%)
1
100.0%
Distinct466
Distinct (%)78.2%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2024-01-10T05:48:39.763215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length18
Mean length5.0805369
Min length2

Characters and Unicode

Total characters3028
Distinct characters411
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique399 ?
Unique (%)66.9%

Sample

1st row신현정+신영주
2nd row고몽
3rd row히로시마레이코
4th row히로시마레이코
5th row히로시마레이코
ValueCountFrequency (%)
17
 
1.9%
흔한남매 10
 
1.1%
제작팀 9
 
1.0%
정글의 8
 
0.9%
법칙 8
 
0.9%
원작 8
 
0.9%
이수지 8
 
0.9%
sbs 8
 
0.9%
정보라 7
 
0.8%
세이 6
 
0.7%
Other values (625) 793
89.9%
2024-01-10T05:48:40.164976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
286
 
9.4%
142
 
4.7%
87
 
2.9%
66
 
2.2%
63
 
2.1%
47
 
1.6%
37
 
1.2%
35
 
1.2%
35
 
1.2%
32
 
1.1%
Other values (401) 2198
72.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2601
85.9%
Space Separator 286
 
9.4%
Uppercase Letter 58
 
1.9%
Math Symbol 26
 
0.9%
Lowercase Letter 24
 
0.8%
Other Punctuation 17
 
0.6%
Close Punctuation 8
 
0.3%
Open Punctuation 8
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
142
 
5.5%
87
 
3.3%
66
 
2.5%
63
 
2.4%
47
 
1.8%
37
 
1.4%
35
 
1.3%
35
 
1.3%
32
 
1.2%
30
 
1.2%
Other values (368) 2027
77.9%
Uppercase Letter
ValueCountFrequency (%)
S 20
34.5%
B 10
17.2%
J 6
 
10.3%
D 3
 
5.2%
R 3
 
5.2%
T 3
 
5.2%
V 2
 
3.4%
E 2
 
3.4%
N 2
 
3.4%
K 2
 
3.4%
Other values (5) 5
 
8.6%
Lowercase Letter
ValueCountFrequency (%)
z 6
25.0%
e 4
16.7%
n 3
12.5%
v 2
 
8.3%
t 2
 
8.3%
c 2
 
8.3%
o 2
 
8.3%
a 1
 
4.2%
u 1
 
4.2%
l 1
 
4.2%
Math Symbol
ValueCountFrequency (%)
+ 24
92.3%
< 1
 
3.8%
> 1
 
3.8%
Other Punctuation
ValueCountFrequency (%)
. 15
88.2%
, 2
 
11.8%
Space Separator
ValueCountFrequency (%)
286
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2601
85.9%
Common 345
 
11.4%
Latin 82
 
2.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
142
 
5.5%
87
 
3.3%
66
 
2.5%
63
 
2.4%
47
 
1.8%
37
 
1.4%
35
 
1.3%
35
 
1.3%
32
 
1.2%
30
 
1.2%
Other values (368) 2027
77.9%
Latin
ValueCountFrequency (%)
S 20
24.4%
B 10
12.2%
J 6
 
7.3%
z 6
 
7.3%
e 4
 
4.9%
D 3
 
3.7%
R 3
 
3.7%
n 3
 
3.7%
T 3
 
3.7%
V 2
 
2.4%
Other values (15) 22
26.8%
Common
ValueCountFrequency (%)
286
82.9%
+ 24
 
7.0%
. 15
 
4.3%
) 8
 
2.3%
( 8
 
2.3%
, 2
 
0.6%
< 1
 
0.3%
> 1
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2601
85.9%
ASCII 427
 
14.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
286
67.0%
+ 24
 
5.6%
S 20
 
4.7%
. 15
 
3.5%
B 10
 
2.3%
) 8
 
1.9%
( 8
 
1.9%
J 6
 
1.4%
z 6
 
1.4%
e 4
 
0.9%
Other values (23) 40
 
9.4%
Hangul
ValueCountFrequency (%)
142
 
5.5%
87
 
3.3%
66
 
2.5%
63
 
2.4%
47
 
1.8%
37
 
1.4%
35
 
1.3%
35
 
1.3%
32
 
1.2%
30
 
1.2%
Other values (368) 2027
77.9%
Distinct311
Distinct (%)52.4%
Missing2
Missing (%)0.3%
Memory size4.8 KiB
2024-01-10T05:48:40.404669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length8
Mean length4.3451178
Min length1

Characters and Unicode

Total characters2581
Distinct characters340
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique205 ?
Unique (%)34.5%

Sample

1st row쌤앤파커스
2nd row성안당
3rd row길벗스쿨
4th row미래엔아이세움
5th row주니어김영사
ValueCountFrequency (%)
영상출판미디어 24
 
4.0%
미래엔아이세움 16
 
2.7%
주니어김영사 15
 
2.5%
위즈덤하우스 13
 
2.2%
문학동네 12
 
2.0%
비룡소 10
 
1.7%
김영사 9
 
1.5%
창비 9
 
1.5%
알에이치코리아 8
 
1.3%
황금가지 7
 
1.2%
Other values (301) 471
79.3%
2024-01-10T05:48:40.781558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
128
 
5.0%
85
 
3.3%
83
 
3.2%
80
 
3.1%
79
 
3.1%
68
 
2.6%
57
 
2.2%
55
 
2.1%
55
 
2.1%
48
 
1.9%
Other values (330) 1843
71.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2540
98.4%
Uppercase Letter 20
 
0.8%
Decimal Number 10
 
0.4%
Lowercase Letter 8
 
0.3%
Other Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
128
 
5.0%
85
 
3.3%
83
 
3.3%
80
 
3.1%
79
 
3.1%
68
 
2.7%
57
 
2.2%
55
 
2.2%
55
 
2.2%
48
 
1.9%
Other values (308) 1802
70.9%
Uppercase Letter
ValueCountFrequency (%)
O 4
20.0%
B 4
20.0%
S 2
10.0%
K 2
10.0%
P 1
 
5.0%
A 1
 
5.0%
C 1
 
5.0%
I 1
 
5.0%
N 1
 
5.0%
E 1
 
5.0%
Other values (2) 2
10.0%
Lowercase Letter
ValueCountFrequency (%)
i 4
50.0%
n 2
25.0%
e 1
 
12.5%
d 1
 
12.5%
Decimal Number
ValueCountFrequency (%)
2 5
50.0%
1 4
40.0%
3 1
 
10.0%
Other Punctuation
ValueCountFrequency (%)
: 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2540
98.4%
Latin 28
 
1.1%
Common 13
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
128
 
5.0%
85
 
3.3%
83
 
3.3%
80
 
3.1%
79
 
3.1%
68
 
2.7%
57
 
2.2%
55
 
2.2%
55
 
2.2%
48
 
1.9%
Other values (308) 1802
70.9%
Latin
ValueCountFrequency (%)
i 4
14.3%
O 4
14.3%
B 4
14.3%
n 2
 
7.1%
S 2
 
7.1%
K 2
 
7.1%
P 1
 
3.6%
A 1
 
3.6%
C 1
 
3.6%
I 1
 
3.6%
Other values (6) 6
21.4%
Common
ValueCountFrequency (%)
2 5
38.5%
1 4
30.8%
: 1
 
7.7%
3 1
 
7.7%
) 1
 
7.7%
( 1
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2540
98.4%
ASCII 41
 
1.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
128
 
5.0%
85
 
3.3%
83
 
3.3%
80
 
3.1%
79
 
3.1%
68
 
2.7%
57
 
2.2%
55
 
2.2%
55
 
2.2%
48
 
1.9%
Other values (308) 1802
70.9%
ASCII
ValueCountFrequency (%)
2 5
12.2%
i 4
 
9.8%
1 4
 
9.8%
O 4
 
9.8%
B 4
 
9.8%
n 2
 
4.9%
S 2
 
4.9%
K 2
 
4.9%
P 1
 
2.4%
: 1
 
2.4%
Other values (12) 12
29.3%

단가
Real number (ℝ)

HIGH CORRELATION 

Distinct62
Distinct (%)10.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15710.067
Minimum6500
Maximum45000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.4 KiB
2024-01-10T05:48:40.912085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6500
5-th percentile10000
Q112800
median15000
Q316800
95-th percentile25000
Maximum45000
Range38500
Interquartile range (IQR)4000

Descriptive statistics

Standard deviation5147.6681
Coefficient of variation (CV)0.32766684
Kurtosis7.3120403
Mean15710.067
Median Absolute Deviation (MAD)2000
Skewness2.2572847
Sum9363200
Variance26498487
MonotonicityNot monotonic
2024-01-10T05:48:41.035399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15000 72
 
12.1%
12000 68
 
11.4%
16000 55
 
9.2%
14000 41
 
6.9%
13000 37
 
6.2%
11000 22
 
3.7%
18000 20
 
3.4%
16800 20
 
3.4%
17000 17
 
2.9%
10000 15
 
2.5%
Other values (52) 229
38.4%
ValueCountFrequency (%)
6500 3
 
0.5%
8000 3
 
0.5%
8500 3
 
0.5%
9000 9
1.5%
9800 4
 
0.7%
10000 15
2.5%
10500 5
 
0.8%
10800 4
 
0.7%
11000 22
3.7%
11500 3
 
0.5%
ValueCountFrequency (%)
45000 1
 
0.2%
43000 1
 
0.2%
42000 1
 
0.2%
39000 1
 
0.2%
38000 2
0.3%
36000 3
0.5%
35000 1
 
0.2%
34000 1
 
0.2%
33000 2
0.3%
32000 3
0.5%

수량
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
1
592 
2
 
2
4
 
1
10
 
1

Length

Max length2
Median length1
Mean length1.0016779
Min length1

Unique

Unique2 ?
Unique (%)0.3%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 592
99.3%
2 2
 
0.3%
4 1
 
0.2%
10 1
 
0.2%

Length

2024-01-10T05:48:41.151402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:48:41.239836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 592
99.3%
2 2
 
0.3%
4 1
 
0.2%
10 1
 
0.2%

금액
Real number (ℝ)

HIGH CORRELATION 

Distinct64
Distinct (%)10.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16056.544
Minimum6500
Maximum120000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.4 KiB
2024-01-10T05:48:41.340780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6500
5-th percentile10000
Q112800
median15000
Q316800
95-th percentile25000
Maximum120000
Range113500
Interquartile range (IQR)4000

Descriptive statistics

Standard deviation7338.9194
Coefficient of variation (CV)0.4570672
Kurtosis78.648671
Mean16056.544
Median Absolute Deviation (MAD)2000
Skewness6.8448079
Sum9569700
Variance53859739
MonotonicityNot monotonic
2024-01-10T05:48:41.459768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15000 72
 
12.1%
12000 67
 
11.2%
16000 55
 
9.2%
14000 41
 
6.9%
13000 37
 
6.2%
11000 22
 
3.7%
18000 20
 
3.4%
16800 20
 
3.4%
17000 17
 
2.9%
10000 15
 
2.5%
Other values (54) 230
38.6%
ValueCountFrequency (%)
6500 3
 
0.5%
8000 3
 
0.5%
8500 3
 
0.5%
9000 9
1.5%
9800 4
 
0.7%
10000 15
2.5%
10500 5
 
0.8%
10800 4
 
0.7%
11000 22
3.7%
11500 2
 
0.3%
ValueCountFrequency (%)
120000 1
 
0.2%
80000 1
 
0.2%
54000 1
 
0.2%
45000 1
 
0.2%
43000 1
 
0.2%
42000 1
 
0.2%
39000 1
 
0.2%
38000 2
0.3%
36000 3
0.5%
35000 1
 
0.2%

Interactions

2024-01-10T05:48:37.039185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:35.326712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:35.727400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:36.154621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:36.542052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:37.143133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:35.402654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:35.809859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:36.233138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:36.629452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:37.253432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:35.485774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:35.900795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:36.313049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:36.727301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:37.340643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:35.562961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:35.981745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:36.390595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:36.824501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:37.447570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:35.647456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:36.074369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:36.471729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:48:36.934345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T05:48:41.546505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구입 월구분분류단가수량금액
연번1.0000.9490.2970.4130.1680.0000.076
구입 월0.9491.0000.4160.2840.1480.1490.151
구분0.2970.4161.0000.3550.5430.0670.262
분류0.4130.2840.3551.0000.5050.0800.304
단가0.1680.1480.5430.5051.0000.2300.870
수량0.0000.1490.0670.0800.2301.0000.931
금액0.0760.1510.2620.3040.8700.9311.000
2024-01-10T05:48:41.640303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분수량
구분1.0000.063
수량0.0631.000
2024-01-10T05:48:41.724958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구입 월분류단가금액구분수량
연번1.0000.9920.0610.005-0.0020.1870.000
구입 월0.9921.0000.061-0.008-0.0150.2010.095
분류0.0610.0611.000-0.381-0.3690.2270.047
단가0.005-0.008-0.3811.0000.9870.3830.139
금액-0.002-0.015-0.3690.9871.0000.1830.910
구분0.1870.2010.2270.3830.1831.0000.063
수량0.0000.0950.0470.1390.9100.0631.000

Missing values

2024-01-10T05:48:37.579895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:48:37.730067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번구입 월구분분류도서명저자명발행자단가수량금액
011일반300파이어족의 재테크신현정+신영주쌤앤파커스15000115000
121일반300유튜브 이야기고몽성안당23000123000
231아동800비밀의 보석 가게 마석관.2히로시마레이코길벗스쿨12000112000
341아동800기묘한 모모한약방 3히로시마레이코미래엔아이세움12000112000
451아동800유령고양이 후쿠코.3히로시마레이코주니어김영사12000112000
561아동800블라인드.2잠뜰TV서울문화사12000112000
671일반900둠 : 재앙의 정치학니얼 퍼거슨21세기북스38000138000
781일반800무너진 다리천선란그래비티북스16000116000
891일반800루시 (저메이카 킨케이드장편소설)저메이카 킨케이드문학동네12000112000
9101일반400에이다, 당신이군요. 최초의 프로그래머시드니 파두아곰출판20000120000
연번구입 월구분분류도서명저자명발행자단가수량금액
5865879일반600영어 읽기 독립 로드맵이설희사람in16500116500
5875889유아800우유에 녹아든 설탕처럼스리티 움리가웅진주니어14000114000
5885899일반800스타피시리사 핍스아르테18500118500
5895909일반100쓰는 습관이시카와 유키뜨인돌출판사14000114000
5905919아동800푸른 사자 와니니 4이현창비12000112000
5915929아동800푸른 사자 와니니 5이현창비12000112000
5925939아동800책 먹는 여우의 여름 이야기프란치스카 비어만주니어김영사13000113000
5935949아동800윙페더 사가.2앤드루 피터슨다산책방22000122000
5945959아동300열두 살 경제학교권오상카시오페아15000115000
5955969아동800마트 사장 구드래곤박현숙다산어린이13000113000