Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 7874 |
Missing cells | 3 |
Missing cells (%) | < 0.1% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 515.3 KiB |
Average record size in memory | 67.0 B |
Variable types
Numeric | 2 |
---|---|
Text | 2 |
Unsupported | 3 |
Categorical | 1 |
Dataset
Description | 도서정보(서명,저자,발행년도,가격,출판사)에 대한 데이터로 2022년도 구미도서관 도서구입 7,874건 제공합니다. |
---|---|
Author | 경상북도교육청 경상북도교육청구미도서관 |
URL | https://www.data.go.kr/data/3068857/fileData.do |
금액 is highly overall correlated with 책수 | High correlation |
책수 is highly overall correlated with 금액 | High correlation |
책수 is highly imbalanced (58.5%) | Imbalance |
번호 has unique values | Unique |
발행자 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
발행년 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
단가 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2024-04-17 10:57:15.477479 |
---|---|
Analysis finished | 2024-04-17 10:57:17.304771 |
Duration | 1.83 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
번호
Real number (ℝ)
UNIQUE
 
Distinct | 7874 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3937.5 |
Minimum | 1 |
---|---|
Maximum | 7874 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 69.3 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 394.65 |
Q1 | 1969.25 |
median | 3937.5 |
Q3 | 5905.75 |
95-th percentile | 7480.35 |
Maximum | 7874 |
Range | 7873 |
Interquartile range (IQR) | 3936.5 |
Descriptive statistics
Standard deviation | 2273.1723 |
---|---|
Coefficient of variation (CV) | 0.57731361 |
Kurtosis | -1.2 |
Mean | 3937.5 |
Median Absolute Deviation (MAD) | 1968.5 |
Skewness | 0 |
Sum | 31003875 |
Variance | 5167312.5 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | < 0.1% |
5261 | 1 | < 0.1% |
5259 | 1 | < 0.1% |
5258 | 1 | < 0.1% |
5257 | 1 | < 0.1% |
5256 | 1 | < 0.1% |
5255 | 1 | < 0.1% |
5254 | 1 | < 0.1% |
5253 | 1 | < 0.1% |
5252 | 1 | < 0.1% |
Other values (7864) | 7864 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
7874 | 1 | |
7873 | 1 | |
7872 | 1 | |
7871 | 1 | |
7870 | 1 | |
7869 | 1 | |
7868 | 1 | |
7867 | 1 | |
7866 | 1 | |
7865 | 1 |
서명
Text
Distinct | 7777 |
---|---|
Distinct (%) | 98.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 61.6 KiB |
Length
Max length | 132 |
---|---|
Median length | 72 |
Mean length | 23.418339 |
Min length | 1 |
Characters and Unicode
Total characters | 184396 |
---|---|
Distinct characters | 1659 |
Distinct categories | 17 ? |
Distinct scripts | 8 ? |
Distinct blocks | 14 ? |
Unique
Unique | 7680 ? |
---|---|
Unique (%) | 97.5% |
Sample
1st row | 사랑한다고 말할 용기 |
---|---|
2nd row | 아빠, 쿠키 주세요 |
3rd row | 숨바꼭질! |
4th row | 소년과 살쾡이 |
5th row | 세종대왕을 찾아라 |
Value | Count | Frequency (%) |
4151 | 8.5% | |
위한 | 320 | 0.7% |
이야기 | 316 | 0.6% |
2 | 251 | 0.5% |
1 | 241 | 0.5% |
내 | 174 | 0.4% |
과학 | 173 | 0.4% |
3 | 133 | 0.3% |
읽는 | 126 | 0.3% |
나는 | 121 | 0.2% |
Other values (18344) | 42975 |
Most occurring characters
Value | Count | Frequency (%) |
45114 | 24.5% | |
의 | 3393 | 1.8% |
이 | 3290 | 1.8% |
는 | 2816 | 1.5% |
: | 2749 | 1.5% |
기 | 1857 | 1.0% |
가 | 1689 | 0.9% |
한 | 1648 | 0.9% |
지 | 1569 | 0.9% |
리 | 1551 | 0.8% |
Other values (1649) | 118720 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 115711 | |
Space Separator | 45114 | 24.5% |
Lowercase Letter | 7735 | 4.2% |
Other Punctuation | 6209 | 3.4% |
Decimal Number | 3511 | 1.9% |
Uppercase Letter | 2393 | 1.3% |
Open Punctuation | 1599 | 0.9% |
Close Punctuation | 1599 | 0.9% |
Math Symbol | 335 | 0.2% |
Nonspacing Mark | 106 | 0.1% |
Other values (7) | 84 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
의 | 3393 | 2.9% |
이 | 3290 | 2.8% |
는 | 2816 | 2.4% |
기 | 1857 | 1.6% |
가 | 1689 | 1.5% |
한 | 1648 | 1.4% |
지 | 1569 | 1.4% |
리 | 1551 | 1.3% |
다 | 1519 | 1.3% |
사 | 1499 | 1.3% |
Other values (1494) | 94880 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 895 | |
o | 720 | 9.3% |
a | 623 | 8.1% |
i | 602 | 7.8% |
n | 579 | 7.5% |
t | 532 | 6.9% |
s | 516 | 6.7% |
r | 492 | 6.4% |
h | 361 | 4.7% |
l | 318 | 4.1% |
Other values (52) | 2097 |
Uppercase Letter
Value | Count | Frequency (%) |
E | 203 | 8.5% |
S | 198 | 8.3% |
I | 188 | 7.9% |
T | 185 | 7.7% |
A | 157 | 6.6% |
L | 148 | 6.2% |
N | 111 | 4.6% |
V | 106 | 4.4% |
O | 101 | 4.2% |
B | 99 | 4.1% |
Other values (17) | 897 |
Other Punctuation
Value | Count | Frequency (%) |
: | 2749 | |
. | 1188 | |
, | 965 | 15.5% |
! | 532 | 8.6% |
? | 383 | 6.2% |
· | 174 | 2.8% |
' | 130 | 2.1% |
& | 28 | 0.5% |
% | 22 | 0.4% |
# | 10 | 0.2% |
Other values (6) | 28 | 0.5% |
Nonspacing Mark
Value | Count | Frequency (%) |
้ | 22 | |
ี | 17 | |
่ | 14 | |
ั | 13 | |
ู | 9 | |
ิ | 9 | |
ุ | 7 | 6.6% |
์ | 5 | 4.7% |
ื | 5 | 4.7% |
́ | 2 | 1.9% |
Other values (2) | 3 | 2.8% |
Decimal Number
Value | Count | Frequency (%) |
1 | 832 | |
2 | 682 | |
0 | 630 | |
3 | 391 | |
5 | 255 | 7.3% |
4 | 251 | 7.1% |
6 | 141 | 4.0% |
9 | 123 | 3.5% |
7 | 113 | 3.2% |
8 | 93 | 2.6% |
Math Symbol
Value | Count | Frequency (%) |
= | 290 | |
+ | 18 | 5.4% |
~ | 14 | 4.2% |
× | 7 | 2.1% |
< | 3 | 0.9% |
> | 3 | 0.9% |
Open Punctuation
Value | Count | Frequency (%) |
( | 1394 | |
[ | 193 | 12.1% |
『 | 7 | 0.4% |
「 | 4 | 0.3% |
《 | 1 | 0.1% |
Close Punctuation
Value | Count | Frequency (%) |
) | 1394 | |
] | 193 | 12.1% |
』 | 7 | 0.4% |
」 | 4 | 0.3% |
》 | 1 | 0.1% |
Other Symbol
Value | Count | Frequency (%) |
★ | 8 | |
ⓔ | 1 | 10.0% |
ⓛ | 1 | 10.0% |
Other Number
Value | Count | Frequency (%) |
② | 3 | |
① | 3 |
Letter Number
Value | Count | Frequency (%) |
Ⅱ | 1 | |
Ⅰ | 1 |
Space Separator
Value | Count | Frequency (%) |
45114 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 60 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 4 |
Final Punctuation
Value | Count | Frequency (%) |
’ | 1 |
Initial Punctuation
Value | Count | Frequency (%) |
‘ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 115141 | |
Common | 58449 | |
Latin | 10130 | 5.5% |
Thai | 425 | 0.2% |
Han | 160 | 0.1% |
Hiragana | 66 | < 0.1% |
Katakana | 21 | < 0.1% |
Inherited | 4 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
의 | 3393 | 2.9% |
이 | 3290 | 2.9% |
는 | 2816 | 2.4% |
기 | 1857 | 1.6% |
가 | 1689 | 1.5% |
한 | 1648 | 1.4% |
지 | 1569 | 1.4% |
리 | 1551 | 1.3% |
다 | 1519 | 1.3% |
사 | 1499 | 1.3% |
Other values (1276) | 94310 |
Han
Value | Count | Frequency (%) |
的 | 6 | 3.8% |
学 | 4 | 2.5% |
子 | 3 | 1.9% |
詩 | 3 | 1.9% |
家 | 3 | 1.9% |
小 | 2 | 1.2% |
寻 | 2 | 1.2% |
个 | 2 | 1.2% |
天 | 2 | 1.2% |
神 | 2 | 1.2% |
Other values (123) | 131 |
Latin
Value | Count | Frequency (%) |
e | 895 | 8.8% |
o | 720 | 7.1% |
a | 623 | 6.2% |
i | 602 | 5.9% |
n | 579 | 5.7% |
t | 532 | 5.3% |
s | 516 | 5.1% |
r | 492 | 4.9% |
h | 361 | 3.6% |
l | 318 | 3.1% |
Other values (81) | 4492 |
Common
Value | Count | Frequency (%) |
45114 | ||
: | 2749 | 4.7% |
( | 1394 | 2.4% |
) | 1394 | 2.4% |
. | 1188 | 2.0% |
, | 965 | 1.7% |
1 | 832 | 1.4% |
2 | 682 | 1.2% |
0 | 630 | 1.1% |
! | 532 | 0.9% |
Other values (42) | 2969 | 5.1% |
Thai
Value | Count | Frequency (%) |
ร | 27 | 6.4% |
น | 27 | 6.4% |
้ | 22 | 5.2% |
อ | 22 | 5.2% |
า | 19 | 4.5% |
ก | 18 | 4.2% |
ม | 17 | 4.0% |
ี | 17 | 4.0% |
ง | 16 | 3.8% |
ล | 16 | 3.8% |
Other values (38) | 224 |
Hiragana
Value | Count | Frequency (%) |
の | 6 | 9.1% |
が | 5 | 7.6% |
と | 4 | 6.1% |
す | 4 | 6.1% |
う | 4 | 6.1% |
か | 3 | 4.5% |
あ | 3 | 4.5% |
た | 3 | 4.5% |
い | 3 | 4.5% |
ま | 3 | 4.5% |
Other values (20) | 28 |
Katakana
Value | Count | Frequency (%) |
ン | 3 | |
イ | 2 | 9.5% |
グ | 2 | 9.5% |
ス | 1 | 4.8% |
セ | 1 | 4.8% |
ッ | 1 | 4.8% |
エ | 1 | 4.8% |
ミ | 1 | 4.8% |
ラ | 1 | 4.8% |
ロ | 1 | 4.8% |
Other values (7) | 7 |
Inherited
Value | Count | Frequency (%) |
́ | 2 | |
̂ | 2 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 115130 | |
ASCII | 68258 | |
Thai | 425 | 0.2% |
None | 255 | 0.1% |
CJK | 160 | 0.1% |
Hiragana | 66 | < 0.1% |
Latin Ext Additional | 43 | < 0.1% |
Katakana | 21 | < 0.1% |
Compat Jamo | 11 | < 0.1% |
Misc Symbols | 8 | < 0.1% |
Other values (4) | 19 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
45114 | ||
: | 2749 | 4.0% |
( | 1394 | 2.0% |
) | 1394 | 2.0% |
. | 1188 | 1.7% |
, | 965 | 1.4% |
e | 895 | 1.3% |
1 | 832 | 1.2% |
o | 720 | 1.1% |
2 | 682 | 1.0% |
Other values (76) | 12325 | 18.1% |
Hangul
Value | Count | Frequency (%) |
의 | 3393 | 2.9% |
이 | 3290 | 2.9% |
는 | 2816 | 2.4% |
기 | 1857 | 1.6% |
가 | 1689 | 1.5% |
한 | 1648 | 1.4% |
지 | 1569 | 1.4% |
리 | 1551 | 1.3% |
다 | 1519 | 1.3% |
사 | 1499 | 1.3% |
Other values (1266) | 94299 |
None
Value | Count | Frequency (%) |
· | 174 | |
『 | 7 | 2.7% |
』 | 7 | 2.7% |
× | 7 | 2.7% |
à | 6 | 2.4% |
á | 5 | 2.0% |
đ | 5 | 2.0% |
「 | 4 | 1.6% |
」 | 4 | 1.6% |
ô | 4 | 1.6% |
Other values (16) | 32 | 12.5% |
Thai
Value | Count | Frequency (%) |
ร | 27 | 6.4% |
น | 27 | 6.4% |
้ | 22 | 5.2% |
อ | 22 | 5.2% |
า | 19 | 4.5% |
ก | 18 | 4.2% |
ม | 17 | 4.0% |
ี | 17 | 4.0% |
ง | 16 | 3.8% |
ล | 16 | 3.8% |
Other values (38) | 224 |
Misc Symbols
Value | Count | Frequency (%) |
★ | 8 |
Hiragana
Value | Count | Frequency (%) |
の | 6 | 9.1% |
が | 5 | 7.6% |
と | 4 | 6.1% |
す | 4 | 6.1% |
う | 4 | 6.1% |
か | 3 | 4.5% |
あ | 3 | 4.5% |
た | 3 | 4.5% |
い | 3 | 4.5% |
ま | 3 | 4.5% |
Other values (20) | 28 |
CJK
Value | Count | Frequency (%) |
的 | 6 | 3.8% |
学 | 4 | 2.5% |
子 | 3 | 1.9% |
詩 | 3 | 1.9% |
家 | 3 | 1.9% |
小 | 2 | 1.2% |
寻 | 2 | 1.2% |
个 | 2 | 1.2% |
天 | 2 | 1.2% |
神 | 2 | 1.2% |
Other values (123) | 131 |
Latin Ext Additional
Value | Count | Frequency (%) |
ậ | 4 | 9.3% |
ủ | 4 | 9.3% |
ế | 4 | 9.3% |
ờ | 3 | 7.0% |
ớ | 3 | 7.0% |
ố | 3 | 7.0% |
ầ | 2 | 4.7% |
ể | 2 | 4.7% |
ệ | 2 | 4.7% |
ợ | 2 | 4.7% |
Other values (11) | 14 |
Punctuation
Value | Count | Frequency (%) |
… | 3 | |
’ | 1 | 20.0% |
‘ | 1 | 20.0% |
Katakana
Value | Count | Frequency (%) |
ン | 3 | |
イ | 2 | 9.5% |
グ | 2 | 9.5% |
ス | 1 | 4.8% |
セ | 1 | 4.8% |
ッ | 1 | 4.8% |
エ | 1 | 4.8% |
ミ | 1 | 4.8% |
ラ | 1 | 4.8% |
ロ | 1 | 4.8% |
Other values (7) | 7 |
Enclosed Alphanum
Value | Count | Frequency (%) |
② | 3 | |
① | 3 | |
ⓔ | 1 | 12.5% |
ⓛ | 1 | 12.5% |
Compat Jamo
Value | Count | Frequency (%) |
ㅈ | 2 | |
ㄱ | 1 | |
ㄴ | 1 | |
ㄷ | 1 | |
ㄹ | 1 | |
ㅣ | 1 | |
ㆍ | 1 | |
ㅡ | 1 | |
ㅎ | 1 | |
ㅅ | 1 |
Diacriticals
Value | Count | Frequency (%) |
́ | 2 | |
̂ | 2 |
Number Forms
Value | Count | Frequency (%) |
Ⅱ | 1 | |
Ⅰ | 1 |
저자
Text
Distinct | 5974 |
---|---|
Distinct (%) | 75.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 61.6 KiB |
Length
Max length | 69 |
---|---|
Median length | 50 |
Mean length | 7.6534163 |
Min length | 2 |
Characters and Unicode
Total characters | 60263 |
---|---|
Distinct characters | 1030 |
Distinct categories | 14 ? |
Distinct scripts | 7 ? |
Distinct blocks | 11 ? |
Unique
Unique | 4987 ? |
---|---|
Unique (%) | 63.3% |
Sample
1st row | 황선우 지음 |
---|---|
2nd row | 데이비드 에즈라 스테인 지음 |
3rd row | Lolita SECHAN |
4th row | 우상구 지음 |
5th row | 김진 지음 |
Value | Count | Frequency (%) |
지음 | 6012 | |
글 | 515 | 2.8% |
외 | 185 | 1.0% |
원작 | 144 | 0.8% |
글·그림 | 121 | 0.7% |
by | 73 | 0.4% |
편집 | 59 | 0.3% |
한자교연 | 59 | 0.3% |
편집부 | 51 | 0.3% |
엮음 | 45 | 0.2% |
Other values (6941) | 11257 |
Most occurring characters
Value | Count | Frequency (%) |
10647 | 17.7% | |
지 | 6392 | 10.6% |
음 | 6076 | 10.1% |
이 | 1435 | 2.4% |
김 | 883 | 1.5% |
스 | 841 | 1.4% |
리 | 747 | 1.2% |
글 | 679 | 1.1% |
정 | 535 | 0.9% |
영 | 433 | 0.7% |
Other values (1020) | 31595 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 45594 | |
Space Separator | 10647 | 17.7% |
Lowercase Letter | 2443 | 4.1% |
Uppercase Letter | 654 | 1.1% |
Other Punctuation | 364 | 0.6% |
Open Punctuation | 204 | 0.3% |
Close Punctuation | 203 | 0.3% |
Nonspacing Mark | 71 | 0.1% |
Dash Punctuation | 34 | 0.1% |
Decimal Number | 26 | < 0.1% |
Other values (4) | 23 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
지 | 6392 | 14.0% |
음 | 6076 | 13.3% |
이 | 1435 | 3.1% |
김 | 883 | 1.9% |
스 | 841 | 1.8% |
리 | 747 | 1.6% |
글 | 679 | 1.5% |
정 | 535 | 1.2% |
영 | 433 | 0.9% |
미 | 410 | 0.9% |
Other values (917) | 27163 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 272 | |
a | 243 | 9.9% |
n | 209 | 8.6% |
i | 194 | 7.9% |
r | 179 | 7.3% |
t | 171 | 7.0% |
o | 160 | 6.5% |
l | 153 | 6.3% |
y | 118 | 4.8% |
s | 104 | 4.3% |
Other values (28) | 640 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 70 | 10.7% |
B | 63 | 9.6% |
M | 62 | 9.5% |
D | 43 | 6.6% |
T | 41 | 6.3% |
H | 37 | 5.7% |
A | 35 | 5.4% |
K | 31 | 4.7% |
J | 30 | 4.6% |
C | 28 | 4.3% |
Other values (16) | 214 |
Nonspacing Mark
Value | Count | Frequency (%) |
ี | 19 | |
์ | 12 | |
ิ | 9 | |
่ | 8 | |
ื | 7 | 9.9% |
ู | 5 | 7.0% |
ั | 4 | 5.6% |
้ | 4 | 5.6% |
ุ | 2 | 2.8% |
ึ | 1 | 1.4% |
Decimal Number
Value | Count | Frequency (%) |
0 | 6 | |
3 | 5 | |
1 | 5 | |
2 | 5 | |
7 | 1 | 3.8% |
6 | 1 | 3.8% |
4 | 1 | 3.8% |
5 | 1 | 3.8% |
8 | 1 | 3.8% |
Other Punctuation
Value | Count | Frequency (%) |
· | 146 | |
. | 138 | |
: | 34 | 9.3% |
; | 31 | 8.5% |
, | 12 | 3.3% |
/ | 2 | 0.5% |
& | 1 | 0.3% |
Math Symbol
Value | Count | Frequency (%) |
< | 9 | |
> | 9 | |
+ | 1 | 5.0% |
× | 1 | 5.0% |
Open Punctuation
Value | Count | Frequency (%) |
[ | 196 | |
( | 8 | 3.9% |
Close Punctuation
Value | Count | Frequency (%) |
] | 195 | |
) | 8 | 3.9% |
Space Separator
Value | Count | Frequency (%) |
10647 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 34 |
Initial Punctuation
Value | Count | Frequency (%) |
‘ | 1 |
Final Punctuation
Value | Count | Frequency (%) |
’ | 1 |
Other Symbol
Value | Count | Frequency (%) |
ⓔ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 45279 | |
Common | 11501 | 19.1% |
Latin | 3097 | 5.1% |
Thai | 256 | 0.4% |
Han | 100 | 0.2% |
Hiragana | 23 | < 0.1% |
Katakana | 7 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
지 | 6392 | 14.1% |
음 | 6076 | 13.4% |
이 | 1435 | 3.2% |
김 | 883 | 2.0% |
스 | 841 | 1.9% |
리 | 747 | 1.6% |
글 | 679 | 1.5% |
정 | 535 | 1.2% |
영 | 433 | 1.0% |
미 | 410 | 0.9% |
Other values (789) | 26848 |
Han
Value | Count | Frequency (%) |
著 | 13 | 13.0% |
编 | 3 | 3.0% |
斯 | 3 | 3.0% |
文 | 3 | 3.0% |
日 | 2 | 2.0% |
子 | 2 | 2.0% |
美 | 2 | 2.0% |
拉 | 2 | 2.0% |
語 | 2 | 2.0% |
尼 | 2 | 2.0% |
Other values (63) | 66 |
Latin
Value | Count | Frequency (%) |
e | 272 | 8.8% |
a | 243 | 7.8% |
n | 209 | 6.7% |
i | 194 | 6.3% |
r | 179 | 5.8% |
t | 171 | 5.5% |
o | 160 | 5.2% |
l | 153 | 4.9% |
y | 118 | 3.8% |
s | 104 | 3.4% |
Other values (54) | 1294 |
Thai
Value | Count | Frequency (%) |
ร | 29 | 11.3% |
เ | 24 | 9.4% |
อ | 22 | 8.6% |
ี | 19 | 7.4% |
ง | 15 | 5.9% |
์ | 12 | 4.7% |
ย | 10 | 3.9% |
ิ | 9 | 3.5% |
่ | 8 | 3.1% |
น | 8 | 3.1% |
Other values (33) | 100 |
Common
Value | Count | Frequency (%) |
10647 | ||
[ | 196 | 1.7% |
] | 195 | 1.7% |
· | 146 | 1.3% |
. | 138 | 1.2% |
: | 34 | 0.3% |
- | 34 | 0.3% |
; | 31 | 0.3% |
, | 12 | 0.1% |
< | 9 | 0.1% |
Other values (19) | 59 | 0.5% |
Hiragana
Value | Count | Frequency (%) |
さ | 3 | |
か | 2 | 8.7% |
と | 2 | 8.7% |
し | 2 | 8.7% |
く | 2 | 8.7% |
り | 2 | 8.7% |
え | 2 | 8.7% |
い | 1 | 4.3% |
ば | 1 | 4.3% |
つ | 1 | 4.3% |
Other values (5) | 5 |
Katakana
Value | Count | Frequency (%) |
ヤ | 1 | |
フ | 1 | |
ミ | 1 | |
ユ | 1 | |
ウ | 1 | |
セ | 1 | |
イ | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 45271 | |
ASCII | 14424 | 23.9% |
Thai | 256 | 0.4% |
None | 159 | 0.3% |
CJK | 100 | 0.2% |
Hiragana | 23 | < 0.1% |
Latin Ext Additional | 12 | < 0.1% |
Compat Jamo | 8 | < 0.1% |
Katakana | 7 | < 0.1% |
Punctuation | 2 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
10647 | ||
e | 272 | 1.9% |
a | 243 | 1.7% |
n | 209 | 1.4% |
[ | 196 | 1.4% |
] | 195 | 1.4% |
i | 194 | 1.3% |
r | 179 | 1.2% |
t | 171 | 1.2% |
o | 160 | 1.1% |
Other values (65) | 1958 | 13.6% |
Hangul
Value | Count | Frequency (%) |
지 | 6392 | 14.1% |
음 | 6076 | 13.4% |
이 | 1435 | 3.2% |
김 | 883 | 2.0% |
스 | 841 | 1.9% |
리 | 747 | 1.7% |
글 | 679 | 1.5% |
정 | 535 | 1.2% |
영 | 433 | 1.0% |
미 | 410 | 0.9% |
Other values (787) | 26840 |
None
Value | Count | Frequency (%) |
· | 146 | |
é | 3 | 1.9% |
ơ | 2 | 1.3% |
ư | 2 | 1.3% |
à | 2 | 1.3% |
ñ | 1 | 0.6% |
ä | 1 | 0.6% |
á | 1 | 0.6% |
× | 1 | 0.6% |
Thai
Value | Count | Frequency (%) |
ร | 29 | 11.3% |
เ | 24 | 9.4% |
อ | 22 | 8.6% |
ี | 19 | 7.4% |
ง | 15 | 5.9% |
์ | 12 | 4.7% |
ย | 10 | 3.9% |
ิ | 9 | 3.5% |
่ | 8 | 3.1% |
น | 8 | 3.1% |
Other values (33) | 100 |
CJK
Value | Count | Frequency (%) |
著 | 13 | 13.0% |
编 | 3 | 3.0% |
斯 | 3 | 3.0% |
文 | 3 | 3.0% |
日 | 2 | 2.0% |
子 | 2 | 2.0% |
美 | 2 | 2.0% |
拉 | 2 | 2.0% |
語 | 2 | 2.0% |
尼 | 2 | 2.0% |
Other values (63) | 66 |
Compat Jamo
Value | Count | Frequency (%) |
ㆍ | 7 | |
ㄷ | 1 | 12.5% |
Hiragana
Value | Count | Frequency (%) |
さ | 3 | |
か | 2 | 8.7% |
と | 2 | 8.7% |
し | 2 | 8.7% |
く | 2 | 8.7% |
り | 2 | 8.7% |
え | 2 | 8.7% |
い | 1 | 4.3% |
ば | 1 | 4.3% |
つ | 1 | 4.3% |
Other values (5) | 5 |
Latin Ext Additional
Value | Count | Frequency (%) |
ờ | 3 | |
ạ | 3 | |
ầ | 2 | |
ế | 2 | |
ẹ | 1 | 8.3% |
ả | 1 | 8.3% |
Katakana
Value | Count | Frequency (%) |
ヤ | 1 | |
フ | 1 | |
ミ | 1 | |
ユ | 1 | |
ウ | 1 | |
セ | 1 | |
イ | 1 |
Punctuation
Value | Count | Frequency (%) |
‘ | 1 | |
’ | 1 |
Enclosed Alphanum
Value | Count | Frequency (%) |
ⓔ | 1 |
발행자
Unsupported
REJECTED
  UNSUPPORTED
 
Missing | 2 |
---|---|
Missing (%) | < 0.1% |
Memory size | 61.6 KiB |
발행년
Unsupported
REJECTED
  UNSUPPORTED
 
Missing | 0 |
---|---|
Missing (%) | 0.0% |
Memory size | 61.6 KiB |
책수
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 61.6 KiB |
1 | |
---|---|
2 | |
3 | 83 |
4 | 15 |
5 | 4 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 3 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 5544 | |
2 | 2228 | |
3 | 83 | 1.1% |
4 | 15 | 0.2% |
5 | 4 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 5544 | |
2 | 2228 | |
3 | 83 | 1.1% |
4 | 15 | 0.2% |
5 | 4 | 0.1% |
단가
Unsupported
REJECTED
  UNSUPPORTED
 
Missing | 0 |
---|---|
Missing (%) | 0.0% |
Memory size | 61.6 KiB |
금액
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 221 |
---|---|
Distinct (%) | 2.8% |
Missing | 1 |
Missing (%) | < 0.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20394.752 |
Minimum | 0 |
---|---|
Maximum | 220014 |
Zeros | 1 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 69.3 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 11000 |
Q1 | 14800 |
median | 18000 |
Q3 | 26000 |
95-th percentile | 34000 |
Maximum | 220014 |
Range | 220014 |
Interquartile range (IQR) | 11200 |
Descriptive statistics
Standard deviation | 8495.5874 |
---|---|
Coefficient of variation (CV) | 0.41655752 |
Kurtosis | 46.943726 |
Mean | 20394.752 |
Median Absolute Deviation (MAD) | 5000 |
Skewness | 3.2474052 |
Sum | 1.6056788 × 108 |
Variance | 72175005 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
15000 | 536 | 6.8% |
16000 | 471 | 6.0% |
12000 | 434 | 5.5% |
18000 | 428 | 5.4% |
24000 | 420 | 5.3% |
26000 | 411 | 5.2% |
13000 | 385 | 4.9% |
28000 | 355 | 4.5% |
30000 | 331 | 4.2% |
17000 | 313 | 4.0% |
Other values (211) | 3789 |
Value | Count | Frequency (%) |
0 | 1 | < 0.1% |
2400 | 1 | < 0.1% |
3000 | 4 | 0.1% |
5000 | 1 | < 0.1% |
5500 | 1 | < 0.1% |
6000 | 3 | < 0.1% |
6500 | 14 | 0.2% |
6600 | 59 | |
7000 | 2 | < 0.1% |
7500 | 1 | < 0.1% |
Value | Count | Frequency (%) |
220014 | 1 | |
125000 | 1 | |
112000 | 1 | |
98000 | 1 | |
96000 | 1 | |
92000 | 1 | |
81000 | 1 | |
72000 | 1 | |
70000 | 1 | |
67500 | 1 |
번호 | 책수 | 금액 | |
---|---|---|---|
번호 | 1.000 | 0.196 | 0.100 |
책수 | 0.196 | 1.000 | 0.666 |
금액 | 0.100 | 0.666 | 1.000 |
번호 | 금액 | 책수 | |
---|---|---|---|
번호 | 1.000 | 0.036 | 0.082 |
금액 | 0.036 | 1.000 | 0.510 |
책수 | 0.082 | 0.510 | 1.000 |
번호 | 서명 | 저자 | 발행자 | 발행년 | 책수 | 단가 | 금액 | |
---|---|---|---|---|---|---|---|---|
0 | 1 | 사랑한다고 말할 용기 | 황선우 지음 | 책읽는수요일 | 2021 | 3 | 14800 | 44400 |
1 | 2 | 아빠, 쿠키 주세요 | 데이비드 에즈라 스테인 지음 | 시공주니어 | 2021 | 1 | 13000 | 13000 |
2 | 3 | 숨바꼭질! | Lolita SECHAN | 바둑이하우스 | 2021 | 1 | 15000 | 15000 |
3 | 4 | 소년과 살쾡이 | 우상구 지음 | 청어람주니어 | 2015 | 1 | 9800 | 9800 |
4 | 5 | 세종대왕을 찾아라 | 김진 지음 | 천개의바람 | 2021 | 1 | 13000 | 13000 |
5 | 6 | 자동 물시계 자격루 | 김명희 지음 | 푸른숲주니어 | 2021 | 1 | 14800 | 14800 |
6 | 7 | (주식 단기투자 필독서) 주식의 道 | 생존재테크 지음 | 트러스트북스 | 2021 | 1 | 18800 | 18800 |
7 | 8 | 몸의 기분 | 마숑 지음 | fifo | 2021 | 1 | 13800 | 13800 |
8 | 9 | 내 인생도 편집이 되나요? | 이지은 지음 | 달 | 2021 | 1 | 15000 | 15000 |
9 | 10 | 들어 봐, 우릴 위해 만든 노래야 | 이환희 | 후마니타스 | 2021 | 1 | 18000 | 18000 |
번호 | 서명 | 저자 | 발행자 | 발행년 | 책수 | 단가 | 금액 | |
---|---|---|---|---|---|---|---|---|
7864 | 7865 | Oliver twist | retold from the story by Charles Dickens | Kiddo | 2021 | 1 | 33400 | 33400 |
7865 | 7866 | (The)wizard of Oz | retold from the story by L. Frank Baum | Kiddo | 2021 | 1 | 33400 | 33400 |
7866 | 7867 | Black beauty | retold from the story by Anna Seweell | Kiddo | 2021 | 1 | 33400 | 33400 |
7867 | 7868 | Anne of green gables | retold from the story by L. M. Montgomery | Kiddo | 2021 | 1 | 33400 | 33400 |
7868 | 7869 | My home | Mind Books | Mind Books | 2021 | 1 | 12600 | 12600 |
7869 | 7870 | Body | Mind Books | Mind Books | 2021 | 1 | 12600 | 12600 |
7870 | 7871 | Puwede po ba tayong magbasa ng aklat | kuwento ni Lawrence Schimel | Kahel Press | 2021 | 1 | 15900 | 15900 |
7871 | 7872 | Kindness | Agnes de Bezenac | Lampara Books | 2021 | 1 | 15900 | 15900 |
7872 | 7873 | Respect | Agnes de Bezenac | Lampara Books | 2021 | 1 | 15900 | 15900 |
7873 | 7874 | Pitong tsinelas | kuwento ni Divine Gil Reyes | Tahanan Books | 2021 | 1 | 15900 | 15900 |