Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 5.3 KiB |
Average record size in memory | 54.3 B |
Variable types
Categorical | 4 |
---|---|
Text | 1 |
Numeric | 1 |
Dataset
Description | Sample |
---|---|
Author | 국립중앙도서관 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=f183c145-fb5a-49da-a72e-35e16e3de833 |
Reproduction
Analysis started | 2023-12-10 09:48:15.462957 |
---|---|
Analysis finished | 2023-12-10 09:48:16.338418 |
Duration | 0.88 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
isbn13
Categorical
Distinct | 3 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
9791191739015 | |
---|---|
9791138007238 | |
9791165881320 | 3 |
Length
Max length | 13 |
---|---|
Median length | 13 |
Mean length | 13 |
Min length | 13 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 9791138007238 |
---|---|
2nd row | 9791165881320 |
3rd row | 9791138007238 |
4th row | 9791138007238 |
5th row | 9791138007238 |
Common Values
Value | Count | Frequency (%) |
9791191739015 | 49 | |
9791138007238 | 48 | |
9791165881320 | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
9791191739015 | 49 | |
9791138007238 | 48 | |
9791165881320 | 3 | 3.0% |
term
Text
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
셜록 | 2 | 1.8% |
집중 | 1 | 0.9% |
장아찌 | 1 | 0.9% |
맛내기 | 1 | 0.9% |
가지 | 1 | 0.9% |
채소 | 1 | 0.9% |
한식대첩 | 1 | 0.9% |
초보 | 1 | 0.9% |
기본 | 1 | 0.9% |
배추김치 | 1 | 0.9% |
Other values (98) | 98 |
Most occurring characters
Value | Count | Frequency (%) |
10 | 3.3% | |
치 | 8 | 2.6% |
스 | 8 | 2.6% |
기 | 8 | 2.6% |
김 | 6 | 2.0% |
정 | 5 | 1.7% |
한 | 5 | 1.7% |
지 | 5 | 1.7% |
명 | 5 | 1.7% |
사 | 4 | 1.3% |
Other values (172) | 239 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 272 | |
Uppercase Letter | 15 | 5.0% |
Space Separator | 10 | 3.3% |
Lowercase Letter | 6 | 2.0% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
치 | 8 | 2.9% |
스 | 8 | 2.9% |
기 | 8 | 2.9% |
김 | 6 | 2.2% |
정 | 5 | 1.8% |
한 | 5 | 1.8% |
지 | 5 | 1.8% |
명 | 5 | 1.8% |
사 | 4 | 1.5% |
양 | 4 | 1.5% |
Other values (153) | 214 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 2 | |
E | 2 | |
N | 2 | |
A | 1 | |
J | 1 | |
K | 1 | |
C | 1 | |
O | 1 | |
L | 1 | |
R | 1 | |
Other values (2) | 2 |
Lowercase Letter
Value | Count | Frequency (%) |
v | 1 | |
i | 1 | |
d | 1 | |
n | 1 | |
y | 1 | |
a | 1 |
Space Separator
Value | Count | Frequency (%) |
10 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 272 | |
Latin | 21 | 6.9% |
Common | 10 | 3.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
치 | 8 | 2.9% |
스 | 8 | 2.9% |
기 | 8 | 2.9% |
김 | 6 | 2.2% |
정 | 5 | 1.8% |
한 | 5 | 1.8% |
지 | 5 | 1.8% |
명 | 5 | 1.8% |
사 | 4 | 1.5% |
양 | 4 | 1.5% |
Other values (153) | 214 |
Latin
Value | Count | Frequency (%) |
P | 2 | 9.5% |
E | 2 | 9.5% |
N | 2 | 9.5% |
v | 1 | 4.8% |
i | 1 | 4.8% |
d | 1 | 4.8% |
A | 1 | 4.8% |
n | 1 | 4.8% |
y | 1 | 4.8% |
a | 1 | 4.8% |
Other values (8) | 8 |
Common
Value | Count | Frequency (%) |
10 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 272 | |
ASCII | 31 | 10.2% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
10 | ||
P | 2 | 6.5% |
E | 2 | 6.5% |
N | 2 | 6.5% |
v | 1 | 3.2% |
i | 1 | 3.2% |
d | 1 | 3.2% |
A | 1 | 3.2% |
n | 1 | 3.2% |
y | 1 | 3.2% |
Other values (9) | 9 |
Hangul
Value | Count | Frequency (%) |
치 | 8 | 2.9% |
스 | 8 | 2.9% |
기 | 8 | 2.9% |
김 | 6 | 2.2% |
정 | 5 | 1.8% |
한 | 5 | 1.8% |
지 | 5 | 1.8% |
명 | 5 | 1.8% |
사 | 4 | 1.5% |
양 | 4 | 1.5% |
Other values (153) | 214 |
freq
Real number (ℝ)
Distinct | 7 |
---|---|
Distinct (%) | 7.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.56 |
Minimum | 1 |
---|---|
Maximum | 27 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 2 |
Q3 | 3 |
95-th percentile | 5 |
Maximum | 27 |
Range | 26 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 2.8152543 |
---|---|
Coefficient of variation (CV) | 1.0997087 |
Kurtosis | 58.230916 |
Mean | 2.56 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 6.8922617 |
Sum | 256 |
Variance | 7.9256566 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 44 | |
1 | 26 | |
3 | 15 | 15.0% |
4 | 6 | 6.0% |
5 | 6 | 6.0% |
8 | 2 | 2.0% |
27 | 1 | 1.0% |
Value | Count | Frequency (%) |
1 | 26 | |
2 | 44 | |
3 | 15 | 15.0% |
4 | 6 | 6.0% |
5 | 6 | 6.0% |
8 | 2 | 2.0% |
27 | 1 | 1.0% |
Value | Count | Frequency (%) |
27 | 1 | 1.0% |
8 | 2 | 2.0% |
5 | 6 | 6.0% |
4 | 6 | 6.0% |
3 | 15 | 15.0% |
2 | 44 | |
1 | 26 |
boost1
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 100 |
boost2
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 100 |
boost3
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
0 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 100 |
isbn13 | term | freq | |
---|---|---|---|
isbn13 | 1.000 | 1.000 | 0.103 |
term | 1.000 | 1.000 | 1.000 |
freq | 0.103 | 1.000 | 1.000 |
freq | isbn13 | |
---|---|---|
freq | 1.000 | 0.095 |
isbn13 | 0.095 | 1.000 |
isbn13 | term | freq | boost1 | boost2 | boost3 | |
---|---|---|---|---|---|---|
0 | 9791138007238 | 셜록 | 8 | 0 | 0 | 0 |
1 | 9791165881320 | 희망 | 1 | 0 | 0 | 0 |
2 | 9791138007238 | 사건 | 4 | 0 | 0 | 0 |
3 | 9791138007238 | 블로그 | 4 | 0 | 0 | 0 |
4 | 9791138007238 | 셜록 SHERLOCK | 3 | 0 | 0 | 0 |
5 | 9791138007238 | 어벤져스 | 3 | 0 | 0 | 0 |
6 | 9791138007238 | 닥터스트레인지 | 3 | 0 | 0 | 0 |
7 | 9791165881320 | 치료 | 1 | 0 | 0 | 0 |
8 | 9791138007238 | Jay | 2 | 0 | 0 | 0 |
9 | 9791138007238 | 영상출판미디어 주 | 2 | 0 | 0 | 0 |
isbn13 | term | freq | boost1 | boost2 | boost3 | |
---|---|---|---|---|---|---|
90 | 9791191739015 | 무김치 | 2 | 0 | 0 | 0 |
91 | 9791191739015 | 고추 | 2 | 0 | 0 | 0 |
92 | 9791191739015 | 경험 | 2 | 0 | 0 | 0 |
93 | 9791191739015 | 김치냉장고 | 2 | 0 | 0 | 0 |
94 | 9791191739015 | 알토란 | 2 | 0 | 0 | 0 |
95 | 9791191739015 | 실패 | 2 | 0 | 0 | 0 |
96 | 9791191739015 | 활동 | 2 | 0 | 0 | 0 |
97 | 9791191739015 | 비결 | 2 | 0 | 0 | 0 |
98 | 9791191739015 | 한식 | 2 | 0 | 0 | 0 |
99 | 9791191739015 | 빼기 | 2 | 0 | 0 | 0 |