Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 4.4 KiB |
Average record size in memory | 45.3 B |
Variable types
Categorical | 2 |
---|---|
Numeric | 2 |
Text | 1 |
Dataset
Description | Sample |
---|---|
Author | 국립중앙도서관 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=4f88c8c0-3e8c-11eb-af9a-4b03f0a582d6 |
anals_trget_year has constant value "" | Constant |
anals_trget_mt has constant value "" | Constant |
all_kwrd_rank_co is highly overall correlated with fq_co | High correlation |
fq_co is highly overall correlated with all_kwrd_rank_co | High correlation |
kwrd_nm has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 10:01:50.487717 |
---|---|
Analysis finished | 2023-12-10 10:01:51.834412 |
Duration | 1.35 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
anals_trget_year
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
2021 |
---|
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2021 |
---|---|
2nd row | 2021 |
3rd row | 2021 |
4th row | 2021 |
5th row | 2021 |
Common Values
Value | Count | Frequency (%) |
2021 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2021 | 100 |
anals_trget_mt
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
11 |
---|
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 11 |
---|---|
2nd row | 11 |
3rd row | 11 |
4th row | 11 |
5th row | 11 |
Common Values
Value | Count | Frequency (%) |
11 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
11 | 100 |
all_kwrd_rank_co
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 71 |
---|---|
Distinct (%) | 71.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 79.22 |
Minimum | 1 |
---|---|
Maximum | 982 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 6.95 |
Q1 | 27.75 |
median | 53.5 |
Q3 | 78.25 |
95-th percentile | 98.05 |
Maximum | 982 |
Range | 981 |
Interquartile range (IQR) | 50.5 |
Descriptive statistics
Standard deviation | 162.00078 |
---|---|
Coefficient of variation (CV) | 2.044948 |
Kurtosis | 27.965859 |
Mean | 79.22 |
Median Absolute Deviation (MAD) | 25.5 |
Skewness | 5.3302158 |
Sum | 7922 |
Variance | 26244.254 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
75 | 3 | 3.0% |
92 | 3 | 3.0% |
86 | 3 | 3.0% |
83 | 3 | 3.0% |
69 | 3 | 3.0% |
982 | 3 | 3.0% |
73 | 2 | 2.0% |
35 | 2 | 2.0% |
62 | 2 | 2.0% |
38 | 2 | 2.0% |
Other values (61) | 74 |
Value | Count | Frequency (%) |
1 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
9 | 1 | |
10 | 1 | |
11 | 1 | |
12 | 1 |
Value | Count | Frequency (%) |
982 | 3 | |
99 | 2 | |
98 | 1 | 1.0% |
97 | 1 | 1.0% |
95 | 2 | |
92 | 3 | |
91 | 1 | 1.0% |
89 | 2 | |
86 | 3 | |
83 | 3 |
kwrd_nm
Text
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
다양 | 1 | 1.0% |
원리 | 1 | 1.0% |
읽기 | 1 | 1.0% |
가족 | 1 | 1.0% |
이용 | 1 | 1.0% |
어른 | 1 | 1.0% |
초등학교 | 1 | 1.0% |
시대 | 1 | 1.0% |
사실 | 1 | 1.0% |
마지막 | 1 | 1.0% |
Other values (90) | 90 |
Most occurring characters
Value | Count | Frequency (%) |
이 | 9 | 4.0% |
상 | 6 | 2.7% |
학 | 5 | 2.2% |
사 | 5 | 2.2% |
리 | 5 | 2.2% |
생 | 4 | 1.8% |
기 | 4 | 1.8% |
시 | 4 | 1.8% |
고 | 3 | 1.3% |
습 | 3 | 1.3% |
Other values (129) | 176 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 224 |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
이 | 9 | 4.0% |
상 | 6 | 2.7% |
학 | 5 | 2.2% |
사 | 5 | 2.2% |
리 | 5 | 2.2% |
생 | 4 | 1.8% |
기 | 4 | 1.8% |
시 | 4 | 1.8% |
고 | 3 | 1.3% |
습 | 3 | 1.3% |
Other values (129) | 176 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 224 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
이 | 9 | 4.0% |
상 | 6 | 2.7% |
학 | 5 | 2.2% |
사 | 5 | 2.2% |
리 | 5 | 2.2% |
생 | 4 | 1.8% |
기 | 4 | 1.8% |
시 | 4 | 1.8% |
고 | 3 | 1.3% |
습 | 3 | 1.3% |
Other values (129) | 176 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 224 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
이 | 9 | 4.0% |
상 | 6 | 2.7% |
학 | 5 | 2.2% |
사 | 5 | 2.2% |
리 | 5 | 2.2% |
생 | 4 | 1.8% |
기 | 4 | 1.8% |
시 | 4 | 1.8% |
고 | 3 | 1.3% |
습 | 3 | 1.3% |
Other values (129) | 176 |
fq_co
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 71 |
---|---|
Distinct (%) | 71.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 192.67 |
Minimum | 25 |
---|---|
Maximum | 505 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 25 |
---|---|
5-th percentile | 128.95 |
Q1 | 139.5 |
median | 172 |
Q3 | 222.5 |
95-th percentile | 334.1 |
Maximum | 505 |
Range | 480 |
Interquartile range (IQR) | 83 |
Descriptive statistics
Standard deviation | 78.213261 |
---|---|
Coefficient of variation (CV) | 0.40594416 |
Kurtosis | 2.8897134 |
Mean | 192.67 |
Median Absolute Deviation (MAD) | 36 |
Skewness | 1.2586475 |
Sum | 19267 |
Variance | 6117.3142 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
144 | 3 | 3.0% |
132 | 3 | 3.0% |
135 | 3 | 3.0% |
136 | 3 | 3.0% |
151 | 3 | 3.0% |
25 | 3 | 3.0% |
145 | 2 | 2.0% |
208 | 2 | 2.0% |
158 | 2 | 2.0% |
204 | 2 | 2.0% |
Other values (61) | 74 |
Value | Count | Frequency (%) |
25 | 3 | |
128 | 2 | |
129 | 1 | 1.0% |
130 | 1 | 1.0% |
131 | 2 | |
132 | 3 | |
133 | 1 | 1.0% |
134 | 2 | |
135 | 3 | |
136 | 3 |
Value | Count | Frequency (%) |
505 | 1 | |
422 | 1 | |
398 | 1 | |
386 | 1 | |
374 | 1 | |
332 | 1 | |
328 | 1 | |
318 | 1 | |
303 | 1 | |
302 | 1 |
all_kwrd_rank_co | kwrd_nm | fq_co | |
---|---|---|---|
all_kwrd_rank_co | 1.000 | 1.000 | 1.000 |
kwrd_nm | 1.000 | 1.000 | 1.000 |
fq_co | 1.000 | 1.000 | 1.000 |
all_kwrd_rank_co | fq_co | |
---|---|---|
all_kwrd_rank_co | 1.000 | -1.000 |
fq_co | -1.000 | 1.000 |
anals_trget_year | anals_trget_mt | all_kwrd_rank_co | kwrd_nm | fq_co | |
---|---|---|---|---|---|
0 | 2021 | 11 | 1 | 다양 | 505 |
1 | 2021 | 11 | 982 | 폭발 | 25 |
2 | 2021 | 11 | 3 | 어린이 | 422 |
3 | 2021 | 11 | 4 | 사람 | 398 |
4 | 2021 | 11 | 5 | 독자 | 386 |
5 | 2021 | 11 | 6 | 시리즈 | 374 |
6 | 2021 | 11 | 7 | 세계 | 332 |
7 | 2021 | 11 | 982 | 기법 | 25 |
8 | 2021 | 11 | 9 | 사랑 | 328 |
9 | 2021 | 11 | 10 | 아이 | 318 |
anals_trget_year | anals_trget_mt | all_kwrd_rank_co | kwrd_nm | fq_co | |
---|---|---|---|---|---|
90 | 2021 | 11 | 91 | 게임 | 133 |
91 | 2021 | 11 | 92 | 사이 | 132 |
92 | 2021 | 11 | 92 | 순간 | 132 |
93 | 2021 | 11 | 92 | 머리 | 132 |
94 | 2021 | 11 | 95 | 인간 | 131 |
95 | 2021 | 11 | 95 | 선물 | 131 |
96 | 2021 | 11 | 97 | 생생 | 130 |
97 | 2021 | 11 | 98 | 질문 | 129 |
98 | 2021 | 11 | 99 | 학교 | 128 |
99 | 2021 | 11 | 99 | 그림책 | 128 |