Dataset statistics
Number of variables | 13 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 10.8 KiB |
Average record size in memory | 110.3 B |
Variable types
Numeric | 4 |
---|---|
Text | 3 |
Categorical | 6 |
Dataset
Description | Sample |
---|---|
Author | 데이터마케팅코리아 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=82969390-5b0b-48c6-a048-815cdd08c745 |
CHNNEL_CL_NM has constant value "" | Constant |
UPPER_CTGRY_NM has constant value "" | Constant |
LWPRT_CTGRY_NM has constant value "" | Constant |
SEQ_NO is highly overall correlated with NTCE_DT | High correlation |
DPI_VALUE is highly overall correlated with RSPN_CO and 1 other fields | High correlation |
RSPN_CO is highly overall correlated with DPI_VALUE and 2 other fields | High correlation |
COMMENT_CO is highly overall correlated with DPI_VALUE and 1 other fields | High correlation |
CHNNEL_NM is highly overall correlated with RSPN_CO | High correlation |
NTCE_DT is highly overall correlated with SEQ_NO and 1 other fields | High correlation |
RECOMEND_CO is highly overall correlated with NTCE_DT | High correlation |
RECOMEND_CO is highly imbalanced (80.6%) | Imbalance |
SEQ_NO has unique values | Unique |
CNTNTS_ID has unique values | Unique |
CNTNTS_URL has unique values | Unique |
DPI_VALUE has 32 (32.0%) zeros | Zeros |
RSPN_CO has 36 (36.0%) zeros | Zeros |
COMMENT_CO has 62 (62.0%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-10 09:50:15.853252 |
---|---|
Analysis finished | 2023-12-10 09:50:21.440575 |
Duration | 5.59 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
SEQ_NO
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 59574.3 |
Minimum | 58459 |
---|---|
Maximum | 60666 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 58459 |
---|---|
5-th percentile | 58582.75 |
Q1 | 59017 |
median | 59572 |
Q3 | 60083 |
95-th percentile | 60556.6 |
Maximum | 60666 |
Range | 2207 |
Interquartile range (IQR) | 1066 |
Descriptive statistics
Standard deviation | 646.75117 |
---|---|
Coefficient of variation (CV) | 0.010856211 |
Kurtosis | -1.1626905 |
Mean | 59574.3 |
Median Absolute Deviation (MAD) | 559 |
Skewness | -0.049311479 |
Sum | 5957430 |
Variance | 418287.08 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
58741 | 1 | 1.0% |
59967 | 1 | 1.0% |
60207 | 1 | 1.0% |
60224 | 1 | 1.0% |
60164 | 1 | 1.0% |
59987 | 1 | 1.0% |
59924 | 1 | 1.0% |
59922 | 1 | 1.0% |
60009 | 1 | 1.0% |
60056 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
58459 | 1 | |
58462 | 1 | |
58497 | 1 | |
58537 | 1 | |
58559 | 1 | |
58584 | 1 | |
58589 | 1 | |
58596 | 1 | |
58621 | 1 | |
58630 | 1 |
Value | Count | Frequency (%) |
60666 | 1 | |
60647 | 1 | |
60608 | 1 | |
60601 | 1 | |
60587 | 1 | |
60555 | 1 | |
60554 | 1 | |
60512 | 1 | |
60466 | 1 | |
60461 | 1 |
CNTNTS_ID
Text
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Length
Max length | 32 |
---|---|
Median length | 32 |
Mean length | 32 |
Min length | 32 |
Characters and Unicode
Total characters | 3200 |
---|---|
Distinct characters | 16 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 100 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | f26b5facf43611eba7b3002b67f7b0e1 |
---|---|
2nd row | f26b5fdbf43611eba11a002b67f7b0e1 |
3rd row | f26b8675f43611eb9e02002b67f7b0e1 |
4th row | f26b5fd2f43611ebbd4b002b67f7b0e1 |
5th row | f26b5fd6f43611ebb253002b67f7b0e1 |
Value | Count | Frequency (%) |
f26b5facf43611eba7b3002b67f7b0e1 | 1 | 1.0% |
f26c4a20f43611eb869d002b67f7b0e1 | 1 | 1.0% |
f26c97fef43611eb9518002b67f7b0e1 | 1 | 1.0% |
f26c717bf43611ebb1cf002b67f7b0e1 | 1 | 1.0% |
f26c4a97f43611eb9db8002b67f7b0e1 | 1 | 1.0% |
f26c4a58f43611ebb440002b67f7b0e1 | 1 | 1.0% |
f26c4a56f43611eb95f0002b67f7b0e1 | 1 | 1.0% |
f26c70e0f43611eb860f002b67f7b0e1 | 1 | 1.0% |
f26c710ff43611eba196002b67f7b0e1 | 1 | 1.0% |
f26c4a87f43611ebabe8002b67f7b0e1 | 1 | 1.0% |
Other values (90) | 90 |
Most occurring characters
Value | Count | Frequency (%) |
b | 396 | |
0 | 343 | |
f | 330 | |
1 | 330 | |
6 | 328 | |
2 | 251 | |
e | 246 | |
7 | 229 | |
4 | 155 | 4.8% |
3 | 133 | 4.2% |
Other values (6) | 459 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 2025 | |
Lowercase Letter | 1175 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 343 | |
1 | 330 | |
6 | 328 | |
2 | 251 | |
7 | 229 | |
4 | 155 | |
3 | 133 | 6.6% |
9 | 94 | 4.6% |
5 | 85 | 4.2% |
8 | 77 | 3.8% |
Lowercase Letter
Value | Count | Frequency (%) |
b | 396 | |
f | 330 | |
e | 246 | |
c | 92 | 7.8% |
a | 64 | 5.4% |
d | 47 | 4.0% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 2025 | |
Latin | 1175 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 343 | |
1 | 330 | |
6 | 328 | |
2 | 251 | |
7 | 229 | |
4 | 155 | |
3 | 133 | 6.6% |
9 | 94 | 4.6% |
5 | 85 | 4.2% |
8 | 77 | 3.8% |
Latin
Value | Count | Frequency (%) |
b | 396 | |
f | 330 | |
e | 246 | |
c | 92 | 7.8% |
a | 64 | 5.4% |
d | 47 | 4.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 3200 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
b | 396 | |
0 | 343 | |
f | 330 | |
1 | 330 | |
6 | 328 | |
2 | 251 | |
e | 246 | |
7 | 229 | |
4 | 155 | 4.8% |
3 | 133 | 4.2% |
Other values (6) | 459 |
CHNNEL_CL_NM
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
news |
---|
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | news |
---|---|
2nd row | news |
3rd row | news |
4th row | news |
5th row | news |
Common Values
Value | Count | Frequency (%) |
news | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
news | 100 |
CHNNEL_NM
Categorical
HIGH CORRELATION
 
Distinct | 26 |
---|---|
Distinct (%) | 26.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
연합뉴스 | |
---|---|
뉴스1 | |
매일경제 | |
뉴시스 | |
중앙SUNDAY | |
Other values (21) |
Length
Max length | 8 |
---|---|
Median length | 4 |
Mean length | 4.16 |
Min length | 3 |
Unique
Unique | 8 ? |
---|---|
Unique (%) | 8.0% |
Sample
1st row | 동아일보 |
---|---|
2nd row | 머니투데이 |
3rd row | 매일경제 |
4th row | 경향신문 |
5th row | 매일경제 |
Common Values
Value | Count | Frequency (%) |
연합뉴스 | 12 | 12.0% |
뉴스1 | 9 | 9.0% |
매일경제 | 9 | 9.0% |
뉴시스 | 8 | 8.0% |
중앙SUNDAY | 6 | 6.0% |
서울경제 | 6 | 6.0% |
동아일보 | 5 | 5.0% |
경향신문 | 5 | 5.0% |
오마이뉴스 | 5 | 5.0% |
한겨레 | 5 | 5.0% |
Other values (16) | 30 |
Length
Value | Count | Frequency (%) |
연합뉴스 | 12 | 12.0% |
매일경제 | 9 | 9.0% |
뉴스1 | 9 | 9.0% |
뉴시스 | 8 | 8.0% |
중앙sunday | 6 | 6.0% |
서울경제 | 6 | 6.0% |
동아일보 | 5 | 5.0% |
경향신문 | 5 | 5.0% |
오마이뉴스 | 5 | 5.0% |
한겨레 | 5 | 5.0% |
Other values (16) | 30 |
TITLE_NM
Text
Distinct | 99 |
---|---|
Distinct (%) | 99.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Length
Max length | 47 |
---|---|
Median length | 37 |
Mean length | 27.89 |
Min length | 12 |
Characters and Unicode
Total characters | 2789 |
---|---|
Distinct characters | 463 |
Distinct categories | 12 ? |
Distinct scripts | 4 ? |
Distinct blocks | 6 ? |
Unique
Unique | 98 ? |
---|---|
Unique (%) | 98.0% |
Sample
1st row | [신춘문예 2021/동화 가작]니들이 사춘기를 알아 |
---|---|
2nd row | “코로나 완치자인데 여전히 아프다”…후유증 외면하는 ‘불안한 시스템’ |
3rd row | 이주의 새책 (1월 9일자) |
4th row | 바삭바삭 구운 바닥 위에 바람 한 꼬집…세상에서 가장 맛있는 낮잠 레시피 [그림 책] |
5th row | 코로나 발생 76일…우한에선 무슨 일이 있었나 |
Value | Count | Frequency (%) |
신간 | 9 | 1.4% |
책 | 6 | 0.9% |
책꽂이 | 5 | 0.8% |
코로나 | 4 | 0.6% |
세계 | 3 | 0.5% |
발간 | 3 | 0.5% |
새책 | 3 | 0.5% |
미래 | 3 | 0.5% |
1월 | 3 | 0.5% |
이승우 | 3 | 0.5% |
Other values (551) | 607 |
Most occurring characters
Value | Count | Frequency (%) |
550 | 19.7% | |
이 | 57 | 2.0% |
' | 50 | 1.8% |
의 | 49 | 1.8% |
는 | 39 | 1.4% |
가 | 38 | 1.4% |
, | 37 | 1.3% |
] | 35 | 1.3% |
[ | 35 | 1.3% |
다 | 34 | 1.2% |
Other values (453) | 1865 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1880 | |
Space Separator | 550 | 19.7% |
Other Punctuation | 163 | 5.8% |
Decimal Number | 63 | 2.3% |
Close Punctuation | 38 | 1.4% |
Open Punctuation | 38 | 1.4% |
Final Punctuation | 21 | 0.8% |
Initial Punctuation | 20 | 0.7% |
Uppercase Letter | 9 | 0.3% |
Math Symbol | 5 | 0.2% |
Other values (2) | 2 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
이 | 57 | 3.0% |
의 | 49 | 2.6% |
는 | 39 | 2.1% |
가 | 38 | 2.0% |
다 | 34 | 1.8% |
한 | 32 | 1.7% |
문 | 27 | 1.4% |
책 | 27 | 1.4% |
시 | 25 | 1.3% |
로 | 25 | 1.3% |
Other values (411) | 1527 |
Other Punctuation
Value | Count | Frequency (%) |
' | 50 | |
, | 37 | |
· | 28 | |
… | 21 | |
. | 9 | 5.5% |
" | 9 | 5.5% |
? | 4 | 2.5% |
! | 2 | 1.2% |
/ | 2 | 1.2% |
: | 1 | 0.6% |
Decimal Number
Value | Count | Frequency (%) |
1 | 16 | |
2 | 15 | |
0 | 14 | |
6 | 4 | 6.3% |
3 | 3 | 4.8% |
8 | 3 | 4.8% |
5 | 3 | 4.8% |
9 | 2 | 3.2% |
7 | 2 | 3.2% |
4 | 1 | 1.6% |
Uppercase Letter
Value | Count | Frequency (%) |
D | 2 | |
B | 1 | |
T | 1 | |
S | 1 | |
K | 1 | |
P | 1 | |
A | 1 | |
I | 1 |
Math Symbol
Value | Count | Frequency (%) |
< | 2 | |
> | 2 | |
| | 1 |
Close Punctuation
Value | Count | Frequency (%) |
] | 35 | |
) | 3 | 7.9% |
Open Punctuation
Value | Count | Frequency (%) |
[ | 35 | |
( | 3 | 7.9% |
Initial Punctuation
Value | Count | Frequency (%) |
‘ | 15 | |
“ | 5 | 25.0% |
Final Punctuation
Value | Count | Frequency (%) |
’ | 15 | |
” | 6 | 28.6% |
Space Separator
Value | Count | Frequency (%) |
550 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1 |
Other Number
Value | Count | Frequency (%) |
① | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1877 | |
Common | 900 | |
Latin | 9 | 0.3% |
Han | 3 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
이 | 57 | 3.0% |
의 | 49 | 2.6% |
는 | 39 | 2.1% |
가 | 38 | 2.0% |
다 | 34 | 1.8% |
한 | 32 | 1.7% |
문 | 27 | 1.4% |
책 | 27 | 1.4% |
시 | 25 | 1.3% |
로 | 25 | 1.3% |
Other values (409) | 1524 |
Common
Value | Count | Frequency (%) |
550 | ||
' | 50 | 5.6% |
, | 37 | 4.1% |
] | 35 | 3.9% |
[ | 35 | 3.9% |
· | 28 | 3.1% |
… | 21 | 2.3% |
1 | 16 | 1.8% |
‘ | 15 | 1.7% |
’ | 15 | 1.7% |
Other values (24) | 98 | 10.9% |
Latin
Value | Count | Frequency (%) |
D | 2 | |
B | 1 | |
T | 1 | |
S | 1 | |
K | 1 | |
P | 1 | |
A | 1 | |
I | 1 |
Han
Value | Count | Frequency (%) |
外 | 2 | |
美 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1877 | |
ASCII | 818 | |
Punctuation | 62 | 2.2% |
None | 28 | 1.0% |
CJK | 3 | 0.1% |
Enclosed Alphanum | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
550 | ||
' | 50 | 6.1% |
, | 37 | 4.5% |
] | 35 | 4.3% |
[ | 35 | 4.3% |
1 | 16 | 2.0% |
2 | 15 | 1.8% |
0 | 14 | 1.7% |
. | 9 | 1.1% |
" | 9 | 1.1% |
Other values (25) | 48 | 5.9% |
Hangul
Value | Count | Frequency (%) |
이 | 57 | 3.0% |
의 | 49 | 2.6% |
는 | 39 | 2.1% |
가 | 38 | 2.0% |
다 | 34 | 1.8% |
한 | 32 | 1.7% |
문 | 27 | 1.4% |
책 | 27 | 1.4% |
시 | 25 | 1.3% |
로 | 25 | 1.3% |
Other values (409) | 1524 |
None
Value | Count | Frequency (%) |
· | 28 |
Punctuation
Value | Count | Frequency (%) |
… | 21 | |
‘ | 15 | |
’ | 15 | |
” | 6 | 9.7% |
“ | 5 | 8.1% |
CJK
Value | Count | Frequency (%) |
外 | 2 | |
美 | 1 |
Enclosed Alphanum
Value | Count | Frequency (%) |
① | 1 |
NTCE_DT
Categorical
HIGH CORRELATION
 
Distinct | 39 |
---|---|
Distinct (%) | 39.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
2021-01-29 00:00:00 | |
---|---|
2021-01-23 00:00:00 | 6 |
2021-01-14 00:00:00 | 6 |
2021-01-01 00:00:00 | 6 |
2021-01-26 00:00:00 | 5 |
Other values (34) |
Length
Max length | 19 |
---|---|
Median length | 19 |
Mean length | 19 |
Min length | 19 |
Unique
Unique | 18 ? |
---|---|
Unique (%) | 18.0% |
Sample
1st row | 2021-01-01 00:00:00 |
---|---|
2nd row | 2021-01-01 00:00:00 |
3rd row | 2021-01-01 00:00:00 |
4th row | 2021-01-01 00:00:00 |
5th row | 2021-01-01 00:00:00 |
Common Values
Value | Count | Frequency (%) |
2021-01-29 00:00:00 | 8 | 8.0% |
2021-01-23 00:00:00 | 6 | 6.0% |
2021-01-14 00:00:00 | 6 | 6.0% |
2021-01-01 00:00:00 | 6 | 6.0% |
2021-01-26 00:00:00 | 5 | 5.0% |
2021-01-21 00:00:00 | 5 | 5.0% |
2021-01-19 00:00:00 | 5 | 5.0% |
2021-01-11 00:00:00 | 5 | 5.0% |
2021-01-20 00:00:00 | 4 | 4.0% |
2021-01-09 00:00:00 | 4 | 4.0% |
Other values (29) | 46 |
Length
Value | Count | Frequency (%) |
00:00:00 | 89 | |
2021-01-29 | 8 | 4.0% |
2021-01-01 | 8 | 4.0% |
2021-01-23 | 6 | 3.0% |
2021-01-14 | 6 | 3.0% |
2021-01-05 | 6 | 3.0% |
2021-01-26 | 5 | 2.5% |
2021-01-21 | 5 | 2.5% |
2021-01-19 | 5 | 2.5% |
2021-01-11 | 5 | 2.5% |
Other values (30) | 57 |
DPI_VALUE
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 25 |
---|---|
Distinct (%) | 25.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.402 |
Minimum | 0 |
---|---|
Maximum | 66.2 |
Zeros | 32 |
Zeros (%) | 32.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0.4 |
Q3 | 1.4 |
95-th percentile | 14.02 |
Maximum | 66.2 |
Range | 66.2 |
Interquartile range (IQR) | 1.4 |
Descriptive statistics
Standard deviation | 7.6527935 |
---|---|
Coefficient of variation (CV) | 3.186009 |
Kurtosis | 49.968565 |
Mean | 2.402 |
Median Absolute Deviation (MAD) | 0.4 |
Skewness | 6.4782191 |
Sum | 240.2 |
Variance | 58.565248 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0 | 32 | |
0.2 | 15 | |
0.6 | 10 | 10.0% |
0.8 | 7 | 7.0% |
0.4 | 6 | 6.0% |
3.2 | 3 | 3.0% |
1.0 | 3 | 3.0% |
2.0 | 3 | 3.0% |
4.8 | 2 | 2.0% |
2.4 | 2 | 2.0% |
Other values (15) | 17 |
Value | Count | Frequency (%) |
0.0 | 32 | |
0.2 | 15 | |
0.4 | 6 | 6.0% |
0.6 | 10 | 10.0% |
0.8 | 7 | 7.0% |
1.0 | 3 | 3.0% |
1.2 | 1 | 1.0% |
1.4 | 2 | 2.0% |
1.6 | 2 | 2.0% |
2.0 | 3 | 3.0% |
Value | Count | Frequency (%) |
66.2 | 1 | |
21.8 | 1 | |
20.0 | 1 | |
18.8 | 1 | |
18.2 | 1 | |
13.8 | 1 | |
7.6 | 1 | |
5.2 | 1 | |
4.8 | 2 | |
3.8 | 1 |
RSPN_CO
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 19 |
---|---|
Distinct (%) | 19.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.77 |
Minimum | 0 |
---|---|
Maximum | 203 |
Zeros | 36 |
Zeros (%) | 36.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 1 |
Q3 | 4 |
95-th percentile | 35.2 |
Maximum | 203 |
Range | 203 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 22.435927 |
---|---|
Coefficient of variation (CV) | 3.3140217 |
Kurtosis | 60.413708 |
Mean | 6.77 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 7.2161554 |
Sum | 677 |
Variance | 503.37081 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 36 | |
1 | 21 | |
4 | 7 | 7.0% |
3 | 7 | 7.0% |
2 | 7 | 7.0% |
10 | 3 | 3.0% |
12 | 3 | 3.0% |
8 | 3 | 3.0% |
7 | 2 | 2.0% |
11 | 2 | 2.0% |
Other values (9) | 9 | 9.0% |
Value | Count | Frequency (%) |
0 | 36 | |
1 | 21 | |
2 | 7 | 7.0% |
3 | 7 | 7.0% |
4 | 7 | 7.0% |
5 | 1 | 1.0% |
6 | 1 | 1.0% |
7 | 2 | 2.0% |
8 | 3 | 3.0% |
10 | 3 | 3.0% |
Value | Count | Frequency (%) |
203 | 1 | 1.0% |
59 | 1 | 1.0% |
54 | 1 | 1.0% |
46 | 1 | 1.0% |
39 | 1 | 1.0% |
35 | 1 | 1.0% |
20 | 1 | 1.0% |
12 | 3 | |
11 | 2 | |
10 | 3 |
COMMENT_CO
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 15 |
---|---|
Distinct (%) | 15.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.59 |
Minimum | 0 |
---|---|
Maximum | 64 |
Zeros | 62 |
Zeros (%) | 62.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 1 |
95-th percentile | 17.3 |
Maximum | 64 |
Range | 64 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 8.0917654 |
---|---|
Coefficient of variation (CV) | 3.1242337 |
Kurtosis | 34.975482 |
Mean | 2.59 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 5.3774936 |
Sum | 259 |
Variance | 65.476667 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 62 | |
1 | 18 | 18.0% |
3 | 6 | 6.0% |
2 | 2 | 2.0% |
6 | 2 | 2.0% |
64 | 1 | 1.0% |
7 | 1 | 1.0% |
9 | 1 | 1.0% |
25 | 1 | 1.0% |
4 | 1 | 1.0% |
Other values (5) | 5 | 5.0% |
Value | Count | Frequency (%) |
0 | 62 | |
1 | 18 | 18.0% |
2 | 2 | 2.0% |
3 | 6 | 6.0% |
4 | 1 | 1.0% |
6 | 2 | 2.0% |
7 | 1 | 1.0% |
8 | 1 | 1.0% |
9 | 1 | 1.0% |
17 | 1 | 1.0% |
Value | Count | Frequency (%) |
64 | 1 | |
26 | 1 | |
25 | 1 | |
24 | 1 | |
23 | 1 | |
17 | 1 | |
9 | 1 | |
8 | 1 | |
7 | 1 | |
6 | 2 |
RECOMEND_CO
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
0 | |
---|---|
1 | 3 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 97 | |
1 | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 97 | |
1 | 3 | 3.0% |
CNTNTS_URL
Text
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Length
Max length | 97 |
---|---|
Median length | 97 |
Mean length | 96.78 |
Min length | 95 |
Characters and Unicode
Total characters | 9678 |
---|---|
Distinct characters | 34 |
Distinct categories | 5 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 100 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=243&oid=020&aid=0003329728 |
---|---|
2nd row | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=243&oid=008&aid=0004522420 |
3rd row | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=243&oid=009&aid=0004726650 |
4th row | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=243&oid=032&aid=0003051941 |
5th row | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=243&oid=009&aid=0004726641 |
Value | Count | Frequency (%) |
https://news.naver.com/main/read.naver?mode=ls2d&mid=shm&sid1=103&sid2=243&oid=020&aid=0003329728 | 1 | 1.0% |
https://news.naver.com/main/read.naver?mode=ls2d&mid=shm&sid1=103&sid2=243&oid=001&aid=0012149257 | 1 | 1.0% |
https://news.naver.com/main/read.naver?mode=ls2d&mid=shm&sid1=103&sid2=243&oid=020&aid=0003334269 | 1 | 1.0% |
https://news.naver.com/main/read.naver?mode=ls2d&mid=shm&sid1=103&sid2=243&oid=020&aid=0003333971 | 1 | 1.0% |
https://news.naver.com/main/read.naver?mode=ls2d&mid=shm&sid1=103&sid2=243&oid=015&aid=0004487606 | 1 | 1.0% |
https://news.naver.com/main/read.naver?mode=ls2d&mid=shm&sid1=103&sid2=243&oid=469&aid=0000573806 | 1 | 1.0% |
https://news.naver.com/main/read.naver?mode=ls2d&mid=shm&sid1=103&sid2=243&oid=015&aid=0004487620 | 1 | 1.0% |
https://news.naver.com/main/read.naver?mode=ls2d&mid=shm&sid1=103&sid2=243&oid=015&aid=0004487603 | 1 | 1.0% |
https://news.naver.com/main/read.naver?mode=ls2d&mid=shm&sid1=103&sid2=243&oid=016&aid=0001782294 | 1 | 1.0% |
https://news.naver.com/main/read.naver?mode=ls2d&mid=shm&sid1=103&sid2=243&oid=001&aid=0012150815 | 1 | 1.0% |
Other values (90) | 90 |
Most occurring characters
Value | Count | Frequency (%) |
d | 700 | 7.2% |
= | 600 | 6.2% |
i | 600 | 6.2% |
0 | 584 | 6.0% |
s | 500 | 5.2% |
& | 500 | 5.2% |
m | 500 | 5.2% |
e | 489 | 5.1% |
a | 489 | 5.1% |
2 | 429 | 4.4% |
Other values (24) | 4287 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 5178 | |
Decimal Number | 2200 | |
Other Punctuation | 1400 | 14.5% |
Math Symbol | 600 | 6.2% |
Uppercase Letter | 300 | 3.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
d | 700 | |
i | 600 | |
s | 500 | |
m | 500 | |
e | 489 | |
a | 489 | |
n | 411 | |
o | 300 | |
r | 289 | |
h | 211 | 4.1% |
Other values (5) | 689 |
Decimal Number
Value | Count | Frequency (%) |
0 | 584 | |
2 | 429 | |
1 | 327 | |
3 | 313 | |
4 | 180 | 8.2% |
5 | 85 | 3.9% |
7 | 76 | 3.5% |
6 | 73 | 3.3% |
8 | 69 | 3.1% |
9 | 64 | 2.9% |
Other Punctuation
Value | Count | Frequency (%) |
& | 500 | |
/ | 400 | |
. | 300 | |
? | 100 | 7.1% |
: | 100 | 7.1% |
Uppercase Letter
Value | Count | Frequency (%) |
L | 100 | |
S | 100 | |
D | 100 |
Math Symbol
Value | Count | Frequency (%) |
= | 600 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 5478 | |
Common | 4200 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
d | 700 | |
i | 600 | |
s | 500 | |
m | 500 | |
e | 489 | |
a | 489 | |
n | 411 | |
o | 300 | 5.5% |
r | 289 | 5.3% |
h | 211 | 3.9% |
Other values (8) | 989 |
Common
Value | Count | Frequency (%) |
= | 600 | |
0 | 584 | |
& | 500 | |
2 | 429 | |
/ | 400 | |
1 | 327 | |
3 | 313 | |
. | 300 | |
4 | 180 | 4.3% |
? | 100 | 2.4% |
Other values (6) | 467 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 9678 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
d | 700 | 7.2% |
= | 600 | 6.2% |
i | 600 | 6.2% |
0 | 584 | 6.0% |
s | 500 | 5.2% |
& | 500 | 5.2% |
m | 500 | 5.2% |
e | 489 | 5.1% |
a | 489 | 5.1% |
2 | 429 | 4.4% |
Other values (24) | 4287 |
UPPER_CTGRY_NM
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
생활/문화 |
---|
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 5 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 생활/문화 |
---|---|
2nd row | 생활/문화 |
3rd row | 생활/문화 |
4th row | 생활/문화 |
5th row | 생활/문화 |
Common Values
Value | Count | Frequency (%) |
생활/문화 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
생활/문화 | 100 |
LWPRT_CTGRY_NM
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
책 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 책 |
---|---|
2nd row | 책 |
3rd row | 책 |
4th row | 책 |
5th row | 책 |
Common Values
Value | Count | Frequency (%) |
책 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
책 | 100 |
SEQ_NO | CNTNTS_ID | CHNNEL_NM | TITLE_NM | NTCE_DT | DPI_VALUE | RSPN_CO | COMMENT_CO | RECOMEND_CO | CNTNTS_URL | |
---|---|---|---|---|---|---|---|---|---|---|
SEQ_NO | 1.000 | 1.000 | 0.460 | 0.941 | 0.980 | 0.000 | 0.237 | 0.309 | 0.587 | 1.000 |
CNTNTS_ID | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
CHNNEL_NM | 0.460 | 1.000 | 1.000 | 1.000 | 0.853 | 0.792 | 0.847 | 0.739 | 0.000 | 1.000 |
TITLE_NM | 0.941 | 1.000 | 1.000 | 1.000 | 0.962 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
NTCE_DT | 0.980 | 1.000 | 0.853 | 0.962 | 1.000 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 |
DPI_VALUE | 0.000 | 1.000 | 0.792 | 1.000 | 0.000 | 1.000 | 0.897 | 0.888 | 0.000 | 1.000 |
RSPN_CO | 0.237 | 1.000 | 0.847 | 1.000 | 0.000 | 0.897 | 1.000 | 1.000 | 0.000 | 1.000 |
COMMENT_CO | 0.309 | 1.000 | 0.739 | 1.000 | 0.000 | 0.888 | 1.000 | 1.000 | 0.000 | 1.000 |
RECOMEND_CO | 0.587 | 1.000 | 0.000 | 1.000 | 1.000 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 |
CNTNTS_URL | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
NTCE_DT | RECOMEND_CO | CHNNEL_NM | |
---|---|---|---|
NTCE_DT | 1.000 | 0.789 | 0.302 |
RECOMEND_CO | 0.789 | 1.000 | 0.000 |
CHNNEL_NM | 0.302 | 0.000 | 1.000 |
SEQ_NO | DPI_VALUE | RSPN_CO | COMMENT_CO | CHNNEL_NM | NTCE_DT | RECOMEND_CO | |
---|---|---|---|---|---|---|---|
SEQ_NO | 1.000 | -0.121 | -0.028 | -0.134 | 0.156 | 0.703 | 0.401 |
DPI_VALUE | -0.121 | 1.000 | 0.937 | 0.815 | 0.464 | 0.000 | 0.000 |
RSPN_CO | -0.028 | 0.937 | 1.000 | 0.645 | 0.550 | 0.000 | 0.000 |
COMMENT_CO | -0.134 | 0.815 | 0.645 | 1.000 | 0.392 | 0.000 | 0.000 |
CHNNEL_NM | 0.156 | 0.464 | 0.550 | 0.392 | 1.000 | 0.302 | 0.000 |
NTCE_DT | 0.703 | 0.000 | 0.000 | 0.000 | 0.302 | 1.000 | 0.789 |
RECOMEND_CO | 0.401 | 0.000 | 0.000 | 0.000 | 0.000 | 0.789 | 1.000 |
SEQ_NO | CNTNTS_ID | CHNNEL_CL_NM | CHNNEL_NM | TITLE_NM | NTCE_DT | DPI_VALUE | RSPN_CO | COMMENT_CO | RECOMEND_CO | CNTNTS_URL | UPPER_CTGRY_NM | LWPRT_CTGRY_NM | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 58741 | f26b5facf43611eba7b3002b67f7b0e1 | news | 동아일보 | [신춘문예 2021/동화 가작]니들이 사춘기를 알아 | 2021-01-01 00:00:00 | 0.2 | 1 | 0 | 0 | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=243&oid=020&aid=0003329728 | 생활/문화 | 책 |
1 | 58788 | f26b5fdbf43611eba11a002b67f7b0e1 | news | 머니투데이 | “코로나 완치자인데 여전히 아프다”…후유증 외면하는 ‘불안한 시스템’ | 2021-01-01 00:00:00 | 66.2 | 203 | 64 | 0 | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=243&oid=008&aid=0004522420 | 생활/문화 | 책 |
2 | 58792 | f26b8675f43611eb9e02002b67f7b0e1 | news | 매일경제 | 이주의 새책 (1월 9일자) | 2021-01-01 00:00:00 | 0.0 | 0 | 0 | 0 | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=243&oid=009&aid=0004726650 | 생활/문화 | 책 |
3 | 58779 | f26b5fd2f43611ebbd4b002b67f7b0e1 | news | 경향신문 | 바삭바삭 구운 바닥 위에 바람 한 꼬집…세상에서 가장 맛있는 낮잠 레시피 [그림 책] | 2021-01-01 00:00:00 | 1.2 | 4 | 1 | 0 | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=243&oid=032&aid=0003051941 | 생활/문화 | 책 |
4 | 58783 | f26b5fd6f43611ebb253002b67f7b0e1 | news | 매일경제 | 코로나 발생 76일…우한에선 무슨 일이 있었나 | 2021-01-01 00:00:00 | 3.2 | 10 | 3 | 0 | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=243&oid=009&aid=0004726641 | 생활/문화 | 책 |
5 | 58789 | f26b5fdcf43611eb9527002b67f7b0e1 | news | 매일경제 | 포스트코로나, 아시아의 재발견 | 2021-01-01 00:00:00 | 1.0 | 3 | 1 | 0 | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=243&oid=009&aid=0004726655 | 생활/문화 | 책 |
6 | 58462 | af882823558911ebaa8570c94e625020 | news | 매일경제 | 코로나 발생 76일…우한에선 무슨 일이 있었나 | 2021-01-01 16:28:00 | 3.2 | 10 | 3 | 0 | https://news.naver.com/main/read.nhn?mode=LS2D&mid=shm&sid1=103&sid2=243&oid=009&aid=0004726641 | 생활/문화 | 책 |
7 | 58459 | ae227144558911ebbf9370c94e625020 | news | 매일경제 | 신간 다이제스트 (1월 2일자) | 2021-01-01 16:45:00 | 0.0 | 0 | 0 | 0 | https://news.naver.com/main/read.nhn?mode=LS2D&mid=shm&sid1=103&sid2=243&oid=009&aid=0004726653 | 생활/문화 | 책 |
8 | 58751 | f26b5fb6f43611ebb9fb002b67f7b0e1 | news | 동아일보 | [책의 향기] 로봇 안내원, 인공육 판매… 미래의 상점은 이런 모습? | 2021-01-02 00:00:00 | 5.2 | 12 | 7 | 0 | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=243&oid=020&aid=0003329853 | 생활/문화 | 책 |
9 | 58497 | bff30704558911eb95e670c94e625020 | news | 뉴스1 | [신간] 콧방울 점 있으면 부자되는 이유…김동완의 '관상 심리학' | 2021-01-02 09:09:00 | 0.8 | 2 | 0 | 1 | https://news.naver.com/main/read.nhn?mode=LS2D&mid=shm&sid1=103&sid2=243&oid=421&aid=0005083875 | 생활/문화 | 책 |
SEQ_NO | CNTNTS_ID | CHNNEL_CL_NM | CHNNEL_NM | TITLE_NM | NTCE_DT | DPI_VALUE | RSPN_CO | COMMENT_CO | RECOMEND_CO | CNTNTS_URL | UPPER_CTGRY_NM | LWPRT_CTGRY_NM | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
90 | 60601 | f26ce662f43611ebaded002b67f7b0e1 | news | 한겨레 | ‘친밀함이 두려운 인간’이 대세가 된다 | 2021-01-29 00:00:00 | 18.2 | 39 | 26 | 0 | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=243&oid=028&aid=0002530734 | 생활/문화 | 책 |
91 | 60608 | f26ce669f43611eb8b9b002b67f7b0e1 | news | 한겨레 | 타인을 하찮은 존재로 만드는, 편견 | 2021-01-29 00:00:00 | 0.0 | 0 | 0 | 0 | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=243&oid=028&aid=0002530724 | 생활/문화 | 책 |
92 | 60587 | f26ce654f43611ebae69002b67f7b0e1 | news | 오마이뉴스 | 섭이수씨가 '섭씨'를 만들었다는 사실, 알고 계셨나요? | 2021-01-29 00:00:00 | 0.4 | 2 | 0 | 0 | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=243&oid=047&aid=0002300596 | 생활/문화 | 책 |
93 | 60554 | f26ce633f43611eb9a61002b67f7b0e1 | news | 서울경제 | "올해는 365일 독서"...밀리의서재, 29일 저녁 온라인 독서 토크 | 2021-01-29 00:00:00 | 0.0 | 0 | 0 | 0 | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=243&oid=011&aid=0003864653 | 생활/문화 | 책 |
94 | 59983 | f26c4a93f43611eb8d08002b67f7b0e1 | news | 매일경제 | 이주의 새책 (1월 30일자) | 2021-01-29 00:00:00 | 0.6 | 1 | 1 | 0 | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=243&oid=009&aid=0004741523 | 생활/문화 | 책 |
95 | 59979 | f26c4a8ff43611eba066002b67f7b0e1 | news | 매일경제 | 기업혁신 111가지 실천전략…'세일즈포스'의 성공비결 | 2021-01-29 00:00:00 | 0.8 | 2 | 1 | 0 | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=243&oid=009&aid=0004741525 | 생활/문화 | 책 |
96 | 60440 | f26cbf84f43611ebb7c7002b67f7b0e1 | news | 한국일보 | 신예작가가 그리는 섬뜩하지만 날카로운 미래 사회, ‘인간교’ | 2021-01-29 00:00:00 | 0.2 | 1 | 0 | 0 | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=243&oid=469&aid=0000575921 | 생활/문화 | 책 |
97 | 60555 | f26ce634f43611eb9b93002b67f7b0e1 | news | 연합뉴스 | [게시판] 박경림·이동진, 밀리의 서재 온라인 토크 콘서트 | 2021-01-29 00:00:00 | 0.8 | 4 | 0 | 0 | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=243&oid=001&aid=0012171359 | 생활/문화 | 책 |
98 | 60666 | f26ce6a3f43611ebac2f002b67f7b0e1 | news | 중앙SUNDAY | [책꽂이] 미친 세상을 이해하는 척 하는 방법 外 | 2021-01-30 00:00:00 | 0.0 | 0 | 0 | 0 | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=243&oid=353&aid=0000038847 | 생활/문화 | 책 |
99 | 60647 | f26ce690f43611eb90ae002b67f7b0e1 | news | 조선일보 | 아빠가 된 ‘몬스터’가 말했어요 “너는 너일 때 가장 아름답단다” | 2021-01-30 00:00:00 | 0.4 | 2 | 0 | 0 | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=243&oid=023&aid=0003593222 | 생활/문화 | 책 |